WorldWideScience

Sample records for genome diversity project

  1. The Human Genome Diversity Project

    Energy Technology Data Exchange (ETDEWEB)

    Cavalli-Sforza, L. [Stanford Univ., CA (United States)

    1994-12-31

    The Human Genome Diversity Project (HGD Project) is an international anthropology project that seeks to study the genetic richness of the entire human species. This kind of genetic information can add a unique thread to the tapestry knowledge of humanity. Culture, environment, history, and other factors are often more important, but humanity`s genetic heritage, when analyzed with recent technology, brings another type of evidence for understanding species` past and present. The Project will deepen the understanding of this genetic richness and show both humanity`s diversity and its deep and underlying unity. The HGD Project is still largely in its planning stages, seeking the best ways to reach its goals. The continuing discussions of the Project, throughout the world, should improve the plans for the Project and their implementation. The Project is as global as humanity itself; its implementation will require the kinds of partnerships among different nations and cultures that make the involvement of UNESCO and other international organizations particularly appropriate. The author will briefly discuss the Project`s history, describe the Project, set out the core principles of the Project, and demonstrate how the Project will help combat the scourge of racism.

  2. The Simons Genome Diversity Project: 300 genomes from 142 diverse populations

    Science.gov (United States)

    Mallick, Swapan; Li, Heng; Lipson, Mark; Mathieson, Iain; Gymrek, Melissa; Racimo, Fernando; Zhao, Mengyao; Chennagiri, Niru; Nordenfelt, Susanne; Tandon, Arti; Skoglund, Pontus; Lazaridis, Iosif; Sankararaman, Sriram; Fu, Qiaomei; Rohland, Nadin; Renaud, Gabriel; Erlich, Yaniv; Willems, Thomas; Gallo, Carla; Spence, Jeffrey P.; Song, Yun S.; Poletti, Giovanni; Balloux, Francois; van Driem, George; de Knijff, Peter; Romero, Irene Gallego; Jha, Aashish R.; Behar, Doron M.; Bravi, Claudio M.; Capelli, Cristian; Hervig, Tor; Moreno-Estrada, Andres; Posukh, Olga L.; Balanovska, Elena; Balanovsky, Oleg; Karachanak-Yankova, Sena; Sahakyan, Hovhannes; Toncheva, Draga; Yepiskoposyan, Levon; Tyler-Smith, Chris; Xue, Yali; Abdullah, M. Syafiq; Ruiz-Linares, Andres; Beall, Cynthia M.; Di Rienzo, Anna; Jeong, Choongwon; Starikovskaya, Elena B.; Metspalu, Ene; Parik, Jüri; Villems, Richard; Henn, Brenna M.; Hodoglugil, Ugur; Mahley, Robert; Sajantila, Antti; Stamatoyannopoulos, George; Wee, Joseph T. S.; Khusainova, Rita; Khusnutdinova, Elza; Litvinov, Sergey; Ayodo, George; Comas, David; Hammer, Michael; Kivisild, Toomas; Klitz, William; Winkler, Cheryl; Labuda, Damian; Bamshad, Michael; Jorde, Lynn B.; Tishkoff, Sarah A.; Watkins, W. Scott; Metspalu, Mait; Dryomov, Stanislav; Sukernik, Rem; Singh, Lalji; Thangaraj, Kumarasamy; Pääbo, Svante; Kelso, Janet; Patterson, Nick; Reich, David

    2016-01-01

    We report the Simons Genome Diversity Project (SGDP) dataset: high quality genomes from 300 individuals from 142 diverse populations. These genomes include at least 5.8 million base pairs that are not present in the human reference genome. Our analysis reveals key features of the landscape of human genome variation, including that the rate of accumulation of mutations has accelerated by about 5% in non-Africans compared to Africans since divergence. We show that the ancestors of some pairs of present-day human populations were substantially separated by 100,000 years ago, well before the archaeologically attested onset of behavioral modernity. We also demonstrate that indigenous Australians, New Guineans and Andamanese do not derive substantial ancestry from an early dispersal of modern humans; instead, their modern human ancestry is consistent with coming from the same source as that in other non-Africans. PMID:27654912

  3. The Simons Genome Diversity Project: 300 genomes from 142 diverse populations.

    Science.gov (United States)

    Mallick, Swapan; Li, Heng; Lipson, Mark; Mathieson, Iain; Gymrek, Melissa; Racimo, Fernando; Zhao, Mengyao; Chennagiri, Niru; Nordenfelt, Susanne; Tandon, Arti; Skoglund, Pontus; Lazaridis, Iosif; Sankararaman, Sriram; Fu, Qiaomei; Rohland, Nadin; Renaud, Gabriel; Erlich, Yaniv; Willems, Thomas; Gallo, Carla; Spence, Jeffrey P; Song, Yun S; Poletti, Giovanni; Balloux, Francois; van Driem, George; de Knijff, Peter; Romero, Irene Gallego; Jha, Aashish R; Behar, Doron M; Bravi, Claudio M; Capelli, Cristian; Hervig, Tor; Moreno-Estrada, Andres; Posukh, Olga L; Balanovska, Elena; Balanovsky, Oleg; Karachanak-Yankova, Sena; Sahakyan, Hovhannes; Toncheva, Draga; Yepiskoposyan, Levon; Tyler-Smith, Chris; Xue, Yali; Abdullah, M Syafiq; Ruiz-Linares, Andres; Beall, Cynthia M; Di Rienzo, Anna; Jeong, Choongwon; Starikovskaya, Elena B; Metspalu, Ene; Parik, Jüri; Villems, Richard; Henn, Brenna M; Hodoglugil, Ugur; Mahley, Robert; Sajantila, Antti; Stamatoyannopoulos, George; Wee, Joseph T S; Khusainova, Rita; Khusnutdinova, Elza; Litvinov, Sergey; Ayodo, George; Comas, David; Hammer, Michael F; Kivisild, Toomas; Klitz, William; Winkler, Cheryl A; Labuda, Damian; Bamshad, Michael; Jorde, Lynn B; Tishkoff, Sarah A; Watkins, W Scott; Metspalu, Mait; Dryomov, Stanislav; Sukernik, Rem; Singh, Lalji; Thangaraj, Kumarasamy; Pääbo, Svante; Kelso, Janet; Patterson, Nick; Reich, David

    2016-10-13

    Here we report the Simons Genome Diversity Project data set: high quality genomes from 300 individuals from 142 diverse populations. These genomes include at least 5.8 million base pairs that are not present in the human reference genome. Our analysis reveals key features of the landscape of human genome variation, including that the rate of accumulation of mutations has accelerated by about 5% in non-Africans compared to Africans since divergence. We show that the ancestors of some pairs of present-day human populations were substantially separated by 100,000 years ago, well before the archaeologically attested onset of behavioural modernity. We also demonstrate that indigenous Australians, New Guineans and Andamanese do not derive substantial ancestry from an early dispersal of modern humans; instead, their modern human ancestry is consistent with coming from the same source as that of other non-Africans.

  4. Life in our hands? Some ethical perspectives on the human genome and human genome diversity projects

    Directory of Open Access Journals (Sweden)

    Cornelius W. du Toit

    2014-01-01

    Full Text Available The article dealt with implications of the human genome and the human genome diversity project. It examined some theological implications, such as: humans as the image of God, God as the creator of life, the changed role of miracles and healings in religion, the sacredness of nature, life and the genome. Ethical issues that were addressed include eugenics, germline intervention, determinism and the human genome diversity project. Economic and legal factors that play a role were also discussed. Whilst positive aspects of genome research were considered, a critical stance was adopted towards patenting the human genome and some concluding guidelines were proposed.

  5. The Human Genome Diversity (HGD) Project. Summary document

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1993-12-31

    In 1991 a group of human geneticists and molecular biologists proposed to the scientific community that a world wide survey be undertaken of variation in the human genome. To aid their considerations, the committee therefore decided to hold a small series of international workshops to explore the major scientific issues involved. The intention was to define a framework for the project which could provide a basis for much wider and more detailed discussion and planning--it was recognized that the successful implementation of the proposed project, which has come to be known as the Human Genome Diversity (HGD) Project, would not only involve scientists but also various national and international non-scientific groups all of which should contribute to the project`s development. The international HGD workshop held in Sardinia in September 1993 was the last in the initial series of planning workshops. As such it not only explored new ground but also pulled together into a more coherent form much of the formal and informal discussion that had taken place in the preceding two years. This report presents the deliberations of the Sardinia workshop within a consideration of the overall development of the HGD Project to date.

  6. Human Genome Diversity Project. Summary of planning workshop 3(B): Ethical and human-rights implications

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1993-12-31

    The third planning workshop of the Human Genome Diversity Project was held on the campus of the US National Institutes of Health in Bethesda, Maryland, from February 16 through February 18, 1993. The second day of the workshop was devoted to an exploration of the ethical and human-rights implications of the Project. This open meeting centered on three roundtables, involving 12 invited participants, and the resulting discussions among all those present. Attendees and their affiliations are listed in the attached Appendix A. The discussion was guided by a schedule and list of possible issues, distributed to all present and attached as Appendix B. This is a relatively complete, and thus lengthy, summary of the comments at the meeting. The beginning of the summary sets out as conclusions some issues on which there appeared to be widespread agreement, but those conclusions are not intended to serve as a set of detailed recommendations. The meeting organizer is distributing his recommendations in a separate memorandum; recommendations from others who attended the meeting are welcome and will be distributed by the meeting organizer to the participants and to the Project committee.

  7. Population Stratification and Underrepresentation of Indian Subcontinent Genetic Diversity in the 1000 Genomes Project Dataset.

    Science.gov (United States)

    Sengupta, Dhriti; Choudhury, Ananyo; Basu, Analabha; Ramsay, Michèle

    2016-12-31

    Genomic variation in Indian populations is of great interest due to the diversity of ancestral components, social stratification, endogamy and complex admixture patterns. With an expanding population of 1.2 billion, India is also a treasure trove to catalogue innocuous as well as clinically relevant rare mutations. Recent studies have revealed four dominant ancestries in populations from mainland India: Ancestral North-Indian (ANI), Ancestral South-Indian (ASI), Ancestral Tibeto-Burman (ATB) and Ancestral Austro-Asiatic (AAA). The 1000 Genomes Project (KGP) Phase-3 data include about 500 genomes from five linguistically defined Indian-Subcontinent (IS) populations (Punjabi, Gujrati, Bengali, Telugu and Tamil) some of whom are recent migrants to USA or UK. Comparative analyses show that despite the distinct geographic origins of the KGP-IS populations, the ANI component is predominantly represented in this dataset. Previous studies demonstrated population substructure in the HapMap Gujrati population, and we found evidence for additional substructure in the Punjabi and Telugu populations. These substructured populations have characteristic/significant differences in heterozygosity and inbreeding coefficients. Moreover, we demonstrate that the substructure is better explained by factors like differences in proportion of ancestral components, and endogamy driven social structure rather than invoking a novel ancestral component to explain it. Therefore, using language and/or geography as a proxy for an ethnic unit is inadequate for many of the IS populations. This highlights the necessity for more nuanced sampling strategies or corrective statistical approaches, particularly for biomedical and population genetics research in India.

  8. Principal component analysis reveals the 1000 Genomes Project does not sufficiently cover the human genetic diversity in Asia

    Directory of Open Access Journals (Sweden)

    Dongsheng eLu

    2013-07-01

    Full Text Available The 1000 Genomes Project (1KG aims to provide a comprehensive resource on human genetic variations. With an effort of sequencing 2,500 individuals, 1KG is expected to cover the majority of the human genetic diversities worldwide. In this study, using analysis of population structure based on genome-wide single nucleotide polymorphisms (SNPs data, we examined and evaluated the coverage of genetic diversity of 1KG samples with the available genome-wide SNP data of 3,831 individuals representing 140 population samples worldwide. We developed a method to quantitatively measure and evaluate the genetic diversity revealed by population structure analysis. Our results showed that the 1KG does not have sufficient coverage of the human genetic diversity in Asia, especially in Southeast Asia. We suggested a good coverage of Southeast Asian populations be considered in 1KG or a regional effort should be initialized to provide a more comprehensive characterization of the human genetic diversity in Asia, which is important for both evolutionary and medical studies in the future.

  9. Wheat Landrace Genome Diversity.

    Science.gov (United States)

    Wingen, Luzie U; West, Claire; Leverington-Waite, Michelle; Collier, Sarah; Orford, Simon; Goram, Richard; Yang, Cai-Yun; King, Julie; Allen, Alexandra M; Burridge, Amanda; Edwards, Keith J; Griffiths, Simon

    2017-02-17

    Understanding the genomic complexity of bread wheat (Triticum aestivum L.) is a cornerstone in the quest to unravel the processes of domestication and the following adaptation of domesticated wheat to a wide variety of environments across the globe. Additionally, it is of importance for future improvement of the crop, particularly in the light of climate change. Focussing on the adaptation after domestication, a nested association mapping (NAM) panel of 60 segregating bi-parental populations were developed mainly involving landrace accessions from the core set of the Watkins hexaploid wheat collection optimized for genetic diversity (WINGEN et al. 2014). A modern spring elite variety, 'Paragon,' was used as common reference parent. Genetic maps were constructed following identical rules to make them comparable. In total, 1,611 linkage groups were identified, based on recombination from an estimated 126,300 crossover events over the whole NAM panel. A consensus map, named landrace consensus map (LRC) was constructed and contained 2,498 genetic loci. These newly developed genetics tools were used to investigate the rules underlying genome fluidity or rigidity, e.g. by comparing at marker distances and marker orders. In general, marker order was highly correlated, which provides support for strong synteny between bread wheat accessions. However, many exceptional cases of incongruent linkage groups and increased marker distances were also found. Segregation distortion was detected for many markers, sometimes as hot-spots present in different populations. Furthermore, evidence for translocations in at least 36 of the maps was found. These translocations fell, in general, into many different translocation classes, but a few translocation classes were found in several accessions, the most frequent one being the well known T5B:7B translocation. Loci involved in recombination rate, which is an interesting trait for plant breeding, were identified by QTL analyses using the

  10. Ethical aspects of genome diversity research: genome research into cultural diversity or cultural diversity in genome research?

    Science.gov (United States)

    Ilkilic, Ilhan; Paul, Norbert W

    2009-03-01

    The goal of the Human Genome Diversity Project (HGDP) was to reconstruct the history of human evolution and the historical and geographical distribution of populations with the help of scientific research. Through this kind of research, the entire spectrum of genetic diversity to be found in the human species was to be explored with the hope of generating a better understanding of the history of humankind. An important part of this genome diversity research consists in taking blood and tissue samples from indigenous populations. For various reasons, it has not been possible to execute this project in the planned scope and form to date. Nevertheless, genomic diversity research addresses complex issues which prove to be highly relevant from the perspective of research ethics, transcultural medical ethics, and cultural philosophy. In the article at hand, we discuss these ethical issues as illustrated by the HGDP. This investigation focuses on the confrontation of culturally diverse images of humans and their cosmologies within the framework of genome diversity research and the ethical questions it raises. We argue that in addition to complex questions pertaining to research ethics such as informed consent and autonomy of probands, genome diversity research also has a cultural-philosophical, meta-ethical, and phenomenological dimension which must be taken into account in ethical discourses. Acknowledging this fact, we attempt to show the limits of current guidelines used in international genome diversity studies, following this up by a formulation of theses designed to facilitate an appropriate inquiry and ethical evaluation of intercultural dimensions of genome research.

  11. Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Block, S. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Cornwall, J. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Dally, W. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Dyson, F. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Fortson, N. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Joyce, G. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Kimble, H. J. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Lewis, N. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Max, C. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Prince, T. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Schwitters, R. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Weinberger, P. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Woodin, W. H. [The MITRE Corporation, McLean, VA (US). JASON Program Office

    1998-01-04

    The study reviews Department of Energy supported aspects of the United States Human Genome Project, the joint National Institutes of Health/Department of Energy program to characterize all human genetic material, to discover the set of human genes, and to render them accessible for further biological study. The study concentrates on issues of technology, quality assurance/control, and informatics relevant to current effort on the genome project and needs beyond it. Recommendations are presented on areas of the genome program that are of particular interest to and supported by the Department of Energy.

  12. HLA diversity in the 1000 genomes dataset.

    Directory of Open Access Journals (Sweden)

    Pierre-Antoine Gourraud

    Full Text Available The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation by sequencing at a level that should allow the genome-wide detection of most variants with frequencies as low as 1%. However, in the major histocompatibility complex (MHC, only the top 10 most frequent haplotypes are in the 1% frequency range whereas thousands of haplotypes are present at lower frequencies. Given the limitation of both the coverage and the read length of the sequences generated by the 1000 Genomes Project, the highly variable positions that define HLA alleles may be difficult to identify. We used classical Sanger sequencing techniques to type the HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 genes in the available 1000 Genomes samples and combined the results with the 103,310 variants in the MHC region genotyped by the 1000 Genomes Project. Using pairwise identity-by-descent distances between individuals and principal component analysis, we established the relationship between ancestry and genetic diversity in the MHC region. As expected, both the MHC variants and the HLA phenotype can identify the major ancestry lineage, informed mainly by the most frequent HLA haplotypes. To some extent, regions of the genome with similar genetic or similar recombination rate have similar properties. An MHC-centric analysis underlines departures between the ancestral background of the MHC and the genome-wide picture. Our analysis of linkage disequilibrium (LD decay in these samples suggests that overestimation of pairwise LD occurs due to a limited sampling of the MHC diversity. This collection of HLA-specific MHC variants, available on the dbMHC portal, is a valuable resource for future analyses of the role of MHC in population and disease studies.

  13. Pseudomonas genomes: diverse and adaptable.

    Science.gov (United States)

    Silby, Mark W; Winstanley, Craig; Godfrey, Scott A C; Levy, Stuart B; Jackson, Robert W

    2011-07-01

    Members of the genus Pseudomonas inhabit a wide variety of environments, which is reflected in their versatile metabolic capacity and broad potential for adaptation to fluctuating environmental conditions. Here, we examine and compare the genomes of a range of Pseudomonas spp. encompassing plant, insect and human pathogens, and environmental saprophytes. In addition to a large number of allelic differences of common genes that confer regulatory and metabolic flexibility, genome analysis suggests that many other factors contribute to the diversity and adaptability of Pseudomonas spp. Horizontal gene transfer has impacted the capability of pathogenic Pseudomonas spp. in terms of disease severity (Pseudomonas aeruginosa) and specificity (Pseudomonas syringae). Genome rearrangements likely contribute to adaptation, and a considerable complement of unique genes undoubtedly contributes to strain- and species-specific activities by as yet unknown mechanisms. Because of the lack of conserved phenotypic differences, the classification of the genus has long been contentious. DNA hybridization and genome-based analyses show close relationships among members of P. aeruginosa, but that isolates within the Pseudomonas fluorescens and P. syringae species are less closely related and may constitute different species. Collectively, genome sequences of Pseudomonas spp. have provided insights into pathogenesis and the genetic basis for diversity and adaptation.

  14. Genome Sequences of Eight Morphologically Diverse Alphaproteobacteria▿

    OpenAIRE

    Brown, Pamela J.B.; Kysela, David T.; Buechlein, Aaron; Hemmerich, Chris; Brun, Yves V

    2011-01-01

    The Alphaproteobacteriacomprise morphologically diverse bacteria, including many species of stalked bacteria. Here we announce the genome sequences of eight alphaproteobacteria, including the first genome sequences of species belonging to the genera Asticcacaulis, Hirschia, Hyphomicrobium, and Rhodomicrobium.

  15. Genome sequences of eight morphologically diverse Alphaproteobacteria.

    Science.gov (United States)

    Brown, Pamela J B; Kysela, David T; Buechlein, Aaron; Hemmerich, Chris; Brun, Yves V

    2011-09-01

    The Alphaproteobacteria comprise morphologically diverse bacteria, including many species of stalked bacteria. Here we announce the genome sequences of eight alphaproteobacteria, including the first genome sequences of species belonging to the genera Asticcacaulis, Hirschia, Hyphomicrobium, and Rhodomicrobium.

  16. Genome Sequences of Eight Morphologically Diverse Alphaproteobacteria▿

    Science.gov (United States)

    Brown, Pamela J. B.; Kysela, David T.; Buechlein, Aaron; Hemmerich, Chris; Brun, Yves V.

    2011-01-01

    The Alphaproteobacteriacomprise morphologically diverse bacteria, including many species of stalked bacteria. Here we announce the genome sequences of eight alphaproteobacteria, including the first genome sequences of species belonging to the genera Asticcacaulis, Hirschia, Hyphomicrobium, and Rhodomicrobium. PMID:21705585

  17. Pseudomonas aeruginosa genomic structure and diversity

    Directory of Open Access Journals (Sweden)

    Jens eKlockgether

    2011-07-01

    Full Text Available The Pseudomonas aeruginosa genome (G + C content 65-67%, size 5.5 – 7 Mbp is made up of a single circular chromosome and a variable number of plasmids. Sequencing of complete genomes or blocks of the accessory genome has revealed that the genome encodes a large repertoire of transporters, transcriptional regulators and two-component regulatory systems which reflects its metabolic diversity to utilize a broad range of nutrients. The conserved core component of the genome is largely collinear among P. aeruginosa strains and exhibits an interclonal sequence diversity of 0.5 – 0.7%. Only a few loci of the core genome are subject to diversifying selection. Genome diversity is mainly caused by accessory DNA elements located in 79 regions of genome plasticity that are scattered around the genome and show an anomalous usage of mono- to tetradecanucleotides. Genomic islands of the pKLC102/PAGI-2 family that integrate into tRNALys or tRNAGly genes represent hotspots of inter- and intraclonal genomic diversity. The individual islands differ in their repertoire of metabolic genes that make a large contribution to the pangenome. In order to unravel intraclonal diversity of P. aeruginosa, the genomes of two members of the PA14 clonal complex from diverse habitats and geographic origin were compared. The genome sequences differed by less than 0.01% from each other. 198 of the 231 SNPs were non-randomly distributed in the genome. Non-synonymous SNPs were mainly found in an integrated Pf1-like phage and in genes involved in transcriptional regulation, membrane and extracellular constituents, transport and secretion. In summary, P. aeruginosa is endowed with a highly conserved core genome of low sequence diversity and a highly variable accessory genome that communicates with other pseudomonads and genera via horizontal gene transfer.

  18. The Materials Genome Project

    Science.gov (United States)

    Aourag, H.

    2008-09-01

    In the past, the search for new and improved materials was characterized mostly by the use of empirical, trial- and-error methods. This picture of materials science has been changing as the knowledge and understanding of fundamental processes governing a material's properties and performance (namely, composition, structure, history, and environment) have increased. In a number of cases, it is now possible to predict a material's properties before it has even been manufactured thus greatly reducing the time spent on testing and development. The objective of modern materials science is to tailor a material (starting with its chemical composition, constituent phases, and microstructure) in order to obtain a desired set of properties suitable for a given application. In the short term, the traditional "empirical" methods for developing new materials will be complemented to a greater degree by theoretical predictions. In some areas, computer simulation is already used by industry to weed out costly or improbable synthesis routes. Can novel materials with optimized properties be designed by computers? Advances in modelling methods at the atomic level coupled with rapid increases in computer capabilities over the last decade have led scientists to answer this question with a resounding "yes'. The ability to design new materials from quantum mechanical principles with computers is currently one of the fastest growing and most exciting areas of theoretical research in the world. The methods allow scientists to evaluate and prescreen new materials "in silico" (in vitro), rather than through time consuming experimentation. The Materials Genome Project is to pursue the theory of large scale modeling as well as powerful methods to construct new materials, with optimized properties. Indeed, it is the intimate synergy between our ability to predict accurately from quantum theory how atoms can be assembled to form new materials and our capacity to synthesize novel materials atom

  19. Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium

    DEFF Research Database (Denmark)

    Machado, Henrique; Gram, Lone

    2017-01-01

    Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand...... the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur, amino-acid usage, ANI), which allowed us to identify two...... misidentified strains. Genome analyses also revealed occurrence of higher and lower GC content clades, correlating with phylogenetic clusters. Pan-and core-genome analysis revealed the conservation of 25% of the genome throughout the genus, with a large and open pan-genome. The major source of genomic diversity...

  20. Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium.

    Science.gov (United States)

    Machado, Henrique; Gram, Lone

    2017-01-01

    Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur, amino-acid usage, ANI), which allowed us to identify two misidentified strains. Genome analyses also revealed occurrence of higher and lower GC content clades, correlating with phylogenetic clusters. Pan- and core-genome analysis revealed the conservation of 25% of the genome throughout the genus, with a large and open pan-genome. The major source of genomic diversity could be traced to the smaller chromosome and plasmids. Several of the physiological traits studied in the genus did not correlate with phylogenetic data. Since horizontal gene transfer (HGT) is often suggested as a source of genetic diversity and a potential driver of genomic evolution in bacterial species, we looked into evidence of such in Photobacterium genomes. Genomic islands were the source of genomic differences between strains of the same species. Also, we found transposase genes and CRISPR arrays that suggest multiple encounters with foreign DNA. Presence of genomic exchange traits was widespread and abundant in the genus, suggesting a role in genomic evolution. The high genetic variability and indications of genetic exchange make it difficult to elucidate genome evolutionary paths and raise the awareness of the roles of foreign DNA in the genomic evolution of environmental organisms.

  1. Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium

    OpenAIRE

    Henrique Machado; Lone Gram

    2017-01-01

    Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationship...

  2. Genomic diversity and evolution of the lyssaviruses.

    Directory of Open Access Journals (Sweden)

    Olivier Delmas

    Full Text Available Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as 'Lagos Bat'. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses.

  3. Malaria Genome Sequencing Project

    Science.gov (United States)

    2004-01-01

    million cases and up to 2.7 million A whole chromosome shotgun sequencing strategy was used to deaths from malaria each year. The mortality levels are...deaths from malaria each year. The mortality levels are greatest in determine the genome sequence of P. falciparum clone 3D7. This sub-Saharan Africa...aminolevulinic acid dehydratase. Cura . Genet. 40, 391-398 (2002). 15. Lasonder, E. et al Analysis of the Plasmodium falciparum proteome by high-accuracy mass

  4. All about the Human Genome Project (HGP)

    Science.gov (United States)

    ... Genome Resources Access to the full human sequence All About The Human Genome Project (HGP) The Human ... an international research effort to sequence and map all of the genes - together known as the genome - ...

  5. Genomic diversity within the haloalkaliphilic genus Thioalkalivibrio.

    Science.gov (United States)

    Ahn, Anne-Catherine; Meier-Kolthoff, Jan P; Overmars, Lex; Richter, Michael; Woyke, Tanja; Sorokin, Dimitry Y; Muyzer, Gerard

    2017-01-01

    Thioalkalivibrio is a genus of obligate chemolithoautotrophic haloalkaliphilic sulfur-oxidizing bacteria. Their habitat are soda lakes which are dual extreme environments with a pH range from 9.5 to 11 and salt concentrations up to saturation. More than 100 strains of this genus have been isolated from various soda lakes all over the world, but only ten species have been effectively described yet. Therefore, the assignment of the remaining strains to either existing or novel species is important and will further elucidate their genomic diversity as well as give a better general understanding of this genus. Recently, the genomes of 76 Thioalkalivibrio strains were sequenced. On these, we applied different methods including (i) 16S rRNA gene sequence analysis, (ii) Multilocus Sequence Analysis (MLSA) based on eight housekeeping genes, (iii) Average Nucleotide Identity based on BLAST (ANIb) and MUMmer (ANIm), (iv) Tetranucleotide frequency correlation coefficients (TETRA), (v) digital DNA:DNA hybridization (dDDH) as well as (vi) nucleotide- and amino acid-based Genome BLAST Distance Phylogeny (GBDP) analyses. We detected a high genomic diversity by revealing 15 new "genomic" species and 16 new "genomic" subspecies in addition to the ten already described species. Phylogenetic and phylogenomic analyses showed that the genus is not monophyletic, because four strains were clearly separated from the other Thioalkalivibrio by type strains from other genera. Therefore, it is recommended to classify the latter group as a novel genus. The biogeographic distribution of Thioalkalivibrio suggested that the different "genomic" species can be classified as candidate disjunct or candidate endemic species. This study is a detailed genome-based classification and identification of members within the genus Thioalkalivibrio. However, future phenotypical and chemotaxonomical studies will be needed for a full species description of this genus.

  6. OryzaGenome: Genome Diversity Database of Wild Oryza Species

    KAUST Repository

    Ohyanagi, Hajime

    2015-11-18

    The species in the genus Oryza, encompassing nine genome types and 23 species, are a rich genetic resource and may have applications in deeper genomic analyses aiming to understand the evolution of plant genomes. With the advancement of next-generation sequencing (NGS) technology, a flood of Oryza species reference genomes and genomic variation information has become available in recent years. This genomic information, combined with the comprehensive phenotypic information that we are accumulating in our Oryzabase, can serve as an excellent genotype-phenotype association resource for analyzing rice functional and structural evolution, and the associated diversity of the Oryza genus. Here we integrate our previous and future phenotypic/habitat information and newly determined genotype information into a united repository, named OryzaGenome, providing the variant information with hyperlinks to Oryzabase. The current version of OryzaGenome includes genotype information of 446 O. rufipogon accessions derived by imputation and of 17 accessions derived by imputation-free deep sequencing. Two variant viewers are implemented: SNP Viewer as a conventional genome browser interface and Variant Table as a textbased browser for precise inspection of each variant one by one. Portable VCF (variant call format) file or tabdelimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/ scaffolds/contigs and genome-wide variation information for almost all of the closely and distantly related wild Oryza species from the NIG Wild Rice Collection will be available in future releases. All of the resources can be accessed through http://viewer.shigen.info/oryzagenome/.

  7. OryzaGenome: Genome Diversity Database of Wild Oryza Species.

    Science.gov (United States)

    Ohyanagi, Hajime; Ebata, Toshinobu; Huang, Xuehui; Gong, Hao; Fujita, Masahiro; Mochizuki, Takako; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu; Feng, Qi; Wang, Zi-Xuan; Han, Bin; Kurata, Nori

    2016-01-01

    The species in the genus Oryza, encompassing nine genome types and 23 species, are a rich genetic resource and may have applications in deeper genomic analyses aiming to understand the evolution of plant genomes. With the advancement of next-generation sequencing (NGS) technology, a flood of Oryza species reference genomes and genomic variation information has become available in recent years. This genomic information, combined with the comprehensive phenotypic information that we are accumulating in our Oryzabase, can serve as an excellent genotype-phenotype association resource for analyzing rice functional and structural evolution, and the associated diversity of the Oryza genus. Here we integrate our previous and future phenotypic/habitat information and newly determined genotype information into a united repository, named OryzaGenome, providing the variant information with hyperlinks to Oryzabase. The current version of OryzaGenome includes genotype information of 446 O. rufipogon accessions derived by imputation and of 17 accessions derived by imputation-free deep sequencing. Two variant viewers are implemented: SNP Viewer as a conventional genome browser interface and Variant Table as a text-based browser for precise inspection of each variant one by one. Portable VCF (variant call format) file or tab-delimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/scaffolds/contigs and genome-wide variation information for almost all of the closely and distantly related wild Oryza species from the NIG Wild Rice Collection will be available in future releases. All of the resources can be accessed through http://viewer.shigen.info/oryzagenome/.

  8. Organizing Diverse, Distributed Project Information

    Science.gov (United States)

    Keller, Richard M.

    2003-01-01

    SemanticOrganizer is a software application designed to organize and integrate information generated within a distributed organization or as part of a project that involves multiple, geographically dispersed collaborators. SemanticOrganizer incorporates the capabilities of database storage, document sharing, hypermedia navigation, and semantic-interlinking into a system that can be customized to satisfy the specific information-management needs of different user communities. The program provides a centralized repository of information that is both secure and accessible to project collaborators via the World Wide Web. SemanticOrganizer's repository can be used to collect diverse information (including forms, documents, notes, data, spreadsheets, images, and sounds) from computers at collaborators work sites. The program organizes the information using a unique network-structured conceptual framework, wherein each node represents a data record that contains not only the original information but also metadata (in effect, standardized data that characterize the information). Links among nodes express semantic relationships among the data records. The program features a Web interface through which users enter, interlink, and/or search for information in the repository. By use of this repository, the collaborators have immediate access to the most recent project information, as well as to archived information. A key advantage to SemanticOrganizer is its ability to interlink information together in a natural fashion using customized terminology and concepts that are familiar to a user community.

  9. Genomic diversity of Escherichia isolates from diverse habitats.

    Directory of Open Access Journals (Sweden)

    Seungdae Oh

    Full Text Available Our understanding of the Escherichia genus is heavily biased toward pathogenic or commensal isolates from human or animal hosts. Recent studies have recovered Escherichia isolates that persist, and even grow, outside these hosts. Although the environmental isolates are typically phylogenetically distinct, they are highly related to and phenotypically indistinguishable from their human counterparts, including for the coliform test. To gain insights into the genomic diversity of Escherichia isolates from diverse habitats, including freshwater, soil, animal, and human sources, we carried out comparative DNA-DNA hybridizations using a multi-genome E. coli DNA microarray. The microarray was validated based on hybridizations with selected strains whose genome sequences were available and used to assess the frequency of microarray false positive and negative signals. Our results showed that human fecal isolates share two sets of genes (n>90 that are rarely found among environmental isolates, including genes presumably important for evading host immune mechanisms (e.g., a multi-drug transporter for acids and antimicrobials and adhering to epithelial cells (e.g., hemolysin E and fimbrial-like adhesin protein. These results imply that environmental isolates are characterized by decreased ability to colonize host cells relative to human isolates. Our study also provides gene markers that can distinguish human isolates from those of warm-blooded animal and environmental origins, and thus can be used to more reliably assess fecal contamination in natural ecosystems.

  10. Report of the second Human Genome Diversity workshop

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1992-12-31

    The Second Human Genome Diversity Workshop was successfully held at Penn State University from October 29--31, 1992. The Workshop was essentially organized around 7 groups, each comprising approximately 10 participants, representing the sampling issues in different regions of the world. These groups worked independently, using a common format provided by the organizers; this was adjusted as needed by the individual groups. The Workshop began with a presentation of the mandate to the participants, and of the procedures to be followed during the workshop. Dr. Feldman presented a summary of the results from the First Workshop. He and the other organizers also presented brief comments giving their perspective on the objectives of the Second Workshop. Dr. Julia Bodmer discussed the study of European genetic diversity, especially in the context of the HLA experience there, and of plans to extend such studies in the coming years. She also discussed surveys of world HLA laboratories in regard to resources related to Human Genome Diversity. Dr. Mark Weiss discussed the relevance of nonhuman primate studies for understanding how demographic processes, such as mate exchange between local groups, affected the local dispersion of genetic variation. Primate population geneticists have some relevant experience in interpreting variation at this local level, in particular, with various DNA fingerprinting methods. This experience may be relevant to the Human Genome Diversity Project, in terms of practical and statistical issues.

  11. Genomic diversity within the haloalkaliphilic genus Thioalkalivibrio

    Science.gov (United States)

    Ahn, Anne-Catherine; Meier-Kolthoff, Jan P.; Overmars, Lex; Richter, Michael; Woyke, Tanja; Sorokin, Dimitry Y.

    2017-01-01

    Thioalkalivibrio is a genus of obligate chemolithoautotrophic haloalkaliphilic sulfur-oxidizing bacteria. Their habitat are soda lakes which are dual extreme environments with a pH range from 9.5 to 11 and salt concentrations up to saturation. More than 100 strains of this genus have been isolated from various soda lakes all over the world, but only ten species have been effectively described yet. Therefore, the assignment of the remaining strains to either existing or novel species is important and will further elucidate their genomic diversity as well as give a better general understanding of this genus. Recently, the genomes of 76 Thioalkalivibrio strains were sequenced. On these, we applied different methods including (i) 16S rRNA gene sequence analysis, (ii) Multilocus Sequence Analysis (MLSA) based on eight housekeeping genes, (iii) Average Nucleotide Identity based on BLAST (ANIb) and MUMmer (ANIm), (iv) Tetranucleotide frequency correlation coefficients (TETRA), (v) digital DNA:DNA hybridization (dDDH) as well as (vi) nucleotide- and amino acid-based Genome BLAST Distance Phylogeny (GBDP) analyses. We detected a high genomic diversity by revealing 15 new “genomic” species and 16 new “genomic” subspecies in addition to the ten already described species. Phylogenetic and phylogenomic analyses showed that the genus is not monophyletic, because four strains were clearly separated from the other Thioalkalivibrio by type strains from other genera. Therefore, it is recommended to classify the latter group as a novel genus. The biogeographic distribution of Thioalkalivibrio suggested that the different “genomic” species can be classified as candidate disjunct or candidate endemic species. This study is a detailed genome-based classification and identification of members within the genus Thioalkalivibrio. However, future phenotypical and chemotaxonomical studies will be needed for a full species description of this genus. PMID:28282461

  12. Parasite Genome Projects and the Trypanosoma cruzi Genome Initiative

    Directory of Open Access Journals (Sweden)

    Wim Degrave

    1997-11-01

    Full Text Available Since the start of the human genome project, a great number of genome projects on other "model" organism have been initiated, some of them already completed. Several initiatives have also been started on parasite genomes, mainly through support from WHO/TDR, involving North-South and South-South collaborations, and great hopes are vested in that these initiatives will lead to new tools for disease control and prevention, as well as to the establishment of genomic research technology in developing countries. The Trypanosoma cruzi genome project, using the clone CL-Brener as starting point, has made considerable progress through the concerted action of more than 20 laboratories, most of them in the South. A brief overview of the current state of the project is given

  13. An epigenetic toolkit allows for diverse genome architectures in eukaryotes.

    Science.gov (United States)

    Maurer-Alcalá, Xyrus X; Katz, Laura A

    2015-12-01

    Genome architecture varies considerably among eukaryotes in terms of both size and structure (e.g. distribution of sequences within the genome, elimination of DNA during formation of somatic nuclei). The diversity in eukaryotic genome architectures and the dynamic processes are only possible due to the well-developed epigenetic toolkit, which probably existed in the Last Eukaryotic Common Ancestor (LECA). This toolkit may have arisen as a means of navigating the genomic conflict that arose from the expansion of transposable elements within the ancestral eukaryotic genome. This toolkit has been coopted to support the dynamic nature of genomes in lineages across the eukaryotic tree of life. Here we highlight how the changes in genome architecture in diverse eukaryotes are regulated by epigenetic processes, such as DNA elimination, genome rearrangements, and adaptive changes to genome architecture. The ability to epigenetically modify and regulate genomes has contributed greatly to the diversity of eukaryotes observed today.

  14. Genomic Encyclopedia of Type Strains, Phase I: The one thousand microbial genomes (KMG-I) project.

    Science.gov (United States)

    Kyrpides, Nikos C; Woyke, Tanja; Eisen, Jonathan A; Garrity, George; Lilburn, Timothy G; Beck, Brian J; Whitman, William B; Hugenholtz, Phil; Klenk, Hans-Peter

    2014-06-15

    The Genomic Encyclopedia of Bacteria and Archaea (GEBA) project was launched by the JGI in 2007 as a pilot project with the objective of sequencing 250 bacterial and archaeal genomes. The two major goals of that project were (a) to test the hypothesis that there are many benefits to the use the phylogenetic diversity of organisms in the tree of life as a primary criterion for generating their genome sequence and (b) to develop the necessary framework, technology and organization for large-scale sequencing of microbial isolate genomes. While the GEBA pilot project has not yet been entirely completed, both of the original goals have already been successfully accomplished, leading the way for the next phase of the project. Here we propose taking the GEBA project to the next level, by generating high quality draft genomes for 1,000 bacterial and archaeal strains. This represents a combined 16-fold increase in both scale and speed as compared to the GEBA pilot project (250 isolate genomes in 4+ years). We will follow a similar approach for organism selection and sequencing prioritization as was done for the GEBA pilot project (i.e. phylogenetic novelty, availability and growth of cultures of type strains and DNA extraction capability), focusing on type strains as this ensures reproducibility of our results and provides the strongest linkage between genome sequences and other knowledge about each strain. In turn, this project will constitute a pilot phase of a larger effort that will target the genome sequences of all available type strains of the Bacteria and Archaea.

  15. The Chlamydomonas genome project: a decade on

    Science.gov (United States)

    Blaby, Ian K.; Blaby-Haas, Crysten; Tourasse, Nicolas; Hom, Erik F. Y.; Lopez, David; Aksoy, Munevver; Grossman, Arthur; Umen, James; Dutcher, Susan; Porter, Mary; King, Stephen; Witman, George; Stanke, Mario; Harris, Elizabeth H.; Goodstein, David; Grimwood, Jane; Schmutz, Jeremy; Vallon, Olivier; Merchant, Sabeeha S.; Prochnik, Simon

    2014-01-01

    The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis and micronutrient homeostasis. Ten years since its genome project was initiated, an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the “omics” era. Housed at Phytozome, the Joint Genome Institute’s (JGI) plant genomics portal, the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of RNA-Seq data. Here, we present the past, present and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes. PMID:24950814

  16. Genomic Prediction from Whole Genome Sequence in Livestock: The 1000 Bull Genomes Project

    DEFF Research Database (Denmark)

    Hayes, Benjamin J; MacLeod, Iona M; Daetwyler, Hans D

    Advantages of using whole genome sequence data to predict genomic estimated breeding values (GEBV) include better persistence of accuracy of GEBV across generations and more accurate GEBV across breeds. The 1000 Bull Genomes Project provides a database of whole genome sequenced key ancestor bulls...

  17. Genomic Diversity and the Microenvironment as Drivers of Progression in DCIS

    Science.gov (United States)

    2015-10-01

    microenvironment, mammographic biomarkers 3. ACCOMPLISHMENTS What were the major goals of the project? Aim 1. Determine whether genetic diversity...of genetic diversity, microenvironmental diversity, and/or mammographic biomarkers can be used to predict which DCIS tumors are most likely to...series of pilot experiments to determine the best resource (Washington University) that we will use to perform the genomic sequencing of our tumors. We

  18. High-Diversity Genes in the Arabidopsis Genome

    OpenAIRE

    Cork, Jennifer M.; Purugganan, Michael D.

    2005-01-01

    High-diversity genes represent an important class of loci in organismal genomes. Since elevated levels of nucleotide variation are a key component of the molecular signature for balancing selection or local adaptation, high-diversity genes may represent loci whose alleles are selectively maintained as balanced polymorphisms. Comparison of 4300 random shotgun sequence fragments of the Arabidopsis thaliana Ler ecotype genome with the whole genomic sequence of the Col-0 ecotype identified 60 gen...

  19. The life cycle of a genome project: perspectives and guidelines inspired by insect genome projects.

    Science.gov (United States)

    Papanicolaou, Alexie

    2016-01-01

    Many research programs on non-model species biology have been empowered by genomics. In turn, genomics is underpinned by a reference sequence and ancillary information created by so-called "genome projects". The most reliable genome projects are the ones created as part of an active research program and designed to address specific questions but their life extends past publication. In this opinion paper I outline four key insights that have facilitated maintaining genomic communities: the key role of computational capability, the iterative process of building genomic resources, the value of community participation and the importance of manual curation. Taken together, these ideas can and do ensure the longevity of genome projects and the growing non-model species community can use them to focus a discussion with regards to its future genomic infrastructure.

  20. Genomics :GTL project quarterly report April 2005.

    Energy Technology Data Exchange (ETDEWEB)

    Rintoul, Mark Daniel; Martino, Anthony A.; Palenik, Brian; Heffelfinger, Grant S.; Xu, Ying; Geist, Al; Gorin, Andrey

    2005-11-01

    This SAND report provides the technical progress through April 2005 of the Sandia-led project, ''Carbon Sequestration in Synechococcus Sp.: From Molecular Machines to Hierarchical Modeling'', funded by the DOE Office of Science GenomicsGTL Program. Understanding, predicting, and perhaps manipulating carbon fixation in the oceans has long been a major focus of biological oceanography and has more recently been of interest to a broader audience of scientists and policy makers. It is clear that the oceanic sinks and sources of CO{sub 2} are important terms in the global environmental response to anthropogenic atmospheric inputs of CO{sub 2} and that oceanic microorganisms play a key role in this response. However, the relationship between this global phenomenon and the biochemical mechanisms of carbon fixation in these microorganisms is poorly understood. In this project, we will investigate the carbon sequestration behavior of Synechococcus Sp., an abundant marine cyanobacteria known to be important to environmental responses to carbon dioxide levels, through experimental and computational methods. This project is a combined experimental and computational effort with emphasis on developing and applying new computational tools and methods. Our experimental effort will provide the biology and data to drive the computational efforts and include significant investment in developing new experimental methods for uncovering protein partners, characterizing protein complexes, identifying new binding domains. We will also develop and apply new data measurement and statistical methods for analyzing microamy experiments. Computational tools will be essential to our efforts to discover and characterize the function of the molecular machines of Synechococcus. To this end, molecular simulation methods will be coupled with knowledge discovery from diverse biological data sets for high-throughput discovery and characterization of protein-protein complexes. In

  1. The African Genome Variation Project shapes medical genetics in Africa.

    Science.gov (United States)

    Gurdasani, Deepti; Carstensen, Tommy; Tekola-Ayele, Fasil; Pagani, Luca; Tachmazidou, Ioanna; Hatzikotoulas, Konstantinos; Karthikeyan, Savita; Iles, Louise; Pollard, Martin O; Choudhury, Ananyo; Ritchie, Graham R S; Xue, Yali; Asimit, Jennifer; Nsubuga, Rebecca N; Young, Elizabeth H; Pomilla, Cristina; Kivinen, Katja; Rockett, Kirk; Kamali, Anatoli; Doumatey, Ayo P; Asiki, Gershim; Seeley, Janet; Sisay-Joof, Fatoumatta; Jallow, Muminatou; Tollman, Stephen; Mekonnen, Ephrem; Ekong, Rosemary; Oljira, Tamiru; Bradman, Neil; Bojang, Kalifa; Ramsay, Michele; Adeyemo, Adebowale; Bekele, Endashaw; Motala, Ayesha; Norris, Shane A; Pirie, Fraser; Kaleebu, Pontiano; Kwiatkowski, Dominic; Tyler-Smith, Chris; Rotimi, Charles; Zeggini, Eleftheria; Sandhu, Manjinder S

    2015-01-15

    Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterization of African genetic diversity is needed. The African Genome Variation Project provides a resource with which to design, implement and interpret genomic studies in sub-Saharan Africa and worldwide. The African Genome Variation Project represents dense genotypes from 1,481 individuals and whole-genome sequences from 320 individuals across sub-Saharan Africa. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across sub-Saharan Africa. We identify new loci under selection, including loci related to malaria susceptibility and hypertension. We show that modern imputation panels (sets of reference genotypes from which unobserved or missing genotypes in study sets can be inferred) can identify association signals at highly differentiated loci across populations in sub-Saharan Africa. Using whole-genome sequencing, we demonstrate further improvements in imputation accuracy, strengthening the case for large-scale sequencing efforts of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa.

  2. Two Tales of Prokaryotic Genomic Diversity: Escherichia coli and Halophiles

    Directory of Open Access Journals (Sweden)

    Lejla Pašić

    2014-01-01

    Full Text Available Prokaryotes are generally characterized by vast genomic diversity that has been shaped by mutations, horizontal gene transfer, bacteriocins and phage predation. Enormous genetic diversity has developed as a result of stresses imposed in harsh environments and the ability of microorganisms to adapt. Two examples of prokaryotic diversity are presented: on intraspecies level, exemplified by Escherichia coli, and the diversity of the hypersaline environment, with the discussion of food-related health issues and biotechnological potential.

  3. Cancer Genome Anatomy Project | Office of Cancer Genomics

    Science.gov (United States)

    The National Cancer Institute (NCI) Cancer Genome Anatomy Project (CGAP) is an online resource designed to provide the research community access to biological tissue characterization data. Request a free copy of the CGAP Website Virtual Tour CD from ocg@mail.nih.gov.

  4. [Human genomic project and human genomic haplotype map project: opportunitiy, challenge and strategy in stomatology].

    Science.gov (United States)

    Wu, Rui-qing; Zeng, Xin; Wang, Zhi

    2010-08-01

    The human genomic project and the international HapMap project were designed to create a genome-wide database of patterns of human genetic variation, with the expectation that these patterns would be useful for genetic association studies of common diseases, thus lead to molecular diagnosis and personnel therapy. The article briefly reviewed the creation, target and achievement of those two projects. Furthermore, the authors have given four suggestions in facing to the opportunities and challenges brought by the two projects, including cultivation improvement of elites, cross binding of multi-subjects, strengthening construction of research base and initiation of natural key scientific project.

  5. The Human Genome Project and Biology Education.

    Science.gov (United States)

    McInerney, Joseph D.

    1996-01-01

    Highlights the importance of the Human Genome Project in educating the public about genetics. Discusses four challenges that science educators must address: teaching for conceptual understanding, the nature of science, the personal and social impact of science and technology, and the principles of technology. Contains 45 references. (JRH)

  6. Justice and the Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Murphy, T.F.; Lappe, M. [eds.

    1992-12-31

    Most of the essays gathered in this volume were first presented at a conference, Justice and the Human Genome, in Chicago in early November, 1991. The goal of the, conference was to consider questions of justice as they are and will be raised by the Human Genome Project. To achieve its goal of identifying and elucidating the challenges of justice inherent in genomic research and its social applications the conference drew together in one forum members from academia, medicine, and industry with interests divergent as rate-setting for insurance, the care of newborns, and the history of ethics. The essays in this volume address a number of theoretical and practical concerns relative to the meaning of genomic research.

  7. Justice and the Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Murphy, T.F.; Lappe, M. (eds.)

    1992-01-01

    Most of the essays gathered in this volume were first presented at a conference, Justice and the Human Genome, in Chicago in early November, 1991. The goal of the, conference was to consider questions of justice as they are and will be raised by the Human Genome Project. To achieve its goal of identifying and elucidating the challenges of justice inherent in genomic research and its social applications the conference drew together in one forum members from academia, medicine, and industry with interests divergent as rate-setting for insurance, the care of newborns, and the history of ethics. The essays in this volume address a number of theoretical and practical concerns relative to the meaning of genomic research.

  8. Large-Scale Release of Campylobacter Draft Genomes: Resources for Food Safety and Public Health from the 100K Pathogen Genome Project

    Science.gov (United States)

    Huang, Bihua C.; Storey, Dylan B.; Kong, Nguyet; Chen, Poyin; Arabyan, Narine; Gilpin, Brent; Mason, Carl; Townsend, Andrea K.; Smith, Woutrina A.; Byrne, Barbara A.; Taff, Conor C.

    2017-01-01

    ABSTRACT Campylobacter is a food-associated bacterium and a leading cause of foodborne illness worldwide, being associated with poultry in the food supply. This is the initial public release of 202 Campylobacter genome sequences as part of the 100K Pathogen Genome Project. These isolates represent global genomic diversity in the Campylobacter genus. PMID:28057746

  9. A first exploration of genome size diversity in sponges.

    Science.gov (United States)

    Jeffery, Nicholas W; Jardine, Catherine B; Gregory, T Ryan

    2013-08-01

    The phyla known as early-branching lineages of animals have become the subject of increasing interest from the perspectives of genomics and evolutionary biology. Unfortunately, data on even the most fundamental properties of their genomes, such as genome size, remain very scarce. In this study, genome size estimates are reported for 75 species of sponges (phylum Porifera) representing 33 families and 12 orders, marking the first large survey of genome size diversity for an early-branching phylum. Sponge genome sizes averaged around 0.2 pg but exhibited a 17-fold range overall (0.04-0.63 pg). In addition, the results of comparisons of two methods of genome size quantification (flow cytometry and Feulgen image analysis densitometry) are presented, thereby facilitating future work on these animals. Some particularly promising avenues for future investigation are highlighted.

  10. Implications of the Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Kitcher, P.

    1998-11-01

    The Human Genome Project (HGP), launched in 1991, aims to map and sequence the human genome by 2006. During the fifteen-year life of the project, it is projected that $3 billion in federal funds will be allocated to it. The ultimate aims of spending this money are to analyze the structure of human DNA, to identify all human genes, to recognize the functions of those genes, and to prepare for the biology and medicine of the twenty-first century. The following summary examines some of the implications of the program, concentrating on its scientific import and on the ethical and social problems that it raises. Its aim is to expose principles that might be used in applying the information which the HGP will generate. There is no attempt here to translate the principles into detailed proposals for legislation. Arguments and discussion can be found in the full report, but, like this summary, that report does not contain any legislative proposals.

  11. Nucleotide diversity maps reveal variation in diversity among wheat genomes and chromosomes

    Directory of Open Access Journals (Sweden)

    McGuire Patrick E

    2010-12-01

    Full Text Available Abstract Background A genome-wide assessment of nucleotide diversity in a polyploid species must minimize the inclusion of homoeologous sequences into diversity estimates and reliably allocate individual haplotypes into their respective genomes. The same requirements complicate the development and deployment of single nucleotide polymorphism (SNP markers in polyploid species. We report here a strategy that satisfies these requirements and deploy it in the sequencing of genes in cultivated hexaploid wheat (Triticum aestivum, genomes AABBDD and wild tetraploid wheat (Triticum turgidum ssp. dicoccoides, genomes AABB from the putative site of wheat domestication in Turkey. Data are used to assess the distribution of diversity among and within wheat genomes and to develop a panel of SNP markers for polyploid wheat. Results Nucleotide diversity was estimated in 2114 wheat genes and was similar between the A and B genomes and reduced in the D genome. Within a genome, diversity was diminished on some chromosomes. Low diversity was always accompanied by an excess of rare alleles. A total of 5,471 SNPs was discovered in 1791 wheat genes. Totals of 1,271, 1,218, and 2,203 SNPs were discovered in 488, 463, and 641 genes of wheat putative diploid ancestors, T. urartu, Aegilops speltoides, and Ae. tauschii, respectively. A public database containing genome-specific primers, SNPs, and other information was constructed. A total of 987 genes with nucleotide diversity estimated in one or more of the wheat genomes was placed on an Ae. tauschii genetic map, and the map was superimposed on wheat deletion-bin maps. The agreement between the maps was assessed. Conclusions In a young polyploid, exemplified by T. aestivum, ancestral species are the primary source of genetic diversity. Low effective recombination due to self-pollination and a genetic mechanism precluding homoeologous chromosome pairing during polyploid meiosis can lead to the loss of diversity from large

  12. The diversity of cyanobacterial metabolism: genome analysis of multiple phototrophic microorganisms

    Directory of Open Access Journals (Sweden)

    Beck Christian

    2012-02-01

    Full Text Available Abstract Background Cyanobacteria are among the most abundant organisms on Earth and represent one of the oldest and most widespread clades known in modern phylogenetics. As the only known prokaryotes capable of oxygenic photosynthesis, cyanobacteria are considered to be a promising resource for renewable fuels and natural products. Our efforts to harness the sun's energy using cyanobacteria would greatly benefit from an increased understanding of the genomic diversity across multiple cyanobacterial strains. In this respect, the advent of novel sequencing techniques and the availability of several cyanobacterial genomes offers new opportunities for understanding microbial diversity and metabolic organization and evolution in diverse environments. Results Here, we report a whole genome comparison of multiple phototrophic cyanobacteria. We describe genetic diversity found within cyanobacterial genomes, specifically with respect to metabolic functionality. Our results are based on pair-wise comparison of protein sequences and concomitant construction of clusters of likely ortholog genes. We differentiate between core, shared and unique genes and show that the majority of genes are associated with a single genome. In contrast, genes with metabolic function are strongly overrepresented within the core genome that is common to all considered strains. The analysis of metabolic diversity within core carbon metabolism reveals parts of the metabolic networks that are highly conserved, as well as highly fragmented pathways. Conclusions Our results have direct implications for resource allocation and further sequencing projects. It can be extrapolated that the number of newly identified genes still significantly increases with increasing number of new sequenced genomes. Furthermore, genome analysis of multiple phototrophic strains allows us to obtain a detailed picture of metabolic diversity that can serve as a starting point for biotechnological

  13. Genomic and Genetic Diversity within the Pseudomonas fluorescens Complex.

    Directory of Open Access Journals (Sweden)

    Daniel Garrido-Sanz

    Full Text Available The Pseudomonas fluorescens complex includes Pseudomonas strains that have been taxonomically assigned to more than fifty different species, many of which have been described as plant growth-promoting rhizobacteria (PGPR with potential applications in biocontrol and biofertilization. So far the phylogeny of this complex has been analyzed according to phenotypic traits, 16S rDNA, MLSA and inferred by whole-genome analysis. However, since most of the type strains have not been fully sequenced and new species are frequently described, correlation between taxonomy and phylogenomic analysis is missing. In recent years, the genomes of a large number of strains have been sequenced, showing important genomic heterogeneity and providing information suitable for genomic studies that are important to understand the genomic and genetic diversity shown by strains of this complex. Based on MLSA and several whole-genome sequence-based analyses of 93 sequenced strains, we have divided the P. fluorescens complex into eight phylogenomic groups that agree with previous works based on type strains. Digital DDH (dDDH identified 69 species and 75 subspecies within the 93 genomes. The eight groups corresponded to clustering with a threshold of 31.8% dDDH, in full agreement with our MLSA. The Average Nucleotide Identity (ANI approach showed inconsistencies regarding the assignment to species and to the eight groups. The small core genome of 1,334 CDSs and the large pan-genome of 30,848 CDSs, show the large diversity and genetic heterogeneity of the P. fluorescens complex. However, a low number of strains were enough to explain most of the CDSs diversity at core and strain-specific genomic fractions. Finally, the identification and analysis of group-specific genome and the screening for distinctive characters revealed a phylogenomic distribution of traits among the groups that provided insights into biocontrol and bioremediation applications as well as their role as

  14. Genomic and Genetic Diversity within the Pseudomonas fluorescens Complex.

    Science.gov (United States)

    Garrido-Sanz, Daniel; Meier-Kolthoff, Jan P; Göker, Markus; Martín, Marta; Rivilla, Rafael; Redondo-Nieto, Miguel

    2016-01-01

    The Pseudomonas fluorescens complex includes Pseudomonas strains that have been taxonomically assigned to more than fifty different species, many of which have been described as plant growth-promoting rhizobacteria (PGPR) with potential applications in biocontrol and biofertilization. So far the phylogeny of this complex has been analyzed according to phenotypic traits, 16S rDNA, MLSA and inferred by whole-genome analysis. However, since most of the type strains have not been fully sequenced and new species are frequently described, correlation between taxonomy and phylogenomic analysis is missing. In recent years, the genomes of a large number of strains have been sequenced, showing important genomic heterogeneity and providing information suitable for genomic studies that are important to understand the genomic and genetic diversity shown by strains of this complex. Based on MLSA and several whole-genome sequence-based analyses of 93 sequenced strains, we have divided the P. fluorescens complex into eight phylogenomic groups that agree with previous works based on type strains. Digital DDH (dDDH) identified 69 species and 75 subspecies within the 93 genomes. The eight groups corresponded to clustering with a threshold of 31.8% dDDH, in full agreement with our MLSA. The Average Nucleotide Identity (ANI) approach showed inconsistencies regarding the assignment to species and to the eight groups. The small core genome of 1,334 CDSs and the large pan-genome of 30,848 CDSs, show the large diversity and genetic heterogeneity of the P. fluorescens complex. However, a low number of strains were enough to explain most of the CDSs diversity at core and strain-specific genomic fractions. Finally, the identification and analysis of group-specific genome and the screening for distinctive characters revealed a phylogenomic distribution of traits among the groups that provided insights into biocontrol and bioremediation applications as well as their role as PGPR.

  15. The human genome project and the future of medical practice ...

    African Journals Online (AJOL)

    The human genome project and the future of medical practice. ... the planning stages of the human genome project, the technology and sequence data ... the quality of healthcare available in the resource-rich and the resource-poor countries.

  16. The 3,000 rice genomes project: new opportunities and challenges for future rice research

    OpenAIRE

    Li, Jia-Yang; Wang, Jun; Zeigler, Robert S.

    2014-01-01

    Rice is the world’s most important staple grown by millions of small-holder farmers. Sustaining rice production relies on the intelligent use of rice diversity. The 3,000 Rice Genomes Project is a giga-dataset of publically available genome sequences (averaging 14× depth of coverage) derived from 3,000 accessions of rice with global representation of genetic and functional diversity. The seed of these accessions is available from the International Rice Genebank Collection. Together, they are ...

  17. Global biogeography of Prochlorococcus genome diversity in the surface ocean.

    Science.gov (United States)

    Kent, Alyssa G; Dupont, Chris L; Yooseph, Shibu; Martiny, Adam C

    2016-08-01

    Prochlorococcus, the smallest known photosynthetic bacterium, is abundant in the ocean's surface layer despite large variation in environmental conditions. There are several genetically divergent lineages within Prochlorococcus and superimposed on this phylogenetic diversity is extensive gene gain and loss. The environmental role in shaping the global ocean distribution of genome diversity in Prochlorococcus is largely unknown, particularly in a framework that considers the vertical and lateral mechanisms of evolution. Here we show that Prochlorococcus field populations from a global circumnavigation harbor extensive genome diversity across the surface ocean, but this diversity is not randomly distributed. We observed a significant correspondence between phylogenetic and gene content diversity, including regional differences in both phylogenetic composition and gene content that were related to environmental factors. Several gene families were strongly associated with specific regions and environmental factors, including the identification of a set of genes related to lower nutrient and temperature regions. Metagenomic assemblies of natural Prochlorococcus genomes reinforced this association by providing linkage of genes across genomic backbones. Overall, our results show that the phylogeography in Prochlorococcus taxonomy is echoed in its genome content. Thus environmental variation shapes the functional capabilities and associated ecosystem role of the globally abundant Prochlorococcus.

  18. Origins of the Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Cook-Deegan, Robert

    1993-07-01

    The human genome project was borne of technology, grew into a science bureaucracy in the US and throughout the world, and is now being transformed into a hybrid academic and commercial enterprise. The next phase of the project promises to veer more sharply toward commercial application, harnessing both the technical prowess of molecular biology and the rapidly growing body of knowledge about DNA structure to the pursuit of practical benefits. Faith that the systematic analysis of DNA structure will prove to be a powerful research tool underlies the rationale behind the genome project. The notion that most genetic information is embedded in the sequence of CNA base pairs comprising chromosomes is a central tenet. A rough analogy is to liken an organism's genetic code to computer code. The coal of the genome project, in this parlance, is to identify and catalog 75,000 or more files (genes) in the software that directs construction of a self-modifying and self-replicating system -- a living organism.

  19. Origins of the Human Genome Project

    Science.gov (United States)

    Cook-Deegan, Robert (Affiliation: Institute of Medicine, National Academy of Sciences)

    1993-07-01

    The human genome project was borne of technology, grew into a science bureaucracy in the United States and throughout the world, and is now being transformed into a hybrid academic and commercial enterprise. The next phase of the project promises to veer more sharply toward commercial application, harnessing both the technical prowess of molecular biology and the rapidly growing body of knowledge about DNA structure to the pursuit of practical benefits. Faith that the systematic analysis of DNA structure will prove to be a powerful research tool underlies the rationale behind the genome project. The notion that most genetic information is embedded in the sequence of CNA base pairs comprising chromosomes is a central tenet. A rough analogy is to liken an organism's genetic code to computer code. The coal of the genome project, in this parlance, is to identify and catalog 75,000 or more files (genes) in the software that directs construction of a self-modifying and self-replicating system -- a living organism.

  20. Cancer Genomics: Diversity and Disparity Across Ethnicity and Geography.

    Science.gov (United States)

    Tan, Daniel S W; Mok, Tony S K; Rebbeck, Timothy R

    2016-01-01

    Ethnic and geographic differences in cancer incidence, prognosis, and treatment outcomes can be attributed to diversity in the inherited (germline) and somatic genome. Although international large-scale sequencing efforts are beginning to unravel the genomic underpinnings of cancer traits, much remains to be known about the underlying mechanisms and determinants of genomic diversity. Carcinogenesis is a dynamic, complex phenomenon representing the interplay between genetic and environmental factors that results in divergent phenotypes across ethnicities and geography. For example, compared with whites, there is a higher incidence of prostate cancer among Africans and African Americans, and the disease is generally more aggressive and fatal. Genome-wide association studies have identified germline susceptibility loci that may account for differences between the African and non-African patients, but the lack of availability of appropriate cohorts for replication studies and the incomplete understanding of genomic architecture across populations pose major limitations. We further discuss the transformative potential of routine diagnostic evaluation for actionable somatic alterations, using lung cancer as an example, highlighting implications of population disparities, current hurdles in implementation, and the far-reaching potential of clinical genomics in enhancing cancer prevention, diagnosis, and treatment. As we enter the era of precision cancer medicine, a concerted multinational effort is key to addressing population and genomic diversity as well as overcoming barriers and geographical disparities in research and health care delivery.

  1. Genomic diversity within the Enterobacter cloacae complex.

    Directory of Open Access Journals (Sweden)

    Armand Paauw

    Full Text Available BACKGROUND: Isolates of the Enterobacter cloacae complex have been increasingly isolated as nosocomial pathogens, but phenotypic identification of the E. cloacae complex is unreliable and irreproducible. Identification of species based on currently available genotyping tools is already superior to phenotypic identification, but the taxonomy of isolates belonging to this complex is cumbersome. METHODOLOGY/PRINCIPAL FINDINGS: This study shows that multilocus sequence analysis and comparative genomic hybridization based on a mixed genome array is a powerful method for studying species assignment within the E. cloacae complex. The E. cloacae complex is shown to be evolutionarily divided into two clades that are genetically distinct from each other. The younger first clade is genetically more homogenous, contains the Enterobacter hormaechei species and is the most frequently cultured Enterobacter species in hospitals. The second and older clade consists of several (subspecies that are genetically more heterogeneous. Genetic markers were identified that could discriminate between the two clades and cluster 1. CONCLUSIONS/SIGNIFICANCE: Based on genomic differences it is concluded that some previously defined (clonal and heterogenic (subspecies of the E. cloacae complex have to be redefined because of disagreements with known or proposed nomenclature. However, further improved identification of the redefined species will be possible based on novel markers presented here.

  2. Castor bean organelle genome sequencing and worldwide genetic diversity analysis.

    Directory of Open Access Journals (Sweden)

    Maximo Rivarola

    Full Text Available Castor bean is an important oil-producing plant in the Euphorbiaceae family. Its high-quality oil contains up to 90% of the unusual fatty acid ricinoleate, which has many industrial and medical applications. Castor bean seeds also contain ricin, a highly toxic Type 2 ribosome-inactivating protein, which has gained relevance in recent years due to biosafety concerns. In order to gain knowledge on global genetic diversity in castor bean and to ultimately help the development of breeding and forensic tools, we carried out an extensive chloroplast sequence diversity analysis. Taking advantage of the recently published genome sequence of castor bean, we assembled the chloroplast and mitochondrion genomes extracting selected reads from the available whole genome shotgun reads. Using the chloroplast reference genome we used the methylation filtration technique to readily obtain draft genome sequences of 7 geographically and genetically diverse castor bean accessions. These sequence data were used to identify single nucleotide polymorphism markers and phylogenetic analysis resulted in the identification of two major clades that were not apparent in previous population genetic studies using genetic markers derived from nuclear DNA. Two distinct sub-clades could be defined within each major clade and large-scale genotyping of castor bean populations worldwide confirmed previously observed low levels of genetic diversity and showed a broad geographic distribution of each sub-clade.

  3. Castor bean organelle genome sequencing and worldwide genetic diversity analysis.

    Science.gov (United States)

    Rivarola, Maximo; Foster, Jeffrey T; Chan, Agnes P; Williams, Amber L; Rice, Danny W; Liu, Xinyue; Melake-Berhan, Admasu; Huot Creasy, Heather; Puiu, Daniela; Rosovitz, M J; Khouri, Hoda M; Beckstrom-Sternberg, Stephen M; Allan, Gerard J; Keim, Paul; Ravel, Jacques; Rabinowicz, Pablo D

    2011-01-01

    Castor bean is an important oil-producing plant in the Euphorbiaceae family. Its high-quality oil contains up to 90% of the unusual fatty acid ricinoleate, which has many industrial and medical applications. Castor bean seeds also contain ricin, a highly toxic Type 2 ribosome-inactivating protein, which has gained relevance in recent years due to biosafety concerns. In order to gain knowledge on global genetic diversity in castor bean and to ultimately help the development of breeding and forensic tools, we carried out an extensive chloroplast sequence diversity analysis. Taking advantage of the recently published genome sequence of castor bean, we assembled the chloroplast and mitochondrion genomes extracting selected reads from the available whole genome shotgun reads. Using the chloroplast reference genome we used the methylation filtration technique to readily obtain draft genome sequences of 7 geographically and genetically diverse castor bean accessions. These sequence data were used to identify single nucleotide polymorphism markers and phylogenetic analysis resulted in the identification of two major clades that were not apparent in previous population genetic studies using genetic markers derived from nuclear DNA. Two distinct sub-clades could be defined within each major clade and large-scale genotyping of castor bean populations worldwide confirmed previously observed low levels of genetic diversity and showed a broad geographic distribution of each sub-clade.

  4. Castor Bean Organelle Genome Sequencing and Worldwide Genetic Diversity Analysis

    Science.gov (United States)

    Chan, Agnes P.; Williams, Amber L.; Rice, Danny W.; Liu, Xinyue; Melake-Berhan, Admasu; Huot Creasy, Heather; Puiu, Daniela; Rosovitz, M. J.; Khouri, Hoda M.; Beckstrom-Sternberg, Stephen M.; Allan, Gerard J.; Keim, Paul; Ravel, Jacques; Rabinowicz, Pablo D.

    2011-01-01

    Castor bean is an important oil-producing plant in the Euphorbiaceae family. Its high-quality oil contains up to 90% of the unusual fatty acid ricinoleate, which has many industrial and medical applications. Castor bean seeds also contain ricin, a highly toxic Type 2 ribosome-inactivating protein, which has gained relevance in recent years due to biosafety concerns. In order to gain knowledge on global genetic diversity in castor bean and to ultimately help the development of breeding and forensic tools, we carried out an extensive chloroplast sequence diversity analysis. Taking advantage of the recently published genome sequence of castor bean, we assembled the chloroplast and mitochondrion genomes extracting selected reads from the available whole genome shotgun reads. Using the chloroplast reference genome we used the methylation filtration technique to readily obtain draft genome sequences of 7 geographically and genetically diverse castor bean accessions. These sequence data were used to identify single nucleotide polymorphism markers and phylogenetic analysis resulted in the identification of two major clades that were not apparent in previous population genetic studies using genetic markers derived from nuclear DNA. Two distinct sub-clades could be defined within each major clade and large-scale genotyping of castor bean populations worldwide confirmed previously observed low levels of genetic diversity and showed a broad geographic distribution of each sub-clade. PMID:21750729

  5. Genomics and transcriptomics across the diversity of the Nematoda.

    Science.gov (United States)

    Blaxter, M; Kumar, S; Kaur, G; Koutsovoulos, G; Elsworth, B

    2012-01-01

    The diversity of biology in nematodes is reflected in the diversity of their genomes. Parasitic species in particular have evolved mechanisms to invade and outwit their hosts, and these offer opportunities for the development of control measures. Genomic analyses can reveal the molecular underpinnings of phenotypes such as parasitism and thus, initiate and support research programmes that explore the manipulation of host and parasite physiologies to achieve favourable outcomes. Wide sampling across nematode diversity allows phylogenetically informed formulation of research hypotheses, identification of core features shared by all species or important evolutionary novelties present in isolated clades. Many nematode species have been investigated through the use of the expressed sequence tag approach, which samples from the transcribed genome. Gene catalogues generated in this way can be explored to reveal the patterns of expression associated with parasitism and candidates for testing as drug targets or vaccine components. Analysis environments, such as NEMBASE facilitate exploitation of these data. The development of new high-throughput DNA-sequencing technologies has facilitated transcriptomic and genomic approaches to parasite biology. Whole genome sequencing offers more complete catalogues of genes and assists a systems approach to phenotype dissection. These efforts are being coordinated through the 959 Nematode Genomes initiative.

  6. The Global Invertebrate Genomics Alliance (GIGA). 2014. Developing Community Resources to Study Diverse Invertebrate Genomes

    NARCIS (Netherlands)

    Pomponi, S.A.

    2014-01-01

    Over 95% of all metazoan (animal) species comprise the “invertebrates,” but very few genomes from these organisms have been sequenced. We have, therefore, formed a “Global Invertebrate Genomics Alliance” (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major cha

  7. The Global Invertebrate Genomics Alliance (GIGA). 2014. Developing Community Resources to Study Diverse Invertebrate Genomes

    NARCIS (Netherlands)

    Pomponi, S.A.

    2014-01-01

    Over 95% of all metazoan (animal) species comprise the “invertebrates,” but very few genomes from these organisms have been sequenced. We have, therefore, formed a “Global Invertebrate Genomics Alliance” (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major

  8. An overview of the human genome project

    Energy Technology Data Exchange (ETDEWEB)

    Batzer, M.A.

    1994-01-01

    The human genome project is one of the most ambitious scientific projects to date, with the ultimate goal being a nucleotide sequence for all four billion bases of human DNA. In the process of determining the nucleotide sequence for each base, the location, function, and regulatory regions from the estimated 100,000 human genes will be identified. The genome project itself relies upon maps of the human genetic code derived from several different levels of resolution. Genetic linkage analysis provides a low resolution genome map. The information for genetic linkage maps is derived from the analysis of chromosome specific markers such as Sequence Tagged Sites (STSs), Variable Number of Tandem Repeats (VNTRs) or other polymorphic (highly informative) loci in a number of different-families. Using this information the location of an unknown disease gene can be limited to a region comprised of one million base pairs of DNA or less. After this point, one must construct or have access to a physical map of the region of interest. Physical mapping involves the construction of an ordered overlapping (contiguous) set of recombinant DNA clones. These clones may be derived from a number of different vectors including cosmids, Bacterial Artificial Chromosomes (BACs), P1 derived Artificial Chromosomes (PACs), somatic cell hybrids, or Yeast Artificial Chromosomes (YACs). The ultimate goal for physical mapping is to establish a completely overlapping (contiguous) set of clones for the entire genome. After a gene or region of interest has been localized using physical mapping the nucleotide sequence is determined. The overlap between genetic mapping, physical mapping and DNA sequencing has proven to be a powerful tool for the isolation of disease genes through positional cloning.

  9. Exuberant innovation: The Human Genome Project

    CERN Document Server

    Gisler, Monika; Woodard, Ryan

    2010-01-01

    We present a detailed synthesis of the development of the Human Genome Project (HGP) from 1986 to 2003 in order to test the "social bubble" hypothesis that strong social interactions between enthusiastic supporters of the HGP weaved a network of reinforcing feedbacks that led to a widespread endorsement and extraordinary commitment by those involved in the project, beyond what would be rationalized by a standard cost-benefit analysis in the presence of extraordinary uncertainties and risks. The vigorous competition and race between the initially public project and several private initiatives is argued to support the social bubble hypothesis. We also present quantitative analyses of the concomitant financial bubble concentrated on the biotech sector. Confirmation of this hypothesis is offered by the present consensus that it will take decades to exploit the fruits of the HGP, via a slow and arduous process aiming at disentangling the extraordinary complexity of the human complex body. The HGP has ushered other...

  10. Natural Product Biosynthetic Diversity and Comparative Genomics of the Cyanobacteria.

    Science.gov (United States)

    Dittmann, Elke; Gugger, Muriel; Sivonen, Kaarina; Fewer, David P

    2015-10-01

    Cyanobacteria are an ancient lineage of slow-growing photosynthetic bacteria and a prolific source of natural products with intricate chemical structures and potent biological activities. The bulk of these natural products are known from just a handful of genera. Recent efforts have elucidated the mechanisms underpinning the biosynthesis of a diverse array of natural products from cyanobacteria. Many of the biosynthetic mechanisms are unique to cyanobacteria or rarely described from other organisms. Advances in genome sequence technology have precipitated a deluge of genome sequences for cyanobacteria. This makes it possible to link known natural products to biosynthetic gene clusters but also accelerates the discovery of new natural products through genome mining. These studies demonstrate that cyanobacteria encode a huge variety of cryptic gene clusters for the production of natural products, and the known chemical diversity is likely to be just a fraction of the true biosynthetic capabilities of this fascinating and ancient group of organisms.

  11. Whole mitochondrial genome genetic diversity in an Estonian population sample.

    Science.gov (United States)

    Stoljarova, Monika; King, Jonathan L; Takahashi, Maiko; Aaspõllu, Anu; Budowle, Bruce

    2016-01-01

    Mitochondrial DNA is a useful marker for population studies, human identification, and forensic analysis. Commonly used hypervariable regions I and II (HVI/HVII) were reported to contain as little as 25% of mitochondrial DNA variants and therefore the majority of power of discrimination of mitochondrial DNA resides in the coding region. Massively parallel sequencing technology enables entire mitochondrial genome sequencing. In this study, buccal swabs were collected from 114 unrelated Estonians and whole mitochondrial genome sequences were generated using the Illumina MiSeq system. The results are concordant with previous mtDNA control region reports of high haplogroup HV and U frequencies (47.4 and 23.7% in this study, respectively) in the Estonian population. One sample with the Northern Asian haplogroup D was detected. The genetic diversity of the Estonian population sample was estimated to be 99.67 and 95.85%, for mtGenome and HVI/HVII data, respectively. The random match probability for mtGenome data was 1.20 versus 4.99% for HVI/HVII. The nucleotide mean pairwise difference was 27 ± 11 for mtGenome and 7 ± 3 for HVI/HVII data. These data describe the genetic diversity of the Estonian population sample and emphasize the power of discrimination of the entire mitochondrial genome over the hypervariable regions.

  12. A Genomic Encyclopedia of the Root Nodule Bacteria: assessing genetic diversity through a systematic biogeographic survey.

    Science.gov (United States)

    Reeve, Wayne; Ardley, Julie; Tian, Rui; Eshragi, Leila; Yoon, Je Won; Ngamwisetkun, Pinyaruk; Seshadri, Rekha; Ivanova, Natalia N; Kyrpides, Nikos C

    2015-01-01

    Root nodule bacteria are free-living soil bacteria, belonging to diverse genera within the Alphaproteobacteria and Betaproteobacteria, that have the capacity to form nitrogen-fixing symbioses with legumes. The symbiosis is specific and is governed by signaling molecules produced from both host and bacteria. Sequencing of several model RNB genomes has provided valuable insights into the genetic basis of symbiosis. However, the small number of sequenced RNB genomes available does not currently reflect the phylogenetic diversity of RNB, or the variety of mechanisms that lead to symbiosis in different legume hosts. This prevents a broad understanding of symbiotic interactions and the factors that govern the biogeography of host-microbe symbioses. Here, we outline a proposal to expand the number of sequenced RNB strains, which aims to capture this phylogenetic and biogeographic diversity. Through the Vavilov centers of diversity (Proposal ID: 231) and GEBA-RNB (Proposal ID: 882) projects we will sequence 107 RNB strains, isolated from diverse legume hosts in various geographic locations around the world. The nominated strains belong to nine of the 16 currently validly described RNB genera. They include 13 type strains, as well as elite inoculant strains of high commercial importance. These projects will strongly support systematic sequence-based studies of RNB and contribute to our understanding of the effects of biogeography on the evolution of different species of RNB, as well as the mechanisms that determine the specificity and effectiveness of nodulation and symbiotic nitrogen fixation by RNB with diverse legume hosts.

  13. Genomic diversity of bacteriophages infecting the fish pathogen Flavobacterium psychrophilum.

    Science.gov (United States)

    Castillo, Daniel; Middelboe, Mathias

    2016-12-01

    Bacteriophages infecting the fish pathogen Flavobacterium psychrophilum can potentially be used to prevent and control outbreaks of this bacterium in salmonid aquaculture. However, the application of bacteriophages in disease control requires detailed knowledge on their genetic composition. To explore the diversity of F. pyschrophilum bacteriophages, we have analyzed the complete genome sequences of 17 phages isolated from two distant geographic areas (Denmark and Chile), including the previously characterized temperate bacteriophage 6H. Phage genome size ranged from 39 302 to 89 010 bp with a G+C content of 27%-32%. None of the bacteriophages isolated in Denmark contained genes associated with lysogeny, whereas the Chilean isolates were all putative temperate phages and similar to bacteriophage 6H. Comparative genome analysis showed that phages grouped in three different genetic clusters based on genetic composition and gene content, indicating a limited genetic diversity of F. psychrophilum-specific bacteriophages. However, amino acid sequence dissimilarity (25%) was found in putative structural proteins, which could be related to the host specificity determinants. This study represents the first analysis of genomic diversity and composition among bacteriophages infecting the fish pathogen F. psychrophilum and discusses the implications for the application of phages in disease control. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  14. Freedom and Responsibility in Synthetic Genomics: The Synthetic Yeast Project

    OpenAIRE

    Sliva, Anna; Yang, Huanming; Boeke, Jef D.; Debra J. H. Mathews

    2015-01-01

    First introduced in 2011, the Synthetic Yeast Genome (Sc2.0) Project is a large international synthetic genomics project that will culminate in the first eukaryotic cell (Saccharomyces cerevisiae) with a fully synthetic genome. With collaborators from across the globe and from a range of institutions spanning from do-it-yourself biology (DIYbio) to commercial enterprises, it is important that all scientists working on this project are cognizant of the ethical and policy issues associated with...

  15. A genomic scale map of genetic diversity in Trypanosoma cruzi

    Directory of Open Access Journals (Sweden)

    Ackermann Alejandro A

    2012-12-01

    Full Text Available Abstract Background Trypanosoma cruzi, the causal agent of Chagas Disease, affects more than 16 million people in Latin America. The clinical outcome of the disease results from a complex interplay between environmental factors and the genetic background of both the human host and the parasite. However, knowledge of the genetic diversity of the parasite, is currently limited to a number of highly studied loci. The availability of a number of genomes from different evolutionary lineages of T. cruzi provides an unprecedented opportunity to look at the genetic diversity of the parasite at a genomic scale. Results Using a bioinformatic strategy, we have clustered T. cruzi sequence data available in the public domain and obtained multiple sequence alignments in which one or two alleles from the reference CL-Brener were included. These data covers 4 major evolutionary lineages (DTUs: TcI, TcII, TcIII, and the hybrid TcVI. Using these set of alignments we have identified 288,957 high quality single nucleotide polymorphisms and 1,480 indels. In a reduced re-sequencing study we were able to validate ~ 97% of high-quality SNPs identified in 47 loci. Analysis of how these changes affect encoded protein products showed a 0.77 ratio of synonymous to non-synonymous changes in the T. cruzi genome. We observed 113 changes that introduce or remove a stop codon, some causing significant functional changes, and a number of tri-allelic and tetra-allelic SNPs that could be exploited in strain typing assays. Based on an analysis of the observed nucleotide diversity we show that the T. cruzi genome contains a core set of genes that are under apparent purifying selection. Interestingly, orthologs of known druggable targets show statistically significant lower nucleotide diversity values. Conclusions This study provides the first look at the genetic diversity of T. cruzi at a genomic scale. The analysis covers an estimated ~ 60% of the genetic diversity present in the

  16. Correlation exploration of metabolic and genomic diversity in rice

    Directory of Open Access Journals (Sweden)

    Shinozaki Kazuo

    2009-12-01

    Full Text Available Abstract Background It is essential to elucidate the relationship between metabolic and genomic diversity to understand the genetic regulatory networks associated with the changing metabolo-phenotype among natural variation and/or populations. Recent innovations in metabolomics technologies allow us to grasp the comprehensive features of the metabolome. Metabolite quantitative trait analysis is a key approach for the identification of genetic loci involved in metabolite variation using segregated populations. Although several attempts have been made to find correlative relationships between genetic and metabolic diversity among natural populations in various organisms, it is still unclear whether it is possible to discover such correlations between each metabolite and the polymorphisms found at each chromosomal location. To assess the correlative relationship between the metabolic and genomic diversity found in rice accessions, we compared the distance matrices for these two "omics" patterns in the rice accessions. Results We selected 18 accessions from the world rice collection based on their population structure. To determine the genomic diversity of the rice genome, we genotyped 128 restriction fragment length polymorphism (RFLP markers to calculate the genetic distance among the accessions. To identify the variations in the metabolic fingerprint, a soluble extract from the seed grain of each accession was analyzed with one dimensional 1H-nuclear magnetic resonance (NMR. We found no correlation between global metabolic diversity and the phylogenetic relationships among the rice accessions (rs = 0.14 by analyzing the distance matrices (calculated from the pattern of the metabolic fingerprint in the 4.29- to 0.71-ppm 1H chemical shift and the genetic distance on the basis of the RFLP markers. However, local correlation analysis between the distance matrices (derived from each 0.04-ppm integral region of the 1H chemical shift against genetic

  17. The surprising diversity of clostridial hydrogenases: a comparative genomic perspective

    OpenAIRE

    Calusinska, Magdalena; Happe, Thomas; Joris, Bernard; Wilmotte, Annick

    2010-01-01

    Among the large variety of micro-organisms capable of fermentative hydrogen production, strict anaerobes such as members of the genus Clostridium are the most widely studied. They can produce hydrogen by a reversible reduction of protons accumulated during fermentation to dihydrogen, a reaction which is catalysed by hydrogenases. Sequenced genomes provide completely new insights into the diversity of clostridial hydrogenases. Building on previous reports, we found that [FeFe] hydrogenases are...

  18. Genome Diversity and Evolution in the Budding Yeasts (Saccharomycotina).

    Science.gov (United States)

    Dujon, Bernard A; Louis, Edward J

    2017-06-01

    Considerable progress in our understanding of yeast genomes and their evolution has been made over the last decade with the sequencing, analysis, and comparisons of numerous species, strains, or isolates of diverse origins. The role played by yeasts in natural environments as well as in artificial manufactures, combined with the importance of some species as model experimental systems sustained this effort. At the same time, their enormous evolutionary diversity (there are yeast species in every subphylum of Dikarya) sparked curiosity but necessitated further efforts to obtain appropriate reference genomes. Today, yeast genomes have been very informative about basic mechanisms of evolution, speciation, hybridization, domestication, as well as about the molecular machineries underlying them. They are also irreplaceable to investigate in detail the complex relationship between genotypes and phenotypes with both theoretical and practical implications. This review examines these questions at two distinct levels offered by the broad evolutionary range of yeasts: inside the best-studied Saccharomyces species complex, and across the entire and diversified subphylum of Saccharomycotina. While obviously revealing evolutionary histories at different scales, data converge to a remarkably coherent picture in which one can estimate the relative importance of intrinsic genome dynamics, including gene birth and loss, vs. horizontal genetic accidents in the making of populations. The facility with which novel yeast genomes can now be studied, combined with the already numerous available reference genomes, offer privileged perspectives to further examine these fundamental biological questions using yeasts both as eukaryotic models and as fungi of practical importance. Copyright © 2017 by the Genetics Society of America.

  19. Relationship between metabolic and genomic diversity in sesame (Sesamum indicum L.)

    National Research Council Canada - National Science Library

    Laurentin, Hernán; Ratzinger, Astrid; Karlovsky, Petr

    2008-01-01

    ... systematically in diversity surveys. Our objective in this study was to assess metabolic diversity in sesame by nontargeted metabolic profiling and elucidate the relationship between metabolic and genome diversity in this crop...

  20. Remarkable diversity of endogenous viruses in a crustacean genome.

    Science.gov (United States)

    Thézé, Julien; Leclercq, Sébastien; Moumen, Bouziane; Cordaux, Richard; Gilbert, Clément

    2014-08-01

    Recent studies in paleovirology have uncovered myriads of endogenous viral elements (EVEs) integrated in the genome of their eukaryotic hosts. These fragments result from endogenization, that is, integration of the viral genome into the host germline genome followed by vertical inheritance. So far, most studies have used a virus-centered approach, whereby endogenous copies of a particular group of viruses were searched in all available sequenced genomes. Here, we follow a host-centered approach whereby the genome of a given species is comprehensively screened for the presence of EVEs using all available complete viral genomes as queries. Our analyses revealed that 54 EVEs corresponding to 10 different viral lineages belonging to 5 viral families (Bunyaviridae, Circoviridae, Parvoviridae, and Totiviridae) and one viral order (Mononegavirales) became endogenized in the genome of the isopod crustacean Armadillidium vulgare. We show that viral endogenization occurred recurrently during the evolution of isopods and that A. vulgare viral lineages were involved in multiple host switches that took place between widely divergent taxa. Furthermore, 30 A. vulgare EVEs have uninterrupted open reading frames, suggesting they result from recent endogenization of viruses likely to be currently infecting isopod populations. Overall, our work shows that isopods have been and are still infected by a large variety of viruses. It also extends the host range of several families of viruses and brings new insights into their evolution. More generally, our results underline the power of paleovirology in characterizing the viral diversity currently infecting eukaryotic taxa. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  1. International network of cancer genome projects

    NARCIS (Netherlands)

    Hudson, Thomas J.; Anderson, Warwick; Aretz, Axel; Barker, Anna D.; Bell, Cindy; Bernabe, Rosa R.; Bhan, M. K.; Calvo, Fabien; Eerola, Iiro; Gerhard, Daniela S.; Guttmacher, Alan; Guyer, Mark; Hemsley, Fiona M.; Jennings, Jennifer L.; Kerr, David; Klatt, Peter; Kolar, Patrik; Kusuda, Jun; Lane, David P.; Laplace, Frank; Lu, Youyong; Nettekoven, Gerd; Ozenberger, Brad; Peterson, Jane; Rao, T. S.; Remacle, Jacques; Schafer, Alan J.; Shibata, Tatsuhiro; Stratton, Michael R.; Vockley, Joseph G.; Watanabe, Koichi; Yang, Huanming; Yuen, Matthew M. F.; Knoppers, M.; Bobrow, Martin; Cambon-Thomsen, Anne; Dressler, Lynn G.; Dyke, Stephanie O. M.; Joly, Yann; Kato, Kazuto; Kennedy, Karen L.; Nicolas, Pilar; Parker, Michael J.; Rial-Sebbag, Emmanuelle; Romeo-Casabona, Carlos M.; Shaw, Kenna M.; Wallace, Susan; Wiesner, Georgia L.; Zeps, Nikolajs; Lichter, Peter; Biankin, Andrew V.; Chabannon, Christian; Chin, Lynda; Clement, Bruno; de Alava, Enrique; Degos, Francoise; Ferguson, Martin L.; Geary, Peter; Hayes, D. Neil; Johns, Amber L.; Nakagawa, Hidewaki; Penny, Robert; Piris, Miguel A.; Sarin, Rajiv; Scarpa, Aldo; Shibata, Tatsuhiro; van de Vijver, Marc; Futreal, P. Andrew; Aburatani, Hiroyuki; Bayes, Monica; Bowtell, David D. L.; Campbell, Peter J.; Estivill, Xavier; Grimmond, Sean M.; Gut, Ivo; Hirst, Martin; Lopez-Otin, Carlos; Majumder, Partha; Marra, Marco; Nakagawa, Hidewaki; Ning, Zemin; Puente, Xose S.; Ruan, Yijun; Shibata, Tatsuhiro; Stratton, Michael R.; Stunnenberg, Hendrik G.; Swerdlow, Harold; Velculescu, Victor E.; Wilson, Richard K.; Xue, Hong H.; Yang, Liu; Spellman, Paul T.; Bader, Gary D.; Boutros, Paul C.; Campbell, Peter J.; Flicek, Paul; Getz, Gad; Guigo, Roderic; Guo, Guangwu; Haussler, David; Heath, Simon; Hubbard, Tim J.; Jiang, Tao; Jones, Steven M.; Li, Qibin; Lopez-Bigas, Nuria; Luo, Ruibang; Pearson, John V.; Puente, Xose S.; Quesada, Victor; Raphael, Benjamin J.; Sander, Chris; Shibata, Tatsuhiro; Speed, Terence P.; Stuart, Joshua M.; Teague, Jon W.; Totoki, Yasushi; Tsunoda, Tatsuhiko; Valencia, Alfonso; Wheeler, David A.; Wu, Honglong; Zhao, Shancen; Zhou, Guangyu; Stein, Lincoln D.; Guigo, Roderic; Hubbard, Tim J.; Joly, Yann; Jones, Steven M.; Lathrop, Mark; Lopez-Bigas, Nuria; Ouellette, B. F. Francis; Spellman, Paul T.; Teague, Jon W.; Thomas, Gilles; Valencia, Alfonso; Yoshida, Teruhiko; Kennedy, Karen L.; Axton, Myles; Dyke, Stephanie O. M.; Futreal, P. Andrew; Gunter, Chris; Guyer, Mark; McPherson, John D.; Miller, Linda J.; Ozenberger, Brad; Kasprzyk, Arek; Zhang, Junjun; Haider, Syed A.; Wang, Jianxin; Yung, Christina K.; Cross, Anthony; Liang, Yong; Gnaneshan, Saravanamuttu; Guberman, Jonathan; Hsu, Jack; Bobrow, Martin; Chalmers, Don R. C.; Hasel, Karl W.; Joly, Yann; Kaan, Terry S. H.; Kennedy, Karen L.; Knoppers, Bartha M.; Lowrance, William W.; Masui, Tohru; Nicolas, Pilar; Rial-Sebbag, Emmanuelle; Rodriguez, Laura Lyman; Vergely, Catherine; Yoshida, Teruhiko; Grimmond, Sean M.; Biankin, Andrew V.; Bowtell, David D. L.; Cloonan, Nicole; Defazio, Anna; Eshleman, James R.; Etemadmoghadam, Dariush; Gardiner, Brooke A.; Kench, James G.; Scarpa, Aldo; Sutherland, Robert L.; Tempero, Margaret A.; Waddell, Nicola J.; Wilson, Peter J.; Gallinger, Steve; Tsao, Ming-Sound; Shaw, Patricia A.; Petersen, Gloria M.; Mukhopadhyay, Debabrata; Chin, Lynda; DePinho, Ronald A.; Thayer, Sarah; Muthuswamy, Lakshmi; Shazand, Kamran; Beck, Timothy; Sam, Michelle; Timms, Lee; Ballin, Vanessa; Lu, Youyong; Ji, Jiafu; Zhang, Xiuqing; Chen, Feng; Hu, Xueda; Zhou, Guangyu; Yang, Qi; Tian, Geng; Zhang, Lianhai; Xing, Xiaofang; Li, Xianghong; Zhu, Zhenggang; Yu, Yingyan; Yu, Jun; Yang, Huanming; Lathrop, Mark; Tost, Joerg; Brennan, Paul; Holcatova, Ivana; Zaridze, David; Brazma, Alvis; Egevad, Lars; Prokhortchouk, Egor; Banks, Rosamonde Elizabeth; Uhlen, Mathias; Cambon-Thomsen, Anne; Viksna, Juris; Ponten, Fredrik; Skryabin, Konstantin; Stratton, Michael R.; Futreal, P. Andrew; Birney, Ewan; Borg, Ake; Borresen-Dale, Anne-Lise; Caldas, Carlos; Foekens, John A.; Martin, Sancha; Reis-Filho, Jorge S.; Richardson, Andrea L.; Sotiriou, Christos; Stunnenberg, Hendrik G.; Thomas, Gilles; van de Vijver, Marc; van't Veer, Laura; Birnbaum, Daniel; Blanche, Helene; Boucher, Pascal; Boyault, Sandrine; Chabannon, Christian; Gut, Ivo; Masson-Jacquemier, Jocelyne D.; Lathrop, Mark; Pauporte, Iris; Pivot, Xavier; Vincent-Salomon, Anne; Tabone, Eric; Theillet, Charles; Thomas, Gilles; Tost, Joerg; Treilleux, Isabelle; Bioulac-Sage, Paulette; Clement, Bruno; Decaens, Thomas; Degos, Francoise; Franco, Dominique; Gut, Ivo; Gut, Marta; Heath, Simon; Lathrop, Mark; Samuel, Didier; Thomas, Gilles; Zucman-Rossi, Jessica; Lichter, Peter; Eils, Roland; Brors, Benedikt; Korbel, Jan O.; Korshunov, Andrey; Landgraf, Pablo; Lehrach, Hans; Pfister, Stefan; Radlwimmer, Bernhard; Reifenberger, Guido; Taylor, Michael D.; von Kalle, Christof; Majumder, Partha P.; Sarin, Rajiv; Scarpa, Aldo; Pederzoli, Paolo; Lawlor, Rita T.; Delledonne, Massimo; Bardelli, Alberto; Biankin, Andrew V.; Grimmond, Sean M.; Gress, Thomas; Klimstra, David; Zamboni, Giuseppe; Shibata, Tatsuhiro; Nakamura, Yusuke; Nakagawa, Hidewaki; Kusuda, Jun; Tsunoda, Tatsuhiko; Miyano, Satoru; Aburatani, Hiroyuki; Kato, Kazuto; Fujimoto, Akihiro; Yoshida, Teruhiko; Campo, Elias; Lopez-Otin, Carlos; Estivill, Xavier; Guigo, Roderic; de Sanjose, Silvia; Piris, Miguel A.; Montserrat, Emili; Gonzalez-Diaz, Marcos; Puente, Xose S.; Jares, Pedro; Valencia, Alfonso; Himmelbaue, Heinz; Quesada, Victor; Bea, Silvia; Stratton, Michael R.; Futreal, P. Andrew; Campbell, Peter J.; Vincent-Salomon, Anne; Richardson, Andrea L.; Reis-Filho, Jorge S.; van de Vijver, Marc; Thomas, Gilles; Masson-Jacquemier, Jocelyne D.; Aparicio, Samuel; Borg, Ake; Borresen-Dale, Anne-Lise; Caldas, Carlos; Foekens, John A.; Stunnenberg, Hendrik G.; van't Veer, Laura; Easton, Douglas F.; Spellman, Paul T.; Martin, Sancha; Chin, Lynda; Collins, Francis S.; Compton, Carolyn C.; Ferguson, Martin L.; Getz, Gad; Gunter, Chris; Guyer, Mark; Hayes, D. Neil; Lander, Eric S.; Ozenberger, Brad; Penny, Robert; Peterson, Jane; Sander, Chris; Speed, Terence P.; Spellman, Paul T.; Wheeler, David A.; Wilson, Richard K.; Chin, Lynda; Knoppers, Bartha M.; Lander, Eric S.; Lichter, Peter; Stratton, Michael R.; Bobrow, Martin; Burke, Wylie; Collins, Francis S.; DePinho, Ronald A.; Easton, Douglas F.; Futreal, P. Andrew; Green, Anthony R.; Guyer, Mark; Hamilton, Stanley R.; Hubbard, Tim J.; Kallioniemi, Olli P.; Kennedy, Karen L.; Ley, Timothy J.; Liu, Edison T.; Lu, Youyong; Majumder, Partha; Marra, Marco; Ozenberger, Brad; Peterson, Jane; Schafer, Alan J.; Spellman, Paul T.; Stunnenberg, Hendrik G.; Wainwright, Brandon J.; Wilson, Richard K.; Yang, Huanming

    2010-01-01

    The International Cancer Genome Consortium (ICGC) was launched to coordinate large-scale cancer genome studies in tumours from 50 different cancer types and/or subtypes that are of clinical and societal importance across the globe. Systematic studies of more than 25,000 cancer genomes at the genomic

  2. The Global Invertebrate Genomics Alliance (GIGA): Developing Community Resources to Study Diverse Invertebrate Genomes

    KAUST Repository

    Bracken-Grissom, Heather

    2013-12-12

    Over 95% of all metazoan (animal) species comprise the invertebrates, but very few genomes from these organisms have been sequenced. We have, therefore, formed a Global Invertebrate Genomics Alliance (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major challenges (e.g., species selection, sample collection and storage, sequence assembly, annotation, analytical tools) associated with genome/transcriptome sequencing across a large taxonomic spectrum. We aim to promote standards that will facilitate comparative approaches to invertebrate genomics and collaborations across the international scientific community. Candidate study taxa include species from Porifera, Ctenophora, Cnidaria, Placozoa, Mollusca, Arthropoda, Echinodermata, Annelida, Bryozoa, and Platyhelminthes, among others. GIGA will target 7000 noninsect/nonnematode species, with an emphasis on marine taxa because of the unrivaled phyletic diversity in the oceans. Priorities for selecting invertebrates for sequencing will include, but are not restricted to, their phylogenetic placement; relevance to organismal, ecological, and conservation research; and their importance to fisheries and human health. We highlight benefits of sequencing both whole genomes (DNA) and transcriptomes and also suggest policies for genomic-level data access and sharing based on transparency and inclusiveness. The GIGA Web site () has been launched to facilitate this collaborative venture.

  3. Diversity of Pseudomonas Genomes, Including Populus-Associated Isolates, as Revealed by Comparative Genome Analysis.

    Science.gov (United States)

    Jun, Se-Ran; Wassenaar, Trudy M; Nookaew, Intawat; Hauser, Loren; Wanchai, Visanu; Land, Miriam; Timm, Collin M; Lu, Tse-Yuan S; Schadt, Christopher W; Doktycz, Mitchel J; Pelletier, Dale A; Ussery, David W

    2015-10-30

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches, including the rhizosphere and endosphere of many plants. Their diversity influences the phylogenetic diversity and heterogeneity of these communities. On the basis of average amino acid identity, comparative genome analysis of >1,000 Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides (eastern cottonwood) trees resulted in consistent and robust genomic clusters with phylogenetic homogeneity. All Pseudomonas aeruginosa genomes clustered together, and these were clearly distinct from other Pseudomonas species groups on the basis of pangenome and core genome analyses. In contrast, the genomes of Pseudomonas fluorescens were organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. Most of our 21 Populus-associated isolates formed three distinct subgroups within the major P. fluorescens group, supported by pathway profile analysis, while two isolates were more closely related to Pseudomonas chlororaphis and Pseudomonas putida. Genes specific to Populus-associated subgroups were identified. Genes specific to subgroup 1 include several sensory systems that act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor. Genes specific to subgroup 2 contain hypothetical genes, and genes specific to subgroup 3 were annotated with hydrolase activity. This study justifies the need to sequence multiple isolates, especially from P. fluorescens, which displays the most genetic variation, in order to study functional capabilities from a pangenomic perspective. This information will prove useful when choosing Pseudomonas strains for use to promote growth and increase disease resistance in plants.

  4. Genomic diversity of drug-resistant Mycobacterium tuberculosis isolates in Lisbon Portugal: Towards tuberculosis genomic epidemiology

    KAUST Repository

    Perdigão, João

    2015-03-01

    Multidrug- (MDR) and extensively drug-resistant (XDR) tuberculosis (TB) present a challenge to disease control and elimination goals. Lisbon, Portugal, has a high TB incidence rate and unusual and successful XDR-TB strains that have been found in circulation for almost two decades. For the last 20. years, a continued circulation of two phylogenetic clades, Lisboa3 and Q1, which are highly associated with MDR and XDR, have been observed. In recent years, these strains have been well characterized regarding the molecular basis of drug resistance and have been inclusively subjected to whole genome sequencing (WGS). Researchers have been studying the genomic diversity of strains circulating in Lisbon and its genomic determinants through cutting-edge next generation sequencing. An enormous amount of whole genome sequence data are now available for the most prevalent and clinically relevant strains circulating in Lisbon.It is the persistence, prevalence and rapid evolution towards drug resistance that has prompted researchers to investigate the properties of these strains at the genomic level and in the future at a global transcriptomic level. Seventy Mycobacterium tuberculosis (MTB) isolates, mostly recovered in Lisbon, were genotyped by 24-. loci Mycobacterial Interspersed Repetitive Unit - Variable Number of Tandem Repeats (MIRU-VNTR) and the genomes sequenced using a next generation sequencing platform - Illumina HiSeq 2000.The genotyping data revealed three major clusters associated with MDR-TB (Lisboa3-A, Lisboa3-B and Q1), two of which are associated with XDR-TB (Lisboa3-B and Q1), whilst the genomic data contributed to elucidating the phylogenetic positioning of circulating MDR-TB strains, showing a high predominance of a single SNP cluster group 5. Furthermore, a genome-wide phylogeny analysis from these strains, together with 19 publicly available genomes of MTB clinical isolates, revealed two major clades responsible for MDR/XDR-TB in the region: Lisboa3 and Q

  5. The Genomic Diversity and Phylogenetic Relationship in the Family Iridoviridae

    Directory of Open Access Journals (Sweden)

    Brooke A. Ring

    2010-07-01

    Full Text Available The Iridoviridae family are large viruses (~120-200 nm that contain a linear double-stranded DNA genome. The genomic size of Iridoviridae family members range from 105,903 bases encoding 97 open reading frames (ORFs for frog virus 3 to 212,482 bases encoding 211 ORFs for Chilo iridescent virus. The family Iridoviridae is currently subdivided into five genera: Chloriridovirus, Iridovirus, Lymphocystivirus, Megalocytivirus, and Ranavirus. Iridoviruses have been found to infect invertebrates and poikilothermic vertebrates, including amphibians, reptiles, and fish. With such a diverse array of hosts, there is great diversity in gene content between different genera. To understand the origin of iridoviruses, we explored the phylogenetic relationship between individual iridoviruses and defined the core-set of genes shared by all members of the family. In order to further explore the evolutionary relationship between the Iridoviridae family repetitive sequences were identified and compared. Each genome was found to contain a set of unique repetitive sequences that could be used in future virus identification. Repeats common to more than one virus were also identified and changes in copy number between these repeats may provide a simple method to differentiate between very closely related virus strains. The results of this paper will be useful in identifying new iridoviruses and determining their relationship to other members of the family.

  6. Weeding out the genes: the Arabidopsis genome project.

    Science.gov (United States)

    Martienssen, R A

    2000-05-01

    The Arabidopsis genome sequence is scheduled for completion at the end of this year (December 2000). It will be the first higher plant genome to be sequenced, and will allow a detailed comparison with bacterial, yeast and animal genomes. Already, two of the five chromosomes have been sequenced, and we have had our first glimpse of higher eukaryotic centromeres, and the structure of heterochromatin. The implications for understanding plant gene function, genome structure and genome organization are profound. In this review, the lessons learned for future genome projects are reviewed as well as a summary of the initial findings in Arabidopsis.

  7. Diversity and Evolution in the Genome of Clostridium difficile

    Science.gov (United States)

    Knight, Daniel R.; Elliott, Briony; Chang, Barbara J.; Perkins, Timothy T.

    2015-01-01

    SUMMARY Clostridium difficile infection (CDI) is the leading cause of antimicrobial and health care-associated diarrhea in humans, presenting a significant burden to global health care systems. In the last 2 decades, PCR- and sequence-based techniques, particularly whole-genome sequencing (WGS), have significantly furthered our knowledge of the genetic diversity, evolution, epidemiology, and pathogenicity of this once enigmatic pathogen. C. difficile is taxonomically distinct from many other well-known clostridia, with a diverse population structure comprising hundreds of strain types spread across at least 6 phylogenetic clades. The C. difficile species is defined by a large diverse pangenome with extreme levels of evolutionary plasticity that has been shaped over long time periods by gene flux and recombination, often between divergent lineages. These evolutionary events are in response to environmental and anthropogenic activities and have led to the rapid emergence and worldwide dissemination of virulent clonal lineages. Moreover, genome analysis of large clinically relevant data sets has improved our understanding of CDI outbreaks, transmission, and recurrence. The epidemiology of CDI has changed dramatically over the last 15 years, and CDI may have a foodborne or zoonotic etiology. The WGS era promises to continue to redefine our view of this significant pathogen. PMID:26085550

  8. Diversity and Evolution in the Genome of Clostridium difficile.

    Science.gov (United States)

    Knight, Daniel R; Elliott, Briony; Chang, Barbara J; Perkins, Timothy T; Riley, Thomas V

    2015-07-01

    Clostridium difficile infection (CDI) is the leading cause of antimicrobial and health care-associated diarrhea in humans, presenting a significant burden to global health care systems. In the last 2 decades, PCR- and sequence-based techniques, particularly whole-genome sequencing (WGS), have significantly furthered our knowledge of the genetic diversity, evolution, epidemiology, and pathogenicity of this once enigmatic pathogen. C. difficile is taxonomically distinct from many other well-known clostridia, with a diverse population structure comprising hundreds of strain types spread across at least 6 phylogenetic clades. The C. difficile species is defined by a large diverse pangenome with extreme levels of evolutionary plasticity that has been shaped over long time periods by gene flux and recombination, often between divergent lineages. These evolutionary events are in response to environmental and anthropogenic activities and have led to the rapid emergence and worldwide dissemination of virulent clonal lineages. Moreover, genome analysis of large clinically relevant data sets has improved our understanding of CDI outbreaks, transmission, and recurrence. The epidemiology of CDI has changed dramatically over the last 15 years, and CDI may have a foodborne or zoonotic etiology. The WGS era promises to continue to redefine our view of this significant pathogen.

  9. Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper

    LENUS (Irish Health Repository)

    Potnis, Neha

    2011-03-11

    Abstract Background Bacterial spot of tomato and pepper is caused by four Xanthomonas species and is a major plant disease in warm humid climates. The four species are distinct from each other based on physiological and molecular characteristics. The genome sequence of strain 85-10, a member of one of the species, Xanthomonas euvesicatoria (Xcv) has been previously reported. To determine the relationship of the four species at the genome level and to investigate the molecular basis of their virulence and differing host ranges, draft genomic sequences of members of the other three species were determined and compared to strain 85-10. Results We sequenced the genomes of X. vesicatoria (Xv) strain 1111 (ATCC 35937), X. perforans (Xp) strain 91-118 and X. gardneri (Xg) strain 101 (ATCC 19865). The genomes were compared with each other and with the previously sequenced Xcv strain 85-10. In addition, the molecular features were predicted that may be required for pathogenicity including the type III secretion apparatus, type III effectors, other secretion systems, quorum sensing systems, adhesins, extracellular polysaccharide, and lipopolysaccharide determinants. Several novel type III effectors from Xg strain 101 and Xv strain 1111 genomes were computationally identified and their translocation was validated using a reporter gene assay. A homolog to Ax21, the elicitor of XA21-mediated resistance in rice, and a functional Ax21 sulfation system were identified in Xcv. Genes encoding proteins with functions mediated by type II and type IV secretion systems have also been compared, including enzymes involved in cell wall deconstruction, as contributors to pathogenicity. Conclusions Comparative genomic analyses revealed considerable diversity among bacterial spot pathogens, providing new insights into differences and similarities that may explain the diverse nature of these strains. Genes specific to pepper pathogens, such as the O-antigen of the lipopolysaccharide cluster

  10. Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper

    Directory of Open Access Journals (Sweden)

    Koebnik Ralf

    2011-03-01

    Full Text Available Abstract Background Bacterial spot of tomato and pepper is caused by four Xanthomonas species and is a major plant disease in warm humid climates. The four species are distinct from each other based on physiological and molecular characteristics. The genome sequence of strain 85-10, a member of one of the species, Xanthomonas euvesicatoria (Xcv has been previously reported. To determine the relationship of the four species at the genome level and to investigate the molecular basis of their virulence and differing host ranges, draft genomic sequences of members of the other three species were determined and compared to strain 85-10. Results We sequenced the genomes of X. vesicatoria (Xv strain 1111 (ATCC 35937, X. perforans (Xp strain 91-118 and X. gardneri (Xg strain 101 (ATCC 19865. The genomes were compared with each other and with the previously sequenced Xcv strain 85-10. In addition, the molecular features were predicted that may be required for pathogenicity including the type III secretion apparatus, type III effectors, other secretion systems, quorum sensing systems, adhesins, extracellular polysaccharide, and lipopolysaccharide determinants. Several novel type III effectors from Xg strain 101 and Xv strain 1111 genomes were computationally identified and their translocation was validated using a reporter gene assay. A homolog to Ax21, the elicitor of XA21-mediated resistance in rice, and a functional Ax21 sulfation system were identified in Xcv. Genes encoding proteins with functions mediated by type II and type IV secretion systems have also been compared, including enzymes involved in cell wall deconstruction, as contributors to pathogenicity. Conclusions Comparative genomic analyses revealed considerable diversity among bacterial spot pathogens, providing new insights into differences and similarities that may explain the diverse nature of these strains. Genes specific to pepper pathogens, such as the O-antigen of the

  11. National Evaluation of Diversion Projects. Executive Summary.

    Science.gov (United States)

    Dunford, Franklyn W.; And Others

    In 1976 the Special Emphasis branch of the Office of Juvenile Justice and Delinquency Prevention made $10 million available for the development of 11 diversion programs. A national evaluation of these programs was promoted in the hope of better understanding the viability of diversion as an alternative to traditional practices. The impact of…

  12. Patterns of genome size diversity in bats (order Chiroptera).

    Science.gov (United States)

    Smith, Jillian D L; Bickham, John W; Gregory, T Ryan

    2013-08-01

    Despite being a group of particular interest in considering relationships between genome size and metabolic parameters, bats have not been well studied from this perspective. This study presents new estimates for 121 "microbat" species from 12 families and complements a previous study on members of the family Pteropodidae ("megabats"). The results confirm that diversity in genome size in bats is very limited even compared with other mammals, varying approximately 2-fold from 1.63 pg in Lophostoma carrikeri to 3.17 pg in Rhinopoma hardwickii and averaging only 2.35 pg ± 0.02 SE (versus 3.5 pg overall for mammals). However, contrary to some other vertebrate groups, and perhaps owing to the narrow range observed, genome size correlations were not apparent with any chromosomal, physiological, flight-related, developmental, or ecological characteristics within the order Chiroptera. Genome size is positively correlated with measures of body size in bats, though the strength of the relationships differs between pteropodids ("megabats") and nonpteropodids ("microbats").

  13. Exceptionally diverse morphotypes and genomes of crenarchaeal hyperthermophilic viruses

    DEFF Research Database (Denmark)

    Prangishvili, D; Garrett, R A

    2004-01-01

    The remarkable diversity of the morphologies of viruses found in terrestrial hydrothermal environments with temperatures >80 degrees C is unprecedented for aquatic ecosystems. The best-studied viruses from these habitats have been assigned to novel viral families: Fuselloviridae, Lipothrixviridae...... no significant matches to sequences in public databases. This suggests that these hyperthermophilic viruses have exceptional biochemical solutions for biological functions. Specific features of genome organization, as well as strategies for DNA replication, suggest that phylogenetic relationships exist between...... crenarchaeal rudiviruses and the large eukaryal DNA viruses: poxviruses, the African swine fever virus and Chlorella viruses. Sequence patterns at the ends of the linear genome of the lipothrixvirus AFV1 are reminiscent of the telomeric ends of linear eukaryal chromosomes and suggest that a primitive telomeric...

  14. Insurance ratemaking method for risk of construction diversion project

    Institute of Scientific and Technical Information of China (English)

    Chen Zhiding; Hu Zhigen

    2012-01-01

    Based on analyzing risk factors of diversion project, synthetic risk rate and engineering insurance period, the frequency and distribution law of loss are researched on the grounds that foundation pit is submerged after diversion project ceases to be effective. And then, the standpoint that these total loss is subject to non-homogeneous compound Poisson processes is put forward. Furthermore, the collective risk model of the total loss about engineering insurance is established on the basis of construction diversion project risk. Ultimately, insurance ratemaking method for construction engineering risk and its mathematical expression are presented, which provides theoretical method for the insurance ratemaking of hydropower engineering to some extent.

  15. Metabolic Genes within Cyanophage Genomes: Implications for Diversity and Evolution

    Directory of Open Access Journals (Sweden)

    E-Bin Gao

    2016-09-01

    Full Text Available Cyanophages, a group of viruses specifically infecting cyanobacteria, are genetically diverse and extensively abundant in water environments. As a result of selective pressure, cyanophages often acquire a range of metabolic genes from host genomes. The host-derived genes make a significant contribution to the ecological success of cyanophages. In this review, we summarize the host-derived metabolic genes, as well as their origin and roles in cyanophage evolution and important host metabolic pathways, such as the light-dependent reactions of photosynthesis, the pentose phosphate pathway, nutrient acquisition and nucleotide biosynthesis. We also discuss the suitability of the host-derived metabolic genes as potential diagnostic markers for the detection of genetic diversity of cyanophages in natural environments.

  16. Diversity in research projects - A key to success?

    Science.gov (United States)

    Henkel, Daniela; Eisenhauer, Anton; Taubner, Isabelle

    2017-04-01

    According to demographers, psychologists, sociologists and economists diverse groups, which are groups of different race, ethnicity, gender and sexual orientation, are more innovative than homogeneous groups. This is also true for groups working together in research collaborations and international cooperation involving a culturally and functionally diverse mix of individuals who have to be integrated into an effective unit - a project team. If the goal is scientific excellence, diversity should be an essential ingredient to conduct science on high level productivity, quality and innovation. Effective teamwork is a key to project success and prime responsibilities of the project manager. Therefore, the project manager has to take into consideration different characteristics such as cultures, languages, and different values related to individual project partners. Here we show how diversity can affect the performance of a research project. Furthermore, the presentation indicates skills and abilities which are required for the management in order to deal also with the challenges of diversity in research projects. The presentation is based on insights experienced in the context of an Innovative Training Network (ITN) project within Marie Skłodowska-Curie Actions of the European HORIZON 2020 program and TRION a Collaborative Research Project in the Framework of the Trilateral Program of the German Research Foundation.

  17. Limitations and benefits of ARISA intra-genomic diversity fingerprinting.

    Science.gov (United States)

    Popa, Radu; Popa, Rodica; Mashall, Matthew J; Nguyen, Hien; Tebo, Bradley M; Brauer, Suzanna

    2009-08-01

    Monitoring diversity changes and contamination in mixed cultures and simple microcosms is challenged by fast community structure dynamics, and the need for means allowing fast, cost-efficient and accurate identification of microorganisms at high phylogenetic resolution. The method we explored is a variant of Automated rRNA Intergenic Spacer Analysis based on Intra-Genomic Diversity Fingerprinting (ARISA-IGDF), and identifies phylotypes with multiple 16S-23S rRNA gene Intergenic Transcribed Spacers. We verified the effect of PCR conditions (annealing temperature, duration of final extension, number of cycles, group-specific primers and formamide) on ARISA-IGD fingerprints of 44 strains of Shewanella. We present a digitization algorithm and data analysis procedures needed to determine confidence in strain identification. Though using stringent PCR conditions and group-specific primers allow reasonably accurate identification of strains with three ARISA-IGD amplicons within the 82-1000 bp size range, ARISA-IGDF is best for phylotypes with >or=4 unambiguously different amplicons. This method allows monitoring the occurrence of culturable microbes and can be implemented in applications requiring high phylogenetic resolution, reproducibility, low cost and high throughput such as identifying contamination and monitoring the evolution of diversity in mixed cultures and low diversity microcosms and periodic screening of small microbial culture libraries.

  18. Limitations and Benefits of ARISA Intra-genomic Diversity Fingerprinting

    Energy Technology Data Exchange (ETDEWEB)

    Popa, Radu; Popa, Rodica; Marshall, Matthew J.; Nguyen, Hien; Tebo, Bradley M.; Brauer, Suzanna

    2009-08-01

    Monitoring diversity changes and contamination in mixed cultures and simplemicrocosms is challenged by fast community structure dynamics, and the need for means allowing fast, cost-efficient and accurate identification of microorganisms at high phylogenetic resolution. The method we explored is a variant of Automated rRNA Intergenic Spacer Analysis based on Intra-Genomic Diversity Fingerprinting (ARISAIGDF), and identifies phylotypes with multiple 16S–23S rRNA gene Intergenic Transcribed Spacers. We verified the effect of PCR conditions (annealing temperature, duration of final extension, number of cycles, group-specific primers and formamide) on ARISA-IGD fingerprints of 44 strains of Shewanella.We present a digitization algorithmand data analysis procedures needed to determine confidence in strain identification. Though using stringent PCR conditions and group-specific primers allow reasonably accurate identification of strains with three ARISA-IGD amplicons within the 82–1000 bp size range, ARISA-IGDF is best for phylotypes with ≥4 unambiguously different amplicons. This method allows monitoring the occurrence of culturable microbes and can be implemented in applications requiring high phylogenetic resolution, reproducibility, low cost and high throughput such as identifying contamination and monitoring the evolution of diversity in mixed cultures and low diversity microcosms and periodic screening of small microbial culture libraries.

  19. Genomes to life project : quarterly report October 2003.

    Energy Technology Data Exchange (ETDEWEB)

    Heffelfinger, Grant S.

    2004-01-01

    This SAND report provides the technical progress through October 2003 of the Sandia-led project, 'Carbon Sequestration in Synechococcus Sp.: From Molecular Machines to Hierarchical Modeling,' funded by the DOE Office of Science Genomes to Life Program. Understanding, predicting, and perhaps manipulating carbon fixation in the oceans has long been a major focus of biological oceanography and has more recently been of interest to a broader audience of scientists and policy makers. It is clear that the oceanic sinks and sources of CO2 are important terms in the global environmental response to anthropogenic atmospheric inputs of CO2 and that oceanic microorganisms play a key role in this response. However, the relationship between this global phenomenon and the biochemical mechanisms of carbon fixation in these microorganisms is poorly understood. In this project, we will investigate the carbon sequestration behavior of Synechococcus Sp., an abundant marine cyanobacteria known to be important to environmental responses to carbon dioxide levels, through experimental and computational methods. This project is a combined experimental and computational effort with emphasis on developing and applying new computational tools and methods. Our experimental effort will provide the biology and data to drive the computational efforts and include significant investment in developing new experimental methods for uncovering protein partners, characterizing protein complexes, identifying new binding domains. We will also develop and apply new data measurement and statistical methods for analyzing microarray experiments. Computational tools will be essential to our efforts to discover and characterize the function of the molecular machines of Synechococcus. To this end, molecular simulation methods will be coupled with knowledge discovery from diverse biological data sets for high-throughput discovery and characterization of protein-protein complexes. In addition, we will

  20. Genomes to Life Project Quartely Report October 2004.

    Energy Technology Data Exchange (ETDEWEB)

    Heffelfinger, Grant S.; Martino, Anthony; Rintoul, Mark Daniel; Geist, Al; Gorin, Andrey; Xu, Ying; Palenik, Brian

    2005-02-01

    This SAND report provides the technical progress through October 2004 of the Sandia-led project, %22Carbon Sequestration in Synechococcus Sp.: From Molecular Machines to Hierarchical Modeling,%22 funded by the DOE Office of Science Genomes to Life Program. Understanding, predicting, and perhaps manipulating carbon fixation in the oceans has long been a major focus of biological oceanography and has more recently been of interest to a broader audience of scientists and policy makers. It is clear that the oceanic sinks and sources of CO2 are important terms in the global environmental response to anthropogenic atmospheric inputs of CO2 and that oceanic microorganisms play a key role in this response. However, the relationship between this global phenomenon and the biochemical mechanisms of carbon fixation in these microorganisms is poorly understood. In this project, we will investigate the carbon sequestration behavior of Synechococcus Sp., an abundant marine cyanobacteria known to be important to environmental responses to carbon dioxide levels, through experimental and computational methods. This project is a combined experimental and computational effort with emphasis on developing and applying new computational tools and methods. Our experimental effort will provide the biology and data to drive the computational efforts and include significant investment in developing new experimental methods for uncovering protein partners, characterizing protein complexes, identifying new binding domains. We will also develop and apply new data measurement and statistical methods for analyzing microarray experiments. Computational tools will be essential to our efforts to discover and characterize the function of the molecular machines of Synechococcus. To this end, molecular simulation methods will be coupled with knowledge discovery from diverse biological data sets for high-throughput discovery and characterization of protein-protein complexes. In addition, we will develop

  1. Genomes to Life Project Quarterly Report April 2005.

    Energy Technology Data Exchange (ETDEWEB)

    Heffelfinger, Grant S.; Martino, Anthony; Rintoul, Mark Daniel; Geist, Al; Gorin, Andrey; Xu, Ying; Palenik, Brian

    2006-02-01

    This SAND report provides the technical progress through April 2005 of the Sandia-led project, "Carbon Sequestration in Synechococcus Sp.: From Molecular Machines to Hierarchical Modeling," funded by the DOE Office of Science Genomics:GTL Program. Understanding, predicting, and perhaps manipulating carbon fixation in the oceans has long been a major focus of biological oceanography and has more recently been of interest to a broader audience of scientists and policy makers. It is clear that the oceanic sinks and sources of CO2 are important terms in the global environmental response to anthropogenic atmospheric inputs of CO2 and that oceanic microorganisms play a key role in this response. However, the relationship between this global phenomenon and the biochemical mechanisms of carbon fixation in these microorganisms is poorly understood. In this project, we will investigate the carbon sequestration behavior of Synechococcus Sp., an abundant marine cyanobacteria known to be important to environmental responses to carbon dioxide levels, through experimental and computational methods. This project is a combined experimental and computational effort with emphasis on developing and applying new computational tools and methods. Our experimental effort will provide the biology and data to drive the computational efforts and include significant investment in developing new experimental methods for uncovering protein partners, characterizing protein complexes, identifying new binding domains. We will also develop and apply new data measurement and statistical methods for analyzing microarray experiments. Computational tools will be essential to our efforts to discover and characterize the function of the molecular machines of Synechococcus. To this end, molecular simulation methods will be coupled with knowledge discovery from diverse biological data sets for high-throughput discovery and characterization of protein-protein complexes. In addition, we will develop a set of

  2. The surprising diversity of clostridial hydrogenases: a comparative genomic perspective.

    Science.gov (United States)

    Calusinska, Magdalena; Happe, Thomas; Joris, Bernard; Wilmotte, Annick

    2010-06-01

    Among the large variety of micro-organisms capable of fermentative hydrogen production, strict anaerobes such as members of the genus Clostridium are the most widely studied. They can produce hydrogen by a reversible reduction of protons accumulated during fermentation to dihydrogen, a reaction which is catalysed by hydrogenases. Sequenced genomes provide completely new insights into the diversity of clostridial hydrogenases. Building on previous reports, we found that [FeFe] hydrogenases are not a homogeneous group of enzymes, but exist in multiple forms with different modular structures and are especially abundant in members of the genus Clostridium. This unusual diversity seems to support the central role of hydrogenases in cell metabolism. In particular, the presence of multiple putative operons encoding multisubunit [FeFe] hydrogenases highlights the fact that hydrogen metabolism is very complex in this genus. In contrast with [FeFe] hydrogenases, their [NiFe] hydrogenase counterparts, widely represented in other bacteria and archaea, are found in only a few clostridial species. Surprisingly, a heteromultimeric Ech hydrogenase, known to be an energy-converting [NiFe] hydrogenase and previously described only in methanogenic archaea and some sulfur-reducing bacteria, was found to be encoded by the genomes of four cellulolytic strains: Clostridum cellulolyticum, Clostridum papyrosolvens, Clostridum thermocellum and Clostridum phytofermentans.

  3. A Glimpse of the genomic diversity of haloarchaeal tailed viruses

    Directory of Open Access Journals (Sweden)

    Ana eSencilo

    2014-03-01

    Full Text Available Tailed viruses are the most common isolates infecting prokaryotic hosts residing hypersaline environments. Archaeal tailed viruses represent only a small portion of all characterized tailed viruses of prokaryotes. But even this small dataset revealed that archaeal tailed viruses have many similarities to their counterparts infecting bacteria, the bacteriophages. Shared functional homologues and similar genome organizations suggested that all microbial tailed viruses have common virion architectural and assembly principles. Recent structural studies have provided evidence justifying this thereby grouping archaeal and bacterial tailed viruses into a single lineage. Currently there are 17 haloarchaeal tailed viruses with entirely sequenced genomes. Nine viruses have at least one close relative among the 17 viruses and, according to the similarities, can be divided into three groups. Two other viruses share some homologues and therefore are distantly related, whereas the rest of the viruses are rather divergent (or singletons. Comparative genomics analysis of these viruses offers a glimpse into the genetic diversity and structure of haloarchaeal tailed virus communities.

  4. Phenotypic heterogeneity of genomically-diverse isolates of Streptococcus mutans.

    Directory of Open Access Journals (Sweden)

    Sara R Palmer

    Full Text Available High coverage, whole genome shotgun (WGS sequencing of 57 geographically- and genetically-diverse isolates of Streptococcus mutans from individuals of known dental caries status was recently completed. Of the 57 sequenced strains, fifteen isolates, were selected based primarily on differences in gene content and phenotypic characteristics known to affect virulence and compared with the reference strain UA159. A high degree of variability in these properties was observed between strains, with a broad spectrum of sensitivities to low pH, oxidative stress (air and paraquat and exposure to competence stimulating peptide (CSP. Significant differences in autolytic behavior and in biofilm development in glucose or sucrose were also observed. Natural genetic competence varied among isolates, and this was correlated to the presence or absence of competence genes, comCDE and comX, and to bacteriocins. In general strains that lacked the ability to become competent possessed fewer genes for bacteriocins and immunity proteins or contained polymorphic variants of these genes. WGS sequence analysis of the pan-genome revealed, for the first time, components of a Type VII secretion system in several S. mutans strains, as well as two putative ORFs that encode possible collagen binding proteins located upstream of the cnm gene, which is associated with host cell invasiveness. The virulence of these particular strains was assessed in a wax-worm model. This is the first study to combine a comprehensive analysis of key virulence-related phenotypes with extensive genomic analysis of a pathogen that evolved closely with humans. Our analysis highlights the phenotypic diversity of S. mutans isolates and indicates that the species has evolved a variety of adaptive strategies to persist in the human oral cavity and, when conditions are favorable, to initiate disease.

  5. The Human Genome Project: big science transforms biology and medicine.

    Science.gov (United States)

    Hood, Leroy; Rowen, Lee

    2013-01-01

    The Human Genome Project has transformed biology through its integrated big science approach to deciphering a reference human genome sequence along with the complete sequences of key model organisms. The project exemplifies the power, necessity and success of large, integrated, cross-disciplinary efforts - so-called 'big science' - directed towards complex major objectives. In this article, we discuss the ways in which this ambitious endeavor led to the development of novel technologies and analytical tools, and how it brought the expertise of engineers, computer scientists and mathematicians together with biologists. It established an open approach to data sharing and open-source software, thereby making the data resulting from the project accessible to all. The genome sequences of microbes, plants and animals have revolutionized many fields of science, including microbiology, virology, infectious disease and plant biology. Moreover, deeper knowledge of human sequence variation has begun to alter the practice of medicine. The Human Genome Project has inspired subsequent large-scale data acquisition initiatives such as the International HapMap Project, 1000 Genomes, and The Cancer Genome Atlas, as well as the recently announced Human Brain Project and the emerging Human Proteome Project.

  6. Genome mining of the genetic diversity in the Aspergillus genus - from a collection of more than 30 Aspergillus species

    DEFF Research Database (Denmark)

    Rasmussen, Jane Lind Nybo; Vesth, Tammi Camilla; Theobald, Sebastian;

    In the era of high-throughput sequencing, comparative genomics can be applied for evaluating species diversity. In this project we aim to compare the genomes of 300 species of filamentous fungi from the Aspergillus genus, a complex task. To be able to define species, clade, and core features......, this project uses BLAST on the amino acid level to discover orthologs. With a potential of 300 Aspergillus species each having ~12,000 annotated genes, traditional clustering will demand supercomputing. Instead, our approach reduces the search space by identifying isoenzymes within each genome creating...... intragenomic protein families (iPFs), and then connecting iPFs across all genomes. The initial findings in a set of 31 species show that ~48% of the annotated genes are core genes (genes shared between all species) and 2-24% of the genes are defining the individual species. The methods presented here...

  7. Cancer Genome Anatomy Project (CGAP) | Office of Cancer Genomics

    Science.gov (United States)

    CGAP generated a wide range of genomics data on cancerous cells that are accessible through easy-to-use online tools. Researchers, educators, and students can find "in silico" answers to biological questions through the CGAP website. Request a free copy of the CGAP Website Virtual Tour CD from ocg@mail.nih.gov to learn how to navigate the website.

  8. Global genomic diversity of human papillomavirus 6 based on 724 isolates and 190 complete genome sequences.

    Science.gov (United States)

    Jelen, Mateja M; Chen, Zigui; Kocjan, Boštjan J; Burt, Felicity J; Chan, Paul K S; Chouhy, Diego; Combrinck, Catharina E; Coutlée, François; Estrade, Christine; Ferenczy, Alex; Fiander, Alison; Franco, Eduardo L; Garland, Suzanne M; Giri, Adriana A; González, Joaquín Víctor; Gröning, Arndt; Heidrich, Kerstin; Hibbitts, Sam; Hošnjak, Lea; Luk, Tommy N M; Marinic, Karina; Matsukura, Toshihiko; Neumann, Anna; Oštrbenk, Anja; Picconi, Maria Alejandra; Richardson, Harriet; Sagadin, Martin; Sahli, Roland; Seedat, Riaz Y; Seme, Katja; Severini, Alberto; Sinchi, Jessica L; Smahelova, Jana; Tabrizi, Sepehr N; Tachezy, Ruth; Tohme, Sarah; Uloza, Virgilijus; Vitkauskiene, Astra; Wong, Yong Wee; Zidovec Lepej, Snježana; Burk, Robert D; Poljak, Mario

    2014-07-01

    Human papillomavirus type 6 (HPV6) is the major etiological agent of anogenital warts and laryngeal papillomas and has been included in both the quadrivalent and nonavalent prophylactic HPV vaccines. This study investigated the global genomic diversity of HPV6, using 724 isolates and 190 complete genomes from six continents, and the association of HPV6 genomic variants with geographical location, anatomical site of infection/disease, and gender. Initially, a 2,800-bp E5a-E5b-L1-LCR fragment was sequenced from 492/530 (92.8%) HPV6-positive samples collected for this study. Among them, 130 exhibited at least one single nucleotide polymorphism (SNP), indel, or amino acid change in the E5a-E5b-L1-LCR fragment and were sequenced in full. A global alignment and maximum likelihood tree of 190 complete HPV6 genomes (130 fully sequenced in this study and 60 obtained from sequence repositories) revealed two variant lineages, A and B, and five B sublineages: B1, B2, B3, B4, and B5. HPV6 (sub)lineage-specific SNPs and a 960-bp representative region for whole-genome-based phylogenetic clustering within the L2 open reading frame were identified. Multivariate logistic regression analysis revealed that lineage B predominated globally. Sublineage B3 was more common in Africa and North and South America, and lineage A was more common in Asia. Sublineages B1 and B3 were associated with anogenital infections, indicating a potential lesion-specific predilection of some HPV6 sublineages. Females had higher odds for infection with sublineage B3 than males. In conclusion, a global HPV6 phylogenetic analysis revealed the existence of two variant lineages and five sublineages, showing some degree of ethnogeographic, gender, and/or disease predilection in their distribution. This study established the largest database of globally circulating HPV6 genomic variants and contributed a total of 130 new, complete HPV6 genome sequences to available sequence repositories. Two HPV6 variant lineages

  9. Transposable elements and small RNAs: Genomic fuel for species diversity.

    Science.gov (United States)

    Hoffmann, Federico G; McGuire, Liam P; Counterman, Brian A; Ray, David A

    2015-01-01

    While transposable elements (TE) have long been suspected of involvement in species diversification, identifying specific roles has been difficult. We recently found evidence of TE-derived regulatory RNAs in a species-rich family of bats. The TE-derived small RNAs are temporally associated with the burst of species diversification, suggesting that they may have been involved in the processes that led to the diversification. In this commentary, we expand on the ideas that were briefly touched upon in that manuscript. Specifically, we suggest avenues of research that may help to identify the roles that TEs may play in perturbing regulatory pathways. Such research endeavors may serve to inform evolutionary biologists of the ways that TEs have influenced the genomic and taxonomic diversity around us.

  10. Challenges and strategies for implementing genomic services in diverse settings: experiences from the Implementing GeNomics In pracTicE (IGNITE) network.

    Science.gov (United States)

    Sperber, Nina R; Carpenter, Janet S; Cavallari, Larisa H; J Damschroder, Laura; Cooper-DeHoff, Rhonda M; Denny, Joshua C; Ginsburg, Geoffrey S; Guan, Yue; Horowitz, Carol R; Levy, Kenneth D; Levy, Mia A; Madden, Ebony B; Matheny, Michael E; Pollin, Toni I; Pratt, Victoria M; Rosenman, Marc; Voils, Corrine I; W Weitzel, Kristen; Wilke, Russell A; Ryanne Wu, R; Orlando, Lori A

    2017-05-22

    To realize potential public health benefits from genetic and genomic innovations, understanding how best to implement the innovations into clinical care is important. The objective of this study was to synthesize data on challenges identified by six diverse projects that are part of a National Human Genome Research Institute (NHGRI)-funded network focused on implementing genomics into practice and strategies to overcome these challenges. We used a multiple-case study approach with each project considered as a case and qualitative methods to elicit and describe themes related to implementation challenges and strategies. We describe challenges and strategies in an implementation framework and typology to enable consistent definitions and cross-case comparisons. Strategies were linked to challenges based on expert review and shared themes. Three challenges were identified by all six projects, and strategies to address these challenges varied across the projects. One common challenge was to increase the relative priority of integrating genomics within the health system electronic health record (EHR). Four projects used data warehousing techniques to accomplish the integration. The second common challenge was to strengthen clinicians' knowledge and beliefs about genomic medicine. To overcome this challenge, all projects developed educational materials and conducted meetings and outreach focused on genomic education for clinicians. The third challenge was engaging patients in the genomic medicine projects. Strategies to overcome this challenge included use of mass media to spread the word, actively involving patients in implementation (e.g., a patient advisory board), and preparing patients to be active participants in their healthcare decisions. This is the first collaborative evaluation focusing on the description of genomic medicine innovations implemented in multiple real-world clinical settings. Findings suggest that strategies to facilitate integration of genomic

  11. Los Alamos Science: The Human Genome Project. Number 20, 1992

    Energy Technology Data Exchange (ETDEWEB)

    Cooper, N G; Shea, N [eds.

    1992-01-01

    This article provides a broad overview of the Human Genome Project, with particular emphasis on work being done at Los Alamos. It tries to emphasize the scientific aspects of the project, compared to the more speculative information presented in the popular press. There is a brief introduction to modern genetics, including a review of classic work. There is a broad overview of the Genome Project, describing what the project is, what are some of its major five-year goals, what are major technological challenges ahead of the project, and what can the field of biology, as well as society expect to see as benefits from this project. Specific results on the efforts directed at mapping chromosomes 16 and 5 are discussed. A brief introduction to DNA libraries is presented, bearing in mind that Los Alamos has housed such libraries for many years prior to the Genome Project. Information on efforts to do applied computational work related to the project are discussed, as well as experimental efforts to do rapid DNA sequencing by means of single-molecule detection using applied spectroscopic methods. The article introduces the Los Alamos staff which are working on the Genome Project, and concludes with brief discussions on ethical, legal, and social implications of this work; a brief glimpse of genetics as it may be practiced in the next century; and a glossary of relevant terms.

  12. Los Alamos Science: The Human Genome Project. Number 20, 1992

    Science.gov (United States)

    Cooper, N. G.; Shea, N. eds.

    1992-01-01

    This document provides a broad overview of the Human Genome Project, with particular emphasis on work being done at Los Alamos. It tries to emphasize the scientific aspects of the project, compared to the more speculative information presented in the popular press. There is a brief introduction to modern genetics, including a review of classic work. There is a broad overview of the Genome Project, describing what the project is, what are some of its major five-year goals, what are major technological challenges ahead of the project, and what can the field of biology, as well as society expect to see as benefits from this project. Specific results on the efforts directed at mapping chromosomes 16 and 5 are discussed. A brief introduction to DNA libraries is presented, bearing in mind that Los Alamos has housed such libraries for many years prior to the Genome Project. Information on efforts to do applied computational work related to the project are discussed, as well as experimental efforts to do rapid DNA sequencing by means of single-molecule detection using applied spectroscopic methods. The article introduces the Los Alamos staff which are working on the Genome Project, and concludes with brief discussions on ethical, legal, and social implications of this work; a brief glimpse of genetics as it may be practiced in the next century; and a glossary of relevant terms.

  13. Integrated analysis of whole genome and transcriptome sequencing reveals diverse transcriptomic aberrations driven by somatic genomic changes in liver cancers.

    Directory of Open Access Journals (Sweden)

    Yuichi Shiraishi

    Full Text Available Recent studies applying high-throughput sequencing technologies have identified several recurrently mutated genes and pathways in multiple cancer genomes. However, transcriptional consequences from these genomic alterations in cancer genome remain unclear. In this study, we performed integrated and comparative analyses of whole genomes and transcriptomes of 22 hepatitis B virus (HBV-related hepatocellular carcinomas (HCCs and their matched controls. Comparison of whole genome sequence (WGS and RNA-Seq revealed much evidence that various types of genomic mutations triggered diverse transcriptional changes. Not only splice-site mutations, but also silent mutations in coding regions, deep intronic mutations and structural changes caused splicing aberrations. HBV integrations generated diverse patterns of virus-human fusion transcripts depending on affected gene, such as TERT, CDK15, FN1 and MLL4. Structural variations could drive over-expression of genes such as WNT ligands, with/without creating gene fusions. Furthermore, by taking account of genomic mutations causing transcriptional aberrations, we could improve the sensitivity of deleterious mutation detection in known cancer driver genes (TP53, AXIN1, ARID2, RPS6KA3, and identified recurrent disruptions in putative cancer driver genes such as HNF4A, CPS1, TSC1 and THRAP3 in HCCs. These findings indicate genomic alterations in cancer genome have diverse transcriptomic effects, and integrated analysis of WGS and RNA-Seq can facilitate the interpretation of a large number of genomic alterations detected in cancer genome.

  14. The Human Genome Project, and recent advances in personalized genomics

    Directory of Open Access Journals (Sweden)

    Wilson BJ

    2015-02-01

    Full Text Available Brenda J Wilson, Stuart G Nicholls Department of Epidemiology and Community Medicine, Faculty of Medicine, University of Ottawa, Ottawa, ON, Canada Abstract: The language of “personalized medicine” and “personal genomics” has now entered the common lexicon. The idea of personalized medicine is the integration of genomic risk assessment alongside other clinical investigations. Consistent with this approach, testing is delivered by health care professionals who are not medical geneticists, and where results represent risks, as opposed to clinical diagnosis of disease, to be interpreted alongside the entirety of a patient's health and medical data. In this review we consider the evidence concerning the application of such personalized genomics within the context of population screening, and potential implications that arise from this. We highlight two general approaches which illustrate potential uses of genomic information in screening. The first is a narrowly targeted approach in which genetic profiling is linked with standard population-based screening for diseases; the second is a broader targeting of variants associated with multiple single gene disorders, performed opportunistically on patients being investigated for unrelated conditions. In doing so we consider the organization and evaluation of tests and services, the challenge of interpretation with less targeted testing, professional confidence, barriers in practice, and education needs. We conclude by discussing several issues pertinent to health policy, namely: avoiding the conflation of genetics with biological determinism, resisting the “technological imperative”, due consideration of the organization of screening services, the need for professional education, as well as informed decision making and public understanding. Keywords: genomics, personalized medicine, ethics, population health, evidence, education

  15. Genomes to life project quarterly report June 2004.

    Energy Technology Data Exchange (ETDEWEB)

    Heffelfinger, Grant S.

    2005-01-01

    This SAND report provides the technical progress through June 2004 of the Sandia-led project, ''Carbon Sequestration in Synechococcus Sp.: From Molecular Machines to Hierarchical Modeling'', funded by the DOE Office of Science Genomes to Life Program. Understanding, predicting, and perhaps manipulating carbon fixation in the oceans has long been a major focus of biological oceanography and has more recently been of interest to a broader audience of scientists and policy makers. It is clear that the oceanic sinks and sources of CO{sub 2} are important terms in the global environmental response to anthropogenic atmospheric inputs of CO{sub 2} and that oceanic microorganisms play a key role in this response. However, the relationship between this global phenomenon and the biochemical mechanisms of carbon fixation in these microorganisms is poorly understood. In this project, we will investigate the carbon sequestration behavior of Synechococcus Sp., an abundant marine cyanobacteria known to be important to environmental responses to carbon dioxide levels, through experimental and computational methods. This project is a combined experimental and computational effort with emphasis on developing and applying new computational tools and methods. Our experimental effort will provide the biology and data to drive the computational efforts and include significant investment in developing new experimental methods for uncovering protein partners, characterizing protein complexes, identifying new binding domains. We will also develop and apply new data measurement and statistical methods for analyzing microarray experiments. Computational tools will be essential to our efforts to discover and characterize the function of the molecular machines of Synechococcus. To this end, molecular simulation methods will be coupled with knowledge discovery from diverse biological data sets for high-throughput discovery and characterization of protein-protein complexes

  16. The genome diversity and karyotype evolution of mammals

    Directory of Open Access Journals (Sweden)

    Trifonov Vladimir A

    2011-10-01

    Full Text Available Abstract The past decade has witnessed an explosion of genome sequencing and mapping in evolutionary diverse species. While full genome sequencing of mammals is rapidly progressing, the ability to assemble and align orthologous whole chromosome regions from more than a few species is still not possible. The intense focus on building of comparative maps for companion (dog and cat, laboratory (mice and rat and agricultural (cattle, pig, and horse animals has traditionally been used as a means to understand the underlying basis of disease-related or economically important phenotypes. However, these maps also provide an unprecedented opportunity to use multispecies analysis as a tool for inferring karyotype evolution. Comparative chromosome painting and related techniques are now considered to be the most powerful approaches in comparative genome studies. Homologies can be identified with high accuracy using molecularly defined DNA probes for fluorescence in situ hybridization (FISH on chromosomes of different species. Chromosome painting data are now available for members of nearly all mammalian orders. In most orders, there are species with rates of chromosome evolution that can be considered as 'default' rates. The number of rearrangements that have become fixed in evolutionary history seems comparatively low, bearing in mind the 180 million years of the mammalian radiation. Comparative chromosome maps record the history of karyotype changes that have occurred during evolution. The aim of this review is to provide an overview of these recent advances in our endeavor to decipher the karyotype evolution of mammals by integrating the published results together with some of our latest unpublished results.

  17. Diversity and genome dynamics of marine cyanophages using metagenomic analyses.

    Science.gov (United States)

    Ma, Yingfei; Allen, Lisa Zeigler; Palenik, Brian

    2014-12-01

    Cyanophages are abundant in the oceanic environment and directly impact cyanobacterial distributions, physiological processes and evolution. Two samples collected from coastal Maine in July and September 2009 were enriched for Synechococcus cells using flow cytometry and examined through metagenomic sequencing. Homology-based sequence prediction indicated cyanophages, largely myoviruses, accounted for almost half the reads and provided insights into environmental infection events. T4-phage core-gene phylogenetic reconstruction revealed unique diversity among uncultured cyanophages and reference isolates resulting in identification of a new phylogenetic cluster. Genomic comparison of reference cyanophage strains S-SM2 and Syn1 with putative homologous contigs recovered from metagenomes provided evidence that gene insertion, deletion and recombination have occurred among, and are likely important for diversification of, natural populations. Identification of putative genetic exchange between cyanophage and non-cyanophage viruses, i.e. Micromonas virus and Pelagibacter phage, supports hypotheses related to a significant role for viruses in mediating transfer of genetic material between taxonomically diverse organisms with overlapping ecological niches.

  18. The Genome 10K Project: a way forward.

    Science.gov (United States)

    Koepfli, Klaus-Peter; Paten, Benedict; O'Brien, Stephen J

    2015-01-01

    The Genome 10K Project was established in 2009 by a consortium of biologists and genome scientists determined to facilitate the sequencing and analysis of the complete genomes of 10,000 vertebrate species. Since then the number of selected and initiated species has risen from ∼26 to 277 sequenced or ongoing with funding, an approximately tenfold increase in five years. Here we summarize the advances and commitments that have occurred by mid-2014 and outline the achievements and present challenges of reaching the 10,000-species goal. We summarize the status of known vertebrate genome projects, recommend standards for pronouncing a genome as sequenced or completed, and provide our present and future vision of the landscape of Genome 10K. The endeavor is ambitious, bold, expensive, and uncertain, but together the Genome 10K Consortium of Scientists and the worldwide genomics community are moving toward their goal of delivering to the coming generation the gift of genome empowerment for many vertebrate species.

  19. A decade of human genome project conclusion: Scientific diffusion about our genome knowledge.

    Science.gov (United States)

    Moraes, Fernanda; Góes, Andréa

    2016-05-06

    The Human Genome Project (HGP) was initiated in 1990 and completed in 2003. It aimed to sequence the whole human genome. Although it represented an advance in understanding the human genome and its complexity, many questions remained unanswered. Other projects were launched in order to unravel the mysteries of our genome, including the ENCyclopedia of DNA Elements (ENCODE). This review aims to analyze the evolution of scientific knowledge related to both the HGP and ENCODE projects. Data were retrieved from scientific articles published in 1990-2014, a period comprising the development and the 10 years following the HGP completion. The fact that only 20,000 genes are protein and RNA-coding is one of the most striking HGP results. A new concept about the organization of genome arose. The ENCODE project was initiated in 2003 and targeted to map the functional elements of the human genome. This project revealed that the human genome is pervasively transcribed. Therefore, it was determined that a large part of the non-protein coding regions are functional. Finally, a more sophisticated view of chromatin structure emerged. The mechanistic functioning of the genome has been redrafted, revealing a much more complex picture. Besides, a gene-centric conception of the organism has to be reviewed. A number of criticisms have emerged against the ENCODE project approaches, raising the question of whether non-conserved but biochemically active regions are truly functional. Thus, HGP and ENCODE projects accomplished a great map of the human genome, but the data generated still requires further in depth analysis. © 2016 by The International Union of Biochemistry and Molecular Biology, 44:215-223, 2016.

  20. Unexpected cross-species contamination in genome sequencing projects

    Directory of Open Access Journals (Sweden)

    Samier Merchant

    2014-11-01

    Full Text Available The raw data from a genome sequencing project sometimes contains DNA from contaminating organisms, which may be introduced during sample collection or sequence preparation. In some instances, these contaminants remain in the sequence even after assembly and deposition of the genome into public databases. As a result, searches of these databases may yield erroneous and confusing results. We used efficient microbiome analysis software to scan the draft assembly of domestic cow, Bos taurus, and identify 173 small contigs that appeared to derive from microbial contaminants. In the course of verifying these findings, we discovered that one genome, Neisseria gonorrhoeae TCDC-NG08107, although putatively a complete genome, contained multiple sequences that actually derived from the cow and sheep genomes. Our findings illustrate the need to carefully validate findings of anomalous DNA that rely on comparisons to either draft or finished genomes.

  1. The Riken mouse genome encyclopedia project.

    Science.gov (United States)

    Hayashizaki, Yoshihide

    2003-01-01

    The Riken mouse genome encyclopedia a comprehensive full-length cDNA collection and sequence database. High-level functional annotation is based on sequence homology search, expression profiling, mapping and protein-protein interactions. More than 1000000 clones prepared from 163 tissues were end-sequenced and classified into 128000 clusters, and 60000 representative clones were fully sequenced representing 24000 clear protein-encoding genes. The application of the mouse genome database for positional cloning and gene network regulation analysis is reported.

  2. The landscape of genomic imprinting across diverse adult human tissues

    Science.gov (United States)

    Baran, Yael; Subramaniam, Meena; Biton, Anne; Tukiainen, Taru; Tsang, Emily K.; Rivas, Manuel A.; Pirinen, Matti; Gutierrez-Arcelus, Maria; Smith, Kevin S.; Kukurba, Kim R.; Zhang, Rui; Eng, Celeste; Torgerson, Dara G.; Urbanek, Cydney; Li, Jin Billy; Rodriguez-Santana, Jose R.; Burchard, Esteban G.; Seibold, Max A.; MacArthur, Daniel G.; Montgomery, Stephen B.; Zaitlen, Noah A.; Lappalainen, Tuuli

    2015-01-01

    Genomic imprinting is an important regulatory mechanism that silences one of the parental copies of a gene. To systematically characterize this phenomenon, we analyze tissue specificity of imprinting from allelic expression data in 1582 primary tissue samples from 178 individuals from the Genotype-Tissue Expression (GTEx) project. We characterize imprinting in 42 genes, including both novel and previously identified genes. Tissue specificity of imprinting is widespread, and gender-specific effects are revealed in a small number of genes in muscle with stronger imprinting in males. IGF2 shows maternal expression in the brain instead of the canonical paternal expression elsewhere. Imprinting appears to have only a subtle impact on tissue-specific expression levels, with genes lacking a systematic expression difference between tissues with imprinted and biallelic expression. In summary, our systematic characterization of imprinting in adult tissues highlights variation in imprinting between genes, individuals, and tissues. PMID:25953952

  3. Genomic distribution and estimation of nucleotide diversity in natural populations: perspectives from the collared flycatcher (Ficedula albicollis) genome.

    Science.gov (United States)

    Dutoit, Ludovic; Burri, Reto; Nater, Alexander; Mugal, Carina F; Ellegren, Hans

    2017-07-01

    Properly estimating genetic diversity in populations of nonmodel species requires a basic understanding of how diversity is distributed across the genome and among individuals. To this end, we analysed whole-genome resequencing data from 20 collared flycatchers (genome size ≈1.1 Gb; 10.13 million single nucleotide polymorphisms detected). Genomewide nucleotide diversity was almost identical among individuals (mean = 0.00394, range = 0.00384-0.00401), but diversity levels varied extensively across the genome (95% confidence interval for 200-kb windows = 0.0013-0.0053). Diversity was related to selective constraint such that in comparison with intergenic DNA, diversity at fourfold degenerate sites was reduced to 85%, 3' UTRs to 82%, 5' UTRs to 70% and nondegenerate sites to 12%. There was a strong positive correlation between diversity and chromosome size, probably driven by a higher density of targets for selection on smaller chromosomes increasing the diversity-reducing effect of linked selection. Simulations exploring the ability of sequence data from a small number of genetic markers to capture the observed diversity clearly demonstrated that diversity estimation from finite sampling of such data is bound to be associated with large confidence intervals. Nevertheless, we show that precision in diversity estimation in large outbred population benefits from increasing the number of loci rather than the number of individuals. Simulations mimicking RAD sequencing showed that this approach gives accurate estimates of genomewide diversity. Based on the patterns of observed diversity and the performed simulations, we provide broad recommendations for how genetic diversity should be estimated in natural populations. © 2016 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.

  4. Genomes to Life Project Quarterly Report April 2005.

    Energy Technology Data Exchange (ETDEWEB)

    Heffelfinger, Grant S.; Martino, Anthony; Rintoul, Mark Daniel; Geist, Al; Gorin, Andrey; Xu, Ying; Palenik, Brian

    2006-02-01

    This SAND report provides the technical progress through April 2005 of the Sandia-led project, "Carbon Sequestration in Synechococcus Sp.: From Molecular Machines to Hierarchical Modeling," funded by the DOE Office of Science Genomics:GTL Program. Understanding, predicting, and perhaps manipulating carbon fixation in the oceans has long been a major focus of biological oceanography and has more recently been of interest to a broader audience of scientists and policy makers. It is clear that the oceanic sinks and sources of CO2 are important terms in the global environmental response to anthropogenic atmospheric inputs of CO2 and that oceanic microorganisms play a key role in this response. However, the relationship between this global phenomenon and the biochemical mechanisms of carbon fixation in these microorganisms is poorly understood. In this project, we will investigate the carbon sequestration behavior of Synechococcus Sp., an abundant marine cyanobacteria known to be important to environmental responses to carbon dioxide levels, through experimental and computational methods. This project is a combined experimental and computational effort with emphasis on developing and applying new computational tools and methods. Our experimental effort will provide the biology and data to drive the computational efforts and include significant investment in developing new experimental methods for uncovering protein partners, characterizing protein complexes, identifying new binding domains. We will also develop and apply new data measurement and statistical methods for analyzing microarray experiments. Computational tools will be essential to our efforts to discover and characterize the function of the molecular machines of Synechococcus. To this end, molecular simulation methods will be coupled with knowledge discovery from diverse biological data sets for high-throughput discovery and characterization of protein-protein complexes. In addition, we will develop a set of

  5. Genomes to Life Project Quartely Report October 2004.

    Energy Technology Data Exchange (ETDEWEB)

    Heffelfinger, Grant S.; Martino, Anthony; Rintoul, Mark Daniel; Geist, Al; Gorin, Andrey; Xu, Ying; Palenik, Brian

    2005-02-01

    This SAND report provides the technical progress through October 2004 of the Sandia-led project, %22Carbon Sequestration in Synechococcus Sp.: From Molecular Machines to Hierarchical Modeling,%22 funded by the DOE Office of Science Genomes to Life Program. Understanding, predicting, and perhaps manipulating carbon fixation in the oceans has long been a major focus of biological oceanography and has more recently been of interest to a broader audience of scientists and policy makers. It is clear that the oceanic sinks and sources of CO2 are important terms in the global environmental response to anthropogenic atmospheric inputs of CO2 and that oceanic microorganisms play a key role in this response. However, the relationship between this global phenomenon and the biochemical mechanisms of carbon fixation in these microorganisms is poorly understood. In this project, we will investigate the carbon sequestration behavior of Synechococcus Sp., an abundant marine cyanobacteria known to be important to environmental responses to carbon dioxide levels, through experimental and computational methods. This project is a combined experimental and computational effort with emphasis on developing and applying new computational tools and methods. Our experimental effort will provide the biology and data to drive the computational efforts and include significant investment in developing new experimental methods for uncovering protein partners, characterizing protein complexes, identifying new binding domains. We will also develop and apply new data measurement and statistical methods for analyzing microarray experiments. Computational tools will be essential to our efforts to discover and characterize the function of the molecular machines of Synechococcus. To this end, molecular simulation methods will be coupled with knowledge discovery from diverse biological data sets for high-throughput discovery and characterization of protein-protein complexes. In addition, we will develop

  6. Comparative genomics of Geobacter chemotaxis genes reveals diverse signaling function

    Directory of Open Access Journals (Sweden)

    Antommattei Frances M

    2008-10-01

    Full Text Available Abstract Background Geobacter species are δ-Proteobacteria and are often the predominant species in a variety of sedimentary environments where Fe(III reduction is important. Their ability to remediate contaminated environments and produce electricity makes them attractive for further study. Cell motility, biofilm formation, and type IV pili all appear important for the growth of Geobacter in changing environments and for electricity production. Recent studies in other bacteria have demonstrated that signaling pathways homologous to the paradigm established for Escherichia coli chemotaxis can regulate type IV pili-dependent motility, the synthesis of flagella and type IV pili, the production of extracellular matrix material, and biofilm formation. The classification of these pathways by comparative genomics improves the ability to understand how Geobacter thrives in natural environments and better their use in microbial fuel cells. Results The genomes of G. sulfurreducens, G. metallireducens, and G. uraniireducens contain multiple (~70 homologs of chemotaxis genes arranged in several major clusters (six, seven, and seven, respectively. Unlike the single gene cluster of E. coli, the Geobacter clusters are not all located near the flagellar genes. The probable functions of some Geobacter clusters are assignable by homology to known pathways; others appear to be unique to the Geobacter sp. and contain genes of unknown function. We identified large numbers of methyl-accepting chemotaxis protein (MCP homologs that have diverse sensing domain architectures and generate a potential for sensing a great variety of environmental signals. We discuss mechanisms for class-specific segregation of the MCPs in the cell membrane, which serve to maintain pathway specificity and diminish crosstalk. Finally, the regulation of gene expression in Geobacter differs from E. coli. The sequences of predicted promoter elements suggest that the alternative sigma factors

  7. Singapore Genome Variation Project: a haplotype map of three Southeast Asian populations.

    Science.gov (United States)

    Teo, Yik-Ying; Sim, Xueling; Ong, Rick T H; Tan, Adrian K S; Chen, Jieming; Tantoso, Erwin; Small, Kerrin S; Ku, Chee-Seng; Lee, Edmund J D; Seielstad, Mark; Chia, Kee-Seng

    2009-11-01

    The Singapore Genome Variation Project (SGVP) provides a publicly available resource of 1.6 million single nucleotide polymorphisms (SNPs) genotyped in 268 individuals from the Chinese, Malay, and Indian population groups in Southeast Asia. This online database catalogs information and summaries on genotype and phased haplotype data, including allele frequencies, assessment of linkage disequilibrium (LD), and recombination rates in a format similar to the International HapMap Project. Here, we introduce this resource and describe the analysis of human genomic variation upon agglomerating data from the HapMap and the Human Genome Diversity Project, providing useful insights into the population structure of the three major population groups in Asia. In addition, this resource also surveyed across the genome for variation in regional patterns of LD between the HapMap and SGVP populations, and for signatures of positive natural selection using two well-established metrics: iHS and XP-EHH. The raw and processed genetic data, together with all population genetic summaries, are publicly available for download and browsing through a web browser modeled with the Generic Genome Browser.

  8. The human genome project: Prospects and implications for clinical medicine

    Energy Technology Data Exchange (ETDEWEB)

    Green, E.D.; Waterston, R.H. (Washington Univ., St. Louis, MO (United States))

    1991-10-09

    The recently initiated human genome project is a large international effort to elucidate the genetic architecture of the genomes of man and several model organisms. The initial phases of this endeavor involve the establishment of rough blueprints (maps) of the genetic landscape of these genomes, with the long-term goal of determining their precise nucleotide sequences and identifying the genes. The knowledge gained by these studies will provide a vital tool for the study of many biologic processes and will have a profound impact on clinical medicine.

  9. The Great Migration and African-American Genomic Diversity.

    Directory of Open Access Journals (Sweden)

    Soheil Baharian

    2016-05-01

    Full Text Available We present a comprehensive assessment of genomic diversity in the African-American population by studying three genotyped cohorts comprising 3,726 African-Americans from across the United States that provide a representative description of the population across all US states and socioeconomic status. An estimated 82.1% of ancestors to African-Americans lived in Africa prior to the advent of transatlantic travel, 16.7% in Europe, and 1.2% in the Americas, with increased African ancestry in the southern United States compared to the North and West. Combining demographic models of ancestry and those of relatedness suggests that admixture occurred predominantly in the South prior to the Civil War and that ancestry-biased migration is responsible for regional differences in ancestry. We find that recent migrations also caused a strong increase in genetic relatedness among geographically distant African-Americans. Long-range relatedness among African-Americans and between African-Americans and European-Americans thus track north- and west-bound migration routes followed during the Great Migration of the twentieth century. By contrast, short-range relatedness patterns suggest comparable mobility of ∼15-16km per generation for African-Americans and European-Americans, as estimated using a novel analytical model of isolation-by-distance.

  10. Mechanical Genomics Identifies Diverse Modulators of Bacterial Cell Stiffness.

    Science.gov (United States)

    Auer, George K; Lee, Timothy K; Rajendram, Manohary; Cesar, Spencer; Miguel, Amanda; Huang, Kerwyn Casey; Weibel, Douglas B

    2016-06-22

    Bacteria must maintain mechanical integrity to withstand the large osmotic pressure differential across the cell membrane and wall. Although maintaining mechanical integrity is critical for proper cellular function, a fact exploited by prominent cell-wall-targeting antibiotics, the proteins that contribute to cellular mechanics remain unidentified. Here, we describe a high-throughput optical method for quantifying cell stiffness and apply this technique to a genome-wide collection of ∼4,000 Escherichia coli mutants. We identify genes with roles in diverse functional processes spanning cell-wall synthesis, energy production, and DNA replication and repair that significantly change cell stiffness when deleted. We observe that proteins with biochemically redundant roles in cell-wall synthesis exhibit different stiffness defects when deleted. Correlating our data with chemical screens reveals that reducing membrane potential generally increases cell stiffness. In total, our work demonstrates that bacterial cell stiffness is a property of both the cell wall and broader cell physiology and lays the groundwork for future systematic studies of mechanoregulation.

  11. Genomic diversity of cercarial clones of Himasthla elongata (Trematoda, Echinostomatidae) determined with AFLP technique.

    Science.gov (United States)

    Galaktionov, N K; Podgornaya, O I; Strelkov, P P; Galaktionov, K V

    2016-12-01

    The aim of this study was to reveal genomic diversity formed during parthenogenetic reproduction of rediae of the trematode Himasthla elongata in its molluskan host Littorina littorea. We applied amplification fragment length polymorphism (AFLP) to determine the genomic diversity of individual cercariae within the clone, that is, the infrapopulation of parthenogenetic progeny in a single molluskan host. The level of genomic diversity of particular cercariae isolates from a single clone, detected with EcoR1/Mse1 AFLP reaction, was significantly lower than the variability of cercariae from different clones. The presence of intraclonal genomic diversity indicates a nonsexual shuffle of alleles during parthenogenesis in the rediae of H. elongata. The obtained polymorphic AFLP fragments were long enough to detect the sequences that may be responsible for clonal genomic variability. Based on this, AFLP can be recommended as a tool for the study of genetic mechanisms of this variability.

  12. Freedom and Responsibility in Synthetic Genomics: The Synthetic Yeast Project.

    Science.gov (United States)

    Sliva, Anna; Yang, Huanming; Boeke, Jef D; Mathews, Debra J H

    2015-08-01

    First introduced in 2011, the Synthetic Yeast Genome (Sc2.0) PROJECT is a large international synthetic genomics project that will culminate in the first eukaryotic cell (Saccharomyces cerevisiae) with a fully synthetic genome. With collaborators from across the globe and from a range of institutions spanning from do-it-yourself biology (DIYbio) to commercial enterprises, it is important that all scientists working on this project are cognizant of the ethical and policy issues associated with this field of research and operate under a common set of principles. In this commentary, we survey the current ethics and regulatory landscape of synthetic biology and present the Sc2.0 Statement of Ethics and Governance to which all members of the project adhere. This statement focuses on four aspects of the Sc2.0 PROJECT: societal benefit, intellectual property, safety, and self-governance. We propose that such project-level agreements are an important, valuable, and flexible model of self-regulation for similar global, large-scale synthetic biology projects in order to maximize the benefits and minimize potential harms. Copyright © 2015 by the Genetics Society of America.

  13. Freedom and Responsibility in Synthetic Genomics: The Synthetic Yeast Project

    Science.gov (United States)

    Sliva, Anna; Yang, Huanming; Boeke, Jef D.; Mathews, Debra J. H.

    2015-01-01

    First introduced in 2011, the Synthetic Yeast Genome (Sc2.0) Project is a large international synthetic genomics project that will culminate in the first eukaryotic cell (Saccharomyces cerevisiae) with a fully synthetic genome. With collaborators from across the globe and from a range of institutions spanning from do-it-yourself biology (DIYbio) to commercial enterprises, it is important that all scientists working on this project are cognizant of the ethical and policy issues associated with this field of research and operate under a common set of principles. In this commentary, we survey the current ethics and regulatory landscape of synthetic biology and present the Sc2.0 Statement of Ethics and Governance to which all members of the project adhere. This statement focuses on four aspects of the Sc2.0 Project: societal benefit, intellectual property, safety, and self-governance. We propose that such project-level agreements are an important, valuable, and flexible model of self-regulation for similar global, large-scale synthetic biology projects in order to maximize the benefits and minimize potential harms. PMID:26272997

  14. A genome-to-genome analysis of associations between human genetic variation, HIV-1 sequence diversity, and viral control.

    Science.gov (United States)

    Bartha, István; Carlson, Jonathan M; Brumme, Chanson J; McLaren, Paul J; Brumme, Zabrina L; John, Mina; Haas, David W; Martinez-Picado, Javier; Dalmau, Judith; López-Galíndez, Cecilio; Casado, Concepción; Rauch, Andri; Günthard, Huldrych F; Bernasconi, Enos; Vernazza, Pietro; Klimkait, Thomas; Yerly, Sabine; O'Brien, Stephen J; Listgarten, Jennifer; Pfeifer, Nico; Lippert, Christoph; Fusi, Nicolo; Kutalik, Zoltán; Allen, Todd M; Müller, Viktor; Harrigan, P Richard; Heckerman, David; Telenti, Amalio; Fellay, Jacques

    2013-10-29

    HIV-1 sequence diversity is affected by selection pressures arising from host genomic factors. Using paired human and viral data from 1071 individuals, we ran >3000 genome-wide scans, testing for associations between host DNA polymorphisms, HIV-1 sequence variation and plasma viral load (VL), while considering human and viral population structure. We observed significant human SNP associations to a total of 48 HIV-1 amino acid variants (pgenome-to-genome approach highlights sites of genomic conflict and is a strategy generally applicable to studies of host-pathogen interaction. DOI:http://dx.doi.org/10.7554/eLife.01123.001.

  15. Comparative genomics of the marine bacterial genus Glaciecola reveals the high degree of genomic diversity and genomic characteristic for cold adaptation.

    Science.gov (United States)

    Qin, Qi-Long; Xie, Bin-Bin; Yu, Yong; Shu, Yan-Li; Rong, Jin-Cheng; Zhang, Yan-Jiao; Zhao, Dian-Li; Chen, Xiu-Lan; Zhang, Xi-Ying; Chen, Bo; Zhou, Bai-Cheng; Zhang, Yu-Zhong

    2014-06-01

    To what extent the genomes of different species belonging to one genus can be diverse and the relationship between genomic differentiation and environmental factor remain unclear for oceanic bacteria. With many new bacterial genera and species being isolated from marine environments, this question warrants attention. In this study, we sequenced all the type strains of the published species of Glaciecola, a recently defined cold-adapted genus with species from diverse marine locations, to study the genomic diversity and cold-adaptation strategy in this genus.The genome size diverged widely from 3.08 to 5.96 Mb, which can be explained by massive gene gain and loss events. Horizontal gene transfer and new gene emergence contributed substantially to the genome size expansion. The genus Glaciecola had an open pan-genome. Comparative genomic research indicated that species of the genus Glaciecola had high diversity in genome size, gene content and genetic relatedness. This may be prevalent in marine bacterial genera considering the dynamic and complex environments of the ocean. Species of Glaciecola had some common genomic features related to cold adaptation, which enable them to thrive and play a role in biogeochemical cycle in the cold marine environments.

  16. PREDICTS: Projecting Responses of Ecological Diversity in Changing Terrestrial Systems

    Directory of Open Access Journals (Sweden)

    Georgina Mace

    2012-12-01

    Full Text Available The PREDICTS project (www.predicts.org.uk is a three-year NERC-funded project to model and predict at a global scale how local terrestrial diversity responds to human pressures such as land use, land cover, pollution, invasive species and infrastructure. PREDICTS is a collaboration between Imperial College London, the UNEP World Conservation Monitoring Centre, Microsoft Research Cambridge, UCL and the University of Sussex. In order to meet its aims, the project relies on extensive data describing the diversity and composition of biological communities at a local scale. Such data are collected on a vast scale through the committed efforts of field ecologists. If you have appropriate data that you would be willing to share with us, please get in touch (enquiries@predicts.org.uk. All contributions will be acknowledged appropriately and all data contributors will be included as co-authors on an open-access paper describing the database.

  17. Genome Project Standards in a New Era of Sequencing

    Energy Technology Data Exchange (ETDEWEB)

    GSC Consortia; HMP Jumpstart Consortia; Chain, P. S. G.; Grafham, D. V.; Fulton, R. S.; FitzGerald, M. G.; Hostetler, J.; Muzny, D.; Detter, J. C.; Ali, J.; Birren, B.; Bruce, D. C.; Buhay, C.; Cole, J. R.; Ding, Y.; Dugan, S.; Field, D.; Garrity, G. M.; Gibbs, R.; Graves, T.; Han, C. S.; Harrison, S. H.; Highlander, S.; Hugenholtz, P.; Khouri, H. M.; Kodira, C. D.; Kolker, E.; Kyrpides, N. C.; Lang, D.; Lapidus, A.; Malfatti, S. A.; Markowitz, V.; Metha, T.; Nelson, K. E.; Parkhill, J.; Pitluck, S.; Qin, X.; Read, T. D.; Schmutz, J.; Sozhamannan, S.; Strausberg, R.; Sutton, G.; Thomson, N. R.; Tiedje, J. M.; Weinstock, G.; Wollam, A.

    2009-06-01

    For over a decade, genome 43 sequences have adhered to only two standards that are relied on for purposes of sequence analysis by interested third parties (1, 2). However, ongoing developments in revolutionary sequencing technologies have resulted in a redefinition of traditional whole genome sequencing that requires a careful reevaluation of such standards. With commercially available 454 pyrosequencing (followed by Illumina, SOLiD, and now Helicos), there has been an explosion of genomes sequenced under the moniker 'draft', however these can be very poor quality genomes (due to inherent errors in the sequencing technologies, and the inability of assembly programs to fully address these errors). Further, one can only infer that such draft genomes may be of poor quality by navigating through the databases to find the number and type of reads deposited in sequence trace repositories (and not all genomes have this available), or to identify the number of contigs or genome fragments deposited to the database. The difficulty in assessing the quality of such deposited genomes has created some havoc for genome analysis pipelines and contributed to many wasted hours of (mis)interpretation. These same novel sequencing technologies have also brought an exponential leap in raw sequencing capability, and at greatly reduced prices that have further skewed the time- and cost-ratios of draft data generation versus the painstaking process of improving and finishing a genome. The resulting effect is an ever-widening gap between drafted and finished genomes that only promises to continue (Figure 1), hence there is an urgent need to distinguish good and poor datasets. The sequencing institutes in the authorship, along with the NIH's Human Microbiome Project Jumpstart Consortium (3), strongly believe that a new set of standards is required for genome sequences. The following represents a set of six community-defined categories of genome sequence standards that better

  18. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

    DEFF Research Database (Denmark)

    Birney, Ewan; Stamatoyannopoulos, John A; Dutta, Anindya

    2007-01-01

    We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses...

  19. The database of the PREDICTS (Projecting Responses of Ecological Diversity In Changing Terrestrial Systems) project

    NARCIS (Netherlands)

    Hudson, Lawrence N; Newbold, Tim; Contu, Sara; Hill, Samantha L L; Lysenko, Igor; De Palma, Adriana; Phillips, Helen R P; Alhusseini, Tamera I; Bedford, Felicity E; Bennett, Dominic J; Booth, Hollie; Burton, Victoria J; Chng, Charlotte W T; Choimes, Argyrios; Correia, David L P; Day, Julie; Echeverría-Londoño, Susy; Emerson, Susan R; Gao, Di; Garon, Morgan; Harrison, Michelle L K; Ingram, Daniel J; Jung, Martin; Kemp, Victoria; Kirkpatrick, Lucinda; Martin, Callum D; Pan, Yuan; Pask-Hale, Gwilym D; Pynegar, Edwin L; Robinson, Alexandra N; Sanchez-Ortiz, Katia; Senior, Rebecca A; Simmons, Benno I; White, Hannah J; Zhang, Hanbin; Aben, Job; Abrahamczyk, Stefan; Adum, Gilbert B; Aguilar-Barquero, Virginia; Aizen, Marcelo A; Albertos, Belén; Alcala, E L; Del Mar Alguacil, Maria; Alignier, Audrey; Ancrenaz, Marc; Andersen, Alan N; Arbeláez-Cortés, Enrique; Armbrecht, Inge; Arroyo-Rodríguez, Víctor; Aumann, Tom; Axmacher, Jan C; Azhar, Badrul; Azpiroz, Adrián B; Baeten, Lander; Bakayoko, Adama; Báldi, András; Banks, John E; Baral, Sharad K; Barlow, Jos; Barratt, Barbara I P; Barrico, Lurdes; Bartolommei, Paola; Barton, Diane M; Basset, Yves; Batáry, Péter; Bates, Adam J; Baur, Bruno; Bayne, Erin M; Beja, Pedro; Benedick, Suzan; Berg, Åke; Bernard, Henry; Berry, Nicholas J; Bhatt, Dinesh; Bicknell, Jake E; Bihn, Jochen H; Blake, Robin J; Bobo, Kadiri S; Bóçon, Roberto; Boekhout, Teun; Böhning-Gaese, Katrin; Bonham, Kevin J; Borges, Paulo A V; Borges, Sérgio H; Boutin, Céline; Bouyer, Jérémy; Bragagnolo, Cibele; Brandt, Jodi S; Brearley, Francis Q; Brito, Isabel; Bros, Vicenç; Brunet, Jörg; Buczkowski, Grzegorz; Buddle, Christopher M; Bugter, Rob; Buscardo, Erika; Buse, Jörn; Cabra-García, Jimmy; Cáceres, Nilton C; Cagle, Nicolette L; Calviño-Cancela, María; Cameron, Sydney A; Cancello, Eliana M; Caparrós, Rut; Cardoso, Pedro; Carpenter, Dan; Carrijo, Tiago F; Carvalho, Anelena L; Cassano, Camila R; Castro, Helena; Castro-Luna, Alejandro A; Rolando, Cerda B; Cerezo, Alexis; Chapman, Kim Alan; Chauvat, Matthieu; Christensen, Morten; Clarke, Francis M; Cleary, Daniel F R; Colombo, Giorgio; Connop, Stuart P; Craig, Michael D; Cruz-López, Leopoldo; Cunningham, Saul A; D'Aniello, Biagio; D'Cruze, Neil; da Silva, Pedro Giovâni; Dallimer, Martin; Danquah, Emmanuel; Darvill, Ben; Dauber, Jens; Davis, Adrian L V; Dawson, Jeff; de Sassi, Claudio; de Thoisy, Benoit; Deheuvels, Olivier; Dejean, Alain; Devineau, Jean-Louis; Diekötter, Tim; Dolia, Jignasu V; Domínguez, Erwin; Dominguez-Haydar, Yamileth; Dorn, Silvia; Draper, Isabel; Dreber, Niels; Dumont, Bertrand; Dures, Simon G; Dynesius, Mats; Edenius, Lars; Eggleton, Paul; Eigenbrod, Felix; Elek, Zoltán; Entling, Martin H; Esler, Karen J; de Lima, Ricardo F; Faruk, Aisyah; Farwig, Nina; Fayle, Tom M; Felicioli, Antonio; Felton, Annika M; Fensham, Roderick J; Fernandez, Ignacio C; Ferreira, Catarina C; Ficetola, Gentile F; Fiera, Cristina; Filgueiras, Bruno K C; Fırıncıoğlu, Hüseyin K; Flaspohler, David; Floren, Andreas; Fonte, Steven J; Fournier, Anne; Fowler, Robert E; Franzén, Markus; Fraser, Lauchlan H; Fredriksson, Gabriella M; Freire, Geraldo B; Frizzo, Tiago L M; Fukuda, Daisuke; Furlani, Dario; Gaigher, René; Ganzhorn, Jörg U; García, Karla P; Garcia-R, Juan C; Garden, Jenni G; Garilleti, Ricardo; Ge, Bao-Ming; Gendreau-Berthiaume, Benoit; Gerard, Philippa J; Gheler-Costa, Carla; Gilbert, Benjamin; Giordani, Paolo; Giordano, Simonetta; Golodets, Carly; Gomes, Laurens G L; Gould, Rachelle K; Goulson, Dave; Gove, Aaron D; Granjon, Laurent; Grass, Ingo; Gray, Claudia L; Grogan, James; Gu, Weibin; Guardiola, Moisès; Gunawardene, Nihara R; Gutierrez, Alvaro G; Gutiérrez-Lamus, Doris L; Haarmeyer, Daniela H; Hanley, Mick E; Hanson, Thor; Hashim, Nor R; Hassan, Shombe N; Hatfield, Richard G; Hawes, Joseph E; Hayward, Matt W; Hébert, Christian; Helden, Alvin J; Henden, John-André; Henschel, Philipp; Hernández, Lionel; Herrera, James P; Herrmann, Farina; Herzog, Felix; Higuera-Diaz, Diego; Hilje, Branko; Höfer, Hubert; Hoffmann, Anke; Horgan, Finbarr G; Hornung, Elisabeth; Horváth, Roland; Hylander, Kristoffer; Isaacs-Cubides, Paola; Ishida, Hiroaki; Ishitani, Masahiro; Jacobs, Carmen T; Jaramillo, Víctor J; Jauker, Birgit; Hernández, F Jiménez; Johnson, McKenzie F; Jolli, Virat; Jonsell, Mats; Juliani, S Nur; Jung, Thomas S; Kapoor, Vena; Kappes, Heike; Kati, Vassiliki; Katovai, Eric; Kellner, Klaus; Kessler, Michael; Kirby, Kathryn R; Kittle, Andrew M; Knight, Mairi E; Knop, Eva; Kohler, Florian; Koivula, Matti; Kolb, Annette; Kone, Mouhamadou; Kőrösi, Ádám; Krauss, Jochen; Kumar, Ajith; Kumar, Raman; Kurz, David J; Kutt, Alex S; Lachat, Thibault; Lantschner, Victoria; Lara, Francisco; Lasky, Jesse R; Latta, Steven C; Laurance, William F; Lavelle, Patrick; Le Féon, Violette; LeBuhn, Gretchen; Légaré, Jean-Philippe; Lehouck, Valérie; Lencinas, María V; Lentini, Pia E; Letcher, Susan G; Li, Qi; Litchwark, Simon A; Littlewood, Nick A; Liu, Yunhui; Lo-Man-Hung, Nancy; López-Quintero, Carlos A; Louhaichi, Mounir; Lövei, Gabor L; Lucas-Borja, Manuel Esteban; Luja, Victor H; Luskin, Matthew S; MacSwiney G, M Cristina; Maeto, Kaoru; Magura, Tibor; Mallari, Neil Aldrin; Malone, Louise A; Malonza, Patrick K; Malumbres-Olarte, Jagoba; Mandujano, Salvador; Måren, Inger E; Marin-Spiotta, Erika; Marsh, Charles J; Marshall, E J P; Martínez, Eliana; Martínez Pastur, Guillermo; Moreno Mateos, David; Mayfield, Margaret M; Mazimpaka, Vicente; McCarthy, Jennifer L; McCarthy, Kyle P; McFrederick, Quinn S; McNamara, Sean; Medina, Nagore G; Medina, Rafael; Mena, Jose L; Mico, Estefania; Mikusinski, Grzegorz; Milder, Jeffrey C; Miller, James R; Miranda-Esquivel, Daniel R; Moir, Melinda L; Morales, Carolina L; Muchane, Mary N; Muchane, Muchai; Mudri-Stojnic, Sonja; Munira, A Nur; Muoñz-Alonso, Antonio; Munyekenye, B F; Naidoo, Robin; Naithani, A; Nakagawa, Michiko; Nakamura, Akihiro; Nakashima, Yoshihiro; Naoe, Shoji; Nates-Parra, Guiomar; Navarrete Gutierrez, Dario A; Navarro-Iriarte, Luis; Ndang'ang'a, Paul K; Neuschulz, Eike L; Ngai, Jacqueline T; Nicolas, Violaine; Nilsson, Sven G; Noreika, Norbertas; Norfolk, Olivia; Noriega, Jorge Ari; Norton, David A; Nöske, Nicole M; Nowakowski, A Justin; Numa, Catherine; O'Dea, Niall; O'Farrell, Patrick J; Oduro, William; Oertli, Sabine; Ofori-Boateng, Caleb; Oke, Christopher Omamoke; Oostra, Vicencio; Osgathorpe, Lynne M; Otavo, Samuel Eduardo; Page, Navendu V; Paritsis, Juan; Parra-H, Alejandro; Parry, Luke; Pe'er, Guy; Pearman, Peter B; Pelegrin, Nicolás; Pélissier, Raphaël; Peres, Carlos A; Peri, Pablo L; Persson, Anna S; Petanidou, Theodora; Peters, Marcell K; Pethiyagoda, Rohan S; Phalan, Ben; Philips, T Keith; Pillsbury, Finn C; Pincheira-Ulbrich, Jimmy; Pineda, Eduardo; Pino, Joan; Pizarro-Araya, Jaime; Plumptre, A J; Poggio, Santiago L; Politi, Natalia; Pons, Pere; Poveda, Katja; Power, Eileen F; Presley, Steven J; Proença, Vânia; Quaranta, Marino; Quintero, Carolina; Rader, Romina; Ramesh, B R; Ramirez-Pinilla, Martha P; Ranganathan, Jai; Rasmussen, Claus; Redpath-Downing, Nicola A; Reid, J Leighton; Reis, Yana T; Rey Benayas, José M; Rey-Velasco, Juan Carlos; Reynolds, Chevonne; Ribeiro, Danilo Bandini; Richards, Miriam H; Richardson, Barbara A; Richardson, Michael J; Ríos, Rodrigo Macip; Robinson, Richard; Robles, Carolina A; Römbke, Jörg; Romero-Duque, Luz Piedad; Rös, Matthias; Rosselli, Loreta; Rossiter, Stephen J; Roth, Dana S; Roulston, T'ai H; Rousseau, Laurent; Rubio, André V; Ruel, Jean-Claude; Sadler, Jonathan P; Sáfián, Szabolcs; Saldaña-Vázquez, Romeo A; Sam, Katerina; Samnegård, Ulrika; Santana, Joana; Santos, Xavier; Savage, Jade; Schellhorn, Nancy A; Schilthuizen, Menno; Schmiedel, Ute; Schmitt, Christine B; Schon, Nicole L; Schüepp, Christof; Schumann, Katharina; Schweiger, Oliver; Scott, Dawn M; Scott, Kenneth A; Sedlock, Jodi L; Seefeldt, Steven S; Shahabuddin, Ghazala; Shannon, Graeme; Sheil, Douglas; Sheldon, Frederick H; Shochat, Eyal; Siebert, Stefan J; Silva, Fernando A B; Simonetti, Javier A; Slade, Eleanor M; Smith, Jo; Smith-Pardo, Allan H; Sodhi, Navjot S; Somarriba, Eduardo J; Sosa, Ramón A; Soto Quiroga, Grimaldo; St-Laurent, Martin-Hugues; Starzomski, Brian M; Stefanescu, Constanti; Steffan-Dewenter, Ingolf; Stouffer, Philip C; Stout, Jane C; Strauch, Ayron M; Struebig, Matthew J; Su, Zhimin; Suarez-Rubio, Marcela; Sugiura, Shinji; Summerville, Keith S; Sung, Yik-Hei; Sutrisno, Hari; Svenning, Jens-Christian; Teder, Tiit; Threlfall, Caragh G; Tiitsaar, Anu; Todd, Jacqui H; Tonietto, Rebecca K; Torre, Ignasi; Tóthmérész, Béla; Tscharntke, Teja; Turner, Edgar C; Tylianakis, Jason M; Uehara-Prado, Marcio; Urbina-Cardona, Nicolas; Vallan, Denis; Vanbergen, Adam J; Vasconcelos, Heraldo L; Vassilev, Kiril; Verboven, Hans A F; Verdasca, Maria João; Verdú, José R; Vergara, Carlos H; Vergara, Pablo M; Verhulst, Jort; Virgilio, Massimiliano; Vu, Lien Van; Waite, Edward M; Walker, Tony R; Wang, Hua-Feng; Wang, Yanping; Watling, James I; Weller, Britta; Wells, Konstans; Westphal, Catrin; Wiafe, Edward D; Williams, Christopher D; Willig, Michael R; Woinarski, John C Z; Wolf, Jan H D; Wolters, Volkmar; Woodcock, Ben A; Wu, Jihua; Wunderle, Joseph M; Yamaura, Yuichi; Yoshikura, Satoko; Yu, Douglas W; Zaitsev, Andrey S; Zeidler, Juliane; Zou, Fasheng; Collen, Ben; Ewers, Rob M; Mace, Georgina M; Purves, Drew W; Scharlemann, Jörn P W; Purvis, Andy

    2017-01-01

    The PREDICTS project-Projecting Responses of Ecological Diversity In Changing Terrestrial Systems (www.predicts.org.uk)-has collated from published studies a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity of

  20. The database of the PREDICTS (Projecting Responses of Ecological Diversity in Changing Terrestrial Systems) project

    DEFF Research Database (Denmark)

    Hudson, Lawrence N; Newbold, Tim; Contu, Sara

    2017-01-01

    The PREDICTS project-Projecting Responses of Ecological Diversity In Changing Terrestrial Systems (www.predicts.org.uk)-has collated from published studies a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity ...

  1. The database of the PREDICTS (Projecting Responses of Ecological Diversity In Changing Terrestrial Systems) project

    NARCIS (Netherlands)

    Hudson, Lawrence N; Newbold, Tim; Contu, Sara; Hill, Samantha L L; Lysenko, Igor; De Palma, Adriana; Phillips, Helen R P; Alhusseini, Tamera I; Bedford, Felicity E; Bennett, Dominic J; Booth, Hollie; Burton, Victoria J; Chng, Charlotte W T; Choimes, Argyrios; Correia, David L P; Day, Julie; Echeverría-Londoño, Susy; Emerson, Susan R; Gao, Di; Garon, Morgan; Harrison, Michelle L K; Ingram, Daniel J; Jung, Martin; Kemp, Victoria; Kirkpatrick, Lucinda; Martin, Callum D; Pan, Yuan; Pask-Hale, Gwilym D; Pynegar, Edwin L; Robinson, Alexandra N; Sanchez-Ortiz, Katia; Senior, Rebecca A; Simmons, Benno I; White, Hannah J; Zhang, Hanbin; Aben, Job; Abrahamczyk, Stefan; Adum, Gilbert B; Aguilar-Barquero, Virginia; Aizen, Marcelo A; Albertos, Belén; Alcala, E L; Del Mar Alguacil, Maria; Alignier, Audrey; Ancrenaz, Marc; Andersen, Alan N; Arbeláez-Cortés, Enrique; Armbrecht, Inge; Arroyo-Rodríguez, Víctor; Aumann, Tom; Axmacher, Jan C; Azhar, Badrul; Azpiroz, Adrián B; Baeten, Lander; Bakayoko, Adama; Báldi, András; Banks, John E; Baral, Sharad K; Barlow, Jos; Barratt, Barbara I P; Barrico, Lurdes; Bartolommei, Paola; Barton, Diane M; Basset, Yves; Batáry, Péter; Bates, Adam J; Baur, Bruno; Bayne, Erin M; Beja, Pedro; Benedick, Suzan; Berg, Åke; Bernard, Henry; Berry, Nicholas J; Bhatt, Dinesh; Bicknell, Jake E; Bihn, Jochen H; Blake, Robin J; Bobo, Kadiri S; Bóçon, Roberto; Boekhout, Teun; Böhning-Gaese, Katrin; Bonham, Kevin J; Borges, Paulo A V; Borges, Sérgio H; Boutin, Céline; Bouyer, Jérémy; Bragagnolo, Cibele; Brandt, Jodi S; Brearley, Francis Q; Brito, Isabel; Bros, Vicenç; Brunet, Jörg; Buczkowski, Grzegorz; Buddle, Christopher M; Bugter, Rob; Buscardo, Erika; Buse, Jörn; Cabra-García, Jimmy; Cáceres, Nilton C; Cagle, Nicolette L; Calviño-Cancela, María; Cameron, Sydney A; Cancello, Eliana M; Caparrós, Rut; Cardoso, Pedro; Carpenter, Dan; Carrijo, Tiago F; Carvalho, Anelena L; Cassano, Camila R; Castro, Helena; Castro-Luna, Alejandro A; Rolando, Cerda B; Cerezo, Alexis; Chapman, Kim Alan; Chauvat, Matthieu; Christensen, Morten; Clarke, Francis M; Cleary, Daniel F R; Colombo, Giorgio; Connop, Stuart P; Craig, Michael D; Cruz-López, Leopoldo; Cunningham, Saul A; D'Aniello, Biagio; D'Cruze, Neil; da Silva, Pedro Giovâni; Dallimer, Martin; Danquah, Emmanuel; Darvill, Ben; Dauber, Jens; Davis, Adrian L V; Dawson, Jeff; de Sassi, Claudio; de Thoisy, Benoit; Deheuvels, Olivier; Dejean, Alain; Devineau, Jean-Louis; Diekötter, Tim; Dolia, Jignasu V; Domínguez, Erwin; Dominguez-Haydar, Yamileth; Dorn, Silvia; Draper, Isabel; Dreber, Niels; Dumont, Bertrand; Dures, Simon G; Dynesius, Mats; Edenius, Lars; Eggleton, Paul; Eigenbrod, Felix; Elek, Zoltán; Entling, Martin H; Esler, Karen J; de Lima, Ricardo F; Faruk, Aisyah; Farwig, Nina; Fayle, Tom M; Felicioli, Antonio; Felton, Annika M; Fensham, Roderick J; Fernandez, Ignacio C; Ferreira, Catarina C; Ficetola, Gentile F; Fiera, Cristina; Filgueiras, Bruno K C; Fırıncıoğlu, Hüseyin K; Flaspohler, David; Floren, Andreas; Fonte, Steven J; Fournier, Anne; Fowler, Robert E; Franzén, Markus; Fraser, Lauchlan H; Fredriksson, Gabriella M; Freire, Geraldo B; Frizzo, Tiago L M; Fukuda, Daisuke; Furlani, Dario; Gaigher, René; Ganzhorn, Jörg U; García, Karla P; Garcia-R, Juan C; Garden, Jenni G; Garilleti, Ricardo; Ge, Bao-Ming; Gendreau-Berthiaume, Benoit; Gerard, Philippa J; Gheler-Costa, Carla; Gilbert, Benjamin; Giordani, Paolo; Giordano, Simonetta; Golodets, Carly; Gomes, Laurens G L; Gould, Rachelle K; Goulson, Dave; Gove, Aaron D; Granjon, Laurent; Grass, Ingo; Gray, Claudia L; Grogan, James; Gu, Weibin; Guardiola, Moisès; Gunawardene, Nihara R; Gutierrez, Alvaro G; Gutiérrez-Lamus, Doris L; Haarmeyer, Daniela H; Hanley, Mick E; Hanson, Thor; Hashim, Nor R; Hassan, Shombe N; Hatfield, Richard G; Hawes, Joseph E; Hayward, Matt W; Hébert, Christian; Helden, Alvin J; Henden, John-André; Henschel, Philipp; Hernández, Lionel; Herrera, James P; Herrmann, Farina; Herzog, Felix; Higuera-Diaz, Diego; Hilje, Branko; Höfer, Hubert; Hoffmann, Anke; Horgan, Finbarr G; Hornung, Elisabeth; Horváth, Roland; Hylander, Kristoffer; Isaacs-Cubides, Paola; Ishida, Hiroaki; Ishitani, Masahiro; Jacobs, Carmen T; Jaramillo, Víctor J; Jauker, Birgit; Hernández, F Jiménez; Johnson, McKenzie F; Jolli, Virat; Jonsell, Mats; Juliani, S Nur; Jung, Thomas S; Kapoor, Vena; Kappes, Heike; Kati, Vassiliki; Katovai, Eric; Kellner, Klaus; Kessler, Michael; Kirby, Kathryn R; Kittle, Andrew M; Knight, Mairi E; Knop, Eva; Kohler, Florian; Koivula, Matti; Kolb, Annette

    The PREDICTS project-Projecting Responses of Ecological Diversity In Changing Terrestrial Systems (www.predicts.org.uk)-has collated from published studies a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity of

  2. [The Human Genome Project and the right to intellectual property].

    Science.gov (United States)

    Cambrón, A

    2000-01-01

    The Human Genome Project was designed to achieve two objectives. The scientific goal was the mapping and sequencing of the human genome and the social objective was to benefit the health and well-being of humanity. Although the first objective is nearing successful conclusion, the same cannot be said for the second, mainly because the benefits will take some time to be applicable and effective, but also due to the very nature of the project. The HGP also had a clear economic dimension, which has had a major bearing on its social side. Operating in the midst of these three dimensions is the right to intellectual property (although not just this right), which has facilitated the granting of patents on human genes. Put another way, the carrying out of the HGP has required the privatisation of knowledge of the human genome, and this can be considered an attack on the genetic heritage of mankind.

  3. Enhancing Biology Instruction with the Human Genome Project

    Science.gov (United States)

    Buxeda, Rosa J.; Moore-Russo, Deborah A.

    2003-01-01

    The Human Genome Project (HGP) is a recent scientific milestone that has received notable attention. This article shows how a biology course is using the HGP to enhance students' experiences by providing awareness of cutting edge research, with information on new emerging career options, and with opportunities to consider ethical questions raised…

  4. The Human Genome Project: Biology, Computers, and Privacy.

    Science.gov (United States)

    Cutter, Mary Ann G.; Drexler, Edward; Gottesman, Kay S.; Goulding, Philip G.; McCullough, Laurence B.; McInerney, Joseph D.; Micikas, Lynda B.; Mural, Richard J.; Murray, Jeffrey C.; Zola, John

    This module, for high school teachers, is the second of two modules about the Human Genome Project (HGP) produced by the Biological Sciences Curriculum Study (BSCS). The first section of this module provides background information for teachers about the structure and objectives of the HGP, aspects of the science and technology that underlie the…

  5. Human Genome Project and cystic fibrosis--a symbiotic relationship.

    Science.gov (United States)

    Tolstoi, L G; Smith, C L

    1999-11-01

    When Watson and Crick determined the structure of DNA in 1953, a biological revolution began. One result of this revolution is the Human Genome Project. The primary goal of this international project is to obtain the complete nucleotide sequence of the human genome by the year 2005. Although molecular biologists and geneticists are most enthusiastic about the Human Genome Project, all areas of clinical medicine and fields of biology will be affected. Cystic fibrosis is the most common, inherited, lethal disease of white persons. In 1989, researchers located the cystic fibrosis gene on the long arm of chromosome 7 by a technique known as positional cloning. The most common mutation (a 3-base pair deletion) of the cystic fibrosis gene occurs in 70% of patients with cystic fibrosis. The knowledge gained from genetic research on cystic fibrosis will help researchers develop new therapies (e.g., gene) and improve standard therapies (e.g., pharmacologic) so that a patient's life span is increased and quality of life is improved. The purpose of this review is twofold. First, the article provides an overview of the Human Genome Project and its clinical significance in advancing interdisciplinary care for patients with cystic fibrosis. Second, the article includes a discussion of the genetic basis, pathophysiology, and management of cystic fibrosis.

  6. Reconsidering democracy - History of the human genome project

    NARCIS (Netherlands)

    Huijer, M

    What options are open for people-citizens, politicians, and other nonscientists-to become actively involved in and anticipate new directions in the life sciences? In addressing this question, this article focuses on the start of the Human Genome Project (1985-1990). By contrasting various models of

  7. Reconsidering democracy - History of the human genome project

    NARCIS (Netherlands)

    Huijer, M

    2003-01-01

    What options are open for people-citizens, politicians, and other nonscientists-to become actively involved in and anticipate new directions in the life sciences? In addressing this question, this article focuses on the start of the Human Genome Project (1985-1990). By contrasting various models of

  8. Mapping our genes: The genome projects: How big, how fast

    Energy Technology Data Exchange (ETDEWEB)

    none,

    1988-04-01

    For the past 2 years, scientific and technical journals in biology and medicine have extensively covered a debate about whether and how to determine the function and order of human genes on human chromosomes and when to determine the sequence of molecular building blocks that comprise DNA in those chromosomes. In 1987, these issues rose to become part of the public agenda. The debate involves science, technology, and politics. Congress is responsible for /open quotes/writing the rules/close quotes/ of what various federal agencies do and for funding their work. This report surveys the points made so far in the debate, focusing on those that most directly influence the policy options facing the US Congress. Congressional interest focused on how to assess the rationales for conducting human genome projects, how to fund human genome projects (at what level and through which mechanisms), how to coordinate the scientific and technical programs of the several federal agencies and private interests already supporting various genome projects, and how to strike a balance regarding the impact of genome projects on international scientific cooperation and international economic competition in biotechnology. OTA prepared this report with the assistance of several hundred experts throughout the world. 342 refs., 26 figs., 11 tabs.

  9. Mapping Our Genes: The Genome Projects: How Big, How Fast

    Science.gov (United States)

    1988-04-01

    For the past 2 years, scientific and technical journals in biology and medicine have extensively covered a debate about whether and how to determine the function and order of human genes on human chromosomes and when to determine the sequence of molecular building blocks that comprise DNA in those chromosomes. In 1987, these issues rose to become part of the public agenda. The debate involves science, technology, and politics. Congress is responsible for �writing the rules� of what various federal agencies do and for funding their work. This report surveys the points made so far in the debate, focusing on those that most directly influence the policy options facing the US Congress. Congressional interest focused on how to assess the rationales for conducting human genome projects, how to fund human genome projects (at what level and through which mechanisms), how to coordinate the scientific and technical programs of the several federal agencies and private interests already supporting various genome projects, and how to strike a balance regarding the impact of genome projects on international scientific cooperation and international economic competition in biotechnology. The Office of Technology Assessment (OTA) prepared this report with the assistance of several hundred experts throughout the world.

  10. Relevance of the Human Genome Project to inherited metabolic disease.

    Science.gov (United States)

    Burn, J

    1994-01-01

    The Human Genome Project is an international effort to identify the complete structure of the human genome. HUGO, the Human Genome Organization, facilitates international cooperation and exchange of information while the Genome Data Base will act as the on-line information retrieval and storage system for the huge amount of information being accumulated. The clinical register MIM (Mendelian Inheritance in Man) established by Victor McKusick is now an on-line resource that will allow biochemists working with inborn errors of metabolism to access the rapidly expanding body of knowledge. Biochemical and molecular genetics are complementary and should draw together to find solutions to the academic and clinical problems posed by inborn errors of metabolism.

  11. Avian picornaviruses: molecular evolution, genome diversity and unusual genome features of a rapidly expanding group of viruses in birds.

    Science.gov (United States)

    Boros, Ákos; Pankovics, Péter; Reuter, Gábor

    2014-12-01

    Picornaviridae is one of the most diverse families of viruses infecting vertebrate species. In contrast to the relative small number of mammal species compared to other vertebrates, the abundance of mammal-infecting picornaviruses was significantly overrepresented among the presently known picornaviruses. Therefore most of the current knowledge about the genome diversity/organization patterns and common genome features were based on the analysis of mammal-infecting picornaviruses. Beside the well known reservoir role of birds in case of several emerging viral pathogens, little is known about the diversity of picornaviruses circulating among birds, although in the last decade the number of known avian picornavirus species with complete genome was increased from one to at least 15. However, little is known about the geographic distribution, host spectrum or pathogenic potential of the recently described picornaviruses of birds. Despite the low number of known avian picornaviruses, the phylogenetic and genome organization diversity of these viruses were remarkable. Beside the common L-4-3-4 and 4-3-4 genome layouts unusual genome patterns (3-4-4; 3-5-4, 3-6-4; 3-8-4) with variable, multicistronic 2A genome regions were found among avian picornaviruses. The phylogenetic and genomic analysis revealed the presence of several conserved structures at the untranslated regions among phylogenetically distant avian and non-avian picornaviruses as well as at least five different avian picornavirus phylogenetic clusters located in every main picornavirus lineage with characteristic genome layouts which suggests the complex evolution history of these viruses. Based on the remarkable genetic diversity of the few known avian picornaviruses, the emergence of further divergent picornaviruses causing challenges in the current taxonomy and also in the understanding of the evolution and genome organization of picornaviruses will be strongly expected. In this review we would like to

  12. Nile Tilapia Infectivity by Genomically Diverse Streptoccocus agalactiae Isolates from Multiple Hosts

    Science.gov (United States)

    Streptococcus agalactiae, Lancefield group B Streptococcus (GBS), is recognized for causing cattle mastitis, human neonatal meningitis, and fish meningo-encephalitis. We investigated the genomic diversity of GBS isolates from different phylogenetic hosts and geographical regions using serological t...

  13. Visual Dynamic Simulation and Optimization of Zhangjiuhe Diversion Project

    Institute of Scientific and Technical Information of China (English)

    ZHONG Denghua; LIU Jianmin; XIONG Kaizhi; FU Jinqiang

    2008-01-01

    With the aim of visualizing the real-time simulation calculation of water delivery system (WDS), a structural drawing-oriented (SDO) simulation technique was presented, and applied to Zhangjiuhe Diversion Project, which is a long-distance water delivery system constructed for drawing water from the Zhangjiuhe River to Kunming city. Taking SIMULINK software as simulating platform, the technique established a visual dynamic simulation model for the system. The simulation procedure of the system was simplified, and the efficiency of modeling was also enhanced according to the modularization and reutilization of the simulation program. Furthermore, a selfoptimization model was presented. Based on the digital simulation models, the on line controlled optimization link was added, and the input data can be continually optimized according to the feedback information of simulating output. The system was thus optimized automatically. Built upon MATLAB software, simulation optimization of the Zhangjiuhe Diversion Project was achieved, which provides a new way for the research of optimal operation of WDS.

  14. Comparative assessment of genetic diversity in cytoplasmic and nuclear genome of upland cotton.

    Science.gov (United States)

    Egamberdiev, Sharof S; Saha, Sukumar; Salakhutdinov, Ilkhom; Jenkins, Johnie N; Deng, Dewayne; Y Abdurakhmonov, Ibrokhim

    2016-06-01

    The importance of the cytoplasmic genome for many economically important traits is well documented in several crop species, including cotton. There is no report on application of cotton chloroplast specific SSR markers as a diagnostic tool to study genetic diversity among improved Upland cotton lines. The complete plastome sequence information in GenBank provided us an opportunity to report on 17 chloroplast specific SSR markers using a cost-effective data mining strategy. Here we report the comparative analysis of genetic diversity among a set of 42 improved Upland cotton lines using SSR markers specific to chloroplast and nuclear genome, respectively. Our results revealed that low to moderate level of genetic diversity existed in both nuclear and cytoplasm genome among this set of cotton lines. However, the specific estimation suggested that genetic diversity is lower in cytoplasmic genome compared to the nuclear genome among this set of Upland cotton lines. In summary, this research is important from several perspectives. We detected a set of cytoplasm genome specific SSR primer pairs by using a cost-effective data mining strategy. We reported for the first time the genetic diversity in the cytoplasmic genome within a set of improved Upland cotton accessions. Results revealed that the genetic diversity in cytoplasmic genome is narrow, compared to the nuclear genome within this set of Upland cotton accessions. Our results suggested that most of these polymorphic chloroplast SSRs would be a valuable complementary tool in addition to the nuclear SSR in the study of evolution, gene flow and genetic diversity in Upland cotton.

  15. Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Doethideomycetes Fungi

    Energy Technology Data Exchange (ETDEWEB)

    Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard; Schoch, Conrad L.; Horwitz, Benjamin A.; Barry, Kerrie W.; Condon, Bradford J.; Copeland, Alex C.; Dhillon, Braham; Glaser, Fabien; Hesse, Cedar N.; Kosti, Idit; LaButti, Kurt; Lindquist, Erika A.; Lucas, Susan; Salamov, Asaf A.; Bradshaw, Rosie E.; Ciuffetti, Lynda; Hamelin, Richard C.; Kema, Gert H. J.; Lawrence, Christopher; Scott, James A.; Spatafora, Joseph W.; Turgeon, B. Gillian; de Wit, Pierre J. G. M.; Zhong, Shaobin; Goodwin, Stephen B.; Grigoriev, Igor V.

    2012-03-13

    The class of Dothideomycetes is one of the largest and most diverse groups of fungi. Many are plant pathogens and pose a serious threat to agricultural crops grown for biofuel, food or feed. Most Dothideomycetes have only a single host and related species can have very diverse host plants. Eighteen genomes of Dothideomycetes have currently been sequenced by the Joint Genome Institute and other sequencing centers. Here we describe the results of comparative analyses of the fungi in this group.

  16. Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes

    Energy Technology Data Exchange (ETDEWEB)

    Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard; Schoch, Conrad L.; Horwitz, Benjamin A.; Barry, Kerrie W.; Condon, Bradford J.; Copeland, Alex C.; Dhillon, Braham; Glaser, Fabian; Hesse, Cedar N.; Kosti, Idit; LaButti, Kurt; Lindquist, Erika A.; Lucas, Susan; Salamov, Asaf A.; Bradshaw, Rosie E.; Ciuffetti, Lynda; Hamelin, Richard C.; Kema, Gert H. J.; Lawrence, Christopher; Scott, James A.; Spatafora, Joseph W.; Turgeon, B. Gillian; de Wit, Pierre J. G. M.; Zhong, Shaobin; Goodwin, Stephen B.; Grigoriev, Igor V.

    2013-03-05

    The class of Dothideomycetes is one of the largest and most diverse groups of fungi. Many are plant pathogens and pose a serious threat to agricultural crops that are grown for biofuel, food or feed. Most Dothideomycetes have only a single host plant, and related species can have very diverse hosts. Eighteen genomes of Dothideomycetes have currently been sequenced by the Joint Genome Institute and other sequencing centers. Here we describe the results of comparative analyses of the fungi in this group.

  17. Wild emmer genome architecture and diversity elucidate wheat evolution and domestication.

    Science.gov (United States)

    Avni, Raz; Nave, Moran; Barad, Omer; Baruch, Kobi; Twardziok, Sven O; Gundlach, Heidrun; Hale, Iago; Mascher, Martin; Spannagl, Manuel; Wiebe, Krystalee; Jordan, Katherine W; Golan, Guy; Deek, Jasline; Ben-Zvi, Batsheva; Ben-Zvi, Gil; Himmelbach, Axel; MacLachlan, Ron P; Sharpe, Andrew G; Fritz, Allan; Ben-David, Roi; Budak, Hikmet; Fahima, Tzion; Korol, Abraham; Faris, Justin D; Hernandez, Alvaro; Mikel, Mark A; Levy, Avraham A; Steffenson, Brian; Maccaferri, Marco; Tuberosa, Roberto; Cattivelli, Luigi; Faccioli, Primetta; Ceriotti, Aldo; Kashkush, Khalil; Pourkheirandish, Mohammad; Komatsuda, Takao; Eilam, Tamar; Sela, Hanan; Sharon, Amir; Ohad, Nir; Chamovitz, Daniel A; Mayer, Klaus F X; Stein, Nils; Ronen, Gil; Peleg, Zvi; Pozniak, Curtis J; Akhunov, Eduard D; Distelfeld, Assaf

    2017-07-07

    Wheat (Triticum spp.) is one of the founder crops that likely drove the Neolithic transition to sedentary agrarian societies in the Fertile Crescent more than 10,000 years ago. Identifying genetic modifications underlying wheat's domestication requires knowledge about the genome of its allo-tetraploid progenitor, wild emmer (T. turgidum ssp. dicoccoides). We report a 10.1-gigabase assembly of the 14 chromosomes of wild tetraploid wheat, as well as analyses of gene content, genome architecture, and genetic diversity. With this fully assembled polyploid wheat genome, we identified the causal mutations in Brittle Rachis 1 (TtBtr1) genes controlling shattering, a key domestication trait. A study of genomic diversity among wild and domesticated accessions revealed genomic regions bearing the signature of selection under domestication. This reference assembly will serve as a resource for accelerating the genome-assisted improvement of modern wheat varieties. Copyright © 2017, American Association for the Advancement of Science.

  18. Comparative Genomics Analysis of Streptomyces Species Reveals Their Adaptation to the Marine Environment and Their Diversity at the Genomic Level

    Science.gov (United States)

    Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa

    2016-01-01

    Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea’s genetic data sources. PMID:27446038

  19. Comparative Genomics Analysis of Streptomyces Species Reveals Their Adaptation to the Marine Environment and Their Diversity at the Genomic Level.

    Science.gov (United States)

    Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa

    2016-01-01

    Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea's genetic data sources.

  20. Genomic diversity and evolution of the head crest in the rock pigeon

    DEFF Research Database (Denmark)

    Shapiro, Michael D.; Kronenberg, Zev; Li, Cai;

    2013-01-01

    The geographic origins of breeds and the genetic basis of variation within the widely distributed and phenotypically diverse domestic rock pigeon (Columba livia) remain largely unknown. We generated a rock pigeon reference genome and additional genome sequences representing domestic and feral pop...

  1. Relationship between metabolic and genomic diversity in sesame (Sesamum indicum L.).

    Science.gov (United States)

    Laurentin, Hernán; Ratzinger, Astrid; Karlovsky, Petr

    2008-05-29

    Diversity estimates in cultivated plants provide a rationale for conservation strategies and support the selection of starting material for breeding programs. Diversity measures applied to crops usually have been limited to the assessment of genome polymorphism at the DNA level. Occasionally, selected morphological features are recorded and the content of key chemical constituents determined, but unbiased and comprehensive chemical phenotypes have not been included systematically in diversity surveys. Our objective in this study was to assess metabolic diversity in sesame by nontargeted metabolic profiling and elucidate the relationship between metabolic and genome diversity in this crop. Ten sesame accessions were selected that represent most of the genome diversity of sesame grown in India, Western Asia, Sudan and Venezuela based on previous AFLP studies. Ethanolic seed extracts were separated by HPLC, metabolites were ionized by positive and negative electrospray and ions were detected with an ion trap mass spectrometer in full-scan mode for m/z from 50 to 1000. Genome diversity was determined by Amplified Fragment Length Polymorphism (AFLP) using eight primer pair combinations. The relationship between biodiversity at the genome and at the metabolome levels was assessed by correlation analysis and multivariate statistics. Patterns of diversity at the genomic and metabolic levels differed, indicating that selection played a significant role in the evolution of metabolic diversity in sesame. This result implies that when used for the selection of genotypes in breeding and conservation, diversity assessment based on neutral DNA markers should be complemented with metabolic profiles. We hypothesize that this applies to all crops with a long history of domestication that possess commercially relevant traits affected by chemical phenotypes.

  2. Genome Size Diversity in Lilium (Liliaceae Is Correlated with Karyotype and Environmental Traits

    Directory of Open Access Journals (Sweden)

    Yun-peng Du

    2017-07-01

    Full Text Available Genome size (GS diversity is of fundamental biological importance. The occurrence of giant genomes in angiosperms is restricted to just a few lineages in the analyzed genome size of plant species so far. It is still an open question whether GS diversity is shaped by neutral or natural selection. The genus Lilium, with giant genomes, is phylogenetically and horticulturally important and is distributed throughout the northern hemisphere. GS diversity in Lilium and the underlying evolutionary mechanisms are poorly understood. We performed a comprehensive study involving phylogenetically independent analysis on 71 species to explore the diversity and evolution of GS and its correlation with karyological and environmental traits within Lilium (including Nomocharis. The strong phylogenetic signal detected for GS in the genus provides evidence consistent with that the repetitive DNA may be the primary contributors to the GS diversity, while the significant positive relationships detected between GS and the haploid chromosome length (HCL provide insights into patterns of genome evolution. The relationships between GS and karyotypes indicate that ancestral karyotypes of Lilium are likely to have exhibited small genomes, low diversity in centromeric index (CVCI values and relatively high relative variation in chromosome length (CVCL values. Significant relationships identified between GS and annual temperature and between GS and annual precipitation suggest that adaptation to habitat strongly influences GS diversity. We conclude that GS in Lilium is shaped by both neutral (genetic drift and adaptive evolution. These findings will have important consequences for understanding the evolution of giant plant genomes, and exploring the role of repetitive DNA fraction and chromosome changes in a plant group with large genomes and conservation of chromosome number.

  3. Project to expand diversity in the nursing workforce.

    Science.gov (United States)

    Georges, Catherine

    2012-05-01

    The Bronx, one of the five boroughs of New York City, has a diverse population, but the largest ethnic group is Hispanic, or Latino. More than half (53 per cent) of the students at Lehman College of the City University of New York are from this group, reflecting the population demographic of the borough, but in 2006 Hispanic students comprised just 8 per cent of those enrolled in the department of nursing. To address this disparity, the department undertook a project to increase recruitment, retention and graduation of Hispanic nursing students. The project involved several activities in collaboration with a Bronx high school, Lehman College's baccalaureate nursing programme, and a partner hospital that serves thousands of people of Hispanic origin. This article describes the project and the lessons learnt.

  4. Mitochondrial genome diversity in dagger and needle nematodes (Nematoda: Longidoridae)

    Science.gov (United States)

    Palomares-Rius, J. E.; Cantalapiedra-Navarrete, C.; Archidona-Yuste, A.; Blok, V. C.; Castillo, P.

    2017-01-01

    Dagger and needle nematodes included in the family Longidoridae (viz. Longidorus, Paralongidorus, and Xiphinema) are highly polyphagous plant-parasitic nematodes in wild and cultivated plants and some of them are plant-virus vectors (nepovirus). The mitochondrial (mt) genomes of the dagger and needle nematodes, Xiphinema rivesi, Xiphinema pachtaicum, Longidorus vineacola and Paralongidorus litoralis were sequenced in this study. The four circular mt genomes have an estimated size of 12.6, 12.5, 13.5 and 12.7 kb, respectively. Up to date, the mt genome of X. pachtaicum is the smallest genome found in Nematoda. The four mt genomes contain 12 protein-coding genes (viz. cox1-3, nad1-6, nad4L, atp6 and cob) and two ribosomal RNA genes (rrnL and rrnS), but the atp8 gene was not detected. These mt genomes showed a gene arrangement very different within the Longidoridae species sequenced, with the exception of very closely related species (X. americanum and X. rivesi). The sizes of non-coding regions in the Longidoridae nematodes were very small and were present in a few places in the mt genome. Phylogenetic analysis of all coding genes showed a closer relationship between Longidorus and Paralongidorus and different phylogenetic possibilities for the three Xiphinema species. PMID:28150734

  5. The emergence of commercial genomics: analysis of the rise of a biotechnology subsector during the Human Genome Project, 1990 to 2004.

    Science.gov (United States)

    Wiechers, Ilse R; Perin, Noah C; Cook-Deegan, Robert

    2013-01-01

    Development of the commercial genomics sector within the biotechnology industry relied heavily on the scientific commons, public funding, and technology transfer between academic and industrial research. This study tracks financial and intellectual property data on genomics firms from 1990 through 2004, thus following these firms as they emerged in the era of the Human Genome Project and through the 2000 to 2001 market bubble. A database was created based on an early survey of genomics firms, which was expanded using three web-based biotechnology services, scientific journals, and biotechnology trade and technical publications. Financial data for publicly traded firms was collected through the use of four databases specializing in firm financials. Patent searches were conducted using firm names in the US Patent and Trademark Office website search engine and the DNA Patent Database. A biotechnology subsector of genomics firms emerged in parallel to the publicly funded Human Genome Project. Trends among top firms show that hiring, capital improvement, and research and development expenditures continued to grow after a 2000 to 2001 bubble. The majority of firms are small businesses with great diversity in type of research and development, products, and services provided. Over half the public firms holding patents have the majority of their intellectual property portfolio in DNA-based patents. These data allow estimates of investment, research and development expenditures, and jobs that paralleled the rise of genomics as a sector within biotechnology between 1990 and 2004.

  6. Draft Genome Sequences of Nine Cyanobacterial Strains from Diverse Habitats

    Science.gov (United States)

    Zhu, Tao; Hou, Shengwei

    2017-01-01

    ABSTRACT Here, we report the annotated draft genome sequences of nine different cyanobacteria, which were originally collected from different habitats, including hot springs, terrestrial, freshwater, and marine environments, and cover four of the five morphological subsections of cyanobacteria. PMID:28254973

  7. Consequences for diversity when prioritizing animals for conservation with pedigree or genomic information.

    Science.gov (United States)

    Engelsma, K A; Veerkamp, R F; Calus, M P L; Windig, J J

    2011-12-01

    Up to now, prioritization of animals for conservation has been mainly based on pedigree information; however, genomic information may improve prioritization. In this study, we used two Holstein populations to investigate the consequences for genetic diversity when animals are prioritized with optimal contributions based on pedigree or genomic data and whether consequences are different at the chromosomal level. Selection with genomic kinships resulted in a higher conserved diversity, but differences were small. Largest differences were found when few animals were prioritized and when pedigree errors were present. We found more differences at the chromosomal level, where selection based on genomic kinships resulted in a higher conserved diversity for most chromosomes, but for some chromosomes, pedigree-based selection resulted in a higher conserved diversity. To optimize conservation strategies, genomic information can help to improve the selection of animals for conservation in those situations where pedigree information is unreliable or absent or when we want to conserve diversity at specific genome regions. © 2011 Blackwell Verlag GmbH.

  8. Ecology, Diversity and Comparative Genomics of Oceanic Cyanobacterial Viruses

    Science.gov (United States)

    2004-06-01

    genome contains a group of some genes with little homology to known bacterial proteins, but with homology to eukaryotic prion -like proteins (e-5), an...eukaryotic and prion -like genes have been made in the genomes of mycobacteriophages (Pedulla et al., 2003). The presence of these genes in a...and asceptically add filter sterilized nutrients to bring level of top agarose nutrients to desired media of choice d. Add appropriate volume of

  9. The characterization of goat genetic diversity : Towards a genomic approach

    NARCIS (Netherlands)

    Ajmone-Marsan, P.; Colli, L.; Han, J. L.; Achilli, A.; Lancioni, H.; Joost, S.; Crepaldi, P.; Pilla, F.; Stella, A.; Taberlet, P.; Boettcher, P.; Negrini, R.; Lenstra, J. A.

    2014-01-01

    The investigation of genetic diversity at molecular level has been proposed as a valuable complement and sometimes proxy to phenotypic diversity of local breeds and is presently considered as one of the FAO priorities for breed characterization. By recommending a set of selected molecular markers fo

  10. The characterization of goat genetic diversity : Towards a genomic approach

    NARCIS (Netherlands)

    Ajmone-Marsan, P.; Colli, L.; Han, J. L.; Achilli, A.; Lancioni, H.; Joost, S.; Crepaldi, P.; Pilla, F.; Stella, A.; Taberlet, P.; Boettcher, P.; Negrini, R.; Lenstra, J. A.|info:eu-repo/dai/nl/067852335

    2014-01-01

    The investigation of genetic diversity at molecular level has been proposed as a valuable complement and sometimes proxy to phenotypic diversity of local breeds and is presently considered as one of the FAO priorities for breed characterization. By recommending a set of selected molecular markers

  11. The little bacteria that can - diversity, genomics and ecophysiology of 'Dehalococcoides' spp. in contaminated environments.

    Science.gov (United States)

    Taş, Neslihan; van Eekert, Miriam H A; de Vos, Willem M; Smidt, Hauke

    2010-07-01

    The fate and persistence of chlorinated organics in the environment have been a concern for the past 50 years. Industrialization and extensive agricultural activities have led to the accumulation of these pollutants in the environment, while their adverse impact on various ecosystems and human health also became evident. This review provides an update on the current knowledge of specialized anaerobic bacteria, namely 'Dehalococcoides' spp., which are dedicated to the transformation of various chlorinated organic compounds via reductive dechlorination. Advances in microbiology and molecular techniques shed light into the diversity and functioning of Dehalococcoides spp. in several different locations. Recent genome sequencing projects revealed a large number of genes that are potentially involved in reductive dechlorination. Molecular approaches towards analysis of diversity and expression especially of reductive dehalogenase-encoding genes are providing a growing body of knowledge on biodegradative pathways active in defined pure and mixed cultures as well as directly in the environment. Moreover, several successful field cases of bioremediation strengthen the notion of dedicated degraders such as Dehalococcoides spp. as key players in the restoration of contaminated environments. © 2009 The Authors. Journal compilation © 2009 Society for Applied Microbiology and Blackwell Publishing Ltd.

  12. The environmental genome project: ethical, legal, and social implications.

    OpenAIRE

    Sharp, R R; Barrett, J. C.

    2000-01-01

    The National Institute of Environmental Health Sciences is supporting a multiyear research initiative examining genetic influences on environmental response. Proponents of this new initiative, known as the Environmental Genome Project, hope that the information learned will improve our understanding of environmentally associated diseases and allow clinicians and public health officials to target disease-prevention strategies to those who are at increased risk. Despite these potential benefits...

  13. nGASP - the nematode genome annotation assessment project

    Energy Technology Data Exchange (ETDEWEB)

    Coghlan, A; Fiedler, T J; McKay, S J; Flicek, P; Harris, T W; Blasiar, D; Allen, J; Stein, L D

    2008-12-19

    While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders. While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C

  14. The database of the PREDICTS (Projecting Responses of Ecological Diversity In Changing Terrestrial Systems) project

    OpenAIRE

    Hudson, Lawrence N; Newbold, Tim; Contu, Sara; Hill, Samantha L.L.; Lysenko, Igor; De Palma, Adriana; Phillips, Helen R. P.; Alhusseini, Tamera I.; Bedford, Felicity E.; Bennett, Dominic J.; Booth, Hollie; Burton, Victoria J.; Chng , Charlotte W. T.; Choimes, Argyrios; Correia, David L.P.

    2017-01-01

    The PREDICTS project-Projecting Responses of Ecological Diversity In Changing Terrestrial Systems (www.predicts.org.uk)-has collated from published studies a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity of human impacts relating to land use. We have used this evidence base to develop global and regional statistical models of how local biodiversity responds to these measures. We describe and make free...

  15. Genome sequence and genetic diversity of European ash trees

    DEFF Research Database (Denmark)

    Sollars, Elizabeth S A; Harper, Andrea L; Kelly, Laura J;

    2016-01-01

    Ash trees (genus Fraxinus, family Oleaceae) are widespread throughout the Northern Hemisphere, but are being devastated in Europe by the fungus Hymenoscyphus fraxineus, causing ash dieback, and in North America by the herbivorous beetle Agrilus planipennis. Here we sequence the genome of a low......-heterozygosity Fraxinus excelsior tree from Gloucestershire, UK, annotating 38,852 protein-coding genes of which 25% appear ash specific when compared with the genomes of ten other plant species. Analyses of paralogous genes suggest a whole-genome duplication shared with olive (Olea europaea, Oleaceae). We also re......-sequence 37 F. excelsior trees from Europe, finding evidence for apparent long-term decline in effective population size. Using our reference sequence, we re-analyse association transcriptomic data, yielding improved markers for reduced susceptibility to ash dieback. Surveys of these markers in British...

  16. The B73 maize genome: complexity, diversity, and dynamics.

    Science.gov (United States)

    Schnable, Patrick S; Ware, Doreen; Fulton, Robert S; Stein, Joshua C; Wei, Fusheng; Pasternak, Shiran; Liang, Chengzhi; Zhang, Jianwei; Fulton, Lucinda; Graves, Tina A; Minx, Patrick; Reily, Amy Denise; Courtney, Laura; Kruchowski, Scott S; Tomlinson, Chad; Strong, Cindy; Delehaunty, Kim; Fronick, Catrina; Courtney, Bill; Rock, Susan M; Belter, Eddie; Du, Feiyu; Kim, Kyung; Abbott, Rachel M; Cotton, Marc; Levy, Andy; Marchetto, Pamela; Ochoa, Kerri; Jackson, Stephanie M; Gillam, Barbara; Chen, Weizu; Yan, Le; Higginbotham, Jamey; Cardenas, Marco; Waligorski, Jason; Applebaum, Elizabeth; Phelps, Lindsey; Falcone, Jason; Kanchi, Krishna; Thane, Thynn; Scimone, Adam; Thane, Nay; Henke, Jessica; Wang, Tom; Ruppert, Jessica; Shah, Neha; Rotter, Kelsi; Hodges, Jennifer; Ingenthron, Elizabeth; Cordes, Matt; Kohlberg, Sara; Sgro, Jennifer; Delgado, Brandon; Mead, Kelly; Chinwalla, Asif; Leonard, Shawn; Crouse, Kevin; Collura, Kristi; Kudrna, Dave; Currie, Jennifer; He, Ruifeng; Angelova, Angelina; Rajasekar, Shanmugam; Mueller, Teri; Lomeli, Rene; Scara, Gabriel; Ko, Ara; Delaney, Krista; Wissotski, Marina; Lopez, Georgina; Campos, David; Braidotti, Michele; Ashley, Elizabeth; Golser, Wolfgang; Kim, HyeRan; Lee, Seunghee; Lin, Jinke; Dujmic, Zeljko; Kim, Woojin; Talag, Jayson; Zuccolo, Andrea; Fan, Chuanzhu; Sebastian, Aswathy; Kramer, Melissa; Spiegel, Lori; Nascimento, Lidia; Zutavern, Theresa; Miller, Beth; Ambroise, Claude; Muller, Stephanie; Spooner, Will; Narechania, Apurva; Ren, Liya; Wei, Sharon; Kumari, Sunita; Faga, Ben; Levy, Michael J; McMahan, Linda; Van Buren, Peter; Vaughn, Matthew W; Ying, Kai; Yeh, Cheng-Ting; Emrich, Scott J; Jia, Yi; Kalyanaraman, Ananth; Hsia, An-Ping; Barbazuk, W Brad; Baucom, Regina S; Brutnell, Thomas P; Carpita, Nicholas C; Chaparro, Cristian; Chia, Jer-Ming; Deragon, Jean-Marc; Estill, James C; Fu, Yan; Jeddeloh, Jeffrey A; Han, Yujun; Lee, Hyeran; Li, Pinghua; Lisch, Damon R; Liu, Sanzhen; Liu, Zhijie; Nagel, Dawn Holligan; McCann, Maureen C; SanMiguel, Phillip; Myers, Alan M; Nettleton, Dan; Nguyen, John; Penning, Bryan W; Ponnala, Lalit; Schneider, Kevin L; Schwartz, David C; Sharma, Anupma; Soderlund, Carol; Springer, Nathan M; Sun, Qi; Wang, Hao; Waterman, Michael; Westerman, Richard; Wolfgruber, Thomas K; Yang, Lixing; Yu, Yeisoo; Zhang, Lifang; Zhou, Shiguo; Zhu, Qihui; Bennetzen, Jeffrey L; Dawe, R Kelly; Jiang, Jiming; Jiang, Ning; Presting, Gernot G; Wessler, Susan R; Aluru, Srinivas; Martienssen, Robert A; Clifton, Sandra W; McCombie, W Richard; Wing, Rod A; Wilson, Richard K

    2009-11-20

    We report an improved draft nucleotide sequence of the 2.3-gigabase genome of maize, an important crop plant and model for biological research. Over 32,000 genes were predicted, of which 99.8% were placed on reference chromosomes. Nearly 85% of the genome is composed of hundreds of families of transposable elements, dispersed nonuniformly across the genome. These were responsible for the capture and amplification of numerous gene fragments and affect the composition, sizes, and positions of centromeres. We also report on the correlation of methylation-poor regions with Mu transposon insertions and recombination, and copy number variants with insertions and/or deletions, as well as how uneven gene losses between duplicated regions were involved in returning an ancient allotetraploid to a genetically diploid state. These analyses inform and set the stage for further investigations to improve our understanding of the domestication and agricultural improvements of maize.

  17. Genome sequence and genetic diversity of European ash trees

    DEFF Research Database (Denmark)

    Sollars, Elizabeth S A; Harper, Andrea L; Kelly, Laura J;

    2016-01-01

    Ash trees (genus Fraxinus, family Oleaceae) are widespread throughout the Northern Hemisphere, but are being devastated in Europe by the fungus Hymenoscyphus fraxineus, causing ash dieback, and in North America by the herbivorous beetle Agrilus planipennis. Here we sequence the genome of a low...... to an emerging health threat in a non-model organism opens the way for mitigation of the epidemic....

  18. Exceptionally diverse morphotypes and genomes of crenarchaeal hyperthermophilic viruses

    DEFF Research Database (Denmark)

    Prangishvili, D; Garrett, R A

    2004-01-01

    crenarchaeal rudiviruses and the large eukaryal DNA viruses: poxviruses, the African swine fever virus and Chlorella viruses. Sequence patterns at the ends of the linear genome of the lipothrixvirus AFV1 are reminiscent of the telomeric ends of linear eukaryal chromosomes and suggest that a primitive telomeric...

  19. Chimpanzee genomic diversity reveals ancient admixture with bonobos

    DEFF Research Database (Denmark)

    de Manuel, Marc; Kuhlwilm, Martin; Frandsen, Peter

    2016-01-01

    Our closest living relatives, chimpanzees and bonobos, have a complex demographic history. We analyzed the high-coverage whole genomes of 75 wild-born chimpanzees and bonobos from 10 countries in Africa. We found that chimpanzee population substructure makes genetic information a good predictor o...

  20. The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata

    Energy Technology Data Exchange (ETDEWEB)

    Fenner, Marsha W; Liolios, Konstantinos; Mavromatis, Konstantinos; Tavernarakis, Nektarios; Kyrpides, Nikos C.

    2007-12-31

    The Genomes On Line Database (GOLD) is a comprehensive resource of information for genome and metagenome projects world-wide. GOLD provides access to complete and ongoing projects and their associated metadata through pre-computed lists and a search page. The database currently incorporates information for more than 2900 sequencing projects, of which 639 have been completed and the data deposited in the public databases. GOLD is constantly expanding to provide metadata information related to the project and the organism and is compliant with the Minimum Information about a Genome Sequence (MIGS) specifications.

  1. The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata

    Energy Technology Data Exchange (ETDEWEB)

    Liolios, Konstantinos; Chen, Amy; Mavromatis, Konstantinos; Tavernarakis, Nektarios; Hugenholtz, Phil; Markowitz, Victor; Kyrpides, Nikos C.

    2009-09-01

    The Genomes On Line Database (GOLD) is a comprehensive resource for centralized monitoring of genome and metagenome projects worldwide. Both complete and ongoing projects, along with their associated metadata, can be accessed in GOLD through precomputed tables and a search page. As of September 2009, GOLD contains information for more than 5800 sequencing projects, of which 1100 have been completed and their sequence data deposited in a public repository. GOLD continues to expand, moving toward the goal of providing the most comprehensive repository of metadata information related to the projects and their organisms/environments in accordance with the Minimum Information about a (Meta)Genome Sequence (MIGS/MIMS) specification.

  2. Endozoicomonas genomes reveal functional adaptation and plasticity in bacterial strains symbiotically associated with diverse marine hosts

    KAUST Repository

    Neave, Matthew J.

    2017-01-17

    Endozoicomonas bacteria are globally distributed and often abundantly associated with diverse marine hosts including reef-building corals, yet their function remains unknown. In this study we generated novel Endozoicomonas genomes from single cells and metagenomes obtained directly from the corals Stylophora pistillata, Pocillopora verrucosa, and Acropora humilis. We then compared these culture-independent genomes to existing genomes of bacterial isolates acquired from a sponge, sea slug, and coral to examine the functional landscape of this enigmatic genus. Sequencing and analysis of single cells and metagenomes resulted in four novel genomes with 60–76% and 81–90% genome completeness, respectively. These data also confirmed that Endozoicomonas genomes are large and are not streamlined for an obligate endosymbiotic lifestyle, implying that they have free-living stages. All genomes show an enrichment of genes associated with carbon sugar transport and utilization and protein secretion, potentially indicating that Endozoicomonas contribute to the cycling of carbohydrates and the provision of proteins to their respective hosts. Importantly, besides these commonalities, the genomes showed evidence for differential functional specificity and diversification, including genes for the production of amino acids. Given this metabolic diversity of Endozoicomonas we propose that different genotypes play disparate roles and have diversified in concert with their hosts.

  3. Lactobacillus paracasei comparative genomics: towards species pan-genome definition and exploitation of diversity.

    Science.gov (United States)

    Smokvina, Tamara; Wels, Michiel; Polka, Justyna; Chervaux, Christian; Brisse, Sylvain; Boekhorst, Jos; van Hylckama Vlieg, Johan E T; Siezen, Roland J

    2013-01-01

    Lactobacillus paracasei is a member of the normal human and animal gut microbiota and is used extensively in the food industry in starter cultures for dairy products or as probiotics. With the development of low-cost, high-throughput sequencing techniques it has become feasible to sequence many different strains of one species and to determine its "pan-genome". We have sequenced the genomes of 34 different L. paracasei strains, and performed a comparative genomics analysis. We analysed genome synteny and content, focussing on the pan-genome, core genome and variable genome. Each genome was shown to contain around 2800-3100 protein-coding genes, and comparative analysis identified over 4200 ortholog groups that comprise the pan-genome of this species, of which about 1800 ortholog groups make up the conserved core. Several factors previously associated with host-microbe interactions such as pili, cell-envelope proteinase, hydrolases p40 and p75 or the capacity to produce short branched-chain fatty acids (bkd operon) are part of the L. paracasei core genome present in all analysed strains. The variome consists mainly of hypothetical proteins, phages, plasmids, transposon/conjugative elements, and known functions such as sugar metabolism, cell-surface proteins, transporters, CRISPR-associated proteins, and EPS biosynthesis proteins. An enormous variety and variability of sugar utilization gene cassettes were identified, with each strain harbouring between 25-53 cassettes, reflecting the high adaptability of L. paracasei to different niches. A phylogenomic tree was constructed based on total genome contents, and together with an analysis of horizontal gene transfer events we conclude that evolution of these L. paracasei strains is complex and not always related to niche adaptation. The results of this genome content comparison was used, together with high-throughput growth experiments on various carbohydrates, to perform gene-trait matching analysis, in order to link

  4. Cajal body function in genome organization and transcriptome diversity.

    Science.gov (United States)

    Sawyer, Iain A; Sturgill, David; Sung, Myong-Hee; Hager, Gordon L; Dundr, Miroslav

    2016-12-01

    Nuclear bodies contribute to non-random organization of the human genome and nuclear function. Using a major prototypical nuclear body, the Cajal body, as an example, we suggest that these structures assemble at specific gene loci located across the genome as a result of high transcriptional activity. Subsequently, target genes are physically clustered in close proximity in Cajal body-containing cells. However, Cajal bodies are observed in only a limited number of human cell types, including neuronal and cancer cells. Ultimately, Cajal body depletion perturbs splicing kinetics by reducing target small nuclear RNA (snRNA) transcription and limiting the levels of spliceosomal snRNPs, including their modification and turnover following each round of RNA splicing. As such, Cajal bodies are capable of shaping the chromatin interaction landscape and the transcriptome by influencing spliceosome kinetics. Future studies should concentrate on characterizing the direct influence of Cajal bodies upon snRNA gene transcriptional dynamics. Also see the video abstract here.

  5. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects

    Directory of Open Access Journals (Sweden)

    Holt Carson

    2011-12-01

    Full Text Available Abstract Background Second-generation sequencing technologies are precipitating major shifts with regards to what kinds of genomes are being sequenced and how they are annotated. While the first generation of genome projects focused on well-studied model organisms, many of today's projects involve exotic organisms whose genomes are largely terra incognita. This complicates their annotation, because unlike first-generation projects, there are no pre-existing 'gold-standard' gene-models with which to train gene-finders. Improvements in genome assembly and the wide availability of mRNA-seq data are also creating opportunities to update and re-annotate previously published genome annotations. Today's genome projects are thus in need of new genome annotation tools that can meet the challenges and opportunities presented by second-generation sequencing technologies. Results We present MAKER2, a genome annotation and data management tool designed for second-generation genome projects. MAKER2 is a multi-threaded, parallelized application that can process second-generation datasets of virtually any size. We show that MAKER2 can produce accurate annotations for novel genomes where training-data are limited, of low quality or even non-existent. MAKER2 also provides an easy means to use mRNA-seq data to improve annotation quality; and it can use these data to update legacy annotations, significantly improving their quality. We also show that MAKER2 can evaluate the quality of genome annotations, and identify and prioritize problematic annotations for manual review. Conclusions MAKER2 is the first annotation engine specifically designed for second-generation genome projects. MAKER2 scales to datasets of any size, requires little in the way of training data, and can use mRNA-seq data to improve annotation quality. It can also update and manage legacy genome annotation datasets.

  6. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects.

    Science.gov (United States)

    Holt, Carson; Yandell, Mark

    2011-12-22

    Second-generation sequencing technologies are precipitating major shifts with regards to what kinds of genomes are being sequenced and how they are annotated. While the first generation of genome projects focused on well-studied model organisms, many of today's projects involve exotic organisms whose genomes are largely terra incognita. This complicates their annotation, because unlike first-generation projects, there are no pre-existing 'gold-standard' gene-models with which to train gene-finders. Improvements in genome assembly and the wide availability of mRNA-seq data are also creating opportunities to update and re-annotate previously published genome annotations. Today's genome projects are thus in need of new genome annotation tools that can meet the challenges and opportunities presented by second-generation sequencing technologies. We present MAKER2, a genome annotation and data management tool designed for second-generation genome projects. MAKER2 is a multi-threaded, parallelized application that can process second-generation datasets of virtually any size. We show that MAKER2 can produce accurate annotations for novel genomes where training-data are limited, of low quality or even non-existent. MAKER2 also provides an easy means to use mRNA-seq data to improve annotation quality; and it can use these data to update legacy annotations, significantly improving their quality. We also show that MAKER2 can evaluate the quality of genome annotations, and identify and prioritize problematic annotations for manual review. MAKER2 is the first annotation engine specifically designed for second-generation genome projects. MAKER2 scales to datasets of any size, requires little in the way of training data, and can use mRNA-seq data to improve annotation quality. It can also update and manage legacy genome annotation datasets.

  7. First genomic survey of human skin fungal diversity

    Science.gov (United States)

    Fungal infections of the skin affect 29 million people in the United States. In the first study of human fungal skin diversity, National Institutes of Health researchers sequenced the DNA of fungi that thrive at different skin sites of healthy adults to d

  8. The family Rhabdoviridae: mono- and bipartite negative-sense RNA viruses with diverse genome organization and common evolutionary origins

    Science.gov (United States)

    Dietzgen, Ralf G.; Kondo, Hideki; Goodin, Michael M.; Kurath, Gael; Vasilakis, Nikos

    2017-01-01

    The family Rhabdoviridae consists of mostly enveloped, bullet-shaped or bacilliform viruses with a negative-sense, single-stranded RNA genome that infect vertebrates, invertebrates or plants. This ecological diversity is reflected by the diversity and complexity of their genomes. Five canonical structural protein genes are conserved in all rhabdoviruses, but may be overprinted, overlapped or interspersed with several novel and diverse accessory genes. This review gives an overview of the characteristics and diversity of rhabdoviruses, their taxonomic classification, replication mechanism, properties of classical rhabdoviruses such as rabies virus and rhabdoviruses with complex genomes, rhabdoviruses infecting aquatic species, and plant rhabdoviruses with both mono- and bipartite genomes.

  9. The life cycle of a genome project: perspectives and guidelines inspired by insect genome projects [version 1; referees: 2 approved, 1 approved with reservations

    Directory of Open Access Journals (Sweden)

    Alexie Papanicolaou

    2016-01-01

    Full Text Available Many research programs on non-model species biology have been empowered by genomics. In turn, genomics is underpinned by a reference sequence and ancillary information created by so-called “genome projects”. The most reliable genome projects are the ones created as part of an active research program and designed to address specific questions but their life extends past publication. In this opinion paper I outline four key insights that have facilitated maintaining genomic communities: the key role of computational capability, the iterative process of building genomic resources, the value of community participation and the importance of manual curation. Taken together, these ideas can and do ensure the longevity of genome projects and the growing non-model species community can use them to focus a discussion with regards to its future genomic infrastructure.

  10. Genome sequence and genetic diversity of European ash trees.

    Science.gov (United States)

    Sollars, Elizabeth S A; Harper, Andrea L; Kelly, Laura J; Sambles, Christine M; Ramirez-Gonzalez, Ricardo H; Swarbreck, David; Kaithakottil, Gemy; Cooper, Endymion D; Uauy, Cristobal; Havlickova, Lenka; Worswick, Gemma; Studholme, David J; Zohren, Jasmin; Salmon, Deborah L; Clavijo, Bernardo J; Li, Yi; He, Zhesi; Fellgett, Alison; McKinney, Lea Vig; Nielsen, Lene Rostgaard; Douglas, Gerry C; Kjær, Erik Dahl; Downie, J Allan; Boshier, David; Lee, Steve; Clark, Jo; Grant, Murray; Bancroft, Ian; Caccamo, Mario; Buggs, Richard J A

    2017-01-12

    Ash trees (genus Fraxinus, family Oleaceae) are widespread throughout the Northern Hemisphere, but are being devastated in Europe by the fungus Hymenoscyphus fraxineus, causing ash dieback, and in North America by the herbivorous beetle Agrilus planipennis. Here we sequence the genome of a low-heterozygosity Fraxinus excelsior tree from Gloucestershire, UK, annotating 38,852 protein-coding genes of which 25% appear ash specific when compared with the genomes of ten other plant species. Analyses of paralogous genes suggest a whole-genome duplication shared with olive (Olea europaea, Oleaceae). We also re-sequence 37 F. excelsior trees from Europe, finding evidence for apparent long-term decline in effective population size. Using our reference sequence, we re-analyse association transcriptomic data, yielding improved markers for reduced susceptibility to ash dieback. Surveys of these markers in British populations suggest that reduced susceptibility to ash dieback may be more widespread in Great Britain than in Denmark. We also present evidence that susceptibility of trees to H. fraxineus is associated with their iridoid glycoside levels. This rapid, integrated, multidisciplinary research response to an emerging health threat in a non-model organism opens the way for mitigation of the epidemic.

  11. Population Genomics of sub-saharan Drosophila melanogaster: African diversity and non-African admixture.

    Directory of Open Access Journals (Sweden)

    John E Pool

    Full Text Available Drosophila melanogaster has played a pivotal role in the development of modern population genetics. However, many basic questions regarding the demographic and adaptive history of this species remain unresolved. We report the genome sequencing of 139 wild-derived strains of D. melanogaster, representing 22 population samples from the sub-Saharan ancestral range of this species, along with one European population. Most genomes were sequenced above 25X depth from haploid embryos. Results indicated a pervasive influence of non-African admixture in many African populations, motivating the development and application of a novel admixture detection method. Admixture proportions varied among populations, with greater admixture in urban locations. Admixture levels also varied across the genome, with localized peaks and valleys suggestive of a non-neutral introgression process. Genomes from the same location differed starkly in ancestry, suggesting that isolation mechanisms may exist within African populations. After removing putatively admixed genomic segments, the greatest genetic diversity was observed in southern Africa (e.g. Zambia, while diversity in other populations was largely consistent with a geographic expansion from this potentially ancestral region. The European population showed different levels of diversity reduction on each chromosome arm, and some African populations displayed chromosome arm-specific diversity reductions. Inversions in the European sample were associated with strong elevations in diversity across chromosome arms. Genomic scans were conducted to identify loci that may represent targets of positive selection within an African population, between African populations, and between European and African populations. A disproportionate number of candidate selective sweep regions were located near genes with varied roles in gene regulation. Outliers for Europe-Africa F(ST were found to be enriched in genomic regions of locally

  12. SSR Analysis on Diversity of AA Genome Oryza Species in the Southeast and South Asia

    Institute of Scientific and Technical Information of China (English)

    LU Jian-zhen; ZHANG Xiao-li; WANG Hai-gang; YUAN Xiao-ping; XU Qun; WANG Yi-ping; YU Han-yong; TANG Sheng-xiang; WEI Xing-hua

    2008-01-01

    To investigate genetic diversities among the AA genome Oryza species in the Southeast and South'Asia, a total of 428 accessions of the AA genome Oryza species were genotyped using 36 simple sequence repeats (SSR) markers distributed throughout the rice genome. All of the 36 SSR markers generated polymorphic bands, revealing 100% polymorphism. The number of alleles per locus ranged from 3 to 17 with the mean of 8.6. The Nei's genetic diversity index (He) ranged from 0.337 at RM455 to 0.865 at RM 169 with an average value of 0.650. The genetic diversity of the AA genome Oryza species in the Southeast Asia was obviously higher than that in the South Asia. Among the detected Oryza species in the South and Southeast Asia, O. rufipogon showed the highest genetic diversity. Meanwhile, a higher genetic differentiation (Fst) was found among the detected Oryza species in the Southeast Asia than in the South Asia. The Fst value between O. nivara and O. sativa was the highest. The results from the number of specific alleles, specific loci, and allele frequency confirmed the greater genetic variation among the detected species. In addition, the specific allele in RM161 displayed higher frequency (0.193), suggesting its important function in identifying Oryza species of AA genome.

  13. SSR Analysis on Diversity of AA Genome Oryza Species in the Southeast and South Asia

    Directory of Open Access Journals (Sweden)

    Jian-zhen LU

    2008-12-01

    Full Text Available To investigate genetic diversities among the AA genome Oryza species in the Southeast and South Asia, a total of 428 accessions of the AA genome Oryza species were genotyped using 36 simple sequence repeats (SSR markers distributed throughout the rice genome. All of the 36 SSR markers generated polymorphic bands, revealing 100% polymorphism. The number of alleles per locus ranged from 3 to 17 with the mean of 8.6. The Nei's genetic diversity index (He ranged from 0.337 at RM455 to 0.865 at RM169 with an average value of 0.650. The genetic diversity of the AA genome Oryza species in the Southeast Asia was obviously higher than that in the South Asia. Among the detected Oryza species in the South and Southeast Asia, O. rufipogon showed the highest genetic diversity. Meanwhile, a higher genetic differentiation (Fst was found among the detected Oryza species in the Southeast Asia than in the South Asia. The Fst value between O. nivara and O. sativa was the highest. The results from the number of specific alleles, specific loci, and allele frequency confirmed the greater genetic variation among the detected species. In addition, the specific allele in RM161 displayed higher frequency (0.193, suggesting its important function in identifying Oryza species of AA genome.

  14. Prospects for the Chinese Human Genome Project (HGP)at the beginning of next century

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    Chinese Human Genome Project (CHGP) as part of the international human genome research has achieved significant progress and created a solid foundation for further development. While participating in the human genome sequencing and gene discovery, the emphasis of CHGP in the next century will be laid on functional genomics. The strategy, resources and some policy issues will be addressed.

  15. Diversity Suppression-Subtractive Hybridization Array for Profiling Genomic DNA Polymorphisms

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    Genomic DNA polymorphisms are very useful for tracing genetic traits and studying biological diversity among species. Here, we present a method we call the "diversity suppression-subtractive hybridization array" for effectively profiling genomic DNA polymorphisms. The method first obtains the subtracted gDNA fragments between any two species by suppression subtraction hybridization (SSH) to establish a subtracted gDNA library,from which diversity SSH arrays are created with the selected subtracted clones. The diversity SSH array hybridizes with the DIG-labeled genomic DNA of the organism to be assayed. Six closely related Dendrobium species were studied as model samples. Four Dendrobium species as testers were used to perform SSH. A total of 617 subtracted positive clones were obtained from four Dendrobium species, and the average ratio of positive clones was 80.3%. We demonstrated that the average percentage of polymorphic fragments of pairwise comparisons of four Dendrobium species was up to 42.4%. A dendrogram of the relatedness of six Dendrobium species was produced according to their polymorphic profiles. The results revealed that the diversity SSH array is a highly effective platform for profiling genomic DNA polymorphisms and dendrograms.

  16. Diversity of 5S rRNA genes within individual prokaryotic genomes.

    Science.gov (United States)

    Pei, Anna; Li, Hongru; Oberdorf, William E; Alekseyenko, Alexander V; Parsons, Tamasha; Yang, Liying; Gerz, Erika A; Lee, Peng; Xiang, Charlie; Nossa, Carlos W; Pei, Zhiheng

    2012-10-01

    We examined intragenomic variation of paralogous 5S rRNA genes to evaluate the concept of ribosomal constraints. In a dataset containing 1161 genomes from 779 unique species, 96 species exhibited > 3% diversity. Twenty-seven species with > 10% diversity contained a total of 421 mismatches between all pairs of the most dissimilar copies of 5S rRNA genes. The large majority (401 of 421) of the diversified positions were conserved at the secondary structure level. The high diversity was associated with partial rRNA operon, split operon, or spacer length-related divergence. In total, these findings indicated that there are tight ribosomal constraints on paralogous 5S rRNA genes in a genome despite of the high degree of diversity at the primary structure level. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  17. Documenting genomics: Applying archival theory to preserving the records of the Human Genome Project.

    Science.gov (United States)

    Shaw, Jennifer

    2016-02-01

    The Human Genome Archive Project (HGAP) aimed to preserve the documentary heritage of the UK's contribution to the Human Genome Project (HGP) by using archival theory to develop a suitable methodology for capturing the results of modern, collaborative science. After assessing past projects and different archival theories, the HGAP used an approach based on the theory of documentation strategy to try to capture the records of a scientific project that had an influence beyond the purely scientific sphere. The HGAP was an archival survey that ran for two years. It led to ninety scientists being contacted and has, so far, led to six collections being deposited in the Wellcome Library, with additional collections being deposited in other UK repositories. In applying documentation strategy the HGAP was attempting to move away from traditional archival approaches to science, which have generally focused on retired Nobel Prize winners. It has been partially successful in this aim, having managed to secure collections from people who are not 'big names', but who made an important contribution to the HGP. However, the attempt to redress the gender imbalance in scientific collections and to improve record-keeping in scientific organisations has continued to be difficult to achieve.

  18. Hidden Diversity Revealed : Genomic, Transcriptomic and Functional Studies of Diplomonads

    OpenAIRE

    2012-01-01

    The diplomonads are a diverse group of eukaryotic microbes found in oxygen limited environments such as the intestine of animals were they may cause severe disease. Among them, the prominent human parasite Giardia intestinalis non-invasively colonizes the small intestine of humans and animals where it induces the gastrointestinal disease giardiasis. Two of the eight genetic groups of G. intestinalis, assemblage A and B, are known to infect humans and have zoonotic potential. At the start of p...

  19. Corynebacterium diphtheriae: genome diversity, population structure and genotyping perspectives.

    Science.gov (United States)

    Mokrousov, Igor

    2009-01-01

    The epidemic re-emergence of diphtheria in Russia and the Newly Independent States (NIS) of the former Soviet Union in the 1990s demonstrated the continued threat of this thought to be rare disease. The bacteriophage encoded toxin is a main virulence factor of Corynebacterium diphtheriae, however, an analysis of the first complete genome sequence of C. diphtheriae revealed a recent acquisition of other pathogenicity factors including iron-uptake systems, adhesins and fimbrial proteins as indeed this extracellular pathogen has more possibilities for lateral gene transfer than, e.g., its close relative, mainly intracellular Mycobacterium tuberculosis. C. diphtheriae appears to have a phylogeographical structure mainly represented by area-specific variants whose circulation is under strong influence of human host factors, including health control measures, first of all, vaccination, and social economic conditions. This framework core population structure may be challenged by importation of the endemic and eventually toxigenic strains from new areas thus leading to localized or large epidemics caused directly by imported strains or by bacteriophage-lysogenized indigenous strains converted into toxin production. A feature of C. diphtheriae co-existence with humans is its periodicity: following large epidemic in the 1990s, the present period is marked by increasing heterogeneity of the circulating populations whereas re-emergence of new toxigenic variants along with persistent circulation of invasive non-toxigenic strains appear alarming. To identify and rapidly monitor subtle changes in the genome structure at an infraclonal level during and between epidemics, portable and discriminatory typing methods of C. diphtheriae are still needed. In this view, CRISPRs and minisatellites are promising genomic markers for development of high-resolution typing schemes and databasing of C. diphtheriae.

  20. A whole-genome microarray reveals genetic diversity among Helicobacter pylori strains

    OpenAIRE

    Salama, Nina; Guillemin, Karen; McDaniel, Timothy K.; Sherlock, Gavin; Tompkins, Lucy; Falkow, Stanley

    2000-01-01

    Helicobacter pylori colonizes the stomach of half of the world's population, causing a wide spectrum of disease ranging from asymptomatic gastritis to ulcers to gastric cancer. Although the basis for these diverse clinical outcomes is not understood, more severe disease is associated with strains harboring a pathogenicity island. To characterize the genetic diversity of more and less virulent strains, we examined the genomic content of 15 H. pylori clinical isolate...

  1. Genetic diversity in the modern horse illustrated from genome-wide SNP data.

    Directory of Open Access Journals (Sweden)

    Jessica L Petersen

    Full Text Available Horses were domesticated from the Eurasian steppes 5,000-6,000 years ago. Since then, the use of horses for transportation, warfare, and agriculture, as well as selection for desired traits and fitness, has resulted in diverse populations distributed across the world, many of which have become or are in the process of becoming formally organized into closed, breeding populations (breeds. This report describes the use of a genome-wide set of autosomal SNPs and 814 horses from 36 breeds to provide the first detailed description of equine breed diversity. F(ST calculations, parsimony, and distance analysis demonstrated relationships among the breeds that largely reflect geographic origins and known breed histories. Low levels of population divergence were observed between breeds that are relatively early on in the process of breed development, and between those with high levels of within-breed diversity, whether due to large population size, ongoing outcrossing, or large within-breed phenotypic diversity. Populations with low within-breed diversity included those which have experienced population bottlenecks, have been under intense selective pressure, or are closed populations with long breed histories. These results provide new insights into the relationships among and the diversity within breeds of horses. In addition these results will facilitate future genome-wide association studies and investigations into genomic targets of selection.

  2. Genetic Diversity in the Modern Horse Illustrated from Genome-Wide SNP Data

    Science.gov (United States)

    Petersen, Jessica L.; Mickelson, James R.; Cothran, E. Gus; Andersson, Lisa S.; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M.; Borges, Alexandre S.; Brama, Pieter; da Câmara Machado, Artur; Distl, Ottmar; Felicetti, Michela; Fox-Clipsham, Laura; Graves, Kathryn T.; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A.; Mikko, Sofia; Orr, Nicholas; Penedo, M. Cecilia T; Piercy, Richard J.; Raekallio, Marja; Rieder, Stefan; Røed, Knut H.; Silvestrelli, Maurizio; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; M. Wade, Claire; McCue, Molly E.

    2013-01-01

    Horses were domesticated from the Eurasian steppes 5,000–6,000 years ago. Since then, the use of horses for transportation, warfare, and agriculture, as well as selection for desired traits and fitness, has resulted in diverse populations distributed across the world, many of which have become or are in the process of becoming formally organized into closed, breeding populations (breeds). This report describes the use of a genome-wide set of autosomal SNPs and 814 horses from 36 breeds to provide the first detailed description of equine breed diversity. FST calculations, parsimony, and distance analysis demonstrated relationships among the breeds that largely reflect geographic origins and known breed histories. Low levels of population divergence were observed between breeds that are relatively early on in the process of breed development, and between those with high levels of within-breed diversity, whether due to large population size, ongoing outcrossing, or large within-breed phenotypic diversity. Populations with low within-breed diversity included those which have experienced population bottlenecks, have been under intense selective pressure, or are closed populations with long breed histories. These results provide new insights into the relationships among and the diversity within breeds of horses. In addition these results will facilitate future genome-wide association studies and investigations into genomic targets of selection. PMID:23383025

  3. Tomato Fruits Show Wide Phenomic Diversity but Fruit Developmental Genes Show Low Genomic Diversity.

    Science.gov (United States)

    Mohan, Vijee; Gupta, Soni; Thomas, Sherinmol; Mickey, Hanjabam; Charakana, Chaitanya; Chauhan, Vineeta Singh; Sharma, Kapil; Kumar, Rakesh; Tyagi, Kamal; Sarma, Supriya; Gupta, Suresh Kumar; Kilambi, Himabindu Vasuki; Nongmaithem, Sapana; Kumari, Alka; Gupta, Prateek; Sreelakshmi, Yellamaraju; Sharma, Rameshwar

    2016-01-01

    Domestication of tomato has resulted in large diversity in fruit phenotypes. An intensive phenotyping of 127 tomato accessions from 20 countries revealed extensive morphological diversity in fruit traits. The diversity in fruit traits clustered the accessions into nine classes and identified certain promising lines having desirable traits pertaining to total soluble salts (TSS), carotenoids, ripening index, weight and shape. Factor analysis of the morphometric data from Tomato Analyzer showed that the fruit shape is a complex trait shared by several factors. The 100% variance between round and flat fruit shapes was explained by one discriminant function having a canonical correlation of 0.874 by stepwise discriminant analysis. A set of 10 genes (ACS2, COP1, CYC-B, RIN, MSH2, NAC-NOR, PHOT1, PHYA, PHYB and PSY1) involved in various plant developmental processes were screened for SNP polymorphism by EcoTILLING. The genetic diversity in these genes revealed a total of 36 non-synonymous and 18 synonymous changes leading to the identification of 28 haplotypes. The average frequency of polymorphism across the genes was 0.038/Kb. Significant negative Tajima'D statistic in two of the genes, ACS2 and PHOT1 indicated the presence of rare alleles in low frequency. Our study indicates that while there is low polymorphic diversity in the genes regulating plant development, the population shows wider phenotype diversity. Nonetheless, morphological and genetic diversity of the present collection can be further exploited as potential resources in future.

  4. The database of the PREDICTS (Projecting Responses of Ecological Diversity In Changing Terrestrial Systems) project

    OpenAIRE

    Hudson, Lawrence N; Newbold, Tim; Contu, Sara; Hill, Samantha L L; Lysenko, Igor; De Palma, Adriana; Phillips, Helen R P; Alhusseini, Tamera I.; Bedford, Felicity E.; Bennett, Dominic J.; Booth, Hollie; Burton, Victoria J.; Chng, Charlotte W. T.; Choimes, Argyrios; Correia, David L.P.

    2016-01-01

    The PREDICTS project—Projecting Responses of Ecological Diversity In Changing Terrestrial Systems (www.predicts.org.uk)—has collated from published studies a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity of human impacts relating to land use. We have used this evidence base to develop global and regional statistical models of how local biodiversity responds to these measures. We describe and make free...

  5. The database of the PREDICTS (Projecting Responses of Ecological Diversity In Changing Terrestrial Systems) project

    OpenAIRE

    Hudson, Lawrence N; Newbold, Tim; Contu, Sara; Hill, Samantha L.L.; Lysenko, Igor; De Palma, Adriana; Phillips, Helen R. P.; Alhusseini, Tamera I.; Bedford, Felicity E.; Bennett, Dominic J.; Booth, Hollie; Burton, Victoria J.; Chng , Charlotte W. T.; Choimes, Argyrios; Correia, David L.P.

    2016-01-01

    Abstract The PREDICTS project—Projecting Responses of Ecological Diversity In Changing Terrestrial Systems (www.predicts.org.uk)—has collated from published studies a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity of human impacts relating to land use. We have used this evidence base to develop global and regional statistical models of how local biodiversity responds to these measures. We describe and ...

  6. The database of the PREDICTS (Projecting Responses of Ecological Diversity In Changing Terrestrial Systems) project

    OpenAIRE

    Hudson, Lawrence N; Newbold, Tim; Contu, Sara; Hill, Samantha L L; Lysenko, Igor; De Palma, Adriana; Phillips, Helen R P; Alhusseini, Tamera I.; Bedford, Felicity E.; Bennett, Dominic J.; Booth, Hollie; Burton, Victoria J.; Chng, Charlotte W. T.; Choimes, Argyrios; Correia, David L.P.

    2017-01-01

    The PREDICTS project—Projecting Responses of Ecological Diversity In Changing Terrestrial Systems (www.predicts.org.uk)—has collated from published studies a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity of human impacts relating to land use. We have used this evidence base to develop global and regional statistical models of how local biodiversity responds to these measures. We describe and make free...

  7. Lactobacillus paracasei comparative genomics: towards species pan-genome definition and exploitation of diversity.

    Directory of Open Access Journals (Sweden)

    Tamara Smokvina

    Full Text Available Lactobacillus paracasei is a member of the normal human and animal gut microbiota and is used extensively in the food industry in starter cultures for dairy products or as probiotics. With the development of low-cost, high-throughput sequencing techniques it has become feasible to sequence many different strains of one species and to determine its "pan-genome". We have sequenced the genomes of 34 different L. paracasei strains, and performed a comparative genomics analysis. We analysed genome synteny and content, focussing on the pan-genome, core genome and variable genome. Each genome was shown to contain around 2800-3100 protein-coding genes, and comparative analysis identified over 4200 ortholog groups that comprise the pan-genome of this species, of which about 1800 ortholog groups make up the conserved core. Several factors previously associated with host-microbe interactions such as pili, cell-envelope proteinase, hydrolases p40 and p75 or the capacity to produce short branched-chain fatty acids (bkd operon are part of the L. paracasei core genome present in all analysed strains. The variome consists mainly of hypothetical proteins, phages, plasmids, transposon/conjugative elements, and known functions such as sugar metabolism, cell-surface proteins, transporters, CRISPR-associated proteins, and EPS biosynthesis proteins. An enormous variety and variability of sugar utilization gene cassettes were identified, with each strain harbouring between 25-53 cassettes, reflecting the high adaptability of L. paracasei to different niches. A phylogenomic tree was constructed based on total genome contents, and together with an analysis of horizontal gene transfer events we conclude that evolution of these L. paracasei strains is complex and not always related to niche adaptation. The results of this genome content comparison was used, together with high-throughput growth experiments on various carbohydrates, to perform gene-trait matching analysis

  8. Insights into the genetic structure and diversity of 38 South Asian Indians from deep whole-genome sequencing.

    Directory of Open Access Journals (Sweden)

    Lai-Ping Wong

    2014-05-01

    Full Text Available South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language-speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP. The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP. SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP.

  9. The Kipawa River versus the Tabaret River diversion projects

    Energy Technology Data Exchange (ETDEWEB)

    Karwacki, P. [Ottawa, ON (Canada)

    2003-08-01

    Hydro-Quebec wants to divert the Kipawa River in northwest Quebec from its natural streambed. While the first time visitor is likely to emphatically proclaim the Kipawa River as the most beautiful, most serene place they have ever encountered, hydro consultants and engineers, disconnected from the attractiveness of that place, are making cost/benefit recommendations that marginalize the inherent value of a free-flowing Kipawa. This paper will discuss the following points: (1) The Kipawa River has its own inherent value, which is related to the cost of simulating threatened white-water habitats in general. (2) The costs of recreating white-water habitats are more understandable through the study of man-made white-water venues. (3) The cost to recreate or simulate a threatened white-water habitat should be factored into the cost of the hydro-project feasibility. The Kipawa River's own inherent value should be factored into the cost of the Tabaret Diversion Project. (4) Methods of gaining community acceptance should be public and open: independent third-party arbitration is recommended. Use of monetary incentives to encourage public acceptance is unethical, immoral and unjustly biased against the survival of white-water habitats. (5) Recreational use of white-water habitats, like the Kipawa River are increasingly important engines of economic growth in Canada and around the world. (author)

  10. Whole-genome sequencing of uropathogenic Escherichia coli reveals long evolutionary history of diversity and virulence.

    Science.gov (United States)

    Lo, Yancy; Zhang, Lixin; Foxman, Betsy; Zöllner, Sebastian

    2015-08-01

    Uropathogenic Escherichia coli (UPEC) are phenotypically and genotypically very diverse. This diversity makes it challenging to understand the evolution of UPEC adaptations responsible for causing urinary tract infections (UTI). To gain insight into the relationship between evolutionary divergence and adaptive paths to uropathogenicity, we sequenced at deep coverage (190×) the genomes of 19 E. coli strains from urinary tract infection patients from the same geographic area. Our sample consisted of 14 UPEC isolates and 5 non-UTI-causing (commensal) rectal E. coli isolates. After identifying strain variants using de novo assembly-based methods, we clustered the strains based on pairwise sequence differences using a neighbor-joining algorithm. We examined evolutionary signals on the whole-genome phylogeny and contrasted these signals with those found on gene trees constructed based on specific uropathogenic virulence factors. The whole-genome phylogeny showed that the divergence between UPEC and commensal E. coli strains without known UPEC virulence factors happened over 32 million generations ago. Pairwise diversity between any two strains was also high, suggesting multiple genetic origins of uropathogenic strains in a small geographic region. Contrasting the whole-genome phylogeny with three gene trees constructed from common uropathogenic virulence factors, we detected no selective advantage of these virulence genes over other genomic regions. These results suggest that UPEC acquired uropathogenicity long time ago and used it opportunistically to cause extraintestinal infections.

  11. A genome-wide analysis of genetic diversity in Trypanosoma cruzi intergenic regions.

    Directory of Open Access Journals (Sweden)

    Leonardo G Panunzi

    2014-05-01

    Full Text Available BACKGROUND: Trypanosoma cruzi is the causal agent of Chagas Disease. Recently, the genomes of representative strains from two major evolutionary lineages were sequenced, allowing the construction of a detailed genetic diversity map for this important parasite. However this map is focused on coding regions of the genome, leaving a vast space of regulatory regions uncharacterized in terms of their evolutionary conservation and/or divergence. METHODOLOGY: Using data from the hybrid CL Brener and Sylvio X10 genomes (from the TcVI and TcI Discrete Typing Units, respectively, we identified intergenic regions that share a common evolutionary ancestry, and are present in both CL Brener haplotypes (TcII-like and TcIII-like and in the TcI genome; as well as intergenic regions that were conserved in only two of the three genomes/haplotypes analyzed. The genetic diversity in these regions was characterized in terms of the accumulation of indels and nucleotide changes. PRINCIPAL FINDINGS: Based on this analysis we have identified i a core of highly conserved intergenic regions, which remained essentially unchanged in independently evolving lineages; ii intergenic regions that show high diversity in spite of still retaining their corresponding upstream and downstream coding sequences; iii a number of defined sequence motifs that are shared by a number of unrelated intergenic regions. A fraction of indels explains the diversification of some intergenic regions by the expansion/contraction of microsatellite-like repeats.

  12. Expanding the diversity of mycobacteriophages: insights into genome architecture and evolution.

    Directory of Open Access Journals (Sweden)

    Welkin H Pope

    Full Text Available Mycobacteriophages are viruses that infect mycobacterial hosts such as Mycobacterium smegmatis and Mycobacterium tuberculosis. All mycobacteriophages characterized to date are dsDNA tailed phages, and have either siphoviral or myoviral morphotypes. However, their genetic diversity is considerable, and although sixty-two genomes have been sequenced and comparatively analyzed, these likely represent only a small portion of the diversity of the mycobacteriophage population at large. Here we report the isolation, sequencing and comparative genomic analysis of 18 new mycobacteriophages isolated from geographically distinct locations within the United States. Although no clear correlation between location and genome type can be discerned, these genomes expand our knowledge of mycobacteriophage diversity and enhance our understanding of the roles of mobile elements in viral evolution. Expansion of the number of mycobacteriophages grouped within Cluster A provides insights into the basis of immune specificity in these temperate phages, and we also describe a novel example of apparent immunity theft. The isolation and genomic analysis of bacteriophages by freshman college students provides an example of an authentic research experience for novice scientists.

  13. Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi

    NARCIS (Netherlands)

    Ohm, R.A.; Feau, N.; Henrissat, B.; Schoch, C.L.; Horwitz, B.A.; Barry, K.W.; Condon, B.J.; Copeland, A.C.; Dhillon, B.; Glaser, F.; Hesse, C.N.; Kosti, I.; LaButti, K.; Lindquist, E.A.; Lucas, S.; Salamov, A.A.; Bradshaw, R.E.; Ciuffetti, L.; Hamelin, R.C.; Kema, G.H.J.; Lawrence, C.; Scott, J.A.; Spatafora, J.W.; Turgeon, B.G.; Wit, de P.J.G.M.; Zhong, S.; Goodwin, S.B.; Grigoriev, I.V.

    2012-01-01

    The class Dothideomycetes is one of the largest groups of fungi with a high level of ecological diversity including many plant pathogens infecting a broad range of hosts. Here, we compare genome features of 18 members of this class, including 6 necrotrophs, 9 (hemi)biotrophs and 3 saprotrophs, to

  14. Consequences for diversity when prioritizing animals for conservation with pedigree or genomic information

    NARCIS (Netherlands)

    Engelsma, K.A.; Veerkamp, R.F.; Calus, M.P.L.; Windig, J.J.

    2011-01-01

    Up to now, prioritization of animals for conservation has been mainly based on pedigree information; however, genomic information may improve prioritization. In this study, we used two Holstein populations to investigate the consequences for genetic diversity when animals are prioritized with

  15. Genome-wide distribution of genetic diversity and linkage disequilibrium in elite sugar beet germplasm

    Directory of Open Access Journals (Sweden)

    Weißleder Knuth

    2011-10-01

    Full Text Available Abstract Background Characterization of population structure and genetic diversity of germplasm is essential for the efficient organization and utilization of breeding material. The objectives of this study were to (i explore the patterns of population structure in the pollen parent heterotic pool using different methods, (ii investigate the genome-wide distribution of genetic diversity, and (iii assess the extent and genome-wide distribution of linkage disequilibrium (LD in elite sugar beet germplasm. Results A total of 264 and 238 inbred lines from the yield type and sugar type inbreds of the pollen parent heterotic gene pools, respectively, which had been genotyped with 328 SNP markers, were used in this study. Two distinct subgroups were detected based on different statistical methods within the elite sugar beet germplasm set, which was in accordance with its breeding history. MCLUST based on principal components, principal coordinates, or lapvectors had high correspondence with the germplasm type information as well as the assignment by STRUCTURE, which indicated that these methods might be alternatives to STRUCTURE for population structure analysis. Gene diversity and modified Roger's distance between the examined germplasm types varied considerably across the genome, which might be due to artificial selection. This observation indicates that population genetic approaches could be used to identify candidate genes for the traits under selection. Due to the fact that r2 >0.8 is required to detect marker-phenotype association explaining less than 1% of the phenotypic variance, our observation of a low proportion of SNP loci pairs showing such levels of LD suggests that the number of markers has to be dramatically increased for powerful genome-wide association mapping. Conclusions We provided a genome-wide distribution map of genetic diversity and linkage disequilibrium for the elite sugar beet germplasm, which is useful for the application of

  16. Genetic Diversity and Reassortment of Hantaan Virus Tripartite RNA Genomes in Nature, the Republic of Korea.

    Directory of Open Access Journals (Sweden)

    Jeong-Ah Kim

    2016-06-01

    Full Text Available Hantaan virus (HTNV, a negative sense tripartite RNA virus of the Family Bunyaviridae, is the most prevalent hantavirus in the Republic of Korea (ROK. It is the causative agent of Hemorrhagic Fever with Renal Syndrome (HFRS in humans and maintained in the striped field mouse, Apodemus agrarius, the primary zoonotic host. Clinical HFRS cases have been reported commonly in HFRS-endemic areas of Gyeonggi province. Recently, the death of a member of the ROK military from Gangwon province due to HFRS prompted an investigation of the epidemiology and distribution of hantaviruses in Gangwon and Gyeonggi provinces that border the demilitarized zone separating North and South Korea.To elucidate the geographic distribution and molecular diversity of HTNV, whole genome sequences of HTNV Large (L, Medium (M, and Small (S segments were acquired from lung tissues of A. agrarius captured from 2003-2014. Consistent with the clinical incidence of HFRS established by the Korea Centers for Disease Control & Prevention (KCDC, the prevalence of HTNV in naturally infected mice in Gangwon province was lower than for Gyeonggi province. Whole genomic sequences of 34 HTNV strains were identified and a phylogenetic analysis showed geographic diversity of the virus in the limited areas. Reassortment analysis first suggested an occurrence of genetic exchange of HTNV genomes in nature, ROK.This study is the first report to demonstrate the molecular prevalence of HTNV in Gangwon province. Whole genome sequencing of HTNV showed well-supported geographic lineages and the molecular diversity in the northern region of ROK due to a natural reassortment of HTNV genomes. These observations contribute to a better understanding of the genetic diversity and molecular evolution of hantaviruses. Also, the full-length of HTNV tripartite genomes will provide a database for phylogeographic analysis of spatial and temporal outbreaks of hantavirus infection.

  17. Genetic diversity analysis of two commercial breeds of pigs using genomic and pedigree data.

    Science.gov (United States)

    Zanella, Ricardo; Peixoto, Jane O; Cardoso, Fernando F; Cardoso, Leandro L; Biegelmeyer, Patrícia; Cantão, Maurício E; Otaviano, Antonio; Freitas, Marcelo S; Caetano, Alexandre R; Ledur, Mônica C

    2016-03-30

    Genetic improvement in livestock populations can be achieved without significantly affecting genetic diversity if mating systems and selection decisions take genetic relationships among individuals into consideration. The objective of this study was to examine the genetic diversity of two commercial breeds of pigs. Genotypes from 1168 Landrace (LA) and 1094 Large White (LW) animals from a commercial breeding program in Brazil were obtained using the Illumina PorcineSNP60 Beadchip. Inbreeding estimates based on pedigree (F x) and genomic information using runs of homozygosity (F ROH) and the single nucleotide polymorphisms (SNP) by SNP inbreeding coefficient (F SNP) were obtained. Linkage disequilibrium (LD), correlation of linkage phase (r) and effective population size (N e ) were also estimated. Estimates of inbreeding obtained with pedigree information were lower than those obtained with genomic data in both breeds. We observed that the extent of LD was slightly larger at shorter distances between SNPs in the LW population than in the LA population, which indicates that the LW population was derived from a smaller N e . Estimates of N e based on genomic data were equal to 53 and 40 for the current populations of LA and LW, respectively. The correlation of linkage phase between the two breeds was equal to 0.77 at distances up to 50 kb, which suggests that genome-wide association and selection should be performed within breed. Although selection intensities have been stronger in the LA breed than in the LW breed, levels of genomic and pedigree inbreeding were lower for the LA than for the LW breed. The use of genomic data to evaluate population diversity in livestock animals can provide new and more precise insights about the effects of intense selection for production traits. Resulting information and knowledge can be used to effectively increase response to selection by appropriately managing the rate of inbreeding, minimizing negative effects of inbreeding

  18. Impact of marker ascertainment bias on genomic selection accuracy and estimates of genetic diversity.

    Directory of Open Access Journals (Sweden)

    Nicolas Heslot

    Full Text Available Genome-wide molecular markers are often being used to evaluate genetic diversity in germplasm collections and for making genomic selections in breeding programs. To accurately predict phenotypes and assay genetic diversity, molecular markers should assay a representative sample of the polymorphisms in the population under study. Ascertainment bias arises when marker data is not obtained from a random sample of the polymorphisms in the population of interest. Genotyping-by-sequencing (GBS is rapidly emerging as a low-cost genotyping platform, even for the large, complex, and polyploid wheat (Triticum aestivum L. genome. With GBS, marker discovery and genotyping occur simultaneously, resulting in minimal ascertainment bias. The previous platform of choice for whole-genome genotyping in many species such as wheat was DArT (Diversity Array Technology and has formed the basis of most of our knowledge about cereals genetic diversity. This study compared GBS and DArT marker platforms for measuring genetic diversity and genomic selection (GS accuracy in elite U.S. soft winter wheat. From a set of 365 breeding lines, 38,412 single nucleotide polymorphism GBS markers were discovered and genotyped. The GBS SNPs gave a higher GS accuracy than 1,544 DArT markers on the same lines, despite 43.9% missing data. Using a bootstrap approach, we observed significantly more clustering of markers and ascertainment bias with DArT relative to GBS. The minor allele frequency distribution of GBS markers had a deficit of rare variants compared to DArT markers. Despite the ascertainment bias of the DArT markers, GS accuracy for three traits out of four was not significantly different when an equal number of markers were used for each platform. This suggests that the gain in accuracy observed using GBS compared to DArT markers was mainly due to a large increase in the number of markers available for the analysis.

  19. The diversity of a distributed genome in bacterial populations

    CERN Document Server

    Baumdicker, F; Pfaffelhuber, P

    2009-01-01

    The distributed genome hypothesis states that the set of genes in a population of bacteria is distributed over all individuals that belong to the specific taxon. It implies that certain genes can be gained and lost from generation to generation. We use the random genealogy given by a Kingman coalescent in order to superimpose events of gene gain and loss along ancestral lines. Gene gains occur at constant rate along ancestral lines. We assume that gained genes have never been present in the population before. Gene losses occur at a rate proportional to the number of genes present along the ancestral line. In this "infinitely many genes model" we derive moments for several statistics within a sample: the average number of genes per individual, the average number of genes differing between individuals, the number of incongruent pairs of genes, the total number of different genes in the sample and the gene frequency spectrum. We demonstrate that the model gives a reasonable fit with gene frequency data from mari...

  20. Diversity of 23S rRNA genes within individual prokaryotic genomes.

    Directory of Open Access Journals (Sweden)

    Anna Pei

    Full Text Available BACKGROUND: The concept of ribosomal constraints on rRNA genes is deduced primarily based on the comparison of consensus rRNA sequences between closely related species, but recent advances in whole-genome sequencing allow evaluation of this concept within organisms with multiple rRNA operons. METHODOLOGY/PRINCIPAL FINDINGS: Using the 23S rRNA gene as an example, we analyzed the diversity among individual rRNA genes within a genome. Of 184 prokaryotic species containing multiple 23S rRNA genes, diversity was observed in 113 (61.4% genomes (mean 0.40%, range 0.01%-4.04%. Significant (1.17%-4.04% intragenomic variation was found in 8 species. In 5 of the 8 species, the diversity in the primary structure had only minimal effect on the secondary structure (stem versus loop transition. In the remaining 3 species, the diversity significantly altered local secondary structure, but the alteration appears minimized through complex rearrangement. Intervening sequences (IVS, ranging between 9 and 1471 nt in size, were found in 7 species. IVS in Deinococcus radiodurans and Nostoc sp. encode transposases. T. tengcongensis was the only species in which intragenomic diversity >3% was observed among 4 paralogous 23S rRNA genes. CONCLUSIONS/SIGNIFICANCE: These findings indicate tight ribosomal constraints on individual 23S rRNA genes within a genome. Although classification using primary 23S rRNA sequences could be erroneous, significant diversity among paralogous 23S rRNA genes was observed only once in the 184 species analyzed, indicating little overall impact on the mainstream of 23S rRNA gene-based prokaryotic taxonomy.

  1. The GenABEL Project for statistical genomics.

    Science.gov (United States)

    Karssen, Lennart C; van Duijn, Cornelia M; Aulchenko, Yurii S

    2016-01-01

    Development of free/libre open source software is usually done by a community of people with an interest in the tool. For scientific software, however, this is less often the case. Most scientific software is written by only a few authors, often a student working on a thesis. Once the paper describing the tool has been published, the tool is no longer developed further and is left to its own device. Here we describe the broad, multidisciplinary community we formed around a set of tools for statistical genomics. The GenABEL project for statistical omics actively promotes open interdisciplinary development of statistical methodology and its implementation in efficient and user-friendly software under an open source licence. The software tools developed withing the project collectively make up the GenABEL suite, which currently consists of eleven tools. The open framework of the project actively encourages involvement of the community in all stages, from formulation of methodological ideas to application of software to specific data sets. A web forum is used to channel user questions and discussions, further promoting the use of the GenABEL suite. Developer discussions take place on a dedicated mailing list, and development is further supported by robust development practices including use of public version control, code review and continuous integration. Use of this open science model attracts contributions from users and developers outside the "core team", facilitating agile statistical omics methodology development and fast dissemination.

  2. The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions.

    Science.gov (United States)

    Guo, Shaogui; Zhang, Jianguo; Sun, Honghe; Salse, Jerome; Lucas, William J; Zhang, Haiying; Zheng, Yi; Mao, Linyong; Ren, Yi; Wang, Zhiwen; Min, Jiumeng; Guo, Xiaosen; Murat, Florent; Ham, Byung-Kook; Zhang, Zhaoliang; Gao, Shan; Huang, Mingyun; Xu, Yimin; Zhong, Silin; Bombarely, Aureliano; Mueller, Lukas A; Zhao, Hong; He, Hongju; Zhang, Yan; Zhang, Zhonghua; Huang, Sanwen; Tan, Tao; Pang, Erli; Lin, Kui; Hu, Qun; Kuang, Hanhui; Ni, Peixiang; Wang, Bo; Liu, Jingan; Kou, Qinghe; Hou, Wenju; Zou, Xiaohua; Jiang, Jiao; Gong, Guoyi; Klee, Kathrin; Schoof, Heiko; Huang, Ying; Hu, Xuesong; Dong, Shanshan; Liang, Dequan; Wang, Juan; Wu, Kui; Xia, Yang; Zhao, Xiang; Zheng, Zequn; Xing, Miao; Liang, Xinming; Huang, Bangqing; Lv, Tian; Wang, Junyi; Yin, Ye; Yi, Hongping; Li, Ruiqiang; Wu, Mingzhu; Levi, Amnon; Zhang, Xingping; Giovannoni, James J; Wang, Jun; Li, Yunfu; Fei, Zhangjun; Xu, Yong

    2013-01-01

    Watermelon, Citrullus lanatus, is an important cucurbit crop grown throughout the world. Here we report a high-quality draft genome sequence of the east Asia watermelon cultivar 97103 (2n = 2× = 22) containing 23,440 predicted protein-coding genes. Comparative genomics analysis provided an evolutionary scenario for the origin of the 11 watermelon chromosomes derived from a 7-chromosome paleohexaploid eudicot ancestor. Resequencing of 20 watermelon accessions representing three different C. lanatus subspecies produced numerous haplotypes and identified the extent of genetic diversity and population structure of watermelon germplasm. Genomic regions that were preferentially selected during domestication were identified. Many disease-resistance genes were also found to be lost during domestication. In addition, integrative genomic and transcriptomic analyses yielded important insights into aspects of phloem-based vascular signaling in common between watermelon and cucumber and identified genes crucial to valuable fruit-quality traits, including sugar accumulation and citrulline metabolism.

  3. Rice Annotation Project Database (RAP-DB): an integrative and interactive database for rice genomics.

    Science.gov (United States)

    Sakai, Hiroaki; Lee, Sung Shin; Tanaka, Tsuyoshi; Numa, Hisataka; Kim, Jungsok; Kawahara, Yoshihiro; Wakimoto, Hironobu; Yang, Ching-chia; Iwamoto, Masao; Abe, Takashi; Yamada, Yuko; Muto, Akira; Inokuchi, Hachiro; Ikemura, Toshimichi; Matsumoto, Takashi; Sasaki, Takuji; Itoh, Takeshi

    2013-02-01

    The Rice Annotation Project Database (RAP-DB, http://rapdb.dna.affrc.go.jp/) has been providing a comprehensive set of gene annotations for the genome sequence of rice, Oryza sativa (japonica group) cv. Nipponbare. Since the first release in 2005, RAP-DB has been updated several times along with the genome assembly updates. Here, we present our newest RAP-DB based on the latest genome assembly, Os-Nipponbare-Reference-IRGSP-1.0 (IRGSP-1.0), which was released in 2011. We detected 37,869 loci by mapping transcript and protein sequences of 150 monocot species. To provide plant researchers with highly reliable and up to date rice gene annotations, we have been incorporating literature-based manually curated data, and 1,626 loci currently incorporate literature-based annotation data, including commonly used gene names or gene symbols. Transcriptional activities are shown at the nucleotide level by mapping RNA-Seq reads derived from 27 samples. We also mapped the Illumina reads of a Japanese leading japonica cultivar, Koshihikari, and a Chinese indica cultivar, Guangluai-4, to the genome and show alignments together with the single nucleotide polymorphisms (SNPs) and gene functional annotations through a newly developed browser, Short-Read Assembly Browser (S-RAB). We have developed two satellite databases, Plant Gene Family Database (PGFD) and Integrative Database of Cereal Gene Phylogeny (IDCGP), which display gene family and homologous gene relationships among diverse plant species. RAP-DB and the satellite databases offer simple and user-friendly web interfaces, enabling plant and genome researchers to access the data easily and facilitating a broad range of plant research topics.

  4. READINGS FROM THE FORMAL DISCOURSE OF PROJECT MANAGERS REGARDING DIVERSITY IN TEAMS

    Directory of Open Access Journals (Sweden)

    Sandra Regina da Rocha-Pinto

    2012-04-01

    Full Text Available Based on the viewpoint of project managers with regards to diversity, this paper used a phenomenographic method. Fifteen project managers were interviewed. The latter focused primarily on the variety of techniques, rather than on varieties of any other kind. This view of diversity extends beyond those angles generally taken in the literature on the theme which in most instances refer to diversity as based on gender, race and disadvantaged ethnic and minority groups. Additionally, the study brings to light the fact that diversities of knowledge and behavior are as beneficial for the development of projects. Furthermore, communication and the role of the project manager were raised as mitigating factors when it came to diversity. And, lastly, the conclusion arrived at was that project managers have similar discourses which correspond to the recommendation of the main project management manuals. These discourses and forms of expression are in most cases ready-made.

  5. Comparative genomics of Mycoplasma: analysis of conserved essential genes and diversity of the pan-genome.

    Directory of Open Access Journals (Sweden)

    Wei Liu

    Full Text Available Mycoplasma, the smallest self-replicating organism with a minimal metabolism and little genomic redundancy, is expected to be a close approximation to the minimal set of genes needed to sustain bacterial life. This study employs comparative evolutionary analysis of twenty Mycoplasma genomes to gain an improved understanding of essential genes. By analyzing the core genome of mycoplasmas, we finally revealed the conserved essential genes set for mycoplasma survival. Further analysis showed that the core genome set has many characteristics in common with experimentally identified essential genes. Several key genes, which are related to DNA replication and repair and can be disrupted in transposon mutagenesis studies, may be critical for bacteria survival especially over long period natural selection. Phylogenomic reconstructions based on 3,355 homologous groups allowed robust estimation of phylogenetic relatedness among mycoplasma strains. To obtain deeper insight into the relative roles of molecular evolution in pathogen adaptation to their hosts, we also analyzed the positive selection pressures on particular sites and lineages. There appears to be an approximate correlation between the divergence of species and the level of positive selection detected in corresponding lineages.

  6. Human Genome Teacher Networking Project, Final Report, April 1, 1992 - March 31, 1998

    Energy Technology Data Exchange (ETDEWEB)

    Collins, Debra

    1999-10-01

    Project to provide education regarding ethical legal and social implications of Human Genome Project to high school science teachers through two consecutive summer workshops, in class activities, and peer teaching workshops.

  7. The UK Human Genome Mapping Project online computing service.

    Science.gov (United States)

    Rysavy, F R; Bishop, M J; Gibbs, G P; Williams, G W

    1992-04-01

    This paper presents an overview of computing and networking facilities developed by the Medical Research Council to provide online computing support to the Human Genome Mapping Project (HGMP) in the UK. The facility is connected to a number of other computing facilities in various centres of genetics and molecular biology research excellence, either directly via high-speed links or through national and international wide-area networks. The paper describes the design and implementation of the current system, a 'client/server' network of Sun, IBM, DEC and Apple servers, gateways and workstations. A short outline of online computing services currently delivered by this system to the UK human genetics research community is also provided. More information about the services and their availability could be obtained by a direct approach to the UK HGMP-RC.

  8. Genomic diversity and introgression in O. sativa reveal the impact of domestication and breeding on the rice genome.

    Directory of Open Access Journals (Sweden)

    Keyan Zhao

    Full Text Available BACKGROUND: The domestication of Asian rice (Oryza sativa was a complex process punctuated by episodes of introgressive hybridization among and between subpopulations. Deep genetic divergence between the two main varietal groups (Indica and Japonica suggests domestication from at least two distinct wild populations. However, genetic uniformity surrounding key domestication genes across divergent subpopulations suggests cultural exchange of genetic material among ancient farmers. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we utilize a novel 1,536 SNP panel genotyped across 395 diverse accessions of O. sativa to study genome-wide patterns of polymorphism, to characterize population structure, and to infer the introgression history of domesticated Asian rice. Our population structure analyses support the existence of five major subpopulations (indica, aus, tropical japonica, temperate japonica and GroupV consistent with previous analyses. Our introgression analysis shows that most accessions exhibit some degree of admixture, with many individuals within a population sharing the same introgressed segment due to artificial selection. Admixture mapping and association analysis of amylose content and grain length illustrate the potential for dissecting the genetic basis of complex traits in domesticated plant populations. CONCLUSIONS/SIGNIFICANCE: Genes in these regions control a myriad of traits including plant stature, blast resistance, and amylose content. These analyses highlight the power of population genomics in agricultural systems to identify functionally important regions of the genome and to decipher the role of human-directed breeding in refashioning the genomes of a domesticated species.

  9. Inbreeding and selection shape genomic diversity in captive populations: Implications for the conservation of endangered species.

    Science.gov (United States)

    Willoughby, Janna R; Ivy, Jamie A; Lacy, Robert C; Doyle, Jacqueline M; DeWoody, J Andrew

    2017-01-01

    Captive breeding programs are often initiated to prevent species extinction until reintroduction into the wild can occur. However, the evolution of captive populations via inbreeding, drift, and selection can impair fitness, compromising reintroduction programs. To better understand the evolutionary response of species bred in captivity, we used nearly 5500 single nucleotide polymorphisms (SNPs) in populations of white-footed mice (Peromyscus leucopus) to measure the impact of breeding regimes on genomic diversity. We bred mice in captivity for 20 generations using two replicates of three protocols: random mating (RAN), selection for docile behaviors (DOC), and minimizing mean kinship (MK). The MK protocol most effectively retained genomic diversity and reduced the effects of selection. Additionally, genomic diversity was significantly related to fitness, as assessed with pedigrees and SNPs supported with genomic sequence data. Because captive-born individuals are often less fit in wild settings compared to wild-born individuals, captive-estimated fitness correlations likely underestimate the effects in wild populations. Therefore, minimizing inbreeding and selection in captive populations is critical to increasing the probability of releasing fit individuals into the wild.

  10. Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity

    Science.gov (United States)

    Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F; Abbazia, Patrick; Ababio, Amma; Adam, Naazneen

    2015-01-01

    The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery. DOI: http://dx.doi.org/10.7554/eLife.06416.001 PMID:25919952

  11. HGD-Chn: The Database of Genome Diversity and Variation for Chinese Populations.

    Science.gov (United States)

    Hong-Sheng, Gui; Peng, Zhou; Cheng-Bo, Yang; Sheng-Bin, Li

    2009-04-01

    The Database of Genome Diversity and Variation for Chinese Populations is toward a more efficient utilization and sharing of the valuable yet diminishing genetic resources in China (including sample information of healthy populations, healthy pedigrees, disease population and disease pedigrees; genomic diversity data; disease-related allelic and haplotype data). Organization of the database can be divided into two parts: (1) Genetic resources of healthy people--Organizing genetic resources of healthy people. A variety of genetic markers (VNTR, STR, SNP, HLA, and enzyme markers, etc.) are chosen for their diversity among populations, with their distribution among different ethnic groups in China stored in the form of allelic frequency. A further analysis as well as an overall description of the Chinese population genetic structure is also being made possible. (2) Disease genetic resources--Four categories are mainly concerned: chromosomal diseases, monogenic diseases, polygenic diseases, and birth defects. For each kind of disease, the basic introduction and description, sample information, and allelic data of related gene are involved. Aside from research-oriented information, introductory courses oriented at general public covering fields of genomic diversity and variation, the related experimental techniques, standards and specifications could also be accessed in our website. Further more, flexible query and submit system with user-friendly interfaces are also integrated in our website to simplify the process of user-query and administrators' database maintenance work. Online data analyzing and managing tools are developed using bioinformatics algorithm and programming language for a better interpretation of the biological data.

  12. Genetic diversity and genomic strategies for improving drought and waterlogging tolerance in soybeans.

    Science.gov (United States)

    Valliyodan, Babu; Ye, Heng; Song, Li; Murphy, MacKensie; Shannon, J Grover; Nguyen, Henry T

    2016-12-07

    Drought and its interaction with high temperature are the major abiotic stress factors affecting soybean yield and production stability. Ongoing climate changes are anticipated to intensify drought events, which will further impact crop production and food security. However, excessive water also limits soybean production. The success of soybean breeding programmes for crop improvement is dependent on the extent of genetic variation present in the germplasm base. Screening for natural genetic variation in drought- and flooding tolerance-related traits, including root system architecture, water and nitrogen-fixation efficiency, and yield performance indices, has helped to identify the best resources for genetic studies in soybean. Genomic resources, including whole-genome sequences of diverse germplasms, millions of single-nucleotide polymorphisms, and high-throughput marker genotyping platforms, have expedited gene and marker discovery for translational genomics in soybean. This review highlights the current knowledge of the genetic diversity and quantitative trait loci associated with root system architecture, canopy wilting, nitrogen-fixation ability, and flooding tolerance that contributes to the understanding of drought- and flooding-tolerance mechanisms in soybean. Next-generation mapping approaches and high-throughput phenotyping will facilitate a better understanding of phenotype-genotype associations and help to formulate genomic-assisted breeding strategies, including genomic selection, in soybean for tolerance to drought and flooding stress.

  13. Genome sequence diversity and clues to the evolution of variola (smallpox) virus.

    Science.gov (United States)

    Esposito, Joseph J; Sammons, Scott A; Frace, A Michael; Osborne, John D; Olsen-Rasmussen, Melissa; Zhang, Ming; Govil, Dhwani; Damon, Inger K; Kline, Richard; Laker, Miriam; Li, Yu; Smith, Geoffrey L; Meyer, Hermann; Leduc, James W; Wohlhueter, Robert M

    2006-08-11

    Comparative genomics of 45 epidemiologically varied variola virus isolates from the past 30 years of the smallpox era indicate low sequence diversity, suggesting that there is probably little difference in the isolates' functional gene content. Phylogenetic clustering inferred three clades coincident with their geographical origin and case-fatality rate; the latter implicated putative proteins that mediate viral virulence differences. Analysis of the viral linear DNA genome suggests that its evolution involved direct descent and DNA end-region recombination events. Knowing the sequences will help understand the viral proteome and improve diagnostic test precision, therapeutics, and systems for their assessment.

  14. Scanning the landscape of genome architecture of non-O1 and non-O139 Vibrio cholerae by whole genome mapping reveals extensive population genetic diversity.

    Directory of Open Access Journals (Sweden)

    Carol Chapman

    Full Text Available Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping based bar coding produces a high resolution, ordered restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.

  15. Scanning the landscape of genome architecture of non-O1 and non-O139 Vibrio cholerae by whole genome mapping reveals extensive population genetic diversity.

    Science.gov (United States)

    Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A; Awosika, Joy; Briska, Adam; Ptashkin, Ryan N; Wagner, Trevor; Rajanna, Chythanya; Tsang, Hsinyi; Johnson, Shannon L; Mokashi, Vishwesh P; Chain, Patrick S G; Sozhamannan, Shanmuga

    2015-01-01

    Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, ordered restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.

  16. Ethical considerations of research policy for personal genome analysis: the approach of the Genome Science Project in Japan.

    Science.gov (United States)

    Minari, Jusaku; Shirai, Tetsuya; Kato, Kazuto

    2014-12-01

    As evidenced by high-throughput sequencers, genomic technologies have recently undergone radical advances. These technologies enable comprehensive sequencing of personal genomes considerably more efficiently and less expensively than heretofore. These developments present a challenge to the conventional framework of biomedical ethics; under these changing circumstances, each research project has to develop a pragmatic research policy. Based on the experience with a new large-scale project-the Genome Science Project-this article presents a novel approach to conducting a specific policy for personal genome research in the Japanese context. In creating an original informed-consent form template for the project, we present a two-tiered process: making the draft of the template following an analysis of national and international policies; refining the draft template in conjunction with genome project researchers for practical application. Through practical use of the template, we have gained valuable experience in addressing challenges in the ethical review process, such as the importance of sharing details of the latest developments in genomics with members of research ethics committees. We discuss certain limitations of the conventional concept of informed consent and its governance system and suggest the potential of an alternative process using information technology.

  17. Genome-Based Studies of Marine Microorganisms to Maximize the Diversity of Natural Products Discovery for Medical Treatments

    Directory of Open Access Journals (Sweden)

    Xin-Qing Zhao

    2011-01-01

    Full Text Available Marine microorganisms are rich source for natural products which play important roles in pharmaceutical industry. Over the past decade, genome-based studies of marine microorganisms have unveiled the tremendous diversity of the producers of natural products and also contributed to the efficiency of harness the strain diversity and chemical diversity, as well as the genetic diversity of marine microorganisms for the rapid discovery and generation of new natural products. In the meantime, genomic information retrieved from marine symbiotic microorganisms can also be employed for the discovery of new medical molecules from yet-unculturable microorganisms. In this paper, the recent progress in the genomic research of marine microorganisms is reviewed; new tools of genome mining as well as the advance in the activation of orphan pathways and metagenomic studies are summarized. Genome-based research of marine microorganisms will maximize the biodiscovery process and solve the problems of supply and sustainability of drug molecules for medical treatments.

  18. The lawful uses of knowledge from the Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Grad, F.P.

    1994-04-15

    Part I of this study deals with the right to know or not to know personal genetic information, and examines available legal protections of the right of privacy and the adverse effect of the disclosure of genetic information both on employment and insurance interests and on self esteem and protection of personal integrity. The study examines the rationale for the legal protection of privacy as the protection of a public interest. It examines the very limited protections currently available for privacy interests, including genetic privacy interests, and concludes that there is a need for broader, more far-reaching legal protections. The second part of the study is based on the assumption that as major a project as the Human Genome Project, spending billions of dollars on science which is health related, will indeed be applied for preventive and therapeutic public health purposes, as it has been in the past. It also addresses the recurring fear that public health initiatives in the genetic area must evolve a new eugenic agenda, that we must not repeat the miserable discriminatory experiences of the past.

  19. Estimating variation within the genes and inferring the phylogeny of 186 sequenced diverse Escherichia coli genomes

    Directory of Open Access Journals (Sweden)

    Kaas Rolf S

    2012-10-01

    Full Text Available Abstract Background Escherichia coli exists in commensal and pathogenic forms. By measuring the variation of individual genes across more than a hundred sequenced genomes, gene variation can be studied in detail, including the number of mutations found for any given gene. This knowledge will be useful for creating better phylogenies, for determination of molecular clocks and for improved typing techniques. Results We find 3,051 gene clusters/families present in at least 95% of the genomes and 1,702 gene clusters present in 100% of the genomes. The former 'soft core' of about 3,000 gene families is perhaps more biologically relevant, especially considering that many of these genome sequences are draft quality. The E. coli pan-genome for this set of isolates contains 16,373 gene clusters. A core-gene tree, based on alignment and a pan-genome tree based on gene presence/absence, maps the relatedness of the 186 sequenced E. coli genomes. The core-gene tree displays high confidence and divides the E. coli strains into the observed MLST type clades and also separates defined phylotypes. Conclusion The results of comparing a large and diverse E. coli dataset support the theory that reliable and good resolution phylogenies can be inferred from the core-genome. The results further suggest that the resolution at the isolate level may, subsequently be improved by targeting more variable genes. The use of whole genome sequencing will make it possible to eliminate, or at least reduce, the need for several typing steps used in traditional epidemiology.

  20. Extensive Genomic Diversity among Bovine-Adapted Staphylococcus aureus: Evidence for a Genomic Rearrangement within CC97.

    Science.gov (United States)

    Budd, Kathleen E; McCoy, Finola; Monecke, Stefan; Cormican, Paul; Mitchell, Jennifer; Keane, Orla M

    2015-01-01

    Staphylococcus aureus is an important pathogen associated with both human and veterinary disease and is a common cause of bovine mastitis. Genomic heterogeneity exists between S. aureus strains and has been implicated in the adaptation of specific strains to colonise particular mammalian hosts. Knowledge of the factors required for host specificity and virulence is important for understanding the pathogenesis and management of S. aureus mastitis. In this study, a panel of mastitis-associated S. aureus isolates (n = 126) was tested for resistance to antibiotics commonly used to treat mastitis. Over half of the isolates (52%) demonstrated resistance to penicillin and ampicillin but all were susceptible to the other antibiotics tested. S. aureus isolates were further examined for their clonal diversity by Multi-Locus Sequence Typing (MLST). In total, 18 different sequence types (STs) were identified and eBURST analysis demonstrated that the majority of isolates grouped into clonal complexes CC97, CC151 or sequence type (ST) 136. Analysis of the role of recombination events in determining S. aureus population structure determined that ST diversification through nucleotide substitutions were more likely to be due to recombination compared to point mutation, with regions of the genome possibly acting as recombination hotspots. DNA microarray analysis revealed a large number of differences amongst S. aureus STs in their variable genome content, including genes associated with capsule and biofilm formation and adhesion factors. Finally, evidence for a genomic arrangement was observed within isolates from CC97 with the ST71-like subgroup showing evidence of an IS431 insertion element having replaced approximately 30 kb of DNA including the ica operon and histidine biosynthesis genes, resulting in histidine auxotrophy. This genomic rearrangement may be responsible for the diversification of ST71 into an emerging bovine adapted subgroup.

  1. Extensive Genomic Diversity among Bovine-Adapted Staphylococcus aureus: Evidence for a Genomic Rearrangement within CC97.

    Directory of Open Access Journals (Sweden)

    Kathleen E Budd

    Full Text Available Staphylococcus aureus is an important pathogen associated with both human and veterinary disease and is a common cause of bovine mastitis. Genomic heterogeneity exists between S. aureus strains and has been implicated in the adaptation of specific strains to colonise particular mammalian hosts. Knowledge of the factors required for host specificity and virulence is important for understanding the pathogenesis and management of S. aureus mastitis. In this study, a panel of mastitis-associated S. aureus isolates (n = 126 was tested for resistance to antibiotics commonly used to treat mastitis. Over half of the isolates (52% demonstrated resistance to penicillin and ampicillin but all were susceptible to the other antibiotics tested. S. aureus isolates were further examined for their clonal diversity by Multi-Locus Sequence Typing (MLST. In total, 18 different sequence types (STs were identified and eBURST analysis demonstrated that the majority of isolates grouped into clonal complexes CC97, CC151 or sequence type (ST 136. Analysis of the role of recombination events in determining S. aureus population structure determined that ST diversification through nucleotide substitutions were more likely to be due to recombination compared to point mutation, with regions of the genome possibly acting as recombination hotspots. DNA microarray analysis revealed a large number of differences amongst S. aureus STs in their variable genome content, including genes associated with capsule and biofilm formation and adhesion factors. Finally, evidence for a genomic arrangement was observed within isolates from CC97 with the ST71-like subgroup showing evidence of an IS431 insertion element having replaced approximately 30 kb of DNA including the ica operon and histidine biosynthesis genes, resulting in histidine auxotrophy. This genomic rearrangement may be responsible for the diversification of ST71 into an emerging bovine adapted subgroup.

  2. DivStat: a user-friendly tool for single nucleotide polymorphism analysis of genomic diversity.

    Directory of Open Access Journals (Sweden)

    Inês Soares

    Full Text Available Recent developments have led to an enormous increase of publicly available large genomic data, including complete genomes. The 1000 Genomes Project was a major contributor, releasing the results of sequencing a large number of individual genomes, and allowing for a myriad of large scale studies on human genetic variation. However, the tools currently available are insufficient when the goal concerns some analyses of data sets encompassing more than hundreds of base pairs and when considering haplotype sequences of single nucleotide polymorphisms (SNPs. Here, we present a new and potent tool to deal with large data sets allowing the computation of a variety of summary statistics of population genetic data, increasing the speed of data analysis.

  3. Diversity, genetic mapping, and signatures of domestication in the carrot (Daucus carota L.) genome, as revealed by Diversity Arrays Technology (DArT) markers

    Science.gov (United States)

    Carrot is one of the most economically important vegetables worldwide, however, genetic and genomic resources supporting carrot breeding remain limited. We developed a Diversity Arrays Technology (DArT) platform for wild and cultivated carrot and used it to investigate genetic diversity and to devel...

  4. The impact of genomics on research in diversity and evolution of archaea.

    Science.gov (United States)

    Mardanov, A V; Ravin, N V

    2012-08-01

    Since the definition of archaea as a separate domain of life along with bacteria and eukaryotes, they have become one of the most interesting objects of modern microbiology, molecular biology, and biochemistry. Sequencing and analysis of archaeal genomes were especially important for studies on archaea because of a limited availability of genetic tools for the majority of these microorganisms and problems associated with their cultivation. Fifteen years since the publication of the first genome of an archaeon, more than one hundred complete genome sequences of representatives of different phylogenetic groups have been determined. Analysis of these genomes has expanded our knowledge of biology of archaea, their diversity and evolution, and allowed identification and characterization of new deep phylogenetic lineages of archaea. The development of genome technologies has allowed sequencing the genomes of uncultivated archaea directly from enrichment cultures, metagenomic samples, and even from single cells. Insights have been gained into the evolution of key biochemical processes in archaea, such as cell division and DNA replication, the role of horizontal gene transfer in the evolution of archaea, and new relationships between archaea and eukaryotes have been revealed.

  5. Genome Microscale Heterogeneity among Wild Potatoes Revealed by Diversity Arrays Technology Marker Sequences

    Directory of Open Access Journals (Sweden)

    Alessandra Traini

    2013-01-01

    Full Text Available Tuber-bearing potato species possess several genes that can be exploited to improve the genetic background of the cultivated potato Solanum tuberosum. Among them, S. bulbocastanum and S. commersonii are well known for their strong resistance to environmental stresses. However, scant information is available for these species in terms of genome organization, gene function, and regulatory networks. Consequently, genomic tools to assist breeding are meager, and efficient exploitation of these species has been limited so far. In this paper, we employed the reference genome sequences from cultivated potato and tomato and a collection of sequences of 1,423 potato Diversity Arrays Technology (DArT markers that show polymorphic representation across the genomes of S. bulbocastanum and/or S. commersonii genotypes. Our results highlighted microscale genome sequence heterogeneity that may play a significant role in functional and structural divergence between related species. Our analytical approach provides knowledge of genome structural and sequence variability that could not be detected by transcriptome and proteome approaches.

  6. Twenty-One Genome Sequences from Pseudomonas Species and 19 Genome Sequences from Diverse Bacteria Isolated from the Rhizosphere and Endosphere of Populus deltoides

    Energy Technology Data Exchange (ETDEWEB)

    Brown, Steven D [ORNL; Utturkar, Sagar M [ORNL; Klingeman, Dawn Marie [ORNL; Johnson, Courtney M [ORNL; Martin, Stanton [ORNL; Land, Miriam L [ORNL; Lu, Tse-Yuan [ORNL; Schadt, Christopher Warren [ORNL; Doktycz, Mitchel John [ORNL; Pelletier, Dale A [ORNL

    2012-01-01

    To aid in the investigation of the Populus deltoides microbiome we generated draft genome sequences for twenty one Pseudomonas and twenty one other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Burkholderia, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium and Variovorax were generated.

  7. Whole genome SNP discovery and analysis of genetic diversity in Turkey (Meleagris gallopavo

    Directory of Open Access Journals (Sweden)

    Aslam Muhammad L

    2012-08-01

    Full Text Available Abstract Background The turkey (Meleagris gallopavo is an important agricultural species and the second largest contributor to the world’s poultry meat production. Genetic improvement is attributed largely to selective breeding programs that rely on highly heritable phenotypic traits, such as body size and breast muscle development. Commercial breeding with small effective population sizes and epistasis can result in loss of genetic diversity, which in turn can lead to reduced individual fitness and reduced response to selection. The presence of genomic diversity in domestic livestock species therefore, is of great importance and a prerequisite for rapid and accurate genetic improvement of selected breeds in various environments, as well as to facilitate rapid adaptation to potential changes in breeding goals. Genomic selection requires a large number of genetic markers such as e.g. single nucleotide polymorphisms (SNPs the most abundant source of genetic variation within the genome. Results Alignment of next generation sequencing data of 32 individual turkeys from different populations was used for the discovery of 5.49 million SNPs, which subsequently were used for the analysis of genetic diversity among the different populations. All of the commercial lines branched from a single node relative to the heritage varieties and the South Mexican turkey population. Heterozygosity of all individuals from the different turkey populations ranged from 0.17-2.73 SNPs/Kb, while heterozygosity of populations ranged from 0.73-1.64 SNPs/Kb. The average frequency of heterozygous SNPs in individual turkeys was 1.07 SNPs/Kb. Five genomic regions with very low nucleotide variation were identified in domestic turkeys that showed state of fixation towards alleles different than wild alleles. Conclusion The turkey genome is much less diverse with a relatively low frequency of heterozygous SNPs as compared to other livestock species like chicken and pig. The

  8. Personalized evolutionary hypothesis in genomics and auxiliary lymph node through diverse subtelomeric signal profile.

    Science.gov (United States)

    Mehdipour, Parvin; Javan, Firoozeh; Savad, Shahram; Karbassian, Hamid; Atri, Morteza

    2015-01-24

    Few available data on the genomic-somatic evolution in breast cancer create limitation to provide the appropriate clinical managements. As an example, human subtelomeres (ST) are diverse-prone and variable targets. STs, as hot spots, have positive and negative impacts on the status of health and malady. We showed higher subtelomere signal copy number (SCN) of specific chromosomes in genomics than in auxiliary lymph node (ALN). Dissimilarity of signal intensity (SI) is found for all chromosomes. Significantly higher SI in genomics than in ALN cells were specified as chromosomes 5, 6, 9-12, 16-19 for weak; 1, 5-9, 19, X for medium; and 2, 5, 9, 10, 16, 18 for strong SI. For lacking, and presence of one and two SCNs; p/q ratio reflected differences for all chromosomes; but, 2, 3, 5, 7, 8, 10, 16, 18, 20, and X chromosomes were involved for three SCN. Chromosomes 1, 4, 9, 12, 17-19 lacked three SCN in ALN and lymphocytes. Weak SI ratio was higher in p- than in q-arm in majority of chromosomes. Manner of evolution and diversity in p- and q-arms is expressive of a novel definition as two diverse domains with a personalized insight. These data have been accompanied by periodic charts as ST array profiles which provide specific and individualized pattern in breast neoplasm. Such profiling at genomics level could be considered as a prediction through the patients' life. Moreover, subtelomere territory by interacting with protein expression of Ki67, cyclin D1, and cyclin E; and molecular targets including telomere length at genomics and somatic level provides package of information to bridge cancer cell biology to the cancer clinic as "puzzling paradigm." © 2015 International Federation for Cell Biology.

  9. Signalering van citruswolluis (Planococcus citri) in de teelt van diverse potplanten : onderzoek binnen project 41203147 "Verbetering biologische bestrijding van wolluis in diverse potplanten”

    NARCIS (Netherlands)

    Boertjes, B.C.; Bruin, de J.

    2003-01-01

    In 2002 en 2003 is door PPO Glastuinbouw het project “Verbetering biologische bestrijding van wolluis in diverse potplanten” (project 41203147) uitgevoerd. Binnen dit project werd onder meer onderzoek gedaan naar methoden om citruswolluis (Planococcus citri) te signaleren.

  10. Acidobacteria form a coherent but highly diverse group within the bacterial domain: evidence from environmental genomics

    DEFF Research Database (Denmark)

    Quaiser, Achim; Ochsenreiter, Torsten; Lanz, Christa

    2003-01-01

    Acidobacteria have been established as a novel phylum of Bacteria that is consistently detected in many different habitats around the globe by 16S rDNA-based molecular surveys. The phylogenetic diversity, ubiquity and abundance of this group, particularly in soil habitats, suggest an important...... insert libraries directly from DNA of a calcerous grassland soil. Genomic fragments of Acidobacteria were identified with specific 16S rDNA probes and sequence analyses of six independently identified clones were performed, representing in total more than 210,000 bp. The 16S rRNA genes of the genomic...... fragments differed between 2.3% and 19.9% and were placed into two different subgroups of Acidobacteria (groups III and V). Although partial co-linearity was found between genomic fragments, the gene content around the rRNA operons was generally not conserved. Phylogenetic reconstructions with orthologues...

  11. Ma-LMM01 infecting toxic Microcystis aeruginosa illuminates diverse cyanophage genome strategies.

    Science.gov (United States)

    Yoshida, Takashi; Nagasaki, Keizo; Takashima, Yukari; Shirai, Yoko; Tomaru, Yuji; Takao, Yoshitake; Sakamoto, Shigetaka; Hiroishi, Shingo; Ogata, Hiroyuki

    2008-03-01

    Cyanobacteria and their phages are significant microbial components of the freshwater and marine environments. We identified a lytic phage, Ma-LMM01, infecting Microcystis aeruginosa, a cyanobacterium that forms toxic blooms on the surfaces of freshwater lakes. Here, we describe the first sequenced freshwater cyanomyovirus genome of Ma-LMM01. The linear, circularly permuted, and terminally redundant genome has 162,109 bp and contains 184 predicted protein-coding genes and two tRNA genes. The genome exhibits no colinearity with previously sequenced genomes of cyanomyoviruses or other Myoviridae. The majority of the predicted genes have no detectable homologues in the databases. These findings indicate that Ma-LMM01 is a member of a new lineage of the Myoviridae family. The genome lacks homologues for the photosynthetic genes that are prevalent in marine cyanophages. However, it has a homologue of nblA, which is essential for the degradation of the major cyanobacteria light-harvesting complex, the phycobilisomes. The genome codes for a site-specific recombinase and two prophage antirepressors, suggesting that it has the capacity to integrate into the host genome. Ma-LMM01 possesses six genes, including three coding for transposases, that are highly similar to homologues found in cyanobacteria, suggesting that recent gene transfers have occurred between Ma-LMM01 and its host. We propose that the Ma-LMM01 NblA homologue possibly reduces the absorption of excess light energy and confers benefits to the phage living in surface waters. This phage genome study suggests that light is central in the phage-cyanobacterium relationships where the viruses use diverse genetic strategies to control their host's photosynthesis.

  12. Identification of genome-wide copy number variations among diverse pig breeds using SNP genotyping arrays.

    Directory of Open Access Journals (Sweden)

    Jiying Wang

    Full Text Available Copy number variations (CNVs are important forms of genetic variation complementary to SNPs, and can be considered as promising markers for some phenotypic and economically important traits or diseases susceptibility in domestic animals. In the present study, we performed a genome-wide CNV identification in 14 individuals selected from diverse populations, including six types of Chinese indigenous breeds, one Asian wild boar population, as well as three modern commercial foreign breeds. We identified 63 CNVRs in total, which covered 9.98 Mb of polymorphic sequence and corresponded to 0.36% of the genome sequence. The length of these CNVRs ranged from 3.20 to 827.21 kb, with an average of 158.37 kb and a median of 97.85 kb. Functional annotation revealed these identified CNVR have important molecular function, and may play an important role in exploring the genetic basis of phenotypic variability and disease susceptibility among pigs. Additionally, to confirm these potential CNVRs, we performed qPCR for 12 randomly selected CNVRs and 8 of them (66.67% were confirmed successfully. CNVs detected in diverse populations herein are essential complementary to the CNV map in the pig genome, which provide an important resource for studies of genomic variation and the association between various economically important traits and CNVs.

  13. Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi

    Energy Technology Data Exchange (ETDEWEB)

    Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard; Schoch, Conrad L.; Horwitz, Benjamin A.; Barry, Kerrie W.; Condon, Bradford J.; Copeland, Alex C.; Dhillon, Braham; Glaser, Fabian; Hesse, Cedar N.; Kosti, Idit; LaButti, Kurt; Lindquist, Erika A.; Lucas, Susan; Salamov, Asaf A.; Bradshaw, Rosie E.; Ciuffetti, Lynda; Hamelin, Richard C.; Kema, Gert H. J.; Lawrence, Christopher; Scott, James A.; Spatafora, Joseph W.; Turgeon, B. Gillian; Wit, Pierre J. G. M. de; Zhong, Shaobin; Goodwin, Stephen B.; Grigoriev, Igor V.

    2012-02-29

    The class Dothideomycetes is one of the largest groups of fungi with a high level of ecological diversity including many plant pathogens infecting a broad range of hosts. Here, we compare genome features of 18 members of this class, including 6 necrotrophs, 9 (hemi)biotrophs and 3 saprotrophs, to analyze genome structure, evolution, and the diverse strategies of pathogenesis. The Dothideomycetes most likely evolved from a common ancestor more than 280 million years ago. The 18 genome sequences differ dramatically in size due to variation in repetitive content, but show much less variation in number of (core) genes. Gene order appears to have been rearranged mostly within chromosomal boundaries by multiple inversions, in extant genomes frequently demarcated by adjacent simple repeats. Several Dothideomycetes contain one or more gene-poor, transposable element (TE)-rich putatively dispensable chromosomes of unknown function. The 18 Dothideomycetes offer an extensive catalogue of genes involved in cellulose degradation, proteolysis, secondary metabolism, and cysteine-rich small secreted proteins. Ancestors of the two major orders of plant pathogens in the Dothideomycetes, the Capnodiales and Pleosporales, may have had different modes of pathogenesis, with the former having fewer of these genes than the latter. Many of these genes are enriched in proximity to transposable elements, suggesting faster evolution because of the effects of repeat induced point (RIP) mutations. A syntenic block of genes, including oxidoreductases, is conserved in most Dothideomycetes and upregulated during infection in L. maculans, suggesting a possible function in response to oxidative stress.

  14. Diverse lifestyles and strategies of plant pathogenesis encoded in the genomes of eighteen Dothideomycetes fungi.

    Directory of Open Access Journals (Sweden)

    Robin A Ohm

    Full Text Available The class Dothideomycetes is one of the largest groups of fungi with a high level of ecological diversity including many plant pathogens infecting a broad range of hosts. Here, we compare genome features of 18 members of this class, including 6 necrotrophs, 9 (hemibiotrophs and 3 saprotrophs, to analyze genome structure, evolution, and the diverse strategies of pathogenesis. The Dothideomycetes most likely evolved from a common ancestor more than 280 million years ago. The 18 genome sequences differ dramatically in size due to variation in repetitive content, but show much less variation in number of (core genes. Gene order appears to have been rearranged mostly within chromosomal boundaries by multiple inversions, in extant genomes frequently demarcated by adjacent simple repeats. Several Dothideomycetes contain one or more gene-poor, transposable element (TE-rich putatively dispensable chromosomes of unknown function. The 18 Dothideomycetes offer an extensive catalogue of genes involved in cellulose degradation, proteolysis, secondary metabolism, and cysteine-rich small secreted proteins. Ancestors of the two major orders of plant pathogens in the Dothideomycetes, the Capnodiales and Pleosporales, may have had different modes of pathogenesis, with the former having fewer of these genes than the latter. Many of these genes are enriched in proximity to transposable elements, suggesting faster evolution because of the effects of repeat induced point (RIP mutations. A syntenic block of genes, including oxidoreductases, is conserved in most Dothideomycetes and upregulated during infection in L. maculans, suggesting a possible function in response to oxidative stress.

  15. Comparative Analysis of 35 Basidiomycete Genomes Reveals Diversity and Uniqueness of the Phylum

    Energy Technology Data Exchange (ETDEWEB)

    Riley, Robert; Salamov, Asaf; Otillar, Robert; Fagnan, Kirsten; Boussau, Bastien; Brown, Daren; Henrissat, Bernard; Levasseur, Anthony; Held, Benjamin; Nagy, Laszlo; Floudas, Dimitris; Morin, Emmanuelle; Manning, Gerard; Baker, Scott; Martin, Francis; Blanchette, Robert; Hibbett, David; Grigoriev, Igor V.

    2013-03-11

    Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprobes including wood decaying fungi. To better understand the diversity of this phylum we compared the genomes of 35 basidiomycete fungi including 6 newly sequenced genomes. The genomes of basidiomycetes span extremes of genome size, gene number, and repeat content. A phylogenetic tree of Basidiomycota was generated using the Phyldog software, which uses all available protein sequence data to simultaneously infer gene and species trees. Analysis of core genes reveals that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) comprising proteins found in only one organism. Phylogenetic patterns of plant biomass-degrading genes suggest a continuum rather than a sharp dichotomy between the white rot and brown rot modes of wood decay among the members of Agaricomycotina subphylum. There is a correlation of the profile of certain gene families to nutritional mode in Agaricomycotina. Based on phylogenetically-informed PCA analysis of such profiles, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has liginolytic class II fungal peroxidases. Furthermore, we find that both fungi exhibit wood decay with white rot-like characteristics in growth assays. Analysis of the rate of discovery of proteins with no or few homologs suggests the high value of continued sequencing of basidiomycete fungi.

  16. Neutral Theory Predicts the Relative Abundance and Diversity of Genetic Elements in a Broad Array of Eukaryotic Genomes

    Science.gov (United States)

    Serra, François; Becher, Verónica; Dopazo, Hernán

    2013-01-01

    It is universally true in ecological communities, terrestrial or aquatic, temperate or tropical, that some species are very abundant, others are moderately common, and the majority are rare. Likewise, eukaryotic genomes also contain classes or “species” of genetic elements that vary greatly in abundance: DNA transposons, retrotransposons, satellite sequences, simple repeats and their less abundant functional sequences such as RNA or genes. Are the patterns of relative species abundance and diversity similar among ecological communities and genomes? Previous dynamical models of genomic diversity have focused on the selective forces shaping the abundance and diversity of transposable elements (TEs). However, ideally, models of genome dynamics should consider not only TEs, but also the diversity of all genetic classes or “species” populating eukaryotic genomes. Here, in an analysis of the diversity and abundance of genetic elements in >500 eukaryotic chromosomes, we show that the patterns are consistent with a neutral hypothesis of genome assembly in virtually all chromosomes tested. The distributions of relative abundance of genetic elements are quite precisely predicted by the dynamics of an ecological model for which the principle of functional equivalence is the main assumption. We hypothesize that at large temporal scales an overarching neutral or nearly neutral process governs the evolution of abundance and diversity of genetic elements in eukaryotic genomes. PMID:23798991

  17. Understanding the Human Genome Project -- A Fact Sheet

    Science.gov (United States)

    ... that contribute to human disease. In 1953, James Watson and Francis Crick described the double helix structure ... of sequencing whole exomes or genomes, groundbreaking comparative genomic studies are now identifiying the causes of rare ...

  18. Secondary uses and the governance of de-identified data: Lessons from the human genome diversity panel

    Directory of Open Access Journals (Sweden)

    Lee Sandra S-J

    2011-09-01

    Full Text Available Abstract Background Recent changes to regulatory guidance in the US and Europe have complicated oversight of secondary research by rendering most uses of de-identified data exempt from human subjects oversight. To identify the implications of such guidelines for harms to participants and communities, this paper explores the secondary uses of one de-identified DNA sample collection with limited oversight: the Human Genome Diversity Project (HGDP-Centre d'Etude du Polymorphisme Humain, Fondation Jean Dausset (CEPH Human Genome Diversity Panel. Methods Using a combination of keyword and cited reference search, we identified English-language scientific articles published between 2002 and 2009 that reported analysis of HGDP Diversity Panel samples and/or data. We then reviewed each article to identify the specific research use to which the samples and/or data was applied. Secondary uses were categorized according to the type and kind of research supported by the collection. Results A wide variety of secondary uses were identified from 148 peer-reviewed articles. While the vast majority of these uses were consistent with the original intent of the collection, a minority of published reports described research whose primary findings could be regarded as controversial, objectionable, or potentially stigmatizing in their interpretation. Conclusions We conclude that potential risks to participants and communities cannot be wholly eliminated by anonymization of individual data and suggest that explicit review of proposed secondary uses, by a Data Access Committee or similar internal oversight body with suitable stakeholder representation, should be a required component of the trustworthy governance of any repository of data or specimens.

  19. Entangled fates of holobiont genomes during invasion: nested bacterial and host diversities in Caulerpa taxifolia

    KAUST Repository

    Arnaud-Haond, S.

    2017-01-30

    Successful prevention and mitigation of biological invasions requires retracing the initial steps of introduction, as well as understanding key elements enhancing the adaptability of invasive species. We studied the genetic diversity of the green alga Caulerpa taxifolia and its associated bacterial communities in several areas around the world. The striking congruence of α and ß diversity of the algal genome and endophytic communities reveals a tight association, supporting the holobiont concept as best describing the unit of spreading and invasion. Both genomic compartments support the hypotheses of a unique accidental introduction in the Mediterranean and of multiple invasion events in Southern Australia. In addition to helping with tracing the origin of invasion, bacterial communities exhibit metabolic functions that can potentially enhance adaptability and competitiveness of the consortium they form with their host. We thus hypothesize that low genetic diversities of both host and symbiont communities may contribute to the recent regression in the Mediterranean, in contrast with the persistence of highly diverse assemblages in southern Australia. This study supports the importance of scaling up from the host to the holobiont for a comprehensive understanding of invasions. This article is protected by copyright. All rights reserved.

  20. The evolution of the Anopheles 16 genomes project

    NARCIS (Netherlands)

    Neafsey, Daniel E.; Christophides, George K.; Collins, Frank H.; Emrich, Scott J.; Fontaine, Michael C.; Gelbart, William; Hahn, Matthew W.; Howell, Paul I.; Kafatos, Fotis C.; Lawson, Daniel; Muskavitch, Marc A. T.; Waterhouse, Robert M.; Williams, Louise J.; Besansky, Nora J.

    2013-01-01

    We report the imminent completion of a set of reference genome assemblies for 16 species of Anopheles mosquitoes. In addition to providing a generally useful resource for comparative genomic analyses, these genome sequences will greatly facilitate exploration of the capacity exhibited by some Anophe

  1. Genome projects 5W1H: what, where, when, why, how and in which population?

    Directory of Open Access Journals (Sweden)

    Pelin Fidanoğlu

    2014-05-01

    Full Text Available Genome projects aim to decode an organism's complete set of deoxyribonucleic acid (DNA, which can be described as the living code of organism. The idea of the Human Genome Project (HGP was conceived in the early 1980s. The project was started at 1990 and finished at 2003. The sequencing of the whole human genome derived from the DNA of several anonymous volunteers, costed 3.8 billion dollars. In order to annotate the genome data, the 'topography of the genome' and the anatomy of the genes should have been revealed. For this purpose, genome projects of several model organisms was carried out in parallel with HGP with the aim to identify basic structural components, organizational structure and evolutionarily development of the genome. With the advent of microarray technology in the early 2000s, high-throughput screening of Single Nucleotide Polymorphisms (SNPs and Copy Number Variations (CNVs became feasible. After the completion of HGP in 13 years, James D. Watson's genome was sequenced with 1 million dollar budget in just 2 months using next generation sequencing technology. Today a human genome can be sequenced in just one day with the cost of 6.600 USD. In this reviev the HGP which created big expectations especially in medicine will be explained from its start to the present. Then we will summarize the studies paving the road to personalized medicine emphasizing the fact that to reveal the meaning of genomic information, it should become computable.

  2. SNiPlay: a web-based tool for detection, management and analysis of SNPs. Application to grapevine diversity projects.

    Science.gov (United States)

    Dereeper, Alexis; Nicolas, Stéphane; Le Cunff, Loïc; Bacilieri, Roberto; Doligez, Agnès; Peros, Jean-Pierre; Ruiz, Manuel; This, Patrice

    2011-05-05

    High-throughput re-sequencing, new genotyping technologies and the availability of reference genomes allow the extensive characterization of Single Nucleotide Polymorphisms (SNPs) and insertion/deletion events (indels) in many plant species. The rapidly increasing amount of re-sequencing and genotyping data generated by large-scale genetic diversity projects requires the development of integrated bioinformatics tools able to efficiently manage, analyze, and combine these genetic data with genome structure and external data. In this context, we developed SNiPlay, a flexible, user-friendly and integrative web-based tool dedicated to polymorphism discovery and analysis. It integrates:1) a pipeline, freely accessible through the internet, combining existing softwares with new tools to detect SNPs and to compute different types of statistical indices and graphical layouts for SNP data. From standard sequence alignments, genotyping data or Sanger sequencing traces given as input, SNiPlay detects SNPs and indels events and outputs submission files for the design of Illumina's SNP chips. Subsequently, it sends sequences and genotyping data into a series of modules in charge of various processes: physical mapping to a reference genome, annotation (genomic position, intron/exon location, synonymous/non-synonymous substitutions), SNP frequency determination in user-defined groups, haplotype reconstruction and network, linkage disequilibrium evaluation, and diversity analysis (Pi, Watterson's Theta, Tajima's D).Furthermore, the pipeline allows the use of external data (such as phenotype, geographic origin, taxa, stratification) to define groups and compare statistical indices.2) a database storing polymorphisms, genotyping data and grapevine sequences released by public and private projects. It allows the user to retrieve SNPs using various filters (such as genomic position, missing data, polymorphism type, allele frequency), to compare SNP patterns between populations, and to

  3. Unraveling Mycobacterium tuberculosis genomic diversity and evolution in Lisbon, Portugal, a highly drug resistant setting

    KAUST Repository

    Perdigão, João

    2014-11-18

    Background Multidrug- (MDR) and extensively drug resistant (XDR) tuberculosis (TB) presents a challenge to disease control and elimination goals. In Lisbon, Portugal, specific and successful XDR-TB strains have been found in circulation for almost two decades. Results In the present study we have genotyped and sequenced the genomes of 56 Mycobacterium tuberculosis isolates recovered mostly from Lisbon. The genotyping data revealed three major clusters associated with MDR-TB, two of which are associated with XDR-TB. Whilst the genomic data contributed to elucidate the phylogenetic positioning of circulating MDR-TB strains, showing a high predominance of a single SNP cluster group 5. Furthermore, a genome-wide phylogeny analysis from these strains, together with 19 publicly available genomes of Mycobacterium tuberculosis clinical isolates, revealed two major clades responsible for M/XDR-TB in the region: Lisboa3 and Q1 (LAM). The data presented by this study yielded insights on microevolution and identification of novel compensatory mutations associated with rifampicin resistance in rpoB and rpoC. The screening for other structural variations revealed putative clade-defining variants. One deletion in PPE41, found among Lisboa3 isolates, is proposed to contribute to immune evasion and as a selective advantage. Insertion sequence (IS) mapping has also demonstrated the role of IS6110 as a major driver in mycobacterial evolution by affecting gene integrity and regulation. Conclusions Globally, this study contributes with novel genome-wide phylogenetic data and has led to the identification of new genomic variants that support the notion of a growing genomic diversity facing both setting and host adaptation.

  4. Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains

    Science.gov (United States)

    Hansen, Lori M.; Bernick, David L.; Abedrabbo, Samar; Underwood, Jason G.; Kong, Nguyet; Huang, Bihua C.; Weis, Allison M.; Pourmand, Nader

    2017-01-01

    ABSTRACT Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains. PMID:28223462

  5. Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains

    Directory of Open Access Journals (Sweden)

    Jenny L. Draper

    2017-02-01

    Full Text Available Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.

  6. Bioethics methods in the ethical, legal, and social implications of the human genome project literature.

    Science.gov (United States)

    Walker, Rebecca L; Morrissey, Clair

    2014-11-01

    While bioethics as a field has concerned itself with methodological issues since the early years, there has been no systematic examination of how ethics is incorporated into research on the Ethical, Legal and Social Implications (ELSI) of the Human Genome Project. Yet ELSI research may bear a particular burden of investigating and substantiating its methods given public funding, an explicitly cross-disciplinary approach, and the perceived significance of adequate responsiveness to advances in genomics. We undertook a qualitative content analysis of a sample of ELSI publications appearing between 2003 and 2008 with the aim of better understanding the methods, aims, and approaches to ethics that ELSI researchers employ. We found that the aims of ethics within ELSI are largely prescriptive and address multiple groups. We also found that the bioethics methods used in the ELSI literature are both diverse between publications and multiple within publications, but are usually not themselves discussed or employed as suggested by bioethics method proponents. Ethics in ELSI is also sometimes undistinguished from related inquiries (such as social, legal, or political investigations).

  7. Evolution of sociality in spiders leads to depleted genomic diversity at both population and species levels.

    Science.gov (United States)

    Settepani, V; Schou, M F; Greve, M; Grinsted, L; Bechsgaard, J; Bilde, T

    2017-08-01

    Across several animal taxa, the evolution of sociality involves a suite of characteristics, a "social syndrome," that includes cooperative breeding, reproductive skew, primary female-biased sex ratio, and the transition from outcrossing to inbreeding mating system, factors that are expected to reduce effective population size (Ne). This social syndrome may be favoured by short-term benefits but come with long-term costs, because the reduction in Ne amplifies loss of genetic diversity by genetic drift, ultimately restricting the potential of populations to respond to environmental change. To investigate the consequences of this social life form on genetic diversity, we used a comparative RAD-sequencing approach to estimate genomewide diversity in spider species that differ in level of sociality, reproductive skew and mating system. We analysed multiple populations of three independent sister-species pairs of social inbreeding and subsocial outcrossing Stegodyphus spiders, and a subsocial outgroup. Heterozygosity and within-population diversity were sixfold to 10-fold lower in social compared to subsocial species, and demographic modelling revealed a tenfold reduction in Ne of social populations. Species-wide genetic diversity depends on population divergence and the viability of genetic lineages. Population genomic patterns were consistent with high lineage turnover, which homogenizes the genetic structure that builds up between inbreeding populations, ultimately depleting genetic diversity at the species level. Indeed, species-wide genetic diversity of social species was 5-8 times lower than that of subsocial species. The repeated evolution of species with this social syndrome is associated with severe loss of genomewide diversity, likely to limit their evolutionary potential. © 2017 John Wiley & Sons Ltd.

  8. Characterization of the Metabochip in diverse populations from the International HapMap Project in the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) project.

    Science.gov (United States)

    Crawford, Dana C; Goodloe, Robert; Brown-Gentry, Kristin; Wilson, Sarah; Roberson, Jamie; Gillani, Niloufar B; Ritchie, Marylyn D; Dilks, Holli H; Bush, William S

    2013-01-01

    Genome-wide association studies (GWAS) have identified hundreds of genomic regions associated with common human disease and quantitative traits. A major research avenue for mature genotype-phenotype associations is the identification of the true risk or functional variant for downstream molecular studies or personalized medicine applications. As part of the Population Architecture using Genomics and Epidemiology (PAGE) study, we as Epidemiologic Architecture for Genes Linked to Environment (EAGLE) are fine-mapping GWAS-identified genomic regions for common diseases and quantitative traits. We are currently genotyping the Metabochip, a custom content BeadChip designed for fine-mapping metabolic diseases and traits, in∼15,000 DNA samples from patients of African, Hispanic, and Asian ancestry linked to de-identified electronic medical records from the Vanderbilt University biorepository (BioVU). As an initial study of quality control, we report here the genotyping data for 360 samples of European, African, Asian, and Mexican descent from the International HapMap Project. In addition to quality control metrics, we report the overall allele frequency distribution, overall population differentiation (as measured by FST), and linkage disequilibrium patterns for a select GWAS-identified region associated with low-density lipoprotein cholesterol levels to illustrate the utility of the Metabochip for fine-mapping studies in the diverse populations expected in EAGLE, the PAGE study, and other efforts underway designed to characterize the complex genetic architecture underlying common human disease and quantitative traits.

  9. Comparative genomics of plant-asssociated Pseudomonas spp.: Insights into diversity and inheritance of traits involved in multitrophic interactions

    NARCIS (Netherlands)

    Loper, J.E.; Hassan, K.A.; Mavrodi, D.V.; Davis II, E.W.; Lim, C.K.; Shaffer, B.T.; Elbourne, L.D.H.; Stockwell, V.O.; Hartney, S.L.; Breakwell, K.; Henkels, M.D.; Tetu, S.G.; Rangel, L.I.; Kidarsa, T.A.; Wilson, N.L.; Mortel, van de J.E.; Song, C.; Blumhagen, R.; Radune, D.; Hostetler, J.B.; Brinkac, L.M.; Durkin, A.C.; Kluepfel, D.A.; Wechter, W.P.; Anderson, A.J.; Kim, Y.C.; Pierson III, L.S.; Pierson, E.A.; Lindow, S.E.; Kobayashi, D.Y.; Raaijmakers, J.; Weller, D.M.; Thomashow, L.S.; Allen, A.E.; Paulsen, I.T.

    2012-01-01

    We provide here a comparative genome analysis of ten strains within the Pseudomonas fluorescens group including seven new genomic sequences. These strains exhibit a diverse spectrum of traits involved in biological control and other multitrophic interactions with plants, microbes, and insects. Multi

  10. The diversity of shell matrix proteins: genome-wide investigation of the pearl oyster, Pinctada fucata.

    Science.gov (United States)

    Miyamoto, Hiroshi; Endo, Hirotoshi; Hashimoto, Naoki; Limura, Kurin; Isowa, Yukinobu; Kinoshita, Shigeharu; Kotaki, Tomohiro; Masaoka, Tetsuji; Miki, Takumi; Nakayama, Seiji; Nogawa, Chihiro; Notazawa, Atsuto; Ohmori, Fumito; Sarashina, Isao; Suzuki, Michio; Takagi, Ryousuke; Takahashi, Jun; Takeuchi, Takeshi; Yokoo, Naoki; Satoh, Nori; Toyohara, Haruhiko; Miyashita, Tomoyuki; Wada, Hiroshi; Samata, Tetsuro; Endo, Kazuyoshi; Nagasawa, Hiromichi; Asakawa, Shuichi; Watabe, Shugo

    2013-10-01

    In molluscs, shell matrix proteins are associated with biomineralization, a biologically controlled process that involves nucleation and growth of calcium carbonate crystals. Identification and characterization of shell matrix proteins are important for better understanding of the adaptive radiation of a large variety of molluscs. We searched the draft genome sequence of the pearl oyster Pinctada fucata and annotated 30 different kinds of shell matrix proteins. Of these, we could identified Perlucin, ependymin-related protein and SPARC as common genes shared by bivalves and gastropods; however, most gastropod shell matrix proteins were not found in the P. fucata genome. Glycinerich proteins were conserved in the genus Pinctada. Another important finding with regard to these annotated genes was that numerous shell matrix proteins are encoded by more than one gene; e.g., three ACCBP-like proteins, three CaLPs, five chitin synthase-like proteins, two N16 proteins (pearlins), 10 N19 proteins, two nacreins, four Pifs, nine shematrins, two prismalin-14 proteins, and 21 tyrosinases. This diversity of shell matrix proteins may be implicated in the morphological diversity of mollusc shells. The annotated genes reported here can be searched in P. fucata gene models version 1.1 and genome assembly version 1.0 ( http://marinegenomics.oist.jp/pinctada_fucata ). These genes should provide a useful resource for studies of the genetic basis of biomineralization and evaluation of the role of shell matrix proteins as an evolutionary toolkit among the molluscs.

  11. Genomic diversity of EPEC associated with clinical presentations of differing severity

    Science.gov (United States)

    Hazen, Tracy H.; Donnenberg, Michael S.; Panchalingam, Sandra; Antonio, Martin; Hossain, Anowar; Mandomando, Inacio; Ochieng, John Benjamin; Ramamurthy, Thandavarayan; Tamboura, Boubou; Qureshi, Shahida; Quadri, Farheen; Zaidi, Anita; Kotloff, Karen L.; Levine, Myron M.; Barry, Eileen M.; Kaper, James B.; Rasko, David A.; Nataro, James P.

    2016-01-01

    Enteropathogenic Escherichia coli (EPEC) are diarrhoeagenic E. coli, and are a significant cause of gastrointestinal illness among young children in developing countries. Typical EPEC are identified by the presence of the bundle-forming pilus encoded by a virulence plasmid, which has been linked to an increased severity of illness, while atypical EPEC lack this feature. Comparative genomics of 70 total EPEC from lethal (LI), non-lethal symptomatic (NSI) or asymptomatic (AI) cases of diarrhoeal illness in children enrolled in the Global Enteric Multicenter Study was used to investigate the genomic differences in EPEC isolates obtained from individuals with various clinical outcomes. A comparison of the genomes of isolates from different clinical outcomes identified genes that were significantly more prevalent in EPEC isolates of symptomatic and lethal outcomes than in EPEC isolates of asymptomatic outcomes. These EPEC isolates exhibited previously unappreciated phylogenomic diversity and combinations of virulence factors. These comparative results highlight the diversity of the pathogen, as well as the complexity of the EPEC virulence factor repertoire. PMID:27571975

  12. Anaplasma marginale: Diversity, Virulence, and Vaccine Landscape through a Genomics Approach

    Science.gov (United States)

    Amaro-Estrada, Itzel; Rodríguez-Camarillo, Sergio Darío

    2016-01-01

    In order to understand the genetic diversity of A. marginale, several efforts have been made around the world. This rickettsia affects a significant number of ruminants, causing bovine anaplasmosis, so the interest in its virulence and how it is transmitted have drawn interest not only from a molecular point of view but also, recently, some genomics research have been performed to elucidate genes and proteins with potential as antigens. Unfortunately, so far, we still do not have a recombinant anaplasmosis vaccine. In this review, we present a landscape of the multiple approaches carried out from the genomic perspective to generate valuable information that could be used in a holistic way to finally develop an anaplasmosis vaccine. These approaches include the analysis of the genetic diversity of A. marginale and how this affects control measures for the disease. Anaplasmosis vaccine development is also reviewed from the conventional vaccinomics to genome-base vaccinology approach based on proteomics, metabolomics, and transcriptomics analyses reported. The use of these new omics approaches will undoubtedly reveal new targets of interest in the near future, comprising information of potential antigens and the immunogenic effect of A. marginale proteins. PMID:27610385

  13. Challenges of metagenomics and single-cell genomics approaches for exploring cyanobacterial diversity.

    Science.gov (United States)

    Davison, Michelle; Hall, Eric; Zare, Richard; Bhaya, Devaki

    2015-10-01

    Cyanobacteria have played a crucial role in the history of early earth and continue to be instrumental in shaping our planet, yet applications of cutting edge technology have not yet been widely used to explore cyanobacterial diversity. To provide adequate background, we briefly review current sequencing technologies and their innovative uses in genomics and metagenomics. Next, we focus on current cell capture technologies and the challenges of using them with cyanobacteria. We illustrate the utility in coupling breakthroughs in DNA amplification with cell capture platforms, with an example of microfluidic isolation and subsequent targeted amplicon sequencing from individual terrestrial thermophilic cyanobacteria. Single cells of thermophilic, unicellular Synechococcus sp. JA-2-3-B'a(2-13) (Syn OS-B') were sorted in a microfluidic device, lysed, and subjected to whole genome amplification by multiple displacement amplification. We amplified regions from specific CRISPR spacer arrays, which are known to be highly diverse, contain semi-palindromic repeats which form secondary structure, and can be difficult to amplify. Cell capture, lysis, and genome amplification on a microfluidic device have been optimized, setting a stage for further investigations of individual cyanobacterial cells isolated directly from natural populations.

  14. Anaplasma marginale: Diversity, Virulence, and Vaccine Landscape through a Genomics Approach

    Directory of Open Access Journals (Sweden)

    Rosa Estela Quiroz-Castañeda

    2016-01-01

    Full Text Available In order to understand the genetic diversity of A. marginale, several efforts have been made around the world. This rickettsia affects a significant number of ruminants, causing bovine anaplasmosis, so the interest in its virulence and how it is transmitted have drawn interest not only from a molecular point of view but also, recently, some genomics research have been performed to elucidate genes and proteins with potential as antigens. Unfortunately, so far, we still do not have a recombinant anaplasmosis vaccine. In this review, we present a landscape of the multiple approaches carried out from the genomic perspective to generate valuable information that could be used in a holistic way to finally develop an anaplasmosis vaccine. These approaches include the analysis of the genetic diversity of A. marginale and how this affects control measures for the disease. Anaplasmosis vaccine development is also reviewed from the conventional vaccinomics to genome-base vaccinology approach based on proteomics, metabolomics, and transcriptomics analyses reported. The use of these new omics approaches will undoubtedly reveal new targets of interest in the near future, comprising information of potential antigens and the immunogenic effect of A. marginale proteins.

  15. The database of the PREDICTS (Projecting Responses of Ecological Diversity In Changing Terrestrial Systems) project.

    Science.gov (United States)

    Hudson, Lawrence N; Newbold, Tim; Contu, Sara; Hill, Samantha L L; Lysenko, Igor; De Palma, Adriana; Phillips, Helen R P; Alhusseini, Tamera I; Bedford, Felicity E; Bennett, Dominic J; Booth, Hollie; Burton, Victoria J; Chng, Charlotte W T; Choimes, Argyrios; Correia, David L P; Day, Julie; Echeverría-Londoño, Susy; Emerson, Susan R; Gao, Di; Garon, Morgan; Harrison, Michelle L K; Ingram, Daniel J; Jung, Martin; Kemp, Victoria; Kirkpatrick, Lucinda; Martin, Callum D; Pan, Yuan; Pask-Hale, Gwilym D; Pynegar, Edwin L; Robinson, Alexandra N; Sanchez-Ortiz, Katia; Senior, Rebecca A; Simmons, Benno I; White, Hannah J; Zhang, Hanbin; Aben, Job; Abrahamczyk, Stefan; Adum, Gilbert B; Aguilar-Barquero, Virginia; Aizen, Marcelo A; Albertos, Belén; Alcala, E L; Del Mar Alguacil, Maria; Alignier, Audrey; Ancrenaz, Marc; Andersen, Alan N; Arbeláez-Cortés, Enrique; Armbrecht, Inge; Arroyo-Rodríguez, Víctor; Aumann, Tom; Axmacher, Jan C; Azhar, Badrul; Azpiroz, Adrián B; Baeten, Lander; Bakayoko, Adama; Báldi, András; Banks, John E; Baral, Sharad K; Barlow, Jos; Barratt, Barbara I P; Barrico, Lurdes; Bartolommei, Paola; Barton, Diane M; Basset, Yves; Batáry, Péter; Bates, Adam J; Baur, Bruno; Bayne, Erin M; Beja, Pedro; Benedick, Suzan; Berg, Åke; Bernard, Henry; Berry, Nicholas J; Bhatt, Dinesh; Bicknell, Jake E; Bihn, Jochen H; Blake, Robin J; Bobo, Kadiri S; Bóçon, Roberto; Boekhout, Teun; Böhning-Gaese, Katrin; Bonham, Kevin J; Borges, Paulo A V; Borges, Sérgio H; Boutin, Céline; Bouyer, Jérémy; Bragagnolo, Cibele; Brandt, Jodi S; Brearley, Francis Q; Brito, Isabel; Bros, Vicenç; Brunet, Jörg; Buczkowski, Grzegorz; Buddle, Christopher M; Bugter, Rob; Buscardo, Erika; Buse, Jörn; Cabra-García, Jimmy; Cáceres, Nilton C; Cagle, Nicolette L; Calviño-Cancela, María; Cameron, Sydney A; Cancello, Eliana M; Caparrós, Rut; Cardoso, Pedro; Carpenter, Dan; Carrijo, Tiago F; Carvalho, Anelena L; Cassano, Camila R; Castro, Helena; Castro-Luna, Alejandro A; Rolando, Cerda B; Cerezo, Alexis; Chapman, Kim Alan; Chauvat, Matthieu; Christensen, Morten; Clarke, Francis M; Cleary, Daniel F R; Colombo, Giorgio; Connop, Stuart P; Craig, Michael D; Cruz-López, Leopoldo; Cunningham, Saul A; D'Aniello, Biagio; D'Cruze, Neil; da Silva, Pedro Giovâni; Dallimer, Martin; Danquah, Emmanuel; Darvill, Ben; Dauber, Jens; Davis, Adrian L V; Dawson, Jeff; de Sassi, Claudio; de Thoisy, Benoit; Deheuvels, Olivier; Dejean, Alain; Devineau, Jean-Louis; Diekötter, Tim; Dolia, Jignasu V; Domínguez, Erwin; Dominguez-Haydar, Yamileth; Dorn, Silvia; Draper, Isabel; Dreber, Niels; Dumont, Bertrand; Dures, Simon G; Dynesius, Mats; Edenius, Lars; Eggleton, Paul; Eigenbrod, Felix; Elek, Zoltán; Entling, Martin H; Esler, Karen J; de Lima, Ricardo F; Faruk, Aisyah; Farwig, Nina; Fayle, Tom M; Felicioli, Antonio; Felton, Annika M; Fensham, Roderick J; Fernandez, Ignacio C; Ferreira, Catarina C; Ficetola, Gentile F; Fiera, Cristina; Filgueiras, Bruno K C; Fırıncıoğlu, Hüseyin K; Flaspohler, David; Floren, Andreas; Fonte, Steven J; Fournier, Anne; Fowler, Robert E; Franzén, Markus; Fraser, Lauchlan H; Fredriksson, Gabriella M; Freire, Geraldo B; Frizzo, Tiago L M; Fukuda, Daisuke; Furlani, Dario; Gaigher, René; Ganzhorn, Jörg U; García, Karla P; Garcia-R, Juan C; Garden, Jenni G; Garilleti, Ricardo; Ge, Bao-Ming; Gendreau-Berthiaume, Benoit; Gerard, Philippa J; Gheler-Costa, Carla; Gilbert, Benjamin; Giordani, Paolo; Giordano, Simonetta; Golodets, Carly; Gomes, Laurens G L; Gould, Rachelle K; Goulson, Dave; Gove, Aaron D; Granjon, Laurent; Grass, Ingo; Gray, Claudia L; Grogan, James; Gu, Weibin; Guardiola, Moisès; Gunawardene, Nihara R; Gutierrez, Alvaro G; Gutiérrez-Lamus, Doris L; Haarmeyer, Daniela H; Hanley, Mick E; Hanson, Thor; Hashim, Nor R; Hassan, Shombe N; Hatfield, Richard G; Hawes, Joseph E; Hayward, Matt W; Hébert, Christian; Helden, Alvin J; Henden, John-André; Henschel, Philipp; Hernández, Lionel; Herrera, James P; Herrmann, Farina; Herzog, Felix; Higuera-Diaz, Diego; Hilje, Branko; Höfer, Hubert; Hoffmann, Anke; Horgan, Finbarr G; Hornung, Elisabeth; Horváth, Roland; Hylander, Kristoffer; Isaacs-Cubides, Paola; Ishida, Hiroaki; Ishitani, Masahiro; Jacobs, Carmen T; Jaramillo, Víctor J; Jauker, Birgit; Hernández, F Jiménez; Johnson, McKenzie F; Jolli, Virat; Jonsell, Mats; Juliani, S Nur; Jung, Thomas S; Kapoor, Vena; Kappes, Heike; Kati, Vassiliki; Katovai, Eric; Kellner, Klaus; Kessler, Michael; Kirby, Kathryn R; Kittle, Andrew M; Knight, Mairi E; Knop, Eva; Kohler, Florian; Koivula, Matti; Kolb, Annette

    2017-01-01

    The PREDICTS project-Projecting Responses of Ecological Diversity In Changing Terrestrial Systems (www.predicts.org.uk)-has collated from published studies a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity of human impacts relating to land use. We have used this evidence base to develop global and regional statistical models of how local biodiversity responds to these measures. We describe and make freely available this 2016 release of the database, containing more than 3.2 million records sampled at over 26,000 locations and representing over 47,000 species. We outline how the database can help in answering a range of questions in ecology and conservation biology. To our knowledge, this is the largest and most geographically and taxonomically representative database of spatial comparisons of biodiversity that has been collated to date; it will be useful to researchers and international efforts wishing to model and understand the global status of biodiversity.

  16. Evolutionary impact of transposable elements on genomic diversity and lineage-specific innovation in vertebrates.

    Science.gov (United States)

    Warren, Ian A; Naville, Magali; Chalopin, Domitille; Levin, Perrine; Berger, Chloé Suzanne; Galiana, Delphine; Volff, Jean-Nicolas

    2015-09-01

    Since their discovery, a growing body of evidence has emerged demonstrating that transposable elements are important drivers of species diversity. These mobile elements exhibit a great variety in structure, size and mechanisms of transposition, making them important putative actors in organism evolution. The vertebrates represent a highly diverse and successful lineage that has adapted to a wide range of different environments. These animals also possess a rich repertoire of transposable elements, with highly diverse content between lineages and even between species. Here, we review how transposable elements are driving genomic diversity and lineage-specific innovation within vertebrates. We discuss the large differences in TE content between different vertebrate groups and then go on to look at how they affect organisms at a variety of levels: from the structure of chromosomes to their involvement in the regulation of gene expression, as well as in the formation and evolution of non-coding RNAs and protein-coding genes. In the process of doing this, we highlight how transposable elements have been involved in the evolution of some of the key innovations observed within the vertebrate lineage, driving the group's diversity and success.

  17. The Oryza Map Alignment Project (OMAP) introgression lines for allelic diversity and new germplasm development

    Science.gov (United States)

    The Oryza Map Alignment Project (OMAP) has developed a genus wide model system for the study of rice that will ultimately provide a complete understanding of the genus. The purpose of this project is to capitalize on the strengths of the Arizona Genomics Institute (AGI), OMAP participants and the r...

  18. Comparative genomic data of the Avian Phylogenomics Project

    DEFF Research Database (Denmark)

    Zhang, Guojie; Li, Bo; Li, Cai;

    2014-01-01

    , which include 38 newly sequenced avian genomes plus previously released or simultaneously released genomes of Chicken, Zebra finch, Turkey, Pigeon, Peregrine falcon, Duck, Budgerigar, Adelie penguin, Emperor penguin and the Medium Ground Finch. We hope that this resource will serve future efforts...... in an average N50 scaffold size of about 50 kb. Repetitive elements comprised 4%-22% of the bird genomes. The assembled scaffolds allowed the homology-based annotation of 13,000 ~ 17000 protein coding genes in each avian genome relative to chicken, zebra finch and human, as well as comparative and sequence...

  19. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.

    Science.gov (United States)

    Birney, Ewan; Stamatoyannopoulos, John A; Dutta, Anindya; Guigó, Roderic; Gingeras, Thomas R; Margulies, Elliott H; Weng, Zhiping; Snyder, Michael; Dermitzakis, Emmanouil T; Thurman, Robert E; Kuehn, Michael S; Taylor, Christopher M; Neph, Shane; Koch, Christoph M; Asthana, Saurabh; Malhotra, Ankit; Adzhubei, Ivan; Greenbaum, Jason A; Andrews, Robert M; Flicek, Paul; Boyle, Patrick J; Cao, Hua; Carter, Nigel P; Clelland, Gayle K; Davis, Sean; Day, Nathan; Dhami, Pawandeep; Dillon, Shane C; Dorschner, Michael O; Fiegler, Heike; Giresi, Paul G; Goldy, Jeff; Hawrylycz, Michael; Haydock, Andrew; Humbert, Richard; James, Keith D; Johnson, Brett E; Johnson, Ericka M; Frum, Tristan T; Rosenzweig, Elizabeth R; Karnani, Neerja; Lee, Kirsten; Lefebvre, Gregory C; Navas, Patrick A; Neri, Fidencio; Parker, Stephen C J; Sabo, Peter J; Sandstrom, Richard; Shafer, Anthony; Vetrie, David; Weaver, Molly; Wilcox, Sarah; Yu, Man; Collins, Francis S; Dekker, Job; Lieb, Jason D; Tullius, Thomas D; Crawford, Gregory E; Sunyaev, Shamil; Noble, William S; Dunham, Ian; Denoeud, France; Reymond, Alexandre; Kapranov, Philipp; Rozowsky, Joel; Zheng, Deyou; Castelo, Robert; Frankish, Adam; Harrow, Jennifer; Ghosh, Srinka; Sandelin, Albin; Hofacker, Ivo L; Baertsch, Robert; Keefe, Damian; Dike, Sujit; Cheng, Jill; Hirsch, Heather A; Sekinger, Edward A; Lagarde, Julien; Abril, Josep F; Shahab, Atif; Flamm, Christoph; Fried, Claudia; Hackermüller, Jörg; Hertel, Jana; Lindemeyer, Manja; Missal, Kristin; Tanzer, Andrea; Washietl, Stefan; Korbel, Jan; Emanuelsson, Olof; Pedersen, Jakob S; Holroyd, Nancy; Taylor, Ruth; Swarbreck, David; Matthews, Nicholas; Dickson, Mark C; Thomas, Daryl J; Weirauch, Matthew T; Gilbert, James; Drenkow, Jorg; Bell, Ian; Zhao, XiaoDong; Srinivasan, K G; Sung, Wing-Kin; Ooi, Hong Sain; Chiu, Kuo Ping; Foissac, Sylvain; Alioto, Tyler; Brent, Michael; Pachter, Lior; Tress, Michael L; Valencia, Alfonso; Choo, Siew Woh; Choo, Chiou Yu; Ucla, Catherine; Manzano, Caroline; Wyss, Carine; Cheung, Evelyn; Clark, Taane G; Brown, James B; Ganesh, Madhavan; Patel, Sandeep; Tammana, Hari; Chrast, Jacqueline; Henrichsen, Charlotte N; Kai, Chikatoshi; Kawai, Jun; Nagalakshmi, Ugrappa; Wu, Jiaqian; Lian, Zheng; Lian, Jin; Newburger, Peter; Zhang, Xueqing; Bickel, Peter; Mattick, John S; Carninci, Piero; Hayashizaki, Yoshihide; Weissman, Sherman; Hubbard, Tim; Myers, Richard M; Rogers, Jane; Stadler, Peter F; Lowe, Todd M; Wei, Chia-Lin; Ruan, Yijun; Struhl, Kevin; Gerstein, Mark; Antonarakis, Stylianos E; Fu, Yutao; Green, Eric D; Karaöz, Ulaş; Siepel, Adam; Taylor, James; Liefer, Laura A; Wetterstrand, Kris A; Good, Peter J; Feingold, Elise A; Guyer, Mark S; Cooper, Gregory M; Asimenos, George; Dewey, Colin N; Hou, Minmei; Nikolaev, Sergey; Montoya-Burgos, Juan I; Löytynoja, Ari; Whelan, Simon; Pardi, Fabio; Massingham, Tim; Huang, Haiyan; Zhang, Nancy R; Holmes, Ian; Mullikin, James C; Ureta-Vidal, Abel; Paten, Benedict; Seringhaus, Michael; Church, Deanna; Rosenbloom, Kate; Kent, W James; Stone, Eric A; Batzoglou, Serafim; Goldman, Nick; Hardison, Ross C; Haussler, David; Miller, Webb; Sidow, Arend; Trinklein, Nathan D; Zhang, Zhengdong D; Barrera, Leah; Stuart, Rhona; King, David C; Ameur, Adam; Enroth, Stefan; Bieda, Mark C; Kim, Jonghwan; Bhinge, Akshay A; Jiang, Nan; Liu, Jun; Yao, Fei; Vega, Vinsensius B; Lee, Charlie W H; Ng, Patrick; Shahab, Atif; Yang, Annie; Moqtaderi, Zarmik; Zhu, Zhou; Xu, Xiaoqin; Squazzo, Sharon; Oberley, Matthew J; Inman, David; Singer, Michael A; Richmond, Todd A; Munn, Kyle J; Rada-Iglesias, Alvaro; Wallerman, Ola; Komorowski, Jan; Fowler, Joanna C; Couttet, Phillippe; Bruce, Alexander W; Dovey, Oliver M; Ellis, Peter D; Langford, Cordelia F; Nix, David A; Euskirchen, Ghia; Hartman, Stephen; Urban, Alexander E; Kraus, Peter; Van Calcar, Sara; Heintzman, Nate; Kim, Tae Hoon; Wang, Kun; Qu, Chunxu; Hon, Gary; Luna, Rosa; Glass, Christopher K; Rosenfeld, M Geoff; Aldred, Shelley Force; Cooper, Sara J; Halees, Anason; Lin, Jane M; Shulha, Hennady P; Zhang, Xiaoling; Xu, Mousheng; Haidar, Jaafar N S; Yu, Yong; Ruan, Yijun; Iyer, Vishwanath R; Green, Roland D; Wadelius, Claes; Farnham, Peggy J; Ren, Bing; Harte, Rachel A; Hinrichs, Angie S; Trumbower, Heather; Clawson, Hiram; Hillman-Jackson, Jennifer; Zweig, Ann S; Smith, Kayla; Thakkapallayil, Archana; Barber, Galt; Kuhn, Robert M; Karolchik, Donna; Armengol, Lluis; Bird, Christine P; de Bakker, Paul I W; Kern, Andrew D; Lopez-Bigas, Nuria; Martin, Joel D; Stranger, Barbara E; Woodroffe, Abigail; Davydov, Eugene; Dimas, Antigone; Eyras, Eduardo; Hallgrímsdóttir, Ingileif B; Huppert, Julian; Zody, Michael C; Abecasis, Gonçalo R; Estivill, Xavier; Bouffard, Gerard G; Guan, Xiaobin; Hansen, Nancy F; Idol, Jacquelyn R; Maduro, Valerie V B; Maskeri, Baishali; McDowell, Jennifer C; Park, Morgan; Thomas, Pamela J; Young, Alice C; Blakesley, Robert W; Muzny, Donna M; Sodergren, Erica; Wheeler, David A; Worley, Kim C; Jiang, Huaiyang; Weinstock, George M; Gibbs, Richard A; Graves, Tina; Fulton, Robert; Mardis, Elaine R; Wilson, Richard K; Clamp, Michele; Cuff, James; Gnerre, Sante; Jaffe, David B; Chang, Jean L; Lindblad-Toh, Kerstin; Lander, Eric S; Koriabine, Maxim; Nefedov, Mikhail; Osoegawa, Kazutoyo; Yoshinaga, Yuko; Zhu, Baoli; de Jong, Pieter J

    2007-06-14

    We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

  20. Rhipicephalus (Boophilus) microplus strain Deutsch, whole genome shotgun sequencing project first submission of genome sequence

    Science.gov (United States)

    The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence difficult. Cot filtration/selection techniques were used to reduce the repetitive fraction of the tick genome and enrich for the fraction of DNA with gene-containing regions. The Cot-selected ...

  1. Genomic and metabolic diversity of Marine Group I Thaumarchaeota in the mesopelagic of two subtropical gyres.

    Directory of Open Access Journals (Sweden)

    Brandon K Swan

    Full Text Available Marine Group I (MGI Thaumarchaeota are one of the most abundant and cosmopolitan chemoautotrophs within the global dark ocean. To date, no representatives of this archaeal group retrieved from the dark ocean have been successfully cultured. We used single cell genomics to investigate the genomic and metabolic diversity of thaumarchaea within the mesopelagic of the subtropical North Pacific and South Atlantic Ocean. Phylogenetic and metagenomic recruitment analysis revealed that MGI single amplified genomes (SAGs are genetically and biogeographically distinct from existing thaumarchaea cultures obtained from surface waters. Confirming prior studies, we found genes encoding proteins for aerobic ammonia oxidation and the hydrolysis of urea, which may be used for energy production, as well as genes involved in 3-hydroxypropionate/4-hydroxybutyrate and oxidative tricarboxylic acid pathways. A large proportion of protein sequences identified in MGI SAGs were absent in the marine cultures Cenarchaeum symbiosum and Nitrosopumilus maritimus, thus expanding the predicted protein space for this archaeal group. Identifiable genes located on genomic islands with low metagenome recruitment capacity were enriched in cellular defense functions, likely in response to viral infections or grazing. We show that MGI Thaumarchaeota in the dark ocean may have more flexibility in potential energy sources and adaptations to biotic interactions than the existing, surface-ocean cultures.

  2. Deep Assessment of Genomic Diversity in Cassava for Herbicide Tolerance and Starch Biosynthesis.

    Science.gov (United States)

    Duitama, Jorge; Kafuri, Lina; Tello, Daniel; Leiva, Ana María; Hofinger, Bernhard; Datta, Sneha; Lentini, Zaida; Aranzales, Ericson; Till, Bradley; Ceballos, Hernán

    2017-01-01

    Cassava is one of the most important food security crops in tropical countries, and a competitive resource for the starch, food, feed and ethanol industries. However, genomics research in this crop is much less developed compared to other economically important crops such as rice or maize. The International Center for Tropical Agriculture (CIAT) maintains the largest cassava germplasm collection in the world. Unfortunately, the genetic potential of this diversity for breeding programs remains underexploited due to the difficulties in phenotypic screening and lack of deep genomic information about the different accessions. A chromosome-level assembly of the cassava reference genome was released this year and only a handful of studies have been made, mainly to find quantitative trait loci (QTL) on breeding populations with limited variability. This work presents the results of pooled targeted resequencing of more than 1500 cassava accessions from the CIAT germplasm collection to obtain a dataset of more than 2000 variants within genes related to starch functional properties and herbicide tolerance. Results of twelve bioinformatic pipelines for variant detection in pooled samples were compared to ensure the quality of the variant calling process. Predictions of functional impact were performed using two separate methods to prioritize interesting variation for genotyping and cultivar selection. Targeted resequencing, either by pooled samples or by similar approaches such as Ecotilling or capture, emerges as a cost effective alternative to whole genome sequencing to identify interesting alleles of genes related to relevant traits within large germplasm collections.

  3. Diversity of chloroplast genome among local clones of cocoa (Theobroma cacao, L.) from Central Sulawesi

    Science.gov (United States)

    Suwastika, I. Nengah; Pakawaru, Nurul Aisyah; Rifka, Rahmansyah, Muslimin, Ishizaki, Yoko; Cruz, André Freire; Basri, Zainuddin; Shiina, Takashi

    2017-02-01

    Chloroplast genomes typically range in size from 120 to 170 kilo base pairs (kb), which relatively conserved among plant species. Recent evaluation on several species, certain unique regions showed high variability which can be utilized in the phylogenetic analysis. Many fragments of coding regions, introns, and intergenic spacers, such as atpB-rbcL, ndhF, rbcL, rpl16, trnH-psbA, trnL-F, trnS-G, etc., have been used for phylogenetic reconstructions at various taxonomic levels. Based on that status, we would like to analysis the diversity of chloroplast genome within species of local cacao (Theobroma cacao L.) from Central Sulawesi. Our recent data showed, there were more than 20 clones from local farming in Central Sulawesi, and it can be detected based on phenotypic and nuclear-genome-based characterization (RAPD- Random Amplified Polymorphic DNA and SSR- Simple Sequences Repeat) markers. In developing DNA marker for this local cacao, here we also included analysis based on the variation of chloroplast genome. At least several regions such as rpl32-TurnL, it can be considered as chloroplast markers on our local clone of cocoa. Furthermore, we could develop phylogenetic analysis in between clones of cocoa.

  4. A common genomic framework for a diverse assembly of plasmids in the symbiotic nitrogen fixing bacteria.

    Directory of Open Access Journals (Sweden)

    Lisa C Crossman

    Full Text Available This work centres on the genomic comparisons of two closely-related nitrogen-fixing symbiotic bacteria, Rhizobium leguminosarum biovar viciae 3841 and Rhizobium etli CFN42. These strains maintain a stable genomic core that is also common to other rhizobia species plus a very variable and significant accessory component. The chromosomes are highly syntenic, whereas plasmids are related by fewer syntenic blocks and have mosaic structures. The pairs of plasmids p42f-pRL12, p42e-pRL11 and p42b-pRL9 as well large parts of p42c with pRL10 are shown to be similar, whereas the symbiotic plasmids (p42d and pRL10 are structurally unrelated and seem to follow distinct evolutionary paths. Even though purifying selection is acting on the whole genome, the accessory component is evolving more rapidly. This component is constituted largely for proteins for transport of diverse metabolites and elements of external origin. The present analysis allows us to conclude that a heterogeneous and quickly diversifying group of plasmids co-exists in a common genomic framework.

  5. Characterization of the Genomic Diversity of Norovirus in Linked Patients Using a Metagenomic Deep Sequencing Approach

    Science.gov (United States)

    Nasheri, Neda; Petronella, Nicholas; Ronholm, Jennifer; Bidawid, Sabah; Corneau, Nathalie

    2017-01-01

    Norovirus (NoV) is the leading cause of gastroenteritis worldwide. A robust cell culture system does not exist for NoV and therefore detailed characterization of outbreak and sporadic strains relies on molecular techniques. In this study, we employed a metagenomic approach that uses non-specific amplification followed by next-generation sequencing to whole genome sequence NoV genomes directly from clinical samples obtained from 8 linked patients. Enough sequencing depth was obtained for each sample to use a de novo assembly of near-complete genome sequences. The resultant consensus sequences were then used to identify inter-host nucleotide variations that occur after direct transmission, analyze amino acid variations in the major capsid protein, and provide evidence of recombination events. The analysis of intra-host quasispecies diversity was possible due to high coverage-depth. We also observed a linear relationship between NoV viral load in the clinical sample and the number of sequence reads that could be attributed to NoV. The method demonstrated here has the potential for future use in whole genome sequence analyses of other RNA viruses isolated from clinical, environmental, and food specimens. PMID:28197136

  6. Genomic diversity and evolution of the head crest in the rock pigeon

    Science.gov (United States)

    Shapiro, Michael D.; Kronenberg, Zev; Li, Cai; Domyan, Eric T.; Pan, Hailin; Campbell, Michael; Tan, Hao; Huff, Chad D.; Hu, Haofu; Vickrey, Anna I.; Nielsen, Sandra C.A.; Stringham, Sydney A.; Hu, Hao; Willerslev, Eske; Gilbert, M. Thomas P.; Yandell, Mark; Zhang, Guojie; Wang, Jun

    2013-01-01

    The geographic origins of breeds and genetic basis of variation within the widely distributed and phenotypically diverse domestic rock pigeon (Columba livia) remain largely unknown. We generated a rock pigeon reference genome and additional genome sequences representing domestic and feral populations. We find evidence for the origins of major breed groups in the Middle East, and contributions from a racing breed to North American feral populations. We identify EphB2 as a strong candidate for the derived head crest phenotype shared by numerous breeds, an important trait in mate selection in many avian species. We also find evidence that this trait evolved just once and spread throughout the species, and that the crest originates early in development by the localized molecular reversal of feather bud polarity. PMID:23371554

  7. A genomic insight into diversity among tribal and nontribal population groups of Manipur, India.

    Science.gov (United States)

    Saraswathy, K N; Kiranmala, Naorem; Murry, Benrithung; Sinha, Ekata; Saksena, Deepti; Kaur, Harpreet; Sachdeva, M P; Kalla, A K

    2009-10-01

    Twenty autosomal markers, including linked markers at two gene markers, are used to understand the genomic similarity and diversity among three tribal (Paite, Thadou, and Kom) and one nontribal communities of Manipur (Northeast India). Two of the markers (CD4 and HB9) are monomorphic in Paite and one (the CD4 marker) in Kom. Data suggest the Meitei (nontribal groups) stand apart from the three tribal groups with respect to higher heterozygosity (0.366) and presence of the highest ancestor haplotypes of DRD2 markers (0.228); this is also supported by principal co-ordinate analysis. These populations are found to be genomically closer to the Chinese population than to other Indian populations.

  8. Genome mining expands the chemical diversity of the cyanobactin family to include highly modified linear peptides.

    Science.gov (United States)

    Leikoski, Niina; Liu, Liwei; Jokela, Jouni; Wahlsten, Matti; Gugger, Muriel; Calteau, Alexandra; Permi, Perttu; Kerfeld, Cheryl A; Sivonen, Kaarina; Fewer, David P

    2013-08-22

    Ribosomal peptides are produced through the posttranslational modification of short precursor peptides. Cyanobactins are a growing family of cyclic ribosomal peptides produced by cyanobacteria. However, a broad systematic survey of the genetic capacity to produce cyanobactins is lacking. Here we report the identification of 31 cyanobactin gene clusters from 126 genomes of cyanobacteria. Genome mining suggested a complex evolutionary history defined by horizontal gene transfer and rapid diversification of precursor genes. Extensive chemical analyses demonstrated that some cyanobacteria produce short linear cyanobactins with a chain length ranging from three to five amino acids. The linear peptides were N-prenylated and O-methylated on the N and C termini, respectively, and named aeruginosamide and viridisamide. These findings broaden the structural diversity of the cyanobactin family to include highly modified linear peptides with rare posttranslational modifications.

  9. Volunteering in a Culturally Diverse Context: Implications for Project Designers and Managers.

    Science.gov (United States)

    Martin, Jay

    1999-01-01

    The volunteer pool of social services organizations often does not reflect the cultural diversity of their clientele. Cultural values and past experiences of discrimination are among the reasons for this limited diversity in volunteers. An Australian project found that refugees were reluctant to be clients of agencies whose volunteers did not…

  10. Genomic and Metagenomic Analysis of Diversity-Generating Retroelements Associated with Treponema denticola

    Directory of Open Access Journals (Sweden)

    Sutichot eNimkulrat

    2016-06-01

    Full Text Available Diversity-generating retroelements (DGRs are genetic cassettes that can produce massive protein sequence variation in prokaryotes. Presumably DGRs confer selective advantages to their hosts (bacteria or viruses by generating variants of target genes—typically resulting in target proteins with altered ligand-binding specificity—through a specialized error-prone reverse transcription process. The only extensively studied DGR system is from the Bordetella phage BPP-1, although DGRs are predicted to exist in other species. Using bioinformatics analysis, we discovered that the DGR system associated with the Treponema denticola species (a human oral-associated periopathogen is dynamic (with gains/losses of the system found in the isolates and diverse (with multiple types found in isolated genomes and the human microbiota. The T. denticola DGR is found in only nine of the 17 sequenced T. denticola strains. Analysis of the DGR-associated template regions and reverse transcriptase gene sequences revealed two types of DGR systems in T. denticola: the ATCC35405-type shared by seven isolates including ATCC35405; and the SP32-type shared by two isolates (SP32 and SP33, suggesting multiple DGR acquisitions. We detected additional variants of the T. denticola DGR systems in the human microbiomes, and found that the SP32-type DGR is more abundant than the ATCC35405-type in the healthy human oral microbiome, although the latter is found in more sequenced isolates. This is the first comprehensive study to characterize the DGRs associated with T. denticola in individual genomes as well as human microbiomes, demonstrating the importance of utilizing both individual genomes and metagenomes for characterizing the elements, and for analyzing their diversity and distribution in human populations.

  11. Sex-biased evolutionary forces shape genomic patterns of human diversity.

    Directory of Open Access Journals (Sweden)

    Michael F Hammer

    Full Text Available Comparisons of levels of variability on the autosomes and X chromosome can be used to test hypotheses about factors influencing patterns of genomic variation. While a tremendous amount of nucleotide sequence data from across the genome is now available for multiple human populations, there has been no systematic effort to examine relative levels of neutral polymorphism on the X chromosome versus autosomes. We analyzed approximately 210 kb of DNA sequencing data representing 40 independent noncoding regions on the autosomes and X chromosome from each of 90 humans from six geographically diverse populations. We correct for differences in mutation rates between males and females by considering the ratio of within-human diversity to human-orangutan divergence. We find that relative levels of genetic variation are higher than expected on the X chromosome in all six human populations. We test a number of alternative hypotheses to explain the excess polymorphism on the X chromosome, including models of background selection, changes in population size, and sex-specific migration in a structured population. While each of these processes may have a small effect on the relative ratio of X-linked to autosomal diversity, our results point to a systematic difference between the sexes in the variance in reproductive success; namely, the widespread effects of polygyny in human populations. We conclude that factors leading to a lower male versus female effective population size must be considered as important demographic variables in efforts to construct models of human demographic history and for understanding the forces shaping patterns of human genomic variability.

  12. Maize (Zea mays L. genome diversity as revealed by RNA-sequencing.

    Directory of Open Access Journals (Sweden)

    Candice N Hansey

    Full Text Available Maize is rich in genetic and phenotypic diversity. Understanding the sequence, structural, and expression variation that contributes to phenotypic diversity would facilitate more efficient varietal improvement. RNA based sequencing (RNA-seq is a powerful approach for transcriptional analysis, assessing sequence variation, and identifying novel transcript sequences, particularly in large, complex, repetitive genomes such as maize. In this study, we sequenced RNA from whole seedlings of 21 maize inbred lines representing diverse North American and exotic germplasm. Single nucleotide polymorphism (SNP detection identified 351,710 polymorphic loci distributed throughout the genome covering 22,830 annotated genes. Tight clustering of two distinct heterotic groups and exotic lines was evident using these SNPs as genetic markers. Transcript abundance analysis revealed minimal variation in the total number of genes expressed across these 21 lines (57.1% to 66.0%. However, the transcribed gene set among the 21 lines varied, with 48.7% expressed in all of the lines, 27.9% expressed in one to 20 lines, and 23.4% expressed in none of the lines. De novo assembly of RNA-seq reads that did not map to the reference B73 genome sequence revealed 1,321 high confidence novel transcripts, of which, 564 loci were present in all 21 lines, including B73, and 757 loci were restricted to a subset of the lines. RT-PCR validation demonstrated 87.5% concordance with the computational prediction of these expressed novel transcripts. Intriguingly, 145 of the novel de novo assembled loci were present in lines from only one of the two heterotic groups consistent with the hypothesis that, in addition to sequence polymorphisms and transcript abundance, transcript presence/absence variation is present and, thereby, may be a mechanism contributing to the genetic basis of heterosis.

  13. Intraspecies Genomic Diversity and Long-Term Persistence of Bifidobacterium longum

    Science.gov (United States)

    Chaplin, Andrei V.; Efimov, Boris A.; Smeianov, Vladimir V.; Kafarskaia, Lyudmila I.; Pikina, Alla P.; Shkoporov, Andrei N.

    2015-01-01

    Members of genus Bifidobacterium are Gram-positive bacteria, representing a large part of the human infant microbiota and moderately common in adults. However, our knowledge about their diversity, intraspecific phylogeny and long-term persistence in humans is still limited. Bifidobacterium longum is generally considered to be the most common and prevalent species in the intestinal microbiota. In this work we studied whole genome sequences of 28 strains of B. longum, including 8 sequences described in this paper. Part of these strains were isolated from healthy children during a long observation period (up to 10 years between isolation from the same patient). The three known subspecies (longum, infantis and suis) could be clearly divided using sequence-based phylogenetic methods, gene content and the average nucleotide identity. The profiles of glycoside hydrolase genes reflected the different ecological specializations of these three subspecies. The high impact of horizontal gene transfer on genomic diversity was observed, which is possibly due to a large number of prophages and rapidly spreading plasmids. The pan-genome characteristics of the subspecies longum corresponded to the open pan-genome model. While the major part of the strain-specific genetic loci represented transposons and phage-derived regions, a large number of cell envelope synthesis genes were also observed within this category, representing high variability of cell surface molecules. We observed the cases of isolation of high genetically similar strains of B. longum from the same patients after long periods of time, however, we didn’t succeed in the isolation of genetically identical bacteria: a fact, reflecting the high plasticity of microbiota in children. PMID:26275230

  14. Diversity, distribution and comparative genomics of Microviridae in Sphagnum-peat soils

    Directory of Open Access Journals (Sweden)

    Achim eQuaiser

    2015-04-01

    Full Text Available Microviridae, a family of bacteria-infecting ssDNA viruses, is a member of the still poorly characterized bacteriophages, even though they include phage PhiX174, one of the main models in virology for genomic and capsid structure studies. Recent studies suggest that they are diverse and well represented in marine and freshwater virioplankton as well as in human microbiomes. Despite previous knowledge, their diversity, abundance and ecological role are completely unknown in soil ecosystems. Here we present the comparative analysis of 17 completely assembled Microviridae genomes from 12 viromes of a Sphagnum-dominated peatland. Phylogenetic analysis of the conserved major capsid protein sequences revealed the affiliation to Gokushovirinae and Pichovirinae as well as to two newly defined subfamilies, the Aravirinae and Stokavirinae. Structural modeling of the Aravirinae major capsid protein showed similarities to Alpavirinae and Pichovirinae but revealed two additional variable regions potentially involved in phage-host recognition. Two new distinct prophages were identified in the genomes of Parabacteroides merdae and Parabacteroides distasonis representing a potential new subfamily of Microviridae. The differentiation of the subfamilies was confirmed by gene order and similarity analysis. Relative abundance analysis using the affiliation of the major capsid protein (VP1 revealed that Gokushovirinae, followed by Aravirinae, are the most abundant Microviridae in 11 out of 12 peat viromes. Sequences matching the Gokushovirinae and Aravirinae VP1 matching sequences respectively accounted for up to 4.19% and 0.65% of the total number of sequences in the corresponding virome, respectively. In this study we provide new genome information of Microviridae and pave the way towards quantitative estimations of Microviridae subfamilies.

  15. Combining genomic sequencing methods to explore viral diversity and reveal potential virus-host interactions

    Directory of Open Access Journals (Sweden)

    Cheryl-Emiliane Tien Chow

    2015-04-01

    Full Text Available Viral diversity and virus-host interactions in oxygen-starved regions of the ocean, also known as oxygen minimum zones (OMZs, remain relatively unexplored. Microbial community metabolism in OMZs alters nutrient and energy flow through marine food webs, resulting in biological nitrogen loss and greenhouse gas production. Thus, viruses infecting OMZ microbes have the potential to modulate community metabolism with resulting feedback on ecosystem function. Here, we describe viral communities inhabiting oxic surface (10m and oxygen-starved basin (200m waters of Saanich Inlet, a seasonally anoxic fjord on the coast of Vancouver Island, British Columbia using viral metagenomics and complete viral fosmid sequencing on samples collected between April 2007 and April 2010. Of 6459 open reading frames (ORFs predicted across all 34 viral fosmids, 77.6% (n=5010 had no homology to reference viral genomes. These fosmids recruited a higher proportion of viral metagenomic sequences from Saanich Inlet than from nearby northeastern subarctic Pacific Ocean (Line P waters, indicating differences in the viral communities between coastal and open ocean locations. While functional annotations of fosmid ORFs were limited, recruitment to NCBI’s non-redundant ‘nr’ database and publicly available single-cell genomes identified putative viruses infecting marine thaumarchaeal and SUP05 proteobacteria to provide potential host linkages with relevance to coupled biogeochemical cycling processes in OMZ waters. Taken together, these results highlight the power of coupled analyses of multiple sequence data types, such as viral metagenomic and fosmid sequence data with prokaryotic single cell genomes, to chart viral diversity, elucidate genomic and ecological contexts for previously unclassifiable viral sequences, and identify novel host interactions in natural and engineered ecosystems.

  16. Intraspecies Genomic Diversity and Long-Term Persistence of Bifidobacterium longum.

    Directory of Open Access Journals (Sweden)

    Andrei V Chaplin

    Full Text Available Members of genus Bifidobacterium are Gram-positive bacteria, representing a large part of the human infant microbiota and moderately common in adults. However, our knowledge about their diversity, intraspecific phylogeny and long-term persistence in humans is still limited. Bifidobacterium longum is generally considered to be the most common and prevalent species in the intestinal microbiota. In this work we studied whole genome sequences of 28 strains of B. longum, including 8 sequences described in this paper. Part of these strains were isolated from healthy children during a long observation period (up to 10 years between isolation from the same patient. The three known subspecies (longum, infantis and suis could be clearly divided using sequence-based phylogenetic methods, gene content and the average nucleotide identity. The profiles of glycoside hydrolase genes reflected the different ecological specializations of these three subspecies. The high impact of horizontal gene transfer on genomic diversity was observed, which is possibly due to a large number of prophages and rapidly spreading plasmids. The pan-genome characteristics of the subspecies longum corresponded to the open pan-genome model. While the major part of the strain-specific genetic loci represented transposons and phage-derived regions, a large number of cell envelope synthesis genes were also observed within this category, representing high variability of cell surface molecules. We observed the cases of isolation of high genetically similar strains of B. longum from the same patients after long periods of time, however, we didn't succeed in the isolation of genetically identical bacteria: a fact, reflecting the high plasticity of microbiota in children.

  17. Characterizing neutral genomic diversity and selection signatures in indigenous populations of Moroccan goats (Capra hircus using WGS data

    Directory of Open Access Journals (Sweden)

    Badr eBenjelloun

    2015-04-01

    Full Text Available Since the time of their domestication, goats (Capra hircus have evolved in a large variety of locally adapted populations in response to different human and environmental pressures. In the present era, many indigenous populations are threatened with extinction due to their substitution by cosmopolitan breeds, while they might represent highly valuable genomic resources. It is thus crucial to characterize the neutral and adaptive genetic diversity of indigenous populations. A fine characterization of whole genome variation in farm animals is now possible by using new sequencing technologies. We sequenced the complete genome at 12X coverage of 44 goats geographically representative of the three phenotypically distinct indigenous populations in Morocco. The study of mitochondrial genomes showed a high diversity exclusively restricted to the haplogroup A. The 44 nuclear genomes showed a very high diversity (24 million variants associated with low linkage disequilibrium. The overall genetic diversity was weakly structured according to geography and phenotypes. When looking for signals of positive selection in each population we identified many candidate genes, several of which gave insights into the metabolic pathways or biological processes involved in the adaptation to local conditions (e.g. panting in warm/desert conditions. This study highlights the interest of WGS data to characterize livestock genomic diversity. It illustrates the valuable genetic richness present in indigenous populations that have to be sustainably managed and may represent valuable genetic resources for the long-term preservation of the species.

  18. Characterizing neutral genomic diversity and selection signatures in indigenous populations of Moroccan goats (Capra hircus) using WGS data.

    Science.gov (United States)

    Benjelloun, Badr; Alberto, Florian J; Streeter, Ian; Boyer, Frédéric; Coissac, Eric; Stucki, Sylvie; BenBati, Mohammed; Ibnelbachyr, Mustapha; Chentouf, Mouad; Bechchari, Abdelmajid; Leempoel, Kevin; Alberti, Adriana; Engelen, Stefan; Chikhi, Abdelkader; Clarke, Laura; Flicek, Paul; Joost, Stéphane; Taberlet, Pierre; Pompanon, François

    2015-01-01

    Since the time of their domestication, goats (Capra hircus) have evolved in a large variety of locally adapted populations in response to different human and environmental pressures. In the present era, many indigenous populations are threatened with extinction due to their substitution by cosmopolitan breeds, while they might represent highly valuable genomic resources. It is thus crucial to characterize the neutral and adaptive genetic diversity of indigenous populations. A fine characterization of whole genome variation in farm animals is now possible by using new sequencing technologies. We sequenced the complete genome at 12× coverage of 44 goats geographically representative of the three phenotypically distinct indigenous populations in Morocco. The study of mitochondrial genomes showed a high diversity exclusively restricted to the haplogroup A. The 44 nuclear genomes showed a very high diversity (24 million variants) associated with low linkage disequilibrium. The overall genetic diversity was weakly structured according to geography and phenotypes. When looking for signals of positive selection in each population we identified many candidate genes, several of which gave insights into the metabolic pathways or biological processes involved in the adaptation to local conditions (e.g., panting in warm/desert conditions). This study highlights the interest of WGS data to characterize livestock genomic diversity. It illustrates the valuable genetic richness present in indigenous populations that have to be sustainably managed and may represent valuable genetic resources for the long-term preservation of the species.

  19. Penicillium arizonense, a new, genome sequenced fungal species, reveals a high chemical diversity in secreted metabolites

    DEFF Research Database (Denmark)

    Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica;

    2016-01-01

    confirmed the grouping of P. arizonense within section Canescentia. Compared to related species, P. arizonense proved to encode a high number of proteins involved in carbohydrate metabolism, in particular hemicellulases. Mining the genome for genes involved in secondary metabolite biosynthesis resulted...... of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential...

  20. Development of a custom-designed, pan genomic DNA microarray to characterize strain-level diversity among Cronobacter spp.

    Directory of Open Access Journals (Sweden)

    Ben Davies Tall

    2015-04-01

    Full Text Available Cronobacter species cause infections in all age groups; however neonates are at highest risk and remain the most susceptible age group for life-threatening invasive disease. The genus contains seven species: C. sakazakii, C. malonaticus, C. turicensis C. muytjensii, C. dublinensis, C. universalis, and C. condimenti. Despite an abundance of published genomes of these species, genomics-based epidemiology of the genus is not well established. The gene content of a diverse group of 126 unique Cronobacter and taxonomically-related isolates was determined using a pan genomic-based DNA microarray as a genotyping tool and as a means to identify outbreak isolates for food safety, environmental, and clinical surveillance purposes. The microarray constitutes 19,287 independent genes representing 15 Cronobacter genomes and 18 plasmids and 2,371 virulence factor genes of phylogenetically-related Gram-negative bacteria. The Cronobacter microarray was able to distinguish the seven Cronobacter species from one another and from non-Cronobacter species; and within each species, strains grouped into distinct clusters based on their genomic diversity. These results also support the phylogenic divergence of the genus and clearly highlight the genomic diversity among each member of the genus. The current study establishes a powerful platform for further genomics research of this diverse genus, an important prerequisite towards the development of future countermeasures against this foodborne pathogen in the food safety and clinical arenas.

  1. Sixteen new lung function signals identified through 1000 Genomes Project reference panel imputation

    NARCIS (Netherlands)

    Artigas, Maria Soler; Wain, Louise V.; Miller, Suzanne; Kheirallah, Abdul Kader; Huffman, Jennifer E.; Ntalla, Ioanna; Shrine, Nick; Obeidat, Ma'en; Trochet, Holly; McArdle, Wendy L.; Alves, Alexessander Couto; Hui, Jennie; Zhao, Jing Hua; Joshi, Peter K.; Teumer, Alexander; Albrecht, Eva; Imboden, Medea; Rawal, Rajesh; Lopez, Lorna M.; Marten, Jonathan; Enroth, Stefan; Surakka, Ida; Polasek, Ozren; Lyytikainen, Leo-Pekka; Granell, Raquel; Hysi, Pirro G.; Flexeder, Claudia; Mahajan, Anubha; Beilby, John; Bosse, Yohan; Brandsma, Corry-Anke; Campbell, Harry; Gieger, Christian; Glaeser, Sven; Gonzalez, Juan R.; Grallert, Harald; Hammond, Chris J.; Harris, Sarah E.; Hartikainen, Anna-Liisa; Heliovaara, Markku; Henderson, John; Hocking, Lynne; Horikoshi, Momoko; Hutri-Kahonen, Nina; Ingelsson, Erik; Johansson, Asa; Kemp, John P.; Kolcic, Ivana; Kumar, Ashish; Lind, Lars; Melen, Erik; Musk, Arthur W.; Navarro, Pau; Nickle, David C.; Padmanabhan, Sandosh; Raitakari, Olli T.; Ried, Janina S.; Ripatti, Samuli; Schulz, Holger; Scott, Robert A.; Sin, Don D.; Starr, John M.; Vinuela, Ana; Voelzke, Henry; Wild, Sarah H.; Wright, Alan F.; Zemunik, Tatijana; Jarvis, Deborah L.; Spector, Tim D.; Evans, David M.; Lehtimaki, Terho; Vitart, Veronique; Kahonen, Mika; Gyllensten, Ulf; Rudan, Igor; Deary, Ian J.; Karrasch, Stefan; Probst-Hensch, Nicole M.; Heinrich, Joachim; Stubbe, Beate; Wilson, James F.; Wareham, Nicholas J.; James, Alan L.; Morris, Andrew P.; Jarvelin, Marjo-Riitta; Hayward, Caroline; Sayers, Ian; Strachan, David P.; Hall, Ian P.; Tobin, Martin D.; Deloukas, Panos; Hansell, Anna L.; Hubbard, Richard; Jackson, Victoria E.; Marchini, Jonathan; Pavord, Ian; Thomson, Neil C.; Zeggini, Eleftheria

    2015-01-01

    Lung function measures are used in the diagnosis of chronic obstructive pulmonary disease. In 38,199 European ancestry individuals, we studied genome-wide association of forced expiratory volume in 1 s (FEV1), forced vital capacity (FVC) and FEV1/FVC with 1000 Genomes Project (phase 1)-imputed genot

  2. The little bacteria that can – diversity, genomics and ecophysiology of ‘Dehalococcoides’ spp. in contaminated environments

    Science.gov (United States)

    Taş, Neslihan; Van Eekert, Miriam H. A.; De Vos, Willem M.; Smidt, Hauke

    2010-01-01

    Summary The fate and persistence of chlorinated organics in the environment have been a concern for the past 50 years. Industrialization and extensive agricultural activities have led to the accumulation of these pollutants in the environment, while their adverse impact on various ecosystems and human health also became evident. This review provides an update on the current knowledge of specialized anaerobic bacteria, namely ‘Dehalococcoides’ spp., which are dedicated to the transformation of various chlorinated organic compounds via reductive dechlorination. Advances in microbiology and molecular techniques shed light into the diversity and functioning of Dehalococcoides spp. in several different locations. Recent genome sequencing projects revealed a large number of genes that are potentially involved in reductive dechlorination. Molecular approaches towards analysis of diversity and expression especially of reductive dehalogenase‐encoding genes are providing a growing body of knowledge on biodegradative pathways active in defined pure and mixed cultures as well as directly in the environment. Moreover, several successful field cases of bioremediation strengthen the notion of dedicated degraders such as Dehalococcoides spp. as key players in the restoration of contaminated environments. PMID:21255338

  3. The Human Genome Project: Information access, management, and regulation. Final report

    Energy Technology Data Exchange (ETDEWEB)

    McInerney, J.D.; Micikas, L.B.

    1996-08-31

    The Human Genome Project is a large, internationally coordinated effort in biological research directed at creating a detailed map of human DNA. This report describes the access of information, management, and regulation of the project. The project led to the development of an instructional module titled The Human Genome Project: Biology, Computers, and Privacy, designed for use in high school biology classes. The module consists of print materials and both Macintosh and Windows versions of related computer software-Appendix A contains a copy of the print materials and discs containing the two versions of the software.

  4. A High-throughput Genomic Tool: Diversity Array Technology Complementary for Rice Genotyping

    Institute of Scientific and Technical Information of China (English)

    Yong Xie; Kenneth McNally; Cheng-Yun Li; Hei Leung; You-Yong Zhu

    2006-01-01

    Diversity array technology (DArTTM) was a genotyping tool characterized gel-independent and high throughput.The main purpose of present study is to validate DArT for rice (Oryza sativa L.)genotyping in a high throughput manner. Technically, the main objective was to generate a rice general purpose gene pool, and optimize this genomic tool in order to evaluate rice germplasm genetic diversity. To achieve this, firstly, a generalpurpose DArT array was developed. Ten representatives from 24 varieties were hybridized with the general-purpose array to determine the informativeness of the clones printed on the array. The informative 1 152 clones were re-arrayed on a slide and used to fingerprint 17 of 24 germplasms. Hybridizing targets prepared from the germplasm to be assayed to the DNA array gave DNA fingerprints of germplasms. Raw data were normalized and transformed into binary data, which were then analyzed by using NTSYSpc (Numerical taxonomy system for cluster and ordination analysis, v. 2.02j) software package. The graphically displayed dendrogram derived from the array experimental data was matched with simple Sequence repeats genotyping outline and varieties' pedigree deviation of the different varieties. Considering DArT is a sequence-independent genotyping approach, it will be applied in studies of the genetic diversity and the gene mapping of diverse of organisms, especially for those crops with less-developed molecular markers.

  5. Assessing Genetic Diversity among Brettanomyces Yeasts by DNA Fingerprinting and Whole-Genome Sequencing

    Science.gov (United States)

    Crauwels, Sam; Zhu, Bo; Steensels, Jan; Busschaert, Pieter; De Samblanx, Gorik; Marchal, Kathleen; Willems, Kris A.

    2014-01-01

    Brettanomyces yeasts, with the species Brettanomyces (Dekkera) bruxellensis being the most important one, are generally reported to be spoilage yeasts in the beer and wine industry due to the production of phenolic off flavors. However, B. bruxellensis is also known to be a beneficial contributor in certain fermentation processes, such as the production of certain specialty beers. Nevertheless, despite its economic importance, Brettanomyces yeasts remain poorly understood at the genetic and genomic levels. In this study, the genetic relationship between more than 50 Brettanomyces strains from all presently known species and from several sources was studied using a combination of DNA fingerprinting techniques. This revealed an intriguing correlation between the B. bruxellensis fingerprints and the respective isolation source. To further explore this relationship, we sequenced a (beneficial) beer isolate of B. bruxellensis (VIB X9085; ST05.12/22) and compared its genome sequence with the genome sequences of two wine spoilage strains (AWRI 1499 and CBS 2499). ST05.12/22 was found to be substantially different from both wine strains, especially at the level of single nucleotide polymorphisms (SNPs). In addition, there were major differences in the genome structures between the strains investigated, including the presence of large duplications and deletions. Gene content analysis revealed the presence of 20 genes which were present in both wine strains but absent in the beer strain, including many genes involved in carbon and nitrogen metabolism, and vice versa, no genes that were missing in both AWRI 1499 and CBS 2499 were found in ST05.12/22. Together, this study provides tools to discriminate Brettanomyces strains and provides a first glimpse at the genetic diversity and genome plasticity of B. bruxellensis. PMID:24814796

  6. Assessing genetic diversity among Brettanomyces yeasts by DNA fingerprinting and whole-genome sequencing.

    Science.gov (United States)

    Crauwels, Sam; Zhu, Bo; Steensels, Jan; Busschaert, Pieter; De Samblanx, Gorik; Marchal, Kathleen; Willems, Kris A; Verstrepen, Kevin J; Lievens, Bart

    2014-07-01

    Brettanomyces yeasts, with the species Brettanomyces (Dekkera) bruxellensis being the most important one, are generally reported to be spoilage yeasts in the beer and wine industry due to the production of phenolic off flavors. However, B. bruxellensis is also known to be a beneficial contributor in certain fermentation processes, such as the production of certain specialty beers. Nevertheless, despite its economic importance, Brettanomyces yeasts remain poorly understood at the genetic and genomic levels. In this study, the genetic relationship between more than 50 Brettanomyces strains from all presently known species and from several sources was studied using a combination of DNA fingerprinting techniques. This revealed an intriguing correlation between the B. bruxellensis fingerprints and the respective isolation source. To further explore this relationship, we sequenced a (beneficial) beer isolate of B. bruxellensis (VIB X9085; ST05.12/22) and compared its genome sequence with the genome sequences of two wine spoilage strains (AWRI 1499 and CBS 2499). ST05.12/22 was found to be substantially different from both wine strains, especially at the level of single nucleotide polymorphisms (SNPs). In addition, there were major differences in the genome structures between the strains investigated, including the presence of large duplications and deletions. Gene content analysis revealed the presence of 20 genes which were present in both wine strains but absent in the beer strain, including many genes involved in carbon and nitrogen metabolism, and vice versa, no genes that were missing in both AWRI 1499 and CBS 2499 were found in ST05.12/22. Together, this study provides tools to discriminate Brettanomyces strains and provides a first glimpse at the genetic diversity and genome plasticity of B. bruxellensis. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  7. Leadership and Organizational Tenure Diversity as Determinants of Project Team Effectiveness

    NARCIS (Netherlands)

    de Poel, Frouke M.; Stoker, Janka I.; Van der Zee, Karen I.

    2014-01-01

    The present study reveals how leadership effectiveness in project teams is dependent on the level of organizational tenure diversity. Data from 34 project teams showed that transformational leadership is related to organizational commitment, creative behavior, and job satisfaction, but only in teams

  8. Leadership and Organizational Tenure Diversity as Determinants of Project Team Effectiveness

    NARCIS (Netherlands)

    de Poel, Frouke M.; Stoker, Janka I.; Van der Zee, Karen I.

    2014-01-01

    The present study reveals how leadership effectiveness in project teams is dependent on the level of organizational tenure diversity. Data from 34 project teams showed that transformational leadership is related to organizational commitment, creative behavior, and job satisfaction, but only in teams

  9. REVISITING MOLECULAR CLONING TO SOLVE GENOME SEQUENCING PROJECT CONFLICTS

    National Research Council Canada - National Science Library

    Hugo A Barrera-Saldaña; Aarón Daniel Ramírez-Sánchez; Tiffany Editth Palacios-Tovar; Dionicio Aguirre-Treviño; Saúl Felipe Karr-de-León

    2017-01-01

    .... Molecular cloning was chosen as the most straight-forward strategy to solve the dilemma. The initial characterization of recombinant plasmids by restriction enzyme digestion confirmed the presence of two genomic sequences...

  10. Whole Genome Sequencing of Field Isolates Reveals Extensive Genetic Diversity in Plasmodium vivax from Colombia.

    Science.gov (United States)

    Winter, David J; Pacheco, M Andreína; Vallejo, Andres F; Schwartz, Rachel S; Arevalo-Herrera, Myriam; Herrera, Socrates; Cartwright, Reed A; Escalante, Ananias A

    2015-12-01

    Plasmodium vivax is the most prevalent malarial species in South America and exerts a substantial burden on the populations it affects. The control and eventual elimination of P. vivax are global health priorities. Genomic research contributes to this objective by improving our understanding of the biology of P. vivax and through the development of new genetic markers that can be used to monitor efforts to reduce malaria transmission. Here we analyze whole-genome data from eight field samples from a region in Cordóba, Colombia where malaria is endemic. We find considerable genetic diversity within this population, a result that contrasts with earlier studies suggesting that P. vivax had limited diversity in the Americas. We also identify a selective sweep around a substitution known to confer resistance to sulphadoxine-pyrimethamine (SP). This is the first observation of a selective sweep for SP resistance in this species. These results indicate that P. vivax has been exposed to SP pressure even when the drug is not in use as a first line treatment for patients afflicted by this parasite. We identify multiple non-synonymous substitutions in three other genes known to be involved with drug resistance in Plasmodium species. Finally, we found extensive microsatellite polymorphisms. Using this information we developed 18 polymorphic and easy to score microsatellite loci that can be used in epidemiological investigations in South America.

  11. Whole genome sequence analysis of Cryptococcus gattii from the Pacific Northwest reveals unexpected diversity.

    Directory of Open Access Journals (Sweden)

    John D Gillece

    Full Text Available A recent emergence of Cryptococcus gattii in the Pacific Northwest involves strains that fall into three primarily clonal molecular subtypes: VGIIa, VGIIb and VGIIc. Multilocus sequence typing (MLST and variable number tandem repeat analysis appear to identify little diversity within these molecular subtypes. Given the apparent expansion of these subtypes into new geographic areas and their ability to cause disease in immunocompetent individuals, differentiation of isolates belonging to these subtypes could be very important from a public health perspective. We used whole genome sequence typing (WGST to perform fine-scale phylogenetic analysis on 20 C. gattii isolates, 18 of which are from the VGII molecular type largely responsible for the Pacific Northwest emergence. Analysis both including and excluding (289,586 SNPs and 56,845 SNPs, respectively molecular types VGI and VGIII isolates resulted in phylogenetic reconstructions consistent, for the most part, with MLST analysis but with far greater resolution among isolates. The WGST analysis presented here resulted in identification of over 100 SNPs among eight VGIIc isolates as well as unique genotypes for each of the VGIIa, VGIIb and VGIIc isolates. Similar levels of genetic diversity were found within each of the molecular subtype isolates, despite the fact that the VGIIb clade is thought to have emerged much earlier. The analysis presented here is the first multi-genome WGST study to focus on the C. gattii molecular subtypes involved in the Pacific Northwest emergence and describes the tools that will further our understanding of this emerging pathogen.

  12. O admirável Projeto Genoma Humano The brave New Human Genome Project

    Directory of Open Access Journals (Sweden)

    Marilena V. Corrêa

    2002-12-01

    research. These problems raise challenges in terms of possible inequality in access to the benefits of research. On the other hand, we have the issue of genetic information and safeguarding individual data concerning the risks and susceptibilities to human diseases and characteristics. Defining men and women as a function of genetic traits poses a clear discriminatory threat and becomes even more acute as a function of the genetic reductionism propagated by the mass media. Answers to these problems cannot be expected only from bioethics. The bioethical approach should be combined with political analyses concerning reproduction, sexuality, health, and medicine. Such a vast range of problems cannot be discussed in depth in a single article. The choice was thus made to map them in the sense of emphasizing to what extent, in reflecting on the Genome Project, genomics, and post-genomics, the challenge is met to link such diverse aspects.

  13. Analysis of genotype diversity and evolution of Dengue virus serotype 2 using complete genomes

    Science.gov (United States)

    Waman, Vaishali P.; Kolekar, Pandurang; Ramtirthkar, Mukund R.; Kale, Mohan M.

    2016-01-01

    Background Dengue is one of the most common arboviral diseases prevalent worldwide and is caused by Dengue viruses (genus Flavivirus, family Flaviviridae). There are four serotypes of Dengue Virus (DENV-1 to DENV-4), each of which is further subdivided into distinct genotypes. DENV-2 is frequently associated with severe dengue infections and epidemics. DENV-2 consists of six genotypes such as Asian/American, Asian I, Asian II, Cosmopolitan, American and sylvatic. Comparative genomic study was carried out to infer population structure of DENV-2 and to analyze the role of evolutionary and spatiotemporal factors in emergence of diversifying lineages. Methods Complete genome sequences of 990 strains of DENV-2 were analyzed using Bayesian-based population genetics and phylogenetic approaches to infer genetically distinct lineages. The role of spatiotemporal factors, genetic recombination and selection pressure in the evolution of DENV-2 is examined using the sequence-based bioinformatics approaches. Results DENV-2 genetic structure is complex and consists of fifteen subpopulations/lineages. The Asian/American genotype is observed to be diversified into seven lineages. The Asian I, Cosmopolitan and sylvatic genotypes were found to be subdivided into two lineages, each. The populations of American and Asian II genotypes were observed to be homogeneous. Significant evidence of episodic positive selection was observed in all the genes, except NS4A. Positive selection operational on a few codons in envelope gene confers antigenic and lineage diversity in the American strains of Asian/American genotype. Selection on codons of non-structural genes was observed to impact diversification of lineages in Asian I, cosmopolitan and sylvatic genotypes. Evidence of intra/inter-genotype recombination was obtained and the uncertainty in classification of recombinant strains was resolved using the population genetics approach. Discussion Complete genome-based analysis revealed that the

  14. Analysis of genotype diversity and evolution of Dengue virus serotype 2 using complete genomes

    Directory of Open Access Journals (Sweden)

    Vaishali P. Waman

    2016-08-01

    Full Text Available Background Dengue is one of the most common arboviral diseases prevalent worldwide and is caused by Dengue viruses (genus Flavivirus, family Flaviviridae. There are four serotypes of Dengue Virus (DENV-1 to DENV-4, each of which is further subdivided into distinct genotypes. DENV-2 is frequently associated with severe dengue infections and epidemics. DENV-2 consists of six genotypes such as Asian/American, Asian I, Asian II, Cosmopolitan, American and sylvatic. Comparative genomic study was carried out to infer population structure of DENV-2 and to analyze the role of evolutionary and spatiotemporal factors in emergence of diversifying lineages. Methods Complete genome sequences of 990 strains of DENV-2 were analyzed using Bayesian-based population genetics and phylogenetic approaches to infer genetically distinct lineages. The role of spatiotemporal factors, genetic recombination and selection pressure in the evolution of DENV-2 is examined using the sequence-based bioinformatics approaches. Results DENV-2 genetic structure is complex and consists of fifteen subpopulations/lineages. The Asian/American genotype is observed to be diversified into seven lineages. The Asian I, Cosmopolitan and sylvatic genotypes were found to be subdivided into two lineages, each. The populations of American and Asian II genotypes were observed to be homogeneous. Significant evidence of episodic positive selection was observed in all the genes, except NS4A. Positive selection operational on a few codons in envelope gene confers antigenic and lineage diversity in the American strains of Asian/American genotype. Selection on codons of non-structural genes was observed to impact diversification of lineages in Asian I, cosmopolitan and sylvatic genotypes. Evidence of intra/inter-genotype recombination was obtained and the uncertainty in classification of recombinant strains was resolved using the population genetics approach. Discussion Complete genome-based analysis

  15. Genomic diversity amongst Vibrio isolates from different sources determined by fluorescent amplified fragment length polymorphism.

    Science.gov (United States)

    Thompson, F L; Hoste, B; Vandemeulebroecke, K; Swings, J

    2001-12-01

    The genomic diversity among 506 strains of the family Vibrionaceae was analysed using Fluorescent Amplified Fragments Length Polymorphisms (FAFLP). Isolates were from different sources (e.g. fish, mollusc, shrimp, rotifers, artemia, and their culture water) in different countries, mainly from the aquacultural environment. Clustering of the FAFLP band patterns resulted in 69 clusters. A majority of the actually known species of the family Vibrionaceae formed separate clusters. Certain species e.g. V. alginolyticus, V. cholerae, V. cincinnatiensis, V. diabolicus, V. diazotrophicus, V. harveyi, V. logei, V. natriegens, V. nereis, V. splendidus and V. tubiashii were found to be ubiquitous, whereas V. halioticoli, V. ichthyoenteri, V. pectenicida and V. wodanis appear to be exclusively associated with a particular host or geographical region. Three main categories of isolates could be distinguished: (1) isolates with genomes related (i.e. with > or =45% FAFLP pattern similarity) to one of the known type strains; (2) isolates clustering (> or =45% pattern similarity) with more than one type strain; (3) isolates with genomes unrelated (<45% pattern similarity) to any of the type strains. The latter group consisted of 236 isolates distributed in 31 clusters indicating that many culturable taxa of the Vibrionaceae remain as yet to be described.

  16. Diversity and genomic insights into the uncultured Chloroflexi from the human microbiota.

    Science.gov (United States)

    Campbell, Alisha G; Schwientek, Patrick; Vishnivetskaya, Tatiana; Woyke, Tanja; Levy, Shawn; Beall, Clifford J; Griffen, Ann; Leys, Eugene; Podar, Mircea

    2014-09-01

    Many microbial phyla that are widely distributed in open environments have few or no representatives within animal-associated microbiota. Among them, the Chloroflexi comprises taxonomically and physiologically diverse lineages adapted to a wide range of aquatic and terrestrial habitats. A distinct group of uncultured chloroflexi related to free-living anaerobic Anaerolineae inhabits the mammalian gastrointestinal tract and includes low-abundance human oral bacteria that appear to proliferate in periodontitis. Using a single-cell genomics approach, we obtained the first draft genomic reconstruction for these organisms and compared their inferred metabolic potential with free-living chloroflexi. Genomic data suggest that oral chloroflexi are anaerobic heterotrophs, encoding abundant carbohydrate transport and metabolism functionalities, similar to those seen in environmental Anaerolineae isolates. The presence of genes for a unique phosphotransferase system and N-acetylglucosamine metabolism suggests an important ecological niche for oral chloroflexi in scavenging material from lysed bacterial cells and the human tissue. The inferred ability to produce sialic acid for cell membrane decoration may enable them to evade the host defence system and colonize the subgingival space. As with other low abundance but persistent members of the microbiota, discerning community and host factors that influence the proliferation of oral chloroflexi may help understand the emergence of oral pathogens and the microbiota dynamics in health and disease states.

  17. Phylogeny of a genomically diverse group of elymus (poaceae allopolyploids reveals multiple levels of reticulation.

    Directory of Open Access Journals (Sweden)

    Roberta J Mason-Gamer

    Full Text Available The grass tribe Triticeae (=Hordeeae comprises only about 300 species, but it is well known for the economically important crop plants wheat, barley, and rye. The group is also recognized as a fascinating example of evolutionary complexity, with a history shaped by numerous events of auto- and allopolyploidy and apparent introgression involving diploids and polyploids. The genus Elymus comprises a heterogeneous collection of allopolyploid genome combinations, all of which include at least one set of homoeologs, designated St, derived from Pseudoroegneria. The current analysis includes a geographically and genomically diverse collection of 21 tetraploid Elymus species, and a single hexaploid species. Diploid and polyploid relationships were estimated using four molecular data sets, including one that combines two regions of the chloroplast genome, and three from unlinked nuclear genes: phosphoenolpyruvate carboxylase, β-amylase, and granule-bound starch synthase I. Four gene trees were generated using maximum likelihood, and the phylogenetic placement of the polyploid sequences reveals extensive reticulation beyond allopolyploidy alone. The trees were interpreted with reference to numerous phenomena known to complicate allopolyploid phylogenies, and introgression was identified as a major factor in their history. The work illustrates the interpretation of complicated phylogenetic results through the sequential consideration of numerous possible explanations, and the results highlight the value of careful inspection of multiple independent molecular phylogenetic estimates, with particular focus on the differences among them.

  18. Phylogeny of a genomically diverse group of elymus (poaceae) allopolyploids reveals multiple levels of reticulation.

    Science.gov (United States)

    Mason-Gamer, Roberta J

    2013-01-01

    The grass tribe Triticeae (=Hordeeae) comprises only about 300 species, but it is well known for the economically important crop plants wheat, barley, and rye. The group is also recognized as a fascinating example of evolutionary complexity, with a history shaped by numerous events of auto- and allopolyploidy and apparent introgression involving diploids and polyploids. The genus Elymus comprises a heterogeneous collection of allopolyploid genome combinations, all of which include at least one set of homoeologs, designated St, derived from Pseudoroegneria. The current analysis includes a geographically and genomically diverse collection of 21 tetraploid Elymus species, and a single hexaploid species. Diploid and polyploid relationships were estimated using four molecular data sets, including one that combines two regions of the chloroplast genome, and three from unlinked nuclear genes: phosphoenolpyruvate carboxylase, β-amylase, and granule-bound starch synthase I. Four gene trees were generated using maximum likelihood, and the phylogenetic placement of the polyploid sequences reveals extensive reticulation beyond allopolyploidy alone. The trees were interpreted with reference to numerous phenomena known to complicate allopolyploid phylogenies, and introgression was identified as a major factor in their history. The work illustrates the interpretation of complicated phylogenetic results through the sequential consideration of numerous possible explanations, and the results highlight the value of careful inspection of multiple independent molecular phylogenetic estimates, with particular focus on the differences among them.

  19. Genome-wide SNP analysis explains coral diversity and recovery in the Ryukyu Archipelago.

    Science.gov (United States)

    Shinzato, Chuya; Mungpakdee, Sutada; Arakaki, Nana; Satoh, Noriyuki

    2015-12-10

    Following a global coral bleaching event in 1998, Acropora corals surrounding most of Okinawa island (OI) were devastated, although they are now gradually recovering. In contrast, the Kerama Islands (KIs) only 30 km west of OI, have continuously hosted a great variety of healthy corals. Taking advantage of the decoded Acropora digitifera genome and using genome-wide SNP analyses, we clarified Acropora population structure in the southern Ryukyu Archipelago (sRA). Despite small genetic distances, we identified distinct clusters corresponding to specific island groups, suggesting infrequent long-distance dispersal within the sRA. Although the KIs were believed to supply coral larvae to OI, admixture analyses showed that such dispersal is much more limited than previously realized, indicating independent recovery of OI coral populations and the necessity of local conservation efforts for each region. We detected strong historical migration from the Yaeyama Islands (YIs) to OI, and suggest that the YIs are the original source of OI corals. In addition, migration edges to the KIs suggest that they are a historical sink population in the sRA, resulting in high diversity. This population genomics study provides the highest resolution data to date regarding coral population structure and history.

  20. Gene arrangement convergence, diverse intron content, and genetic code modifications in mitochondrial genomes of sphaeropleales (chlorophyta).

    Science.gov (United States)

    Fučíková, Karolina; Lewis, Paul O; González-Halphen, Diego; Lewis, Louise A

    2014-08-08

    The majority of our knowledge about mitochondrial genomes of Viridiplantae comes from land plants, but much less is known about their green algal relatives. In the green algal order Sphaeropleales (Chlorophyta), only one representative mitochondrial genome is currently available-that of Acutodesmus obliquus. Our study adds nine completely sequenced and three partially sequenced mitochondrial genomes spanning the phylogenetic diversity of Sphaeropleales. We show not only a size range of 25-53 kb and variation in intron content (0-11) and gene order but also conservation of 13 core respiratory genes and fragmented ribosomal RNA genes. We also report an unusual case of gene arrangement convergence in Neochloris aquatica, where the two rns fragments were secondarily placed in close proximity. Finally, we report the unprecedented usage of UCG as stop codon in Pseudomuriella schumacherensis. In addition, phylogenetic analyses of the mitochondrial protein-coding genes yield a fully resolved, well-supported phylogeny, showing promise for addressing systematic challenges in green algae. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  1. Genomic diversity among Beijing and non-Beijing Mycobacterium tuberculosis isolates from Myanmar.

    Directory of Open Access Journals (Sweden)

    Ruth Stavrum

    Full Text Available BACKGROUND: The Beijing family of Mycobacterium tuberculosis is dominant in countries in East Asia. Genomic polymorphisms are a source of diversity within the M. tuberculosis genome and may account for the variation of virulence among M. tuberculosis isolates. Till date there are no studies that have examined the genomic composition of M. tuberculosis isolates from the high TB-burden country, Myanmar. METHODOLOGY/PRINCIPLE FINDINGS: Twenty-two M. tuberculosis isolates from Myanmar were screened on whole-genome arrays containing genes from M. tuberculosis H37Rv, M. tuberculosis CDC1551 and M. bovis AF22197. Screening identified 198 deletions or extra regions in the clinical isolates compared to H37Rv. Twenty-two regions differentiated between Beijing and non-Beijing isolates and were verified by PCR on an additional 40 isolates. Six regions (Rv0071-0074 [RD105], Rv1572-1576c [RD149], Rv1585c-1587c [RD149], MT1798-Rv1755c [RD152], Rv1761c [RD152] and Rv0279c were deleted in Beijing isolates, of which 4 (Rv1572-1576c, Rv1585c-1587c, MT1798-Rv1755c and Rv1761c were variably deleted among ST42 isolates, indicating a closer relationship between the Beijing and ST42 lineages. The TbD1 region, Mb1582-Mb1583 was deleted in Beijing and ST42 isolates. One M. bovis gene of unknown function, Mb3184c was present in all isolates, except 11 of 13 ST42 isolates. The CDC1551 gene, MT1360 coding for a putative adenylate cyclase, was present in all Beijing and ST42 isolates (except 1. The pks15/1 gene, coding for a putative virulence factor, was intact in all Beijing and non-Beijing isolates, except in ST42 and ST53 isolates. CONCLUSION: This study describes previously unreported deletions/extra regions in Beijing and non-Beijing M. tuberculosis isolates. The modern and highly frequent ST42 lineage showed a closer relationship to the hypervirulent Beijing lineage than to the ancient non-Beijing lineages. The pks15/1 gene was disrupted only in modern non

  2. Effects of the Project Approach on Preschoolers with Diverse Abilities

    Science.gov (United States)

    Beneke, Sallee; Ostrosky, Michaelene M.

    2015-01-01

    Mixed methods were used to study the impact of the Project Approach, a curriculum component that can engage and motivate children to participate in learning activities, on the play behaviors and language development of preschoolers. Participants included 4 children with disabilities and 4 children identified as at-risk. Six adults received support…

  3. Analysis of the Relationship between Genome Diversity and Adult Survive Rate of Botryllus Schlosseri by AFLP

    Institute of Scientific and Technical Information of China (English)

    FENG Xiao-rong; ZHU Jian-zhong; DENG Feng-jiao; Jacob Douek; Baruch Rinkevich

    2004-01-01

    Objective: The self-cross colonial prochordate, Botryllus schlosseri ( B. schlosseri) occupy a key phylogenetic position in the evolution of vertebrates. To clarify the relationship of genome diversity and survive rate, five generations of B.schlosseri was investigated by amplified fragment length polymorphism (AFLP). Methods:AFLP markersare extremely sensitive to even smell sequence variation, using PCR and high-resolution electrophoresis to examine restriction fragments. Results: AFLP polymorphism was high in the parent and lower in its F1, F2, F3 and F4. Each primer combination generated from 80 to more than 120 bands, of which average 25.85% poiymorphic loci in parent, 15.79% polymorphic among F1, 9.16% and 5.58% in F2,F3. The AFLP markers were transmitted from F1 to F2, F3 and F4 and inherited, segregated in expected Mendelian ratio. However, some of the markers were lost in F2, F3 and F4 while it disappeared in their mother. In addition, gene mutation-new loci and lost loci among F1, F2, F3 and F4 were observed. These special fragments were cloned and sequenced. Then, the genomic DNA was analyzed by Southem hybridization with the probes from these specific fragments and the mechanism of gene mutation was clarified. Conclusion:The se results suggest that there are high frequency of polymorphic loci and mutation in genome of B.schlosseri. Gene deletion or iow diversity may be the reason for high rate of death of the offspring of inbred laboratory-reared strains.

  4. Remarkable variation in maize genome structure inferred from haplotype diversity at the bz locus.

    Science.gov (United States)

    Wang, Qinghua; Dooner, Hugo K

    2006-11-21

    Maize is probably the most diverse of all crop species. Unexpectedly large differences among haplotypes were first revealed in a comparison of the bz genomic regions of two different inbred lines, McC and B73. Retrotransposon clusters, which comprise most of the repetitive DNA in maize, varied markedly in makeup, and location relative to the genes in the region and genic sequences, later shown to be carried by two helitron transposons, also differed between the inbreds. Thus, the allelic bz regions of these Corn Belt inbreds shared only a minority of the total sequence. To investigate further the variation caused by retrotransposons, helitrons, and other insertions, we have analyzed the organization of the bz genomic region in five additional cultivars selected because of their geographic and genetic diversity: the inbreds A188, CML258, and I137TN, and the land races Coroico and NalTel. This vertical comparison has revealed the existence of several new helitrons, new retrotransposons, members of every superfamily of DNA transposons, numerous miniature elements, and novel insertions flanked at either end by TA repeats, which we call TAFTs (TA-flanked transposons). The extent of variation in the region is remarkable. In pairwise comparisons of eight bz haplotypes, the percentage of shared sequences ranges from 25% to 84%. Chimeric haplotypes were identified that combine retrotransposon clusters found in different haplotypes. We propose that recombination in the common gene space greatly amplifies the variability produced by the retrotransposition explosion in the maize ancestry, creating the heterogeneity in genome organization found in modern maize.

  5. Identification of genome-wide copy number variations among diverse pig breeds by array CGH

    Directory of Open Access Journals (Sweden)

    Li Yan

    2012-12-01

    Full Text Available Abstract Background Recent studies have shown that copy number variation (CNV in mammalian genomes contributes to phenotypic diversity, including health and disease status. In domestic pigs, CNV has been catalogued by several reports, but the extent of CNV and the phenotypic effects are far from clear. The goal of this study was to identify CNV regions (CNVRs in pigs based on array comparative genome hybridization (aCGH. Results Here a custom-made tiling oligo-nucleotide array was used with a median probe spacing of 2506 bp for screening 12 pigs including 3 Chinese native pigs (one Chinese Erhualian, one Tongcheng and one Yangxin pig, 5 European pigs (one Large White, one Pietrain, one White Duroc and two Landrace pigs, 2 synthetic pigs (Chinese new line DIV pigs and 2 crossbred pigs (Landrace × DIV pigs with a Duroc pig as the reference. Two hundred and fifty-nine CNVRs across chromosomes 1–18 and X were identified, with an average size of 65.07 kb and a median size of 98.74 kb, covering 16.85 Mb or 0.74% of the whole genome. Concerning copy number status, 93 (35.91% CNVRs were called as gains, 140 (54.05% were called as losses and the remaining 26 (10.04% were called as both gains and losses. Of all detected CNVRs, 171 (66.02% and 34 (13.13% CNVRs directly overlapped with Sus scrofa duplicated sequences and pig QTLs, respectively. The CNVRs encompassed 372 full length Ensembl transcripts. Two CNVRs identified by aCGH were validated using real-time quantitative PCR (qPCR. Conclusions Using 720 K array CGH (aCGH we described a map of porcine CNVs which facilitated the identification of structural variations for important phenotypes and the assessment of the genetic diversity of pigs.

  6. SkateBase, an elasmobranch genome project and collection of molecular resources for chondrichthyan fishes [v1; ref status: indexed, http://f1000r.es/445

    Directory of Open Access Journals (Sweden)

    Jennifer Wyffels

    2014-08-01

    Full Text Available Chondrichthyan fishes are a diverse class of gnathostomes that provide a valuable perspective on fundamental characteristics shared by all jawed and limbed vertebrates. Studies of phylogeny, species diversity, population structure, conservation, and physiology are accelerated by genomic, transcriptomic and protein sequence data. These data are widely available for many sarcopterygii (coelacanth, lungfish and tetrapods and actinoptergii (ray-finned fish including teleosts taxa, but limited for chondrichthyan fishes.  In this study, we summarize available data for chondrichthyes and describe resources for one of the largest projects to characterize one of these fish, Leucoraja erinacea, the little skate.  SkateBase (http://skatebase.org serves as the skate genome project portal linking data, research tools, and teaching resources.

  7. Dynamic simulation and optimization approach to construction diversion of hydraulic and hydroelectric projects

    Institute of Scientific and Technical Information of China (English)

    ZHONG DengHua; LI MingChao; HUANG Wei; LIU Yong

    2009-01-01

    To solve the engineering and scientific problems in construction diversion and its simulation analysis,a complete scheme is presented. Firstly, the complex constraint relationship was analyzed among main buildings, diversion buildings and flow control. Secondly, the time-space relationship model of construction diversion system and the general block diagram-oriented simulation model of diversion process were set up. Then, the corresponding numerical simulation method and 3D dynamic visual simulation method were put forward. Further, the simulation and optimization platform of construction diversion control process was developed, integrated with simulation modeling, computation and visualization. Finally, these methods were applied to a practical project successfully, showing that the modeling process is convenient, the computation and the visual analysis can be coupled effectively,and the results conform to practical state. They provide new theoretical principles and technical measures for analyzing the control problems encountered in construction diversion of hydraulic and hydroelectric engineering under complex conditions.

  8. Dynamic simulation and optimization approach to construction diversion of hydraulic and hydroelectric projects

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    To solve the engineering and scientific problems in construction diversion and its simulation analysis, a complete scheme is presented. Firstly, the complex constraint relationship was analyzed among main buildings, diversion buildings and flow control. Secondly, the time-space relationship model of construction diversion system and the general block diagram-oriented simulation model of diversion process were set up. Then, the corresponding numerical simulation method and 3D dynamic visual simulation method were put forward. Further, the simulation and optimization platform of construction diversion control process was developed, integrated with simulation modeling, computation and visualization. Finally, these methods were applied to a practical project successfully, showing that the modeling process is convenient, the computation and the visual analysis can be coupled effectively, and the results conform to practical state. They provide new theoretical principles and technical measures for analyzing the control problems encountered in construction diversion of hydraulic and hydroelectric engineering under complex conditions.

  9. Low frequency variants, collapsed based on biological knowledge, uncover complexity of population stratification in 1000 genomes project data.

    Directory of Open Access Journals (Sweden)

    Carrie B Moore

    Full Text Available Analyses investigating low frequency variants have the potential for explaining additional genetic heritability of many complex human traits. However, the natural frequencies of rare variation between human populations strongly confound genetic analyses. We have applied a novel collapsing method to identify biological features with low frequency variant burden differences in thirteen populations sequenced by the 1000 Genomes Project. Our flexible collapsing tool utilizes expert biological knowledge from multiple publicly available database sources to direct feature selection. Variants were collapsed according to genetically driven features, such as evolutionary conserved regions, regulatory regions genes, and pathways. We have conducted an extensive comparison of low frequency variant burden differences (MAF<0.03 between populations from 1000 Genomes Project Phase I data. We found that on average 26.87% of gene bins, 35.47% of intergenic bins, 42.85% of pathway bins, 14.86% of ORegAnno regulatory bins, and 5.97% of evolutionary conserved regions show statistically significant differences in low frequency variant burden across populations from the 1000 Genomes Project. The proportion of bins with significant differences in low frequency burden depends on the ancestral similarity of the two populations compared and types of features tested. Even closely related populations had notable differences in low frequency burden, but fewer differences than populations from different continents. Furthermore, conserved or functionally relevant regions had fewer significant differences in low frequency burden than regions under less evolutionary constraint. This degree of low frequency variant differentiation across diverse populations and feature elements highlights the critical importance of considering population stratification in the new era of DNA sequencing and low frequency variant genomic analyses.

  10. The FlyBase database of the Drosophila genome projects andcommunity literature

    Energy Technology Data Exchange (ETDEWEB)

    Gelbart, William; Bayraktaroglu, Leyla; Bettencourt, Brian; Campbell, Kathy; Crosby, Madeline; Emmert, David; Hradecky, Pavel; Huang,Yanmei; Letovsky, Stan; Matthews, Beverly; Russo, Susan; Schroeder,Andrew; Smutniak, Frank; Zhou, Pinglei; Zytkovicz, Mark; Ashburner,Michael; Drysdale, Rachel; de Grey, Aubrey; Foulger, Rebecca; Millburn,Gillian; Yamada, Chihiro; Kaufman, Thomas; Matthews, Kathy; Gilbert, Don; Grumbling, Gary; Strelets, Victor; Shemen, C.; Rubin, Gerald; Berman,Brian; Frise, Erwin; Gibson, Mark; Harris, Nomi; Kaminker, Josh; Lewis,Suzanna; Marshall, Brad; Misra, Sima; Mungall, Christopher; Prochnik,Simon; Richter, John; Smith, Christopher; Shu, ShengQiang; Tupy,Jonathan; Wiel, Colin

    2002-09-16

    FlyBase (http://flybase.bio.indiana.edu/) provides an integrated view of the fundamental genomic and genetic data on the major genetic model Drosophila melanogaster and related species. FlyBase has primary responsibility for the continual reannotation of the D.melanogaster genome. The ultimate goal of the reannotation effort is to decorate the euchromatic sequence of the genome with as much biological information as is available from the community and from the major genome project centers. A complete revision of the annotations of the now-finished euchromatic genomic sequence has been completed. There are many points of entry to the genome within FlyBase, most notably through maps, gene products and ontologies, structured phenotypic and gene expression data, and anatomy.

  11. Genome-wide detection of copy number variations among diverse horse breeds by array CGH.

    Science.gov (United States)

    Wang, Wei; Wang, Shenyuan; Hou, Chenglin; Xing, Yanping; Cao, Junwei; Wu, Kaifeng; Liu, Chunxia; Zhang, Dong; Zhang, Li; Zhang, Yanru; Zhou, Huanmin

    2014-01-01

    Recent studies have found that copy number variations (CNVs) are widespread in human and animal genomes. CNVs are a significant source of genetic variation, and have been shown to be associated with phenotypic diversity. However, the effect of CNVs on genetic variation in horses is not well understood. In the present study, CNVs in 6 different breeds of mare horses, Mongolia horse, Abaga horse, Hequ horse and Kazakh horse (all plateau breeds) and Debao pony and Thoroughbred, were determined using aCGH. In total, seven hundred CNVs were identified ranging in size from 6.1 Kb to 0.57 Mb across all autosomes, with an average size of 43.08 Kb and a median size of 15.11 Kb. By merging overlapping CNVs, we found a total of three hundred and fifty-three CNV regions (CNVRs). The length of the CNVRs ranged from 6.1 Kb to 1.45 Mb with average and median sizes of 38.49 Kb and 13.1 Kb. Collectively, 13.59 Mb of copy number variation was identified among the horses investigated and accounted for approximately 0.61% of the horse genome sequence. Five hundred and eighteen annotated genes were affected by CNVs, which corresponded to about 2.26% of all horse genes. Through the gene ontology (GO), genetic pathway analysis and comparison of CNV genes among different breeds, we found evidence that CNVs involving 7 genes may be related to the adaptation to severe environment of these plateau horses. This study is the first report of copy number variations in Chinese horses, which indicates that CNVs are ubiquitous in the horse genome and influence many biological processes of the horse. These results will be helpful not only in mapping the horse whole-genome CNVs, but also to further research for the adaption to the high altitude severe environment for plateau horses.

  12. Diversity of eukaryotic DNA replication origins revealed by genome-wide analysis of chromatin structure.

    Directory of Open Access Journals (Sweden)

    Nicolas M Berbenetz

    2010-09-01

    Full Text Available Eukaryotic DNA replication origins differ both in their efficiency and in the characteristic time during S phase when they become active. The biological basis for these differences remains unknown, but they could be a consequence of chromatin structure. The availability of genome-wide maps of nucleosome positions has led to an explosion of information about how nucleosomes are assembled at transcription start sites, but no similar maps exist for DNA replication origins. Here we combine high-resolution genome-wide nucleosome maps with comprehensive annotations of DNA replication origins to identify patterns of nucleosome occupancy at eukaryotic replication origins. On average, replication origins contain a nucleosome depleted region centered next to the ACS element, flanked on both sides by arrays of well-positioned nucleosomes. Our analysis identified DNA sequence properties that correlate with nucleosome occupancy at replication origins genome-wide and that are correlated with the nucleosome-depleted region. Clustering analysis of all annotated replication origins revealed a surprising diversity of nucleosome occupancy patterns. We provide evidence that the origin recognition complex, which binds to the origin, acts as a barrier element to position and phase nucleosomes on both sides of the origin. Finally, analysis of chromatin reconstituted in vitro reveals that origins are inherently nucleosome depleted. Together our data provide a comprehensive, genome-wide view of chromatin structure at replication origins and suggest a model of nucleosome positioning at replication origins in which the underlying sequence occludes nucleosomes to permit binding of the origin recognition complex, which then (likely in concert with nucleosome modifiers and remodelers positions nucleosomes adjacent to the origin to promote replication origin function.

  13. DNA variation of the mammalian major histocompatibility complex reflects genomic diversity and population history

    Energy Technology Data Exchange (ETDEWEB)

    Yuhki, Naoya; O' Brien, S.J. (National Cancer Institute, Frederick, MD (USA))

    1990-01-01

    The major histocompatibility complex (MHC) is a multigene complex of tightly linked homologous genes that encode cell surface antigens that play a key role in immune regulation and response to foreign antigens. In most species, MHC gene products display extreme antigenic polymorphism, and their variability has been interpreted to reflect an adaptive strategy for accommodating rapidly evolving infectious agents that periodically afflict natural populations. Determination of the extent of MHC variation has been limited to populations in which skin grafting is feasible or for which serological reagents have been developed. The authors present here a quantitative analysis of restriction fragment length polymorphism of MHC class I genes in several mammalian species (cats, rodents, humans) known to have very different levels of genetic diversity based on functional MHC assays and on allozyme surveys. When homologous class I probes were employed, a notable concordance was observed between the extent of MHC restriction fragment variation and functional MHC variation detected by skin grafts or genome-wide diversity estimated by allozyme screens. These results confirm the genetically depauperate character of the African cheetah, Acinonyx jubatus, and the Asiatic lion, Panthera leo persica; further, they support the use of class I MHC molecular reagents in estimating the extent and character of genetic diversity in natural populations.

  14. Distributing intelligence and organizing diversity in new media projects

    Directory of Open Access Journals (Sweden)

    Monique Girard

    2002-06-01

    Full Text Available This paper examines how web design firms in the new media industry probe and experiment with possible forms and sources of value giving shape to the new economy. Focusing on the collaborative engineering of cross-disciplinary web-design project teams, we examine how websites emerge as provisional settlements among the heterogeneous disciplines as they negotiate working compromises across competing performance criteria.

  15. Calculation of ecological compensation for water sources for water diversion projects

    Science.gov (United States)

    Su, H. B.; Zhang, T. M.; Hu, C. Y.; Long, L. Y.

    2016-08-01

    This study considers the compensation of water diversion projects for the values of the terrestrial biological resources, water environment, and aquatic biological resources in water sources. An analysis of capital dynamics was conducted, and the economic development coefficient was used to correct the current method for calculating ecological compensation. A model was constructed to calculatethe ecological compensation for the water sources for water diversion projects. This model was used to calculate the ecological compensation for the Niulanjiang River provided by the Niulanjiang River to the Dianchi Lake water diversion project, which was calculated to be 136,799,400 RMB. As long as we know the occupying area of the project, the change of the river net flow after diversion and the local average GDP, the ecological compensation for water sources could be calculated by the model. The proposed model for calculating the ecological compensation for water sources is simple and incorporates the compensation provided by water diversion projects for the various environmental effects on water sources. It provides a guarantee for the capital to be used for the environmental protection of water sources and facilitates the sustainable development of the ecological environments of water sources.

  16. Genetic Diversity of Marine Anaerobic Ammonium-Oxidizing Bacteria as Revealed by Genomic and Proteomic Analyses of 'Candidatus Scalindua japonica'.

    Science.gov (United States)

    Oshiki, Mamoru; Mizuto, Keisuke; Kimura, Zenichiro; Kindaichi, Tomonori; Satoh, Hisashi; Okabe, Satoshi

    2017-09-11

    Anaerobic ammonium-oxidizing (anammox) bacteria affiliated with the genus 'Candidatus Scalindua' are responsible for significant nitrogen loss in oceans, and thus their ecophysiology is of great interest. Here, we enriched a marine anammox bacterium, 'Ca. S. japonica' from a Hiroshima bay sediment in Japan, and comparative genomic and proteomic analyses of 'Ca. S. japonica' were conducted. Sequence of the 4.81-Mb genome containing 4,019 coding regions of genes (CDSs) composed of 47 contigs was determined. In the proteome, 1,762 out of 4,019 CDSs in the 'Ca. S. japonica' genome were detected. Based on the genomic and proteomic data, the core anammox process and carbon fixation of 'Ca. S. japonica' were further investigated. Additionally, the present study provides the first detailed insights into the genetic background responsible for iron acquisition and menaquinone biosynthesis in anammox bacterial cells. Comparative analysis of the 'Ca. Scalindua' genomes revealed that the 1,502 genes found in the 'Ca. S. japonica' genome were not present in the 'Ca. S. profunda' and 'Ca. S. rubra' genomes, showing a high genomic diversity. This result may reflect a high phylogenetic diversity of the genus 'Ca. Scalindua'. This article is protected by copyright. All rights reserved. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  17. De novo assembly of soybean wild relatives for pan-genome analysis of diversity and agronomic traits.

    Science.gov (United States)

    Li, Ying-hui; Zhou, Guangyu; Ma, Jianxin; Jiang, Wenkai; Jin, Long-guo; Zhang, Zhouhao; Guo, Yong; Zhang, Jinbo; Sui, Yi; Zheng, Liangtao; Zhang, Shan-shan; Zuo, Qiyang; Shi, Xue-hui; Li, Yan-fei; Zhang, Wan-ke; Hu, Yiyao; Kong, Guanyi; Hong, Hui-long; Tan, Bing; Song, Jian; Liu, Zhang-xiong; Wang, Yaoshen; Ruan, Hang; Yeung, Carol K L; Liu, Jian; Wang, Hailong; Zhang, Li-juan; Guan, Rong-xia; Wang, Ke-jing; Li, Wen-bin; Chen, Shou-yi; Chang, Ru-zhen; Jiang, Zhi; Jackson, Scott A; Li, Ruiqiang; Qiu, Li-juan

    2014-10-01

    Wild relatives of crops are an important source of genetic diversity for agriculture, but their gene repertoire remains largely unexplored. We report the establishment and analysis of a pan-genome of Glycine soja, the wild relative of cultivated soybean Glycine max, by sequencing and de novo assembly of seven phylogenetically and geographically representative accessions. Intergenomic comparisons identified lineage-specific genes and genes with copy number variation or large-effect mutations, some of which show evidence of positive selection and may contribute to variation of agronomic traits such as biotic resistance, seed composition, flowering and maturity time, organ size and final biomass. Approximately 80% of the pan-genome was present in all seven accessions (core), whereas the rest was dispensable and exhibited greater variation than the core genome, perhaps reflecting a role in adaptation to diverse environments. This work will facilitate the harnessing of untapped genetic diversity from wild soybean for enhancement of elite cultivars.

  18. Selection for Silage Yield and Composition Did Not Affect Genomic Diversity Within the Wisconsin Quality Synthetic Maize Population

    Science.gov (United States)

    Lorenz, Aaron J.; Beissinger, Timothy M.; Silva, Renato Rodrigues; de Leon, Natalia

    2015-01-01

    Maize silage is forage of high quality and yield, and represents the second most important use of maize in the United States. The Wisconsin Quality Synthetic (WQS) maize population has undergone five cycles of recurrent selection for silage yield and composition, resulting in a genetically improved population. The application of high-density molecular markers allows breeders and geneticists to identify important loci through association analysis and selection mapping, as well as to monitor changes in the distribution of genetic diversity across the genome. The objectives of this study were to identify loci controlling variation for maize silage traits through association analysis and the assessment of selection signatures and to describe changes in the genomic distribution of gene diversity through selection and genetic drift in the WQS recurrent selection program. We failed to find any significant marker-trait associations using the historical phenotypic data from WQS breeding trials combined with 17,719 high-quality, informative single nucleotide polymorphisms. Likewise, no strong genomic signatures were left by selection on silage yield and quality in the WQS despite genetic gain for these traits. These results could be due to the genetic complexity underlying these traits, or the role of selection on standing genetic variation. Variation in loss of diversity through drift was observed across the genome. Some large regions experienced much greater loss in diversity than what is expected, suggesting limited recombination combined with small populations in recurrent selection programs could easily lead to fixation of large swaths of the genome. PMID:25645532

  19. Selection for silage yield and composition did not affect genomic diversity within the Wisconsin Quality Synthetic maize population.

    Science.gov (United States)

    Lorenz, Aaron J; Beissinger, Timothy M; Silva, Renato Rodrigues; de Leon, Natalia

    2015-02-02

    Maize silage is forage of high quality and yield, and represents the second most important use of maize in the United States. The Wisconsin Quality Synthetic (WQS) maize population has undergone five cycles of recurrent selection for silage yield and composition, resulting in a genetically improved population. The application of high-density molecular markers allows breeders and geneticists to identify important loci through association analysis and selection mapping, as well as to monitor changes in the distribution of genetic diversity across the genome. The objectives of this study were to identify loci controlling variation for maize silage traits through association analysis and the assessment of selection signatures and to describe changes in the genomic distribution of gene diversity through selection and genetic drift in the WQS recurrent selection program. We failed to find any significant marker-trait associations using the historical phenotypic data from WQS breeding trials combined with 17,719 high-quality, informative single nucleotide polymorphisms. Likewise, no strong genomic signatures were left by selection on silage yield and quality in the WQS despite genetic gain for these traits. These results could be due to the genetic complexity underlying these traits, or the role of selection on standing genetic variation. Variation in loss of diversity through drift was observed across the genome. Some large regions experienced much greater loss in diversity than what is expected, suggesting limited recombination combined with small populations in recurrent selection programs could easily lead to fixation of large swaths of the genome.

  20. Whole genome analysis of diverse Chlamydia trachomatis strains identifies phylogenetic relationships masked by current clinical typing

    Science.gov (United States)

    Harris, Simon R.; Clarke, Ian N.; Seth-Smith, Helena M. B.; Solomon, Anthony W.; Cutcliffe, Lesley T.; Marsh, Peter; Skilton, Rachel J.; Holland, Martin J.; Mabey, David; Peeling, Rosanna W.; Lewis, David A.; Spratt, Brian G.; Unemo, Magnus; Persson, Kenneth; Bjartling, Carina; Brunham, Robert; de Vries, Henry J.C.; Morré, Servaas A.; Speksnijder, Arjen; Bébéar, Cécile M.; Clerc, Maïté; de Barbeyrac, Bertille; Parkhill, Julian; Thomson, Nicholas R.

    2012-01-01

    Chlamydia trachomatis is responsible for both trachoma and sexually transmitted infections causing substantial morbidity and economic cost globally. Despite this, our knowledge of its population and evolutionary genetics is limited. Here we present a detailed whole genome phylogeny from representative strains of both trachoma and lymphogranuloma venereum (LGV) biovars from temporally and geographically diverse sources. Our analysis demonstrates that predicting phylogenetic structure using the ompA gene, traditionally used to classify Chlamydia, is misleading because extensive recombination in this region masks true relationships. We show that in many instances ompA is a chimera that can be exchanged in part or whole, both within and between biovars. We also provide evidence for exchange of, and recombination within, the cryptic plasmid, another important diagnostic target. We have used our phylogenetic framework to show how genetic exchange has manifested itself in ocular, urogenital and LGV C. trachomatis strains, including the epidemic LGV serotype L2b. PMID:22406642

  1. Genomic diversity and differentiation of a managed island wild boar population

    DEFF Research Database (Denmark)

    Iacolina, Laura; Scandura, Massimo; J. Goedbloed, Daniel;

    2016-01-01

    The evolution of island populations in natural systems is driven by local adaptation and genetic drift. However, evolutionary pathways may be altered by humans in several ways. The wild boar (WB) (Sus scrofa) is an iconic game species occurring in several islands, where it has been strongly managed...... since prehistoric times. We examined genomic diversity at 49 803 single-nucleotide polymorphisms in 99 Sardinian WBs and compared them with 196 wild specimens from mainland Europe and 105 domestic pigs (DP; 11 breeds). High levels of genetic variation were observed in Sardinia (80.9% of the total number...... of polymorphisms), which can be only in part associated to recent genetic introgression. Both Principal Component Analysis and Bayesian clustering approach revealed that the Sardinian WB population is highly differentiated from the other European populations (FST=0.126–0.138), and from DP (FST=0...

  2. The Nephila clavipes genome highlights the diversity of spider silk genes and their complex expression.

    Science.gov (United States)

    Babb, Paul L; Lahens, Nicholas F; Correa-Garhwal, Sandra M; Nicholson, David N; Kim, Eun Ji; Hogenesch, John B; Kuntner, Matjaž; Higgins, Linden; Hayashi, Cheryl Y; Agnarsson, Ingi; Voight, Benjamin F

    2017-06-01

    Spider silks are the toughest known biological materials, yet are lightweight and virtually invisible to the human immune system, and they thus have revolutionary potential for medicine and industry. Spider silks are largely composed of spidroins, a unique family of structural proteins. To investigate spidroin genes systematically, we constructed the first genome of an orb-weaving spider: the golden orb-weaver (Nephila clavipes), which builds large webs using an extensive repertoire of silks with diverse physical properties. We cataloged 28 Nephila spidroins, representing all known orb-weaver spidroin types, and identified 394 repeated coding motif variants and higher-order repetitive cassette structures unique to specific spidroins. Characterization of spidroin expression in distinct silk gland types indicates that glands can express multiple spidroin types. We find evidence of an alternatively spliced spidroin, a spidroin expressed only in venom glands, evolutionary mechanisms for spidroin diversification, and non-spidroin genes with expression patterns that suggest roles in silk production.

  3. Diversity and selective sweep in the OsAMT1;1 genomic region of rice

    Directory of Open Access Journals (Sweden)

    Chen Sheng

    2011-03-01

    Full Text Available Abstract Background Ammonium is one of the major forms in which nitrogen is available for plant growth. OsAMT1;1 is a high-affinity ammonium transporter in rice (Oryza sativa L., responsible for ammonium uptake at low nitrogen concentration. The expression pattern of the gene has been reported. However, variations in its nucleotides and the evolutionary pathway of its descent from wild progenitors are yet to be elucidated. In this study, nucleotide diversity of the gene OsAMT1;1 and the diversity pattern of seven gene fragments spanning a genomic region approximately 150 kb long surrounding the gene were surveyed by sequencing a panel of 216 rice accessions including both cultivated rice and wild relatives. Results Nucleotide polymorphism (Pi of OsAMT1;1 was as low as 0.00004 in cultivated rice (Oryza sativa, only 2.3% of that in the common wild rice (O. rufipogon. A single dominant haplotype was fixed at the locus in O. sativa. The test values for neutrality were significantly negative in the entire region stretching 5' upstream and 3' downstream of the gene in all accessions. The value of linkage disequilibrium remained high across a 100 kb genomic region around OsAMT1;1 in O. sativa, but fell rapidly in O. rufipogon on either side of the promoter of OsAMT1;1, demonstrating a strong natural selection within or nearby the ammonium transporter. Conclusions The severe reduction in nucleotide variation at OsAMT1;1 in rice was caused by a selective sweep around OsAMT1;1, which may reflect the nitrogen uptake system under strong selection by the paddy soil during the domestication of rice. Purifying selection also occurred before the wild rice diverged into its two subspecies, namely indica and japonica. These findings would provide useful insights into the processes of evolution and domestication of nitrogen uptake genes in rice.

  4. When does activating diversity alleviate, when does it increase intergroup bias? An ingroup projection perspective

    Science.gov (United States)

    Steffens, Melanie C.; Reese, Gerhard; Ehrke, Franziska; Jonas, Kai J.

    2017-01-01

    The question how intergroup bias can be alleviated is of much theoretical and practical interest. Whereas diversity training and the multiculturalism ideology are two approaches prominent in practice, most theoretical models on reducing intergroup bias are based on social-identity theory and self-categorization theory. This social-identity perspective assumes that similar processes lead to intergroup bias in very different intergroup contexts if people identify with the respective social groups. A recent prominent model based on these theories is the ingroup-projection model. As this model assumes, an ingroup’s norms and standards are applied to outgroups included in a common superordinate category (this is called ingroup projection). Intergroup bias results because the outgroup fulfils these norms and standards less than the ingroup. Importantly, if the diversity of the superordinate category is induced as the norm, ingroup projection and thus intergroup bias should be reduced. The present research delineates and tests how general this process is. We propose that ingroup prototypicality is not only an outcome variable, as the ingroup-projection model originally assumes, but can also be an important moderator. We hypothesize that for members considering their ingroup highly prototypical (“pars pro toto”, large majorities), the superordinate group’s diversity may question their ingroup’s position and thus elicit threat and intergroup bias. In contrast, for members who consider their group as less prototypical (one among several, or “una inter pares” groups), activating diversity should, as originally assumed in the ingroup-projection model, reduce intergroup bias. Three experiments (total N = 345) supported these predictions in the contexts of groups defined by gender or nationality. Taken together, the ingroup-projection model can explain under which conditions activating superordinate-category diversity induces tolerance, and when it may backfire. We

  5. Functional Genomics of Novel Secondary Metabolites from Diverse Cyanobacteria Using Untargeted Metabolomics

    Directory of Open Access Journals (Sweden)

    Muriel Gugger

    2013-09-01

    Full Text Available Mass spectrometry-based metabolomics has become a powerful tool for the detection of metabolites in complex biological systems and for the identification of novel metabolites. We previously identified a number of unexpected metabolites in the cyanobacterium Synechococcus sp. PCC 7002, such as histidine betaine, its derivatives and several unusual oligosaccharides. To test for the presence of these compounds and to assess the diversity of small polar metabolites in other cyanobacteria, we profiled cell extracts of nine strains representing much of the morphological and evolutionary diversification of this phylum. Spectral features in raw metabolite profiles obtained by normal phase liquid chromatography coupled to mass spectrometry (MS were manually curated so that chemical formulae of metabolites could be assigned. For putative identification, retention times and MS/MS spectra were cross-referenced with those of standards or available sprectral library records. Overall, we detected 264 distinct metabolites. These included indeed different betaines, oligosaccharides as well as additional unidentified metabolites with chemical formulae not present in databases of metabolism. Some of these metabolites were detected only in a single strain, but some were present in more than one. Genomic interrogation of the strains revealed that generally, presence of a given metabolite corresponded well with the presence of its biosynthetic genes, if known. Our results show the potential of combining metabolite profiling and genomics for the identification of novel biosynthetic genes.

  6. Functional Genomics of Novel Secondary Metabolites from Diverse Cyanobacteria Using Untargeted Metabolomics

    Science.gov (United States)

    Baran, Richard; Ivanova, Natalia N.; Jose, Nick; Garcia-Pichel, Ferran; Kyrpides, Nikos C.; Gugger, Muriel; Northen, Trent R.

    2013-01-01

    Mass spectrometry-based metabolomics has become a powerful tool for the detection of metabolites in complex biological systems and for the identification of novel metabolites. We previously identified a number of unexpected metabolites in the cyanobacterium Synechococcus sp. PCC 7002, such as histidine betaine, its derivatives and several unusual oligosaccharides. To test for the presence of these compounds and to assess the diversity of small polar metabolites in other cyanobacteria, we profiled cell extracts of nine strains representing much of the morphological and evolutionary diversification of this phylum. Spectral features in raw metabolite profiles obtained by normal phase liquid chromatography coupled to mass spectrometry (MS) were manually curated so that chemical formulae of metabolites could be assigned. For putative identification, retention times and MS/MS spectra were cross-referenced with those of standards or available sprectral library records. Overall, we detected 264 distinct metabolites. These included indeed different betaines, oligosaccharides as well as additional unidentified metabolites with chemical formulae not present in databases of metabolism. Some of these metabolites were detected only in a single strain, but some were present in more than one. Genomic interrogation of the strains revealed that generally, presence of a given metabolite corresponded well with the presence of its biosynthetic genes, if known. Our results show the potential of combining metabolite profiling and genomics for the identification of novel biosynthetic genes. PMID:24084783

  7. Genetic diversity of Greek Aegilops species using different types of nuclear genome markers.

    Science.gov (United States)

    Thomas, Konstantinos G; Bebeli, Penelope J

    2010-09-01

    Random Amplified Polymorphic DNA (RAPD) and Inter-Simple Sequence Repeat (ISSR) analyses were used to evaluate genetic variability and relationships of Greek Aegilops species. Thirty-eight accessions of seven Greek Aegilops species [Ae. triuncialis (genome UC), Ae. neglecta (UM), Ae. biuncialis (UM), Ae. caudata (C), Ae. comosa (M), Ae. geniculata (MU) and Ae. umbellulata (U)] as well as Triticum accessions were studied. Nineteen RAPD and ten ISSR primers yielded 344 and 170 polymorphic bands, respectively, that were used for the construction of dendrograms. Regardless of the similarity coefficient and marker type used, UPGMA placed 38 Aegilops accessions into one branch while the other branch consisted of wheat species. Within the Aegilops cluster, subgroups were identified that included species that shared the same genome or belonged to the same botanical section. Within the Triticum cluster, two robust subgroups were formed, one including diploid wheat and another including polyploid wheat. In conclusion, results showed that there is genetic diversity in the Greek Aegilops species studied, and clustering based on genetic similarities was in agreement with botanical classifications.

  8. Physiological, genomic and transcriptional diversity in responses to boron deficiency in rapeseed genotypes

    Science.gov (United States)

    Hua, Yingpeng; Zhou, Ting; Ding, Guangda; Yang, Qingyong; Shi, Lei; Xu, Fangsen

    2016-01-01

    Allotetraploid rapeseed (Brassica napus L. AnAnCnCn, 2n=4x=38) is highly susceptible to boron (B) deficiency, a widespread limiting factor that causes severe losses in seed yield. The genetic variation in the sensitivity to B deficiency found in rapeseed genotypes emphasizes the complex response architecture. In this research, a B-inefficient genotype, ‘Westar 10’ (‘W10’), responded to B deficiencies during vegetative and reproductive development with an over-accumulation of reactive oxygen species, severe lipid peroxidation, evident plasmolysis, abnormal floral organogenesis, and widespread sterility compared to a B-efficient genotype, ‘Qingyou 10’ (‘QY10’). Whole-genome re-sequencing (WGS) of ‘QY10’ and ‘W10’ revealed a total of 1 605 747 single nucleotide polymorphisms and 218 755 insertions/deletions unevenly distributed across the allotetraploid rapeseed genome (~1130Mb). Digital gene expression (DGE) profiling identified more genes related to B transporters, antioxidant enzymes, and the maintenance of cell walls and membranes with higher transcript levels in the roots of ‘QY10’ than in ‘W10’ under B deficiency. Furthermore, based on WGS and bulked segregant analysis of the doubled haploid (DH) line pools derived from ‘QY10’ and ‘W10’, two significant quantitative trait loci (QTLs) for B efficiency were characterized on chromosome C2, and DGE-assisted QTL-seq analyses then identified a nodulin 26-like intrinsic protein gene and an ATP-binding cassette (ABC) transporter gene as the corresponding candidates regulating B efficiency. This research facilitates a more comprehensive understanding of the differential physiological and transcriptional responses to B deficiency and abundant genetic diversity in rapeseed genotypes, and the DGE-assisted QTL-seq analyses provide novel insights regarding the rapid dissection of quantitative trait genes in plant species with complex genomes. PMID:27639094

  9. Probing the diversity of chloromethane-degrading bacteria by comparative genomics and isotopic fractionation.

    Science.gov (United States)

    Nadalig, Thierry; Greule, Markus; Bringel, Françoise; Keppler, Frank; Vuilleumier, Stéphane

    2014-01-01

    Chloromethane (CH3Cl) is produced on earth by a variety of abiotic and biological processes. It is the most important halogenated trace gas in the atmosphere, where it contributes to ozone destruction. Current estimates of the global CH3Cl budget are uncertain and suggest that microorganisms might play a more important role in degrading atmospheric CH3Cl than previously thought. Its degradation by bacteria has been demonstrated in marine, terrestrial, and phyllospheric environments. Improving our knowledge of these degradation processes and their magnitude is thus highly relevant for a better understanding of the global budget of CH3Cl. The cmu pathway, for chloromethane utilisation, is the only microbial pathway for CH3Cl degradation elucidated so far, and was characterized in detail in aerobic methylotrophic Alphaproteobacteria. Here, we reveal the potential of using a two-pronged approach involving a combination of comparative genomics and isotopic fractionation during CH3Cl degradation to newly address the question of the diversity of chloromethane-degrading bacteria in the environment. Analysis of available bacterial genome sequences reveals that several bacteria not yet known to degrade CH3Cl contain part or all of the complement of cmu genes required for CH3Cl degradation. These organisms, unlike bacteria shown to grow with CH3Cl using the cmu pathway, are obligate anaerobes. On the other hand, analysis of the complete genome of the chloromethane-degrading bacterium Leisingera methylohalidivorans MB2 showed that this bacterium does not contain cmu genes. Isotope fractionation experiments with L. methylohalidivorans MB2 suggest that the unknown pathway used by this bacterium for growth with CH3Cl can be differentiated from the cmu pathway. This result opens the prospect that contributions from bacteria with the cmu and Leisingera-type pathways to the atmospheric CH3Cl budget may be teased apart in the future.

  10. Probing the diversity of chloromethane-degrading bacteria by comparative genomics and isotopic fractionation

    Directory of Open Access Journals (Sweden)

    Thierry eNADALIG

    2014-10-01

    Full Text Available Chloromethane (CH3Cl is produced on earth by a variety of abiotic and biological processes. It is the most important halogenated trace gas in the atmosphere, where it contributes to ozone destruction. Current estimates of the global CH3Cl budget are uncertain and suggest that microorganisms might play a more important role in degrading atmospheric CH3Cl than previously thought. Its degradation by bacteria has been demonstrated in marine, terrestrial and phyllospheric environments. Improving our knowledge of these degradation processes and its magnitude is thus highly relevant for a better understanding of the global budget of CH3Cl.The cmu pathway, for chloromethane utilisation, is the only microbial pathway for CH3Cl degradation elucidated so far, and was characterised in detail in aerobic methylotrophic Alphaproteobacteria. Here, we reveal the potential of using a two-pronged approach involving a combination of comparative genomics and isotopic fractionation during CH3Cl degradation to newly address the question of the diversity of chloromethane-degrading bacteria in the environment.Analysis of available bacterial genome sequences reveals that several bacteria not yet known to degrade CH3Cl contain part or all of the complement of cmu genes required for CH3Cl degradation. These organisms, unlike bacteria shown to grow with CH3Cl using the cmu pathway, are obligate anaerobes. On the other hand, analysis of the complete genome of the chloromethane-degrading bacterium Leisingera methylohalidivorans showed that this bacterium does not contain cmu genes. Isotope fractionation experiments with L. methylohalidivorans suggest that the unknown pathway used by this bacterium for growth with CH3Cl can be differentiated from the cmu pathway. This result opens the prospect that contributions from bacteria with the cmu and Leisingera-type pathways to the atmospheric CH3Cl budget may be teased apart in the future.

  11. Contrasting Genomic Diversity in Two Closely Related Postharvest Pathogens: Penicillium digitatum and Penicillium expansum.

    Science.gov (United States)

    Julca, Irene; Droby, Samir; Sela, Noa; Marcet-Houben, Marina; Gabaldón, Toni

    2015-12-14

    Penicillium digitatum and Penicillium expansum are two closely related fungal plant pathogens causing green and blue mold in harvested fruit, respectively. The two species differ in their host specificity, being P. digitatum restricted to citrus fruits and P. expansum able to infect a wide range of fruits after harvest. Although host-specific Penicillium species have been found to have a smaller gene content, it is so far unclear whether these different host specificities impact genome variation at the intraspecific level. Here we assessed genome variation across four P. digitatum and seven P. expansum isolates from geographically distant regions. Our results show very high similarity (average 0.06 SNPs [single nucleotide polymorphism] per kb) between globally distributed isolates of P. digitatum pointing to a recent expansion of a single lineage. This low level of genetic variation found in our samples contrasts with the higher genetic variability observed in the similarly distributed P. expansum isolates (2.44 SNPs per kb). Patterns of polymorphism in P. expansum indicate that recombination exists between genetically diverged strains. Consistent with the existence of sexual recombination and heterothallism, which was unknown for this species, we identified the two alternative mating types in different P. expansum isolates. Patterns of polymorphism in P. digitatum indicate a recent clonal population expansion of a single lineage that has reached worldwide distribution. We suggest that the contrasting patterns of genomic variation between the two species reflect underlying differences in population dynamics related with host specificities and related agricultural practices. It should be noted, however, that this results should be confirmed with a larger sampling of strains, as new strains may broaden the diversity so far found in P. digitatum.

  12. Human genome education model project. Ethical, legal, and social implications of the human genome project: Education of interdisciplinary professionals

    Energy Technology Data Exchange (ETDEWEB)

    Weiss, J.O. [Alliance of Genetic Support Groups, Chevy Chase, MD (United States); Lapham, E.V. [Georgetown Univ., Washington, DC (United States). Child Development Center

    1996-12-31

    This meeting was held June 10, 1996 at Georgetown University. The purpose of this meeting was to provide a multidisciplinary forum for exchange of state-of-the-art information on the human genome education model. Topics of discussion include the following: psychosocial issues; ethical issues for professionals; legislative issues and update; and education issues.

  13. Losing identity: structural diversity of transposable elements belonging to different classes in the genome of Anopheles gambiae

    Directory of Open Access Journals (Sweden)

    Fernández-Medina Rita D

    2012-06-01

    Full Text Available Abstract Background Transposable elements (TEs, both DNA transposons and retrotransposons, are genetic elements with the main characteristic of being able to mobilize and amplify their own representation within genomes, utilizing different mechanisms of transposition. An almost universal feature of TEs in eukaryotic genomes is their inability to transpose by themselves, mainly as the result of sequence degeneration (by either mutations or deletions. Most of the elements are thus either inactive or non-autonomous. Considering that the bulk of some eukaryotic genomes derive from TEs, they have been conceived as “TE graveyards.” It has been shown that once an element has been inactivated, it progressively accumulates mutations and deletions at neutral rates until completely losing its identity or being lost from the host genome; however, it has also been shown that these “neutral sequences” might serve as raw material for domestication by host genomes. Results We have analyzed the sequence structural variations, nucleotide divergence, and pattern of insertions and deletions of several superfamilies of TEs belonging to both class I (long terminal repeats [LTRs] and non-LTRs [NLTRs] and II in the genome of Anopheles gambiae, aiming at describing the landscape of deterioration of these elements in this particular genome. Our results describe a great diversity in patterns of deterioration, indicating lineage-specific differences including the presence of Solo-LTRs in the LTR lineage, 5′-deleted NLTRs, and several non-autonomous and MITEs in the class II families. Interestingly, we found fragments of NLTRs corresponding to the RT domain, which preserves high identity among them, suggesting a possible remaining genomic role for these domains. Conclusions We show here that the TEs in the An. gambiae genome deteriorate in different ways according to the class to which they belong. This diversity certainly has implications not only at the host

  14. Wildlife Impact Assessment: Anderson Ranch, Black Canyon, and Boise Diversion Projects, Idaho. Final Report.

    Energy Technology Data Exchange (ETDEWEB)

    Meuleman, G. Allyn

    1986-05-01

    This report presents an analysis of impacts on wildlife and their habitats as a result of construction and operation of the US Bureau of Reclamation's Anderson Ranch, Black Canyon, and Boise Diversion Projects in Idaho. The objectives were to: (1) determine the probable impacts of development and operation of the Anderson Ranch, Black Canyon, and Boise Diversion Projects to wildlife and their habitats; (2) determine the wildlife and habitat impacts directly attributable to hydroelectric development and operation; (3) briefly identify the current major concerns for wildlife in the vicinities of the hydroelectric projects; and (4) provide for consultation and coordination with interested agencies, tribes, and other entities expressing interest in the project.

  15. Antigenic and genomic diversity of human rotavirus VP4 in two consecutive epidemic seasons in Mexico.

    Science.gov (United States)

    Padilla-Noriega, L; Méndez-Toss, M; Menchaca, G; Contreras, J F; Romero-Guido, P; Puerto, F I; Guiscafré, H; Mota, F; Herrera, I; Cedillo, R; Muñoz, O; Calva, J; Guerrero, M L; Coulson, B S; Greenberg, H B; López, S; Arias, C F

    1998-06-01

    In the present investigation we characterized the antigenic diversity of the VP4 and VP7 proteins in 309 and 261 human rotavirus strains isolated during two consecutive epidemic seasons, respectively, in three different regions of Mexico. G3 was found to be the prevalent VP7 serotype during the first year, being superseded by serotype G1 strains during the second season. To antigenically characterize the VP4 protein of the strains isolated, we used five neutralizing monoclonal antibodies (MAbs) which showed specificity for VP4 serotypes P1A, P1B, and P2 in earlier studies. Eight different patterns of reactivity with these MAbs were found, and the prevalence of three of these patterns varied from one season to the next. The P genotype of a subset of 52 samples was determined by PCR. Among the strains characterized as genotype P[4] and P[8] there were three and five different VP4 MAb reactivity patterns, respectively, indicating that the diversity of neutralization epitopes in VP4 is greater than that previously appreciated by the genomic typing methods.

  16. Implementing sponge physiological and genomic information to enhance the diversity of its culturable associated bacteria.

    Science.gov (United States)

    Lavy, Adi; Keren, Ray; Haber, Markus; Schwartz, Inbar; Ilan, Micha

    2014-02-01

    In recent years new approaches have emerged for culturing marine environmental bacteria. They include the use of novel culture media, sometimes with very low-nutrient content, and a variety of growth conditions such as temperature, oxygen levels, and different atmospheric pressures. These approaches have largely been neglected when it came to the cultivation of sponge-associated bacteria. Here, we used physiological and environmental conditions to reflect the environment of sponge-associated bacteria along with genomic data of the prominent sponge symbiont Candidatus Poribacteria sp. WGA-4E, to cultivate bacteria from the Red Sea sponge Theonella swinhoei. Designing culturing conditions to fit the metabolic needs of major bacterial taxa present in the sponge, through a combined use of diverse culture media compositions with aerobic and microaerophilic states, and addition of antibiotics, yielded higher diversity of the cultured bacteria and led to the isolation of novel sponge-associated and sponge-specific bacteria. In this work, 59 OTUs of six phyla were isolated. Of these, 22 have no close type strains at the species level (bacteria species, and some are probably new genera and even families.

  17. A Genome-Scale Model of Shewanella piezotolerans Simulates Mechanisms of Metabolic Diversity and Energy Conservation.

    Science.gov (United States)

    Dufault-Thompson, Keith; Jian, Huahua; Cheng, Ruixue; Li, Jiefu; Wang, Fengping; Zhang, Ying

    2017-01-01

    Shewanella piezotolerans strain WP3 belongs to the group 1 branch of the Shewanella genus and is a piezotolerant and psychrotolerant species isolated from the deep sea. In this study, a genome-scale model was constructed for WP3 using a combination of genome annotation, ortholog mapping, and physiological verification. The metabolic reconstruction contained 806 genes, 653 metabolites, and 922 reactions, including central metabolic functions that represented nonhomologous replacements between the group 1 and group 2 Shewanella species. Metabolic simulations with the WP3 model demonstrated consistency with existing knowledge about the physiology of the organism. A comparison of model simulations with experimental measurements verified the predicted growth profiles under increasing concentrations of carbon sources. The WP3 model was applied to study mechanisms of anaerobic respiration through investigating energy conservation, redox balancing, and the generation of proton motive force. Despite being an obligate respiratory organism, WP3 was predicted to use substrate-level phosphorylation as the primary source of energy conservation under anaerobic conditions, a trait previously identified in other Shewanella species. Further investigation of the ATP synthase activity revealed a positive correlation between the availability of reducing equivalents in the cell and the directionality of the ATP synthase reaction flux. Comparison of the WP3 model with an existing model of a group 2 species, Shewanella oneidensis MR-1, revealed that the WP3 model demonstrated greater flexibility in ATP production under the anaerobic conditions. Such flexibility could be advantageous to WP3 for its adaptation to fluctuating availability of organic carbon sources in the deep sea. IMPORTANCE The well-studied nature of the metabolic diversity of Shewanella bacteria makes species from this genus a promising platform for investigating the evolution of carbon metabolism and energy conservation

  18. Human Genome Project: an attentive reading of the book of life?

    OpenAIRE

    2010-01-01

    The idea to sequence all 3 billion bases of the humane genome started in the late 80s and the project began in the early 90s. In June 2000, the first "draft" was announced and in February, 2001 the final sequence was published by Science and Nature. Many debates about the ethical, legal and social issues originated from the Human Genome Project. The main questions are? "who should have access to an individual's genetic information?"; "will the genetic information be used as a discrimination t...

  19. Crowdfunding the Azolla fern genome project: a grassroots approach.

    Science.gov (United States)

    Li, Fay-Wei; Pryer, Kathleen M

    2014-01-01

    Much of science progresses within the tight boundaries of what is often seen as a "black box". Though familiar to funding agencies, researchers and the academic journals they publish in, it is an entity that outsiders rarely get to peek into. Crowdfunding is a novel means that allows the public to participate in, as well as to support and witness advancements in science. Here we describe our recent crowdfunding efforts to sequence the Azolla genome, a little fern with massive green potential. Crowdfunding is a worthy platform not only for obtaining seed money for exploratory research, but also for engaging directly with the general public as a rewarding form of outreach.

  20. Clinal distribution of human genomic diversity across the Netherlands despite archaeological evidence for genetic discontinuities in Dutch population history

    NARCIS (Netherlands)

    O. Lao Grueso (Oscar); E. Altena (Eveline); C.R. Becker (Christian); S. Brauer (Silke); T. Kraaijenbrink (Thirsa); M. van Oven (Mannis); P. Nürnberg (Peter); P. de Knijff (Peter); M.H. Kayser (Manfred)

    2013-01-01

    textabstractBackground: The presence of a southeast to northwest gradient across Europe in human genetic diversity is a well-established observation and has recently been confirmed by genome-wide single nucleotide polymorphism (SNP) data. This pattern is traditionally explained by major prehistoric

  1. Genetic diversity revealed by genomic-SSR and EST-SSR markers among common wheat, spelt and compactum

    Institute of Scientific and Technical Information of China (English)

    YANG Xinquan; LIU Peng; HAN Zongfu; NI Zhongfu; SUN Qixin

    2005-01-01

    In this study, two SSR molecular markers, named genomic-SSR and EST-SSR, are used to measure the genetic diversity among three hexaploid wheat populations, which include 28 common wheat ( Triticum aestivum L. ), 13 spelt ( Triticum spelta L. ),and 11 compactum ( Triticum compactum Host. ). The results show that common wheat has the highest genetic polymorphism, followed by spelt and then compactum. The mean genetic distance between the populations is higher than that within a population, and similar tendency is detected for individual genomes A, B and D. Therefore, spelt and compactum can be used as potential germplasms for wheat breeding, especially for enriching the genetic variation in genome D. As compared with spelt, the genetic diversity between common wheat and compactum is much smaller, indicating a closer consanguine relationship between these two species. Although the polymorphism revealed by EST-SSR is lower than that by genomic-SSR, it can effectively differentiate diverse genotypes as well. Together with our present results, it is concluded that EST-SSR marker is an ideal marker for assessing the genetic diversity in wheat. Meanwhile, the origin and evolution of hexaploid wheat is also analyzed and discussed.

  2. Exploring the diversity of Arcobacter spp. in cattle in the UK using MLST and whole genome sequencing

    Science.gov (United States)

    Arcobacter butzleri is considered to be an emerging human foodborne pathogen. The completion of an A. butzleri genome sequence along with microarray analysis of 13 isolates in 2007 revealed a surprising amount of diversity amongst A. butzleri isolates from humans, animals and food. In order to furth...

  3. Small RNA pathways and diversity in model legumes: lessons from genomics.

    Directory of Open Access Journals (Sweden)

    Pilar eBustos-Sanmamed

    2013-07-01

    Full Text Available Small non coding RNAs (smRNA participate in the regulation of development, cell differentiation, adaptation to environmental constraints and defense responses in plants. They negatively regulate gene expression by degrading specific mRNA targets, repressing their translation or modifying chromatin conformation through homologous interaction with target loci. MicroRNAs (miRNA and short-interfering RNAs (siRNA are generated from long double stranded RNA (dsRNA that are cleaved into 20- to 24-nucleotide dsRNAs by RNase III proteins called DICERs (DCL. One strand of the duplex is then loaded onto effective complexes containing different ARGONAUTE (AGO proteins. In this review, we explored smRNA diversity in model legumes and compiled available data from miRBAse, the miRNA database, and from 22 reports of smRNA deep sequencing or miRNA identification genome-wide in Medicago truncatula, Glycine max and Lotus japonicus. In addition to conserved miRNAs present in other plant species, 229, 179 and 35 novel miRNA families were identified respectively in these 3 legumes, among which several seems legume-specific. New potential functions of several miRNAs in the legume-specific nodulation process are discussed. Furthermore, a new category of siRNA, the phased siRNAs, which seems to mainly regulate disease-resistance genes, was recently discovered in legumes. Despite that the genome sequence of model legumes are not yet fully completed, further analysis was performed by database mining of gene families and protein characteristics of DCLs and AGOs in these genomes. Although most components of the smRNA pathways are conserved, identifiable homologs of key smRNA players from non-legumes could not yet be detected in M. truncatula available genomic and expressed sequence databases. In addition, an important gene diversification was observed in the three legumes. Functional significance of these variant isoforms may reflect peculiarities of smRNA biogenesis in

  4. Accounting for a Diverse Forest Ownership Structure in Projections of Forest Sustainability Indicators

    Directory of Open Access Journals (Sweden)

    Jeannette Eggers

    2015-11-01

    Full Text Available In this study, we assessed the effect of a diverse ownership structure with different management strategies within and between owner categories in long-term projections of economic, ecological and social forest sustainability indicators, representing important ecosystem services, for two contrasting Swedish municipalities. This was done by comparing two scenarios: one where the diversity of management strategies was accounted for (Diverse and one where it was not (Simple. The Diverse scenario resulted in a 14% lower total harvested volume for the 100 year period compared to the Simple scenario, which resulted in a higher growing stock and a more favorable development of the ecological indicators. The higher proportion of sparse forests and the lower proportion of clear-felled sites made the Diverse scenario more appropriate for delivering access to common outdoor recreation activities, while the Simple scenario projected more job opportunities. Differences between the scenarios were considerable already in the medium term (after 20 years of simulation. Our results highlight the importance of accounting for the variety of management strategies employed by forest owners in medium- to long-term projections of the development of forest sustainability indicators.

  5. Using Project-Based Learning and Google Docs to Support Diversity

    Science.gov (United States)

    Leh, Amy

    2014-01-01

    A graduate course, ETEC543 ("Technology and Learning I"), was revised to better serve increasing new student population, international students, in an academic program. Project-based learning, Google Docs, and instructional strategies fostering diversity and critical thinking were incorporated into the course redesign. Observations,…

  6. Productive and Inclusive? How Documentation Concealed Racialising Practices in a Diversity Project

    Science.gov (United States)

    Miller, Melinda G.

    2014-01-01

    This article examines how documentation concealed racialising practices in a diversity project that was seen to be productive and inclusive. Documentation examples are taken from a doctoral study about embedding Indigenous perspectives in early childhood education curricula in two Australian urban childcare centres. In place of reporting examples…

  7. Genotype Imputation for Latinos Using the HapMap and 1000 Genomes Project Reference Panels

    Directory of Open Access Journals (Sweden)

    Xiaoyi eGao

    2012-06-01

    Full Text Available Genotype imputation is a vital tool in genome-wide association studies (GWAS and meta-analyses of multiple GWAS results. Imputation enables researchers to increase genomic coverage and to pool data generated using different genotyping platforms. HapMap samples are often employed as the reference panel. More recently, the 1000 Genomes Project resource is becoming the primary source for reference panels. Multiple GWAS and meta-analyses are targeting Latinos, the most populous and fastest growing minority group in the US. However, genotype imputation resources for Latinos are rather limited compared to individuals of European ancestry at present, largely because of the lack of good reference data. One choice of reference panel for Latinos is one derived from the population of Mexican individuals in Los Angeles contained in the HapMap Phase 3 project and the 1000 Genomes Project. However, a detailed evaluation of the quality of the imputed genotypes derived from the public reference panels has not yet been reported. Using simulation studies, the Illumina OmniExpress GWAS data from the Los Angles Latino Eye Study and the MACH software package, we evaluated the accuracy of genotype imputation in Latinos. Our results show that the 1000 Genomes Project AMR+CEU+YRI reference panel provides the highest imputation accuracy for Latinos, and that also including Asian samples in the panel can reduce imputation accuracy. We also provide the imputation accuracy for each autosomal chromosome using the 1000 Genomes Project panel for Latinos. Our results serve as a guide to future imputation-based analysis in Latinos.

  8. Genotype Imputation for Latinos Using the HapMap and 1000 Genomes Project Reference Panels.

    Science.gov (United States)

    Gao, Xiaoyi; Haritunians, Talin; Marjoram, Paul; McKean-Cowdin, Roberta; Torres, Mina; Taylor, Kent D; Rotter, Jerome I; Gauderman, William J; Varma, Rohit

    2012-01-01

    Genotype imputation is a vital tool in genome-wide association studies (GWAS) and meta-analyses of multiple GWAS results. Imputation enables researchers to increase genomic coverage and to pool data generated using different genotyping platforms. HapMap samples are often employed as the reference panel. More recently, the 1000 Genomes Project resource is becoming the primary source for reference panels. Multiple GWAS and meta-analyses are targeting Latinos, the most populous, and fastest growing minority group in the US. However, genotype imputation resources for Latinos are rather limited compared to individuals of European ancestry at present, largely because of the lack of good reference data. One choice of reference panel for Latinos is one derived from the population of Mexican individuals in Los Angeles contained in the HapMap Phase 3 project and the 1000 Genomes Project. However, a detailed evaluation of the quality of the imputed genotypes derived from the public reference panels has not yet been reported. Using simulation studies, the Illumina OmniExpress GWAS data from the Los Angles Latino Eye Study and the MACH software package, we evaluated the accuracy of genotype imputation in Latinos. Our results show that the 1000 Genomes Project AMR + CEU + YRI reference panel provides the highest imputation accuracy for Latinos, and that also including Asian samples in the panel can reduce imputation accuracy. We also provide the imputation accuracy for each autosomal chromosome using the 1000 Genomes Project panel for Latinos. Our results serve as a guide to future imputation based analysis in Latinos.

  9. Comparative Genomics Reveals the Diversity of Restriction-Modification Systems and DNA Methylation Sites in Listeria monocytogenes.

    Science.gov (United States)

    Chen, Poyin; den Bakker, Henk C; Korlach, Jonas; Kong, Nguyet; Storey, Dylan B; Paxinos, Ellen E; Ashby, Meredith; Clark, Tyson; Luong, Khai; Wiedmann, Martin; Weimer, Bart C

    2017-02-01

    Listeria monocytogenes is a bacterial pathogen that is found in a wide variety of anthropogenic and natural environments. Genome sequencing technologies are rapidly becoming a powerful tool in facilitating our understanding of how genotype, classification phenotypes, and virulence phenotypes interact to predict the health risks of individual bacterial isolates. Currently, 57 closed L. monocytogenes genomes are publicly available, representing three of the four phylogenetic lineages, and they suggest that L. monocytogenes has high genomic synteny. This study contributes an additional 15 closed L. monocytogenes genomes that were used to determine the associations between the genome and methylome with host invasion magnitude. In contrast to previous findings, large chromosomal inversions and rearrangements were detected in five isolates at the chromosome terminus and within rRNA genes, including a previously undescribed inversion within rRNA-encoding regions. Each isolate's epigenome contained highly diverse methyltransferase recognition sites, even within the same serotype and methylation pattern. Eleven strains contained a single chromosomally encoded methyltransferase, one strain contained two methylation systems (one system on a plasmid), and three strains exhibited no methylation, despite the occurrence of methyltransferase genes. In three isolates a new, unknown DNA modification was observed in addition to diverse methylation patterns, accompanied by a novel methylation system. Neither chromosome rearrangement nor strain-specific patterns of epigenome modification observed within virulence genes were correlated with serotype designation, clonal complex, or in vitro infectivity. These data suggest that genome diversity is larger than previously considered in L. monocytogenes and that as more genomes are sequenced, additional structure and methylation novelty will be observed in this organism.

  10. Exceptional diversity, non-random distribution, and rapid evolution of retroelements in the B73 maize genome.

    Directory of Open Access Journals (Sweden)

    Regina S Baucom

    2009-11-01

    Full Text Available Recent comprehensive sequence analysis of the maize genome now permits detailed discovery and description of all transposable elements (TEs in this complex nuclear environment. Reiteratively optimized structural and homology criteria were used in the computer-assisted search for retroelements, TEs that transpose by reverse transcription of an RNA intermediate, with the final results verified by manual inspection. Retroelements were found to occupy the majority (>75% of the nuclear genome in maize inbred B73. Unprecedented genetic diversity was discovered in the long terminal repeat (LTR retrotransposon class of retroelements, with >400 families (>350 newly discovered contributing >31,000 intact elements. The two other classes of retroelements, SINEs (four families and LINEs (at least 30 families, were observed to contribute 1,991 and approximately 35,000 copies, respectively, or a combined approximately 1% of the B73 nuclear genome. With regard to fully intact elements, median copy numbers for all retroelement families in maize was 2 because >250 LTR retrotransposon families contained only one or two intact members that could be detected in the B73 draft sequence. The majority, perhaps all, of the investigated retroelement families exhibited non-random dispersal across the maize genome, with LINEs, SINEs, and many low-copy-number LTR retrotransposons exhibiting a bias for accumulation in gene-rich regions. In contrast, most (but not all medium- and high-copy-number LTR retrotransposons were found to preferentially accumulate in gene-poor regions like pericentromeric heterochromatin, while a few high-copy-number families exhibited the opposite bias. Regions of the genome with the highest LTR retrotransposon density contained the lowest LTR retrotransposon diversity. These results indicate that the maize genome provides a great number of different niches for the survival and procreation of a great variety of retroelements that have evolved to

  11. Exceptional diversity, non-random distribution, and rapid evolution of retroelements in the B73 maize genome.

    Science.gov (United States)

    Baucom, Regina S; Estill, James C; Chaparro, Cristian; Upshaw, Naadira; Jogi, Ansuya; Deragon, Jean-Marc; Westerman, Richard P; Sanmiguel, Phillip J; Bennetzen, Jeffrey L

    2009-11-01

    Recent comprehensive sequence analysis of the maize genome now permits detailed discovery and description of all transposable elements (TEs) in this complex nuclear environment. Reiteratively optimized structural and homology criteria were used in the computer-assisted search for retroelements, TEs that transpose by reverse transcription of an RNA intermediate, with the final results verified by manual inspection. Retroelements were found to occupy the majority (>75%) of the nuclear genome in maize inbred B73. Unprecedented genetic diversity was discovered in the long terminal repeat (LTR) retrotransposon class of retroelements, with >400 families (>350 newly discovered) contributing >31,000 intact elements. The two other classes of retroelements, SINEs (four families) and LINEs (at least 30 families), were observed to contribute 1,991 and approximately 35,000 copies, respectively, or a combined approximately 1% of the B73 nuclear genome. With regard to fully intact elements, median copy numbers for all retroelement families in maize was 2 because >250 LTR retrotransposon families contained only one or two intact members that could be detected in the B73 draft sequence. The majority, perhaps all, of the investigated retroelement families exhibited non-random dispersal across the maize genome, with LINEs, SINEs, and many low-copy-number LTR retrotransposons exhibiting a bias for accumulation in gene-rich regions. In contrast, most (but not all) medium- and high-copy-number LTR retrotransposons were found to preferentially accumulate in gene-poor regions like pericentromeric heterochromatin, while a few high-copy-number families exhibited the opposite bias. Regions of the genome with the highest LTR retrotransposon density contained the lowest LTR retrotransposon diversity. These results indicate that the maize genome provides a great number of different niches for the survival and procreation of a great variety of retroelements that have evolved to differentially

  12. Comparative genomics of Brachyspira pilosicoli strains: genome rearrangements, reductions and correlation of genetic compliment with phenotypic diversity

    Directory of Open Access Journals (Sweden)

    Mappley Luke J

    2012-09-01

    Full Text Available Abstract Background The anaerobic spirochaete Brachyspira pilosicoli causes enteric disease in avian, porcine and human hosts, amongst others. To date, the only available genome sequence of B. pilosicoli is that of strain 95/1000, a porcine isolate. In the first intra-species genome comparison within the Brachyspira genus, we report the whole genome sequence of B. pilosicoli B2904, an avian isolate, the incomplete genome sequence of B. pilosicoli WesB, a human isolate, and the comparisons with B. pilosicoli 95/1000. We also draw on incomplete genome sequences from three other Brachyspira species. Finally we report the first application of the high-throughput Biolog phenotype screening tool on the B. pilosicoli strains for detailed comparisons between genotype and phenotype. Results Feature and sequence genome comparisons revealed a high degree of similarity between the three B. pilosicoli strains, although the genomes of B2904 and WesB were larger than that of 95/1000 (~2,765, 2.890 and 2.596 Mb, respectively. Genome rearrangements were observed which correlated largely with the positions of mobile genetic elements. Through comparison of the B2904 and WesB genomes with the 95/1000 genome, features that we propose are non-essential due to their absence from 95/1000 include a peptidase, glycine reductase complex components and transposases. Novel bacteriophages were detected in the newly-sequenced genomes, which appeared to have involvement in intra- and inter-species horizontal gene transfer. Phenotypic differences predicted from genome analysis, such as the lack of genes for glucuronate catabolism in 95/1000, were confirmed by phenotyping. Conclusions The availability of multiple B. pilosicoli genome sequences has allowed us to demonstrate the substantial genomic variation that exists between these strains, and provides an insight into genetic events that are shaping the species. In addition, phenotype screening allowed determination of how

  13. Genomic Encyclopedia of Fungi

    Energy Technology Data Exchange (ETDEWEB)

    Grigoriev, Igor

    2012-08-10

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  14. JGI Fungal Genomics Program

    Energy Technology Data Exchange (ETDEWEB)

    Grigoriev, Igor V.

    2011-03-14

    Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here

  15. The Chlamydia suis Genome Exhibits High Levels of Diversity, Plasticity, and Mobile Antibiotic Resistance: Comparative Genomics of a Recent Livestock Cohort Shows Influence of Treatment Regimes

    Science.gov (United States)

    Wanninger, Sabrina; Bachmann, Nathan; Marti, Hanna; Qi, Weihong; Donati, Manuela; di Francesco, Antonietta; Polkinghorne, Adam; Borel, Nicole

    2017-01-01

    Chlamydia suis is an endemic pig pathogen, belonging to a fascinating genus of obligate intracellular pathogens. Of particular interest, this is the only chlamydial species to have naturally acquired genes encoding for tetracycline resistance. To date, the distribution and mobility of the Tet-island are not well understood. Our study focused on whole genome sequencing of 29 C. suis isolates from a recent porcine cohort within Switzerland, combined with data from USA tetracycline-resistant isolates. Our findings show that the genome of C. suis is very plastic, with unprecedented diversity, highly affected by recombination and plasmid exchange. A large diversity of isolates circulates within Europe, even within individual Swiss farms, suggesting that C. suis originated around Europe. New World isolates have more restricted diversity and appear to derive from European isolates, indicating that historical strain transfers to the United States have occurred. The architecture of the Tet-island is variable, but the tetA(C) gene is always intact, and recombination has been a major factor in its transmission within C. suis. Selective pressure from tetracycline use within pigs leads to a higher number of Tet-island carrying isolates, which appear to be lost in the absence of such pressure, whereas the loss or gain of the Tet-island from individual strains is not observed. The Tet-island appears to be a recent import into the genome of C. suis, with a possible American origin. PMID:28338777

  16. Genomic comparison of multi-drug resistant invasive and colonizing Acinetobacter baumannii isolated from diverse human body sites reveals genomic plasticity

    Directory of Open Access Journals (Sweden)

    Hsiao William W

    2011-06-01

    Full Text Available Abstract Background Acinetobacter baumannii has recently emerged as a significant global pathogen, with a surprisingly rapid acquisition of antibiotic resistance and spread within hospitals and health care institutions. This study examines the genomic content of three A. baumannii strains isolated from distinct body sites. Isolates from blood, peri-anal, and wound sources were examined in an attempt to identify genetic features that could be correlated to each isolation source. Results Pulsed-field gel electrophoresis, multi-locus sequence typing and antibiotic resistance profiles demonstrated genotypic and phenotypic variation. Each isolate was sequenced to high-quality draft status, which allowed for comparative genomic analyses with existing A. baumannii genomes. A high resolution, whole genome alignment method detailed the phylogenetic relationships of sequenced A. baumannii and found no correlation between phylogeny and body site of isolation. This method identified genomic regions unique to both those isolates found on the surface of the skin or in wounds, termed colonization isolates, and those identified from body fluids, termed invasive isolates; these regions may play a role in the pathogenesis and spread of this important pathogen. A PCR-based screen of 74 A. baumanii isolates demonstrated that these unique genes are not exclusive to either phenotype or isolation source; however, a conserved genomic region exclusive to all sequenced A. baumannii was identified and verified. Conclusions The results of the comparative genome analysis and PCR assay show that A. baumannii is a diverse and genomically variable pathogen that appears to have the potential to cause a range of human disease regardless of the isolation source.

  17. Reflections on Mental Retardation and Eugenics, Old and New: Mensa and the Human Genome Project.

    Science.gov (United States)

    Smith, J. David

    1994-01-01

    This article addresses the moral and ethical issues of mental retardation and a continuing legacy of belief in eugenics. It discusses the involuntary sterilization of Carrie Buck in 1927, support for legalized killing of subnormal infants by 47% of respondents to a Mensa survey, and implications of the Human Genome Project for the field of mental…

  18. Democratizing Human Genome Project Information: A Model Program for Education, Information and Debate in Public Libraries.

    Science.gov (United States)

    Pollack, Miriam

    The "Mapping the Human Genome" project demonstrated that librarians can help whomever they serve in accessing information resources in the areas of biological and health information, whether it is the scientists who are developing the information or a member of the public who is using the information. Public libraries can guide library…

  19. Reflections on Mental Retardation and Eugenics, Old and New: Mensa and the Human Genome Project.

    Science.gov (United States)

    Smith, J. David

    1994-01-01

    This article addresses the moral and ethical issues of mental retardation and a continuing legacy of belief in eugenics. It discusses the involuntary sterilization of Carrie Buck in 1927, support for legalized killing of subnormal infants by 47% of respondents to a Mensa survey, and implications of the Human Genome Project for the field of mental…

  20. The Human Genome Project and Eugenics: Identifying the Impact on Individuals with Mental Retardation.

    Science.gov (United States)

    Kuna, Jason

    2001-01-01

    This article explores the impact of the mapping work of the Human Genome Project on individuals with mental retardation and the negative effects of genetic testing. The potential to identify disabilities and the concept of eugenics are discussed, along with ethical issues surrounding potential genetic therapies. (Contains references.) (CR)

  1. A late origin of the extant eukaryotic diversity: divergence time estimates using rare genomic changes

    Directory of Open Access Journals (Sweden)

    Koonin Eugene V

    2011-05-01

    eukaryotes that is open to comparative-genomic study probably was preceded by hundreds of millions years of evolution that might have included extinct diversity inaccessible to comparative approaches. Reviewers This article was reviewed by William Martin, Herve Philippe (nominated by I. King Jordan, and Romain Derelle.

  2. Comparative genomics of plant-associated Pseudomonas spp.: insights into diversity and inheritance of traits involved in multitrophic interactions.

    Science.gov (United States)

    Loper, Joyce E; Hassan, Karl A; Mavrodi, Dmitri V; Davis, Edward W; Lim, Chee Kent; Shaffer, Brenda T; Elbourne, Liam D H; Stockwell, Virginia O; Hartney, Sierra L; Breakwell, Katy; Henkels, Marcella D; Tetu, Sasha G; Rangel, Lorena I; Kidarsa, Teresa A; Wilson, Neil L; van de Mortel, Judith E; Song, Chunxu; Blumhagen, Rachel; Radune, Diana; Hostetler, Jessica B; Brinkac, Lauren M; Durkin, A Scott; Kluepfel, Daniel A; Wechter, W Patrick; Anderson, Anne J; Kim, Young Cheol; Pierson, Leland S; Pierson, Elizabeth A; Lindow, Steven E; Kobayashi, Donald Y; Raaijmakers, Jos M; Weller, David M; Thomashow, Linda S; Allen, Andrew E; Paulsen, Ian T

    2012-07-01

    We provide here a comparative genome analysis of ten strains within the Pseudomonas fluorescens group including seven new genomic sequences. These strains exhibit a diverse spectrum of traits involved in biological control and other multitrophic interactions with plants, microbes, and insects. Multilocus sequence analysis placed the strains in three sub-clades, which was reinforced by high levels of synteny, size of core genomes, and relatedness of orthologous genes between strains within a sub-clade. The heterogeneity of the P. fluorescens group was reflected in the large size of its pan-genome, which makes up approximately 54% of the pan-genome of the genus as a whole, and a core genome representing only 45-52% of the genome of any individual strain. We discovered genes for traits that were not known previously in the strains, including genes for the biosynthesis of the siderophores achromobactin and pseudomonine and the antibiotic 2-hexyl-5-propyl-alkylresorcinol; novel bacteriocins; type II, III, and VI secretion systems; and insect toxins. Certain gene clusters, such as those for two type III secretion systems, are present only in specific sub-clades, suggesting vertical inheritance. Almost all of the genes associated with multitrophic interactions map to genomic regions present in only a subset of the strains or unique to a specific strain. To explore the evolutionary origin of these genes, we mapped their distributions relative to the locations of mobile genetic elements and repetitive extragenic palindromic (REP) elements in each genome. The mobile genetic elements and many strain-specific genes fall into regions devoid of REP elements (i.e., REP deserts) and regions displaying atypical tri-nucleotide composition, possibly indicating relatively recent acquisition of these loci. Collectively, the results of this study highlight the enormous heterogeneity of the P. fluorescens group and the importance of the variable genome in tailoring individual strains to

  3. Comparative genomics of plant-associated Pseudomonas spp.: insights into diversity and inheritance of traits involved in multitrophic interactions.

    Directory of Open Access Journals (Sweden)

    Joyce E Loper

    2012-07-01

    Full Text Available We provide here a comparative genome analysis of ten strains within the Pseudomonas fluorescens group including seven new genomic sequences. These strains exhibit a diverse spectrum of traits involved in biological control and other multitrophic interactions with plants, microbes, and insects. Multilocus sequence analysis placed the strains in three sub-clades, which was reinforced by high levels of synteny, size of core genomes, and relatedness of orthologous genes between strains within a sub-clade. The heterogeneity of the P. fluorescens group was reflected in the large size of its pan-genome, which makes up approximately 54% of the pan-genome of the genus as a whole, and a core genome representing only 45-52% of the genome of any individual strain. We discovered genes for traits that were not known previously in the strains, including genes for the biosynthesis of the siderophores achromobactin and pseudomonine and the antibiotic 2-hexyl-5-propyl-alkylresorcinol; novel bacteriocins; type II, III, and VI secretion systems; and insect toxins. Certain gene clusters, such as those for two type III secretion systems, are present only in specific sub-clades, suggesting vertical inheritance. Almost all of the genes associated with multitrophic interactions map to genomic regions present in only a subset of the strains or unique to a specific strain. To explore the evolutionary origin of these genes, we mapped their distributions relative to the locations of mobile genetic elements and repetitive extragenic palindromic (REP elements in each genome. The mobile genetic elements and many strain-specific genes fall into regions devoid of REP elements (i.e., REP deserts and regions displaying atypical tri-nucleotide composition, possibly indicating relatively recent acquisition of these loci. Collectively, the results of this study highlight the enormous heterogeneity of the P. fluorescens group and the importance of the variable genome in tailoring

  4. Population Genomic Analysis Reveals Differential Evolutionary Histories and Patterns of Diversity across Subgenomes and Subpopulations of Brassica napus L.

    Science.gov (United States)

    Gazave, Elodie; Tassone, Erica E; Ilut, Daniel C; Wingerson, Megan; Datema, Erwin; Witsenboer, Hanneke M A; Davis, James B; Grant, David; Dyer, John M; Jenks, Matthew A; Brown, Jack; Gore, Michael A

    2016-01-01

    The allotetraploid species Brassica napus L. is a global crop of major economic importance, providing canola oil (seed) and vegetables for human consumption and fodder and meal for livestock feed. Characterizing the genetic diversity present in the extant germplasm pool of B. napus is fundamental to better conserve, manage and utilize the genetic resources of this species. We used sequence-based genotyping to identify and genotype 30,881 SNPs in a diversity panel of 782 B. napus accessions, representing samples of winter and spring growth habits originating from 33 countries across Europe, Asia, and America. We detected strong population structure broadly concordant with growth habit and geography, and identified three major genetic groups: spring (SP), winter Europe (WE), and winter Asia (WA). Subpopulation-specific polymorphism patterns suggest enriched genetic diversity within the WA group and a smaller effective breeding population for the SP group compared to WE. Interestingly, the two subgenomes of B. napus appear to have different geographic origins, with phylogenetic analysis placing WE and WA as basal clades for the other subpopulations in the C and A subgenomes, respectively. Finally, we identified 16 genomic regions where the patterns of diversity differed markedly from the genome-wide average, several of which are suggestive of genomic inversions. The results obtained in this study constitute a valuable resource for worldwide breeding efforts and the genetic dissection and prediction of complex B. napus traits.

  5. Genetic and genomic diversity studies of Acacia symbionts in Senegal reveal new species of Mesorhizobium with a putative geographical pattern.

    Directory of Open Access Journals (Sweden)

    Fatou Diouf

    Full Text Available Acacia senegal (L Willd. and Acacia seyal Del. are highly nitrogen-fixing and moderately salt tolerant species. In this study we focused on the genetic and genomic diversity of Acacia mesorhizobia symbionts from diverse origins in Senegal and investigated possible correlations between the genetic diversity of the strains, their soil of origin, and their tolerance to salinity. We first performed a multi-locus sequence analysis on five markers gene fragments on a collection of 47 mesorhizobia strains of A. senegal and A. seyal from 8 localities. Most of the strains (60% clustered with the M. plurifarium type strain ORS 1032T, while the others form four new clades (MSP1 to MSP4. We sequenced and assembled seven draft genomes: four in the M. plurifarium clade (ORS3356, ORS3365, STM8773 and ORS1032T, one in MSP1 (STM8789, MSP2 (ORS3359 and MSP3 (ORS3324. The average nucleotide identities between these genomes together with the MLSA analysis reveal three new species of Mesorhizobium. A great variability of salt tolerance was found among the strains with a lack of correlation between the genetic diversity of mesorhizobia, their salt tolerance and the soils samples characteristics. A putative geographical pattern of A. senegal symbionts between the dryland north part and the center of Senegal was found, reflecting adaptations to specific local conditions such as the water regime. However, the presence of salt does not seem to be an important structuring factor of Mesorhizobium species.

  6. Genetic and genomic diversity studies of Acacia symbionts in Senegal reveal new species of Mesorhizobium with a putative geographical pattern.

    Science.gov (United States)

    Diouf, Fatou; Diouf, Diegane; Klonowska, Agnieszka; Le Queré, Antoine; Bakhoum, Niokhor; Fall, Dioumacor; Neyra, Marc; Parrinello, Hugues; Diouf, Mayecor; Ndoye, Ibrahima; Moulin, Lionel

    2015-01-01

    Acacia senegal (L) Willd. and Acacia seyal Del. are highly nitrogen-fixing and moderately salt tolerant species. In this study we focused on the genetic and genomic diversity of Acacia mesorhizobia symbionts from diverse origins in Senegal and investigated possible correlations between the genetic diversity of the strains, their soil of origin, and their tolerance to salinity. We first performed a multi-locus sequence analysis on five markers gene fragments on a collection of 47 mesorhizobia strains of A. senegal and A. seyal from 8 localities. Most of the strains (60%) clustered with the M. plurifarium type strain ORS 1032T, while the others form four new clades (MSP1 to MSP4). We sequenced and assembled seven draft genomes: four in the M. plurifarium clade (ORS3356, ORS3365, STM8773 and ORS1032T), one in MSP1 (STM8789), MSP2 (ORS3359) and MSP3 (ORS3324). The average nucleotide identities between these genomes together with the MLSA analysis reveal three new species of Mesorhizobium. A great variability of salt tolerance was found among the strains with a lack of correlation between the genetic diversity of mesorhizobia, their salt tolerance and the soils samples characteristics. A putative geographical pattern of A. senegal symbionts between the dryland north part and the center of Senegal was found, reflecting adaptations to specific local conditions such as the water regime. However, the presence of salt does not seem to be an important structuring factor of Mesorhizobium species.

  7. The evolutionary history of Plasmodium vivax as inferred from mitochondrial genomes: parasite genetic diversity in the Americas.

    Science.gov (United States)

    Taylor, Jesse E; Pacheco, M Andreína; Bacon, David J; Beg, Mohammad A; Machado, Ricardo Luiz; Fairhurst, Rick M; Herrera, Socrates; Kim, Jung-Yeon; Menard, Didier; Póvoa, Marinete Marins; Villegas, Leopoldo; Mulyanto; Snounou, Georges; Cui, Liwang; Zeyrek, Fadile Yildiz; Escalante, Ananias A

    2013-09-01

    Plasmodium vivax is the most prevalent human malaria parasite in the Americas. Previous studies have contrasted the genetic diversity of parasite populations in the Americas with those in Asia and Oceania, concluding that New World populations exhibit low genetic diversity consistent with a recent introduction. Here we used an expanded sample of complete mitochondrial genome sequences to investigate the diversity of P. vivax in the Americas as well as in other continental populations. We show that the diversity of P. vivax in the Americas is comparable to that in Asia and Oceania, and we identify several divergent clades circulating in South America that may have resulted from independent introductions. In particular, we show that several haplotypes sampled in Venezuela and northeastern Brazil belong to a clade that diverged from the other P. vivax lineages at least 30,000 years ago, albeit not necessarily in the Americas. We propose that, unlike in Asia where human migration increases local genetic diversity, the combined effects of the geographical structure and the low incidence of vivax malaria in the Americas has resulted in patterns of low local but high regional genetic diversity. This could explain previous views that P. vivax in the Americas has low genetic diversity because these were based on studies carried out in limited areas. Further elucidation of the complex geographical pattern of P. vivax variation will be important both for diversity assessments of genes encoding candidate vaccine antigens and in the formulation of control and surveillance measures aimed at malaria elimination.

  8. Gender and Diversity in a Problem and Project Based Learning Environment

    DEFF Research Database (Denmark)

    Du, Xiangyun

    Problem and Project Based Learning (PBL) has been well used as an educational philosophy and methodology in the construction of student centered and contextualized learning environment. PBL is also regarded as an effective method in producing engineering graduates who can not only meet the needs...... on the learning experiences of engineering students in the PBL environment in Denmark. This book also attempts to question the issue of diversity in engineering education via the exploration of whether or in which ways the PBL environment is friendly to diverse groups of learners such as women....

  9. Diverse data supports the transition of filamentous fungal model organisms into the post-genomics era

    Energy Technology Data Exchange (ETDEWEB)

    McCluskey, Kevin; Baker, Scott E.

    2017-02-17

    Filamentous fungi have been important as model organisms since the beginning of modern biological inquiry and have benefitted from open data since the earliest genetic maps were shared. From early origins in simple Mendelian genetics of mating types, parasexual genetics of colony colour, and the foundational demonstration of the segregation of a nutritional requirement, the contribution of research systems utilising filamentous fungi has spanned the biochemical genetics era, through the molecular genetics era, and now are at the very foundation of diverse omics approaches to research and development. Fungal model organisms have come from most major taxonomic groups although Ascomycete filamentous fungi have seen the most major sustained effort. In addition to the published material about filamentous fungi, shared molecular tools have found application in every area of fungal biology. Similarly, shared data has contributed to the success of model systems. The scale of data supporting research with filamentous fungi has grown by 10 to 12 orders of magnitude. From genetic to molecular maps, expression databases, and finally genome resources, the open and collaborative nature of the research communities has assured that the rising tide of data has lifted all of the research systems together.

  10. Phylogenomic analyses reveal the diversity of laccase-coding genes in Fonsecaea genomes

    Science.gov (United States)

    Feng, Peiying; Weiss, Vinicius Almir; Vicente, Vania Aparecida; Stielow, J. Benjamin; de Hoog, Sybren

    2017-01-01

    The genus Fonsecaea comprises black yeast-like fungi of clinical relevance, including etiologic agents of chromoblastomycosis and cerebral phaeohyphomycosis. Presence of melanin and assimilation of monoaromatic hydrocarbons and alkylbenzenes have been proposed as virulence factors. Multicopper oxidase (MCO) is a family of enzymes including laccases, ferroxidases and ascorbate oxidases which are able to catalyze the oxidation of various aromatic organic compounds with the reduction of molecular oxygen to water. Additionally, laccases are required for the production of fungal melanins, a cell-wall black pigment recognized as a key polymer for pathogenicity and extremotolerance in black yeast-like fungi. Although the activity of laccase enzymes has previously been reported in many wood-rotting fungi, the diversity of laccase genes in Fonsecaea has not yet been assessed. In this study, we identified and characterized laccase-coding genes and determined their genomic location in five clinical and environmental Fonsecaea species. The identification of laccases sensu stricto will provide insights into carbon acquisition strategies as well as melanin production in Fonsecaea. PMID:28187150

  11. Integrating Diverse Types of Genomic Data to Identify Genes that Underlie Adverse Pregnancy Phenotypes.

    Directory of Open Access Journals (Sweden)

    Jibril Hirbo

    Full Text Available Progress in understanding complex genetic diseases has been bolstered by synthetic approaches that overlay diverse data types and analyses to identify functionally important genes. Pre-term birth (PTB, a major complication of pregnancy, is a leading cause of infant mortality worldwide. A major obstacle in addressing PTB is that the mechanisms controlling parturition and birth timing remain poorly understood. Integrative approaches that overlay datasets derived from comparative genomics with function-derived ones have potential to advance our understanding of the genetics of birth timing, and thus provide insights into the genes that may contribute to PTB. We intersected data from fast evolving coding and non-coding gene regions in the human and primate lineage with data from genes expressed in the placenta, from genes that show enriched expression only in the placenta, as well as from genes that are differentially expressed in four distinct PTB clinical subtypes. A large fraction of genes that are expressed in placenta, and differentially expressed in PTB clinical subtypes (23-34% are fast evolving, and are associated with functions that include adhesion neurodevelopmental and immune processes. Functional categories of genes that express fast evolution in coding regions differ from those linked to fast evolution in non-coding regions. Finally, there is a surprising lack of overlap between fast evolving genes that are differentially expressed in four PTB clinical subtypes. Integrative approaches, especially those that incorporate evolutionary perspectives, can be successful in identifying potential genetic contributions to complex genetic diseases, such as PTB.

  12. Homosexuality and the human genome project: private and public choices.

    Science.gov (United States)

    Gabard, D L

    1999-01-01

    Recent scientific research which offers evidence of genetic and biologic influence in homosexuality has created serious concerns. The intent of this article is to offer suggestions based in principles of bioethics in which perceived negative outcomes may be diminished and the positive qualities of the research enhanced. For a portion of the general population the concerns expressed in this article could be alleviated through public discussion and exposure to the findings and theories of the academic and scientific communities. For another portion of the population, however, additional safeguards against misuse of screening tests and somatic cell interventions may be advisable through efforts initiated by researchers themselves, general public policies, and additional medical policies. While these efforts are recommended as short term goals for the separate scientific and social paradigms of homosexuality, it is proposed that an equally important and related debate involves the subjects of disease, normality and the value of diversity. It is suggested that while it is imperative that the behavioral and biological sciences recognize the limitations of their separate approaches, the reductionist approach itself limits our understanding of what essentially are questions of attraction and relationships. In conclusion, homosexuality should be understood from the perspective of autonomy as every person's right to experience a full and meaningful life.

  13. Comparative genomics reveals high biological diversity and specific adaptations in the industrially and medically important fungal genus Aspergillus

    DEFF Research Database (Denmark)

    de Vries, Ronald P.; Riley, Robert; Wiebenga, Ad

    2017-01-01

    Background:  The fungal genus Aspergillus is of critical importance to humankind. Species include those with industrial applications, important pathogens of humans, animals and crops, a source of potent carcinogenic contaminants of food, and an important genetic model. The genome sequences of eight...... here, allows for the first time a genus-wide view of the biological diversity of the aspergilli and in many, but not all, cases linked genome differences to phenotype. Insights gained could be exploited for biotechnological and medical applications of fungi....

  14. Genome-wide evaluation of genetic diversity and linkage disequilibrium in winter and spring triticale (x Triticosecale Wittmack

    Directory of Open Access Journals (Sweden)

    Alheit Katharina V

    2012-06-01

    Full Text Available Abstract Background Recent advances in genotyping with high-density markers nowadays enable genome-wide genomic analyses in crops. A detailed characterisation of the population structure and linkage disequilibrium (LD is essential for the application of genomic approaches and consequently for knowledge-based breeding. In this study we used the triticale-specific DArT array to analyze population structure, genetic diversity, and LD in a worldwide set of 161 winter and spring triticale lines. Results The principal coordinate analysis revealed that the first principal coordinate divides the triticale population into two clusters according to their growth habit. The density distributions of the first ten principal coordinates revealed that several show a distribution indicative of population structure. In addition, we observed relatedness within growth habits which was higher among the spring types than among the winter types. The genome-wide analysis of polymorphic information content (PIC showed that the PIC is variable among and along chromosomes and that especially the R genome of spring types possesses a reduced genetic diversity. We also found that several chromosomes showed regions of high genetic distance between the two growth habits, indicative of divergent selection. Regarding linkage disequilibrium, the A and B genomes showed a similar LD of 0.24 for closely linked markers and a decay within approximately 12 cM. LD in the R genome was lower with 0.19 and decayed within a shorter map distance of approximately 5 cM. The extent of LD was generally higher for the spring types compared to the winter types. In addition, we observed strong variability of LD along the chromosomes. Conclusions Our results confirm winter and spring growth habit are the major contributors to population structure in triticale, and a family structure exists in both growth types. The specific patterns of genetic diversity observed within these types, such as the

  15. Genomic and resistance gene homolog diversity of the dominant tallgrass prairie species across the U.S. Great Plains precipitation gradient.

    Directory of Open Access Journals (Sweden)

    Matthew N Rouse

    Full Text Available BACKGROUND: Environmental variables such as moisture availability are often important in determining species prevalence and intraspecific diversity. The population genetic structure of dominant plant species in response to a cline of these variables has rarely been addressed. We evaluated the spatial genetic structure and diversity of Andropogon gerardii populations across the U.S. Great Plains precipitation gradient, ranging from approximately 48 cm/year to 105 cm/year. METHODOLOGY/PRINCIPAL FINDINGS: Genomic diversity was evaluated with AFLP markers and diversity of a disease resistance gene homolog was evaluated by PCR-amplification and digestion with restriction enzymes. We determined the degree of spatial genetic structure using Mantel tests. Genomic and resistance gene homolog diversity were evaluated across prairies using Shannon's index and by averaging haplotype dissimilarity. Trends in diversity across prairies were determined using linear regression of diversity on average precipitation for each prairie. We identified significant spatial genetic structure, with genomic similarity decreasing as a function of distance between samples. However, our data indicated that genome-wide diversity did not vary consistently across the precipitation gradient. In contrast, we found that disease resistance gene homolog diversity was positively correlated with precipitation. SIGNIFICANCE: Prairie remnants differ in the genetic resources they maintain. Selection and evolution in this disease resistance homolog is environmentally dependent. Overall, we found that, though this environmental gradient may not predict genomic diversity, individual traits such as disease resistance genes may vary significantly.

  16. Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environments.

    Science.gov (United States)

    Sullivan, Matthew B; Huang, Katherine H; Ignacio-Espinoza, Julio C; Berlin, Aaron M; Kelly, Libusha; Weigele, Peter R; DeFrancesco, Alicia S; Kern, Suzanne E; Thompson, Luke R; Young, Sarah; Yandava, Chandri; Fu, Ross; Krastins, Bryan; Chase, Michael; Sarracino, David; Osburne, Marcia S; Henn, Matthew R; Chisholm, Sallie W

    2010-11-01

    T4-like myoviruses are ubiquitous, and their genes are among the most abundant documented in ocean systems. Here we compare 26 T4-like genomes, including 10 from non-cyanobacterial myoviruses, and 16 from marine cyanobacterial myoviruses (cyanophages) isolated on diverse Prochlorococcus or Synechococcus hosts. A core genome of 38 virion construction and DNA replication genes was observed in all 26 genomes, with 32 and 25 additional genes shared among the non-cyanophage and cyanophage subsets, respectively. These hierarchical cores are highly syntenic across the genomes, and sampled to saturation. The 25 cyanophage core genes include six previously described genes with putative functions (psbA, mazG, phoH, hsp20, hli03, cobS), a hypothetical protein with a potential phytanoyl-CoA dioxygenase domain, two virion structural genes, and 16 hypothetical genes. Beyond previously described cyanophage-encoded photosynthesis and phosphate stress genes, we observed core genes that may play a role in nitrogen metabolism during infection through modulation of 2-oxoglutarate. Patterns among non-core genes that may drive niche diversification revealed that phosphorus-related gene content reflects source waters rather than host strain used for isolation, and that carbon metabolism genes appear associated with putative mobile elements. As well, phages isolated on Synechococcus had higher genome-wide %G+C and often contained different gene subsets (e.g. petE, zwf, gnd, prnA, cpeT) than those isolated on Prochlorococcus. However, no clear diagnostic genes emerged to distinguish these phage groups, suggesting blurred boundaries possibly due to cross-infection. Finally, genome-wide comparisons of both diverse and closely related, co-isolated genomes provide a locus-to-locus variability metric that will prove valuable for interpreting metagenomic data sets.

  17. Getting the Word Out on the Human Genome Project: A Course for Physicians

    Energy Technology Data Exchange (ETDEWEB)

    Sara L. Tobin

    2004-09-29

    Our project, ''Getting the Word Out on the Human Genome Project: A Course for Physicians,'' presented educational goals to convey the power and promise of the Human Genome Program to a variety of professional, educational, and public audiences. Our initial goal was to provide practicing physicians with a comprehensive multimedia tool to update their skills in the genomic era. We therefore created the multimedia courseware, ''The New Genetics: Courseware for Physicians. Molecular Concepts, Applications, and Ramifications.'' However, as the project moved forward, several unanticipated audiences found the courseware to be useful for instruction and for self-education, so an additional edition of the courseware ''The New Genetics: Medicine and the Human Genome. Molecular Concepts, Applications, and Ramifications'' was published simultaneously with the physician version. At the time that both versions of the courseware were being completed, Stanford's Office of Technology Licensing opted not to commercialize the courseware and offered a license-back agreement if the authors founded a commercial business. The authors thus became closely involved in marketing and sales, and several thousand copies of the courseware have been sold. Surprisingly, the non-physician version has turned out to be more in demand, and this has led us in several new directions, most of which involve undergraduate education. These are discussed in detail in the Report.

  18. Genetic diversity and structure of elite cotton germplasm (Gossypium hirsutum L.) using genome-wide SNP data.

    Science.gov (United States)

    Ai, XianTao; Liang, YaJun; Wang, JunDuo; Zheng, JuYun; Gong, ZhaoLong; Guo, JiangPing; Li, XueYuan; Qu, YanYing

    2017-07-28

    Cotton (Gossypium spp.) is the most important natural textile fiber crop, and Gossypium hirsutum L. is responsible for 90% of the annual cotton crop in the world. Information on cotton genetic diversity and population structure is essential for new breeding lines. In this study, we analyzed population structure and genetic diversity of 288 elite Gossypium hirsutum cultivar accessions collected from around the world, and especially from China, using genome-wide single nucleotide polymorphisms (SNP) markers. The average polymorphsim information content (PIC) was 0.25, indicating a relatively low degree of genetic diversity. Population structure analysis revealed extensive admixture and identified three subgroups. Phylogenetic analysis supported the subgroups identified by STRUCTURE. The results from both population structure and phylogenetic analysis were, for the most part, in agreement with pedigree information. Analysis of molecular variance revealed a larger amount of variation was due to diversity within the groups. Establishment of genetic diversity and population structure from this study could be useful for genetic and genomic analysis and systematic utilization of the standing genetic variation in upland cotton.

  19. Genome size diversity in angiosperms and its influence on gene space.

    Science.gov (United States)

    Dodsworth, Steven; Leitch, Andrew R; Leitch, Ilia J

    2015-12-01

    Genome size varies c. 2400-fold in angiosperms (flowering plants), although the range of genome size is skewed towards small genomes, with a mean genome size of 1C=5.7Gb. One of the most crucial factors governing genome size in angiosperms is the relative amount and activity of repetitive elements. Recently, there have been new insights into how these repeats, previously discarded as 'junk' DNA, can have a significant impact on gene space (i.e. the part of the genome comprising all the genes and gene-related DNA). Here we review these new findings and explore in what ways genome size itself plays a role in influencing how repeats impact genome dynamics and gene space, including gene expression. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  20. The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences

    Directory of Open Access Journals (Sweden)

    Yandell Mark

    2010-07-01

    Full Text Available Abstract Background In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24. The size and complexity of these genomes have prompted much speculation as to the feasibility of completing a conifer genome sequence. Conifer genomes are reputed to be highly repetitive, but there is little information available on the nature and identity of repetitive units in gymnosperms. The pines have extensive genetic resources, with approximately 329000 ESTs from eleven species and genetic maps in eight species, including a dense genetic map of the twelve linkage groups in Pinus taeda. Results We present here the Sanger sequence and annotation of ten P. taeda BAC clones and Genome Analyzer II whole genome shotgun (WGS sequences representing 7.5% of the genome. Computational annotation of ten BACs predicts three putative protein-coding genes and at least fifteen likely pseudogenes in nearly one megabase of sequence. We found three conifer-specific LTR retroelements in the BACs, and tentatively identified at least 15 others based on evidence from the distantly related angiosperms. Alignment of WGS sequences to the BACs indicates that 80% of BAC sequences have similar copies (≥ 75% nucleotide identity elsewhere in the genome, but only 23% have identical copies (99% identity. The three most common repetitive elements in the genome were identified and, when combined, represent less than 5% of the genome. Conclusions This study indicates that the majority of repeats in the P. taeda genome are 'novel' and will therefore require additional BAC or genomic sequencing for accurate characterization. The pine genome contains a very large number of diverged and probably defunct repetitive elements. This study also provides new evidence that sequencing a pine genome using a WGS approach is

  1. Cultural diversity in Brazilian children’s literature: The project Literatura em Minha Casa in question

    Directory of Open Access Journals (Sweden)

    Flávia Ferreira de Paula

    2016-08-01

    Full Text Available This paper intends to search for representations of Brazilian cultural diversity in children’s literature of the Programa Nacional Biblioteca da Escola [National Program of School Library] (PNBE, in the editions of 2001, 2002, and 2003, years of the project Literatura em Minha Casa [Literature in My House], especially those addressed to fourth and fifth grades of Elementary School. The selection criteria of works claimed that the collections should “[…] present a small picture of the Brazilian culture […]” (Brasil, 2001; 2002; 2003, p. 12, understanding that culture as characterized by diversity. Therefore, the analysis was divided into two phases: the first dealt with ethnic plurality and the second with culture and regionalism. In general, the results showed that among 120 works analyzed, 15 had ethnic-racial diversity and 12 works presented aspects of regionalism and culture from different parts of Brazil.

  2. Estimating variation within the genes and inferring the phylogeny of 186 sequenced diverse Escherichia coli genomes

    DEFF Research Database (Denmark)

    Kaas, Rolf Sommer; Rundsten, Carsten Friis; Ussery, David

    2012-01-01

    more biologically relevant, especially considering that many of these genome sequences are draft quality. The E. coli pan-genome for this set of isolates contains 16,373 gene clusters. A core-gene tree, based on alignment and a pan-genome tree based on gene presence/absence, maps the relatedness...

  3. Comparative genomic analysis reveals a diverse repertoire of genes involved in prokaryote-eukaryote interactions within the Pseudovibrio genus.

    Directory of Open Access Journals (Sweden)

    Stefano eRomano

    2016-03-01

    Full Text Available Strains of the Pseudovibrio genus have been detected worldwide, mainly as part of bacterial communities associated with marine invertebrates, particularly sponges. This recurrent association has been considered as an indication of a symbiotic relationship between these microbes and their host. Until recently, the availability of only two genomes, belonging to closely related strains, has limited the knowledge on the genomic and physiological features of the genus to a single phylogenetic lineage.Here we present 10 newly sequenced genomes of Pseudovibrio strains isolated from marine sponges from the west coast of Ireland, and including the other two publicly available genomes we performed an extensive comparative genomic analysis. Homogeneity was apparent in terms of both the orthologous genes and the metabolic features shared amongst the 12 strains. At the genomic level, a key physiological difference observed amongst the isolates was the presence only in strain P. axinellae AD2 of genes encoding proteins involved in assimilatory nitrate reduction, which was then proved experimentally. We then focused on studying those systems known to be involved in the interactions with eukaryotic and prokaryotic cells. This analysis revealed that the genus harbors a large diversity of toxin-like proteins, secretion systems and their potential effectors. Their distribution in the genus was not always consistent with the phylogenetic relationship of the strains. Finally, our analyses identified new genomic islands encoding potential toxin-immunity systems, previously unknown in the genus.Our analyses shed new light on the Pseudovibrio genus, indicating a large diversity of both metabolic features and systems for interacting with the host. The diversity in both distribution and abundance of these systems amongst the strains underlines how metabolically and phylogenetically similar bacteria may use different strategies to interact with the host and find a niche

  4. Comparative Genomic Analysis Reveals a Diverse Repertoire of Genes Involved in Prokaryote-Eukaryote Interactions within the Pseudovibrio Genus.

    Science.gov (United States)

    Romano, Stefano; Fernàndez-Guerra, Antonio; Reen, F Jerry; Glöckner, Frank O; Crowley, Susan P; O'Sullivan, Orla; Cotter, Paul D; Adams, Claire; Dobson, Alan D W; O'Gara, Fergal

    2016-01-01

    Strains of the Pseudovibrio genus have been detected worldwide, mainly as part of bacterial communities associated with marine invertebrates, particularly sponges. This recurrent association has been considered as an indication of a symbiotic relationship between these microbes and their host. Until recently, the availability of only two genomes, belonging to closely related strains, has limited the knowledge on the genomic and physiological features of the genus to a single phylogenetic lineage. Here we present 10 newly sequenced genomes of Pseudovibrio strains isolated from marine sponges from the west coast of Ireland, and including the other two publicly available genomes we performed an extensive comparative genomic analysis. Homogeneity was apparent in terms of both the orthologous genes and the metabolic features shared amongst the 12 strains. At the genomic level, a key physiological difference observed amongst the isolates was the presence only in strain P. axinellae AD2 of genes encoding proteins involved in assimilatory nitrate reduction, which was then proved experimentally. We then focused on studying those systems known to be involved in the interactions with eukaryotic and prokaryotic cells. This analysis revealed that the genus harbors a large diversity of toxin-like proteins, secretion systems and their potential effectors. Their distribution in the genus was not always consistent with the phylogenetic relationship of the strains. Finally, our analyses identified new genomic islands encoding potential toxin-immunity systems, previously unknown in the genus. Our analyses shed new light on the Pseudovibrio genus, indicating a large diversity of both metabolic features and systems for interacting with the host. The diversity in both distribution and abundance of these systems amongst the strains underlines how metabolically and phylogenetically similar bacteria may use different strategies to interact with the host and find a niche within its

  5. Assessment of climate change impact on water diversion strategies of Melamchi Water Supply Project in Nepal

    Science.gov (United States)

    Shrestha, Sangam; Shrestha, Manish; Babel, Mukand S.

    2017-04-01

    This paper analyzes the climate change impact on water diversion plan of Melamchi Water Supply Project (MWSP) in Nepal. The MWSP is an interbasin water transfer project aimed at diverting water from the Melamchi River of the Indrawati River basin to Kathmandu Valley for drinking water purpose. Future temperature and precipitation of the basin were predicted using the outputs of two regional climate models (RCMs) and two general circulation models (GCMs) under two representative concentration pathway (RCP) scenarios which were then used as inputs to Soil and Water Assessment Tool (SWAT) to predict the water availability and evaluate the water diversion strategies in the future. The average temperature of the basin is projected to increase by 2.35 to 4.25 °C under RCP 4.5 and RCP 8.5, respectively, by 2085s. The average precipitation in the basin is projected to increase by 6-18 % in the future. The annual water availability is projected to increase in the future; however, the variability is observed in monthly water availability in the basin. The water supply and demand scenarios of Kathmandu Valley was also examined by considering the population increase, unaccounted for water and water diversion from MWSP in the future. It is observed that even with the additional supply of water from MWSP and reduction of unaccounted for water, the Kathmandu Valley will be still under water scarcity in the future. The findings of this study can be helpful to formulate water supply and demand management strategies in Kathmandu Valley in the context of climate change in the future.

  6. Assessment of climate change impact on water diversion strategies of Melamchi Water Supply Project in Nepal

    Science.gov (United States)

    Shrestha, Sangam; Shrestha, Manish; Babel, Mukand S.

    2015-12-01

    This paper analyzes the climate change impact on water diversion plan of Melamchi Water Supply Project (MWSP) in Nepal. The MWSP is an interbasin water transfer project aimed at diverting water from the Melamchi River of the Indrawati River basin to Kathmandu Valley for drinking water purpose. Future temperature and precipitation of the basin were predicted using the outputs of two regional climate models (RCMs) and two general circulation models (GCMs) under two representative concentration pathway (RCP) scenarios which were then used as inputs to Soil and Water Assessment Tool (SWAT) to predict the water availability and evaluate the water diversion strategies in the future. The average temperature of the basin is projected to increase by 2.35 to 4.25 °C under RCP 4.5 and RCP 8.5, respectively, by 2085s. The average precipitation in the basin is projected to increase by 6-18 % in the future. The annual water availability is projected to increase in the future; however, the variability is observed in monthly water availability in the basin. The water supply and demand scenarios of Kathmandu Valley was also examined by considering the population increase, unaccounted for water and water diversion from MWSP in the future. It is observed that even with the additional supply of water from MWSP and reduction of unaccounted for water, the Kathmandu Valley will be still under water scarcity in the future. The findings of this study can be helpful to formulate water supply and demand management strategies in Kathmandu Valley in the context of climate change in the future.

  7. Comparative genomic analysis of 45 type strains of the genus Bifidobacterium: a snapshot of its genetic diversity and evolution.

    Directory of Open Access Journals (Sweden)

    Zhihong Sun

    Full Text Available Bifidobacteria are well known for their human health-promoting effects and are therefore widely applied in the food industry. Members of the Bifidobacterium genus were first identified from the human gastrointestinal tract and were then found to be widely distributed across various ecological niches. Although the genetic diversity of Bifidobacterium has been determined based on several marker genes or a few genomes, the global diversity and evolution scenario for the entire genus remain unresolved. The present study comparatively analyzed the genomes of 45 type strains. We built a robust genealogy for Bifidobacterium based on 402 core genes and defined its root according to the phylogeny of the tree of bacteria. Our results support that all human isolates are of younger lineages, and although species isolated from bees dominate the more ancient lineages, the bee was not necessarily the original host for bifidobacteria. Moreover, the species isolated from different hosts are enriched with specific gene sets, suggesting host-specific adaptation. Notably, bee-specific genes are strongly associated with respiratory metabolism and are potential in helping those bacteria adapt to the oxygen-rich gut environment in bees. This study provides a snapshot of the genetic diversity and evolution of Bifidobacterium, paving the way for future studies on the taxonomy and functional genomics of the genus.

  8. Exploring the diversity of Arcobacter butzleri from cattle in the UK using MLST and whole genome sequencing.

    Directory of Open Access Journals (Sweden)

    J Yvette Merga

    Full Text Available Arcobacter butzleri is considered to be an emerging human foodborne pathogen. The completion of an A. butzleri genome sequence along with microarray analysis of 13 isolates in 2007 revealed a surprising amount of diversity amongst A. butzleri isolates from humans, animals and food. In order to further investigate Arcobacter diversity, 792 faecal samples were collected from cattle on beef and dairy farms in the North West of England. Arcobacter was isolated from 42.5% of the samples and the diversity of the isolates was investigated using multilocus sequence typing. An A. butzleri whole genome sequence, obtained by 454 shotgun sequencing of an isolate from a clinically-healthy dairy cow, showed a number of differences when compared to the genome of a human-derived A. butzleri isolate. PCR-based prevalence assays for variable genes suggested some tentative evidence for source-related distributions. We also found evidence for phenotypic differences relating to growth capabilities between our representative human and cattle isolates. Our genotypic and phenotypic observations suggest that some level of niche adaptation may have occurred in A. butzleri.

  9. Exploring the diversity of Arcobacter butzleri from cattle in the UK using MLST and whole genome sequencing.

    Science.gov (United States)

    Merga, J Yvette; Williams, Nicola J; Miller, William G; Leatherbarrow, Andrew J H; Bennett, Malcolm; Hall, Neil; Ashelford, Kevin E; Winstanley, Craig

    2013-01-01

    Arcobacter butzleri is considered to be an emerging human foodborne pathogen. The completion of an A. butzleri genome sequence along with microarray analysis of 13 isolates in 2007 revealed a surprising amount of diversity amongst A. butzleri isolates from humans, animals and food. In order to further investigate Arcobacter diversity, 792 faecal samples were collected from cattle on beef and dairy farms in the North West of England. Arcobacter was isolated from 42.5% of the samples and the diversity of the isolates was investigated using multilocus sequence typing. An A. butzleri whole genome sequence, obtained by 454 shotgun sequencing of an isolate from a clinically-healthy dairy cow, showed a number of differences when compared to the genome of a human-derived A. butzleri isolate. PCR-based prevalence assays for variable genes suggested some tentative evidence for source-related distributions. We also found evidence for phenotypic differences relating to growth capabilities between our representative human and cattle isolates. Our genotypic and phenotypic observations suggest that some level of niche adaptation may have occurred in A. butzleri.

  10. Comparative genomic and functional analyses: unearthing the diversity and specificity of nematicidal factors in Pseudomonas putida strain 1A00316

    Science.gov (United States)

    Guo, Jing; Jing, Xueping; Peng, Wen-Lei; Nie, Qiyu; Zhai, Yile; Shao, Zongze; Zheng, Longyu; Cai, Minmin; Li, Guangyu; Zuo, Huaiyu; Zhang, Zhitao; Wang, Rui-Ru; Huang, Dian; Cheng, Wanli; Yu, Ziniu; Chen, Ling-Ling; Zhang, Jibin

    2016-01-01

    We isolated Pseudomonas putida (P. putida) strain 1A00316 from Antarctica. This bacterium has a high efficiency against Meloidogyne incognita (M. incognita) in vitro and under greenhouse conditions. The complete genome of P. putida 1A00316 was sequenced using PacBio single molecule real-time (SMRT) technology. A comparative genomic analysis of 16 Pseudomonas strains revealed that although P. putida 1A00316 belonged to P. putida, it was phenotypically more similar to nematicidal Pseudomonas fluorescens (P. fluorescens) strains. We characterized the diversity and specificity of nematicidal factors in P. putida 1A00316 with comparative genomics and functional analysis, and found that P. putida 1A00316 has diverse nematicidal factors including protein alkaline metalloproteinase AprA and two secondary metabolites, hydrogen cyanide and cyclo-(l-isoleucyl-l-proline). We show for the first time that cyclo-(l-isoleucyl-l-proline) exhibit nematicidal activity in P. putida. Interestingly, our study had not detected common nematicidal factors such as 2,4-diacetylphloroglucinol (2,4-DAPG) and pyrrolnitrin in P. putida 1A00316. The results of the present study reveal the diversity and specificity of nematicidal factors in P. putida strain 1A00316. PMID:27384076

  11. Genome sequences of siphoviruses infecting marine Synechococcus unveil a diverse cyanophage group and extensive phage-host genetic exchanges.

    Science.gov (United States)

    Huang, Sijun; Wang, Kui; Jiao, Nianzhi; Chen, Feng

    2012-02-01

    Investigating the interactions between marine cyanobacteria and their viruses (phages) is important towards understanding the dynamic of ocean's primary productivity. Genome sequencing of marine cyanophages has greatly advanced our understanding about their ecology and evolution. Among 24 reported genomes of cyanophages that infect marine picocyanobacteria, 17 are from cyanomyoviruses and six from cyanopodoviruses, and only one from cyanosiphovirus (Prochlorococcus phage P-SS2). Here we present four complete genome sequences of siphoviruses (S-CBS1, S-CBS2, S-CBS3 and S-CBS4) that infect four different marine Synechococcus strains. Three distinct subtypes were recognized among the five known marine siphoviruses (including P-SS2) in terms of morphology, genome architecture, gene content and sequence similarity. Our study revealed that cyanosiphoviruses are genetically diverse with polyphyletic origin. No core genes were found across these five cyanosiphovirus genomes, and this is in contrast to the fact that many core genes have been found in cyanomyovirus or cyanopodovirus genomes. Interestingly, genes encoding three structural proteins and a lysozyme of S-CBS1 and S-CBS3 showed homology to a prophage-like genetic element in two freshwater Synechococcus elongatus genomes. Re-annotation of the prophage-like genomic region suggests that S. elongatus may contain an intact prophage. Cyanosiphovirus genes involved in DNA metabolism and replication share high sequence homology with those in cyanobacteria, and further phylogenetic analysis based on these genes suggests that ancient and selective genetic exchanges occurred, possibly due to past prophage integration. Metagenomic analysis based on the Global Ocean Sampling database showed that cyanosiphoviruses are present in relatively low abundance in the ocean surface water compared to cyanomyoviruses and cyanopodoviruses.

  12. Genome sequence and genetic diversity of the common carp, Cyprinus carpio.

    Science.gov (United States)

    Xu, Peng; Zhang, Xiaofeng; Wang, Xumin; Li, Jiongtang; Liu, Guiming; Kuang, Youyi; Xu, Jian; Zheng, Xianhu; Ren, Lufeng; Wang, Guoliang; Zhang, Yan; Huo, Linhe; Zhao, Zixia; Cao, Dingchen; Lu, Cuiyun; Li, Chao; Zhou, Yi; Liu, Zhanjiang; Fan, Zhonghua; Shan, Guangle; Li, Xingang; Wu, Shuangxiu; Song, Lipu; Hou, Guangyuan; Jiang, Yanliang; Jeney, Zsigmond; Yu, Dan; Wang, Li; Shao, Changjun; Song, Lai; Sun, Jing; Ji, Peifeng; Wang, Jian; Li, Qiang; Xu, Liming; Sun, Fanyue; Feng, Jianxin; Wang, Chenghui; Wang, Shaolin; Wang, Baosen; Li, Yan; Zhu, Yaping; Xue, Wei; Zhao, Lan; Wang, Jintu; Gu, Ying; Lv, Weihua; Wu, Kejing; Xiao, Jingfa; Wu, Jiayan; Zhang, Zhang; Yu, Jun; Sun, Xiaowen

    2014-11-01

    The common carp, Cyprinus carpio, is one of the most important cyprinid species and globally accounts for 10% of freshwater aquaculture production. Here we present a draft genome of domesticated C. carpio (strain Songpu), whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100). The latest round of whole-genome duplication has been estimated to have occurred approximately 8.2 million years ago. Genome resequencing of 33 representative individuals from worldwide populations demonstrates a single origin for C. carpio in 2 subspecies (C. carpio Haematopterus and C. carpio carpio). Integrative genomic and transcriptomic analyses were used to identify loci potentially associated with traits including scaling patterns and skin color. In combination with the high-resolution genetic map, the draft genome paves the way for better molecular studies and improved genome-assisted breeding of C. carpio and other closely related species.

  13. High-resolution genetic map for understanding the effect of genome-wide recombination rate on nucleotide diversity in watermelon.

    Science.gov (United States)

    Reddy, Umesh K; Nimmakayala, Padma; Levi, Amnon; Abburi, Venkata Lakshmi; Saminathan, Thangasamy; Tomason, Yan R; Vajja, Gopinath; Reddy, Rishi; Abburi, Lavanya; Wehner, Todd C; Ronin, Yefim; Karol, Abraham

    2014-09-15

    We used genotyping by sequencing to identify a set of 10,480 single nucleotide polymorphism (SNP) markers for constructing a high-resolution genetic map of 1096 cM for watermelon. We assessed the genome-wide variation in recombination rate (GWRR) across the map and found an association between GWRR and genome-wide nucleotide diversity. Collinearity between the map and the genome-wide reference sequence for watermelon was studied to identify inconsistency and chromosome rearrangements. We assessed genome-wide nucleotide diversity, linkage disequilibrium (LD), and selective sweep for wild, semi-wild, and domesticated accessions of Citrullus lanatus var. lanatus to track signals of domestication. Principal component analysis combined with chromosome-wide phylogenetic study based on 1563 SNPs obtained after LD pruning with minor allele frequency of 0.05 resolved the differences between semi-wild and wild accessions as well as relationships among worldwide sweet watermelon. Population structure analysis revealed predominant ancestries for wild, semi-wild, and domesticated watermelons as well as admixture of various ancestries that were important for domestication. Sliding window analysis of Tajima's D across various chromosomes was used to resolve selective sweep. LD decay was estimated for various chromosomes. We identified a strong selective sweep on chromosome 3 consisting of important genes that might have had a role in sweet watermelon domestication. Copyright © 2014 Reddy et al.

  14. Investigations into genome diversity of Haemophilus influenzae using whole genome sequencing of clinical isolates and laboratory transformants

    Directory of Open Access Journals (Sweden)

    Power Peter M

    2012-11-01

    Full Text Available Abstract Background Haemophilus influenzae is an important human commensal pathogen associated with significant levels of disease. High-throughput DNA sequencing was used to investigate differences in genome content within this species. Results Genomic DNA sequence was obtained from 85 strains of H. influenzae and from other related species, selected based on geographical site of isolation, disease association and documented genotypic and phenotypic differences. When compared by Mauve alignment these indicated groupings of H. influenzae that were consistent with previously published analyses; capsule expressing strains fell into two distinct groups and those of serotype b (Hib were found in two closely positioned lineages. For 18 Hib strains representing both lineages we found many discrete regions (up to 40% of the total genome displaying sequence variation when compared to a common reference strain. Evidence that this naturally occurring pattern of inter-strain variation in H. influenzae can be mediated by transformation was obtained through sequencing DNA obtained from a pool of 200 independent transformants of a recipient (strain Rd using donor DNA from a heterologous Hib strain (Eagan. Conclusion Much of the inter-strain variation in genome sequence in H. influenzae is likely the result of inter-strain exchanges of DNA, most plausibly through transformation.

  15. Genome-wide analysis of gene expression in primate taste buds reveals links to diverse processes.

    Directory of Open Access Journals (Sweden)

    Peter Hevezi

    Full Text Available Efforts to unravel the mechanisms underlying taste sensation (gustation have largely focused on rodents. Here we present the first comprehensive characterization of gene expression in primate taste buds. Our findings reveal unique new insights into the biology of taste buds. We generated a taste bud gene expression database using laser capture microdissection (LCM procured fungiform (FG and circumvallate (CV taste buds from primates. We also used LCM to collect the top and bottom portions of CV taste buds. Affymetrix genome wide arrays were used to analyze gene expression in all samples. Known taste receptors are preferentially expressed in the top portion of taste buds. Genes associated with the cell cycle and stem cells are preferentially expressed in the bottom portion of taste buds, suggesting that precursor cells are located there. Several chemokines including CXCL14 and CXCL8 are among the highest expressed genes in taste buds, indicating that immune system related processes are active in taste buds. Several genes expressed specifically in endocrine glands including growth hormone releasing hormone and its receptor are also strongly expressed in taste buds, suggesting a link between metabolism and taste. Cell type-specific expression of transcription factors and signaling molecules involved in cell fate, including KIT, reveals the taste bud as an active site of cell regeneration, differentiation, and development. IKBKAP, a gene mutated in familial dysautonomia, a disease that results in loss of taste buds, is expressed in taste cells that communicate with afferent nerve fibers via synaptic transmission. This database highlights the power of LCM coupled with transcriptional profiling to dissect the molecular composition of normal tissues, represents the most comprehensive molecular analysis of primate taste buds to date, and provides a foundation for further studies in diverse aspects of taste biology.

  16. Citrus sinensis annotation project (CAP): a comprehensive database for sweet orange genome.

    Science.gov (United States)

    Wang, Jia; Chen, Dijun; Lei, Yang; Chang, Ji-Wei; Hao, Bao-Hai; Xing, Feng; Li, Sen; Xu, Qiang; Deng, Xiu-Xin; Chen, Ling-Ling

    2014-01-01

    Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia), and constructed the Citrus sinensis annotation project (CAP) to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-based organization of sweet orange genomic data, which integrates ab initio gene prediction, EST, RNA-seq and RNA-paired end tag (RNA-PET) evidence-based gene annotation. Furthermore, we provide a user-friendly web interface to show the predicted protein-protein interactions (PPIs) and metabolic pathways in sweet orange. CAP provides comprehensive information beneficial to the researchers of sweet orange and other woody plants, which is freely available at http://citrus.hzau.edu.cn/.

  17. Diversity of Layer 5 Projection Neurons in the Mouse Motor Cortex

    Directory of Open Access Journals (Sweden)

    Manfred J Oswald

    2013-10-01

    Full Text Available In the primary motor cortex (M1, layer 5 projection neurons signal directly to distant motor structures to drive movement. Despite their pivotal position and acknowledged diversity these neurons are traditionally separated into broad commissural and corticofugal types, and until now no attempt has been made at resolving the basis for their diversity. We therefore probed the electrophysiological and morphological properties of retrogradely labelled M1 corticospinal (CSp, corticothalamic (CTh, and commissural projecting corticostriatal (CStr and corticocortical (CC neurons. An unsupervised cluster analysis established at least four phenotypes with additional differences between lumbar and cervical projecting CSp neurons. Distinguishing parameters included the action potential (AP waveform, firing behaviour, the hyperpolarisation-activated sag potential, sublayer position, and soma and dendrite size. CTh neurons differed from CSp neurons in showing spike frequency acceleration and a greater sag potential. CStr neurons had the lowest AP amplitude and maximum rise rate of all neurons. Temperature influenced spike train behaviour in corticofugal neurons. At 26 ºC CTh neurons fired bursts of APs more often than CSp neurons, but at 36 ºC both groups fired regular APs. Our findings provide reliable phenotypic fingerprints to identify distinct M1 projection neuron classes as a tool to understand their unique contributions to motor function.

  18. Immigration, health and diversity management: Preliminary developments of a project in neighborhoods of Catalonia

    Directory of Open Access Journals (Sweden)

    Dan Rodríguez-García

    2007-09-01

    Full Text Available This article presents an ongoing research project on immigration, health, and socio-cultural diversity , and offers preliminary information on the theoretical and socio-demographic context of this investigation. The objective of the project, funded by the Department of Health of the Autonomous Government of Catalonia, Spain, is to analyse the socioeconomic and cultural factors involved in health and the access to the formal health system of a few major migrant communities and ethnic minorities living in high-priority neighbourhoods in Catalonia. The results of this project, which will come fundamentally from ethnographic research, aim to give suggestions for improving health conditions for the population and to provide to those professionals working in the public health care system with some conceptual and practical tools for improving intercultural communication between themselves and their patients, as well as for detecting, preventing, and resolving problems in everyday practice.

  19. Microbial iron management mechanisms in extremely acidic environments: comparative genomics evidence for diversity and versatility

    Directory of Open Access Journals (Sweden)

    Nieto Pamela A

    2008-11-01

    uptake systems could reflect their obligatory occupation of extremely low pH environments where high concentrations of soluble iron may always be available and were oxidized sulfur species might not compromise iron speciation dynamics. Presence of bacterioferritin in the Acidithiobacilli, polyphosphate accumulation functions and variants of FieF-like diffusion facilitators in both Acidithiobacilli and Leptospirilla, indicate that they may remove or store iron under conditions of variable availability. In addition, the Fe(II-oxidizing capacity of both A. ferrooxidans and Leptospirilla could itself be a way to evade iron stress imposed by readily available Fe(II ions at low pH. Fur regulatory sites have been predicted for a number of gene clusters including iron related and non-iron related functions in both the Acidithiobacilli and Leptospirilla, laying the foundation for the future discovery of iron regulated and iron-phosphate coordinated regulatory control circuits. Conclusion In silico analyses of the genomes of acidophilic bacteria are beginning to tease apart the mechanisms that mediate iron uptake and homeostasis in low pH environments. Initial models pinpoint significant differences in abundance and diversity of iron management mechanisms between Leptospirilla and Acidithiobacilli, and begin to reveal how these two groups respond to iron cycling and iron fluctuations in naturally acidic environments and in industrial operations. Niche partitions and ecological successions between acidophilic microorganisms may be partially explained by these observed differences. Models derived from these analyses pave the way for improved hypothesis testing and well directed experimental investigation. In addition, aspects of these models should challenge investigators to evaluate alternative iron management strategies in non-acidophilic model organisms.

  20. The Human Genome Project and Mental Retardation: An Educational Program. Final Progress Report

    Energy Technology Data Exchange (ETDEWEB)

    Davis, Sharon

    1999-05-03

    The Arc, a national organization on mental retardation, conducted an educational program for members, many of whom have a family member with a genetic condition causing mental retardation. The project informed members about the Human Genome scientific efforts, conducted training regarding ethical, legal and social implications and involved members in issue discussions. Short reports and fact sheets on genetic and ELSI topics were disseminated to 2,200 of the Arc's leaders across the country and to other interested individuals. Materials produced by the project can e found on the Arc's web site, TheArc.org.

  1. Natural history of Bartonella-infecting rodents in light of new knowledge on genomics, diversity and evolution.

    Science.gov (United States)

    Buffet, Jean-Philippe; Kosoy, Michael; Vayssier-Taussat, Muriel

    2013-09-01

    Among the 33 confirmed Bartonella species to date, more than half are hosted by rodent species, and at least five of them have been involved in human illness causing diverse symptoms including fever, myocarditis, endocarditis, lymphadenitis and hepatitis. In almost all countries, wild rodents are infected by extremely diverse Bartonella strains with a high prevalence. In the present paper, in light of new knowledge on rodent-adapted Bartonella species genomics, we bring together knowledge gained in recent years to have an overview of the impact of rodent-adapted Bartonella infection on humans and to determine how diversity of Bartonella helps to understand their mechanisms of adaptation to rodents and the consequences on human health.

  2. Genome Structural Diversity among 31 Bordetella pertussis Isolates from Two Recent U.S. Whooping Cough Statewide Epidemics.

    Science.gov (United States)

    Bowden, Katherine E; Weigand, Michael R; Peng, Yanhui; Cassiday, Pamela K; Sammons, Scott; Knipe, Kristen; Rowe, Lori A; Loparev, Vladimir; Sheth, Mili; Weening, Keeley; Tondella, M Lucia; Williams, Margaret M

    2016-01-01

    During 2010 and 2012, California and Vermont, respectively, experienced statewide epidemics of pertussis with differences seen in the demographic affected, case clinical presentation, and molecular epidemiology of the circulating strains. To overcome limitations of the current molecular typing methods for pertussis, we utilized whole-genome sequencing to gain a broader understanding of how current circulating strains are causing large epidemics. Through the use of combined next-generation sequencing technologies, this study compared de novo, single-contig genome assemblies from 31 out of 33 Bordetella pertussis isolates collected during two separate pertussis statewide epidemics and 2 resequenced vaccine strains. Final genome architecture assemblies were verified with whole-genome optical mapping. Sixteen distinct genome rearrangement profiles were observed in epidemic isolate genomes, all of which were distinct from the genome structures of the two resequenced vaccine strains. These rearrangements appear to be mediated by repetitive sequence elements, such as high-copy-number mobile genetic elements and rRNA operons. Additionally, novel and previously identified single nucleotide polymorphisms were detected in 10 virulence-related genes in the epidemic isolates. Whole-genome variation analysis identified state-specific variants, and coding regions bearing nonsynonymous mutations were classified into functional annotated orthologous groups. Comprehensive studies on whole genomes are needed to understand the resurgence of pertussis and develop novel tools to better characterize the molecular epidemiology of evolving B. pertussis populations. IMPORTANCE Pertussis, or whooping cough, is the most poorly controlled vaccine-preventable bacterial disease in the United States, which has experienced a resurgence for more than a decade. Once viewed as a monomorphic pathogen, B. pertussis strains circulating during epidemics exhibit diversity visible on a genome structural

  3. Putatively novel serotypes and the potential for reduced vaccine effectiveness: capsular locus diversity revealed among 5405 pneumococcal genomes

    Science.gov (United States)

    van Tonder, Andries J.; Bray, James E.; Quirk, Sigríður J.; Haraldsson, Gunnsteinn; Jolley, Keith A.; Maiden, Martin C. J.; Hoffmann, Steen; Bentley, Stephen D.; Haraldsson, Ásgeir; Erlendsdóttir, Helga; Kristinsson, Karl G.; Brueggemann, Angela B.

    2017-01-01

    The pneumococcus is a leading global pathogen and a key virulence factor possessed by the majority of pneumococci is an antigenic polysaccharide capsule (‘serotype’), which is encoded by the capsular (cps) locus. Approximately 100 different serotypes are known, but the extent of sequence diversity within the cps loci of individual serotypes is not well understood. Investigating serotype-specific sequence variation is crucial to the design of sequence-based serotyping methodology, understanding pneumococcal conjugate vaccine (PCV) effectiveness and the design of future PCVs. The availability of large genome datasets makes it possible to assess population-level variation among pneumococcal serotypes and in this study 5405 pneumococcal genomes were used to investigate cps locus diversity among 49 different serotypes. Pneumococci had been recovered between 1916 and 2014 from people of all ages living in 51 countries. Serotypes were deduced bioinformatically, cps locus sequences were extracted and variation was assessed within the cps locus, in the context of pneumococcal genetic lineages. Overall, cps locus sequence diversity varied markedly: low to moderate diversity was revealed among serogroups/types 1, 3, 7, 9, 11 and 22; whereas serogroups/types 6, 19, 23, 14, 15, 18, 33 and 35 displayed high diversity. Putative novel and/or hybrid cps loci were identified among all serogroups/types apart from 1, 3 and 9. This study demonstrated that cps locus sequence diversity varied widely between serogroups/types. Investigation of the biochemical structure of the polysaccharide capsule of major variants, particularly PCV-related serotypes and those that appear to be novel or hybrids, is warranted. PMID:28133541

  4. Enhancing genome-wide copy number variation identification by high density array CGH using diverse resources of pig breeds.

    Directory of Open Access Journals (Sweden)

    Jiying Wang

    Full Text Available Copy number variations (CNVs are important forms of genomic variation, and have attracted extensive attentions in humans as well as domestic animals. In the study, using a custom-designed 2.1 M array comparative genomic hybridization (aCGH, genome-wide CNVs were identified among 12 individuals from diverse pig breeds, including one Asian wild population, six Chinese indigenous breeds and two modern commercial breeds (Yorkshire and Landrace, with one individual of the other modern commercial breed, Duroc, as the reference. A total of 1,344 CNV regions (CNVRs were identified, covering 47.79 Mb (∼1.70% of the pig genome. The length of these CNVRs ranged from 3.37 Kb to 1,319.0 Kb with a mean of 35.56 Kb and a median of 11.11 Kb. Compared with similar studies reported, most of the CNVRs (74.18% were firstly identified in present study. In order to confirm these CNVRs, 21 CNVRs were randomly chosen to be validated by quantitative real time PCR (qPCR and a high rate (85.71% of confirmation was obtained. Functional annotation of CNVRs suggested that the identified CNVRs have important function, and may play an important role in phenotypic and production traits difference among various breeds. Our results are essential complementary to the CNV map in the pig genome, which will provide abundant genetic markers to investigate association studies between various phenotypes and CNVs in pigs.

  5. The projection of a test genome onto a reference population and applications to humans and archaic hominins.

    Science.gov (United States)

    Yang, Melinda A; Harris, Kelley; Slatkin, Montgomery

    2014-12-01

    We introduce a method for comparing a test genome with numerous genomes from a reference population. Sites in the test genome are given a weight, w, that depends on the allele frequency, x, in the reference population. The projection of the test genome onto the reference population is the average weight for each x, [Formula: see text]. The weight is assigned in such a way that, if the test genome is a random sample from the reference population, then [Formula: see text]. Using analytic theory, numerical analysis, and simulations, we show how the projection depends on the time of population splitting, the history of admixture, and changes in past population size. The projection is sensitive to small amounts of past admixture, the direction of admixture, and admixture from a population not sampled (a ghost population). We compute the projections of several human and two archaic genomes onto three reference populations from the 1000 Genomes project-Europeans, Han Chinese, and Yoruba-and discuss the consistency of our analysis with previously published results for European and Yoruba demographic history. Including higher amounts of admixture between Europeans and Yoruba soon after their separation and low amounts of admixture more recently can resolve discrepancies between the projections and demographic inferences from some previous studies.

  6. Evaluative profiling of arsenic sensing and regulatory systems in the human microbiome project genomes.

    Science.gov (United States)

    Isokpehi, Raphael D; Udensi, Udensi K; Simmons, Shaneka S; Hollman, Antoinesha L; Cain, Antia E; Olofinsae, Samson A; Hassan, Oluwabukola A; Kashim, Zainab A; Enejoh, Ojochenemi A; Fasesan, Deborah E; Nashiru, Oyekanmi

    2014-01-01

    The influence of environmental chemicals including arsenic, a type 1 carcinogen, on the composition and function of the human-associated microbiota is of significance in human health and disease. We have developed a suite of bioinformatics and visual analytics methods to evaluate the availability (presence or absence) and abundance of functional annotations in a microbial genome for seven Pfam protein families: As(III)-responsive transcriptional repressor (ArsR), anion-transporting ATPase (ArsA), arsenical pump membrane protein (ArsB), arsenate reductase (ArsC), arsenical resistance operon transacting repressor (ArsD), water/glycerol transport protein (aquaporins), and universal stress protein (USP). These genes encode function for sensing and/or regulating arsenic content in the bacterial cell. The evaluative profiling strategy was applied to 3,274 genomes from which 62 genomes from 18 genera were identified to contain genes for the seven protein families. Our list included 12 genomes in the Human Microbiome Project (HMP) from the following genera: Citrobacter, Escherichia, Lactobacillus, Providencia, Rhodococcus, and Staphylococcus. Gene neighborhood analysis of the arsenic resistance operon in the genome of Bacteroides thetaiotaomicron VPI-5482, a human gut symbiont, revealed the adjacent arrangement of genes for arsenite binding/transfer (ArsD) and cytochrome c biosynthesis (DsbD_2). Visual analytics facilitated evaluation of protein annotations in 367 genomes in the phylum Bacteroidetes identified multiple genomes in which genes for ArsD and DsbD_2 were adjacently arranged. Cytochrome c, produced by a posttranslational process, consists of heme-containing proteins important for cellular energy production and signaling. Further research is desired to elucidate arsenic resistance and arsenic-mediated cellular energy production in the Bacteroidetes.

  7. Diversity of the Ty-1 copia retrotransposon Tos17 in rice (Oryza sativa L.) and the AA genome of the Oryza genus

    OpenAIRE

    Petit, J.; Bourgeois, E; Stenger, W.; Bes, M.; Droc, G.; Meynard, D.; Courtois, B.; Ghesquière, Alain; Sabot, François; Panaud, O.; Guiderdoni, E.

    2009-01-01

    Retrotransposons are mobile genetic elements, ubiquitous in Eukaryotic genomes, which have proven to be major genetic tools in determining phylogeny and structuring genetic diversity, notably in plants. We investigate here the diversity of the Ty1-copia retrotransposon Tos17 in the cultivated rice of Asian origin (Oryza sativa L.) and related AA genome species of the Oryza genus, to contribute understanding of the complex evolutionary history in this group of species through that of the eleme...

  8. A high-density Diversity Arrays Technology (DArT microarray for genome-wide genotyping in Eucalyptus

    Directory of Open Access Journals (Sweden)

    Myburg Alexander A

    2010-06-01

    Full Text Available Abstract Background A number of molecular marker technologies have allowed important advances in the understanding of the genetics and evolution of Eucalyptus, a genus that includes over 700 species, some of which are used worldwide in plantation forestry. Nevertheless, the average marker density achieved with current technologies remains at the level of a few hundred markers per population. Furthermore, the transferability of markers produced with most existing technology across species and pedigrees is usually very limited. High throughput, combined with wide genome coverage and high transferability are necessary to increase the resolution, speed and utility of molecular marker technology in eucalypts. We report the development of a high-density DArT genome profiling resource and demonstrate its potential for genome-wide diversity analysis and linkage mapping in several species of Eucalyptus. Findings After testing several genome complexity reduction methods we identified the PstI/TaqI method as the most effective for Eucalyptus and developed 18 genomic libraries from PstI/TaqI representations of 64 different Eucalyptus species. A total of 23,808 cloned DNA fragments were screened and 13,300 (56% were found to be polymorphic among 284 individuals. After a redundancy analysis, 6,528 markers were selected for the operational array and these were supplemented with 1,152 additional clones taken from a library made from the E. grandis tree whose genome has been sequenced. Performance validation for diversity studies revealed 4,752 polymorphic markers among 174 individuals. Additionally, 5,013 markers showed segregation when screened using six inter-specific mapping pedigrees, with an average of 2,211 polymorphic markers per pedigree and a minimum of 859 polymorphic markers that were shared between any two pedigrees. Conclusions This operational DArT array will deliver 1,000-2,000 polymorphic markers for linkage mapping in most eucalypt pedigrees

  9. Data Management Challenges in a National Scientific Program of 55 Diverse Research Projects

    Science.gov (United States)

    De Bruin, T.

    2016-12-01

    In 2007-2015, the Dutch funding agency NWO funded the National Ocean and Coastal Research Program (in Dutch: ZKO). This program focused on `the scientific analysis of five societal challenges related to a sustainable use of the sea and coastal zones'. These five challenges were safety, economic yield, nature, spatial planning & development and water quality. The ZKO program was `set up to strengthen the cohesion and collaboration within Dutch marine research'. From the start of the program, data management was addressed, to allow data to be shared amongst the, diverse, research projects. The ZKO program was divided in 4 different themes (or regions). The `Carrying Capacity' theme was subdivided into 3 `research lines': Carrying capacity (Wadden Sea) - Policy-relevant Research - Monitoring - Hypothesis-driven Research Oceans North Sea Transnational Wadden Sea Research 56 Projects were funded, ranging from studies on the governance of the Wadden Sea to expeditions studying trace elements in the Atlantic Ocean. One of the first projects to be funded was the data management project. Its objectives were to allow data exchange between projects, to archive all relevant data from all ZKO projects and to make the data and publications publicly available, following the ZKO Data Policy. This project was carried out by the NIOZ Data Management Group. It turned out that the research projects had hardly any interest in sharing data between projects and had good (?) arguments not to share data at all until the end of the projects. A data portal was built, to host and make available all ZKO data and publications. When it came to submitting the data to this portal, most projects obliged willingly, though found it occasionally difficult to find time to do so. However, some projects refused to submit data to an open data portal, despite the rules set up by the funding agency and agreed by all. The take-home message of this presentation is that data sharing is a cultural and

  10. Potential Ecological Benefits of the Middle Route for the South-North Water Diversion Project

    Institute of Scientific and Technical Information of China (English)

    HanChu CHEN; DU Pengfei

    2008-01-01

    This paper presents a study of the middle route of the South-North Water Diversion Project.The middle route runs through the Northern China plain,where the water shortages are the most severe.There is not only a shortage of water for human usage,but also a shortage of ecological water.Although the current plan for the middle route is strictly focused on supplying water for residential and industrial use,the water can also potentially be used for ecological purposes.This paper evaluates the potential ecological benefits that can be brought to the fragile ecology in northern China by the middle route,in addition to the water supplied to residences and industry.The study describes ecological benefits of the middle route project,such as mitigation of groundwater extraction in the region and positive influences on the climate,the ecological uses of the middle route project itself,such as creating artificial niches along the channel and directly using the channel for ecological purposes,and the ecological uses of the water along the middle route,such as diversion of the water into river channels that have suffered from drought conditions for decades.

  11. The human genome project: Information management, access, and regulation. Technical progress report, 1 April--31 August 1993

    Energy Technology Data Exchange (ETDEWEB)

    McInerney, J.D.; Micikas, L.B.

    1993-09-10

    Efforts are described to prepare educational materials including computer based as well as conventional type teaching materials for training interested high school and elementary students in aspects of Human Genome Project.

  12. Blueprint for Sustainable Change in Diversity Management and Cultural Competence: Lessons From the National Center for Healthcare Leadership Diversity Demonstration Project.

    Science.gov (United States)

    Dreachslin, Janice L; Weech-Maldonado, Robert; Gail, Judith; Epané, Josué Patien; Wainio, Joyce Anne

    How can healthcare leaders build a sustainable infrastructure to leverage workforce diversity and deliver culturally and linguistically appropriate care to patients? To answer that question, two health systems participated in the National Center for Healthcare Leadership's diversity leadership demonstration project, November 2008 to December 2013. Each system provided one intervention hospital and one control hospital.The control hospital in each system participated in pre- and postassessments but received no preassessment feedback and no intervention support. Each intervention hospital's C-suite leadership and demonstration project manager worked with a diversity coach provided by the National Center for Healthcare Leadership to design and implement an action plan to improve diversity and cultural competence practices and build a sustainable infrastructure. Plans explored areas of strength and areas for improvement that were identified through preintervention assessments. The assessments focused on five competencies of strategic diversity management and culturally and linguistically appropriate care: diversity leadership, strategic human resource management, organizational climate, diversity climate, and patient cultural competence.This article describes each intervention hospital's success in action plan implementation and reports results of postintervention interviews with leadership to provide a blueprint for sustainable change.

  13. Ethical challenges and innovations in the dissemination of genomic data: the experience of the PERSPECTIVE project

    Directory of Open Access Journals (Sweden)

    Lévesque E

    2015-08-01

    Full Text Available Emmanuelle Lévesque,1 Bartha Maria Knoppers,1 Jacques Simard,2 1Department of Human Genetics, Centre for Genomics and Policy, McGill University, Montréal, 2Genomics Centre, CHU de Québec Research Center, Department of Molecular Medicine, Laval University, Québec City, QC, Canada Abstract: The importance of making genomic data available for future research is now widely recognized among the scientific community and policymakers. In this era of shared responsibility for data dissemination, improved patient care through research depends on the development of powerful and secure data-sharing systems. As part of the concerted effort to share research resources, the project entitled Personalized Risk Stratification for Prevention and Early Detection of Breast Cancer (PERSPECTIVE makes effective data sharing through the development of a data-sharing framework, one of its goals. The secondary uses of data from PERSPECTIVE for future research promise to enhance our knowledge of breast cancer etiologies without duplicating data-gathering efforts. Despite its benefit for research, we recognize the ethical challenges of data sharing on the local, national, and international levels. The effective management of ethical approvals for projects spanning across jurisdictions, the return of results to research participants, and research incentives and recognition for data production, are but a few pressing issues that need to be properly addressed. We discuss how we managed these issues and suggest how ongoing innovations might help to facilitate data sharing in future genomic research projects. Keywords: data sharing, research ethics, cancer

  14. Analysis of anoxybacillus genomes from the aspects of lifestyle adaptations, prophage diversity, and carbohydrate metabolism.

    Directory of Open Access Journals (Sweden)

    Kian Mau Goh

    Full Text Available Species of Anoxybacillus are widespread in geothermal springs, manure, and milk-processing plants. The genus is composed of 22 species and two subspecies, but the relationship between its lifestyle and genome is little understood. In this study, two high-quality draft genomes were generated from Anoxybacillus spp. SK3-4 and DT3-1, isolated from Malaysian hot springs. De novo assembly and annotation were performed, followed by comparative genome analysis with the complete genome of Anoxybacillus flavithermus WK1 and two additional draft genomes, of A. flavithermus TNO-09.006 and A. kamchatkensis G10. The genomes of Anoxybacillus spp. are among the smaller of the family Bacillaceae. Despite having smaller genomes, their essential genes related to lifestyle adaptations at elevated temperature, extreme pH, and protection against ultraviolet are complete. Due to the presence of various competence proteins, Anoxybacillus spp. SK3-4 and DT3-1 are able to take up foreign DNA fragments, and some of these transferred genes are important for the survival of the cells. The analysis of intact putative prophage genomes shows that they are highly diversified. Based on the genome analysis using SEED, many of the annotated sequences are involved in carbohydrate metabolism. The presence of glycosyl hydrolases among the Anoxybacillus spp. was compared, and the potential applications of these unexplored enzymes are suggested here. This is the first study that compares Anoxybacillus genomes from the aspect of lifestyle adaptations, the capacity for horizontal gene transfer, and carbohydrate metabolism.

  15. Positive correlation between recombination rate and nucleotide diversity is shown under domestication selection in the chicken genome

    Institute of Scientific and Technical Information of China (English)

    FANG Lin; YE Jia; LI Ning; ZHANG Yong; LI SongGang; GANE KaShu WONG; WANG Jun

    2008-01-01

    Positive correlation between recombination rate and nucleotide diversity has been observed in a wide variety of eukaryotes on megabase scale. On the basis of genome-wide chicken genetic variation map generated by comparing three domestic breeds with wild ancestor and the positions of markers on the genetic linkage map, we found that SNPs rates were similar for all chromosomes while the recombina-tion rates increased in micro chromosomes. In other words no correlation exists in chromosome size. Nevertheless, when we scanned the genome by calculating the values of each characteristic within non-overlapping windows, instead of single value for each chromosomes, the nucleotide diversity was found to be significantly correlated with the recombination rate (r=0.27, P<0.0005). Furthermore, the significant association not only existed between these two features, but also existed between all 6 pairwise combinations of nucleotide diversity, recombination rate, GC content and average gene length. This co-variation is very meaningful for the studies of sequence evolution.

  16. Genome-wide analysis of LTR-retrotransposon diversity and its impact on the evolution of the genus Helianthus (L.).

    Science.gov (United States)

    Mascagni, Flavia; Giordani, Tommaso; Ceccarelli, Marilena; Cavallini, Andrea; Natali, Lucia

    2017-08-18

    Genome divergence by mobile elements activity and recombination is a continuous process that plays a key role in the evolution of species. Nevertheless, knowledge on retrotransposon-related variability among species belonging to the same genus is still limited. Considering the importance of the genus Helianthus, a model system for studying the ecological genetics of speciation and adaptation, we performed a comparative analysis of the repetitive genome fraction across ten species and one subspecies of sunflower, focusing on long terminal repeat retrotransposons at superfamily, lineage and sublineage levels. After determining the relative genome size of each species, genomic DNA was isolated and subjected to Illumina sequencing. Then, different assembling and clustering approaches allowed exploring the repetitive component of all genomes. On average, repetitive DNA in Helianthus species represented more than 75% of the genome, being composed mostly by long terminal repeat retrotransposons. Also, the prevalence of Gypsy over Copia superfamily was observed and, among lineages, Chromovirus was by far the most represented. Although nearly all the same sublineages are present in all species, we found considerable variability in the abundance of diverse retrotransposon lineages and sublineages, especially between annual and perennial species. This large variability should indicate that different events of amplification or loss related to these elements occurred following species separation and should have been involved in species differentiation. Our data allowed us inferring on the extent of interspecific repetitive DNA variation related to LTR-RE abundance, investigating the relationship between changes of LTR-RE abundance and the evolution of the genus, and determining the degree of coevolution of different LTR-RE lineages or sublineages between and within species. Moreover, the data suggested that LTR-RE abundance in a species was affected by the annual or perennial

  17. Final Independent External Peer Review Report for the Intake Diversion Dam Modification Lower Yellowstone Project, Montana Draft Supplement to the 26 April 2010 Environmental Assessment and Appendices

    Science.gov (United States)

    2013-02-08

    February 8, 2013 Final Independent External Peer Review Report for the Intake Diversion Dam Modification Lower Yellowstone Project, Montana...Final Independent External Peer Review Report for the Intake Diversion Dam Modification Lower Yellowstone Project, Montana Draft Supplement to the...Intake Project IEPR Final IEPR Report Intake Project IEPR Final IEPR Report Final Independent External Peer Review Report for the

  18. Comprehensive Survey of Genetic Diversity in Chloroplast Genomes and 45S nrDNAs within Panax ginseng Species

    Science.gov (United States)

    Kim, Kyunghee; Lee, Sang-Choon; Lee, Junki; Lee, Hyun Oh; Joh, Ho Jun; Kim, Nam-Hoon; Park, Hyun-Seung; Yang, Tae-Jin

    2015-01-01

    We report complete sequences of chloroplast (cp) genome and 45S nuclear ribosomal DNA (45S nrDNA) for 11 Panax ginseng cultivars. We have obtained complete sequences of cp and 45S nrDNA, the representative barcoding target sequences for cytoplasm and nuclear genome, respectively, based on low coverage NGS sequence of each cultivar. The cp genomes sizes ranged from 156,241 to 156,425 bp and the major size variation was derived from differences in copy number of tandem repeats in the ycf1 gene and in the intergenic regions of rps16-trnUUG and rpl32-trnUAG. The complete 45S nrDNA unit sequences were 11,091 bp, representing a consensus single transcriptional unit with an intergenic spacer region. Comparative analysis of these sequences as well as those previously reported for three Chinese accessions identified very rare but unique polymorphism in the cp genome within P. ginseng cultivars. There were 12 intra-species polymorphisms (six SNPs and six InDels) among 14 cultivars. We also identified five SNPs from 45S nrDNA of 11 Korean ginseng cultivars. From the 17 unique informative polymorphic sites, we developed six reliable markers for analysis of ginseng diversity and cultivar authentication. PMID:26061692

  19. Fruits of human genome project and private venture, and their impact on life science.

    Science.gov (United States)

    Ikekawa, A; Ikekawa, S

    2001-12-01

    A small knowledge base was created by organizing the Human Genome Project (HGP) and its related issues in "Science" magazines between 1996 and 2000. This base revealed the stunning achievement of HGP and a private venture and its impact on today's biology and life science. In the mid-1990, they encouraged the development of advanced high throughput automated DNA sequencers and the technologies that can analyse all genes at once in a systematic fashion. Using these technologies, they completed the genome sequence of human and various other organisms. These fruits opened the door to comparative genomics, functional genomics, the interdisprinary field between computer and biology, and proteomics. They have caused a shift in biological investigation from studying single genes or proteins to studying all genes or proteins at once, and causing revolutional changes in traditional biology, drug discovery and therapy. They have expanded the range of potential drug targets and have facilitated a shift in drug discovery programs toward rational target-based strategies. They have spawned pharmacogenomics that could give rise to a new generation of highly effective drugs that treat causes, not just symptoms. They should also cause a migration from the traditional medications that are safe and effective for every members of the population to personalized medicine and personalized therapy.

  20. Patterns of cross-contamination in a multispecies population genomic project: detection, quantification, impact, and solutions.

    Science.gov (United States)

    Ballenghien, Marion; Faivre, Nicolas; Galtier, Nicolas

    2017-03-29

    Contamination is a well-known but often neglected problem in molecular biology. Here, we investigated the prevalence of cross-contamination among 446 samples from 116 distinct species of animals, which were processed in the same laboratory and subjected to subcontracted transcriptome sequencing. Using cytochrome oxidase 1 as a barcode, we identified a minimum of 782 events of between-species contamination, with approximately 80% of our samples being affected. An analysis of laboratory metadata revealed a strong effect of the sequencing center: nearly all the detected events of between-species contamination involved species that were sent the same day to the same company. We introduce new methods to address the amount of within-species, between-individual contamination, and to correct for this problem when calling genotypes from base read counts. We report evidence for pervasive within-species contamination in this data set, and show that classical population genomic statistics, such as synonymous diversity, the ratio of non-synonymous to synonymous diversity, inbreeding coefficient FIT, and Tajima's D, are sensitive to this problem to various extents. Control analyses suggest that our published results are probably robust to the problem of contamination. Recommendations on how to prevent or avoid contamination in large-scale population genomics/molecular ecology are provided based on this analysis.

  1. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication.

    Science.gov (United States)

    Wu, G Albert; Prochnik, Simon; Jenkins, Jerry; Salse, Jerome; Hellsten, Uffe; Murat, Florent; Perrier, Xavier; Ruiz, Manuel; Scalabrin, Simone; Terol, Javier; Takita, Marco Aurélio; Labadie, Karine; Poulain, Julie; Couloux, Arnaud; Jabbari, Kamel; Cattonaro, Federica; Del Fabbro, Cristian; Pinosio, Sara; Zuccolo, Andrea; Chapman, Jarrod; Grimwood, Jane; Tadeo, Francisco R; Estornell, Leandro H; Muñoz-Sanz, Juan V; Ibanez, Victoria; Herrero-Ortega, Amparo; Aleza, Pablo; Pérez-Pérez, Julián; Ramón, Daniel; Brunel, Dominique; Luro, François; Chen, Chunxian; Farmerie, William G; Desany, Brian; Kodira, Chinnappa; Mohiuddin, Mohammed; Harkins, Tim; Fredrikson, Karin; Burns, Paul; Lomsadze, Alexandre; Borodovsky, Mark; Reforgiato, Giuseppe; Freitas-Astúa, Juliana; Quetier, Francis; Navarro, Luis; Roose, Mikeal; Wincker, Patrick; Schmutz, Jeremy; Morgante, Michele; Machado, Marcos Antonio; Talon, Manuel; Jaillon, Olivier; Ollitrault, Patrick; Gmitter, Frederick; Rokhsar, Daniel

    2014-07-01

    Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes--a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-orange genomes--and show that cultivated types derive from two progenitor species. Although cultivated pummelos represent selections from one progenitor species, Citrus maxima, cultivated mandarins are introgressions of C. maxima into the ancestral mandarin species Citrus reticulata. The most widely cultivated citrus, sweet orange, is the offspring of previously admixed individuals, but sour orange is an F1 hybrid of pure C. maxima and C. reticulata parents, thus implying that wild mandarins were part of the early breeding germplasm. A Chinese wild 'mandarin' diverges substantially from C. reticulata, thus suggesting the possibility of other unrecognized wild citrus species. Understanding citrus phylogeny through genome analysis clarifies taxonomic relationships and facilitates sequence-directed genetic improvement.

  2. Chromosomal copy number variation, selection and uneven rates of recombination reveal cryptic genome diversity linked to pathogenicity.

    Directory of Open Access Journals (Sweden)

    Rhys A Farrer

    Full Text Available Pathogenic fungi constitute a growing threat to both plant and animal species on a global scale. Despite a clonal mode of reproduction dominating the population genetic structure of many fungi, putatively asexual species are known to adapt rapidly when confronted by efforts to control their growth and transmission. However, the mechanisms by which adaptive diversity is generated across a clonal background are often poorly understood. We sequenced a global panel of the emergent amphibian pathogen, Batrachochytrium dendrobatidis (Bd, to high depth and characterized rapidly changing features of its genome that we believe hold the key to the worldwide success of this organism. Our analyses show three processes that contribute to the generation of de novo diversity. Firstly, we show that the majority of wild isolates manifest chromosomal copy number variation that changes over short timescales. Secondly, we show that cryptic recombination occurs within all lineages of Bd, leading to large regions of the genome being in linkage equilibrium, and is preferentially associated with classes of genes of known importance for virulence in other pathosystems. Finally, we show that these classes of genes are under directional selection, and that this has predominantly targeted the Global Panzootic Lineage (BdGPL. Our analyses show that Bd manifests an unusually dynamic genome that may have been shaped by its association with the amphibian host. The rates of variation that we document likely explain the high levels of phenotypic variability that have been reported for Bd, and suggests that the dynamic genome of this pathogen has contributed to its success across multiple biomes and host-species.

  3. The Shanggongshan Tunnel Kunming Zhangjiuhe River Water Diversion and Water Supply Project

    Institute of Scientific and Technical Information of China (English)

    J. P. Kaegi; M. Bachmann; A. Colombi

    2004-01-01

    Kunming is the political and economical centre of the Yunnan Province in the south -west of China and one of the most beautiful historical and cultural cities in China. It is also one of the 14 cities in China that are severely short of water. In order to solve the supply problem and to allow for future development of the local society and economy, the "Kunming Zhangjiuhe River Water Diversion and Water Supply Project" was implemented. The total investment for the project is about USD 476 million.The objective is to establish a water supply system with a capacity of 0.6 million tons of water per day.Major parts of the project are:capacity by 0. 442 billion m3 and an annual water supply of 0. 245 billion m3;tunnels, but also some siphons);pacity of 0.4 million tons per day in the initial stage and 0.6 million tons per day once completed;length of 93.43 km;sons.Project completion is planned for the end of 2006.

  4. The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

    Science.gov (United States)

    Reddy, T B K; Thomas, Alex D; Stamatis, Dimitri; Bertsch, Jon; Isbandi, Michelle; Jansson, Jakob; Mallajosyula, Jyothi; Pagani, Ioanna; Lobos, Elizabeth A; Kyrpides, Nikos C

    2015-01-01

    The Genomes OnLine Database (GOLD; http://www.genomesonline.org) is a comprehensive online resource to catalog and monitor genetic studies worldwide. GOLD provides up-to-date status on complete and ongoing sequencing projects along with a broad array of curated metadata. Here we report version 5 (v.5) of the database. The newly designed database schema and web user interface supports several new features including the implementation of a four level (meta)genome project classification system and a simplified intuitive web interface to access reports and launch search tools. The database currently hosts information for about 19,200 studies, 56,000 Biosamples, 56,00