WorldWideScience

Sample records for underlying genomic variation

  1. Genome-wide association study identified CNP12587 region underlying height variation in Chinese females.

    Directory of Open Access Journals (Sweden)

    Yin-Ping Zhang

    Full Text Available Human height is a highly heritable trait considered as an important factor for health. There has been limited success in identifying the genetic factors underlying height variation. We aim to identify sequence variants associated with adult height by a genome-wide association study of copy number variants (CNVs in Chinese.Genome-wide CNV association analyses were conducted in 1,625 unrelated Chinese adults and sex specific subgroup for height variation, respectively. Height was measured with a stadiometer. Affymetrix SNP6.0 genotyping platform was used to identify copy number polymorphisms (CNPs. We constructed a genomic map containing 1,009 CNPs in Chinese individuals and performed a genome-wide association study of CNPs with height.We detected 10 significant association signals for height (p<0.05 in the whole population, 9 and 11 association signals for Chinese female and male population, respectively. A copy number polymorphism (CNP12587, chr18:54081842-54086942, p = 2.41 × 10(-4 was found to be significantly associated with height variation in Chinese females even after strict Bonferroni correction (p = 0.048. Confirmatory real time PCR experiments lent further support for CNV validation. Compared to female subjects with two copies of the CNP, carriers of three copies had an average of 8.1% decrease in height. An important candidate gene, ubiquitin-protein ligase NEDD4-like (NEDD4L, was detected at this region, which plays important roles in bone metabolism by binding to bone formation regulators.Our findings suggest the important genetic variants underlying height variation in Chinese.

  2. HGVA: the Human Genome Variation Archive

    OpenAIRE

    Lopez, Javier; Coll, Jacobo; Haimel, Matthias; Kandasamy, Swaathi; Tarraga, Joaquin; Furio-Tari, Pedro; Bari, Wasim; Bleda, Marta; Rueda, Antonio; Gr?f, Stefan; Rendon, Augusto; Dopazo, Joaquin; Medina, Ignacio

    2017-01-01

    Abstract High-profile genomic variation projects like the 1000 Genomes project or the Exome Aggregation Consortium, are generating a wealth of human genomic variation knowledge which can be used as an essential reference for identifying disease-causing genotypes. However, accessing these data, contrasting the various studies and integrating those data in downstream analyses remains cumbersome. The Human Genome Variation Archive (HGVA) tackles these challenges and facilitates access to genomic...

  3. GFVO: the Genomic Feature and Variation Ontology

    KAUST Repository

    Baran, Joachim

    2015-05-05

    Falling costs in genomic laboratory experiments have led to a steady increase of genomic feature and variation data. Multiple genomic data formats exist for sharing these data, and whilst they are similar, they are addressing slightly different data viewpoints and are consequently not fully compatible with each other. The fragmentation of data format specifications makes it hard to integrate and interpret data for further analysis with information from multiple data providers. As a solution, a new ontology is presented here for annotating and representing genomic feature and variation dataset contents. The Genomic Feature and Variation Ontology (GFVO) specifically addresses genomic data as it is regularly shared using the GFF3 (incl. FASTA), GTF, GVF and VCF file formats. GFVO simplifies data integration and enables linking of genomic annotations across datasets through common semantics of genomic types and relations. Availability and implementation. The latest stable release of the ontology is available via its base URI; previous and development versions are available at the ontology’s GitHub repository: https://github.com/BioInterchange/Ontologies; versions of the ontology are indexed through BioPortal (without external class-/property-equivalences due to BioPortal release 4.10 limitations); examples and reference documentation is provided on a separate web-page: http://www.biointerchange.org/ontologies.html. GFVO version 1.0.2 is licensed under the CC0 1.0 Universal license (https://creativecommons.org/publicdomain/zero/1.0) and therefore de facto within the public domain; the ontology can be appropriated without attribution for commercial and non-commercial use.

  4. Genome Variation Map: a data repository of genome variations in BIG Data Center

    OpenAIRE

    Song, Shuhui; Tian, Dongmei; Li, Cuiping; Tang, Bixia; Dong, Lili; Xiao, Jingfa; Bao, Yiming; Zhao, Wenming; He, Hang; Zhang, Zhang

    2017-01-01

    Abstract The Genome Variation Map (GVM; http://bigd.big.ac.cn/gvm/) is a public data repository of genome variations. As a core resource in the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, GVM dedicates to collect, integrate and visualize genome variations for a wide range of species, accepts submissions of different types of genome variations from all over the world and provides free open access to all publicly available data in support of worldwide research a...

  5. Structural genomic variations and Parkinson's disease.

    Science.gov (United States)

    Bandrés-Ciga, Sara; Ruz, Clara; Barrero, Francisco J; Escamilla-Sevilla, Francisco; Pelegrina, Javier; Vives, Francisco; Duran, Raquel

    2017-10-01

    Parkinson's disease (PD) is the second most common neurodegenerative disease, whose prevalence is projected to be between 8.7 and 9.3 million by 2030. Until about 20 years ago, PD was considered to be the textbook example of a "non-genetic" disorder. Nowadays, PD is generally considered a multifactorial disorder that arises from the combination and complex interaction of genes and environmental factors. To date, a total of 7 genes including SNCA, LRRK2, PARK2, DJ-1, PINK 1, VPS35 and ATP13A2 have been seen to cause unequivocally Mendelian PD. Also, variants with incomplete penetrance in the genes LRRK2 and GBA are considered to be strong risk factors for PD worldwide. Although genetic studies have provided valuable insights into the pathogenic mechanisms underlying PD, the role of structural variation in PD has been understudied in comparison with other genomic variations. Structural genomic variations might substantially account for such genetic substrates yet to be discovered. The present review aims to provide an overview of the structural genomic variants implicated in the pathogenesis of PD.

  6. Genomic regions, cellular components and gene regulatory basis underlying pod length variations in cowpea (V. unguiculata L. Walp).

    Science.gov (United States)

    Xu, Pei; Wu, Xinyi; Muñoz-Amatriaín, María; Wang, Baogen; Wu, Xiaohua; Hu, Yaowen; Huynh, Bao-Lam; Close, Timothy J; Roberts, Philip A; Zhou, Wen; Lu, Zhongfu; Li, Guojing

    2017-05-01

    Cowpea (V. unguiculata L. Walp) is a climate resilient legume crop important for food security. Cultivated cowpea (V. unguiculata L) generally comprises the bushy, short-podded grain cowpea dominant in Africa and the climbing, long-podded vegetable cowpea popular in Asia. How selection has contributed to the diversification of the two types of cowpea remains largely unknown. In the current study, a novel genotyping assay for over 50 000 SNPs was employed to delineate genomic regions governing pod length. Major, minor and epistatic QTLs were identified through QTL mapping. Seventy-two SNPs associated with pod length were detected by genome-wide association studies (GWAS). Population stratification analysis revealed subdivision among a cowpea germplasm collection consisting of 299 accessions, which is consistent with pod length groups. Genomic scan for selective signals suggested that domestication of vegetable cowpea was accompanied by selection of multiple traits including pod length, while the further improvement process was featured by selection of pod length primarily. Pod growth kinetics assay demonstrated that more durable cell proliferation rather than cell elongation or enlargement was the main reason for longer pods. Transcriptomic analysis suggested the involvement of sugar, gibberellin and nutritional signalling in regulation of pod length. This study establishes the basis for map-based cloning of pod length genes in cowpea and for marker-assisted selection of this trait in breeding programmes. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  7. From genomic variation to personalized medicine

    DEFF Research Database (Denmark)

    Wesolowska, Agata; Schmiegelow, Kjeld

    Genomic variation is the basis of interindividual differences in observable traits and disease susceptibility. Genetic studies are the driving force of personalized medicine, as many of the differences in treatment efficacy can be attributed to our genomic background. The rapid development...... a considerable amount of the phenotype variability, hence the major difficulty of interpretation lies in the complexity of molecular interactions. This PhD thesis describes the state-of-art of the functional human variation research (Chapter 1) and introduces childhood acute lymphoblastic leukaemia (ALL...... the thesis and includes some final remarks on the perspectives of genomic variation research and personalized medicine. In summary, this thesis demonstrates the feasibility of integrative analyses of genomic variations and introduces large-scale hypothesis-driven SNP exploration studies as an emerging...

  8. Genomics technologies to study structural variations in the grapevine genome

    Directory of Open Access Journals (Sweden)

    Cardone Maria Francesca

    2016-01-01

    Full Text Available Grapevine is one of the most important crop plants in the world. Recently there was great expansion of genomics resources about grapevine genome, thus providing increasing efforts for molecular breeding. Current cultivars display a great level of inter-specific differentiation that needs to be investigated to reach a comprehensive understanding of the genetic basis of phenotypic differences, and to find responsible genes selected by cross breeding programs. While there have been significant advances in resolving the pattern and nature of single nucleotide polymorphisms (SNPs on plant genomes, few data are available on copy number variation (CNV. Furthermore association between structural variations and phenotypes has been described in only a few cases. We combined high throughput biotechnologies and bioinformatics tools, to reveal the first inter-varietal atlas of structural variation (SV for the grapevine genome. We sequenced and compared four table grape cultivars with the Pinot noir inbred line PN40024 genome as the reference. We detected roughly 8% of the grapevine genome affected by genomic variations. Taken into account phenotypic differences existing among the studied varieties we performed comparison of SVs among them and the reference and next we performed an in-depth analysis of gene content of polymorphic regions. This allowed us to identify genes showing differences in copy number as putative functional candidates for important traits in grapevine cultivation.

  9. HGVA: the Human Genome Variation Archive.

    Science.gov (United States)

    Lopez, Javier; Coll, Jacobo; Haimel, Matthias; Kandasamy, Swaathi; Tarraga, Joaquin; Furio-Tari, Pedro; Bari, Wasim; Bleda, Marta; Rueda, Antonio; Gräf, Stefan; Rendon, Augusto; Dopazo, Joaquin; Medina, Ignacio

    2017-07-03

    High-profile genomic variation projects like the 1000 Genomes project or the Exome Aggregation Consortium, are generating a wealth of human genomic variation knowledge which can be used as an essential reference for identifying disease-causing genotypes. However, accessing these data, contrasting the various studies and integrating those data in downstream analyses remains cumbersome. The Human Genome Variation Archive (HGVA) tackles these challenges and facilitates access to genomic data for key reference projects in a clean, fast and integrated fashion. HGVA provides an efficient and intuitive web-interface for easy data mining, a comprehensive RESTful API and client libraries in Python, Java and JavaScript for fast programmatic access to its knowledge base. HGVA calculates population frequencies for these projects and enriches their data with variant annotation provided by CellBase, a rich and fast annotation solution. HGVA serves as a proof-of-concept of the genome analysis developments being carried out by the University of Cambridge together with UK's 100 000 genomes project and the National Institute for Health Research BioResource Rare-Diseases, in particular, deploying open-source for Computational Biology (OpenCB) software platform for storing and analyzing massive genomic datasets. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Insights into structural variations and genome rearrangements in prokaryotic genomes.

    Science.gov (United States)

    Periwal, Vinita; Scaria, Vinod

    2015-01-01

    Structural variations (SVs) are genomic rearrangements that affect fairly large fragments of DNA. Most of the SVs such as inversions, deletions and translocations have been largely studied in context of genetic diseases in eukaryotes. However, recent studies demonstrate that genome rearrangements can also have profound impact on prokaryotic genomes, leading to altered cell phenotype. In contrast to single-nucleotide variations, SVs provide a much deeper insight into organization of bacterial genomes at a much better resolution. SVs can confer change in gene copy number, creation of new genes, altered gene expression and many other functional consequences. High-throughput technologies have now made it possible to explore SVs at a much refined resolution in bacterial genomes. Through this review, we aim to highlight the importance of the less explored field of SVs in prokaryotic genomes and their impact. We also discuss its potential applicability in the emerging fields of synthetic biology and genome engineering where targeted SVs could serve to create sophisticated and accurate genome editing. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  11. Genomic Sequence Variation Markup Language (GSVML).

    Science.gov (United States)

    Nakaya, Jun; Kimura, Michio; Hiroi, Kaei; Ido, Keisuke; Yang, Woosung; Tanaka, Hiroshi

    2010-02-01

    With the aim of making good use of internationally accumulated genomic sequence variation data, which is increasing rapidly due to the explosive amount of genomic research at present, the development of an interoperable data exchange format and its international standardization are necessary. Genomic Sequence Variation Markup Language (GSVML) will focus on genomic sequence variation data and human health applications, such as gene based medicine or pharmacogenomics. We developed GSVML through eight steps, based on case analysis and domain investigations. By focusing on the design scope to human health applications and genomic sequence variation, we attempted to eliminate ambiguity and to ensure practicability. We intended to satisfy the requirements derived from the use case analysis of human-based clinical genomic applications. Based on database investigations, we attempted to minimize the redundancy of the data format, while maximizing the data covering range. We also attempted to ensure communication and interface ability with other Markup Languages, for exchange of omics data among various omics researchers or facilities. The interface ability with developing clinical standards, such as the Health Level Seven Genotype Information model, was analyzed. We developed the human health-oriented GSVML comprising variation data, direct annotation, and indirect annotation categories; the variation data category is required, while the direct and indirect annotation categories are optional. The annotation categories contain omics and clinical information, and have internal relationships. For designing, we examined 6 cases for three criteria as human health application and 15 data elements for three criteria as data formats for genomic sequence variation data exchange. The data format of five international SNP databases and six Markup Languages and the interface ability to the Health Level Seven Genotype Model in terms of 317 items were investigated. GSVML was developed as

  12. Big Data Analysis of Human Genome Variations

    KAUST Repository

    Gojobori, Takashi

    2016-01-25

    Since the human genome draft sequence was in public for the first time in 2000, genomic analyses have been intensively extended to the population level. The following three international projects are good examples for large-scale studies of human genome variations: 1) HapMap Data (1,417 individuals) (http://hapmap.ncbi.nlm.nih.gov/downloads/genotypes/2010-08_phaseII+III/forward/), 2) HGDP (Human Genome Diversity Project) Data (940 individuals) (http://www.hagsc.org/hgdp/files.html), 3) 1000 genomes Data (2,504 individuals) http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/ If we can integrate all three data into a single volume of data, we should be able to conduct a more detailed analysis of human genome variations for a total number of 4,861 individuals (= 1,417+940+2,504 individuals). In fact, we successfully integrated these three data sets by use of information on the reference human genome sequence, and we conducted the big data analysis. In particular, we constructed a phylogenetic tree of about 5,000 human individuals at the genome level. As a result, we were able to identify clusters of ethnic groups, with detectable admixture, that were not possible by an analysis of each of the three data sets. Here, we report the outcome of this kind of big data analyses and discuss evolutionary significance of human genomic variations. Note that the present study was conducted in collaboration with Katsuhiko Mineta and Kosuke Goto at KAUST.

  13. Copy number variation in the bovine genome

    DEFF Research Database (Denmark)

    Fadista, João; Thomsen, Bo; Holm, Lars-Erik

    2010-01-01

    to genetic variation in cattle. Results We designed and used a set of NimbleGen CGH arrays that tile across the assayable portion of the cattle genome with approximately 6.3 million probes, at a median probe spacing of 301 bp. This study reports the highest resolution map of copy number variation...... in the cattle genome, with 304 CNV regions (CNVRs) being identified among the genomes of 20 bovine samples from 4 dairy and beef breeds. The CNVRs identified covered 0.68% (22 Mb) of the genome, and ranged in size from 1.7 to 2,031 kb (median size 16.7 kb). About 20% of the CNVs co-localized with segmental...... duplications, while 30% encompass genes, of which the majority is involved in environmental response. About 10% of the human orthologous of these genes are associated with human disease susceptibility and, hence, may have important phenotypic consequences. Conclusions Together, this analysis provides a useful...

  14. Genome Variation Map: a data repository of genome variations in BIG Data Center.

    Science.gov (United States)

    Song, Shuhui; Tian, Dongmei; Li, Cuiping; Tang, Bixia; Dong, Lili; Xiao, Jingfa; Bao, Yiming; Zhao, Wenming; He, Hang; Zhang, Zhang

    2018-01-04

    The Genome Variation Map (GVM; http://bigd.big.ac.cn/gvm/) is a public data repository of genome variations. As a core resource in the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, GVM dedicates to collect, integrate and visualize genome variations for a wide range of species, accepts submissions of different types of genome variations from all over the world and provides free open access to all publicly available data in support of worldwide research activities. Unlike existing related databases, GVM features integration of a large number of genome variations for a broad diversity of species including human, cultivated plants and domesticated animals. Specifically, the current implementation of GVM not only houses a total of ∼4.9 billion variants for 19 species including chicken, dog, goat, human, poplar, rice and tomato, but also incorporates 8669 individual genotypes and 13 262 manually curated high-quality genotype-to-phenotype associations for non-human species. In addition, GVM provides friendly intuitive web interfaces for data submission, browse, search and visualization. Collectively, GVM serves as an important resource for archiving genomic variation data, helpful for better understanding population genetic diversity and deciphering complex mechanisms associated with different phenotypes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Genome Variation Map: a data repository of genome variations in BIG Data Center

    Science.gov (United States)

    Tian, Dongmei; Li, Cuiping; Tang, Bixia; Dong, Lili; Xiao, Jingfa; Bao, Yiming; Zhao, Wenming; He, Hang

    2018-01-01

    Abstract The Genome Variation Map (GVM; http://bigd.big.ac.cn/gvm/) is a public data repository of genome variations. As a core resource in the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, GVM dedicates to collect, integrate and visualize genome variations for a wide range of species, accepts submissions of different types of genome variations from all over the world and provides free open access to all publicly available data in support of worldwide research activities. Unlike existing related databases, GVM features integration of a large number of genome variations for a broad diversity of species including human, cultivated plants and domesticated animals. Specifically, the current implementation of GVM not only houses a total of ∼4.9 billion variants for 19 species including chicken, dog, goat, human, poplar, rice and tomato, but also incorporates 8669 individual genotypes and 13 262 manually curated high-quality genotype-to-phenotype associations for non-human species. In addition, GVM provides friendly intuitive web interfaces for data submission, browse, search and visualization. Collectively, GVM serves as an important resource for archiving genomic variation data, helpful for better understanding population genetic diversity and deciphering complex mechanisms associated with different phenotypes. PMID:29069473

  16. Structural genomic variation in ischemic stroke

    Science.gov (United States)

    Matarin, Mar; Simon-Sanchez, Javier; Fung, Hon-Chung; Scholz, Sonja; Gibbs, J. Raphael; Hernandez, Dena G.; Crews, Cynthia; Britton, Angela; Wavrant De Vrieze, Fabienne; Brott, Thomas G.; Brown, Robert D.; Worrall, Bradford B.; Silliman, Scott; Case, L. Douglas; Hardy, John A.; Rich, Stephen S.; Meschia, James F.; Singleton, Andrew B.

    2008-01-01

    Technological advances in molecular genetics allow rapid and sensitive identification of genomic copy number variants (CNVs). This, in turn, has sparked interest in the function such variation may play in disease. While a role for copy number mutations as a cause of Mendelian disorders is well established, it is unclear whether CNVs may affect risk for common complex disorders. We sought to investigate whether CNVs may modulate risk for ischemic stroke (IS) and to provide a catalog of CNVs in patients with this disorder by analyzing copy number metrics produced as a part of our previous genome-wide single-nucleotide polymorphism (SNP)-based association study of ischemic stroke in a North American white population. We examined CNVs in 263 patients with ischemic stroke (IS). Each identified CNV was compared with changes identified in 275 neurologically normal controls. Our analysis identified 247 CNVs, corresponding to 187 insertions (76%; 135 heterozygous; 25 homozygous duplications or triplications; 2 heterosomic) and 60 deletions (24%; 40 heterozygous deletions;3 homozygous deletions; 14 heterosomic deletions). Most alterations (81%) were the same as, or overlapped with, previously reported CNVs. We report here the first genome-wide analysis of CNVs in IS patients. In summary, our study did not detect any common genomic structural variation unequivocally linked to IS, although we cannot exclude that smaller CNVs or CNVs in genomic regions poorly covered by this methodology may confer risk for IS. The application of genome-wide SNP arrays now facilitates the evaluation of structural changes through the entire genome as part of a genome-wide genetic association study. PMID:18288507

  17. Copy Number Variations in Tilapia Genomes.

    Science.gov (United States)

    Li, Bi Jun; Li, Hong Lian; Meng, Zining; Zhang, Yong; Lin, Haoran; Yue, Gen Hua; Xia, Jun Hong

    2017-02-01

    Discovering the nature and pattern of genome variation is fundamental in understanding phenotypic diversity among populations. Although several millions of single nucleotide polymorphisms (SNPs) have been discovered in tilapia, the genome-wide characterization of larger structural variants, such as copy number variation (CNV) regions has not been carried out yet. We conducted a genome-wide scan for CNVs in 47 individuals from three tilapia populations. Based on 254 Gb of high-quality paired-end sequencing reads, we identified 4642 distinct high-confidence CNVs. These CNVs account for 1.9% (12.411 Mb) of the used Nile tilapia reference genome. A total of 1100 predicted CNVs were found overlapping with exon regions of protein genes. Further association analysis based on linear model regression found 85 CNVs ranging between 300 and 27,000 base pairs significantly associated to population types (R 2  > 0.9 and P > 0.001). Our study sheds first insights on genome-wide CNVs in tilapia. These CNVs among and within tilapia populations may have functional effects on phenotypes and specific adaptation to particular environments.

  18. The African Genome Variation Project shapes medical genetics in Africa

    Science.gov (United States)

    Gurdasani, Deepti; Carstensen, Tommy; Tekola-Ayele, Fasil; Pagani, Luca; Tachmazidou, Ioanna; Hatzikotoulas, Konstantinos; Karthikeyan, Savita; Iles, Louise; Pollard, Martin O.; Choudhury, Ananyo; Ritchie, Graham R. S.; Xue, Yali; Asimit, Jennifer; Nsubuga, Rebecca N.; Young, Elizabeth H.; Pomilla, Cristina; Kivinen, Katja; Rockett, Kirk; Kamali, Anatoli; Doumatey, Ayo P.; Asiki, Gershim; Seeley, Janet; Sisay-Joof, Fatoumatta; Jallow, Muminatou; Tollman, Stephen; Mekonnen, Ephrem; Ekong, Rosemary; Oljira, Tamiru; Bradman, Neil; Bojang, Kalifa; Ramsay, Michele; Adeyemo, Adebowale; Bekele, Endashaw; Motala, Ayesha; Norris, Shane A.; Pirie, Fraser; Kaleebu, Pontiano; Kwiatkowski, Dominic; Tyler-Smith, Chris; Rotimi, Charles; Zeggini, Eleftheria; Sandhu, Manjinder S.

    2014-01-01

    Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterisation of African genetic diversity is needed. The African Genome Variation Project (AGVP) provides a resource to help design, implement and interpret genomic studies in sub-Saharan Africa (SSA) and worldwide. The AGVP represents dense genotypes from 1,481 and whole genome sequences (WGS) from 320 individuals across SSA. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across SSA. We identify new loci under selection, including for malaria and hypertension. We show that modern imputation panels can identify association signals at highly differentiated loci across populations in SSA. Using WGS, we show further improvement in imputation accuracy supporting efforts for large-scale sequencing of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa, showing for the first time that such designs are feasible. PMID:25470054

  19. The African Genome Variation Project shapes medical genetics in Africa.

    Science.gov (United States)

    Gurdasani, Deepti; Carstensen, Tommy; Tekola-Ayele, Fasil; Pagani, Luca; Tachmazidou, Ioanna; Hatzikotoulas, Konstantinos; Karthikeyan, Savita; Iles, Louise; Pollard, Martin O; Choudhury, Ananyo; Ritchie, Graham R S; Xue, Yali; Asimit, Jennifer; Nsubuga, Rebecca N; Young, Elizabeth H; Pomilla, Cristina; Kivinen, Katja; Rockett, Kirk; Kamali, Anatoli; Doumatey, Ayo P; Asiki, Gershim; Seeley, Janet; Sisay-Joof, Fatoumatta; Jallow, Muminatou; Tollman, Stephen; Mekonnen, Ephrem; Ekong, Rosemary; Oljira, Tamiru; Bradman, Neil; Bojang, Kalifa; Ramsay, Michele; Adeyemo, Adebowale; Bekele, Endashaw; Motala, Ayesha; Norris, Shane A; Pirie, Fraser; Kaleebu, Pontiano; Kwiatkowski, Dominic; Tyler-Smith, Chris; Rotimi, Charles; Zeggini, Eleftheria; Sandhu, Manjinder S

    2015-01-15

    Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterization of African genetic diversity is needed. The African Genome Variation Project provides a resource with which to design, implement and interpret genomic studies in sub-Saharan Africa and worldwide. The African Genome Variation Project represents dense genotypes from 1,481 individuals and whole-genome sequences from 320 individuals across sub-Saharan Africa. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across sub-Saharan Africa. We identify new loci under selection, including loci related to malaria susceptibility and hypertension. We show that modern imputation panels (sets of reference genotypes from which unobserved or missing genotypes in study sets can be inferred) can identify association signals at highly differentiated loci across populations in sub-Saharan Africa. Using whole-genome sequencing, we demonstrate further improvements in imputation accuracy, strengthening the case for large-scale sequencing efforts of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa.

  20. The African Genome Variation Project shapes medical genetics in Africa

    Science.gov (United States)

    Gurdasani, Deepti; Carstensen, Tommy; Tekola-Ayele, Fasil; Pagani, Luca; Tachmazidou, Ioanna; Hatzikotoulas, Konstantinos; Karthikeyan, Savita; Iles, Louise; Pollard, Martin O.; Choudhury, Ananyo; Ritchie, Graham R. S.; Xue, Yali; Asimit, Jennifer; Nsubuga, Rebecca N.; Young, Elizabeth H.; Pomilla, Cristina; Kivinen, Katja; Rockett, Kirk; Kamali, Anatoli; Doumatey, Ayo P.; Asiki, Gershim; Seeley, Janet; Sisay-Joof, Fatoumatta; Jallow, Muminatou; Tollman, Stephen; Mekonnen, Ephrem; Ekong, Rosemary; Oljira, Tamiru; Bradman, Neil; Bojang, Kalifa; Ramsay, Michele; Adeyemo, Adebowale; Bekele, Endashaw; Motala, Ayesha; Norris, Shane A.; Pirie, Fraser; Kaleebu, Pontiano; Kwiatkowski, Dominic; Tyler-Smith, Chris; Rotimi, Charles; Zeggini, Eleftheria; Sandhu, Manjinder S.

    2015-01-01

    Given the importance of Africa to studies of human origins and disease susceptibility, detailed characterization of African genetic diversity is needed. The African Genome Variation Project provides a resource with which to design, implement and interpret genomic studies in sub-Saharan Africa and worldwide. The African Genome Variation Project represents dense genotypes from 1,481 individuals and whole-genome sequences from 320 individuals across sub-Saharan Africa. Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture across sub-Saharan Africa. We identify new loci under selection, including loci related to malaria susceptibility and hypertension. We show that modern imputation panels (sets of reference genotypes from which unobserved or missing genotypes in study sets can be inferred) can identify association signals at highly differentiated loci across populations in sub-Saharan Africa. Using whole-genome sequencing, we demonstrate further improvements in imputation accuracy, strengthening the case for large-scale sequencing efforts of diverse African haplotypes. Finally, we present an efficient genotype array design capturing common genetic variation in Africa.

  1. Regions of the bread wheat D genome associated with variation in key photosynthesis traits and shoot biomass under both well watered and water deficient conditions.

    Science.gov (United States)

    Osipova, Svetlana; Permyakov, Alexey; Permyakova, Marina; Pshenichnikova, Tatyana; Verkhoturov, Vasiliy; Rudikovsky, Alexandr; Rudikovskaya, Elena; Shishparenok, Alexandr; Doroshkov, Alexey; Börner, Andreas

    2016-05-01

    A quantitative trait locus (QTL) approach was taken to reveal the genetic basis in wheat of traits associated with photosynthesis during a period of exposure to water deficit stress. The performance, with respect to shoot biomass, gas exchange and chlorophyll fluorescence, leaf pigment content and the activity of various ascorbate-glutathione cycle enzymes and catalase, of a set of 80 wheat lines, each containing a single chromosomal segment introgressed from the bread wheat D genome progenitor Aegilops tauschii, was monitored in plants exposed to various water regimes. Four of the seven D genome chromosomes (1D, 2D, 5D, and 7D) carried clusters of both major (LOD >3.0) and minor (LOD between 2.0 and 3.0) QTL. A major QTL underlying the activity of glutathione reductase was located on chromosome 2D, and another, controlling the activity of ascorbate peroxidase, on chromosome 7D. A region of chromosome 2D defined by the microsatellite locus Xgwm539 and a second on chromosome 7D flanked by the marker loci Xgwm1242 and Xgwm44 harbored a number of QTL associated with the water deficit stress response.

  2. Causal Genetic Variation Underlying Metabolome Differences.

    Science.gov (United States)

    Swain-Lenz, Devjanee; Nikolskiy, Igor; Cheng, Jiye; Sudarsanam, Priya; Nayler, Darcy; Staller, Max V; Cohen, Barak A

    2017-08-01

    An ongoing challenge in biology is to predict the phenotypes of individuals from their genotypes. Genetic variants that cause disease often change an individual's total metabolite profile, or metabolome. In light of our extensive knowledge of metabolic pathways, genetic variants that alter the metabolome may help predict novel phenotypes. To link genetic variants to changes in the metabolome, we studied natural variation in the yeast Saccharomyces cerevisiae We used an untargeted mass spectrometry method to identify dozens of metabolite Quantitative Trait Loci (mQTL), genomic regions containing genetic variation that control differences in metabolite levels between individuals. We mapped differences in urea cycle metabolites to genetic variation in specific genes known to regulate amino acid biosynthesis. Our functional assays reveal that genetic variation in two genes, AUA1 and ARG81 , cause the differences in the abundance of several urea cycle metabolites. Based on knowledge of the urea cycle, we predicted and then validated a new phenotype: sensitivity to a particular class of amino acid isomers. Our results are a proof-of-concept that untargeted mass spectrometry can reveal links between natural genetic variants and metabolome diversity. The interpretability of our results demonstrates the promise of using genetic variants underlying natural differences in the metabolome to predict novel phenotypes from genotype. Copyright © 2017 by the Genetics Society of America.

  3. Genomic variation in Salmonella enterica core genes for epidemiological typing

    DEFF Research Database (Denmark)

    Leekitcharoenphon, Pimlapas; Lukjancenko, Oksana; Rundsten, Carsten Friis

    2012-01-01

    Background: Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over...... genomes and evaluate their value as typing targets, comparing whole genome typing and traditional methods such as 16S and MLST. A consensus tree based on variation of core genes gives much better resolution than 16S and MLST; the pan-genome family tree is similar to the consensus tree, but with higher...... that there is a positive selection towards mutations leading to amino acid changes. Conclusions: Genomic variation within the core genome is useful for investigating molecular evolution and providing candidate genes for bacterial genome typing. Identification of genes with different degrees of variation is important...

  4. Genome size variation in the genus Avena.

    Science.gov (United States)

    Yan, Honghai; Martin, Sara L; Bekele, Wubishet A; Latta, Robert G; Diederichsen, Axel; Peng, Yuanying; Tinker, Nicholas A

    2016-03-01

    Genome size is an indicator of evolutionary distance and a metric for genome characterization. Here, we report accurate estimates of genome size in 99 accessions from 26 species of Avena. We demonstrate that the average genome size of C genome diploid species (2C = 10.26 pg) is 15% larger than that of A genome species (2C = 8.95 pg), and that this difference likely accounts for a progression of size among tetraploid species, where AB genome configuration had similar genome sizes (average 2C = 25.74 pg). Genome size was mostly consistent within species and in general agreement with current information about evolutionary distance among species. Results also suggest that most of the polyploid species in Avena have experienced genome downsizing in relation to their diploid progenitors. Genome size measurements could provide additional quality control for species identification in germplasm collections, especially in cases where diploid and polyploid species have similar morphology.

  5. Intrapopulation genome size variation in D. melanogaster reflects life history variation and plasticity.

    Directory of Open Access Journals (Sweden)

    Lisa L Ellis

    2014-07-01

    Full Text Available We determined female genome sizes using flow cytometry for 211 Drosophila melanogaster sequenced inbred strains from the Drosophila Genetic Reference Panel, and found significant conspecific and intrapopulation variation in genome size. We also compared several life history traits for 25 lines with large and 25 lines with small genomes in three thermal environments, and found that genome size as well as genome size by temperature interactions significantly correlated with survival to pupation and adulthood, time to pupation, female pupal mass, and female eclosion rates. Genome size accounted for up to 23% of the variation in developmental phenotypes, but the contribution of genome size to variation in life history traits was plastic and varied according to the thermal environment. Expression data implicate differences in metabolism that correspond to genome size variation. These results indicate that significant genome size variation exists within D. melanogaster and this variation may impact the evolutionary ecology of the species. Genome size variation accounts for a significant portion of life history variation in an environmentally dependent manner, suggesting that potential fitness effects associated with genome size variation also depend on environmental conditions.

  6. Intrapopulation Genome Size Variation in D. melanogaster Reflects Life History Variation and Plasticity

    Science.gov (United States)

    Ellis, Lisa L.; Huang, Wen; Quinn, Andrew M.; Ahuja, Astha; Alfrejd, Ben; Gomez, Francisco E.; Hjelmen, Carl E.; Moore, Kristi L.; Mackay, Trudy F. C.; Johnston, J. Spencer; Tarone, Aaron M.

    2014-01-01

    We determined female genome sizes using flow cytometry for 211 Drosophila melanogaster sequenced inbred strains from the Drosophila Genetic Reference Panel, and found significant conspecific and intrapopulation variation in genome size. We also compared several life history traits for 25 lines with large and 25 lines with small genomes in three thermal environments, and found that genome size as well as genome size by temperature interactions significantly correlated with survival to pupation and adulthood, time to pupation, female pupal mass, and female eclosion rates. Genome size accounted for up to 23% of the variation in developmental phenotypes, but the contribution of genome size to variation in life history traits was plastic and varied according to the thermal environment. Expression data implicate differences in metabolism that correspond to genome size variation. These results indicate that significant genome size variation exists within D. melanogaster and this variation may impact the evolutionary ecology of the species. Genome size variation accounts for a significant portion of life history variation in an environmentally dependent manner, suggesting that potential fitness effects associated with genome size variation also depend on environmental conditions. PMID:25057905

  7. Copy Number Variation in the Horse Genome

    Science.gov (United States)

    Ghosh, Sharmila; Qu, Zhipeng; Das, Pranab J.; Fang, Erica; Juras, Rytis; Cothran, E. Gus; McDonell, Sue; Kenney, Daniel G.; Lear, Teri L.; Adelson, David L.; Chowdhary, Bhanu P.; Raudsepp, Terje

    2014-01-01

    We constructed a 400K WG tiling oligoarray for the horse and applied it for the discovery of copy number variations (CNVs) in 38 normal horses of 16 diverse breeds, and the Przewalski horse. Probes on the array represented 18,763 autosomal and X-linked genes, and intergenic, sub-telomeric and chrY sequences. We identified 258 CNV regions (CNVRs) across all autosomes, chrX and chrUn, but not in chrY. CNVs comprised 1.3% of the horse genome with chr12 being most enriched. American Miniature horses had the highest and American Quarter Horses the lowest number of CNVs in relation to Thoroughbred reference. The Przewalski horse was similar to native ponies and draft breeds. The majority of CNVRs involved genes, while 20% were located in intergenic regions. Similar to previous studies in horses and other mammals, molecular functions of CNV-associated genes were predominantly in sensory perception, immunity and reproduction. The findings were integrated with previous studies to generate a composite genome-wide dataset of 1476 CNVRs. Of these, 301 CNVRs were shared between studies, while 1174 were novel and require further validation. Integrated data revealed that to date, 41 out of over 400 breeds of the domestic horse have been analyzed for CNVs, of which 11 new breeds were added in this study. Finally, the composite CNV dataset was applied in a pilot study for the discovery of CNVs in 6 horses with XY disorders of sexual development. A homozygous deletion involving AKR1C gene cluster in chr29 in two affected horses was considered possibly causative because of the known role of AKR1C genes in testicular androgen synthesis and sexual development. While the findings improve and integrate the knowledge of CNVs in horses, they also show that for effective discovery of variants of biomedical importance, more breeds and individuals need to be analyzed using comparable methodological approaches. PMID:25340504

  8. Copy number variation in the horse genome.

    Directory of Open Access Journals (Sweden)

    Sharmila Ghosh

    2014-10-01

    Full Text Available We constructed a 400K WG tiling oligoarray for the horse and applied it for the discovery of copy number variations (CNVs in 38 normal horses of 16 diverse breeds, and the Przewalski horse. Probes on the array represented 18,763 autosomal and X-linked genes, and intergenic, sub-telomeric and chrY sequences. We identified 258 CNV regions (CNVRs across all autosomes, chrX and chrUn, but not in chrY. CNVs comprised 1.3% of the horse genome with chr12 being most enriched. American Miniature horses had the highest and American Quarter Horses the lowest number of CNVs in relation to Thoroughbred reference. The Przewalski horse was similar to native ponies and draft breeds. The majority of CNVRs involved genes, while 20% were located in intergenic regions. Similar to previous studies in horses and other mammals, molecular functions of CNV-associated genes were predominantly in sensory perception, immunity and reproduction. The findings were integrated with previous studies to generate a composite genome-wide dataset of 1476 CNVRs. Of these, 301 CNVRs were shared between studies, while 1174 were novel and require further validation. Integrated data revealed that to date, 41 out of over 400 breeds of the domestic horse have been analyzed for CNVs, of which 11 new breeds were added in this study. Finally, the composite CNV dataset was applied in a pilot study for the discovery of CNVs in 6 horses with XY disorders of sexual development. A homozygous deletion involving AKR1C gene cluster in chr29 in two affected horses was considered possibly causative because of the known role of AKR1C genes in testicular androgen synthesis and sexual development. While the findings improve and integrate the knowledge of CNVs in horses, they also show that for effective discovery of variants of biomedical importance, more breeds and individuals need to be analyzed using comparable methodological approaches.

  9. Child Development and Structural Variation in the Human Genome

    Science.gov (United States)

    Zhang, Ying; Haraksingh, Rajini; Grubert, Fabian; Abyzov, Alexej; Gerstein, Mark; Weissman, Sherman; Urban, Alexander E.

    2013-01-01

    Structural variation of the human genome sequence is the insertion, deletion, or rearrangement of stretches of DNA sequence sized from around 1,000 to millions of base pairs. Over the past few years, structural variation has been shown to be far more common in human genomes than previously thought. Very little is currently known about the effects…

  10. Potential Value of Genomic Copy Number Variations in Schizophrenia

    Directory of Open Access Journals (Sweden)

    Chuanjun Zhuo

    2017-06-01

    Full Text Available Schizophrenia is a devastating neuropsychiatric disorder affecting approximately 1% of the global population, and the disease has imposed a considerable burden on families and society. Although, the exact cause of schizophrenia remains unknown, several lines of scientific evidence have revealed that genetic variants are strongly correlated with the development and early onset of the disease. In fact, the heritability among patients suffering from schizophrenia is as high as 80%. Genomic copy number variations (CNVs are one of the main forms of genomic variations, ubiquitously occurring in the human genome. An increasing number of studies have shown that CNVs account for population diversity and genetically related diseases, including schizophrenia. The last decade has witnessed rapid advances in the development of novel genomic technologies, which have led to the identification of schizophrenia-associated CNVs, insight into the roles of the affected genes in their intervals in schizophrenia, and successful manipulation of the target CNVs. In this review, we focus on the recent discoveries of important CNVs that are associated with schizophrenia and outline the potential values that the study of CNVs will bring to the areas of schizophrenia research, diagnosis, and therapy. Furthermore, with the help of the novel genetic tool known as the Clustered Regularly Interspaced Short Palindromic Repeats-associated nuclease 9 (CRISPR/Cas9 system, the pathogenic CNVs as genomic defects could be corrected. In conclusion, the recent novel findings of schizophrenia-associated CNVs offer an exciting opportunity for schizophrenia research to decipher the pathological mechanisms underlying the onset and development of schizophrenia as well as to provide potential clinical applications in genetic counseling, diagnosis, and therapy for this complex mental disease.

  11. Big Data Analysis of Human Genome Variations

    KAUST Repository

    Gojobori, Takashi

    2016-01-01

    Since the human genome draft sequence was in public for the first time in 2000, genomic analyses have been intensively extended to the population level. The following three international projects are good examples for large-scale studies of human

  12. Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping

    DEFF Research Database (Denmark)

    Zhan, Bujie; Fadista, João; Thomsen, Bo

    2011-01-01

    Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome...... of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation...

  13. Genome size, morphological and palynological variations, and ...

    African Journals Online (AJOL)

    The present study compares the morphological, palynologycal and genome size (C-value content) characteristics in the long-styled and short-styled plants in three Linum species, that is, ... The analysis of variance (ANOVA) test performed among the three Linum species showed a significant difference in 2C-value content.

  14. Genome Architecture and Its Roles in Human Copy Number Variation

    Directory of Open Access Journals (Sweden)

    Lu Chen

    2014-12-01

    Full Text Available Besides single-nucleotide variants in the human genome, large-scale genomic variants, such as copy number variations (CNVs, are being increasingly discovered as a genetic source of human diversity and the pathogenic factors of diseases. Recent experimental findings have shed light on the links between different genome architectures and CNV mutagenesis. In this review, we summarize various genomic features and discuss their contributions to CNV formation. Genomic repeats, including both low-copy and high-copy repeats, play important roles in CNV instability, which was initially known as DNA recombination events. Furthermore, it has been found that human genomic repeats can also induce DNA replication errors and consequently result in CNV mutations. Some recent studies showed that DNA replication timing, which reflects the high-order information of genomic organization, is involved in human CNV mutations. Our review highlights that genome architecture, from DNA sequence to high-order genomic organization, is an important molecular factor in CNV mutagenesis and human genomic instability.

  15. Salmon and steelhead genetics and genomics - Epigenetic and genomic variation in salmon and steelhead

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — Conduct analyses of epigenetic and genomic variation in Chinook salmon and steelhead to determine influence on phenotypic expression of life history traits. Genetic,...

  16. Bonobos fall within the genomic variation of chimpanzees.

    Directory of Open Access Journals (Sweden)

    Anne Fischer

    Full Text Available To gain insight into the patterns of genetic variation and evolutionary relationships within and between bonobos and chimpanzees, we sequenced 150,000 base pairs of nuclear DNA divided among 15 autosomal regions as well as the complete mitochondrial genomes from 20 bonobos and 58 chimpanzees. Except for western chimpanzees, we found poor genetic separation of chimpanzees based on sample locality. In contrast, bonobos consistently cluster together but fall as a group within the variation of chimpanzees for many of the regions. Thus, while chimpanzees retain genomic variation that predates bonobo-chimpanzee speciation, extensive lineage sorting has occurred within bonobos such that much of their genome traces its ancestry back to a single common ancestor that postdates their origin as a group separate from chimpanzees.

  17. Mapping copy number variation by population-scale genome sequencing

    DEFF Research Database (Denmark)

    Mills, Ryan E.; Walter, Klaudia; Stewart, Chip

    2011-01-01

    Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is......, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications...

  18. Genomic variation landscape of the human gut microbiome

    DEFF Research Database (Denmark)

    Schloissnig, Siegfried; Arumugam, Manimozhiyan; Sunagawa, Shinichi

    2013-01-01

    Whereas large-scale efforts have rapidly advanced the understanding and practical impact of human genomic variation, the practical impact of variation is largely unexplored in the human microbiome. We therefore developed a framework for metagenomic variation analysis and applied it to 252 faecal...... polymorphism rates of 0.11 was more variable between gut microbial species than across human hosts. Subjects sampled at varying time intervals exhibited individuality and temporal stability of SNP variation patterns, despite considerable composition changes of their gut microbiota. This indicates...

  19. The Organelle Genomes of Hassawi Rice (Oryza sativa L.) and Its Hybrid in Saudi Arabia: Genome Variation, Rearrangement, and Origins

    Science.gov (United States)

    Zhang, Tongwu; Hu, Songnian; Zhang, Guangyu; Pan, Linlin; Zhang, Xiaowei; Al-Mssallem, Ibrahim S.; Yu, Jun

    2012-01-01

    Hassawi rice (Oryza sativa L.) is a landrace adapted to the climate of Saudi Arabia, characterized by its strong resistance to soil salinity and drought. Using high quality sequencing reads extracted from raw data of a whole genome sequencing project, we assembled both chloroplast (cp) and mitochondrial (mt) genomes of the wild-type Hassawi rice (Hassawi-1) and its dwarf hybrid (Hassawi-2). We discovered 16 InDels (insertions and deletions) but no SNP (single nucleotide polymorphism) is present between the two Hassawi cp genomes. We identified 48 InDels and 26 SNPs in the two Hassawi mt genomes and a new type of sequence variation, termed reverse complementary variation (RCV) in the rice cp genomes. There are two and four RCVs identified in Hassawi-1 when compared to 93–11 (indica) and Nipponbare (japonica), respectively. Microsatellite sequence analysis showed there are more SSRs in the genic regions of both cp and mt genomes in the Hassawi rice than in the other rice varieties. There are also large repeats in the Hassawi mt genomes, with the longest length of 96,168 bp and 96,165 bp in Hassawi-1 and Hassawi-2, respectively. We believe that frequent DNA rearrangement in the Hassawi mt and cp genomes indicate ongoing dynamic processes to reach genetic stability under strong environmental pressures. Based on sequence variation analysis and the breeding history, we suggest that both Hassawi-1 and Hassawi-2 originated from the Indonesian variety Peta since genetic diversity between the two Hassawi cultivars is very low albeit an unknown historic origin of the wild-type Hassawi rice. PMID:22870184

  20. Genome-wide sequence variations among Mycobacterium avium subspecies paratuberculosis.

    Directory of Open Access Journals (Sweden)

    Chung-Yi eHsu

    2011-12-01

    Full Text Available Mycobacterium avium subspecies paratuberculosis (M. ap, the causative agent of Johne’s disease (JD, infects many farmed ruminants, wildlife animals and humans. To better understand the molecular pathogenesis of these infections, we analyzed the whole genome sequences of several M. ap and M. avium subspecies avium (M. avium strains isolated from various hosts and environments. Using Next-generation sequencing technology, all 6 M. ap isolates showed a high percentage of homology (98% to the reference genome sequence of M. ap K-10 isolated from cattle. However, 2 M. avium isolates (DT 78 and Env 77 showed significant sequence diversity from the reference strain M. avium 104. The genomes of M. avium isolates DT 78 and Env 77 exhibited only 87% and 40% homology, respectively, to the M. avium 104 reference genome. Within the M. ap isolates, genomic rearrangements (insertions/deletions, Indels were not detected, and only unique single nucleotide polymorphisms (SNPs were observed among the 6 M. ap strains. While most of the SNPs (~100 in M. ap genomes were non-synonymous, a total of ~ 6000 SNPs were detected among M. avium genomes, most of them were synonymous suggesting a differential selective pressure between M. ap and M. avium isolates. In addition, SNPs-based phylo-genomic analysis showed that isolates from goat and Oryx are closely related to the cattle (K-10 strain while the human isolate (M. ap 4B is closely related to the environmental strains, indicating environmental source to human infections. Overall, SNPs were the most common variations among M. ap isolates while SNPs in addition to Indels were prevalent among M. avium isolates. Genomic variations will be useful in designing host-specific markers for the analysis of mycobacterial evolution and for developing novel diagnostics directed against Johne’s disease in animals.

  1. Draft genome sequence of an elite Dura palm and whole-genome patterns of DNA variation in oil palm.

    Science.gov (United States)

    Jin, Jingjing; Lee, May; Bai, Bin; Sun, Yanwei; Qu, Jing; Rahmadsyah; Alfiko, Yuzer; Lim, Chin Huat; Suwanto, Antonius; Sugiharti, Maria; Wong, Limsoon; Ye, Jian; Chua, Nam-Hai; Yue, Gen Hua

    2016-12-01

    Oil palm is the world's leading source of vegetable oil and fat. Dura, Pisifera and Tenera are three forms of oil palm. The genome sequence of Pisifera is available whereas the Dura form has not been sequenced yet. We sequenced the genome of one elite Dura palm, and re-sequenced 17 palm genomes. The assemble genome sequence of the elite Dura tree contained 10,971 scaffolds and was 1.701 Gb in length, covering 94.49% of the oil palm genome. 36,105 genes were predicted. Re-sequencing of 17 additional palm trees identified 18.1 million SNPs. We found high genetic variation among palms from different geographical regions, but lower variation among Southeast Asian Dura and Pisifera palms. We mapped 10,000 SNPs on the linkage map of oil palm. In addition, high linkage disequilibrium (LD) was detected in the oil palms used in breeding populations of Southeast Asia, suggesting that LD mapping is likely to be practical in this important oil crop. Our data provide a valuable resource for accelerating genetic improvement and studying the mechanism underlying phenotypic variations of important oil palm traits. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  2. A unifying model of genome evolution under parsimony.

    Science.gov (United States)

    Paten, Benedict; Zerbino, Daniel R; Hickey, Glenn; Haussler, David

    2014-06-19

    Parsimony and maximum likelihood methods of phylogenetic tree estimation and parsimony methods for genome rearrangements are central to the study of genome evolution yet to date they have largely been pursued in isolation. We present a data structure called a history graph that offers a practical basis for the analysis of genome evolution. It conceptually simplifies the study of parsimonious evolutionary histories by representing both substitutions and double cut and join (DCJ) rearrangements in the presence of duplications. The problem of constructing parsimonious history graphs thus subsumes related maximum parsimony problems in the fields of phylogenetic reconstruction and genome rearrangement. We show that tractable functions can be used to define upper and lower bounds on the minimum number of substitutions and DCJ rearrangements needed to explain any history graph. These bounds become tight for a special type of unambiguous history graph called an ancestral variation graph (AVG), which constrains in its combinatorial structure the number of operations required. We finally demonstrate that for a given history graph G, a finite set of AVGs describe all parsimonious interpretations of G, and this set can be explored with a few sampling moves. This theoretical study describes a model in which the inference of genome rearrangements and phylogeny can be unified under parsimony.

  3. GFVO: the Genomic Feature and Variation Ontology

    KAUST Repository

    Baran, Joachim; Durgahee, Bibi Sehnaaz Begum; Eilbeck, Karen; Antezana, Erick; Hoehndorf, Robert; Dumontier, Michel

    2015-01-01

    Availability and implementation. The latest stable release of the ontology is available via its base URI; previous and development versions are available at the ontology’s GitHub repository: https://github.com/BioInterchange/Ontologies; versions of the ontology are indexed through BioPortal (without external class-/property-equivalences due to BioPortal release 4.10 limitations); examples and reference documentation is provided on a separate web-page: http://www.biointerchange.org/ontologies.html. GFVO version 1.0.2 is licensed under the CC0 1.0 Universal license (https://creativecommons.org/publicdomain/zero/1.0) and therefore de facto within the public domain; the ontology can be appropriated without attribution for commercial and non-commercial use.

  4. Transformation of natural genetic variation into Haemophilus influenzae genomes.

    Directory of Open Access Journals (Sweden)

    Joshua Chang Mell

    2011-07-01

    Full Text Available Many bacteria are able to efficiently bind and take up double-stranded DNA fragments, and the resulting natural transformation shapes bacterial genomes, transmits antibiotic resistance, and allows escape from immune surveillance. The genomes of many competent pathogens show evidence of extensive historical recombination between lineages, but the actual recombination events have not been well characterized. We used DNA from a clinical isolate of Haemophilus influenzae to transform competent cells of a laboratory strain. To identify which of the ~40,000 polymorphic differences had recombined into the genomes of four transformed clones, their genomes and their donor and recipient parents were deep sequenced to high coverage. Each clone was found to contain ~1000 donor polymorphisms in 3-6 contiguous runs (8.1±4.5 kb in length that collectively comprised ~1-3% of each transformed chromosome. Seven donor-specific insertions and deletions were also acquired as parts of larger donor segments, but the presence of other structural variation flanking 12 of 32 recombination breakpoints suggested that these often disrupt the progress of recombination events. This is the first genome-wide analysis of chromosomes directly transformed with DNA from a divergent genotype, connecting experimental studies of transformation with the high levels of natural genetic variation found in isolates of the same species.

  5. Gene copy number variation throughout the Plasmodium falciparum genome

    Directory of Open Access Journals (Sweden)

    Stewart Lindsay B

    2009-08-01

    Full Text Available Abstract Background Gene copy number variation (CNV is responsible for several important phenotypes of the malaria parasite Plasmodium falciparum, including drug resistance, loss of infected erythrocyte cytoadherence and alteration of receptor usage for erythrocyte invasion. Despite the known effects of CNV, little is known about its extent throughout the genome. Results We performed a whole-genome survey of CNV genes in P. falciparum using comparative genome hybridisation of a diverse set of 16 laboratory culture-adapted isolates to a custom designed high density Affymetrix GeneChip array. Overall, 186 genes showed hybridisation signals consistent with deletion or amplification in one or more isolate. There is a strong association of CNV with gene length, genomic location, and low orthology to genes in other Plasmodium species. Sub-telomeric regions of all chromosomes are strongly associated with CNV genes independent from members of previously described multigene families. However, ~40% of CNV genes were located in more central regions of the chromosomes. Among the previously undescribed CNV genes, several that are of potential phenotypic relevance are identified. Conclusion CNV represents a major form of genetic variation within the P. falciparum genome; the distribution of gene features indicates the involvement of highly non-random mutational and selective processes. Additional studies should be directed at examining CNV in natural parasite populations to extend conclusions to clinical settings.

  6. Genome size variation affects song attractiveness in grasshoppers: evidence for sexual selection against large genomes.

    Science.gov (United States)

    Schielzeth, Holger; Streitner, Corinna; Lampe, Ulrike; Franzke, Alexandra; Reinhold, Klaus

    2014-12-01

    Genome size is largely uncorrelated to organismal complexity and adaptive scenarios. Genetic drift as well as intragenomic conflict have been put forward to explain this observation. We here study the impact of genome size on sexual attractiveness in the bow-winged grasshopper Chorthippus biguttulus. Grasshoppers show particularly large variation in genome size due to the high prevalence of supernumerary chromosomes that are considered (mildly) selfish, as evidenced by non-Mendelian inheritance and fitness costs if present in high numbers. We ranked male grasshoppers by song characteristics that are known to affect female preferences in this species and scored genome sizes of attractive and unattractive individuals from the extremes of this distribution. We find that attractive singers have significantly smaller genomes, demonstrating that genome size is reflected in male courtship songs and that females prefer songs of males with small genomes. Such a genome size dependent mate preference effectively selects against selfish genetic elements that tend to increase genome size. The data therefore provide a novel example of how sexual selection can reinforce natural selection and can act as an agent in an intragenomic arms race. Furthermore, our findings indicate an underappreciated route of how choosy females could gain indirect benefits. © 2014 The Author(s). Evolution © 2014 The Society for the Study of Evolution.

  7. Somatic genomic variations in extra-embryonic tissues

    Energy Technology Data Exchange (ETDEWEB)

    Weier, Jingly F.; Ferlatte, Christy; Weier, Heinz-Ulli G.

    2010-05-21

    In the mature chorion, one of the membranes that exist during pregnancy between the developing fetus and mother, human placental cells form highly specialized tissues composed of mesenchyme and floating or anchoring villi. Using fluorescence in situ hybridization, we found that human invasive cytotrophoblasts isolated from anchoring villi or the uterine wall had gained individual chromosomes; however, chromosome losses were detected infrequently. With chromosomes gained in what appeared to be a chromosome-specific manner, more than half of the invasive cytotrophoblasts in normal pregnancies were found to be hyperdiploid. Interestingly, the rates of hyperdiploid cells depended not only on gestational age, but were strongly associated with the extraembryonic compartment at the fetal-maternal interface from which they were isolated. Since hyperdiploid cells showed drastically reduced DNA replication as measured by bromodeoxyuridine incorporation, we conclude that aneuploidy is a part of the normal process of placentation potentially limiting the proliferative capabilities of invasive cytotrophoblasts. Thus, under the special circumstances of human reproduction, somatic genomic variations may exert a beneficial, anti-neoplastic effect on the organism.

  8. Genome-wide variation in recombination rate in Eucalyptus.

    Science.gov (United States)

    Gion, Jean-Marc; Hudson, Corey J; Lesur, Isabelle; Vaillancourt, René E; Potts, Brad M; Freeman, Jules S

    2016-08-09

    Meiotic recombination is a fundamental evolutionary process. It not only generates diversity, but influences the efficacy of natural selection and genome evolution. There can be significant heterogeneity in recombination rates within and between species, however this variation is not well understood outside of a few model taxa, particularly in forest trees. Eucalypts are forest trees of global economic importance, and dominate many Australian ecosystems. We studied recombination rate in Eucalyptus globulus using genetic linkage maps constructed in 10 unrelated individuals, and markers anchored to the Eucalyptus reference genome. This experimental design provided the replication to study whether recombination rate varied between individuals and chromosomes, and allowed us to study the genomic attributes and population genetic parameters correlated with this variation. Recombination rate varied significantly between individuals (range = 2.71 to 3.51 centimorgans/megabase [cM/Mb]), but was not significantly influenced by sex or cross type (F1 vs. F2). Significant differences in recombination rate between chromosomes were also evident (range = 1.98 to 3.81 cM/Mb), beyond those which were due to variation in chromosome size. Variation in chromosomal recombination rate was significantly correlated with gene density (r = 0.94), GC content (r = 0.90), and the number of tandem duplicated genes (r = -0.72) per chromosome. Notably, chromosome level recombination rate was also negatively correlated with the average genetic diversity across six species from an independent set of samples (r = -0.75). The correlations with genomic attributes are consistent with findings in other taxa, however, the direction of the correlation between diversity and recombination rate is opposite to that commonly observed. We argue this is likely to reflect the interaction of selection and specific genome architecture of Eucalyptus. Interestingly, the differences amongst

  9. Genome-wide associations of gene expression variation in humans.

    Directory of Open Access Journals (Sweden)

    Barbara E Stranger

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  10. Genome-Wide Associations of Gene Expression Variation in Humans.

    Directory of Open Access Journals (Sweden)

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  11. Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation.

    Science.gov (United States)

    Kidd, Jeffrey M; Gravel, Simon; Byrnes, Jake; Moreno-Estrada, Andres; Musharoff, Shaila; Bryc, Katarzyna; Degenhardt, Jeremiah D; Brisbin, Abra; Sheth, Vrunda; Chen, Rong; McLaughlin, Stephen F; Peckham, Heather E; Omberg, Larsson; Bormann Chung, Christina A; Stanley, Sarah; Pearlstein, Kevin; Levandowsky, Elizabeth; Acevedo-Acevedo, Suehelay; Auton, Adam; Keinan, Alon; Acuña-Alonzo, Victor; Barquera-Lozano, Rodrigo; Canizales-Quinteros, Samuel; Eng, Celeste; Burchard, Esteban G; Russell, Archie; Reynolds, Andy; Clark, Andrew G; Reese, Martin G; Lincoln, Stephen E; Butte, Atul J; De La Vega, Francisco M; Bustamante, Carlos D

    2012-10-05

    Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas-70% of the European ancestry in today's African Americans dates back to European gene flow happening only 7-8 generations ago. Copyright © 2012 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  12. Discrepancy variation of dinucleotide microsatellite repeats in eukaryotic genomes

    Directory of Open Access Journals (Sweden)

    HUAN GAO

    2009-01-01

    Full Text Available To address whether there are differences of variation among repeat motif types and among taxonomic groups, we present here an analysis of variation and correlation of dinucleotide microsatellite repeats in eukaryotic genomes. Ten taxonomic groups were compared, those being primates, mammalia (excluding primates and rodentia, rodentia, birds, fish, amphibians and reptiles, insects, molluscs, plants and fungi, respectively. The data used in the analysis is from the literature published in the Journal of Molecular Ecology Notes. Analysis of variation reveals that there are no significant differences between AC and AG repeat motif types. Moreover, the number of alleles correlates positively with the copy number in both AG and AC repeats. Similar conclusions can be obtained from each taxonomic group. These results strongly suggest that the increase of SSR variation is almost linear with the increase of the copy number of each repeat motif. As well, the results suggest that the variability of SSR in the genomes of low-ranking species seem to be more than that of high-ranking species, excluding primates and fungi.

  13. Genomic Variation in Natural Populations of Drosophila melanogaster

    Science.gov (United States)

    Langley, Charles H.; Stevens, Kristian; Cardeno, Charis; Lee, Yuh Chwen G.; Schrider, Daniel R.; Pool, John E.; Langley, Sasha A.; Suarez, Charlyn; Corbett-Detig, Russell B.; Kolaczkowski, Bryan; Fang, Shu; Nista, Phillip M.; Holloway, Alisha K.; Kern, Andrew D.; Dewey, Colin N.; Song, Yun S.; Hahn, Matthew W.; Begun, David J.

    2012-01-01

    This report of independent genome sequences of two natural populations of Drosophila melanogaster (37 from North America and 6 from Africa) provides unique insight into forces shaping genomic polymorphism and divergence. Evidence of interactions between natural selection and genetic linkage is abundant not only in centromere- and telomere-proximal regions, but also throughout the euchromatic arms. Linkage disequilibrium, which decays within 1 kbp, exhibits a strong bias toward coupling of the more frequent alleles and provides a high-resolution map of recombination rate. The juxtaposition of population genetics statistics in small genomic windows with gene structures and chromatin states yields a rich, high-resolution annotation, including the following: (1) 5′- and 3′-UTRs are enriched for regions of reduced polymorphism relative to lineage-specific divergence; (2) exons overlap with windows of excess relative polymorphism; (3) epigenetic marks associated with active transcription initiation sites overlap with regions of reduced relative polymorphism and relatively reduced estimates of the rate of recombination; (4) the rate of adaptive nonsynonymous fixation increases with the rate of crossing over per base pair; and (5) both duplications and deletions are enriched near origins of replication and their density correlates negatively with the rate of crossing over. Available demographic models of X and autosome descent cannot account for the increased divergence on the X and loss of diversity associated with the out-of-Africa migration. Comparison of the variation among these genomes to variation among genomes from D. simulans suggests that many targets of directional selection are shared between these species. PMID:22673804

  14. The international Genome sample resource (IGSR): A worldwide collection of genome variation incorporating the 1000 Genomes Project data.

    Science.gov (United States)

    Clarke, Laura; Fairley, Susan; Zheng-Bradley, Xiangqun; Streeter, Ian; Perry, Emily; Lowy, Ernesto; Tassé, Anne-Marie; Flicek, Paul

    2017-01-04

    The International Genome Sample Resource (IGSR; http://www.internationalgenome.org) expands in data type and population diversity the resources from the 1000 Genomes Project. IGSR represents the largest open collection of human variation data and provides easy access to these resources. IGSR was established in 2015 to maintain and extend the 1000 Genomes Project data, which has been widely used as a reference set of human variation and by researchers developing analysis methods. IGSR has mapped all of the 1000 Genomes sequence to the newest human reference (GRCh38), and will release updated variant calls to ensure maximal usefulness of the existing data. IGSR is collecting new structural variation data on the 1000 Genomes samples from long read sequencing and other technologies, and will collect relevant functional data into a single comprehensive resource. IGSR is extending coverage with new populations sequenced by collaborating groups. Here, we present the new data and analysis that IGSR has made available. We have also introduced a new data portal that increases discoverability of our data-previously only browseable through our FTP site-by focusing on particular samples, populations or data sets of interest. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Ultra Deep Sequencing of a Baculovirus Population Reveals Widespread Genomic Variations

    Directory of Open Access Journals (Sweden)

    Aurélien Chateigner

    2015-07-01

    Full Text Available Viruses rely on widespread genetic variation and large population size for adaptation. Large DNA virus populations are thought to harbor little variation though natural populations may be polymorphic. To measure the genetic variation present in a dsDNA virus population, we deep sequenced a natural strain of the baculovirus Autographa californica multiple nucleopolyhedrovirus. With 124,221X average genome coverage of our 133,926 bp long consensus, we could detect low frequency mutations (0.025%. K-means clustering was used to classify the mutations in four categories according to their frequency in the population. We found 60 high frequency non-synonymous mutations under balancing selection distributed in all functional classes. These mutants could alter viral adaptation dynamics, either through competitive or synergistic processes. Lastly, we developed a technique for the delimitation of large deletions in next generation sequencing data. We found that large deletions occur along the entire viral genome, with hotspots located in homologous repeat regions (hrs. Present in 25.4% of the genomes, these deletion mutants presumably require functional complementation to complete their infection cycle. They might thus have a large impact on the fitness of the baculovirus population. Altogether, we found a wide breadth of genomic variation in the baculovirus population, suggesting it has high adaptive potential.

  16. Regulatory hotspots in the malaria parasite genome dictate transcriptional variation.

    Directory of Open Access Journals (Sweden)

    Joseph M Gonzales

    2008-09-01

    Full Text Available The determinants of transcriptional regulation in malaria parasites remain elusive. The presence of a well-characterized gene expression cascade shared by different Plasmodium falciparum strains could imply that transcriptional regulation and its natural variation do not contribute significantly to the evolution of parasite drug resistance. To clarify the role of transcriptional variation as a source of stain-specific diversity in the most deadly malaria species and to find genetic loci that dictate variations in gene expression, we examined genome-wide expression level polymorphisms (ELPs in a genetic cross between phenotypically distinct parasite clones. Significant variation in gene expression is observed through direct co-hybridizations of RNA from different P. falciparum clones. Nearly 18% of genes were regulated by a significant expression quantitative trait locus. The genetic determinants of most of these ELPs resided in hotspots that are physically distant from their targets. The most prominent regulatory locus, influencing 269 transcripts, coincided with a Chromosome 5 amplification event carrying the drug resistance gene, pfmdr1, and 13 other genes. Drug selection pressure in the Dd2 parental clone lineage led not only to a copy number change in the pfmdr1 gene but also to an increased copy number of putative neighboring regulatory factors that, in turn, broadly influence the transcriptional network. Previously unrecognized transcriptional variation, controlled by polymorphic regulatory genes and possibly master regulators within large copy number variants, contributes to sweeping phenotypic evolution in drug-resistant malaria parasites.

  17. Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly

    DEFF Research Database (Denmark)

    Li, Yingrui; Zheng, Hancheng; Luo, Ruibang

    2011-01-01

    Here we use whole-genome de novo assembly of second-generation sequencing reads to map structural variation (SV) in an Asian genome and an African genome. Our approach identifies small- and intermediate-size homozygous variants (1-50 kb) including insertions, deletions, inversions and their precise...

  18. Theories of Population Variation in Genes and Genomes

    DEFF Research Database (Denmark)

    Christiansen, Freddy

    This textbook provides an authoritative introduction to both classical and coalescent approaches to population genetics. Written for graduate students and advanced undergraduates by one of the world’s leading authorities in the field, the book focuses on the theoretical background of population...... genetics, while emphasizing the close interplay between theory and empiricism. Traditional topics such as genetic and phenotypic variation, mutation, migration, and linkage are covered and advanced by contemporary coalescent theory, which describes the genealogy of genes in a population, ultimately...... connecting them to a single common ancestor. Effects of selection, particularly genomic effects, are discussed with reference to molecular genetic variation. The book is designed for students of population genetics, bioinformatics, evolutionary biology, molecular evolution, and theoretical biology—as well...

  19. Simultaneous Structural Variation Discovery in Multiple Paired-End Sequenced Genomes

    Science.gov (United States)

    Hormozdiari, Fereydoun; Hajirasouliha, Iman; McPherson, Andrew; Eichler, Evan E.; Sahinalp, S. Cenk

    Next generation sequencing technologies have been decreasing the costs and increasing the world-wide capacity for sequence production at an unprecedented rate, making the initiation of large scale projects aiming to sequence almost 2000 genomes [1]. Structural variation detection promises to be one of the key diagnostic tools for cancer and other diseases with genomic origin. In this paper, we study the problem of detecting structural variation events in two or more sequenced genomes through high throughput sequencing . We propose to move from the current model of (1) detecting genomic variations in single next generation sequenced (NGS) donor genomes independently, and (2) checking whether two or more donor genomes indeed agree or disagree on the variations (in this paper we name this framework Independent Structural Variation Discovery and Merging - ISV&M), to a new model in which we detect structural variation events among multiple genomes simultaneously.

  20. PolyTB: A genomic variation map for Mycobacterium tuberculosis

    KAUST Repository

    Coll, Francesc

    2014-02-15

    Tuberculosis (TB) caused by Mycobacterium tuberculosis (Mtb) is the second major cause of death from an infectious disease worldwide. Recent advances in DNA sequencing are leading to the ability to generate whole genome information in clinical isolates of M. tuberculosis complex (MTBC). The identification of informative genetic variants such as phylogenetic markers and those associated with drug resistance or virulence will help barcode Mtb in the context of epidemiological, diagnostic and clinical studies. Mtb genomic datasets are increasingly available as raw sequences, which are potentially difficult and computer intensive to process, and compare across studies. Here we have processed the raw sequence data (>1500 isolates, eight studies) to compile a catalogue of SNPs (n = 74,039, 63% non-synonymous, 51.1% in more than one isolate, i.e. non-private), small indels (n = 4810) and larger structural variants (n = 800). We have developed the PolyTB web-based tool (http://pathogenseq.lshtm.ac.uk/polytb) to visualise the resulting variation and important meta-data (e.g. in silico inferred strain-types, location) within geographical map and phylogenetic views. This resource will allow researchers to identify polymorphisms within candidate genes of interest, as well as examine the genomic diversity and distribution of strains. PolyTB source code is freely available to researchers wishing to develop similar tools for their pathogen of interest. 2014 Elsevier Ltd. All rights reserved.

  1. Effective Normalization for Copy Number Variation Detection from Whole Genome Sequencing

    NARCIS (Netherlands)

    Janevski, A.; Varadan, V.; Kamalakaran, S.; Banerjee, N.; Dimitrova, D.

    2012-01-01

    Background Whole genome sequencing enables a high resolution view ofthe human genome and provides unique insights into genome structureat an unprecedented scale. There have been a number of tools to infer copy number variation in the genome. These tools while validatedalso include a number of

  2. Rare and common regulatory variation in population-scale sequenced human genomes.

    Directory of Open Access Journals (Sweden)

    Stephen B Montgomery

    2011-07-01

    Full Text Available Population-scale genome sequencing allows the characterization of functional effects of a broad spectrum of genetic variants underlying human phenotypic variation. Here, we investigate the influence of rare and common genetic variants on gene expression patterns, using variants identified from sequencing data from the 1000 genomes project in an African and European population sample and gene expression data from lymphoblastoid cell lines. We detect comparable numbers of expression quantitative trait loci (eQTLs when compared to genotypes obtained from HapMap 3, but as many as 80% of the top expression quantitative trait variants (eQTVs discovered from 1000 genomes data are novel. The properties of the newly discovered variants suggest that mapping common causal regulatory variants is challenging even with full resequencing data; however, we observe significant enrichment of regulatory effects in splice-site and nonsense variants. Using RNA sequencing data, we show that 46.2% of nonsynonymous variants are differentially expressed in at least one individual in our sample, creating widespread potential for interactions between functional protein-coding and regulatory variants. We also use allele-specific expression to identify putative rare causal regulatory variants. Furthermore, we demonstrate that outlier expression values can be due to rare variant effects, and we approximate the number of such effects harboured in an individual by effect size. Our results demonstrate that integration of genomic and RNA sequencing analyses allows for the joint assessment of genome sequence and genome function.

  3. Background selection as baseline for nucleotide variation across the Drosophila genome.

    Directory of Open Access Journals (Sweden)

    Josep M Comeron

    2014-06-01

    Full Text Available The constant removal of deleterious mutations by natural selection causes a reduction in neutral diversity and efficacy of selection at genetically linked sites (a process called Background Selection, BGS. Population genetic studies, however, often ignore BGS effects when investigating demographic events or the presence of other types of selection. To obtain a more realistic evolutionary expectation that incorporates the unavoidable consequences of deleterious mutations, we generated high-resolution landscapes of variation across the Drosophila melanogaster genome under a BGS scenario independent of polymorphism data. We find that BGS plays a significant role in shaping levels of variation across the entire genome, including long introns and intergenic regions distant from annotated genes. We also find that a very large percentage of the observed variation in diversity across autosomes can be explained by BGS alone, up to 70% across individual chromosome arms at 100-kb scale, thus indicating that BGS predictions can be used as baseline to infer additional types of selection and demographic events. This approach allows detecting several outlier regions with signal of recent adaptive events and selective sweeps. The use of a BGS baseline, however, is particularly appropriate to investigate the presence of balancing selection and our study exposes numerous genomic regions with the predicted signature of higher polymorphism than expected when a BGS context is taken into account. Importantly, we show that these conclusions are robust to the mutation and selection parameters of the BGS model. Finally, analyses of protein evolution together with previous comparisons of genetic maps between Drosophila species, suggest temporally variable recombination landscapes and, thus, local BGS effects that may differ between extant and past phases. Because genome-wide BGS and temporal changes in linkage effects can skew approaches to estimate demographic and

  4. Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping.

    Directory of Open Access Journals (Sweden)

    Amaury Vaysse

    2011-10-01

    Full Text Available The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary between breeds, and we identify novel associations with both morphological and behavioral traits. We next scan the genome for signatures of selective sweeps in single breeds, characterized by long regions of reduced heterozygosity and fixation of extended haplotypes. These scans identify hundreds of regions, including 22 blocks of homozygosity longer than one megabase in certain breeds. Candidate selection loci are strongly enriched for developmental genes. We chose one highly differentiated region, associated with body size and ear morphology, and characterized it using high-throughput sequencing to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease.

  5. Genomic and gene variation in Mycoplasma hominis strains

    DEFF Research Database (Denmark)

    Christiansen, Gunna; Andersen, H; Birkelund, Svend

    1987-01-01

    DNAs from 14 strains of Mycoplasma hominis isolated from various habitats, including strain PG21, were analyzed for genomic heterogeneity. DNA-DNA filter hybridization values were from 51 to 91%. Restriction endonuclease digestion patterns, analyzed by agarose gel electrophoresis, revealed...... no identity or cluster formation between strains. Variation within M. hominis rRNA genes was analyzed by Southern hybridization of EcoRI-cleaved DNA hybridized with a cloned fragment of the rRNA gene from the mycoplasma strain PG50. Five of the M. hominis strains showed identical hybridization patterns....... These hybridization patterns were compared with those of 12 other mycoplasma species, which showed a much more complex band pattern. Cloned nonribosomal RNA gene fragments of M. hominis PG21 DNA were analyzed, and the fragments were used to demonstrate heterogeneity among the strains. A monoclonal antibody against...

  6. Meiotic gene-conversion rate and tract length variation in the human genome.

    Science.gov (United States)

    Padhukasahasram, Badri; Rannala, Bruce

    2013-02-27

    Meiotic recombination occurs in the form of two different mechanisms called crossing-over and gene-conversion and both processes have an important role in shaping genetic variation in populations. Although variation in crossing-over rates has been studied extensively using sperm-typing experiments, pedigree studies and population genetic approaches, our knowledge of variation in gene-conversion parameters (ie, rates and mean tract lengths) remains far from complete. To explore variability in population gene-conversion rates and its relationship to crossing-over rate variation patterns, we have developed and validated using coalescent simulations a comprehensive Bayesian full-likelihood method that can jointly infer crossing-over and gene-conversion rates as well as tract lengths from population genomic data under general variable rate models with recombination hotspots. Here, we apply this new method to SNP data from multiple human populations and attempt to characterize for the first time the fine-scale variation in gene-conversion parameters along the human genome. We find that the estimated ratio of gene-conversion to crossing-over rates varies considerably across genomic regions as well as between populations. However, there is a great degree of uncertainty associated with such estimates. We also find substantial evidence for variation in the mean conversion tract length. The estimated tract lengths did not show any negative relationship with the local heterozygosity levels in our analysis.European Journal of Human Genetics advance online publication, 27 February 2013; doi:10.1038/ejhg.2013.30.

  7. Genomic copy number variations in three Southeast Asian populations.

    Science.gov (United States)

    Ku, Chee-Seng; Pawitan, Yudi; Sim, Xueling; Ong, Rick T H; Seielstad, Mark; Lee, Edmund J D; Teo, Yik-Ying; Chia, Kee-Seng; Salim, Agus

    2010-07-01

    Research on the role of copy number variations (CNVs) in the genetic risk of diseases in Asian populations has been hampered by a relative lack of reference CNV maps for Asian populations outside the East Asians. In this article, we report the population characteristics of CNVs in Chinese, Malay, and Asian Indian populations in Singapore. Using the Illumina Human 1M Beadchip array, we identify 1,174 CNV loci in these populations that corroborated with findings when the same samples were typed on the Affymetrix 6.0 platform. We identify 441 novel loci not previously reported in the Database of Genomic Variations (DGV). We observe a considerable number of loci that span all three populations and were previously unreported, as well as population-specific loci that are quite common in the respective populations. From this we observe the distribution of CNVs in the Asian Indian population to be considerably different from the Chinese and Malay populations. About half of the deletion loci and three-quarters of duplication loci overlap UCSC genes. Tens of loci show population differentiation and overlap with genes previously known to be associated with genetic risk of diseases. One of these loci is the CYP2A6 deletion, previously linked to reduced susceptibility to lung cancer. (c) 2010 Wiley-Liss, Inc.

  8. Identification of genomic indels and structural variations using split reads

    Directory of Open Access Journals (Sweden)

    Urban Alexander E

    2011-07-01

    Full Text Available Abstract Background Recent studies have demonstrated the genetic significance of insertions, deletions, and other more complex structural variants (SVs in the human population. With the development of the next-generation sequencing technologies, high-throughput surveys of SVs on the whole-genome level have become possible. Here we present split-read identification, calibrated (SRiC, a sequence-based method for SV detection. Results We start by mapping each read to the reference genome in standard fashion using gapped alignment. Then to identify SVs, we score each of the many initial mappings with an assessment strategy designed to take into account both sequencing and alignment errors (e.g. scoring more highly events gapped in the center of a read. All current SV calling methods have multilevel biases in their identifications due to both experimental and computational limitations (e.g. calling more deletions than insertions. A key aspect of our approach is that we calibrate all our calls against synthetic data sets generated from simulations of high-throughput sequencing (with realistic error models. This allows us to calculate sensitivity and the positive predictive value under different parameter-value scenarios and for different classes of events (e.g. long deletions vs. short insertions. We run our calculations on representative data from the 1000 Genomes Project. Coupling the observed numbers of events on chromosome 1 with the calibrations gleaned from the simulations (for different length events allows us to construct a relatively unbiased estimate for the total number of SVs in the human genome across a wide range of length scales. We estimate in particular that an individual genome contains ~670,000 indels/SVs. Conclusions Compared with the existing read-depth and read-pair approaches for SV identification, our method can pinpoint the exact breakpoints of SV events, reveal the actual sequence content of insertions, and cover the whole

  9. Variation in heterozygosity predicts variation in human substitution rates between populations, individuals and genomic regions.

    Directory of Open Access Journals (Sweden)

    William Amos

    Full Text Available The "heterozygote instability" (HI hypothesis suggests that gene conversion events focused on heterozygous sites during meiosis locally increase the mutation rate, but this hypothesis remains largely untested. As humans left Africa they lost variability, which, if HI operates, should have reduced the mutation rate in non-Africans. Relative substitution rates were quantified in diverse humans using aligned whole genome sequences from the 1,000 genomes project. Substitution rate is consistently greater in Africans than in non-Africans, but only in diploid regions of the genome, consistent with a role for heterozygosity. Analysing the same data partitioned into a series of non-overlapping 2 Mb windows reveals a strong, non-linear correlation between the amount of heterozygosity lost "out of Africa" and the difference in substitution rate between Africans and non-Africans. Putative recent mutations, derived variants that occur only once among the 80 human chromosomes sampled, occur preferentially at the centre of 2 Kb windows that have elevated heterozygosity compared both with the same region in a closely related population and with an immediately adjacent region in the same population. More than half of all substitutions appear attributable to variation in heterozygosity. This observation provides strong support for HI with implications for many branches of evolutionary biology.

  10. Genomic Evolution of Saccharomyces cerevisiae under Chinese Rice Wine Fermentation

    Science.gov (United States)

    Li, Yudong; Zhang, Weiping; Zheng, Daoqiong; Zhou, Zhan; Yu, Wenwen; Zhang, Lei; Feng, Lifang; Liang, Xinle; Guan, Wenjun; Zhou, Jingwen; Chen, Jian; Lin, Zhenguo

    2014-01-01

    Rice wine fermentation represents a unique environment for the evolution of the budding yeast, Saccharomyces cerevisiae. To understand how the selection pressure shaped the yeast genome and gene regulation, we determined the genome sequence and transcriptome of a S. cerevisiae strain YHJ7 isolated from Chinese rice wine (Huangjiu), a popular traditional alcoholic beverage in China. By comparing the genome of YHJ7 to the lab strain S288c, a Japanese sake strain K7, and a Chinese industrial bioethanol strain YJSH1, we identified many genomic sequence and structural variations in YHJ7, which are mainly located in subtelomeric regions, suggesting that these regions play an important role in genomic evolution between strains. In addition, our comparative transcriptome analysis between YHJ7 and S288c revealed a set of differentially expressed genes, including those involved in glucose transport (e.g., HXT2, HXT7) and oxidoredutase activity (e.g., AAD10, ADH7). Interestingly, many of these genomic and transcriptional variations are directly or indirectly associated with the adaptation of YHJ7 strain to its specific niches. Our molecular evolution analysis suggested that Japanese sake strains (K7/UC5) were derived from Chinese rice wine strains (YHJ7) at least approximately 2,300 years ago, providing the first molecular evidence elucidating the origin of Japanese sake strains. Our results depict interesting insights regarding the evolution of yeast during rice wine fermentation, and provided a valuable resource for genetic engineering to improve industrial wine-making strains. PMID:25212861

  11. Fundamental quadratic variational principle underlying general relativity

    International Nuclear Information System (INIS)

    Atkins, W.K.

    1983-01-01

    The fundamental result of Lanczos is used in a new type of quadratic variational principle whose field equations are the Einstein field equations together with the Yang-Mills type equations for the Riemann curvature. Additionally, a spin-2 theory of gravity for the special case of the Einstein vacuum is discussed

  12. Whole-genome sequence variation, population structure and demographic history of the Dutch population

    NARCIS (Netherlands)

    Francioli, Laurent C.; Menelaou, Andronild; Pulit, Sara L.; Van Dijk, Freerk; Palamara, Pier Francesco; Elbers, Clara C.; Neerincx, Pieter B. T.; Ye, Kai; Guryev, Victor; Kloosterman, Wigard P.; Deelen, Patrick; Abdellaoui, Abdel; Van Leeuwen, Elisabeth M.; Van Oven, Mannis; Vermaat, Martijn; Li, Mingkun; Laros, Jeroen F. J.; Karssen, Lennart C.; Kanterakis, Alexandros; Amin, Najaf; Hottenga, Jouke Jan; Lameijer, Eric-Wubbo; Kattenberg, Mathijs; Dijkstra, Martijn; Byelas, Heorhiy; Van Settenl, Jessica; Van Schaik, Barbera D. C.; Bot, Jan; Nijman, Isaac J.; Renkens, Ivo; Marscha, Tobias; Schonhuth, Alexander; Hehir-Kwa, Jayne Y.; Handsaker, Robert E.; Polak, Paz; Sohail, Mashaal; Vuzman, Dana; Hormozdiari, Fereydoun; Van Enckevort, David; Mei, Hailiang; Koval, Vyacheslav; Moed, Ma-Tthijs H.; Van der Velde, K. Joeri; Rivadeneira, Fernando; Estrada, Karol; Medina-Gomez, Carolina; Isaacs, Aaron; Platteel, Mathieu; Swertz, Morris A.; Wijmenga, Cisca

    Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring

  13. Whole-genome sequence variation, population structure and demographic history of the Dutch population

    NARCIS (Netherlands)

    The Genome of the Netherlands Consortium; T. Marschall (Tobias); A. Schönhuth (Alexander)

    2014-01-01

    htmlabstractWhole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch

  14. No evidence that sex and transposable elements drive genome size variation in evening primroses.

    Science.gov (United States)

    Ågren, J Arvid; Greiner, Stephan; Johnson, Marc T J; Wright, Stephen I

    2015-04-01

    Genome size varies dramatically across species, but despite an abundance of attention there is little agreement on the relative contributions of selective and neutral processes in governing this variation. The rate of sex can potentially play an important role in genome size evolution because of its effect on the efficacy of selection and transmission of transposable elements (TEs). Here, we used a phylogenetic comparative approach and whole genome sequencing to investigate the contribution of sex and TE content to genome size variation in the evening primrose (Oenothera) genus. We determined genome size using flow cytometry for 30 species that vary in genetic system and find that variation in sexual/asexual reproduction cannot explain the almost twofold variation in genome size. Moreover, using whole genome sequences of three species of varying genome sizes and reproductive system, we found that genome size was not associated with TE abundance; instead the larger genomes had a higher abundance of simple sequence repeats. Although it has long been clear that sexual reproduction may affect various aspects of genome evolution in general and TE evolution in particular, it does not appear to have played a major role in genome size evolution in the evening primroses. © 2015 The Author(s).

  15. Karyotype diversity and genome size variation in Neotropical Maxillariinae orchids.

    Science.gov (United States)

    Moraes, A P; Koehler, S; Cabral, J S; Gomes, S S L; Viccini, L F; Barros, F; Felix, L P; Guerra, M; Forni-Martins, E R

    2017-03-01

    Orchidaceae is a widely distributed plant family with very diverse vegetative and floral morphology, and such variability is also reflected in their karyotypes. However, since only a low proportion of Orchidaceae has been analysed for chromosome data, greater diversity may await to be unveiled. Here we analyse both genome size (GS) and karyotype in two subtribes recently included in the broadened Maxillariinea to detect how much chromosome and GS variation there is in these groups and to evaluate which genome rearrangements are involved in the species evolution. To do so, the GS (14 species), the karyotype - based on chromosome number, heterochromatic banding and 5S and 45S rDNA localisation (18 species) - was characterised and analysed along with published data using phylogenetic approaches. The GS presented a high phylogenetic correlation and it was related to morphological groups in Bifrenaria (larger plants - higher GS). The two largest GS found among genera were caused by different mechanisms: polyploidy in Bifrenaria tyrianthina and accumulation of repetitive DNA in Scuticaria hadwenii. The chromosome number variability was caused mainly through descending dysploidy, and x=20 was estimated as the base chromosome number. Combining GS and karyotype data with molecular phylogeny, our data provide a more complete scenario of the karyotype evolution in Maxillariinae orchids, allowing us to suggest, besides dysploidy, that inversions and transposable elements as two mechanisms involved in the karyotype evolution. Such karyotype modifications could be associated with niche changes that occurred during species evolution. © 2016 German Botanical Society and The Royal Botanical Society of the Netherlands.

  16. Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice

    OpenAIRE

    Wei Tong; Qiang He; Yong-Jin Park

    2017-01-01

    Mitochondrial genome variations have been detected despite the overall conservation of this gene content, which has been valuable for plant population genetics and evolutionary studies. Here, we describe mitochondrial variation architecture and our performance of a phylogenetic dissection of Korean landrace and weedy rice. A total of 4,717 variations across the mitochondrial genome were identified adjunct with 10 wild rice. Genetic diversity assessment revealed that wild rice has higher nucle...

  17. A simple genetic architecture underlies morphological variation in dogs.

    Directory of Open Access Journals (Sweden)

    Adam R Boyko

    2010-08-01

    Full Text Available Domestic dogs exhibit tremendous phenotypic diversity, including a greater variation in body size than any other terrestrial mammal. Here, we generate a high density map of canine genetic variation by genotyping 915 dogs from 80 domestic dog breeds, 83 wild canids, and 10 outbred African shelter dogs across 60,968 single-nucleotide polymorphisms (SNPs. Coupling this genomic resource with external measurements from breed standards and individuals as well as skeletal measurements from museum specimens, we identify 51 regions of the dog genome associated with phenotypic variation among breeds in 57 traits. The complex traits include average breed body size and external body dimensions and cranial, dental, and long bone shape and size with and without allometric scaling. In contrast to the results from association mapping of quantitative traits in humans and domesticated plants, we find that across dog breeds, a small number of quantitative trait loci (< or = 3 explain the majority of phenotypic variation for most of the traits we studied. In addition, many genomic regions show signatures of recent selection, with most of the highly differentiated regions being associated with breed-defining traits such as body size, coat characteristics, and ear floppiness. Our results demonstrate the efficacy of mapping multiple traits in the domestic dog using a database of genotyped individuals and highlight the important role human-directed selection has played in altering the genetic architecture of key traits in this important species.

  18. Sequence variation of the feline immunodeficiency virus genome and its clinical relevance.

    Science.gov (United States)

    Stickney, A L; Dunowska, M; Cave, N J

    2013-06-08

    The ongoing evolution of feline immunodeficiency virus (FIV) has resulted in the existence of a diverse continuum of viruses. FIV isolates differ with regards to their mutation and replication rates, plasma viral loads, cell tropism and the ability to induce apoptosis. Clinical disease in FIV-infected cats is also inconsistent. Genomic sequence variation of FIV is likely to be responsible for some of the variation in viral behaviour. The specific genetic sequences that influence these key viral properties remain to be determined. With knowledge of the specific key determinants of pathogenicity, there is the potential for veterinarians in the future to apply this information for prognostic purposes. Genomic sequence variation of FIV also presents an obstacle to effective vaccine development. Most challenge studies demonstrate acceptable efficacy of a dual-subtype FIV vaccine (Fel-O-Vax FIV) against FIV infection under experimental settings; however, vaccine efficacy in the field still remains to be proven. It is important that we discover the key determinants of immunity induced by this vaccine; such data would compliment vaccine field efficacy studies and provide the basis to make informed recommendations on its use.

  19. CpG islands undermethylation in human genomic regions under selective pressure.

    Directory of Open Access Journals (Sweden)

    Sergio Cocozza

    Full Text Available DNA methylation at CpG islands (CGIs is one of the most intensively studied epigenetic mechanisms. It is fundamental for cellular differentiation and control of transcriptional potential. DNA methylation is involved also in several processes that are central to evolutionary biology, including phenotypic plasticity and evolvability. In this study, we explored the relationship between CpG islands methylation and signatures of selective pressure in Homo Sapiens, using a computational biology approach. By analyzing methylation data of 25 cell lines from the Encyclopedia of DNA Elements (ENCODE Consortium, we compared the DNA methylation of CpG islands in genomic regions under selective pressure with the methylation of CpG islands in the remaining part of the genome. To define genomic regions under selective pressure, we used three different methods, each oriented to provide distinct information about selective events. Independently of the method and of the cell type used, we found evidences of undermethylation of CGIs in human genomic regions under selective pressure. Additionally, by analyzing SNP frequency in CpG islands, we demonstrated that CpG islands in regions under selective pressure show lower genetic variation. Our findings suggest that the CpG islands in regions under selective pressure seem to be somehow more "protected" from methylation when compared with other regions of the genome.

  20. A genome-wide analysis of putative functional and exonic variation associated with extremely high intelligence.

    Science.gov (United States)

    Spain, S L; Pedroso, I; Kadeva, N; Miller, M B; Iacono, W G; McGue, M; Stergiakouli, E; Davey Smith, G; Putallaz, M; Lubinski, D; Meaburn, E L; Plomin, R; Simpson, M A

    2016-08-01

    Although individual differences in intelligence (general cognitive ability) are highly heritable, molecular genetic analyses to date have had limited success in identifying specific loci responsible for its heritability. This study is the first to investigate exome variation in individuals of extremely high intelligence. Under the quantitative genetic model, sampling from the high extreme of the distribution should provide increased power to detect associations. We therefore performed a case-control association analysis with 1409 individuals drawn from the top 0.0003 (IQ >170) of the population distribution of intelligence and 3253 unselected population-based controls. Our analysis focused on putative functional exonic variants assayed on the Illumina HumanExome BeadChip. We did not observe any individual protein-altering variants that are reproducibly associated with extremely high intelligence and within the entire distribution of intelligence. Moreover, no significant associations were found for multiple rare alleles within individual genes. However, analyses using genome-wide similarity between unrelated individuals (genome-wide complex trait analysis) indicate that the genotyped functional protein-altering variation yields a heritability estimate of 17.4% (s.e. 1.7%) based on a liability model. In addition, investigation of nominally significant associations revealed fewer rare alleles associated with extremely high intelligence than would be expected under the null hypothesis. This observation is consistent with the hypothesis that rare functional alleles are more frequently detrimental than beneficial to intelligence.

  1. Genomic Features That Predict Allelic Imbalance in Humans Suggest Patterns of Constraint on Gene Expression Variation

    Science.gov (United States)

    Fédrigo, Olivier; Haygood, Ralph; Mukherjee, Sayan; Wray, Gregory A.

    2009-01-01

    Variation in gene expression is an important contributor to phenotypic diversity within and between species. Although this variation often has a genetic component, identification of the genetic variants driving this relationship remains challenging. In particular, measurements of gene expression usually do not reveal whether the genetic basis for any observed variation lies in cis or in trans to the gene, a distinction that has direct relevance to the physical location of the underlying genetic variant, and which may also impact its evolutionary trajectory. Allelic imbalance measurements identify cis-acting genetic effects by assaying the relative contribution of the two alleles of a cis-regulatory region to gene expression within individuals. Identification of patterns that predict commonly imbalanced genes could therefore serve as a useful tool and also shed light on the evolution of cis-regulatory variation itself. Here, we show that sequence motifs, polymorphism levels, and divergence levels around a gene can be used to predict commonly imbalanced genes in a human data set. Reduction of this feature set to four factors revealed that only one factor significantly differentiated between commonly imbalanced and nonimbalanced genes. We demonstrate that these results are consistent between the original data set and a second published data set in humans obtained using different technical and statistical methods. Finally, we show that variation in the single allelic imbalance-associated factor is partially explained by the density of genes in the region of a target gene (allelic imbalance is less probable for genes in gene-dense regions), and, to a lesser extent, the evenness of expression of the gene across tissues and the magnitude of negative selection on putative regulatory regions of the gene. These results suggest that the genomic distribution of functional cis-regulatory variants in the human genome is nonrandom, perhaps due to local differences in evolutionary

  2. Genomic evolution of Saccharomyces cerevisiae under Chinese rice wine fermentation.

    Science.gov (United States)

    Li, Yudong; Zhang, Weiping; Zheng, Daoqiong; Zhou, Zhan; Yu, Wenwen; Zhang, Lei; Feng, Lifang; Liang, Xinle; Guan, Wenjun; Zhou, Jingwen; Chen, Jian; Lin, Zhenguo

    2014-09-10

    Rice wine fermentation represents a unique environment for the evolution of the budding yeast, Saccharomyces cerevisiae. To understand how the selection pressure shaped the yeast genome and gene regulation, we determined the genome sequence and transcriptome of a S. cerevisiae strain YHJ7 isolated from Chinese rice wine (Huangjiu), a popular traditional alcoholic beverage in China. By comparing the genome of YHJ7 to the lab strain S288c, a Japanese sake strain K7, and a Chinese industrial bioethanol strain YJSH1, we identified many genomic sequence and structural variations in YHJ7, which are mainly located in subtelomeric regions, suggesting that these regions play an important role in genomic evolution between strains. In addition, our comparative transcriptome analysis between YHJ7 and S288c revealed a set of differentially expressed genes, including those involved in glucose transport (e.g., HXT2, HXT7) and oxidoredutase activity (e.g., AAD10, ADH7). Interestingly, many of these genomic and transcriptional variations are directly or indirectly associated with the adaptation of YHJ7 strain to its specific niches. Our molecular evolution analysis suggested that Japanese sake strains (K7/UC5) were derived from Chinese rice wine strains (YHJ7) at least approximately 2,300 years ago, providing the first molecular evidence elucidating the origin of Japanese sake strains. Our results depict interesting insights regarding the evolution of yeast during rice wine fermentation, and provided a valuable resource for genetic engineering to improve industrial wine-making strains. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  3. Genomic regulation of natural variation in cortical and noncortical brain volume

    Directory of Open Access Journals (Sweden)

    Laughlin Rick E

    2006-02-01

    Full Text Available Abstract Background The relative growth of the neocortex parallels the emergence of complex cognitive functions across species. To determine the regions of the mammalian genome responsible for natural variations in cortical volume, we conducted a complex trait analysis using 34 strains of recombinant inbred (Rl strains of mice (BXD, as well as their two parental strains (C57BL/6J and DBA/2J. We measured both neocortical volume and total brain volume in 155 coronally sectioned mouse brains that were Nissl stained and embedded in celloidin. After correction for shrinkage, the measured cortical and noncortical brain volumes were entered into a multiple regression analysis, which removed the effects of body size and age from the measurements. Marker regression and interval mapping were computed using WebQTL. Results An ANOVA revealed that more than half of the variance of these regressed phenotypes is genetically determined. We then identified the regions of the genome regulating this heritability. We located genomic regions in which a linkage disequilibrium was present using WebQTL as both a mapping engine and genomic database. For neocortex, we found a genome-wide significant quantitative trait locus (QTL on chromosome 11 (marker D11Mit19, as well as a suggestive QTL on chromosome 16 (marker D16Mit100. In contrast, for noncortex the effect of chromosome 11 was markedly reduced, and a significant QTL appeared on chromosome 19 (D19Mit22. Conclusion This classic pattern of double dissociation argues strongly for different genetic factors regulating relative cortical size, as opposed to brain volume more generally. It is likely, however, that the effects of proximal chromosome 11 extend beyond the neocortex strictly defined. An analysis of single nucleotide polymorphisms in these regions indicated that ciliary neurotrophic factor (Cntf is quite possibly the gene underlying the noncortical QTL. Evidence for a candidate gene modulating neocortical

  4. Assessing genome-wide copy number variation in the Han Chinese population.

    Science.gov (United States)

    Lu, Jianqi; Lou, Haiyi; Fu, Ruiqing; Lu, Dongsheng; Zhang, Feng; Wu, Zhendong; Zhang, Xi; Li, Changhua; Fang, Baijun; Pu, Fangfang; Wei, Jingning; Wei, Qian; Zhang, Chao; Wang, Xiaoji; Lu, Yan; Yan, Shi; Yang, Yajun; Jin, Li; Xu, Shuhua

    2017-10-01

    Copy number variation (CNV) is a valuable source of genetic diversity in the human genome and a well-recognised cause of various genetic diseases. However, CNVs have been considerably under-represented in population-based studies, particularly the Han Chinese which is the largest ethnic group in the world. To build a representative CNV map for the Han Chinese population. We conducted a genome-wide CNV study involving 451 male Han Chinese samples from 11 geographical regions encompassing 28 dialect groups, representing a less-biased panel compared with the currently available data. We detected CNVs by using 4.2M NimbleGen comparative genomic hybridisation array and whole-genome deep sequencing of 51 samples to optimise the filtering conditions in CNV discovery. A comprehensive Han Chinese CNV map was built based on a set of high-quality variants (positive predictive value >0.8, with sizes ranging from 369 bp to 4.16 Mb and a median of 5907 bp). The map consists of 4012 CNV regions (CNVRs), and more than half are novel to the 30 East Asian CNV Project and the 1000 Genomes Project Phase 3. We further identified 81 CNVRs specific to regional groups, which was indicative of the subpopulation structure within the Han Chinese population. Our data are complementary to public data sources, and the CNV map may facilitate in the identification of pathogenic CNVs and further biomedical research studies involving the Han Chinese population. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  5. Analysis of the genetic variation in Mycobacterium tuberculosis strains by multiple genome alignments

    Directory of Open Access Journals (Sweden)

    Morales Juan

    2008-11-01

    Full Text Available Abstract Background The recent determination of the complete nucleotide sequence of several Mycobacterium tuberculosis (MTB genomes allows the use of comparative genomics as a tool for dissecting the nature and consequence of genetic variability within this species. The multiple alignment of the genomes of clinical strains (CDC1551, F11, Haarlem and C, along with the genomes of laboratory strains (H37Rv and H37Ra, provides new insights on the mechanisms of adaptation of this bacterium to the human host. Findings The genetic variation found in six M. tuberculosis strains does not involve significant genomic rearrangements. Most of the variation results from deletion and transposition events preferentially associated with insertion sequences and genes of the PE/PPE family but not with genes implicated in virulence. Using a Perl-based software islandsanalyser, which creates a representation of the genetic variation in the genome, we identified differences in the patterns of distribution and frequency of the polymorphisms across the genome. The identification of genes displaying strain-specific polymorphisms and the extrapolation of the number of strain-specific polymorphisms to an unlimited number of genomes indicates that the different strains contain a limited number of unique polymorphisms. Conclusion The comparison of multiple genomes demonstrates that the M. tuberculosis genome is currently undergoing an active process of gene decay, analogous to the adaptation process of obligate bacterial symbionts. This observation opens new perspectives into the evolution and the understanding of the pathogenesis of this bacterium.

  6. Genomic and karyotypic variation in Drosophila parasitoids (Hymenoptera, Cynipoidea, Figitidae

    Directory of Open Access Journals (Sweden)

    Vladimir Gokhman

    2011-08-01

    Full Text Available Drosophila melanogaster Meigen, 1830 has served as a model insect for over a century. Sequencing of the 11 additional Drosophila Fallen, 1823 species marks substantial progress in comparative genomics of this genus. By comparison, practically nothing is known about the genome size or genome sequences of parasitic wasps of Drosophila. Here, we present the first comparative analysis of genome size and karyotype structures of Drosophila parasitoids of the Leptopilina Förster, 1869 and Ganaspis Förster, 1869 species. The gametic genome size of Ganaspis xanthopoda (Ashmead, 1896 is larger than those of the three Leptopilina species studied. The genome sizes of all parasitic wasps studied here are also larger than those known for all Drosophila species. Surprisingly, genome sizes of these Drosophila parasitoids exceed the average value known for all previously studied Hymenoptera. The haploid chromosome number of both Leptopilina heterotoma (Thomson, 1862 and L. victoriae Nordlander, 1980 is ten. A chromosomal fusion appears to have produced a distinct karyotype for L. boulardi (Barbotin, Carton et Keiner-Pillault, 1979 (n = 9, whose genome size is smaller than that of wasps of the L. heterotoma clade. Like L. boulardi, the haploid chromosome number for G. xanthopoda is also nine. Our studies reveal a positive, but non linear, correlation between the genome size and total chromosome length in Drosophila parasitoids. These Drosophila parasitoids differ widely in their host range, and utilize different infection strategies to overcome host defense. Their comparative genomics, in relation to their exceptionally well-characterized hosts, will prove to be valuable for understanding the molecular basis of the host-parasite arms race and how such mechanisms shape the genetic structures of insect communities.

  7. Overview of the creative genome: effects of genome structure and sequence on the generation of variation and evolution.

    Science.gov (United States)

    Caporale, Lynn Helena

    2012-09-01

    This overview of a special issue of Annals of the New York Academy of Sciences discusses uneven distribution of distinct types of variation across the genome, the dependence of specific types of variation upon distinct classes of DNA sequences and/or the induction of specific proteins, the circumstances in which distinct variation-generating systems are activated, and the implications of this work for our understanding of evolution and of cancer. Also discussed is the value of non text-based computational methods for analyzing information carried by DNA, early insights into organizational frameworks that affect genome behavior, and implications of this work for comparative genomics. © 2012 New York Academy of Sciences.

  8. Are we Genomic Mosaics? Variations of the Genome of Somatic Cells can Contribute to Diversify our Phenotypes.

    Science.gov (United States)

    Astolfi, P A; Salamini, F; Sgaramella, V

    2010-09-01

    Theoretical and experimental evidences support the hypothesis that the genomes and the epigenomes may be different in the somatic cells of complex organisms. In the genome, the differences range from single base substitutions to chromosome number; in the epigenome, they entail multiple postsynthetic modifications of the chromatin. Somatic genome variations (SGV) may accumulate during development in response both to genetic programs, which may differ from tissue to tissue, and to environmental stimuli, which are often undetected and generally irreproducible. SGV may jeopardize physiological cellular functions, but also create novel coding and regulatory sequences, to be exposed to intraorganismal Darwinian selection. Genomes acknowledged as comparatively poor in genes, such as humans', could thus increase their pristine informational endowment. A better understanding of SGV will contribute to basic issues such as the "nature vs nurture" dualism and the inheritance of acquired characters. On the applied side, they may explain the low yield of cloning via somatic cell nuclear transfer, provide clues to some of the problems associated with transdifferentiation, and interfere with individual DNA analysis. SGV may be unique in the different cells types and in the different developmental stages, and thus explain the several hundred gaps persisting in the human genomes "completed" so far. They may compound the variations associated to our epigenomes and make of each of us an "(epi)genomic" mosaic. An ensuing paradigm is the possibility that a single genome (the ephemeral one assembled at fertilization) has the capacity to generate several different brains in response to different environments.

  9. Estimating variation within the genes and inferring the phylogeny of 186 sequenced diverse Escherichia coli genomes

    DEFF Research Database (Denmark)

    Kaas, Rolf Sommer; Rundsten, Carsten Friis; Ussery, David

    2012-01-01

    Background Escherichia coli exists in commensal and pathogenic forms. By measuring the variation of individual genes across more than a hundred sequenced genomes, gene variation can be studied in detail, including the number of mutations found for any given gene. This knowledge will be useful...... for creating better phylogenies, for determination of molecular clocks and for improved typing techniques. Results We find 3,051 gene clusters/families present in at least 95% of the genomes and 1,702 gene clusters present in 100% of the genomes. The former 'soft core' of about 3,000 gene families is perhaps...... more biologically relevant, especially considering that many of these genome sequences are draft quality. The E. coli pan-genome for this set of isolates contains 16,373 gene clusters. A core-gene tree, based on alignment and a pan-genome tree based on gene presence/absence, maps the relatedness...

  10. Functional conservation of nucleosome formation selectively biases presumably neutral molecular variation in yeast genomes.

    Science.gov (United States)

    Babbitt, Gregory A; Cotter, C R

    2011-01-01

    One prominent pattern of mutational frequency, long appreciated in comparative genomics, is the bias of purine/pyrimidine conserving substitutions (transitions) over purine/pyrimidine altering substitutions (transversions). Traditionally, this transitional bias has been thought to be driven by the underlying rates of DNA mutation and/or repair. However, recent sequencing studies of mutation accumulation lines in model organisms demonstrate that substitutions generally do not accumulate at rates that would indicate a transitional bias. These observations have called into question a very basic assumption of molecular evolution; that naturally occurring patterns of molecular variation in noncoding regions accurately reflect the underlying processes of randomly accumulating neutral mutation in nuclear genomes. Here, in Saccharomyces yeasts, we report a very strong inverse association (r = -0.951, P < 0.004) between the genome-wide frequency of substitutions and their average energetic effect on nucleosome formation, as predicted by a structurally based energy model of DNA deformation around the nucleosome core. We find that transitions occurring at sites positioned nearest the nucleosome surface, which are believed to function most importantly in nucleosome formation, alter the deformation energy of DNA to the nucleosome core by only a fraction of the energy changes typical of most transversions. When we examined the same substitutions set against random background sequences as well as an existing study reporting substitutions arising in mutation accumulation lines of Saccharomyces cerevisiae, we failed to find a similar relationship. These results support the idea that natural selection acting to functionally conserve chromatin organization may contribute significantly to genome-wide transitional bias, even in noncoding regions. Because nucleosome core structure is highly conserved across eukaryotes, our observations may also help to further explain locally elevated

  11. Genome-Wide Association Study Reveals Natural Variations Contributing to Drought Resistance in Crops

    Directory of Open Access Journals (Sweden)

    Hongwei Wang

    2017-06-01

    Full Text Available Crops are often cultivated in regions where they will face environmental adversities; resulting in substantial yield loss which can ultimately lead to food and societal problems. Thus, significant efforts have been made to breed stress tolerant cultivars in an attempt to minimize these problems and to produce more stability with respect to crop yields across broad geographies. Since stress tolerance is a complex and multi-genic trait, advancements with classical breeding approaches have been challenging. On the other hand, molecular breeding, which is based on transgenics, marker-assisted selection and genome editing technologies; holds great promise to enable farmers to better cope with these challenges. However, identification of the key genetic components underlying the trait is critical and will serve as the foundation for future crop genetic improvement. Recently, genome-wide association studies have made significant contributions to facilitate the discovery of natural variation contributing to stress tolerance in crops. From these studies, the identified loci can serve as targets for genomic selection or editing to enable the molecular design of new cultivars. Here, we summarize research progress on this issue and focus on the genetic basis of drought tolerance as revealed by genome-wide association studies and quantitative trait loci mapping. Although many favorable loci have been identified, elucidation of their molecular mechanisms contributing to increased stress tolerance still remains a challenge. Thus, continuous efforts are still required to functionally dissect this complex trait through comprehensive approaches, such as system biological studies. It is expected that proper application of the acquired knowledge will enable the development of stress tolerant cultivars; allowing agricultural production to become more sustainable under dynamic environmental conditions.

  12. Transposable element distribution, abundance and role in genome size variation in the genus Oryza.

    Science.gov (United States)

    Zuccolo, Andrea; Sebastian, Aswathy; Talag, Jayson; Yu, Yeisoo; Kim, HyeRan; Collura, Kristi; Kudrna, Dave; Wing, Rod A

    2007-08-29

    The genus Oryza is composed of 10 distinct genome types, 6 diploid and 4 polyploid, and includes the world's most important food crop - rice (Oryza sativa [AA]). Genome size variation in the Oryza is more than 3-fold and ranges from 357 Mbp in Oryza glaberrima [AA] to 1283 Mbp in the polyploid Oryza ridleyi [HHJJ]. Because repetitive elements are known to play a significant role in genome size variation, we constructed random sheared small insert genomic libraries from 12 representative Oryza species and conducted a comprehensive study of the repetitive element composition, distribution and phylogeny in this genus. Particular attention was paid to the role played by the most important classes of transposable elements (Long Terminal Repeats Retrotransposons, Long interspersed Nuclear Elements, helitrons, DNA transposable elements) in shaping these genomes and in their contributing to genome size variation. We identified the elements primarily responsible for the most strikingly genome size variation in Oryza. We demonstrated how Long Terminal Repeat retrotransposons belonging to the same families have proliferated to very different extents in various species. We also showed that the pool of Long Terminal Repeat Retrotransposons is substantially conserved and ubiquitous throughout the Oryza and so its origin is ancient and its existence predates the speciation events that originated the genus. Finally we described the peculiar behavior of repeats in the species Oryza coarctata [HHKK] whose placement in the Oryza genus is controversial. Long Terminal Repeat retrotransposons are the major component of the Oryza genomes analyzed and, along with polyploidization, are the most important contributors to the genome size variation across the Oryza genus. Two families of Ty3-gypsy elements (RIRE2 and Atlantys) account for a significant portion of the genome size variations present in the Oryza genus.

  13. Defining the diverse spectrum of inversions, complex structural variation, and chromothripsis in the morbid human genome

    NARCIS (Netherlands)

    Collins, Ryan L; Brand, Harrison; Redin, Claire E.; Hanscom, Carrie; Antolik, Caroline; Stone, Matthew R; Glessner, Joseph T.; Mason, Tamara; Pregno, Giulia; Dorrani, Naghmeh; Mandrile, Giorgia; Giachino, Daniela; Perrin, Danielle; Walsh, Cole; Cipicchio, Michelle; Costello, Maura; Stortchevoi, Alexei; An, Joon Yong; Currall, Benjamin B; Seabra, Catarina M; Ragavendran, Ashok; Margolin, Lauren; Martinez-Agosto, Julian A.; Lucente, Diane; Levy, Brynn; Sanders, Jan-Stephan; Wapner, Ronald J.; Quintero-Rivera, Fabiola; Kloosterman, Wigard; Talkowski, Michael E.

    2017-01-01

    Background: Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association studies. Results: We sequenced 689 participants with autism spectrum disorder (ASD) and other

  14. Host genome variations and risk of infections during induction treatment for childhood acute lymphoblastic leukaemia

    DEFF Research Database (Denmark)

    Lund, Bendik; Wesolowska-Andersen, Agata; Lausen, Birgitte

    2014-01-01

    Objectives: To investigate association of host genomic variation and risk of infections during treatment for childhood acute lymphoblastic leukaemia (ALL). Methods: We explored association of 34 000 singlenucleotide polymorphisms (SNPs) related primarily to pharmacogenomics and immune function...

  15. DESCARTES' RULE OF SIGNS AND THE IDENTIFIABILITY OF POPULATION DEMOGRAPHIC MODELS FROM GENOMIC VARIATION DATA.

    Science.gov (United States)

    Bhaskar, Anand; Song, Yun S

    2014-01-01

    The sample frequency spectrum (SFS) is a widely-used summary statistic of genomic variation in a sample of homologous DNA sequences. It provides a highly efficient dimensional reduction of large-scale population genomic data and its mathematical dependence on the underlying population demography is well understood, thus enabling the development of efficient inference algorithms. However, it has been recently shown that very different population demographies can actually generate the same SFS for arbitrarily large sample sizes. Although in principle this nonidentifiability issue poses a thorny challenge to statistical inference, the population size functions involved in the counterexamples are arguably not so biologically realistic. Here, we revisit this problem and examine the identifiability of demographic models under the restriction that the population sizes are piecewise-defined where each piece belongs to some family of biologically-motivated functions. Under this assumption, we prove that the expected SFS of a sample uniquely determines the underlying demographic model, provided that the sample is sufficiently large. We obtain a general bound on the sample size sufficient for identifiability; the bound depends on the number of pieces in the demographic model and also on the type of population size function in each piece. In the cases of piecewise-constant, piecewise-exponential and piecewise-generalized-exponential models, which are often assumed in population genomic inferences, we provide explicit formulas for the bounds as simple functions of the number of pieces. Lastly, we obtain analogous results for the "folded" SFS, which is often used when there is ambiguity as to which allelic type is ancestral. Our results are proved using a generalization of Descartes' rule of signs for polynomials to the Laplace transform of piecewise continuous functions.

  16. DESCARTES’ RULE OF SIGNS AND THE IDENTIFIABILITY OF POPULATION DEMOGRAPHIC MODELS FROM GENOMIC VARIATION DATA1

    Science.gov (United States)

    Bhaskar, Anand; Song, Yun S.

    2016-01-01

    The sample frequency spectrum (SFS) is a widely-used summary statistic of genomic variation in a sample of homologous DNA sequences. It provides a highly efficient dimensional reduction of large-scale population genomic data and its mathematical dependence on the underlying population demography is well understood, thus enabling the development of efficient inference algorithms. However, it has been recently shown that very different population demographies can actually generate the same SFS for arbitrarily large sample sizes. Although in principle this nonidentifiability issue poses a thorny challenge to statistical inference, the population size functions involved in the counterexamples are arguably not so biologically realistic. Here, we revisit this problem and examine the identifiability of demographic models under the restriction that the population sizes are piecewise-defined where each piece belongs to some family of biologically-motivated functions. Under this assumption, we prove that the expected SFS of a sample uniquely determines the underlying demographic model, provided that the sample is sufficiently large. We obtain a general bound on the sample size sufficient for identifiability; the bound depends on the number of pieces in the demographic model and also on the type of population size function in each piece. In the cases of piecewise-constant, piecewise-exponential and piecewise-generalized-exponential models, which are often assumed in population genomic inferences, we provide explicit formulas for the bounds as simple functions of the number of pieces. Lastly, we obtain analogous results for the “folded” SFS, which is often used when there is ambiguity as to which allelic type is ancestral. Our results are proved using a generalization of Descartes’ rule of signs for polynomials to the Laplace transform of piecewise continuous functions. PMID:28018011

  17. New Regions of the Human Genome Linked to Skin Color Variation in Some African Populations

    Science.gov (United States)

    In the first study of its kind, an international team of genomics researchers has identified new regions of the human genome that are associated with skin color variation in some African populations, opening new avenues for research on skin diseases and cancer in all populations.

  18. An integrated map of genetic variation from 1.092 human genomes

    DEFF Research Database (Denmark)

    Abecasis, Goncalo R.; Auton, Adam; Brooks, Lisa D.

    2012-01-01

    By characterizing the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help to understand the genetic contribution to disease. Here we describe the genomes of 1,092 individuals from 14 populations, constructed using a combination ...

  19. Copy number variation identification and analysis of the chicken genome using a 60K SNP BeadChip.

    Science.gov (United States)

    Rao, Y S; Li, J; Zhang, R; Lin, X R; Xu, J G; Xie, L; Xu, Z Q; Wang, L; Gan, J K; Xie, X J; He, J; Zhang, X Q

    2016-08-01

    Copy number variation (CNV) is an important source of genetic variation in organisms and a main factor that affects phenotypic variation. A comprehensive study of chicken CNV can provide valuable information on genetic diversity and facilitate future analyses of associations between CNV and economically important traits in chickens. In the present study, an F2 full-sib chicken population (554 individuals), established from a cross between Xinghua and White Recessive Rock chickens, was used to explore CNV in the chicken genome. Genotyping was performed using a chicken 60K SNP BeadChip. A total of 1,875 CNV were detected with the PennCNV algorithm, and the average number of CNV was 3.42 per individual. The CNV were distributed across 383 independent CNV regions (CNVR) and covered 41 megabases (3.97%) of the chicken genome. Seven CNVR in 108 individuals were validated by quantitative real-time PCR, and 81 of these individuals (75%) also were detected with the PennCNV algorithm. In total, 274 CNVR (71.54%) identified in the current study were previously reported. Of these, 147 (38.38%) were reported in at least 2 studies. Additionally, 109 of the CNVR (28.46%) discovered here are novel. A total of 709 genes within or overlapping with the CNVR was retrieved. Out of the 2,742 quantitative trait loci (QTL) collected in the chicken QTL database, 43 QTL had confidence intervals overlapping with the CNVR, and 32 CNVR encompassed one or more functional genes. The functional genes located in the CNVR are likely to be the QTG that are associated with underlying economic traits. This study considerably expands our insight into the structural variation in the genome of chickens and provides an important resource for genomic variation, especially for genomic structural variation related to economic traits in chickens. © 2016 Poultry Science Association Inc.

  20. Genome-size Variation in Switchgrass (Panicum virgatum: Flow Cytometry and Cytology Reveal Rampant Aneuploidy

    Directory of Open Access Journals (Sweden)

    Denise E. Costich

    2010-11-01

    Full Text Available Switchgrass ( L., a native perennial dominant of the prairies of North America, has been targeted as a model herbaceous species for biofeedstock development. A flow-cytometric survey of a core set of 11 primarily upland polyploid switchgrass accessions indicated that there was considerable variation in genome size within each accession, particularly at the octoploid (2 = 8 = 72 chromosome ploidy level. Highly variable chromosome counts in mitotic cell preparations indicated that aneuploidy was more common in octoploids (86.3% than tetraploids (23.2%. Furthermore, the incidence of hyper- versus hypoaneuploidy is equivalent in tetraploids. This is clearly not the case in octoploids, where close to 90% of the aneuploid counts are lower than the euploid number. Cytogenetic investigation using fluorescent in situ hybridization (FISH revealed an unexpected degree of variation in chromosome structure underlying the apparent genomic instability of this species. These results indicate that rapid advances in the breeding of polyploid biofuel feedstocks, based on the molecular-genetic dissection of biomass characteristics and yield, will be predicated on the continual improvement of our understanding of the cytogenetics of these species.

  1. Trait variation and genetic diversity in a banana genomic selection training population.

    Directory of Open Access Journals (Sweden)

    Moses Nyine

    Full Text Available Banana (Musa spp. is an important crop in the African Great Lakes region in terms of income and food security, with the highest per capita consumption worldwide. Pests, diseases and climate change hamper sustainable production of bananas. New breeding tools with increased crossbreeding efficiency are being investigated to breed for resistant, high yielding hybrids of East African Highland banana (EAHB. These include genomic selection (GS, which will benefit breeding through increased genetic gain per unit time. Understanding trait variation and the correlation among economically important traits is an essential first step in the development and selection of suitable GS models for banana. In this study, we tested the hypothesis that trait variations in bananas are not affected by cross combination, cycle, field management and their interaction with genotype. A training population created using EAHB breeding material and its progeny was phenotyped in two contrasting conditions. A high level of correlation among vegetative and yield related traits was observed. Therefore, genomic selection models could be developed for traits that are easily measured. It is likely that the predictive ability of traits that are difficult to phenotype will be similar to less difficult traits they are highly correlated with. Genotype response to cycle and field management practices varied greatly with respect to traits. Yield related traits accounted for 31-35% of principal component variation under low and high input field management conditions. Resistance to Black Sigatoka was stable across cycles but varied under different field management depending on the genotype. The best cross combination was 1201K-1xSH3217 based on selection response (R of hybrids. Genotyping using simple sequence repeat (SSR markers revealed that the training population was genetically diverse, reflecting a complex pedigree background, which was mostly influenced by the male parents.

  2. Trait variation and genetic diversity in a banana genomic selection training population.

    Science.gov (United States)

    Nyine, Moses; Uwimana, Brigitte; Swennen, Rony; Batte, Michael; Brown, Allan; Christelová, Pavla; Hřibová, Eva; Lorenzen, Jim; Doležel, Jaroslav

    2017-01-01

    Banana (Musa spp.) is an important crop in the African Great Lakes region in terms of income and food security, with the highest per capita consumption worldwide. Pests, diseases and climate change hamper sustainable production of bananas. New breeding tools with increased crossbreeding efficiency are being investigated to breed for resistant, high yielding hybrids of East African Highland banana (EAHB). These include genomic selection (GS), which will benefit breeding through increased genetic gain per unit time. Understanding trait variation and the correlation among economically important traits is an essential first step in the development and selection of suitable GS models for banana. In this study, we tested the hypothesis that trait variations in bananas are not affected by cross combination, cycle, field management and their interaction with genotype. A training population created using EAHB breeding material and its progeny was phenotyped in two contrasting conditions. A high level of correlation among vegetative and yield related traits was observed. Therefore, genomic selection models could be developed for traits that are easily measured. It is likely that the predictive ability of traits that are difficult to phenotype will be similar to less difficult traits they are highly correlated with. Genotype response to cycle and field management practices varied greatly with respect to traits. Yield related traits accounted for 31-35% of principal component variation under low and high input field management conditions. Resistance to Black Sigatoka was stable across cycles but varied under different field management depending on the genotype. The best cross combination was 1201K-1xSH3217 based on selection response (R) of hybrids. Genotyping using simple sequence repeat (SSR) markers revealed that the training population was genetically diverse, reflecting a complex pedigree background, which was mostly influenced by the male parents.

  3. Trait variation and genetic diversity in a banana genomic selection training population

    Science.gov (United States)

    Nyine, Moses; Uwimana, Brigitte; Swennen, Rony; Batte, Michael; Brown, Allan; Christelová, Pavla; Hřibová, Eva; Lorenzen, Jim

    2017-01-01

    Banana (Musa spp.) is an important crop in the African Great Lakes region in terms of income and food security, with the highest per capita consumption worldwide. Pests, diseases and climate change hamper sustainable production of bananas. New breeding tools with increased crossbreeding efficiency are being investigated to breed for resistant, high yielding hybrids of East African Highland banana (EAHB). These include genomic selection (GS), which will benefit breeding through increased genetic gain per unit time. Understanding trait variation and the correlation among economically important traits is an essential first step in the development and selection of suitable GS models for banana. In this study, we tested the hypothesis that trait variations in bananas are not affected by cross combination, cycle, field management and their interaction with genotype. A training population created using EAHB breeding material and its progeny was phenotyped in two contrasting conditions. A high level of correlation among vegetative and yield related traits was observed. Therefore, genomic selection models could be developed for traits that are easily measured. It is likely that the predictive ability of traits that are difficult to phenotype will be similar to less difficult traits they are highly correlated with. Genotype response to cycle and field management practices varied greatly with respect to traits. Yield related traits accounted for 31–35% of principal component variation under low and high input field management conditions. Resistance to Black Sigatoka was stable across cycles but varied under different field management depending on the genotype. The best cross combination was 1201K-1xSH3217 based on selection response (R) of hybrids. Genotyping using simple sequence repeat (SSR) markers revealed that the training population was genetically diverse, reflecting a complex pedigree background, which was mostly influenced by the male parents. PMID:28586365

  4. Alignment-free phylogeny of whole genomes using underlying subwords

    Directory of Open Access Journals (Sweden)

    Comin Matteo

    2012-12-01

    Full Text Available Abstract Background With the progress of modern sequencing technologies a large number of complete genomes are now available. Traditionally the comparison of two related genomes is carried out by sequence alignment. There are cases where these techniques cannot be applied, for example if two genomes do not share the same set of genes, or if they are not alignable to each other due to low sequence similarity, rearrangements and inversions, or more specifically to their lengths when the organisms belong to different species. For these cases the comparison of complete genomes can be carried out only with ad hoc methods that are usually called alignment-free methods. Methods In this paper we propose a distance function based on subword compositions called Underlying Approach (UA. We prove that the matching statistics, a popular concept in the field of string algorithms able to capture the statistics of common words between two sequences, can be derived from a small set of “independent” subwords, namely the irredundant common subwords. We define a distance-like measure based on these subwords, such that each region of genomes contributes only once, thus avoiding to count shared subwords a multiple number of times. In a nutshell, this filter discards subwords occurring in regions covered by other more significant subwords. Results The Underlying Approach (UA builds a scoring function based on this set of patterns, called underlying. We prove that this set is by construction linear in the size of input, without overlaps, and can be efficiently constructed. Results show the validity of our method in the reconstruction of phylogenetic trees, where the Underlying Approach outperforms the current state of the art methods. Moreover, we show that the accuracy of UA is achieved with a very small number of subwords, which in some cases carry meaningful biological information. Availability http://www.dei.unipd.it/∼ciompin/main/underlying.html

  5. Bread wheat progenitors: Aegilops tauschii (DD genome) and Triticum dicoccoides (AABB genome) reveal differential antioxidative response under water stress.

    Science.gov (United States)

    Suneja, Yadhu; Gupta, Anil Kumar; Bains, Navtej Singh

    2017-01-01

    Antioxidant enzymes are known to play a significant role in scavenging reactive oxygen species and maintaining cellular homeostasis. Activity of four antioxidant enzymes viz., superoxide dismutase (SOD), catalase (CAT), ascorbate peroxidase (APX) and glutathione reductase (GR) was examined in the flag leaves of nine Aegilops tauschii and three Triticum dicoccoides accessions along with two bread wheat cultivars under irrigated and rain-fed conditions. These accessions were shortlisted from a larger set on the basis of field performance for a set of morpho-physiological traits. At anthesis, significant differences were observed in enzyme activities in two environments. A 45% elevation in average GR activity was observed under rain-fed conditions. Genotypic variation was evident within each environment as well as in terms of response to stress environment. Aegilops tauschii accession 3769 (86% increase in SOD, 41% in CAT, 72% in APX, 48% in GR activity) and acc. 14096 (37% increase in SOD, 32% CAT, 25% APX, 42% GR) showed up-regulation in the activity of all the four studied antioxidant enzymes. Aegilops tauschii accessions-9809, 14189 and 14113 also seemed to have strong induction mechanism as elevated activity of at least three enzymes was observed in them under rain-fed conditions. T. dicoccoides , on the other hand, maintained active antioxidative machinery under irrigated condition with relatively lower induction under stress. A significant positive correlation (r = 0.760) was identified between change in the activity of CAT and GR under stress. Changes in plant height, spike length and grain weight were recorded under stress and non-stress conditions on the basis of which a cumulative tolerance index was deduced and accessions were ranked for drought tolerance. Overall, Ae. tauschii accession 3769, 14096, 14113 (DD-genome) and T. dicoccoides accession 7054 (AABB-genome) may be used as donors to combine beneficial stress adaptive traits of all the three sub-genomes

  6. Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology

    DEFF Research Database (Denmark)

    Cao, Hongzhi; Hastie, Alex R.; Cao, Dandan

    2014-01-01

    mutations; however, none of the current detection methods are comprehensive, and currently available methodologies are incapable of providing sufficient resolution and unambiguous information across complex regions in the human genome. To address these challenges, we applied a high-throughput, cost......-effective genome mapping technology to comprehensively discover genome-wide SVs and characterize complex regions of the YH genome using long single molecules (>150 kb) in a global fashion. RESULTS: Utilizing nanochannel-based genome mapping technology, we obtained 708 insertions/deletions and 17 inversions larger...... fosmid data. Of the remaining 270 SVs, 260 are insertions and 213 overlap known SVs in the Database of Genomic Variants. Overall, 609 out of 666 (90%) variants were supported by experimental orthogonal methods or historical evidence in public databases. At the same time, genome mapping also provides...

  7. Detecting microsatellites within genomes: significant variation among algorithms

    Directory of Open Access Journals (Sweden)

    Rivals Eric

    2007-04-01

    Full Text Available Abstract Background Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker. Results Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp, regardless of motif. Conclusion Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions.

  8. Pan-Genome Analysis Links the Hereditary Variation of Leptospirillum ferriphilum With Its Evolutionary Adaptation

    Directory of Open Access Journals (Sweden)

    Xian Zhang

    2018-03-01

    Full Text Available Niche adaptation has long been recognized to drive intra-species differentiation and speciation, yet knowledge about its relatedness with hereditary variation of microbial genomes is relatively limited. Using Leptospirillum ferriphilum species as a case study, we present a detailed analysis of genomic features of five recognized strains. Genome-to-genome distance calculation preliminarily determined the roles of spatial distance and environmental heterogeneity that potentially contribute to intra-species variation within L. ferriphilum species at the genome level. Mathematical models were further constructed to extrapolate the expansion of L. ferriphilum genomes (an ‘open’ pan-genome, indicating the emergence of novel genes with new sequenced genomes. The identification of diverse mobile genetic elements (MGEs (such as transposases, integrases, and phage-associated genes revealed the prevalence of horizontal gene transfer events, which is an important evolutionary mechanism that provides avenues for the recruitment of novel functionalities and further for the genetic divergence of microbial genomes. Comprehensive analysis also demonstrated that the genome reduction by gene loss in a broad sense might contribute to the observed diversification. We thus inferred a plausible explanation to address this observation: the community-dependent adaptation that potentially economizes the limiting resources of the entire community. Now that the introduction of new genes is accompanied by a parallel abandonment of some other ones, our results provide snapshots on the biological fitness cost of environmental adaptation within the L. ferriphilum genomes. In short, our genome-wide analyses bridge the relation between genetic variation of L. ferriphilum with its evolutionary adaptation.

  9. ChickVD: a sequence variation database for the chicken genome

    DEFF Research Database (Denmark)

    Wang, Jing; He, Ximiao; Ruan, Jue

    2005-01-01

    Working in parallel with the efforts to sequence the chicken (Gallus gallus) genome, the Beijing Genomics Institute led an international team of scientists from China, USA, UK, Sweden, The Netherlands and Germany to map extensive DNA sequence variation throughout the chicken genome by sampling DN...... on quantitative trait loci using data from collaborating institutions and public resources. Our data can be queried by search engine and homology-based BLAST searches. ChickVD is publicly accessible at http://chicken.genomics.org.cn. Udgivelsesdato: 2005-Jan-1...

  10. Structural genomic variation as risk factor for idiopathic recurrent miscarriage

    DEFF Research Database (Denmark)

    Nagirnaja, Liina; Palta, Priit; Kasak, Laura

    2014-01-01

    Recurrent miscarriage (RM) is a multifactorial disorder with acknowledged genetic heritability that affects ∼3% of couples aiming at childbirth. As copy number variants (CNVs) have been shown to contribute to reproductive disease susceptibility, we aimed to describe genome-wide profile of CNVs an...

  11. Population genomics of Pacific lamprey: adaptive variation in a highly dispersive species.

    Science.gov (United States)

    Hess, Jon E; Campbell, Nathan R; Close, David A; Docker, Margaret F; Narum, Shawn R

    2013-06-01

    Unlike most anadromous fishes that have evolved strict homing behaviour, Pacific lamprey (Entosphenus tridentatus) seem to lack philopatry as evidenced by minimal population structure across the species range. Yet unexplained findings of within-region population genetic heterogeneity coupled with the morphological and behavioural diversity described for the species suggest that adaptive genetic variation underlying fitness traits may be responsible. We employed restriction site-associated DNA sequencing to genotype 4439 quality filtered single nucleotide polymorphism (SNP) loci for 518 individuals collected across a broad geographical area including British Columbia, Washington, Oregon and California. A subset of putatively neutral markers (N = 4068) identified a significant amount of variation among three broad populations: northern British Columbia, Columbia River/southern coast and 'dwarf' adults (F(CT) = 0.02, P ≪ 0.001). Additionally, 162 SNPs were identified as adaptive through outlier tests, and inclusion of these markers revealed a signal of adaptive variation related to geography and life history. The majority of the 162 adaptive SNPs were not independent and formed four groups of linked loci. Analyses with matsam software found that 42 of these outlier SNPs were significantly associated with geography, run timing and dwarf life history, and 27 of these 42 SNPs aligned with known genes or highly conserved genomic regions using the genome browser available for sea lamprey. This study provides both neutral and adaptive context for observed genetic divergence among collections and thus reconciles previous findings of population genetic heterogeneity within a species that displays extensive gene flow. © 2012 John Wiley & Sons Ltd.

  12. Genetic and epigenetic variation in 5S ribosomal RNA genes reveals genome dynamics in Arabidopsis thaliana.

    Science.gov (United States)

    Simon, Lauriane; Rabanal, Fernando A; Dubos, Tristan; Oliver, Cecilia; Lauber, Damien; Poulet, Axel; Vogt, Alexander; Mandlbauer, Ariane; Le Goff, Samuel; Sommer, Andreas; Duborjal, Hervé; Tatout, Christophe; Probst, Aline V

    2018-04-06

    Organized in tandem repeat arrays in most eukaryotes and transcribed by RNA polymerase III, expression of 5S rRNA genes is under epigenetic control. To unveil mechanisms of transcriptional regulation, we obtained here in depth sequence information on 5S rRNA genes from the Arabidopsis thaliana genome and identified differential enrichment in epigenetic marks between the three 5S rDNA loci situated on chromosomes 3, 4 and 5. We reveal the chromosome 5 locus as the major source of an atypical, long 5S rRNA transcript characteristic of an open chromatin structure. 5S rRNA genes from this locus translocated in the Landsberg erecta ecotype as shown by linkage mapping and chromosome-specific FISH analysis. These variations in 5S rDNA locus organization cause changes in the spatial arrangement of chromosomes in the nucleus. Furthermore, 5S rRNA gene arrangements are highly dynamic with alterations in chromosomal positions through translocations in certain mutants of the RNA-directed DNA methylation pathway and important copy number variations among ecotypes. Finally, variations in 5S rRNA gene sequence, chromatin organization and transcripts indicate differential usage of 5S rDNA loci in distinct ecotypes. We suggest that both the usage of existing and new 5S rDNA loci resulting from translocations may impact neighboring chromatin organization.

  13. Comprehensive genomic characterization of campylobacter genus reveals some underlying mechanisms for its genomic diversification.

    Directory of Open Access Journals (Sweden)

    Yizhuang Zhou

    Full Text Available Campylobacter species.are phenotypically diverse in many aspects including host habitats and pathogenicities, which demands comprehensive characterization of the entire Campylobacter genus to study their underlying genetic diversification. Up to now, 34 Campylobacter strains have been sequenced and published in public databases, providing good opportunity to systemically analyze their genomic diversities. In this study, we first conducted genomic characterization, which includes genome-wide alignments, pan-genome analysis, and phylogenetic identification, to depict the genetic diversity of Campylobacter genus. Afterward, we improved the tetranucleotide usage pattern-based naïve Bayesian classifier to identify the abnormal composition fragments (ACFs, fragments with significantly different tetranucleotide frequency profiles from its genomic tetranucleotide frequency profiles including horizontal gene transfers (HGTs to explore the mechanisms for the genetic diversity of this organism. Finally, we analyzed the HGTs transferred via bacteriophage transductions. To our knowledge, this study is the first to use single nucleotide polymorphism information to construct liable microevolution phylogeny of 21 Campylobacter jejuni strains. Combined with the phylogeny of all the collected Campylobacter species based on genome-wide core gene information, comprehensive phylogenetic inference of all 34 Campylobacter organisms was determined. It was found that C. jejuni harbors a high fraction of ACFs possibly through intraspecies recombination, whereas other Campylobacter members possess numerous ACFs possibly via intragenus recombination. Furthermore, some Campylobacter strains have undergone significant ancient viral integration during their evolution process. The improved method is a powerful tool for bacterial genomic analysis. Moreover, the findings would provide useful information for future research on Campylobacter genus.

  14. Genome-wide patterns of copy number variation in the diversified chicken genomes using next-generation sequencing.

    Science.gov (United States)

    Yi, Guoqiang; Qu, Lujiang; Liu, Jianfeng; Yan, Yiyuan; Xu, Guiyun; Yang, Ning

    2014-11-07

    Copy number variation (CNV) is important and widespread in the genome, and is a major cause of disease and phenotypic diversity. Herein, we performed a genome-wide CNV analysis in 12 diversified chicken genomes based on whole genome sequencing. A total of 8,840 CNV regions (CNVRs) covering 98.2 Mb and representing 9.4% of the chicken genome were identified, ranging in size from 1.1 to 268.8 kb with an average of 11.1 kb. Sequencing-based predictions were confirmed at a high validation rate by two independent approaches, including array comparative genomic hybridization (aCGH) and quantitative PCR (qPCR). The Pearson's correlation coefficients between sequencing and aCGH results ranged from 0.435 to 0.755, and qPCR experiments revealed a positive validation rate of 91.71% and a false negative rate of 22.43%. In total, 2,214 (25.0%) predicted CNVRs span 2,216 (36.4%) RefSeq genes associated with specific biological functions. Besides two previously reported copy number variable genes EDN3 and PRLR, we also found some promising genes with potential in phenotypic variation. Two genes, FZD6 and LIMS1, related to disease susceptibility/resistance are covered by CNVRs. The highly duplicated SOCS2 may lead to higher bone mineral density. Entire or partial duplication of some genes like POPDC3 may have great economic importance in poultry breeding. Our results based on extensive genetic diversity provide a more refined chicken CNV map and genome-wide gene copy number estimates, and warrant future CNV association studies for important traits in chickens.

  15. Variation in Payment Rates under Medicare's Inpatient Prospective Payment System.

    Science.gov (United States)

    Krinsky, Sam; Ryan, Andrew M; Mijanovich, Tod; Blustein, Jan

    2017-04-01

    To measure variation in payment rates under Medicare's Inpatient Prospective Payment System (IPPS) and identify the main payment adjustments that drive variation. Medicare cost reports for all Medicare-certified hospitals, 1987-2013, and Dartmouth Atlas geographic files. We measure the Medicare payment rate as a hospital's total acute inpatient Medicare Part A payment, divided by the standard IPPS payment for its geographic area. We assess variation using several measures, both within local markets and nationally. We perform a factor decomposition to identify the share of variation attributable to specific adjustments. We also describe the characteristics of hospitals receiving different payment rates and evaluate changes in the magnitude of the main adjustments over time. Data downloaded from the Centers for Medicare and Medicaid Services, the National Bureau of Economic Research, and the Dartmouth Atlas. In 2013, Medicare paid for acute inpatient discharges at a rate 31 percent above the IPPS base. For the top 10 percent of discharges, the mean rate was double the IPPS base. Variations were driven by adjustments for medical education and care to low-income populations. The magnitude of variation has increased over time. Adjustments are a large and growing share of Medicare hospital payments, and they create significant variation in payment rates. © Health Research and Educational Trust.

  16. PolyTB: A genomic variation map for Mycobacterium tuberculosis

    KAUST Repository

    Coll, Francesc; Preston, Mark; Guerra-Assunç ã o, José Afonso; Hill-Cawthorn, Grant; Harris, David; Perdigã o, Joã o; Viveiros, Miguel; Portugal, Isabel; Drobniewski, Francis; Gagneux, Sebastien; Glynn, Judith R.; Pain, Arnab; Parkhill, Julian; McNerney, Ruth; Martin, Nigel; Clark, Taane G.

    2014-01-01

    ://pathogenseq.lshtm.ac.uk/polytb) to visualise the resulting variation and important meta-data (e.g. in silico inferred strain-types, location) within geographical map and phylogenetic views. This resource will allow researchers to identify polymorphisms within candidate genes of interest

  17. Deciphering molecular circuits from genetic variation underlying transcriptional responsiveness to stimuli.

    Science.gov (United States)

    Gat-Viks, Irit; Chevrier, Nicolas; Wilentzik, Roni; Eisenhaure, Thomas; Raychowdhury, Raktima; Steuerman, Yael; Shalek, Alex K; Hacohen, Nir; Amit, Ido; Regev, Aviv

    2013-04-01

    Individual genetic variation affects gene responsiveness to stimuli, often by influencing complex molecular circuits. Here we combine genomic and intermediate-scale transcriptional profiling with computational methods to identify variants that affect the responsiveness of genes to stimuli (responsiveness quantitative trait loci or reQTLs) and to position these variants in molecular circuit diagrams. We apply this approach to study variation in transcriptional responsiveness to pathogen components in dendritic cells from recombinant inbred mouse strains. We identify reQTLs that correlate with particular stimuli and position them in known pathways. For example, in response to a virus-like stimulus, a trans-acting variant responds as an activator of the antiviral response; using RNA interference, we identify Rgs16 as the likely causal gene. Our approach charts an experimental and analytic path to decipher the mechanisms underlying genetic variation in circuits that control responses to stimuli.

  18. Human-specific HERV-K insertion causes genomic variations in the human genome.

    Directory of Open Access Journals (Sweden)

    Wonseok Shin

    Full Text Available Human endogenous retroviruses (HERV sequences account for about 8% of the human genome. Through comparative genomics and literature mining, we identified a total of 29 human-specific HERV-K insertions. We characterized them focusing on their structure and flanking sequence. The results showed that four of the human-specific HERV-K insertions deleted human genomic sequences via non-classical insertion mechanisms. Interestingly, two of the human-specific HERV-K insertion loci contained two HERV-K internals and three LTR elements, a pattern which could be explained by LTR-LTR ectopic recombination or template switching. In addition, we conducted a polymorphic test and observed that twelve out of the 29 elements are polymorphic in the human population. In conclusion, human-specific HERV-K elements have inserted into human genome since the divergence of human and chimpanzee, causing human genomic changes. Thus, we believe that human-specific HERV-K activity has contributed to the genomic divergence between humans and chimpanzees, as well as within the human population.

  19. Sequencing the CHO DXB11 genome reveals regional variations in genomic stability and haploidy

    DEFF Research Database (Denmark)

    Kaas, Christian Schrøder; Kristensen, Claus; Betenbaugh, Michael J.

    2015-01-01

    Background: The DHFR negative CHO DXB11 cell line (also known as DUX-B11 and DUKX) was historically the first CHO cell line to be used for large scale production of heterologous proteins and is still used for production of a number of complex proteins.  Results: Here we present the genomic sequence...... of the CHO DXB11 genome sequenced to a depth of 33x. Overall a significant genomic drift was seen favoring GC -> AT point mutations in line with the chemical mutagenesis strategy used for generation of the cell line. The sequencing depth for each gene in the genome revealed distinct peaks at sequencing...... in eight additional analyzed CHO genomes (15-20% haploidy) but not in the genome of the Chinese hamster. The dhfr gene is confirmed to be haploid in CHO DXB11; transcriptionally active and the remaining allele contains a G410C point mutation causing a Thr137Arg missense mutation. We find similar to 2...

  20. Genetic Variation in the Nuclear and Organellar Genomes Modulates Stochastic Variation in the Metabolome, Growth, and Defense

    Science.gov (United States)

    Joseph, Bindu; Corwin, Jason A.; Kliebenstein, Daniel J.

    2015-01-01

    Recent studies are starting to show that genetic control over stochastic variation is a key evolutionary solution of single celled organisms in the face of unpredictable environments. This has been expanded to show that genetic variation can alter stochastic variation in transcriptional processes within multi-cellular eukaryotes. However, little is known about how genetic diversity can control stochastic variation within more non-cell autonomous phenotypes. Using an Arabidopsis reciprocal RIL population, we showed that there is significant genetic diversity influencing stochastic variation in the plant metabolome, defense chemistry, and growth. This genetic diversity included loci specific for the stochastic variation of each phenotypic class that did not affect the other phenotypic classes or the average phenotype. This suggests that the organism's networks are established so that noise can exist in one phenotypic level like metabolism and not permeate up or down to different phenotypic levels. Further, the genomic variation within the plastid and mitochondria also had significant effects on the stochastic variation of all phenotypic classes. The genetic influence over stochastic variation within the metabolome was highly metabolite specific, with neighboring metabolites in the same metabolic pathway frequently showing different levels of noise. As expected from bet-hedging theory, there was more genetic diversity and a wider range of stochastic variation for defense chemistry than found for primary metabolism. Thus, it is possible to begin dissecting the stochastic variation of whole organismal phenotypes in multi-cellular organisms. Further, there are loci that modulate stochastic variation at different phenotypic levels. Finding the identity of these genes will be key to developing complete models linking genotype to phenotype. PMID:25569687

  1. A refined model of the genomic basis for phenotypic variation in vertebrate hemostasis.

    Science.gov (United States)

    Ribeiro, Ângela M; Zepeda-Mendoza, M Lisandra; Bertelsen, Mads F; Kristensen, Annemarie T; Jarvis, Erich D; Gilbert, M Thomas P; da Fonseca, Rute R

    2015-06-30

    Hemostasis is a defense mechanism that enhances an organism's survival by minimizing blood loss upon vascular injury. In vertebrates, hemostasis has been evolving with the cardio-vascular and hemodynamic systems over the last 450 million years. Birds and mammals have very similar vascular and hemodynamic systems, thus the mechanism that blocks ruptures in the vasculature is expected to be the same. However, the speed of the process varies across vertebrates, and is particularly slow for birds. Understanding the differences in the hemostasis pathway between birds and mammals, and placing them in perspective to other vertebrates may provide clues to the genetic contribution to variation in blood clotting phenotype in vertebrates. We compiled genomic data corresponding to key elements involved in hemostasis across vertebrates to investigate its genetic basis and understand how it affects fitness. We found that: i) fewer genes are involved in hemostasis in birds compared to mammals; and ii) the largest differences concern platelet membrane receptors and components from the kallikrein-kinin system. We propose that lack of the cytoplasmic domain of the GPIb receptor subunit alpha could be a strong contributor to the prolonged bleeding phenotype in birds. Combined analysis of laboratory assessments of avian hemostasis with the first avian phylogeny based on genomic-scale data revealed that differences in hemostasis within birds are not explained by phylogenetic relationships, but more so by genetic variation underlying components of the hemostatic process, suggestive of natural selection. This work adds to our understanding of the evolution of hemostasis in vertebrates. The overlap with the inflammation, complement and renin-angiotensin (blood pressure regulation) pathways is a potential driver of rapid molecular evolution in the hemostasis network. Comparisons between avian species and mammals allowed us to hypothesize that the observed mammalian innovations might have

  2. A variational analysis for large deflection of skew plates under ...

    African Journals Online (AJOL)

    In the present paper, the static behaviour of thin isotropic skew plates under uniformly distributed load is analyzed with the geometric nonlinearity of the model properly handled. A variational method based on total potential energy has been implemented through assumed displacement field. The computational work has ...

  3. A genome-wide association study demonstrates significant genetic variation for fracture risk in Thoroughbred racehorses

    Science.gov (United States)

    2014-01-01

    Background Thoroughbred racehorses are subject to non-traumatic distal limb bone fractures that occur during racing and exercise. Susceptibility to fracture may be due to underlying disturbances in bone metabolism which have a genetic cause. Fracture risk has been shown to be heritable in several species but this study is the first genetic analysis of fracture risk in the horse. Results Fracture cases (n = 269) were horses that sustained catastrophic distal limb fractures while racing on UK racecourses, necessitating euthanasia. Control horses (n = 253) were over 4 years of age, were racing during the same time period as the cases, and had no history of fracture at the time the study was carried out. The horses sampled were bred for both flat and National Hunt (NH) jump racing. 43,417 SNPs were employed to perform a genome-wide association analysis and to estimate the proportion of genetic variance attributable to the SNPs on each chromosome using restricted maximum likelihood (REML). Significant genetic variation associated with fracture risk was found on chromosomes 9, 18, 22 and 31. Three SNPs on chromosome 18 (62.05 Mb – 62.15 Mb) and one SNP on chromosome 1 (14.17 Mb) reached genome-wide significance (p fracture than cases, p = 1 × 10-4), while a second haplotype increases fracture risk (cases at 3.39 times higher risk of fracture than controls, p = 0.042). Conclusions Fracture risk in the Thoroughbred horse is a complex condition with an underlying genetic basis. Multiple genomic regions contribute to susceptibility to fracture risk. This suggests there is the potential to develop SNP-based estimators for genetic risk of fracture in the Thoroughbred racehorse, using methods pioneered in livestock genetics such as genomic selection. This information would be useful to racehorse breeders and owners, enabling them to reduce the risk of injury in their horses. PMID:24559379

  4. Extensive gene content variation in the Brachypodium distachyon pan-genome correlates with population structure.

    Science.gov (United States)

    Gordon, Sean P; Contreras-Moreira, Bruno; Woods, Daniel P; Des Marais, David L; Burgess, Diane; Shu, Shengqiang; Stritt, Christoph; Roulin, Anne C; Schackwitz, Wendy; Tyler, Ludmila; Martin, Joel; Lipzen, Anna; Dochy, Niklas; Phillips, Jeremy; Barry, Kerrie; Geuten, Koen; Budak, Hikmet; Juenger, Thomas E; Amasino, Richard; Caicedo, Ana L; Goodstein, David; Davidson, Patrick; Mur, Luis A J; Figueroa, Melania; Freeling, Michael; Catalan, Pilar; Vogel, John P

    2017-12-19

    While prokaryotic pan-genomes have been shown to contain many more genes than any individual organism, the prevalence and functional significance of differentially present genes in eukaryotes remains poorly understood. Whole-genome de novo assembly and annotation of 54 lines of the grass Brachypodium distachyon yield a pan-genome containing nearly twice the number of genes found in any individual genome. Genes present in all lines are enriched for essential biological functions, while genes present in only some lines are enriched for conditionally beneficial functions (e.g., defense and development), display faster evolutionary rates, lie closer to transposable elements and are less likely to be syntenic with orthologous genes in other grasses. Our data suggest that differentially present genes contribute substantially to phenotypic variation within a eukaryote species, these genes have a major influence in population genetics, and transposable elements play a key role in pan-genome evolution.

  5. Within-Host Variations of Human Papillomavirus Reveal APOBEC-Signature Mutagenesis in the Viral Genome.

    Science.gov (United States)

    Hirose, Yusuke; Onuki, Mamiko; Tenjimbayashi, Yuri; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao

    2018-03-28

    Persistent infection with oncogenic human papillomaviruses (HPVs) causes cervical cancer, accompanied with the accumulation of somatic mutations into the host genome. There are concomitant genetic changes in the HPV genome during viral infection; however, their relevance to cervical carcinogenesis is poorly understood. Here we explored within-host genetic diversity of HPV by performing deep sequencing analyses of viral whole-genome sequences in clinical specimens. The whole genomes of HPV types 16, 52 and 58 were amplified by type-specific PCR from total cellular DNA of cervical exfoliated cells collected from patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC), and were deep-sequenced. After constructing a reference vial genome sequence for each specimen, nucleotide positions showing changes with > 0.5% frequencies compared to the reference sequence were determined for individual samples. In total, 1,052 positions of nucleotide variations were detected in HPV genomes from 151 samples (CIN1, n = 56; CIN2/3, n = 68; ICC, n = 27), with varying numbers per sample. Overall, C-to-T and C-to-A substitutions were the dominant changes observed across all histological grades. While C-to-T transitions were predominantly detected in CIN1, their prevalence was decreased in CIN2/3 and fell below that of C-to-A transversions in ICC. Analysis of the tri-nucleotides context encompassing substituted bases revealed that Tp C pN, a preferred target sequence for cellular APOBEC cytosine deaminases, was a primary site for C-to-T substitutions in the HPV genome. These results strongly imply that the APOBEC proteins are drivers of HPV genome mutation, particularly in CIN1 lesions. IMPORTANCE HPVs exhibit surprisingly high levels of genetic diversity, including a large repertoire of minor genomic variants in each viral genotype. Here, by conducting deep sequencing analyses, we show for the first time a comprehensive snapshot of the "within

  6. Genomic structural variation contributes to phenotypic change of industrial bioethanol yeast Saccharomyces cerevisiae.

    Science.gov (United States)

    Zhang, Ke; Zhang, Li-Jie; Fang, Ya-Hong; Jin, Xin-Na; Qi, Lei; Wu, Xue-Chang; Zheng, Dao-Qiong

    2016-03-01

    Genomic structural variation (GSV) is a ubiquitous phenomenon observed in the genomes of Saccharomyces cerevisiae strains with different genetic backgrounds; however, the physiological and phenotypic effects of GSV are not well understood. Here, we first revealed the genetic characteristics of a widely used industrial S. cerevisiae strain, ZTW1, by whole genome sequencing. ZTW1 was identified as an aneuploidy strain and a large-scale GSV was observed in the ZTW1 genome compared with the genome of a diploid strain YJS329. These GSV events led to copy number variations (CNVs) in many chromosomal segments as well as one whole chromosome in the ZTW1 genome. Changes in the DNA dosage of certain functional genes directly affected their expression levels and the resultant ZTW1 phenotypes. Moreover, CNVs of large chromosomal regions triggered an aneuploidy stress in ZTW1. This stress decreased the proliferation ability and tolerance of ZTW1 to various stresses, while aneuploidy response stress may also provide some benefits to the fermentation performance of the yeast, including increased fermentation rates and decreased byproduct generation. This work reveals genomic characters of the bioethanol S. cerevisiae strain ZTW1 and suggests that GSV is an important kind of mutation that changes the traits of industrial S. cerevisiae strains. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. Genome size variation and incidence of polyploidy in Scrophulariaceae sensu lato from the Iberian Peninsula.

    Science.gov (United States)

    Castro, Mariana; Castro, Sílvia; Loureiro, João

    2012-01-01

    In the last decade, genomic studies using DNA markers have strongly influenced the current phylogeny of angiosperms. Genome size and ploidy level have contributed to this discussion, being considered important characters in biosystematics, ecology and population biology. Despite the recent increase in studies related to genome size evolution and polyploidy incidence, only a few are available for Scrophulariaceae. In this context, we assessed the value of genome size, mostly as a taxonomic marker, and the role of polyploidy as a process of genesis and maintenance of plant diversity in Scrophulariaceae sensu lato in the Iberian Peninsula. Large-scale analyses of genome size and ploidy-level variation across the Iberian Peninsula were performed using flow cytometry. One hundred and sixty-two populations of 59 distinct taxa were analysed. A bibliographic review on chromosome counts was also performed. From the 59 sampled taxa, 51 represent first estimates of genome size. The majority of the Scrophulariaceae species presented very small to small genome sizes (2C ≤ 7.0 pg). Furthermore, in most of the analysed genera it was possible to use this character to separate several taxa, independently if these genera were homoploid or heteroploid. Also, some genome-related phenomena were detected, such as intraspecific variation of genome size in some genera and the possible occurrence of dysploidy in Verbascum spp. With respect to polyploidy, despite a few new DNA ploidy levels having been detected in Veronica, no multiple cytotypes have been found in any taxa. This work contributed with important basic scientific knowledge on genome size and polyploid incidence in the Scrophulariaceae, providing important background information for subsequent studies, with several perspectives for future studies being opened.

  8. Insights into the genome structure and copy-number variation of Eimeria tenella

    Directory of Open Access Journals (Sweden)

    Lim Lik-Sin

    2012-08-01

    Full Text Available Abstract Background Eimeria is a genus of parasites in the same phylum (Apicomplexa as human parasites such as Toxoplasma, Cryptosporidium and the malaria parasite Plasmodium. As an apicomplexan whose life-cycle involves a single host, Eimeria is a convenient model for understanding this group of organisms. Although the genomes of the Apicomplexa are diverse, that of Eimeria is unique in being composed of large alternating blocks of sequence with very different characteristics - an arrangement seen in no other organism. This arrangement has impeded efforts to fully sequence the genome of Eimeria, which remains the last of the major apicomplexans to be fully analyzed. In order to increase the value of the genome sequence data and aid in the effort to gain a better understanding of the Eimeria tenella genome, we constructed a whole genome map for the parasite. Results A total of 1245 contigs representing 70.0% of the whole genome assembly sequences (Wellcome Trust Sanger Institute were selected and subjected to marker selection. Subsequently, 2482 HAPPY markers were developed and typed. Of these, 795 were considered as usable markers, and utilized in the construction of a HAPPY map. Markers developed from chromosomally-assigned genes were then integrated into the HAPPY map and this aided the assignment of a number of linkage groups to their respective chromosomes. BAC-end sequences and contigs from whole genome sequencing were also integrated to improve and validate the HAPPY map. This resulted in an integrated HAPPY map consisting of 60 linkage groups that covers approximately half of the estimated 60 Mb genome. Further analysis suggests that the segmental organization first seen in Chromosome 1 is present throughout the genome, with repeat-poor (P regions alternating with repeat-rich (R regions. Evidence of copy-number variation between strains was also uncovered. Conclusions This paper describes the application of a whole genome mapping

  9. Genomic variation in recently collected maize landraces from Mexico

    Directory of Open Access Journals (Sweden)

    María Clara Arteaga

    2016-03-01

    Full Text Available The present dataset comprises 36,931 SNPs genotyped in 46 maize landraces native to Mexico as well as the teosinte subspecies Zea maiz ssp. parviglumis and ssp. mexicana. These landraces were collected directly from farmers mostly between 2006 and 2010. We accompany these data with a short description of the variation within each landrace, as well as maps, principal component analyses and neighbor joining trees showing the distribution of the genetic diversity relative to landrace, geographical features and maize biogeography. High levels of genetic variation were detected for the maize landraces (HE = 0.234 to 0.318 (mean 0.311, while slightly lower levels were detected in Zea m. mexicana and Zea m. parviglumis (HE = 0.262 and 0.234, respectively. The distribution of genetic variation was better explained by environmental variables given by the interaction of altitude and latitude than by landrace identity. This dataset is a follow up product of the Global Native Maize Project, an initiative to update the data on Mexican maize landraces and their wild relatives, and to generate information that is necessary for implementing the Mexican Biosafety Law. Keywords: Maize, Teosinte, Maize SNP50K BeadChip, Mexican landraces, Proyecto Global de Maíces Nativos

  10. Genomic variation in recently collected maize landraces from Mexico

    Science.gov (United States)

    Arteaga, María Clara; Moreno-Letelier, Alejandra; Mastretta-Yanes, Alicia; Vázquez-Lobo, Alejandra; Breña-Ochoa, Alejandra; Moreno-Estrada, Andrés; Eguiarte, Luis E.; Piñero, Daniel

    2015-01-01

    The present dataset comprises 36,931 SNPs genotyped in 46 maize landraces native to Mexico as well as the teosinte subspecies Zea maiz ssp. parviglumis and ssp. mexicana. These landraces were collected directly from farmers mostly between 2006 and 2010. We accompany these data with a short description of the variation within each landrace, as well as maps, principal component analyses and neighbor joining trees showing the distribution of the genetic diversity relative to landrace, geographical features and maize biogeography. High levels of genetic variation were detected for the maize landraces (HE = 0.234 to 0.318 (mean 0.311), while slightly lower levels were detected in Zea m. mexicana and Zea m. parviglumis (HE = 0.262 and 0.234, respectively). The distribution of genetic variation was better explained by environmental variables given by the interaction of altitude and latitude than by landrace identity. This dataset is a follow up product of the Global Native Maize Project, an initiative to update the data on Mexican maize landraces and their wild relatives, and to generate information that is necessary for implementing the Mexican Biosafety Law. PMID:26981357

  11. Comparison of variations detection between whole-genome amplification methods used in single-cell resequencing

    DEFF Research Database (Denmark)

    Hou, Yong; Wu, Kui; Shi, Xulian

    2015-01-01

    methods, focusing particularly on variations detection. Low-coverage whole-genome sequencing revealed that DOP-PCR had the highest duplication ratio, but an even read distribution and the best reproducibility and accuracy for detection of copy-number variations (CNVs). However, MDA had significantly...... performance using SCRS amplified by different WGA methods. It will guide researchers to determine which WGA method is best suited to individual experimental needs at single-cell level....

  12. Identifying tagging SNPs for African specific genetic variation from the African Diaspora Genome.

    Science.gov (United States)

    Johnston, Henry Richard; Hu, Yi-Juan; Gao, Jingjing; O'Connor, Timothy D; Abecasis, Gonçalo R; Wojcik, Genevieve L; Gignoux, Christopher R; Gourraud, Pierre-Antoine; Lizee, Antoine; Hansen, Mark; Genuario, Rob; Bullis, Dave; Lawley, Cindy; Kenny, Eimear E; Bustamante, Carlos; Beaty, Terri H; Mathias, Rasika A; Barnes, Kathleen C; Qin, Zhaohui S

    2017-04-21

    A primary goal of The Consortium on Asthma among African-ancestry Populations in the Americas (CAAPA) is to develop an 'African Diaspora Power Chip' (ADPC), a genotyping array consisting of tagging SNPs, useful in comprehensively identifying African specific genetic variation. This array is designed based on the novel variation identified in 642 CAAPA samples of African ancestry with high coverage whole genome sequence data (~30× depth). This novel variation extends the pattern of variation catalogued in the 1000 Genomes and Exome Sequencing Projects to a spectrum of populations representing the wide range of West African genomic diversity. These individuals from CAAPA also comprise a large swath of the African Diaspora population and incorporate historical genetic diversity covering nearly the entire Atlantic coast of the Americas. Here we show the results of designing and producing such a microchip array. This novel array covers African specific variation far better than other commercially available arrays, and will enable better GWAS analyses for researchers with individuals of African descent in their study populations. A recent study cataloging variation in continental African populations suggests this type of African-specific genotyping array is both necessary and valuable for facilitating large-scale GWAS in populations of African ancestry.

  13. Relationship between Deleterious Variation, Genomic Autozygosity, and Disease Risk: Insights from The 1000 Genomes Project.

    Science.gov (United States)

    Pemberton, Trevor J; Szpiech, Zachary A

    2018-04-05

    Genomic regions of autozygosity (ROAs) represent segments of individual genomes that are homozygous for haplotypes inherited identical-by-descent (IBD) from a common ancestor. ROAs are nonuniformly distributed across the genome, and increased ROA levels are a reported risk factor for numerous complex diseases. Previously, we hypothesized that long ROAs are enriched for deleterious homozygotes as a result of young haplotypes with recent deleterious mutations-relatively untouched by purifying selection-being paired IBD as a consequence of recent parental relatedness, a pattern supported by ROA and whole-exome sequence data on 27 individuals. Here, we significantly bolster support for our hypothesis and expand upon our original analyses using ROA and whole-genome sequence data on 2,436 individuals from The 1000 Genomes Project. Considering CADD deleteriousness scores, we reaffirm our previous observation that long ROAs are enriched for damaging homozygotes worldwide. We show that strongly damaging homozygotes experience greater enrichment than weaker damaging homozygotes, while overall enrichment varies appreciably among populations. Mendelian disease genes and those encoding FDA-approved drug targets have significantly increased rates of gain in damaging homozygotes with increasing ROA coverage relative to all other genes. In genes implicated in eight complex phenotypes for which ROA levels have been identified as a risk factor, rates of gain in damaging homozygotes vary across phenotypes and populations but frequently differ significantly from non-disease genes. These findings highlight the potential confounding effects of population background in the assessment of associations between ROA levels and complex disease risk, which might underlie reported inconsistencies in ROA-phenotype associations. Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  14. Genomic variation among populations of threatened coral: Acropora cervicornis.

    Science.gov (United States)

    Drury, C; Dale, K E; Panlilio, J M; Miller, S V; Lirman, D; Larson, E A; Bartels, E; Crawford, D L; Oleksiak, M F

    2016-04-13

    Acropora cervicornis, a threatened, keystone reef-building coral has undergone severe declines (>90 %) throughout the Caribbean. These declines could reduce genetic variation and thus hamper the species' ability to adapt. Active restoration strategies are a common conservation approach to mitigate species' declines and require genetic data on surviving populations to efficiently respond to declines while maintaining the genetic diversity needed to adapt to changing conditions. To evaluate active restoration strategies for the staghorn coral, the genetic diversity of A. cervicornis within and among populations was assessed in 77 individuals collected from 68 locations along the Florida Reef Tract (FRT) and in the Dominican Republic. Genotyping by Sequencing (GBS) identified 4,764 single nucleotide polymorphisms (SNPs). Pairwise nucleotide differences (π) within a population are large (~37 %) and similar to π across all individuals. This high level of genetic diversity along the FRT is similar to the diversity within a small, isolated reef. Much of the genetic diversity (>90 %) exists within a population, yet GBS analysis shows significant variation along the FRT, including 300 SNPs with significant FST values and significant divergence relative to distance. There are also significant differences in SNP allele frequencies over small spatial scales, exemplified by the large FST values among corals collected within Miami-Dade county. Large standing diversity was found within each population even after recent declines in abundance, including significant, potentially adaptive divergence over short distances. The data here inform conservation and management actions by uncovering population structure and high levels of diversity maintained within coral collections among sites previously shown to have little genetic divergence. More broadly, this approach demonstrates the power of GBS to resolve differences among individuals and identify subtle genetic structure

  15. Phylogeny, rate variation, and genome size evolution of Pelargonium (Geraniaceae).

    Science.gov (United States)

    Weng, Mao-Lun; Ruhlman, Tracey A; Gibby, Mary; Jansen, Robert K

    2012-09-01

    The phylogeny of 58 Pelargonium species was estimated using five plastid markers (rbcL, matK, ndhF, rpoC1, trnL-F) and one mitochondrial gene (nad5). The results confirmed the monophyly of three major clades and four subclades within Pelargonium but also indicate the need to revise some sectional classifications. This phylogeny was used to examine karyotype evolution in the genus: plotting chromosome sizes, numbers and 2C-values indicates that genome size is significantly correlated with chromosome size but not number. Accelerated rates of nucleotide substitution have been previously detected in both plastid and mitochondrial genes in Pelargonium, but sparse taxon sampling did not enable identification of the phylogenetic distribution of these elevated rates. Using the multigene phylogeny as a constraint, we investigated lineage- and locus-specific heterogeneity of substitution rates in Pelargonium for an expanded number of taxa and demonstrated that both plastid and mitochondrial genes have had accelerated substitution rates but with markedly disparate patterns. In the plastid, the exons of rpoC1 have significantly accelerated substitution rates compared to its intron and the acceleration was mainly due to nonsynonymous substitutions. In contrast, the mitochondrial gene, nad5, experienced substantial acceleration of synonymous substitution rates in three internal branches of Pelargonium, but this acceleration ceased in all terminal branches. Several lineages also have dN/dS ratios significantly greater than one for rpoC1, indicating that positive selection is acting on this gene, whereas the accelerated synonymous substitutions in the mitochondrial gene are the result of elevated mutation rates. Published by Elsevier Inc.

  16. Natural selection affects multiple aspects of genetic variation at putatively peutral sites across the human genome

    DEFF Research Database (Denmark)

    Lohmueller, Kirk E; Albrechtsen, Anders; Li, Yingrui

    2011-01-01

    A major question in evolutionary biology is how natural selection has shaped patterns of genetic variation across the human genome. Previous work has documented a reduction in genetic diversity in regions of the genome with low recombination rates. However, it is unclear whether other summaries...... these questions by analyzing three different genome-wide resequencing datasets from European individuals. We document several significant correlations between different genomic features. In particular, we find that average minor allele frequency and diversity are reduced in regions of low recombination...... and that human diversity, human-chimp divergence, and average minor allele frequency are reduced near genes. Population genetic simulations show that either positive natural selection acting on favorable mutations or negative natural selection acting against deleterious mutations can explain these correlations...

  17. Extreme Recombination Frequencies Shape Genome Variation and Evolution in the Honeybee, Apis mellifera

    Science.gov (United States)

    Wallberg, Andreas; Glémin, Sylvain; Webster, Matthew T.

    2015-01-01

    Meiotic recombination is a fundamental cellular process, with important consequences for evolution and genome integrity. However, we know little about how recombination rates vary across the genomes of most species and the molecular and evolutionary determinants of this variation. The honeybee, Apis mellifera, has extremely high rates of meiotic recombination, although the evolutionary causes and consequences of this are unclear. Here we use patterns of linkage disequilibrium in whole genome resequencing data from 30 diploid honeybees to construct a fine-scale map of rates of crossing over in the genome. We find that, in contrast to vertebrate genomes, the recombination landscape is not strongly punctate. Crossover rates strongly correlate with levels of genetic variation, but not divergence, which indicates a pervasive impact of selection on the genome. Germ-line methylated genes have reduced crossover rate, which could indicate a role of methylation in suppressing recombination. Controlling for the effects of methylation, we do not infer a strong association between gene expression patterns and recombination. The site frequency spectrum is strongly skewed from neutral expectations in honeybees: rare variants are dominated by AT-biased mutations, whereas GC-biased mutations are found at higher frequencies, indicative of a major influence of GC-biased gene conversion (gBGC), which we infer to generate an allele fixation bias 5 – 50 times the genomic average estimated in humans. We uncover further evidence that this repair bias specifically affects transitions and favours fixation of CpG sites. Recombination, via gBGC, therefore appears to have profound consequences on genome evolution in honeybees and interferes with the process of natural selection. These findings have important implications for our understanding of the forces driving molecular evolution. PMID:25902173

  18. Genome size variation among and within Camellia species by using flow cytometric analysis.

    Directory of Open Access Journals (Sweden)

    Hui Huang

    Full Text Available BACKGROUND: The genus Camellia, belonging to the family Theaceae, is economically important group in flowering plants. Frequent interspecific hybridization together with polyploidization has made them become taxonomically "difficult taxa". The DNA content is often used to measure genome size variation and has largely advanced our understanding of plant evolution and genome variation. The goals of this study were to investigate patterns of interspecific and intraspecific variation of DNA contents and further explore genome size evolution in a phylogenetic context of the genus. METHODOLOGY/PRINCIPAL FINDINGS: The DNA amount in the genus was determined by using propidium iodide flow cytometry analysis for a total of 139 individual plants representing almost all sections of the two subgenera, Camellia and Thea. An improved WPB buffer was proven to be suitable for the Camellia species, which was able to counteract the negative effects of secondary metabolite and generated high-quality results with low coefficient of variation values (CV <5%. Our results showed trivial effects on different tissues of flowers, leaves and buds as well as cytosolic compounds on the estimation of DNA amount. The DNA content of C. sinensis var. assamica was estimated to be 1C = 3.01 pg by flow cytometric analysis, which is equal to a genome size of about 2940 Mb. CONCLUSION: Intraspecific and interspecific variations were observed in the genus Camellia, and as expected, the latter was larger than the former. Our study suggests a directional trend of increasing genome size in the genus Camellia probably owing to the frequent polyploidization events.

  19. A haplotype map of genomic variations and genome-wide association studies of agronomic traits in foxtail millet (Setaria italica).

    Science.gov (United States)

    Jia, Guanqing; Huang, Xuehui; Zhi, Hui; Zhao, Yan; Zhao, Qiang; Li, Wenjun; Chai, Yang; Yang, Lifang; Liu, Kunyan; Lu, Hengyun; Zhu, Chuanrang; Lu, Yiqi; Zhou, Congcong; Fan, Danlin; Weng, Qijun; Guo, Yunli; Huang, Tao; Zhang, Lei; Lu, Tingting; Feng, Qi; Hao, Hangfei; Liu, Hongkuan; Lu, Ping; Zhang, Ning; Li, Yuhui; Guo, Erhu; Wang, Shujun; Wang, Suying; Liu, Jinrong; Zhang, Wenfei; Chen, Guoqiu; Zhang, Baojin; Li, Wei; Wang, Yongfang; Li, Haiquan; Zhao, Baohua; Li, Jiayang; Diao, Xianmin; Han, Bin

    2013-08-01

    Foxtail millet (Setaria italica) is an important grain crop that is grown in arid regions. Here we sequenced 916 diverse foxtail millet varieties, identified 2.58 million SNPs and used 0.8 million common SNPs to construct a haplotype map of the foxtail millet genome. We classified the foxtail millet varieties into two divergent groups that are strongly correlated with early and late flowering times. We phenotyped the 916 varieties under five different environments and identified 512 loci associated with 47 agronomic traits by genome-wide association studies. We performed a de novo assembly of deeply sequenced genomes of a Setaria viridis accession (the wild progenitor of S. italica) and an S. italica variety and identified complex interspecies and intraspecies variants. We also identified 36 selective sweeps that seem to have occurred during modern breeding. This study provides fundamental resources for genetics research and genetic improvement in foxtail millet.

  20. Genic intolerance to functional variation and the interpretation of personal genomes.

    Directory of Open Access Journals (Sweden)

    Slavé Petrovski

    Full Text Available A central challenge in interpreting personal genomes is determining which mutations most likely influence disease. Although progress has been made in scoring the functional impact of individual mutations, the characteristics of the genes in which those mutations are found remain largely unexplored. For example, genes known to carry few common functional variants in healthy individuals may be judged more likely to cause certain kinds of disease than genes known to carry many such variants. Until now, however, it has not been possible to develop a quantitative assessment of how well genes tolerate functional genetic variation on a genome-wide scale. Here we describe an effort that uses sequence data from 6503 whole exome sequences made available by the NHLBI Exome Sequencing Project (ESP. Specifically, we develop an intolerance scoring system that assesses whether genes have relatively more or less functional genetic variation than expected based on the apparently neutral variation found in the gene. To illustrate the utility of this intolerance score, we show that genes responsible for Mendelian diseases are significantly more intolerant to functional genetic variation than genes that do not cause any known disease, but with striking variation in intolerance among genes causing different classes of genetic disease. We conclude by showing that use of an intolerance ranking system can aid in interpreting personal genomes and identifying pathogenic mutations.

  1. Variation, Evolution, and Correlation Analysis of C+G Content and Genome or Chromosome Size in Different Kingdoms and Phyla

    Science.gov (United States)

    Li, Xiu-Qing; Du, Donglei

    2014-01-01

    C+G content (GC content or G+C content) is known to be correlated with genome/chromosome size in bacteria but the relationship for other kingdoms remains unclear. This study analyzed genome size, chromosome size, and base composition in most of the available sequenced genomes in various kingdoms. Genome size tends to increase during evolution in plants and animals, and the same is likely true for bacteria. The genomic C+G contents were found to vary greatly in microorganisms but were quite similar within each animal or plant subkingdom. In animals and plants, the C+G contents are ranked as follows: monocot plants>mammals>non-mammalian animals>dicot plants. The variation in C+G content between chromosomes within species is greater in animals than in plants. The correlation between average chromosome C+G content and chromosome length was found to be positive in Proteobacteria, Actinobacteria (but not in other analyzed bacterial phyla), Ascomycota fungi, and likely also in some plants; negative in some animals, insignificant in two protist phyla, and likely very weak in Archaea. Clearly, correlations between C+G content and chromosome size can be positive, negative, or not significant depending on the kingdoms/groups or species. Different phyla or species exhibit different patterns of correlation between chromosome-size and C+G content. Most chromosomes within a species have a similar pattern of variation in C+G content but outliers are common. The data presented in this study suggest that the C+G content is under genetic control by both trans- and cis- factors and that the correlation between C+G content and chromosome length can be positive, negative, or not significant in different phyla. PMID:24551092

  2. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

    Science.gov (United States)

    Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J; Leitch, Ilia J

    2015-01-01

    The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.

  3. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

    Directory of Open Access Journals (Sweden)

    Jiří Macas

    Full Text Available The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57% of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%. Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.

  4. Genomic Diversity Using Copy Number Variations in Worldwide Chicken Populations

    Directory of Open Access Journals (Sweden)

    Erica Gorla

    2018-06-01

    Full Text Available Recently, many studies in livestock have focused on the identification of Copy Number Variants (CNVs using high-density Single Nucleotide Polymorphism (SNP arrays, but few have focused on studying chicken ecotypes coming from many locations. CNVs are polymorphisms, which may influence phenotype and are an important source of genetic variation in populations. The aim of this study was to explore the genetic difference and structure, using a high density SNP chip in 936 individuals from seven different countries (Brazil, Italy, Egypt, Mexico, Rwanda, Sri Lanka and Uganda. The DNA was genotyped with the Affymetrix Axiom®600k Chicken Genotyping Array and processed with stringent quality controls to obtain 559,201 SNPs in 915 individuals. The Log R Ratio (LRR and the B Allele Frequency of SNPs were used to perform the CNV calling with PennCNV software based on a Hidden Markov Model analysis and the LRR was used to perform CNV detection with SVS Golden Helix software.After filtering, a total of 19,027 CNVs were detected with the SVS software, while 9,065 CNVs were identified with the Penn CNV software. The CNVs were summarized in 7,001 Copy Number Variant Regions (CNVRs and 4,414 CNVRs, using the software BedTool.The consensus analysis across the CNVRs allowed the identification of 2,820 consensus CNVR, of which 1,721 were gain, 637 loss and 462 complex, for a total length of 53 Mb corresponding to the 5 % of the GalGal5 chicken autosomes. Only the consensus CNV regions obtained from both detections were considered for further analysis.The intersection analysis performed between the chicken gene database (Gallus_gallus-5.0 and the 1,927 consensus CNVRs allowed the identification (within or partial overlap of a total of 2,354 unique genes with an official gene ID.  The CNVRs identified here represent the first comprehensive mapping in several worldwide populations, using a high-density SNP chip.

  5. A Perfect Match Genomic Landscape Provides a Unified Framework for the Precise Detection of Variation in Natural and Synthetic Haploid Genomes.

    Science.gov (United States)

    Palacios-Flores, Kim; García-Sotelo, Jair; Castillo, Alejandra; Uribe, Carina; Aguilar, Luis; Morales, Lucía; Gómez-Romero, Laura; Reyes, José; Garciarubio, Alejandro; Boege, Margareta; Dávila, Guillermo

    2018-04-01

    We present a conceptually simple, sensitive, precise, and essentially nonstatistical solution for the analysis of genome variation in haploid organisms. The generation of a Perfect Match Genomic Landscape (PMGL), which computes intergenome identity with single nucleotide resolution, reveals signatures of variation wherever a query genome differs from a reference genome. Such signatures encode the precise location of different types of variants, including single nucleotide variants, deletions, insertions, and amplifications, effectively introducing the concept of a general signature of variation. The precise nature of variants is then resolved through the generation of targeted alignments between specific sets of sequence reads and known regions of the reference genome. Thus, the perfect match logic decouples the identification of the location of variants from the characterization of their nature, providing a unified framework for the detection of genome variation. We assessed the performance of the PMGL strategy via simulation experiments. We determined the variation profiles of natural genomes and of a synthetic chromosome, both in the context of haploid yeast strains. Our approach uncovered variants that have previously escaped detection. Moreover, our strategy is ideally suited for further refining high-quality reference genomes. The source codes for the automated PMGL pipeline have been deposited in a public repository. Copyright © 2018 by the Genetics Society of America.

  6. High-confidence assessment of functional impact of human mitochondrial non-synonymous genome variations by APOGEE.

    Directory of Open Access Journals (Sweden)

    Stefano Castellana

    2017-06-01

    Full Text Available 24,189 are all the possible non-synonymous amino acid changes potentially affecting the human mitochondrial DNA. Only a tiny subset was functionally evaluated with certainty so far, while the pathogenicity of the vast majority was only assessed in-silico by software predictors. Since these tools proved to be rather incongruent, we have designed and implemented APOGEE, a machine-learning algorithm that outperforms all existing prediction methods in estimating the harmfulness of mitochondrial non-synonymous genome variations. We provide a detailed description of the underlying algorithm, of the selected and manually curated training and test sets of variants, as well as of its classification ability.

  7. Introgression of a Block of Genome Under Infinitesimal Selection.

    Science.gov (United States)

    Sachdeva, Himani; Barton, Nicholas H

    2018-06-12

    Adaptive introgression is common in nature and can be driven by selection acting on multiple, linked genes. We explore the effects of polygenic selection on introgression under the infinitesimal model with linkage. This model assumes that the introgressing block has an effectively infinite number of loci, each with an infinitesimal effect on the trait under selection. The block is assumed to introgress under directional selection within a native population that is genetically homogeneous. We use individual-based simulations and a branching process approximation to compute various statistics of the introgressing block, and explore how these depend on parameters such as the map length and initial trait value associated with the introgressing block, the genetic variability along the block, and the strength of selection. Our results show that the introgression dynamics of a block under infinitesimal selection are qualitatively different from the dynamics of neutral introgression. We also find that in the long run, surviving descendant blocks are likely to have intermediate lengths, and clarify how their length is shaped by the interplay between linkage and infinitesimal selection. Our results suggest that it may be difficult to distinguish the long-term introgression of a block of genome with a single strongly selected locus from the introgression of a block with multiple, tightly linked and weakly selected loci. Copyright © 2018, Genetics.

  8. Limits of variation, specific infectivity, and genome packaging of massively recoded poliovirus genomes.

    Science.gov (United States)

    Song, Yutong; Gorbatsevych, Oleksandr; Liu, Ying; Mugavero, JoAnn; Shen, Sam H; Ward, Charles B; Asare, Emmanuel; Jiang, Ping; Paul, Aniko V; Mueller, Steffen; Wimmer, Eckard

    2017-10-10

    Computer design and chemical synthesis generated viable variants of poliovirus type 1 (PV1), whose ORF (6,189 nucleotides) carried up to 1,297 "Max" mutations (excess of overrepresented synonymous codon pairs) or up to 2,104 "SD" mutations (randomly scrambled synonymous codons). "Min" variants (excess of underrepresented synonymous codon pairs) are nonviable except for P2 Min , a variant temperature-sensitive at 33 and 39.5 °C. Compared with WT PV1, P2 Min displayed a vastly reduced specific infectivity (si) (WT, 1 PFU/118 particles vs. P2 Min , 1 PFU/35,000 particles), a phenotype that will be discussed broadly. Si of haploid PV presents cellular infectivity of a single genotype. We performed a comprehensive analysis of sequence and structures of the PV genome to determine if evolutionary conserved cis-acting packaging signal(s) were preserved after recoding. We showed that conserved synonymous sites and/or local secondary structures that might play a role in determining packaging specificity do not survive codon pair recoding. This makes it unlikely that numerous "cryptic, sequence-degenerate, dispersed RNA packaging signals mapping along the entire viral genome" [Patel N, et al. (2017) Nat Microbiol 2:17098] play the critical role in poliovirus packaging specificity. Considering all available evidence, we propose a two-step assembly strategy for +ssRNA viruses: step I, acquisition of packaging specificity, either ( a ) by specific recognition between capsid protein(s) and replication proteins (poliovirus), or ( b ) by the high affinity interaction of a single RNA packaging signal (PS) with capsid protein(s) (most +ssRNA viruses so far studied); step II, cocondensation of genome/capsid precursors in which an array of hairpin structures plays a role in virion formation.

  9. Parameter choice in Banach space regularization under variational inequalities

    International Nuclear Information System (INIS)

    Hofmann, Bernd; Mathé, Peter

    2012-01-01

    The authors study parameter choice strategies for the Tikhonov regularization of nonlinear ill-posed problems in Banach spaces. The effectiveness of any parameter choice for obtaining convergence rates depends on the interplay of the solution smoothness and the nonlinearity structure, and it can be expressed concisely in terms of variational inequalities. Such inequalities are link conditions between the penalty term, the norm misfit and the corresponding error measure. The parameter choices under consideration include an a priori choice, the discrepancy principle as well as the Lepskii principle. For the convenience of the reader, the authors review in an appendix a few instances where the validity of a variational inequality can be established. (paper)

  10. A genome-wide, fine-scale map of natural pigmentation variation in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Héloïse Bastide

    2013-06-01

    Full Text Available Various approaches can be applied to uncover the genetic basis of natural phenotypic variation, each with their specific strengths and limitations. Here, we use a replicated genome-wide association approach (Pool-GWAS to fine-scale map genomic regions contributing to natural variation in female abdominal pigmentation in Drosophila melanogaster, a trait that is highly variable in natural populations and highly heritable in the laboratory. We examined abdominal pigmentation phenotypes in approximately 8000 female European D. melanogaster, isolating 1000 individuals with extreme phenotypes. We then used whole-genome Illumina sequencing to identify single nucleotide polymorphisms (SNPs segregating in our sample, and tested these for associations with pigmentation by contrasting allele frequencies between replicate pools of light and dark individuals. We identify two small regions near the pigmentation genes tan and bric-à-brac 1, both corresponding to known cis-regulatory regions, which contain SNPs showing significant associations with pigmentation variation. While the Pool-GWAS approach suffers some limitations, its cost advantage facilitates replication and it can be applied to any non-model system with an available reference genome.

  11. A genome-wide, fine-scale map of natural pigmentation variation in Drosophila melanogaster.

    Science.gov (United States)

    Bastide, Héloïse; Betancourt, Andrea; Nolte, Viola; Tobler, Raymond; Stöbe, Petra; Futschik, Andreas; Schlötterer, Christian

    2013-06-01

    Various approaches can be applied to uncover the genetic basis of natural phenotypic variation, each with their specific strengths and limitations. Here, we use a replicated genome-wide association approach (Pool-GWAS) to fine-scale map genomic regions contributing to natural variation in female abdominal pigmentation in Drosophila melanogaster, a trait that is highly variable in natural populations and highly heritable in the laboratory. We examined abdominal pigmentation phenotypes in approximately 8000 female European D. melanogaster, isolating 1000 individuals with extreme phenotypes. We then used whole-genome Illumina sequencing to identify single nucleotide polymorphisms (SNPs) segregating in our sample, and tested these for associations with pigmentation by contrasting allele frequencies between replicate pools of light and dark individuals. We identify two small regions near the pigmentation genes tan and bric-à-brac 1, both corresponding to known cis-regulatory regions, which contain SNPs showing significant associations with pigmentation variation. While the Pool-GWAS approach suffers some limitations, its cost advantage facilitates replication and it can be applied to any non-model system with an available reference genome.

  12. Natural selection affects multiple aspects of genetic variation at putatively neutral sites across the human genome.

    Science.gov (United States)

    Lohmueller, Kirk E; Albrechtsen, Anders; Li, Yingrui; Kim, Su Yeon; Korneliussen, Thorfinn; Vinckenbosch, Nicolas; Tian, Geng; Huerta-Sanchez, Emilia; Feder, Alison F; Grarup, Niels; Jørgensen, Torben; Jiang, Tao; Witte, Daniel R; Sandbæk, Annelli; Hellmann, Ines; Lauritzen, Torsten; Hansen, Torben; Pedersen, Oluf; Wang, Jun; Nielsen, Rasmus

    2011-10-01

    A major question in evolutionary biology is how natural selection has shaped patterns of genetic variation across the human genome. Previous work has documented a reduction in genetic diversity in regions of the genome with low recombination rates. However, it is unclear whether other summaries of genetic variation, like allele frequencies, are also correlated with recombination rate and whether these correlations can be explained solely by negative selection against deleterious mutations or whether positive selection acting on favorable alleles is also required. Here we attempt to address these questions by analyzing three different genome-wide resequencing datasets from European individuals. We document several significant correlations between different genomic features. In particular, we find that average minor allele frequency and diversity are reduced in regions of low recombination and that human diversity, human-chimp divergence, and average minor allele frequency are reduced near genes. Population genetic simulations show that either positive natural selection acting on favorable mutations or negative natural selection acting against deleterious mutations can explain these correlations. However, models with strong positive selection on nonsynonymous mutations and little negative selection predict a stronger negative correlation between neutral diversity and nonsynonymous divergence than observed in the actual data, supporting the importance of negative, rather than positive, selection throughout the genome. Further, we show that the widespread presence of weakly deleterious alleles, rather than a small number of strongly positively selected mutations, is responsible for the correlation between neutral genetic diversity and recombination rate. This work suggests that natural selection has affected multiple aspects of linked neutral variation throughout the human genome and that positive selection is not required to explain these observations.

  13. VarB Plus: An Integrated Tool for Visualization of Genome Variation Datasets

    KAUST Repository

    Hidayah, Lailatul

    2012-07-01

    Research on genomic sequences has been improving significantly as more advanced technology for sequencing has been developed. This opens enormous opportunities for sequence analysis. Various analytical tools have been built for purposes such as sequence assembly, read alignments, genome browsing, comparative genomics, and visualization. From the visualization perspective, there is an increasing trend towards use of large-scale computation. However, more than power is required to produce an informative image. This is a challenge that we address by providing several ways of representing biological data in order to advance the inference endeavors of biologists. This thesis focuses on visualization of variations found in genomic sequences. We develop several visualization functions and embed them in an existing variation visualization tool as extensions. The tool we improved is named VarB, hence the nomenclature for our enhancement is VarB Plus. To the best of our knowledge, besides VarB, there is no tool that provides the capability of dynamic visualization of genome variation datasets as well as statistical analysis. Dynamic visualization allows users to toggle different parameters on and off and see the results on the fly. The statistical analysis includes Fixation Index, Relative Variant Density, and Tajima’s D. Hence we focused our efforts on this tool. The scope of our work includes plots of per-base genome coverage, Principal Coordinate Analysis (PCoA), integration with a read alignment viewer named LookSeq, and visualization of geo-biological data. In addition to description of embedded functionalities, significance, and limitations, future improvements are discussed. The result is four extensions embedded successfully in the original tool, which is built on the Qt framework in C++. Hence it is portable to numerous platforms. Our extensions have shown acceptable execution time in a beta testing with various high-volume published datasets, as well as positive

  14. Genome-wide transcriptional reprogramming under drought stress

    KAUST Repository

    Chen, Hao

    2012-01-01

    Soil water deficit is one of the major factors limiting plant productivity. Plants cope with this adverse environmental condition by coordinating the up- or downregulation of an array of stress responsive genes. Reprogramming the expression of these genes leads to rebalanced development and growth that are in concert with the reduced water availability and that ultimately confer enhanced stress tolerance. Currently, several techniques have been employed to monitor genome-wide transcriptional reprogramming under drought stress. The results from these high throughput studies indicate that drought stress-induced transcriptional reprogramming is dynamic, has temporal and spatial specificity, and is coupled with the circadian clock and phytohormone signaling pathways. © 2012 Springer-Verlag Berlin Heidelberg. All rights are reserved.

  15. Resolution of Disease Phenotypes Resulting from Multilocus Genomic Variation.

    Science.gov (United States)

    Posey, Jennifer E; Harel, Tamar; Liu, Pengfei; Rosenfeld, Jill A; James, Regis A; Coban Akdemir, Zeynep H; Walkiewicz, Magdalena; Bi, Weimin; Xiao, Rui; Ding, Yan; Xia, Fan; Beaudet, Arthur L; Muzny, Donna M; Gibbs, Richard A; Boerwinkle, Eric; Eng, Christine M; Sutton, V Reid; Shaw, Chad A; Plon, Sharon E; Yang, Yaping; Lupski, James R

    2017-01-05

    Whole-exome sequencing can provide insight into the relationship between observed clinical phenotypes and underlying genotypes. We conducted a retrospective analysis of data from a series of 7374 consecutive unrelated patients who had been referred to a clinical diagnostic laboratory for whole-exome sequencing; our goal was to determine the frequency and clinical characteristics of patients for whom more than one molecular diagnosis was reported. The phenotypic similarity between molecularly diagnosed pairs of diseases was calculated with the use of terms from the Human Phenotype Ontology. A molecular diagnosis was rendered for 2076 of 7374 patients (28.2%); among these patients, 101 (4.9%) had diagnoses that involved two or more disease loci. We also analyzed parental samples, when available, and found that de novo variants accounted for 67.8% (61 of 90) of pathogenic variants in autosomal dominant disease genes and 51.7% (15 of 29) of pathogenic variants in X-linked disease genes; both variants were de novo in 44.7% (17 of 38) of patients with two monoallelic variants. Causal copy-number variants were found in 12 patients (11.9%) with multiple diagnoses. Phenotypic similarity scores were significantly lower among patients in whom the phenotype resulted from two distinct mendelian disorders that affected different organ systems (50 patients) than among patients with disorders that had overlapping phenotypic features (30 patients) (median score, 0.21 vs. 0.36; P=1.77×10 -7 ). In our study, we found multiple molecular diagnoses in 4.9% of cases in which whole-exome sequencing was informative. Our results show that structured clinical ontologies can be used to determine the degree of overlap between two mendelian diseases in the same patient; the diseases can be distinct or overlapping. Distinct disease phenotypes affect different organ systems, whereas overlapping disease phenotypes are more likely to be caused by two genes encoding proteins that interact within

  16. Genomic analysis of natural selection and phenotypic variation in high-altitude mongolians.

    Directory of Open Access Journals (Sweden)

    Jinchuan Xing

    Full Text Available Deedu (DU Mongolians, who migrated from the Mongolian steppes to the Qinghai-Tibetan Plateau approximately 500 years ago, are challenged by environmental conditions similar to native Tibetan highlanders. Identification of adaptive genetic factors in this population could provide insight into coordinated physiological responses to this environment. Here we examine genomic and phenotypic variation in this unique population and present the first complete analysis of a Mongolian whole-genome sequence. High-density SNP array data demonstrate that DU Mongolians share genetic ancestry with other Mongolian as well as Tibetan populations, specifically in genomic regions related with adaptation to high altitude. Several selection candidate genes identified in DU Mongolians are shared with other Asian groups (e.g., EDAR, neighboring Tibetan populations (including high-altitude candidates EPAS1, PKLR, and CYP2E1, as well as genes previously hypothesized to be associated with metabolic adaptation (e.g., PPARG. Hemoglobin concentration, a trait associated with high-altitude adaptation in Tibetans, is at an intermediate level in DU Mongolians compared to Tibetans and Han Chinese at comparable altitude. Whole-genome sequence from a DU Mongolian (Tianjiao1 shows that about 2% of the genomic variants, including more than 300 protein-coding changes, are specific to this individual. Our analyses of DU Mongolians and the first Mongolian genome provide valuable insight into genetic adaptation to extreme environments.

  17. Training set optimization under population structure in genomic selection.

    Science.gov (United States)

    Isidro, Julio; Jannink, Jean-Luc; Akdemir, Deniz; Poland, Jesse; Heslot, Nicolas; Sorrells, Mark E

    2015-01-01

    Population structure must be evaluated before optimization of the training set population. Maximizing the phenotypic variance captured by the training set is important for optimal performance. The optimization of the training set (TRS) in genomic selection has received much interest in both animal and plant breeding, because it is critical to the accuracy of the prediction models. In this study, five different TRS sampling algorithms, stratified sampling, mean of the coefficient of determination (CDmean), mean of predictor error variance (PEVmean), stratified CDmean (StratCDmean) and random sampling, were evaluated for prediction accuracy in the presence of different levels of population structure. In the presence of population structure, the most phenotypic variation captured by a sampling method in the TRS is desirable. The wheat dataset showed mild population structure, and CDmean and stratified CDmean methods showed the highest accuracies for all the traits except for test weight and heading date. The rice dataset had strong population structure and the approach based on stratified sampling showed the highest accuracies for all traits. In general, CDmean minimized the relationship between genotypes in the TRS, maximizing the relationship between TRS and the test set. This makes it suitable as an optimization criterion for long-term selection. Our results indicated that the best selection criterion used to optimize the TRS seems to depend on the interaction of trait architecture and population structure.

  18. Effects of Sublethal Fungicides on Mutation Rates and Genomic Variation in Fungal Plant Pathogen, Sclerotinia sclerotiorum.

    Science.gov (United States)

    Amaradasa, B Sajeewa; Everhart, Sydney E

    2016-01-01

    when repeated, only one isolate had higher EC50 while most isolates showed no difference. Results of this support the hypothesis that sublethal fungicide stress increases mutation rates in a largely clonal plant pathogen under in vitro conditions. Collectively, this work will aid our understanding how non-lethal fungicide exposure may affect genomic variation, which may be an important mechanism of novel trait emergence, adaptation, and evolution for clonal organisms.

  19. Effects of Sublethal Fungicides on Mutation Rates and Genomic Variation in Fungal Plant Pathogen, Sclerotinia sclerotiorum.

    Directory of Open Access Journals (Sweden)

    B Sajeewa Amaradasa

    experiment, and when repeated, only one isolate had higher EC50 while most isolates showed no difference. Results of this support the hypothesis that sublethal fungicide stress increases mutation rates in a largely clonal plant pathogen under in vitro conditions. Collectively, this work will aid our understanding how non-lethal fungicide exposure may affect genomic variation, which may be an important mechanism of novel trait emergence, adaptation, and evolution for clonal organisms.

  20. Effects of Sublethal Fungicides on Mutation Rates and Genomic Variation in Fungal Plant Pathogen, Sclerotinia sclerotiorum

    Science.gov (United States)

    Amaradasa, B. Sajeewa

    2016-01-01

    , and when repeated, only one isolate had higher EC50 while most isolates showed no difference. Results of this support the hypothesis that sublethal fungicide stress increases mutation rates in a largely clonal plant pathogen under in vitro conditions. Collectively, this work will aid our understanding how non-lethal fungicide exposure may affect genomic variation, which may be an important mechanism of novel trait emergence, adaptation, and evolution for clonal organisms. PMID:27959950

  1. Genome-wide detection of copy number variations among diverse horse breeds by array CGH.

    Directory of Open Access Journals (Sweden)

    Wei Wang

    Full Text Available Recent studies have found that copy number variations (CNVs are widespread in human and animal genomes. CNVs are a significant source of genetic variation, and have been shown to be associated with phenotypic diversity. However, the effect of CNVs on genetic variation in horses is not well understood. In the present study, CNVs in 6 different breeds of mare horses, Mongolia horse, Abaga horse, Hequ horse and Kazakh horse (all plateau breeds and Debao pony and Thoroughbred, were determined using aCGH. In total, seven hundred CNVs were identified ranging in size from 6.1 Kb to 0.57 Mb across all autosomes, with an average size of 43.08 Kb and a median size of 15.11 Kb. By merging overlapping CNVs, we found a total of three hundred and fifty-three CNV regions (CNVRs. The length of the CNVRs ranged from 6.1 Kb to 1.45 Mb with average and median sizes of 38.49 Kb and 13.1 Kb. Collectively, 13.59 Mb of copy number variation was identified among the horses investigated and accounted for approximately 0.61% of the horse genome sequence. Five hundred and eighteen annotated genes were affected by CNVs, which corresponded to about 2.26% of all horse genes. Through the gene ontology (GO, genetic pathway analysis and comparison of CNV genes among different breeds, we found evidence that CNVs involving 7 genes may be related to the adaptation to severe environment of these plateau horses. This study is the first report of copy number variations in Chinese horses, which indicates that CNVs are ubiquitous in the horse genome and influence many biological processes of the horse. These results will be helpful not only in mapping the horse whole-genome CNVs, but also to further research for the adaption to the high altitude severe environment for plateau horses.

  2. Nucleotide diversity maps reveal variation in diversity among wheat genomes and chromosomes

    Directory of Open Access Journals (Sweden)

    McGuire Patrick E

    2010-12-01

    chromosomal regions. The net effect of these factors in T. aestivum is large variation in diversity among genomes and chromosomes, which impacts the development of SNP markers and their practical utility. Accumulation of new mutations in older polyploid species, such as wild emmer, results in increased diversity and its more uniform distribution across the genome.

  3. Variations and classification of toxic epitopes related to celiac disease among α-gliadin genes from four Aegilops genomes.

    Science.gov (United States)

    Li, Jie; Wang, Shunli; Li, Shanshan; Ge, Pei; Li, Xiaohui; Ma, Wujun; Zeller, F J; Hsam, Sai L K; Yan, Yueming

    2012-07-01

    The α-gliadins are associated with human celiac disease. A total of 23 noninterrupted full open reading frame α-gliadin genes and 19 pseudogenes were cloned and sequenced from C, M, N, and U genomes of four diploid Aegilops species. Sequence comparison of α-gliadin genes from Aegilops and Triticum species demonstrated an existence of extensive allelic variations in Gli-2 loci of the four Aegilops genomes. Specific structural features were found including the compositions and variations of two polyglutamine domains (QI and QII) and four T cell stimulatory toxic epitopes. The mean numbers of glutamine residues in the QI domain in C and N genomes and the QII domain in C, N, and U genomes were much higher than those in Triticum genomes, and the QI domain in C and N genomes and the QII domain in C, M, N, and U genomes displayed greater length variations. Interestingly, the types and numbers of four T cell stimulatory toxic epitopes in α-gliadins from the four Aegilops genomes were significantly less than those from Triticum A, B, D, and their progenitor genomes. Relationships between the structural variations of the two polyglutamine domains and the distributions of four T cell stimulatory toxic epitopes were found, resulting in the α-gliadin genes from the Aegilops and Triticum genomes to be classified into three groups.

  4. Distribution and diversity of cytotypes in Dianthus broteri as evidenced by genome size variations.

    Science.gov (United States)

    Balao, Francisco; Casimiro-Soriguer, Ramón; Talavera, María; Herrera, Javier; Talavera, Salvador

    2009-10-01

    Studying the spatial distribution of cytotypes and genome size in plants can provide valuable information about the evolution of polyploid complexes. Here, the spatial distribution of cytological races and the amount of DNA in Dianthus broteri, an Iberian carnation with several ploidy levels, is investigated. Sample chromosome counts and flow cytometry (using propidium iodide) were used to determine overall genome size (2C value) and ploidy level in 244 individuals of 25 populations. Both fresh and dried samples were investigated. Differences in 2C and 1Cx values among ploidy levels within biogeographical provinces were tested using ANOVA. Geographical correlations of genome size were also explored. Extensive variation in chromosomes numbers (2n = 2x = 30, 2n = 4x = 60, 2n = 6x = 90 and 2n = 12x =180) was detected, and the dodecaploid cytotype is reported for the first time in this genus. As regards cytotype distribution, six populations were diploid, 11 were tetraploid, three were hexaploid and five were dodecaploid. Except for one diploid population containing some triploid plants (2n = 45), the remaining populations showed a single cytotype. Diploids appeared in two disjunct areas (south-east and south-west), and so did tetraploids (although with a considerably wider geographic range). Dehydrated leaf samples provided reliable measurements of DNA content. Genome size varied significantly among some cytotypes, and also extensively within diploid (up to 1.17-fold) and tetraploid (1.22-fold) populations. Nevertheless, variations were not straightforwardly congruent with ecology and geographical distribution. Dianthus broteri shows the highest diversity of cytotypes known to date in the genus Dianthus. Moreover, some cytotypes present remarkable internal genome size variation. The evolution of the complex is discussed in terms of autopolyploidy, with primary and secondary contact zones.

  5. Genome-Wide Fine-Scale Recombination Rate Variation in Drosophila melanogaster

    Science.gov (United States)

    Song, Yun S.

    2012-01-01

    Estimating fine-scale recombination maps of Drosophila from population genomic data is a challenging problem, in particular because of the high background recombination rate. In this paper, a new computational method is developed to address this challenge. Through an extensive simulation study, it is demonstrated that the method allows more accurate inference, and exhibits greater robustness to the effects of natural selection and noise, compared to a well-used previous method developed for studying fine-scale recombination rate variation in the human genome. As an application, a genome-wide analysis of genetic variation data is performed for two Drosophila melanogaster populations, one from North America (Raleigh, USA) and the other from Africa (Gikongoro, Rwanda). It is shown that fine-scale recombination rate variation is widespread throughout the D. melanogaster genome, across all chromosomes and in both populations. At the fine-scale, a conservative, systematic search for evidence of recombination hotspots suggests the existence of a handful of putative hotspots each with at least a tenfold increase in intensity over the background rate. A wavelet analysis is carried out to compare the estimated recombination maps in the two populations and to quantify the extent to which recombination rates are conserved. In general, similarity is observed at very broad scales, but substantial differences are seen at fine scales. The average recombination rate of the X chromosome appears to be higher than that of the autosomes in both populations, and this pattern is much more pronounced in the African population than the North American population. The correlation between various genomic features—including recombination rates, diversity, divergence, GC content, gene content, and sequence quality—is examined using the wavelet analysis, and it is shown that the most notable difference between D. melanogaster and humans is in the correlation between recombination and

  6. Chromosome Numbers and Genome Size Variation in Indian Species of Curcuma (Zingiberaceae)

    Science.gov (United States)

    Leong-Škorničková, Jana; Šída, Otakar; Jarolímová, Vlasta; Sabu, Mamyil; Fér, Tomáš; Trávníček, Pavel; Suda, Jan

    2007-01-01

    Background and Aims Genome size and chromosome numbers are important cytological characters that significantly influence various organismal traits. However, geographical representation of these data is seriously unbalanced, with tropical and subtropical regions being largely neglected. In the present study, an investigation was made of chromosomal and genome size variation in the majority of Curcuma species from the Indian subcontinent, and an assessment was made of the value of these data for taxonomic purposes. Methods Genome size of 161 homogeneously cultivated plant samples classified into 51 taxonomic entities was determined by propidium iodide flow cytometry. Chromosome numbers were counted in actively growing root tips using conventional rapid squash techniques. Key Results Six different chromosome counts (2n = 22, 42, 63, >70, 77 and 105) were found, the last two representing new generic records. The 2C-values varied from 1·66 pg in C. vamana to 4·76 pg in C. oligantha, representing a 2·87-fold range. Three groups of taxa with significantly different homoploid genome sizes (Cx-values) and distinct geographical distribution were identified. Five species exhibited intraspecific variation in nuclear DNA content, reaching up to 15·1 % in cultivated C. longa. Chromosome counts and genome sizes of three Curcuma-like species (Hitchenia caulina, Kaempferia scaposa and Paracautleya bhatii) corresponded well with typical hexaploid (2n = 6x = 42) Curcuma spp. Conclusions The basic chromosome number in the majority of Indian taxa (belonging to subgenus Curcuma) is x = 7; published counts correspond to 6x, 9x, 11x, 12x and 15x ploidy levels. Only a few species-specific C-values were found, but karyological and/or flow cytometric data may support taxonomic decisions in some species alliances with morphological similarities. Close evolutionary relationships among some cytotypes are suggested based on the similarity in homoploid genome sizes and geographical grouping

  7. [Analysis of genomic copy number variations in two sisters with primary amenorrhea and hyperandrogenism].

    Science.gov (United States)

    Zhang, Yanliang; Xu, Qiuyue; Cai, Xuemei; Li, Yixun; Song, Guibo; Wang, Juan; Zhang, Rongchen; Dai, Yong; Duan, Yong

    2015-12-01

    To analyze genomic copy number variations (CNVs) in two sisters with primary amenorrhea and hyperandrogenism. G-banding was performed for karyotype analysis. The whole genome of the two sisters were scanned and analyzed by array-based comparative genomic hybridization (array-CGH). The results were confirmed with real-time quantitative PCR (RT-qPCR). No abnormality was found by conventional G-banded chromosome analysis. Array-CGH has identified 11 identical CNVs from the sisters which, however, overlapped with CNVs reported by the Database of Genomic Variants (http://projects.tcag.ca/variation/). Therefore, they are likely to be benign. In addition, a -8.44 Mb 9p11.1-p13.1 duplication (38,561,587-47,002,387 bp, hg18) and a -80.9 kb 4q13.2 deletion (70,183,990-70,264,889 bp, hg18) were also detected in the elder and younger sister, respectively. The relationship between such CNVs and primary amenorrhea and hyperandrogenism was however uncertain. RT-qPCR results were in accordance with array-CGH. Two CNVs were detected in two sisters by array-CGH, for which further studies are needed to clarify their correlation with primary amenorrhea and hyperandrogenism.

  8. Comparative population genomics of latitudinal variation in Drosophila simulans and Drosophila melanogaster.

    Science.gov (United States)

    Machado, Heather E; Bergland, Alan O; O'Brien, Katherine R; Behrman, Emily L; Schmidt, Paul S; Petrov, Dmitri A

    2016-02-01

    Examples of clinal variation in phenotypes and genotypes across latitudinal transects have served as important models for understanding how spatially varying selection and demographic forces shape variation within species. Here, we examine the selective and demographic contributions to latitudinal variation through the largest comparative genomic study to date of Drosophila simulans and Drosophila melanogaster, with genomic sequence data from 382 individual fruit flies, collected across a spatial transect of 19 degrees latitude and at multiple time points over 2 years. Consistent with phenotypic studies, we find less clinal variation in D. simulans than D. melanogaster, particularly for the autosomes. Moreover, we find that clinally varying loci in D. simulans are less stable over multiple years than comparable clines in D. melanogaster. D. simulans shows a significantly weaker pattern of isolation by distance than D. melanogaster and we find evidence for a stronger contribution of migration to D. simulans population genetic structure. While population bottlenecks and migration can plausibly explain the differences in stability of clinal variation between the two species, we also observe a significant enrichment of shared clinal genes, suggesting that the selective forces associated with climate are acting on the same genes and phenotypes in D. simulans and D. melanogaster. © 2015 John Wiley & Sons Ltd.

  9. A genomic overview of short genetic variations in a basal chordate, Ciona intestinalis

    Directory of Open Access Journals (Sweden)

    Satou Yutaka

    2012-05-01

    Full Text Available Abstract Background Although the Ciona intestinalis genome contains many allelic polymorphisms, there is only limited data analyzed systematically. Establishing a dense map of genetic variations in C. intestinalis is necessary not only for linkage analysis, but also for other experimental biology including molecular developmental and evolutionary studies, because animals from natural populations are typically used for experiments. Results Here, we identified over three million candidate short genomic variations within a 110 Mb euchromatin region among five C. intestinalis individuals. The average nucleotide diversity was approximately 1.1%. Genetic variations were found at a similar density in intergenic and gene regions. Non-synonymous and nonsense nucleotide substitutions were found in 12,493 and 1,214 genes accounting for 81.9% and 8.0% of the entire gene set, respectively, and over 60% of genes in the single animal encode non-identical proteins between maternal and paternal alleles. Conclusions Our results provide a framework for studying evolution of the animal genome, as well as a useful resource for a wide range of C. intestinalis researchers.

  10. Comparative analysis of complete chloroplast genome sequence and inversion variation in Lasthenia burkei (Madieae, Asteraceae).

    Science.gov (United States)

    Walker, Joseph F; Zanis, Michael J; Emery, Nancy C

    2014-04-01

    Complete chloroplast genome studies can help resolve relationships among large, complex plant lineages such as Asteraceae. We present the first whole plastome from the Madieae tribe and compare its sequence variation to other chloroplast genomes in Asteraceae. We used high throughput sequencing to obtain the Lasthenia burkei chloroplast genome. We compared sequence structure and rates of molecular evolution in the small single copy (SSC), large single copy (LSC), and inverted repeat (IR) regions to those for eight Asteraceae accessions and one Solanaceae accession. The chloroplast sequence of L. burkei is 150 746 bp and contains 81 unique protein coding genes and 4 coding ribosomal RNA sequences. We identified three major inversions in the L. burkei chloroplast, all of which have been found in other Asteraceae lineages, and a previously unreported inversion in Lactuca sativa. Regions flanking inversions contained tRNA sequences, but did not have particularly high G + C content. Substitution rates varied among the SSC, LSC, and IR regions, and rates of evolution within each region varied among species. Some observed differences in rates of molecular evolution may be explained by the relative proportion of coding to noncoding sequence within regions. Rates of molecular evolution vary substantially within and among chloroplast genomes, and major inversion events may be promoted by the presence of tRNAs. Collectively, these results provide insight into different mechanisms that may promote intramolecular recombination and the inversion of large genomic regions in the plastome.

  11. Genome variations associated with viral susceptibility and calcification in Emiliania huxleyi.

    Science.gov (United States)

    Kegel, Jessica U; John, Uwe; Valentin, Klaus; Frickenhaus, Stephan

    2013-01-01

    Emiliania huxleyi, a key player in the global carbon cycle is one of the best studied coccolithophores with respect to biogeochemical cycles, climatology, and host-virus interactions. Strains of E. huxleyi show phenotypic plasticity regarding growth behaviour, light-response, calcification, acidification, and virus susceptibility. This phenomenon is likely a consequence of genomic differences, or transcriptomic responses, to environmental conditions or threats such as viral infections. We used an E. huxleyi genome microarray based on the sequenced strain CCMP1516 (reference strain) to perform comparative genomic hybridizations (CGH) of 16 E. huxleyi strains of different geographic origin. We investigated the genomic diversity and plasticity and focused on the identification of genes related to virus susceptibility and coccolith production (calcification). Among the tested 31940 gene models a core genome of 14628 genes was identified by hybridization among 16 E. huxleyi strains. 224 probes were characterized as specific for the reference strain CCMP1516. Compared to the sequenced E. huxleyi strain CCMP1516 variation in gene content of up to 30 percent among strains was observed. Comparison of core and non-core transcripts sets in terms of annotated functions reveals a broad, almost equal functional coverage over all KOG-categories of both transcript sets within the whole annotated genome. Within the variable (non-core) genome we identified genes associated with virus susceptibility and calcification. Genes associated with virus susceptibility include a Bax inhibitor-1 protein, three LRR receptor-like protein kinases, and mitogen-activated protein kinase. Our list of transcripts associated with coccolith production will stimulate further research, e.g. by genetic manipulation. In particular, the V-type proton ATPase 16 kDa proteolipid subunit is proposed to be a plausible target gene for further calcification studies.

  12. Patterns of Genome-Wide Variation in Glossina fuscipes fuscipes Tsetse Flies from Uganda

    Directory of Open Access Journals (Sweden)

    Andrea Gloria-Soria

    2016-06-01

    Full Text Available The tsetse fly Glossina fuscipes fuscipes (Gff is the insect vector of the two forms of Human African Trypanosomiasis (HAT that exist in Uganda. Understanding Gff population dynamics, and the underlying genetics of epidemiologically relevant phenotypes is key to reducing disease transmission. Using ddRAD sequence technology, complemented with whole-genome sequencing, we developed a panel of ∼73,000 single-nucleotide polymorphisms (SNPs distributed across the Gff genome that can be used for population genomics and to perform genome-wide-association studies. We used these markers to estimate genomic patterns of linkage disequilibrium (LD in Gff, and used the information, in combination with outlier-locus detection tests, to identify candidate regions of the genome under selection. LD in individual populations decays to half of its maximum value (r2max/2 between 1359 and 2429 bp. The overall LD estimated for the species reaches r2max/2 at 708 bp, an order of magnitude slower than in Drosophila. Using 53 infected (Trypanosoma spp. and uninfected flies from four genetically distinct Ugandan populations adapted to different environmental conditions, we were able to identify SNPs associated with the infection status of the fly and local environmental adaptation. The extent of LD in Gff likely facilitated the detection of loci under selection, despite the small sample size. Furthermore, it is probable that LD in the regions identified is much higher than the average genomic LD due to strong selection. Our results show that even modest sample sizes can reveal significant genetic associations in this species, which has implications for future studies given the difficulties of collecting field specimens with contrasting phenotypes for association analysis.

  13. Single-Nucleotide Variations in Cardiac Arrhythmias: Prospects for Genomics and Proteomics Based Biomarker Discovery and Diagnostics

    Directory of Open Access Journals (Sweden)

    Ayman Abunimer

    2014-03-01

    Full Text Available Cardiovascular diseases are a large contributor to causes of early death in developed countries. Some of these conditions, such as sudden cardiac death and atrial fibrillation, stem from arrhythmias—a spectrum of conditions with abnormal electrical activity in the heart. Genome-wide association studies can identify single nucleotide variations (SNVs that may predispose individuals to developing acquired forms of arrhythmias. Through manual curation of published genome-wide association studies, we have collected a comprehensive list of 75 SNVs associated with cardiac arrhythmias. Ten of the SNVs result in amino acid changes and can be used in proteomic-based detection methods. In an effort to identify additional non-synonymous mutations that affect the proteome, we analyzed the post-translational modification S-nitrosylation, which is known to affect cardiac arrhythmias. We identified loss of seven known S-nitrosylation sites due to non-synonymous single nucleotide variations (nsSNVs. For predicted nitrosylation sites we found 1429 proteins where the sites are modified due to nsSNV. Analysis of the predicted S-nitrosylation dataset for over- or under-representation (compared to the complete human proteome of pathways and functional elements shows significant statistical over-representation of the blood coagulation pathway. Gene Ontology (GO analysis displays statistically over-represented terms related to muscle contraction, receptor activity, motor activity, cystoskeleton components, and microtubule activity. Through the genomic and proteomic context of SNVs and S-nitrosylation sites presented in this study, researchers can look for variation that can predispose individuals to cardiac arrhythmias. Such attempts to elucidate mechanisms of arrhythmia thereby add yet another useful parameter in predicting susceptibility for cardiac diseases.

  14. GENOMICS SYMPOSIUM: Using genomic approaches to uncover sources of variation in age at puberty and reproductive longevity in sows

    Science.gov (United States)

    Genetic variants associated with traits such as age at puberty and litter size could provide insight into the underlying genetic sources of variation impacting sow reproductive longevity and productivity. Genomewide characterization and gene expression profiling were used using gilts from the Univer...

  15. Genome - wide variation and demographic history of small cats with a focus on Felis species

    Directory of Open Access Journals (Sweden)

    Anubhab Khan

    2017-10-01

    Full Text Available Majority of the 38 known cat species are classified as small and they inhabit five of the seven continents. They survive in a vast range of habitats but still 12 out of the 18 threatened felids are small cats. However, there has not been enough progress in the field of small cat research as they generally get overshadowed by the charismatic big cats. Here we attempt to create a resource for small cat research especially of the genus Felis which has six species out of which two are classified as vulnerable by IUCN and at least one more is at risk. We collected tissue samples of four Felis chaus (Jungle cat from central India and used available whole genome sequences of nine individuals from four other Felis species, two individuals of Prionailurus bengalensis and an Otocolobus manul. These whole genome sequences were filtered and aligned with the already published domestic cat (Felis catus genome assembly. Felids are closely related species and reads from all species in our study aligned with the domestic cat genome with a rate of at least 93%. We estimated the existing genomic variation by calculating heterozygous SNP encounter rate. So far, it seems that all wild cats have more genetic variation than Felis catus species. This can be attributed to the inbreeding in these cats. Among the wild cats, Felis silvestris seems to have the highest level of genetic variation. To understand the reasons behind the distribution of genetic variation in small cats, we estimated the demographic histories of each of the species using PSMC. This method can only detect demographic changes more than 1000 generations ago. We observe that roughly all species share a parallel history in terms of population increase. The most interesting and important feature might be that all wild small cat population sizes increased exponentially around twenty thousand years ago as opposed to domestic cat and big cats which declined around this time. Another interesting feature of

  16. Genomic value prediction for quantitative traits under the epistatic model

    Directory of Open Access Journals (Sweden)

    Xu Shizhong

    2011-01-01

    Full Text Available Abstract Background Most quantitative traits are controlled by multiple quantitative trait loci (QTL. The contribution of each locus may be negligible but the collective contribution of all loci is usually significant. Genome selection that uses markers of the entire genome to predict the genomic values of individual plants or animals can be more efficient than selection on phenotypic values and pedigree information alone for genetic improvement. When a quantitative trait is contributed by epistatic effects, using all markers (main effects and marker pairs (epistatic effects to predict the genomic values of plants can achieve the maximum efficiency for genetic improvement. Results In this study, we created 126 recombinant inbred lines of soybean and genotyped 80 makers across the genome. We applied the genome selection technique to predict the genomic value of somatic embryo number (a quantitative trait for each line. Cross validation analysis showed that the squared correlation coefficient between the observed and predicted embryo numbers was 0.33 when only main (additive effects were used for prediction. When the interaction (epistatic effects were also included in the model, the squared correlation coefficient reached 0.78. Conclusions This study provided an excellent example for the application of genome selection to plant breeding.

  17. Epigenetic Variation in Monozygotic Twins: A Genome-Wide Analysis of DNA Methylation in Buccal Cells

    Directory of Open Access Journals (Sweden)

    Jenny van Dongen

    2014-05-01

    Full Text Available DNA methylation is one of the most extensively studied epigenetic marks in humans. Yet, it is largely unknown what causes variation in DNA methylation between individuals. The comparison of DNA methylation profiles of monozygotic (MZ twins offers a unique experimental design to examine the extent to which such variation is related to individual-specific environmental influences and stochastic events or to familial factors (DNA sequence and shared environment. We measured genome-wide DNA methylation in buccal samples from ten MZ pairs (age 8–19 using the Illumina 450k array and examined twin correlations for methylation level at 420,921 CpGs after QC. After selecting CpGs showing the most variation in the methylation level between subjects, the mean genome-wide correlation (rho was 0.54. The correlation was higher, on average, for CpGs within CpG islands (CGIs, compared to CGI shores, shelves and non-CGI regions, particularly at hypomethylated CpGs. This finding suggests that individual-specific environmental and stochastic influences account for more variation in DNA methylation in CpG-poor regions. Our findings also indicate that it is worthwhile to examine heritable and shared environmental influences on buccal DNA methylation in larger studies that also include dizygotic twins.

  18. Common genetic variation and susceptibility to partial epilepsies: a genome-wide association study.

    Science.gov (United States)

    Kasperaviciūte, Dalia; Catarino, Claudia B; Heinzen, Erin L; Depondt, Chantal; Cavalleri, Gianpiero L; Caboclo, Luis O; Tate, Sarah K; Jamnadas-Khoda, Jenny; Chinthapalli, Krishna; Clayton, Lisa M S; Shianna, Kevin V; Radtke, Rodney A; Mikati, Mohamad A; Gallentine, William B; Husain, Aatif M; Alhusaini, Saud; Leppert, David; Middleton, Lefkos T; Gibson, Rachel A; Johnson, Michael R; Matthews, Paul M; Hosford, David; Heuser, Kjell; Amos, Leslie; Ortega, Marcos; Zumsteg, Dominik; Wieser, Heinz-Gregor; Steinhoff, Bernhard J; Krämer, Günter; Hansen, Jörg; Dorn, Thomas; Kantanen, Anne-Mari; Gjerstad, Leif; Peuralinna, Terhi; Hernandez, Dena G; Eriksson, Kai J; Kälviäinen, Reetta K; Doherty, Colin P; Wood, Nicholas W; Pandolfo, Massimo; Duncan, John S; Sander, Josemir W; Delanty, Norman; Goldstein, David B; Sisodiya, Sanjay M

    2010-07-01

    Partial epilepsies have a substantial heritability. However, the actual genetic causes are largely unknown. In contrast to many other common diseases for which genetic association-studies have successfully revealed common variants associated with disease risk, the role of common variation in partial epilepsies has not yet been explored in a well-powered study. We undertook a genome-wide association-study to identify common variants which influence risk for epilepsy shared amongst partial epilepsy syndromes, in 3445 patients and 6935 controls of European ancestry. We did not identify any genome-wide significant association. A few single nucleotide polymorphisms may warrant further investigation. We exclude common genetic variants with effect sizes above a modest 1.3 odds ratio for a single variant as contributors to genetic susceptibility shared across the partial epilepsies. We show that, at best, common genetic variation can only have a modest role in predisposition to the partial epilepsies when considered across syndromes in Europeans. The genetic architecture of the partial epilepsies is likely to be very complex, reflecting genotypic and phenotypic heterogeneity. Larger meta-analyses are required to identify variants of smaller effect sizes (odds ratio<1.3) or syndrome-specific variants. Further, our results suggest research efforts should also be directed towards identifying the multiple rare variants likely to account for at least part of the heritability of the partial epilepsies. Data emerging from genome-wide association-studies will be valuable during the next serious challenge of interpreting all the genetic variation emerging from whole-genome sequencing studies.

  19. A map of human genome variation from population-scale sequencing.

    Science.gov (United States)

    Abecasis, Gonçalo R; Altshuler, David; Auton, Adam; Brooks, Lisa D; Durbin, Richard M; Gibbs, Richard A; Hurles, Matt E; McVean, Gil A

    2010-10-28

    The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother-father-child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10(-8) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research.

  20. An initial comparative map of copy number variations in the goat (Capra hircus genome

    Directory of Open Access Journals (Sweden)

    Casadio Rita

    2010-11-01

    Full Text Available Abstract Background The goat (Capra hircus represents one of the most important farm animal species. It is reared in all continents with an estimated world population of about 800 million of animals. Despite its importance, studies on the goat genome are still in their infancy compared to those in other farm animal species. Comparative mapping between cattle and goat showed only a few rearrangements in agreement with the similarity of chromosome banding. We carried out a cross species cattle-goat array comparative genome hybridization (aCGH experiment in order to identify copy number variations (CNVs in the goat genome analysing animals of different breeds (Saanen, Camosciata delle Alpi, Girgentana, and Murciano-Granadina using a tiling oligonucleotide array with ~385,000 probes designed on the bovine genome. Results We identified a total of 161 CNVs (an average of 17.9 CNVs per goat, with the largest number in the Saanen breed and the lowest in the Camosciata delle Alpi goat. By aggregating overlapping CNVs identified in different animals we determined CNV regions (CNVRs: on the whole, we identified 127 CNVRs covering about 11.47 Mb of the virtual goat genome referred to the bovine genome (0.435% of the latter genome. These 127 CNVRs included 86 loss and 41 gain and ranged from about 24 kb to about 1.07 Mb with a mean and median equal to 90,292 bp and 49,530 bp, respectively. To evaluate whether the identified goat CNVRs overlap with those reported in the cattle genome, we compared our results with those obtained in four independent cattle experiments. Overlapping between goat and cattle CNVRs was highly significant (P Conclusions We describe a first map of goat CNVRs. This provides information on a comparative basis with the cattle genome by identifying putative recurrent interspecies CNVs between these two ruminant species. Several goat CNVs affect genes with important biological functions. Further studies are needed to evaluate the

  1. Extensive variation in the density and distribution of DNA polymorphism in sorghum genomes.

    Directory of Open Access Journals (Sweden)

    Joseph Evans

    Full Text Available Sorghum genotypes currently used for grain production in the United States were developed from African landraces that were imported starting in the mid-to-late 19(th century. Farmers and plant breeders selected genotypes for grain production with reduced plant height, early flowering, increased grain yield, adaptation to drought, and improved resistance to lodging, diseases and pests. DNA polymorphisms that distinguish three historically important grain sorghum genotypes, BTx623, BTx642 and Tx7000, were characterized by genome sequencing, genotyping by sequencing, genetic mapping, and pedigree-based haplotype analysis. The distribution and density of DNA polymorphisms in the sequenced genomes varied widely, in part because the lines were derived through breeding and selection from diverse Kafir, Durra, and Caudatum race accessions. Genomic DNA spanning dw1 (SBI-09 and dw3 (SBI-07 had identical haplotypes due to selection for reduced height. Lower SNP density in genes located in pericentromeric regions compared with genes located in euchromatic regions is consistent with background selection in these regions of low recombination. SNP density was higher in euchromatic DNA and varied >100-fold in contiguous intervals that spanned up to 300 Kbp. The localized variation in DNA polymorphism density occurred throughout euchromatic regions where recombination is elevated, however, polymorphism density was not correlated with gene density or DNA methylation. Overall, sorghum chromosomes contain distal euchromatic regions characterized by extensive, localized variation in DNA polymorphism density, and large pericentromeric regions of low gene density, diversity, and recombination.

  2. PGen: large-scale genomic variations analysis workflow and browser in SoyKB.

    Science.gov (United States)

    Liu, Yang; Khan, Saad M; Wang, Juexin; Rynge, Mats; Zhang, Yuanxun; Zeng, Shuai; Chen, Shiyuan; Maldonado Dos Santos, Joao V; Valliyodan, Babu; Calyam, Prasad P; Merchant, Nirav; Nguyen, Henry T; Xu, Dong; Joshi, Trupti

    2016-10-06

    With the advances in next-generation sequencing (NGS) technology and significant reductions in sequencing costs, it is now possible to sequence large collections of germplasm in crops for detecting genome-scale genetic variations and to apply the knowledge towards improvements in traits. To efficiently facilitate large-scale NGS resequencing data analysis of genomic variations, we have developed "PGen", an integrated and optimized workflow using the Extreme Science and Engineering Discovery Environment (XSEDE) high-performance computing (HPC) virtual system, iPlant cloud data storage resources and Pegasus workflow management system (Pegasus-WMS). The workflow allows users to identify single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), perform SNP annotations and conduct copy number variation analyses on multiple resequencing datasets in a user-friendly and seamless way. We have developed both a Linux version in GitHub ( https://github.com/pegasus-isi/PGen-GenomicVariations-Workflow ) and a web-based implementation of the PGen workflow integrated within the Soybean Knowledge Base (SoyKB), ( http://soykb.org/Pegasus/index.php ). Using PGen, we identified 10,218,140 single-nucleotide polymorphisms (SNPs) and 1,398,982 indels from analysis of 106 soybean lines sequenced at 15X coverage. 297,245 non-synonymous SNPs and 3330 copy number variation (CNV) regions were identified from this analysis. SNPs identified using PGen from additional soybean resequencing projects adding to 500+ soybean germplasm lines in total have been integrated. These SNPs are being utilized for trait improvement using genotype to phenotype prediction approaches developed in-house. In order to browse and access NGS data easily, we have also developed an NGS resequencing data browser ( http://soykb.org/NGS_Resequence/NGS_index.php ) within SoyKB to provide easy access to SNP and downstream analysis results for soybean researchers. PGen workflow has been optimized for the most

  3. Molecular Mechanisms Underlying Genomic Instability in Brca-Deficient Cells

    Science.gov (United States)

    2014-11-01

    increased by hydroxyurea, ATR inhibition, deregulated c-Myc expression and by PARPi treatment of BRCA1 deficient cells. This work was recently published...Genome Stability." 6: May 27, 2013-Collaborative Research Center 655 from Cells to Tissues seminar series at the Max-Planck-Institute in Dresden, Germany ...Eisenach, Germany -“Genome Stability during DNA Replication” 8: May 3, 2013- Chemical and Systems Biology Department Seminar Series at Stanford

  4. Illumina based whole mitochondrial genome of Junonia iphita reveals minor intraspecific variation

    Directory of Open Access Journals (Sweden)

    Catherine Vanlalruati

    2015-12-01

    Full Text Available In the present study, the near complete mitochondrial genome (mitogenome of Junonia iphita (Lepidoptera: Nymphalidae: Nymphalinae was determined to be 14,892 bp. The gene order and orientation are identical to those in other butterfly species. The phylogenetic tree constructed from the whole mitogenomes using the 13 protein coding genes (PCGs defines the genetic relatedness of the two J. iphita species collected from two different regions. All the Junonia species clustered together, and were further subdivided into clade one consisting of J. almana and J. orithya and clade two comprising of the two J. iphita which were collected from Indo and Indochinese subregions separated by river barrier. Comparison between the two J. iphita sequences revealed minor variations and Single Nucleotide Polymorphisms were identified at 51 sites amounting to 0.4% of the entire mitochondrial genome.

  5. Copy number variation is a fundamental aspect of the placental genome.

    Directory of Open Access Journals (Sweden)

    Roberta L Hannibal

    2014-05-01

    Full Text Available Discovery of lineage-specific somatic copy number variation (CNV in mammals has led to debate over whether CNVs are mutations that propagate disease or whether they are a normal, and even essential, aspect of cell biology. We show that 1,000 N polyploid trophoblast giant cells (TGCs of the mouse placenta contain 47 regions, totaling 138 Megabases, where genomic copies are underrepresented (UR. UR domains originate from a subset of late-replicating heterochromatic regions containing gene deserts and genes involved in cell adhesion and neurogenesis. While lineage-specific CNVs have been identified in mammalian cells, classically in the immune system where V(DJ recombination occurs, we demonstrate that CNVs form during gestation in the placenta by an underreplication mechanism, not by recombination nor deletion. Our results reveal that large scale CNVs are a normal feature of the mammalian placental genome, which are regulated systematically during embryogenesis and are propagated by a mechanism of underreplication.

  6. Whole genome re-sequencing reveals genome-wide variations among parental lines of 16 mapping populations in chickpea (Cicer arietinum L.).

    Science.gov (United States)

    Thudi, Mahendar; Khan, Aamir W; Kumar, Vinay; Gaur, Pooran M; Katta, Krishnamohan; Garg, Vanika; Roorkiwal, Manish; Samineni, Srinivasan; Varshney, Rajeev K

    2016-01-27

    Chickpea (Cicer arietinum L.) is the second most important grain legume cultivated by resource poor farmers in South Asia and Sub-Saharan Africa. In order to harness the untapped genetic potential available for chickpea improvement, we re-sequenced 35 chickpea genotypes representing parental lines of 16 mapping populations segregating for abiotic (drought, heat, salinity), biotic stresses (Fusarium wilt, Ascochyta blight, Botrytis grey mould, Helicoverpa armigera) and nutritionally important (protein content) traits using whole genome re-sequencing approach. A total of 192.19 Gb data, generated on 35 genotypes of chickpea, comprising 973.13 million reads, with an average sequencing depth of ~10 X for each line. On an average 92.18 % reads from each genotype were aligned to the chickpea reference genome with 82.17 % coverage. A total of 2,058,566 unique single nucleotide polymorphisms (SNPs) and 292,588 Indels were detected while comparing with the reference chickpea genome. Highest number of SNPs were identified on the Ca4 pseudomolecule. In addition, copy number variations (CNVs) such as gene deletions and duplications were identified across the chickpea parental genotypes, which were minimum in PI 489777 (1 gene deletion) and maximum in JG 74 (1,497). A total of 164,856 line specific variations (144,888 SNPs and 19,968 Indels) with the highest percentage were identified in coding regions in ICC 1496 (21 %) followed by ICCV 97105 (12 %). Of 539 miscellaneous variations, 339, 138 and 62 were inter-chromosomal variations (CTX), intra-chromosomal variations (ITX) and inversions (INV) respectively. Genome-wide SNPs, Indels, CNVs, PAVs, and miscellaneous variations identified in different mapping populations are a valuable resource in genetic research and helpful in locating genes/genomic segments responsible for economically important traits. Further, the genome-wide variations identified in the present study can be used for developing high density SNP arrays for

  7. Genome Wide Distributions and Functional Characterization of Copy Number Variations between Chinese and Western Pigs.

    Directory of Open Access Journals (Sweden)

    Hongyang Wang

    Full Text Available Copy number variations (CNVs refer to large insertions, deletions and duplications in the genomic structure ranging from one thousand to several million bases in size. Since the development of next generation sequencing technology, several methods have been well built for detection of copy number variations with high credibility and accuracy. Evidence has shown that CNV occurring in gene region could lead to phenotypic changes due to the alteration in gene structure and dosage. However, it still remains unexplored whether CNVs underlie the phenotypic differences between Chinese and Western domestic pigs. Based on the read-depth methods, we investigated copy number variations using 49 individuals derived from both Chinese and Western pig breeds. A total of 3,131 copy number variation regions (CNVRs were identified with an average size of 13.4 Kb in all individuals during domestication, harboring 1,363 genes. Among them, 129 and 147 CNVRs were Chinese and Western pig specific, respectively. Gene functional enrichments revealed that these CNVRs contribute to strong disease resistance and high prolificacy in Chinese domestic pigs, but strong muscle tissue development in Western domestic pigs. This finding is strongly consistent with the morphologic characteristics of Chinese and Western pigs, indicating that these group-specific CNVRs might have been preserved by artificial selection for the favored phenotypes during independent domestication of Chinese and Western pigs. In this study, we built high-resolution CNV maps in several domestic pig breeds and discovered the group specific CNVs by comparing Chinese and Western pigs, which could provide new insight into genomic variations during pigs' independent domestication, and facilitate further functional studies of CNV-associated genes.

  8. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing.

    Science.gov (United States)

    Aflitos, Saulo; Schijlen, Elio; de Jong, Hans; de Ridder, Dick; Smit, Sandra; Finkers, Richard; Wang, Jun; Zhang, Gengyun; Li, Ning; Mao, Likai; Bakker, Freek; Dirks, Rob; Breit, Timo; Gravendeel, Barbara; Huits, Henk; Struss, Darush; Swanson-Wagner, Ruth; van Leeuwen, Hans; van Ham, Roeland C H J; Fito, Laia; Guignier, Laëtitia; Sevilla, Myrna; Ellul, Philippe; Ganko, Eric; Kapur, Arvind; Reclus, Emannuel; de Geus, Bernard; van de Geest, Henri; Te Lintel Hekkert, Bas; van Haarst, Jan; Smits, Lars; Koops, Andries; Sanchez-Perez, Gabino; van Heusden, Adriaan W; Visser, Richard; Quan, Zhiwu; Min, Jiumeng; Liao, Li; Wang, Xiaoli; Wang, Guangbiao; Yue, Zhen; Yang, Xinhua; Xu, Na; Schranz, Eric; Smets, Erik; Vos, Rutger; Rauwerda, Johan; Ursem, Remco; Schuit, Cees; Kerns, Mike; van den Berg, Jan; Vriezen, Wim; Janssen, Antoine; Datema, Erwin; Jahrman, Torben; Moquet, Frederic; Bonnet, Julien; Peters, Sander

    2014-10-01

    We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars. Using gene models from the annotated Heinz 1706 reference genome, we observed differences in the ratio between non-synonymous and synonymous SNPs (dN/dS) in fruit diversification and plant growth genes compared to a random set of genes, indicating positive selection and differences in selection pressure between crop accessions and wild species. In wild species, the number of single-nucleotide polymorphisms (SNPs) exceeds 10 million, i.e. 20-fold higher than found in most of the crop accessions, indicating dramatic genetic erosion of crop and heirloom tomatoes. In addition, the highest levels of heterozygosity were found for allogamous self-incompatible wild species, while facultative and autogamous self-compatible species display a lower heterozygosity level. Using whole-genome SNP information for maximum-likelihood analysis, we achieved complete tree resolution, whereas maximum-likelihood trees based on SNPs from ten fruit and growth genes show incomplete resolution for the crop accessions, partly due to the effect of heterozygous SNPs. Finally, results suggest that phylogenetic relationships are correlated with habitat, indicating the occurrence of geographical races within these groups, which is of practical importance for Solanum genome evolution studies. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

  9. Genomic variation in CYP3A4: type, frequencies and potential implications for pharmacogenetic understanding.

    OpenAIRE

    Creemer, O.

    2012-01-01

    The human cytochrome P450 3A subfamily metabolises endogenous substances and approximately half of all currently available drugs. There is marked inter-individual variation in hepatic expression of the major adult isoform, CYP3A4; the genetic component of this variability is estimated at 60-90% and, as yet, remains largely uncharacterised. Elucidation of genetic factors determining CYP3A4 activity would permit personalised dose-adjustment in therapies with CYP3A4 drug substrates. CYP3A4 genom...

  10. SV2: accurate structural variation genotyping and de novo mutation detection from whole genomes.

    Science.gov (United States)

    Antaki, Danny; Brandler, William M; Sebat, Jonathan

    2018-05-15

    Structural variation (SV) detection from short-read whole genome sequencing is error prone, presenting significant challenges for population or family-based studies of disease. Here, we describe SV2, a machine-learning algorithm for genotyping deletions and duplications from paired-end sequencing data. SV2 can rapidly integrate variant calls from multiple structural variant discovery algorithms into a unified call set with high genotyping accuracy and capability to detect de novo mutations. SV2 is freely available on GitHub (https://github.com/dantaki/SV2). jsebat@ucsd.edu. Supplementary data are available at Bioinformatics online.

  11. Genomic dissection of variation in clutch size and egg mass in a wild great tit (Parus major) population.

    Science.gov (United States)

    Santure, Anna W; De Cauwer, Isabelle; Robinson, Matthew R; Poissant, Jocelyn; Sheldon, Ben C; Slate, Jon

    2013-08-01

    Clutch size and egg mass are life history traits that have been extensively studied in wild bird populations, as life history theory predicts a negative trade-off between them, either at the phenotypic or at the genetic level. Here, we analyse the genomic architecture of these heritable traits in a wild great tit (Parus major) population, using three marker-based approaches - chromosome partitioning, quantitative trait locus (QTL) mapping and a genome-wide association study (GWAS). The variance explained by each great tit chromosome scales with predicted chromosome size, no location in the genome contains genome-wide significant QTL, and no individual SNPs are associated with a large proportion of phenotypic variation, all of which may suggest that variation in both traits is due to many loci of small effect, located across the genome. There is no evidence that any regions of the genome contribute significantly to both traits, which combined with a small, nonsignificant negative genetic covariance between the traits, suggests the absence of genetic constraints on the independent evolution of these traits. Our findings support the hypothesis that variation in life history traits in natural populations is likely to be determined by many loci of small effect spread throughout the genome, which are subject to continued input of variation by mutation and migration, although we cannot exclude the possibility of an additional input of major effect genes influencing either trait. © 2013 John Wiley & Sons Ltd.

  12. Orion: Detecting regions of the human non-coding genome that are intolerant to variation using population genetics.

    Science.gov (United States)

    Gussow, Ayal B; Copeland, Brett R; Dhindsa, Ryan S; Wang, Quanli; Petrovski, Slavé; Majoros, William H; Allen, Andrew S; Goldstein, David B

    2017-01-01

    There is broad agreement that genetic mutations occurring outside of the protein-coding regions play a key role in human disease. Despite this consensus, we are not yet capable of discerning which portions of non-coding sequence are important in the context of human disease. Here, we present Orion, an approach that detects regions of the non-coding genome that are depleted of variation, suggesting that the regions are intolerant of mutations and subject to purifying selection in the human lineage. We show that Orion is highly correlated with known intolerant regions as well as regions that harbor putatively pathogenic variation. This approach provides a mechanism to identify pathogenic variation in the human non-coding genome and will have immediate utility in the diagnostic interpretation of patient genomes and in large case control studies using whole-genome sequences.

  13. Genomic and chromatin signals underlying transcription start-site selection

    DEFF Research Database (Denmark)

    Valen, Eivind; Sandelin, Albin Gustav

    2011-01-01

    A central question in cellular biology is how the cell regulates transcription and discerns when and where to initiate it. Locating transcription start sites (TSSs), the signals that specify them, and ultimately elucidating the mechanisms of regulated initiation has therefore been a recurrent theme....... In recent years substantial progress has been made towards this goal, spurred by the possibility of applying genome-wide, sequencing-based analysis. We now have a large collection of high-resolution datasets identifying locations of TSSs, protein-DNA interactions, and chromatin features over whole genomes...

  14. Natural variation in rosette size under salt stress conditions corresponds to developmental differences between Arabidopsis accessions and allelic variation in the LRR-KISS gene

    KAUST Repository

    Julkowska, Magdalena

    2016-02-11

    Natural variation among Arabidopsis accessions is an important genetic resource to identify mechanisms underlying plant development and stress tolerance. To evaluate the natural variation in salinity stress tolerance, two large-scale experiments were performed on two populations consisting of 160 Arabidopsis accessions each. Multiple traits, including projected rosette area, and fresh and dry weight were collected as an estimate for salinity tolerance. Our results reveal a correlation between rosette size under salt stress conditions and developmental differences between the accessions grown in control conditions, suggesting that in general larger plants were more salt tolerant. This correlation was less pronounced when plants were grown under severe salt stress conditions. Subsequent genome wide association study (GWAS) revealed associations with novel candidate genes for salinity tolerance such as LRR-KISS (At4g08850), flowering locus KH-domain containing protein and a DUF1639-containing protein. Accessions with high LRR-KISS expression developed larger rosettes under salt stress conditions. Further characterization of allelic variation in candidate genes identified in this study will provide more insight into mechanisms of salt stress tolerance due to enhanced shoot growth.

  15. Complete chloroplast genomes from apomictic Taraxacum (Asteraceae): Identity and variation between three microspecies

    Science.gov (United States)

    Majeský, Ľuboš; Schwarzacher, Trude; Gornall, Richard; Heslop-Harrison, Pat

    2017-01-01

    Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny reconstruction, not all showed the same evolutionary scenario suggesting care is needed for interpretation of relationships if a limited number of markers are used. Studying genotypic diversity in

  16. EVA: Exome Variation Analyzer, an efficient and versatile tool for filtering strategies in medical genomics

    Directory of Open Access Journals (Sweden)

    Coutant Sophie

    2012-09-01

    Full Text Available Abstract Background Whole exome sequencing (WES has become the strategy of choice to identify a coding allelic variant for a rare human monogenic disorder. This approach is a revolution in medical genetics history, impacting both fundamental research, and diagnostic methods leading to personalized medicine. A plethora of efficient algorithms has been developed to ensure the variant discovery. They generally lead to ~20,000 variations that have to be narrow down to find the potential pathogenic allelic variant(s and the affected gene(s. For this purpose, commonly adopted procedures which implicate various filtering strategies have emerged: exclusion of common variations, type of the allelics variants, pathogenicity effect prediction, modes of inheritance and multiple individuals for exome comparison. To deal with the expansion of WES in medical genomics individual laboratories, new convivial and versatile software tools have to implement these filtering steps. Non-programmer biologists have to be autonomous combining themselves different filtering criteria and conduct a personal strategy depending on their assumptions and study design. Results We describe EVA (Exome Variation Analyzer, a user-friendly web-interfaced software dedicated to the filtering strategies for medical WES. Thanks to different modules, EVA (i integrates and stores annotated exome variation data as strictly confidential to the project owner, (ii allows to combine the main filters dealing with common variations, molecular types, inheritance mode and multiple samples, (iii offers the browsing of annotated data and filtered results in various interactive tables, graphical visualizations and statistical charts, (iv and finally offers export files and cross-links to external useful databases and softwares for further prioritization of the small subset of sorted candidate variations and genes. We report a demonstrative case study that allowed to identify a new candidate gene

  17. Whole-genome sequencing reveals mutational landscape underlying phenotypic differences between two widespread Chinese cattle breeds

    OpenAIRE

    Xu, Yao; Jiang, Yu; Shi, Tao; Cai, Hanfang; Lan, Xianyong; Zhao, Xin; Plath, Martin; Chen, Hong

    2017-01-01

    Whole-genome sequencing provides a powerful tool to obtain more genetic variability that could produce a range of benefits for cattle breeding industry. Nanyang (Bos indicus) and Qinchuan (Bos taurus) are two important Chinese indigenous cattle breeds with distinct phenotypes. To identify the genetic characteristics responsible for variation in phenotypes between the two breeds, in the present study, we for the first time sequenced the genomes of four Nanyang and four Qinchuan cattle with 10 ...

  18. Metabolome-genome-wide association study dissects genetic architecture for generating natural variation in rice secondary metabolism

    Science.gov (United States)

    Matsuda, Fumio; Nakabayashi, Ryo; Yang, Zhigang; Okazaki, Yozo; Yonemaru, Jun-ichi; Ebana, Kaworu; Yano, Masahiro; Saito, Kazuki

    2015-01-01

    Plants produce structurally diverse secondary (specialized) metabolites to increase their fitness for survival under adverse environments. Several bioactive compounds for new drugs have been identified through screening of plant extracts. In this study, genome-wide association studies (GWAS) were conducted to investigate the genetic architecture behind the natural variation of rice secondary metabolites. GWAS using the metabolome data of 175 rice accessions successfully identified 323 associations among 143 single nucleotide polymorphisms (SNPs) and 89 metabolites. The data analysis highlighted that levels of many metabolites are tightly associated with a small number of strong quantitative trait loci (QTLs). The tight association may be a mechanism generating strains with distinct metabolic composition through the crossing of two different strains. The results indicate that one plant species produces more diverse phytochemicals than previously expected, and plants still contain many useful compounds for human applications. PMID:25267402

  19. Analyzing the genomic variation of microbial cell factories in the era of “New Biotechnology”

    DEFF Research Database (Denmark)

    Herrgard, Markus; Panagiotou, Gianni

    2012-01-01

    The application of genome-scale technologies, both experimental and in silico, to industrial biotechnology has allowed improving the conversion of biomass-derived feedstocks to chemicals, materials and fuels through microbial fermentation. In particular, due to rapidly decreasing costs and its...... technologies for finding the underlying molecular mechanisms for (a) improved carbon source utilization, (b) increased product formation, and (c) stress tolerance. We also discuss the strengths and weaknesses of different strategies for mapping industrially relevant genotype-to-phenotype links including...

  20. Landscape genomic prediction for restoration of a Eucalyptus foundation species under climate change.

    Science.gov (United States)

    Supple, Megan Ann; Bragg, Jason G; Broadhurst, Linda M; Nicotra, Adrienne B; Byrne, Margaret; Andrew, Rose L; Widdup, Abigail; Aitken, Nicola C; Borevitz, Justin O

    2018-04-24

    As species face rapid environmental change, we can build resilient populations through restoration projects that incorporate predicted future climates into seed sourcing decisions. Eucalyptus melliodora is a foundation species of a critically endangered community in Australia that is a target for restoration. We examined genomic and phenotypic variation to make empirical based recommendations for seed sourcing. We examined isolation by distance and isolation by environment, determining high levels of gene flow extending for 500 km and correlations with climate and soil variables. Growth experiments revealed extensive phenotypic variation both within and among sampling sites, but no site-specific differentiation in phenotypic plasticity. Model predictions suggest that seed can be sourced broadly across the landscape, providing ample diversity for adaptation to environmental change. Application of our landscape genomic model to E. melliodora restoration projects can identify genomic variation suitable for predicted future climates, thereby increasing the long term probability of successful restoration. © 2018, Supple et al.

  1. A genome wide survey of SNP variation reveals the genetic structure of sheep breeds.

    Directory of Open Access Journals (Sweden)

    James W Kijas

    Full Text Available The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identifying the first genome-wide set of SNP for sheep, we report on levels of genetic variability both within and between a diverse sample of ovine populations. Then, using cluster analysis and the partitioning of genetic variation, we demonstrate sheep are characterised by weak phylogeographic structure, overlapping genetic similarity and generally low differentiation which is consistent with their short evolutionary history. The degree of population substructure was, however, sufficient to cluster individuals based on geographic origin and known breed history. Specifically, African and Asian populations clustered separately from breeds of European origin sampled from Australia, New Zealand, Europe and North America. Furthermore, we demonstrate the presence of stratification within some, but not all, ovine breeds. The results emphasize that careful documentation of genetic structure will be an essential prerequisite when mapping the genetic basis of complex traits. Furthermore, the identification of a subset of SNP able to assign individuals into broad groupings demonstrates even a small panel of markers may be suitable for applications such as traceability.

  2. A Bayesian method and its variational approximation for prediction of genomic breeding values in multiple traits

    Directory of Open Access Journals (Sweden)

    Hayashi Takeshi

    2013-01-01

    Full Text Available Abstract Background Genomic selection is an effective tool for animal and plant breeding, allowing effective individual selection without phenotypic records through the prediction of genomic breeding value (GBV. To date, genomic selection has focused on a single trait. However, actual breeding often targets multiple correlated traits, and, therefore, joint analysis taking into consideration the correlation between traits, which might result in more accurate GBV prediction than analyzing each trait separately, is suitable for multi-trait genomic selection. This would require an extension of the prediction model for single-trait GBV to multi-trait case. As the computational burden of multi-trait analysis is even higher than that of single-trait analysis, an effective computational method for constructing a multi-trait prediction model is also needed. Results We described a Bayesian regression model incorporating variable selection for jointly predicting GBVs of multiple traits and devised both an MCMC iteration and variational approximation for Bayesian estimation of parameters in this multi-trait model. The proposed Bayesian procedures with MCMC iteration and variational approximation were referred to as MCBayes and varBayes, respectively. Using simulated datasets of SNP genotypes and phenotypes for three traits with high and low heritabilities, we compared the accuracy in predicting GBVs between multi-trait and single-trait analyses as well as between MCBayes and varBayes. The results showed that, compared to single-trait analysis, multi-trait analysis enabled much more accurate GBV prediction for low-heritability traits correlated with high-heritability traits, by utilizing the correlation structure between traits, while the prediction accuracy for uncorrelated low-heritability traits was comparable or less with multi-trait analysis in comparison with single-trait analysis depending on the setting for prior probability that a SNP has zero

  3. The analysis of APOL1 genetic variation and haplotype diversity provided by 1000 Genomes project.

    Science.gov (United States)

    Peng, Ting; Wang, Li; Li, Guisen

    2017-08-11

    The APOL1 gene variants has been shown to be associated with an increased risk of multiple kinds of diseases, particularly in African Americans, but not in Caucasians and Asians. In this study, we explored the single nucleotide polymorphism (SNP) and haplotype diversity of APOL1 gene in different races provided by 1000 Genomes project. Variants of APOL1 gene in 1000 Genome Project were obtained and SNPs located in the regulatory region or coding region were selected for genetic variation analysis. Total 2504 individuals from 26 populations were classified as four groups that included Africa, Europe, Asia and Admixed populations. Tag SNPs were selected to evaluate the haplotype diversities in the four populations by HaploStats software. APOL1 gene was surrounded by some of the most polymorphic genes in the human genome, variation of APOL1 gene was common, with up to 613 SNP (1000 Genome Project reported) and 99 of them (16.2%) with MAF ≥ 1%. There were 79 SNPs in the URR and 92 SNPs in 3'UTR. Total 12 SNPs in URR and 24 SNPs in 3'UTR were considered as common variants with MAF ≥ 1%. It is worth noting that URR-1 was presents lower frequencies in European populations, while other three haplotypes taken an opposite pattern; 3'UTR presents several high-frequency variation sites in a short segment, and the differences of its haplotypes among different population were significant (P < 0.01), UTR-1 and UTR-5 presented much higher frequency in African population, while UTR-2, UTR-3 and UTR-4 were much lower. APOL1 coding region showed that two SNP of G1 with higher frequency are actually pull down the haplotype H-1 frequency when considering all populations pooled together, and the diversity among the four populations be widen by the G1 two mutation (P 1  = 3.33E-4 vs P 2  = 3.61E-30). The distributions of APOL1 gene variants and haplotypes were significantly different among the different populations, in either regulatory or coding regions. It could provide

  4. Genome-wide recombination dynamics are associated with phenotypic variation in maize.

    Science.gov (United States)

    Pan, Qingchun; Li, Lin; Yang, Xiaohong; Tong, Hao; Xu, Shutu; Li, Zhigang; Li, Weiya; Muehlbauer, Gary J; Li, Jiansheng; Yan, Jianbing

    2016-05-01

    Meiotic recombination is a major driver of genetic diversity, species evolution, and agricultural improvement. Thus, an understanding of the genetic recombination landscape across the maize (Zea mays) genome will provide insight and tools for further study of maize evolution and improvement. Here, we used c. 50 000 single nucleotide polymorphisms to precisely map recombination events in 12 artificial maize segregating populations. We observed substantial variation in the recombination frequency and distribution along the ten maize chromosomes among the 12 populations and identified 143 recombination hot regions. Recombination breakpoints were partitioned into intragenic and intergenic events. Interestingly, an increase in the number of genes containing recombination events was accompanied by a decrease in the number of recombination events per gene. This kept the overall number of intragenic recombination events nearly invariable in a given population, suggesting that the recombination variation observed among populations was largely attributed to intergenic recombination. However, significant associations between intragenic recombination events and variation in gene expression and agronomic traits were observed, suggesting potential roles for intragenic recombination in plant phenotypic diversity. Our results provide a comprehensive view of the maize recombination landscape, and show an association between recombination, gene expression and phenotypic variation, which may enhance crop genetic improvement. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  5. Empirical analysis of skin friction under variations of temperature

    International Nuclear Information System (INIS)

    Parra Alvarez, A. R. de la; Groot Viana, M. de

    2014-01-01

    In soil geotechnical characterization, strength parameters, cohesion (c) and internal friction angle (Φ) has been traditional measured without taking into account temperature, been a very important issue in energy geostructures. The present document analyzes the variation of these parameters in soil-concrete interface at different temperatures. A traditional shear strength case with a forced plane of failure was used. Several tests were carried out to determine the variation of skin friction in granular and cohesive oils with temperature. (Author)

  6. Genomic variation and its impact on gene expression in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Andreas Massouras

    Full Text Available Understanding the relationship between genetic and phenotypic variation is one of the great outstanding challenges in biology. To meet this challenge, comprehensive genomic variation maps of human as well as of model organism populations are required. Here, we present a nucleotide resolution catalog of single-nucleotide, multi-nucleotide, and structural variants in 39 Drosophila melanogaster Genetic Reference Panel inbred lines. Using an integrative, local assembly-based approach for variant discovery, we identify more than 3.6 million distinct variants, among which were more than 800,000 unique insertions, deletions (indels, and complex variants (1 to 6,000 bp. While the SNP density is higher near other variants, we find that variants themselves are not mutagenic, nor are regions with high variant density particularly mutation-prone. Rather, our data suggest that the elevated SNP density around variants is mainly due to population-level processes. We also provide insights into the regulatory architecture of gene expression variation in adult flies by mapping cis-expression quantitative trait loci (cis-eQTLs for more than 2,000 genes. Indels comprise around 10% of all cis-eQTLs and show larger effects than SNP cis-eQTLs. In addition, we identified two-fold more gene associations in males as compared to females and found that most cis-eQTLs are sex-specific, revealing a partial decoupling of the genomic architecture between the sexes as well as the importance of genetic factors in mediating sex-biased gene expression. Finally, we performed RNA-seq-based allelic expression imbalance analyses in the offspring of crosses between sequenced lines, which revealed that the majority of strong cis-eQTLs can be validated in heterozygous individuals.

  7. DNA variation of the mammalian major histocompatibility complex reflects genomic diversity and population history

    International Nuclear Information System (INIS)

    Yuhki, Naoya; O'Brien, S.J.

    1990-01-01

    The major histocompatibility complex (MHC) is a multigene complex of tightly linked homologous genes that encode cell surface antigens that play a key role in immune regulation and response to foreign antigens. In most species, MHC gene products display extreme antigenic polymorphism, and their variability has been interpreted to reflect an adaptive strategy for accommodating rapidly evolving infectious agents that periodically afflict natural populations. Determination of the extent of MHC variation has been limited to populations in which skin grafting is feasible or for which serological reagents have been developed. The authors present here a quantitative analysis of restriction fragment length polymorphism of MHC class I genes in several mammalian species (cats, rodents, humans) known to have very different levels of genetic diversity based on functional MHC assays and on allozyme surveys. When homologous class I probes were employed, a notable concordance was observed between the extent of MHC restriction fragment variation and functional MHC variation detected by skin grafts or genome-wide diversity estimated by allozyme screens. These results confirm the genetically depauperate character of the African cheetah, Acinonyx jubatus, and the Asiatic lion, Panthera leo persica; further, they support the use of class I MHC molecular reagents in estimating the extent and character of genetic diversity in natural populations

  8. DNA variation of the mammalian major histocompatibility complex reflects genomic diversity and population history

    Energy Technology Data Exchange (ETDEWEB)

    Yuhki, Naoya; O' Brien, S.J. (National Cancer Institute, Frederick, MD (USA))

    1990-01-01

    The major histocompatibility complex (MHC) is a multigene complex of tightly linked homologous genes that encode cell surface antigens that play a key role in immune regulation and response to foreign antigens. In most species, MHC gene products display extreme antigenic polymorphism, and their variability has been interpreted to reflect an adaptive strategy for accommodating rapidly evolving infectious agents that periodically afflict natural populations. Determination of the extent of MHC variation has been limited to populations in which skin grafting is feasible or for which serological reagents have been developed. The authors present here a quantitative analysis of restriction fragment length polymorphism of MHC class I genes in several mammalian species (cats, rodents, humans) known to have very different levels of genetic diversity based on functional MHC assays and on allozyme surveys. When homologous class I probes were employed, a notable concordance was observed between the extent of MHC restriction fragment variation and functional MHC variation detected by skin grafts or genome-wide diversity estimated by allozyme screens. These results confirm the genetically depauperate character of the African cheetah, Acinonyx jubatus, and the Asiatic lion, Panthera leo persica; further, they support the use of class I MHC molecular reagents in estimating the extent and character of genetic diversity in natural populations.

  9. Identification of genomic copy number variations associated with specific clinical features of head and neck cancer.

    Science.gov (United States)

    Zagradišnik, Boris; Krgović, Danijela; Herodež, Špela Stangler; Zagorac, Andreja; Ćižmarević, Bogdan; Vokač, Nadja Kokalj

    2018-01-01

    Copy number variations (CNSs) of large genomic regions are an important mechanism implicated in the development of head and neck cancer, however, for most changes their exact role is not well understood. The aim of this study was to find possible associations between gains/losses of genomic regions and clinically distinct subgroups of head and neck cancer patients. Array comparative genomic hybridization (aCGH) analysis was performed on DNA samples in 64 patients with cancer in oral cavity, oropharynx or hypopharynx. Overlapping genomic regions created from gains and losses were used for statistical analysis. Following regions were overrepresented: in tumors with stage I or II a gain of 2.98 Mb on 6p21.2-p11 and a gain of 7.4 Mb on 8q11.1-q11.23; in tumors with grade I histology a gain of 1.1 Mb on 8q24.13, a loss of a large part of p arm of chromosome 3, a loss of a 1.24 Mb on 6q14.3, and a loss of terminal 32 Mb region of 8p23.3; in cases with affected lymph nodes a gain of 0.75 Mb on 3q24, and a gain of 0.9 Mb on 3q26.32-q26.33; in cases with unaffected lymph nodes a gain of 1.1 Mb on 8q23.3, in patients not treated with surgery a gain of 12.2 Mb on 7q21.3-q22.3 and a gain of 0.33 Mb on 20q11.22. Our study identified several genomic regions of interest which appear to be associated with various clinically distinct subgroups of head and neck cancer. They represent a potentially important source of biomarkers useful for the clinical management of head and neck cancer. In particular, the PIK3CA and AGTR1 genes could be singled out to predict the lymph node involvement.

  10. Genetic variation and plasticity of Plantago coronopus under saline conditions

    NARCIS (Netherlands)

    Smekens, Marret; Van Tienderen, P.H.

    2001-01-01

    Phenotypic plasticity may allow organisms to cope with variation in the environmental conditions they encounter in their natural habitats. Salt adaptation appears to be an excellent example of such a plastic response. Many plant species accumulate organic solutes in response to saline conditions.

  11. 3D Facial Landmarking under Expression, Pose, and Occlusion Variations

    NARCIS (Netherlands)

    H. Dibeklioğ lu; A.A. Salah (Albert Ali); L. Akarun

    2008-01-01

    htmlabstractAutomatic localization of 3D facial features is important for face recognition, tracking, modeling and expression analysis. Methods developed for 2D images were shown to have problems working across databases acquired with different illumination conditions. Expression variations, pose

  12. Spiral phyllotaxis underlies constrained variation in Anemone (Ranunculaceae) tepal arrangement.

    Science.gov (United States)

    Kitazawa, Miho S; Fujimoto, Koichi

    2018-05-01

    Stabilization and variation of floral structures are indispensable for plant reproduction and evolution; however, the developmental mechanism regulating their structural robustness is largely unknown. To investigate this mechanism, we examined positional arrangement (aestivation) of excessively produced perianth organs (tepals) of six- and seven-tepaled (lobed) flowers in six Anemone species (Ranunculaceae). We found that the tepal arrangement that occurred in nature varied intraspecifically between spiral and whorled arrangements. Moreover, among the studied species, variation was commonly limited to three types, including whorls, despite five geometrically possible arrangements in six-tepaled flowers and two types among six possibilities in seven-tepaled flowers. A spiral arrangement, on the other hand, was unique to five-tepaled flowers. A spiral phyllotaxis model with stochasticity on initiating excessive primordia accounted for these limited variations in arrangement in cases when the divergence angle between preexisting primordia was less than 144°. Moreover, interspecific differences in the frequency of the observed arrangements were explained by the change of model parameters that represent meristematic growth and differential organ growth. These findings suggest that the phyllotaxis parameters are responsible for not only intraspecific stability but interspecific difference of floral structure. Decreasing arrangements from six-tepaled to seven-tepaled Anemone flowers demonstrate that the stabilization occurs as development proceeds to increase the component (organ) number, in contrast from the intuition that the variation will be larger due to increasing number of possible states (arrangements).

  13. Genomic Analysis of Hepatitis B Virus Reveals Antigen State and Genotype as Sources of Evolutionary Rate Variation

    Science.gov (United States)

    Harrison, Abby; Lemey, Philippe; Hurles, Matthew; Moyes, Chris; Horn, Susanne; Pryor, Jan; Malani, Joji; Supuri, Mathias; Masta, Andrew; Teriboriki, Burentau; Toatu, Tebuka; Penny, David; Rambaut, Andrew; Shapiro, Beth

    2011-01-01

    Hepatitis B virus (HBV) genomes are small, semi-double-stranded DNA circular genomes that contain alternating overlapping reading frames and replicate through an RNA intermediary phase. This complex biology has presented a challenge to estimating an evolutionary rate for HBV, leading to difficulties resolving the evolutionary and epidemiological history of the virus. Here, we re-examine rates of HBV evolution using a novel data set of 112 within-host, transmission history (pedigree) and among-host genomes isolated over 20 years from the indigenous peoples of the South Pacific, combined with 313 previously published HBV genomes. We employ Bayesian phylogenetic approaches to examine several potential causes and consequences of evolutionary rate variation in HBV. Our results reveal rate variation both between genotypes and across the genome, as well as strikingly slower rates when genomes are sampled in the Hepatitis B e antigen positive state, compared to the e antigen negative state. This Hepatitis B e antigen rate variation was found to be largely attributable to changes during the course of infection in the preCore and Core genes and their regulatory elements. PMID:21765983

  14. Genetic Architecture of Natural Variation Underlying Adult Foraging Behavior That Is Essential for Survival of Drosophila melanogaster.

    Science.gov (United States)

    Lee, Yuh Chwen G; Yang, Qian; Chi, Wanhao; Turkson, Susie A; Du, Wei A; Kemkemer, Claus; Zeng, Zhao-Bang; Long, Manyuan; Zhuang, Xiaoxi

    2017-05-01

    Foraging behavior is critical for the fitness of individuals. However, the genetic basis of variation in foraging behavior and the evolutionary forces underlying such natural variation have rarely been investigated. We developed a systematic approach to assay the variation in survival rate in a foraging environment for adult flies derived from a wild Drosophila melanogaster population. Despite being such an essential trait, there is substantial variation of foraging behavior among D. melanogaster strains. Importantly, we provided the first evaluation of the potential caveats of using inbred Drosophila strains to perform genome-wide association studies on life-history traits, and concluded that inbreeding depression is unlikely a major contributor for the observed large variation in adult foraging behavior. We found that adult foraging behavior has a strong genetic component and, unlike larval foraging behavior, depends on multiple loci. Identified candidate genes are enriched in those with high expression in adult heads and, demonstrated by expression knock down assay, are involved in maintaining normal functions of the nervous system. Our study not only identified candidate genes for foraging behavior that is relevant to individual fitness, but also shed light on the initial stage underlying the evolution of the behavior. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  15. Plasticity of the Leishmania genome leading to gene copy number variations and drug resistance [version 1; referees: 5 approved

    Directory of Open Access Journals (Sweden)

    Marie-Claude N. Laffitte

    2016-09-01

    Full Text Available Leishmania has a plastic genome, and drug pressure can select for gene copy number variation (CNV. CNVs can apply either to whole chromosomes, leading to aneuploidy, or to specific genomic regions. For the latter, the amplification of chromosomal regions occurs at the level of homologous direct or inverted repeated sequences leading to extrachromosomal circular or linear amplified DNAs. This ability of Leishmania to respond to drug pressure by CNVs has led to the development of genomic screens such as Cos-Seq, which has the potential of expediting the discovery of drug targets for novel promising drug candidates.

  16. Variation in the OC locus of Acinetobacter baumannii genomes predicts extensive structural diversity in the lipooligosaccharide.

    Directory of Open Access Journals (Sweden)

    Johanna J Kenyon

    Full Text Available Lipooligosaccharide (LOS is a complex surface structure that is linked to many pathogenic properties of Acinetobacter baumannii. In A. baumannii, the genes responsible for the synthesis of the outer core (OC component of the LOS are located between ilvE and aspS. The content of the OC locus is usually variable within a species, and examination of 6 complete and 227 draft A. baumannii genome sequences available in GenBank non-redundant and Whole Genome Shotgun databases revealed nine distinct new types, OCL4-OCL12, in addition to the three known ones. The twelve gene clusters fell into two distinct groups, designated Group A and Group B, based on similarities in the genes present. OCL6 (Group B was unique in that it included genes for the synthesis of L-Rhamnosep. Genetic exchange of the different configurations between strains has occurred as some OC forms were found in several different sequence types (STs. OCL1 (Group A was the most widely distributed being present in 18 STs, and OCL6 was found in 16 STs. Variation within clones was also observed, with more than one OC locus type found in the two globally disseminated clones, GC1 and GC2, that include the majority of multiply antibiotic resistant isolates. OCL1 was the most abundant gene cluster in both GC1 and GC2 genomes but GC1 isolates also carried OCL2, OCL3 or OCL5, and OCL3 was also present in GC2. As replacement of the OC locus in the major global clones indicates the presence of sub-lineages, a PCR typing scheme was developed to rapidly distinguish Group A and Group B types, and to distinguish the specific forms found in GC1 and GC2 isolates.

  17. Genome-wide copy number variation (CNV in patients with autoimmune Addison's disease

    Directory of Open Access Journals (Sweden)

    Brønstad Ingeborg

    2011-08-01

    Full Text Available Abstract Background Addison's disease (AD is caused by an autoimmune destruction of the adrenal cortex. The pathogenesis is multi-factorial, involving genetic components and hitherto unknown environmental factors. The aim of the present study was to investigate if gene dosage in the form of copy number variation (CNV could add to the repertoire of genetic susceptibility to autoimmune AD. Methods A genome-wide study using the Affymetrix GeneChip® Genome-Wide Human SNP Array 6.0 was conducted in 26 patients with AD. CNVs in selected genes were further investigated in a larger material of patients with autoimmune AD (n = 352 and healthy controls (n = 353 by duplex Taqman real-time polymerase chain reaction assays. Results We found that low copy number of UGT2B28 was significantly more frequent in AD patients compared to controls; conversely high copy number of ADAM3A was associated with AD. Conclusions We have identified two novel CNV associations to ADAM3A and UGT2B28 in AD. The mechanism by which this susceptibility is conferred is at present unclear, but may involve steroid inactivation (UGT2B28 and T cell maturation (ADAM3A. Characterization of these proteins may unravel novel information on the pathogenesis of autoimmunity.

  18. Distinct Contributions of Replication and Transcription to Mutation Rate Variation of Human Genomes

    KAUST Repository

    Cui, Peng; Ding, Feng; Lin, Qiang; Zhang, Lingfang; Li, Ang; Zhang, Zhang; Hu, Songnian; Yu, Jun

    2012-01-01

    Here, we evaluate the contribution of two major biological processes—DNA replication and transcription—to mutation rate variation in human genomes. Based on analysis of the public human tissue transcriptomics data, high-resolution replicating map of Hela cells and dbSNP data, we present significant correlations between expression breadth, replication time in local regions and SNP density. SNP density of tissue-specific (TS) genes is significantly higher than that of housekeeping (HK) genes. TS genes tend to locate in late-replicating genomic regions and genes in such regions have a higher SNP density compared to those in early-replication regions. In addition, SNP density is found to be positively correlated with expression level among HK genes. We conclude that the process of DNA replication generates stronger mutational pressure than transcription-associated biological processes do, resulting in an increase of mutation rate in TS genes while having weaker effects on HK genes. In contrast, transcription-associated processes are mainly responsible for the accumulation of mutations in highly-expressed HK genes.

  19. Genome-wide copy number variation (CNV) in patients with autoimmune Addison's disease

    Science.gov (United States)

    2011-01-01

    Background Addison's disease (AD) is caused by an autoimmune destruction of the adrenal cortex. The pathogenesis is multi-factorial, involving genetic components and hitherto unknown environmental factors. The aim of the present study was to investigate if gene dosage in the form of copy number variation (CNV) could add to the repertoire of genetic susceptibility to autoimmune AD. Methods A genome-wide study using the Affymetrix GeneChip® Genome-Wide Human SNP Array 6.0 was conducted in 26 patients with AD. CNVs in selected genes were further investigated in a larger material of patients with autoimmune AD (n = 352) and healthy controls (n = 353) by duplex Taqman real-time polymerase chain reaction assays. Results We found that low copy number of UGT2B28 was significantly more frequent in AD patients compared to controls; conversely high copy number of ADAM3A was associated with AD. Conclusions We have identified two novel CNV associations to ADAM3A and UGT2B28 in AD. The mechanism by which this susceptibility is conferred is at present unclear, but may involve steroid inactivation (UGT2B28) and T cell maturation (ADAM3A). Characterization of these proteins may unravel novel information on the pathogenesis of autoimmunity. PMID:21851588

  20. A genome-wide association study of copy number variations with umbilical hernia in swine.

    Science.gov (United States)

    Long, Yi; Su, Ying; Ai, Huashui; Zhang, Zhiyan; Yang, Bin; Ruan, Guorong; Xiao, Shijun; Liao, Xinjun; Ren, Jun; Huang, Lusheng; Ding, Nengshui

    2016-06-01

    Umbilical hernia (UH) is one of the most common congenital defects in pigs, leading to considerable economic loss and serious animal welfare problems. To test whether copy number variations (CNVs) contribute to pig UH, we performed a case-control genome-wide CNV association study on 905 pigs from the Duroc, Landrace and Yorkshire breeds using the Porcine SNP60 BeadChip and penncnv algorithm. We first constructed a genomic map comprising 6193 CNVs that pertain to 737 CNV regions. Then, we identified eight CNVs significantly associated with the risk for UH in the three pig breeds. Six of seven significantly associated CNVs were validated using quantitative real-time PCR. Notably, a rare CNV (CNV14:13030843-13059455) encompassing the NUGGC gene was strongly associated with UH (permutation-corrected P = 0.0015) in Duroc pigs. This CNV occurred exclusively in seven Duroc UH-affected individuals. SNPs surrounding the CNV did not show association signals, indicating that rare CNVs may play an important role in complex pig diseases such as UH. The NUGGC gene has been implicated in human omphalocele and inguinal hernia. Our finding supports that CNVs, including the NUGGC CNV, contribute to the pathogenesis of pig UH. © 2016 Stichting International Foundation for Animal Genetics.

  1. Analysis of Genetic Variation across the Encapsidated Genome of Microplitis demolitor Bracovirus in Parasitoid Wasps.

    Directory of Open Access Journals (Sweden)

    Gaelen R Burke

    Full Text Available Insect parasitoids must complete part of their life cycle within or on another insect, ultimately resulting in the death of the host insect. One group of parasitoid wasps, the 'microgastroid complex' (Hymenoptera: Braconidae, engage in an association with beneficial symbiotic viruses that are essential for successful parasitism of hosts. These viruses, known as Bracoviruses, persist in an integrated form in the wasp genome, and activate to replicate in wasp ovaries during development to ultimately be delivered into host insects during parasitism. The lethal nature of host-parasitoid interactions, combined with the involvement of viruses in mediating these interactions, has led to the hypothesis that Bracoviruses are engaged in an arms race with hosts, resulting in recurrent adaptation in viral (and host genes. Deep sequencing was employed to characterize sequence variation across the encapsidated Bracovirus genome within laboratory and field populations of the parasitoid wasp species Microplitis demolitor. Contrary to expectations, there was a paucity of evidence for positive directional selection among virulence genes, which generally exhibited signatures of purifying selection. These data suggest that the dynamics of host-parasite interactions may not result in recurrent rounds of adaptation, and that adaptation may be more variable in time than previously expected.

  2. Structural variation discovery in the cancer genome using next generation sequencing: Computational solutions and perspectives

    Science.gov (United States)

    Liu, Biao; Conroy, Jeffrey M.; Morrison, Carl D.; Odunsi, Adekunle O.; Qin, Maochun; Wei, Lei; Trump, Donald L.; Johnson, Candace S.; Liu, Song; Wang, Jianmin

    2015-01-01

    Somatic Structural Variations (SVs) are a complex collection of chromosomal mutations that could directly contribute to carcinogenesis. Next Generation Sequencing (NGS) technology has emerged as the primary means of interrogating the SVs of the cancer genome in recent investigations. Sophisticated computational methods are required to accurately identify the SV events and delineate their breakpoints from the massive amounts of reads generated by a NGS experiment. In this review, we provide an overview of current analytic tools used for SV detection in NGS-based cancer studies. We summarize the features of common SV groups and the primary types of NGS signatures that can be used in SV detection methods. We discuss the principles and key similarities and differences of existing computational programs and comment on unresolved issues related to this research field. The aim of this article is to provide a practical guide of relevant concepts, computational methods, software tools and important factors for analyzing and interpreting NGS data for the detection of SVs in the cancer genome. PMID:25849937

  3. Distinct Contributions of Replication and Transcription to Mutation Rate Variation of Human Genomes

    KAUST Repository

    Cui, Peng

    2012-03-23

    Here, we evaluate the contribution of two major biological processes—DNA replication and transcription—to mutation rate variation in human genomes. Based on analysis of the public human tissue transcriptomics data, high-resolution replicating map of Hela cells and dbSNP data, we present significant correlations between expression breadth, replication time in local regions and SNP density. SNP density of tissue-specific (TS) genes is significantly higher than that of housekeeping (HK) genes. TS genes tend to locate in late-replicating genomic regions and genes in such regions have a higher SNP density compared to those in early-replication regions. In addition, SNP density is found to be positively correlated with expression level among HK genes. We conclude that the process of DNA replication generates stronger mutational pressure than transcription-associated biological processes do, resulting in an increase of mutation rate in TS genes while having weaker effects on HK genes. In contrast, transcription-associated processes are mainly responsible for the accumulation of mutations in highly-expressed HK genes.

  4. Approximating the edit distance for genomes with duplicate genes under DCJ, insertion and deletion

    Directory of Open Access Journals (Sweden)

    Shao Mingfu

    2012-12-01

    Full Text Available Abstract Computing the edit distance between two genomes under certain operations is a basic problem in the study of genome evolution. The double-cut-and-join (DCJ model has formed the basis for most algorithmic research on rearrangements over the last few years. The edit distance under the DCJ model can be easily computed for genomes without duplicate genes. In this paper, we study the edit distance for genomes with duplicate genes under a model that includes DCJ operations, insertions and deletions. We prove that computing the edit distance is equivalent to finding the optimal cycle decomposition of the corresponding adjacency graph, and give an approximation algorithm with an approximation ratio of 1.5 + ∈.

  5. Detecting single DNA copy number variations in complex genomes using one nanogram of starting DNA and BAC-array CGH.

    Science.gov (United States)

    Guillaud-Bataille, Marine; Valent, Alexander; Soularue, Pascal; Perot, Christine; Inda, Maria Mar; Receveur, Aline; Smaïli, Sadek; Roest Crollius, Hugues; Bénard, Jean; Bernheim, Alain; Gidrol, Xavier; Danglot, Gisèle

    2004-07-29

    Comparative genomic hybridization to bacterial artificial chromosome (BAC)-arrays (array-CGH) is a highly efficient technique, allowing the simultaneous measurement of genomic DNA copy number at hundreds or thousands of loci, and the reliable detection of local one-copy-level variations. We report a genome-wide amplification method allowing the same measurement sensitivity, using 1 ng of starting genomic DNA, instead of the classical 1 microg usually necessary. Using a discrete series of DNA fragments, we defined the parameters adapted to the most faithful ligation-mediated PCR amplification and the limits of the technique. The optimized protocol allows a 3000-fold DNA amplification, retaining the quantitative characteristics of the initial genome. Validation of the amplification procedure, using DNA from 10 tumour cell lines hybridized to BAC-arrays of 1500 spots, showed almost perfectly superimposed ratios for the non-amplified and amplified DNAs. Correlation coefficients of 0.96 and 0.99 were observed for regions of low-copy-level variations and all regions, respectively (including in vivo amplified oncogenes). Finally, labelling DNA using two nucleotides bearing the same fluorophore led to a significant increase in reproducibility and to the correct detection of one-copy gain or loss in >90% of the analysed data, even for pseudotriploid tumour genomes.

  6. Natural Selection and Recombination Rate Variation Shape Nucleotide Polymorphism Across the Genomes of Three Related Populus Species.

    Science.gov (United States)

    Wang, Jing; Street, Nathaniel R; Scofield, Douglas G; Ingvarsson, Pär K

    2016-03-01

    A central aim of evolutionary genomics is to identify the relative roles that various evolutionary forces have played in generating and shaping genetic variation within and among species. Here we use whole-genome resequencing data to characterize and compare genome-wide patterns of nucleotide polymorphism, site frequency spectrum, and population-scaled recombination rates in three species of Populus: Populus tremula, P. tremuloides, and P. trichocarpa. We find that P. tremuloides has the highest level of genome-wide variation, skewed allele frequencies, and population-scaled recombination rates, whereas P. trichocarpa harbors the lowest. Our findings highlight multiple lines of evidence suggesting that natural selection, due to both purifying and positive selection, has widely shaped patterns of nucleotide polymorphism at linked neutral sites in all three species. Differences in effective population sizes and rates of recombination largely explain the disparate magnitudes and signatures of linked selection that we observe among species. The present work provides the first phylogenetic comparative study on a genome-wide scale in forest trees. This information will also improve our ability to understand how various evolutionary forces have interacted to influence genome evolution among related species. Copyright © 2016 by the Genetics Society of America.

  7. Analysis of the genome-wide variations among multiple strains of the plant pathogenic bacterium Xylella fastidiosa

    Directory of Open Access Journals (Sweden)

    Walker M Andrew

    2006-09-01

    Full Text Available Abstract Background The Gram-negative, xylem-limited phytopathogenic bacterium Xylella fastidiosa is responsible for causing economically important diseases in grapevine, citrus and many other plant species. Despite its economic impact, relatively little is known about the genomic variations among strains isolated from different hosts and their influence on the population genetics of this pathogen. With the availability of genome sequence information for four strains, it is now possible to perform genome-wide analyses to identify and categorize such DNA variations and to understand their influence on strain functional divergence. Results There are 1,579 genes and 194 non-coding homologous sequences present in the genomes of all four strains, representing a 76. 2% conservation of the sequenced genome. About 60% of the X. fastidiosa unique sequences exist as tandem gene clusters of 6 or more genes. Multiple alignments identified 12,754 SNPs and 14,449 INDELs in the 1528 common genes and 20,779 SNPs and 10,075 INDELs in the 194 non-coding sequences. The average SNP frequency was 1.08 × 10-2 per base pair of DNA and the average INDEL frequency was 2.06 × 10-2 per base pair of DNA. On an average, 60.33% of the SNPs were synonymous type while 39.67% were non-synonymous type. The mutation frequency, primarily in the form of external INDELs was the main type of sequence variation. The relative similarity between the strains was discussed according to the INDEL and SNP differences. The number of genes unique to each strain were 60 (9a5c, 54 (Dixon, 83 (Ann1 and 9 (Temecula-1. A sub-set of the strain specific genes showed significant differences in terms of their codon usage and GC composition from the native genes suggesting their xenologous origin. Tandem repeat analysis of the genomic sequences of the four strains identified associations of repeat sequences with hypothetical and phage related functions. Conclusion INDELs and strain specific genes

  8. Overlap in genomic variation associated with milk fat composition in Holstein Friesian and Dutch native dual-purpose breeds

    NARCIS (Netherlands)

    Maurice - Van Eijndhoven, M.H.T.; Bovenhuis, H.; Veerkamp, R.F.; Calus, M.P.L.

    2015-01-01

    The aim of this study was to identify if genomic variations associated with fatty acid (FA) composition are similar between the Holstein-Friesian (HF) and native dual-purpose breeds used in the Dutch dairy industry. Phenotypic and genotypic information were available for the breeds Meuse-Rhine-Yssel

  9. Genome-Wide Mapping of Structural Variations Reveals a Copy Number Variant That Determines Reproductive Morphology in Cucumber

    NARCIS (Netherlands)

    Zhang, Z.; Mao, L.; Chen, Junshi; Bu, F.; Li, G.; Sun, J.; Li, S.; Sun, H.; Jiao, C.; Blakely, R.; Pan, J.; Cai, R.; Luo, R.; Peer, Van de Y.; Jacobsen, E.; Fei, Z.; Huang, S.

    2015-01-01

    Structural variations (SVs) represent a major source of genetic diversity. However, the functional impact and formation mechanisms of SVs in plant genomes remain largely unexplored. Here, we report a nucleotide-resolution SV map of cucumber (Cucumis sativas) that comprises 26,788 SVs based on deep

  10. Genome-wide recombination rate variation in a recombination map of cotton.

    Science.gov (United States)

    Shen, Chao; Li, Ximei; Zhang, Ruiting; Lin, Zhongxu

    2017-01-01

    Recombination is crucial for genetic evolution, which not only provides new allele combinations but also influences the biological evolution and efficacy of natural selection. However, recombination variation is not well understood outside of the complex species' genomes, and it is particularly unclear in Gossypium. Cotton is the most important natural fibre crop and the second largest oil-seed crop. Here, we found that the genetic and physical maps distances did not have a simple linear relationship. Recombination rates were unevenly distributed throughout the cotton genome, which showed marked changes along the chromosome lengths and recombination was completely suppressed in the centromeric regions. Recombination rates significantly varied between A-subgenome (At) (range = 1.60 to 3.26 centimorgan/megabase [cM/Mb]) and D-subgenome (Dt) (range = 2.17 to 4.97 cM/Mb), which explained why the genetic maps of At and Dt are similar but the physical map of Dt is only half that of At. The translocation regions between A02 and A03 and between A04 and A05, and the inversion regions on A10, D10, A07 and D07 indicated relatively high recombination rates in the distal regions of the chromosomes. Recombination rates were positively correlated with the densities of genes, markers and the distance from the centromere, and negatively correlated with transposable elements (TEs). The gene ontology (GO) categories showed that genes in high recombination regions may tend to response to environmental stimuli, and genes in low recombination regions are related to mitosis and meiosis, which suggested that they may provide the primary driving force in adaptive evolution and assure the stability of basic cell cycle in a rapidly changing environment. Global knowledge of recombination rates will facilitate genetics and breeding in cotton.

  11. Autism genome-wide copy number variation reveals ubiquitin and neuronal genes.

    Science.gov (United States)

    Glessner, Joseph T; Wang, Kai; Cai, Guiqing; Korvatska, Olena; Kim, Cecilia E; Wood, Shawn; Zhang, Haitao; Estes, Annette; Brune, Camille W; Bradfield, Jonathan P; Imielinski, Marcin; Frackelton, Edward C; Reichert, Jennifer; Crawford, Emily L; Munson, Jeffrey; Sleiman, Patrick M A; Chiavacci, Rosetta; Annaiah, Kiran; Thomas, Kelly; Hou, Cuiping; Glaberson, Wendy; Flory, James; Otieno, Frederick; Garris, Maria; Soorya, Latha; Klei, Lambertus; Piven, Joseph; Meyer, Kacie J; Anagnostou, Evdokia; Sakurai, Takeshi; Game, Rachel M; Rudd, Danielle S; Zurawiecki, Danielle; McDougle, Christopher J; Davis, Lea K; Miller, Judith; Posey, David J; Michaels, Shana; Kolevzon, Alexander; Silverman, Jeremy M; Bernier, Raphael; Levy, Susan E; Schultz, Robert T; Dawson, Geraldine; Owley, Thomas; McMahon, William M; Wassink, Thomas H; Sweeney, John A; Nurnberger, John I; Coon, Hilary; Sutcliffe, James S; Minshew, Nancy J; Grant, Struan F A; Bucan, Maja; Cook, Edwin H; Buxbaum, Joseph D; Devlin, Bernie; Schellenberg, Gerard D; Hakonarson, Hakon

    2009-05-28

    Autism spectrum disorders (ASDs) are childhood neurodevelopmental disorders with complex genetic origins. Previous studies focusing on candidate genes or genomic regions have identified several copy number variations (CNVs) that are associated with an increased risk of ASDs. Here we present the results from a whole-genome CNV study on a cohort of 859 ASD cases and 1,409 healthy children of European ancestry who were genotyped with approximately 550,000 single nucleotide polymorphism markers, in an attempt to comprehensively identify CNVs conferring susceptibility to ASDs. Positive findings were evaluated in an independent cohort of 1,336 ASD cases and 1,110 controls of European ancestry. Besides previously reported ASD candidate genes, such as NRXN1 (ref. 10) and CNTN4 (refs 11, 12), several new susceptibility genes encoding neuronal cell-adhesion molecules, including NLGN1 and ASTN2, were enriched with CNVs in ASD cases compared to controls (P = 9.5 x 10(-3)). Furthermore, CNVs within or surrounding genes involved in the ubiquitin pathways, including UBE3A, PARK2, RFWD2 and FBXO40, were affected by CNVs not observed in controls (P = 3.3 x 10(-3)). We also identified duplications 55 kilobases upstream of complementary DNA AK123120 (P = 3.6 x 10(-6)). Although these variants may be individually rare, they target genes involved in neuronal cell-adhesion or ubiquitin degradation, indicating that these two important gene networks expressed within the central nervous system may contribute to the genetic susceptibility of ASD.

  12. Overlap in genomic variation associated with milk fat composition in Holstein Friesian and Dutch native dual-purpose breeds.

    Science.gov (United States)

    Maurice-Van Eijndhoven, M H T; Bovenhuis, H; Veerkamp, R F; Calus, M P L

    2015-09-01

    The aim of this study was to identify if genomic variations associated with fatty acid (FA) composition are similar between the Holstein-Friesian (HF) and native dual-purpose breeds used in the Dutch dairy industry. Phenotypic and genotypic information were available for the breeds Meuse-Rhine-Yssel (MRY), Dutch Friesian (DF), Groningen White Headed (GWH), and HF. First, the reliability of genomic breeding values of the native Dutch dual-purpose cattle breeds MRY, DF, and GWH was evaluated using single nucleotide polymorphism (SNP) effects estimated in HF, including all SNP or subsets with stronger associations in HF. Second, the genomic variation of the regions associated with FA composition in HF (regions on Bos taurus autosome 5, 14, and 26), were studied in the different breeds. Finally, similarities in genotype and allele frequencies between MRY, DF, GWH, and HF breeds were assessed for specific regions associated with FA composition. On average across the traits, the highest reliabilities of genomic prediction were estimated for GWH (0.158) and DF (0.116) when the 8 to 22 SNP with the strongest association in HF were included. With the same set of SNP, GEBV for MRY were the least reliable (0.022). This indicates that on average only 2 (MRY) to 16% (GWH) of the genomic variation in HF is shared with the native Dutch dual-purpose breeds. The comparison of predicted variances of different regions associated with milk and milk fat composition showed that breeds clearly differed in genomic variation within these regions. Finally, the correlations of allele frequencies between breeds across the 8 to 22 SNP with the strongest association in HF were around 0.8 between the Dutch native dual-purpose breeds, whereas the correlations between the native breeds and HF were clearly lower and around 0.5. There was no consistent relationship between the reliabilities of genomic prediction for a specific breed and the correlation between the allele frequencies of this breed

  13. Wind power variations under humid and arid meteorological conditions

    International Nuclear Information System (INIS)

    Şen, Zekâi

    2013-01-01

    Highlights: • It indicates the role of weather parameters’ roles in the wind energy calculation. • Meteorological variables are more significant in arid regions for wind power. • It provides opportunity to take into consideration air density variability. • Wind power is presented in terms of the wind speed, temperature and pressure. - Abstract: The classical wind power per rotor area per time is given as the half product of the air density by third power of the wind velocity. This approach adopts the standard air density as constant (1.23 g/cm 3 ), which ignores the density dependence on air temperature and pressure. Weather conditions are not taken into consideration except the variations in wind velocity. In general, increase in pressure and decrease in temperature cause increase in the wind power generation. The rate of increase in the pressure has less effect on the wind power as compared with the temperature rate. This paper provides the wind power formulation based on three meteorological variables as the wind velocity, air temperature and air pressure. Furthermore, from the meteorology point of view any change in the wind power is expressed as a function of partial changes in these meteorological variables. Additionally, weather conditions in humid and arid regions differ from each other, and it is interesting to see possible differences between the two regions. The application of the methodology is presented for two meteorology stations in Istanbul, Turkey, as representative of the humid regions and Al-Madinah Al-Monawwarah, Kingdom of Saudi Arabia, for arid region, both on daily record bases for 2010. It is found that consideration of air temperature and pressure in the average wind power calculation gives about 1.3% decrease in Istanbul, whereas it is about 13.7% in Al-Madinah Al-Monawwarah. Hence, consideration of meteorological variables in wind power calculations becomes more significant in arid regions

  14. Population-genetic nature of copy number variations in the human genome.

    Science.gov (United States)

    Kato, Mamoru; Kawaguchi, Takahisa; Ishikawa, Shumpei; Umeda, Takayoshi; Nakamichi, Reiichiro; Shapero, Michael H; Jones, Keith W; Nakamura, Yusuke; Aburatani, Hiroyuki; Tsunoda, Tatsuhiko

    2010-03-01

    Copy number variations (CNVs) are universal genetic variations, and their association with disease has been increasingly recognized. We designed high-density microarrays for CNVs, and detected 3000-4000 CNVs (4-6% of the genomic sequence) per population that included CNVs previously missed because of smaller sizes and residing in segmental duplications. The patterns of CNVs across individuals were surprisingly simple at the kilo-base scale, suggesting the applicability of a simple genetic analysis for these genetic loci. We utilized the probabilistic theory to determine integer copy numbers of CNVs and employed a recently developed phasing tool to estimate the population frequencies of integer copy number alleles and CNV-SNP haplotypes. The results showed a tendency toward a lower frequency of CNV alleles and that most of our CNVs were explained only by zero-, one- and two-copy alleles. Using the estimated population frequencies, we found several CNV regions with exceptionally high population differentiation. Investigation of CNV-SNP linkage disequilibrium (LD) for 500-900 bi- and multi-allelic CNVs per population revealed that previous conflicting reports on bi-allelic LD were unexpectedly consistent and explained by an LD increase correlated with deletion-allele frequencies. Typically, the bi-allelic LD was lower than SNP-SNP LD, whereas the multi-allelic LD was somewhat stronger than the bi-allelic LD. After further investigation of tag SNPs for CNVs, we conclude that the customary tagging strategy for disease association studies can be applicable for common deletion CNVs, but direct interrogation is needed for other types of CNVs.

  15. Genetic variation in the Staphylococcus aureus 8325 strain lineage revealed by whole-genome sequencing.

    Directory of Open Access Journals (Sweden)

    Kristoffer T Bæk

    Full Text Available Staphylococcus aureus strains of the 8325 lineage, especially 8325-4 and derivatives lacking prophage, have been used extensively for decades of research. We report herein the results of our deep sequence analysis of strain 8325-4. Assignment of sequence variants compared with the reference strain 8325 (NRS77/PS47 required correction of errors in the 8325 reference genome, and reassessment of variation previously attributed to chemical mutagenesis of the restriction-defective RN4220. Using an extensive strain pedigree analysis, we discovered that 8325-4 contains 16 single nucleotide polymorphisms (SNP arising prior to the construction of RN4220. We identified 5 indels in 8325-4 compared with 8325. Three indels correspond to expected Φ11, 12, 13 excisions, one indel is explained by a sequence assembly artifact, and the final indel (Δ63bp in the spa-sarS intergenic region is common to only a sub-lineage of 8325-4 strains including SH1000. This deletion was found to significantly decrease (75% steady state sarS but not spa transcript levels in post-exponential phase. The sub-lineage 8325-4 was also found to harbor 4 additional SNPs. We also found large sequence variation between 8325, 8325-4 and RN4220 in a cluster of repetitive hypothetical proteins (SA0282 homologs near the Ess secretion cluster. The overall 8325-4 SNP set results in 17 alterations within coding sequences. Remarkably, we discovered that all tested strains of the 8325-4 lineage lack phenol soluble modulin α3 (PSMα3, a virulence determinant implicated in neutrophil chemotaxis, biofilm architecture and surface spreading. Collectively, our results clarify and define the 8325-4 pedigree and reveal clear evidence that mutations existing throughout all branches of this lineage, including the widely used RN6390 and SH1000 strains, could conceivably impact virulence regulation.

  16. De novo Genome Assembly and Single Nucleotide Variations for Soybean Mosaic Virus Using Soybean Seed Transcriptome Data

    Directory of Open Access Journals (Sweden)

    Yeonhwa Jo

    2017-10-01

    Full Text Available Soybean is the most important legume crop in the world. Several diseases in soybean lead to serious yield losses in major soybean-producing countries. Moreover, soybean can be infected by diverse viruses. Recently, we carried out a large-scale screening to identify viruses infecting soybean using available soybean transcriptome data. Of the screened transcriptomes, a soybean transcriptome for soybean seed development analysis contains several virus-associated sequences. In this study, we identified five viruses, including soybean mosaic virus (SMV, infecting soybean by de novo transcriptome assembly followed by blast search. We assembled a nearly complete consensus genome sequence of SMV China using transcriptome data. Based on phylogenetic analysis, the consensus genome sequence of SMV China was closely related to SMV isolates from South Korea. We examined single nucleotide variations (SNVs for SMVs in the soybean seed transcriptome revealing 780 SNVs, which were evenly distributed on the SMV genome. Four SNVs, C-U, U-C, A-G, and G-A, were frequently identified. This result demonstrated the quasispecies variation of the SMV genome. Taken together, this study carried out bioinformatics analyses to identify viruses using soybean transcriptome data. In addition, we demonstrated the application of soybean transcriptome data for virus genome assembly and SNV analysis.

  17. Patterns of genomic variation in the poplar rust fungus Melampsora larici-populina identify pathogenesis-related factors

    Directory of Open Access Journals (Sweden)

    Antoine ePersoons

    2014-09-01

    Full Text Available Melampsora larici-populina is a fungal pathogen responsible for foliar rust disease on poplar trees, which causes damage to forest plantations worldwide, particularly in Northern Europe. The reference genome of the isolate 98AG31 was previously sequenced using a whole genome shotgun strategy, revealing a large genome of 101 megabases containing 16,399 predicted genes, which included secreted protein genes representing poplar rust candidate effectors. In the present study, the genomes of 15 isolates collected over the past 20 years throughout the French territory, representing distinct virulence profiles, were characterized by massively parallel sequencing to assess genetic variation in the poplar rust fungus. Comparison to the reference genome revealed striking structural variations. Analysis of coverage and sequencing depth identified large missing regions between isolates related to the mating type loci. More than 611,824 single-nucleotide polymorphism (SNP positions were uncovered overall, indicating a remarkable level of polymorphism. Based on the accumulation of non-synonymous substitutions in coding sequences and the relative frequencies of synonymous and non-synonymous polymorphisms (i.e. PN/PS, we identify candidate genes that may be involved in fungal pathogenesis. Correlation between non-synonymous SNPs in genes encoding secreted proteins and pathotypes of the studied isolates revealed candidate genes potentially related to virulences 1, 6 and 8 of the poplar rust fungus.

  18. SOILS AGROCHEMICAL PROPERTIES VARIATION UNDER MEDICINAL HERBS ECOLOGICAL CROPS

    Directory of Open Access Journals (Sweden)

    Mihaela Lungu

    2012-12-01

    Full Text Available Researches have been carried out with medicinal herbs in the frame of a National project financed by CNCSIS through the Partnership Program. Ecologic and conventional technologies were applied. The project aimed to implement a standardization system of the vegetal raw materials which can be used in the cosmetic industry. Sage, basilicum, and savory were subject of the experiments, at Jucu, Cluj County, Ungureni – Butimanu, Dâmboviţa County, and Secuieni, Neamţ County. The dominant soils in these areas are Fluvisols and Haplic Chernozems in the Jucu area, Chromic Luvisol in the Ungureni – Butimanu area, and Calcic Chernozem in the Secuieni area. The agrochemical analysis of the soils from the experimental fields highlighted soil fertility properties conservation both under ecologic and conventional growing technologies.

  19. Circadian pathway genetic variation and cancer risk: evidence from genome-wide association studies.

    Science.gov (United States)

    Mocellin, Simone; Tropea, Saveria; Benna, Clara; Rossi, Carlo Riccardo

    2018-02-19

    Dysfunction of the circadian clock and single polymorphisms of some circadian genes have been linked to cancer susceptibility, although data are scarce and findings inconsistent. We aimed to investigate the association between circadian pathway genetic variation and risk of developing common cancers based on the findings of genome-wide association studies (GWASs). Single nucleotide polymorphisms (SNPs) of 17 circadian genes reported by three GWAS meta-analyses dedicated to breast (Discovery, Biology, and Risk of Inherited Variants in Breast Cancer (DRIVE) Consortium; cases, n = 15,748; controls, n = 18,084), prostate (Elucidating Loci Involved in Prostate Cancer Susceptibility (ELLIPSE) Consortium; cases, n = 14,160; controls, n = 12,724) and lung carcinoma (Transdisciplinary Research In Cancer of the Lung (TRICL) Consortium; cases, n = 12,160; controls, n = 16,838) in patients of European ancestry were utilized to perform pathway analysis by means of the adaptive rank truncated product (ARTP) method. Data were also available for the following subgroups: estrogen receptor negative breast cancer, aggressive prostate cancer, squamous lung carcinoma and lung adenocarcinoma. We found a highly significant statistical association between circadian pathway genetic variation and the risk of breast (pathway P value = 1.9 × 10 -6 ; top gene RORA, gene P value = 0.0003), prostate (pathway P value = 4.1 × 10 -6 ; top gene ARNTL, gene P value = 0.0002) and lung cancer (pathway P value = 6.9 × 10 -7 ; top gene RORA, gene P value = 2.0 × 10 -6 ), as well as all their subgroups. Out of 17 genes investigated, 15 were found to be significantly associated with the risk of cancer: four genes were shared by all three malignancies (ARNTL, CLOCK, RORA and RORB), two by breast and lung cancer (CRY1 and CRY2) and three by prostate and lung cancer (NPAS2, NR1D1 and PER3), whereas four genes were specific for lung cancer

  20. Efficient inference of population size histories and locus-specific mutation rates from large-sample genomic variation data.

    Science.gov (United States)

    Bhaskar, Anand; Wang, Y X Rachel; Song, Yun S

    2015-02-01

    With the recent increase in study sample sizes in human genetics, there has been growing interest in inferring historical population demography from genomic variation data. Here, we present an efficient inference method that can scale up to very large samples, with tens or hundreds of thousands of individuals. Specifically, by utilizing analytic results on the expected frequency spectrum under the coalescent and by leveraging the technique of automatic differentiation, which allows us to compute gradients exactly, we develop a very efficient algorithm to infer piecewise-exponential models of the historical effective population size from the distribution of sample allele frequencies. Our method is orders of magnitude faster than previous demographic inference methods based on the frequency spectrum. In addition to inferring demography, our method can also accurately estimate locus-specific mutation rates. We perform extensive validation of our method on simulated data and show that it can accurately infer multiple recent epochs of rapid exponential growth, a signal that is difficult to pick up with small sample sizes. Lastly, we use our method to analyze data from recent sequencing studies, including a large-sample exome-sequencing data set of tens of thousands of individuals assayed at a few hundred genic regions. © 2015 Bhaskar et al.; Published by Cold Spring Harbor Laboratory Press.

  1. IW-Scoring: an Integrative Weighted Scoring framework for annotating and prioritizing genetic variations in the noncoding genome.

    Science.gov (United States)

    Wang, Jun; Dayem Ullah, Abu Z; Chelala, Claude

    2018-01-30

    The vast majority of germline and somatic variations occur in the noncoding part of the genome, only a small fraction of which are believed to be functional. From the tens of thousands of noncoding variations detectable in each genome, identifying and prioritizing driver candidates with putative functional significance is challenging. To address this, we implemented IW-Scoring, a new Integrative Weighted Scoring model to annotate and prioritise functionally relevant noncoding variations. We evaluate 11 scoring methods, and apply an unsupervised spectral approach for subsequent selective integration into two linear weighted functional scoring schemas for known and novel variations. IW-Scoring produces stable high-quality performance as the best predictors for three independent data sets. We demonstrate the robustness of IW-Scoring in identifying recurrent functional mutations in the TERT promoter, as well as disease SNPs in proximity to consensus motifs and with gene regulatory effects. Using follicular lymphoma as a paradigmatic cancer model, we apply IW-Scoring to locate 11 recurrently mutated noncoding regions in 14 follicular lymphoma genomes, and validate 9 of these regions in an extension cohort, including the promoter and enhancer regions of PAX5. Overall, IW-Scoring demonstrates greater versatility in identifying trait- and disease-associated noncoding variants. Scores from IW-Scoring as well as other methods are freely available from http://www.snp-nexus.org/IW-Scoring/. © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Genomic Circuitry Underlying Immunological Response to Pediatric Acute Respiratory Infection.

    Science.gov (United States)

    Henrickson, Sarah E; Manne, Sasikanth; Dolfi, Douglas V; Mansfield, Kathleen D; Parkhouse, Kaela; Mistry, Rakesh D; Alpern, Elizabeth R; Hensley, Scott E; Sullivan, Kathleen E; Coffin, Susan E; Wherry, E John

    2018-01-09

    Acute respiratory tract viral infections (ARTIs) cause significant morbidity and mortality. CD8 T cells are fundamental to host responses, but transcriptional alterations underlying anti-viral mechanisms and links to clinical characteristics remain unclear. CD8 T cell transcriptional circuitry in acutely ill pediatric patients with influenza-like illness was distinct for different viral pathogens. Although changes included expected upregulation of interferon-stimulated genes (ISGs), transcriptional downregulation was prominent upon exposure to innate immune signals in early IFV infection. Network analysis linked changes to severity of infection, asthma, sex, and age. An influenza pediatric signature (IPS) distinguished acute influenza from other ARTIs and outperformed other influenza prediction gene lists. The IPS allowed a deeper investigation of the connection between transcriptional alterations and clinical characteristics of acute illness, including age-based differences in circuits connecting the STAT1/2 pathway to ISGs. A CD8 T cell-focused systems immunology approach in pediatrics identified age-based alterations in ARTI host response pathways. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  3. aCNViewer: Comprehensive genome-wide visualization of absolute copy number and copy neutral variations.

    Directory of Open Access Journals (Sweden)

    Victor Renault

    Full Text Available Copy number variations (CNV include net gains or losses of part or whole chromosomal regions. They differ from copy neutral loss of heterozygosity (cn-LOH events which do not induce any net change in the copy number and are often associated with uniparental disomy. These phenomena have long been reported to be associated with diseases and particularly in cancer. Losses/gains of genomic regions are often correlated with lower/higher gene expression. On the other hand, loss of heterozygosity (LOH and cn-LOH are common events in cancer and may be associated with the loss of a functional tumor suppressor gene. Therefore, identifying recurrent CNV and cn-LOH events can be important as they may highlight common biological components and give insights into the development or mechanisms of a disease. However, no currently available tools allow a comprehensive whole-genome visualization of recurrent CNVs and cn-LOH in groups of samples providing absolute quantification of the aberrations leading to the loss of potentially important information.To overcome these limitations, we developed aCNViewer (Absolute CNV Viewer, a visualization tool for absolute CNVs and cn-LOH across a group of samples. aCNViewer proposes three graphical representations: dendrograms, bi-dimensional heatmaps showing chromosomal regions sharing similar abnormality patterns, and quantitative stacked histograms facilitating the identification of recurrent absolute CNVs and cn-LOH. We illustrated aCNViewer using publically available hepatocellular carcinomas (HCCs Affymetrix SNP Array data (Fig 1A. Regions 1q and 8q present a similar percentage of total gains but significantly different copy number gain categories (p-value of 0.0103 with a Fisher exact test, validated by another cohort of HCCs (p-value of 5.6e-7 (Fig 2B.aCNViewer is implemented in python and R and is available with a GNU GPLv3 license on GitHub https://github.com/FJD-CEPH/aCNViewer and Docker https://hub.docker.com/r/fjdceph/acnviewer/.aCNViewer@cephb.fr.

  4. aCNViewer: Comprehensive genome-wide visualization of absolute copy number and copy neutral variations.

    Science.gov (United States)

    Renault, Victor; Tost, Jörg; Pichon, Fabien; Wang-Renault, Shu-Fang; Letouzé, Eric; Imbeaud, Sandrine; Zucman-Rossi, Jessica; Deleuze, Jean-François; How-Kit, Alexandre

    2017-01-01

    Copy number variations (CNV) include net gains or losses of part or whole chromosomal regions. They differ from copy neutral loss of heterozygosity (cn-LOH) events which do not induce any net change in the copy number and are often associated with uniparental disomy. These phenomena have long been reported to be associated with diseases and particularly in cancer. Losses/gains of genomic regions are often correlated with lower/higher gene expression. On the other hand, loss of heterozygosity (LOH) and cn-LOH are common events in cancer and may be associated with the loss of a functional tumor suppressor gene. Therefore, identifying recurrent CNV and cn-LOH events can be important as they may highlight common biological components and give insights into the development or mechanisms of a disease. However, no currently available tools allow a comprehensive whole-genome visualization of recurrent CNVs and cn-LOH in groups of samples providing absolute quantification of the aberrations leading to the loss of potentially important information. To overcome these limitations, we developed aCNViewer (Absolute CNV Viewer), a visualization tool for absolute CNVs and cn-LOH across a group of samples. aCNViewer proposes three graphical representations: dendrograms, bi-dimensional heatmaps showing chromosomal regions sharing similar abnormality patterns, and quantitative stacked histograms facilitating the identification of recurrent absolute CNVs and cn-LOH. We illustrated aCNViewer using publically available hepatocellular carcinomas (HCCs) Affymetrix SNP Array data (Fig 1A). Regions 1q and 8q present a similar percentage of total gains but significantly different copy number gain categories (p-value of 0.0103 with a Fisher exact test), validated by another cohort of HCCs (p-value of 5.6e-7) (Fig 2B). aCNViewer is implemented in python and R and is available with a GNU GPLv3 license on GitHub https://github.com/FJD-CEPH/aCNViewer and Docker https

  5. Performance Evaluation of Type-3 PLLs Under Wide Variation in Input Voltage and Frequency

    DEFF Research Database (Denmark)

    Aravind, C. K.; Rani, B.Indu; Chakkarapani, M.

    2017-01-01

    This paper presents a detailed analysis of Type-3 PLL under wide variation in input voltage and frequency. Using small signal modeling, the performance of both single loop and dual loop type-3 PLL for variation in input voltage and frequency is studied. The analysis shows that for the same bandwi...... verified by implementing in ALTERA cyclone II FPGA board....

  6. Genome Sequencing Reveals Loci under Artificial Selection that Underlie Disease Phenotypes in the Laboratory Rat

    NARCIS (Netherlands)

    Atanur, Santosh S.; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R.; Kaisaki, Pamela J.; Otto, Georg W.; Ma, Man Chun John; Keane, Thomas M.; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R.; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J.; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J.

    2013-01-01

    Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and

  7. Novel genetic loci underlying human intracranial volume identified through genome-wide association

    NARCIS (Netherlands)

    Adams, Hieab H H; Hibar, Derrek P; Chouraki, Vincent; Stein, Jason L; Nyquist, Paul A; Rentería, Miguel E; Trompet, Stella; Arias-Vasquez, Alejandro; Seshadri, Sudha; Desrivières, Sylvane; Beecham, Ashley H; Jahanshad, Neda; Wittfeld, Katharina; Van der Lee, Sven J; Abramovic, Lucija; Alhusaini, Saud; Amin, Najaf; Andersson, Micael; Arfanakis, Konstantinos; Aribisala, Benjamin S; Armstrong, Nicola J; Athanasiu, Lavinia; Axelsson, Tomas; Beiser, Alexa; Bernard, Manon; Bis, Joshua C; Blanken, Laura M E; Blanton, Susan H; Bohlken, Marc M; Boks, Marco P; Bralten, Janita; Brickman, Adam M; Carmichael, Owen; Chakravarty, M Mallar; Chauhan, Ganesh; Chen, Qiang; Ching, Christopher R K; Cuellar-Partida, Gabriel; Braber, Anouk Den; Doan, Nhat Trung; Ehrlich, Stefan; Filippi, Irina; Ge, Tian; Giddaluru, Sudheer; Goldman, Aaron L; Gottesman, Rebecca F; Greven, Corina U; Grimm, Oliver; Griswold, Michael E; Guadalupe, Tulio; Hass, Johanna; Haukvik, Unn K; Hilal, Saima; Hofer, Edith; Hoehn, David; Holmes, Avram J; Hoogman, Martine; Janowitz, Deborah; Jia, Tianye; Kasperaviciute, Dalia; Kim, Sungeun; Klein, Marieke; Kraemer, Bernd; Lee, Phil H; Liao, Jiemin; Liewald, David C M; Lopez, Lorna M; Luciano, Michelle; Macare, Christine; Marquand, Andre; Matarin, Mar; Mather, Karen A; Mattheisen, Manuel; Mazoyer, Bernard; McKay, David R; McWhirter, Rebekah; Milaneschi, Yuri; Mirza-Schreiber, Nazanin; Muetzel, Ryan L; Maniega, Susana Muñoz; Nho, Kwangsik; Nugent, Allison C; Loohuis, Loes M Olde; Oosterlaan, Jaap; Papmeyer, Martina; Pappa, Irene; Pirpamer, Lukas; Pudas, Sara; Pütz, Benno; Rajan, Kumar B; Ramasamy, Adaikalavan; Richards, Jennifer S; Risacher, Shannon L; Roiz-Santiañez, Roberto; Rommelse, Nanda; Rose, Emma J; Royle, Natalie A; Rundek, Tatjana; Sämann, Philipp G; Satizabal, Claudia L; Schmaal, Lianne; Schork, Andrew J; Shen, Li; Shin, Jean; Shumskaya, Elena; Smith, Albert V; Sprooten, Emma; Strike, Lachlan T; Teumer, Alexander; Thomson, Russell; Tordesillas-Gutierrez, Diana; Toro, Roberto; Trabzuni, Daniah; Vaidya, Dhananjay; Van der Grond, Jeroen; Van der Meer, Dennis; Van Donkelaar, Marjolein M J; Van Eijk, Kristel R; Van Erp, Theo G M; Van Rooij, Daan; Walton, Esther; Westlye, Lars T; Whelan, Christopher D; Windham, Beverly G; Winkler, Anderson M; Woldehawariat, Girma; Wolf, Christiane; Wolfers, Thomas; Xu, Bing; Yanek, Lisa R; Yang, Jingyun; Zijdenbos, Alex; Zwiers, Marcel P; Agartz, Ingrid; Aggarwal, Neelum T; Almasy, Laura; Ames, David; Amouyel, Philippe; Andreassen, Ole A; Arepalli, Sampath; Assareh, Amelia A; Barral, Sandra; Bastin, Mark E; Becker, Diane M; Becker, James T; Bennett, David A; Blangero, John; van Bokhoven, Hans; Boomsma, Dorret I; Brodaty, Henry; Brouwer, Rachel M; Brunner, Han G; Buckner, Randy L; Buitelaar, Jan K; Bulayeva, Kazima B; Cahn, Wiepke; Calhoun, Vince D; Cannon, Dara M; Cavalleri, Gianpiero L; Chen, Christopher; Cheng, Ching-Yu; Cichon, Sven; Cookson, Mark R; Corvin, Aiden; Crespo-Facorro, Benedicto; Curran, Joanne E; Czisch, Michael; Dale, Anders M; Davies, Gareth E; De Geus, Eco J C; De Jager, Philip L; de Zubicaray, Greig I; Delanty, Norman; Depondt, Chantal; DeStefano, Anita L; Dillman, Allissa; Djurovic, Srdjan; Donohoe, Gary; Drevets, Wayne C; Duggirala, Ravi; Dyer, Thomas D; Erk, Susanne; Espeseth, Thomas; Evans, Denis A; Fedko, Iryna O; Fernández, Guillén; Ferrucci, Luigi; Fisher, Simon E; Fleischman, Debra A; Ford, Ian; Foroud, Tatiana M; Fox, Peter T; Francks, Clyde; Fukunaga, Masaki; Gibbs, J Raphael; Glahn, David C; Gollub, Randy L; Göring, Harald H H; Grabe, Hans J; Green, Robert C; Gruber, Oliver; Gudnason, Vilmundur; Guelfi, Sebastian; Hansell, Narelle K; Hardy, John; Hartman, Catharina A; Hashimoto, Ryota; Hegenscheid, Katrin; Heinz, Andreas; Le Hellard, Stephanie; Hernandez, Dena G; Heslenfeld, Dirk J; Ho, Beng-Choon; Hoekstra, Pieter J; Hoffmann, Wolfgang; Hofman, Albert; Holsboer, Florian; Homuth, Georg; Hosten, Norbert; Hottenga, Jouke-Jan; Pol, Hilleke E Hulshoff; Ikeda, Masashi; Ikram, M Kamran; Jack, Clifford R; Jenkinson, Mark; Johnson, Robert; Jönsson, Erik G; Jukema, J Wouter; Kahn, René S; Kanai, Ryota; Kloszewska, Iwona; Knopman, David S; Kochunov, Peter; Kwok, John B; Lawrie, Stephen M; Lemaître, Hervé; Liu, Xinmin; Longo, Dan L; Longstreth, W T; Lopez, Oscar L; Lovestone, Simon; Martinez, Oliver; Martinot, Jean-Luc; Mattay, Venkata S; McDonald, Colm; McIntosh, Andrew M; McMahon, Katie L; McMahon, Francis J; Mecocci, Patrizia; Melle, Ingrid; Meyer-Lindenberg, Andreas; Mohnke, Sebastian; Montgomery, Grant W; Morris, Derek W; Mosley, Thomas H; Mühleisen, Thomas W; Müller-Myhsok, Bertram; Nalls, Michael A; Nauck, Matthias; Nichols, Thomas E; Niessen, Wiro J; Nöthen, Markus M; Nyberg, Lars; Ohi, Kazutaka; Olvera, Rene L; Ophoff, Roel A; Pandolfo, Massimo; Paus, Tomas; Pausova, Zdenka; Penninx, Brenda W J H; Pike, G Bruce; Potkin, Steven G; Psaty, Bruce M; Reppermund, Simone; Rietschel, Marcella; Roffman, Joshua L; Romanczuk-Seiferth, Nina; Rotter, Jerome I; Ryten, Mina; Sacco, Ralph L; Sachdev, Perminder S; Saykin, Andrew J; Schmidt, Reinhold; Schofield, Peter R; Sigurdsson, Sigurdur; Simmons, Andy; Singleton, Andrew; Sisodiya, Sanjay M; Smith, Colin; Smoller, Jordan W; Soininen, Hilkka; Srikanth, Velandai; Steen, Vidar M; Stott, David J; Sussmann, Jessika E; Thalamuthu, Anbupalam; Tiemeier, Henning; Toga, Arthur W; Traynor, Bryan J; Troncoso, Juan; Turner, Jessica A; Tzourio, Christophe; Uitterlinden, Andre G; Hernández, Maria C Valdés; Van der Brug, Marcel; Van der Lugt, Aad; Van der Wee, Nic J A; Van Duijn, Cornelia M; Van Haren, Neeltje E M; Van T Ent, Dennis; Van Tol, Marie-Jose; Vardarajan, Badri N; Veltman, Dick J; Vernooij, Meike W; Völzke, Henry; Walter, Henrik; Wardlaw, Joanna M; Wassink, Thomas H; Weale, Michael E; Weinberger, Daniel R; Weiner, Michael W; Wen, Wei; Westman, Eric; White, Tonya; Wong, Tien Y; Wright, Clinton B; Zielke, H Ronald; Zonderman, Alan B; Deary, Ian J; DeCarli, Charles; Schmidt, Helena; Martin, Nicholas G; De Craen, Anton J M; Wright, Margaret J; Launer, Lenore J; Schumann, Gunter; Fornage, Myriam; Franke, Barbara; Debette, Stéphanie; Medland, Sarah E; Ikram, M Arfan; Thompson, Paul M

    2016-01-01

    Intracranial volume reflects the maximally attained brain size during development, and remains stable with loss of tissue in late life. It is highly heritable, but the underlying genes remain largely undetermined. In a genome-wide association study of 32,438 adults, we discovered five previously

  8. Whole-genome copy number variation analysis in anophthalmia and microphthalmia.

    Science.gov (United States)

    Schilter, K F; Reis, L M; Schneider, A; Bardakjian, T M; Abdul-Rahman, O; Kozel, B A; Zimmerman, H H; Broeckel, U; Semina, E V

    2013-11-01

    Anophthalmia/microphthalmia (A/M) represent severe developmental ocular malformations. Currently, mutations in known genes explain less than 40% of A/M cases. We performed whole-genome copy number variation analysis in 60 patients affected with isolated or syndromic A/M. Pathogenic deletions of 3q26 (SOX2) were identified in four independent patients with syndromic microphthalmia. Other variants of interest included regions with a known role in human disease (likely pathogenic) as well as novel rearrangements (uncertain significance). A 2.2-Mb duplication of 3q29 in a patient with non-syndromic anophthalmia and an 877-kb duplication of 11p13 (PAX6) and a 1.4-Mb deletion of 17q11.2 (NF1) in two independent probands with syndromic microphthalmia and other ocular defects were identified; while ocular anomalies have been previously associated with 3q29 duplications, PAX6 duplications, and NF1 mutations in some cases, the ocular phenotypes observed here are more severe than previously reported. Three novel regions of possible interest included a 2q14.2 duplication which cosegregated with microphthalmia/microcornea and congenital cataracts in one family, and 2q21 and 15q26 duplications in two additional cases; each of these regions contains genes that are active during vertebrate ocular development. Overall, this study identified causative copy number mutations and regions with a possible role in ocular disease in 17% of A/M cases. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  9. A genome-wide investigation of copy number variation in patients with sporadic brain arteriovenous malformation.

    Directory of Open Access Journals (Sweden)

    Nasrine Bendjilali

    Full Text Available Brain arteriovenous malformations (BAVM are clusters of abnormal blood vessels, with shunting of blood from the arterial to venous circulation and a high risk of rupture and intracranial hemorrhage. Most BAVMs are sporadic, but also occur in patients with Hereditary Hemorrhagic Telangiectasia, a Mendelian disorder caused by mutations in genes in the transforming growth factor beta (TGFβ signaling pathway.To investigate whether copy number variations (CNVs contribute to risk of sporadic BAVM, we performed a genome-wide association study in 371 sporadic BAVM cases and 563 healthy controls, all Caucasian. Cases and controls were genotyped using the Affymetrix 6.0 array. CNVs were called using the PennCNV and Birdsuite algorithms and analyzed via segment-based and gene-based approaches. Common and rare CNVs were evaluated for association with BAVM.A CNV region on 1p36.13, containing the neuroblastoma breakpoint family, member 1 gene (NBPF1, was significantly enriched with duplications in BAVM cases compared to controls (P = 2.2×10(-9; NBPF1 was also significantly associated with BAVM in gene-based analysis using both PennCNV and Birdsuite. We experimentally validated the 1p36.13 duplication; however, the association did not replicate in an independent cohort of 184 sporadic BAVM cases and 182 controls (OR = 0.81, P = 0.8. Rare CNV analysis did not identify genes significantly associated with BAVM.We did not identify common CNVs associated with sporadic BAVM that replicated in an independent cohort. Replication in larger cohorts is required to elucidate the possible role of common or rare CNVs in BAVM pathogenesis.

  10. Whole Genome Re-Sequencing and Characterization of Powdery Mildew Disease-Associated Allelic Variation in Melon.

    Directory of Open Access Journals (Sweden)

    Sathishkumar Natarajan

    Full Text Available Powdery mildew is one of the most common fungal diseases in the world. This disease frequently affects melon (Cucumis melo L. and other Cucurbitaceous family crops in both open field and greenhouse cultivation. One of the goals of genomics is to identify the polymorphic loci responsible for variation in phenotypic traits. In this study, powdery mildew disease assessment scores were calculated for four melon accessions, 'SCNU1154', 'Edisto47', 'MR-1', and 'PMR5'. To investigate the genetic variation of these accessions, whole genome re-sequencing using the Illumina HiSeq 2000 platform was performed. A total of 754,759,704 quality-filtered reads were generated, with an average of 82.64% coverage relative to the reference genome. Comparisons of the sequences for the melon accessions revealed around 7.4 million single nucleotide polymorphisms (SNPs, 1.9 million InDels, and 182,398 putative structural variations (SVs. Functional enrichment analysis of detected variations classified them into biological process, cellular component and molecular function categories. Further, a disease-associated QTL map was constructed for 390 SNPs and 45 InDels identified as related to defense-response genes. Among them 112 SNPs and 12 InDels were observed in powdery mildew responsive chromosomes. Accordingly, this whole genome re-sequencing study identified SNPs and InDels associated with defense genes that will serve as candidate polymorphisms in the search for sources of resistance against powdery mildew disease and could accelerate marker-assisted breeding in melon.

  11. Whole Genome Re-Sequencing and Characterization of Powdery Mildew Disease-Associated Allelic Variation in Melon.

    Science.gov (United States)

    Natarajan, Sathishkumar; Kim, Hoy-Taek; Thamilarasan, Senthil Kumar; Veerappan, Karpagam; Park, Jong-In; Nou, Ill-Sup

    2016-01-01

    Powdery mildew is one of the most common fungal diseases in the world. This disease frequently affects melon (Cucumis melo L.) and other Cucurbitaceous family crops in both open field and greenhouse cultivation. One of the goals of genomics is to identify the polymorphic loci responsible for variation in phenotypic traits. In this study, powdery mildew disease assessment scores were calculated for four melon accessions, 'SCNU1154', 'Edisto47', 'MR-1', and 'PMR5'. To investigate the genetic variation of these accessions, whole genome re-sequencing using the Illumina HiSeq 2000 platform was performed. A total of 754,759,704 quality-filtered reads were generated, with an average of 82.64% coverage relative to the reference genome. Comparisons of the sequences for the melon accessions revealed around 7.4 million single nucleotide polymorphisms (SNPs), 1.9 million InDels, and 182,398 putative structural variations (SVs). Functional enrichment analysis of detected variations classified them into biological process, cellular component and molecular function categories. Further, a disease-associated QTL map was constructed for 390 SNPs and 45 InDels identified as related to defense-response genes. Among them 112 SNPs and 12 InDels were observed in powdery mildew responsive chromosomes. Accordingly, this whole genome re-sequencing study identified SNPs and InDels associated with defense genes that will serve as candidate polymorphisms in the search for sources of resistance against powdery mildew disease and could accelerate marker-assisted breeding in melon.

  12. Melanic variation underlies aposematic color variation in two hymenopteran mimicry systems.

    Directory of Open Access Journals (Sweden)

    Heather M Hines

    Full Text Available The stinging hymenopteran velvet ants (Mutillidae and bumble bees (Apidae: Bombus spp. have both undergone extensive diversification in aposematic color patterns, including yellow-red hues and contrasting dark-light body coloration, as a result of Müllerian mimicry. Understanding the genetic and developmental mechanisms underlying shifts in these mimetic colors requires characterization of their pigmentation. In this study, a combination of solubility, spectrophotometry, and melanin degradation analysis are applied to several color forms and species of these lineages to determine that orange-red colors in both lineages are comprised of primarily dopamine-derived pheomelanins. Until a few recent studies, pheomelanins were thought not to occur in insects. These results support their potential to occur across insects and particularly among the Hymenoptera. Shifts between black and orange-red colors, such as between mimetic color forms of bumble bee Bombus melanopygus, are inferred to involve modification of the ratios of dark eumelanins to red pheomelanins, thus implicating the melanin pathway in mimetic diversification. This discovery highlights the need to focus on how pheomelanins are synthesized in the insect melanin pathway and the potential for new pigments to be found even in some of our most well-known insect systems.

  13. Natural variation in rosette size under salt stress conditions corresponds to developmental differences between Arabidopsis accessions and allelic variation in the LRR-KISS gene

    KAUST Repository

    Julkowska, Magdalena; Klei, Karlijn; Fokkens, Like; Haring, Michel A.; Schranz, M. Eric; Testerink, Christa

    2016-01-01

    Natural variation among Arabidopsis accessions is an important genetic resource to identify mechanisms underlying plant development and stress tolerance. To evaluate the natural variation in salinity stress tolerance, two large-scale experiments

  14. Defining the diverse spectrum of inversions, complex structural variation, and chromothripsis in the morbid human genome.

    Science.gov (United States)

    Collins, Ryan L; Brand, Harrison; Redin, Claire E; Hanscom, Carrie; Antolik, Caroline; Stone, Matthew R; Glessner, Joseph T; Mason, Tamara; Pregno, Giulia; Dorrani, Naghmeh; Mandrile, Giorgia; Giachino, Daniela; Perrin, Danielle; Walsh, Cole; Cipicchio, Michelle; Costello, Maura; Stortchevoi, Alexei; An, Joon-Yong; Currall, Benjamin B; Seabra, Catarina M; Ragavendran, Ashok; Margolin, Lauren; Martinez-Agosto, Julian A; Lucente, Diane; Levy, Brynn; Sanders, Stephan J; Wapner, Ronald J; Quintero-Rivera, Fabiola; Kloosterman, Wigard; Talkowski, Michael E

    2017-03-06

    Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association studies. We sequenced 689 participants with autism spectrum disorder (ASD) and other developmental abnormalities to construct a genome-wide map of large SV. Using long-insert jumping libraries at 105X mean physical coverage and linked-read whole-genome sequencing from 10X Genomics, we document seven major SV classes at ~5 kb SV resolution. Our results encompass 11,735 distinct large SV sites, 38.1% of which are novel and 16.8% of which are balanced or complex. We characterize 16 recurrent subclasses of complex SV (cxSV), revealing that: (1) cxSV are larger and rarer than canonical SV; (2) each genome harbors 14 large cxSV on average; (3) 84.4% of large cxSVs involve inversion; and (4) most large cxSV (93.8%) have not been delineated in previous studies. Rare SVs are more likely to disrupt coding and regulatory non-coding loci, particularly when truncating constrained and disease-associated genes. We also identify multiple cases of catastrophic chromosomal rearrangements known as chromoanagenesis, including somatic chromoanasynthesis, and extreme balanced germline chromothripsis events involving up to 65 breakpoints and 60.6 Mb across four chromosomes, further defining rare categories of extreme cxSV. These data provide a foundational map of large SV in the morbid human genome and demonstrate a previously underappreciated abundance and diversity of cxSV that should be considered in genomic studies of human disease.

  15. Defining the role of common variation in the genomic and biological architecture of adult human height

    NARCIS (Netherlands)

    A.R. Wood (Andrew); T. Esko (Tõnu); J. Yang (Jian); S. Vedantam (Sailaja); T.H. Pers (Tune); S. Gustafsson (Stefan); A.Y. Chu (Audrey Y); K. Estrada Gil (Karol); J. Luan; Z. Kutalik; N. Amin (Najaf); M.L. Buchkovich (Martin); D.C. Croteau-Chonka (Damien); F.R. Day (Felix); Y. Duan (Yanan); M. Fall (Magnus); R.S.N. Fehrmann (Rudolf); T. Ferreira (Teresa); A.U. Jackson (Anne); J. Karjalainen (Juha); K.S. Lo (Ken Sin); A. Locke (Adam); R. Mägi (Reedik); E. Mihailov (Evelin); E. Porcu (Eleonora); J.C. Randall (Joshua); A. Scherag (Andre); A.A.E. Vinkhuyzen (Anna A.); H.J. Westra (Harm-Jan); T.W. Winkler (Thomas W.); T. Workalemahu (Tsegaselassie); J.H. Zhao (Jing Hua); D. Absher (Devin); E. Albrecht (Eva); J. Baron (Jeffrey); M. Beekman (Marian); A. Demirkan (Ayşe); G.B. Ehret (Georg); B. Feenstra; M.F. Feitosa (Mary Furlan); K. Fischer (Krista); R.M. Fraser (Ross); A. Goel (Anuj); J. Gong (Jian); A.E. Justice (Anne); S. Kanoni (Stavroula); M.E. Kleber (Marcus); K. Kristiansson (Kati); U. Lim (Unhee); V. Lotay (Vaneet); J.C. Lui (Julian C); M. Mangino (Massimo); I.M. Leach (Irene Mateo); M.C. Medina-Gomez (Carolina); M.A. Nalls (Michael); A.S. Dimas (Antigone); C. Palmer (Cameron); D. Pasko (Dorota); S. Pechlivanis (Sonali); I. Prokopenko (Inga); J.S. Ried (Janina); S. Ripke (Stephan); D. Shungin (Dmitry); A. Stancáková (Alena); R.J. Strawbridge (Rona); Y.J. Sung (Yun Ju); T. Tanaka (Toshiko); A. Teumer (Alexander); S. Trompet (Stella); S.W. Van Der Laan (Sander W.); J. van Setten (Jessica); J.V. van Vliet-Ostaptchouk (Jana); Z. Wang (Zhaoming); L. Yengo (Loic); W. Zhang (Weihua); U. Afzal (Uzma); J. Ärnlöv (Johan); G.M. Arscott (Gillian M.); S. Bandinelli (Stefania); A. Barrett (Angela); C. Bellis (Claire); A.J. Bennett (Amanda); C. Berne (Christian); M. Blüher (Matthias); J.L. Bolton (Jennifer); Y. Böttcher (Yvonne); H.A. Boyd; M. Bruinenberg (M.); B.M. Buckley (Brendan M.); S. Buyske (Steven); I.H. Caspersen (Ida H.); P.S. Chines (Peter); R. Clarke (Robert); S. Claudi-Boehm (Simone); M.N. Cooper (Matthew); E.W. Daw (E Warwick); P.A. De Jong (Pim A); J. Deelen (Joris); G. Delgado; J.C. Denny (Josh C); R.A.M. Dhonukshe-Rutten (Rosalie); M. Dimitriou (Maria); A.S.F. Doney (Alex); M. Dörr (Marcus); N. Eklund (Niina); E. Eury (Elodie); L. Folkersen (Lasse); M. Garcia (Melissa); F. Geller (Frank); V. Giedraitis (Vilmantas); A. Go (Attie); H. Grallert (Harald); T.B. Grammer (Tanja B); J. Gräßler (Jürgen); H. Grönberg (Henrik); L.C.P.G.M. de Groot (Lisette); C.J. Groves (Christopher J.); J. Haessler (Jeff); P. Hall (Per); T. Haller (Toomas); G. Hallmans (Göran); M. Hannemann (Mario); C.A. Hartman (Catharina); M. Hassinen (Maija); C. Hayward (Caroline); N.L. Heard-Costa (Nancy); Q. Helmer (Quinta); G. Hemani; A.K. Henders (Anjali); H.L. Hillege (Hans); M.A. Hlatky (Mark); W. Hoffmann (Wolfgang); P. Hoffmann (Per); O.L. Holmen (Oddgeir); J.J. Houwing-Duistermaat (Jeanine); T. Illig (Thomas); A. Isaacs (Aaron); A.L. James (Alan); J. Jeff (Janina); B. Johansen (Berit); A. Johansson (Åsa); G.J. Jolley (Jason); T. Juliusdottir (Thorhildur); M.J. Junttila (Juhani); M.M.L. Kho (Marcia); L. Kinnunen (Leena); N. Klopp (Norman); T. Kocher; W. Kratzer (Wolfgang); P. Lichtner (Peter); L. Lind (Lars); J. Lindström (Jaana); S. Lobbens (Stéphane); M. Lorentzon (Mattias); Y. Lu (Yingchang); V. Lyssenko (Valeriya); P.K. Magnusson (Patrik); A. Mahajan (Anubha); M. Maillard (Marc); W.L. McArdle (Wendy); C.A. McKenzie (Colin A.); S. McLachlan (Stela); P.J. McLaren (Paul J); C. Menni (Cristina); S. Merger (Sigrun); L. Milani (Lili); A. Moayyeri (Alireza); K.L. Monda (Keri); M.A. Morken (Mario); G. Müller (Gabriele); M. Müller-Nurasyid (Martina); A.W. Musk (Arthur); N. Narisu (Narisu); M. Nauck (Matthias); I.M. Nolte (Ilja M.); M.M. Nöthen (Markus); L. Oozageer (Laticia); S. Pilz (Stefan); N.W. Rayner (Nigel William); F. Renström (Frida); N.R. Robertson (Neil R.); L.M. Rose (Lynda M.); R. Roussel (Ronan); S. Sanna (Serena); H. Scharnagl (Hubert); S. Scholtens (Salome); F.R. Schumacher (Fredrick R); H. Schunkert (Heribert); R.A. Scott (Robert); J.S. Sehmi (Joban); T. Seufferlein (Thomas); J. Shi (Jianxin); K. Silventoinen (Karri); J.H. Smit (Johannes); G.D. Smith; J. Smolonska (Joanna); A. Stanton (Alice); K. Stirrups (Kathy); D.J. Stott (David J); H.M. Stringham (Heather); J. Sundstrom (Johan); M. Swertz (Morris); A.C. Syvanen; B. Tayo (Bamidele); G. Thorleifsson (Gudmar); J.P. Tyrer (Jonathan); S. Van Dijk (Suzanne); N.M. van Schoor (Natasja); N. van der Velde (Nathalie); D. van Heemst (Diana); F.V.A. Van Oort (Floor V A); S.H.H.M. Vermeulen (Sita); N. Verweij (Niek); J.M. Vonk (Judith M); L. Waite (Lindsay); M. Waldenberger (Melanie); R. Wennauer (Roman); L.R. Wilkens (Lynne R.); C. Willenborg (Christina); T. Wilsgaard (Tom); M.K. Wojczynski (Mary ); A. Wong (Andrew); A. Wright (Alan); Q. Zhang (Qunyuan); D. Arveiler (Dominique); S.J.L. Bakker (Stephan); J. Beilby (John); R.N. Bergman (Richard); S.M. Bergmann (Sven); R. Biffar; J. Blangero (John); D.I. Boomsma (Dorret); S.R. Bornstein (Stefan R.); P. Bovet (Pascal); P. Brambilla (Paolo); M.J. Brown (Morris); H. Campbell (Harry); M. Caulfield (Mark); A. Chakravarti (Aravinda); F.S. Collins (Francis); D.C. Crawford (Dana); L.A. Cupples (Adrienne); J. Danesh (John); U. de Faire (Ulf); H.M. den Ruijter (Hester ); R. Erbel (Raimund); J. Erdmann (Jeanette); J. Eriksson; M. Farrall (Martin); E. Ferrannini (Ele); J. Ferrieres (Jean); I. Ford; N.G. Forouhi (Nita); T. Forrester (Terrence); R.T. Gansevoort (Ron); P.V. Gejman (Pablo); C. Gieger (Christian); A. Golay (Alain); R.F. Gottesman (Rebecca); V. Gudnason (Vilmundur); U. Gyllensten (Ulf); D.W. Haas (David W); A.S. Hall (Alistair); T.B. Harris (Tamara); A.T. Hattersley (Andrew); A.C. Heath (Andrew C); C. Hengstenberg (Christian); A.A. Hicks (Andrew); L.A. Hindorff (Lucia A); A. Hingorani (Aroon); A. Hofman (Albert); G.K. Hovingh (Kees); S.E. Humphries (Steve E.); S.C. Hunt (Steven); E. Hypponen (Elina); K.B. Jacobs (Kevin); M.-R. Jarvelin (Marjo-Riitta); P. Jousilahti (Pekka); A. Jula (Antti); J. Kaprio (Jaakko); J.J.P. Kastelein (John); M.H. Kayser (Manfred); F. Kee (Frank); S. Keinanen-Kiukaanniemi (Sirkka); L.A.L.M. Kiemeney (Bart); J.S. Kooner (Jaspal S.); C. Kooperberg (Charles); S. Koskinen (Seppo); P. Kovacs (Peter); A. Kraja (Aldi); M. Kumari (Meena); J. Kuusisto (Johanna); T.A. Lakka (Timo); C. Langenberg (Claudia); L. Le Marchand (Loic); T. Lehtimäki (Terho); S. Lupoli (Sara); P.A. Madden; S. Männistö (Satu); P. Manunta (Paolo); A. Marette (Andre'); T.C. Matise (Tara C.); B. McKnight (Barbara); T. Meitinger (Thomas); F.L. Moll (Frans); G.W. Montgomery (Grant W.); A.D. Morris (Andrew); A.P. Morris (Andrew); J.C. Murray (Jeffrey); M. Nelis (Mari); C. Ohlsson (Claes); A.J. Oldehinkel (Albertine); K.K. Ong (Ken K.); W.H. Ouwehand (Willem); G. Pasterkamp (Gerard); A. Peters (Annette); P.P. Pramstaller (Peter Paul); J.F. Price (Jackie F.); L. Qi (Lu); O. Raitakari (Olli); T. Rankinen (Tuomo); D.C. Rao (Dabeeru C.); T.K. Rice (Treva K.); M.D. Ritchie (Marylyn D.); I. Rudan (Igor); V. Salomaa (Veikko); N.J. Samani (Nilesh); J. Saramies (Jouko); M.A. Sarzynski (Mark A.); P.E.H. Schwarz (Peter E. H.); S. Sebert (Sylvain); P. Sever (Peter); A.R. Shuldiner (Alan); J. Sinisalo (Juha); V. Steinthorsdottir (Valgerdur); R.P. Stolk; J.-C. Tardif (Jean-Claude); A. Tönjes (Anke); A. Tremblay (Angelo); E. Tremoli (Elena); J. Virtamo (Jarmo); M.-C. Vohl (Marie-Claude); P. Amouyel (Philippe); F.W. Asselbergs (Folkert W.); T.L. Assimes (Themistocles); M. Bochud (Murielle); B.O. Boehm (Bernhard); E.A. Boerwinkle (Eric); E.P. Bottinger (Erwin P.); C. Bouchard (Claude); S. Cauchi (Stéphane); J.C. Chambers (John C.); S.J. Chanock (Stephen); R.S. Cooper (Richard S.); P.I.W. de Bakker (Paul); G.V. Dedoussis (George); L. Ferrucci (Luigi); P.W. Franks; P. Froguel (Philippe); L. Groop (Leif); C.A. Haiman (Christopher); A. Hamsten (Anders); M.G. Hayes (M. Geoffrey); J. Hui (Jennie); D. Hunter (David); K. Hveem (Kristian); J.W. Jukema (Jan Wouter); R.C. Kaplan (Robert); M. Kivimaki (Mika); D. Kuh (Diana); M. Laakso (Markku); Y. Liu (YongMei); N.G. Martin (Nicholas); W. März (Winfried); M. Melbye (Mads); S. Moebus (Susanne); P. Munroe (Patricia); I. Njølstad (Inger); B.A. Oostra (Ben); C.N.A. Palmer (Colin); N.L. Pedersen (Nancy L.); M. Perola (Markus); L. Perusse (Louis); U. Peters (Ulrike); J.E. Powell (Joseph); C. Power (Christine); T. Quertermous (Thomas); R. Rauramaa (Rainer); E. Reinmaa (Eva); P.M. Ridker (Paul); F. Rivadeneira Ramirez (Fernando); J.I. Rotter (Jerome I.); T. Saaristo (Timo); D. Saleheen; D. Schlessinger (David); P.E. Slagboom (P Eline); H. Snieder (Harold); T.D. Spector (Timothy); K. Strauch (Konstantin); M. Stumvoll (Michael); J. Tuomilehto (Jaakko); M. Uusitupa (Matti); P. van der Harst (Pim); H. Völzke (Henry); M. Walker (Mark); N.J. Wareham (Nick); H. Watkins (Hugh); H.E. Wichmann (Heinz Erich); J.F. Wilson (James F); P. Zanen (Pieter); P. Deloukas (Panagiotis); I.M. Heid (Iris); C.M. Lindgren (Cecilia); K.L. Mohlke (Karen); E.K. Speliotes (Elizabeth); U. Thorsteinsdottir (Unnur); I.E. Barroso (Inês); C.S. Fox (Caroline S.); K.E. North (Kari); D.P. Strachan (David P.); J.S. Beckmann (Jacques); S.I. Berndt (Sonja); M. Boehnke (Michael); I.B. Borecki (Ingrid); M.I. McCarthy (Mark); A. Metspalu (Andres); J-A. Zwart (John-Anker); A.G. Uitterlinden (André); C.M. van Duijn (Cornelia); L. Franke (Lude); C.J. Willer (Cristen); A. Price (Alkes); G. Lettre (Guillaume); R.J.F. Loos (Ruth); M.N. Weedon (Michael); E. Ingelsson (Erik); J.R. O´Connell; G.R. Abecasis (Gonçalo); D.I. Chasman (Daniel); D. Anderson (Denise); M.E. Goddard (Michael); P.M. Visscher (Peter); J.N. Hirschhorn (Joel); T.M. Frayling (Timothy)

    2014-01-01

    textabstractUsing genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated

  16. Defining the role of common variation in the genomic and biological architecture of adult human height

    NARCIS (Netherlands)

    Wood, Andrew R.; Esko, Tonu; Yang, Jian; Vedantam, Sailaja; Pers, Tune H.; Gustafsson, Stefan; Chu, Audrey Y.; Estrada, Karol; Luan, Jian'an; Kutalik, Zoltán; Amin, Najaf; Buchkovich, Martin L.; Croteau-Chonka, Damien C.; Day, Felix R.; Duan, Yanan; Fall, Tove; Fehrmann, Rudolf; Ferreira, Teresa; Jackson, Anne U.; Karjalainen, Juha; Lo, Ken Sin; Locke, Adam E.; Mägi, Reedik; Mihailov, Evelin; Porcu, Eleonora; Randall, Joshua C.; Scherag, André; Vinkhuyzen, Anna A. E.; Westra, Harm-Jan; Winkler, Thomas W.; Workalemahu, Tsegaselassie; Zhao, Jing Hua; Absher, Devin; Albrecht, Eva; Anderson, Denise; Baron, Jeffrey; Beekman, Marian; Demirkan, Ayse; Ehret, Georg B.; Feenstra, Bjarke; Feitosa, Mary F.; Fischer, Krista; Fraser, Ross M.; Goel, Anuj; Gong, Jian; Justice, Anne E.; Kanoni, Stavroula; Kleber, Marcus E.; Kristiansson, Kati; Lim, Unhee; Lotay, Vaneet; Lui, Julian C.; Mangino, Massimo; Mateo Leach, Irene; Medina-Gomez, Carolina; Nalls, Michael A.; Nyholt, Dale R.; Palmer, Cameron D.; Pasko, Dorota; Pechlivanis, Sonali; Prokopenko, Inga; Ried, Janina S.; Ripke, Stephan; Shungin, Dmitry; Stancáková, Alena; Strawbridge, Rona J.; Sung, Yun Ju; Tanaka, Toshiko; Teumer, Alexander; Trompet, Stella; van der Laan, Sander W.; van Setten, Jessica; van Vliet-Ostaptchouk, Jana V.; Wang, Zhaoming; Yengo, Loïc; Zhang, Weihua; Afzal, Uzma; Arnlöv, Johan; Arscott, Gillian M.; Bandinelli, Stefania; Barrett, Amy; Bellis, Claire; Bennett, Amanda J.; Berne, Christian; Blüher, Matthias; Bolton, Jennifer L.; Böttcher, Yvonne; Boyd, Heather A.; Bruinenberg, Marcel; Buckley, Brendan M.; Buyske, Steven; Caspersen, Ida H.; Chines, Peter S.; Clarke, Robert; Claudi-Boehm, Simone; Cooper, Matthew; Daw, E. Warwick; de Jong, Pim A.; Deelen, Joris; Delgado, Graciela; Denny, Josh C.; Dhonukshe-Rutten, Rosalie; Dimitriou, Maria; Doney, Alex S. F.; Dörr, Marcus; Eklund, Niina; Eury, Elodie; Folkersen, Lasse; Garcia, Melissa E.; Geller, Frank; Giedraitis, Vilmantas; Go, Alan S.; Grallert, Harald; Grammer, Tanja B.; Gräßler, Jürgen; Grönberg, Henrik; de Groot, Lisette C. P. G. M.; Groves, Christopher J.; Haessler, Jeffrey; Hall, Per; Haller, Toomas; Hallmans, Goran; Hannemann, Anke; Hartman, Catharina A.; Hassinen, Maija; Hayward, Caroline; Heard-Costa, Nancy L.; Helmer, Quinta; Hemani, Gibran; Henders, Anjali K.; Hillege, Hans L.; Hlatky, Mark A.; Hoffmann, Wolfgang; Hoffmann, Per; Holmen, Oddgeir; Houwing-Duistermaat, Jeanine J.; Illig, Thomas; Isaacs, Aaron; James, Alan L.; Jeff, Janina; Johansen, Berit; Johansson, Åsa; Jolley, Jennifer; Juliusdottir, Thorhildur; Junttila, Juhani; Kho, Abel N.; Kinnunen, Leena; Klopp, Norman; Kocher, Thomas; Kratzer, Wolfgang; Lichtner, Peter; Lind, Lars; Lindström, Jaana; Lobbens, Stéphane; Lorentzon, Mattias; Lu, Yingchang; Lyssenko, Valeriya; Magnusson, Patrik K. E.; Mahajan, Anubha; Maillard, Marc; McArdle, Wendy L.; McKenzie, Colin A.; McLachlan, Stela; McLaren, Paul J.; Menni, Cristina; Merger, Sigrun; Milani, Lili; Moayyeri, Alireza; Monda, Keri L.; Morken, Mario A.; Müller, Gabriele; Müller-Nurasyid, Martina; Musk, Arthur W.; Narisu, Narisu; Nauck, Matthias; Nolte, Ilja M.; Nöthen, Markus M.; Oozageer, Laticia; Pilz, Stefan; Rayner, Nigel W.; Renstrom, Frida; Robertson, Neil R.; Rose, Lynda M.; Roussel, Ronan; Sanna, Serena; Scharnagl, Hubert; Scholtens, Salome; Schumacher, Fredrick R.; Schunkert, Heribert; Scott, Robert A.; Sehmi, Joban; Seufferlein, Thomas; Shi, Jianxin; Silventoinen, Karri; Smit, Johannes H.; Smith, Albert Vernon; Smolonska, Joanna; Stanton, Alice V.; Stirrups, Kathleen; Stott, David J.; Stringham, Heather M.; Sundström, Johan; Swertz, Morris A.; Syvänen, Ann-Christine; Tayo, Bamidele O.; Thorleifsson, Gudmar; Tyrer, Jonathan P.; van Dijk, Suzanne; van Schoor, Natasja M.; van der Velde, Nathalie; van Heemst, Diana; van Oort, Floor V. A.; Vermeulen, Sita H.; Verweij, Niek; Vonk, Judith M.; Waite, Lindsay L.; Waldenberger, Melanie; Wennauer, Roman; Wilkens, Lynne R.; Willenborg, Christina; Wilsgaard, Tom; Wojczynski, Mary K.; Wong, Andrew; Wright, Alan F.; Zhang, Qunyuan; Arveiler, Dominique; Bakker, Stephan J. L.; Beilby, John; Bergman, Richard N.; Bergmann, Sven; Biffar, Reiner; Blangero, John; Boomsma, Dorret I.; Bornstein, Stefan R.; Bovet, Pascal; Brambilla, Paolo; Brown, Morris J.; Campbell, Harry; Caulfield, Mark J.; Chakravarti, Aravinda; Collins, Rory; Collins, Francis S.; Crawford, Dana C.; Cupples, L. Adrienne; Danesh, John; de Faire, Ulf; den Ruijter, Hester M.; Erbel, Raimund; Erdmann, Jeanette; Eriksson, Johan G.; Farrall, Martin; Ferrannini, Ele; Ferrières, Jean; Ford, Ian; Forouhi, Nita G.; Forrester, Terrence; Gansevoort, Ron T.; Gejman, Pablo V.; Gieger, Christian; Golay, Alain; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Haas, David W.; Hall, Alistair S.; Harris, Tamara B.; Hattersley, Andrew T.; Heath, Andrew C.; Hengstenberg, Christian; Hicks, Andrew A.; Hindorff, Lucia A.; Hingorani, Aroon D.; Hofman, Albert; Hovingh, G. Kees; Humphries, Steve E.; Hunt, Steven C.; Hypponen, Elina; Jacobs, Kevin B.; Jarvelin, Marjo-Riitta; Jousilahti, Pekka; Jula, Antti M.; Kaprio, Jaakko; Kastelein, John J. P.; Kayser, Manfred; Kee, Frank; Keinanen-Kiukaanniemi, Sirkka M.; Kiemeney, Lambertus A.; Kooner, Jaspal S.; Kooperberg, Charles; Koskinen, Seppo; Kovacs, Peter; Kraja, Aldi T.; Kumari, Meena; Kuusisto, Johanna; Lakka, Timo A.; Langenberg, Claudia; Le Marchand, Loic; Lehtimäki, Terho; Lupoli, Sara; Madden, Pamela A. F.; Männistö, Satu; Manunta, Paolo; Marette, André; Matise, Tara C.; McKnight, Barbara; Meitinger, Thomas; Moll, Frans L.; Montgomery, Grant W.; Morris, Andrew D.; Morris, Andrew P.; Murray, Jeffrey C.; Nelis, Mari; Ohlsson, Claes; Oldehinkel, Albertine J.; Ong, Ken K.; Ouwehand, Willem H.; Pasterkamp, Gerard; Peters, Annette; Pramstaller, Peter P.; Price, Jackie F.; Qi, Lu; Raitakari, Olli T.; Rankinen, Tuomo; Rao, D. C.; Rice, Treva K.; Ritchie, Marylyn; Rudan, Igor; Salomaa, Veikko; Samani, Nilesh J.; Saramies, Jouko; Sarzynski, Mark A.; Schwarz, Peter E. H.; Sebert, Sylvain; Sever, Peter; Shuldiner, Alan R.; Sinisalo, Juha; Steinthorsdottir, Valgerdur; Stolk, Ronald P.; Tardif, Jean-Claude; Tönjes, Anke; Tremblay, Angelo; Tremoli, Elena; Virtamo, Jarmo; Vohl, Marie-Claude; Amouyel, Philippe; Asselbergs, Folkert W.; Assimes, Themistocles L.; Bochud, Murielle; Boehm, Bernhard O.; Boerwinkle, Eric; Bottinger, Erwin P.; Bouchard, Claude; Cauchi, Stéphane; Chambers, John C.; Chanock, Stephen J.; Cooper, Richard S.; de Bakker, Paul I. W.; Dedoussis, George; Ferrucci, Luigi; Franks, Paul W.; Froguel, Philippe; Groop, Leif C.; Haiman, Christopher A.; Hamsten, Anders; Hayes, M. Geoffrey; Hui, Jennie; Hunter, David J.; Hveem, Kristian; Jukema, J. Wouter; Kaplan, Robert C.; Kivimaki, Mika; Kuh, Diana; Laakso, Markku; Liu, Yongmei; Martin, Nicholas G.; März, Winfried; Melbye, Mads; Moebus, Susanne; Munroe, Patricia B.; Njølstad, Inger; Oostra, Ben A.; Palmer, Colin N. A.; Pedersen, Nancy L.; Perola, Markus; Pérusse, Louis; Peters, Ulrike; Powell, Joseph E.; Power, Chris; Quertermous, Thomas; Rauramaa, Rainer; Reinmaa, Eva; Ridker, Paul M.; Rivadeneira, Fernando; Rotter, Jerome I.; Saaristo, Timo E.; Saleheen, Danish; Schlessinger, David; Slagboom, P. Eline; Snieder, Harold; Spector, Tim D.; Strauch, Konstantin; Stumvoll, Michael; Tuomilehto, Jaakko; Uusitupa, Matti; van der Harst, Pim; Völzke, Henry; Walker, Mark; Wareham, Nicholas J.; Watkins, Hugh; Wichmann, H.-Erich; Wilson, James F.; Zanen, Pieter; Deloukas, Panos; Heid, Iris M.; Lindgren, Cecilia M.; Mohlke, Karen L.; Speliotes, Elizabeth K.; Thorsteinsdottir, Unnur; Barroso, Inês; Fox, Caroline S.; North, Kari E.; Strachan, David P.; Beckmann, Jacques S.; Berndt, Sonja I.; Boehnke, Michael; Borecki, Ingrid B.; McCarthy, Mark I.; Metspalu, Andres; Stefansson, Kari; Uitterlinden, André G.; van Duijn, Cornelia M.; Franke, Lude; Willer, Cristen J.; Price, Alkes L.; Lettre, Guillaume; Loos, Ruth J. F.; Weedon, Michael N.; Ingelsson, Erik; O'Connell, Jeffrey R.; Abecasis, Goncalo R.; Chasman, Daniel I.; Goddard, Michael E.; Visscher, Peter M.; Hirschhorn, Joel N.; Frayling, Timothy M.; McCarty, Catherine A.; Starren, Justin; Peissig, Peggy; Berg, Richard; Rasmussen, Luke; Linneman, James; Miller, Aaron; Choudary, Vidhu; Chen, Lin; Waudby, Carol; Kitchner, Terrie; Reeser, Jonathan; Fost, Norman; Wilke, Russell A.; Chisholm, Rex L.; Avila, Pedro C.; Greenland, Philip; Hayes, M. Geoff; Kho, Abel; Kibbe, Warren A.; Lemke, Amy A.; Lowe, William L.; Smith, Maureen E.; Wolf, Wendy A.; Pacheco, Jennifer A.; Thompson, William K.; Humowiecki, Joel; Law, May; Chute, Christopher; Kullo, Iftikar; Koenig, Barbara; de Andrade, Mariza; Bielinski, Suzette; Pathak, Jyotishman; Savova, Guergana; Wu, Joel; Henriksen, Joan; Ding, Keyue; Hart, Lacey; Palbicki, Jeremy; Larson, Eric B.; Newton, Katherine; Ludman, Evette; Spangler, Leslie; Hart, Gene; Carrell, David; Jarvik, Gail; Crane, Paul; Burke, Wylie; Fullerton, Stephanie Malia; Trinidad, Susan Brown; Carlson, Chris; Hutchinson, Fred; McDavid, Andrew; Roden, Dan M.; Clayton, Ellen; Haines, Jonathan L.; Masys, Daniel R.; Churchill, Larry R.; Cornfield, Daniel; Crawford, Dana; Darbar, Dawood; Denny, Joshua C.; Malin, Bradley A.; Ritchie, Marylyn D.; Schildcrout, Jonathan S.; Xu, Hua; Ramirez, Andrea Havens; Basford, Melissa; Pulley, Jill; Alizadeh, Behrooz Z.; de Boer, Rudolf A.; Boezen, H. Marike; van der Klauw, Melanie M.; Navis, Gerjan; Ormel, Johan; Postma, Dirkje S.; Rosmalen, Judith G. M.; Slaets, Joris P.; Wolffenbuttel, Bruce H. R.; Wijmenga, Cisca; Kathiresan, Sekar; Voight, Benjamin F.; Purcell, Shaun; Musunuru, Kiran; Ardissino, Diego; Mannucci, Pier M.; Anand, Sonia; Engert, James C.; Reilly, Muredach P.; Rader, Daniel J.; Morgan, Thomas; Spertus, John A.; Stoll, Monika; Girelli, Domenico; McKeown, Pascal P.; Patterson, Chris C.; Siscovick, David S.; O'Donnell, Christopher J.; Elosua, Roberto; Peltonen, Leena; Schwartz, Stephen M.; Melander, Olle; Altshuler, David; Merlini, Pier Angelica; Berzuini, Carlo; Bernardinelli, Luisa; Peyvandi, Flora; Tubaro, Marco; Celli, Patrizia; Ferrario, Maurizio; Fetiveau, Raffaela; Marziliano, Nicola; Casari, Giorgio; Galli, Michele; Ribichini, Flavio; Rossi, Marco; Bernardi, Francesco; Zonzin, Pietro; Piazza, Alberto; Yee, Jean; Friedlander, Yechiel; Marrugat, Jaume; Lucas, Gavin; Subirana, Isaac; Sala, Joan; Ramos, Rafael; Meigs, James B.; Williams, Gordon; Nathan, David M.; MacRae, Calum A.; Havulinna, Aki S.; Berglund, Goran; Asselta, Rosanna; Duga, Stefano; Spreafico, Marta; Daly, Mark J.; Nemesh, James; Korn, Joshua M.; McCarroll, Steven A.; Surti, Aarti; Guiducci, Candace; Gianniny, Lauren; Mirel, Daniel; Parkin, Melissa; Burtt, Noel; Gabriel, Stacey B.; Thompson, John R.; Braund, Peter S.; Wright, Benjamin J.; Balmforth, Anthony J.; Ball, Stephen G.; Schunkert, I. Heribert; Linsel-Nitschke, Patrick; Lieb, Wolfgang; Ziegler, Andreas; König, Inke R.; Fischer, Marcus; Stark, Klaus; Grosshennig, Anika; Preuss, Michael; Schreiber, Stefan; Ouwehand, Willem; Scholz, Michael; Cambien, Francois; Goodall, Alison; Li, Mingyao; Chen, Zhen; Wilensky, Robert; Matthai, William; Qasim, Atif; Hakonarson, Hakon H.; Devaney, Joe; Burnett, Mary-Susan; Pichard, Augusto D.; Kent, Kenneth M.; Satler, Lowell; Lindsay, Joseph M.; Waksman, Ron; Knouff, Christopher W.; Waterworth, Dawn M.; Walker, Max C.; Mooser, Vincent; Epstein, Stephen E.; Scheffold, Thomas; Berger, Klaus; Huge, Andreas; Martinelli, Nicola; Olivieri, Oliviero; Corrocher, Roberto; Hólm, Hilma; Do, Ron; Xie, Changchun; Siscovick, David; Matise, Tara; Buyske, Steve; Higashio, Julia; Williams, Rasheeda; Nato, Andrew; Ambite, Jose Luis; Deelman, Ewa; Manolio, Teri; Hindorff, Lucia; Heiss, Gerardo; Taylor, Kira; Franceschini, Nora; Avery, Christy; Graff, Misa; Lin, Danyu; Quibrera, Miguel; Cochran, Barbara; Kao, Linda; Umans, Jason; Cole, Shelley; MacCluer, Jean; Person, Sharina; Pankow, James; Gross, Myron; Fornage, Myriam; Durda, Peter; Jenny, Nancy; Patsy, Bruce; Arnold, Alice; Buzkova, Petra; Haines, Jonathan; Murdock, Deborah; Glenn, Kim; Brown-Gentry, Kristin; Thornton-Wells, Tricia; Dumitrescu, Logan; Bush, William S.; Mitchell, Sabrina L.; Goodloe, Robert; Wilson, Sarah; Boston, Jonathan; Malinowski, Jennifer; Restrepo, Nicole; Oetjens, Matthew; Fowke, Jay; Zheng, Wei; Spencer, Kylee; Pendergrass, Sarah; Le Marchand, Loïc; Wilkens, Lynne; Park, Lani; Tiirikainen, Maarit; Kolonel, Laurence; Cheng, Iona; Wang, Hansong; Shohet, Ralph; Haiman, Christopher; Stram, Daniel; Henderson, Brian; Monroe, Kristine; Schumacher, Fredrick; Anderson, Garnet; Prentice, Ross; LaCroix, Andrea; Wu, Chunyuan; Carty, Cara; Rosse, Stephanie; Young, Alicia; Haessler, Jeff; Kocarnik, Jonathan; Lin, Yi; Jackson, Rebecca; Duggan, David; Kuller, Lew

    2014-01-01

    Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700

  17. Adaptive genomic divergence under high gene flow between freshwater and brackish-water ecotypes of prickly sculpin (Cottus asper) revealed by Pool-Seq.

    Science.gov (United States)

    Dennenmoser, Stefan; Vamosi, Steven M; Nolte, Arne W; Rogers, Sean M

    2017-01-01

    Understanding the genomic basis of adaptive divergence in the presence of gene flow remains a major challenge in evolutionary biology. In prickly sculpin (Cottus asper), an abundant euryhaline fish in northwestern North America, high genetic connectivity among brackish-water (estuarine) and freshwater (tributary) habitats of coastal rivers does not preclude the build-up of neutral genetic differentiation and emergence of different life history strategies. Because these two habitats present different osmotic niches, we predicted high genetic differentiation at known teleost candidate genes underlying salinity tolerance and osmoregulation. We applied whole-genome sequencing of pooled DNA samples (Pool-Seq) to explore adaptive divergence between two estuarine and two tributary habitats. Paired-end sequence reads were mapped against genomic contigs of European Cottus, and the gene content of candidate regions was explored based on comparisons with the threespine stickleback genome. Genes showing signals of repeated differentiation among brackish-water and freshwater habitats included functions such as ion transport and structural permeability in freshwater gills, which suggests that local adaptation to different osmotic niches might contribute to genomic divergence among habitats. Overall, the presence of both repeated and unique signatures of differentiation across many loci scattered throughout the genome is consistent with polygenic adaptation from standing genetic variation and locally variable selection pressures in the early stages of life history divergence. © 2016 John Wiley & Sons Ltd.

  18. Genome-wide characterization of genetic variants and putative regions under selection in meat and egg-type chicken lines.

    Science.gov (United States)

    Boschiero, Clarissa; Moreira, Gabriel Costa Monteiro; Gheyas, Almas Ara; Godoy, Thaís Fernanda; Gasparin, Gustavo; Mariani, Pilar Drummond Sampaio Corrêa; Paduan, Marcela; Cesar, Aline Silva Mello; Ledur, Mônica Corrêa; Coutinho, Luiz Lehmann

    2018-01-25

    Meat and egg-type chickens have been selected for several generations for different traits. Artificial and natural selection for different phenotypes can change frequency of genetic variants, leaving particular genomic footprints throghtout the genome. Thus, the aims of this study were to sequence 28 chickens from two Brazilian lines (meat and white egg-type) and use this information to characterize genome-wide genetic variations, identify putative regions under selection using Fst method, and find putative pathways under selection. A total of 13.93 million SNPs and 1.36 million INDELs were identified, with more variants detected from the broiler (meat-type) line. Although most were located in non-coding regions, we identified 7255 intolerant non-synonymous SNPs, 512 stopgain/loss SNPs, 1381 frameshift and 1094 non-frameshift INDELs that may alter protein functions. Genes harboring intolerant non-synonymous SNPs affected metabolic pathways related mainly to reproduction and endocrine systems in the white-egg layer line, and lipid metabolism and metabolic diseases in the broiler line. Fst analysis in sliding windows, using SNPs and INDELs separately, identified over 300 putative regions of selection overlapping with more than 250 genes. For the first time in chicken, INDEL variants were considered for selection signature analysis, showing high level of correlation in results between SNP and INDEL data. The putative regions of selection signatures revealed interesting candidate genes and pathways related to important phenotypic traits in chicken, such as lipid metabolism, growth, reproduction, and cardiac development. In this study, Fst method was applied to identify high confidence putative regions under selection, providing novel insights into selection footprints that can help elucidate the functional mechanisms underlying different phenotypic traits relevant to meat and egg-type chicken lines. In addition, we generated a large catalog of line-specific and common

  19. Whole-genome sequencing reveals mutational landscape underlying phenotypic differences between two widespread Chinese cattle breeds.

    Directory of Open Access Journals (Sweden)

    Yao Xu

    Full Text Available Whole-genome sequencing provides a powerful tool to obtain more genetic variability that could produce a range of benefits for cattle breeding industry. Nanyang (Bos indicus and Qinchuan (Bos taurus are two important Chinese indigenous cattle breeds with distinct phenotypes. To identify the genetic characteristics responsible for variation in phenotypes between the two breeds, in the present study, we for the first time sequenced the genomes of four Nanyang and four Qinchuan cattle with 10 to 12 fold on average of 97.86% and 98.98% coverage of genomes, respectively. Comparison with the Bos_taurus_UMD_3.1 reference assembly yielded 9,010,096 SNPs for Nanyang, and 6,965,062 for Qinchuan cattle, 51% and 29% of which were novel SNPs, respectively. A total of 154,934 and 115,032 small indels (1 to 3 bp were found in the Nanyang and Qinchuan genomes, respectively. The SNP and indel distribution revealed that Nanyang showed a genetically high diversity as compared to Qinchuan cattle. Furthermore, a total of 2,907 putative cases of copy number variation (CNV were identified by aligning Nanyang to Qinchuan genome, 783 of which (27% encompassed the coding regions of 495 functional genes. The gene ontology (GO analysis revealed that many CNV genes were enriched in the immune system and environment adaptability. Among several CNV genes related to lipid transport and fat metabolism, Lepin receptor gene (LEPR overlapping with CNV_1815 showed remarkably higher copy number in Qinchuan than Nanyang (log2 (ratio = -2.34988; P value = 1.53E-102. Further qPCR and association analysis investigated that the copy number of the LEPR gene presented positive correlations with transcriptional expression and phenotypic traits, suggesting the LEPR CNV may contribute to the higher fat deposition in muscles of Qinchuan cattle. Our findings provide evidence that the distinct phenotypes of Nanyang and Qinchuan breeds may be due to the different genetic variations including SNPs

  20. Chromosomal Copy Number Variation in Saccharomyces pastorianus Is Evidence for Extensive Genome Dynamics in Industrial Lager Brewing Strains.

    Science.gov (United States)

    van den Broek, M; Bolat, I; Nijkamp, J F; Ramos, E; Luttik, M A H; Koopman, F; Geertman, J M; de Ridder, D; Pronk, J T; Daran, J-M

    2015-09-01

    Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. Copyright © 2015, van den Broek et al.

  1. Spatial variation in the parasite communities and genomic structure of urban rats in New York City.

    Science.gov (United States)

    Angley, L P; Combs, M; Firth, C; Frye, M J; Lipkin, I; Richardson, J L; Munshi-South, J

    2018-02-01

    Brown rats (Rattus norvegicus) are a globally distributed pest. Urban habitats can support large infestations of rats, posing a potential risk to public health from the parasites and pathogens they carry. Despite the potential influence of rodent-borne zoonotic diseases on human health, it is unclear how urban habitats affect the structure and transmission dynamics of ectoparasite and microbial communities (all referred to as "parasites" hereafter) among rat colonies. In this study, we use ecological data on parasites and genomic sequencing of their rat hosts to examine associations between spatial proximity, genetic relatedness and the parasite communities associated with 133 rats at five sites in sections of New York City with persistent rat infestations. We build on previous work showing that rats in New York carry a wide variety of parasites and report that these communities differ significantly among sites, even across small geographical distances. Ectoparasite community similarity was positively associated with geographical proximity; however, there was no general association between distance and microbial communities of rats. Sites with greater overall parasite diversity also had rats with greater infection levels and parasite species richness. Parasite community similarity among sites was not linked to genetic relatedness of rats, suggesting that these communities are not associated with genetic similarity among host individuals or host dispersal among sites. Discriminant analysis identified site-specific associations of several parasite species, suggesting that the presence of some species within parasite communities may allow researchers to determine the sites of origin for newly sampled rats. The results of our study help clarify the roles that colony structure and geographical proximity play in determining the ecology of R. norvegicus as a significant urban reservoir of zoonotic diseases. Our study also highlights the spatial variation present in urban

  2. Positive and negative variations in capacitive images for given defects under varying experimental conditions

    Science.gov (United States)

    Li, Chen; Yin, Xiaokang; Li, Zhen; Li, Wei; Chen, Guoming

    2018-04-01

    Capacitive imaging (CI) technique is a novel electromagnetic NDE technique. The Quasi-static electromagnetic field from the carefully designed electrode pair will vary when the electrical properties of the sample change, leading to the possibility of imaging. It is observed that for a given specimen, the targeted features appear as different variations in capacitive images under different experimental conditions. In some cases, even opposite variations occur, which brings confusion to indication interpretation. It is thus thought interesting to embark on investigations into the cause and effects of the negative variation phenomenon. In this work, the positive and negative variations were first explained from the measurement sensitivity distribution perspective. This was then followed by a detailed analysis using finite element models in COMSOL. A parametric experimental study on a glass fiber composite plate with artificial defects was then carried out to investigate how the experimental conditions affect the variation.

  3. Genetic basis for spontaneous hybrid genome doubling during allopolyploid speciation of common wheat shown by natural variation analyses of the paternal species.

    Directory of Open Access Journals (Sweden)

    Yoshihiro Matsuoka

    Full Text Available The complex process of allopolyploid speciation includes various mechanisms ranging from species crosses and hybrid genome doubling to genome alterations and the establishment of new allopolyploids as persisting natural entities. Currently, little is known about the genetic mechanisms that underlie hybrid genome doubling, despite the fact that natural allopolyploid formation is highly dependent on this phenomenon. We examined the genetic basis for the spontaneous genome doubling of triploid F1 hybrids between the direct ancestors of allohexaploid common wheat (Triticum aestivum L., AABBDD genome, namely Triticumturgidum L. (AABB genome and Aegilopstauschii Coss. (DD genome. An Ae. tauschii intraspecific lineage that is closely related to the D genome of common wheat was identified by population-based analysis. Two representative accessions, one that produces a high-genome-doubling-frequency hybrid when crossed with a T. turgidum cultivar and the other that produces a low-genome-doubling-frequency hybrid with the same cultivar, were chosen from that lineage for further analyses. A series of investigations including fertility analysis, immunostaining, and quantitative trait locus (QTL analysis showed that (1 production of functional unreduced gametes through nonreductional meiosis is an early step key to successful hybrid genome doubling, (2 first division restitution is one of the cytological mechanisms that cause meiotic nonreduction during the production of functional male unreduced gametes, and (3 six QTLs in the Ae. tauschii genome, most of which likely regulate nonreductional meiosis and its subsequent gamete production processes, are involved in hybrid genome doubling. Interlineage comparisons of Ae. tauschii's ability to cause hybrid genome doubling suggested an evolutionary model for the natural variation pattern of the trait in which non-deleterious mutations in six QTLs may have important roles. The findings of this study demonstrated

  4. Genomic Heterogeneity of Methicillin Resistant Staphylococcus aureus Associated with Variation in Severity of Illness among Children with Acute Hematogenous Osteomyelitis.

    Directory of Open Access Journals (Sweden)

    Claudia Gaviria-Agudelo

    Full Text Available The association between severity of illness of children with osteomyelitis caused by Methicillin-resistant Staphylococcus aureus (MRSA and genomic variation of the causative organism has not been previously investigated. The purpose of this study is to assess genomic heterogeneity among MRSA isolates from children with osteomyelitis who have diverse severity of illness.Children with osteomyelitis were prospectively studied between 2010 and 2011. Severity of illness of the affected children was determined from clinical and laboratory parameters. MRSA isolates were analyzed with next generation sequencing (NGS and optical mapping. Sequence data was used for multi-locus sequence typing (MLST, phylogenetic analysis by maximum likelihood (PAML, and identification of virulence genes and single nucleotide polymorphisms (SNP relative to reference strains.The twelve children studied demonstrated severity of illness scores ranging from 0 (mild to 9 (severe. All isolates were USA300, ST 8, SCC mec IVa MRSA by MLST. The isolates differed from reference strains by 2 insertions (40 Kb each and 2 deletions (10 and 25 Kb but had no rearrangements or copy number variations. There was a higher occurrence of virulence genes among study isolates when compared to the reference strains (p = 0.0124. There were an average of 11 nonsynonymous SNPs per strain. PAML demonstrated heterogeneity of study isolates from each other and from the reference strains.Genomic heterogeneity exists among MRSA isolates causing osteomyelitis among children in a single community. These variations may play a role in the pathogenesis of variation in clinical severity among these children.

  5. Genome-wide analysis of macrosatellite repeat copy number variation in worldwide populations: Evidence for differences and commonalities in size distributions and size restrictions

    NARCIS (Netherlands)

    M. Schaap (Michiel); R.J.L.F. Lemmers (Richard); R. Maassen (Roel); P.J. van der Vliet (Patrick); L.F. Hoogerheide (Lennart); H.K. van Dijk (Herman); N. Basturk (Nalan); P. de Knijff (Peter); S.M. van der Maarel (Silvère)

    2013-01-01

    textabstractBackground: Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and

  6. Genome-wide analysis of macrosatellite repeat copy number variation in worldwide populations: evidence for differences and commonalities in size distributions and size restrictions

    NARCIS (Netherlands)

    Schaap, M.; Lemmers, R.J.L.F.; Maassen, R.; van der Vliet, P.J.; Hoogerheide, L.F.; van Dijk, H.K.; Basturk, N.; de Knijff, P.; van der Maarel, S.M.

    2013-01-01

    Background: Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and function is largely

  7. Genomes

    National Research Council Canada - National Science Library

    Brown, T. A. (Terence A.)

    2002-01-01

    ... of genome expression and replication processes, and transcriptomics and proteomics. This text is richly illustrated with clear, easy-to-follow, full color diagrams, which are downloadable from the book's website...

  8. Dynamics of chromosome number and genome size variation in a cytogenetically variable sedge (Carex scoparia var. scoparia, Cyperaceae).

    Science.gov (United States)

    Chung, Kyong-Sook; Weber, Jaime A; Hipp, Andrew L

    2011-01-01

    High intraspecific cytogenetic variation in the sedge genus Carex (Cyperaceae) is hypothesized to be due to the "diffuse" or non-localized centromeres, which facilitate chromosome fission and fusion. If chromosome number changes are dominated by fission and fusion, then chromosome evolution will result primarily in changes in the potential for recombination among populations. Chromosome duplications, on the other hand, entail consequent opportunities for divergent evolution of paralogs. In this study, we evaluate whether genome size and chromosome number covary within species. We used flow cytometry to estimate genome sizes in Carex scoparia var. scoparia, sampling 99 plants (23 populations) in the Chicago region, and we used meiotic chromosome observations to document chromosome numbers and chromosome pairing relations. Chromosome numbers range from 2n = 62 to 2n = 68, and nuclear DNA 1C content from 0.342 to 0.361 pg DNA. Regressions of DNA content on chromosome number are nonsignificant for data analyzed by individual or population, and a regression model that excludes slope is favored over a model in which chromosome number predicts genome size. Chromosome rearrangements within cytogenetically variable Carex species are more likely a consequence of fission and fusion than of duplication and deletion. Moreover, neither genome size nor chromosome number is spatially autocorrelated, which suggests the potential for rapid chromosome evolution by fission and fusion at a relatively fine geographic scale (<350 km). These findings have important implications for ecological restoration and speciation within the largest angiosperm genus of the temperate zone.

  9. Novel genetic loci underlying human intracranial volume identified through genome-wide association

    OpenAIRE

    Adams, Hieab HH; Hibar, Derrek P; Chouraki, Vincent; Stein, Jason L; Nyquist, Paul A; Renter��a, Miguel E; Trompet, Stella; Arias-Vasquez, Alejandro; Seshadri, Sudha; Desrivi��res, Sylvane; Beecham, Ashley H; Jahanshad, Neda; Wittfeld, Katharina; Van der Lee, Sven J; Abramovic, Lucija

    2016-01-01

    Intracranial volume reflects the maximally attained brain size during development, and remains stable with loss of tissue in late life. It is highly heritable, but the underlying genes remain largely undetermined. In a genome-wide association study of 32,438 adults, we discovered five previously unknown loci for intracranial volume and confirmed two known signals. Four of the loci were also associated with adult human stature, but these remained associated with intracranial volume after adjus...

  10. Genomic Microdiversity of Bifidobacterium pseudocatenulatum Underlying Differential Strain-Level Responses to Dietary Carbohydrate Intervention

    Directory of Open Access Journals (Sweden)

    Guojun Wu

    2017-02-01

    Full Text Available The genomic basis of the response to dietary intervention of human gut beneficial bacteria remains elusive, which hinders precise manipulation of the microbiota for human health. After receiving a dietary intervention enriched with nondigestible carbohydrates for 105 days, a genetically obese child with Prader-Willi syndrome lost 18.4% of his body weight and showed significant improvement in his bioclinical parameters. We obtained five isolates (C1, C15, C55, C62, and C95 of one of the most abundantly promoted beneficial species, Bifidobacterium pseudocatenulatum, from a postintervention fecal sample. Intriguingly, these five B. pseudocatenulatum strains showed differential responses during the dietary intervention. Two strains were largely unaffected, while the other three were promoted to different extents by the changes in dietary carbohydrate resources. The differential responses of these strains were consistent with their functional clustering based on the COGs (Clusters of Orthologous Groups, including those involved with the ABC-type sugar transport systems, suggesting that the strain-specific genomic variations may have contributed to the niche adaption. Particularly, B. pseudocatenulatum C15, which had the most diverse types and highest gene copy numbers of carbohydrate-active enzymes targeting plant polysaccharides, had the highest abundance after the dietary intervention. These studies show the importance of understanding genomic diversity of specific members of the gut microbiota if precise nutrition approaches are to be realized.

  11. Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication

    Science.gov (United States)

    Li, Gang; Gandolfi, Barbara; Khan, Razib; Aken, Bronwen L.; Searle, Steven M. J.; Minx, Patrick; Hillier, LaDeana W.; Koboldt, Daniel C.; Davis, Brian W.; Driscoll, Carlos A.; Barr, Christina S.; Blackistone, Kevin; Quilez, Javier; Lorente-Galdos, Belen; Marques-Bonet, Tomas; Alkan, Can; Thomas, Gregg W. C.; Hahn, Matthew W.; Menotti-Raymond, Marilyn; O’Brien, Stephen J.; Wilson, Richard K.; Lyons, Leslie A.; Murphy, William J.; Warren, Wesley C.

    2014-01-01

    Little is known about the genetic changes that distinguish domestic cat populations from their wild progenitors. Here we describe a high-quality domestic cat reference genome assembly and comparative inferences made with other cat breeds, wildcats, and other mammals. Based upon these comparisons, we identified positively selected genes enriched for genes involved in lipid metabolism that underpin adaptations to a hypercarnivorous diet. We also found positive selection signals within genes underlying sensory processes, especially those affecting vision and hearing in the carnivore lineage. We observed an evolutionary tradeoff between functional olfactory and vomeronasal receptor gene repertoires in the cat and dog genomes, with an expansion of the feline chemosensory system for detecting pheromones at the expense of odorant detection. Genomic regions harboring signatures of natural selection that distinguish domestic cats from their wild congeners are enriched in neural crest-related genes associated with behavior and reward in mouse models, as predicted by the domestication syndrome hypothesis. Our description of a previously unidentified allele for the gloving pigmentation pattern found in the Birman breed supports the hypothesis that cat breeds experienced strong selection on specific mutations drawn from random bred populations. Collectively, these findings provide insight into how the process of domestication altered the ancestral wildcat genome and build a resource for future disease mapping and phylogenomic studies across all members of the Felidae. PMID:25385592

  12. Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication.

    Science.gov (United States)

    Montague, Michael J; Li, Gang; Gandolfi, Barbara; Khan, Razib; Aken, Bronwen L; Searle, Steven M J; Minx, Patrick; Hillier, LaDeana W; Koboldt, Daniel C; Davis, Brian W; Driscoll, Carlos A; Barr, Christina S; Blackistone, Kevin; Quilez, Javier; Lorente-Galdos, Belen; Marques-Bonet, Tomas; Alkan, Can; Thomas, Gregg W C; Hahn, Matthew W; Menotti-Raymond, Marilyn; O'Brien, Stephen J; Wilson, Richard K; Lyons, Leslie A; Murphy, William J; Warren, Wesley C

    2014-12-02

    Little is known about the genetic changes that distinguish domestic cat populations from their wild progenitors. Here we describe a high-quality domestic cat reference genome assembly and comparative inferences made with other cat breeds, wildcats, and other mammals. Based upon these comparisons, we identified positively selected genes enriched for genes involved in lipid metabolism that underpin adaptations to a hypercarnivorous diet. We also found positive selection signals within genes underlying sensory processes, especially those affecting vision and hearing in the carnivore lineage. We observed an evolutionary tradeoff between functional olfactory and vomeronasal receptor gene repertoires in the cat and dog genomes, with an expansion of the feline chemosensory system for detecting pheromones at the expense of odorant detection. Genomic regions harboring signatures of natural selection that distinguish domestic cats from their wild congeners are enriched in neural crest-related genes associated with behavior and reward in mouse models, as predicted by the domestication syndrome hypothesis. Our description of a previously unidentified allele for the gloving pigmentation pattern found in the Birman breed supports the hypothesis that cat breeds experienced strong selection on specific mutations drawn from random bred populations. Collectively, these findings provide insight into how the process of domestication altered the ancestral wildcat genome and build a resource for future disease mapping and phylogenomic studies across all members of the Felidae.

  13. The effect of subdivision on variation at multi-allelic loci under balancing selection

    DEFF Research Database (Denmark)

    Schierup, M H; Vekemans, X; Charlesworth, D

    2000-01-01

    Simulations are used to investigate the expected pattern of variation at loci under different forms of multi-allelic balancing selection in a finite island model of a subdivided population. The objective is to evaluate the effect of restricted migration among demes on the distribution of polymorp......Simulations are used to investigate the expected pattern of variation at loci under different forms of multi-allelic balancing selection in a finite island model of a subdivided population. The objective is to evaluate the effect of restricted migration among demes on the distribution...

  14. Investigation of common, low-frequency and rare genome-wide variation in anorexia nervosa

    Science.gov (United States)

    Huckins, L M; Hatzikotoulas, K; Southam, L; Thornton, L M; Steinberg, J; Aguilera-McKay, F; Treasure, J; Schmidt, U; Gunasinghe, C; Romero, A; Curtis, C; Rhodes, D; Moens, J; Kalsi, G; Dempster, D; Leung, R; Keohane, A; Burghardt, R; Ehrlich, S; Hebebrand, J; Hinney, A; Ludolph, A; Walton, E; Deloukas, P; Hofman, A; Palotie, A; Palta, P; van Rooij, F J A; Stirrups, K; Adan, R; Boni, C; Cone, R; Dedoussis, G; van Furth, E; Gonidakis, F; Gorwood, P; Hudson, J; Kaprio, J; Kas, M; Keski-Rahonen, A; Kiezebrink, K; Knudsen, G-P; Slof-Op 't Landt, M C T; Maj, M; Monteleone, A M; Monteleone, P; Raevuori, A H; Reichborn-Kjennerud, T; Tozzi, F; Tsitsika, A; van Elburg, A; Adan, R A H; Alfredsson, L; Ando, T; Andreassen, O A; Aschauer, H; Baker, J H; Barrett, J C; Bencko, V; Bergen, A W; Berrettini, W H; Birgegard, A; Boni, C; Boraska Perica, V; Brandt, H; Breen, G; Bulik, C M; Carlberg, L; Cassina, M; Cichon, S; Clementi, M; Cohen-Woods, S; Coleman, J; Cone, R D; Courtet, P; Crawford, S; Crow, S; Crowley, J; Danner, U N; Davis, O S P; de Zwaan, M; Dedoussis, G; Degortes, D; DeSocio, J E; Dick, D M; Dikeos, D; Dina, C; Ding, B; Dmitrzak-Weglarz, M; Docampo, E; Duncan, L; Egberts, K; Ehrlich, S; Escaramís, G; Esko, T; Espeseth, T; Estivill, X; Favaro, A; Fernández-Aranda, F; Fichter, M M; Finan, C; Fischer, K; Floyd, J A B; Foretova, L; Forzan, M; Franklin, C S; Gallinger, S; Gambaro, G; Gaspar, H A; Giegling, I; Gonidakis, F; Gorwood, P; Gratacos, M; Guillaume, S; Guo, Y; Hakonarson, H; Halmi, K A; Hatzikotoulas, K; Hauser, J; Hebebrand, J; Helder, S; Herms, S; Herpertz-Dahlmann, B; Herzog, W; Hilliard, C E; Hinney, A; Hübel, C; Huckins, L M; Hudson, J I; Huemer, J; Inoko, H; Janout, V; Jiménez-Murcia, S; Johnson, C; Julià, A; Juréus, A; Kalsi, G; Kaminska, D; Kaplan, A S; Kaprio, J; Karhunen, L; Karwautz, A; Kas, M J H; Kaye, W; Kennedy, J L; Keski-Rahkonen, A; Kiezebrink, K; Klareskog, L; Klump, K L; Knudsen, G P S; Koeleman, B P C; Koubek, D; La Via, M C; Landén, M; Le Hellard, S; Levitan, R D; Li, D; Lichtenstein, P; Lilenfeld, L; Lissowska, J; Lundervold, A; Magistretti, P; Maj, M; Mannik, K; Marsal, S; Martin, N; Mattingsdal, M; McDevitt, S; McGuffin, P; Merl, E; Metspalu, A; Meulenbelt, I; Micali, N; Mitchell, J; Mitchell, K; Monteleone, P; Monteleone, A M; Mortensen, P; Munn-Chernoff, M A; Navratilova, M; Nilsson, I; Norring, C; Ntalla, I; Ophoff, R A; O'Toole, J K; Palotie, A; Pante, J; Papezova, H; Pinto, D; Rabionet, R; Raevuori, A; Rajewski, A; Ramoz, N; Rayner, N W; Reichborn-Kjennerud, T; Ripatti, S; Roberts, M; Rotondo, A; Rujescu, D; Rybakowski, F; Santonastaso, P; Scherag, A; Scherer, S W; Schmidt, U; Schork, N J; Schosser, A; Slachtova, L; Sladek, R; Slagboom, P E; Slof-Op 't Landt, M C T; Slopien, A; Soranzo, N; Southam, L; Steen, V M; Strengman, E; Strober, M; Sullivan, P F; Szatkiewicz, J P; Szeszenia-Dabrowska, N; Tachmazidou, I; Tenconi, E; Thornton, L M; Tortorella, A; Tozzi, F; Treasure, J; Tsitsika, A; Tziouvas, K; van Elburg, A A; van Furth, E F; Wagner, G; Walton, E; Watson, H; Wichmann, H-E; Widen, E; Woodside, D B; Yanovski, J; Yao, S; Yilmaz, Z; Zeggini, E; Zerwas, S; Zipfel, S; Collier, D A; Sullivan, P F; Breen, G; Bulik, C M; Zeggini, E

    2018-01-01

    Anorexia nervosa (AN) is a complex neuropsychiatric disorder presenting with dangerously low body weight, and a deep and persistent fear of gaining weight. To date, only one genome-wide significant locus associated with AN has been identified. We performed an exome-chip based genome-wide association studies (GWAS) in 2158 cases from nine populations of European origin and 15 485 ancestrally matched controls. Unlike previous studies, this GWAS also probed association in low-frequency and rare variants. Sixteen independent variants were taken forward for in silico and de novo replication (11 common and 5 rare). No findings reached genome-wide significance. Two notable common variants were identified: rs10791286, an intronic variant in OPCML (P=9.89 × 10−6), and rs7700147, an intergenic variant (P=2.93 × 10−5). No low-frequency variant associations were identified at genome-wide significance, although the study was well-powered to detect low-frequency variants with large effect sizes, suggesting that there may be no AN loci in this genomic search space with large effect sizes. PMID:29155802

  15. Population genomic analysis of strain variation in Leptospirillum group II bacteria involved in acid mine drainage formation.

    Science.gov (United States)

    Simmons, Sheri L; Dibartolo, Genevieve; Denef, Vincent J; Goltsman, Daniela S Aliaga; Thelen, Michael P; Banfield, Jillian F

    2008-07-22

    Deeply sampled community genomic (metagenomic) datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth approximately 20x). The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type) at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types ( approximately 94% sequence identity) have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the observed patterns of

  16. Population genomic analysis of strain variation in Leptospirillum group II bacteria involved in acid mine drainage formation.

    Directory of Open Access Journals (Sweden)

    Sheri L Simmons

    2008-07-01

    Full Text Available Deeply sampled community genomic (metagenomic datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth approximately 20x. The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types ( approximately 94% sequence identity have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the

  17. Population-Genomic Insights into Variation in Prevotella intermedia and Prevotella nigrescens Isolates and Its Association with Periodontal Disease

    Directory of Open Access Journals (Sweden)

    Yifei Zhang

    2017-09-01

    Full Text Available High-throughput sequencing has helped to reveal the close relationship between Prevotella and periodontal disease, but the roles of subspecies diversity and genomic variation within this genus in periodontal diseases still need to be investigated. We performed a comparative genome analysis of 48 Prevotella intermedia and Prevotella nigrescens isolates that from the same cohort of subjects to identify the main drivers of their pathogenicity and adaptation to different environments. The comparisons were done between two species and between disease and health based on pooled sequences. The results showed that both P. intermedia and P. nigrescens have highly dynamic genomes and can take up various exogenous factors through horizontal gene transfer. The major differences between disease-derived and health-derived samples of P. intermedia and P. nigrescens were factors related to genome modification and recombination, indicating that the Prevotella isolates from disease sites may be more capable of genomic reconstruction. We also identified genetic elements specific to each sample, and found that disease groups had more unique virulence factors related to capsule and lipopolysaccharide synthesis, secretion systems, proteinases, and toxins, suggesting that strains from disease sites may have more specific virulence, particularly for P. intermedia. The differentially represented pathways between samples from disease and health were related to energy metabolism, carbohydrate and lipid metabolism, and amino acid metabolism, consistent with data from the whole subgingival microbiome in periodontal disease and health. Disease-derived samples had gained or lost several metabolic genes compared to healthy-derived samples, which could be linked with the difference in virulence performance between diseased and healthy sample groups. Our findings suggest that P. intermedia and P. nigrescens may serve as “crucial substances” in subgingival plaque, which may

  18. Population-Genomic Insights into Variation in Prevotella intermedia and Prevotella nigrescens Isolates and Its Association with Periodontal Disease.

    Science.gov (United States)

    Zhang, Yifei; Zhen, Min; Zhan, Yalin; Song, Yeqing; Zhang, Qian; Wang, Jinfeng

    2017-01-01

    High-throughput sequencing has helped to reveal the close relationship between Prevotella and periodontal disease, but the roles of subspecies diversity and genomic variation within this genus in periodontal diseases still need to be investigated. We performed a comparative genome analysis of 48 Prevotella intermedia and Prevotella nigrescens isolates that from the same cohort of subjects to identify the main drivers of their pathogenicity and adaptation to different environments. The comparisons were done between two species and between disease and health based on pooled sequences. The results showed that both P. intermedia and P. nigrescens have highly dynamic genomes and can take up various exogenous factors through horizontal gene transfer. The major differences between disease-derived and health-derived samples of P. intermedia and P. nigrescens were factors related to genome modification and recombination, indicating that the Prevotella isolates from disease sites may be more capable of genomic reconstruction. We also identified genetic elements specific to each sample, and found that disease groups had more unique virulence factors related to capsule and lipopolysaccharide synthesis, secretion systems, proteinases, and toxins, suggesting that strains from disease sites may have more specific virulence, particularly for P. intermedia . The differentially represented pathways between samples from disease and health were related to energy metabolism, carbohydrate and lipid metabolism, and amino acid metabolism, consistent with data from the whole subgingival microbiome in periodontal disease and health. Disease-derived samples had gained or lost several metabolic genes compared to healthy-derived samples, which could be linked with the difference in virulence performance between diseased and healthy sample groups. Our findings suggest that P. intermedia and P. nigrescens may serve as "crucial substances" in subgingival plaque, which may reflect changes in

  19. Genomic regions under selection in crop-wild hybrids of lettuce: implications for crop breeding and environmental risk assessment

    NARCIS (Netherlands)

    Hartman, Y.

    2012-01-01

    The results of this thesis show that the probability of introgression of a putative transgene to wild relatives indeed depends strongly on the insertion location of the transgene. The study of genomic selection patterns can identify crop genomic regions under negative selection in multiple

  20. Reactive power control methods for improved reliability of wind power inverters under wind speed variations

    DEFF Research Database (Denmark)

    Ma, Ke; Liserre, Marco; Blaabjerg, Frede

    2012-01-01

    method to relieve the thermal cycling of power switching devices under severe wind speed variations, by circulating reactive power among the parallel power converters in a WTS or among the WTS's in a wind park. The amount of reactive power is adjusted to limit the junction temperature fluctuation...

  1. Variation of topical application to skin under good clinical practice (GCP)

    DEFF Research Database (Denmark)

    Vind-Kezunovic, Dina; Serup, Jørgen Vedelskov

    2016-01-01

    INTRODUCTION: Application of topical products by individuals is inherently variable and accurate dosing can be difficult to control. Variation of the dose used under optimal conditions in drug trials is unknown. METHODS: This trial was part of a double-blind, randomized, placebo-controlled good...

  2. Basic Density and Strength Properties Variations in Cordia Africana (Lam) Grown Under Agroforestry in Arumeru, Tanzania

    NARCIS (Netherlands)

    Mahonge, C.P.I.

    2007-01-01

    Variations in basic density and strength properties of Cordia africana (lam) grown under agroforestry in Arumeru district Arusha Tanzania were determined. Tree sampling procedure and data collection based on standard methods (ISO 3129.of 1975). The main results indicated that basic density increased

  3. Ancestry variation and footprints of natural selection along the genome in Latin American populations.

    Science.gov (United States)

    Deng, Lian; Ruiz-Linares, Andrés; Xu, Shuhua; Wang, Sijia

    2016-02-18

    Latin American populations stem from the admixture of Europeans, Africans and Native Americans, which started over 400 years ago and had lasted for several centuries. Extreme deviation over the genome-wide average in ancestry estimations at certain genomic locations could reflect recent natural selection. We evaluated the distribution of ancestry estimations using 678 genome-wide microsatellite markers in 249 individuals from 13 admixed populations across Latin America. We found significant deviations in ancestry estimations including three locations with more than 3.5 times standard deviations from the genome-wide average: an excess of European ancestry at 1p36 and 14q32, and an excess of African ancestry at 6p22. Using simulations, we could show that at least the deviation at 6p22 was unlikely to result from genetic drift alone. By applying different linguistic groups as well as the most likely ancestral Native American populations as the ancestry, we showed that the choice of Native American ancestry could affect the local ancestry estimation. However, the signal at 6p22 consistently appeared in most of the analyses using various ancestral groups. This study provided important insights for recent natural selection in the context of the unique history of the New World and implications for disease mapping.

  4. Detection of genomic variation by selection of a 9 mb DNA region and high throughput sequencing.

    Directory of Open Access Journals (Sweden)

    Sergey I Nikolaev

    Full Text Available Detection of the rare polymorphisms and causative mutations of genetic diseases in a targeted genomic area has become a major goal in order to understand genomic and phenotypic variability. We have interrogated repeat-masked regions of 8.9 Mb on human chromosomes 21 (7.8 Mb and 7 (1.1 Mb from an individual from the International HapMap Project (NA12872. We have optimized a method of genomic selection for high throughput sequencing. Microarray-based selection and sequencing resulted in 260-fold enrichment, with 41% of reads mapping to the target region. 83% of SNPs in the targeted region had at least 4-fold sequence coverage and 54% at least 15-fold. When assaying HapMap SNPs in NA12872, our sequence genotypes are 91.3% concordant in regions with coverage > or = 4-fold, and 97.9% concordant in regions with coverage > or = 15-fold. About 81% of the SNPs recovered with both thresholds are listed in dbSNP. We observed that regions with low sequence coverage occur in close proximity to low-complexity DNA. Validation experiments using Sanger sequencing were performed for 46 SNPs with 15-20 fold coverage, with a confirmation rate of 96%, suggesting that DNA selection provides an accurate and cost-effective method for identifying rare genomic variants.

  5. Natural variation of histone modification and its impact on gene expression in the rat genome

    NARCIS (Netherlands)

    Rintisch, Carola; Heinig, Matthias; Bauerfeind, Anja; Schafer, Sebastian; Mieth, Christin; Patone, Giannino; Hummel, Oliver; Chen, Wei; Cook, Stuart; Cuppen, Edwin; Colomé-Tatché, Maria; Johannes, Frank; Jansen, Ritsert C; Neil, Helen; Werner, Michel; Pravenec, Michal; Vingron, Martin; Hubner, Norbert

    Histone modifications are epigenetic marks that play fundamental roles in many biological processes including the control of chromatin-mediated regulation of gene expression. Little is known about interindividual variability of histone modification levels across the genome and to what extent they

  6. Genome size and phenotypic variation of Nymphaea (Nymphaeaceae) species from Eastern Europe and temperate Asia

    Czech Academy of Sciences Publication Activity Database

    Dąbrowska, M. A.; Rola, K.; Volkova, P.; Suda, Jan; Zalewska-Gałosz, J.

    2015-01-01

    Roč. 84, č. 2 (2015), s. 277-286 ISSN 0001-6977 R&D Projects: GA ČR GB14-36079G Institutional support: RVO:67985939 Keywords : flow cytometry * genome size * morphometrics Subject RIV: EF - Botanics Impact factor: 1.213, year: 2015

  7. Using an online genome resource to identify myostatin variation in U.S. sheep

    Science.gov (United States)

    We created a public, searchable DNA sequence resource for sheep that contained approximately 14x whole genome sequence of 96 rams. The animals represent 10 popular U.S. breeds and share minimal pedigree relationships, making the resource suitable for viewing gene variants in the user-friendly Integ...

  8. Systematic differences in the response of genetic variation to pedigree and genome-based selection methods

    NARCIS (Netherlands)

    Heidaritabar, M.; Vereijken, A.; Muir, W.M.; Meuwissen, T.H.E.; Cheng, H.; Megens, H.J.W.C.; Groenen, M.; Bastiaansen, J.W.M.

    2014-01-01

    Genomic selection (GS) is a DNA-based method of selecting for quantitative traits in animal and plant breeding, and offers a potentially superior alternative to traditional breeding methods that rely on pedigree and phenotype information. Using a 60¿K SNP chip with markers spaced throughout the

  9. How genome size variation is linked with evolution within Chenopodium sensu lato

    Czech Academy of Sciences Publication Activity Database

    Mandák, Bohumil; Krak, Karol; Vít, Petr; Pavlíková, Zuzana; Lomonosova, M. N.; Habibi, Farzaneh; Lei, Wang; Jellen, E.N.; Douda, Jan

    2016-01-01

    Roč. 23, DEC 2016 (2016), s. 18-32 ISSN 1433-8319 R&D Projects: GA ČR GA13-02290S Institutional support: RVO:67985939 Keywords : Chenopodium * genome size evolution * flow cytometry Subject RIV: EF - Botanics Impact factor: 3.123, year: 2016

  10. Genome-wide association mapping identifies the genetic basis of discrete and quantitative variation in sexual weaponry in a wild sheep population.

    Science.gov (United States)

    Johnston, Susan E; McEwan, John C; Pickering, Natalie K; Kijas, James W; Beraldi, Dario; Pilkington, Jill G; Pemberton, Josephine M; Slate, Jon

    2011-06-01

    Understanding the genetic architecture of phenotypic variation in natural populations is a fundamental goal of evolutionary genetics. Wild Soay sheep (Ovis aries) have an inherited polymorphism for horn morphology in both sexes, controlled by a single autosomal locus, Horns. The majority of males have large normal horns, but a small number have vestigial, deformed horns, known as scurs; females have either normal horns, scurs or no horns (polled). Given that scurred males and polled females have reduced fitness within each sex, it is counterintuitive that the polymorphism persists within the population. Therefore, identifying the genetic basis of horn type will provide a vital foundation for understanding why the different morphs are maintained in the face of natural selection. We conducted a genome-wide association study using ∼36000 single nucleotide polymorphisms (SNPs) and determined the main candidate for Horns as RXFP2, an autosomal gene with a known involvement in determining primary sex characters in humans and mice. Evidence from additional SNPs in and around RXFP2 supports a new model of horn-type inheritance in Soay sheep, and for the first time, sheep with the same horn phenotype but different underlying genotypes can be identified. In addition, RXFP2 was shown to be an additive quantitative trait locus (QTL) for horn size in normal-horned males, accounting for up to 76% of additive genetic variation in this trait. This finding contrasts markedly from genome-wide association studies of quantitative traits in humans and some model species, where it is often observed that mapped loci only explain a modest proportion of the overall genetic variation. © 2011 Blackwell Publishing Ltd.

  11. BIGSdb: Scalable analysis of bacterial genome variation at the population level

    Directory of Open Access Journals (Sweden)

    Maiden Martin CJ

    2010-12-01

    Full Text Available Abstract Background The opportunities for bacterial population genomics that are being realised by the application of parallel nucleotide sequencing require novel bioinformatics platforms. These must be capable of the storage, retrieval, and analysis of linked phenotypic and genotypic information in an accessible, scalable and computationally efficient manner. Results The Bacterial Isolate Genome Sequence Database (BIGSDB is a scalable, open source, web-accessible database system that meets these needs, enabling phenotype and sequence data, which can range from a single sequence read to whole genome data, to be efficiently linked for a limitless number of bacterial specimens. The system builds on the widely used mlstdbNet software, developed for the storage and distribution of multilocus sequence typing (MLST data, and incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences. These loci can be further organised into 'schemes' for isolate characterisation or for evolutionary or functional analyses. Isolates and loci can be indexed by multiple names and any number of alternative schemes can be accommodated, enabling cross-referencing of different studies and approaches. LIMS functionality of the software enables linkage to and organisation of laboratory samples. The data are easily linked to external databases and fine-grained authentication of access permits multiple users to participate in community annotation by setting up or contributing to different schemes within the database. Some of the applications of BIGSDB are illustrated with the genera Neisseria and Streptococcus. The BIGSDB source code and documentation are available at http://pubmlst.org/software/database/bigsdb/. Conclusions Genomic data can be used to characterise bacterial isolates in many different ways but it can also be efficiently exploited for evolutionary or functional studies. BIGSDB

  12. Detecting loci under recent positive selection in dairy and beef cattle by combining different genome-wide scan methods.

    Directory of Open Access Journals (Sweden)

    Yuri Tani Utsunomiya

    Full Text Available As the methodologies available for the detection of positive selection from genomic data vary in terms of assumptions and execution, weak correlations are expected among them. However, if there is any given signal that is consistently supported across different methodologies, it is strong evidence that the locus has been under past selection. In this paper, a straightforward frequentist approach based on the Stouffer Method to combine P-values across different tests for evidence of recent positive selection in common variations, as well as strategies for extracting biological information from the detected signals, were described and applied to high density single nucleotide polymorphism (SNP data generated from dairy and beef cattle (taurine and indicine. The ancestral Bovinae allele state of over 440,000 SNP is also reported. Using this combination of methods, highly significant (P<3.17×10(-7 population-specific sweeps pointing out to candidate genes and pathways that may be involved in beef and dairy production were identified. The most significant signal was found in the Cornichon homolog 3 gene (CNIH3 in Brown Swiss (P = 3.82×10(-12, and may be involved in the regulation of pre-ovulatory luteinizing hormone surge. Other putative pathways under selection are the glucolysis/gluconeogenesis, transcription machinery and chemokine/cytokine activity in Angus; calpain-calpastatin system and ribosome biogenesis in Brown Swiss; and gangliosides deposition in milk fat globules in Gyr. The composite method, combined with the strategies applied to retrieve functional information, may be a useful tool for surveying genome-wide selective sweeps and providing insights in to the source of selection.

  13. Evolutionary and biotechnology implications of plastid genome variation in the inverted-repeat-lacking clade of legumes.

    Science.gov (United States)

    Sabir, Jamal; Schwarz, Erika; Ellison, Nicholas; Zhang, Jin; Baeshen, Nabih A; Mutwakil, Muhammed; Jansen, Robert; Ruhlman, Tracey

    2014-08-01

    Land plant plastid genomes (plastomes) provide a tractable model for evolutionary study in that they are relatively compact and gene dense. Among the groups that display an appropriate level of variation for structural features, the inverted-repeat-lacking clade (IRLC) of papilionoid legumes presents the potential to advance general understanding of the mechanisms of genomic evolution. Here, are presented six complete plastome sequences from economically important species of the IRLC, a lineage previously represented by only five completed plastomes. A number of characters are compared across the IRLC including gene retention and divergence, synteny, repeat structure and functional gene transfer to the nucleus. The loss of clpP intron 2 was identified in one newly sequenced member of IRLC, Glycyrrhiza glabra. Using deeply sequenced nuclear transcriptomes from two species helped clarify the nature of the functional transfer of accD to the nucleus in Trifolium, which likely occurred in the lineage leading to subgenus Trifolium. Legumes are second only to cereal crops in agricultural importance based on area harvested and total production. Genetic improvement via plastid transformation of IRLC crop species is an appealing proposition. Comparative analyses of intergenic spacer regions emphasize the need for complete genome sequences for developing transformation vectors for plastid genetic engineering of legume crops. © 2014 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  14. Single-Cell-Based Platform for Copy Number Variation Profiling through Digital Counting of Amplified Genomic DNA Fragments.

    Science.gov (United States)

    Li, Chunmei; Yu, Zhilong; Fu, Yusi; Pang, Yuhong; Huang, Yanyi

    2017-04-26

    We develop a novel single-cell-based platform through digital counting of amplified genomic DNA fragments, named multifraction amplification (mfA), to detect the copy number variations (CNVs) in a single cell. Amplification is required to acquire genomic information from a single cell, while introducing unavoidable bias. Unlike prevalent methods that directly infer CNV profiles from the pattern of sequencing depth, our mfA platform denatures and separates the DNA molecules from a single cell into multiple fractions of a reaction mix before amplification. By examining the sequencing result of each fraction for a specific fragment and applying a segment-merge maximum likelihood algorithm to the calculation of copy number, we digitize the sequencing-depth-based CNV identification and thus provide a method that is less sensitive to the amplification bias. In this paper, we demonstrate a mfA platform through multiple displacement amplification (MDA) chemistry. When performing the mfA platform, the noise of MDA is reduced; therefore, the resolution of single-cell CNV identification can be improved to 100 kb. We can also determine the genomic region free of allelic drop-out with mfA platform, which is impossible for conventional single-cell amplification methods.

  15. Candidate genes revealed by a genome scan for mosquito resistance to a bacterial insecticide: sequence and gene expression variations

    Directory of Open Access Journals (Sweden)

    David Jean-Philippe

    2009-11-01

    Full Text Available Abstract Background Genome scans are becoming an increasingly popular approach to study the genetic basis of adaptation and speciation, but on their own, they are often helpless at identifying the specific gene(s or mutation(s targeted by selection. This shortcoming is hopefully bound to disappear in the near future, thanks to the wealth of new genomic resources that are currently being developed for many species. In this article, we provide a foretaste of this exciting new era by conducting a genome scan in the mosquito Aedes aegypti with the aim to look for candidate genes involved in resistance to Bacillus thuringiensis subsp. israelensis (Bti insecticidal toxins. Results The genome of a Bti-resistant and a Bti-susceptible strains was surveyed using about 500 MITE-based molecular markers, and the loci showing the highest inter-strain genetic differentiation were sequenced and mapped on the Aedes aegypti genome sequence. Several good candidate genes for Bti-resistance were identified in the vicinity of these highly differentiated markers. Two of them, coding for a cadherin and a leucine aminopeptidase, were further examined at the sequence and gene expression levels. In the resistant strain, the cadherin gene displayed patterns of nucleotide polymorphisms consistent with the action of positive selection (e.g. an excess of high compared to intermediate frequency mutations, as well as a significant under-expression compared to the susceptible strain. Conclusion Both sequence and gene expression analyses agree to suggest a role for positive selection in the evolution of this cadherin gene in the resistant strain. However, it is unlikely that resistance to Bti is conferred by this gene alone, and further investigation will be needed to characterize other genes significantly associated with Bti resistance in Ae. aegypti. Beyond these results, this article illustrates how genome scans can build on the body of new genomic information (here, full

  16. Reactor thermal behaviors under kinetics parameters variations in fast reactivity insertion

    Energy Technology Data Exchange (ETDEWEB)

    Abou-El-Maaty, Talal [Reactors Department, Atomic Energy Authority, Cairo 13759 (Egypt)], E-mail: talal22969@yahoo.com; Abdelhady, Amr [Reactors Department, Atomic Energy Authority, Cairo 13759 (Egypt)

    2009-03-15

    The influences of variations in some of the kinetics parameters affecting the reactivity insertion are considered in this study, it has been accomplished in order to acquire knowledge about the role that kinetic parameters play in prompt critical transients from the safety point of view. The kinetics parameters variations are limited to the effective delayed neutron fraction ({beta}{sub eff}) and the prompt neutron generation time ({lambda}). The reactor thermal behaviors under the variations in effective delayed neutron fraction and prompt neutron generation time included, the reactor power, maximum fuel temperature, maximum clad temperature, maximum coolant temperature and the mass flux variations at the hot channel. The analysis is done for a typical swimming pool, plate type research reactor with low enriched uranium. The scram system is disabled during the accidents simulations. Calculations were done using PARET code. As a result of simulations, it is concluded that, the reactor (ETRR2) thermal behavior is considerably more sensitive to the variation in the effective delayed neutron fraction than to the variation in prompt neutron generation time and the fast reactivity insertion in both cases causes a flow expansion and contraction at the hot channel exit. The amplitude of the oscillated flow is a qualitatively increases with the decrease in both {beta}{sub eff} and {lambda}.

  17. Frequency of mononuclear diploid cardiomyocytes underlies natural variation in heart regeneration.

    Science.gov (United States)

    Patterson, Michaela; Barske, Lindsey; Van Handel, Ben; Rau, Christoph D; Gan, Peiheng; Sharma, Avneesh; Parikh, Shan; Denholtz, Matt; Huang, Ying; Yamaguchi, Yukiko; Shen, Hua; Allayee, Hooman; Crump, J Gage; Force, Thomas I; Lien, Ching-Ling; Makita, Takako; Lusis, Aldons J; Kumar, S Ram; Sucov, Henry M

    2017-09-01

    Adult mammalian cardiomyocyte regeneration after injury is thought to be minimal. Mononuclear diploid cardiomyocytes (MNDCMs), a relatively small subpopulation in the adult heart, may account for the observed degree of regeneration, but this has not been tested. We surveyed 120 inbred mouse strains and found that the frequency of adult mononuclear cardiomyocytes was surprisingly variable (>7-fold). Cardiomyocyte proliferation and heart functional recovery after coronary artery ligation both correlated with pre-injury MNDCM content. Using genome-wide association, we identified Tnni3k as one gene that influences variation in this composition and demonstrated that Tnni3k knockout resulted in elevated MNDCM content and increased cardiomyocyte proliferation after injury. Reciprocally, overexpression of Tnni3k in zebrafish promoted cardiomyocyte polyploidization and compromised heart regeneration. Our results corroborate the relevance of MNDCMs in heart regeneration. Moreover, they imply that intrinsic heart regeneration is not limited nor uniform in all individuals, but rather is a variable trait influenced by multiple genes.

  18. Genome-wide transcriptomic responses of the seagrasses Zostera marina and Nanozostera noltii under a simulated heatwave confirm functional types

    NARCIS (Netherlands)

    Franssen, Susanne U.; Gu, Jenny; Winters, Gidon; Huylmans, Ann-Kathrin; Wienpahl, Isabell; Sparwel, Maximiliane; Coyer, James; Olsen, Jeanine; Reusch, Thorsten; Bornberg-Bauer, Erich

    Genome-wide transcription analysis between related species occurring in overlapping ranges can provide insights into the molecular basis underlying different ecological niches. The co-occurring seagrass species, Zostera marina and Nanozostera noltii, are found in marine coastal environments

  19. Variation in Linked Selection and Recombination Drive Genomic Divergence during Allopatric Speciation of European and American Aspens.

    Science.gov (United States)

    Wang, Jing; Street, Nathaniel R; Scofield, Douglas G; Ingvarsson, Pär K

    2016-07-01

    Despite the global economic and ecological importance of forest trees, the genomic basis of differential adaptation and speciation in tree species is still poorly understood. Populus tremula and Populus tremuloides are two of the most widespread tree species in the Northern Hemisphere. Using whole-genome re-sequencing data of 24 P. tremula and 22 P. tremuloides individuals, we find that the two species diverged ∼2.2-3.1 million years ago, coinciding with the severing of the Bering land bridge and the onset of dramatic climatic oscillations during the Pleistocene. Both species have experienced substantial population expansions following long-term declines after species divergence. We detect widespread and heterogeneous genomic differentiation between species, and in accordance with the expectation of allopatric speciation, coalescent simulations suggest that neutral evolutionary processes can account for most of the observed patterns of genetic differentiation. However, there is an excess of regions exhibiting extreme differentiation relative to those expected under demographic simulations, which is indicative of the action of natural selection. Overall genetic differentiation is negatively associated with recombination rate in both species, providing strong support for a role of linked selection in generating the heterogeneous genomic landscape of differentiation between species. Finally, we identify a number of candidate regions and genes that may have been subject to positive and/or balancing selection during the speciation process. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. The missing indels: an estimate of indel variation in a human genome and analysis of factors that impede detection

    Science.gov (United States)

    Jiang, Yue; Turinsky, Andrei L.; Brudno, Michael

    2015-01-01

    With the development of High-Throughput Sequencing (HTS) thousands of human genomes have now been sequenced. Whenever different studies analyze the same genome they usually agree on the amount of single-nucleotide polymorphisms, but differ dramatically on the number of insertion and deletion variants (indels). Furthermore, there is evidence that indels are often severely under-reported. In this manuscript we derive the total number of indel variants in a human genome by combining data from different sequencing technologies, while assessing the indel detection accuracy. Our estimate of approximately 1 million indels in a Yoruban genome is much higher than the results reported in several recent HTS studies. We identify two key sources of difficulties in indel detection: the insufficient coverage, read length or alignment quality; and the presence of repeats, including short interspersed elements and homopolymers/dimers. We quantify the effect of these factors on indel detection. The quality of sequencing data plays a major role in improving indel detection by HTS methods. However, many indels exist in long homopolymers and repeats, where their detection is severely impeded. The true number of indel events is likely even higher than our current estimates, and new techniques and technologies will be required to detect them. PMID:26130710

  1. Utilizing intraspecific variation in phenotypic plasticity to bolster agricultural and forest productivity under climate change.

    Science.gov (United States)

    Aspinwall, Michael J; Loik, Michael E; Resco de Dios, Victor; Tjoelker, Mark G; Payton, Paxton R; Tissue, David T

    2015-09-01

    Climate change threatens the ability of agriculture and forestry to meet growing global demands for food, fibre and wood products. Information gathered from genotype-by-environment interactions (G × E), which demonstrate intraspecific variation in phenotypic plasticity (the ability of a genotype to alter its phenotype in response to environmental change), may prove important for bolstering agricultural and forest productivity under climate change. Nonetheless, very few studies have explicitly quantified genotype plasticity-productivity relationships in agriculture or forestry. Here, we conceptualize the importance of intraspecific variation in agricultural and forest species plasticity, and discuss the physiological and genetic factors contributing to intraspecific variation in phenotypic plasticity. Our discussion highlights the need for an integrated understanding of the mechanisms of G × E, more extensive assessments of genotypic responses to climate change under field conditions, and explicit testing of genotype plasticity-productivity relationships. Ultimately, further investigation of intraspecific variation in phenotypic plasticity in agriculture and forestry may prove important for identifying genotypes capable of increasing or sustaining productivity under more extreme climatic conditions. © 2014 John Wiley & Sons Ltd.

  2. Spatio-temporal variations of vegetation indicators in Eastern Siberia under global warming

    Science.gov (United States)

    Varlamova, Eugenia V.; Solovyev, Vladimir S.

    2017-11-01

    Study of spatio-temporal variations of NDVI (Normalized Difference Vegetation Index) and phenological parameters of Eastern Siberia vegetation cover under global warming was carried out on AVHRR/NOAA data (1982-2014). Trend maps of NDVI and annual variations of phenological parameters and NDVI are analyzed. A method based on stable transition of air temperature through +5°C was used to estimate the beginning, end and the length of the growing season. Correlation between NDVI and phenological parameters, surface air temperature and precipitation are discussed.

  3. Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication

    Science.gov (United States)

    vonHoldt, Bridgett M.; Pollinger, John P.; Lohmueller, Kirk E.; Han, Eunjung; Parker, Heidi G.; Quignon, Pascale; Degenhardt, Jeremiah D.; Boyko, Adam R.; Earl, Dent A.; Auton, Adam; Reynolds, Andy; Bryc, Kasia; Brisbin, Abra; Knowles, James C.; Mosher, Dana S.; Spady, Tyrone C.; Elkahloun, Abdel; Geffen, Eli; Pilot, Malgorzata; Jedrzejewski, Wlodzimierz; Greco, Claudia; Randi, Ettore; Bannasch, Danika; Wilton, Alan; Shearman, Jeremy; Musiani, Marco; Cargill, Michelle; Jones, Paul G.; Qian, Zuwei; Huang, Wei; Ding, Zhao-Li; Zhang, Ya-ping; Bustamante, Carlos D.; Ostrander, Elaine A.; Novembre, John; Wayne, Robert K.

    2010-01-01

    Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication1,2. To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data3. Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity. PMID:20237475

  4. Genomic interrogation of mechanism(s) underlying cellular responses to toxicants

    International Nuclear Information System (INIS)

    Amin, Rupesh P.; Hamadeh, Hisham K.; Bushel, Pierre R.; Bennett, Lee; Afshari, Cynthia A.; Paules, Richard S.

    2002-01-01

    Assessment of the impact of xenobiotic exposure on human health and disease progression is complex. Knowledge of mode(s) of action, including mechanism(s) contributing to toxicity and disease progression, is valuable for evaluating compounds. Toxicogenomics, the subdiscipline which merges genomics with toxicology, holds the promise to contributing significantly toward the goal of elucidating mechanism(s) by studying genome-wide effects of xenobiotics. Global gene expression profiling, revolutionized by microarray technology and a crucial aspect of a toxicogenomic study, allows measuring transcriptional modulation of thousands of genes following exposure to a xenobiotic. We use our results from previous studies on compounds representing two different classes of xenobiotics (barbiturate and peroxisome proliferator) to discuss the application of computational approaches for analyzing microarray data to elucidate mechanism(s) underlying cellular responses to toxicants. In particular, our laboratory demonstrated that chemical-specific patterns of gene expression can be revealed using cDNA microarrays. Transcript profiling provides discrimination between classes of toxicants, as well as, genome-wide insight into mechanism(s) of toxicity and disease progression. Ultimately, the expectation is that novel approaches for predicting xenobiotic toxicity in humans will emerge from such information

  5. Systematic differences in the response of genetic variation to pedigree and genome-based selection methods.

    Science.gov (United States)

    Heidaritabar, M; Vereijken, A; Muir, W M; Meuwissen, T; Cheng, H; Megens, H-J; Groenen, M A M; Bastiaansen, J W M

    2014-12-01

    Genomic selection (GS) is a DNA-based method of selecting for quantitative traits in animal and plant breeding, and offers a potentially superior alternative to traditional breeding methods that rely on pedigree and phenotype information. Using a 60 K SNP chip with markers spaced throughout the entire chicken genome, we compared the impact of GS and traditional BLUP (best linear unbiased prediction) selection methods applied side-by-side in three different lines of egg-laying chickens. Differences were demonstrated between methods, both at the level and genomic distribution of allele frequency changes. In all three lines, the average allele frequency changes were larger with GS, 0.056 0.064 and 0.066, compared with BLUP, 0.044, 0.045 and 0.036 for lines B1, B2 and W1, respectively. With BLUP, 35 selected regions (empirical P selected regions were identified. Empirical thresholds for local allele frequency changes were determined from gene dropping, and differed considerably between GS (0.167-0.198) and BLUP (0.105-0.126). Between lines, the genomic regions with large changes in allele frequencies showed limited overlap. Our results show that GS applies selection pressure much more locally than BLUP, resulting in larger allele frequency changes. With these results, novel insights into the nature of selection on quantitative traits have been gained and important questions regarding the long-term impact of GS are raised. The rapid changes to a part of the genetic architecture, while another part may not be selected, at least in the short term, require careful consideration, especially when selection occurs before phenotypes are observed.

  6. Demographic history and biologically relevant genetic variation of Native Mexicans inferred from whole-genome sequencing

    OpenAIRE

    Romero-Hidalgo, Sandra; Ochoa-Leyva, Adrián; Garcíarrubio, Alejandro; Acuña-Alonzo, Victor; Antúnez-Argüelles, Erika; Balcazar-Quintero, Martha; Barquera-Lozano, Rodrigo; Carnevale, Alessandra; Cornejo-Granados, Fernanda; Fernández-López, Juan Carlos; García-Herrera, Rodrigo; García-Ortíz, Humberto; Granados-Silvestre, Ángeles; Granados, Julio; Guerrero-Romero, Fernando

    2017-01-01

    Understanding the genetic structure of Native American populations is important to clarify their diversity, demographic history, and to identify genetic factors relevant for biomedical traits. Here, we show a demographic history reconstruction from 12 Native American whole genomes belonging to six distinct ethnic groups representing the three main described genetic clusters of Mexico (Northern, Southern, and Maya). Effective population size estimates of all Native American groups remained bel...

  7. Genome size variation in Macaronesian Angiosperms: Forty Percent of Canarian Endemic Flora Completed

    Czech Academy of Sciences Publication Activity Database

    Suda, Jan; Kyncl, Tomáš; Jarolímová, Vlasta

    2005-01-01

    Roč. 252, 3-4 (2005), s. 215-238 ISSN 0378-2697 R&D Projects: GA ČR(CZ) GA206/00/1445; GA ČR(CZ) GA206/04/0081; GA AV ČR(CZ) KSK6005114 Institutional research plan: CEZ:AV0Z60050516 Keywords : genome size * cytometry * Macaronesia Subject RIV: EF - Botanics Impact factor: 1.421, year: 2005

  8. Striking structural dynamism and nucleotide sequence variation of the transposon Galileo in the genome of Drosophila mojavensis.

    Science.gov (United States)

    Marzo, Mar; Bello, Xabier; Puig, Marta; Maside, Xulio; Ruiz, Alfredo

    2013-02-04

    Galileo is a transposable element responsible for the generation of three chromosomal inversions in natural populations of Drosophila buzzatii. Although the most characteristic feature of Galileo is the long internally-repetitive terminal inverted repeats (TIRs), which resemble the Drosophila Foldback element, its transposase-coding sequence has led to its classification as a member of the P-element superfamily (Class II, subclass 1, TIR order). Furthermore, Galileo has a wide distribution in the genus Drosophila, since it has been found in 6 of the 12 Drosophila sequenced genomes. Among these species, D. mojavensis, the one closest to D. buzzatii, presented the highest diversity in sequence and structure of Galileo elements. In the present work, we carried out a thorough search and annotation of all the Galileo copies present in the D. mojavensis sequenced genome. In our set of 170 Galileo copies we have detected 5 Galileo subfamilies (C, D, E, F, and X) with different structures ranging from nearly complete, to only 2 TIR or solo TIR copies. Finally, we have explored the structural and length variation of the Galileo copies that point out the relatively frequent rearrangements within and between Galileo elements. Different mechanisms responsible for these rearrangements are discussed. Although Galileo is a transposable element with an ancient history in the D. mojavensis genome, our data indicate a recent transpositional activity. Furthermore, the dynamism in sequence and structure, mainly affecting the TIRs, suggests an active exchange of sequences among the copies. This exchange could lead to new subfamilies of the transposon, which could be crucial for the long-term survival of the element in the genome.

  9. Spectrum of mitochondrial genomic variation and associated clinical presentation of prostate cancer in South African men.

    Science.gov (United States)

    McCrow, John P; Petersen, Desiree C; Louw, Melanie; Chan, Eva K F; Harmeyer, Katherine; Vecchiarelli, Stefano; Lyons, Ruth J; Bornman, M S Riana; Hayes, Vanessa M

    2016-03-01

    Prostate cancer incidence and mortality rates are significantly increased in African-American men, but limited studies have been performed within Sub-Saharan African populations. As mitochondria control energy metabolism and apoptosis we speculate that somatic mutations within mitochondrial genomes are candidate drivers of aggressive prostate carcinogenesis. We used matched blood and prostate tissue samples from 87 South African men (77 with African ancestry) to perform deep sequencing of complete mitochondrial genomes. Clinical presentation was biased toward aggressive disease (Gleason score >7, 64%), and compared with men without prostate cancer either with or without benign prostatic hyperplasia. We identified 144 somatic mtDNA single nucleotide variants (SNVs), of which 80 were observed in 39 men presenting with aggressive disease. Both the number and frequency of somatic mtDNA SNVs were associated with higher pathological stage. Besides doubling the total number of somatic PCa-associated mitochondrial genome mutations identified to date, we associate mutational load with aggressive prostate cancer status in men of African ancestry. © 2015 The Authors. The Prostate published by Wiley Periodicals, Inc.

  10. Kinetic theory approach to modeling of cellular repair mechanisms under genome stress.

    Directory of Open Access Journals (Sweden)

    Jinpeng Qi

    Full Text Available Under acute perturbations from outer environment, a normal cell can trigger cellular self-defense mechanism in response to genome stress. To investigate the kinetics of cellular self-repair process at single cell level further, a model of DNA damage generating and repair is proposed under acute Ion Radiation (IR by using mathematical framework of kinetic theory of active particles (KTAP. Firstly, we focus on illustrating the profile of Cellular Repair System (CRS instituted by two sub-populations, each of which is made up of the active particles with different discrete states. Then, we implement the mathematical framework of cellular self-repair mechanism, and illustrate the dynamic processes of Double Strand Breaks (DSBs and Repair Protein (RP generating, DSB-protein complexes (DSBCs synthesizing, and toxins accumulating. Finally, we roughly analyze the capability of cellular self-repair mechanism, cellular activity of transferring DNA damage, and genome stability, especially the different fates of a certain cell before and after the time thresholds of IR perturbations that a cell can tolerate maximally under different IR perturbation circumstances.

  11. Kinetic theory approach to modeling of cellular repair mechanisms under genome stress.

    Science.gov (United States)

    Qi, Jinpeng; Ding, Yongsheng; Zhu, Ying; Wu, Yizhi

    2011-01-01

    Under acute perturbations from outer environment, a normal cell can trigger cellular self-defense mechanism in response to genome stress. To investigate the kinetics of cellular self-repair process at single cell level further, a model of DNA damage generating and repair is proposed under acute Ion Radiation (IR) by using mathematical framework of kinetic theory of active particles (KTAP). Firstly, we focus on illustrating the profile of Cellular Repair System (CRS) instituted by two sub-populations, each of which is made up of the active particles with different discrete states. Then, we implement the mathematical framework of cellular self-repair mechanism, and illustrate the dynamic processes of Double Strand Breaks (DSBs) and Repair Protein (RP) generating, DSB-protein complexes (DSBCs) synthesizing, and toxins accumulating. Finally, we roughly analyze the capability of cellular self-repair mechanism, cellular activity of transferring DNA damage, and genome stability, especially the different fates of a certain cell before and after the time thresholds of IR perturbations that a cell can tolerate maximally under different IR perturbation circumstances.

  12. Genomic instability after targeted irradiation of human lymphocytes: Evidence for inter-individual differences under bystander conditions

    International Nuclear Information System (INIS)

    Kadhim, Munira A.; Lee, Ryonfa; Moore, Stephen R.; Macdonald, Denise A.; Chapman, Kim L.; Patel, Gaurang; Prise, Kevin M.

    2010-01-01

    Environmental 222 radon exposure is a human health concern, and many studies demonstrate that very low doses of high LET α-particle irradiation initiate deleterious genetic consequences in both irradiated and non-irradiated bystander cells. One consequence, radiation-induced genomic instability (RIGI), is a hallmark of tumorigenesis and is often assessed by measuring delayed chromosomal aberrations. We utilised a technique that facilitates transient immobilization of primary lymphocytes for targeted microbeam irradiation and have reported that environmentally relevant doses, e.g. a single 3 He 2+ particle traversal to a single cell, are sufficient to induce RIGI. Herein we sought to determine differences in radiation response in lymphocytes isolated from five healthy male donors. Primary lymphocytes were irradiated with a single particle per cell nucleus. We found evidence for inter-individual variation in radiation response (RIGI, measured as delayed chromosome aberrations). Although this was not highly significant, it was possibly masked by high levels of intra-individual variation. While there are many studies showing a link between genetic predisposition and RIGI, there are few studies linking genetic background with bystander effects in normal human lymphocytes. In an attempt to investigate inter-individual variation in the induction of bystander effects, primary lymphocytes were irradiated with a single particle under conditions where fractions of the population were traversed. We showed a marked genotype-dependent bystander response in one donor after exposure to 15% of the population. The findings may also be regarded as a radiation-induced genotype-dependent bystander effect triggering an instability phenotype.

  13. Genomic instability after targeted irradiation of human lymphocytes: Evidence for inter-individual differences under bystander conditions

    Energy Technology Data Exchange (ETDEWEB)

    Kadhim, Munira A., E-mail: mkadhim@brookes.ac.uk [School of Life Sciences, Oxford Brookes University, Oxford OX3 0BP (United Kingdom); Lee, Ryonfa [Biophysics, GSI Helmholtzzentrum fuer Schwerionenforschung GmbH, Planckstrasse 1, D-64291 Darmstadt (Germany); Moore, Stephen R.; Macdonald, Denise A. [Radiation and Genome Stability Unit, Medical Research Council, Harwell, Oxfordshire OX11 0RD (United Kingdom); Chapman, Kim L. [School of Life Sciences, Oxford Brookes University, Oxford OX3 0BP (United Kingdom); Patel, Gaurang; Prise, Kevin M. [Centre for Cancer Research and Cell Biology, Queen' s University Belfast, Belfast BT9 7BL (United Kingdom)

    2010-06-01

    Environmental {sup 222}radon exposure is a human health concern, and many studies demonstrate that very low doses of high LET {alpha}-particle irradiation initiate deleterious genetic consequences in both irradiated and non-irradiated bystander cells. One consequence, radiation-induced genomic instability (RIGI), is a hallmark of tumorigenesis and is often assessed by measuring delayed chromosomal aberrations. We utilised a technique that facilitates transient immobilization of primary lymphocytes for targeted microbeam irradiation and have reported that environmentally relevant doses, e.g. a single {sup 3}He{sup 2+} particle traversal to a single cell, are sufficient to induce RIGI. Herein we sought to determine differences in radiation response in lymphocytes isolated from five healthy male donors. Primary lymphocytes were irradiated with a single particle per cell nucleus. We found evidence for inter-individual variation in radiation response (RIGI, measured as delayed chromosome aberrations). Although this was not highly significant, it was possibly masked by high levels of intra-individual variation. While there are many studies showing a link between genetic predisposition and RIGI, there are few studies linking genetic background with bystander effects in normal human lymphocytes. In an attempt to investigate inter-individual variation in the induction of bystander effects, primary lymphocytes were irradiated with a single particle under conditions where fractions of the population were traversed. We showed a marked genotype-dependent bystander response in one donor after exposure to 15% of the population. The findings may also be regarded as a radiation-induced genotype-dependent bystander effect triggering an instability phenotype.

  14. Genome-wide map of Apn1 binding sites under oxidative stress in Saccharomyces cerevisiae.

    Science.gov (United States)

    Morris, Lydia P; Conley, Andrew B; Degtyareva, Natalya; Jordan, I King; Doetsch, Paul W

    2017-11-01

    The DNA is cells is continuously exposed to reactive oxygen species resulting in toxic and mutagenic DNA damage. Although the repair of oxidative DNA damage occurs primarily through the base excision repair (BER) pathway, the nucleotide excision repair (NER) pathway processes some of the same lesions. In addition, damage tolerance mechanisms, such as recombination and translesion synthesis, enable cells to tolerate oxidative DNA damage, especially when BER and NER capacities are exceeded. Thus, disruption of BER alone or disruption of BER and NER in Saccharomyces cerevisiae leads to increased mutations as well as large-scale genomic rearrangements. Previous studies demonstrated that a particular region of chromosome II is susceptible to chronic oxidative stress-induced chromosomal rearrangements, suggesting the existence of DNA damage and/or DNA repair hotspots. Here we investigated the relationship between oxidative damage and genomic instability utilizing chromatin immunoprecipitation combined with DNA microarray technology to profile DNA repair sites along yeast chromosomes under different oxidative stress conditions. We targeted the major yeast AP endonuclease Apn1 as a representative BER protein. Our results indicate that Apn1 target sequences are enriched for cytosine and guanine nucleotides. We predict that BER protects these sites in the genome because guanines and cytosines are thought to be especially susceptible to oxidative attack, thereby preventing large-scale genome destabilization from chronic accumulation of DNA damage. Information from our studies should provide insight into how regional deployment of oxidative DNA damage management systems along chromosomes protects against large-scale rearrangements. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  15. Ectopic KIT copy number variation underlies impaired migration of primordial germ cells associated with gonadal hypoplasia in cattle (Bos taurus.

    Directory of Open Access Journals (Sweden)

    Heli Venhoranta

    Full Text Available Impaired migration of primordial germ cells during embryonic development causes hereditary gonadal hypoplasia in both sexes of Northern Finncattle and Swedish Mountain cattle. The affected gonads exhibit a lack of or, in rare cases, a reduced number of germ cells. Most affected animals present left-sided gonadal hypoplasia. However, right-sided and bilateral cases are also found. This type of gonadal hypoplasia prevails in animals with white coat colour. Previous studies indicated that gonadal hypoplasia is inherited in an autosomal recessive fashion with incomplete penetrance. In order to identify genetic regions underlying gonadal hypoplasia, a genome-wide association study (GWAS and a copy number variation (CNV analysis were performed with 94 animals, including 21 affected animals, using bovine 777,962 SNP arrays. The GWAS and CNV results revealed two significantly associated regions on bovine chromosomes (BTA 29 and 6, respectively (P=2.19 x 10(-13 and P=5.65 x 10(-6. Subsequent cytogenetic and PCR analyses demonstrated that homozygosity of a ~500 kb chromosomal segment translocated from BTA6 to BTA29 (Cs29 allele is the underlying genetic mechanism responsible for gonadal hypoplasia. The duplicated segment includes the KIT gene that is known to regulate the migration of germ cells and precursors of melanocytes. This duplication is also one of the two translocations associated with colour sidedness in various cattle breeds.

  16. Antagonism between phytohormone signalling underlies the variation in disease susceptibility of tomato plants under elevated CO2

    Science.gov (United States)

    Zhang, Shuai; Li, Xin; Sun, Zenghui; Shao, Shujun; Hu, Lingfei; Ye, Meng; Zhou, Yanhong; Xia, Xiaojian; Yu, Jingquan; Shi, Kai

    2015-01-01

    Increasing CO2 concentrations ([CO2]) have the potential to disrupt plant–pathogen interactions in natural and agricultural ecosystems, but the research in this area has often produced conflicting results. Variations in phytohormone salicylic acid (SA) and jasmonic acid (JA) signalling could be associated with variations in the responses of pathogens to plants grown under elevated [CO2]. In this study, interactions between tomato plants and three pathogens with different infection strategies were compared. Elevated [CO2] generally favoured SA biosynthesis and signalling but repressed the JA pathway. The exposure of plants to elevated [CO2] revealed a lower incidence and severity of disease caused by tobacco mosaic virus (TMV) and by Pseudomonas syringae, whereas plant susceptibility to necrotrophic Botrytis cinerea increased. The elevated [CO2]-induced and basal resistance to TMV and P. syringae were completely abolished in plants in which the SA signalling pathway nonexpressor of pathogenesis-related genes 1 (NPR1) had been silenced or in transgenic plants defective in SA biosynthesis. In contrast, under both ambient and elevated [CO2], the susceptibility to B. cinerea highly increased in plants in which the JA signalling pathway proteinase inhibitors (PI) gene had been silenced or in a mutant affected in JA biosynthesis. However, plants affected in SA signalling remained less susceptible to this disease. These findings highlight the modulated antagonistic relationship between SA and JA that contributes to the variation in disease susceptibility under elevated [CO2]. This information will be critical for investigating how elevated CO2 may affect plant defence and the dynamics between plants and pathogens in both agricultural and natural ecosystems. PMID:25657213

  17. False Operation of Static Random Access Memory Cells under Alternating Current Power Supply Voltage Variation

    Science.gov (United States)

    Sawada, Takuya; Takata, Hidehiro; Nii, Koji; Nagata, Makoto

    2013-04-01

    Static random access memory (SRAM) cores exhibit susceptibility against power supply voltage variation. False operation is investigated among SRAM cells under sinusoidal voltage variation on power lines introduced by direct RF power injection. A standard SRAM core of 16 kbyte in a 90 nm 1.5 V technology is diagnosed with built-in self test and on-die noise monitor techniques. The sensitivity of bit error rate is shown to be high against the frequency of injected voltage variation, while it is not greatly influenced by the difference in frequency and phase against SRAM clocking. It is also observed that the distribution of false bits is substantially random in a cell array.

  18. Genomewide variation in an introgression line of rice-Zizania revealed by whole-genome re-sequencing.

    Directory of Open Access Journals (Sweden)

    Zhen-Hui Wang

    Full Text Available BACKGROUND: Hybridization between genetically diverged organisms is known as an important avenue that drives plant genome evolution. The possible outcomes of hybridization would be the occurrences of genetic instabilities in the resultant hybrids. It remained under-investigated however whether pollination by alien pollens of a closely related but sexually "incompatible" species could evoke genomic changes and to what extent it may result in phenotypic novelties in the derived progenies. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we have re-sequenced the genomes of Oryza sativa ssp. japonica cv. Matsumae and one of its derived introgressant RZ35 that was obtained from an introgressive hybridization between Matsumae and Zizanialatifolia Griseb. in general, 131 millions 90 base pair (bp paired-end reads were generated which covered 13.2 and 21.9 folds of the Matsumae and RZ35 genomes, respectively. Relative to Matsumae, a total of 41,724 homozygous single nucleotide polymorphisms (SNPs and 17,839 homozygous insertions/deletions (indels were identified in RZ35, of which 3,797 SNPs were nonsynonymous mutations. Furthermore, rampant mobilization of transposable elements (TEs was found in the RZ35 genome. The results of pathogen inoculation revealed that RZ35 exhibited enhanced resistance to blast relative to Matsumae. Notably, one nonsynonymous mutation was found in the known blast resistance gene Pid3/Pi25 and real-time quantitative (q RT-PCR analysis revealed constitutive up-regulation of its expression, suggesting both altered function and expression of Pid3/Pi25 may be responsible for the enhanced resistance to rice blast by RZ35. CONCLUSIONS/SIGNIFICANCE: Our results demonstrate that introgressive hybridization by Zizania has provoked genomewide, extensive genomic changes in the rice genome, and some of which have resulted in important phenotypic novelties. These findings suggest that introgressive hybridization by alien pollens of even a

  19. Analysis of genetic variation and potential applications in genome-scale metabolic modeling

    DEFF Research Database (Denmark)

    Cardoso, Joao; Andersen, Mikael Rørdam; Herrgard, Markus

    2015-01-01

    scale and resolution by re-sequencing thousands of strains systematically. In this article, we review challenges in the integration and analysis of large-scale re-sequencing data, present an extensive overview of bioinformatics methods for predicting the effects of genetic variants on protein function......Genetic variation is the motor of evolution and allows organisms to overcome the environmental challenges they encounter. It can be both beneficial and harmful in the process of engineering cell factories for the production of proteins and chemicals. Throughout the history of biotechnology......, there have been efforts to exploit genetic variation in our favor to create strains with favorable phenotypes. Genetic variation can either be present in natural populations or it can be artificially created by mutagenesis and selection or adaptive laboratory evolution. On the other hand, unintended genetic...

  20. Switches in Genomic GC Content Drive Shifts of Optimal Codons under Sustained Selection on Synonymous Sites

    Science.gov (United States)

    Sun, Yu; Tamarit, Daniel

    2017-01-01

    Abstract The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites. PMID:27540085

  1. Genetic Architecture of Natural Variation in Rice Chlorophyll Content Revealed by a Genome-Wide Association Study.

    Science.gov (United States)

    Wang, Quanxiu; Xie, Weibo; Xing, Hongkun; Yan, Ju; Meng, Xiangzhou; Li, Xinglei; Fu, Xiangkui; Xu, Jiuyue; Lian, Xingming; Yu, Sibin; Xing, Yongzhong; Wang, Gongwei

    2015-06-01

    Chlorophyll content is one of the most important physiological traits as it is closely related to leaf photosynthesis and crop yield potential. So far, few genes have been reported to be involved in natural variation of chlorophyll content in rice (Oryza sativa) and the extent of variations explored is very limited. We conducted a genome-wide association study (GWAS) using a diverse worldwide collection of 529 O. sativa accessions. A total of 46 significant association loci were identified. Three F2 mapping populations with parents selected from the association panel were tested for validation of GWAS signals. We clearly demonstrated that Grain number, plant height, and heading date7 (Ghd7) was a major locus for natural variation of chlorophyll content at the heading stage by combining evidence from near-isogenic lines and transgenic plants. The enhanced expression of Ghd7 decreased the chlorophyll content, mainly through down-regulating the expression of genes involved in the biosynthesis of chlorophyll and chloroplast. In addition, Narrow leaf1 (NAL1) corresponded to one significant association region repeatedly detected over two years. We revealed a high degree of polymorphism in the 5' UTR and four non-synonymous SNPs in the coding region of NAL1, and observed diverse effects of the major haplotypes. The loci or candidate genes identified would help to fine-tune and optimize the antenna size of canopies in rice breeding. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.

  2. A novel variational Bayes multiple locus Z-statistic for genome-wide association studies with Bayesian model averaging

    Science.gov (United States)

    Logsdon, Benjamin A.; Carty, Cara L.; Reiner, Alexander P.; Dai, James Y.; Kooperberg, Charles

    2012-01-01

    Motivation: For many complex traits, including height, the majority of variants identified by genome-wide association studies (GWAS) have small effects, leaving a significant proportion of the heritable variation unexplained. Although many penalized multiple regression methodologies have been proposed to increase the power to detect associations for complex genetic architectures, they generally lack mechanisms for false-positive control and diagnostics for model over-fitting. Our methodology is the first penalized multiple regression approach that explicitly controls Type I error rates and provide model over-fitting diagnostics through a novel normally distributed statistic defined for every marker within the GWAS, based on results from a variational Bayes spike regression algorithm. Results: We compare the performance of our method to the lasso and single marker analysis on simulated data and demonstrate that our approach has superior performance in terms of power and Type I error control. In addition, using the Women's Health Initiative (WHI) SNP Health Association Resource (SHARe) GWAS of African-Americans, we show that our method has power to detect additional novel associations with body height. These findings replicate by reaching a stringent cutoff of marginal association in a larger cohort. Availability: An R-package, including an implementation of our variational Bayes spike regression (vBsr) algorithm, is available at http://kooperberg.fhcrc.org/soft.html. Contact: blogsdon@fhcrc.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22563072

  3. Identification of genotypic variation for nitrogen response in potato (Solanum tuberosum) under low nitrogen input circumstances

    OpenAIRE

    Tiemens-Hulscher, M.; Lammerts Van Bueren, E.; Struik, P.C.

    2009-01-01

    Nitrogen is an essential nutrient for crop growth. The demand for nitrogen in the potato crop is relatively high. However, in organic farming nitrogen input is rather limited, compared with conventional farming. In this research nine potato varieties were tested at three nitrogen levels. Genotypic variation for yield, leaf area index, period of maximum soil cover, sensitivity for N-shortage and nitrogen efficiency under low input circumstances was found. However, in these experiments varietie...

  4. Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology

    Science.gov (United States)

    Rossin, Elizabeth J.; Lage, Kasper; Raychaudhuri, Soumya; Xavier, Ramnik J.; Tatar, Diana; Benita, Yair

    2011-01-01

    Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein–protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in

  5. Epigenetic Variation in Monozygotic Twins: A Genome-Wide Analysis of DNA Methylation in Buccal Cells

    NARCIS (Netherlands)

    van Dongen, J.; Ehli, E.A.; Slieker, R.C.; Bartels, M.; Weber, Z.M.; Davies, G.E.; Slagboom, P.E.; Heijmans, B.T.; Boomsma, D.I.

    2014-01-01

    DNA methylation is one of the most extensively studied epigenetic marks in humans. Yet, it is largely unknown what causes variation in DNA methylation between individuals. The comparison of DNA methylation profiles of monozygotic (MZ) twins offers a unique experimental design to examine the extent

  6. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing

    NARCIS (Netherlands)

    Aflitos, S.A.; Schijlen, E.G.W.M.; Jong, de J.H.S.G.M.; Ridder, de D.; Smit, S.; Finkers, H.J.; Bakker, F.T.; Geest, van de H.C.; Lintel Hekkert, te B.; Haarst, van J.C.; Smits, L.W.M.; Koops, A.J.; Sanchez-Perez, M.J.; Heusden, van A.W.; Visser, R.G.F.; Schranz, M.E.; Peters, S.A.

    2014-01-01

    We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative for the Lycopersicon, Arcanum, Eriopersicon, and Neolycopersicon groups which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new

  7. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing

    NARCIS (Netherlands)

    Aflitos, S.; Schijlen, E.; de Jong, H.; de Ridder, D.; Smit, S.; Finkers, R.; Wang, J.; Zhang, G.; Li, N.; Mao, L.; Bakker, F.; Dirks, R.; Breit, T.; Gravendeel, B.; Huits, H.; Struss, D.; Swanson-Wagner, R.; van Leeuwen, H.; van Ham, R.C.H.J.; Fito, L.; Guignier, L.; Sevilla, M.; Ellul, P.; Ganko, E.; Kapur, A.; Reclus, E.; de Geus, B.; van de Geest, H.; te Lintel Hekkert, B.; van Haarst, J.; Smits, L.; Koops, A.; Sanchez-Perez, G.; van Heusden, A.W.; Visser, R.; Quan, Z.; Min, J.; Liao, L.; Wang, X.; Wang, G.; Yue, Z.; Yang, X.; Xu, N.; Schranz, E.; Smets, E.; Vos, R.; Rauwerda, J.; Ursem, R.; Schuit, C.; Kerns, M.; van den Berg, J.; Vriezen, W.; Janssen, A.; Datema, E.; Jahrman, T.; Moquet, F.; Bonnet, J.; Peters, S.

    2014-01-01

    We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new

  8. Defining the role of common variation in the genomic and biological architecture of adult human height.

    Science.gov (United States)

    Wood, Andrew R; Esko, Tonu; Yang, Jian; Vedantam, Sailaja; Pers, Tune H; Gustafsson, Stefan; Chu, Audrey Y; Estrada, Karol; Luan, Jian'an; Kutalik, Zoltán; Amin, Najaf; Buchkovich, Martin L; Croteau-Chonka, Damien C; Day, Felix R; Duan, Yanan; Fall, Tove; Fehrmann, Rudolf; Ferreira, Teresa; Jackson, Anne U; Karjalainen, Juha; Lo, Ken Sin; Locke, Adam E; Mägi, Reedik; Mihailov, Evelin; Porcu, Eleonora; Randall, Joshua C; Scherag, André; Vinkhuyzen, Anna A E; Westra, Harm-Jan; Winkler, Thomas W; Workalemahu, Tsegaselassie; Zhao, Jing Hua; Absher, Devin; Albrecht, Eva; Anderson, Denise; Baron, Jeffrey; Beekman, Marian; Demirkan, Ayse; Ehret, Georg B; Feenstra, Bjarke; Feitosa, Mary F; Fischer, Krista; Fraser, Ross M; Goel, Anuj; Gong, Jian; Justice, Anne E; Kanoni, Stavroula; Kleber, Marcus E; Kristiansson, Kati; Lim, Unhee; Lotay, Vaneet; Lui, Julian C; Mangino, Massimo; Mateo Leach, Irene; Medina-Gomez, Carolina; Nalls, Michael A; Nyholt, Dale R; Palmer, Cameron D; Pasko, Dorota; Pechlivanis, Sonali; Prokopenko, Inga; Ried, Janina S; Ripke, Stephan; Shungin, Dmitry; Stancáková, Alena; Strawbridge, Rona J; Sung, Yun Ju; Tanaka, Toshiko; Teumer, Alexander; Trompet, Stella; van der Laan, Sander W; van Setten, Jessica; Van Vliet-Ostaptchouk, Jana V; Wang, Zhaoming; Yengo, Loïc; Zhang, Weihua; Afzal, Uzma; Arnlöv, Johan; Arscott, Gillian M; Bandinelli, Stefania; Barrett, Amy; Bellis, Claire; Bennett, Amanda J; Berne, Christian; Blüher, Matthias; Bolton, Jennifer L; Böttcher, Yvonne; Boyd, Heather A; Bruinenberg, Marcel; Buckley, Brendan M; Buyske, Steven; Caspersen, Ida H; Chines, Peter S; Clarke, Robert; Claudi-Boehm, Simone; Cooper, Matthew; Daw, E Warwick; De Jong, Pim A; Deelen, Joris; Delgado, Graciela; Denny, Josh C; Dhonukshe-Rutten, Rosalie; Dimitriou, Maria; Doney, Alex S F; Dörr, Marcus; Eklund, Niina; Eury, Elodie; Folkersen, Lasse; Garcia, Melissa E; Geller, Frank; Giedraitis, Vilmantas; Go, Alan S; Grallert, Harald; Grammer, Tanja B; Gräßler, Jürgen; Grönberg, Henrik; de Groot, Lisette C P G M; Groves, Christopher J; Haessler, Jeffrey; Hall, Per; Haller, Toomas; Hallmans, Goran; Hannemann, Anke; Hartman, Catharina A; Hassinen, Maija; Hayward, Caroline; Heard-Costa, Nancy L; Helmer, Quinta; Hemani, Gibran; Henders, Anjali K; Hillege, Hans L; Hlatky, Mark A; Hoffmann, Wolfgang; Hoffmann, Per; Holmen, Oddgeir; Houwing-Duistermaat, Jeanine J; Illig, Thomas; Isaacs, Aaron; James, Alan L; Jeff, Janina; Johansen, Berit; Johansson, Åsa; Jolley, Jennifer; Juliusdottir, Thorhildur; Junttila, Juhani; Kho, Abel N; Kinnunen, Leena; Klopp, Norman; Kocher, Thomas; Kratzer, Wolfgang; Lichtner, Peter; Lind, Lars; Lindström, Jaana; Lobbens, Stéphane; Lorentzon, Mattias; Lu, Yingchang; Lyssenko, Valeriya; Magnusson, Patrik K E; Mahajan, Anubha; Maillard, Marc; McArdle, Wendy L; McKenzie, Colin A; McLachlan, Stela; McLaren, Paul J; Menni, Cristina; Merger, Sigrun; Milani, Lili; Moayyeri, Alireza; Monda, Keri L; Morken, Mario A; Müller, Gabriele; Müller-Nurasyid, Martina; Musk, Arthur W; Narisu, Narisu; Nauck, Matthias; Nolte, Ilja M; Nöthen, Markus M; Oozageer, Laticia; Pilz, Stefan; Rayner, Nigel W; Renstrom, Frida; Robertson, Neil R; Rose, Lynda M; Roussel, Ronan; Sanna, Serena; Scharnagl, Hubert; Scholtens, Salome; Schumacher, Fredrick R; Schunkert, Heribert; Scott, Robert A; Sehmi, Joban; Seufferlein, Thomas; Shi, Jianxin; Silventoinen, Karri; Smit, Johannes H; Smith, Albert Vernon; Smolonska, Joanna; Stanton, Alice V; Stirrups, Kathleen; Stott, David J; Stringham, Heather M; Sundström, Johan; Swertz, Morris A; Syvänen, Ann-Christine; Tayo, Bamidele O; Thorleifsson, Gudmar; Tyrer, Jonathan P; van Dijk, Suzanne; van Schoor, Natasja M; van der Velde, Nathalie; van Heemst, Diana; van Oort, Floor V A; Vermeulen, Sita H; Verweij, Niek; Vonk, Judith M; Waite, Lindsay L; Waldenberger, Melanie; Wennauer, Roman; Wilkens, Lynne R; Willenborg, Christina; Wilsgaard, Tom; Wojczynski, Mary K; Wong, Andrew; Wright, Alan F; Zhang, Qunyuan; Arveiler, Dominique; Bakker, Stephan J L; Beilby, John; Bergman, Richard N; Bergmann, Sven; Biffar, Reiner; Blangero, John; Boomsma, Dorret I; Bornstein, Stefan R; Bovet, Pascal; Brambilla, Paolo; Brown, Morris J; Campbell, Harry; Caulfield, Mark J; Chakravarti, Aravinda; Collins, Rory; Collins, Francis S; Crawford, Dana C; Cupples, L Adrienne; Danesh, John; de Faire, Ulf; den Ruijter, Hester M; Erbel, Raimund; Erdmann, Jeanette; Eriksson, Johan G; Farrall, Martin; Ferrannini, Ele; Ferrières, Jean; Ford, Ian; Forouhi, Nita G; Forrester, Terrence; Gansevoort, Ron T; Gejman, Pablo V; Gieger, Christian; Golay, Alain; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Haas, David W; Hall, Alistair S; Harris, Tamara B; Hattersley, Andrew T; Heath, Andrew C; Hengstenberg, Christian; Hicks, Andrew A; Hindorff, Lucia A; Hingorani, Aroon D; Hofman, Albert; Hovingh, G Kees; Humphries, Steve E; Hunt, Steven C; Hypponen, Elina; Jacobs, Kevin B; Jarvelin, Marjo-Riitta; Jousilahti, Pekka; Jula, Antti M; Kaprio, Jaakko; Kastelein, John J P; Kayser, Manfred; Kee, Frank; Keinanen-Kiukaanniemi, Sirkka M; Kiemeney, Lambertus A; Kooner, Jaspal S; Kooperberg, Charles; Koskinen, Seppo; Kovacs, Peter; Kraja, Aldi T; Kumari, Meena; Kuusisto, Johanna; Lakka, Timo A; Langenberg, Claudia; Le Marchand, Loic; Lehtimäki, Terho; Lupoli, Sara; Madden, Pamela A F; Männistö, Satu; Manunta, Paolo; Marette, André; Matise, Tara C; McKnight, Barbara; Meitinger, Thomas; Moll, Frans L; Montgomery, Grant W; Morris, Andrew D; Morris, Andrew P; Murray, Jeffrey C; Nelis, Mari; Ohlsson, Claes; Oldehinkel, Albertine J; Ong, Ken K; Ouwehand, Willem H; Pasterkamp, Gerard; Peters, Annette; Pramstaller, Peter P; Price, Jackie F; Qi, Lu; Raitakari, Olli T; Rankinen, Tuomo; Rao, D C; Rice, Treva K; Ritchie, Marylyn; Rudan, Igor; Salomaa, Veikko; Samani, Nilesh J; Saramies, Jouko; Sarzynski, Mark A; Schwarz, Peter E H; Sebert, Sylvain; Sever, Peter; Shuldiner, Alan R; Sinisalo, Juha; Steinthorsdottir, Valgerdur; Stolk, Ronald P; Tardif, Jean-Claude; Tönjes, Anke; Tremblay, Angelo; Tremoli, Elena; Virtamo, Jarmo; Vohl, Marie-Claude; Amouyel, Philippe; Asselbergs, Folkert W; Assimes, Themistocles L; Bochud, Murielle; Boehm, Bernhard O; Boerwinkle, Eric; Bottinger, Erwin P; Bouchard, Claude; Cauchi, Stéphane; Chambers, John C; Chanock, Stephen J; Cooper, Richard S; de Bakker, Paul I W; Dedoussis, George; Ferrucci, Luigi; Franks, Paul W; Froguel, Philippe; Groop, Leif C; Haiman, Christopher A; Hamsten, Anders; Hayes, M Geoffrey; Hui, Jennie; Hunter, David J; Hveem, Kristian; Jukema, J Wouter; Kaplan, Robert C; Kivimaki, Mika; Kuh, Diana; Laakso, Markku; Liu, Yongmei; Martin, Nicholas G; März, Winfried; Melbye, Mads; Moebus, Susanne; Munroe, Patricia B; Njølstad, Inger; Oostra, Ben A; Palmer, Colin N A; Pedersen, Nancy L; Perola, Markus; Pérusse, Louis; Peters, Ulrike; Powell, Joseph E; Power, Chris; Quertermous, Thomas; Rauramaa, Rainer; Reinmaa, Eva; Ridker, Paul M; Rivadeneira, Fernando; Rotter, Jerome I; Saaristo, Timo E; Saleheen, Danish; Schlessinger, David; Slagboom, P Eline; Snieder, Harold; Spector, Tim D; Strauch, Konstantin; Stumvoll, Michael; Tuomilehto, Jaakko; Uusitupa, Matti; van der Harst, Pim; Völzke, Henry; Walker, Mark; Wareham, Nicholas J; Watkins, Hugh; Wichmann, H-Erich; Wilson, James F; Zanen, Pieter; Deloukas, Panos; Heid, Iris M; Lindgren, Cecilia M; Mohlke, Karen L; Speliotes, Elizabeth K; Thorsteinsdottir, Unnur; Barroso, Inês; Fox, Caroline S; North, Kari E; Strachan, David P; Beckmann, Jacques S; Berndt, Sonja I; Boehnke, Michael; Borecki, Ingrid B; McCarthy, Mark I; Metspalu, Andres; Stefansson, Kari; Uitterlinden, André G; van Duijn, Cornelia M; Franke, Lude; Willer, Cristen J; Price, Alkes L; Lettre, Guillaume; Loos, Ruth J F; Weedon, Michael N; Ingelsson, Erik; O'Connell, Jeffrey R; Abecasis, Goncalo R; Chasman, Daniel I; Goddard, Michael E; Visscher, Peter M; Hirschhorn, Joel N; Frayling, Timothy M

    2014-11-01

    Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700 and ∼9,500 SNPs explained ∼21%, ∼24% and ∼29% of phenotypic variance. Furthermore, all common variants together captured 60% of heritability. The 697 variants clustered in 423 loci were enriched for genes, pathways and tissue types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/β-catenin and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants.

  9. Defining the role of common variation in the genomic and biological architecture of adult human height

    Science.gov (United States)

    Chu, Audrey Y; Estrada, Karol; Luan, Jian’an; Kutalik, Zoltán; Amin, Najaf; Buchkovich, Martin L; Croteau-Chonka, Damien C; Day, Felix R; Duan, Yanan; Fall, Tove; Fehrmann, Rudolf; Ferreira, Teresa; Jackson, Anne U; Karjalainen, Juha; Lo, Ken Sin; Locke, Adam E; Mägi, Reedik; Mihailov, Evelin; Porcu, Eleonora; Randall, Joshua C; Scherag, André; Vinkhuyzen, Anna AE; Westra, Harm-Jan; Winkler, Thomas W; Workalemahu, Tsegaselassie; Zhao, Jing Hua; Absher, Devin; Albrecht, Eva; Anderson, Denise; Baron, Jeffrey; Beekman, Marian; Demirkan, Ayse; Ehret, Georg B; Feenstra, Bjarke; Feitosa, Mary F; Fischer, Krista; Fraser, Ross M; Goel, Anuj; Gong, Jian; Justice, Anne E; Kanoni, Stavroula; Kleber, Marcus E; Kristiansson, Kati; Lim, Unhee; Lotay, Vaneet; Lui, Julian C; Mangino, Massimo; Leach, Irene Mateo; Medina-Gomez, Carolina; Nalls, Michael A; Nyholt, Dale R; Palmer, Cameron D; Pasko, Dorota; Pechlivanis, Sonali; Prokopenko, Inga; Ried, Janina S; Ripke, Stephan; Shungin, Dmitry; Stancáková, Alena; Strawbridge, Rona J; Sung, Yun Ju; Tanaka, Toshiko; Teumer, Alexander; Trompet, Stella; van der Laan, Sander W; van Setten, Jessica; Van Vliet-Ostaptchouk, Jana V; Wang, Zhaoming; Yengo, Loïc; Zhang, Weihua; Afzal, Uzma; Ärnlöv, Johan; Arscott, Gillian M; Bandinelli, Stefania; Barrett, Amy; Bellis, Claire; Bennett, Amanda J; Berne, Christian; Blüher, Matthias; Bolton, Jennifer L; Böttcher, Yvonne; Boyd, Heather A; Bruinenberg, Marcel; Buckley, Brendan M; Buyske, Steven; Caspersen, Ida H; Chines, Peter S; Clarke, Robert; Claudi-Boehm, Simone; Cooper, Matthew; Daw, E Warwick; De Jong, Pim A; Deelen, Joris; Delgado, Graciela; Denny, Josh C; Dhonukshe-Rutten, Rosalie; Dimitriou, Maria; Doney, Alex SF; Dörr, Marcus; Eklund, Niina; Eury, Elodie; Folkersen, Lasse; Garcia, Melissa E; Geller, Frank; Giedraitis, Vilmantas; Go, Alan S; Grallert, Harald; Grammer, Tanja B; Gräßler, Jürgen; Grönberg, Henrik; de Groot, Lisette C.P.G.M.; Groves, Christopher J; Haessler, Jeffrey; Hall, Per; Haller, Toomas; Hallmans, Goran; Hannemann, Anke; Hartman, Catharina A; Hassinen, Maija; Hayward, Caroline; Heard-Costa, Nancy L; Helmer, Quinta; Hemani, Gibran; Henders, Anjali K; Hillege, Hans L; Hlatky, Mark A; Hoffmann, Wolfgang; Hoffmann, Per; Holmen, Oddgeir; Houwing-Duistermaat, Jeanine J; Illig, Thomas; Isaacs, Aaron; James, Alan L; Jeff, Janina; Johansen, Berit; Johansson, Åsa; Jolley, Jennifer; Juliusdottir, Thorhildur; Junttila, Juhani; Kho, Abel N; Kinnunen, Leena; Klopp, Norman; Kocher, Thomas; Kratzer, Wolfgang; Lichtner, Peter; Lind, Lars; Lindström, Jaana; Lobbens, Stéphane; Lorentzon, Mattias; Lu, Yingchang; Lyssenko, Valeriya; Magnusson, Patrik KE; Mahajan, Anubha; Maillard, Marc; McArdle, Wendy L; McKenzie, Colin A; McLachlan, Stela; McLaren, Paul J; Menni, Cristina; Merger, Sigrun; Milani, Lili; Moayyeri, Alireza; Monda, Keri L; Morken, Mario A; Müller, Gabriele; Müller-Nurasyid, Martina; Musk, Arthur W; Narisu, Narisu; Nauck, Matthias; Nolte, Ilja M; Nöthen, Markus M; Oozageer, Laticia; Pilz, Stefan; Rayner, Nigel W; Renstrom, Frida; Robertson, Neil R; Rose, Lynda M; Roussel, Ronan; Sanna, Serena; Scharnagl, Hubert; Scholtens, Salome; Schumacher, Fredrick R; Schunkert, Heribert; Scott, Robert A; Sehmi, Joban; Seufferlein, Thomas; Shi, Jianxin; Silventoinen, Karri; Smit, Johannes H; Smith, Albert Vernon; Smolonska, Joanna; Stanton, Alice V; Stirrups, Kathleen; Stott, David J; Stringham, Heather M; Sundström, Johan; Swertz, Morris A; Syvänen, Ann-Christine; Tayo, Bamidele O; Thorleifsson, Gudmar; Tyrer, Jonathan P; van Dijk, Suzanne; van Schoor, Natasja M; van der Velde, Nathalie; van Heemst, Diana; van Oort, Floor VA; Vermeulen, Sita H; Verweij, Niek; Vonk, Judith M; Waite, Lindsay L; Waldenberger, Melanie; Wennauer, Roman; Wilkens, Lynne R; Willenborg, Christina; Wilsgaard, Tom; Wojczynski, Mary K; Wong, Andrew; Wright, Alan F; Zhang, Qunyuan; Arveiler, Dominique; Bakker, Stephan JL; Beilby, John; Bergman, Richard N; Bergmann, Sven; Biffar, Reiner; Blangero, John; Boomsma, Dorret I; Bornstein, Stefan R; Bovet, Pascal; Brambilla, Paolo; Brown, Morris J; Campbell, Harry; Caulfield, Mark J; Chakravarti, Aravinda; Collins, Rory; Collins, Francis S; Crawford, Dana C; Cupples, L Adrienne; Danesh, John; de Faire, Ulf; den Ruijter, Hester M; Erbel, Raimund; Erdmann, Jeanette; Eriksson, Johan G; Farrall, Martin; Ferrannini, Ele; Ferrières, Jean; Ford, Ian; Forouhi, Nita G; Forrester, Terrence; Gansevoort, Ron T; Gejman, Pablo V; Gieger, Christian; Golay, Alain; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Haas, David W; Hall, Alistair S; Harris, Tamara B; Hattersley, Andrew T; Heath, Andrew C; Hengstenberg, Christian; Hicks, Andrew A; Hindorff, Lucia A; Hingorani, Aroon D; Hofman, Albert; Hovingh, G Kees; Humphries, Steve E; Hunt, Steven C; Hypponen, Elina; Jacobs, Kevin B; Jarvelin, Marjo-Riitta; Jousilahti, Pekka; Jula, Antti M; Kaprio, Jaakko; Kastelein, John JP; Kayser, Manfred; Kee, Frank; Keinanen-Kiukaanniemi, Sirkka M; Kiemeney, Lambertus A; Kooner, Jaspal S; Kooperberg, Charles; Koskinen, Seppo; Kovacs, Peter; Kraja, Aldi T; Kumari, Meena; Kuusisto, Johanna; Lakka, Timo A; Langenberg, Claudia; Le Marchand, Loic; Lehtimäki, Terho; Lupoli, Sara; Madden, Pamela AF; Männistö, Satu; Manunta, Paolo; Marette, André; Matise, Tara C; McKnight, Barbara; Meitinger, Thomas; Moll, Frans L; Montgomery, Grant W; Morris, Andrew D; Morris, Andrew P; Murray, Jeffrey C; Nelis, Mari; Ohlsson, Claes; Oldehinkel, Albertine J; Ong, Ken K; Ouwehand, Willem H; Pasterkamp, Gerard; Peters, Annette; Pramstaller, Peter P; Price, Jackie F; Qi, Lu; Raitakari, Olli T; Rankinen, Tuomo; Rao, DC; Rice, Treva K; Ritchie, Marylyn; Rudan, Igor; Salomaa, Veikko; Samani, Nilesh J; Saramies, Jouko; Sarzynski, Mark A; Schwarz, Peter EH; Sebert, Sylvain; Sever, Peter; Shuldiner, Alan R; Sinisalo, Juha; Steinthorsdottir, Valgerdur; Stolk, Ronald P; Tardif, Jean-Claude; Tönjes, Anke; Tremblay, Angelo; Tremoli, Elena; Virtamo, Jarmo; Vohl, Marie-Claude; Amouyel, Philippe; Asselbergs, Folkert W; Assimes, Themistocles L; Bochud, Murielle; Boehm, Bernhard O; Boerwinkle, Eric; Bottinger, Erwin P; Bouchard, Claude; Cauchi, Stéphane; Chambers, John C; Chanock, Stephen J; Cooper, Richard S; de Bakker, Paul IW; Dedoussis, George; Ferrucci, Luigi; Franks, Paul W; Froguel, Philippe; Groop, Leif C; Haiman, Christopher A; Hamsten, Anders; Hayes, M Geoffrey; Hui, Jennie; Hunter, David J.; Hveem, Kristian; Jukema, J Wouter; Kaplan, Robert C; Kivimaki, Mika; Kuh, Diana; Laakso, Markku; Liu, Yongmei; Martin, Nicholas G; März, Winfried; Melbye, Mads; Moebus, Susanne; Munroe, Patricia B; Njølstad, Inger; Oostra, Ben A; Palmer, Colin NA; Pedersen, Nancy L; Perola, Markus; Pérusse, Louis; Peters, Ulrike; Powell, Joseph E; Power, Chris; Quertermous, Thomas; Rauramaa, Rainer; Reinmaa, Eva; Ridker, Paul M; Rivadeneira, Fernando; Rotter, Jerome I; Saaristo, Timo E; Saleheen, Danish; Schlessinger, David; Slagboom, P Eline; Snieder, Harold; Spector, Tim D; Strauch, Konstantin; Stumvoll, Michael; Tuomilehto, Jaakko; Uusitupa, Matti; van der Harst, Pim; Völzke, Henry; Walker, Mark; Wareham, Nicholas J; Watkins, Hugh; Wichmann, H-Erich; Wilson, James F; Zanen, Pieter; Deloukas, Panos; Heid, Iris M; Lindgren, Cecilia M; Mohlke, Karen L; Speliotes, Elizabeth K; Thorsteinsdottir, Unnur; Barroso, Inês; Fox, Caroline S; North, Kari E; Strachan, David P; Beckmann, Jacques S.; Berndt, Sonja I; Boehnke, Michael; Borecki, Ingrid B; McCarthy, Mark I; Metspalu, Andres; Stefansson, Kari; Uitterlinden, André G; van Duijn, Cornelia M; Franke, Lude; Willer, Cristen J; Price, Alkes L.; Lettre, Guillaume; Loos, Ruth JF; Weedon, Michael N; Ingelsson, Erik; O’Connell, Jeffrey R; Abecasis, Goncalo R; Chasman, Daniel I; Goddard, Michael E

    2014-01-01

    Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explain one-fifth of heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ~2,000, ~3,700 and ~9,500 SNPs explained ~21%, ~24% and ~29% of phenotypic variance. Furthermore, all common variants together captured the majority (60%) of heritability. The 697 variants clustered in 423 loci enriched for genes, pathways, and tissue-types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/beta-catenin, and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants. PMID:25282103

  10. Genomic selection in plant breeding.

    Science.gov (United States)

    Newell, Mark A; Jannink, Jean-Luc

    2014-01-01

    Genomic selection (GS) is a method to predict the genetic value of selection candidates based on the genomic estimated breeding value (GEBV) predicted from high-density markers positioned throughout the genome. Unlike marker-assisted selection, the GEBV is based on all markers including both minor and major marker effects. Thus, the GEBV may capture more of the genetic variation for the particular trait under selection.

  11. Bayesian Nonparametric Hidden Markov Models with application to the analysis of copy-number-variation in mammalian genomes.

    Science.gov (United States)

    Yau, C; Papaspiliopoulos, O; Roberts, G O; Holmes, C

    2011-01-01

    We consider the development of Bayesian Nonparametric methods for product partition models such as Hidden Markov Models and change point models. Our approach uses a Mixture of Dirichlet Process (MDP) model for the unknown sampling distribution (likelihood) for the observations arising in each state and a computationally efficient data augmentation scheme to aid inference. The method uses novel MCMC methodology which combines recent retrospective sampling methods with the use of slice sampler variables. The methodology is computationally efficient, both in terms of MCMC mixing properties, and robustness to the length of the time series being investigated. Moreover, the method is easy to implement requiring little or no user-interaction. We apply our methodology to the analysis of genomic copy number variation.

  12. Rare Genome-Wide Copy Number Variation and Expression of Schizophrenia in 22q11.2 Deletion Syndrome.

    Science.gov (United States)

    Bassett, Anne S; Lowther, Chelsea; Merico, Daniele; Costain, Gregory; Chow, Eva W C; van Amelsvoort, Therese; McDonald-McGinn, Donna; Gur, Raquel E; Swillen, Ann; Van den Bree, Marianne; Murphy, Kieran; Gothelf, Doron; Bearden, Carrie E; Eliez, Stephan; Kates, Wendy; Philip, Nicole; Sashi, Vandana; Campbell, Linda; Vorstman, Jacob; Cubells, Joseph; Repetto, Gabriela M; Simon, Tony; Boot, Erik; Heung, Tracy; Evers, Rens; Vingerhoets, Claudia; van Duin, Esther; Zackai, Elaine; Vergaelen, Elfi; Devriendt, Koen; Vermeesch, Joris R; Owen, Michael; Murphy, Clodagh; Michaelovosky, Elena; Kushan, Leila; Schneider, Maude; Fremont, Wanda; Busa, Tiffany; Hooper, Stephen; McCabe, Kathryn; Duijff, Sasja; Isaev, Karin; Pellecchia, Giovanna; Wei, John; Gazzellone, Matthew J; Scherer, Stephen W; Emanuel, Beverly S; Guo, Tingwei; Morrow, Bernice E; Marshall, Christian R

    2017-11-01

    Chromosome 22q11.2 deletion syndrome (22q11.2DS) is associated with a more than 20-fold increased risk for developing schizophrenia. The aim of this study was to identify additional genetic factors (i.e., "second hits") that may contribute to schizophrenia expression. Through an international consortium, the authors obtained DNA samples from 329 psychiatrically phenotyped subjects with 22q11.2DS. Using a high-resolution microarray platform and established methods to assess copy number variation (CNV), the authors compared the genome-wide burden of rare autosomal CNV, outside of the 22q11.2 deletion region, between two groups: a schizophrenia group and those with no psychotic disorder at age ≥25 years. The authors assessed whether genes overlapped by rare CNVs were overrepresented in functional pathways relevant to schizophrenia. Rare CNVs overlapping one or more protein-coding genes revealed significant between-group differences. For rare exonic duplications, six of 19 gene sets tested were enriched in the schizophrenia group; genes associated with abnormal nervous system phenotypes remained significant in a stepwise logistic regression model and showed significant interactions with 22q11.2 deletion region genes in a connectivity analysis. For rare exonic deletions, the schizophrenia group had, on average, more genes overlapped. The additional rare CNVs implicated known (e.g., GRM7, 15q13.3, 16p12.2) and novel schizophrenia risk genes and loci. The results suggest that additional rare CNVs overlapping genes outside of the 22q11.2 deletion region contribute to schizophrenia risk in 22q11.2DS, supporting a multigenic hypothesis for schizophrenia. The findings have implications for understanding expression of psychotic illness and herald the importance of whole-genome sequencing to appreciate the overall genomic architecture of schizophrenia.

  13. Simultaneous inference of selection and population growth from patterns of variation in the human genome

    DEFF Research Database (Denmark)

    Williamson, Scott H.; Hernandez, Ryan; Fledel-Alon, Adi

    2005-01-01

    Natural selection and demographic forces can have similar effects on patterns of DNA polymorphism. Therefore, to infer selection from samples of DNA sequences, one must simultaneously account for demographic effects. Here we take a model-based approach to this problem by developing predictions fo......-specific methods, and (iii) strong evidence for very recent population growth....... for patterns of polymorphism in the presence of both population size change and natural selection. If data are available from different functional classes of variation, and a priori information suggests that mutations in one of those classes are selectively neutral, then the putatively neutral class can...... this method to a large polymorphism data set from 301 human genes and find (i) widespread negative selection acting on standing nonsynonymous variation, (ii) that the fitness effects of nonsynonymous mutations are well predicted by several measures of amino acid exchangeability, especially site...

  14. Genomic and proteomic analysis of soybean heritable variations induced by space flight

    Institute of Scientific and Technical Information of China (English)

    HE Jie; GAO Yong; SUN Ye-qing

    2009-01-01

    To analyze the biological effects of space environment, the diversity of genomic DNA between the space flight soybean 194(4126) with phenotype of good yield and good fruit quality induced by space flight and the soybean with ground control was studied by amplified fragment length polymorphism (AFLP) method, and the polymorphism of space flight soybean 194(4126) was 3.56%. The differences of protein expression of seeds and leaves between the two kinds of soybeans were analysed by two-dimensional electrophoresis, PDQuest software and MALDI-TOF mass spectrometry. Results show that the loss and decrease of protein expression in 194(4126) soybean are subjected to the space fight of seeds, and three special proteins including Dehydrin, MAT1 and ceQORH are identified. It is concluded that the space environment changes the phenotype and geno-type of soybeans due to the space flight of seeds.

  15. Direct linkage of mitochondrial genome variation to risk factors for type 2 diabetes in conplastic strains

    Czech Academy of Sciences Publication Activity Database

    Pravenec, Michal; Hyakukoku, M.; Houštěk, Josef; Zídek, Václav; Landa, Vladimír; Mlejnek, Petr; Mikšík, Ivan; Mothejzíková-Dudová, Kristýna; Pecina, Petr; Vrbacký, Marek; Drahota, Zdeněk; Vojtíšková, Alena; Mráček, Tomáš; Kazdová, L.; Oliyarnyk, O.; Wang, Ji.; Ho, Ch.; Qi, N.; Sugimoto, K.; Kurtz, T.

    2007-01-01

    Roč. 17, č. 9 (2007), s. 1319-1326 ISSN 1088-9051 R&D Projects: GA MŠk(CZ) 1M0520; GA ČR(CZ) GA301/06/0028; GA ČR GA303/07/0781 Grant - others:GA UK(CZ) 24/2005; GA UK(CZ) 26/2005; National Institutes of Health(US) HL35018; National Institutes of Health(US) HL56028; National Institutes of Health(US) HL63709; EURATOOLS(XE) LSHG-CT-2005-019015 Institutional research plan: CEZ:AV0Z50110509 Source of funding: R - rámcový projekt EK Keywords : mitochondrial genome * conplastic strains * risk factors for type 2 diabetes Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 11.224, year: 2007

  16. Genomic and transcriptome profiling identified both human and HBV genetic variations and their interactions in Chinese hepatocellular carcinoma

    Directory of Open Access Journals (Sweden)

    Hua Dong

    2015-12-01

    Full Text Available Interaction between HBV and host genome integrations in hepatocellular carcinoma (HCC development is a complex process and the mechanism is still unclear. Here we described in details the quality controls and data mining of aCGH and transcriptome sequencing data on 50 HCC samples from the Chinese patients, published by Dong et al. (2015 (GEO#: GSE65486. In additional to the HBV-MLL4 integration discovered, we also investigated the genetic aberrations of HBV and host genes as well as their genetic interactions. We reported human genome copy number changes and frequent transcriptome variations (e.g. TP53, CTNNB1 mutation, especially MLL family mutations in this cohort of the patients. For HBV genotype C, we identified a novel linkage disequilibrium region covering HBV replication regulatory elements, including basal core promoter, DR1, epsilon and poly-A regions, which is associated with HBV core antigen over-expression and almost exclusive to HBV-MLL4 integration.

  17. Variation in the complex carbohydrate biosynthesis loci of Acinetobacter baumannii genomes.

    Directory of Open Access Journals (Sweden)

    Johanna J Kenyon

    Full Text Available Extracellular polysaccharides are major immunogenic components of the bacterial cell envelope. However, little is known about their biosynthesis in the genus Acinetobacter, which includes A. baumannii, an important nosocomial pathogen. Whether Acinetobacter sp. produce a capsule or a lipopolysaccharide carrying an O antigen or both is not resolved. To explore these issues, genes involved in the synthesis of complex polysaccharides were located in 10 complete A. baumannii genome sequences, and the function of each of their products was predicted via comparison to enzymes with a known function. The absence of a gene encoding a WaaL ligase, required to link the carbohydrate polymer to the lipid A-core oligosaccharide (lipooligosaccharide forming lipopolysaccharide, suggests that only a capsule is produced. Nine distinct arrangements of a large capsule biosynthesis locus, designated KL1 to KL9, were found in the genomes. Three forms of a second, smaller variable locus, likely to be required for synthesis of the outer core of the lipid A-core moiety, were designated OCL1 to OCL3 and also annotated. Each K locus includes genes for capsule export as well as genes for synthesis of activated sugar precursors, and for glycosyltransfer, glycan modification and oligosaccharide repeat-unit processing. The K loci all include the export genes at one end and genes for synthesis of common sugar precursors at the other, with a highly variable region that includes the remaining genes in between. Five different capsule loci, KL2, KL6, KL7, KL8 and KL9 were detected in multiply antibiotic resistant isolates belonging to global clone 2, and two other loci, KL1 and KL4, in global clone 1. This indicates that this region is being substituted repeatedly in multiply antibiotic resistant isolates from these clones.

  18. Rhinovirus genome variation during chronic upper and lower respiratory tract infections.

    Directory of Open Access Journals (Sweden)

    Caroline Tapparel

    Full Text Available Routine screening of lung transplant recipients and hospital patients for respiratory virus infections allowed to identify human rhinovirus (HRV in the upper and lower respiratory tracts, including immunocompromised hosts chronically infected with the same strain over weeks or months. Phylogenetic analysis of 144 HRV-positive samples showed no apparent correlation between a given viral genotype or species and their ability to invade the lower respiratory tract or lead to protracted infection. By contrast, protracted infections were found almost exclusively in immunocompromised patients, thus suggesting that host factors rather than the virus genotype modulate disease outcome, in particular the immune response. Complete genome sequencing of five chronic cases to study rhinovirus genome adaptation showed that the calculated mutation frequency was in the range observed during acute human infections. Analysis of mutation hot spot regions between specimens collected at different times or in different body sites revealed that non-synonymous changes were mostly concentrated in the viral capsid genes VP1, VP2 and VP3, independent of the HRV type. In an immunosuppressed lung transplant recipient infected with the same HRV strain for more than two years, both classical and ultra-deep sequencing of samples collected at different time points in the upper and lower respiratory tracts showed that these virus populations were phylogenetically indistinguishable over the course of infection, except for the last month. Specific signatures were found in the last two lower respiratory tract populations, including changes in the 5'UTR polypyrimidine tract and the VP2 immunogenic site 2. These results highlight for the first time the ability of a given rhinovirus to evolve in the course of a natural infection in immunocompromised patients and complement data obtained from previous experimental inoculation studies in immunocompetent volunteers.

  19. Analysis of Natural Variation in Bermudagrass (Cynodon dactylon) Reveals Physiological Responses Underlying Drought Tolerance

    Science.gov (United States)

    Cheng, Zhangmin; Ye, Tiantian; Chan, Zhulong

    2012-01-01

    Bermudagrass (Cynodon dactylon) is a widely used warm-season turfgrass and one of the most drought tolerant species. Dissecting the natural variation in drought tolerance and physiological responses will bring us powerful basis and novel insight for plant breeding. In the present study, we evaluated the natural variation of drought tolerance among nine bermudagrass varieties by measuring physiological responses after drought stress treatment through withholding water. Three groups differing in drought tolerance were identified, including two tolerant, five moderately tolerant and two susceptible varieties. Under drought stress condition, drought sensitive variety (Yukon) showed relative higher water loss, more severe cell membrane damage (EL), and more accumulation of hydrogen peroxide (H2O2) and malondialdehyde (MDA), while drought tolerant variety (Tifgreen) exhibited significantly higher antioxidant enzymes activities. Further results indicated that drought induced cell injury in different varieties (Yukon, SR9554 and Tifgreen) exhibited liner correlation with leaf water content (LWC), H2O2 content, MDA content and antioxidant enzyme activities. Additionally, Tifgreen plants had significantly higher levels of osmolytes (proline level and soluble sugars) when compared with Yukon and SR9554 under drought stress condition. Taken together, our results indicated that natural variation of drought stress tolerance in bermudagrass varieties might be largely related to the induced changes of water status, osmolyte accumulation and antioxidant defense system. PMID:23285294

  20. Analysis of natural variation in bermudagrass (Cynodon dactylon) reveals physiological responses underlying drought tolerance.

    Science.gov (United States)

    Shi, Haitao; Wang, Yanping; Cheng, Zhangmin; Ye, Tiantian; Chan, Zhulong

    2012-01-01

    Bermudagrass (Cynodon dactylon) is a widely used warm-season turfgrass and one of the most drought tolerant species. Dissecting the natural variation in drought tolerance and physiological responses will bring us powerful basis and novel insight for plant breeding. In the present study, we evaluated the natural variation of drought tolerance among nine bermudagrass varieties by measuring physiological responses after drought stress treatment through withholding water. Three groups differing in drought tolerance were identified, including two tolerant, five moderately tolerant and two susceptible varieties. Under drought stress condition, drought sensitive variety (Yukon) showed relative higher water loss, more severe cell membrane damage (EL), and more accumulation of hydrogen peroxide (H₂O₂) and malondialdehyde (MDA), while drought tolerant variety (Tifgreen) exhibited significantly higher antioxidant enzymes activities. Further results indicated that drought induced cell injury in different varieties (Yukon, SR9554 and Tifgreen) exhibited liner correlation with leaf water content (LWC), H₂O₂ content, MDA content and antioxidant enzyme activities. Additionally, Tifgreen plants had significantly higher levels of osmolytes (proline level and soluble sugars) when compared with Yukon and SR9554 under drought stress condition. Taken together, our results indicated that natural variation of drought stress tolerance in bermudagrass varieties might be largely related to the induced changes of water status, osmolyte accumulation and antioxidant defense system.

  1. Sardinians genetic background explained by runs of homozygosity and genomic regions under positive selection.

    Directory of Open Access Journals (Sweden)

    Cornelia Di Gaetano

    Full Text Available The peculiar position of Sardinia in the Mediterranean sea has rendered its population an interesting biogeographical isolate. The aim of this study was to investigate the genetic population structure, as well as to estimate Runs of Homozygosity and regions under positive selection, using about 1.2 million single nucleotide polymorphisms genotyped in 1077 Sardinian individuals. Using four different methods--fixation index, inflation factor, principal component analysis and ancestry estimation--we were able to highlight, as expected for a genetic isolate, the high internal homogeneity of the island. Sardinians showed a higher percentage of genome covered by RoHs>0.5 Mb (F(RoH%0.5 when compared to peninsular Italians, with the only exception of the area surrounding Alghero. We furthermore identified 9 genomic regions showing signs of positive selection and, we re-captured many previously inferred signals. Other regions harbor novel candidate genes for positive selection, like TMEM252, or regions containing long non coding RNA. With the present study we confirmed the high genetic homogeneity of Sardinia that may be explained by the shared ancestry combined with the action of evolutionary forces.

  2. Variations in cardiovascular disease under-diagnosis in England: national cross-sectional spatial analysis

    Directory of Open Access Journals (Sweden)

    Walford Hannah

    2011-03-01

    Full Text Available Abstract Background There is under-diagnosis of cardiovascular disease (CVD in the English population, despite financial incentives to encourage general practices to register new cases. We compared the modelled (expected and diagnosed (observed prevalence of three cardiovascular conditions- coronary heart disease (CHD, hypertension and stroke- at local level, their geographical variation, and population and healthcare predictors which might influence diagnosis. Methods Cross-sectional observational study in all English local authorities (351 and general practices (8,372 comparing model-based expected prevalence with diagnosed prevalence on practice disease registers. Spatial analyses were used to identify geographic clusters and variation in regression relationships. Results A total of 9,682,176 patients were on practice CHD, stroke and transient ischaemic attack, and hypertension registers. There was wide spatial variation in observed: expected prevalence ratios for all three diseases, with less than five per cent of expected cases diagnosed in some areas. London and the surrounding area showed statistically significant discrepancies in observed: expected prevalence ratios, with observed prevalence much lower than the epidemiological models predicted. The addition of general practitioner supply as a variable yielded stronger regression results for all three conditions. Conclusions Despite almost universal access to free primary healthcare, there may be significant and highly variable under-diagnosis of CVD across England, which can be partially explained by persistent inequity in GP supply. Disease management studies should consider the possible impact of under-diagnosis on population health outcomes. Compared to classical regression modelling, spatial analytic techniques can provide additional information on risk factors for under-diagnosis, and can suggest where healthcare resources may be most needed.

  3. A variational principle for computing nonequilibrium fluxes and potentials in genome-scale biochemical networks.

    Science.gov (United States)

    Fleming, R M T; Maes, C M; Saunders, M A; Ye, Y; Palsson, B Ø

    2012-01-07

    We derive a convex optimization problem on a steady-state nonequilibrium network of biochemical reactions, with the property that energy conservation and the second law of thermodynamics both hold at the problem solution. This suggests a new variational principle for biochemical networks that can be implemented in a computationally tractable manner. We derive the Lagrange dual of the optimization problem and use strong duality to demonstrate that a biochemical analogue of Tellegen's theorem holds at optimality. Each optimal flux is dependent on a free parameter that we relate to an elementary kinetic parameter when mass action kinetics is assumed. Copyright © 2011 Elsevier Ltd. All rights reserved.

  4. Heritable Variation for Sex Ratio under Environmental Sex Determination in the Common Snapping Turtle (Chelydra Serpentina)

    Science.gov (United States)

    Janzen, F. J.

    1992-01-01

    The magnitude of quantitative genetic variation for primary sex ratio was measured in families extracted from a natural population of the common snapping turtle (Chelydra serpentina), which possesses temperature-dependent sex determination (TSD). Eggs were incubated at three temperatures that produced mixed sex ratios. This experimental design provided estimates of the heritability of sex ratio in multiple environments and a test of the hypothesis that genotype X environment (G X E) interactions may be maintaining genetic variation for sex ratio in this population of C. serpentina. Substantial quantitative genetic variation for primary sex ratio was detected in all experimental treatments. These results in conjunction with the occurrence of TSD in this species provide support for three critical assumptions of Fisher's theory for the microevolution of sex ratio. There were statistically significant effects of family and incubation temperature on sex ratio, but no significant interaction was observed. Estimates of the genetic correlations of sex ratio across environments were highly positive and essentially indistinguishable from +1. These latter two findings suggest that G X E interaction is not the mechanism maintaining genetic variation for sex ratio in this system. Finally, although substantial heritable variation exists for primary sex ratio of C. serpentina under constant temperatures, estimates of the effective heritability of primary sex ratio in nature are approximately an order of magnitude smaller. Small effective heritability and a long generation time in C. serpentina imply that evolution of sex ratios would be slow even in response to strong selection by, among other potential agents, any rapid and/or substantial shifts in local temperatures, including those produced by changes in the global climate. PMID:1592234

  5. Aboriginal Australian mitochondrial genome variation - an increased understanding of population antiquity and diversity

    Science.gov (United States)

    Nagle, Nano; van Oven, Mannis; Wilcox, Stephen; van Holst Pellekaan, Sheila; Tyler-Smith, Chris; Xue, Yali; Ballantyne, Kaye N.; Wilcox, Leah; Papac, Luka; Cooke, Karen; van Oorschot, Roland A. H.; McAllister, Peter; Williams, Lesley; Kayser, Manfred; Mitchell, R. John; Adhikarla, Syama; Adler, Christina J.; Balanovska, Elena; Balanovsky, Oleg; Bertranpetit, Jaume; Clarke, Andrew C.; Comas, David; Cooper, Alan; der Sarkissian, Clio S. I.; Dulik, Matthew C.; Gaieski, Jill B.; Ganeshprasad, Arunkumar; Haak, Wolfgang; Haber, Marc; Hobbs, Angela; Javed, Asif; Jin, Li; Kaplan, Matthew E.; Li, Shilin; Martínez-Cruz, Begoña; Matisoo-Smith, Elizabeth A.; Melé, Marta; Merchant, Nirav C.; Owings, Amanda C.; Parida, Laxmi; Pitchappan, Ramasamy; Platt, Daniel E.; Quintana-Murci, Lluis; Renfrew, Colin; Royyuru, Ajay K.; Santhakumari, Arun Varatharajan; Santos, Fabrício R.; Schurr, Theodore G.; Soodyall, Himla; Soria Hernanz, David F.; Swamikrishnan, Pandikumar; Vilar, Miguel G.; Wells, R. Spencer; Zalloua, Pierre A.; Ziegle, Janet S.

    2017-03-01

    Aboriginal Australians represent one of the oldest continuous cultures outside Africa, with evidence indicating that their ancestors arrived in the ancient landmass of Sahul (present-day New Guinea and Australia) ~55 thousand years ago. Genetic studies, though limited, have demonstrated both the uniqueness and antiquity of Aboriginal Australian genomes. We have further resolved known Aboriginal Australian mitochondrial haplogroups and discovered novel indigenous lineages by sequencing the mitogenomes of 127 contemporary Aboriginal Australians. In particular, the more common haplogroups observed in our dataset included M42a, M42c, S, P5 and P12, followed by rarer haplogroups M15, M16, N13, O, P3, P6 and P8. We propose some major phylogenetic rearrangements, such as in haplogroup P where we delinked P4a and P4b and redefined them as P4 (New Guinean) and P11 (Australian), respectively. Haplogroup P2b was identified as a novel clade potentially restricted to Torres Strait Islanders. Nearly all Aboriginal Australian mitochondrial haplogroups detected appear to be ancient, with no evidence of later introgression during the Holocene. Our findings greatly increase knowledge about the geographic distribution and phylogenetic structure of mitochondrial lineages that have survived in contemporary descendants of Australia’s first settlers.

  6. Genome-Wide DNA Methylation Analysis and Epigenetic Variations Associated with Congenital Aortic Valve Stenosis (AVS.

    Directory of Open Access Journals (Sweden)

    Uppala Radhakrishna

    Full Text Available Congenital heart defect (CHD is the most common cause of death from congenital anomaly. Among several candidate epigenetic mechanisms, DNA methylation may play an important role in the etiology of CHDs. We conducted a genome-wide DNA methylation analysis using an Illumina Infinium 450k human methylation assay in a cohort of 24 newborns who had aortic valve stenosis (AVS, with gestational-age matched controls. The study identified significantly-altered CpG methylation at 59 sites in 52 genes in AVS subjects as compared to controls (either hypermethylated or demethylated. Gene Ontology analysis identified biological processes and functions for these genes including positive regulation of receptor-mediated endocytosis. Consistent with prior clinical data, the molecular function categories as determined using DAVID identified low-density lipoprotein receptor binding, lipoprotein receptor binding and identical protein binding to be over-represented in the AVS group. A significant epigenetic change in the APOA5 and PCSK9 genes known to be involved in AVS was also observed. A large number CpG methylation sites individually demonstrated good to excellent diagnostic accuracy for the prediction of AVS status, thus raising possibility of molecular screening markers for this disorder. Using epigenetic analysis we were able to identify genes significantly involved in the pathogenesis of AVS.

  7. Genome-Wide DNA Methylation Analysis and Epigenetic Variations Associated with Congenital Aortic Valve Stenosis (AVS).

    Science.gov (United States)

    Radhakrishna, Uppala; Albayrak, Samet; Alpay-Savasan, Zeynep; Zeb, Amna; Turkoglu, Onur; Sobolewski, Paul; Bahado-Singh, Ray O

    2016-01-01

    Congenital heart defect (CHD) is the most common cause of death from congenital anomaly. Among several candidate epigenetic mechanisms, DNA methylation may play an important role in the etiology of CHDs. We conducted a genome-wide DNA methylation analysis using an Illumina Infinium 450k human methylation assay in a cohort of 24 newborns who had aortic valve stenosis (AVS), with gestational-age matched controls. The study identified significantly-altered CpG methylation at 59 sites in 52 genes in AVS subjects as compared to controls (either hypermethylated or demethylated). Gene Ontology analysis identified biological processes and functions for these genes including positive regulation of receptor-mediated endocytosis. Consistent with prior clinical data, the molecular function categories as determined using DAVID identified low-density lipoprotein receptor binding, lipoprotein receptor binding and identical protein binding to be over-represented in the AVS group. A significant epigenetic change in the APOA5 and PCSK9 genes known to be involved in AVS was also observed. A large number CpG methylation sites individually demonstrated good to excellent diagnostic accuracy for the prediction of AVS status, thus raising possibility of molecular screening markers for this disorder. Using epigenetic analysis we were able to identify genes significantly involved in the pathogenesis of AVS.

  8. [Variation trends of natural vegetation net primary productivity in China under climate change scenario].

    Science.gov (United States)

    Zhao, Dong-sheng; Wu, Shao-hong; Yin, Yun-he

    2011-04-01

    Based on the widely used Lund-Potsdam-Jena Dynamic Global Vegetation Model (LPJ) for climate change study, and according to the features of natural environment in China, the operation mechanism of the model was adjusted, and the parameters were modified. With the modified LPJ model and taking 1961-1990 as baseline period, the responses of natural vegetation net primary productivity (NPP) in China to climate change in 1991-2080 were simulated under the Special Report on Emissions Scenarios (SRES) B2 scenario. In 1961-1990, the total NPP of natural vegetation in China was about 3.06 Pg C a(-1); in 1961-2080, the total NPP showed a fluctuant decreasing trend, with an accelerated decreasing rate. Under the condition of slight precipitation change, the increase of mean air temperature would have definite adverse impact on the NPP. Spatially, the NPP decreased from southeast coast to northwest inland, and this pattern would have less variation under climate change. In eastern China with higher NPP, especially in Northeast China, east of North China, and Loess Plateau, the NPP would mainly have a decreasing trend; while in western China with lower NPP, especially in the Tibetan Plateau and Tarim Basin, the NPP would be increased. With the intensive climate change, such a variation trend of NPP would be more obvious.

  9. Insights into mechanisms of bacterial antigenic variation derived from the complete genome sequence of Anaplasma marginale.

    Science.gov (United States)

    Palmer, Guy H; Futse, James E; Knowles, Donald P; Brayton, Kelly A

    2006-10-01

    Persistence of Anaplasma spp. in the animal reservoir host is required for efficient tick-borne transmission of these pathogens to animals and humans. Using A. marginale infection of its natural reservoir host as a model, persistent infection has been shown to reflect sequential cycles in which antigenic variants emerge, replicate, and are controlled by the immune system. Variation in the immunodominant outer-membrane protein MSP2 is generated by a process of gene conversion, in which unique hypervariable region sequences (HVRs) located in pseudogenes are recombined into a single operon-linked msp2 expression site. Although organisms expressing whole HVRs derived from pseudogenes emerge early in infection, long-term persistent infection is dependent on the generation of complex mosaics in which segments from different HVRs recombine into the expression site. The resulting combinatorial diversity generates the number of variants both predicted and shown to emerge during persistence.

  10. Variation in genome-wide levels of meiotic recombination is established at the onset of prophase in mammalian males.

    Directory of Open Access Journals (Sweden)

    Brian Baier

    2014-01-01

    Full Text Available Segregation of chromosomes during the first meiotic division relies on crossovers established during prophase. Although crossovers are strictly regulated so that at least one occurs per chromosome, individual variation in crossover levels is not uncommon. In an analysis of different inbred strains of male mice, we identified among-strain variation in the number of foci for the crossover-associated protein MLH1. We report studies of strains with "low" (CAST/EiJ, "medium" (C3H/HeJ, and "high" (C57BL/6J genome-wide MLH1 values to define factors responsible for this variation. We utilized immunofluorescence to analyze the number and distribution of proteins that function at different stages in the recombination pathway: RAD51 and DMC1, strand invasion proteins acting shortly after double-strand break (DSB formation, MSH4, part of the complex stabilizing double Holliday junctions, and the Bloom helicase BLM, thought to have anti-crossover activity. For each protein, we identified strain-specific differences that mirrored the results for MLH1; i.e., CAST/EiJ mice had the lowest values, C3H/HeJ mice intermediate values, and C57BL/6J mice the highest values. This indicates that differences in the numbers of DSBs (as identified by RAD51 and DMC1 are translated into differences in the number of crossovers, suggesting that variation in crossover levels is established by the time of DSB formation. However, DSBs per se are unlikely to be the primary determinant, since allelic variation for the DSB-inducing locus Spo11 resulted in differences in the numbers of DSBs but not the number of MLH1 foci. Instead, chromatin conformation appears to be a more important contributor, since analysis of synaptonemal complex length and DNA loop size also identified consistent strain-specific differences; i.e., crossover frequency increased with synaptonemal complex length and was inversely related to chromatin loop size. This indicates a relationship between recombination

  11. Sources of variation in under-5 mortality across sub-Saharan Africa: a spatial analysis.

    Science.gov (United States)

    Burke, Marshall; Heft-Neal, Sam; Bendavid, Eran

    2016-12-01

    Detailed spatial understanding of levels and trends in under-5 mortality is needed to improve the targeting of interventions to the areas of highest need, and to understand the sources of variation in mortality. To improve this understanding, we analysed local-level information on child mortality across sub-Saharan Africa between 1980-2010. We used data from 82 Demographic and Health Surveys in 28 sub-Saharan African countries, including the location and timing of 3·24 million childbirths and 393 685 deaths, to develop high-resolution spatial maps of under-5 mortality in the 1980s, 1990s, and 2000s. These estimates were at a resolution of 0·1 degree latitude by 0·1 degree longitude (roughly 10 km × 10 km). We then analysed this spatial information to distinguish within-country versus between-country sources of variation in mortality, to examine the extent to which declines in mortality have been accompanied by convergence in the distribution of mortality, and to study localised drivers of mortality differences, including temperature, malaria burden, and conflict. In our sample of sub-Saharan African countries from the 1980s to the 2000s, within-country differences in under-5 mortality accounted for 74-78% of overall variation in under-5 mortality across space and over time. Mortality differed significantly across only 8-15% of country borders, supporting the role of local, rather than national, factors in driving mortality patterns. We found that by the end of the study period, 23% of the eligible children in the study countries continue to live in mortality hotspots-areas where, if current trends continue, the Sustainable Developent Goals mortality targets will not be met. In multivariate analysis, within-country mortality levels at each pixel were significantly related to local temperature, malaria burden, and recent history of conflict. Our findings suggest that sub-national determinants explain a greater portion of under-5 mortality than do country

  12. Northeast African genomic variation shaped by the continuity of indigenous groups and Eurasian migrations.

    Directory of Open Access Journals (Sweden)

    Nina Hollfelder

    2017-08-01

    Full Text Available Northeast Africa has a long history of human habitation, with fossil-finds from the earliest anatomically modern humans, and housing ancient civilizations. The region is also the gate-way out of Africa, as well as a portal for migration into Africa from Eurasia via the Middle East and the Arabian Peninsula. We investigate the population history of northeast Africa by genotyping ~3.9 million SNPs in 221 individuals from 18 populations sampled in Sudan and South Sudan and combine this data with published genome-wide data from surrounding areas. We find a strong genetic divide between the populations from the northeastern parts of the region (Nubians, central Arab populations, and the Beja and populations towards the west and south (Nilotes, Darfur and Kordofan populations. This differentiation is mainly caused by a large Eurasian ancestry component of the northeast populations likely driven by migration of Middle Eastern groups followed by admixture that affected the local populations in a north-to-south succession of events. Genetic evidence points to an early admixture event in the Nubians, concurrent with historical contact between North Sudanese and Arab groups. We estimate the admixture in current-day Sudanese Arab populations to about 700 years ago, coinciding with the fall of Dongola in 1315/1316 AD, a wave of admixture that reached the Darfurian/Kordofanian populations some 400-200 years ago. In contrast to the northeastern populations, the current-day Nilotic populations from the south of the region display little or no admixture from Eurasian groups indicating long-term isolation and population continuity in these areas of northeast Africa.

  13. Northeast African genomic variation shaped by the continuity of indigenous groups and Eurasian migrations.

    Science.gov (United States)

    Hollfelder, Nina; Schlebusch, Carina M; Günther, Torsten; Babiker, Hiba; Hassan, Hisham Y; Jakobsson, Mattias

    2017-08-01

    Northeast Africa has a long history of human habitation, with fossil-finds from the earliest anatomically modern humans, and housing ancient civilizations. The region is also the gate-way out of Africa, as well as a portal for migration into Africa from Eurasia via the Middle East and the Arabian Peninsula. We investigate the population history of northeast Africa by genotyping ~3.9 million SNPs in 221 individuals from 18 populations sampled in Sudan and South Sudan and combine this data with published genome-wide data from surrounding areas. We find a strong genetic divide between the populations from the northeastern parts of the region (Nubians, central Arab populations, and the Beja) and populations towards the west and south (Nilotes, Darfur and Kordofan populations). This differentiation is mainly caused by a large Eurasian ancestry component of the northeast populations likely driven by migration of Middle Eastern groups followed by admixture that affected the local populations in a north-to-south succession of events. Genetic evidence points to an early admixture event in the Nubians, concurrent with historical contact between North Sudanese and Arab groups. We estimate the admixture in current-day Sudanese Arab populations to about 700 years ago, coinciding with the fall of Dongola in 1315/1316 AD, a wave of admixture that reached the Darfurian/Kordofanian populations some 400-200 years ago. In contrast to the northeastern populations, the current-day Nilotic populations from the south of the region display little or no admixture from Eurasian groups indicating long-term isolation and population continuity in these areas of northeast Africa.

  14. Diurnal variation in soil respiration under different land uses on Taihang Mountain, North China

    Science.gov (United States)

    Liu, Xiuping; Zhang, Wanjun; Zhang, Bin; Yang, Qihong; Chang, Jianguo; Hou, Ke

    2016-01-01

    The aim of this paper is to evaluate the diurnal variation in soil respiration under different land use types on Taihang Mountain, North China, and to understand its response to environmental factors (e.g., soil temperature and moisture) and forest management. Diurnal variations in soil respiration from plantations (Robinia pseudoacacia, Punica granatum, and Ziziphus jujuba), naturally regenerated forests (Vitex negundo var. heterophylla), grasslands (Bothriochloa ischaemum), and farmlands (winter wheat/summer maize) were measured using an LI-8100 automated soil CO2 flux system from May 2012 to April 2013. The results indicated that land use type had a significant effect on the diurnal variation of soil respiration. The diurnal soil respiration from farmlands was highest, followed by Ziziphus jujube, R. pseudoacacia, P. granatum, the lower soil CO2 efflux was found from B. ischaemum and V. negundo var. heterophylla. The diurnal soil respiration across different land use types was significantly affected by soil temperature and moisture, and their interaction. Precipitation-stimulated soil respiration increased more in soil with low water content and less in soil with high water content. The lower diurnal soil respiration from naturally regenerated forests suggests that naturally regenerated vegetation is the optimal vegetation type for reducing global warming.

  15. Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology

    DEFF Research Database (Denmark)

    Rossin, Elizabeth J.; Hansen, Kasper Lage; Raychaudhuri, Soumya

    2011-01-01

    Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these r......Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed...... in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein-protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more...... that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non...

  16. Identification of genomic regions associated with phenotypic variation between dog breeds using selection mapping

    DEFF Research Database (Denmark)

    Vaysse, Amaury; Ratnakumar, Abhirami; Derrien, Thomas

    2011-01-01

    The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse br...

  17. Genome-scale neurogenetics: methodology and meaning.

    Science.gov (United States)

    McCarroll, Steven A; Feng, Guoping; Hyman, Steven E

    2014-06-01

    Genetic analysis is currently offering glimpses into molecular mechanisms underlying such neuropsychiatric disorders as schizophrenia, bipolar disorder and autism. After years of frustration, success in identifying disease-associated DNA sequence variation has followed from new genomic technologies, new genome data resources, and global collaborations that could achieve the scale necessary to find the genes underlying highly polygenic disorders. Here we describe early results from genome-scale studies of large numbers of subjects and the emerging significance of these results for neurobiology.

  18. Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale

    DEFF Research Database (Denmark)

    Liu, Siyang; Huang, Shujia; Rao, Junhua

    2015-01-01

    present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome......) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. We...... assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction...

  19. Genomic structural variation-mediated allelic suppression causes hybrid male sterility in rice.

    Science.gov (United States)

    Shen, Rongxin; Wang, Lan; Liu, Xupeng; Wu, Jiang; Jin, Weiwei; Zhao, Xiucai; Xie, Xianrong; Zhu, Qinlong; Tang, Huiwu; Li, Qing; Chen, Letian; Liu, Yao-Guang

    2017-11-03

    Hybrids between divergent populations commonly show hybrid sterility; this reproductive barrier hinders hybrid breeding of the japonica and indica rice (Oryza sativa L.) subspecies. Here we show that structural changes and copy number variation at the Sc locus confer japonica-indica hybrid male sterility. The japonica allele, Sc-j, contains a pollen-essential gene encoding a DUF1618-domain protein; the indica allele, Sc-i, contains two or three tandem-duplicated ~ 28-kb segments, each carrying an Sc-j-homolog with a distinct promoter. In Sc-j/Sc-i hybrids, the high-expression of Sc-i in sporophytic cells causes suppression of Sc-j expression in pollen and selective abortion of Sc-j-pollen, leading to transmission ratio distortion. Knocking out one or two of the three Sc-i copies by CRISPR/Cas9 rescues Sc-j expression and male fertility. Our results reveal the gene dosage-dependent allelic suppression as a mechanism of hybrid incompatibility, and provide an effective approach to overcome the reproductive barrier for hybrid breeding.

  20. Genomic Structural Variations Affecting Virulence During Clonal Expansion of Pseudomonas syringae pv. actinidiae Biovar 3 in Europe.

    Science.gov (United States)

    Firrao, Giuseppe; Torelli, Emanuela; Polano, Cesare; Ferrante, Patrizia; Ferrini, Francesca; Martini, Marta; Marcelletti, Simone; Scortichini, Marco; Ermacora, Paolo

    2018-01-01

    Pseudomonas syringae pv. actinidiae (Psa) biovar 3 caused pandemic bacterial canker of Actinidia chinensis and Actinidia deliciosa since 2008. In Europe, the disease spread rapidly in the kiwifruit cultivation areas from a single introduction. In this study, we investigated the genomic diversity of Psa biovar 3 strains during the primary clonal expansion in Europe using single molecule real-time (SMRT), Illumina and Sanger sequencing technologies. We recorded evidences of frequent mobilization and loss of transposon Tn6212, large chromosome inversions, and ectopic integration of IS sequences (remarkably ISPsy31, ISPsy36, and ISPsy37). While no phenotype change associated with Tn6212 mobilization could be detected, strains CRAFRU 12.29 and CRAFRU 12.50 did not elicit the hypersensitivity response (HR) on tobacco and eggplant leaves and were limited in their growth in kiwifruit leaves due to insertion of ISPsy31 and ISPsy36 in the hrpS and hrpR genes, respectively, interrupting the hrp cluster. Both strains had been isolated from symptomatic plants, suggesting coexistence of variant strains with reduced virulence together with virulent strains in mixed populations. The structural differences caused by rearrangements of self-genetic elements within European and New Zealand strains were comparable in number and type to those occurring among the European strains, in contrast with the significant difference in terms of nucleotide polymorphisms. We hypothesize a relaxation, during clonal expansion, of the selection limiting the accumulation of deleterious mutations associated with genome structural variation due to transposition of mobile elements. This consideration may be relevant when evaluating strategies to be adopted for epidemics management.

  1. Genomic and Phenotypic Variation in Morphogenetic Networks of Two Candida albicans Isolates Subtends Their Different Pathogenic Potential

    Directory of Open Access Journals (Sweden)

    Duccio Cavalieri

    2018-01-01

    Full Text Available The transition from commensalism to pathogenicity of Candida albicans reflects both the host inability to mount specific immune responses and the microorganism’s dimorphic switch efficiency. In this study, we used whole genome sequencing and microarray analysis to investigate the genomic determinants of the phenotypic changes observed in two C. albicans clinical isolates (YL1 and YQ2. In vitro experiments employing epithelial, microglial, and peripheral blood mononuclear cells were thus used to evaluate C. albicans isolates interaction with first line host defenses, measuring adhesion, susceptibility to phagocytosis, and induction of secretory responses. Moreover, a murine model of peritoneal infection was used to compare the in vivo pathogenic potential of the two isolates. Genome sequence and gene expression analysis of C. albicans YL1 and YQ2 showed significant changes in cellular pathways involved in environmental stress response, adhesion, filamentous growth, invasiveness, and dimorphic transition. This was in accordance with the observed marked phenotypic differences in biofilm production, dimorphic switch efficiency, cell adhesion, invasion, and survival to phagocyte-mediated host defenses. The mutations in key regulators of the hyphal growth pathway in the more virulent strain corresponded to an overall greater number of budding yeast cells released. Compared to YQ2, YL1 consistently showed enhanced pathogenic potential, since in vitro, it was less susceptible to ingestion by phagocytic cells and more efficient in invading epithelial cells, while in vivo YL1 was more effective than YQ2 in recruiting inflammatory cells, eliciting IL-1β response and eluding phagocytic cells. Overall, these results indicate an unexpected isolate-specific variation in pathways important for host invasion and colonization, showing how the genetic background of C. albicans may greatly affect its behavior both in vitro and in vivo. Based on this approach, we

  2. Whole-genome patterns of linkage disequilibrium across flycatcher populations clarify the causes and consequences of fine-scale recombination rate variation in birds.

    Science.gov (United States)

    Kawakami, Takeshi; Mugal, Carina F; Suh, Alexander; Nater, Alexander; Burri, Reto; Smeds, Linnéa; Ellegren, Hans

    2017-08-01

    Recombination rate is heterogeneous across the genome of various species and so are genetic diversity and differentiation as a consequence of linked selection. However, we still lack a clear picture of the underlying mechanisms for regulating recombination. Here we estimated fine-scale population recombination rate based on the patterns of linkage disequilibrium across the genomes of multiple populations of two closely related flycatcher species (Ficedula albicollis and F. hypoleuca). This revealed an overall conservation of the recombination landscape between these species at the scale of 200 kb, but we also identified differences in the local rate of recombination despite their recent divergence (recombination rate in a lineage-specific manner, indicating differences in the extent of linked selection between species. We detected 400-3,085 recombination hotspots per population. Location of hotspots was conserved between species, but the intensity of hotspot activity varied between species. Recombination hotspots were primarily associated with CpG islands (CGIs), regardless of whether CGIs were at promoter regions or away from genes. Recombination hotspots were also associated with specific transposable elements (TEs), but this association appears indirect due to shared preferences of the transposition machinery and the recombination machinery for accessible open chromatin regions. Our results suggest that CGIs are a major determinant of the localization of recombination hotspots, and we propose that both the distribution of TEs and fine-scale variation in recombination rate may be associated with the evolution of the epigenetic landscape. © 2017 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.

  3. Alignment-free microbial phylogenomics under scenarios of sequence divergence, genome rearrangement and lateral genetic transfer.

    Science.gov (United States)

    Bernard, Guillaume; Chan, Cheong Xin; Ragan, Mark A

    2016-07-01

    Alignment-free (AF) approaches have recently been highlighted as alternatives to methods based on multiple sequence alignment in phylogenetic inference. However, the sensitivity of AF methods to genome-scale evolutionary scenarios is little known. Here, using simulated microbial genome data we systematically assess the sensitivity of nine AF methods to three important evolutionary scenarios: sequence divergence, lateral genetic transfer (LGT) and genome rearrangement. Among these, AF methods are most sensitive to the extent of sequence divergence, less sensitive to low and moderate frequencies of LGT, and most robust against genome rearrangement. We describe the application of AF methods to three well-studied empirical genome datasets, and introduce a new application of the jackknife to assess node support. Our results demonstrate that AF phylogenomics is computationally scalable to multi-genome data and can generate biologically meaningful phylogenies and insights into microbial evolution.

  4. Genetic control of environmental variation of two quantitative traits of Drosophila melanogaster revealed by whole-genome sequencing

    DEFF Research Database (Denmark)

    Sørensen, Peter; de los Campos, Gustavo; Morgante, Fabio

    2015-01-01

    and others more volatile performance. Understanding the mechanisms responsible for environmental variability not only informs medical questions but is relevant in evolution and in agricultural science. In this work fully sequenced inbred lines of Drosophila melanogaster were analyzed to study the nature...... of genetic control of environmental variance for two quantitative traits: starvation resistance (SR) and startle response (SL). The evidence for genetic control of environmental variance is compelling for both traits. Sequence information is incorporated in random regression models to study the underlying...... genetic signals, which are shown to be different in the two traits. Genomic variance in sexual dimorphism was found for SR but not for SL. Indeed, the proportion of variance captured by sequence information and the contribution to this variance from four chromosome segments differ between sexes in SR...

  5. The distribution and impact of common copy-number variation in the genome of the domesticated apple, Malus x domestica Borkh.

    Science.gov (United States)

    Boocock, James; Chagné, David; Merriman, Tony R; Black, Michael A

    2015-10-23

    Copy number variation (CNV) is a common feature of eukaryotic genomes, and a growing body of evidence suggests that genes affected by CNV are enriched in processes that are associated with environmental responses. Here we use next generation sequence (NGS) data to detect copy-number variable regions (CNVRs) within the Malus x domestica genome, as well as to examine their distribution and impact. CNVRs were detected using NGS data derived from 30 accessions of M. x domestica analyzed using the read-depth method, as implemented in the CNVrd2 software. To improve the reliability of our results, we developed a quality control and analysis procedure that involved checking for organelle DNA, not repeat masking, and the determination of CNVR identity using a permutation testing procedure. Overall, we identified 876 CNVRs, which spanned 3.5 % of the apple genome. To verify that detected CNVRs were not artifacts, we analyzed the B- allele-frequencies (BAF) within a single nucleotide polymorphism (SNP) array dataset derived from a screening of 185 individual apple accessions and found the CNVRs were enriched for SNPs having aberrant BAFs (P apple scab. We present the first analysis and catalogue of CNVRs in the M. x domestica genome. The enrichment of the CNVRs with R gene models and their overlap with gene loci of agricultural significance draw attention to a form of unexplored genetic variation in apple. This research will underpin further investigation of the role that CNV plays within the apple genome.

  6. Genome-wide copy number variation study associates metabotropic glutamate receptor gene networks with attention deficit hyperactivity disorder

    Science.gov (United States)

    Elia, Josephine; Glessner, Joseph T; Wang, Kai; Takahashi, Nagahide; Shtir, Corina J; Hadley, Dexter; Sleiman, Patrick M A; Zhang, Haitao; Kim, Cecilia E; Robison, Reid; Lyon, Gholson J; Flory, James H; Bradfield, Jonathan P; Imielinski, Marcin; Hou, Cuiping; Frackelton, Edward C; Chiavacci, Rosetta M; Sakurai, Takeshi; Rabin, Cara; Middleton, Frank A; Thomas, Kelly A; Garris, Maria; Mentch, Frank; Freitag, Christine M; Steinhausen, Hans-Christoph; Todorov, Alexandre A; Reif, Andreas; Rothenberger, Aribert; Franke, Barbara; Mick, Eric O; Roeyers, Herbert; Buitelaar, Jan; Lesch, Klaus-Peter; Banaschewski, Tobias; Ebstein, Richard P; Mulas, Fernando; Oades, Robert D; Sergeant, Joseph; Sonuga-Barke, Edmund; Renner, Tobias J; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Meyer, Jobst; Pálmason, Haukur; Seitz, Christiane; Loo, Sandra K; Smalley, Susan L; Biederman, Joseph; Kent, Lindsey; Asherson, Philip; Anney, Richard J L; Gaynor, J William; Shaw, Philip; Devoto, Marcella; White, Peter S; Grant, Struan F A; Buxbaum, Joseph D; Rapoport, Judith L; Williams, Nigel M; Nelson, Stanley F; Faraone, Stephen V; Hakonarson, Hakon

    2014-01-01

    Attention deficit hyperactivity disorder (ADHD) is a common, heritable neuropsychiatric disorder of unknown etiology. We performed a whole-genome copy number variation (CNV) study on 1,013 cases with ADHD and 4,105 healthy children of European ancestry using 550,000 SNPs. We evaluated statistically significant findings in multiple independent cohorts, with a total of 2,493 cases with ADHD and 9,222 controls of European ancestry, using matched platforms. CNVs affecting metabotropic glutamate receptor genes were enriched across all cohorts (P = 2.1 × 10−9). We saw GRM5 (encoding glutamate receptor, metabotropic 5) deletions in ten cases and one control (P = 1.36 × 10−6). We saw GRM7 deletions in six cases, and we saw GRM8 deletions in eight cases and no controls. GRM1 was duplicated in eight cases. We experimentally validated the observed variants using quantitative RT-PCR. A gene network analysis showed that genes interacting with the genes in the GRM family are enriched for CNVs in ~10% of the cases (P = 4.38 × 10−10) after correction for occurrence in the controls. We identified rare recurrent CNVs affecting glutamatergic neurotransmission genes that were overrepresented in multiple ADHD cohorts. PMID:22138692

  7. An integrated linkage map reveals candidate genes underlying adaptive variation in Chinook salmon (Oncorhynchus tshawytscha)

    DEFF Research Database (Denmark)

    Mckinney, G. J.; Seeb, L. W.; Larson, W. A.

    2016-01-01

    Salmonids are an important cultural and ecological resource exhibiting near worldwide distribution between their native and introduced range. Previous research has generated linkage maps and genomic resources for several species as well as genome assemblies for two species. We first leveraged...

  8. [Spatiotemporal variations of natural wetland CH4 emissions over China under future climate change].

    Science.gov (United States)

    Liu, Jian-gong; Zhu, Qiu-an; Shen, Yan; Yang, Yan-zheng; Luo, Yun-peng; Peng, Chang-hui

    2015-11-01

    Based on a new process-based model, TRIPLEX-GHG, this paper analyzed the spatio-temporal variations of natural wetland CH4 emissions over China under different future climate change scenarios. When natural wetland distributions were fixed, the amount of CH4 emissions from natural wetland ecosystem over China would increase by 32.0%, 55.3% and 90.8% by the end of 21st century under three representative concentration pathways (RCPs) scenarios, RCP2. 6, RCP4.5 and RCP8.5, respectively, compared with the current level. Southern China would have higher CH4 emissions compared to that from central and northern China. Besides, there would be relatively low emission fluxes in western China while relatively high emission fluxes in eastern China. Spatially, the areas with relatively high CH4 emission fluxes would be concentrated in the middle-lower reaches of the Yangtze River, the Northeast and the coasts of the Pearl River. In the future, most natural wetlands would emit more CH4 for RCP4.5 and RCP8.5 than that of 2005. However, under RCP2.6 scenario, the increasing trend would be curbed and CH4 emissions (especially from the Qinghai-Tibet Plateau) begin to decrease in the late 21st century.

  9. DNS of transcritical turbulent boundary layers at supercritical pressures under abrupt variations in thermodynamic properties

    Science.gov (United States)

    Kawai, Soshi

    2014-11-01

    In this talk, we first propose a numerical strategy that is robust and high-order accurate for enabling to simulate transcritical flows at supercritical pressures under abrupt variations in thermodynamic properties due to the real fluid effects. The method is based on introducing artificial density diffusion in a physically-consistent manner in order to capture the steep variation of thermodynamic properties in transcritical conditions robustly, while solving a pressure evolution equation to achieve pressure equilibrium at the transcritical interfaces. We then discuss the direct numerical simulation (DNS) of transcritical heated turbulent boundary layers on a zero-pressure-gradient flat plate at supercritical pressures. To the best of my knowledge, the present DNS is the first DNS of zero-pressure-gradient flat-plate transcritical turbulent boundary layer. The turbulent kinetic budget indicates that the compressibility effects (especially, pressure-dilatation correlation) are not negligible at the transcritical conditions even if the flow is subsonic. The unique and interesting interactions between the real fluid effects and wall turbulence, and their turbulence statistics, which have never been seen in the ideal-fluid turbulent boundary layers, are also discussed. This work was supported in part by Japan Society for the Promotion of Science (JSPS) Grant-in-Aid for Young Scientists (A) KAKENHI 26709066 and the JAXA International Top Young Fellowship Program.

  10. Study on the Variation of Groundwater Level under Time-varying Recharge

    Science.gov (United States)

    Wu, Ming-Chang; Hsieh, Ping-Cheng

    2017-04-01

    The slopes of the suburbs come to important areas by focusing on the work of soil and water conservation in recent years. The water table inside the aquifer is affected by rainfall, geology and topography, which will result in the change of groundwater discharge and water level. Currently, the way to obtain water table information is to set up the observation wells; however, owing to that the cost of equipment and the wells excavated is too expensive, we develop a mathematical model instead, which might help us to simulate the groundwater level variation. In this study, we will discuss the groundwater level change in a sloping unconfined aquifer with impermeable bottom under time-varying rainfall events. Referring to Child (1971), we employ the Boussinesq equation as the governing equation, and apply the General Integral Transforms Method (GITM) to analyzing the groundwater level after linearizing the Boussinesq equation. After comparing the solution with Verhoest & Troch (2000) and Bansal & Das (2010), we get satisfactory results. To sum up, we have presented an alternative approach to solve the linearized Boussinesq equation for the response of groundwater level in a sloping unconfined aquifer. The present analytical results combine the effect of bottom slope and the time-varying recharge pattern on the water table fluctuations. Owing to the limitation and difficulty of measuring the groundwater level directly, we develop such a mathematical model that we can predict or simulate the variation of groundwater level affected by any rainfall events in advance.

  11. Identification of balanced chromosomal rearrangements previously unknown among participants in the 1000 Genomes Project: implications for interpretation of structural variation in genomes and the future of clinical cytogenetics.

    Science.gov (United States)

    Dong, Zirui; Wang, Huilin; Chen, Haixiao; Jiang, Hui; Yuan, Jianying; Yang, Zhenjun; Wang, Wen-Jing; Xu, Fengping; Guo, Xiaosen; Cao, Ye; Zhu, Zhenzhen; Geng, Chunyu; Cheung, Wan Chee; Kwok, Yvonne K; Yang, Huanming; Leung, Tak Yeung; Morton, Cynthia C; Cheung, Sau Wai; Choy, Kwong Wai

    2017-11-02

    PurposeRecent studies demonstrate that whole-genome sequencing enables detection of cryptic rearrangements in apparently balanced chromosomal rearrangements (also known as balanced chromosomal abnormalities, BCAs) previously identified by conventional cytogenetic methods. We aimed to assess our analytical tool for detecting BCAs in the 1000 Genomes Project without knowing which bands were affected.MethodsThe 1000 Genomes Project provides an unprecedented integrated map of structural variants in phenotypically normal subjects, but there is no information on potential inclusion of subjects with apparent BCAs akin to those traditionally detected in diagnostic cytogenetics laboratories. We applied our analytical tool to 1,166 genomes from the 1000 Genomes Project with sufficient physical coverage (8.25-fold).ResultsWith this approach, we detected four reciprocal balanced translocations and four inversions, ranging in size from 57.9 kb to 13.3 Mb, all of which were confirmed by cytogenetic methods and polymerase chain reaction studies. One of these DNAs has a subtle translocation that is not readily identified by chromosome analysis because of the similarity of the banding patterns and size of exchanged segments, and another results in disruption of all transcripts of an OMIM gene.ConclusionOur study demonstrates the extension of utilizing low-pass whole-genome sequencing for unbiased detection of BCAs including translocations and inversions previously unknown in the 1000 Genomes Project.GENETICS in MEDICINE advance online publication, 2 November 2017; doi:10.1038/gim.2017.170.

  12. Asymptotic properties of Pearson's rank-variate correlation coefficient under contaminated Gaussian model.

    Science.gov (United States)

    Ma, Rubao; Xu, Weichao; Zhang, Yun; Ye, Zhongfu

    2014-01-01

    This paper investigates the robustness properties of Pearson's rank-variate correlation coefficient (PRVCC) in scenarios where one channel is corrupted by impulsive noise and the other is impulsive noise-free. As shown in our previous work, these scenarios that frequently encountered in radar and/or sonar, can be well emulated by a particular bivariate contaminated Gaussian model (CGM). Under this CGM, we establish the asymptotic closed forms of the expectation and variance of PRVCC by means of the well known Delta method. To gain a deeper understanding, we also compare PRVCC with two other classical correlation coefficients, i.e., Spearman's rho (SR) and Kendall's tau (KT), in terms of the root mean squared error (RMSE). Monte Carlo simulations not only verify our theoretical findings, but also reveal the advantage of PRVCC by an example of estimating the time delay in the particular impulsive noise environment.

  13. Estimated allele substitution effects underlying genomic evaluation models depend on the scaling of allele counts

    NARCIS (Netherlands)

    Bouwman, Aniek C.; Hayes, Ben J.; Calus, Mario P.L.

    2017-01-01

    Background: Genomic evaluation is used to predict direct genomic values (DGV) for selection candidates in breeding programs, but also to estimate allele substitution effects (ASE) of single nucleotide polymorphisms (SNPs). Scaling of allele counts influences the estimated ASE, because scaling of

  14. Novel genetic loci underlying human intracranial volume identified through genome-wide association

    Science.gov (United States)

    Adams, Hieab HH; Hibar, Derrek P; Chouraki, Vincent; Stein, Jason L; Nyquist, Paul A; Rentería, Miguel E; Trompet, Stella; Arias-Vasquez, Alejandro; Seshadri, Sudha; Desrivières, Sylvane; Beecham, Ashley H; Jahanshad, Neda; Wittfeld, Katharina; Van der Lee, Sven J; Abramovic, Lucija; Alhusaini, Saud; Amin, Najaf; Andersson, Micael; Arfanakis, Konstantinos; Aribisala, Benjamin S; Armstrong, Nicola J; Athanasiu, Lavinia; Axelsson, Tomas; Beiser, Alexa; Bernard, Manon; Bis, Joshua C; Blanken, Laura ME; Blanton, Susan H; Bohlken, Marc M; Boks, Marco P; Bralten, Janita; Brickman, Adam M; Carmichael, Owen; Chakravarty, M Mallar; Chauhan, Ganesh; Chen, Qiang; Ching, Christopher RK; Cuellar-Partida, Gabriel; Den Braber, Anouk; Doan, Nhat Trung; Ehrlich, Stefan; Filippi, Irina; Ge, Tian; Giddaluru, Sudheer; Goldman, Aaron L; Gottesman, Rebecca F; Greven, Corina U; Grimm, Oliver; Griswold, Michael E; Guadalupe, Tulio; Hass, Johanna; Haukvik, Unn K; Hilal, Saima; Hofer, Edith; Hoehn, David; Holmes, Avram J; Hoogman, Martine; Janowitz, Deborah; Jia, Tianye; Kasperaviciute, Dalia; Kim, Sungeun; Klein, Marieke; Kraemer, Bernd; Lee, Phil H; Liao, Jiemin; Liewald, David CM; Lopez, Lorna M; Luciano, Michelle; Macare, Christine; Marquand, Andre; Matarin, Mar; Mather, Karen A; Mattheisen, Manuel; Mazoyer, Bernard; McKay, David R; McWhirter, Rebekah; Milaneschi, Yuri; Mirza-Schreiber, Nazanin; Muetzel, Ryan L; Maniega, Susana Muñoz; Nho, Kwangsik; Nugent, Allison C; Olde Loohuis, Loes M; Oosterlaan, Jaap; Papmeyer, Martina; Pappa, Irene; Pirpamer, Lukas; Pudas, Sara; Pütz, Benno; Rajan, Kumar B; Ramasamy, Adaikalavan; Richards, Jennifer S; Risacher, Shannon L; Roiz-Santiañez, Roberto; Rommelse, Nanda; Rose, Emma J; Royle, Natalie A; Rundek, Tatjana; Sämann, Philipp G; Satizabal, Claudia L; Schmaal, Lianne; Schork, Andrew J; Shen, Li; Shin, Jean; Shumskaya, Elena; Smith, Albert V; Sprooten, Emma; Strike, Lachlan T; Teumer, Alexander; Thomson, Russell; Tordesillas-Gutierrez, Diana; Toro, Roberto; Trabzuni, Daniah; Vaidya, Dhananjay; Van der Grond, Jeroen; Van der Meer, Dennis; Van Donkelaar, Marjolein MJ; Van Eijk, Kristel R; Van Erp, Theo GM; Van Rooij, Daan; Walton, Esther; Westlye, Lars T; Whelan, Christopher D; Windham, Beverly G; Winkler, Anderson M; Woldehawariat, Girma; Wolf, Christiane; Wolfers, Thomas; Xu, Bing; Yanek, Lisa R; Yang, Jingyun; Zijdenbos, Alex; Zwiers, Marcel P; Agartz, Ingrid; Aggarwal, Neelum T; Almasy, Laura; Ames, David; Amouyel, Philippe; Andreassen, Ole A; Arepalli, Sampath; Assareh, Amelia A; Barral, Sandra; Bastin, Mark E; Becker, Diane M; Becker, James T; Bennett, David A; Blangero, John; van Bokhoven, Hans; Boomsma, Dorret I; Brodaty, Henry; Brouwer, Rachel M; Brunner, Han G; Buckner, Randy L; Buitelaar, Jan K; Bulayeva, Kazima B; Cahn, Wiepke; Calhoun, Vince D; Cannon, Dara M; Cavalleri, Gianpiero L; Chen, Christopher; Cheng, Ching-Yu; Cichon, Sven; Cookson, Mark R; Corvin, Aiden; Crespo-Facorro, Benedicto; Curran, Joanne E; Czisch, Michael; Dale, Anders M; Davies, Gareth E; De Geus, Eco JC; De Jager, Philip L; de Zubicaray, Greig I; Delanty, Norman; Depondt, Chantal; DeStefano, Anita L; Dillman, Allissa; Djurovic, Srdjan; Donohoe, Gary; Drevets, Wayne C; Duggirala, Ravi; Dyer, Thomas D; Erk, Susanne; Espeseth, Thomas; Evans, Denis A; Fedko, Iryna O; Fernández, Guillén; Ferrucci, Luigi; Fisher, Simon E; Fleischman, Debra A; Ford, Ian; Foroud, Tatiana M; Fox, Peter T; Francks, Clyde; Fukunaga, Masaki; Gibbs, J Raphael; Glahn, David C; Gollub, Randy L; Göring, Harald HH; Grabe, Hans J; Green, Robert C; Gruber, Oliver; Gudnason, Vilmundur; Guelfi, Sebastian; Hansell, Narelle K; Hardy, John; Hartman, Catharina A; Hashimoto, Ryota; Hegenscheid, Katrin; Heinz, Andreas; Le Hellard, Stephanie; Hernandez, Dena G; Heslenfeld, Dirk J; Ho, Beng-Choon; Hoekstra, Pieter J; Hoffmann, Wolfgang; Hofman, Albert; Holsboer, Florian; Homuth, Georg; Hosten, Norbert; Hottenga, Jouke-Jan; Hulshoff Pol, Hilleke E; Ikeda, Masashi; Ikram, M Kamran; Jack, Clifford R; Jenkinson, Mark; Johnson, Robert; Jönsson, Erik G; Jukema, J Wouter; Kahn, René S; Kanai, Ryota; Kloszewska, Iwona; Knopman, David S; Kochunov, Peter; Kwok, John B; Lawrie, Stephen M; Lemaître, Hervé; Liu, Xinmin; Longo, Dan L; Longstreth, WT; Lopez, Oscar L; Lovestone, Simon; Martinez, Oliver; Martinot, Jean-Luc; Mattay, Venkata S; McDonald, Colm; McIntosh, Andrew M; McMahon, Katie L; McMahon, Francis J; Mecocci, Patrizia; Melle, Ingrid; Meyer-Lindenberg, Andreas; Mohnke, Sebastian; Montgomery, Grant W; Morris, Derek W; Mosley, Thomas H; Mühleisen, Thomas W; Müller-Myhsok, Bertram; Nalls, Michael A; Nauck, Matthias; Nichols, Thomas E; Niessen, Wiro J; Nöthen, Markus M; Nyberg, Lars; Ohi, Kazutaka; Olvera, Rene L; Ophoff, Roel A; Pandolfo, Massimo; Paus, Tomas; Pausova, Zdenka; Penninx, Brenda WJH; Pike, G Bruce; Potkin, Steven G; Psaty, Bruce M; Reppermund, Simone; Rietschel, Marcella; Roffman, Joshua L; Romanczuk-Seiferth, Nina; Rotter, Jerome I; Ryten, Mina; Sacco, Ralph L; Sachdev, Perminder S; Saykin, Andrew J; Schmidt, Reinhold; Schofield, Peter R; Sigurdsson, Sigurdur; Simmons, Andy; Singleton, Andrew; Sisodiya, Sanjay M; Smith, Colin; Smoller, Jordan W; Soininen, Hilkka; Srikanth, Velandai; Steen, Vidar M; Stott, David J; Sussmann, Jessika E; Thalamuthu, Anbupalam; Tiemeier, Henning; Toga, Arthur W; Traynor, Bryan J; Troncoso, Juan; Turner, Jessica A; Tzourio, Christophe; Uitterlinden, Andre G; Valdés Hernández, Maria C; Van der Brug, Marcel; Van der Lugt, Aad; Van der Wee, Nic JA; Van Duijn, Cornelia M; Van Haren, Neeltje EM; Van 't Ent, Dennis; Van Tol, Marie-Jose; Vardarajan, Badri N; Veltman, Dick J; Vernooij, Meike W; Völzke, Henry; Walter, Henrik; Wardlaw, Joanna M; Wassink, Thomas H; Weale, Michael E; Weinberger, Daniel R; Weiner, Michael W; Wen, Wei; Westman, Eric; White, Tonya; Wong, Tien Y; Wright, Clinton B; Zielke, H Ronald; Zonderman, Alan B; Deary, Ian J; DeCarli, Charles; Schmidt, Helena; Martin, Nicholas G; De Craen, Anton JM; Wright, Margaret J; Launer, Lenore J; Schumann, Gunter; Fornage, Myriam; Franke, Barbara; Debette, Stéphanie; Medland, Sarah E; Ikram, M Arfan; Thompson, Paul M

    2016-01-01

    Intracranial volume reflects the maximally attained brain size during development, and remains stable with loss of tissue in late life. It is highly heritable, but the underlying genes remain largely undetermined. In a genome-wide association study of 32,438 adults, we discovered five novel loci for intracranial volume and confirmed two known signals. Four of the loci are also associated with adult human stature, but these remained associated with intracranial volume after adjusting for height. We found a high genetic correlation with child head circumference (ρgenetic=0.748), which indicated a similar genetic background and allowed for the identification of four additional loci through meta-analysis (Ncombined = 37,345). Variants for intracranial volume were also related to childhood and adult cognitive function, Parkinson’s disease, and enriched near genes involved in growth pathways including PI3K–AKT signaling. These findings identify biological underpinnings of intracranial volume and provide genetic support for theories on brain reserve and brain overgrowth. PMID:27694991

  15. Uncovering the functional constraints underlying the genomic organization of the odorant-binding protein genes.

    Science.gov (United States)

    Librado, Pablo; Rozas, Julio

    2013-01-01

    Animal olfactory systems have a critical role for the survival and reproduction of individuals. In insects, the odorant-binding proteins (OBPs) are encoded by a moderately sized gene family, and mediate the first steps of the olfactory processing. Most OBPs are organized in clusters of a few paralogs, which are conserved over time. Currently, the biological mechanism explaining the close physical proximity among OBPs is not yet established. Here, we conducted a comprehensive study aiming to gain insights into the mechanisms underlying the OBP genomic organization. We found that the OBP clusters are embedded within large conserved arrangements. These organizations also include other non-OBP genes, which often encode proteins integral to plasma membrane. Moreover, the conservation degree of such large clusters is related to the following: 1) the promoter architecture of the confined genes, 2) a characteristic transcriptional environment, and 3) the chromatin conformation of the chromosomal region. Our results suggest that chromatin domains may restrict the location of OBP genes to regions having the appropriate transcriptional environment, leading to the OBP cluster structure. However, the appropriate transcriptional environment for OBP and the other neighbor genes is not dominated by reduced levels of expression noise. Indeed, the stochastic fluctuations in the OBP transcript abundance may have a critical role in the combinatorial nature of the olfactory coding process.

  16. Genome-wide association studies dissect the genetic networks underlying agronomical traits in soybean.

    Science.gov (United States)

    Fang, Chao; Ma, Yanming; Wu, Shiwen; Liu, Zhi; Wang, Zheng; Yang, Rui; Hu, Guanghui; Zhou, Zhengkui; Yu, Hong; Zhang, Min; Pan, Yi; Zhou, Guoan; Ren, Haixiang; Du, Weiguang; Yan, Hongrui; Wang, Yanping; Han, Dezhi; Shen, Yanting; Liu, Shulin; Liu, Tengfei; Zhang, Jixiang; Qin, Hao; Yuan, Jia; Yuan, Xiaohui; Kong, Fanjiang; Liu, Baohui; Li, Jiayang; Zhang, Zhiwu; Wang, Guodong; Zhu, Baoge; Tian, Zhixi

    2017-08-24

    Soybean (Glycine max [L.] Merr.) is one of the most important oil and protein crops. Ever-increasing soybean consumption necessitates the improvement of varieties for more efficient production. However, both correlations among different traits and genetic interactions among genes that affect a single trait pose a challenge to soybean breeding. To understand the genetic networks underlying phenotypic correlations, we collected 809 soybean accessions worldwide and phenotyped them for two years at three locations for 84 agronomic traits. Genome-wide association studies identified 245 significant genetic loci, among which 95 genetically interacted with other loci. We determined that 14 oil synthesis-related genes are responsible for fatty acid accumulation in soybean and function in line with an additive model. Network analyses demonstrated that 51 traits could be linked through the linkage disequilibrium of 115 associated loci and these links reflect phenotypic correlations. We revealed that 23 loci, including the known Dt1, E2, E1, Ln, Dt2, Fan, and Fap loci, as well as 16 undefined associated loci, have pleiotropic effects on different traits. This study provides insights into the genetic correlation among complex traits and will facilitate future soybean functional studies and breeding through molecular design.

  17. Genome-wide characterization of the WRKY gene family in radish (Raphanus sativus L.) reveals its critical functions under different abiotic stresses.

    Science.gov (United States)

    Karanja, Bernard Kinuthia; Fan, Lianxue; Xu, Liang; Wang, Yan; Zhu, Xianwen; Tang, Mingjia; Wang, Ronghua; Zhang, Fei; Muleke, Everlyne M'mbone; Liu, Liwang

    2017-11-01

    The radish WRKY gene family was genome-widely identified and played critical roles in response to multiple abiotic stresses. The WRKY is among the largest transcription factors (TFs) associated with multiple biological activities for plant survival, including control response mechanisms against abiotic stresses such as heat, salinity, and heavy metals. Radish is an important root vegetable crop and therefore characterization and expression pattern investigation of WRKY transcription factors in radish is imperative. In the present study, 126 putative WRKY genes were retrieved from radish genome database. Protein sequence and annotation scrutiny confirmed that RsWRKY proteins possessed highly conserved domains and zinc finger motif. Based on phylogenetic analysis results, RsWRKYs candidate genes were divided into three groups (Group I, II and III) with the number 31, 74, and 20, respectively. Additionally, gene structure analysis revealed that intron-exon patterns of the WRKY genes are highly conserved in radish. Linkage map analysis indicated that RsWRKY genes were distributed with varying densities over nine linkage groups. Further, RT-qPCR analysis illustrated the significant variation of 36 RsWRKY genes under one or more abiotic stress treatments, implicating that they might be stress-responsive genes. In total, 126 WRKY TFs were identified from the R. sativus genome wherein, 35 of them showed abiotic stress-induced expression patterns. These results provide a genome-wide characterization of RsWRKY TFs and baseline for further functional dissection and molecular evolution investigation, specifically for improving abiotic stress resistances with an ultimate goal of increasing yield and quality of radish.

  18. Genomic prediction in contrast to a genome-wide association study in explaining heritable variation of complex growth traits in breeding populations of Eucalyptus.

    Science.gov (United States)

    Müller, Bárbara S F; Neves, Leandro G; de Almeida Filho, Janeo E; Resende, Márcio F R; Muñoz, Patricio R; Dos Santos, Paulo E T; Filho, Estefano Paludzyszyn; Kirst, Matias; Grattapaglia, Dario

    2017-07-11

    The advent of high-throughput genotyping technologies coupled to genomic prediction methods established a new paradigm to integrate genomics and breeding. We carried out whole-genome prediction and contrasted it to a genome-wide association study (GWAS) for growth traits in breeding populations of Eucalyptus benthamii (n =505) and Eucalyptus pellita (n =732). Both species are of increasing commercial interest for the development of germplasm adapted to environmental stresses. Predictive ability reached 0.16 in E. benthamii and 0.44 in E. pellita for diameter growth. Predictive abilities using either Genomic BLUP or different Bayesian methods were similar, suggesting that growth adequately fits the infinitesimal model. Genomic prediction models using ~5000-10,000 SNPs provided predictive abilities equivalent to using all 13,787 and 19,506 SNPs genotyped in the E. benthamii and E. pellita populations, respectively. No difference was detected in predictive ability when different sets of SNPs were utilized, based on position (equidistantly genome-wide, inside genes, linkage disequilibrium pruned or on single chromosomes), as long as the total number of SNPs used was above ~5000. Predictive abilities obtained by removing relatedness between training and validation sets fell near zero for E. benthamii and were halved for E. pellita. These results corroborate the current view that relatedness is the main driver of genomic prediction, although some short-range historical linkage disequilibrium (LD) was likely captured for E. pellita. A GWAS identified only one significant association for volume growth in E. pellita, illustrating the fact that while genome-wide regression is able to account for large proportions of the heritability, very little or none of it is captured into significant associations using GWAS in breeding populations of the size evaluated in this study. This study provides further experimental data supporting positive prospects of using genome-wide data to

  19. Evaluation of potential novel variations and their interactions related to bipolar disorders: analysis of genome-wide association study data.

    Science.gov (United States)

    Acikel, Cengizhan; Aydin Son, Yesim; Celik, Cemil; Gul, Husamettin

    2016-01-01

    Multifactor dimensionality reduction (MDR) is a nonparametric approach that can be used to detect relevant interactions between single-nucleotide polymorphisms (SNPs). The aim of this study was to build the best genomic model based on SNP associations and to identify candidate polymorphisms that are the underlying molecular basis of the bipolar disorders. This study was performed on Whole-Genome Association Study of Bipolar Disorder (dbGaP [database of Genotypes and Phenotypes] study accession number: phs000017.v3.p1) data. After preprocessing of the genotyping data, three classification-based data mining methods (ie, random forest, naïve Bayes, and k-nearest neighbor) were performed. Additionally, as a nonparametric, model-free approach, the MDR method was used to evaluate the SNP profiles. The validity of these methods was evaluated using true classification rate, recall (sensitivity), precision (positive predictive value), and F-measure. Random forests, naïve Bayes, and k-nearest neighbors identified 16, 13, and ten candidate SNPs, respectively. Surprisingly, the top six SNPs were reported by all three methods. Random forests and k-nearest neighbors were more successful than naïve Bayes, with recall values >0.95. On the other hand, MDR generated a model with comparable predictive performance based on five SNPs. Although different SNP profiles were identified in MDR compared to the classification-based models, all models mapped SNPs to the DOCK10 gene. Three classification-based data mining approaches, random forests, naïve Bayes, and k-nearest neighbors, have prioritized similar SNP profiles as predictors of bipolar disorders, in contrast to MDR, which has found different SNPs through analysis of two-way and three-way interactions. The reduced number of associated SNPs discovered by MDR, without loss in the classification performance, would facilitate validation studies and decision support models, and would reduce the cost to develop predictive and

  20. PSP: rapid identification of orthologous coding genes under positive selection across multiple closely related prokaryotic genomes.

    Science.gov (United States)

    Su, Fei; Ou, Hong-Yu; Tao, Fei; Tang, Hongzhi; Xu, Ping

    2013-12-27

    With genomic sequences of many closely related bacterial strains made available by deep sequencing, it is now possible to investigate trends in prokaryotic microevolution. Positive selection is a sub-process of microevolution, in which a particular mutation is favored, causing the allele frequency to continuously shift in one direction. Wide scanning of prokaryotic genomes has shown that positive selection at the molecular level is much more frequent than expected. Genes with significant positive selection may play key roles in bacterial adaption to different environmental pressures. However, selection pressure analyses are computationally intensive and awkward to configure. Here we describe an open access web server, which is designated as PSP (Positive Selection analysis for Prokaryotic genomes) for performing evolutionary analysis on orthologous coding genes, specially designed for rapid comparison of dozens of closely related prokaryotic genomes. Remarkably, PSP facilitates functional exploration at the multiple levels by assignments and enrichments of KO, GO or COG terms. To illustrate this user-friendly tool, we analyzed Escherichia coli and Bacillus cereus genomes and found that several genes, which play key roles in human infection and antibiotic resistance, show significant evidence of positive selection. PSP is freely available to all users without any login requirement at: http://db-mml.sjtu.edu.cn/PSP/. PSP ultimately allows researchers to do genome-scale analysis for evolutionary selection across multiple prokaryotic genomes rapidly and easily, and identify the genes undergoing positive selection, which may play key roles in the interactions of host-pathogen and/or environmental adaptation.

  1. The variation of root exudates from the hyperaccumulator Sedum alfredii under cadmium stress: metabonomics analysis.

    Directory of Open Access Journals (Sweden)

    Qing Luo

    Full Text Available Hydroponic experiments were conducted to investigate the variation of root exudates from the hyperaccumulator Sedum alfredii under the stress of cadmium (Cd. S. alfredii was cultured for 4 days in the nutrient solution spiked with CdCl2 at concentrations of 0, 5, 10, 40, and 400 µM Cd after the pre-culture. The root exudates were collected and analyzed by GC-MS, and 62 compounds were identified. Of these compounds, the orthogonal partial least-squares discrimination analysis (OPLS-DA showed that there were a distinct difference among the root exudates with different Cd treatments and 20 compounds resulting in this difference were found out. Changing tendencies in the relative content of these 20 compounds under the different Cd treatments were analyzed. These results indicated that trehalose, erythritol, naphthalene, d-pinitol and n-octacosane might be closely related to the Cd stabilization, phosphoric acid, tetradecanoic acid, oxalic acid, threonic acid and glycine could be attributed to the Cd mobilization, and mannitol, oleic acid, 3-hydroxybutanoic acid, fructose, octacosanol and ribitol could copy well with the Cd stress.

  2. Conserved Genetic Architecture Underlying Individual Recombination Rate Variation in a Wild Population of Soay Sheep (Ovis aries).

    Science.gov (United States)

    Johnston, Susan E; Bérénos, Camillo; Slate, Jon; Pemberton, Josephine M

    2016-05-01

    Meiotic recombination breaks down linkage disequilibrium (LD) and forms new haplotypes, meaning that it is an important driver of diversity in eukaryotic genomes. Understanding the causes of variation in recombination rate is important in interpreting and predicting evolutionary phenomena and in understanding the potential of a population to respond to selection. However, despite attention in model systems, there remains little data on how recombination rate varies at the individual level in natural populations. Here we used extensive pedigree and high-density SNP information in a wild population of Soay sheep (Ovis aries) to investigate the genetic architecture of individual autosomal recombination rates. Individual rates were high relative to other mammal systems and were higher in males than in females (autosomal map lengths of 3748 and 2860 cM, respectively). The heritability of autosomal recombination rate was low but significant in both sexes (h(2) = 0.16 and 0.12 in females and males, respectively). In females, 46.7% of the heritable variation was explained by a subtelomeric region on chromosome 6; a genome-wide association study showed the strongest associations at locus RNF212, with further associations observed at a nearby ∼374-kb region of complete LD containing three additional candidate loci, CPLX1, GAK, and PCGF3 A second region on chromosome 7 containing REC8 and RNF212B explained 26.2% of the heritable variation in recombination rate in both sexes. Comparative analyses with 40 other sheep breeds showed that haplotypes associated with recombination rates are both old and globally distributed. Both regions have been implicated in rate variation in mice, cattle, and humans, suggesting a common genetic architecture of recombination rate variation in mammals. Copyright © 2016 by the Genetics Society of America.

  3. Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute's genomic medicine portfolio.

    Science.gov (United States)

    Manolio, Teri A

    2016-10-01

    Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual's genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of "Genomic Medicine Meetings," under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and difficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI's genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so. Published by Elsevier Ireland Ltd.

  4. Self-Regulation Principles Underlying Risk Perception and Decision Making within the Context of Genomic Testing

    Science.gov (United States)

    Cameron, Linda D.; Biesecker, Barbara Bowles; Peters, Ellen; Taber, Jennifer M.; Klein, William M. P.

    2017-01-01

    Advances in theory and research on self-regulation and decision-making processes have yielded important insights into how cognitive, emotional, and social processes shape risk perceptions and risk-related decisions. We examine how self-regulation theory can be applied to inform our understanding of decision-making processes within the context of genomic testing, a clinical arena in which individuals face complex risk information and potentially life-altering decisions. After presenting key principles of self-regulation, we present a genomic testing case example to illustrate how principles related to risk representations, approach and avoidance motivations, emotion regulation, defensive responses, temporal construals, and capacities such as numeric abilities can shape decisions and psychological responses during the genomic testing process. We conclude with implications for using self-regulation theory to advance science within genomic testing and opportunities for how this research can inform further developments in self-regulation theory. PMID:29225669

  5. Self-Regulation Principles Underlying Risk Perception and Decision Making within the Context of Genomic Testing.

    Science.gov (United States)

    Cameron, Linda D; Biesecker, Barbara Bowles; Peters, Ellen; Taber, Jennifer M; Klein, William M P

    2017-05-01

    Advances in theory and research on self-regulation and decision-making processes have yielded important insights into how cognitive, emotional, and social processes shape risk perceptions and risk-related decisions. We examine how self-regulation theory can be applied to inform our understanding of decision-making processes within the context of genomic testing, a clinical arena in which individuals face complex risk information and potentially life-altering decisions. After presenting key principles of self-regulation, we present a genomic testing case example to illustrate how principles related to risk representations, approach and avoidance motivations, emotion regulation, defensive responses, temporal construals, and capacities such as numeric abilities can shape decisions and psychological responses during the genomic testing process. We conclude with implications for using self-regulation theory to advance science within genomic testing and opportunities for how this research can inform further developments in self-regulation theory.

  6. Genetic variation of temperature-regulated curd induction in cauliflower: elucidation of floral transition by genome-wide association mapping and gene expression analysis

    Science.gov (United States)

    Matschegewski, Claudia; Zetzsche, Holger; Hasan, Yaser; Leibeguth, Lena; Briggs, William; Ordon, Frank; Uptmoor, Ralf

    2015-01-01

    Cauliflower (Brassica oleracea var. botrytis) is a vernalization-responsive crop. High ambient temperatures delay harvest time. The elucidation of the genetic regulation of floral transition is highly interesting for a precise harvest scheduling and to ensure stable market supply. This study aims at genetic dissection of temperature-dependent curd induction in cauliflower by genome-wide association studies and gene expression analysis. To assess temperature-dependent curd induction, two greenhouse trials under distinct temperature regimes were conducted on a diversity panel consisting of 111 cauliflower commercial parent lines, genotyped with 14,385 SNPs. Broad phenotypic variation and high heritability (0.93) were observed for temperature-related curd induction within the cauliflower population. GWA mapping identified a total of 18 QTL localized on chromosomes O1, O2, O3, O4, O6, O8, and O9 for curding time under two distinct temperature regimes. Among those, several QTL are localized within regions of promising candidate flowering genes. Inferring population structure and genetic relatedness among the diversity set assigned three main genetic clusters. Linkage disequilibrium (LD) patterns estimated global LD extent of r2 = 0.06 and a maximum physical distance of 400 kb for genetic linkage. Transcriptional profiling of flowering genes FLOWERING LOCUS C (BoFLC) and VERNALIZATION 2 (BoVRN2) was performed, showing increased expression levels of BoVRN2 in genotypes with faster curding. However, functional relevance of BoVRN2 and BoFLC2 could not consistently be supported, which probably suggests to act facultative and/or might evidence for BoVRN2/BoFLC-independent mechanisms in temperature-regulated floral transition in cauliflower. Genetic insights in temperature-regulated curd induction can underpin genetically informed phenology models and benefit molecular breeding strategies toward the development of thermo-tolerant cultivars. PMID:26442034

  7. Transcriptome sequencing and whole genome expression profiling of chrysanthemum under dehydration stress

    Science.gov (United States)

    2013-01-01

    Background Chrysanthemum is one of the most important ornamental crops in the world and drought stress seriously limits its production and distribution. In order to generate a functional genomics resource and obtain a deeper understanding of the molecular mechanisms regarding chrysanthemum responses to dehydration stress, we performed large-scale transcriptome sequencing of chrysanthemum plants under dehydration stress using the Illumina sequencing technology. Results Two cDNA libraries constructed from mRNAs of control and dehydration-treated seedlings were sequenced by Illumina technology. A total of more than 100 million reads were generated and de novo assembled into 98,180 unique transcripts which were further extensively annotated by comparing their sequencing to different protein databases. Biochemical pathways were predicted from these transcript sequences. Furthermore, we performed gene expression profiling analysis upon dehydration treatment in chrysanthemum and identified 8,558 dehydration-responsive unique transcripts, including 307 transcription factors and 229 protein kinases and many well-known stress responsive genes. Gene ontology (GO) term enrichment and biochemical pathway analyses showed that dehydration stress caused changes in hormone response, secondary and amino acid metabolism, and light and photoperiod response. These findings suggest that drought tolerance of chrysanthemum plants may be related to the regulation of hormone biosynthesis and signaling, reduction of oxidative damage, stabilization of cell proteins and structures, and maintenance of energy and carbon supply. Conclusions Our transcriptome sequences can provide a valuable resource for chrysanthemum breeding and research and novel insights into chrysanthemum responses to dehydration stress and offer candidate genes or markers that can be used to guide future studies attempting to breed drought tolerant chrysanthemum cultivars. PMID:24074255

  8. Demographically-Based Evaluation of Genomic Regions under Selection in Domestic Dogs.

    Directory of Open Access Journals (Sweden)

    Adam H Freedman

    2016-03-01

    Full Text Available Controlling for background demographic effects is important for accurately identifying loci that have recently undergone positive selection. To date, the effects of demography have not yet been explicitly considered when identifying loci under selection during dog domestication. To investigate positive selection on the dog lineage early in the domestication, we examined patterns of polymorphism in six canid genomes that were previously used to infer a demographic model of dog domestication. Using an inferred demographic model, we computed false discovery rates (FDR and identified 349 outlier regions consistent with positive selection at a low FDR. The signals in the top 100 regions were frequently centered on candidate genes related to brain function and behavior, including LHFPL3, CADM2, GRIK3, SH3GL2, MBP, PDE7B, NTAN1, and GLRA1. These regions contained significant enrichments in behavioral ontology categories. The 3rd top hit, CCRN4L, plays a major role in lipid metabolism, that is supported by additional metabolism related candidates revealed in our scan, including SCP2D1 and PDXC1. Comparing our method to an empirical outlier approach that does not directly account for demography, we found only modest overlaps between the two methods, with 60% of empirical outliers having no overlap with our demography-based outlier detection approach. Demography-aware approaches have lower-rates of false discovery. Our top candidates for selection, in addition to expanding the set of neurobehavioral candidate genes, include genes related to lipid metabolism, suggesting a dietary target of selection that was important during the period when proto-dogs hunted and fed alongside hunter-gatherers.

  9. Estimating the total number of susceptibility variants underlying complex diseases from genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Hon-Cheong So

    2010-11-01

    Full Text Available Recently genome-wide association studies (GWAS have identified numerous susceptibility variants for complex diseases. In this study we proposed several approaches to estimate the total number of variants underlying these diseases. We assume that the variance explained by genetic markers (Vg follow an exponential distribution, which is justified by previous studies on theories of adaptation. Our aim is to fit the observed distribution of Vg from GWAS to its theoretical distribution. The number of variants is obtained by the heritability divided by the estimated mean of the exponential distribution. In practice, due to limited sample sizes, there is insufficient power to detect variants with small effects. Therefore the power was taken into account in fitting. Besides considering the most significant variants, we also tried to relax the significance threshold, allowing more markers to be fitted. The effects of false positive variants were removed by considering the local false discovery rates. In addition, we developed an alternative approach by directly fitting the z-statistics from GWAS to its theoretical distribution. In all cases, the "winner's curse" effect was corrected analytically. Confidence intervals were also derived. Simulations were performed to compare and verify the performance of different estimators (which incorporates various means of winner's curse correction and the coverage of the proposed analytic confidence intervals. Our methodology only requires summary statistics and is able to handle both binary and continuous traits. Finally we applied the methods to a few real disease examples (lipid traits, type 2 diabetes and Crohn's disease and estimated that hundreds to nearly a thousand variants underlie these traits.

  10. Evolution of genomic diversity and sex at extreme environments: Fungal life under hypersaline Dead Sea stress

    Science.gov (United States)

    Kis-Papo, Tamar; Kirzhner, Valery; Wasser, Solomon P.; Nevo, Eviatar

    2003-01-01

    We have found that genomic diversity is generally positively correlated with abiotic and biotic stress levels (1–3). However, beyond a high-threshold level of stress, the diversity declines to a few adapted genotypes. The Dead Sea is the harshest planetary hypersaline environment (340 g·liter–1 total dissolved salts, ≈10 times sea water). Hence, the Dead Sea is an excellent natural laboratory for testing the “rise and fall” pattern of genetic diversity with stress proposed in this article. Here, we examined genomic diversity of the ascomycete fungus Aspergillus versicolor from saline, nonsaline, and hypersaline Dead Sea environments. We screened the coding and noncoding genomes of A. versicolor isolates by using >600 AFLP (amplified fragment length polymorphism) markers (equal to loci). Genomic diversity was positively correlated with stress, culminating in the Dead Sea surface but dropped drastically in 50- to 280-m-deep seawater. The genomic diversity pattern paralleled the pattern of sexual reproduction of fungal species across the same southward gradient of increasing stress in Israel. This parallel may suggest that diversity and sex are intertwined intimately according to the rise and fall pattern and adaptively selected by natural selection in fungal genome evolution. Future large-scale verification in micromycetes will define further the trajectories of diversity and sex in the rise and fall pattern. PMID:14645702

  11. Structural variation and rates of genome evolution in the grass family seen through comparison of sequences of genomes greatly differing in size.

    Science.gov (United States)

    Dvorak, Jan; Wang, Le; Zhu, Tingting; Jorgensen, Chad M; Deal, Karin R; Dai, Xiongtao; Dawson, Matthew W; Müller, Hans-Georg; Luo, Ming-Cheng; Ramasamy, Ramesh K; Dehghani, Hamid; Gu, Yong Q; Gill, Bikram S; Distelfeld, Assaf; Devos, Katrien M; Qi, Peng; You, Frank M; Gulick, Patrick J; McGuire, Patrick E

    2018-05-16

    Homology was searched with genes annotated in the Aegilops tauschii pseudomolecules against genes annotated in the pseudomolecules of tetraploid wild emmer wheat, Brachypodium distachyon, sorghum, and rice. Similar searches were initiated with genes annotated in the rice pseudomolecules. Matrices of colinear genes and rearrangements in their order were constructed. Optical Bionano genome maps were constructed and used to validate rearrangements unique to the wild emmer and Ae. tauschii genomes. Most common rearrangements were short paracentric inversions and short intrachromosomal translocations. Intrachromosomal translocations outnumbered segmental intrachromosomal duplications. The densities of paracentric inversion lengths were approximated by exponential distributions in all six genomes. Densities of colinear genes along the Ae. tauschii chromosomes were highly correlated with meiotic recombination rates but those of rearrangements were not, suggesting different causes of the erosion of gene colinearity and evolution of major chromosome rearrangements. Frequent rearrangements sharing breakpoints suggested that chromosomes have been rearranged recurrently at some sites. The distal 4 Mb of the short arms of rice chromosomes Os11 and Os12 and corresponding regions in the sorghum, B. distachyon, and Triticeae genomes contain clusters of interstitial translocations including from 1 to 7 colinear genes. The rates of acquisition of major rearrangements were greater in the wild emmer wheat and Ae. tauschii genomes than in the lineage preceding their divergence or in the B. distachyon, rice, and sorghum lineages. It is suggested that synergy between large quantities of dynamic transposable elements and annual growth habit caused the fast evolution of the Triticeae genomes. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  12. Population-genomic variation within RNA viruses of the Western honey bee, Apis mellifera, inferred from deep sequencing.

    Science.gov (United States)

    Cornman, Robert Scott; Boncristiani, Humberto; Dainat, Benjamin; Chen, Yanping; vanEngelsdorp, Dennis; Weaver, Daniel; Evans, Jay D

    2013-03-07

    Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li's D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens.

  13. Analysis of Genome-Wide Copy Number Variations in Chinese Indigenous and Western Pig Breeds by 60 K SNP Genotyping Arrays

    Science.gov (United States)

    Sun, Yaqi; Wang, Hongyang; Wang, Chao; Yu, Shaobo; Liu, Jing; Zhang, Yu; Fan, Bin; Li, Kui; Liu, Bang

    2014-01-01

    Copy number variations (CNVs) represent a substantial source of structural variants in mammals and contribute to both normal phenotypic variability and disease susceptibility. Although low-resolution CNV maps are produced in many domestic animals, and several reports have been published about the CNVs of porcine genome, the differences between Chinese and western pigs still remain to be elucidated. In this study, we used Porcine SNP60 BeadChip and PennCNV algorithm to perform a genome-wide CNV detection in 302 individuals from six Chinese indigenous breeds (Tongcheng, Laiwu, Luchuan, Bama, Wuzhishan and Ningxiang pigs), three western breeds (Yorkshire, Landrace and Duroc) and one hybrid (Tongcheng×Duroc). A total of 348 CNV Regions (CNVRs) across genome were identified, covering 150.49 Mb of the pig genome or 6.14% of the autosomal genome sequence. In these CNVRs, 213 CNVRs were found to exist only in the six Chinese indigenous breeds, and 60 CNVRs only in the three western breeds. The characters of CNVs in four Chinese normal size breeds (Luchuan, Tongcheng and Laiwu pigs) and two minipig breeds (Bama and Wuzhishan pigs) were also analyzed in this study. Functional annotation suggested that these CNVRs possess a great variety of molecular function and may play important roles in phenotypic and production traits between Chinese and western breeds. Our results are important complementary to the CNV map in pig genome, which provide new information about the diversity of Chinese and western pig breeds, and facilitate further research on porcine genome CNVs. PMID:25198154

  14. Peroxisomal monodehydroascorbate reductase. Genomic clone characterization and functional analysis under environmental stress conditions.

    Science.gov (United States)

    Leterrier, Marina; Corpas, Francisco J; Barroso, Juan B; Sandalio, Luisa M; del Río, Luis A

    2005-08-01

    In plant cells, ascorbate is a major antioxidant that is involved in the ascorbate-glutathione cycle. Monodehydroascorbate reductase (MDAR) is the enzymatic component of this cycle involved in the regeneration of reduced ascorbate. The identification of the intron-exon organization and the promoter region of the pea (Pisum sativum) MDAR 1 gene was achieved in pea leaves using the method of walking polymerase chain reaction on genomic DNA. The nuclear gene of MDAR 1 comprises nine exons and eight introns, giving a total length of 3,770 bp. The sequence of 544 bp upstream of the initiation codon, which contains the promoter and 5' untranslated region, and 190 bp downstream of the stop codon were also determined. The presence of different regulatory motifs in the promoter region of the gene might indicate distinct responses to various conditions. The expression analysis in different plant organs by northern blots showed that fruits had the highest level of MDAR. Confocal laser scanning microscopy analysis of pea leaves transformed with Agrobacterium tumefaciens having the binary vectors pGD, which contain the autofluorescent proteins enhanced green fluorescent protein and enhanced yellow fluorescent protein with the full-length cDNA for MDAR 1 and catalase, indicated that the MDAR 1 encoded the peroxisomal isoform. The functional analysis of MDAR by activity and protein expression was studied in pea plants grown under eight stress conditions, including continuous light, high light intensity, continuous dark, mechanical wounding, low and high temperature, cadmium, and the herbicide 2,4-dichlorophenoxyacetic acid. This functional analysis is representative of all the MDAR isoforms present in the different cell compartments. Results obtained showed a significant induction by high light intensity and cadmium. On the other hand, expression studies, performed by semiquantitative reverse transcription-polymerase chain reaction demonstrated differential expression patterns of

  15. Rice sHsp genes: genomic organization and expression profiling under stress and development

    Directory of Open Access Journals (Sweden)

    Grover Anil

    2009-08-01

    Full Text Available Abstract Background Heat shock proteins (Hsps constitute an important component in the heat shock response of all living systems. Among the various plant Hsps (i.e. Hsp100, Hsp90, Hsp70 and Hsp20, Hsp20 or small Hsps (sHsps are expressed in maximal amounts under high temperature stress. The characteristic feature of the sHsps is the presence of α-crystallin domain (ACD at the C-terminus. sHsps cooperate with Hsp100/Hsp70 and co-chaperones in ATP-dependent manner in preventing aggregation of cellular proteins and in their subsequent refolding. Database search was performed to investigate the sHsp gene family across rice genome sequence followed by comprehensive expression analysis of these genes. Results We identified 40 α-crystallin domain containing genes in rice. Phylogenetic analysis showed that 23 out of these 40 genes constitute sHsps. The additional 17 genes containing ACD clustered with Acd proteins of Arabidopsis. Detailed scrutiny of 23 sHsp sequences enabled us to categorize these proteins in a revised scheme of classification constituting of 16 cytoplasmic/nuclear, 2 ER, 3 mitochondrial, 1 plastid and 1 peroxisomal genes. In the new classification proposed herein nucleo-cytoplasmic class of sHsps with 9 subfamilies is more complex in rice than in Arabidopsis. Strikingly, 17 of 23 rice sHsp genes were noted to be intronless. Expression analysis based on microarray and RT-PCR showed that 19 sHsp genes were upregulated by high temperature stress. Besides heat stress, expression of sHsp genes was up or downregulated by other abiotic and biotic stresses. In addition to stress regulation, various sHsp genes were differentially upregulated at different developmental stages of the rice plant. Majority of sHsp genes were expressed in seed. Conclusion We identified twenty three sHsp genes and seventeen Acd genes in rice. Three nucleocytoplasmic sHsp genes were found only in monocots. Analysis of expression profiling of sHsp genes revealed

  16. Genome-wide transcriptome analysis of soybean primary root under varying water-deficit conditions.

    Science.gov (United States)

    Song, Li; Prince, Silvas; Valliyodan, Babu; Joshi, Trupti; Maldonado dos Santos, Joao V; Wang, Jiaojiao; Lin, Li; Wan, Jinrong; Wang, Yongqin; Xu, Dong; Nguyen, Henry T

    2016-01-15

    Soybean is a major crop that provides an important source of protein and oil to humans and animals, but its production can be dramatically decreased by the occurrence of drought stress. Soybeans can survive drought stress if there is a robust and deep root system at the early vegetative growth stage. However, little is known about the genome-wide molecular mechanisms contributing to soybean root system architecture. This study was performed to gain knowledge on transcriptome changes and related molecular mechanisms contributing to soybean root development under water limited conditions. The soybean Williams 82 genotype was subjected to very mild stress (VMS), mild stress (MS) and severe stress (SS) conditions, as well as recovery from the severe stress after re-watering (SR). In total, 6,609 genes in the roots showed differential expression patterns in response to different water-deficit stress levels. Genes involved in hormone (Auxin/Ethylene), carbohydrate, and cell wall-related metabolism (XTH/lipid/flavonoids/lignin) pathways were differentially regulated in the soybean root system. Several transcription factors (TFs) regulating root growth and responses under varying water-deficit conditions were identified and the expression patterns of six TFs were found to be common across the stress levels. Further analysis on the whole plant level led to the finding of tissue-specific or water-deficit levels specific regulation of transcription factors. Analysis of the over-represented motif of different gene groups revealed several new cis-elements associated with different levels of water deficit. The expression patterns of 18 genes were confirmed byquantitative reverse transcription polymerase chain reaction method and demonstrated the accuracy and effectiveness of RNA-Seq. The primary root specific transcriptome in soybean can enable a better understanding of the root response to water deficit conditions. The genes detected in root tissues that were associated with

  17. Genome size variation in the pine fusiform rust pathogen Cronartium quercuum f.sp. fusiforme as determined by flow cytometry

    Science.gov (United States)

    Claire L Anderson; Thomas L Kubisiak; C Dana Nelson; Jason A Smith; John M Davis

    2010-01-01

    The genome size of the pine fusiform rust pathogen Cronartium quercuum f.sp. fusiforme (Cqf) was determined by flow cytometric analysis of propidium iodide-stained, intact haploid pycniospores with haploid spores of two genetically well characterized fungal species, Sclerotinia sclerotiorum and Puccinia graminis f.sp. tritici, as size standards. The Cqf haploid genome...

  18. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus.

    Science.gov (United States)

    He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei

    2016-01-01

    WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions

  19. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus.

    Directory of Open Access Journals (Sweden)

    Yajun He

    Full Text Available WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related

  20. [Soil moisture variation under different water and fertilization managements in apple orchard of Weibei dryland, China].

    Science.gov (United States)

    Zhao, Zhi Yuan; Zheng, Wei; Liu, Jie; Ma, Peng Yi; Li, Zi Yan; Zhai, Bing Nian; Wang, Zhao Hui

    2018-04-01

    To evaluate the variations of soil moisture under different water and fertilizer treatments in apple orchard in the Weibei dryland, a field experiment was carried out in 2013-2016 at Tianjiawa Village, Baishui County, Shaanxi Province. There were three treatments, i.e., farmers traditional model (only addition of NPK chemical fertilizer, FM), extension model (swine manure and NPK chemical fertilizer combined with black plastic film in tree row space, EM), and optimized model (swine manure and NPK chemical fertilizer combined with black plastic film in tree row space and planting rape in the inter-row of apple trees, OM). The results showed that OM treatment significantly increased soil water storage capacity in 0-200 cm soil layer. Water content of 0-100 cm soil layer was increased by 5.6% and 15.3% in the dry season compared with FM and EM treatment, respectively. Moreover, the soil water relative deficit index of OM was lower than that of EM in 200-300 cm soil layer. The rainfall infiltration in the dry year could reach 300 cm depth under OM. Meanwhile, OM stabilized soil water content and efficiently alleviated the desiccation in deep soil layer. Compared with FM and EM, the 4-year average yield of OM was increased by 36.6% and 22.5%, respectively. In summary, OM could increase water use efficiency through increasing the contents of available soil water and improving the soil water condition in shallow and deep layers, which help alleviate the soil deficit in deep layer and increase yield.

  1. Growth dynamics variation of different larch provenances under the mountain conditions in Poland

    Energy Technology Data Exchange (ETDEWEB)

    Kulej, M. [Univ. of Agriculture, Cracow (Poland). Section of Seed Production and Selection

    1995-12-31

    The results of 25-year investigations based on measurements and statistical analysis concerning the growth dynamics variation of larch provenances from the entire area of Poland are reported in this paper. This is the first larch provenance experiment in Poland under mountain conditions. The results obtained showed a significant variability among the provenances tested as regards the basic growth characters (height, d.b.h., growth index) at the age of 5, 8, 11, 15, 20 and 25 years. The larch from Klodzko and Proszkow turned out to be the best in respect of growth during the entire 25-years period. Decidedly bad were provenances from Marcule, Grojec, Rawa mazowiecka and Kroscienko. We cannot forecast the future growth of larch when trees are 5-years old since such prognosis may carry an error. However, on the basis of the results obtained it may be concluded that when trees are about 8 years old the stabilization of the position of individual provenances as regards growth takes place. The height growth curves for the individual provenances during the 25-years period (with exception of the provenance from Marcule) fall within the interval {+-} 0,5S from the compensated curve for the entire population studied. All larch provenances in the experiment had reached the height growth culmination. A greatest differentiation in respect of this character occurred in case of the provenances from Sudetes. 27 refs, 4 figs, 8 tabs

  2. Plasticity in variation of xylem and phloem cell characteristics of Norway spruce under different local conditions

    Directory of Open Access Journals (Sweden)

    Jozica eGricar

    2015-09-01

    Full Text Available There is limited information on intra-annual plasticity of secondary tissues of tree species growing under different environmental conditions. To increase the knowledge about the plasticity of secondary growth, which allows trees to adapt to specific local climatic regimes, we examined climate–radial growth relationships of Norway spruce (Picea abies (L. H. Karst. from three contrasting locations in the temperate climatic zone by analyzing tree-ring widths for the period 1932–2010, and cell characteristics in xylem and phloem increments formed in the years 2009–2011. Variation in the structure of xylem and phloem increments clearly shows that plasticity in seasonal dynamics of cambial cell production and cell differentiation exists on xylem and phloem sides. Anatomical characteristics of xylem and phloem cells are predominantly site-specific characteristics, because they varied among sites but were fairly uniform among years in trees from the same site. Xylem and phloem tissues formed in the first part of the growing season seemed to be more stable in structure, indicating their priority over latewood and late phloem for tree performance. Long-term climate and radial growth analyses revealed that growth was in general less dependent on precipitation than on temperature; however, growth sensitivity to local conditions differed among the sites. Only partial dependence of radial growth of spruce on climatic factors on the selected sites confirms its strategy to adapt the structure of wood and phloem increments to function optimally in local conditions.

  3. Growth dynamics variation of different larch provenances under the mountain conditions in Poland

    Energy Technology Data Exchange (ETDEWEB)

    Kulej, M [Univ. of Agriculture, Cracow (Poland). Section of Seed Production and Selection

    1996-12-31

    The results of 25-year investigations based on measurements and statistical analysis concerning the growth dynamics variation of larch provenances from the entire area of Poland are reported in this paper. This is the first larch provenance experiment in Poland under mountain conditions. The results obtained showed a significant variability among the provenances tested as regards the basic growth characters (height, d.b.h., growth index) at the age of 5, 8, 11, 15, 20 and 25 years. The larch from Klodzko and Proszkow turned out to be the best in respect of growth during the entire 25-years period. Decidedly bad were provenances from Marcule, Grojec, Rawa mazowiecka and Kroscienko. We cannot forecast the future growth of larch when trees are 5-years old since such prognosis may carry an error. However, on the basis of the results obtained it may be concluded that when trees are about 8 years old the stabilization of the position of individual provenances as regards growth takes place. The height growth curves for the individual provenances during the 25-years period (with exception of the provenance from Marcule) fall within the interval {+-} 0,5S from the compensated curve for the entire population studied. All larch provenances in the experiment had reached the height growth culmination. A greatest differentiation in respect of this character occurred in case of the provenances from Sudetes. 27 refs, 4 figs, 8 tabs

  4. Pseudomonas aeruginosa Genome Evolution in Patients and under the Hospital Environment

    Directory of Open Access Journals (Sweden)

    Céline Lucchetti-Miganeh

    2014-04-01

    Full Text Available Pseudomonas aeruginosa is a Gram-negative environmental species and an opportunistic microorganism, establishing itself in vulnerable patients, such as those with cystic fibrosis (CF or those hospitalized in intensive care units (ICU. It has become a major cause of nosocomial infections worldwide and a serious threat to Public Health because of overuse and misuse of antibiotics that have selected highly resistant strains against which very few therapeutic options exist. Herein is illustrated the intraclonal evolution of the genome of sequential isolates collected in a single CF patient from the early phase of pulmonary colonization to the fatal outcome. We also examined at the whole genome scale a pair of genotypically-related strains made of a drug susceptible, environmental isolate recovered from an ICU sink and of its multidrug resistant counterpart found to infect an ICU patient. Multiple genetic changes accumulated in the CF isolates over the disease time course including SNPs, deletion events and reduction of whole genome size. The strain isolated from the ICU patient displayed an increase in the genome size of 4.8% with major genetic rearrangements as compared to the initial environmental strain. The annotated genomes are given in free access in an interactive web application WallGene  designed to facilitate large-scale comparative analysis and thus allowing investigators to explore homologies and syntenies between P. aeruginosa strains, here PAO1 and the five clinical strains described.

  5. Analysis of hydrologic variation under climate change environment in southern Taiwan

    Science.gov (United States)

    Chen, Yung-Chau; Chen, Yu-Chin; Chen, Wen-Fu

    2014-05-01

    Impact and adaptation is an important issue in response to climate change. We need to know the affections of climate change on hydrologic characteristics before estimating the impacts and making adaptation strategies of concerned area. The wet and dry seasons of southern Taiwan are significant. In addition, the amount of average annual rainfall is about 2,100mm in southern Taiwan. Most of rainfalls happen in wet season and are caused by cyclones (typhoons) or thunderstorms in wet season. It implies that both quantity and intensity of rainfall are large in wet season, while they are small in dry season. Corresponding to the phenomena, the possibility of flood in wet season and draught in dry season is high. This means significant hydrologic variations may cause disasters. The purpose of this study is to analyze hydrologic variation due to recent climate changes in southern Taiwan, and provide decision makers some information to understand possible impacts and make adaptation strategies. Before typhoon Morakot hit Taiwan, southern Taiwan was suffering from aridity. As usual, people were expecting the rainfall accompanied with typhoons will resolve the drought in this area. However, it fell down huge amount of water within a short period of time and the rain became a big disaster in this area. The rainfall is an over 200-year event, a record breaker. The data used in this research is based on the records of Taiwan Central Weather Bureau at Chiayi, Tainan, Kaohsiung, and Hengchun station, respectively. The trends of temperature, amount of rainfall, and number of rainy days are examined. Both Mann-Kendall trend test and linear regression method are chosen as the means to do trend examination.The results show that annual mean temperatures at Chiayi, Tainan, Kaohsiung, and Hengchun have raised 0.5~0.9°C during past decades under the impact of global warming. The amount of annual rainfall does not appear statistically significant trend. However, the number of annual rainy

  6. Genome-wide DNA methylation alterations of Alternanthera philoxeroides in natural and manipulated habitats: implications for epigenetic regulation of rapid responses to environmental fluctuation and phenotypic variation.

    Science.gov (United States)

    Gao, Lexuan; Geng, Yupeng; Li, Bo; Chen, Jiakuan; Yang, Ji

    2010-11-01

    Alternanthera philoxeroides (alligator weed) is an invasive weed that can colonize both aquatic and terrestrial habitats. Individuals growing in different habitats exhibit extensive phenotypic variation but little genetic differentiation in its introduced range. The mechanisms underpinning the wide range of phenotypic variation and rapid adaptation to novel and changing environments remain uncharacterized. In this study, we examined the epigenetic variation and its correlation with phenotypic variation in plants exposed to natural and manipulated environmental variability. Genome-wide methylation profiling using methylation-sensitive amplified fragment length polymorphism (MSAP) revealed considerable DNA methylation polymorphisms within and between natural populations. Plants of different source populations not only underwent significant morphological changes in common garden environments, but also underwent a genome-wide epigenetic reprogramming in response to different treatments. Methylation alterations associated with response to different water availability were detected in 78.2% (169/216) of common garden induced polymorphic sites, demonstrating the environmental sensitivity and flexibility of the epigenetic regulatory system. These data provide evidence of the correlation between epigenetic reprogramming and the reversible phenotypic response of alligator weed to particular environmental factors. © 2010 Blackwell Publishing Ltd.

  7. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes

    Science.gov (United States)

    Liu, Shengyi; Liu, Yumei; Yang, Xinhua; Tong, Chaobo; Edwards, David; Parkin, Isobel A. P.; Zhao, Meixia; Ma, Jianxin; Yu, Jingyin; Huang, Shunmou; Wang, Xiyin; Wang, Junyi; Lu, Kun; Fang, Zhiyuan; Bancroft, Ian; Yang, Tae-Jin; Hu, Qiong; Wang, Xinfa; Yue, Zhen; Li, Haojie; Yang, Linfeng; Wu, Jian; Zhou, Qing; Wang, Wanxin; King, Graham J; Pires, J. Chris; Lu, Changxin; Wu, Zhangyan; Sampath, Perumal; Wang, Zhuo; Guo, Hui; Pan, Shengkai; Yang, Limei; Min, Jiumeng; Zhang, Dong; Jin, Dianchuan; Li, Wanshun; Belcram, Harry; Tu, Jinxing; Guan, Mei; Qi, Cunkou; Du, Dezhi; Li, Jiana; Jiang, Liangcai; Batley, Jacqueline; Sharpe, Andrew G; Park, Beom-Seok; Ruperao, Pradeep; Cheng, Feng; Waminal, Nomar Espinosa; Huang, Yin; Dong, Caihua; Wang, Li; Li, Jingping; Hu, Zhiyong; Zhuang, Mu; Huang, Yi; Huang, Junyan; Shi, Jiaqin; Mei, Desheng; Liu, Jing; Lee, Tae-Ho; Wang, Jinpeng; Jin, Huizhe; Li, Zaiyun; Li, Xun; Zhang, Jiefu; Xiao, Lu; Zhou, Yongming; Liu, Zhongsong; Liu, Xuequn; Qin, Rui; Tang, Xu; Liu, Wenbin; Wang, Yupeng; Zhang, Yangyong; Lee, Jonghoon; Kim, Hyun Hee; Denoeud, France; Xu, Xun; Liang, Xinming; Hua, Wei; Wang, Xiaowu; Wang, Jun; Chalhoub, Boulos; Paterson, Andrew H

    2014-01-01

    Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus. PMID:24852848

  8. Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

    Science.gov (United States)

    Jo, Yeong Deuk; Choi, Yoomi; Kim, Dong-Hwan; Kim, Byung-Dong; Kang, Byoung-Cheorl

    2014-07-04

    Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearrangements caused by recombination. However, the mitochondrial genome structure and the DNA rearrangements that may be related to CMS have not been characterized in Capsicum spp. We obtained the complete mitochondrial genome sequences of the pepper CMS line FS4401 (507,452 bp) and the fertile line Jeju (511,530 bp). Comparative analysis between mitochondrial genomes of peppers and tobacco that are included in Solanaceae revealed extensive DNA rearrangements and poor conservation in non-coding DNA. In comparison between pepper lines, FS4401 and Jeju mitochondrial DNAs contained the same complement of protein coding genes except for one additional copy of an atp6 gene (ψatp6-2) in FS4401. In terms of genome structure, we found eighteen syntenic blocks in the two mitochondrial genomes, which have been rearranged in each genome. By contrast, sequences between syntenic blocks, which were specific to each line, accounted for 30,380 and 17,847 bp in FS4401 and Jeju, respectively. The previously-reported CMS candidate genes, orf507 and ψatp6-2, were located on the edges of the largest sequence segments that were specific to FS4401. In this region, large number of small sequence segments which were absent or found on different locations in Jeju mitochondrial genome were combined together. The incorporation of repeats and overlapping of connected sequence segments by a few nucleotides implied that extensive rearrangements by homologous recombination might be involved in evolution of this region. Further analysis using mtDNA pairs from other plant species revealed common features of DNA regions around CMS-associated genes. Although large portion of sequence context was

  9. Genome-wide association study identified genetic variations and candidate genes for plant architecture component traits in Chinese upland cotton.

    Science.gov (United States)

    Su, Junji; Li, Libei; Zhang, Chi; Wang, Caixiang; Gu, Lijiao; Wang, Hantao; Wei, Hengling; Liu, Qibao; Huang, Long; Yu, Shuxun

    2018-06-01

    Thirty significant associations between 22 SNPs and five plant architecture component traits in Chinese upland cotton were identified via GWAS. Four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits. A candidate gene, Gh_D03G0922, might be responsible for plant height in upland cotton. A compact plant architecture is increasingly required for mechanized harvesting processes in China. Therefore, cotton plant architecture is an important trait, and its components, such as plant height, fruit branch length and fruit branch angle, affect the suitability of a cultivar for mechanized harvesting. To determine the genetic basis of cotton plant architecture, a genome-wide association study (GWAS) was performed using a panel composed of 355 accessions and 93,250 single nucleotide polymorphisms (SNPs) identified using the specific-locus amplified fragment sequencing method. Thirty significant associations between 22 SNPs and five plant architecture component traits were identified via GWAS. Most importantly, four peak SNP loci located on chromosome D03 were simultaneously associated with more plant architecture component traits, and these SNPs were harbored in one linkage disequilibrium block. Furthermore, 21 candidate genes for plant architecture were predicted in a 0.95-Mb region including the four peak SNPs. One of these genes (Gh_D03G0922) was near the significant SNP D03_31584163 (8.40 kb), and its Arabidopsis homologs contain MADS-box domains that might be involved in plant growth and development. qRT-PCR showed that the expression of Gh_D03G0922 was upregulated in the apical buds and young leaves of the short and compact cotton varieties, and virus-induced gene silencing (VIGS) proved that the silenced plants exhibited increased PH. These results indicate that Gh_D03G0922 is likely the candidate gene for PH in cotton. The genetic variations and candidate genes identified in this study lay a foundation

  10. Common and Rare Coding Genetic Variation Underlying the Electrocardiographic PR Interval

    DEFF Research Database (Denmark)

    Lin, Honghuang; van Setten, Jessica; Smith, Albert V

    2018-01-01

    BACKGROUND: Electrical conduction from the cardiac sinoatrial node to the ventricles is critical for normal heart function. Genome-wide association studies have identified more than a dozen common genetic loci that are associated with PR interval. However, it is unclear whether rare and low-frequ...

  11. Interannual variations in needle and sapwood traits of Pinus edulis branches under an experimental drought.

    Science.gov (United States)

    Guérin, Marceau; Martin-Benito, Dario; von Arx, Georg; Andreu-Hayles, Laia; Griffin, Kevin L; Hamdan, Rayann; McDowell, Nate G; Muscarella, Robert; Pockman, William; Gentine, Pierre

    2018-02-01

    In the southwestern USA, recent large-scale die-offs of conifers raise the question of their resilience and mortality under droughts. To date, little is known about the interannual structural response to droughts. We hypothesized that piñon pines ( Pinus edulis ) respond to drought by reducing the drop of leaf water potential in branches from year to year through needle morphological adjustments. We tested our hypothesis using a 7-year experiment in central New Mexico with three watering treatments (irrigated, normal, and rain exclusion). We analyzed how variation in "evaporative structure" (needle length, stomatal diameter, stomatal density, stomatal conductance) responded to watering treatment and interannual climate variability. We further analyzed annual functional adjustments by comparing yearly addition of needle area (LA) with yearly addition of sapwood area (SA) and distance to tip ( d ), defining the yearly ratios SA:LA and SA:LA/ d . Needle length ( l ) increased with increasing winter and monsoon water supply, and showed more interannual variability when the soil was drier. Stomatal density increased with dryness, while stomatal diameter was reduced. As a result, anatomical maximal stomatal conductance was relatively invariant across treatments. SA:LA and SA:LA/ d showed significant differences across treatments and contrary to our expectation were lower with reduced water input. Within average precipitation ranges, the response of these ratios to soil moisture was similar across treatments. However, when extreme soil drought was combined with high VPD, needle length, SA:LA and SA:LA/ d became highly nonlinear, emphasizing the existence of a response threshold of combined high VPD and dry soil conditions. In new branch tissues, the response of annual functional ratios to water stress was immediate (same year) and does not attempt to reduce the drop of water potential. We suggest that unfavorable evaporative structural response to drought is compensated

  12. Seasonal variations in TSH serum levels in athyreotic patients under L-thyroxine replacement monotherapy.

    Science.gov (United States)

    Gullo, Damiano; Latina, Adele; Frasca, Francesco; Squatrito, Sebastiano; Belfiore, Antonino; Vigneri, Riccardo

    2017-08-01

    Whether serum TSH undergoes seasonal fluctuations in euthyroid and hypothyroid residents of temperate climates is controversial. Monthly TSH and thyroid hormone levels were cross-sectionally analysed in a large cohort of euthyroid subjects (n=11 806) and L-thyroxine (L-T4)-treated athyreotic patients (n=3 934). Moreover, in a small group (n=119) of athyreotic patients treated with an unchanged dosage of L-T4 monotherapy, hormones were measured both in the coldest and in the hottest seasons of the same year (longitudinal study). No seasonal hormone change was observed in the euthyroid subjects except for a small FT3 increase in winter (+2.9%, P<.001). In contrast, the L-T4-treated athyreotic patients had significantly higher serum TSH values in the cold season when the FT4 values were significantly lower. The differences were more notable in the longitudinal series (TSH, 0.80 vs. 0.20 mU/L and FT4, 16.3 vs. 17.8 pmol/L in December-March vs. June-September, respectively). In these patients also serum FT3 values significantly decreased in winter (in the longitudinal series, 3.80 in winter vs 4.07 pmol/L in summer). Regression analysis showed that in athyreotic subjects, a greater FT4 change is required to obtain a TSH change similar to that of euthyroid controls and that this effect is more pronounced in the summer. Athyreotic patients undergoing L-T4 monotherapy have abnormal seasonal variations in TSH. These changes are secondary to the FT4 and FT3 serum decreases in winter, which occur in spite of the constant treatment. The underlying mechanisms are unclear, but in some cases, these changes may be clinically relevant. © 2017 John Wiley & Sons Ltd.

  13. Spatiotemporal Variations of Extreme Precipitation under a Changing Climate in the Three Gorges Reservoir Area (TGRA

    Directory of Open Access Journals (Sweden)

    Mingquan Lü

    2018-01-01

    Full Text Available The Three Gorges Dam (TGD is one of the largest hydroelectric projects in the world. Monitoring the spatiotemporal distribution of extreme precipitation offers valuable information for adaptation and mitigation strategies and reservoir management schemes. This study examined variations in extreme precipitation over the Three Gorges Reservoir area (TGRA in China to investigate the potential role of climate warming and Three Gorges Reservoir (TGR. The trends in extreme precipitation over the TGRA were investigated using the iterative-based Mann–Kendall (MK test and Sen’s slope estimator, based on weather station daily data series and TRMM (Tropical Rainfall Measuring Mission data series. The mean and density distribution of extreme precipitation indices between pre-dam and post-dam, pre-1985 and post-1985, and near and distant reservoir area were assessed by the Mann–Whitney test and the Kolmogorov–Smirnov test. The ratio of extreme precipitation to non-extreme precipitation became larger. The precipitation was characterized by increases in heavy precipitation as well as decreases in light and moderate rain. Comparing extreme precipitation indices between pre-1985 (cooling and post-1985 (warming indicated extreme precipitation has changed to become heavier. Under climate warming, the precipitation amount corresponding to more than the 95th percentile increased at the rate of 6.48%/°C. Results from comparing extreme precipitation for the pre- and post-dam, near reservoir area (NRA and away from the reservoir area (ARA imply an insignificant role of the TGR on rainfall extremes over the TGRA. Moreover, the impoundment of TGR did not exert detectable impacts on the surface relative humidity (RH and water vapor pressure (WP.

  14. Genomics tools available for unravelling mechanisms underlying agronomical traits in strawberry with more to come

    Science.gov (United States)

    In the last few years, high-throughput genomics promised to bridge the gap between plant physiology and plant sciences. In addition, high-throughput genotyping technologies facilitate marker-based selection for better performing genotypes. In strawberry, Fragaria vesca was the first reference sequen...

  15. Epigenetic dysregulation underlies radiation-induced transgenerational genome instability in vivo

    International Nuclear Information System (INIS)

    Koturbash, Igor; Baker, Mike; Loree, Jonathan; Kutanzi, Kristy; Hudson, Darryl; Pogribny, Igor; Sedelnikova, Olga; Bonner, William; Kovalchuk, Olga

    2006-01-01

    Purpose: Although modern cancer radiation therapy has led to increased patient survival rates, the risk of radiation treatment-related complications is becoming a growing problem. Among various complications, radiation also poses a threat to the progeny of exposed parents. It causes transgenerational genome instability that is linked to transgenerational carcinogenesis. Although the occurrence of transgenerational genome instability, which manifests as elevated delayed and nontargeted mutation, has been well documented, the mechanisms by which it arises remain obscure. We hypothesized that epigenetic alterations may play a pivotal role in the molecular etiology of transgenerational genome instability. Methods and Materials: We studied the levels of cytosine DNA methylation in somatic tissues of unexposed offspring upon maternal, paternal, or combined parental exposure. Results: We observed a significant loss of global cytosine DNA methylation in the thymus tissue of the offspring upon combined parental exposure. The loss of DNA methylation was paralleled by a significant decrease in the levels of maintenance (DNMT1) and de novo methyltransferases DNMT3a and 3b and methyl-CpG-binding protein MeCP2. Along with profound changes in DNA methylation, we noted a significant accumulation of DNA strand breaks in thymus, which is a radiation carcinogenesis target organ. Conclusions: The observed changes were indicative of a profound epigenetic dysregulation in the offspring, which in turn could lead to genome destabilization and possibly could serve as precursor for transgenerational carcinogenesis. Future studies are clearly needed to address the cellular and carcinogenic repercussions of those changes

  16. Metabolic and genomic analysis elucidates strain-level variation in Microbacterium spp. isolated from chromate contaminated sediment

    Data.gov (United States)

    U.S. Environmental Protection Agency — The data is in the form of genomic sequences deposited in a public database, growth curves, and bioinformatic analysis of sequences. This dataset is associated with...

  17. Combining high-throughput phenotyping and genome-wide association studies to reveal natural genetic variation in rice

    OpenAIRE

    Yang, Wanneng; Guo, Zilong; Huang, Chenglong; Duan, Lingfeng; Chen, Guoxing; Jiang, Ni; Fang, Wei; Feng, Hui; Xie, Weibo; Lian, Xingming; Wang, Gongwei; Luo, Qingming; Zhang, Qifa; Liu, Qian; Xiong, Lizhong

    2014-01-01

    Even as the study of plant genomics rapidly develops through the use of high-throughput sequencing techniques, traditional plant phenotyping lags far behind. Here we develop a high-throughput rice phenotyping facility (HRPF) to monitor 13 traditional agronomic traits and 2 newly defined traits during the rice growth period. Using genome-wide association studies (GWAS) of the 15 traits, we identify 141 associated loci, 25 of which contain known genes such as the Green Revolution semi-dwarf gen...

  18. Genomic Variation and Evolution of Vibrio parahaemolyticus ST36 over the Course of a Transcontinental Epidemic Expansion

    Directory of Open Access Journals (Sweden)

    Jaime Martinez-Urtaza

    2017-11-01

    Full Text Available Vibrio parahaemolyticus is the leading cause of seafood-related infections with illnesses undergoing a geographic expansion. In this process of expansion, the most fundamental change has been the transition from infections caused by local strains to the surge of pandemic clonal types. Pandemic clone sequence type 3 (ST3 was the only example of transcontinental spreading until 2012, when ST36 was detected outside the region where it is endemic in the U.S. Pacific Northwest causing infections along the U.S. northeast coast and Spain. Here, we used genome-wide analyses to reconstruct the evolutionary history of the V. parahaemolyticus ST36 clone over the course of its geographic expansion during the previous 25 years. The origin of this lineage was estimated to be in ~1985. By 1995, a new variant emerged in the region and quickly replaced the old clone, which has not been detected since 2000. The new Pacific Northwest (PNW lineage was responsible for the first cases associated with this clone outside the Pacific Northwest region. After several introductions into the northeast coast, the new PNW clone differentiated into a highly dynamic group that continues to cause illness on the northeast coast of the United States. Surprisingly, the strains detected in Europe in 2012 diverged from this ancestral group around 2000 and have conserved genetic features present only in the old PNW lineage. Recombination was identified as the major driver of diversification, with some preliminary observations suggesting a trend toward a more specialized lifestyle, which may represent a critical element in the expansion of epidemics under scenarios of coastal warming.

  19. Evolutionary origin of Rosaceae-specific active non-autonomous hAT elements and their contribution to gene regulation and genomic structural variation.

    Science.gov (United States)

    Wang, Lu; Peng, Qian; Zhao, Jianbo; Ren, Fei; Zhou, Hui; Wang, Wei; Liao, Liao; Owiti, Albert; Jiang, Quan; Han, Yuepeng

    2016-05-01

    Transposable elements account for approximately 30 % of the Prunus genome; however, their evolutionary origin and functionality remain largely unclear. In this study, we identified a hAT transposon family, termed Moshan, in Prunus. The Moshan elements consist of three types, aMoshan, tMoshan, and mMoshan. The aMoshan and tMoshan types contain intact or truncated transposase genes, respectively, while the mMoshan type is miniature inverted-repeat transposable element (MITE). The Moshan transposons are unique to Rosaceae, and the copy numbers of different Moshan types are significantly correlated. Sequence homology analysis reveals that the mMoshan MITEs are direct deletion derivatives of the tMoshan progenitors, and one kind of mMoshan containing a MuDR-derived fragment were amplified predominately in the peach genome. The mMoshan sequences contain cis-regulatory elements that can enhance gene expression up to 100-fold. The mMoshan MITEs can serve as potential sources of micro and long noncoding RNAs. Whole-genome re-sequencing analysis indicates that mMoshan elements are highly active, and an insertion into S-haplotype-specific F-box gene was reported to cause the breakdown of self-incompatibility in sour cherry. Taken together, all these results suggest that the mMoshan elements play important roles in regulating gene expression and driving genomic structural variation in Prunus.

  20. Salix transect of Europe: variation in ploidy and genome size in willow-associated common nettle, Urtica dioica L. sens. lat., from Greece to arctic Norway.

    Science.gov (United States)

    Cronk, Quentin; Hidalgo, Oriane; Pellicer, Jaume; Percy, Diana; Leitch, Ilia J

    2016-01-01

    The common stinging nettle, Urtica dioica L. sensu lato, is an invertebrate "superhost", its clonal patches maintaining large populations of insects and molluscs. It is extremely widespread in Europe and highly variable, and two ploidy levels (diploid and tetraploid) are known. However, geographical patterns in cytotype variation require further study. We assembled a collection of nettles in conjunction with a transect of Europe from the Aegean to Arctic Norway (primarily conducted to examine the diversity of Salix and Salix -associated insects). Using flow cytometry to measure genome size, our sample of 29 plants reveals 5 diploids and 24 tetraploids. Two diploids were found in SE Europe (Bulgaria and Romania) and three diploids in S. Finland. More detailed cytotype surveys in these regions are suggested. The tetraploid genome size (2C value) varied between accessions from 2.36 to 2.59 pg. The diploids varied from 1.31 to 1.35 pg per 2C nucleus, equivalent to a haploid genome size of c. 650 Mbp. Within the tetraploids, we find that the most northerly samples (from N. Finland and arctic Norway) have a generally higher genome size. This is possibly indicative of a distinct population in this region.

  1. Processes Underlying Rabies Virus Incursions across US-Canada Border as Revealed by Whole-Genome Phylogeography.

    Science.gov (United States)

    Trewby, Hannah; Nadin-Davis, Susan A; Real, Leslie A; Biek, Roman

    2017-09-01

    Disease control programs aim to constrain and reduce the spread of infection. Human disease interventions such as wildlife vaccination play a major role in determining the limits of a pathogen's spatial distribution. Over the past few decades, a raccoon-specific variant of rabies virus (RRV) has invaded large areas of eastern North America. Although expansion into Canada has been largely prevented through vaccination along the US border, several outbreaks have occurred in Canada. Applying phylogeographic approaches to 289 RRV whole-genome sequences derived from isolates collected in Canada and adjacent US states, we examined the processes underlying these outbreaks. RRV incursions were attributable predominantly to systematic virus leakage of local strains across areas along the border where vaccination has been conducted but also to single stochastic events such as long-distance translocations. These results demonstrate the utility of phylogeographic analysis of pathogen genomes for understanding transboundary outbreaks.

  2. The Spatiotemporal Variations of Runoff in the Yangtze River Basin under Climate Change

    OpenAIRE

    Xiao, Ziwei; Shi, Peng; Jiang, Peng; Hu, Jianwei; Qu, Simin; Chen, Xingyu; Chen, Yingbing; Dai, Yunqiu; Wang, Jianjin

    2018-01-01

    A better understanding of the runoff variations contributes to a better utilization of water resources and water conservancy planning. In this paper, we analyzed the runoff changes in the Yangtze River Basin (YRB) including the spatiotemporal characteristics of intra-annual variation, the trend, the mutation point, and the period of annual runoff using various statistical methods. We also investigated how changes in the precipitation and temperature could impact on runoff. We found that the i...

  3. Developmental plasticity in Tenebrio molitor (Coleoptera: Tenebrionidae): Analysis of Instar Variation in Number and Development Time under Different Diets

    Science.gov (United States)

    The variation in instar number and the pattern of sequential instar development time of Tenebrio molitor L. (Coleoptera: Tenebrionidae) was studied under 4 different diet regimes. Addition of dietary supplements consisting of dry potato or a mix of dry potato and dry egg whites significantly reduced...

  4. MDA and GSH-Px activity in transition dairy cows under seasonal variations and their relationship with reproductive performance

    Directory of Open Access Journals (Sweden)

    Colakoglu Hatice Esra

    2017-12-01

    Full Text Available Introduction: The purpose of the current study was to evaluate the blood glutathione peroxidase (GSH-Px and malondialdehyde (MDA levels under seasonal variations in dairy cows during transition period, and to assess the relationship between chosen reproductive parameters, GSH-Px, and MDA.

  5. Variation in relative growth rate and growth traits in wild and cultivated Capsicum accessions grown under different temperatures

    NARCIS (Netherlands)

    Swart, de E.A.M.; Marcelis, L.F.M.; Voorrips, R.E.

    2006-01-01

    Differences in environmental conditions are known to influence plant growth and growth-related traits. The aim of this study was to identify the variation in relative growth rate (RGR), and its underlying physiological and morphological traits, in a group of ten wild and cultivated Capsicum

  6. Novel approach to evaluate the dynamic variation of wind drift and evaporation losses under moving irrigation systems

    Science.gov (United States)

    Sayed-Hossein Sadeghi; Troy R. Peters; Mohammad Z. Amini; Sparkle L. Malone; Hank W. Loescher

    2015-01-01

    The increased need for water and food security requires the development of new approaches to save water through irrigation management strategies, particularly for center pivot irrigation. To do so entails monitoring of the dynamic variation in wind drift and evaporation losses (WDELs) of irrigation systems under different weather conditions and for relatively long time...

  7. Genetic Variation Underlying Traumatic Brain injury (TBI) and Late Onset Alzheimer’s Disease (LOAD)

    Science.gov (United States)

    2017-10-01

    Stable trajectory, Decliners were more likely women , older, less educated, from non-White ancestry population and APOE-ε4 carriers. The highest annual...with slightly higher rates for women compared to males (rates= 4.0 versus 3.8) and the highest rates achieved by subjects with a Caribbean-Hispanic... Single nucleotide polymorp