WorldWideScience

Sample records for density snp genotyping

  1. Development and characterization of a high density SNP genotyping assay for cattle.

    Directory of Open Access Journals (Sweden)

    Lakshmi K Matukumalli

    Full Text Available The success of genome-wide association (GWA studies for the detection of sequence variation affecting complex traits in human has spurred interest in the use of large-scale high-density single nucleotide polymorphism (SNP genotyping for the identification of quantitative trait loci (QTL and for marker-assisted selection in model and agricultural species. A cost-effective and efficient approach for the development of a custom genotyping assay interrogating 54,001 SNP loci to support GWA applications in cattle is described. A novel algorithm for achieving a compressed inter-marker interval distribution proved remarkably successful, with median interval of 37 kb and maximum predicted gap of <350 kb. The assay was tested on a panel of 576 animals from 21 cattle breeds and six outgroup species and revealed that from 39,765 to 46,492 SNP are polymorphic within individual breeds (average minor allele frequency (MAF ranging from 0.24 to 0.27. The assay also identified 79 putative copy number variants in cattle. Utility for GWA was demonstrated by localizing known variation for coat color and the presence/absence of horns to their correct genomic locations. The combination of SNP selection and the novel spacing algorithm allows an efficient approach for the development of high-density genotyping platforms in species having full or even moderate quality draft sequence. Aspects of the approach can be exploited in species which lack an available genome sequence. The BovineSNP50 assay described here is commercially available from Illumina and provides a robust platform for mapping disease genes and QTL in cattle.

  2. SNP Discovery and Development of a High-Density Genotyping Array for Sunflower

    Science.gov (United States)

    Bachlava, Eleni; Taylor, Christopher A.; Tang, Shunxue; Bowers, John E.; Mandel, Jennifer R.; Burke, John M.; Knapp, Steven J.

    2012-01-01

    Recent advances in next-generation DNA sequencing technologies have made possible the development of high-throughput SNP genotyping platforms that allow for the simultaneous interrogation of thousands of single-nucleotide polymorphisms (SNPs). Such resources have the potential to facilitate the rapid development of high-density genetic maps, and to enable genome-wide association studies as well as molecular breeding approaches in a variety of taxa. Herein, we describe the development of a SNP genotyping resource for use in sunflower (Helianthus annuus L.). This work involved the development of a reference transcriptome assembly for sunflower, the discovery of thousands of high quality SNPs based on the generation and analysis of ca. 6 Gb of transcriptome re-sequencing data derived from multiple genotypes, the selection of 10,640 SNPs for inclusion in the genotyping array, and the use of the resulting array to screen a diverse panel of sunflower accessions as well as related wild species. The results of this work revealed a high frequency of polymorphic SNPs and relatively high level of cross-species transferability. Indeed, greater than 95% of successful SNP assays revealed polymorphism, and more than 90% of these assays could be successfully transferred to related wild species. Analysis of the polymorphism data revealed patterns of genetic differentiation that were largely congruent with the evolutionary history of sunflower, though the large number of markers allowed for finer resolution than has previously been possible. PMID:22238659

  3. Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar)

    Science.gov (United States)

    2014-01-01

    Background Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. Results SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. Conclusions This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in

  4. Evaluation of Bovine High-Density SNP Genotyping Array in Indigenous Dairy Cattle Breeds.

    Science.gov (United States)

    Dash, S; Singh, A; Bhatia, A K; Jayakumar, S; Sharma, A; Singh, S; Ganguly, I; Dixit, S P

    2018-04-03

    In total 52 samples of Sahiwal ( 19 ), Tharparkar ( 17 ), and Gir ( 16 ) were genotyped by using BovineHD SNP chip to analyze minor allele frequency (MAF), genetic diversity, and linkage disequilibrium among these cattle. The common SNPs of BovineHD and 54K SNP Chips were also extracted and evaluated for their performance. Only 40%-50% SNPs of these arrays was found informative for genetic analysis in these cattle breeds. The overall mean of MAF for SNPs of BovineHD SNPChip was 0.248 ± 0.006, 0.241 ± 0.007, and 0.242 ± 0.009 in Sahiwal, Tharparkar and Gir, respectively, while that for 54K SNPs was on lower side. The average Reynold's genetic distance between breeds ranged from 0.042 to 0.055 based on BovineHD Beadchip, and from 0.052 to 0.084 based on 54K SNP Chip. The estimates of genetic diversity based on HD and 54K chips were almost same and, hence, low density chip seems to be good enough to decipher genetic diversity of these cattle breeds. The linkage disequilibrium started decaying (r 2  < 0.2) at 140 kb inter-marker distance and, hence, a 20K low density customized SNP array from HD chip could be designed for genomic selection in these cattle else the 54K Bead Chip as such will be useful.

  5. Improving accuracy of genomic prediction in Brangus cattle by adding animals with imputed low-density SNP genotypes.

    Science.gov (United States)

    Lopes, F B; Wu, X-L; Li, H; Xu, J; Perkins, T; Genho, J; Ferretti, R; Tait, R G; Bauck, S; Rosa, G J M

    2018-02-01

    Reliable genomic prediction of breeding values for quantitative traits requires the availability of sufficient number of animals with genotypes and phenotypes in the training set. As of 31 October 2016, there were 3,797 Brangus animals with genotypes and phenotypes. These Brangus animals were genotyped using different commercial SNP chips. Of them, the largest group consisted of 1,535 animals genotyped by the GGP-LDV4 SNP chip. The remaining 2,262 genotypes were imputed to the SNP content of the GGP-LDV4 chip, so that the number of animals available for training the genomic prediction models was more than doubled. The present study showed that the pooling of animals with both original or imputed 40K SNP genotypes substantially increased genomic prediction accuracies on the ten traits. By supplementing imputed genotypes, the relative gains in genomic prediction accuracies on estimated breeding values (EBV) were from 12.60% to 31.27%, and the relative gain in genomic prediction accuracies on de-regressed EBV was slightly small (i.e. 0.87%-18.75%). The present study also compared the performance of five genomic prediction models and two cross-validation methods. The five genomic models predicted EBV and de-regressed EBV of the ten traits similarly well. Of the two cross-validation methods, leave-one-out cross-validation maximized the number of animals at the stage of training for genomic prediction. Genomic prediction accuracy (GPA) on the ten quantitative traits was validated in 1,106 newly genotyped Brangus animals based on the SNP effects estimated in the previous set of 3,797 Brangus animals, and they were slightly lower than GPA in the original data. The present study was the first to leverage currently available genotype and phenotype resources in order to harness genomic prediction in Brangus beef cattle. © 2018 Blackwell Verlag GmbH.

  6. Forensic SNP genotyping with SNaPshot

    DEFF Research Database (Denmark)

    Fondevila, M; Børsting, C; Phillips, C

    2017-01-01

    to routine STR profiling, use of SNaPshot is an important part of the development of SNP sets for a wide range of forensic applications with these markers, from genotyping highly degraded DNA with very short amplicons to the introduction of SNPs to ascertain the ancestry and physical characteristics......This review explores the key factors that influence the optimization, routine use, and profile interpretation of the SNaPshot single-base extension (SBE) system applied to forensic single-nucleotide polymorphism (SNP) genotyping. Despite being a mainly complimentary DNA genotyping technique...... of an unidentified contact trace donor. However, this technology, as resourceful as it is, displays several features that depart from the usual STR genotyping far enough to demand a certain degree of expertise from the forensic analyst before tackling the complex casework on which SNaPshot application provides...

  7. SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping

    Directory of Open Access Journals (Sweden)

    Chang Hsueh-Wei

    2010-04-01

    Full Text Available Abstract Background PCR-restriction fragment length polymorphism (RFLP assay is a cost-effective method for SNP genotyping and mutation detection, but the manual mining for restriction enzyme sites is challenging and cumbersome. Three years after we constructed SNP-RFLPing, a freely accessible database and analysis tool for restriction enzyme mining of SNPs, significant improvements over the 2006 version have been made and incorporated into the latest version, SNP-RFLPing 2. Results The primary aim of SNP-RFLPing 2 is to provide comprehensive PCR-RFLP information with multiple functionality about SNPs, such as SNP retrieval to multiple species, different polymorphism types (bi-allelic, tri-allelic, tetra-allelic or indels, gene-centric searching, HapMap tagSNPs, gene ontology-based searching, miRNAs, and SNP500Cancer. The RFLP restriction enzymes and the corresponding PCR primers for the natural and mutagenic types of each SNP are simultaneously analyzed. All the RFLP restriction enzyme prices are also provided to aid selection. Furthermore, the previously encountered updating problems for most SNP related databases are resolved by an on-line retrieval system. Conclusions The user interfaces for functional SNP analyses have been substantially improved and integrated. SNP-RFLPing 2 offers a new and user-friendly interface for RFLP genotyping that can be used in association studies and is freely available at http://bio.kuas.edu.tw/snp-rflping2.

  8. SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping.

    Science.gov (United States)

    Chang, Hsueh-Wei; Cheng, Yu-Huei; Chuang, Li-Yeh; Yang, Cheng-Hong

    2010-04-08

    PCR-restriction fragment length polymorphism (RFLP) assay is a cost-effective method for SNP genotyping and mutation detection, but the manual mining for restriction enzyme sites is challenging and cumbersome. Three years after we constructed SNP-RFLPing, a freely accessible database and analysis tool for restriction enzyme mining of SNPs, significant improvements over the 2006 version have been made and incorporated into the latest version, SNP-RFLPing 2. The primary aim of SNP-RFLPing 2 is to provide comprehensive PCR-RFLP information with multiple functionality about SNPs, such as SNP retrieval to multiple species, different polymorphism types (bi-allelic, tri-allelic, tetra-allelic or indels), gene-centric searching, HapMap tagSNPs, gene ontology-based searching, miRNAs, and SNP500Cancer. The RFLP restriction enzymes and the corresponding PCR primers for the natural and mutagenic types of each SNP are simultaneously analyzed. All the RFLP restriction enzyme prices are also provided to aid selection. Furthermore, the previously encountered updating problems for most SNP related databases are resolved by an on-line retrieval system. The user interfaces for functional SNP analyses have been substantially improved and integrated. SNP-RFLPing 2 offers a new and user-friendly interface for RFLP genotyping that can be used in association studies and is freely available at http://bio.kuas.edu.tw/snp-rflping2.

  9. Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples.

    Science.gov (United States)

    Henshall, John M; Dierens, Leanne; Sellars, Melony J

    2014-09-02

    While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are

  10. Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology

    Science.gov (United States)

    Ramos, Antonio M.; Crooijmans, Richard P. M. A.; Affara, Nabeel A.; Amaral, Andreia J.; Archibald, Alan L.; Beever, Jonathan E.; Bendixen, Christian; Churcher, Carol; Clark, Richard; Dehais, Patrick; Hansen, Mark S.; Hedegaard, Jakob; Hu, Zhi-Liang; Kerstens, Hindrik H.; Law, Andy S.; Megens, Hendrik-Jan; Milan, Denis; Nonneman, Danny J.; Rohrer, Gary A.; Rothschild, Max F.; Smith, Tim P. L.; Schnabel, Robert D.; Van Tassell, Curt P.; Taylor, Jeremy F.; Wiedmann, Ralph T.; Schook, Lawrence B.; Groenen, Martien A. M.

    2009-01-01

    Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs. PMID:19654876

  11. Ultra-low-density genotype panels for breed assignment of Angus and Hereford cattle.

    Science.gov (United States)

    Judge, M M; Kelleher, M M; Kearney, J F; Sleator, R D; Berry, D P

    2017-06-01

    Angus and Hereford beef is marketed internationally for apparent superior meat quality attributes; DNA-based breed authenticity could be a useful instrument to ensure consumer confidence on premium meat products. The objective of this study was to develop an ultra-low-density genotype panel to accurately quantify the Angus and Hereford breed proportion in biological samples. Medium-density genotypes (13 306 single nucleotide polymorphisms (SNPs)) were available on 54 703 commercial and 4042 purebred animals. The breed proportion of the commercial animals was generated from the medium-density genotypes and this estimate was regarded as the gold-standard breed composition. Ten genotype panels (100 to 1000 SNPs) were developed from the medium-density genotypes; five methods were used to identify the most informative SNPs and these included the Delta statistic, the fixation (F st) statistic and an index of both. Breed assignment analyses were undertaken for each breed, panel density and SNP selection method separately with a programme to infer population structure using the entire 13 306 SNP panel (representing the gold-standard measure). Breed assignment was undertaken for all commercial animals (n=54 703), animals deemed to contain some proportion of Angus based on pedigree (n=5740) and animals deemed to contain some proportion of Hereford based on pedigree (n=5187). The predicted breed proportion of all animals from the lower density panels was then compared with the gold-standard breed prediction. Panel density, SNP selection method and breed all had a significant effect on the correlation of predicted and actual breed proportion. Regardless of breed, the Index method of SNP selection numerically (but not significantly) outperformed all other selection methods in accuracy (i.e. correlation and root mean square of prediction) when panel density was ⩾300 SNPs. The correlation between actual and predicted breed proportion increased as panel density increased. Using

  12. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.

    Directory of Open Access Journals (Sweden)

    Xiao-Lin Wu

    Full Text Available Low-density (LD single nucleotide polymorphism (SNP arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD or high-density (HD SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE or haplotype-averaged Shannon entropy (HASE and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus

  13. SNP high-throughput screening in grapevine using the SNPlex™ genotyping system

    Directory of Open Access Journals (Sweden)

    Velasco Riccardo

    2008-01-01

    Full Text Available Abstract Background Until recently, only a small number of low- and mid-throughput methods have been used for single nucleotide polymorphism (SNP discovery and genotyping in grapevine (Vitis vinifera L.. However, following completion of the sequence of the highly heterozygous genome of Pinot Noir, it has been possible to identify millions of electronic SNPs (eSNPs thus providing a valuable source for high-throughput genotyping methods. Results Herein we report the first application of the SNPlex™ genotyping system in grapevine aiming at the anchoring of an eukaryotic genome. This approach combines robust SNP detection with automated assay readout and data analysis. 813 candidate eSNPs were developed from non-repetitive contigs of the assembled genome of Pinot Noir and tested in 90 progeny of Syrah × Pinot Noir cross. 563 new SNP-based markers were obtained and mapped. The efficiency rate of 69% was enhanced to 80% when multiple displacement amplification (MDA methods were used for preparation of genomic DNA for the SNPlex assay. Conclusion Unlike other SNP genotyping methods used to investigate thousands of SNPs in a few genotypes, or a few SNPs in around a thousand genotypes, the SNPlex genotyping system represents a good compromise to investigate several hundred SNPs in a hundred or more samples simultaneously. Therefore, the use of the SNPlex assay, coupled with whole genome amplification (WGA, is a good solution for future applications in well-equipped laboratories.

  14. HRM and SNaPshot as alternative forensic SNP genotyping methods.

    Science.gov (United States)

    Mehta, Bhavik; Daniel, Runa; McNevin, Dennis

    2017-09-01

    Single nucleotide polymorphisms (SNPs) have been widely used in forensics for prediction of identity, biogeographical ancestry (BGA) and externally visible characteristics (EVCs). Single base extension (SBE) assays, most notably SNaPshot® (Thermo Fisher Scientific), are commonly used for forensic SNP genotyping as they can be employed on standard instrumentation in forensic laboratories (e.g. capillary electrophoresis). High resolution melt (HRM) analysis is an alternative method and is a simple, fast, single tube assay for low throughput SNP typing. This study compares HRM and SNaPshot®. HRM produced reproducible and concordant genotypes at 500 pg, however, difficulties were encountered when genotyping SNPs with high GC content in flanking regions and differentiating variants of symmetrical SNPs. SNaPshot® was reproducible at 100 pg and is less dependent on SNP choice. HRM has a shorter processing time in comparison to SNaPshot®, avoids post PCR contamination risk and has potential as a screening tool for many forensic applications.

  15. Dynamic variable selection in SNP genotype autocalling from APEX microarray data

    Directory of Open Access Journals (Sweden)

    Zamar Ruben H

    2006-11-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are DNA sequence variations, occurring when a single nucleotide – adenine (A, thymine (T, cytosine (C or guanine (G – is altered. Arguably, SNPs account for more than 90% of human genetic variation. Our laboratory has developed a highly redundant SNP genotyping assay consisting of multiple probes with signals from multiple channels for a single SNP, based on arrayed primer extension (APEX. This mini-sequencing method is a powerful combination of a highly parallel microarray with distinctive Sanger-based dideoxy terminator sequencing chemistry. Using this microarray platform, our current genotype calling system (known as SNP Chart is capable of calling single SNP genotypes by manual inspection of the APEX data, which is time-consuming and exposed to user subjectivity bias. Results Using a set of 32 Coriell DNA samples plus three negative PCR controls as a training data set, we have developed a fully-automated genotyping algorithm based on simple linear discriminant analysis (LDA using dynamic variable selection. The algorithm combines separate analyses based on the multiple probe sets to give a final posterior probability for each candidate genotype. We have tested our algorithm on a completely independent data set of 270 DNA samples, with validated genotypes, from patients admitted to the intensive care unit (ICU of St. Paul's Hospital (plus one negative PCR control sample. Our method achieves a concordance rate of 98.9% with a 99.6% call rate for a set of 96 SNPs. By adjusting the threshold value for the final posterior probability of the called genotype, the call rate reduces to 94.9% with a higher concordance rate of 99.6%. We also reversed the two independent data sets in their training and testing roles, achieving a concordance rate up to 99.8%. Conclusion The strength of this APEX chemistry-based platform is its unique redundancy having multiple probes for a single SNP. Our

  16. High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species

    Science.gov (United States)

    2011-01-01

    Background High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of Eucalyptus from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for E. grandis. A systematic assessment of in silico SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous in silico constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine Eucalyptus species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions This study indicates that the GGGT performs well both within and across species of Eucalyptus notwithstanding its nucleotide diversity ≥2%. The development of a much larger array of informative SNPs across

  17. Vitis phylogenomics: hybridization intensities from a SNP array outperform genotype calls.

    Directory of Open Access Journals (Sweden)

    Allison J Miller

    Full Text Available Understanding relationships among species is a fundamental goal of evolutionary biology. Single nucleotide polymorphisms (SNPs identified through next generation sequencing and related technologies enable phylogeny reconstruction by providing unprecedented numbers of characters for analysis. One approach to SNP-based phylogeny reconstruction is to identify SNPs in a subset of individuals, and then to compile SNPs on an array that can be used to genotype additional samples at hundreds or thousands of sites simultaneously. Although powerful and efficient, this method is subject to ascertainment bias because applying variation discovered in a representative subset to a larger sample favors identification of SNPs with high minor allele frequencies and introduces bias against rare alleles. Here, we demonstrate that the use of hybridization intensity data, rather than genotype calls, reduces the effects of ascertainment bias. Whereas traditional SNP calls assess known variants based on diversity housed in the discovery panel, hybridization intensity data survey variation in the broader sample pool, regardless of whether those variants are present in the initial SNP discovery process. We apply SNP genotype and hybridization intensity data derived from the Vitis9kSNP array developed for grape to show the effects of ascertainment bias and to reconstruct evolutionary relationships among Vitis species. We demonstrate that phylogenies constructed using hybridization intensities suffer less from the distorting effects of ascertainment bias, and are thus more accurate than phylogenies based on genotype calls. Moreover, we reconstruct the phylogeny of the genus Vitis using hybridization data, show that North American subgenus Vitis species are monophyletic, and resolve several previously poorly known relationships among North American species. This study builds on earlier work that applied the Vitis9kSNP array to evolutionary questions within Vitis vinifera

  18. Development and Applications of a High Throughput Genotyping Tool for Polyploid Crops: Single Nucleotide Polymorphism (SNP Array

    Directory of Open Access Journals (Sweden)

    Qian You

    2018-02-01

    Full Text Available Polypoid species play significant roles in agriculture and food production. Many crop species are polyploid, such as potato, wheat, strawberry, and sugarcane. Genotyping has been a daunting task for genetic studies of polyploid crops, which lags far behind the diploid crop species. Single nucleotide polymorphism (SNP array is considered to be one of, high-throughput, relatively cost-efficient and automated genotyping approaches. However, there are significant challenges for SNP identification in complex, polyploid genomes, which has seriously slowed SNP discovery and array development in polyploid species. Ploidy is a significant factor impacting SNP qualities and validation rates of SNP markers in SNP arrays, which has been proven to be a very important tool for genetic studies and molecular breeding. In this review, we (1 discussed the pros and cons of SNP array in general for high throughput genotyping, (2 presented the challenges of and solutions to SNP calling in polyploid species, (3 summarized the SNP selection criteria and considerations of SNP array design for polyploid species, (4 illustrated SNP array applications in several different polyploid crop species, then (5 discussed challenges, available software, and their accuracy comparisons for genotype calling based on SNP array data in polyploids, and finally (6 provided a series of SNP array design and genotype calling recommendations. This review presents a complete overview of SNP array development and applications in polypoid crops, which will benefit the research in molecular breeding and genetics of crops with complex genomes.

  19. Imputation of microsatellite alleles from dense SNP genotypes for parental verification

    Directory of Open Access Journals (Sweden)

    Matthew eMcclure

    2012-08-01

    Full Text Available Microsatellite (MS markers have recently been used for parental verification and are still the international standard despite higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP-based assays. Despite domestic and international interest from producers and research communities, no viable means currently exist to verify parentage for an individual unless all familial connections were analyzed using the same DNA marker type (MS or SNP. A simple and cost-effective method was devised to impute MS alleles from SNP haplotypes within breeds. For some MS, imputation results may allow inference across breeds. A total of 347 dairy cattle representing 4 dairy breeds (Brown Swiss, Guernsey, Holstein, and Jersey were used to generate reference haplotypes. This approach has been verified (>98% accurate for imputing the International Society of Animal Genetics (ISAG recommended panel of 12 MS for cattle parentage verification across a validation set of 1,307 dairy animals.. Implementation of this method will allow producers and breed associations to transition to SNP-based parentage verification utilizing MS genotypes from historical data on parents where SNP genotypes are missing. This approach may be applicable to additional cattle breeds and other species that wish to migrate from MS- to SNP- based parental verification.

  20. High-density SNP genotyping of tomato (Solanum lycopersicum L. reveals patterns of genetic variation due to breeding.

    Directory of Open Access Journals (Sweden)

    Sung-Chur Sim

    Full Text Available The effects of selection on genome variation were investigated and visualized in tomato using a high-density single nucleotide polymorphism (SNP array. 7,720 SNPs were genotyped on a collection of 426 tomato accessions (410 inbreds and 16 hybrids and over 97% of the markers were polymorphic in the entire collection. Principal component analysis (PCA and pairwise estimates of F(st supported that the inbred accessions represented seven sub-populations including processing, large-fruited fresh market, large-fruited vintage, cultivated cherry, landrace, wild cherry, and S. pimpinellifolium. Further divisions were found within both the contemporary processing and fresh market sub-populations. These sub-populations showed higher levels of genetic diversity relative to the vintage sub-population. The array provided a large number of polymorphic SNP markers across each sub-population, ranging from 3,159 in the vintage accessions to 6,234 in the cultivated cherry accessions. Visualization of minor allele frequency revealed regions of the genome that distinguished three representative sub-populations of cultivated tomato (processing, fresh market, and vintage, particularly on chromosomes 2, 4, 5, 6, and 11. The PCA loadings and F(st outlier analysis between these three sub-populations identified a large number of candidate loci under positive selection on chromosomes 4, 5, and 11. The extent of linkage disequilibrium (LD was examined within each chromosome for these sub-populations. LD decay varied between chromosomes and sub-populations, with large differences reflective of breeding history. For example, on chromosome 11, decay occurred over 0.8 cM for processing accessions and over 19.7 cM for fresh market accessions. The observed SNP variation and LD decay suggest that different patterns of genetic variation in cultivated tomato are due to introgression from wild species and selection for market specialization.

  1. Large SNP arrays for genotyping in crop plants

    Indian Academy of Sciences (India)

    Genotyping with large numbers of molecular markers is now an indispensable tool within plant genetics and breeding. Especially through the identification of large numbers of single nucleotide polymorphism (SNP) markers using the novel high-throughput sequencing technologies, it is now possible to reliably identify many ...

  2. Reducing Bias of Allele Frequency Estimates by Modeling SNP Genotype Data with Informative Missingness

    Directory of Open Access Journals (Sweden)

    Wan-Yu eLin

    2012-06-01

    Full Text Available The presence of missing single-nucleotide polymorphism (SNP genotypes is common in genetic data. For studies with low-density SNPs, the most commonly used approach to deal with genotype missingness is to simply remove the observations with missing genotypes from the analyses. This naïve method is straightforward but is appropriate only when the missingness is random. However, a given assay often has a different capability in genotyping heterozygotes and homozygotes, causing the phenomenon of ‘differential dropout’ in the sense that the missing rates of heterozygotes and homozygotes are different. In practice, differential dropout among genotypes exists in even carefully designed studies, such as the data from the HapMap project and the Wellcome Trust Case Control Consortium. In this study, we propose a statistical method to model the differential dropout among different genotypes. Compared with the naïve method, our method provides more accurate allele frequency estimates when the differential dropout is present. To demonstrate its practical use, we further apply our method to the HapMap data and a scleroderma data set.

  3. Typing of 48 autosomal SNPs and amelogenin with GenPlex SNP genotyping system in forensic genetics

    DEFF Research Database (Denmark)

    Tomas Mas, Carmen; Stangegaard, Michael; Børsting, Claus

    2008-01-01

    , Somalia and Greenland were investigated with GenPlex using a Biomek 3000 (Beckman Coulter) robot. The results were compared to results obtained with an ISO 17025 accredited SNP typing assay based on single base extension (SBE). With the GenPlex SNP genotyping system, full SNP profiles were obtained in 97.......6% of the investigations. Perfect concordance was obtained in duplicate investigations and the SNP genotypes obtained with the GenPlex system were concordant with those of the accredited SBE based SNP typing system except for one result in rs901398 in one of 286 individuals most likely due to a mutation 6 bp downstream...

  4. Genotyping-By-Sequencing for Plant Genetic Diversity Analysis: A Lab Guide for SNP Genotyping

    Directory of Open Access Journals (Sweden)

    Gregory W. Peterson

    2014-10-01

    Full Text Available Genotyping-by-sequencing (GBS has recently emerged as a promising genomic approach for exploring plant genetic diversity on a genome-wide scale. However, many uncertainties and challenges remain in the application of GBS, particularly in non-model species. Here, we present a GBS protocol we developed and use for plant genetic diversity analysis. It uses two restriction enzymes to reduce genome complexity, applies Illumina multiplexing indexes for barcoding and has a custom bioinformatics pipeline for genotyping. This genetic diversity-focused GBS (gd-GBS protocol can serve as an easy-to-follow lab guide to assist a researcher through every step of a GBS application with five main components: sample preparation, library assembly, sequencing, SNP calling and diversity analysis. Specifically, in this presentation, we provide a brief overview of the GBS approach, describe the gd-GBS procedures, illustrate it with an application to analyze genetic diversity in 20 flax (Linum usitatissimum L. accessions and discuss related issues in GBS application. Following these lab bench procedures and using the custom bioinformatics pipeline, one could generate genome-wide SNP genotype data for a conventional genetic diversity analysis of a non-model plant species.

  5. High-throughput SNP genotyping in Cucurbita pepo for map construction and quantitative trait loci mapping.

    Science.gov (United States)

    Esteras, Cristina; Gómez, Pedro; Monforte, Antonio J; Blanca, José; Vicente-Dólera, Nelly; Roig, Cristina; Nuez, Fernando; Picó, Belén

    2012-02-22

    Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species.The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most of these markers are located in

  6. Design of a bovine low-density SNP array optimized for imputation.

    Directory of Open Access Journals (Sweden)

    Didier Boichard

    Full Text Available The Illumina BovineLD BeadChip was designed to support imputation to higher density genotypes in dairy and beef breeds by including single-nucleotide polymorphisms (SNPs that had a high minor allele frequency as well as uniform spacing across the genome except at the ends of the chromosome where densities were increased. The chip also includes SNPs on the Y chromosome and mitochondrial DNA loci that are useful for determining subspecies classification and certain paternal and maternal breed lineages. The total number of SNPs was 6,909. Accuracy of imputation to Illumina BovineSNP50 genotypes using the BovineLD chip was over 97% for most dairy and beef populations. The BovineLD imputations were about 3 percentage points more accurate than those from the Illumina GoldenGate Bovine3K BeadChip across multiple populations. The improvement was greatest when neither parent was genotyped. The minor allele frequencies were similar across taurine beef and dairy breeds as was the proportion of SNPs that were polymorphic. The new BovineLD chip should facilitate low-cost genomic selection in taurine beef and dairy cattle.

  7. SNP Data Quality Control in a National Beef and Dairy Cattle System and Highly Accurate SNP Based Parentage Verification and Identification

    Directory of Open Access Journals (Sweden)

    Matthew C. McClure

    2018-03-01

    Full Text Available A major use of genetic data is parentage verification and identification as inaccurate pedigrees negatively affect genetic gain. Since 2012 the international standard for single nucleotide polymorphism (SNP verification in Bos taurus cattle has been the ISAG SNP panels. While these ISAG panels provide an increased level of parentage accuracy over microsatellite markers (MS, they can validate the wrong parent at ≤1% misconcordance rate levels, indicating that more SNP are needed if a more accurate pedigree is required. With rapidly increasing numbers of cattle being genotyped in Ireland that represent 61 B. taurus breeds from a wide range of farm types: beef/dairy, AI/pedigree/commercial, purebred/crossbred, and large to small herd size the Irish Cattle Breeding Federation (ICBF analyzed different SNP densities to determine that at a minimum ≥500 SNP are needed to consistently predict only one set of parents at a ≤1% misconcordance rate. For parentage validation and prediction ICBF uses 800 SNP (ICBF800 selected based on SNP clustering quality, ISAG200 inclusion, call rate (CR, and minor allele frequency (MAF in the Irish cattle population. Large datasets require sample and SNP quality control (QC. Most publications only deal with SNP QC via CR, MAF, parent-progeny conflicts, and Hardy-Weinberg deviation, but not sample QC. We report here parentage, SNP QC, and a genomic sample QC pipelines to deal with the unique challenges of >1 million genotypes from a national herd such as SNP genotype errors from mis-tagging of animals, lab errors, farm errors, and multiple other issues that can arise. We divide the pipeline into two parts: a Genotype QC and an Animal QC pipeline. The Genotype QC identifies samples with low call rate, missing or mixed genotype classes (no BB genotype or ABTG alleles present, and low genotype frequencies. The Animal QC handles situations where the genotype might not belong to the listed individual by identifying: >1 non

  8. [Restriction endonuclease digest - melting curve analysis: a new SNP genotyping and its application in traditional Chinese medicine authentication].

    Science.gov (United States)

    Jiang, Chao; Huang, Lu-Qi; Yuan, Yuan; Chen, Min; Hou, Jing-Yi; Wu, Zhi-Gang; Lin, Shu-Fang

    2014-04-01

    Single nucleotide polymorphisms (SNP) is an important molecular marker in traditional Chinese medicine research, and it is widely used in TCM authentication. The present study created a new genotyping method by combining restriction endonuclease digesting with melting curve analysis, which is a stable, rapid and easy doing SNP genotyping method. The new method analyzed SNP genotyping of two chloroplast SNP which was located in or out of the endonuclease recognition site, the results showed that when attaching a 14 bp GC-clamp (cggcgggagggcgg) to 5' end of the primer and selecting suited endonuclease to digest the amplification products, the melting curve of Lonicera japonica and Atractylodes macrocephala were all of double peaks and the adulterants Shan-yin-hua and A. lancea were of single peaks. The results indicated that the method had good stability and reproducibility for identifying authentic medicines from its adulterants. It is a potential SNP genotyping method and named restriction endonuclease digest - melting curve analysis.

  9. Construction and evaluation of a high-density SNP array for the Pacific oyster (Crassostrea gigas.

    Directory of Open Access Journals (Sweden)

    Haigang Qi

    Full Text Available Single nucleotide polymorphisms (SNPs are widely used in genetics and genomics research. The Pacific oyster (Crassostrea gigas is an economically and ecologically important marine bivalve, and it possesses one of the highest levels of genomic DNA variation among animal species. Pacific oyster SNPs have been extensively investigated; however, the mechanisms by which these SNPs may be used in a high-throughput, transferable, and economical manner remain to be elucidated. Here, we constructed an oyster 190K SNP array using Affymetrix Axiom genotyping technology. We designed 190,420 SNPs on the chip; these SNPs were selected from 54 million SNPs identified through re-sequencing of 472 Pacific oysters collected in China, Japan, Korea, and Canada. Our genotyping results indicated that 133,984 (70.4% SNPs were polymorphic and successfully converted on the chip. The SNPs were distributed evenly throughout the oyster genome, located in 3,595 scaffolds with a length of ~509.4 million; the average interval spacing was 4,210 bp. In addition, 111,158 SNPs were distributed in 21,050 coding genes, with an average of 5.3 SNPs per gene. In comparison with genotypes obtained through re-sequencing, ~69% of the converted SNPs had a concordance rate of >0.971; the mean concordance rate was 0.966. Evaluation based on genotypes of full-sib family individuals revealed that the average genotyping accuracy rate was 0.975. Carrying 133 K polymorphic SNPs, our oyster 190K SNP array is the first commercially available high-density SNP chip for mollusks, with the highest throughput. It represents a valuable tool for oyster genome-wide association studies, fine linkage mapping, and population genetics.

  10. SNP genotyping technologies

    DEFF Research Database (Denmark)

    Studer, Bruno; Kölliker, Roland

    2013-01-01

    In the recent years, single nucleotide polymorphism (SNP) markers have emerged as the marker technology of choice for plant genetics and breeding applications. Besides the efficient technologies available for SNP discovery even in complex genomes, one of the main reasons for this is the availabil...

  11. SNP calling using genotype model selection on high-throughput sequencing data

    KAUST Repository

    You, Na; Murillo, Gabriel; Su, Xiaoquan; Zeng, Xiaowei; Xu, Jian; Ning, Kang; Zhang, ShouDong; Zhu, Jian-Kang; Cui, Xinping

    2012-01-01

    calling SNPs. Thus, errors not involved in base-calling or alignment, such as those in genomic sample preparation, are not accounted for.Results: A novel method of consensus and SNP calling, Genotype Model Selection (GeMS), is given which accounts

  12. Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array.

    Science.gov (United States)

    Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J

    2012-05-25

    A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been

  13. SNP calling using genotype model selection on high-throughput sequencing data

    KAUST Repository

    You, Na

    2012-01-16

    Motivation: A review of the available single nucleotide polymorphism (SNP) calling procedures for Illumina high-throughput sequencing (HTS) platform data reveals that most rely mainly on base-calling and mapping qualities as sources of error when calling SNPs. Thus, errors not involved in base-calling or alignment, such as those in genomic sample preparation, are not accounted for.Results: A novel method of consensus and SNP calling, Genotype Model Selection (GeMS), is given which accounts for the errors that occur during the preparation of the genomic sample. Simulations and real data analyses indicate that GeMS has the best performance balance of sensitivity and positive predictive value among the tested SNP callers. © The Author 2012. Published by Oxford University Press. All rights reserved.

  14. A SNP Genotyping Array for Hexaploid Oat

    Directory of Open Access Journals (Sweden)

    Nicholas A. Tinker

    2014-11-01

    Full Text Available Recognizing a need in cultivated hexaploid oat ( L. for a reliable set of reference single nucleotide polymorphisms (SNPs, we have developed a 6000 (6K BeadChip design containing 257 Infinium I and 5486 Infinium II designs corresponding to 5743 SNPs. Of those, 4975 SNPs yielded successful assays after array manufacturing. These SNPs were discovered based on a variety of bioinformatics pipelines in complementary DNA (cDNA and genomic DNA originating from 20 or more diverse oat cultivars. The array was validated in 1100 samples from six recombinant inbred line (RIL mapping populations and sets of diverse oat cultivars and breeding lines, and provided approximately 3500 discernible Mendelian polymorphisms. Here, we present an annotation of these SNPs, including methods of discovery, gene identification and orthology, population-genetic characteristics, and tentative positions on an oat consensus map. We also evaluate a new cluster-based method of calling SNPs. The SNP design sequences are made publicly available, and the full SNP genotyping platform is available for commercial purchase from an independent third party.

  15. Improved technique that allows the performance of large-scale SNP genotyping on DNA immobilized by FTA technology.

    Science.gov (United States)

    He, Hongbin; Argiro, Laurent; Dessein, Helia; Chevillard, Christophe

    2007-01-01

    FTA technology is a novel method designed to simplify the collection, shipment, archiving and purification of nucleic acids from a wide variety of biological sources. The number of punches that can normally be obtained from a single specimen card are often however, insufficient for the testing of the large numbers of loci required to identify genetic factors that control human susceptibility or resistance to multifactorial diseases. In this study, we propose an improved technique to perform large-scale SNP genotyping. We applied a whole genome amplification method to amplify DNA from buccal cell samples stabilized using FTA technology. The results show that using the improved technique it is possible to perform up to 15,000 genotypes from one buccal cell sample. Furthermore, the procedure is simple. We consider this improved technique to be a promising methods for performing large-scale SNP genotyping because the FTA technology simplifies the collection, shipment, archiving and purification of DNA, while whole genome amplification of FTA card bound DNA produces sufficient material for the determination of thousands of SNP genotypes.

  16. Transcriptomic SNP discovery for custom genotyping arrays: impacts of sequence data, SNP calling method and genotyping technology on the probability of validation success.

    Science.gov (United States)

    Humble, Emily; Thorne, Michael A S; Forcada, Jaume; Hoffman, Joseph I

    2016-08-26

    Single nucleotide polymorphism (SNP) discovery is an important goal of many studies. However, the number of 'putative' SNPs discovered from a sequence resource may not provide a reliable indication of the number that will successfully validate with a given genotyping technology. For this it may be necessary to account for factors such as the method used for SNP discovery and the type of sequence data from which it originates, suitability of the SNP flanking sequences for probe design, and genomic context. To explore the relative importance of these and other factors, we used Illumina sequencing to augment an existing Roche 454 transcriptome assembly for the Antarctic fur seal (Arctocephalus gazella). We then mapped the raw Illumina reads to the new hybrid transcriptome using BWA and BOWTIE2 before calling SNPs with GATK. The resulting markers were pooled with two existing sets of SNPs called from the original 454 assembly using NEWBLER and SWAP454. Finally, we explored the extent to which SNPs discovered using these four methods overlapped and predicted the corresponding validation outcomes for both Illumina Infinium iSelect HD and Affymetrix Axiom arrays. Collating markers across all discovery methods resulted in a global list of 34,718 SNPs. However, concordance between the methods was surprisingly poor, with only 51.0 % of SNPs being discovered by more than one method and 13.5 % being called from both the 454 and Illumina datasets. Using a predictive modeling approach, we could also show that SNPs called from the Illumina data were on average more likely to successfully validate, as were SNPs called by more than one method. Above and beyond this pattern, predicted validation outcomes were also consistently better for Affymetrix Axiom arrays. Our results suggest that focusing on SNPs called by more than one method could potentially improve validation outcomes. They also highlight possible differences between alternative genotyping technologies that could be

  17. UPD detection using homozygosity profiling with a SNP genotyping microarray.

    Science.gov (United States)

    Papenhausen, Peter; Schwartz, Stuart; Risheg, Hiba; Keitges, Elisabeth; Gadi, Inder; Burnside, Rachel D; Jaswaney, Vikram; Pappas, John; Pasion, Romela; Friedman, Kenneth; Tepperberg, James

    2011-04-01

    Single nucleotide polymorphism (SNP) based chromosome microarrays provide both a high-density whole genome analysis of copy number and genotype. In the past 21 months we have analyzed over 13,000 samples primarily referred for developmental delay using the Affymetrix SNP/CN 6.0 version array platform. In addition to copy number, we have focused on the relative distribution of allele homozygosity (HZ) throughout the genome to confirm a strong association of uniparental disomy (UPD) with regions of isoallelism found in most confirmed cases of UPD. We sought to determine whether a long contiguous stretch of HZ (LCSH) greater than a threshold value found only in a single chromosome would correlate with UPD of that chromosome. Nine confirmed UPD cases were retrospectively analyzed with the array in the study, each showing the anticipated LCSH with the smallest 13.5 Mb in length. This length is well above the average longest run of HZ in a set of control patients and was then set as the prospective threshold for reporting possible UPD correlation. Ninety-two cases qualified at that threshold, 46 of those had molecular UPD testing and 29 were positive. Including retrospective cases, 16 showed complete HZ across the chromosome, consistent with total isoUPD. The average size LCSH in the 19 cases that were not completely HZ was 46.3 Mb with a range of 13.5-127.8 Mb. Three patients showed only segmental UPD. Both the size and location of the LCSH are relevant to correlation with UPD. Further studies will continue to delineate an optimal threshold for LCSH/UPD correlation. Copyright © 2011 Wiley-Liss, Inc.

  18. Combinations of SNP genotypes from the Wellcome Trust Case Control Study of bipolar patients

    DEFF Research Database (Denmark)

    Mellerup, Erling; Jørgensen, Martin Balslev; Dam, Henrik

    2018-01-01

    Objectives: Combinations of genetic variants are the basis for polygenic disorders. We examined combinations of SNP genotypes taken from the 446 729 SNPs in The Wellcome Trust Case Control Study of bipolar patients. Methods: Parallel computing by graphics processing units, cloud computing, and data...

  19. Publishing SNP genotypes of human embryonic stem cell lines: policy statement of the International Stem Cell Forum Ethics Working Party.

    Science.gov (United States)

    Knoppers, Bartha M; Isasi, Rosario; Benvenisty, Nissim; Kim, Ock-Joo; Lomax, Geoffrey; Morris, Clive; Murray, Thomas H; Lee, Eng Hin; Perry, Margery; Richardson, Genevra; Sipp, Douglas; Tanner, Klaus; Wahlström, Jan; de Wert, Guido; Zeng, Fanyi

    2011-09-01

    Novel methods and associated tools permitting individual identification in publicly accessible SNP databases have become a debatable issue. There is growing concern that current technical and ethical safeguards to protect the identities of donors could be insufficient. In the context of human embryonic stem cell research, there are no studies focusing on the probability that an hESC line donor could be identified by analyzing published SNP profiles and associated genotypic and phenotypic information. We present the International Stem Cell Forum (ISCF) Ethics Working Party's Policy Statement on "Publishing SNP Genotypes of Human Embryonic Stem Cell Lines (hESC)". The Statement prospectively addresses issues surrounding the publication of genotypic data and associated annotations of hESC lines in open access databases. It proposes a balanced approach between the goals of open science and data sharing with the respect for fundamental bioethical principles (autonomy, privacy, beneficence, justice and research merit and integrity).

  20. Use of Sequenom sample ID Plus® SNP genotyping in identification of FFPE tumor samples.

    Directory of Open Access Journals (Sweden)

    Jessica K Miller

    Full Text Available Short tandem repeat (STR analysis, such as the AmpFlSTR® Identifiler® Plus kit, is a standard, PCR-based human genotyping method used in the field of forensics. Misidentification of cell line and tissue DNA can be costly if not detected early; therefore it is necessary to have quality control measures such as STR profiling in place. A major issue in large-scale research studies involving archival formalin-fixed paraffin embedded (FFPE tissues is that varying levels of DNA degradation can result in failure to correctly identify samples using STR genotyping. PCR amplification of STRs of several hundred base pairs is not always possible when DNA is degraded. The Sample ID Plus® panel from Sequenom allows for human DNA identification and authentication using SNP genotyping. In comparison to lengthy STR amplicons, this multiplexing PCR assay requires amplification of only 76-139 base pairs, and utilizes 47 SNPs to discriminate between individual samples. In this study, we evaluated both STR and SNP genotyping methods of sample identification, with a focus on paired FFPE tumor/normal DNA samples intended for next-generation sequencing (NGS. The ability to successfully validate the identity of FFPE samples can enable cost savings by reducing rework.

  1. Biomek®-3000 and GenPlex SNP Genotyping in Forensic Genetics

    DEFF Research Database (Denmark)

    Stangegaard, Michael; Tomas, Carmen; Hansen, Anders J.

    2008-01-01

    Single nucleotide polymorphism genotyping provides a supplement for conventional short tandem repeats-based kits currently used for human identification. GenPlex (Applied Biosystems (AB), Foster City, CA) is an SNP-genotyping kit based on a multiplex of 48 informative, autosomal SNPs from...... the SNPforID Consortium. Our objective was to setup, implement, and validate a small and affordable automated liquid-handling robot for forensic casework samples (buccal swaps on FTA-paper and Qiagen purified blood). The reaction scheme consisted of numerous steps and was cumbersome to perform consistently...... manually. Automation was accomplished with a Biomek-3000 (Beckmann Coulter) laboratory-automated workstation using five in-house-developed methods. All methods allowed the user to select the number of subsequent injections to the capillary electrophoresis instrument (ABI 3130xl, AB) enabling processing...

  2. Development of high-throughput SNP-based genotyping in Acacia auriculiformis x A. mangium hybrids using short-read transcriptome data

    Directory of Open Access Journals (Sweden)

    Wong Melissa ML

    2012-12-01

    Full Text Available Abstract Background Next Generation Sequencing has provided comprehensive, affordable and high-throughput DNA sequences for Single Nucleotide Polymorphism (SNP discovery in Acacia auriculiformis and Acacia mangium. Like other non-model species, SNP detection and genotyping in Acacia are challenging due to lack of genome sequences. The main objective of this study is to develop the first high-throughput SNP genotyping assay for linkage map construction of A. auriculiformis x A. mangium hybrids. Results We identified a total of 37,786 putative SNPs by aligning short read transcriptome data from four parents of two Acacia hybrid mapping populations using Bowtie against 7,839 de novo transcriptome contigs. Given a set of 10 validated SNPs from two lignin genes, our in silico SNP detection approach is highly accurate (100% compared to the traditional in vitro approach (44%. Further validation of 96 SNPs using Illumina GoldenGate Assay gave an overall assay success rate of 89.6% and conversion rate of 37.5%. We explored possible factors lowering assay success rate by predicting exon-intron boundaries and paralogous genes of Acacia contigs using Medicago truncatula genome as reference. This assessment revealed that presence of exon-intron boundary is the main cause (50% of assay failure. Subsequent SNPs filtering and improved assay design resulted in assay success and conversion rate of 92.4% and 57.4%, respectively based on 768 SNPs genotyping. Analysis of clustering patterns revealed that 27.6% of the assays were not reproducible and flanking sequence might play a role in determining cluster compression. In addition, we identified a total of 258 and 319 polymorphic SNPs in A. auriculiformis and A. mangium natural germplasms, respectively. Conclusion We have successfully discovered a large number of SNP markers in A. auriculiformis x A. mangium hybrids using next generation transcriptome sequencing. By using a reference genome from the most closely

  3. Increasing Genome Sampling and Improving SNP Genotyping for Genotyping-by-Sequencing with New Combinations of Restriction Enzymes.

    Science.gov (United States)

    Fu, Yong-Bi; Peterson, Gregory W; Dong, Yibo

    2016-04-07

    Genotyping-by-sequencing (GBS) has emerged as a useful genomic approach for exploring genome-wide genetic variation. However, GBS commonly samples a genome unevenly and can generate a substantial amount of missing data. These technical features would limit the power of various GBS-based genetic and genomic analyses. Here we present software called IgCoverage for in silico evaluation of genomic coverage through GBS with an individual or pair of restriction enzymes on one sequenced genome, and report a new set of 21 restriction enzyme combinations that can be applied to enhance GBS applications. These enzyme combinations were developed through an application of IgCoverage on 22 plant, animal, and fungus species with sequenced genomes, and some of them were empirically evaluated with different runs of Illumina MiSeq sequencing in 12 plant species. The in silico analysis of 22 organisms revealed up to eight times more genome coverage for the new combinations consisted of pairing four- or five-cutter restriction enzymes than the commonly used enzyme combination PstI + MspI. The empirical evaluation of the new enzyme combination (HinfI + HpyCH4IV) in 12 plant species showed 1.7-6 times more genome coverage than PstI + MspI, and 2.3 times more genome coverage in dicots than monocots. Also, the SNP genotyping in 12 Arabidopsis and 12 rice plants revealed that HinfI + HpyCH4IV generated 7 and 1.3 times more SNPs (with 0-16.7% missing observations) than PstI + MspI, respectively. These findings demonstrate that these novel enzyme combinations can be utilized to increase genome sampling and improve SNP genotyping in various GBS applications. Copyright © 2016 Fu et al.

  4. Construction of an SNP-based high-density linkage map for flax (Linum usitatissimum L.) using specific length amplified fragment sequencing (SLAF-seq) technology.

    Science.gov (United States)

    Yi, Liuxi; Gao, Fengyun; Siqin, Bateer; Zhou, Yu; Li, Qiang; Zhao, Xiaoqing; Jia, Xiaoyun; Zhang, Hui

    2017-01-01

    Flax is an important crop for oil and fiber, however, no high-density genetic maps have been reported for this species. Specific length amplified fragment sequencing (SLAF-seq) is a high-resolution strategy for large scale de novo discovery and genotyping of single nucleotide polymorphisms. In this study, SLAF-seq was employed to develop SNP markers in an F2 population to construct a high-density genetic map for flax. In total, 196.29 million paired-end reads were obtained. The average sequencing depth was 25.08 in male parent, 32.17 in the female parent, and 9.64 in each F2 progeny. In total, 389,288 polymorphic SLAFs were detected, from which 260,380 polymorphic SNPs were developed. After filtering, 4,638 SNPs were found suitable for genetic map construction. The final genetic map included 4,145 SNP markers on 15 linkage groups and was 2,632.94 cM in length, with an average distance of 0.64 cM between adjacent markers. To our knowledge, this map is the densest SNP-based genetic map for flax. The SNP markers and genetic map reported in here will serve as a foundation for the fine mapping of quantitative trait loci (QTLs), map-based gene cloning and marker assisted selection (MAS) for flax.

  5. A procedure for the detection of linkage with high density SNP arrays in a large pedigree with colorectal cancer

    International Nuclear Information System (INIS)

    Middeldorp, Anneke; Wijnen, Juul T; Wezel, Tom van; Jagmohan-Changur, Shantie; Helmer, Quinta; Klift, Heleen M van der; Tops, Carli MJ; Vasen, Hans FA; Devilee, Peter; Morreau, Hans; Houwing-Duistermaat, Jeanine J

    2007-01-01

    The apparent dominant model of colorectal cancer (CRC) inheritance in several large families, without mutations in known CRC susceptibility genes, suggests the presence of so far unidentified genes with strong or moderate effect on the development of CRC. Linkage analysis could lead to identification of susceptibility genes in such families. In comparison to classical linkage analysis with multi-allelic markers, single nucleotide polymorphism (SNP) arrays have increased information content and can be processed with higher throughput. Therefore, SNP arrays can be excellent tools for linkage analysis. However, the vast number of SNPs on the SNP arrays, combined with large informative pedigrees (e.g. >35–40 bits), presents us with a computational complexity that is challenging for existing statistical packages or even exceeds their capacity. We therefore setup a procedure for linkage analysis in large pedigrees and validated the method by genotyping using SNP arrays of a colorectal cancer family with a known MLH1 germ line mutation. Quality control of the genotype data was performed in Alohomora, Mega2 and SimWalk2, with removal of uninformative SNPs, Mendelian inconsistencies and Mendelian consistent errors, respectively. Linkage disequilibrium was measured by SNPLINK and Merlin. Parametric linkage analysis using two flanking markers was performed using MENDEL. For multipoint parametric linkage analysis and haplotype analysis, SimWalk2 was used. On chromosome 3, in the MLH1-region, a LOD score of 1.9 was found by parametric linkage analysis using two flanking markers. On chromosome 11 a small region with LOD 1.1 was also detected. Upon linkage disequilibrium removal, multipoint linkage analysis yielded a LOD score of 2.1 in the MLH1 region, whereas the LOD score dropped to negative values in the region on chromosome 11. Subsequent haplotype analysis in the MLH1 region perfectly matched the mutation status of the family members. We developed a workflow for linkage

  6. Multi-generational imputation of single nucleotide polymorphism marker genotypes and accuracy of genomic selection.

    Science.gov (United States)

    Toghiani, S; Aggrey, S E; Rekaya, R

    2016-07-01

    Availability of high-density single nucleotide polymorphism (SNP) genotyping platforms provided unprecedented opportunities to enhance breeding programmes in livestock, poultry and plant species, and to better understand the genetic basis of complex traits. Using this genomic information, genomic breeding values (GEBVs), which are more accurate than conventional breeding values. The superiority of genomic selection is possible only when high-density SNP panels are used to track genes and QTLs affecting the trait. Unfortunately, even with the continuous decrease in genotyping costs, only a small fraction of the population has been genotyped with these high-density panels. It is often the case that a larger portion of the population is genotyped with low-density and low-cost SNP panels and then imputed to a higher density. Accuracy of SNP genotype imputation tends to be high when minimum requirements are met. Nevertheless, a certain rate of genotype imputation errors is unavoidable. Thus, it is reasonable to assume that the accuracy of GEBVs will be affected by imputation errors; especially, their cumulative effects over time. To evaluate the impact of multi-generational selection on the accuracy of SNP genotypes imputation and the reliability of resulting GEBVs, a simulation was carried out under varying updating of the reference population, distance between the reference and testing sets, and the approach used for the estimation of GEBVs. Using fixed reference populations, imputation accuracy decayed by about 0.5% per generation. In fact, after 25 generations, the accuracy was only 7% lower than the first generation. When the reference population was updated by either 1% or 5% of the top animals in the previous generations, decay of imputation accuracy was substantially reduced. These results indicate that low-density panels are useful, especially when the generational interval between reference and testing population is small. As the generational interval

  7. Complex nature of SNP genotype effects on gene expression in primary human leucocytes

    Directory of Open Access Journals (Sweden)

    Dinesen Lotte C

    2009-01-01

    Full Text Available Abstract Background Genome wide association studies have been hugely successful in identifying disease risk variants, yet most variants do not lead to coding changes and how variants influence biological function is usually unknown. Methods We correlated gene expression and genetic variation in untouched primary leucocytes (n = 110 from individuals with celiac disease – a common condition with multiple risk variants identified. We compared our observations with an EBV-transformed HapMap B cell line dataset (n = 90, and performed a meta-analysis to increase power to detect non-tissue specific effects. Results In celiac peripheral blood, 2,315 SNP variants influenced gene expression at 765 different transcripts (cis expression quantitative trait loci, eQTLs. 135 of the detected SNP-probe effects (reflecting 51 unique probes were also detected in a HapMap B cell line published dataset, all with effects in the same allelic direction. Overall gene expression differences within the two datasets predominantly explain the limited overlap in observed cis-eQTLs. Celiac associated risk variants from two regions, containing genes IL18RAP and CCR3, showed significant cis genotype-expression correlations in the peripheral blood but not in the B cell line datasets. We identified 14 genes where a SNP affected the expression of different probes within the same gene, but in opposite allelic directions. By incorporating genetic variation in co-expression analyses, functional relationships between genes can be more significantly detected. Conclusion In conclusion, the complex nature of genotypic effects in human populations makes the use of a relevant tissue, large datasets, and analysis of different exons essential to enable the identification of the function for many genetic risk variants in common diseases.

  8. [SNP-19 genotypic variants of CAPN10 gene and its relation to diabetes mellitus type 2 in a population of Ciudad Juarez, Mexico].

    Science.gov (United States)

    Loya Méndez, Yolanda; Reyes Leal, Gilberto; Sánchez González, Adriana; Portillo Reyes, Verónica; Reyes Ruvalcaba, David; Bojórquez Rangel, Guillermo

    2014-09-28

    Diabetes Mellitus (DM) type 2 is a common pathology with multifactorial etiology, which exact genetic bases remain unknown. Some studies suggest that single nucleotides polymorphisms (SNPs) in the CAPN10 gene (Locus 2q37.3) could be associated with the development of this disease, including the insertion/deletion polymorphism SNP-19 (2R→3R). The present study determined the association between the SNP-19 and the risk of developing DM type 2 in Ciudad Juarez population. For this study 107 participants were selected: 43 diabetics type 2 (cases) and 64 non diabetics with no family history of DM type 2 in first grade (control). Anthropometric studies were realized as well as lipids, lipoproteins and serum glucose biochemical profiles. The genotypification of SNP-19 was performed using peripheral blood lymphocytes DNA, polymerase chain reactions (PCR), and electrophoretic analysis in agarose gels. Once obtained the genotypic and allelic frequencies, the Hardy-Weinberg equilibrium test (GenAlEx 6.4) was also performed. Using the X² analysis it was identified the genotypic differences between cases and control with higher frequency of the homozygous genotype 3R of SNP- 19 in the cases group (0.418) compared to control group (0.265). Also, it was observed an association between genotype 2R/3R with elevated weight, body mass index, and waist and hip circumferences, but only in the diabetic group (P=< 0.05). The findings in this study suggest that SNP-19 in CAPN10 may participate in the development of DM type 2 in the studied population. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.

  9. Association between SNP and haplotypes in PPARGCl and adiponectin genes and bone mineral density in Chinese nuclear families

    Institute of Scientific and Technical Information of China (English)

    Zhen-lin ZHANG; Jin-wei HE; Yue-juan QIN; Yun-qiu HU; Miao LI; Yu-juan LIU; Hao ZHANG; Wei-wei HU

    2007-01-01

    Aim: To assess the contribution of single nucleotide polymorphisms (SNP) and haplotypes in the peroxisome proliferator-activated receptor-γ co-activator-1(PPARGC1) and adiponectin genes to normal bone mineral density (BMD) variation in healthy Chinese women and men. Methods: We performed population-based (ANOVA) and family-based (quantitative trait locus transmission disequi-librium test) association studies of PPARGC1 and adiponectin genes. SNP in the 2 genes were genotyped. BMD was measured using dual-energy X-ray absorptiometry in the lumbar spine and hip in 401 nuclear families with a total of1260 subjects, including 458 premenopausal women, 20-40 years of age; 401 post-menopausal women (mothers), 43-74 years of age; and 401 men (fathers), 49-76years of age. Results: Significant within-family association was found between the Thr394Thr polymorphism in the PPGAGC1 gene and peak BMD in the femoral neck (P=0.026). Subsequent permutations were in agreement with this significant within-family association result (P=0.016), but Thr394Thr SNP only accounted for0.7% of the variation in femoral neck peak BMD. However, no significant within-family association was detected between each SNP in the adiponect in gene and peak BMD. Although no significant association was found between BMD and SNP in the PPARGC1 and adiponectin genes in both men and postmenopausal women, haplotype 2 (T-T) in the adiponect in gene was associated with lumbar spine BMD in postmenopausal women (P=0.019). Conclusion: Our findings sug-gest that Thr394Thr SNP in the PPARGC1 gene was associated with peak BMD in the femoral neck in Chinese women. Confirmation of our results is needed in other populations and with more functional markers within and flanking the PPARGC1 or adiponectin genes region.

  10. V-MitoSNP: visualization of human mitochondrial SNPs

    Directory of Open Access Journals (Sweden)

    Tsui Ke-Hung

    2006-08-01

    Full Text Available Abstract Background Mitochondrial single nucleotide polymorphisms (mtSNPs constitute important data when trying to shed some light on human diseases and cancers. Unfortunately, providing relevant mtSNP genotyping information in mtDNA databases in a neatly organized and transparent visual manner still remains a challenge. Amongst the many methods reported for SNP genotyping, determining the restriction fragment length polymorphisms (RFLPs is still one of the most convenient and cost-saving methods. In this study, we prepared the visualization of the mtDNA genome in a way, which integrates the RFLP genotyping information with mitochondria related cancers and diseases in a user-friendly, intuitive and interactive manner. The inherent problem associated with mtDNA sequences in BLAST of the NCBI database was also solved. Description V-MitoSNP provides complete mtSNP information for four different kinds of inputs: (1 color-coded visual input by selecting genes of interest on the genome graph, (2 keyword search by locus, disease and mtSNP rs# ID, (3 visualized input of nucleotide range by clicking the selected region of the mtDNA sequence, and (4 sequences mtBLAST. The V-MitoSNP output provides 500 bp (base pairs flanking sequences for each SNP coupled with the RFLP enzyme and the corresponding natural or mismatched primer sets. The output format enables users to see the SNP genotype pattern of the RFLP by virtual electrophoresis of each mtSNP. The rate of successful design of enzymes and primers for RFLPs in all mtSNPs was 99.1%. The RFLP information was validated by actual agarose electrophoresis and showed successful results for all mtSNPs tested. The mtBLAST function in V-MitoSNP provides the gene information within the input sequence rather than providing the complete mitochondrial chromosome as in the NCBI BLAST database. All mtSNPs with rs number entries in NCBI are integrated in the corresponding SNP in V-MitoSNP. Conclusion V-MitoSNP is a web

  11. Development of maizeSNP3072, a high-throughput compatible SNP array, for DNA fingerprinting identification of Chinese maize varieties.

    Science.gov (United States)

    Tian, Hong-Li; Wang, Feng-Ge; Zhao, Jiu-Ran; Yi, Hong-Mei; Wang, Lu; Wang, Rui; Yang, Yang; Song, Wei

    2015-01-01

    Single nucleotide polymorphisms (SNPs) are abundant and evenly distributed throughout the maize ( Zea mays L.) genome. SNPs have several advantages over simple sequence repeats, such as ease of data comparison and integration, high-throughput processing of loci, and identification of associated phenotypes. SNPs are thus ideal for DNA fingerprinting, genetic diversity analysis, and marker-assisted breeding. Here, we developed a high-throughput and compatible SNP array, maizeSNP3072, containing 3072 SNPs developed from the maizeSNP50 array. To improve genotyping efficiency, a high-quality cluster file, maizeSNP3072_GT.egt, was constructed. All 3072 SNP loci were localized within different genes, where they were distributed in exons (43 %), promoters (21 %), 3' untranslated regions (UTRs; 22 %), 5' UTRs (9 %), and introns (5 %). The average genotyping failure rate using these SNPs was only 6 %, or 3 % using the cluster file to call genotypes. The genotype consistency of repeat sample analysis on Illumina GoldenGate versus Infinium platforms exceeded 96.4 %. The minor allele frequency (MAF) of the SNPs averaged 0.37 based on data from 309 inbred lines. The 3072 SNPs were highly effective for distinguishing among 276 examined hybrids. Comparative analysis using Chinese varieties revealed that the 3072SNP array showed a better marker success rate and higher average MAF values, evaluation scores, and variety-distinguishing efficiency than the maizeSNP50K array. The maizeSNP3072 array thus can be successfully used in DNA fingerprinting identification of Chinese maize varieties and shows potential as a useful tool for germplasm resource evaluation and molecular marker-assisted breeding.

  12. SNP genotyping by DNA photoligation: application to SNP detection of genes from food crops

    Energy Technology Data Exchange (ETDEWEB)

    Yoshimura, Yoshinaga; Ohtake, Tomoko; Okada, Hajime; Fujimoto, Kenzo [School of Materials Science, Japan Advanced Institute of Science and Technology, 1-1 Asahidai, Nomi, Ishikawa 923-1292 (Japan); Ami, Takehiro [Innovation Plaza Ishikawa, Japan Science and Technology Agency, 2-13 Asahidai, Nomi, Ishikawa 923-1211 (Japan); Tsukaguchi, Tadashi, E-mail: kenzo@jaist.ac.j [Faculty of Bioresources and Environmental Sciences, Ishikawa Prefectural University, 1-308 Suematsu, Nonoichi, Ishikawa 921-8836 (Japan)

    2009-06-15

    We describe a simple and inexpensive single-nucleotide polymorphism (SNP) typing method, using DNA photoligation with 5-carboxyvinyl-2'-deoxyuridine and two fluorophores. This SNP-typing method facilitates qualitative determination of genes from indica and japonica rice, and showed a high degree of single nucleotide specificity up to 10 000. This method can be used in the SNP typing of actual genomic DNA samples from food crops.

  13. SNP genotyping by DNA photoligation: application to SNP detection of genes from food crops

    Directory of Open Access Journals (Sweden)

    Yoshinaga Yoshimura, Tomoko Ohtake, Hajime Okada, Takehiro Ami, Tadashi Tsukaguchi and Kenzo Fujimoto

    2009-01-01

    Full Text Available We describe a simple and inexpensive single-nucleotide polymorphism (SNP typing method, using DNA photoligation with 5-carboxyvinyl-2'-deoxyuridine and two fluorophores. This SNP-typing method facilitates qualitative determination of genes from indica and japonica rice, and showed a high degree of single nucleotide specificity up to 10 000. This method can be used in the SNP typing of actual genomic DNA samples from food crops.

  14. SNP_tools: A compact tool package for analysis and conversion of genotype data for MS-Excel.

    Science.gov (United States)

    Chen, Bowang; Wilkening, Stefan; Drechsel, Marion; Hemminki, Kari

    2009-10-23

    Single nucleotide polymorphism (SNP) genotyping is a major activity in biomedical research. Scientists prefer to have a facile access to the results which may require conversions between data formats. First hand SNP data is often entered in or saved in the MS-Excel format, but this software lacks genetic and epidemiological related functions. A general tool to do basic genetic and epidemiological analysis and data conversion for MS-Excel is needed. The SNP_tools package is prepared as an add-in for MS-Excel. The code is written in Visual Basic for Application, embedded in the Microsoft Office package. This add-in is an easy to use tool for users with basic computer knowledge (and requirements for basic statistical analysis). Our implementation for Microsoft Excel 2000-2007 in Microsoft Windows 2000, XP, Vista and Windows 7 beta can handle files in different formats and converts them into other formats. It is a free software.

  15. A 34K SNP genotyping array for Populus trichocarpa: design, application to the study of natural populations and transferability to other Populus species.

    Science.gov (United States)

    Geraldes, A; Difazio, S P; Slavov, G T; Ranjan, P; Muchero, W; Hannemann, J; Gunter, L E; Wymore, A M; Grassa, C J; Farzaneh, N; Porth, I; McKown, A D; Skyba, O; Li, E; Fujita, M; Klápště, J; Martin, J; Schackwitz, W; Pennacchio, C; Rokhsar, D; Friedmann, M C; Wasteneys, G O; Guy, R D; El-Kassaby, Y A; Mansfield, S D; Cronk, Q C B; Ehlting, J; Douglas, C J; Tuskan, G A

    2013-03-01

    Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. For such studies, the use of large single nucleotide polymorphism (SNP) genotyping arrays still offers the most cost-effective solution. Herein we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species latitudinal range. We adopted a candidate gene approach to the array design that resulted in the selection of 34 131 SNPs, the majority of which are located in, or within 2 kb of, 3543 candidate genes. A subset of the SNPs on the array (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%. We demonstrate that even among small numbers of samples (n = 10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca. Finally, we provide evidence for the utility of the array to address evolutionary questions such as intraspecific studies of genetic differentiation, species assignment and the detection of natural hybrids. © 2013 Blackwell Publishing Ltd.

  16. Design and characterization of a 52K SNP chip for goats.

    Directory of Open Access Journals (Sweden)

    Gwenola Tosser-Klopp

    Full Text Available The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50-60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed: Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes, sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years.

  17. High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).

    Science.gov (United States)

    Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C

    2016-03-01

    Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies. © 2015 John Wiley & Sons Ltd.

  18. A large-scale chromosome-specific SNP discovery guideline.

    Science.gov (United States)

    Akpinar, Bala Ani; Lucas, Stuart; Budak, Hikmet

    2017-01-01

    Single-nucleotide polymorphisms (SNPs) are the most prevalent type of variation in genomes that are increasingly being used as molecular markers in diversity analyses, mapping and cloning of genes, and germplasm characterization. However, only a few studies reported large-scale SNP discovery in Aegilops tauschii, restricting their potential use as markers for the low-polymorphic D genome. Here, we report 68,592 SNPs found on the gene-related sequences of the 5D chromosome of Ae. tauschii genotype MvGB589 using genomic and transcriptomic sequences from seven Ae. tauschii accessions, including AL8/78, the only genotype for which a draft genome sequence is available at present. We also suggest a workflow to compare SNP positions in homologous regions on the 5D chromosome of Triticum aestivum, bread wheat, to mark single nucleotide variations between these closely related species. Overall, the identified SNPs define a density of 4.49 SNPs per kilobyte, among the highest reported for the genic regions of Ae. tauschii so far. To our knowledge, this study also presents the first chromosome-specific SNP catalog in Ae. tauschii that should facilitate the association of these SNPs with morphological traits on chromosome 5D to be ultimately targeted for wheat improvement.

  19. Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species.

    Directory of Open Access Journals (Sweden)

    Brant K Peterson

    Full Text Available The ability to efficiently and accurately determine genotypes is a keystone technology in modern genetics, crucial to studies ranging from clinical diagnostics, to genotype-phenotype association, to reconstruction of ancestry and the detection of selection. To date, high capacity, low cost genotyping has been largely achieved via "SNP chip" microarray-based platforms which require substantial prior knowledge of both genome sequence and variability, and once designed are suitable only for those targeted variable nucleotide sites. This method introduces substantial ascertainment bias and inherently precludes detection of rare or population-specific variants, a major source of information for both population history and genotype-phenotype association. Recent developments in reduced-representation genome sequencing experiments on massively parallel sequencers (commonly referred to as RAD-tag or RADseq have brought direct sequencing to the problem of population genotyping, but increased cost and procedural and analytical complexity have limited their widespread adoption. Here, we describe a complete laboratory protocol, including a custom combinatorial indexing method, and accompanying software tools to facilitate genotyping across large numbers (hundreds or more of individuals for a range of markers (hundreds to hundreds of thousands. Our method requires no prior genomic knowledge and achieves per-site and per-individual costs below that of current SNP chip technology, while requiring similar hands-on time investment, comparable amounts of input DNA, and downstream analysis times on the order of hours. Finally, we provide empirical results from the application of this method to both genotyping in a laboratory cross and in wild populations. Because of its flexibility, this modified RADseq approach promises to be applicable to a diversity of biological questions in a wide range of organisms.

  20. SNP design from 454 sequencing of Podosphaera plantaginis transcriptome reveals a genetically diverse pathogen metapopulation with high levels of mixed-genotype infection.

    Directory of Open Access Journals (Sweden)

    Charlotte Tollenaere

    Full Text Available Molecular tools may greatly improve our understanding of pathogen evolution and epidemiology but technical constraints have hindered the development of genetic resources for parasites compared to free-living organisms. This study aims at developing molecular tools for Podosphaera plantaginis, an obligate fungal pathogen of Plantago lanceolata. This interaction has been intensively studied in the Åland archipelago of Finland with epidemiological data collected from over 4,000 host populations annually since year 2001.A cDNA library of a pooled sample of fungal conidia was sequenced on the 454 GS-FLX platform. Over 549,411 reads were obtained and annotated into 45,245 contigs. Annotation data was acquired for 65.2% of the assembled sequences. The transcriptome assembly was screened for SNP loci, as well as for functionally important genes (mating-type genes and potential effector proteins. A genotyping assay of 27 SNP loci was designed and tested on 380 infected leaf samples from 80 populations within the Åland archipelago. With this panel we identified 85 multilocus genotypes (MLG with uneven frequencies across the pathogen metapopulation. Approximately half of the sampled populations contain polymorphism. Our genotyping protocol revealed mixed-genotype infection within a single host leaf to be common. Mixed infection has been proposed as one of the main drivers of pathogen evolution, and hence may be an important process in this pathosystem.The developed SNP panel offers exciting research perspectives for future studies in this well-characterized pathosystem. Also, the transcriptome provides an invaluable novel genomic resource for powdery mildews, which cause significant yield losses on commercially important crops annually. Furthermore, the features that render genetic studies in this system a challenge are shared with the majority of obligate parasitic species, and hence our results provide methodological insights from SNP calling to field

  1. Supplementing High-Density SNP Microarrays for Additional Coverage of Disease-Related Genes: Addiction as a Paradigm

    Energy Technology Data Exchange (ETDEWEB)

    SacconePhD, Scott F [Washington University, St. Louis; Chesler, Elissa J [ORNL; Bierut, Laura J [Washington University, St. Louis; Kalivas, Peter J [Medical College of South Carolina, Charleston; Lerman, Caryn [University of Pennsylvania; Saccone, Nancy L [Washington University, St. Louis; Uhl, George R [Johns Hopkins University; Li, Chuan-Yun [Peking University; Philip, Vivek M [ORNL; Edenberg, Howard [Indiana University; Sherry, Steven [National Center for Biotechnology Information; Feolo, Michael [National Center for Biotechnology Information; Moyzis, Robert K [Johns Hopkins University; Rutter, Joni L [National Institute of Drug Abuse

    2009-01-01

    Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.

  2. Development of admixture mapping panels for African Americans from commercial high-density SNP arrays

    Directory of Open Access Journals (Sweden)

    Dunston Georgia M

    2010-07-01

    Full Text Available Abstract Background Admixture mapping is a powerful approach for identifying genetic variants involved in human disease that exploits the unique genomic structure in recently admixed populations. To use existing published panels of ancestry-informative markers (AIMs for admixture mapping, markers have to be genotyped de novo for each admixed study sample and samples representing the ancestral parental populations. The increased availability of dense marker data on commercial chips has made it feasible to develop panels wherein the markers need not be predetermined. Results We developed two panels of AIMs (~2,000 markers each based on the Affymetrix Genome-Wide Human SNP Array 6.0 for admixture mapping with African American samples. These two AIM panels had good map power that was higher than that of a denser panel of ~20,000 random markers as well as other published panels of AIMs. As a test case, we applied the panels in an admixture mapping study of hypertension in African Americans in the Washington, D.C. metropolitan area. Conclusions Developing marker panels for admixture mapping from existing genome-wide genotype data offers two major advantages: (1 no de novo genotyping needs to be done, thereby saving costs, and (2 markers can be filtered for various quality measures and replacement markers (to minimize gaps can be selected at no additional cost. Panels of carefully selected AIMs have two major advantages over panels of random markers: (1 the map power from sparser panels of AIMs is higher than that of ~10-fold denser panels of random markers, and (2 clusters can be labeled based on information from the parental populations. With current technology, chip-based genome-wide genotyping is less expensive than genotyping ~20,000 random markers. The major advantage of using random markers is the absence of ascertainment effects resulting from the process of selecting markers. The ability to develop marker panels informative for ancestry from

  3. Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries

    Directory of Open Access Journals (Sweden)

    Kumar Santosh

    2012-12-01

    Full Text Available Abstract Background Flax (Linum usitatissimum L. is a significant fibre and oilseed crop. Current flax molecular markers, including isozymes, RAPDs, AFLPs and SSRs are of limited use in the construction of high density linkage maps and for association mapping applications due to factors such as low reproducibility, intense labour requirements and/or limited numbers. We report here on the use of a reduced representation library strategy combined with next generation Illumina sequencing for rapid and large scale discovery of SNPs in eight flax genotypes. SNP discovery was performed through in silico analysis of the sequencing data against the whole genome shotgun sequence assembly of flax genotype CDC Bethune. Genotyping-by-sequencing of an F6-derived recombinant inbred line population provided validation of the SNPs. Results Reduced representation libraries of eight flax genotypes were sequenced on the Illumina sequencing platform resulting in sequence coverage ranging from 4.33 to 15.64X (genome equivalents. Depending on the relatedness of the genotypes and the number and length of the reads, between 78% and 93% of the reads mapped onto the CDC Bethune whole genome shotgun sequence assembly. A total of 55,465 SNPs were discovered with the largest number of SNPs belonging to the genotypes with the highest mapping coverage percentage. Approximately 84% of the SNPs discovered were identified in a single genotype, 13% were shared between any two genotypes and the remaining 3% in three or more. Nearly a quarter of the SNPs were found in genic regions. A total of 4,706 out of 4,863 SNPs discovered in Macbeth were validated using genotyping-by-sequencing of 96 F6 individuals from a recombinant inbred line population derived from a cross between CDC Bethune and Macbeth, corresponding to a validation rate of 96.8%. Conclusions Next generation sequencing of reduced representation libraries was successfully implemented for genome-wide SNP discovery from

  4. Low cost, low tech SNP genotyping tools for resource-limited areas: Plague in Madagascar as a model.

    Directory of Open Access Journals (Sweden)

    Cedar L Mitchell

    2017-12-01

    Full Text Available Genetic analysis of pathogenic organisms is a useful tool for linking human cases together and/or to potential environmental sources. The resulting data can also provide information on evolutionary patterns within a targeted species and phenotypic traits. However, the instruments often used to generate genotyping data, such as single nucleotide polymorphisms (SNPs, can be expensive and sometimes require advanced technologies to implement. This places many genotyping tools out of reach for laboratories that do not specialize in genetic studies and/or lack the requisite financial and technological resources. To address this issue, we developed a low cost and low tech genotyping system, termed agarose-MAMA, which combines traditional PCR and agarose gel electrophoresis to target phylogenetically informative SNPs.To demonstrate the utility of this approach for generating genotype data in a resource-constrained area (Madagascar, we designed an agarose-MAMA system targeting previously characterized SNPs within Yersinia pestis, the causative agent of plague. We then used this system to genetically type pathogenic strains of Y. pestis in a Malagasy laboratory not specialized in genetic studies, the Institut Pasteur de Madagascar (IPM. We conducted rigorous assay performance validations to assess potential variation introduced by differing research facilities, reagents, and personnel and found no difference in SNP genotyping results. These agarose-MAMA PCR assays are currently employed as an investigative tool at IPM, providing Malagasy researchers a means to improve the value of their plague epidemiological investigations by linking outbreaks to potential sources through genetic characterization of isolates and to improve understanding of disease ecology that may contribute to a long-term control effort.The success of our study demonstrates that the SNP-based genotyping capacity of laboratories in developing countries can be expanded with manageable

  5. Low cost, low tech SNP genotyping tools for resource-limited areas: Plague in Madagascar as a model.

    Science.gov (United States)

    Mitchell, Cedar L; Andrianaivoarimanana, Voahangy; Colman, Rebecca E; Busch, Joseph; Hornstra-O'Neill, Heidie; Keim, Paul S; Wagner, David M; Rajerison, Minoarisoa; Birdsell, Dawn N

    2017-12-01

    Genetic analysis of pathogenic organisms is a useful tool for linking human cases together and/or to potential environmental sources. The resulting data can also provide information on evolutionary patterns within a targeted species and phenotypic traits. However, the instruments often used to generate genotyping data, such as single nucleotide polymorphisms (SNPs), can be expensive and sometimes require advanced technologies to implement. This places many genotyping tools out of reach for laboratories that do not specialize in genetic studies and/or lack the requisite financial and technological resources. To address this issue, we developed a low cost and low tech genotyping system, termed agarose-MAMA, which combines traditional PCR and agarose gel electrophoresis to target phylogenetically informative SNPs. To demonstrate the utility of this approach for generating genotype data in a resource-constrained area (Madagascar), we designed an agarose-MAMA system targeting previously characterized SNPs within Yersinia pestis, the causative agent of plague. We then used this system to genetically type pathogenic strains of Y. pestis in a Malagasy laboratory not specialized in genetic studies, the Institut Pasteur de Madagascar (IPM). We conducted rigorous assay performance validations to assess potential variation introduced by differing research facilities, reagents, and personnel and found no difference in SNP genotyping results. These agarose-MAMA PCR assays are currently employed as an investigative tool at IPM, providing Malagasy researchers a means to improve the value of their plague epidemiological investigations by linking outbreaks to potential sources through genetic characterization of isolates and to improve understanding of disease ecology that may contribute to a long-term control effort. The success of our study demonstrates that the SNP-based genotyping capacity of laboratories in developing countries can be expanded with manageable financial cost for

  6. TNF-alpha 308 SNP Rs3091256 GG Genotype is Strongly Associated with Fibrosis in Patients with Chronic Hepatitis C

    Directory of Open Access Journals (Sweden)

    Özgür GÜNAL

    2017-12-01

    Full Text Available Objective: We aimed to review the influence of host genetic factors on the clinical course, treatment response as well as fibrosis progression in patients with viral hepatitis C genotype 1. Materials and Methods: Ninety-five patients with chronic hepatitis C virus (HCV infection and 97 controls were enrolled. The patients received pegylated interferon (Peg-IFN+ribavirin therapy for 48 weeks and were followed up for the next 48 weeks. Aspartat aminotransferase/platelet ratio (APRI was used to detect liver fibrosis DNA specimens were extracted from the peripheral blood mononuclear cells and the tumor necrosis factor-alpha (TNF-α 308 rs3091256 was genotyped by the polymerase chain reaction-restriction fragment length polymorphism method. Results: All patients included in the study were infected with HCV genotype 1. of the 95 HCV-positive patients, spontaneous viral clearence was observed in 25.5%, rapid viral response in 44.2%, early viral response in 91.8%, and sustained viral response was found in 73.3% of patients. The allele and genotype were not significant between patients and controls. There was no significant difference in virologic response as well. However, TNF-α-308 single nucleotide polymorphisms (SNP rs3091256 GG genotype was strongly associated with fibrosis and alanine aminotransferase (ALT levels (p=0.006 and p=0.017, respectively. Conclusion: TNF-α-308 polymorphisms may reveal different results among countries. Patients having SNP rs3091256 GG are prone to have higher ALT levels and fibrosis score but have better treatment outcome.

  7. SNP-PHAGE – High throughput SNP discovery pipeline

    Directory of Open Access Journals (Sweden)

    Cregan Perry B

    2006-10-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs as defined here are single base sequence changes or short insertion/deletions between or within individuals of a given species. As a result of their abundance and the availability of high throughput analysis technologies SNP markers have begun to replace other traditional markers such as restriction fragment length polymorphisms (RFLPs, amplified fragment length polymorphisms (AFLPs and simple sequence repeats (SSRs or microsatellite markers for fine mapping and association studies in several species. For SNP discovery from chromatogram data, several bioinformatics programs have to be combined to generate an analysis pipeline. Results have to be stored in a relational database to facilitate interrogation through queries or to generate data for further analyses such as determination of linkage disequilibrium and identification of common haplotypes. Although these tasks are routinely performed by several groups, an integrated open source SNP discovery pipeline that can be easily adapted by new groups interested in SNP marker development is currently unavailable. Results We developed SNP-PHAGE (SNP discovery Pipeline with additional features for identification of common haplotypes within a sequence tagged site (Haplotype Analysis and GenBank (-dbSNP submissions. This tool was applied for analyzing sequence traces from diverse soybean genotypes to discover over 10,000 SNPs. This package was developed on UNIX/Linux platform, written in Perl and uses a MySQL database. Scripts to generate a user-friendly web interface are also provided with common queries for preliminary data analysis. A machine learning tool developed by this group for increasing the efficiency of SNP discovery is integrated as a part of this package as an optional feature. The SNP-PHAGE package is being made available open source at http://bfgl.anri.barc.usda.gov/ML/snp-phage/. Conclusion SNP-PHAGE provides a bioinformatics

  8. Comparison of three boosting methods in parent-offspring trios for genotype imputation using simulation study

    Directory of Open Access Journals (Sweden)

    Abbas Mikhchi

    2016-01-01

    Full Text Available Abstract Background Genotype imputation is an important process of predicting unknown genotypes, which uses reference population with dense genotypes to predict missing genotypes for both human and animal genetic variations at a low cost. Machine learning methods specially boosting methods have been used in genetic studies to explore the underlying genetic profile of disease and build models capable of predicting missing values of a marker. Methods In this study strategies and factors affecting the imputation accuracy of parent-offspring trios compared from lower-density SNP panels (5 K to high density (10 K SNP panel using three different Boosting methods namely TotalBoost (TB, LogitBoost (LB and AdaBoost (AB. The methods employed using simulated data to impute the un-typed SNPs in parent-offspring trios. Four different datasets of G1 (100 trios with 5 k SNPs, G2 (100 trios with 10 k SNPs, G3 (500 trios with 5 k SNPs, and G4 (500 trio with 10 k SNPs were simulated. In four datasets all parents were genotyped completely, and offspring genotyped with a lower density panel. Results Comparison of the three methods for imputation showed that the LB outperformed AB and TB for imputation accuracy. The time of computation were different between methods. The AB was the fastest algorithm. The higher SNP densities resulted the increase of the accuracy of imputation. Larger trios (i.e. 500 was better for performance of LB and TB. Conclusions The conclusion is that the three methods do well in terms of imputation accuracy also the dense chip is recommended for imputation of parent-offspring trios.

  9. Reliable single chip genotyping with semi-parametric log-concave mixtures.

    Directory of Open Access Journals (Sweden)

    Ralph C A Rippe

    Full Text Available The common approach to SNP genotyping is to use (model-based clustering per individual SNP, on a set of arrays. Genotyping all SNPs on a single array is much more attractive, in terms of flexibility, stability and applicability, when developing new chips. A new semi-parametric method, named SCALA, is proposed. It is based on a mixture model using semi-parametric log-concave densities. Instead of using the raw data, the mixture is fitted on a two-dimensional histogram, thereby making computation time almost independent of the number of SNPs. Furthermore, the algorithm is effective in low-MAF situations.Comparisons between SCALA and CRLMM on HapMap genotypes show very reliable calling of single arrays. Some heterozygous genotypes from HapMap are called homozygous by SCALA and to lesser extent by CRLMM too. Furthermore, HapMap's NoCalls (NN could be genotyped by SCALA, mostly with high probability. The software is available as R scripts from the website www.math.leidenuniv.nl/~rrippe.

  10. Differential growth of Mycobacterium leprae strains (SNP genotypes) in armadillos.

    Science.gov (United States)

    Sharma, Rahul; Singh, Pushpendra; Pena, Maria; Subramanian, Ramesh; Chouljenko, Vladmir; Kim, Joohyun; Kim, Nayong; Caskey, John; Baudena, Marie A; Adams, Linda B; Truman, Richard W

    2018-04-14

    Leprosy (Hansen's Disease) has occurred throughout human history, and persists today at a low prevalence in most populations. Caused by Mycobacterium leprae, the infection primarily involves the skin, mucosa and peripheral nerves. The susceptible host range for Mycobacterium leprae is quite narrow. Besides humans, nine banded armadillos (Dasypus novemcinctus) and red squirrels (Sciurus vulgaris) are the only other natural hosts for M. leprae, but only armadillos recapitulate the disease as seen in humans. Armadillos across the Southern United States harbor a single predominant genotypic strain (SNP Type-3I) of M. leprae, which is also implicated in the zoonotic transmission of leprosy. We investigated, whether the zoonotic strain (3I) has any notable growth advantages in armadillos over another genetically distant strain-type (SNP Type-4P) of M. leprae, and if M. leprae strains manifest any notably different pathology among armadillos. We co-infected armadillos (n = 6) with 2 × 10 9 highly viable M. leprae of both strains and assessed the relative growth and dissemination of each strain in the animals. We also analyzed 12 additional armadillos, 6 each individually infected with the same quantity of either strain. The infections were allowed to fulminate and the clinical manifestations of the disease were noted. Animals were humanely sacrificed at the terminal stage of infection and the number of bacilli per gram of liver, spleen and lymph node tissue were enumerated by Q-PCR assay. The growth of M. leprae strain 4P was significantly higher (P leprae strains within armadillos suggest there are notable pathological variations between M. leprae strain-types. Copyright © 2018. Published by Elsevier B.V.

  11. Discovery of novel variants in genotyping arrays improves genotype retention and reduces ascertainment bias

    Directory of Open Access Journals (Sweden)

    Didion John P

    2012-01-01

    Full Text Available Abstract Background High-density genotyping arrays that measure hybridization of genomic DNA fragments to allele-specific oligonucleotide probes are widely used to genotype single nucleotide polymorphisms (SNPs in genetic studies, including human genome-wide association studies. Hybridization intensities are converted to genotype calls by clustering algorithms that assign each sample to a genotype class at each SNP. Data for SNP probes that do not conform to the expected pattern of clustering are often discarded, contributing to ascertainment bias and resulting in lost information - as much as 50% in a recent genome-wide association study in dogs. Results We identified atypical patterns of hybridization intensities that were highly reproducible and demonstrated that these patterns represent genetic variants that were not accounted for in the design of the array platform. We characterized variable intensity oligonucleotide (VINO probes that display such patterns and are found in all hybridization-based genotyping platforms, including those developed for human, dog, cattle, and mouse. When recognized and properly interpreted, VINOs recovered a substantial fraction of discarded probes and counteracted SNP ascertainment bias. We developed software (MouseDivGeno that identifies VINOs and improves the accuracy of genotype calling. MouseDivGeno produced highly concordant genotype calls when compared with other methods but it uniquely identified more than 786000 VINOs in 351 mouse samples. We used whole-genome sequence from 14 mouse strains to confirm the presence of novel variants explaining 28000 VINOs in those strains. We also identified VINOs in human HapMap 3 samples, many of which were specific to an African population. Incorporating VINOs in phylogenetic analyses substantially improved the accuracy of a Mus species tree and local haplotype assignment in laboratory mouse strains. Conclusion The problems of ascertainment bias and missing

  12. Olive oil DNA fingerprinting by multiplex SNP genotyping on fluorescent microspheres.

    Science.gov (United States)

    Kalogianni, Despina P; Bazakos, Christos; Boutsika, Lemonia M; Targem, Mehdi Ben; Christopoulos, Theodore K; Kalaitzis, Panagiotis; Ioannou, Penelope C

    2015-04-01

    Olive oil cultivar verification is of primary importance for the competitiveness of the product and the protection of consumers and producers from fraudulence. Single-nucleotide polymorphisms (SNPs) have emerged as excellent DNA markers for authenticity testing. This paper reports the first multiplex SNP genotyping assay for olive oil cultivar identification that is performed on a suspension of fluorescence-encoded microspheres. Up to 100 sets of microspheres, with unique "fluorescence signatures", are available. Allele discrimination was accomplished by primer extension reaction. The reaction products were captured via hybridization on the microspheres and analyzed, within seconds, by a flow cytometer. The "fluorescence signature" of each microsphere is assigned to a specific allele, whereas the signal from a reporter fluorophore denotes the presence of the allele. As a model, a panel of three SNPs was chosen that enabled identification of five common Greek olive cultivars (Adramytini, Chondrolia Chalkidikis, Kalamon, Koroneiki, and Valanolia).

  13. A high-density SNP map for accurate mapping of seed fibre QTL in Brassica napus L.

    Directory of Open Access Journals (Sweden)

    Liezhao Liu

    Full Text Available A high density genetic linkage map for the complex allotetraploid crop species Brassica napus (oilseed rape was constructed in a late-generation recombinant inbred line (RIL population, using genome-wide single nucleotide polymorphism (SNP markers assayed by the Brassica 60 K Infinium BeadChip Array. The linkage map contains 9164 SNP markers covering 1832.9 cM. 1232 bins account for 7648 of the markers. A subset of 2795 SNP markers, with an average distance of 0.66 cM between adjacent markers, was applied for QTL mapping of seed colour and the cell wall fiber components acid detergent lignin (ADL, cellulose and hemicellulose. After phenotypic analyses across four different environments a total of 11 QTL were detected for seed colour and fiber traits. The high-density map considerably improved QTL resolution compared to the previous low-density maps. A previously identified major QTL with very high effects on seed colour and ADL was pinpointed to a narrow genome interval on chromosome A09, while a minor QTL explaining 8.1% to 14.1% of variation for ADL was detected on chromosome C05. Five and three QTL accounting for 4.7% to 21.9% and 7.3% to 16.9% of the phenotypic variation for cellulose and hemicellulose, respectively, were also detected. To our knowledge this is the first description of QTL for seed cellulose and hemicellulose in B. napus, representing interesting new targets for improving oil content. The high density SNP genetic map enables navigation from interesting B. napus QTL to Brassica genome sequences, giving useful new information for understanding the genetics of key seed quality traits in rapeseed.

  14. Identification of Mendelian inconsistencies between SNP and pedigree information of sibs

    Directory of Open Access Journals (Sweden)

    Calus Mario PL

    2011-10-01

    Full Text Available Abstract Background Using SNP genotypes to apply genomic selection in breeding programs is becoming common practice. Tools to edit and check the quality of genotype data are required. Checking for Mendelian inconsistencies makes it possible to identify animals for which pedigree information and genotype information are not in agreement. Methods Straightforward tests to detect Mendelian inconsistencies exist that count the number of opposing homozygous marker (e.g. SNP genotypes between parent and offspring (PAR-OFF. Here, we develop two tests to identify Mendelian inconsistencies between sibs. The first test counts SNP with opposing homozygous genotypes between sib pairs (SIBCOUNT. The second test compares pedigree and SNP-based relationships (SIBREL. All tests iteratively remove animals based on decreasing numbers of inconsistent parents and offspring or sibs. The PAR-OFF test, followed by either SIB test, was applied to a dataset comprising 2,078 genotyped cows and 211 genotyped sires. Theoretical expectations for distributions of test statistics of all three tests were calculated and compared to empirically derived values. Type I and II error rates were calculated after applying the tests to the edited data, while Mendelian inconsistencies were introduced by permuting pedigree against genotype data for various proportions of animals. Results Both SIB tests identified animal pairs for which pedigree and genomic relationships could be considered as inconsistent by visual inspection of a scatter plot of pairwise pedigree and SNP-based relationships. After removal of 235 animals with the PAR-OFF test, SIBCOUNT (SIBREL identified 18 (22 additional inconsistent animals. Seventeen animals were identified by both methods. The numbers of incorrectly deleted animals (Type I error, were equally low for both methods, while the numbers of incorrectly non-deleted animals (Type II error, were considerably higher for SIBREL compared to SIBCOUNT. Conclusions

  15. Evaluation of the OvineSNP50 chip for use in four South African ...

    African Journals Online (AJOL)

    Relatively rapid and cost-effective genotyping using the OvineSNP50 chip holds great promise for the South African sheep industry and research partners. However, SNP ascertainment bias may influence inferences from the genotyping results of South African sheep breeds. Therefore, samples from Dorper, Namaqua ...

  16. fcGENE: a versatile tool for processing and transforming SNP datasets.

    Directory of Open Access Journals (Sweden)

    Nab Raj Roshyara

    Full Text Available Modern analysis of high-dimensional SNP data requires a number of biometrical and statistical methods such as pre-processing, analysis of population structure, association analysis and genotype imputation. Software used for these purposes often rely on specific and incompatible input and output data formats. Therefore extensive data management including multiple format conversions is necessary during analyses.In order to support fast and efficient management and bio-statistical quality control of high-dimensional SNP data, we developed the publically available software fcGENE using C++ object-oriented programming language. This software simplifies and automates the use of different existing analysis packages, especially during the workflow of genotype imputations and corresponding analyses.fcGENE transforms SNP data and imputation results into different formats required for a large variety of analysis packages such as PLINK, SNPTEST, HAPLOVIEW, EIGENSOFT, GenABEL and tools used for genotype imputation such as MaCH, IMPUTE, BEAGLE and others. Data Management tasks like merging, splitting, extracting SNP and pedigree information can be performed. fcGENE also supports a number of bio-statistical quality control processes and quality based filtering processes at SNP- and sample-wise level. The tool also generates templates of commands required to run specific software packages, especially those required for genotype imputation. We demonstrate the functionality of fcGENE by example workflows of SNP data analyses and provide a comprehensive manual of commands, options and applications.We have developed a user-friendly open-source software fcGENE, which comprehensively supports SNP data management, quality control and analysis workflows. Download statistics and corresponding feedbacks indicate that software is highly recognised and extensively applied by the scientific community.

  17. MDM2 gene SNP309 T/G and p53 gene SNP72 G/C do not influence diffuse large B-cell non-Hodgkin lymphoma onset or survival in central European Caucasians

    Directory of Open Access Journals (Sweden)

    Landt Olfert

    2008-04-01

    Full Text Available Abstract Background SNP309 T/G (rs2279744 causes higher levels of MDM2, the most important negative regulator of the p53 tumor suppressor. SNP72 G/C (rs1042522 gives rise to a p53 protein with a greatly reduced capacity to induce apoptosis. Both polymorphisms have been implicated in cancer. The SNP309 G-allele has recently been reported to accelerate diffuse large B-cell lymphoma (DLBCL formation in pre-menopausal women and suggested to constitute a genetic basis for estrogen affecting human tumorigenesis. Here we asked whether SNP309 and SNP72 are associated with DLBCL in women and are correlated with age of onset, diagnosis, or patient's survival. Methods SNP309 and SNP72 were PCR-genotyped in a case-control study that included 512 controls and 311 patients diagnosed with aggressive NHL. Of these, 205 were diagnosed with DLBCL. Results The age of onset was similar in men and women. The control and patients group showed similar SNP309 and SNP72 genotype frequencies. Importantly and in contrast to the previous findings, similar genotype frequencies were observed in female patients diagnosed by 51 years of age and those diagnosed later. Specifically, 3/20 female DLBCL patients diagnosed by 51 years of age were homozygous for SNP309 G and 2/20 DLBCL females in that age group were homozygous for SNP72 C. Neither SNP309 nor SNP72 had a significant influence on event-free and overall survival in multivariate analyses. Conclusion In contrast to the previous study on Ashkenazi Jewish Caucasians, DLBCL in pre-menopausal women of central European Caucasian ethnicity was not associated with SNP309 G. Neither SNP309 nor SNP72 seem to be correlated with age of onset, diagnosis, or survival of patients.

  18. SNP marker detection and genotyping in tilapia

    NARCIS (Netherlands)

    Bers, van N.E.M.; Crooijmans, R.P.M.A.; Groenen, M.A.M.; Dibbits, B.W.; Komen, J.

    2012-01-01

    We have generated a unique resource consisting of nearly 175 000 short contig sequences and 3569 SNP markers from the widely cultured GIFT (Genetically Improved Farmed Tilapia) strain of Nile tilapia (Oreochromis niloticus). In total, 384 SNPs were selected to monitor the wider applicability of the

  19. Short communication: relationship of call rate and accuracy of single nucleotide polymorphism genotypes in dairy cattle.

    Science.gov (United States)

    Cooper, T A; Wiggans, G R; VanRaden, P M

    2013-05-01

    Call rates on both a single nucleotide polymorphism (SNP) basis and an animal basis are used as measures of data quality and as screening tools for genomic studies and evaluations of dairy cattle. To investigate the relationship of SNP call rate and genotype accuracy for individual SNP, the correlation between percentages of missing genotypes and parent-progeny conflicts for each SNP was calculated for 103,313 Holsteins. Correlations ranged from 0.14 to 0.38 for the BovineSNP50 and BovineLD (Illumina Inc., San Diego, CA) and GeneSeek Genomic Profiler (Neogen Corp., Lincoln, NE) chips, with lower correlations for newer chips. For US genomic evaluations, genotypes are excluded for animals with a call rate of call rate for 220,175 Holstein, Jersey, and Brown Swiss genotypes was 99.6%. Animal genotypes with a call rate of ≤99% were examined from the US Department of Agriculture genotype database to determine how genotype call rate is related to accuracy of calls on an animal basis. Animal call rate was determined from SNP used in genomic evaluation and is the number of called autosomal and X-specific SNP genotypes divided by the number of SNP from that type of chip. To investigate the relationship of animal call rate and parentage validation, conflicts between a genotyped animal and its sire or dam were determined through a duo test (opposite homozygous SNP genotypes between sire and progeny; 1,374 animal genotypes) and a trio test (also including conflicts with dam and heterozygous SNP genotype for the animal when both parents are the same homozygote; 482 animal genotypes). When animal call rate was ≤ 80%, parentage validation was no longer reliable with the duo test. With the trio test, parentage validation was no longer reliable when animal call rate was ≤ 90%. To investigate how animal call rate was related to genotyping accuracy for animals with multiple genotypes, concordance between genotypes for 1,216 animals that had a genotype with a call rate of ≤ 99

  20. Model SNP development for complex genomes based on hexaploid oat using high-throughput 454 sequencing technology

    Directory of Open Access Journals (Sweden)

    Chao Shiaoman

    2011-01-01

    Full Text Available Abstract Background Genetic markers are pivotal to modern genomics research; however, discovery and genotyping of molecular markers in oat has been hindered by the size and complexity of the genome, and by a scarcity of sequence data. The purpose of this study was to generate oat expressed sequence tag (EST information, develop a bioinformatics pipeline for SNP discovery, and establish a method for rapid, cost-effective, and straightforward genotyping of SNP markers in complex polyploid genomes such as oat. Results Based on cDNA libraries of four cultivated oat genotypes, approximately 127,000 contigs were assembled from approximately one million Roche 454 sequence reads. Contigs were filtered through a novel bioinformatics pipeline to eliminate ambiguous polymorphism caused by subgenome homology, and 96 in silico SNPs were selected from 9,448 candidate loci for validation using high-resolution melting (HRM analysis. Of these, 52 (54% were polymorphic between parents of the Ogle1040 × TAM O-301 (OT mapping population, with 48 segregating as single Mendelian loci, and 44 being placed on the existing OT linkage map. Ogle and TAM amplicons from 12 primers were sequenced for SNP validation, revealing complex polymorphism in seven amplicons but general sequence conservation within SNP loci. Whole-amplicon interrogation with HRM revealed insertions, deletions, and heterozygotes in secondary oat germplasm pools, generating multiple alleles at some primer targets. To validate marker utility, 36 SNP assays were used to evaluate the genetic diversity of 34 diverse oat genotypes. Dendrogram clusters corresponded generally to known genome composition and genetic ancestry. Conclusions The high-throughput SNP discovery pipeline presented here is a rapid and effective method for identification of polymorphic SNP alleles in the oat genome. The current-generation HRM system is a simple and highly-informative platform for SNP genotyping. These techniques provide

  1. A low-density SNP array for analyzing differential selection in freshwater and marine populations of threespine stickleback (Gasterosteus aculeatus)

    DEFF Research Database (Denmark)

    Ferchaud, Anne-Laure; Pedersen, Susanne H.; Bekkevold, Dorte

    2014-01-01

    for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. Results......: RAD (Restriction site Associated DNA) sequencing of sixty individuals representing two freshwater and one marine population led to the identification of 33,993 SNP markers. Ninety-six of these were chosen for the low-density SNP array, among which 70 represented SNPs under putatively directional...... selection in freshwater vs. marine environments, whereas 26 SNPs were assumed to be neutral. Annotation of these regions revealed several genes that are candidates for affecting stickleback phenotypic variation, some of which have been observed in previous studies whereas others are new. Conclusions: We...

  2. Identification of Mendelian inconsistencies between SNP and pedigree Information of Sibs

    NARCIS (Netherlands)

    Calus, M.P.L.; Mulder, H.A.; Bastiaansen, J.W.M.

    2011-01-01

    Background Using SNP genotypes to apply genomic selection in breeding programs is becoming common practice. Tools to edit and check the quality of genotype data are required. Checking for Mendelian inconsistencies makes it possible to identify animals for which pedigree information and genotype

  3. Extensive variation in the density and distribution of DNA polymorphism in sorghum genomes.

    Directory of Open Access Journals (Sweden)

    Joseph Evans

    Full Text Available Sorghum genotypes currently used for grain production in the United States were developed from African landraces that were imported starting in the mid-to-late 19(th century. Farmers and plant breeders selected genotypes for grain production with reduced plant height, early flowering, increased grain yield, adaptation to drought, and improved resistance to lodging, diseases and pests. DNA polymorphisms that distinguish three historically important grain sorghum genotypes, BTx623, BTx642 and Tx7000, were characterized by genome sequencing, genotyping by sequencing, genetic mapping, and pedigree-based haplotype analysis. The distribution and density of DNA polymorphisms in the sequenced genomes varied widely, in part because the lines were derived through breeding and selection from diverse Kafir, Durra, and Caudatum race accessions. Genomic DNA spanning dw1 (SBI-09 and dw3 (SBI-07 had identical haplotypes due to selection for reduced height. Lower SNP density in genes located in pericentromeric regions compared with genes located in euchromatic regions is consistent with background selection in these regions of low recombination. SNP density was higher in euchromatic DNA and varied >100-fold in contiguous intervals that spanned up to 300 Kbp. The localized variation in DNA polymorphism density occurred throughout euchromatic regions where recombination is elevated, however, polymorphism density was not correlated with gene density or DNA methylation. Overall, sorghum chromosomes contain distal euchromatic regions characterized by extensive, localized variation in DNA polymorphism density, and large pericentromeric regions of low gene density, diversity, and recombination.

  4. Genomic Variants Revealed by Invariably Missing Genotypes in Nelore Cattle.

    Directory of Open Access Journals (Sweden)

    Joaquim Manoel da Silva

    Full Text Available High density genotyping panels have been used in a wide range of applications. From population genetics to genome-wide association studies, this technology still offers the lowest cost and the most consistent solution for generating SNP data. However, in spite of the application, part of the generated data is always discarded from final datasets based on quality control criteria used to remove unreliable markers. Some discarded data consists of markers that failed to generate genotypes, labeled as missing genotypes. A subset of missing genotypes that occur in the whole population under study may be caused by technical issues but can also be explained by the presence of genomic variations that are in the vicinity of the assayed SNP and that prevent genotyping probes from annealing. The latter case may contain relevant information because these missing genotypes might be used to identify population-specific genomic variants. In order to assess which case is more prevalent, we used Illumina HD Bovine chip genotypes from 1,709 Nelore (Bos indicus samples. We found 3,200 missing genotypes among the whole population. NGS re-sequencing data from 8 sires were used to verify the presence of genomic variations within their flanking regions in 81.56% of these missing genotypes. Furthermore, we discovered 3,300 novel SNPs/Indels, 31% of which are located in genes that may affect traits of importance for the genetic improvement of cattle production.

  5. A 200K SNP chip reveals a novel Pacific salmon louse genotype linked to differential efficacy of emamectin benzoate.

    Science.gov (United States)

    Messmer, Amber M; Leong, Jong S; Rondeau, Eric B; Mueller, Anita; Despins, Cody A; Minkley, David R; Kent, Matthew P; Lien, Sigbjørn; Boyce, Brad; Morrison, Diane; Fast, Mark D; Norman, Joseph D; Danzmann, Roy G; Koop, Ben F

    2018-04-16

    Antiparasitic drugs such as emamectin benzoate (EMB) are relied upon to reduce the parasite load, particularly of the sea louse Lepeophtheirus salmonis, on farmed salmon. The decline in EMB treatment efficacy for this purpose is an important issue for salmon producers around the world, and particularly for those in the Atlantic Ocean where widespread EMB tolerance in sea lice is recognized as a significant problem. Salmon farms in the Northeast Pacific Ocean have not historically experienced the same issues with treatment efficacy, possibly due to the relatively large population of endemic salmonid hosts that serve to both redistribute surviving lice and dilute populations potentially under selection by introducing naïve lice to farms. Frequent migration of lice among farmed and wild hosts should limit the effect of farm-specific selection pressures on changes to the overall allele frequencies of sea lice in the Pacific Ocean. A previous study using microsatellites examined L. salmonis oncorhynchi from 10 Pacific locations from wild and farmed hosts and found no population structure. Recently however, a farm population of sea lice was detected where EMB bioassay exposure tolerance was abnormally elevated. In response, we have developed a Pacific louse draft genome that complements the previously-released Atlantic louse sequence. These genomes were combined with whole-genome re-sequencing data to design a highly sensitive 201,279 marker SNP array applicable for both subspecies (90,827 validated Pacific loci; 153,569 validated Atlantic loci). Notably, kmer spectrum analysis of the re-sequenced samples indicated that Pacific lice exhibit a large within-individual heterozygosity rate (average of 1 in every 72 bases) that is markedly higher than that of Atlantic individuals (1 in every 173 bases). The SNP chip was used to produce a high-density map for Atlantic sea louse linkage group 5 that was previously shown to be associated with EMB tolerance in Atlantic lice

  6. Comparing strategies for selection of low-density SNPs for imputation-mediated genomic prediction in U. S. Holsteins.

    Science.gov (United States)

    He, Jun; Xu, Jiaqi; Wu, Xiao-Lin; Bauck, Stewart; Lee, Jungjae; Morota, Gota; Kachman, Stephen D; Spangler, Matthew L

    2018-04-01

    SNP chips are commonly used for genotyping animals in genomic selection but strategies for selecting low-density (LD) SNPs for imputation-mediated genomic selection have not been addressed adequately. The main purpose of the present study was to compare the performance of eight LD (6K) SNP panels, each selected by a different strategy exploiting a combination of three major factors: evenly-spaced SNPs, increased minor allele frequencies, and SNP-trait associations either for single traits independently or for all the three traits jointly. The imputation accuracies from 6K to 80K SNP genotypes were between 96.2 and 98.2%. Genomic prediction accuracies obtained using imputed 80K genotypes were between 0.817 and 0.821 for daughter pregnancy rate, between 0.838 and 0.844 for fat yield, and between 0.850 and 0.863 for milk yield. The two SNP panels optimized on the three major factors had the highest genomic prediction accuracy (0.821-0.863), and these accuracies were very close to those obtained using observed 80K genotypes (0.825-0.868). Further exploration of the underlying relationships showed that genomic prediction accuracies did not respond linearly to imputation accuracies, but were significantly affected by genotype (imputation) errors of SNPs in association with the traits to be predicted. SNPs optimal for map coverage and MAF were favorable for obtaining accurate imputation of genotypes whereas trait-associated SNPs improved genomic prediction accuracies. Thus, optimal LD SNP panels were the ones that combined both strengths. The present results have practical implications on the design of LD SNP chips for imputation-enabled genomic prediction.

  7. Identification of T1D susceptibility genes within the MHC region by combining protein interaction networks and SNP genotyping data

    DEFF Research Database (Denmark)

    Brorsson, C.; Hansen, Niclas Tue; Hansen, Kasper Lage

    2009-01-01

    genes. We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein-protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC......To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1...... region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein...

  8. An integrated SNP mining and utilization (ISMU) pipeline for next generation sequencing data.

    Science.gov (United States)

    Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A V S K; Varshney, Rajeev K

    2014-01-01

    Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone

  9. High-throughput genotyping of single nucleotide polymorphisms with rolling circle amplification

    Directory of Open Access Journals (Sweden)

    Sun Zhenyu

    2001-08-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the foundation of powerful complex trait and pharmacogenomic analyses. The availability of large SNP databases, however, has emphasized a need for inexpensive SNP genotyping methods of commensurate simplicity, robustness, and scalability. We describe a solution-based, microtiter plate method for SNP genotyping of human genomic DNA. The method is based upon allele discrimination by ligation of open circle probes followed by rolling circle amplification of the signal using fluorescent primers. Only the probe with a 3' base complementary to the SNP is circularized by ligation. Results SNP scoring by ligation was optimized to a 100,000 fold discrimination against probe mismatched to the SNP. The assay was used to genotype 10 SNPs from a set of 192 genomic DNA samples in a high-throughput format. Assay directly from genomic DNA eliminates the need to preamplify the target as done for many other genotyping methods. The sensitivity of the assay was demonstrated by genotyping from 1 ng of genomic DNA. We demonstrate that the assay can detect a single molecule of the circularized probe. Conclusions Compatibility with homogeneous formats and the ability to assay small amounts of genomic DNA meets the exacting requirements of automated, high-throughput SNP scoring.

  10. Light whole genome sequence for SNP discovery across domestic cat breeds

    Directory of Open Access Journals (Sweden)

    Driscoll Carlos

    2010-06-01

    Full Text Available Abstract Background The domestic cat has offered enormous genomic potential in the veterinary description of over 250 hereditary disease models as well as the occurrence of several deadly feline viruses (feline leukemia virus -- FeLV, feline coronavirus -- FECV, feline immunodeficiency virus - FIV that are homologues to human scourges (cancer, SARS, and AIDS respectively. However, to realize this bio-medical potential, a high density single nucleotide polymorphism (SNP map is required in order to accomplish disease and phenotype association discovery. Description To remedy this, we generated 3,178,297 paired fosmid-end Sanger sequence reads from seven cats, and combined these data with the publicly available 2X cat whole genome sequence. All sequence reads were assembled together to form a 3X whole genome assembly allowing the discovery of over three million SNPs. To reduce potential false positive SNPs due to the low coverage assembly, a low upper-limit was placed on sequence coverage and a high lower-limit on the quality of the discrepant bases at a potential variant site. In all domestic cats of different breeds: female Abyssinian, female American shorthair, male Cornish Rex, female European Burmese, female Persian, female Siamese, a male Ragdoll and a female African wildcat were sequenced lightly. We report a total of 964 k common SNPs suitable for a domestic cat SNP genotyping array and an additional 900 k SNPs detected between African wildcat and domestic cats breeds. An empirical sampling of 94 discovered SNPs were tested in the sequenced cats resulting in a SNP validation rate of 99%. Conclusions These data provide a large collection of mapped feline SNPs across the cat genome that will allow for the development of SNP genotyping platforms for mapping feline diseases.

  11. Direct inference of SNP heterozygosity rates and resolution of LOH detection.

    Directory of Open Access Journals (Sweden)

    Xiaohong Li

    2007-11-01

    Full Text Available Single nucleotide polymorphisms (SNPs have been increasingly utilized to investigate somatic genetic abnormalities in premalignancy and cancer. LOH is a common alteration observed during cancer development, and SNP assays have been used to identify LOH at specific chromosomal regions. The design of such studies requires consideration of the resolution for detecting LOH throughout the genome and identification of the number and location of SNPs required to detect genetic alterations in specific genomic regions. Our study evaluated SNP distribution patterns and used probability models, Monte Carlo simulation, and real human subject genotype data to investigate the relationships between the number of SNPs, SNP HET rates, and the sensitivity (resolution for detecting LOH. We report that variances of SNP heterozygosity rate in dbSNP are high for a large proportion of SNPs. Two statistical methods proposed for directly inferring SNP heterozygosity rates require much smaller sample sizes (intermediate sizes and are feasible for practical use in SNP selection or verification. Using HapMap data, we showed that a region of LOH greater than 200 kb can be reliably detected, with losses smaller than 50 kb having a substantially lower detection probability when using all SNPs currently in the HapMap database. Higher densities of SNPs may exist in certain local chromosomal regions that provide some opportunities for reliably detecting LOH of segment sizes smaller than 50 kb. These results suggest that the interpretation of the results from genome-wide scans for LOH using commercial arrays need to consider the relationships among inter-SNP distance, detection probability, and sample size for a specific study. New experimental designs for LOH studies would also benefit from considering the power of detection and sample sizes required to accomplish the proposed aims.

  12. Development and application of a 20K SNP array in potato

    NARCIS (Netherlands)

    Vos, Peter

    2016-01-01

    In this thesis the results are described of investigations of various application of genome wide SNP (single nucleotide polymorphism) markers. The set of SNP markers was identified by GBS (genotyping by sequencing) strategy. The resulting dataset of 129,156 SNPs across 83 tetraploid varieties was

  13. A SNP-Based Molecular Barcode for Characterization of Common Wheat.

    Directory of Open Access Journals (Sweden)

    LiFeng Gao

    Full Text Available Wheat is grown as a staple crop worldwide. It is important to develop an effective genotyping tool for this cereal grain both to identify germplasm diversity and to protect the rights of breeders. Single-nucleotide polymorphism (SNP genotyping provides a means for developing a practical, rapid, inexpensive and high-throughput assay. Here, we investigated SNPs as robust markers of genetic variation for typing wheat cultivars. We identified SNPs from an array of 9000 across a collection of 429 well-known wheat cultivars grown in China, of which 43 SNP markers with high minor allele frequency and variations discriminated the selected wheat varieties and their wild ancestors. This SNP-based barcode will allow for the rapid and precise identification of wheat germplasm resources and newly released varieties and will further assist in the wheat breeding program.

  14. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.

    Science.gov (United States)

    Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D

    2013-02-28

    Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to

  15. Dissection of Genetic Factors underlying Wheat Kernel Shape and Size in an Elite × Nonadapted Cross using a High Density SNP Linkage Map

    Directory of Open Access Journals (Sweden)

    Ajay Kumar

    2016-03-01

    Full Text Available Wheat kernel shape and size has been under selection since early domestication. Kernel morphology is a major consideration in wheat breeding, as it impacts grain yield and quality. A population of 160 recombinant inbred lines (RIL, developed using an elite (ND 705 and a nonadapted genotype (PI 414566, was extensively phenotyped in replicated field trials and genotyped using Infinium iSelect 90K assay to gain insight into the genetic architecture of kernel shape and size. A high density genetic map consisting of 10,172 single nucleotide polymorphism (SNP markers, with an average marker density of 0.39 cM/marker, identified a total of 29 genomic regions associated with six grain shape and size traits; ∼80% of these regions were associated with multiple traits. The analyses showed that kernel length (KL and width (KW are genetically independent, while a large number (∼59% of the quantitative trait loci (QTL for kernel shape traits were in common with genomic regions associated with kernel size traits. The most significant QTL was identified on chromosome 4B, and could be an ortholog of major rice grain size and shape gene or . Major and stable loci also were identified on the homeologous regions of Group 5 chromosomes, and in the regions of (6A and (7A genes. Both parental genotypes contributed equivalent positive QTL alleles, suggesting that the nonadapted germplasm has a great potential for enhancing the gene pool for grain shape and size. This study provides new knowledge on the genetic dissection of kernel morphology, with a much higher resolution, which may aid further improvement in wheat yield and quality using genomic tools.

  16. Temperature Switch PCR (TSP: Robust assay design for reliable amplification and genotyping of SNPs

    Directory of Open Access Journals (Sweden)

    Mather Diane E

    2009-12-01

    Full Text Available Abstract Background Many research and diagnostic applications rely upon the assay of individual single nucleotide polymorphisms (SNPs. Thus, methods to improve the speed and efficiency for single-marker SNP genotyping are highly desirable. Here, we describe the method of temperature-switch PCR (TSP, a biphasic four-primer PCR system with a universal primer design that permits amplification of the target locus in the first phase of thermal cycling before switching to the detection of the alleles. TSP can simplify assay design for a range of commonly used single-marker SNP genotyping methods, and reduce the requirement for individual assay optimization and operator expertise in the deployment of SNP assays. Results We demonstrate the utility of TSP for the rapid construction of robust and convenient endpoint SNP genotyping assays based on allele-specific PCR and high resolution melt analysis by generating a total of 11,232 data points. The TSP assays were performed under standardised reaction conditions, requiring minimal optimization of individual assays. High genotyping accuracy was verified by 100% concordance of TSP genotypes in a blinded study with an independent genotyping method. Conclusion Theoretically, TSP can be directly incorporated into the design of assays for most current single-marker SNP genotyping methods. TSP provides several technological advances for single-marker SNP genotyping including simplified assay design and development, increased assay specificity and genotyping accuracy, and opportunities for assay automation. By reducing the requirement for operator expertise, TSP provides opportunities to deploy a wider range of single-marker SNP genotyping methods in the laboratory. TSP has broad applications and can be deployed in any animal and plant species.

  17. Development and validation of a 20K single nucleotide polymorphism (SNP) whole genome genotyping array for apple (Malus × domestica Borkh).

    Science.gov (United States)

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.

  18. Development and validation of a 20K single nucleotide polymorphism (SNP whole genome genotyping array for apple (Malus × domestica Borkh.

    Directory of Open Access Journals (Sweden)

    Luca Bianco

    Full Text Available High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus. A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs. Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.

  19. Development and Validation of a 20K Single Nucleotide Polymorphism (SNP) Whole Genome Genotyping Array for Apple (Malus × domestica Borkh)

    Science.gov (United States)

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088

  20. Polygenic analysis of genome-wide SNP data identifies common variants on allergic rhinitis

    DEFF Research Database (Denmark)

    Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette

    Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...

  1. A low-density SNP array for analyzing differential selection in freshwater and marine populations of threespine stickleback (Gasterosteus aculeatus).

    Science.gov (United States)

    Ferchaud, Anne-Laure; Pedersen, Susanne H; Bekkevold, Dorte; Jian, Jianbo; Niu, Yongchao; Hansen, Michael M

    2014-10-06

    The threespine stickleback (Gasterosteus aculeatus) has become an important model species for studying both contemporary and parallel evolution. In particular, differential adaptation to freshwater and marine environments has led to high differentiation between freshwater and marine stickleback populations at the phenotypic trait of lateral plate morphology and the underlying candidate gene Ectodysplacin (EDA). Many studies have focused on this trait and candidate gene, although other genes involved in marine-freshwater adaptation may be equally important. In order to develop a resource for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. RAD (Restriction site Associated DNA) sequencing of sixty individuals representing two freshwater and one marine population led to the identification of 33,993 SNP markers. Ninety-six of these were chosen for the low-density SNP array, among which 70 represented SNPs under putatively directional selection in freshwater vs. marine environments, whereas 26 SNPs were assumed to be neutral. Annotation of these regions revealed several genes that are candidates for affecting stickleback phenotypic variation, some of which have been observed in previous studies whereas others are new. We have developed a cost-efficient low-density SNP array that allows for rapid screening of polymorphisms in threespine stickleback. The array provides a valuable tool for analyzing adaptive divergence between freshwater and marine stickleback populations beyond the well-established candidate gene Ectodysplacin (EDA).

  2. Genomewide high-density SNP linkage analysis of non-BRCA1/2 breast cancer families identifies various candidate regions and has greater power than microsatellite studies

    Directory of Open Access Journals (Sweden)

    Gonzalez-Neira Anna

    2007-08-01

    Full Text Available Abstract Background The recent development of new high-throughput technologies for SNP genotyping has opened the possibility of taking a genome-wide linkage approach to the search for new candidate genes involved in heredity diseases. The two major breast cancer susceptibility genes BRCA1 and BRCA2 are involved in 30% of hereditary breast cancer cases, but the discovery of additional breast cancer predisposition genes for the non-BRCA1/2 breast cancer families has so far been unsuccessful. Results In order to evaluate the power improvement provided by using SNP markers in a real situation, we have performed a whole genome screen of 19 non-BRCA1/2 breast cancer families using 4720 genomewide SNPs with Illumina technology (Illumina's Linkage III Panel, with an average distance of 615 Kb/SNP. We identified six regions on chromosomes 2, 3, 4, 7, 11 and 14 as candidates to contain genes involved in breast cancer susceptibility, and additional fine mapping genotyping using microsatellite markers around linkage peaks confirmed five of them, excluding the region on chromosome 3. These results were consistent in analyses that excluded SNPs in high linkage disequilibrium. The results were compared with those obtained previously using a 10 cM microsatellite scan (STR-GWS and we found lower or not significant linkage signals with STR-GWS data compared to SNP data in all cases. Conclusion Our results show the power increase that SNPs can supply in linkage studies.

  3. Effect of Tryptophan Hydroxylase-2 rs7305115 SNP on suicide attempts risk in major depression

    Directory of Open Access Journals (Sweden)

    Zhang Yuqi

    2010-08-01

    Full Text Available Abstract Background Suicide and major depressive disorders (MDD are strongly associated, and genetic factors are responsible for at least part of the variability in suicide risk. We investigated whether variation at the tryptophan hydroxylase-2 (TPH2 gene rs7305115 SNP may predispose to suicide attempts in MDD. Methods We genotyped TPH2 gene rs7305115 SNP in 215 MDD patients with suicide and matched MDD patients without suicide. Differences in behavioral and personality traits according to genotypic variation were investigated by logistic regression analysis. Results There were no significant differences between MDD patients with suicide and controls in genotypic (AG and GG frequencies for rs7305115 SNP, but the distribution of AA genotype differed significantly (14.4% vs. 29.3%, p p p Conclusions The study suggested that hopelessness, negative life events and family history of suicide were risk factors of attempted suicide in MDD while the TPH2 rs7305115A remained a significant protective predictor of suicide attempts.

  4. Construction of High Density Sweet Cherry (Prunus avium L. Linkage Maps Using Microsatellite Markers and SNPs Detected by Genotyping-by-Sequencing (GBS.

    Directory of Open Access Journals (Sweden)

    Verónica Guajardo

    Full Text Available Linkage maps are valuable tools in genetic and genomic studies. For sweet cherry, linkage maps have been constructed using mainly microsatellite markers (SSRs and, recently, using single nucleotide polymorphism markers (SNPs from a cherry 6K SNP array. Genotyping-by-sequencing (GBS, a new methodology based on high-throughput sequencing, holds great promise for identification of high number of SNPs and construction of high density linkage maps. In this study, GBS was used to identify SNPs from an intra-specific sweet cherry cross. A total of 8,476 high quality SNPs were selected for mapping. The physical position for each SNP was determined using the peach genome, Peach v1.0, as reference, and a homogeneous distribution of markers along the eight peach scaffolds was obtained. On average, 65.6% of the SNPs were present in genic regions and 49.8% were located in exonic regions. In addition to the SNPs, a group of SSRs was also used for construction of linkage maps. Parental and consensus high density maps were constructed by genotyping 166 siblings from a 'Rainier' x 'Rivedel' (Ra x Ri cross. Using Ra x Ri population, 462, 489 and 985 markers were mapped into eight linkage groups in 'Rainier', 'Rivedel' and the Ra x Ri map, respectively, with 80% of mapped SNPs located in genic regions. Obtained maps spanned 549.5, 582.6 and 731.3 cM for 'Rainier', 'Rivedel' and consensus maps, respectively, with an average distance of 1.2 cM between adjacent markers for both 'Rainier' and 'Rivedel' maps and of 0.7 cM for Ra x Ri map. High synteny and co-linearity was observed between obtained maps and with Peach v1.0. These new high density linkage maps provide valuable information on the sweet cherry genome, and serve as the basis for identification of QTLs and genes relevant for the breeding of the species.

  5. Reliable Single Chip Genotyping with Semi-Parametric Log-Concave Mixtures

    NARCIS (Netherlands)

    R.C.A. Rippe (Ralph); J.J. Meulman (Jacqueline); P.H.C. Eilers (Paul)

    2012-01-01

    textabstractThe common approach to SNP genotyping is to use (model-based) clustering per individual SNP, on a set of arrays. Genotyping all SNPs on a single array is much more attractive, in terms of flexibility, stability and applicability, when developing new chips. A new semi-parametric method,

  6. Dissection of Genetic Factors underlying Wheat Kernel Shape and Size in an Elite × Nonadapted Cross using a High Density SNP Linkage Map.

    Science.gov (United States)

    Kumar, Ajay; Mantovani, E E; Seetan, R; Soltani, A; Echeverry-Solarte, M; Jain, S; Simsek, S; Doehlert, D; Alamri, M S; Elias, E M; Kianian, S F; Mergoum, M

    2016-03-01

    Wheat kernel shape and size has been under selection since early domestication. Kernel morphology is a major consideration in wheat breeding, as it impacts grain yield and quality. A population of 160 recombinant inbred lines (RIL), developed using an elite (ND 705) and a nonadapted genotype (PI 414566), was extensively phenotyped in replicated field trials and genotyped using Infinium iSelect 90K assay to gain insight into the genetic architecture of kernel shape and size. A high density genetic map consisting of 10,172 single nucleotide polymorphism (SNP) markers, with an average marker density of 0.39 cM/marker, identified a total of 29 genomic regions associated with six grain shape and size traits; ∼80% of these regions were associated with multiple traits. The analyses showed that kernel length (KL) and width (KW) are genetically independent, while a large number (∼59%) of the quantitative trait loci (QTL) for kernel shape traits were in common with genomic regions associated with kernel size traits. The most significant QTL was identified on chromosome 4B, and could be an ortholog of major rice grain size and shape gene or . Major and stable loci also were identified on the homeologous regions of Group 5 chromosomes, and in the regions of (6A) and (7A) genes. Both parental genotypes contributed equivalent positive QTL alleles, suggesting that the nonadapted germplasm has a great potential for enhancing the gene pool for grain shape and size. This study provides new knowledge on the genetic dissection of kernel morphology, with a much higher resolution, which may aid further improvement in wheat yield and quality using genomic tools. Copyright © 2016 Crop Science Society of America.

  7. Maintenance of Chronic Fatigue Syndrome (CFS in Young CFS Patients Is Associated with the 5-HTTLPR and SNP rs25531 A > G Genotype.

    Directory of Open Access Journals (Sweden)

    Benedicte Meyer

    Full Text Available Earlier studies have shown that genetic variability in the SLC6A4 gene encoding the serotonin transporter (5-HTT may be important for the re-uptake of serotonin (5-HT in the central nervous system. In the present study we investigated how the 5-HTT genotype i.e. the short (S versus long (L 5-HTTLPR allele and the SNP rs25531 A > G affect the physical and psychosocial functioning in patients with chronic fatigue syndrome (CFS. All 120 patients were recruited from The Department of Paediatrics at Oslo University Hospital, Norway, a national referral center for young CFS patients (12-18 years. Main outcomes were number of steps per day obtained by an accelerometer and disability scored by the Functional Disability Inventory (FDI. Patients with the 5-HTT SS or SLG genotype had a significantly lower number of steps per day than patients with the 5-HTT LALG, SLA or LALA genotype. Patients with the 5-HTT SS or SLG genotype also had a significantly higher FDI score than patients with the 5-HTT LALG, SLA or LALA genotype. Thus, CFS patients with the 5-HTT SS or SLG genotype had worse 30 weeks outcome than CFS patients with the 5-HTT LALG, SLA or LALA genotype. The present study suggests that the 5-HTT genotype may be a factor that contributes to maintenance of CFS.

  8. Involvement of Sodium Nitroprusside (SNP in the Mechanism That Delays Stem Bending of Different Gerbera Cultivars

    Directory of Open Access Journals (Sweden)

    Aung H. Naing

    2017-11-01

    Full Text Available Longevity of cut flowers of many gerbera cultivars (Gerbera jamesonii is typically short because of stem bending; hence, stem bending that occurs during the early vase life period is a major problem in gerbera. Here, we investigated the effects of sodium nitroprusside (SNP on the delay of stem bending in the gerbera cultivars, Alliance, Rosalin, and Bintang, by examining relative fresh weight, bacterial density in the vase solution, transcriptional analysis of a lignin biosynthesis gene, antioxidant activity, and xylem blockage. All three gerbera cultivars responded to SNP by delaying stem bending, compared to the controls; however, the responses were dose- and cultivar-dependent. Among the treatments, SNP at 20 mg L-1 was the best to delay stem bending in Alliance, while dosages of 10 and 5 mg L-1 were the best for Rosalin and Bintang, respectively. However, stem bending in Alliance and Rosalin was faster than in Bintang, indicating a discrepancy influenced by genotype. According to our analysis of the role of SNP in the delay of stem bending, the results revealed that SNP treatment inhibited bacterial growth and xylem blockage, enhanced expression levels of a lignin biosynthesis gene, and maintained antioxidant activities. Therefore, it is suggested that the cause of stem bending is associated with the above-mentioned parameters and SNP is involved in the mechanism that delays stem bending in the different gerbera cultivars.

  9. Effect of Myostatin SNP on muscle fiber properties in male Thoroughbred horses during training period.

    Science.gov (United States)

    Miyata, Hirofumi; Itoh, Rika; Sato, Fumio; Takebe, Naoya; Hada, Tetsuro; Tozaki, Teruaki

    2017-10-20

    Variants of the Myostatin gene have been shown to have an influence on muscle hypertrophy phenotypes in a wide range of mammalian species. Recently, a Thoroughbred horse with a C-Allele at the g.66493737C/T single-nucleotide polymorphism (SNP) has been reported to be suited to short-distance racing. In this study, we examined the effect of the Myostatin SNP on muscle fiber properties in young Thoroughbred horses during a training period. To investigate the effect of the Myostatin SNP on muscle fiber before training, several mRNA expressions were relatively quantified in biopsy samples from the middle gluteal muscle of 27 untrained male Thoroughbred horses (1.5 years old) using real-time RT-PCR analysis. The remaining muscle samples were used for immunohistochemical analysis to determine the population and area of each fiber type. All measurements were revaluated in biopsy samples of the same horses after a 5-month period of conventional training. Although the expressions of Myostatin mRNA decreased in all SNP genotypes, a significant decrease was found in only the C/C genotype after training. While, expression of VEGFa, PGC1α, and SDHa mRNAs, which relate to the biogenesis of mitochondria and capillaries, was significantly higher (54-82%) in the T/T than the C/C genotypes after training. It is suggested that hypertrophy of muscle fiber is directly associated with a decrease in Myostatin mRNA expression in the C/C genotype, and that increased expressions of VEGFa, PGC1α, and SDHa in the T/T genotype might be indirectly caused by the Myostatin SNP.

  10. Genomewide high-density SNP linkage analysis of non-BRCA1/2 breast cancer families identifies various candidate regions and has greater power than microsatellite studies

    NARCIS (Netherlands)

    A. González-Neira (Anna); J.M. Rosa-Rosa; A. Osorio (Ana); E. Gonzalez (Emilio); M.C. Southey (Melissa); O. Sinilnikova (Olga); H. Lynch (Henry); R.A. Oldenburg (Rogier); C.J. van Asperen (Christi); N. Hoogerbrugge (Nicoline); G. Pita (Guillermo); P. Devilee (Peter); D. Goldgar (David); J. Benítez (Javier)

    2007-01-01

    textabstractBackground: The recent development of new high-throughput technologies for SNP genotyping has opened the possibility of taking a genome-wide linkage approach to the search for new candidate genes involved in heredity diseases. The two major breast cancer susceptibility genes BRCA1 and

  11. Population structure of Atlantic Mackerel inferred from RAD-seq derived SNP markers: effects of sequence clustering parameters and hierarchical SNP selection

    KAUST Repository

    Rodrí guez-Ezpeleta, Naiara; Bradbury, Ian R.; Mendibil, Iñ aki; Á lvarez, Paula; Cotano, Unai; Irigoien, Xabier

    2016-01-01

    : the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides

  12. Development and validation of the Axiom(®) Apple480K SNP genotyping array.

    Science.gov (United States)

    Bianco, Luca; Cestaro, Alessandro; Linsmith, Gareth; Muranty, Hélène; Denancé, Caroline; Théron, Anthony; Poncet, Charles; Micheletti, Diego; Kerschbamer, Emanuela; Di Pierro, Erica A; Larger, Simone; Pindo, Massimo; Van de Weg, Eric; Davassi, Alessandro; Laurens, François; Velasco, Riccardo; Durel, Charles-Eric; Troggio, Michela

    2016-04-01

    Cultivated apple (Malus × domestica Borkh.) is one of the most important fruit crops in temperate regions, and has great economic and cultural value. The apple genome is highly heterozygous and has undergone a recent duplication which, combined with a rapid linkage disequilibrium decay, makes it difficult to perform genome-wide association (GWA) studies. Single nucleotide polymorphism arrays offer highly multiplexed assays at a relatively low cost per data point and can be a valid tool for the identification of the markers associated with traits of interest. Here, we describe the development and validation of a 487K SNP Affymetrix Axiom(®) genotyping array for apple and discuss its potential applications. The array has been built from the high-depth resequencing of 63 different cultivars covering most of the genetic diversity in cultivated apple. The SNPs were chosen by applying a focal points approach to enrich genic regions, but also to reach a uniform coverage of non-genic regions. A total of 1324 apple accessions, including the 92 progenies of two mapping populations, have been genotyped with the Axiom(®) Apple480K to assess the effectiveness of the array. A large majority of SNPs (359 994 or 74%) fell in the stringent class of poly high resolution polymorphisms. We also devised a filtering procedure to identify a subset of 275K very robust markers that can be safely used for germplasm surveys in apple. The Axiom(®) Apple480K has now been commercially released both for public and proprietary use and will likely be a reference tool for GWA studies in apple. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  13. Utility of the pooling approach as applied to whole genome association scans with high-density Affymetrix microarrays

    Directory of Open Access Journals (Sweden)

    Gray Joanna

    2010-11-01

    Full Text Available Abstract Background We report an attempt to extend the previously successful approach of combining SNP (single nucleotide polymorphism microarrays and DNA pooling (SNP-MaP employing high-density microarrays. Whereas earlier studies employed a range of Affymetrix SNP microarrays comprising from 10 K to 500 K SNPs, this most recent investigation used the 6.0 chip which displays 906,600 SNP probes and 946,000 probes for the interrogation of CNVs (copy number variations. The genotyping assay using the Affymetrix SNP 6.0 array is highly demanding on sample quality due to the small feature size, low redundancy, and lack of mismatch probes. Findings In the first study published so far using this microarray on pooled DNA, we found that pooled cheek swab DNA could not accurately predict real allele frequencies of the samples that comprised the pools. In contrast, the allele frequency estimates using blood DNA pools were reasonable, although inferior compared to those obtained with previously employed Affymetrix microarrays. However, it might be possible to improve performance by developing improved analysis methods. Conclusions Despite the decreasing costs of genome-wide individual genotyping, the pooling approach may have applications in very large-scale case-control association studies. In such cases, our study suggests that high-quality DNA preparations and lower density platforms should be preferred.

  14. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography

    Science.gov (United States)

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-01-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined ‘elimination’ status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of M. leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420

  15. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography.

    Science.gov (United States)

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-03-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined 'elimination' status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of Mycobacterium leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. Copyright © 2012 Elsevier B.V. All rights reserved.

  16. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple.

    Directory of Open Access Journals (Sweden)

    David Chagné

    Full Text Available As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of 'Golden Delicious', SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional, and genomic selection in apple.

  17. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    Science.gov (United States)

    Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718

  18. Genotyping of single spore isolates of a Pasteuria penetrans population occurring in Florida using SNP-based markers.

    Science.gov (United States)

    Joseph, S; Schmidt, L M; Danquah, W B; Timper, P; Mekete, T

    2017-02-01

    To generate single spore lines of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida and examine genotypic variation and virulence characteristics exist within the population. Six single spore lines (SSP), 16SSP, 17SSP, 18SSP, 25SSP, 26SSP and 30SSP were generated. Genetic variability was evaluated by comparing single-nucleotide polymorphisms (SNPs) in six protein-coding genes and the 16S rRNA gene. An average of one SNP was observed for every 69 bp in the 16S rRNA, whereas no SNPs were observed in the protein-coding sequences. Hierarchical cluster analysis of 16S rRNA sequences placed the clones into three distinct clades. Bio-efficacy analysis revealed significant heterogeneity in the level virulence and host specificity between the individual clones. The SNP markers developed to the 5' hypervariable region of the 16S rRNA gene may be useful in biotype differentiation within a population of P. penetrans. This study demonstrates an efficient method for generating single spore lines of P. penetrans and gives a deep insight into genetic heterogeneity and varying level of virulence exists within a population parasitizing a specific Meloidogyne sp. host. The results also suggest that the application of generalist spore lines in nematode management may achieve broad RKN control. © 2016 The Society for Applied Microbiology.

  19. MDM2 SNP309 and SNP285 Act as Negative Prognostic Markers for Non-small Cell Lung Cancer Adenocarcinoma Patients

    Science.gov (United States)

    Deben, Christophe; Op de Beeck, Ken; Van den Bossche, Jolien; Jacobs, Julie; Lardon, Filip; Wouters, An; Peeters, Marc; Van Camp, Guy; Rolfo, Christian; Deschoolmeester, Vanessa; Pauwels, Patrick

    2017-01-01

    Objectives: Two functional polymorphisms in the MDM2 promoter region, SNP309T>G and SNP285G>C, have been shown to impact MDM2 expression and cancer risk. Currently available data on the prognostic value of MDM2 SNP309 in non-small cell lung cancer (NSCLC) is contradictory and unavailable for SNP285. The goal of this study was to clarify the role of these MDM2 SNPs in the outcome of NSCLC patients. Materials and Methods: In this study we genotyped SNP309 and SNP285 in 98 NSCLC adenocarcinoma patients and determined MDM2 mRNA and protein levels. In addition, we assessed the prognostic value of these common SNPs on overall and progression free survival, taking into account the TP53 status of the tumor. Results and Conclusion: We found that the SNP285C allele, but not the SNP309G allele, was significantly associated with increased MDM2 mRNA expression levels (p = 0.025). However, we did not observe an association with MDM2 protein levels for SNP285. The SNP309G allele was significantly associated with the presence of wild type TP53 (p = 0.047) and showed a strong trend towards increased MDM2 protein levels (p = 0.068). In addition, patients harboring the SNP309G allele showed a worse overall survival, but only in the presence of wild type TP53. The SNP285C allele was significantly associated with an early age of diagnosis and metastasis. Additionally, the SNP285C allele acted as an independent predictor for worse progression free survival (HR = 3.97; 95% CI = 1.51 - 10.42; p = 0.005). Our data showed that both SNP309 (in the presence of wild type TP53) and SNP285 act as negative prognostic markers for NSCLC patients, implicating a prominent role for these variants in the outcome of these patients. PMID:28819417

  20. The polymorphisms of P53 codon 72 and MDM2 SNP309 and renal cell carcinoma risk in a low arsenic exposure area

    Energy Technology Data Exchange (ETDEWEB)

    Huang, Chao-Yuan [Graduate Institute of Clinical Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan (China); Department of Urology, National Taiwan University Hospital, College of Medicine National Taiwan University, Taipei, Taiwan (China); Su, Chien-Tien [Department of Family Medicine, Taipei Medical University Hospital, Taipei, Taiwan (China); Chu, Jan-Show [Graduate Institute of Clinical Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan (China); Department of Pathology, College of Medicine, Taipei Medical University, Taipei, Taiwan (China); Huang, Shu-Pin [Department of Urology, Kaohsiung Medical University Hospital, College of Medicine Kaohsiung Medical University, Kaohsiung, Taiwan (China); Pu, Yeong-Shiau [Department of Urology, National Taiwan University Hospital, College of Medicine National Taiwan University, Taipei, Taiwan (China); Yang, Hsiu-Yuan [School of Public Health, College of Public Health and Nutrition, Taipei Medical University, Taipei, Taiwan (China); Chung, Chi-Jung [Department of Medical Research, China Medical University Hospital, Taichung, Taiwan (China); Department of Health Risk Management, College of Public Health, China Medical University, Taichung, Taiwan (China); Wu, Chia-Chang [School of Public Health, College of Public Health and Nutrition, Taipei Medical University, Taipei, Taiwan (China); Department of Urology, Taipei Medical Universtiy-Shuang Ho Hospital, Taipei, Taiwan (China); Hsueh, Yu-Mei, E-mail: ymhsueh@tmu.edu.tw [School of Public Health, College of Public Health and Nutrition, Taipei Medical University, Taipei, Taiwan (China); Department of Public Health, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan (China)

    2011-12-15

    Our recent study demonstrated the increased risk of renal cell carcinoma (RCC) associated with high urinary total arsenic levels among people living in a low arsenic exposure area. Genomic instability is important in arsenic carcinogenesis. This study evaluated the relationship between the polymorphisms of p53, p21, and MDM2, which plays a role in gene stability, and the arsenic-related RCC risk. Here, we found that p53 Pro/Pro genotype and MDM2 SNP309 GG genotype significantly increased RCC risk compared to the p53 Arg/Arg genotype and MDM2 SNP309 TT genotype. RCC patients with the p53Arg/Arg genotype had a signicantly low percentage of inorganic arsenic, a low percentage of monomethylarsonic acid (MMA), and a high percentage of dimethylarsinic acid (DMA), which indicates efcient arsenic methylation capacity. Subjects with the p53 Arg/Pro + Pro/Pro genotype or MDM2 SNP309 TG + GG genotype, in conjunction with high urinary total arsenic ({>=} 14.02 {mu}g/L), had a signicantly higher RCC risk than those with the p53 Arg/Arg or MDM2 SNP309 TT genotypes and low urinary total arsenic. Taken together, this is the first study to show that a variant genotype of p53 Arg{sup 72}Pro or MDM2 SNP309 may modify the arsenic-related RCC risk even in a non-obvious arsenic exposure area. -- Highlights: Black-Right-Pointing-Pointer Subjects with p53 Pro/Pro or MDM2 GG genotype significantly increased RCC risk. Black-Right-Pointing-Pointer A significant multiplicative joint effect of p53 and p21 on RCC risk. Black-Right-Pointing-Pointer RCC patients with p53 Arg/Arg genotype had efficient arsenic methylation capacity. Black-Right-Pointing-Pointer Joint effect of p53 or MDM2 genotype and high urinary total arsenic on RCC risk.

  1. Bovine exome sequence analysis and targeted SNP genotyping of recessive fertility defects BH1, HH2, and HH3 reveal a putative causative mutation in SMC2 for HH3.

    Science.gov (United States)

    McClure, Matthew C; Bickhart, Derek; Null, Dan; Vanraden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B; Van Tassell, Curtis P; Sonstegard, Tad S

    2014-01-01

    The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array.

  2. Bovine exome sequence analysis and targeted SNP genotyping of recessive fertility defects BH1, HH2, and HH3 reveal a putative causative mutation in SMC2 for HH3.

    Directory of Open Access Journals (Sweden)

    Matthew C McClure

    Full Text Available The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3 by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3, while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2 on Chromosome 8 at position 95,410,507 (UMD3.1. This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array.

  3. Detecting selection signatures between Duroc and Duroc synthetic pig populations using high-density SNP chip.

    Science.gov (United States)

    Edea, Z; Hong, J-K; Jung, J-H; Kim, D-W; Kim, Y-M; Kim, E-S; Shin, S S; Jung, Y C; Kim, K-S

    2017-08-01

    The development of high throughput genotyping techniques has facilitated the identification of selection signatures of pigs. The detection of genomic selection signals in a population subjected to differential selection pressures may provide insights into the genes associated with economically and biologically important traits. To identify genomic regions under selection, we genotyped 488 Duroc (D) pigs and 155 D × Korean native pigs (DKNPs) using the Porcine SNP70K BeadChip. By applying the F ST and extended haplotype homozygosity (EHH-Rsb) methods, we detected genes under directional selection associated with growth/stature (DOCK7, PLCB4, HS2ST1, FBP2 and TG), carcass and meat quality (TG, COL14A1, FBXO5, NR3C1, SNX7, ARHGAP26 and DPYD), number of teats (LOC100153159 and LRRC1), pigmentation (MME) and ear morphology (SOX5), which are all mostly near or at fixation. These results could be a basis for investigating the underlying mutations associated with observed phenotypic variation. Validation using genome-wide association analysis would also facilitate the inclusion of some of these markers in genetic evaluation programs. © 2017 Stichting International Foundation for Animal Genetics.

  4. Response of cactus pear genotypes to different crop densities

    Directory of Open Access Journals (Sweden)

    Lucas Aroaldo Dantas Cavalcante

    2014-09-01

    Full Text Available Considering the importance of cactus pear as an alimentary alternative for the herd of cattle of the Brazilian semiarid region and the effect of crop spacing among plants, this study aimed to evaluate the morphometry, yield and chemical-bromatological composition of cactus pear genotypes, under different cropping densities. The experimental design was completely randomized blocks, in a 3x4 factorial scheme. Treatments consisted of a combination of three cactus pear genotypes (Gigante, Redonda and Miúda and four cropping densities (10,000 plants ha-1; 20,000 plants ha-1; 40,000 plants ha-1; and 80,000 plants ha-1, with three replications. The cactus pear genotypes reacted differently, regarding morphometry, yield and chemical-bromatological composition, and, regardless of the species, the denser planting increased yield (tons ha-1. The Miúda palm presented the highest dry matter yield and consequently the greater accumulation of total digestible nutrients, raw protein and water per hectare, as well as the highest in vitro dry matter digestibility.

  5. Genomic analyses of tropical beef cattle fertility based on genotyping pools of Brahman cows with unknown pedigree.

    Science.gov (United States)

    Reverter, A; Porto-Neto, L R; Fortes, M R S; McCulloch, R; Lyons, R E; Moore, S; Nicol, D; Henshall, J; Lehnert, S A

    2016-10-01

    We introduce an innovative approach to lowering the overall cost of obtaining genomic EBV (GEBV) and encourage their use in commercial extensive herds of Brahman beef cattle. In our approach, the DNA genotyping of cow herds from 2 independent properties was performed using a high-density bovine SNP chip on DNA from pooled blood samples, grouped according to the result of a pregnancy test following their first and second joining opportunities. For the DNA pooling strategy, 15 to 28 blood samples from the same phenotype and contemporary group were allocated to pools. Across the 2 properties, a total of 183 pools were created representing 4,164 cows. In addition, blood samples from 309 bulls from the same properties were also taken. After genotyping and quality control, 74,584 remaining SNP were used for analyses. Pools and individual DNA samples were related by means of a "hybrid" genomic relationship matrix. The pooled genotyping analysis of 2 large and independent commercial populations of tropical beef cattle was able to recover significant and plausible associations between SNP and pregnancy test outcome. We discuss 24 SNP with significant association ( < 1.0 × 10) and mapped within 40 kb of an annotated gene. We have established a method to estimate the GEBV in young herd bulls for a trait that is currently unable to be predicted at all. In summary, our novel approach allowed us to conduct genomic analyses of fertility in 2 large commercial Brahman herds managed under extensive pastoral conditions.

  6. Genome-wide SNP identification in multiple morphotypes of allohexaploid tall fescue (Festuca arundinacea Schreb

    Directory of Open Access Journals (Sweden)

    Hand Melanie L

    2012-06-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs provide essential tools for the advancement of research in plant genomics, and the development of SNP resources for many species has been accelerated by the capabilities of second-generation sequencing technologies. The current study aimed to develop and use a novel bioinformatic pipeline to generate a comprehensive collection of SNP markers within the agriculturally important pasture grass tall fescue; an outbreeding allopolyploid species displaying three distinct morphotypes: Continental, Mediterranean and rhizomatous. Results A bioinformatic pipeline was developed that successfully identified SNPs within genotypes from distinct tall fescue morphotypes, following the sequencing of 414 polymerase chain reaction (PCR – generated amplicons using 454 GS FLX technology. Equivalent amplicon sets were derived from representative genotypes of each morphotype, including six Continental, five Mediterranean and one rhizomatous. A total of 8,584 and 2,292 SNPs were identified with high confidence within the Continental and Mediterranean morphotypes respectively. The success of the bioinformatic approach was demonstrated through validation (at a rate of 70% of a subset of 141 SNPs using both SNaPshot™ and GoldenGate™ assay chemistries. Furthermore, the quantitative genotyping capability of the GoldenGate™ assay revealed that approximately 30% of the putative SNPs were accessible to co-dominant scoring, despite the hexaploid genome structure. The sub-genome-specific origin of each SNP validated from Continental tall fescue was predicted using a phylogenetic approach based on comparison with orthologous sequences from predicted progenitor species. Conclusions Using the appropriate bioinformatic approach, amplicon resequencing based on 454 GS FLX technology is an effective method for the identification of polymorphic SNPs within the genomes of Continental and Mediterranean tall fescue. The

  7. Two-temperature LATE-PCR endpoint genotyping

    Directory of Open Access Journals (Sweden)

    Reis Arthur H

    2006-12-01

    Full Text Available Abstract Background In conventional PCR, total amplicon yield becomes independent of starting template number as amplification reaches plateau and varies significantly among replicate reactions. This paper describes a strategy for reconfiguring PCR so that the signal intensity of a single fluorescent detection probe after PCR thermal cycling reflects genomic composition. The resulting method corrects for product yield variations among replicate amplification reactions, permits resolution of homozygous and heterozygous genotypes based on endpoint fluorescence signal intensities, and readily identifies imbalanced allele ratios equivalent to those arising from gene/chromosomal duplications. Furthermore, the use of only a single colored probe for genotyping enhances the multiplex detection capacity of the assay. Results Two-Temperature LATE-PCR endpoint genotyping combines Linear-After-The-Exponential (LATE-PCR (an advanced form of asymmetric PCR that efficiently generates single-stranded DNA and mismatch-tolerant probes capable of detecting allele-specific targets at high temperature and total single-stranded amplicons at a lower temperature in the same reaction. The method is demonstrated here for genotyping single-nucleotide alleles of the human HEXA gene responsible for Tay-Sachs disease and for genotyping SNP alleles near the human p53 tumor suppressor gene. In each case, the final probe signals were normalized against total single-stranded DNA generated in the same reaction. Normalization reduces the coefficient of variation among replicates from 17.22% to as little as 2.78% and permits endpoint genotyping with >99.7% accuracy. These assays are robust because they are consistent over a wide range of input DNA concentrations and give the same results regardless of how many cycles of linear amplification have elapsed. The method is also sufficiently powerful to distinguish between samples with a 1:1 ratio of two alleles from samples comprised of

  8. Physiological response of soybean genotypes to plant density

    NARCIS (Netherlands)

    Gan, Y; Stulen, [No Value; van Keulen, H; Kuiper, PJC

    2002-01-01

    Response of soybean (Glycine max (L.) Merr.) to plant density has occupied a segment of agronomic research for most of the century. Genotype differences have been noted especially in response to planting date, lodging problems and water limitation. There is limited information on the physiological

  9. Genotyping of Brucella species using clade specific SNPs

    Directory of Open Access Journals (Sweden)

    Foster Jeffrey T

    2012-06-01

    Full Text Available Abstract Background Brucellosis is a worldwide disease of mammals caused by Alphaproteobacteria in the genus Brucella. The genus is genetically monomorphic, requiring extensive genotyping to differentiate isolates. We utilized two different genotyping strategies to characterize isolates. First, we developed a microarray-based assay based on 1000 single nucleotide polymorphisms (SNPs that were identified from whole genome comparisons of two B. abortus isolates , one B. melitensis, and one B. suis. We then genotyped a diverse collection of 85 Brucella strains at these SNP loci and generated a phylogenetic tree of relationships. Second, we developed a selective primer-extension assay system using capillary electrophoresis that targeted 17 high value SNPs across 8 major branches of the phylogeny and determined their genotypes in a large collection ( n = 340 of diverse isolates. Results Our 1000 SNP microarray readily distinguished B. abortus, B. melitensis, and B. suis, differentiating B. melitensis and B. suis into two clades each. Brucella abortus was divided into four major clades. Our capillary-based SNP genotyping confirmed all major branches from the microarray assay and assigned all samples to defined lineages. Isolates from these lineages and closely related isolates, among the most commonly encountered lineages worldwide, can now be quickly and easily identified and genetically characterized. Conclusions We have identified clade-specific SNPs in Brucella that can be used for rapid assignment into major groups below the species level in the three main Brucella species. Our assays represent SNP genotyping approaches that can reliably determine the evolutionary relationships of bacterial isolates without the need for whole genome sequencing of all isolates.

  10. [Association Between SNP rs6007897 of CELSR1 and Acute Ischemic Stroke in Western China Han Population: a Case-control Study].

    Science.gov (United States)

    Qin, Feng-qin; Yu, Li-hua; Hu, Wen-ting; Guo, Jian; Chen, Ning; Guo, Jiang; Fang, Jing-huan; He, Li

    2015-07-01

    To investigate the relationship between single nucleotide polymorphism (SNP) rs6007897 of CELSR1 and acute ischemic stroke in Western China Han population. All subjects (759 acute ischemic stroke patients and 786 controls) were genotyped using ligation detection reaction (LDR). We analyzed the differences between SNP rs6007897 genotypes and allele frequencies between two groups. Two genotypes (AA, AG) of rs6007897 were found in both stroke and control group. There was no statistically significance between two groups about genotype and allele frequency. After adjusting for risk factors, we found there was no significant association between rs6007897 and ischemic stroke CP = 0.797, odds ratio (OR) = 0.886, 95% confidence interval (CI) = 0.352-2.227). SNP rs6007897 of CELSR1 was not significantly associated with ischemic stroke in Western China Han population.

  11. Ascertainment biases in SNP chips affect measures of population divergence

    DEFF Research Database (Denmark)

    Albrechtsen, Anders; Nielsen, Finn Cilius; Nielsen, Rasmus

    2010-01-01

    Chip-based high-throughput genotyping has facilitated genome-wide studies of genetic diversity. Many studies have utilized these large data sets to make inferences about the demographic history of human populations using measures of genetic differentiation such as F(ST) or principal component...... on direct sequencing. In addition, we also analyze publicly available genome-wide data. We demonstrate that the ascertainment biases will distort measures of human diversity and possibly change conclusions drawn from these measures in some times unexpected ways. We also show that details of the genotyping...... analyses. However, the single nucleotide polymorphism (SNP) chip data suffer from ascertainment biases caused by the SNP discovery process in which a small number of individuals from selected populations are used as discovery panels. In this study, we investigate the effect of the ascertainment bias...

  12. Imputation Accuracy from Low to Moderate Density Single Nucleotide Polymorphism Chips in a Thai Multibreed Dairy Cattle Population

    Directory of Open Access Journals (Sweden)

    Danai Jattawa

    2016-04-01

    Full Text Available The objective of this study was to investigate the accuracy of imputation from low density (LDC to moderate density SNP chips (MDC in a Thai Holstein-Other multibreed dairy cattle population. Dairy cattle with complete pedigree information (n = 1,244 from 145 dairy farms were genotyped with GeneSeek GGP20K (n = 570, GGP26K (n = 540 and GGP80K (n = 134 chips. After checking for single nucleotide polymorphism (SNP quality, 17,779 SNP markers in common between the GGP20K, GGP26K, and GGP80K were used to represent MDC. Animals were divided into two groups, a reference group (n = 912 and a test group (n = 332. The SNP markers chosen for the test group were those located in positions corresponding to GeneSeek GGP9K (n = 7,652. The LDC to MDC genotype imputation was carried out using three different software packages, namely Beagle 3.3 (population-based algorithm, FImpute 2.2 (combined family- and population-based algorithms and Findhap 4 (combined family- and population-based algorithms. Imputation accuracies within and across chromosomes were calculated as ratios of correctly imputed SNP markers to overall imputed SNP markers. Imputation accuracy for the three software packages ranged from 76.79% to 93.94%. FImpute had higher imputation accuracy (93.94% than Findhap (84.64% and Beagle (76.79%. Imputation accuracies were similar and consistent across chromosomes for FImpute, but not for Findhap and Beagle. Most chromosomes that showed either high (73% or low (80% imputation accuracies were the same chromosomes that had above and below average linkage disequilibrium (LD; defined here as the correlation between pairs of adjacent SNP within chromosomes less than or equal to 1 Mb apart. Results indicated that FImpute was more suitable than Findhap and Beagle for genotype imputation in this Thai multibreed population. Perhaps additional increments in imputation accuracy could be achieved by increasing the completeness of pedigree information.

  13. OpenADAM: an open source genome-wide association data management system for Affymetrix SNP arrays

    Directory of Open Access Journals (Sweden)

    Sham P C

    2008-12-01

    Full Text Available Abstract Background Large scale genome-wide association studies have become popular since the introduction of high throughput genotyping platforms. Efficient management of the vast array of data generated poses many challenges. Description We have developed an open source web-based data management system for the large amount of genotype data generated from the Affymetrix GeneChip® Mapping Array and Affymetrix Genome-Wide Human SNP Array platforms. The database supports genotype calling using DM, BRLMM, BRLMM-P or Birdseed algorithms provided by the Affymetrix Power Tools. The genotype and corresponding pedigree data are stored in a relational database for efficient downstream data manipulation and analysis, such as calculation of allele and genotype frequencies, sample identity checking, and export of genotype data in various file formats for analysis using commonly-available software. A novel method for genotyping error estimation is implemented using linkage disequilibrium information from the HapMap project. All functionalities are accessible via a web-based user interface. Conclusion OpenADAM provides an open source database system for management of Affymetrix genome-wide association SNP data.

  14. Construction of a high-density genetic map for grape using next generation restriction-site associated DNA sequencing

    Directory of Open Access Journals (Sweden)

    Wang Nian

    2012-08-01

    Full Text Available Abstract Background Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP marker development. Results An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. Conclusions The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison.

  15. SNP-SNP interactions in breast cancer susceptibility

    Directory of Open Access Journals (Sweden)

    Wang Yuanyuan

    2006-05-01

    Full Text Available Abstract Background Breast cancer predisposition genes identified to date (e.g., BRCA1 and BRCA2 are responsible for less than 5% of all breast cancer cases. Many studies have shown that the cancer risks associated with individual commonly occurring single nucleotide polymorphisms (SNPs are incremental. However, polygenic models suggest that multiple commonly occurring low to modestly penetrant SNPs of cancer related genes might have a greater effect on a disease when considered in combination. Methods In an attempt to identify the breast cancer risk conferred by SNP interactions, we have studied 19 SNPs from genes involved in major cancer related pathways. All SNPs were genotyped by TaqMan 5'nuclease assay. The association between the case-control status and each individual SNP, measured by the odds ratio and its corresponding 95% confidence interval, was estimated using unconditional logistic regression models. At the second stage, two-way interactions were investigated using multivariate logistic models. The robustness of the interactions, which were observed among SNPs with stronger functional evidence, was assessed using a bootstrap approach, and correction for multiple testing based on the false discovery rate (FDR principle. Results None of these SNPs contributed to breast cancer risk individually. However, we have demonstrated evidence for gene-gene (SNP-SNP interaction among these SNPs, which were associated with increased breast cancer risk. Our study suggests cross talk between the SNPs of the DNA repair and immune system (XPD-[Lys751Gln] and IL10-[G(-1082A], cell cycle and estrogen metabolism (CCND1-[Pro241Pro] and COMT-[Met108/158Val], cell cycle and DNA repair (BARD1-[Pro24Ser] and XPD-[Lys751Gln], and within carcinogen metabolism (GSTP1-[Ile105Val] and COMT-[Met108/158Val] pathways. Conclusion The importance of these pathways and their communication in breast cancer predisposition has been emphasized previously, but their

  16. SNP-SNP interactions in breast cancer susceptibility

    International Nuclear Information System (INIS)

    Onay, Venüs Ümmiye; Ozcelik, Hilmi; Briollais, Laurent; Knight, Julia A; Shi, Ellen; Wang, Yuanyuan; Wells, Sean; Li, Hong; Rajendram, Isaac; Andrulis, Irene L

    2006-01-01

    Breast cancer predisposition genes identified to date (e.g., BRCA1 and BRCA2) are responsible for less than 5% of all breast cancer cases. Many studies have shown that the cancer risks associated with individual commonly occurring single nucleotide polymorphisms (SNPs) are incremental. However, polygenic models suggest that multiple commonly occurring low to modestly penetrant SNPs of cancer related genes might have a greater effect on a disease when considered in combination. In an attempt to identify the breast cancer risk conferred by SNP interactions, we have studied 19 SNPs from genes involved in major cancer related pathways. All SNPs were genotyped by TaqMan 5'nuclease assay. The association between the case-control status and each individual SNP, measured by the odds ratio and its corresponding 95% confidence interval, was estimated using unconditional logistic regression models. At the second stage, two-way interactions were investigated using multivariate logistic models. The robustness of the interactions, which were observed among SNPs with stronger functional evidence, was assessed using a bootstrap approach, and correction for multiple testing based on the false discovery rate (FDR) principle. None of these SNPs contributed to breast cancer risk individually. However, we have demonstrated evidence for gene-gene (SNP-SNP) interaction among these SNPs, which were associated with increased breast cancer risk. Our study suggests cross talk between the SNPs of the DNA repair and immune system (XPD-[Lys751Gln] and IL10-[G(-1082)A]), cell cycle and estrogen metabolism (CCND1-[Pro241Pro] and COMT-[Met108/158Val]), cell cycle and DNA repair (BARD1-[Pro24Ser] and XPD-[Lys751Gln]), and within carcinogen metabolism (GSTP1-[Ile105Val] and COMT-[Met108/158Val]) pathways. The importance of these pathways and their communication in breast cancer predisposition has been emphasized previously, but their biological interactions through SNPs have not been described

  17. Effects of DNA mass on multiple displacement whole genome amplification and genotyping performance

    Directory of Open Access Journals (Sweden)

    Haque Kashif A

    2005-09-01

    Full Text Available Abstract Background Whole genome amplification (WGA promises to eliminate practical molecular genetic analysis limitations associated with genomic DNA (gDNA quantity. We evaluated the performance of multiple displacement amplification (MDA WGA using gDNA extracted from lymphoblastoid cell lines (N = 27 with a range of starting gDNA input of 1–200 ng into the WGA reaction. Yield and composition analysis of whole genome amplified DNA (wgaDNA was performed using three DNA quantification methods (OD, PicoGreen® and RT-PCR. Two panels of N = 15 STR (using the AmpFlSTR® Identifiler® panel and N = 49 SNP (TaqMan® genotyping assays were performed on each gDNA and wgaDNA sample in duplicate. gDNA and wgaDNA masses of 1, 4 and 20 ng were used in the SNP assays to evaluate the effects of DNA mass on SNP genotyping assay performance. A total of N = 6,880 STR and N = 56,448 SNP genotype attempts provided adequate power to detect differences in STR and SNP genotyping performance between gDNA and wgaDNA, and among wgaDNA produced from a range of gDNA templates inputs. Results The proportion of double-stranded wgaDNA and human-specific PCR amplifiable wgaDNA increased with increased gDNA input into the WGA reaction. Increased amounts of gDNA input into the WGA reaction improved wgaDNA genotyping performance. Genotype completion or genotype concordance rates of wgaDNA produced from all gDNA input levels were observed to be reduced compared to gDNA, although the reduction was not always statistically significant. Reduced wgaDNA genotyping performance was primarily due to the increased variance of allelic amplification, resulting in loss of heterozygosity or increased undetermined genotypes. MDA WGA produces wgaDNA from no template control samples; such samples exhibited substantial false-positive genotyping rates. Conclusion The amount of gDNA input into the MDA WGA reaction is a critical determinant of genotyping performance of wgaDNA. At least 10 ng of

  18. Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing.

    Directory of Open Access Journals (Sweden)

    ShiGang Yu

    Full Text Available Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV or low estimated breeding value (LEBV. A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the

  19. The Brachyury Gly177Asp SNP Is not Associated with a Risk of Skull Base Chordoma in the Chinese Population

    Directory of Open Access Journals (Sweden)

    Zhen Wu

    2013-10-01

    Full Text Available A recent chordoma cancer genotyping study reveals that the rs2305089, a single nucleotide polymorphism (SNP located in brachyury gene and a key gene in the development of notochord, is significantly associated with chordoma risk. The brachyury gene is believed to be one of the key genes involved in the pathogenesis of chordoma, a rare primary bone tumor originating along the spinal column or at the base of the skull. The association between the brachyury Gly177Asp single nucleotide polymorphism (SNP and the risk of skull base chordoma in Chinese populations is currently unknown. We investigated the genotype distribution of this SNP in 65 skull-base chordoma cases and 120 healthy subjects. Comparisons of the genotype distributions and allele frequencies did not reveal any significant difference between the groups. Our data suggest that the brachyury Gly177Asp SNP is not involved in the risks of skull-base chordoma, at least in the Chinese population.

  20. Inter-laboratory evaluation of the EUROFORGEN Global ancestry-informative SNP panel by massively parallel sequencing using the Ion PGM™.

    Science.gov (United States)

    Eduardoff, M; Gross, T E; Santos, C; de la Puente, M; Ballard, D; Strobl, C; Børsting, C; Morling, N; Fusco, L; Hussing, C; Egyed, B; Souto, L; Uacyisrael, J; Syndercombe Court, D; Carracedo, Á; Lareu, M V; Schneider, P M; Parson, W; Phillips, C; Parson, W; Phillips, C

    2016-07-01

    The EUROFORGEN Global ancestry-informative SNP (AIM-SNPs) panel is a forensic multiplex of 128 markers designed to differentiate an individual's ancestry from amongst the five continental population groups of Africa, Europe, East Asia, Native America, and Oceania. A custom multiplex of AmpliSeq™ PCR primers was designed for the Global AIM-SNPs to perform massively parallel sequencing using the Ion PGM™ system. This study assessed individual SNP genotyping precision using the Ion PGM™, the forensic sensitivity of the multiplex using dilution series, degraded DNA plus simple mixtures, and the ancestry differentiation power of the final panel design, which required substitution of three original ancestry-informative SNPs with alternatives. Fourteen populations that had not been previously analyzed were genotyped using the custom multiplex and these studies allowed assessment of genotyping performance by comparison of data across five laboratories. Results indicate a low level of genotyping error can still occur from sequence misalignment caused by homopolymeric tracts close to the target SNP, despite careful scrutiny of candidate SNPs at the design stage. Such sequence misalignment required the exclusion of component SNP rs2080161 from the Global AIM-SNPs panel. However, the overall genotyping precision and sensitivity of this custom multiplex indicates the Ion PGM™ assay for the Global AIM-SNPs is highly suitable for forensic ancestry analysis with massively parallel sequencing. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  1. Influence of the MDM2 single nucleotide polymorphism SNP309 on tumour development in BRCA1 mutation carriers

    Directory of Open Access Journals (Sweden)

    Johnson Peter W

    2006-03-01

    Full Text Available Abstract Background The MDM2 gene encodes a negative regulator of the p53 tumour suppressor protein. A single nucleotide polymorphism (SNP in the MDM2 promoter (a T to G exchange at nucleotide 309 has been reported to produce accelerated tumour formation in individuals with inherited p53 mutations. We have investigated the effect of the MDM2 SNP309 on clinical outcome in a cohort of patients with germline mutations of BRCA1. Methods Genomic DNA was obtained for 102 healthy controls and 116 patients with established pathogenic mutations of BRCA1 and Pyrosequencing technology™ was used to determine the genotype at the MDM2 SNP309 locus. Results The polymorphism was present in 52.9% of the controls (G/T in 37.3% and G/G in 15.6% and 58.6% of the BRCA1 mutation carriers (47.4% G/T and 11.2% G/G. Incidence of malignancy in female BRCA1 carriers was not significantly higher in SNP309 carriers than in wildtype (T/T individuals (72.7% vs. 75.6%, p = 1.00. Mean age of diagnosis of first breast cancer was 41.2 years in the SNP309 G/G genotype carriers, 38.6 years in those with the SNP309 G/T genotype and 39.0 years in wildtype subjects (p = 0.80. Conclusion We found no evidence that the MDM2 SNP309 accelerates tumour development in carriers of known pathogenic germline mutations of BRCA1.

  2. New Insights into the Geographic Distribution of Mycobacterium leprae SNP Genotypes Determined for Isolates from Leprosy Cases Diagnosed in Metropolitan France and French Territories.

    Science.gov (United States)

    Reibel, Florence; Chauffour, Aurélie; Brossier, Florence; Jarlier, Vincent; Cambau, Emmanuelle; Aubry, Alexandra

    2015-01-01

    Between 20 and 30 bacteriologically confirmed cases of leprosy are diagnosed each year at the French National Reference Center for mycobacteria. Patients are mainly immigrants from various endemic countries or living in French overseas territories. We aimed at expanding data regarding the geographical distribution of the SNP genotypes of the M. leprae isolates from these patients. Skin biopsies were obtained from 71 leprosy patients diagnosed between January 2009 and December 2013. Data regarding age, sex and place of birth and residence were also collected. Diagnosis of leprosy was confirmed by microscopic detection of acid-fast bacilli and/or amplification by PCR of the M. leprae-specific RLEP region. Single nucleotide polymorphisms (SNP), present in the M. leprae genome at positions 14 676, 1 642 875 and 2 935 685, were determined with an efficiency of 94% (67/71). Almost all patients were from countries other than France where leprosy is still prevalent (n = 31) or from French overseas territories (n = 36) where leprosy is not totally eradicated, while only a minority (n = 4) was born in metropolitan France but have lived in other countries. SNP type 1 was predominant (n = 33), followed by type 3 (n = 17), type 4 (n = 11) and type 2 (n = 6). SNP types were concordant with those previously reported as prevalent in the patients' countries of birth. SNP types found in patients born in countries other than France (Comoros, Haiti, Benin, Congo, Sri Lanka) and French overseas territories (French Polynesia, Mayotte and La Réunion) not covered by previous work correlated well with geographical location and history of human settlements. The phylogenic analysis of M. leprae strains isolated in France strongly suggests that French leprosy cases are caused by SNP types that are (a) concordant with the geographic origin or residence of the patients (non-French countries, French overseas territories, metropolitan France) or (b) more likely random in regions where diverse

  3. Refining QTL with high-density SNP genotyping and whole genome sequence in three cattle breeds

    DEFF Research Database (Denmark)

    Sahana, Goutam; Guldbrandtsen, Bernt; Lund, Mogens Sandø

    2012-01-01

    Genome-wide association study was carried out in Nordic Holsteins, Nordic Red and Jersey breeds for functional traits using BovineHD Genotyping BreadChip (Illumina, San Diego, CA). The association analyses were carried out using both linear mixed model approach and a Bayesian variable selection...... method. Principal components were used to account for population structure. The QTL segregating in all three breeds were selected and a few of the most significant ones were followed in further analyses. The polymorphisms in the identified QTL regions were imputed using 90 whole genome sequences...

  4. Melt analysis of mismatch amplification mutation assays (Melt-MAMA: a functional study of a cost-effective SNP genotyping assay in bacterial models.

    Directory of Open Access Journals (Sweden)

    Dawn N Birdsell

    Full Text Available Single nucleotide polymorphisms (SNPs are abundant in genomes of all species and biologically informative markers extensively used across broad scientific disciplines. Newly identified SNP markers are publicly available at an ever-increasing rate due to advancements in sequencing technologies. Efficient, cost-effective SNP genotyping methods to screen sample populations are in great demand in well-equipped laboratories, but also in developing world situations. Dual Probe TaqMan assays are robust but can be cost-prohibitive and require specialized equipment. The Mismatch Amplification Mutation Assay, coupled with melt analysis (Melt-MAMA, is flexible, efficient and cost-effective. However, Melt-MAMA traditionally suffers from high rates of assay design failures and knowledge gaps on assay robustness and sensitivity. In this study, we identified strategies that improved the success of Melt-MAMA. We examined the performance of 185 Melt-MAMAs across eight different pathogens using various optimization parameters. We evaluated the effects of genome size and %GC content on assay development. When used collectively, specific strategies markedly improved the rate of successful assays at the first design attempt from ~50% to ~80%. We observed that Melt-MAMA accurately genotypes across a broad DNA range (~100 ng to ~0.1 pg. Genomic size and %GC content influence the rate of successful assay design in an independent manner. Finally, we demonstrated the versatility of these assays by the creation of a duplex Melt-MAMA real-time PCR (two SNPs and conversion to a size-based genotyping system, which uses agarose gel electrophoresis. Melt-MAMA is comparable to Dual Probe TaqMan assays in terms of design success rate and accuracy. Although sensitivity is less robust than Dual Probe TaqMan assays, Melt-MAMA is superior in terms of cost-effectiveness, speed of development and versatility. We detail the parameters most important for the successful application of

  5. Association test based on SNP set: logistic kernel machine based test vs. principal component analysis.

    Directory of Open Access Journals (Sweden)

    Yang Zhao

    Full Text Available GWAS has facilitated greatly the discovery of risk SNPs associated with complex diseases. Traditional methods analyze SNP individually and are limited by low power and reproducibility since correction for multiple comparisons is necessary. Several methods have been proposed based on grouping SNPs into SNP sets using biological knowledge and/or genomic features. In this article, we compare the linear kernel machine based test (LKM and principal components analysis based approach (PCA using simulated datasets under the scenarios of 0 to 3 causal SNPs, as well as simple and complex linkage disequilibrium (LD structures of the simulated regions. Our simulation study demonstrates that both LKM and PCA can control the type I error at the significance level of 0.05. If the causal SNP is in strong LD with the genotyped SNPs, both the PCA with a small number of principal components (PCs and the LKM with kernel of linear or identical-by-state function are valid tests. However, if the LD structure is complex, such as several LD blocks in the SNP set, or when the causal SNP is not in the LD block in which most of the genotyped SNPs reside, more PCs should be included to capture the information of the causal SNP. Simulation studies also demonstrate the ability of LKM and PCA to combine information from multiple causal SNPs and to provide increased power over individual SNP analysis. We also apply LKM and PCA to analyze two SNP sets extracted from an actual GWAS dataset on non-small cell lung cancer.

  6. Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array

    Energy Technology Data Exchange (ETDEWEB)

    Gardner, S; Jaing, C

    2012-03-27

    The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interim report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.

  7. Tri-allelic SNP markers enable analysis of mixed and degraded DNA samples.

    Science.gov (United States)

    Westen, Antoinette A; Matai, Anuska S; Laros, Jeroen F J; Meiland, Hugo C; Jasper, Mandy; de Leeuw, Wiljo J F; de Knijff, Peter; Sijen, Titia

    2009-09-01

    For the analysis of degraded DNA in disaster victim identification (DVI) and criminal investigations, single nucleotide polymorphisms (SNPs) have been recognized as promising markers mainly because they can be analyzed in short sized amplicons. Most SNPs are bi-allelic and are thereby ineffective to detect mixtures, which may lead to incorrect genotyping. We developed an algorithm to find non-binary (i.e. tri-allelic or tetra-allelic) SNPs in the NCBI dbSNP database. We selected 31 potential tri-allelic SNPs with a minor allele frequency of at least 10%. The tri-allelic nature was confirmed for 15 SNPs residing on 14 different chromosomes. Multiplex SNaPshot assays were developed, and the allele frequencies of 16 SNPs were determined among 153 Dutch and 111 Netherlands Antilles reference samples. Using these multiplex SNP assays, the presence of a mixture of two DNA samples in a ratio up to 1:8 could be recognized reliably. Furthermore, we compared the genotyping efficiency of the tri-allelic SNP markers and short tandem repeat (STR) markers by analyzing artificially degraded DNA and DNA from 30 approximately 500-year-old bone and molar samples. In both types of degraded DNA samples, the larger sized STR amplicons failed to amplify whereas the tri-allelic SNP markers still provided valuable information. In conclusion, tri-allelic SNP markers are suited for the analysis of degraded DNA and enable the detection of a second DNA source in a sample.

  8. SNP discovery in the transcriptome of white Pacific shrimp Litopenaeus vannamei by next generation sequencing.

    Directory of Open Access Journals (Sweden)

    Yang Yu

    Full Text Available The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies.

  9. Detecting imbalanced expression of SNP alleles by minisequencing on microarrays

    Directory of Open Access Journals (Sweden)

    Dahlgren Andreas

    2004-10-01

    Full Text Available Abstract Background Each of the human genes or transcriptional units is likely to contain single nucleotide polymorphisms that may give rise to sequence variation between individuals and tissues on the level of RNA. Based on recent studies, differential expression of the two alleles of heterozygous coding single nucleotide polymorphisms (SNPs may be frequent for human genes. Methods with high accuracy to be used in a high throughput setting are needed for systematic surveys of expressed sequence variation. In this study we evaluated two formats of multiplexed, microarray based minisequencing for quantitative detection of imbalanced expression of SNP alleles. We used a panel of ten SNPs located in five genes known to be expressed in two endothelial cell lines as our model system. Results The accuracy and sensitivity of quantitative detection of allelic imbalance was assessed for each SNP by constructing regression lines using a dilution series of mixed samples from individuals of different genotype. Accurate quantification of SNP alleles by both assay formats was evidenced for by R2 values > 0.95 for the majority of the regression lines. According to a two sample t-test, we were able to distinguish 1–9% of a minority SNP allele from a homozygous genotype, with larger variation between SNPs than between assay formats. Six of the SNPs, heterozygous in either of the two cell lines, were genotyped in RNA extracted from the endothelial cells. The coefficient of variation between the fluorescent signals from five parallel reactions was similar for cDNA and genomic DNA. The fluorescence signal intensity ratios measured in the cDNA samples were compared to those in genomic DNA to determine the relative expression levels of the two alleles of each SNP. Four of the six SNPs tested displayed a higher than 1.4-fold difference in allelic ratios between cDNA and genomic DNA. The results were verified by allele-specific oligonucleotide hybridisation and

  10. SNP discovery in nonmodel organisms: strand bias and base-substitution errors reduce conversion rates.

    Science.gov (United States)

    Gonçalves da Silva, Anders; Barendse, William; Kijas, James W; Barris, Wes C; McWilliam, Sean; Bunch, Rowan J; McCullough, Russell; Harrison, Blair; Hoelzel, A Rus; England, Phillip R

    2015-07-01

    Single nucleotide polymorphisms (SNPs) have become the marker of choice for genetic studies in organisms of conservation, commercial or biological interest. Most SNP discovery projects in nonmodel organisms apply a strategy for identifying putative SNPs based on filtering rules that account for random sequencing errors. Here, we analyse data used to develop 4723 novel SNPs for the commercially important deep-sea fish, orange roughy (Hoplostethus atlanticus), to assess the impact of not accounting for systematic sequencing errors when filtering identified polymorphisms when discovering SNPs. We used SAMtools to identify polymorphisms in a velvet assembly of genomic DNA sequence data from seven individuals. The resulting set of polymorphisms were filtered to minimize 'bycatch'-polymorphisms caused by sequencing or assembly error. An Illumina Infinium SNP chip was used to genotype a final set of 7714 polymorphisms across 1734 individuals. Five predictors were examined for their effect on the probability of obtaining an assayable SNP: depth of coverage, number of reads that support a variant, polymorphism type (e.g. A/C), strand-bias and Illumina SNP probe design score. Our results indicate that filtering out systematic sequencing errors could substantially improve the efficiency of SNP discovery. We show that BLASTX can be used as an efficient tool to identify single-copy genomic regions in the absence of a reference genome. The results have implications for research aiming to identify assayable SNPs and build SNP genotyping assays for nonmodel organisms. © 2014 John Wiley & Sons Ltd.

  11. A general SNP-based molecular barcode for Plasmodium falciparum identification and tracking

    Directory of Open Access Journals (Sweden)

    Rosen David

    2008-10-01

    Full Text Available Abstract Background Single nucleotide polymorphism (SNP genotyping provides the means to develop a practical, rapid, inexpensive assay that will uniquely identify any Plasmodium falciparum parasite using a small amount of DNA. Such an assay could be used to distinguish recrudescence from re-infection in drug trials, to monitor the frequency and distribution of specific parasites in a patient population undergoing drug treatment or vaccine challenge, or for tracking samples and determining purity of isolates in the laboratory during culture adaptation and sub-cloning, as well as routine passage. Methods A panel of twenty-four SNP markers has been identified that exhibit a high minor allele frequency (average MAF > 35%, for which robust TaqMan genotyping assays were constructed. All SNPs were identified through whole genome sequencing and MAF was estimated through Affymetrix array-based genotyping of a worldwide collection of parasites. These assays create a "molecular barcode" to uniquely identify a parasite genome. Results Using 24 such markers no two parasites known to be of independent origin have yet been found to have the same allele signature. The TaqMan genotyping assays can be performed on a variety of samples including cultured parasites, frozen whole blood, or whole blood spotted onto filter paper with a success rate > 99%. Less than 5 ng of parasite DNA is needed to complete a panel of 24 markers. The ability of this SNP panel to detect and identify parasites was compared to the standard molecular methods, MSP-1 and MSP-2 typing. Conclusion This work provides a facile field-deployable genotyping tool that can be used without special skills with standard lab equipment, and at reasonable cost that will unambiguously identify and track P. falciparum parasites both from patient samples and in the laboratory.

  12. Whole genome DNA copy number changes identified by high density oligonucleotide arrays

    Directory of Open Access Journals (Sweden)

    Huang Jing

    2004-05-01

    Full Text Available Abstract Changes in DNA copy number are one of the hallmarks of the genetic instability common to most human cancers. Previous micro-array-based methods have been used to identify chromosomal gains and losses; however, they are unable to genotype alleles at the level of single nucleotide polymorphisms (SNPs. Here we describe a novel algorithm that uses a recently developed high-density oligonucleotide array-based SNP genotyping method, whole genome sampling analysis (WGSA, to identify genome-wide chromosomal gains and losses at high resolution. WGSA simultaneously genotypes over 10,000 SNPs by allele-specific hybridisation to perfect match (PM and mismatch (MM probes synthesised on a single array. The copy number algorithm jointly uses PM intensity and discrimination ratios between paired PM and MM intensity values to identify and estimate genetic copy number changes. Values from an experimental sample are compared with SNP-specific distributions derived from a reference set containing over 100 normal individuals to gain statistical power. Genomic regions with statistically significant copy number changes can be identified using both single point analysis and contiguous point analysis of SNP intensities. We identified multiple regions of amplification and deletion using a panel of human breast cancer cell lines. We verified these results using an independent method based on quantitative polymerase chain reaction and found that our approach is both sensitive and specific and can tolerate samples which contain a mixture of both tumour and normal DNA. In addition, by using known allele frequencies from the reference set, statistically significant genomic intervals can be identified containing contiguous stretches of homozygous markers, potentially allowing the detection of regions undergoing loss of heterozygosity (LOH without the need for a matched normal control sample. The coupling of LOH analysis, via SNP genotyping, with copy number

  13. Sex-specific association of rs16996148 SNP in the NCAN/CILP2/PBX4 and serum lipid levels in the Mulao and Han populations

    Directory of Open Access Journals (Sweden)

    Yan Ting-Ting

    2011-12-01

    Full Text Available Abstract Background The association of rs16996148 single nucleotide polymorphism (SNP in NCAN/CILP2/PBX4 and serum lipid levels is inconsistent. Furthermore, little is known about the association of rs16996148 SNP and serum lipid levels in the Chinese population. We therefore aimed to detect the association of rs16996148 SNP and several environmental factors with serum lipid levels in the Guangxi Mulao and Han populations. Method A total of 712 subjects of Mulao nationality and 736 participants of Han nationality were randomly selected from our stratified randomized cluster samples. Genotyping of the rs16996148 SNP was performed by polymerase chain reaction and restriction fragment length polymorphism combined with gel electrophoresis, and then confirmed by direct sequencing. Results The levels of apolipoprotein (Apo B were higher in Mulao than in Han (P P 0.05; respectively. The frequencies of GG, GT and TT genotypes were 76.0%, 22.5% and 1.5% in Mulao, and 81.2%, 17.4% and 1.4% in Han (P 0.05; respectively. There were no significant differences in the genotypic and allelic frequencies between males and females in both ethnic groups. The levels of HDL-C, ApoAI, and the ratio of ApoAI to ApoB in Mulao were different between the GG and GT/TT genotypes in males but not in females (P P P P P Conclusions The genotypic and allelic frequencies of rs16996148 SNP and the associations of the SNP and serum lipid levels are different in the Mulao and Han populations. Sex (male-specific association of rs16996148 SNP in the NCAN/CILP2/PBX4 and serum lipid levels is also observed in the both ethnic groups.

  14. New generation pharmacogenomic tools: a SNP linkage disequilibrium Map, validated SNP assay resource, and high-throughput instrumentation system for large-scale genetic studies.

    Science.gov (United States)

    De La Vega, Francisco M; Dailey, David; Ziegle, Janet; Williams, Julie; Madden, Dawn; Gilbert, Dennis A

    2002-06-01

    Since public and private efforts announced the first draft of the human genome last year, researchers have reported great numbers of single nucleotide polymorphisms (SNPs). We believe that the availability of well-mapped, quality SNP markers constitutes the gateway to a revolution in genetics and personalized medicine that will lead to better diagnosis and treatment of common complex disorders. A new generation of tools and public SNP resources for pharmacogenomic and genetic studies--specifically for candidate-gene, candidate-region, and whole-genome association studies--will form part of the new scientific landscape. This will only be possible through the greater accessibility of SNP resources and superior high-throughput instrumentation-assay systems that enable affordable, highly productive large-scale genetic studies. We are contributing to this effort by developing a high-quality linkage disequilibrium SNP marker map and an accompanying set of ready-to-use, validated SNP assays across every gene in the human genome. This effort incorporates both the public sequence and SNP data sources, and Celera Genomics' human genome assembly and enormous resource ofphysically mapped SNPs (approximately 4,000,000 unique records). This article discusses our approach and methodology for designing the map, choosing quality SNPs, designing and validating these assays, and obtaining population frequency ofthe polymorphisms. We also discuss an advanced, high-performance SNP assay chemisty--a new generation of the TaqMan probe-based, 5' nuclease assay-and high-throughput instrumentation-software system for large-scale genotyping. We provide the new SNP map and validation information, validated SNP assays and reagents, and instrumentation systems as a novel resource for genetic discoveries.

  15. A SNP based high-density linkage map of Apis cerana reveals a high recombination rate similar to Apis mellifera.

    Directory of Open Access Journals (Sweden)

    Yuan Yuan Shi

    Full Text Available BACKGROUND: The Eastern honey bee, Apis cerana Fabricius, is distributed in southern and eastern Asia, from India and China to Korea and Japan and southeast to the Moluccas. This species is also widely kept for honey production besides Apis mellifera. Apis cerana is also a model organism for studying social behavior, caste determination, mating biology, sexual selection, and host-parasite interactions. Few resources are available for molecular research in this species, and a linkage map was never constructed. A linkage map is a prerequisite for quantitative trait loci mapping and for analyzing genome structure. We used the Chinese honey bee, Apis cerana cerana to construct the first linkage map in the Eastern honey bee. RESULTS: F2 workers (N = 103 were genotyped for 126,990 single nucleotide polymorphisms (SNPs. After filtering low quality and those not passing the Mendel test, we obtained 3,000 SNPs, 1,535 of these were informative and used to construct a linkage map. The preliminary map contains 19 linkage groups, we then mapped the 19 linkage groups to 16 chromosomes by comparing the markers to the genome of A. mellfiera. The final map contains 16 linkage groups with a total of 1,535 markers. The total genetic distance is 3,942.7 centimorgans (cM with the largest linkage group (180 loci measuring 574.5 cM. Average marker interval for all markers across the 16 linkage groups is 2.6 cM. CONCLUSION: We constructed a high density linkage map for A. c. cerana with 1,535 markers. Because the map is based on SNP markers, it will enable easier and faster genotyping assays than randomly amplified polymorphic DNA or microsatellite based maps used in A. mellifera.

  16. SNP discovery in the bovine milk transcriptome using RNA-Seq technology.

    Science.gov (United States)

    Cánovas, Angela; Rincon, Gonzalo; Islas-Trejo, Alma; Wickramasinghe, Saumya; Medrano, Juan F

    2010-12-01

    High-throughput sequencing of RNA (RNA-Seq) was developed primarily to analyze global gene expression in different tissues. However, it also is an efficient way to discover coding SNPs. The objective of this study was to perform a SNP discovery analysis in the milk transcriptome using RNA-Seq. Seven milk samples from Holstein cows were analyzed by sequencing cDNAs using the Illumina Genome Analyzer system. We detected 19,175 genes expressed in milk samples corresponding to approximately 70% of the total number of genes analyzed. The SNP detection analysis revealed 100,734 SNPs in Holstein samples, and a large number of those corresponded to differences between the Holstein breed and the Hereford bovine genome assembly Btau4.0. The number of polymorphic SNPs within Holstein cows was 33,045. The accuracy of RNA-Seq SNP discovery was tested by comparing SNPs detected in a set of 42 candidate genes expressed in milk that had been resequenced earlier using Sanger sequencing technology. Seventy of 86 SNPs were detected using both RNA-Seq and Sanger sequencing technologies. The KASPar Genotyping System was used to validate unique SNPs found by RNA-Seq but not observed by Sanger technology. Our results confirm that analyzing the transcriptome using RNA-Seq technology is an efficient and cost-effective method to identify SNPs in transcribed regions. This study creates guidelines to maximize the accuracy of SNP discovery and prevention of false-positive SNP detection, and provides more than 33,000 SNPs located in coding regions of genes expressed during lactation that can be used to develop genotyping platforms to perform marker-trait association studies in Holstein cattle.

  17. Analysis of SNP rs16754 of WT1 gene in a series of de novo acute myeloid leukemia patients.

    Science.gov (United States)

    Luna, Irene; Such, Esperanza; Cervera, Jose; Barragán, Eva; Jiménez-Velasco, Antonio; Dolz, Sandra; Ibáñez, Mariam; Gómez-Seguí, Inés; López-Pavía, María; Llop, Marta; Fuster, Óscar; Oltra, Silvestre; Moscardó, Federico; Martínez-Cuadrón, David; Senent, M Leonor; Gascón, Adriana; Montesinos, Pau; Martín, Guillermo; Bolufer, Pascual; Sanz, Miguel A

    2012-12-01

    The single nucleotide polymorphism (SNP) rs16754 of the WT1 gene has been previously described as a possible prognostic marker in normal karyotype acute myeloid leukemia (AML) patients. Nevertheless, the findings in this field are not always reproducible in different series. One hundred and seventy-five adult de novo AML patients were screened with two different methods for the detection of SNP rs16754: high-resolution melting (HRM) and FRET hybridization probes. Direct sequencing was used to validate both techniques. The SNP was detected in 52 out of 175 patients (30 %), both by HRM and hybridization probes. Direct sequencing confirmed that every positive sample in the screening methods had a variation in the DNA sequence. Patients with the wild-type genotype (WT1(AA)) for the SNP rs16754 were significantly younger than those with the heterozygous WT1(AG) genotype. No other difference was observed for baseline characteristic or outcome between patients with or without the SNP. Both techniques are equally reliable and reproducible as screening methods for the detection of the SNP rs16754, allowing for the selection of those samples that will need to be sequenced. We were unable to confirm the suggested favorable outcome of SNP rs16754 in de novo AML.

  18. SNP markers retrieval for a non-model species: a practical approach

    Directory of Open Access Journals (Sweden)

    Shahin Arwa

    2012-01-01

    Full Text Available Abstract Background SNP (Single Nucleotide Polymorphism markers are rapidly becoming the markers of choice for applications in breeding because of next generation sequencing technology developments. For SNP development by NGS technologies, correct assembly of the huge amounts of sequence data generated is essential. Little is known about assembler's performance, especially when dealing with highly heterogeneous species that show a high genome complexity and what the possible consequences are of differences in assemblies on SNP retrieval. This study tested two assemblers (CAP3 and CLC on 454 data from four lily genotypes and compared results with respect to SNP retrieval. Results CAP3 assembly resulted in higher numbers of contigs, lower numbers of reads per contig, and shorter average read lengths compared to CLC. Blast comparisons showed that CAP3 contigs were highly redundant. Contrastingly, CLC in rare cases combined paralogs in one contig. Redundant and chimeric contigs may lead to erroneous SNPs. Filtering for redundancy can be done by blasting selected SNP markers to the contigs and discarding all the SNP markers that show more than one blast hit. Results on chimeric contigs showed that only four out of 2,421 SNP markers were selected from chimeric contigs. Conclusion In practice, CLC performs better in assembling highly heterogeneous genome sequences compared to CAP3, and consequently SNP retrieval is more efficient. Additionally a simple flow scheme is suggested for SNP marker retrieval that can be valid for all non-model species.

  19. Functional SNP associated with birth weight in independent populations identified with a permutation step added to GBLUP-GWAS

    Science.gov (United States)

    This study was conducted as an initial assessment of a newly available genotyping assay containing about 34,000 common SNP included on previous SNP chips, and 199,000 sequence variants predicted to affect gene function. Objectives were to identify functional variants associated with birth weight in...

  20. Survey of SSC12 regions affecting fatty acid composition of intramuscular fat using high density SNP data

    Directory of Open Access Journals (Sweden)

    María eMuñoz

    2012-01-01

    Full Text Available Fatty acid composition is a critical aspect of pork because it affects sensorial and technological aspects of meat quality and it is relevant for human health. Previous studies identified significant QTLs in porcine chromosome 12 for fatty acid profile of backfat and intramuscular fat. In the present study, 374 SNPs mapped in SSC12 from the 60K Porcine SNP Beadchip were used. We have combined linkage and association analyses with expression data analysis in order to identify regions of SSC12 that could affect fatty acid composition of intramuscular fat in longissimus muscle. The QTL scan showed a region around the 60 cM position that significantly affects palmitic fatty acid and two related fatty acid indexes. The Iberian QTL allele increased the palmitic content (+2.6% of mean trait. This QTL does not match any of those reported in the previous study on fatty acid composition of backfat, suggesting different genetic control acting at both tissues. The SNP association analyses showed significant associations with linolenic and palmitic acids besides several indexes. Among the polymorphisms that affect palmitic fatty acid and match the QTL region at 60 cM, there were three that mapped in the Phosphatidylcholine Transfer Protein (PCTP gene and one in the Acetyl-CoA Carboxylase  gene (ACACA. Interestingly one of the PCTP SNPs also affected significantly unsaturated and double bound indexes and the ratio between polyunsaturated/monounsaturated fatty acids. Differential expression was assessed on longissimus muscle conditional on the genotype of the QTL and on the most significant SNPs, according to the results obtained in the former analyses. Results from the microarray expression analyses, validated by RT-qPCR, showed that PCTP expression levels significantly vary depending on the QTL as well as on the own PCTP genotype. The results obtained with the different approaches point out the PCTP gene as a powerful candidate underlying the QTL for

  1. Genotyping by Sequencing for SNP-Based Linkage Map Construction and QTL Analysis of Chilling Requirement and Bloom Date in Peach [Prunus persica (L. Batsch].

    Directory of Open Access Journals (Sweden)

    Douglas Gary Bielenberg

    Full Text Available Low-cost, high throughput genotyping methods are crucial to marker discovery and marker-assisted breeding efforts, but have not been available for many 'specialty crops' such as fruit and nut trees. Here we apply the Genotyping-By-Sequencing (GBS method developed for cereals to the discovery of single nucleotide polymorphisms (SNPs in a peach F2 mapping population. Peach is a genetic and genomic model within the Rosaceae and will provide a template for the use of this method with other members of this family. Our F2 mapping population of 57 genotypes segregates for bloom time (BD and chilling requirement (CR and we have extensively phenotyped this population. The population derives from a selfed F1 progeny of a cross between 'Hakuho' (high CR and 'UFGold' (low CR. We were able to successfully employ GBS and the TASSEL GBS pipeline without modification of the original methodology using the ApeKI restriction enzyme and multiplexing at an equivalent of 96 samples per Illumina HiSeq 2000 lane. We obtained hundreds of SNP markers which were then used to construct a genetic linkage map and identify quantitative trait loci (QTL for BD and CR.

  2. GACT: a Genome build and Allele definition Conversion Tool for SNP imputation and meta-analysis in genetic association studies.

    Science.gov (United States)

    Sulovari, Arvis; Li, Dawei

    2014-07-19

    Genome-wide association studies (GWAS) have successfully identified genes associated with complex human diseases. Although much of the heritability remains unexplained, combining single nucleotide polymorphism (SNP) genotypes from multiple studies for meta-analysis will increase the statistical power to identify new disease-associated variants. Meta-analysis requires same allele definition (nomenclature) and genome build among individual studies. Similarly, imputation, commonly-used prior to meta-analysis, requires the same consistency. However, the genotypes from various GWAS are generated using different genotyping platforms, arrays or SNP-calling approaches, resulting in use of different genome builds and allele definitions. Incorrect assumptions of identical allele definition among combined GWAS lead to a large portion of discarded genotypes or incorrect association findings. There is no published tool that predicts and converts among all major allele definitions. In this study, we have developed a tool, GACT, which stands for Genome build and Allele definition Conversion Tool, that predicts and inter-converts between any of the common SNP allele definitions and between the major genome builds. In addition, we assessed several factors that may affect imputation quality, and our results indicated that inclusion of singletons in the reference had detrimental effects while ambiguous SNPs had no measurable effect. Unexpectedly, exclusion of genotypes with missing rate > 0.001 (40% of study SNPs) showed no significant decrease of imputation quality (even significantly higher when compared to the imputation with singletons in the reference), especially for rare SNPs. GACT is a new, powerful, and user-friendly tool with both command-line and interactive online versions that can accurately predict, and convert between any of the common allele definitions and between genome builds for genome-wide meta-analysis and imputation of genotypes from SNP-arrays or deep

  3. Compression and fast retrieval of SNP data.

    Science.gov (United States)

    Sambo, Francesco; Di Camillo, Barbara; Toffolo, Gianna; Cobelli, Claudio

    2014-11-01

    The increasing interest in rare genetic variants and epistatic genetic effects on complex phenotypic traits is currently pushing genome-wide association study design towards datasets of increasing size, both in the number of studied subjects and in the number of genotyped single nucleotide polymorphisms (SNPs). This, in turn, is leading to a compelling need for new methods for compression and fast retrieval of SNP data. We present a novel algorithm and file format for compressing and retrieving SNP data, specifically designed for large-scale association studies. Our algorithm is based on two main ideas: (i) compress linkage disequilibrium blocks in terms of differences with a reference SNP and (ii) compress reference SNPs exploiting information on their call rate and minor allele frequency. Tested on two SNP datasets and compared with several state-of-the-art software tools, our compression algorithm is shown to be competitive in terms of compression rate and to outperform all tools in terms of time to load compressed data. Our compression and decompression algorithms are implemented in a C++ library, are released under the GNU General Public License and are freely downloadable from http://www.dei.unipd.it/~sambofra/snpack.html. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. Design and Characterization of a 52K SNP Chip for Goats

    NARCIS (Netherlands)

    Tosser-klopp, G.; Bardou, P.; Bouchez, O.; Cabau, C.; Crooijmans, R.P.M.A.; Dong, Y.; Donnadieu-Tonon, C.; Eggen, A.; Heuven, H.C.M.; Jamli, S.; Jiken, A.J.; Klopp, C.; Lawley, C.T.; McEwen, J.; Martin, P.; Moreno, C.R.; Mulsant, P.; Nabihoudine, I.; Pailhoux, E.; Palhiere, I.; Rupp, R.; Sarry, J.; Sayre, B.L.; Tircazes, A.; Wang, J.; Wang, W.; Zhang, W.G.

    2014-01-01

    The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a

  5. Large SNP arrays for genotyping in crop plants

    Indian Academy of Sciences (India)

    2012-10-15

    Oct 15, 2012 ... in human has been paralleled by the simultaneous develop- ment of ... In crop plants, the development of large genotyping arrays started much ..... via deep resequencing of reduced representation libraries with the Illumina ...

  6. Bovine Exome Sequence Analysis and Targeted SNP Genotyping of Recessive Fertility Defects BH1, HH2, and HH3 Reveal a Putative Causative Mutation in SMC2 for HH3

    OpenAIRE

    McClure, Matthew C.; Bickhart, Derek; Null, Dan; VanRaden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B.; Van Tassell, Curtis P.; Sonstegard, Tad S.

    2014-01-01

    The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next gener...

  7. Rapid identification of tomato Sw-5 resistance-breaking isolates of Tomato spotted wilt virus using high resolution melting and TaqMan SNP Genotyping assays as allelic discrimination techniques.

    Directory of Open Access Journals (Sweden)

    Valentina di Rienzo

    Full Text Available In tomato, resistance to Tomato spotted wilt virus (TSWV is conferred by the dominant gene, designated Sw-5. Virulent Sw-5 resistance breaking (SRB mutants of TSWV have been reported on Sw-5 tomato cultivars. Two different PCR-based allelic discrimination techniques, namely Custom TaqMan™ SNP Genotyping and high-resolution melting (HRM assays, were developed and compared for their ability to distinguish between avirulent (Sw-5 non-infecting, SNI and SRB biotypes. TaqMan assays proved to be more sensitive (threshold of detection in a range of 50-70 TSWV RNA copies and more reliable than HRM, assigning 25 TSWV isolates to their correct genotype with an accuracy of 100%. Moreover, the TaqMan SNP assays were further improved developing a rapid and simple protocol that included crude leaf extraction for RNA template preparations. On the other hand, HRM assays showed higher levels of sensitivity than TaqMan when used to co-detect both biotypes in different artificial mixtures. These diagnostic assays contributed to gain preliminary information on the epidemiology of TSWV isolates in open field conditions. In fact, the presented data suggest that SRB isolates are present as stable populations established year round, persisting on both winter (globe artichoke and summer (tomato crops, in the same cultivated areas of Southern Italy.

  8. QTL Mapping of Adult-Plant Resistance to Leaf Rust in the Wheat Cross Zhou 8425B/Chinese Spring Using High-Density SNP Markers

    Directory of Open Access Journals (Sweden)

    Peipei Zhang

    2017-05-01

    Full Text Available Wheat leaf rust is an important disease worldwide. Growing resistant cultivars is an effective means to control the disease. In the present study, 244 recombinant inbred lines from Zhou 8425B/Chinese Spring cross were phenotyped for leaf rust severities during the 2011–2012, 2012–2013, 2013–2014, and 2014–2015 cropping seasons at Baoding, Hebei province, and 2012–2013 and 2013–2014 cropping seasons in Zhoukou, Henan province. The population was genotyped using the high-density Illumina iSelect 90K SNP assay and SSR markers. Inclusive composite interval mapping identified eight QTL, designated as QLr.hebau-2AL, QLr.hebau-2BS, QLr.hebau-3A, QLr.hebau-3BS, QLr.hebau-4AL, QLr.hebau-4B, QLr.hebau-5BL, and QLr.hebau-7DS, respectively. QLr.hebau-2BS, QLr.hebau-3A, QLr.hebau-3BS, and QLr.hebau-5BL were derived from Zhou 8425B, whereas the other four were from Chinese Spring. Three stable QTL on chromosomes 2BS, 4B and 7DS explained 7.5–10.6%, 5.5–24.4%, and 11.2–20.9% of the phenotypic variance, respectively. QLr.hebau-2BS in Zhou 8425B might be the same as LrZH22 in Zhoumai 22; QLr.hebau-4B might be the residual resistance of Lr12, and QLr.hebau-7DS is Lr34. QLr.hebau-2AL, QLr.hebau-3BS, QLr.hebau-4AL, and QLr.hebau-5BL are likely to be novel QTL for leaf rust. These QTL and their closely linked SNP and SSR markers can be used for fine mapping, candidate gene discovery, and marker-assisted selection in wheat breeding.

  9. Genetic Polymorphism of MDM2 SNP309 in Patients with Helicobacter Pylori-Associated Gastritis.

    Science.gov (United States)

    Tongtawee, Taweesak; Dechsukhum, Chavaboon; Leeanansaksiri, Wilairat; Kaewpitoon, Soraya; Kaewpitoon, Natthawut; Loyd, Ryan A; Matrakool, Likit; Panpimanmas, Sukij

    2015-01-01

    Helicobacter pylori plays an important role in gastric cancer, which has a relatively low inciduence in Thailand. MDM2 is a major negative regulator of p53, the key tumor suppressor involved in tumorigenesis of the majority of human cancers. Whether its expression might explain the relative lack of gastric cancer in Thailand was assessed here. This single-center study was conducted in the northeast region of Thailand. Gastric mucosa from 100 patients with Helicobacter pylori associated gastritis was analyzed for MDM2 SNP309 using real-time PCR hybridization (light-cycler) probes. In the total 100 Helicobacter pylori associated gastritis cases the incidence of SNP 309 T/T homozygous was 78 % with SNP309 G/T heterozygous found in 19% and SNP309 G/G homozygous in 3%. The result show SNP 309 T/T and SNP 309 G/T to be rather common in the Thai population. Our study indicates that the MDM2 SNP309 G/G homozygous genotype might be a risk factor for gastric cancer in Thailand and the fact that it is infrequent could explain to some extent the low incidence of gastric cancer in the Thai population.

  10. Comparison of three PCR-based assays for SNP genotyping in sugar beet

    Science.gov (United States)

    Background: PCR allelic discrimination technologies have broad applications in the detection of single nucleotide polymorphisms (SNPs) in genetics and genomics. The use of fluorescence-tagged probes is the leading method for targeted SNP detection, but assay costs and error rates could be improved t...

  11. An improved PSO algorithm for generating protective SNP barcodes in breast cancer.

    Directory of Open Access Journals (Sweden)

    Li-Yeh Chuang

    Full Text Available BACKGROUND: Possible single nucleotide polymorphism (SNP interactions in breast cancer are usually not investigated in genome-wide association studies. Previously, we proposed a particle swarm optimization (PSO method to compute these kinds of SNP interactions. However, this PSO does not guarantee to find the best result in every implement, especially when high-dimensional data is investigated for SNP-SNP interactions. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we propose IPSO algorithm to improve the reliability of PSO for the identification of the best protective SNP barcodes (SNP combinations and genotypes with maximum difference between cases and controls associated with breast cancer. SNP barcodes containing different numbers of SNPs were computed. The top five SNP barcode results are retained for computing the next SNP barcode with a one-SNP-increase for each processing step. Based on the simulated data for 23 SNPs of six steroid hormone metabolisms and signalling-related genes, the performance of our proposed IPSO algorithm is evaluated. Among 23 SNPs, 13 SNPs displayed significant odds ratio (OR values (1.268 to 0.848; p<0.05 for breast cancer. Based on IPSO algorithm, the jointed effect in terms of SNP barcodes with two to seven SNPs show significantly decreasing OR values (0.84 to 0.57; p<0.05 to 0.001. Using PSO algorithm, two to four SNPs show significantly decreasing OR values (0.84 to 0.77; p<0.05 to 0.001. Based on the results of 20 simulations, medians of the maximum differences for each SNP barcode generated by IPSO are higher than by PSO. The interquartile ranges of the boxplot, as well as the upper and lower hinges for each n-SNP barcode (n = 3∼10 are more narrow in IPSO than in PSO, suggesting that IPSO is highly reliable for SNP barcode identification. CONCLUSIONS/SIGNIFICANCE: Overall, the proposed IPSO algorithm is robust to provide exact identification of the best protective SNP barcodes for breast cancer.

  12. Haplotype-Based Genotyping in Polyploids

    Directory of Open Access Journals (Sweden)

    Josh P. Clevenger

    2018-04-01

    Full Text Available Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2 was developed for Arachis hypogaea (peanut, an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.

  13. Calmodulin-like protein 3 is an estrogen receptor alpha coregulator for gene expression and drug response in a SNP, estrogen, and SERM-dependent fashion.

    Science.gov (United States)

    Qin, Sisi; Ingle, James N; Liu, Mohan; Yu, Jia; Wickerham, D Lawrence; Kubo, Michiaki; Weinshilboum, Richard M; Wang, Liewei

    2017-08-18

    We previously performed a case-control genome-wide association study in women treated with selective estrogen receptor modulators (SERMs) for breast cancer prevention and identified single nucleotide polymorphisms (SNPs) in ZNF423 as potential biomarkers for response to SERM therapy. The ZNF423rs9940645 SNP, which is approximately 200 bp away from the estrogen response elements, resulted in the SNP, estrogen, and SERM-dependent regulation of ZNF423 expression and, "downstream", that of BRCA1. Electrophoretic mobility shift assay-mass spectrometry was performed to identify proteins binding to the ZNF423 SNP and coordinating with estrogen receptor alpha (ERα). Clustered, regularly interspaced short palindromic repeats (CRISPR)/Cas9 genome editing was applied to generate ZR75-1 breast cancer cells with different ZNF423 SNP genotypes. Both cultured cells and mouse xenograft models with different ZNF423 SNP genotypes were used to study the cellular responses to SERMs and poly(ADP-ribose) polymerase (PARP) inhibitors. We identified calmodulin-like protein 3 (CALML3) as a key sensor of this SNP and a coregulator of ERα, which contributes to differential gene transcription regulation in an estrogen and SERM-dependent fashion. Furthermore, using CRISPR/Cas9-engineered ZR75-1 breast cancer cells with different ZNF423 SNP genotypes, striking differences in cellular responses to SERMs and PARP inhibitors, alone or in combination, were observed not only in cells but also in a mouse xenograft model. Our results have demonstrated the mechanism by which the ZNF423 rs9940645 SNP might regulate gene expression and drug response as well as its potential role in achieving more highly individualized breast cancer therapy.

  14. Effects of ghrelin gene genotypes on the growth traits in Chinese cattle.

    Science.gov (United States)

    Zhang, Ai-ling; Zhang, Li; Zhang, Liang-zhi; Zhang, Cun-fang; Lan, Xian-yong; Zhang, Chun-lei; Chen, Hong

    2012-06-01

    Ghrelin is an important peptide that stimulates food intake and regulates energy balance of animals. Single nucleotide polymorphisms of ghrelin gene in three Chinese cattle populations were investigated through PCR-SSCP and DNA sequencing. Five over-lapped DNA fragments were analyzed and a total of three ones exhibited different genotypes. Three genotypes and four SNPs (-415 A > G, -414 T > C, -321 C > A, and -172 A > G) were found on the -544 to +35 bp region (G-1) of ghrelin gene. On the locus of -1037 to -509 bp (G-2), two genotypes and one SNP (-726 A > T) were discovered. And in the exon1, exon2, and intron1 (G-4 locus, (+4 to +427)), two genotypes and one SNP were detected (+205 C > T, located in intron1). Positions of the five SNPs in the 5′ regulatory region might be the transcription factor binding sites. The SNPs at -415 and -414 in the core binding sequence were found to cause the change of the site. Though the SNP at -172 did not change the binding site, it generated one new site at the same time. The frequencies of the genotypes varied differently in the three breeds. Results of ANOVA showed that G-1 was correlative to the ischium width (IW) of Nanyang cattle aged 18 months (p = 0.043). The least square analysis between genotypes at G-1 locus and growth traits in Nanyang cattle showed that the individuals (aged 18 months) with C genotype had greater IW than that of the other two genotypes. The C genotype might serve as one potential candidate genetic marker for cattle growth and development.

  15. SNPdetector: a software tool for sensitive and accurate SNP detection.

    Directory of Open Access Journals (Sweden)

    Jinghui Zhang

    2005-10-01

    Full Text Available Identification of single nucleotide polymorphisms (SNPs and mutations is important for the discovery of genetic predisposition to complex diseases. PCR resequencing is the method of choice for de novo SNP discovery. However, manual curation of putative SNPs has been a major bottleneck in the application of this method to high-throughput screening. Therefore it is critical to develop a more sensitive and accurate computational method for automated SNP detection. We developed a software tool, SNPdetector, for automated identification of SNPs and mutations in fluorescence-based resequencing reads. SNPdetector was designed to model the process of human visual inspection and has a very low false positive and false negative rate. We demonstrate the superior performance of SNPdetector in SNP and mutation analysis by comparing its results with those derived by human inspection, PolyPhred (a popular SNP detection tool, and independent genotype assays in three large-scale investigations. The first study identified and validated inter- and intra-subspecies variations in 4,650 traces of 25 inbred mouse strains that belong to either the Mus musculus species or the M. spretus species. Unexpected heterozygosity in CAST/Ei strain was observed in two out of 1,167 mouse SNPs. The second study identified 11,241 candidate SNPs in five ENCODE regions of the human genome covering 2.5 Mb of genomic sequence. Approximately 50% of the candidate SNPs were selected for experimental genotyping; the validation rate exceeded 95%. The third study detected ENU-induced mutations (at 0.04% allele frequency in 64,896 traces of 1,236 zebra fish. Our analysis of three large and diverse test datasets demonstrated that SNPdetector is an effective tool for genome-scale research and for large-sample clinical studies. SNPdetector runs on Unix/Linux platform and is available publicly (http://lpg.nci.nih.gov.

  16. Marcadores SNP: conceitos básicos, aplicações no manejo e no melhoramento animal e perspectivas para o futuro SNP markers: basic concepts, applications in animal breeding and management and perspectives for the future

    Directory of Open Access Journals (Sweden)

    Alexandre Rodrigues Caetano

    2009-07-01

    molecular markers to characterize genetic resources and generate tools for animal breeding and management date from the end of the 80s. In the last 20 years the technologies to generate molecular data went through several innovation cycles. The last wave of technological innovations represents a true revolution, bringing methods to identify and genotype SNP (Single Nucleotide Polymorphism markers in large scale. High density DNA chips were generated to genotype from tens of thousands to hundreds of thousands of SNPs in a single assay. Furthermore, other medium density technologies allow for the genotyping of tens to hundreds of makers, in high numbers of samples, with very high speed and automation. These new technologies allowed for the generation of new applications, such as the methods to genetically evaluate and select animals based on their Genomic Value (Genomic Estimated Breeding Value - GEBV. The statistical methods for genomic evaluation and selection are in full development, but the technology already became reality with the release of the first bull summary for the Holstein breed with GEBVs for milk production and quality traits in January 2009. In addition, these technologies brought new options for development of diagnostic tests for paternity testing, individual identification, traceability, etc. Also, these new technologies to genotype SNP markers facilitated the development of outsourcing companies to generate molecular data, allowing any group to conduct advanced experiments, always using the most advanced technologies, without the need of investments into equipment.

  17. Development of highly reliable in silico SNP resource and genotyping assay from exome capture and sequencing: an example from black spruce (Picea mariana).

    Science.gov (United States)

    Pavy, Nathalie; Gagnon, France; Deschênes, Astrid; Boyle, Brian; Beaulieu, Jean; Bousquet, Jean

    2016-03-01

    Picea mariana is a widely distributed boreal conifer across Canada and the subject of advanced breeding programmes for which population genomics and genomic selection approaches are being developed. Targeted sequencing was achieved after capturing P. mariana exome with probes designed from the sequenced transcriptome of Picea glauca, a distant relative. A high capture efficiency of 75.9% was reached although spruce has a complex and large genome including gene sequences interspersed by some long introns. The results confirmed the relevance of using probes from congeneric species to perform successfully interspecific exome capture in the genus Picea. A bioinformatics pipeline was developed including stringent criteria that helped detect a set of 97,075 highly reliable in silico SNPs. These SNPs were distributed across 14,909 genes. Part of an Infinium iSelect array was used to estimate the rate of true positives by validating 4267 of the predicted in silico SNPs by genotyping trees from P. mariana populations. The true positive rate was 96.2% for in silico SNPs, compared to a genotyping success rate of 96.7% for a set 1115 P. mariana control SNPs recycled from previous genotyping arrays. These results indicate the high success rate of the genotyping array and the relevance of the selection criteria used to delineate the new P. mariana in silico SNP resource. Furthermore, in silico SNPs were generally of medium to high frequency in natural populations, thus providing high informative value for future population genomics applications. © 2015 John Wiley & Sons Ltd.

  18. Conclusive evidence for hexasomic inheritance in chrysanthemum based on analysis of a 183 k SNP array.

    Science.gov (United States)

    van Geest, Geert; Voorrips, Roeland E; Esselink, Danny; Post, Aike; Visser, Richard Gf; Arens, Paul

    2017-08-07

    Cultivated chrysanthemum is an outcrossing hexaploid (2n = 6× = 54) with a disputed mode of inheritance. In this paper, we present a single nucleotide polymorphism (SNP) selection pipeline that was used to design an Affymetrix Axiom array with 183 k SNPs from RNA sequencing data (1). With this array, we genotyped four bi-parental populations (with sizes of 405, 53, 76 and 37 offspring plants respectively), and a cultivar panel of 63 genotypes. Further, we present a method for dosage scoring in hexaploids from signal intensities of the array based on mixture models (2) and validation of selection steps in the SNP selection pipeline (3). The resulting genotypic data is used to draw conclusions on the mode of inheritance in chrysanthemum (4), and to make an inference on allelic expression bias (5). With use of the mixture model approach, we successfully called the dosage of 73,936 out of 183,130 SNPs (40.4%) that segregated in any of the bi-parental populations. To investigate the mode of inheritance, we analysed markers that segregated in the large bi-parental population (n = 405). Analysis of segregation of duplex x nulliplex SNPs resulted in evidence for genome-wide hexasomic inheritance. This evidence was substantiated by the absence of strong linkage between markers in repulsion, which indicated absence of full disomic inheritance. We present the success rate of SNP discovery out of RNA sequencing data as affected by different selection steps, among which SNP coverage over genotypes and use of different types of sequence read mapping software. Genomic dosage highly correlated with relative allele coverage from the RNA sequencing data, indicating that most alleles are expressed according to their genomic dosage. The large population, genotyped with a very large number of markers, is a unique framework for extensive genetic analyses in hexaploid chrysanthemum. As starting point, we show conclusive evidence for genome-wide hexasomic inheritance.

  19. SNP detection for massively parallel whole-genome resequencing

    DEFF Research Database (Denmark)

    Li, Ruiqiang; Li, Yingrui; Fang, Xiaodong

    2009-01-01

    -genome or target region resequencing. Here, we have developed a consensus-calling and SNP-detection method for sequencing-by-synthesis Illumina Genome Analyzer technology. We designed this method by carefully considering the data quality, alignment, and experimental errors common to this technology. All...... of this information was integrated into a single quality score for each base under Bayesian theory to measure the accuracy of consensus calling. We tested this methodology using a large-scale human resequencing data set of 36x coverage and assembled a high-quality nonrepetitive consensus sequence for 92.......25% of the diploid autosomes and 88.07% of the haploid X chromosome. Comparison of the consensus sequence with Illumina human 1M BeadChip genotyped alleles from the same DNA sample showed that 98.6% of the 37,933 genotyped alleles on the X chromosome and 98% of 999,981 genotyped alleles on autosomes were covered...

  20. TPH2 -703G/T SNP may have important effect on susceptibility to suicidal behavior in major depression.

    Science.gov (United States)

    Yoon, Ho-Kyoung; Kim, Yong-Ku

    2009-04-30

    Serotonergic system-related genes can be good candidate genes for both major depressive disorder (MDD) and suicidal behavior. In this study, we aimed to investigate the association of serotonin 2A receptor gene -1438A/G SNP (HTR2A -1438A/G), tryptophan hydroxylase 2 gene -703G/T SNP (TPH2 -703G/T) and serotonin 1A receptor C-1019G (HTR1A C-1019G) with suicidal behavior. One hundred and eighty one suicidal depressed patients and 143 non-suicidal depressed patients who met DSM-IV criteria for major depressive disorder were recruited from patients who were admitted to Korea University Ansan Hospital. One hundred seventy six normal controls were healthy volunteers who were recruited by local advertisement. Patients and normal controls were genotyped for HTR2A -1438A/G, TPH2 -703G/T and 5-HT1A C-1019G. The suicidal depressed patients were evaluated by the lethality of individual suicide attempts using Weisman and Worden's risk-rescue rating (RRR) and the Lethality Suicide Attempt Rating Scale-updated (LSARS-II). In order to assess the severity of depressive symptoms of patients, Hamilton's Depression Rating Scale (HDRS) was administered. Genotype and allele frequencies were compared between groups by chi(2) statistics. Association of genotype of the candidate genes with the lethality of suicidal behavior was examined with ANOVA by comparing the mean scores of LSARS and RRR according to the genotype. There were statistically significant differences in the genotype distributions and allele frequencies of TPH2 -703G/T between the suicidal depressive group and the normal control group. The homozygous allele G (G/G genotype) frequency was significantly higher in suicidal depressed patients than in controls. However, no differences in either genotype distribution or in allele frequencies of HTR2A -1438A/G and HTR1A C-1019G were observed between the suicidal depressed patients, the non-suicidal depressed patients, and the normal controls. There were no differences in the

  1. The Development of Quality Control Genotyping Approaches: A Case Study Using Elite Maize Lines.

    Directory of Open Access Journals (Sweden)

    Jiafa Chen

    Full Text Available Quality control (QC of germplasm identity and purity is a critical component of breeding and conservation activities. SNP genotyping technologies and increased availability of markers provide the opportunity to employ genotyping as a low-cost and robust component of this QC. In the public sector available low-cost SNP QC genotyping methods have been developed from a very limited panel of markers of 1,000 to 1,500 markers without broad selection of the most informative SNPs. Selection of optimal SNPs and definition of appropriate germplasm sampling in addition to platform section impact on logistical and resource-use considerations for breeding and conservation applications when mainstreaming QC. In order to address these issues, we evaluated the selection and use of SNPs for QC applications from large DArTSeq data sets generated from CIMMYT maize inbred lines (CMLs. Two QC genotyping strategies were developed, the first is a "rapid QC", employing a small number of SNPs to identify potential mislabeling of seed packages or plots, the second is a "broad QC", employing a larger number of SNP, used to identify each germplasm entry and to measure heterogeneity. The optimal marker selection strategies combined the selection of markers with high minor allele frequency, sampling of clustered SNP in proportion to marker cluster distance and selecting markers that maintain a uniform genomic distribution. The rapid and broad QC SNP panels selected using this approach were further validated using blind test assessments of related re-generation samples. The influence of sampling within each line was evaluated. Sampling 192 individuals would result in close to 100% possibility of detecting a 5% contamination in the entry, and approximately a 98% probability to detect a 2% contamination of the line. These results provide a framework for the establishment of QC genotyping. A comparison of financial and time costs for use of these approaches across different

  2. Analysis of Single Nucleotide Polymorphism (SNP rs22114085 Associated with Canine Atopic Dermatitis by PCR-RFLP Method

    Directory of Open Access Journals (Sweden)

    Martina Miluchová

    2012-05-01

    Full Text Available Canine atopic dermatitis (cAD is a common inflammatory skin disease that is considered to be a naturally occurring, spontaneous model of human atopic dermatitis (eczema. The aim of the paper was to identify of the SNP rs22114085 in different dog breeds. The material involved 52 dogs from 5 different breeds. Canine genomic DNA was isolated from saliva by modified method with using DNAzol® and linear polyacrylamide (LPA carrier and from blood by using commercial kit NucleospinBlood and used in order to estimate rs22114085 SNP genotypes by PCR-RFLP method. The PCR products were digested with DdeI restriction enzyme. The C allele was distributed in Czech Pointer, Chihuahua, German Wirehaired Pointer with an allele frequency ranging from 0.4545 to 1.00. In the population of Czech Pointer we detected all genotypes CC, CT and TT with frequency in male 0.25, 0.5833 and 0.1667, and in female 0.2728, 0.3636 and 0.3636, subsequently. In German Wirehaired Pointer was detected homozygote genotype CC in male and heterozygote genotype CT in female with frequency 1 and 1. In Chihuahua was observed homozygote genotype CC and heterozygote genotype CT with frequency 0.3333 and 0.6667, subsequently. In Golden retriever and Pincher we detected genotype TT with frequency 1.

  3. Identification of SNP barcode biomarkers for genes associated with facial emotion perception using particle swarm optimization algorithm.

    Science.gov (United States)

    Chuang, Li-Yeh; Lane, Hsien-Yuan; Lin, Yu-Da; Lin, Ming-Teng; Yang, Cheng-Hong; Chang, Hsueh-Wei

    2014-01-01

    Facial emotion perception (FEP) can affect social function. We previously reported that parts of five tested single-nucleotide polymorphisms (SNPs) in the MET and AKT1 genes may individually affect FEP performance. However, the effects of SNP-SNP interactions on FEP performance remain unclear. This study compared patients with high and low FEP performances (n = 89 and 93, respectively). A particle swarm optimization (PSO) algorithm was used to identify the best SNP barcodes (i.e., the SNP combinations and genotypes that revealed the largest differences between the high and low FEP groups). The analyses of individual SNPs showed no significant differences between the high and low FEP groups. However, comparisons of multiple SNP-SNP interactions involving different combinations of two to five SNPs showed that the best PSO-generated SNP barcodes were significantly associated with high FEP score. The analyses of the joint effects of the best SNP barcodes for two to five interacting SNPs also showed that the best SNP barcodes had significantly higher odds ratios (2.119 to 3.138; P < 0.05) compared to other SNP barcodes. In conclusion, the proposed PSO algorithm effectively identifies the best SNP barcodes that have the strongest associations with FEP performance. This study also proposes a computational methodology for analyzing complex SNP-SNP interactions in social cognition domains such as recognition of facial emotion.

  4. Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties.

    Science.gov (United States)

    Singh, Nivedita; Choudhury, Debjani Roy; Singh, Amit Kumar; Kumar, Sundeep; Srinivasan, Kalyani; Tyagi, R K; Singh, N K; Singh, Rakesh

    2013-01-01

    Simple sequence repeat (SSR) and Single Nucleotide Polymorphic (SNP), the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR) and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC) values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA) indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA) with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD) derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis.

  5. EvoSNP-DB: A database of genetic diversity in East Asian populations.

    Science.gov (United States)

    Kim, Young Uk; Kim, Young Jin; Lee, Jong-Young; Park, Kiejung

    2013-08-01

    Genome-wide association studies (GWAS) have become popular as an approach for the identification of large numbers of phenotype-associated variants. However, differences in genetic architecture and environmental factors mean that the effect of variants can vary across populations. Understanding population genetic diversity is valuable for the investigation of possible population specific and independent effects of variants. EvoSNP-DB aims to provide information regarding genetic diversity among East Asian populations, including Chinese, Japanese, and Korean. Non-redundant SNPs (1.6 million) were genotyped in 54 Korean trios (162 samples) and were compared with 4 million SNPs from HapMap phase II populations. EvoSNP-DB provides two user interfaces for data query and visualization, and integrates scores of genetic diversity (Fst and VarLD) at the level of SNPs, genes, and chromosome regions. EvoSNP-DB is a web-based application that allows users to navigate and visualize measurements of population genetic differences in an interactive manner, and is available online at [http://biomi.cdc.go.kr/EvoSNP/].

  6. Effect of MDM2 SNP309 and p53 codon 72 polymorphisms on lung cancer risk and survival among non-smoking Chinese women in Singapore

    Directory of Open Access Journals (Sweden)

    Sabapathy Kanaga

    2010-03-01

    Full Text Available Abstract Background Single nucleotide polymorphism (SNP 309 resulting in a T or G allele in the promoter of MDM2, the negative regulator of p53, has been suggested to affect cancer predisposition and age of onset, primarily in females. However, findings have been inconsistent in various cancers, and ethnicity appears to be a critical factor influencing the effects of the SNP on cancer risk. An increasing trend has been observed in the prevalence of lung cancers in non-smokers, especially females, though the underlying genetic basis is unclear. Methods We therefore examined the role of the SNPs in the p53 pathway (p53 codon 72 and MDM2 SNP309 on lung cancer risk and prognosis of a life-time non-smoking female Chinese population, in a hospital-based case-control study of 123 cases and 159 age-matched controls, by PCR analysis. Results Our findings reveal that the risk of lung cancer among individuals with the MDM2 SNP309 TT genotype was 2.1 (95% CI 1.01-4.36 relative to the GG genotype, contrary to initial expectations that the GG genotype with elevated MDM2 levels will increase cancer risk. Those who had this genotype in combination with the p53 Pro allele had a risk of 2.5 (95% CI 1.2-5.0. There was however no effect of either polymorphism on age at diagnosis of lung cancer or on overall survival. Conclusions The results thus demonstrate that the MDM2 SNP309 TT rather than the GG genotype is associated with increased risk of lung cancer in this population, suggesting that other mechanisms independent of increased MDM2 levels can influence cancer susceptibility.

  7. Application of LogitBoost Classifier for Traceability Using SNP Chip Data.

    Science.gov (United States)

    Kim, Kwondo; Seo, Minseok; Kang, Hyunsung; Cho, Seoae; Kim, Heebal; Seo, Kang-Seok

    2015-01-01

    Consumer attention to food safety has increased rapidly due to animal-related diseases; therefore, it is important to identify their places of origin (POO) for safety purposes. However, only a few studies have addressed this issue and focused on machine learning-based approaches. In the present study, classification analyses were performed using a customized SNP chip for POO prediction. To accomplish this, 4,122 pigs originating from 104 farms were genotyped using the SNP chip. Several factors were considered to establish the best prediction model based on these data. We also assessed the applicability of the suggested model using a kinship coefficient-filtering approach. Our results showed that the LogitBoost-based prediction model outperformed other classifiers in terms of classification performance under most conditions. Specifically, a greater level of accuracy was observed when a higher kinship-based cutoff was employed. These results demonstrated the applicability of a machine learning-based approach using SNP chip data for practical traceability.

  8. SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

    Science.gov (United States)

    Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon

    2016-01-01

    Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.

  9. Functional characterization of the Thr946Ala SNP at the type 1 diabetes IFIH1 locus.

    Science.gov (United States)

    Zouk, Hana; Marchand, Luc; Li, Quan; Polychronakos, Constantin

    2014-02-01

    The Thr allele at the Thr946Ala non-synonymous single-nucleotide polymorphism (nsSNP) in the IFIH1 gene confers risk for type 1 diabetes (T1D). IFIH1 binds viral double-stranded RNA (dsRNA), inducing a type I interferon (IFN) response. Reports of this nsSNP's role in IFIH1 expression regulation have produced conflicting results and a study evaluating transfected Thr946Ala protein alleles in an artificial system overexpressing IFIH1 shows that the SNP does not affect IFH1 function. In this study, we examine the effects of the Thr946Ala polymorphism on IFN-α response in a cell line that endogenously expresses physiological levels of IFIH1. Eleven lymphoblastoid cell lines (LCLs) homozygous for the major predisposing allele (Thr/Thr) and 6 LCLs homozygous for the minor protective allele (Ala/Ala) were electroporated with the viral dsRNA mimic, poly I:C, in three independent experiments. Media were collected 24 hours later and measured for IFN-α production by ELISA. Basal IFN response is minimal in mock-transfected cells from both genotypes and increases by about 8-fold in cells treated with poly I:C. LCLs with the Ala/Ala genotype have slightly higher IFN-α levels than their Thr/Thr counterparts but this did not reach statistical significance because of the large variability of the IFN response, due mostly to two high outliers (biological, not technical). A larger sample size would be needed to determine whether the Thr946Ala SNP affects the poly I:C-driven IFN-α response. Additionally, the possibility that this nsSNP recognizes viral dsRNA specificities cannot be ruled out. Thus, the mechanism of the observed association of this SNP with T1D remains to be determined.

  10. Identification, Characterization, and Mapping of a Novel SNP Associated with Body Color Transparency in Juvenile Red Sea Bream (Pagrus major).

    Science.gov (United States)

    Sawayama, Eitaro; Noguchi, Daiki; Nakayama, Kei; Takagi, Motohiro

    2018-03-23

    We previously reported a body color deformity in juvenile red sea bream, which shows transparency in the juvenile stage because of delayed chromatophore development compared with normal individuals, and this finding suggested a genetic cause based on parentage assessments. To conduct marker-assisted selection to eliminate broodstock inheriting the causative gene, developing DNA markers associated with the phenotype was needed. We first conducted SNP mining based on AFLP analysis using bulked-DNA from normal and transparent individuals. One SNP was identified from a transparent-specific AFLP fragment, which significantly associated with transparent individuals. Two alleles (A/G) were observed in this locus, and the genotype G/G was dominantly observed in the transparent groups (97.1%) collected from several production lots produced from different broodstock populations. A few normal individuals inherited the G/G genotype (5.0%), but the A/A and A/G genotypes were dominantly observed in the normal groups. The homologs region of the SNP was searched using a medaka genome database, and intron 12 of the Nell2a gene (located on chromosome 6 of the medaka genome) was highly matched. We also mapped the red sea bream Nell2a gene on the previously developed linkage maps, and this gene was mapped on a male linkage group, LG4-M. The newly found SNP was useful in eliminating broodstock possessing the causative gene of the body color transparency observed in juvenile stage of red sea bream.

  11. Development and evaluation of the first high-throughput SNP array for common carp (Cyprinus carpio).

    Science.gov (United States)

    Xu, Jian; Zhao, Zixia; Zhang, Xiaofeng; Zheng, Xianhu; Li, Jiongtang; Jiang, Yanliang; Kuang, Youyi; Zhang, Yan; Feng, Jianxin; Li, Chuangju; Yu, Juhua; Li, Qiang; Zhu, Yuanyuan; Liu, Yuanyuan; Xu, Peng; Sun, Xiaowen

    2014-04-24

    A large number of single nucleotide polymorphisms (SNPs) have been identified in common carp (Cyprinus carpio) but, as yet, no high-throughput genotyping platform is available for this species. C. carpio is an important aquaculture species that accounts for nearly 14% of freshwater aquaculture production worldwide. We have developed an array for C. carpio with 250,000 SNPs and evaluated its performance using samples from various strains of C. carpio. The SNPs used on the array were selected from two resources: the transcribed sequences from RNA-seq data of four strains of C. carpio, and the genome re-sequencing data of five strains of C. carpio. The 250,000 SNPs on the resulting array are distributed evenly across the reference C.carpio genome with an average spacing of 6.6 kb. To evaluate the SNP array, 1,072 C. carpio samples were collected and tested. Of the 250,000 SNPs on the array, 185,150 (74.06%) were found to be polymorphic sites. Genotyping accuracy was checked using genotyping data from a group of full-siblings and their parents, and over 99.8% of the qualified SNPs were found to be reliable. Analysis of the linkage disequilibrium on all samples and on three domestic C.carpio strains revealed that the latter had the longer haplotype blocks. We also evaluated our SNP array on 80 samples from eight species related to C. carpio, with from 53,526 to 71,984 polymorphic SNPs. An identity by state analysis divided all the samples into three clusters; most of the C. carpio strains formed the largest cluster. The Carp SNP array described here is the first high-throughput genotyping platform for C. carpio. Our evaluation of this array indicates that it will be valuable for farmed carp and for genetic and population biology studies in C. carpio and related species.

  12. Integrating milk metabolite profile information for the prediction of traditional milk traits based on SNP information for Holstein cows.

    Directory of Open Access Journals (Sweden)

    Nina Melzer

    Full Text Available In this study the benefit of metabolome level analysis for the prediction of genetic value of three traditional milk traits was investigated. Our proposed approach consists of three steps: First, milk metabolite profiles are used to predict three traditional milk traits of 1,305 Holstein cows. Two regression methods, both enabling variable selection, are applied to identify important milk metabolites in this step. Second, the prediction of these important milk metabolite from single nucleotide polymorphisms (SNPs enables the detection of SNPs with significant genetic effects. Finally, these SNPs are used to predict milk traits. The observed precision of predicted genetic values was compared to the results observed for the classical genotype-phenotype prediction using all SNPs or a reduced SNP subset (reduced classical approach. To enable a comparison between SNP subsets, a special invariable evaluation design was implemented. SNPs close to or within known quantitative trait loci (QTL were determined. This enabled us to determine if detected important SNP subsets were enriched in these regions. The results show that our approach can lead to genetic value prediction, but requires less than 1% of the total amount of (40,317 SNPs., significantly more important SNPs in known QTL regions were detected using our approach compared to the reduced classical approach. Concluding, our approach allows a deeper insight into the associations between the different levels of the genotype-phenotype map (genotype-metabolome, metabolome-phenotype, genotype-phenotype.

  13. Pacifiplex: an ancestry-informative SNP panel centred on Australia and the Pacific region.

    Science.gov (United States)

    Santos, Carla; Phillips, Christopher; Fondevila, Manuel; Daniel, Runa; van Oorschot, Roland A H; Burchard, Esteban G; Schanfield, Moses S; Souto, Luis; Uacyisrael, Jolame; Via, Marc; Carracedo, Ángel; Lareu, Maria V

    2016-01-01

    The analysis of human population variation is an area of considerable interest in the forensic, medical genetics and anthropological fields. Several forensic single nucleotide polymorphism (SNP) assays provide ancestry-informative genotypes in sensitive tests designed to work with limited DNA samples, including a 34-SNP multiplex differentiating African, European and East Asian ancestries. Although assays capable of differentiating Oceanian ancestry at a global scale have become available, this study describes markers compiled specifically for differentiation of Oceanian populations. A sensitive multiplex assay, termed Pacifiplex, was developed and optimized in a small-scale test applicable to forensic analyses. The Pacifiplex assay comprises 29 ancestry-informative marker SNPs (AIM-SNPs) selected to complement the 34-plex test, that in a combined set distinguish Africans, Europeans, East Asians and Oceanians. Nine Pacific region study populations were genotyped with both SNP assays, then compared to four reference population groups from the HGDP-CEPH human diversity panel. STRUCTURE analyses estimated population cluster membership proportions that aligned with the patterns of variation suggested for each study population's currently inferred demographic histories. Aboriginal Taiwanese and Philippine samples indicated high East Asian ancestry components, Papua New Guinean and Aboriginal Australians samples were predominantly Oceanian, while other populations displayed cluster patterns explained by the distribution of divergence amongst Melanesians, Polynesians and Micronesians. Genotype data from Pacifiplex and 34-plex tests is particularly well suited to analysis of Australian Aboriginal populations and when combined with Y and mitochondrial DNA variation will provide a powerful set of markers for ancestry inference applied to modern Australian demographic profiles. On a broader geographic scale, Pacifiplex adds highly informative data for inferring the ancestry

  14. New Insights on Water Buffalo Genomic Diversity and Post-Domestication Migration Routes From Medium Density SNP Chip Data

    Directory of Open Access Journals (Sweden)

    Licia Colli

    2018-03-01

    Full Text Available The domestic water buffalo is native to the Asian continent but through historical migrations and recent importations, nowadays has a worldwide distribution. The two types of water buffalo, i.e., river and swamp, display distinct morphological and behavioral traits, different karyotypes and also have different purposes and geographical distributions. River buffaloes from Pakistan, Iran, Turkey, Egypt, Romania, Bulgaria, Italy, Mozambique, Brazil and Colombia, and swamp buffaloes from China, Thailand, Philippines, Indonesia and Brazil were genotyped with a species-specific medium-density 90K SNP panel. We estimated the levels of molecular diversity and described population structure, which revealed historical relationships between populations and migration events. Three distinct gene pools were identified in pure river as well as in pure swamp buffalo populations. Genomic admixture was seen in the Philippines and in Brazil, resulting from importations of animals for breed improvement. Our results were largely consistent with previous archeological, historical and molecular-based evidence for two independent domestication events for river- and swamp-type buffaloes, which occurred in the Indo-Pakistani region and close to the China/Indochina border, respectively. Based on a geographical analysis of the distribution of diversity, our evidence also indicated that the water buffalo spread out of the domestication centers followed two major divergent migration directions: river buffaloes migrated west from the Indian sub-continent while swamp buffaloes migrated from northern Indochina via an east-south-eastern route. These data suggest that the current distribution of water buffalo diversity has been shaped by the combined effects of multiple migration events occurred at different stages of the post-domestication history of the species.

  15. Marker-assisted introgression of drought tolerance from wild ancestors into popular Indian rice varieties using a 7K Infinium SNP array

    Directory of Open Access Journals (Sweden)

    Ravindra Donde

    2017-10-01

    Full Text Available Recent advances in the area of genomics have led to the development of high throughput genotyping platforms that have immensely contributed to molecular breeding programs. Custom-designed single nucleotide polymorphism (SNP arrays provide an efficient, cost effective, high throughput genotyping tool for QTL/gene mapping, variety identification, marker-assisted selection, etc. In the current study, two interspecific libraries of Chromosome Segment Substitution Lines (CSSLs were evaluated under both drought and control conditions to identify lines with superior yield under drought. The CSSL libraries consisted of 48 BC4F3 lines derived from O. sativa cv. Curinga (tropical japonica x O. rufipogon, and 32 BC4F3 lines derived from O. sativa cv. Curinga (tropical japonica x O. meridionalis. The phenotypic screening of these 80 CSSLs led to the identification of three lines, MER-20, RUF-16, and RUF-44, that yielded well under drought stress. This line was backcrossed with popular rice variety of India, Swarna-Sub1 to introgress wild chromosome segments responsible for reproductive stage drought tolerance. During backcrossing, tracking of wild introgressions and monitoring of recurrent parent genome recovery was facilitated by the use of the Cornell 6K and 7K Infinium rice SNP arrays. The 6K and 7K SNP arrays assayed 5275 SNPs and 7099 SNPs, respectively, distributed across the 12 chromosomes. In our populations of (MER-20X Swarna sub1 BC2F1 lines, 1775 SNPs were polymorphic using the 6K array. The percentage of recurrent parent genome in these backcrossed lines ranged from 33-92% and the percentage of wild donor genome ranged from 8-67%. Using genotypic selection, 5% of plants were identified for further marker assisted backcrossing, based on the presence of the target donor (wild segment and maximum recovery of recurrent parent background. In the next generation, BC3F1 lines were genotyped using the 7K SNP array, which identified 2521 polymorphic SNPs

  16. SNP Arrays

    Directory of Open Access Journals (Sweden)

    Jari Louhelainen

    2016-10-01

    Full Text Available The papers published in this Special Issue “SNP arrays” (Single Nucleotide Polymorphism Arrays focus on several perspectives associated with arrays of this type. The range of papers vary from a case report to reviews, thereby targeting wider audiences working in this field. The research focus of SNP arrays is often human cancers but this Issue expands that focus to include areas such as rare conditions, animal breeding and bioinformatics tools. Given the limited scope, the spectrum of papers is nothing short of remarkable and even from a technical point of view these papers will contribute to the field at a general level. Three of the papers published in this Special Issue focus on the use of various SNP array approaches in the analysis of three different cancer types. Two of the papers concentrate on two very different rare conditions, applying the SNP arrays slightly differently. Finally, two other papers evaluate the use of the SNP arrays in the context of genetic analysis of livestock. The findings reported in these papers help to close gaps in the current literature and also to give guidelines for future applications of SNP arrays.

  17. Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses

    NARCIS (Netherlands)

    Orr, J.L.; Back, W.; Gu, J.; Leegwater, P.H.; Govindarajan, P.; Conroy, J.; Ducro, B.J.; Arendonk, van J.A.M.

    2010-01-01

    The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of

  18. Imputation of genotypes from low density (50,000 markers) to high density (700,000 markers) of cows from research herds in Europe, North America, and Australasia using 2 reference populations

    DEFF Research Database (Denmark)

    Pryce, J E; Johnston, J; Hayes, B J

    2014-01-01

    detection in genome-wide association studies and the accuracy of genomic selection may increase when the low-density genotypes are imputed to higher density. Genotype data were available from 10 research herds: 5 from Europe [Denmark, Germany, Ireland, the Netherlands, and the United Kingdom (UK)], 2 from...... reference populations. Although it was not possible to use a combined reference population, which would probably result in the highest accuracies of imputation, differences arising from using 2 high-density reference populations on imputing 50,000-marker genotypes of 583 animals (from the UK) were...... information exploited. The UK animals were also included in the North American data set (n = 1,579) that was imputed to high density using a reference population of 2,018 bulls. After editing, 591,213 genotypes on 5,999 animals from 10 research herds remained. The correlation between imputed allele...

  19. Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses.

    Science.gov (United States)

    Orr, N; Back, W; Gu, J; Leegwater, P; Govindarajan, P; Conroy, J; Ducro, B; Van Arendonk, J A M; MacHugh, D E; Ennis, S; Hill, E W; Brama, P A J

    2010-12-01

    The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of inheritance, to a 2-MB region of chromosome 14 using just 10 affected animals and 10 controls. We successfully genotyped 34,429 SNPs that were tested for association with dwarfism using chi-square tests. The most significant SNP in our study, BIEC2-239376 (P(2df)=4.54 × 10(-5), P(rec)=7.74 × 10(-6)), is located close to a gene implicated in human dwarfism. Fine-mapping and resequencing analyses did not aid in further localization of the causative variant, and replication of our findings in independent sample sets will be necessary to confirm these results. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.

  20. Genome-Wide Association Study for Identification and Validation of Novel SNP Markers for Sr6 Stem Rust Resistance Gene in Bread Wheat.

    Science.gov (United States)

    Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S

    2018-01-01

    Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.

  1. High-density single nucleotide polymorphism (SNP) array mapping in Brassica oleracea: identification of QTL associated with carotenoid variation in broccoli florets.

    Science.gov (United States)

    Brown, Allan F; Yousef, Gad G; Chebrolu, Kranthi K; Byrd, Robert W; Everhart, Koyt W; Thomas, Aswathy; Reid, Robert W; Parkin, Isobel A P; Sharpe, Andrew G; Oliver, Rebekah; Guzman, Ivette; Jackson, Eric W

    2014-09-01

    A high-resolution genetic linkage map of B. oleracea was developed from a B. napus SNP array. The work will facilitate genetic and evolutionary studies in Brassicaceae. A broccoli population, VI-158 × BNC, consisting of 150 F2:3 families was used to create a saturated Brassica oleracea (diploid: CC) linkage map using a recently developed rapeseed (Brassica napus) (tetraploid: AACC) Illumina Infinium single nucleotide polymorphism (SNP) array. The map consisted of 547 non-redundant SNP markers spanning 948.1 cM across nine chromosomes with an average interval size of 1.7 cM. As the SNPs are anchored to the genomic reference sequence of the rapid cycling B. oleracea TO1000, we were able to estimate that the map provides 96 % coverage of the diploid genome. Carotenoid analysis of 2 years data identified 3 QTLs on two chromosomes that are associated with up to half of the phenotypic variation associated with the accumulation of total or individual compounds. By searching the genome sequences of the two related diploid species (B. oleracea and B. rapa), we further identified putative carotenoid candidate genes in the region of these QTLs. This is the first description of the use of a B. napus SNP array to rapidly construct high-density genetic linkage maps of one of the constituent diploid species. The unambiguous nature of these markers with regard to genomic sequences provides evidence to the nature of genes underlying the QTL, and demonstrates the value and impact this resource will have on Brassica research.

  2. Genomic prediction in families of perennial ryegrass based on genotyping-by-sequencing

    DEFF Research Database (Denmark)

    Ashraf, Bilal

    In this thesis we investigate the potential for genomic prediction in perennial ryegrass using genotyping-by-sequencing (GBS) data. Association method based on family-based breeding systems was developed, genomic heritabilities, genomic prediction accurancies and effects of some key factors wer...... explored. Results show that low sequencing depth caused underestimation of allele substitution effects in GWAS and overestimation of genomic heritability in prediction studies. Other factors susch as SNP marker density, population structure and size of training population influenced accuracy of genomic...... prediction. Overall, GBS allows for genomic prediction in breeding families of perennial ryegrass and holds good potential to expedite genetic gain and encourage the application of genomic prediction...

  3. Genotyping of Mycobacterium leprae present on Ziehl-Neelsen-stained microscopic slides and in skin biopsy samples from leprosy patients in different geographic regions of Brazil

    Directory of Open Access Journals (Sweden)

    Amanda Nogueira Brum Fontes

    2012-12-01

    Full Text Available We analysed 16 variable number tandem repeats (VNTR and three single-nucleotide polymorphisms (SNP in Mycobacterium leprae present on 115 Ziehl-Neelsen (Z-N-stained slides and in 51 skin biopsy samples derived from leprosy patients from Ceará (n = 23, Pernambuco (n = 41, Rio de Janeiro (n = 22 and Rondônia (RO (n = 78. All skin biopsies yielded SNP-based genotypes, while 48 of the samples (94.1% yielded complete VNTR genotypes. We evaluated two procedures for extracting M. leprae DNA from Z-N-stained slides: the first including Chelex and the other combining proteinase and sodium dodecyl sulfate. Of the 76 samples processed using the first procedure, 30.2% were positive for 16 or 15 VNTRs, whereas of the 39 samples processed using the second procedure, 28.2% yielded genotypes defined by at least 10 VNTRs. Combined VNTR and SNP analysis revealed large variability in genotypes, but a high prevalence of SNP genotype 4 in the Northeast Region of Brazil. Our observation of two samples from RO with an identical genotype and seven groups with similar genotypes, including four derived from residents of the same state or region, suggest a tendency to form groups according to the origin of the isolates. This study demonstrates the existence of geographically related M. leprae genotypes and that Z-N-stained slides are an alternative source for M. leprae genotyping.

  4. Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties.

    Directory of Open Access Journals (Sweden)

    Nivedita Singh

    Full Text Available Simple sequence repeat (SSR and Single Nucleotide Polymorphic (SNP, the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis.

  5. Fine-scaled human genetic structure revealed by SNP microarrays.

    Science.gov (United States)

    Xing, Jinchuan; Watkins, W Scott; Witherspoon, David J; Zhang, Yuhua; Guthery, Stephen L; Thara, Rangaswamy; Mowry, Bryan J; Bulayeva, Kazima; Weiss, Robert B; Jorde, Lynn B

    2009-05-01

    We report an analysis of more than 240,000 loci genotyped using the Affymetrix SNP microarray in 554 individuals from 27 worldwide populations in Africa, Asia, and Europe. To provide a more extensive and complete sampling of human genetic variation, we have included caste and tribal samples from two states in South India, Daghestanis from eastern Europe, and the Iban from Malaysia. Consistent with observations made by Charles Darwin, our results highlight shared variation among human populations and demonstrate that much genetic variation is geographically continuous. At the same time, principal components analyses reveal discernible genetic differentiation among almost all identified populations in our sample, and in most cases, individuals can be clearly assigned to defined populations on the basis of SNP genotypes. All individuals are accurately classified into continental groups using a model-based clustering algorithm, but between closely related populations, genetic and self-classifications conflict for some individuals. The 250K data permitted high-level resolution of genetic variation among Indian caste and tribal populations and between highland and lowland Daghestani populations. In particular, upper-caste individuals from Tamil Nadu and Andhra Pradesh form one defined group, lower-caste individuals from these two states form another, and the tribal Irula samples form a third. Our results emphasize the correlation of genetic and geographic distances and highlight other elements, including social factors that have contributed to population structure.

  6. Results based on 124 cases of breast cancer and 97 controls from Taiwan suggest that the single nucleotide polymorphism (SNP309) in the MDM2 gene promoter is associated with earlier onset and increased risk of breast cancer

    International Nuclear Information System (INIS)

    Sun, Ying-Fang; Leu, Jyh-Der; Chen, Su-Mei; Lin, I-Feng; Lee, Yi-Jang

    2009-01-01

    It has been suggested that the single nucleotide polymorphism 309 (SNP309, T -> G) in the promoter region of the MDM2 gene is important for tumor development; however, with regards to breast cancer, inconsistent associations have been reported worldwide. It is speculated that these conflicting results may have arisen due to different patient subgroups and ethnicities studied. For the first time, this study explores the effect of the MDM2 SNP309 genotype on Taiwanese breast cancer patients. Genomic DNA was obtained from the whole blood of 124 breast cancer patients and 97 cancer-free healthy women living in Taiwan. MDM2 SNP309 genotyping was carried out by restriction fragment length polymorphism (RFLP) assay. The multivariate logistic regression and the Kaplan-Meier method were used for analyzing the risk association and significance of age at diagnosis among different MDM2 SNP309 genotypes, respectively. Compared to the TT genotype, an increased risk association with breast cancer was apparent for the GG genotype (OR = 3.05, 95% CI = 1.04 to 8.95), and for the TG genotype (OR = 2.12, 95% CI = 0.90 to 5.00) after adjusting for age, cardiovascular disease/diabetes, oral contraceptive usage, and body mass index, which exhibits significant difference between cases and controls. Furthermore, the average ages at diagnosis for breast cancer patients were 53.6, 52 and 47 years for those harboring TT, TG and GG genotypes, respectively. A significant difference in median age of onset for breast cancer between GG and TT+TG genotypes was obtained by the log-rank test (p = 0.0067). Findings based on the current sample size suggest that the MDM2 SNP309 GG genotype may be associated with both the risk of breast cancer and an earlier age of onset in Taiwanese women

  7. Analysis of single nucleotide polymorphism (SNP RS23472497 associated with canine atopic dermatitis by ACRS-PCR method

    Directory of Open Access Journals (Sweden)

    Martina Miluchová

    2014-05-01

    Full Text Available The aim of the paper was to identify of the SNP rs23472497 associated with canine atopic dermatitis (cAD. cAD is a common inflammatory skin disease that is considered to be a naturally occurring, spontaneous model of human atopic dermatitis (eczema. The material involved 60 dogs from 6 different breeds. Canine genomic DNA was isolated from saliva by modified method with using DNAzol® and linear polyacrylamide (LPA carrier and from blood by using commercial kit NucleospinBlood and used in order to estimate rs23472497 SNP genotypes by ACRS-PCR method. The PCR products were digested with NlaIII restriction enzyme. In the population of Czech Pointer and Slovak Wirehaired Pointer we detected all genotypes AA, AG and GG with frequency 0.0732, 0.5122 and 0.4146 for Czech Pointer and 0.1818, 0.5455 and 0.2727 for Slovak Wirehaired Pointer. In Border Collie was observed heterozygote genotype AG and homozygote genotype GG with frequency 0.6667 and 0.3333, subsequently. In German Wirehaired Pointer, Australian Shepherd dog and American Staffordshire terrier we detected only genotype AG with frequency 1. The A allele was distributed with an allele frequency ranging from 0.3293 to 0.5. The G allele was distributed with an allele frequency ranging from 0.5 to 0.6707.

  8. Validation of Genotyping-By-Sequencing Analysis in Populations of Tetraploid Alfalfa by 454 Sequencing

    Science.gov (United States)

    Rocher, Solen; Jean, Martine; Castonguay, Yves; Belzile, François

    2015-01-01

    Genotyping-by-sequencing (GBS) is a relatively low-cost high throughput genotyping technology based on next generation sequencing and is applicable to orphan species with no reference genome. A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations. GBS was performed on ApeKI libraries using DNA from 48 genotypes each of two heterogeneous populations of tetraploid alfalfa (Medicago sativa spp. sativa): the synthetic cultivar Apica (ATF0) and a derived population (ATF5) obtained after five cycles of recurrent selection for superior tolerance to freezing (TF). Nearly 400 million reads were obtained from two lanes of an Illumina HiSeq 2000 sequencer and analyzed with the Universal Network-Enabled Analysis Kit (UNEAK) pipeline designed for species with no reference genome. Following the application of whole dataset-level filters, 11,694 single nucleotide polymorphism (SNP) loci were obtained. About 60% had a significant match on the Medicago truncatula syntenic genome. The accuracy of allelic ratios and genotype calls based on GBS data was directly assessed using 454 sequencing on a subset of SNP loci scored in eight plant samples. Sequencing depth in this study was not sufficient for accurate tetraploid allelic dosage, but reliable genotype calls based on diploid allelic dosage were obtained when using additional quality filtering. Principal Component Analysis of SNP loci in plant samples revealed that a small proportion (<5%) of the genetic variability assessed by GBS is able to differentiate ATF0 and ATF5. Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids. PMID:26115486

  9. Collagen Type I alpha1 (COL1A1 Gene Polymorphism and Bone Mineral Density in Postmenopausal Kazakh Women

    Directory of Open Access Journals (Sweden)

    Akbota Aitkulova

    2014-12-01

    Full Text Available Introduction: Single nucleotide polymorphism (SNP at the collagen type I alpha 1 gene (COL1A1 rs1800012 has been widely studied and has shown an association with bone mineral density (BMD and fractures. A minor allele TT of this SNP was found to be greatly overrepresented in individuals with fractures compared to controls, thus becoming a good predictor of  increased fracture risk. The aim of this investigation was to evaluate potential association between COL1A1 gene polymorphism and osteoporosis in Kazakh postmenopausal women.Methods: The study population included 103 postmenopausal women recruited from Pavlodar and Almaty clinics. BMD was measured using DEXA. Genomic DNA was extracted from peripheral venous blood of study participants with Wizard® Genomic DNA Purification Kit (Promega, USA. Detection of COL1A1 +1245G/T (Sp1 polymorphism was done by the TaqMan® SNP Genotyping Assay of real-time PCR.Results: Densitometry results revealed 36 osteoporotic, 42 osteopenic, and 25 normal postmenopausal women. Data analysis of 1245G>T polymorphism in COL1A1 gene in the group of women with osteopenia and osteoporosis revealed deviation from Hardy-Weinberg equilibrium. The mutant TT genotype was prevalent compared to the heterozygous genotype GT in both groups. Distributions were 83% GG, 3% GT, and 14% TT in the group with osteopenia and 80% GG, 6% GT, and 14% TT in the group with osteoporosis. The distribution of genotypes frequency in the group of normal postmenopausal women was 76% GG, 16% GT, and 8% TT.Conclusion: These results suggest that TT genotype of COL1A1 +1245G/T (Sp1 polymorphism is associated with risk of postmenopausal osteoporosis in Kazakh women. Further studies involving a larger number of women are needed to clarify the relationship of this polymorphism with risk of osteoporosis. 

  10. Phylogenetic relationships among the European and American bison and seven cattle breeds recon structed using the Bovine SNP50 Illumina Genotyping BeadChip

    DEFF Research Database (Denmark)

    Pertoldi, Cino; Wójcik, Jan M; Kawalko, Agata

    2010-01-01

    amongst bison subspecies and cattle, and (3) de tect loci under positive or stabilizing selection. A Bayesian clustering procedure (STRUCTURE) detected ten genetically distinct clusters, with separation among all seven cattle breeds and European and American bison, but no separation be tween plain......Here we present the first at tempt to use the BovineSNP50 Illumina genotyping BeadChip for genome-wide screening of European bison Bisonbonasus bonasus (EB), two subspecies of American bison: the plains bison (EB), two sub species of American bison: the plains bison Bison bison bison (PB), the wood...... bison Bi on bison athabascae (WB) and seven (PB), the wood bison (WB) and seven cattle Bostaurus breeds. Our aims were to (1) reconstruct their evolutionary relationships, (2) detect any genetic signature of past bottlenecks and to quantify the con sequences of bottle necks on the genetic distances...

  11. Whole-genome single-nucleotide polymorphism (SNP marker discovery and association analysis with the eicosapentaenoic acid (EPA and docosahexaenoic acid (DHA content in Larimichthys crocea

    Directory of Open Access Journals (Sweden)

    Shijun Xiao

    2016-12-01

    Full Text Available Whole-genome single-nucleotide polymorphism (SNP markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA and docosahexaenoic acid (DHA content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms.

  12. Association of single nucleotide polymorphism at position 45 in adiponectin gene with plasma adiponectin level and insulin resistance in obesity

    International Nuclear Information System (INIS)

    Chen Xiaoyu; Li Xisheng; Lin Xiahong; Gao Hongzhi; Li Qiulan; Zha Jinshun

    2012-01-01

    Objective: To explore the association of single nucleotide polymorphism at position 45 (SNP45) in adiponectin gene with plasma adiponectin level and insulin resistance in obesity in Quanzhou area of Fujian province. Methods: Two hundred and forty-eight patients with obesity and 225 normal control subjects were enrolled in this study.Fasting insulin (FINS) were measured by radioimmunoassay and fasting plasma glucose (FPG), total cholesterol (TC), triglyceride (TG), high density lipoprotein-cholesterol (HDL-C), low density lipoprotein-cholesterol (LDL-C) were measured by BECKMAN DXC800 biochemistry analyzer. Body mass index (BMI), waist to hip ratio,homeostasis model assessment of insulin resistance (HOMA-IR) were calculated. Plasma adiponectin levels were examined by means of enzyme-linked immunosorbentassy. The adiponectin gene SNP45 was identified by PCR-restriction fragment length polymorphism. Results: (1) Frequencies of GG+GT genotype in obesity group and normal control group were 61% and 44% respectively (χ 2 =14.182, P<0.01), and G allele frequencies were 35% and 25% (χ 2 =10.708, P<0.01). (2) In obesity group,the subjects with SNP45 GG+GT genotype had higher TG and LDL-C levels than those with TT genotype (t=2.604, P<0.01; t=5.507, P<0.01), and had lower adiponectin level than those with TT genotype (t=2.275, P<0.05), and had significantly lower HDL-L level than those with TT genotype (t=10.100, P< 0.01). (3) In normal control group,the subjects with SNP45 GG +GT genotype had significantly lower adiponectin,TG,TC levels than those with TT genotype (t=2.510, P<0.05; t=2.922, P<0.01; t=3.272, P< 0.01). (4) Logistic analysis proved that the SNP45 GG+GT genotype in obesity group was associated with decreased risk of plasma adiponectin level (OR=0.810, 95% CI : 0.673-0.975, P<0.05), and with increased risk of HOMA-IR (OR=1.746, 95% CI : 1.060-2.875, P<0.05). The SNP45 GG+GT genotype in normal control group was associated with increased risk of HOMA-IR (OR=3

  13. Association between ABCG1 polymorphism rs1893590 and high-density lipoprotein (HDL) in an asymptomatic Brazilian population.

    Science.gov (United States)

    Zago, V H S; Scherrer, D Z; Parra, E S; Panzoldo, N B; Alexandre, F; Nakandakare, E R; Quintão, E C R; de Faria, E C

    2015-03-01

    ATP binding cassette transporter G1 (ABCG1) promotes lipidation of nascent high-density lipoprotein (HDL) particles, acting as an intracellular transporter. SNP rs1893590 (c.-204A > C) of ABCG1 gene has been previously studied and reported as functional over plasma HDL-C and lipoprotein lipase activity. This study aimed to investigate the relationships of SNP rs1893590 with plasma lipids and lipoproteins in a large Brazilian population. Were selected 654 asymptomatic and normolipidemic volunteers from both genders. Clinical and anthropometrical data were taken and blood samples were drawn after 12 h fasting. Plasma lipids and lipoproteins, as well as HDL particle size and volume were determined. Genomic DNA was isolated for SNP rs1893590 detection by TaqMan(®) OpenArray(®) Real-Time PCR Plataform (Applied Biosystems). Mann-Whitney U, Chi square and two-way ANOVA were the used statistical tests. No significant differences were found in the comparison analyses between the allele groups for all studied parameters. Conversely, significant interactions were observed between SNP and age over plasma HDL-C, were volunteers under 60 years with AA genotype had increased HDL-C (p = 0.048). Similar results were observed in the group with body mass index (BMI) m(2), where volunteers with AA genotype had higher HDL-C levels (p = 0.0034), plus an increased HDL particle size (p = 0.01). These findings indicate that SNP rs1893590 of ABCG1 has a significant impact over HDL-C under asymptomatic clinical conditions in an age and BMI dependent way.

  14. A high-density genetic map and QTL analysis of agronomic traits in foxtail millet [Setaria italica (L.) P. Beauv.] using RAD-seq.

    Science.gov (United States)

    Wang, Jun; Wang, Zhilan; Du, Xiaofen; Yang, Huiqing; Han, Fang; Han, Yuanhuai; Yuan, Feng; Zhang, Linyi; Peng, Shuzhong; Guo, Erhu

    2017-01-01

    Foxtail millet (Setaria italica), a very important grain crop in China, has become a new model plant for cereal crops and biofuel grasses. Although its reference genome sequence was released recently, quantitative trait loci (QTLs) controlling complex agronomic traits remains limited. The development of massively parallel genotyping methods and next-generation sequencing technologies provides an excellent opportunity for developing single-nucleotide polymorphisms (SNPs) for linkage map construction and QTL analysis of complex quantitative traits. In this study, a high-throughput and cost-effective RAD-seq approach was employed to generate a high-density genetic map for foxtail millet. A total of 2,668,587 SNP loci were detected according to the reference genome sequence; meanwhile, 9,968 SNP markers were used to genotype 124 F2 progenies derived from the cross between Hongmiaozhangu and Changnong35; a high-density genetic map spanning 1648.8 cM, with an average distance of 0.17 cM between adjacent markers was constructed; 11 major QTLs for eight agronomic traits were identified; five co-dominant DNA markers were developed. These findings will be of value for the identification of candidate genes and marker-assisted selection in foxtail millet.

  15. p.Q192R SNP of PON1 seems not to be Associated with Carotid Atherosclerosis Risk Factors in an Asymptomatic and Normolipidemic Brazilian Population Sample

    Directory of Open Access Journals (Sweden)

    Daniel Zanetti Scherrer

    2015-07-01

    Full Text Available Background:Evidences suggest that paraoxonase 1 (PON1 confers important antioxidant and anti-inflammatory properties when associated with high-density lipoprotein (HDL.Objective:To investigate the relationships between p.Q192R SNP of PON1, biochemical parameters and carotid atherosclerosis in an asymptomatic, normolipidemic Brazilian population sample.Methods:We studied 584 volunteers (females n = 326, males n = 258; 19-75 years of age. Total genomic DNA was extracted and SNP was detected in the TaqMan® SNP OpenArray® genotyping platform (Applied Biosystems, Foster City, CA. Plasma lipoproteins and apolipoproteins were determined and PON1 activity was measured using paraoxon as a substrate. High-resolution β-mode ultrasonography was used to measure cIMT and the presence of carotid atherosclerotic plaques in a subgroup of individuals (n = 317.Results:The presence of p.192Q was associated with a significant increase in PON1 activity (RR = 12.30 (11.38; RQ = 46.96 (22.35; QQ = 85.35 (24.83 μmol/min; p Conclusion:In low-risk individuals, the presence of the p.192Q variant of PON1 is associated with a beneficial plasma lipid profile but not with carotid atherosclerosis.

  16. An Improved Opposition-Based Learning Particle Swarm Optimization for the Detection of SNP-SNP Interactions

    Science.gov (United States)

    Shang, Junliang; Sun, Yan; Li, Shengjun; Liu, Jin-Xing; Zheng, Chun-Hou; Zhang, Junying

    2015-01-01

    SNP-SNP interactions have been receiving increasing attention in understanding the mechanism underlying susceptibility to complex diseases. Though many works have been done for the detection of SNP-SNP interactions, the algorithmic development is still ongoing. In this study, an improved opposition-based learning particle swarm optimization (IOBLPSO) is proposed for the detection of SNP-SNP interactions. Highlights of IOBLPSO are the introduction of three strategies, namely, opposition-based learning, dynamic inertia weight, and a postprocedure. Opposition-based learning not only enhances the global explorative ability, but also avoids premature convergence. Dynamic inertia weight allows particles to cover a wider search space when the considered SNP is likely to be a random one and converges on promising regions of the search space while capturing a highly suspected SNP. The postprocedure is used to carry out a deep search in highly suspected SNP sets. Experiments of IOBLPSO are performed on both simulation data sets and a real data set of age-related macular degeneration, results of which demonstrate that IOBLPSO is promising in detecting SNP-SNP interactions. IOBLPSO might be an alternative to existing methods for detecting SNP-SNP interactions. PMID:26236727

  17. SNP interaction pattern identifier (SIPI)

    DEFF Research Database (Denmark)

    Lin, Hui Yi; Chen, Dung Tsa; Huang, Po Yu

    2017-01-01

    Motivation: Testing SNP-SNP interactions is considered as a key for overcoming bottlenecks of genetic association studies. However, related statistical methods for testing SNP-SNP interactions are underdeveloped. Results: We propose the SNP Interaction Pattern Identifier (SIPI), which tests 45...

  18. Switchgrass genomic diversity, ploidy, and evolution: novel insights from a network-based SNP discovery protocol.

    Directory of Open Access Journals (Sweden)

    Fei Lu

    Full Text Available Switchgrass (Panicum virgatum L. is a perennial grass that has been designated as an herbaceous model biofuel crop for the United States of America. To facilitate accelerated breeding programs of switchgrass, we developed both an association panel and linkage populations for genome-wide association study (GWAS and genomic selection (GS. All of the 840 individuals were then genotyped using genotyping by sequencing (GBS, generating 350 GB of sequence in total. As a highly heterozygous polyploid (tetraploid and octoploid species lacking a reference genome, switchgrass is highly intractable with earlier methodologies of single nucleotide polymorphism (SNP discovery. To access the genetic diversity of species like switchgrass, we developed a SNP discovery pipeline based on a network approach called the Universal Network-Enabled Analysis Kit (UNEAK. Complexities that hinder single nucleotide polymorphism discovery, such as repeats, paralogs, and sequencing errors, are easily resolved with UNEAK. Here, 1.2 million putative SNPs were discovered in a diverse collection of primarily upland, northern-adapted switchgrass populations. Further analysis of this data set revealed the fundamentally diploid nature of tetraploid switchgrass. Taking advantage of the high conservation of genome structure between switchgrass and foxtail millet (Setaria italica (L. P. Beauv., two parent-specific, synteny-based, ultra high-density linkage maps containing a total of 88,217 SNPs were constructed. Also, our results showed clear patterns of isolation-by-distance and isolation-by-ploidy in natural populations of switchgrass. Phylogenetic analysis supported a general south-to-north migration path of switchgrass. In addition, this analysis suggested that upland tetraploid arose from upland octoploid. All together, this study provides unparalleled insights into the diversity, genomic complexity, population structure, phylogeny, phylogeography, ploidy, and evolutionary dynamics

  19. Applying SNP marker technology in the cacao breeding program at the Cocoa Research Institute of Ghana

    Science.gov (United States)

    In this investigation 45 parental cacao plants and five progeny derived from the parental stock studied were genotyped using six SNP markers to determine off-types or mislabeled clones and to authenticate crosses made in the Cocoa Research Institute of Ghana (CRIG) breeding program. Investigation wa...

  20. Identification and genotyping of feline infectious peritonitis-associated single nucleotide polymorphisms in the feline interferon-γ gene.

    Science.gov (United States)

    Hsieh, Li-En; Chueh, Ling-Ling

    2014-05-21

    Feline infectious peritonitis (FIP) is an immune-mediated, highly lethal disease caused by feline coronavirus (FCoV) infection. Currently, no protective vaccine or effective treatment for the disease is available. Studies have found that some cats survive the challenge of virulent FCoV isolates. Since cellular immunity is thought to be critical in preventing FIP and because diseased cats often show a significant decrease in interferon-γ (IFN-γ) production, we investigated whether single nucleotide polymorphisms (SNP) in the feline IFN-γ gene (fIFNG) are associated with the outcome of infection. A total of 82 asymptomatic and 63 FIP cats were analyzed, and 16 SNP were identified in intron 1 of fIFNG. Among these SNP, the fFING + 428 T allele was shown to be a FIP-resistant allele (p = 0.03), and the heterozygous genotypes 01C/T and +408C/T were found to be FIP-susceptible factors (p = 0.004). Furthermore, an fIFNG + 428 resistant allele also showed a clear correlation with the plasma level of IFN-γ in FIP cats. For the identification of these three FIP-related SNP, genotyping methods were established using amplification refractory mutation system PCR (ARMS-PCR) and restriction fragment length polymorphisms (RFLP), and the different genotypes could easily be identified without sequencing. The identification of additional FIP-related SNP will allow the selection of resistant cats and decrease the morbidity of the cat population to FIP.

  1. UGbS-Flex, a novel bioinformatics pipeline for imputation-free SNP discovery in polyploids without a reference genome: finger millet as a case study.

    Science.gov (United States)

    Qi, Peng; Gimode, Davis; Saha, Dipnarayan; Schröder, Stephan; Chakraborty, Debkanta; Wang, Xuewen; Dida, Mathews M; Malmberg, Russell L; Devos, Katrien M

    2018-06-15

    Research on orphan crops is often hindered by a lack of genomic resources. With the advent of affordable sequencing technologies, genotyping an entire genome or, for large-genome species, a representative fraction of the genome has become feasible for any crop. Nevertheless, most genotyping-by-sequencing (GBS) methods are geared towards obtaining large numbers of markers at low sequence depth, which excludes their application in heterozygous individuals. Furthermore, bioinformatics pipelines often lack the flexibility to deal with paired-end reads or to be applied in polyploid species. UGbS-Flex combines publicly available software with in-house python and perl scripts to efficiently call SNPs from genotyping-by-sequencing reads irrespective of the species' ploidy level, breeding system and availability of a reference genome. Noteworthy features of the UGbS-Flex pipeline are an ability to use paired-end reads as input, an effective approach to cluster reads across samples with enhanced outputs, and maximization of SNP calling. We demonstrate use of the pipeline for the identification of several thousand high-confidence SNPs with high representation across samples in an F 3 -derived F 2 population in the allotetraploid finger millet. Robust high-density genetic maps were constructed using the time-tested mapping program MAPMAKER which we upgraded to run efficiently and in a semi-automated manner in a Windows Command Prompt Environment. We exploited comparative GBS with one of the diploid ancestors of finger millet to assign linkage groups to subgenomes and demonstrate the presence of chromosomal rearrangements. The paper combines GBS protocol modifications, a novel flexible GBS analysis pipeline, UGbS-Flex, recommendations to maximize SNP identification, updated genetic mapping software, and the first high-density maps of finger millet. The modules used in the UGbS-Flex pipeline and for genetic mapping were applied to finger millet, an allotetraploid selfing species

  2. RS-SNP: a random-set method for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Mukherjee Sayan

    2011-03-01

    Full Text Available Abstract Background The typical objective of Genome-wide association (GWA studies is to identify single-nucleotide polymorphisms (SNPs and corresponding genes with the strongest evidence of association (the 'most-significant SNPs/genes' approach. Borrowing ideas from micro-array data analysis, we propose a new method, named RS-SNP, for detecting sets of genes enriched in SNPs moderately associated to the phenotype. RS-SNP assesses whether the number of significant SNPs, with p-value P ≤ α, belonging to a given SNP set is statistically significant. The rationale of proposed method is that two kinds of null hypotheses are taken into account simultaneously. In the first null model the genotype and the phenotype are assumed to be independent random variables and the null distribution is the probability of the number of significant SNPs in greater than observed by chance. The second null model assumes the number of significant SNPs in depends on the size of and not on the identity of the SNPs in . Statistical significance is assessed using non-parametric permutation tests. Results We applied RS-SNP to the Crohn's disease (CD data set collected by the Wellcome Trust Case Control Consortium (WTCCC and compared the results with GENGEN, an approach recently proposed in literature. The enrichment analysis using RS-SNP and the set of pathways contained in the MSigDB C2 CP pathway collection highlighted 86 pathways rich in SNPs weakly associated to CD. Of these, 47 were also indicated to be significant by GENGEN. Similar results were obtained using the MSigDB C5 pathway collection. Many of the pathways found to be enriched by RS-SNP have a well-known connection to CD and often with inflammatory diseases. Conclusions The proposed method is a valuable alternative to other techniques for enrichment analysis of SNP sets. It is well founded from a theoretical and statistical perspective. Moreover, the experimental comparison with GENGEN highlights that it is

  3. Application of high-resolution DNA melting for genotyping in lepidopteran non-model species: Ostrinia furnacalis (Crambidae.

    Directory of Open Access Journals (Sweden)

    FengBo Li

    Full Text Available Development of an ideal marker system facilitates a better understanding of the genetic diversity in lepidopteran non-model organisms, which have abundant species, but relatively limited genomic resources. Single nucleotide polymorphisms (SNPs discovered within single-copy genes have proved to be desired markers, but SNP genotyping by current techniques remain laborious and expensive. High resolution melting (HRM curve analysis represents a simple, rapid and inexpensive genotyping method that is primarily confined to clinical and diagnostic studies. In this study, we evaluated the potential of HRM analysis for SNP genotyping in the lepidopteran non-model species Ostrinia furnacalis (Crambidae. Small amplicon and unlabeled probe assays were developed for the SNPs, which were identified in 30 females of O. furnacalis from 3 different populations by our direct sequencing. Both assays were then applied to genotype 90 unknown female DNA by prior mixing with known wild-type DNA. The genotyping results were compared with those that were obtained using bi-directional sequencing analysis. Our results demonstrated the efficiency and reliability of the HRM assays. HRM has the potential to provide simple, cost-effective genotyping assays and facilitates genotyping studies in any non-model lepidopteran species of interest.

  4. MMP-8 genotypes influence the inflammatory response in human endotoxemia.

    Science.gov (United States)

    Rella, Judith M; Jilma, Bernd; Fabry, Astrid; Kaynar, A Murat; Mayr, Florian B

    2014-04-01

    Clinical studies have reported associations between MMP-8 genotypes and clinical outcomes without exploring underlying mechanisms. This study aims to understand the influence of the rs1940475 SNP on downstream chemokine and cytokine response in human endotoxemia. Rs1940475 was genotyped in 44 healthy Caucasian males, who were challenged with an intravenous bolus of 2 ng/kg lipopolysaccharide (LPS). Plasma levels of tumor necrosis factor (TNF), interleukin (IL)-6, IL-8, and macrophage inflammatory protein (MIP)-1α were measured at baseline and 2, 4, 6, and 24 h after LPS infusion with high-sensitivity enzyme immunoassays. Peak TNF levels at 2 h after LPS infusion were significantly higher in subjects with AA genotype compared to subjects with AG or GG genotypes (185 pg/mL [IQR, 154-234] vs. 94 pg/mL [IQR, 65-125] vs. 107 pg/mL [IQR, 80-241], respectively; p = 0.03 between groups). Peak IL-6 levels were trend-wise higher in subjects with AA genotype compared to those with AG or GG genotypes (566 pg/mL [IQR, 294-644] vs. 278 pg/mL [IQR, 184-539] and 329 pg/mL [IQR, 240-492], respectively; p = 0.15 between groups). In contrast, peak MIP-1α at 2 h was highest in GG genotype carriers compared to those with AG or AA genotypes (602 pg/mL [IQR, 449-727] vs. 389 pg/mL [IQR, 375-490] and 510 pg/mL [425-813], respectively; p < 0.03 between groups). AA genotype carriers had highest peak TNF and IL-6 levels after LPS challenge, whereas peak MIP-1α levels were highest in GG carriers. This indicates that the rs1940475 SNP modifies the host response to inflammatory stimuli, which may in part explain previously shown associations with clinical outcomes.

  5. Development and Evaluation of a Barley 50k iSelect SNP Array

    Directory of Open Access Journals (Sweden)

    Micha M. Bayer

    2017-10-01

    Full Text Available High-throughput genotyping arrays continue to be an attractive, cost-effective alternative to sequencing based approaches. We have developed a new 50k Illumina Infinium iSelect genotyping array for barley, a cereal crop species of major international importance. The majority of SNPs on the array have been extracted from variants called in exome capture data of a wide range of European barley germplasm. We used the recently published barley pseudomolecule assembly to map the exome capture data, which allowed us to generate markers with accurate physical positions and detailed gene annotation. Markers from an existing and widely used barley 9k Infinium iSelect array were carried over onto the 50k chip for backward compatibility. The array design featured 49,267 SNP markers that converted into 44,040 working assays, of which 43,461 were scorable in GenomeStudio. Of the working assays, 6,251 are from the 9k iSelect platform. We validated the SNPs by comparing the genotype calls from the new array to legacy datasets. Rates of agreement averaged 98.1 and 93.9% respectively for the legacy 9k iSelect SNP set (Comadran et al., 2012 and the exome capture SNPs. To test the utility of the 50k chip for genetic mapping, we genotyped a segregating population derived from a Golden Promise × Morex cross (Liu et al., 2014 and mapped over 14,000 SNPs to genetic positions which showed a near exact correspondence to their known physical positions. Manual adjustment of the cluster files used by the interpreting software for genotype scoring improved results substantially, but migration of cluster files between sites led to a deterioration of results, suggesting that local adjustment of cluster files is required on a site-per-site basis. Information relating to the markers on the chip is available online at https://ics.hutton.ac.uk/50k.

  6. Genome-wide detection of CNVs in Chinese indigenous sheep with different types of tails using ovine high-density 600K SNP arrays

    OpenAIRE

    Zhu, Caiye; Fan, Hongying; Yuan, Zehu; Hu, Shijin; Ma, Xiaomeng; Xuan, Junli; Wang, Hongwei; Zhang, Li; Wei, Caihong; Zhang, Qin; Zhao, Fuping; Du, Lixin

    2016-01-01

    Chinese indigenous sheep can be classified into three types based on tail morphology: fat-tailed, fat-rumped, and thin-tailed sheep, of which the typical breeds are large-tailed Han sheep, Altay sheep, and Tibetan sheep, respectively. To unravel the genetic mechanisms underlying the phenotypic differences among Chinese indigenous sheep with tails of three different types, we used ovine high-density 600K SNP arrays to detect genome-wide copy number variation (CNV). In large-tailed Han sheep, A...

  7. High-throughput single nucleotide polymorphism genotyping using nanofluidic Dynamic Arrays

    Directory of Open Access Journals (Sweden)

    Crenshaw Andrew

    2009-01-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs have emerged as the genetic marker of choice for mapping disease loci and candidate gene association studies, because of their high density and relatively even distribution in the human genomes. There is a need for systems allowing medium multiplexing (ten to hundreds of SNPs with high throughput, which can efficiently and cost-effectively generate genotypes for a very large sample set (thousands of individuals. Methods that are flexible, fast, accurate and cost-effective are urgently needed. This is also important for those who work on high throughput genotyping in non-model systems where off-the-shelf assays are not available and a flexible platform is needed. Results We demonstrate the use of a nanofluidic Integrated Fluidic Circuit (IFC - based genotyping system for medium-throughput multiplexing known as the Dynamic Array, by genotyping 994 individual human DNA samples on 47 different SNP assays, using nanoliter volumes of reagents. Call rates of greater than 99.5% and call accuracies of greater than 99.8% were achieved from our study, which demonstrates that this is a formidable genotyping platform. The experimental set up is very simple, with a time-to-result for each sample of about 3 hours. Conclusion Our results demonstrate that the Dynamic Array is an excellent genotyping system for medium-throughput multiplexing (30-300 SNPs, which is simple to use and combines rapid throughput with excellent call rates, high concordance and low cost. The exceptional call rates and call accuracy obtained may be of particular interest to those working on validation and replication of genome- wide- association (GWA studies.

  8. Presence of Mycobacterium leprae genotype 4 in environmental waters in Northeast Brazil.

    Science.gov (United States)

    Holanda, Maísa Viana de; Marques, Livia Erika Carlos; Macedo, Maria Luisa Bezerra de; Pontes, Maria Araci de Andrade; Sabadia, José Antonio Beltrão; Kerr, Ligia Regina Franco Sansigolo; Almeida, Rosa Lívia Freitas; Frota, Cristiane Cunha

    2017-01-01

    This study quantified Mycobacterium leprae bacilli in environmental water samples from five municipalities in the State of Ceará by quantitative polymerase chain reaction (qPCR) and compared the identified genotypes with those obtained from leprosy patient biopsies. We collected five replicas from each of the 30 selected reservoirs and skin lesion biopsies from 25 new leprosy cases treated at a reference center in Fortaleza, Ceará from 2010 to 2013. The 16S rRNA gene region of M. leprae was amplified by qPCR and a standard curve was created with the pIDTBlue 16SrRNAMlep plasmid. The Juazeiro do Norte water samples and the biopsies were genotyped (single nucleotide polymorphism [SNP] 1 to 4) and the SNP 4 genotypes were subtyped. Of the 149 water samples analyzed, 54.4% were positive for the M. leprae DNA. The M. leprae bacilli copy number ranged from 1.42 × 10 -1 to 1.44 × 10 + 2 . Most biopsies showed SNP type 4 (64%), while all samples from Juazeiro do Norte were SNP type 4, with subtype 4-N appearing at the highest frequency. We suggest that environmental waters containing M. leprae bacilli play an important role in disease transmission, justifying PGL-1 seropositivity in individuals living in areas where there is no reported case, and in leprosy cases individuals who report no previous contact with other case. Therefore, further investigation is needed to clarify disease transmission in this region and to explore the role of the environment. We also suggest that in this area surveillance for leprosy cases should be intensified.

  9. Trps1 differentially modulates the bone mineral density between male and female mice and its polymorphism associates with BMD differently between women and men.

    Directory of Open Access Journals (Sweden)

    Lishi Wang

    Full Text Available The objective of our study was to identify genetic factors that regulate bone mineral density (BMD in mice using well defined recombinant inbred strains. For this purpose we chose the BXD recombinant inbred (RI strains derived from progeny of the C57BL/6J (B6 and DBA/2J (D2 progenitor strains. We sampled both male and female mice (∼4 each of 46 strains at 3 months-of-age, measured their BMD, and conducted QTL mapping. The data were analyzed to identify candidates genes contained within the most significant quantitative trait locus (QTL. Evaluation of candidate genes included functional assessment, single nucleotide polymorphism (SNP genotyping and direct sequencing. We established that there was a QTL for BMD in males on chromosome 15 that has the impact larger than QTLs on all other chromosomes. The QTL on chromosome 15 was narrowed to a genomic region between 38 Mbp and 52 Mbp. By examining transcripts within this region, we found an important candidate gene: trichorhinophalangeal syndrome, type I (Trps1. SNP analysis identified a nonsynonymous SNP (rs32398060 in Trps1 that co-segregated with bone mineral density. Analysis of association between this SNP within TRPS1 and BMD in a human population confirmed its significance.

  10. Application of next-generation sequencing technology to study genetic diversity and identify unique SNP markers in bread wheat from Kazakhstan.

    Science.gov (United States)

    Shavrukov, Yuri; Suchecki, Radoslaw; Eliby, Serik; Abugalieva, Aigul; Kenebayev, Serik; Langridge, Peter

    2014-09-28

    New SNP marker platforms offer the opportunity to investigate the relationships between wheat cultivars from different regions and assess the mechanism and processes that have led to adaptation to particular production environments. Wheat breeding has a long history in Kazakhstan and the aim of this study was to explore the relationship between key varieties from Kazakhstan and germplasm from breeding programs for other regions. The study revealed 5,898 polymorphic markers amongst ten cultivars, of which 2,730 were mapped in the consensus genetic map. Mapped SNP markers were distributed almost equally across the A and B genomes, with between 279 and 484 markers assigned to each chromosome. Marker coverage was approximately 10-fold lower in the D genome. There were 863 SNP markers identified as unique to specific cultivars, and clusters of these markers (regions containing more than three closely mapped unique SNPs) showed specific patterns on the consensus genetic map for each cultivar. Significant intra-varietal genetic polymorphism was identified in three cultivars (Tzelinnaya 3C, Kazakhstanskaya rannespelaya and Kazakhstanskaya 15). Phylogenetic analysis based on inter-varietal polymorphism showed that the very old cultivar Erythrospermum 841 was the most genetically distinct from the other nine cultivars from Kazakhstan, falling in a clade together with the American cultivar Sonora and genotypes from Central and South Asia. The modern cultivar Kazakhstanskaya 19 also fell into a separate clade, together with the American cultivar Thatcher. The remaining eight cultivars shared a single sub-clade but were categorised into four clusters. The accumulated data for SNP marker polymorphisms amongst bread wheat genotypes from Kazakhstan may be used for studying genetic diversity in bread wheat, with potential application for marker-assisted selection and the preparation of a set of genotype-specific markers.

  11. Single Nucleotide Polymorphisms in Common Bean: Their Discovery and Genotyping Using a Multiplex Detection System

    Directory of Open Access Journals (Sweden)

    E. Gaitán-Solís

    2008-11-01

    Full Text Available Single nucleotide polymorphism (SNP markers are by far the most common form of DNA polymorphism in a genome. The objectives of this study were to discover SNPs in common bean ( L. by comparing sequences from coding and noncoding regions obtained from the GenBank and genomic DNA and to compare sequencing results with those obtained using single base extension (SBE assays on the Luminex-100 system for use in high-throughput germplasm evaluation. We assessed the frequency of SNPs in 47 fragments of common bean DNA, using SBE as the evaluation methodology. We conducted a sequence analysis of 10 genotypes of cultivated and wild beans belonging to the Mesoamerican and Andean genetic pools of . For the 10 genotypes evaluated, a total of 20,964 bp of sequence were analyzed in each genotype and compared, resulting in the discovery of 239 SNPs and 133 InDels, giving an average SNP frequency of one per 88 bp and an InDel frequency of one per 157 bp. This is the equivalent of a nucleotide diversity (θ of 6.27 × 10. Comparisons with the SNP genotypes previously obtained by direct sequencing showed that the SBE assays on the Luminex-100 were accurate, with 2.5% being miscalled and 1% showing no signal. These results indicate that the Luminex-100 provides a high-throughput system that can be used to analyze SNPs in large samples of genotypes both for purposes of assessing diversity and also for mapping studies.

  12. Finding the right coverage : The impact of coverage and sequence quality on single nucleotide polymorphism genotyping error rates

    NARCIS (Netherlands)

    Fountain, Emily D.; Pauli, Jonathan N.; Reid, Brendan N.; Palsboll, Per J.; Peery, M. Zachariah

    Restriction-enzyme-based sequencing methods enable the genotyping of thousands of single nucleotide polymorphism (SNP) loci in nonmodel organisms. However, in contrast to traditional genetic markers, genotyping error rates in SNPs derived from restriction-enzyme-based methods remain largely unknown.

  13. SNP based heritability estimation using a Bayesian approach

    DEFF Research Database (Denmark)

    Krag, Kristian; Janss, Luc; Mahdi Shariati, Mohammad

    2013-01-01

    . Differences in family structure were in general not found to influence the estimation of the heritability. For the sample sizes used in this study, a 10-fold increase of SNP density did not improve precision estimates compared with set-ups with a less dense distribution of SNPs. The methods used in this study...

  14. Characterization of a Wheat Breeders' Array suitable for high-throughput SNP genotyping of global accessions of hexaploid bread wheat (Triticum aestivum).

    Science.gov (United States)

    Allen, Alexandra M; Winfield, Mark O; Burridge, Amanda J; Downie, Rowena C; Benbow, Harriet R; Barker, Gary L A; Wilkinson, Paul A; Coghill, Jane; Waterfall, Christy; Davassi, Alessandro; Scopes, Geoff; Pirani, Ali; Webster, Teresa; Brew, Fiona; Bloor, Claire; Griffiths, Simon; Bentley, Alison R; Alda, Mark; Jack, Peter; Phillips, Andrew L; Edwards, Keith J

    2017-03-01

    Targeted selection and inbreeding have resulted in a lack of genetic diversity in elite hexaploid bread wheat accessions. Reduced diversity can be a limiting factor in the breeding of high yielding varieties and crucially can mean reduced resilience in the face of changing climate and resource pressures. Recent technological advances have enabled the development of molecular markers for use in the assessment and utilization of genetic diversity in hexaploid wheat. Starting with a large collection of 819 571 previously characterized wheat markers, here we describe the identification of 35 143 single nucleotide polymorphism-based markers, which are highly suited to the genotyping of elite hexaploid wheat accessions. To assess their suitability, the markers have been validated using a commercial high-density Affymetrix Axiom ® genotyping array (the Wheat Breeders' Array), in a high-throughput 384 microplate configuration, to characterize a diverse global collection of wheat accessions including landraces and elite lines derived from commercial breeding communities. We demonstrate that the Wheat Breeders' Array is also suitable for generating high-density genetic maps of previously uncharacterized populations and for characterizing novel genetic diversity produced by mutagenesis. To facilitate the use of the array by the wheat community, the markers, the associated sequence and the genotype information have been made available through the interactive web site 'CerealsDB'. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  15. ChIP on SNP-chip for genome-wide analysis of human histone H4 hyperacetylation

    Directory of Open Access Journals (Sweden)

    Porter Christopher J

    2007-09-01

    Full Text Available Abstract Background SNP microarrays are designed to genotype Single Nucleotide Polymorphisms (SNPs. These microarrays report hybridization of DNA fragments and therefore can be used for the purpose of detecting genomic fragments. Results Here, we demonstrate that a SNP microarray can be effectively used in this way to perform chromatin immunoprecipitation (ChIP on chip as an alternative to tiling microarrays. We illustrate this novel application by mapping whole genome histone H4 hyperacetylation in human myoblasts and myotubes. We detect clusters of hyperacetylated histone H4, often spanning across up to 300 kilobases of genomic sequence. Using complementary genome-wide analyses of gene expression by DNA microarray we demonstrate that these clusters of hyperacetylated histone H4 tend to be associated with expressed genes. Conclusion The use of a SNP array for a ChIP-on-chip application (ChIP on SNP-chip will be of great value to laboratories whose interest is the determination of general rules regarding the relationship of specific chromatin modifications to transcriptional status throughout the genome and to examine the asymmetric modification of chromatin at heterozygous loci.

  16. Gains in QTL detection using an ultra-high density SNP map based on population sequencing relative to traditional RFLP/SSR markers.

    Directory of Open Access Journals (Sweden)

    Huihui Yu

    Full Text Available Huge efforts have been invested in the last two decades to dissect the genetic bases of complex traits including yields of many crop plants, through quantitative trait locus (QTL analyses. However, almost all the studies were based on linkage maps constructed using low-throughput molecular markers, e.g. restriction fragment length polymorphisms (RFLPs and simple sequence repeats (SSRs, thus are mostly of low density and not able to provide precise and complete information about the numbers and locations of the genes or QTLs controlling the traits. In this study, we constructed an ultra-high density genetic map based on high quality single nucleotide polymorphisms (SNPs from low-coverage sequences of a recombinant inbred line (RIL population of rice, generated using new sequencing technology. The quality of the map was assessed by validating the positions of several cloned genes including GS3 and GW5/qSW5, two major QTLs for grain length and grain width respectively, and OsC1, a qualitative trait locus for pigmentation. In all the cases the loci could be precisely resolved to the bins where the genes are located, indicating high quality and accuracy of the map. The SNP map was used to perform QTL analysis for yield and three yield-component traits, number of tillers per plant, number of grains per panicle and grain weight, using data from field trials conducted over years, in comparison to QTL mapping based on RFLPs/SSRs. The SNP map detected more QTLs especially for grain weight, with precise map locations, demonstrating advantages in detecting power and resolution relative to the RFLP/SSR map. Thus this study provided an example for ultra-high density map construction using sequencing technology. Moreover, the results obtained are helpful for understanding the genetic bases of the yield traits and for fine mapping and cloning of QTLs.

  17. Single nucleotide polymorphism (SNP) detection on a magnetoresistive sensor

    DEFF Research Database (Denmark)

    Rizzi, Giovanni; Østerberg, Frederik Westergaard; Dufva, Martin

    2013-01-01

    We present a magnetoresistive sensor platform for hybridization assays and demonstrate its applicability on single nucleotide polymorphism (SNP) genotyping. The sensor relies on anisotropic magnetoresistance in a new geometry with a local negative reference and uses the magnetic field from...... the sensor bias current to magnetize magnetic beads in the vicinity of the sensor. The method allows for real-time measurements of the specific bead binding to the sensor surface during DNA hybridization and washing. Compared to other magnetic biosensing platforms, our approach eliminates the need...... for external electromagnets and thus allows for miniaturization of the sensor platform....

  18. In Vitro vs In Silico Detected SNPs for the Development of a Genotyping Array: What Can We Learn from a Non-Model Species?

    Science.gov (United States)

    Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe

    2010-01-01

    Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species

  19. In vitro vs in silico detected SNPs for the development of a genotyping array: what can we learn from a non-model species?

    Directory of Open Access Journals (Sweden)

    Camille Lepoittevin

    2010-06-01

    Full Text Available There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait., a conifer characterized by a huge genome size ( approximately 23.8 Gb/C.A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs, chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively. The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates.This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome.

  20. Advanced statistical tools for SNP arrays : signal calibration, copy number estimation and single array genotyping

    NARCIS (Netherlands)

    Rippe, Ralph Christian Alexander

    2012-01-01

    Fluorescence bias in in signals from individual SNP arrays can be calibrated using linear models. Given the data, the system of equations is very large, so a specialized symbolic algorithm was developed. These models are also used to illustrate that genomic waves do not exist, but are merely an

  1. Identification of field caught Anopheles gambiae s.s. and Anopheles arabiensis by TaqMan single nucleotide polymorphism genotyping

    Directory of Open Access Journals (Sweden)

    Bayoh Nabie M

    2007-02-01

    Full Text Available Abstract Background Identification of Anopheles gambiae s.s. and Anopheles arabiensis from field-collected Anopheles gambiae s.l. is often necessary in basic and applied research, and in operational control programmes. The currently accepted method involves use of standard polymerase chain reaction amplification of ribosomal DNA (rDNA from the 3' 28S to 5' intergenic spacer region of the genome, and visual confirmation of amplicons of predicted size on agarose gels, after electrophoresis. This report describes development and evaluation of an automated, quantitative PCR method based upon TaqMan™ single nucleotide polymorphism (SNP genotyping. Methods Standard PCR, and TaqMan SNP genotyping with newly designed primers and fluorophore-labeled probes hybridizing to sequences of complementary rDNA specific for either An. gambiae s.s. or An. arabiensis, were conducted in three experiments involving field-collected An. gambiae s.l. from western Kenya, and defined laboratory strains. DNA extraction was from a single leg, sonicated for five minutes in buffer in wells of 96-well PCR plates. Results TaqMan SNP genotyping showed a reaction success rate, sensitivity, and species specificity comparable to that of standard PCR. In an extensive field study, only 29 of 3,041 (0.95% were determined to be hybrids by TaqMan (i.e., having rDNA sequences from both species, however, all but one were An. arabiensis by standard PCR, suggesting an acceptably low (ca. 1% error rate for TaqMan genotyping in mistakenly identifying species hybrids. Conclusion TaqMan SNP genotyping proved to be a sensitive and rapid method for identification of An. gambiae s.l. and An. arabiensis, with a high success rate, specific results, and congruence with the standard PCR method.

  2. [C677T-SNP of methylenetetrahydrofolate reductase gene and breast cancer in Mexican women].

    Science.gov (United States)

    Calderón-Garcidueñas, Ana Laura; Cerda-Flores, Ricardo Martín; Castruita-Ávila, Ana Lilia; González-Guerrero, Juan Francisco; Barrera-Saldaña, Hugo Alberto

    2017-01-01

    Low-penetrance susceptibility genes such as 5,10-methylenetetrahydrofolate reductase gene (MTHFR) have been considered in the progression of breast cancer (BC). Cancer is a result of genetic, environmental and epigenetic interactions; therefore, these genes should be studied in environmental context, because the results can vary between populations and even within the same country. The objective was to analyze the allelic and genotypic frequencies of the MTHFR C667T SNP in Mexican Mestizo patients with BC and controls from Northeastern Mexico. 243 patients and 118 healthy women were studied. The analysis of the polymorphism was performed with a DNA microarray. Once the frequency of the polymorphism was obtained, Hardy-Weinberg equilibrium test was carried out for the genotypes. Chi square test was used to compare the distribution of frequencies. The allele frequency in patients was: C = 0.5406; T = 0.4594 and in controls C = 0.5678, T = 0.4322. Genotype in BC patients was: C / C = 29.9%, C / T = 48.3% and T / T = 21.8. The distribution in controls was: C / C = 31.4%, C / T = 50.8%, T / T = 17.8% (chi squared 0.77, p = 0.6801). Northeastern Mexican women in this study showed no association between MTFHR C667T SNP and the risk of BC. It seems that the contribution of this polymorphism to BC in Mexico varies depending on various factors, both genetic and environmental.

  3. Genotyping three SNPs affecting warfarin drug response by isothermal real-time HDA assays.

    Science.gov (United States)

    Li, Ying; Jortani, Saeed A; Ramey-Hartung, Bronwyn; Hudson, Elizabeth; Lemieux, Bertrand; Kong, Huimin

    2011-01-14

    The response to the anticoagulant drug warfarin is greatly affected by genetic polymorphisms in the VKORC1 and CYP2C9 genes. Genotyping these polymorphisms has been shown to be important in reducing the time of the trial and error process for finding the maintenance dose of warfarin thus reducing the risk of adverse effects of the drug. We developed a real-time isothermal DNA amplification system for genotyping three single nucleotide polymorphisms (SNPs) that influence warfarin response. For each SNP, real-time isothermal Helicase Dependent Amplification (HDA) reactions were performed to amplify a DNA fragment containing the SNP. Amplicons were detected by fluorescently labeled allele specific probes during real-time HDA amplification. Fifty clinical samples were analyzed by the HDA-based method, generating a total of 150 results. Of these, 148 were consistent between the HDA-based assays and a reference method. The two samples with unresolved HDA-based test results were repeated and found to be consistent with the reference method. The HDA-based assays demonstrated a clinically acceptable performance for genotyping the VKORC1 -1639G>A SNP and two SNPs (430C>T and 1075A>C) for the CYP2C9 enzyme (CYP2C9*2 and CYP2C9*3), all of which are relevant in warfarin pharmacogenentics. Copyright © 2010 Elsevier B.V. All rights reserved.

  4. Genotyping of Single Nucleotide Polymorphisms in DNA Isolated from Serum Using Sequenom MassARRAY Technology.

    Directory of Open Access Journals (Sweden)

    Tess V Clendenen

    Full Text Available Large epidemiologic studies have the potential to make valuable contributions to the assessment of gene-environment interactions because they prospectively collected detailed exposure data. Some of these studies, however, have only serum or plasma samples as a low quantity source of DNA.We examined whether DNA isolated from serum can be used to reliably and accurately genotype single nucleotide polymorphisms (SNPs using Sequenom multiplex SNP genotyping technology. We genotyped 81 SNPs using samples from 158 participants in the NYU Women's Health Study. Each participant had DNA from serum and at least one paired DNA sample isolated from a high quality source of DNA, i.e. clots and/or cell precipitates, for comparison.We observed that 60 of the 81 SNPs (74% had high call frequencies (≥95% using DNA from serum, only slightly lower than the 85% of SNPs with high call frequencies in DNA from clots or cell precipitates. Of the 57 SNPs with high call frequencies for serum, clot, and cell precipitate DNA, 54 (95% had highly concordant (>98% genotype calls across all three sample types. High purity was not a critical factor to successful genotyping.Our results suggest that this multiplex SNP genotyping method can be used reliably on DNA from serum in large-scale epidemiologic studies.

  5. TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

    Science.gov (United States)

    Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

    2018-04-11

    Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions

  6. Inter-laboratory evaluation of the EUROFORGEN Global ancestry-informative SNP panel by massively parallel sequencing using the Ion PGM™

    DEFF Research Database (Denmark)

    Eduardoff, M; Gross, T E; Santos, C

    2016-01-01

    Seq™ PCR primers was designed for the Global AIM-SNPs to perform massively parallel sequencing using the Ion PGM™ system. This study assessed individual SNP genotyping precision using the Ion PGM™, the forensic sensitivity of the multiplex using dilution series, degraded DNA plus simple mixtures...

  7. Environmental Response and Genomic Regions Correlated with Rice Root Growth and Yield under Drought in the OryzaSNP Panel across Multiple Study Systems.

    Directory of Open Access Journals (Sweden)

    Len J Wade

    Full Text Available The rapid progress in rice genotyping must be matched by advances in phenotyping. A better understanding of genetic variation in rice for drought response, root traits, and practical methods for studying them are needed. In this study, the OryzaSNP set (20 diverse genotypes that have been genotyped for SNP markers was phenotyped in a range of field and container studies to study the diversity of rice root growth and response to drought. Of the root traits measured across more than 20 root experiments, root dry weight showed the most stable genotypic performance across studies. The environment (E component had the strongest effect on yield and root traits. We identified genomic regions correlated with root dry weight, percent deep roots, maximum root depth, and grain yield based on a correlation analysis with the phenotypes and aus, indica, or japonica introgression regions using the SNP data. Two genomic regions were identified as hot spots in which root traits and grain yield were co-located; on chromosome 1 (39.7-40.7 Mb and on chromosome 8 (20.3-21.9 Mb. Across experiments, the soil type/ growth medium showed more correlations with plant growth than the container dimensions. Although the correlations among studies and genetic co-location of root traits from a range of study systems points to their potential utility to represent responses in field studies, the best correlations were observed when the two setups had some similar properties. Due to the co-location of the identified genomic regions (from introgression block analysis with QTL for a number of previously reported root and drought traits, these regions are good candidates for detailed characterization to contribute to understanding rice improvement for response to drought. This study also highlights the utility of characterizing a small set of 20 genotypes for root growth, drought response, and related genomic regions.

  8. Identification of a sex-linked SNP marker in the salmon louse (Lepeophtheirus salmonis using RAD sequencing.

    Directory of Open Access Journals (Sweden)

    Stephen N Carmichael

    Full Text Available The salmon louse (Lepeophtheirus salmonis (Krøyer, 1837 is a parasitic copepod that can, if untreated, cause considerable damage to Atlantic salmon (Salmo salar Linnaeus, 1758 and incurs significant costs to the Atlantic salmon mariculture industry. Salmon lice are gonochoristic and normally show sex ratios close to 1:1. While this observation suggests that sex determination in salmon lice is genetic, with only minor environmental influences, the mechanism of sex determination in the salmon louse is unknown. This paper describes the identification of a sex-linked Single Nucleotide Polymorphism (SNP marker, providing the first evidence for a genetic mechanism of sex determination in the salmon louse. Restriction site-associated DNA sequencing (RAD-seq was used to isolate SNP markers in a laboratory-maintained salmon louse strain. A total of 85 million raw Illumina 100 base paired-end reads produced 281,838 unique RAD-tags across 24 unrelated individuals. RAD marker Lsa101901 showed complete association with phenotypic sex for all individuals analysed, being heterozygous in females and homozygous in males. Using an allele-specific PCR assay for genotyping, this SNP association pattern was further confirmed for three unrelated salmon louse strains, displaying complete association with phenotypic sex in a total of 96 genotyped individuals. The marker Lsa101901 was located in the coding region of the prohibitin-2 gene, which showed a sex-dependent differential expression, with mRNA levels determined by RT-qPCR about 1.8-fold higher in adult female than adult male salmon lice. This study's observations of a novel sex-linked SNP marker are consistent with sex determination in the salmon louse being genetic and following a female heterozygous system. Marker Lsa101901 provides a tool to determine the genetic sex of salmon lice, and could be useful in the development of control strategies.

  9. Exhaustive Genome-Wide Search for SNP-SNP Interactions Across 10 Human Diseases

    Directory of Open Access Journals (Sweden)

    William Murk

    2016-07-01

    Full Text Available The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs with minor allele frequency (MAF ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10−12. Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.

  10. Resolving incomplete single nucleotide polymorphism tagging of HLA-DQ2.2 for coeliac disease genotyping using digital droplet PCR.

    Science.gov (United States)

    Hardy, M Y; Ontiveros, N; Varney, M D; Tye-Din, J A

    2018-04-01

    A hallmark of coeliac disease (CD) is the exceptionally strong genetic association with HLA-DQ2.5, DQ8, and DQ2.2. HLA typing provides information on CD risk important to both clinicians and researchers. A method that enables simple and fast detection of all CD risk genotypes is particularly desirable for the study of large populations. Single nucleotide polymorphism (SNP)-based HLA typing can detect the CD risk genotypes by detecting a combination of six SNPs but this approach can struggle to resolve HLA-DQ2.2, seen in 4% of European CD patients, because of the low resolution of one negatively predicting SNP. We sought to optimise SNP-based HLA typing by harnessing the additional resolution of digital droplet PCR to resolve HLA-DQ2.2. Here we test this two-step approach in an unselected sample of Mexican DNA and compare its accuracy to DNA typed using traditional exon detection. The addition of digital droplet PCR for samples requiring negative prediction of HLA-DQ2.2 enabled HLA-DQ2.2 to be accurately typed. This technique is a simple addition to a SNP-based typing strategy and enables comprehensive definition of all at-risk HLA genotypes in CD in a timely and cost-effective manner. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  11. Population structure of Atlantic Mackerel inferred from RAD-seq derived SNP markers: effects of sequence clustering parameters and hierarchical SNP selection

    KAUST Repository

    Rodríguez-Ezpeleta, Naiara

    2016-03-03

    Restriction-site associated DNA sequencing (RAD-seq) and related methods are revolutionizing the field of population genomics in non-model organisms as they allow generating an unprecedented number of single nucleotide polymorphisms (SNPs) even when no genomic information is available. Yet, RAD-seq data analyses rely on assumptions on nature and number of nucleotide variants present in a single locus, the choice of which may lead to an under- or overestimated number of SNPs and/or to incorrectly called genotypes. Using the Atlantic mackerel (Scomber scombrus L.) and a close relative, the Atlantic chub mackerel (Scomber colias), as case study, here we explore the sensitivity of population structure inferences to two crucial aspects in RAD-seq data analysis: the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides insights into the effects of alternative RAD-seq data analysis strategies on population structure inferences that are directly applicable to other species.

  12. HLA-C -35kb expression SNP is associated with differential control of β-HPV infection in squamous cell carcinoma cases and controls.

    Directory of Open Access Journals (Sweden)

    Karin A Vineretsky

    Full Text Available A single nucleotide polymorphism (SNP 35 kb upstream of the HLA-C gene is associated with HLA-C expression, and the high expressing genotype (CC has been associated with HIV-I control. HLA-C is unique among the classical MHC class I molecules for its role in the control of viral infections and recognition of abnormal or missing self. This immunosurveillance is central to the pathogenesis of non-melanoma skin cancer (NMSC, and of squamous cell carcinoma (SCC in particular. While sun exposure is a major risk factor for these cancers, cutaneous infections with genus β-HPV have been implicated in the development of SCC. We hypothesized that the high expression HLA-C genotype is associated with β-HPV infections. Therefore, we investigated the association between β-HPV serology and the -35 kb SNP (rs9264942 in a population-based case-control study of 510 SCC cases and 608 controls. Among controls, the high expression -35 kb SNP genotype (CC reduced the likelihood of positive serology for multiple (≥2 β-HPV infections (OR = 0.49, 95% CI: 0.25-0.97, and β-HPV species 2 infection (OR = 0.43, 95% CI: 0.23-0.79. However, no association with β-HPV status was observed among SCC cases. Our findings suggest that underlying immunogenotype plays an important role in differential control of β-HPV in SCC cases and controls.

  13. CYP2D7 sequence variation interferes with TaqMan CYP2D6*15 and *35 genotyping

    Directory of Open Access Journals (Sweden)

    Amanda K Riffel

    2016-01-01

    Full Text Available TaqMan™ genotyping assays are widely used to genotype CYP2D6, which encodes a major drug metabolizing enzyme. Assay design for CYP2D6 can be challenging owing to the presence of two pseudogenes, CYP2D7 and CYP2D8, structural and copy number variation and numerous single nucleotide polymorphisms (SNPs some of which reflect the wild-type sequence of the CYP2D7 pseudogene. The aim of this study was to identify the mechanism causing false positive CYP2D6*15 calls and remediate those by redesigning and validating alternative TaqMan genotype assays. Among 13,866 DNA samples genotyped by the CompanionDx® lab on the OpenArray platform, 70 samples were identified as heterozygotes for 137Tins, the key SNP of CYP2D6*15. However, only 15 samples were confirmed when tested with the Luminex xTAG CYP2D6 Kit and sequencing of CYP2D6-specific long range (XL-PCR products. Genotype and gene resequencing of CYP2D6 and CYP2D7-specific XL-PCR products revealed a CC>GT dinucleotide SNP in exon 1 of CYP2D7 that reverts the sequence to CYP2D6 and allows a TaqMan assay PCR primer to bind. Because CYP2D7 also carries a Tins, a false-positive mutation signal is generated. This CYP2D7 SNP was also responsible for generating false-positive signals for rs769258 (CYP2D6*35 which is also located in exon 1. Although alternative CYP2D6*15 and *35 assays resolved the issue, we discovered a novel CYP2D6*15 subvariant in one sample that carries additional SNPs preventing detection with the alternate assay. The frequency of CYP2D6*15 was 0.1% in this ethnically diverse U.S. population sample. In addition, we also discovered linkage between the CYP2D7 CC>GT dinucleotide SNP and the 77G>A (rs28371696 SNP of CYP2D6*43. The frequency of this tentatively functional allele was 0.2%. Taken together, these findings emphasize that regardless of how careful genotyping assays are designed and evaluated before being commercially marketed, rare or unknown SNPs underneath primer and/or probe

  14. Development of a single nucleotide polymorphism barcode to genotype Plasmodium vivax infections.

    Directory of Open Access Journals (Sweden)

    Mary Lynn Baniecki

    2015-03-01

    Full Text Available Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs. Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM, we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding. From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana, Africa (Ethiopia and Asia (Sri Lanka. We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1. Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections.

  15. Development of a Single Nucleotide Polymorphism Barcode to Genotype Plasmodium vivax Infections

    Science.gov (United States)

    Baniecki, Mary Lynn; Faust, Aubrey L.; Schaffner, Stephen F.; Park, Daniel J.; Galinsky, Kevin; Daniels, Rachel F.; Hamilton, Elizabeth; Ferreira, Marcelo U.; Karunaweera, Nadira D.; Serre, David; Zimmerman, Peter A.; Sá, Juliana M.; Wellems, Thomas E.; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E.; Volkman, Sarah K.; Wirth, Dyann F.; Sabeti, Pardis C.

    2015-01-01

    Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25–40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890

  16. SAQC: SNP Array Quality Control

    Directory of Open Access Journals (Sweden)

    Li Ling-Hui

    2011-04-01

    Full Text Available Abstract Background Genome-wide single-nucleotide polymorphism (SNP arrays containing hundreds of thousands of SNPs from the human genome have proven useful for studying important human genome questions. Data quality of SNP arrays plays a key role in the accuracy and precision of downstream data analyses. However, good indices for assessing data quality of SNP arrays have not yet been developed. Results We developed new quality indices to measure the quality of SNP arrays and/or DNA samples and investigated their statistical properties. The indices quantify a departure of estimated individual-level allele frequencies (AFs from expected frequencies via standardized distances. The proposed quality indices followed lognormal distributions in several large genomic studies that we empirically evaluated. AF reference data and quality index reference data for different SNP array platforms were established based on samples from various reference populations. Furthermore, a confidence interval method based on the underlying empirical distributions of quality indices was developed to identify poor-quality SNP arrays and/or DNA samples. Analyses of authentic biological data and simulated data show that this new method is sensitive and specific for the detection of poor-quality SNP arrays and/or DNA samples. Conclusions This study introduces new quality indices, establishes references for AFs and quality indices, and develops a detection method for poor-quality SNP arrays and/or DNA samples. We have developed a new computer program that utilizes these methods called SNP Array Quality Control (SAQC. SAQC software is written in R and R-GUI and was developed as a user-friendly tool for the visualization and evaluation of data quality of genome-wide SNP arrays. The program is available online (http://www.stat.sinica.edu.tw/hsinchou/genetics/quality/SAQC.htm.

  17. High Density Linkage Map Construction and Mapping of Yield Trait QTLs in Maize (Zea mays) Using the Genotyping-by-Sequencing (GBS) Technology

    Science.gov (United States)

    Su, Chengfu; Wang, Wei; Gong, Shunliang; Zuo, Jinghui; Li, Shujiang; Xu, Shizhong

    2017-01-01

    Increasing grain yield is the ultimate goal for maize breeding. High resolution quantitative trait loci (QTL) mapping can help us understand the molecular basis of phenotypic variation of yield and thus facilitate marker assisted breeding. The aim of this study is to use genotyping-by-sequencing (GBS) for large-scale SNP discovery and simultaneous genotyping of all F2 individuals from a cross between two varieties of maize that are in clear contrast in yield and related traits. A set of 199 F2 progeny derived from the cross of varieties SG-5 and SG-7 were generated and genotyped by GBS. A total of 1,046,524,604 reads with an average of 5,258,918 reads per F2 individual were generated. This number of reads represents an approximately 0.36-fold coverage of the maize reference genome Zea_mays.AGPv3.29 for each F2 individual. A total of 68,882 raw SNPs were discovered in the F2 population, which, after stringent filtering, led to a total of 29,927 high quality SNPs. Comparative analysis using these physically mapped marker loci revealed a higher degree of synteny with the reference genome. The SNP genotype data were utilized to construct an intra-specific genetic linkage map of maize consisting of 3,305 bins on 10 linkage groups spanning 2,236.66 cM at an average distance of 0.68 cM between consecutive markers. From this map, we identified 28 QTLs associated with yield traits (100-kernel weight, ear length, ear diameter, cob diameter, kernel row number, corn grains per row, ear weight, and grain weight per plant) using the composite interval mapping (CIM) method and 29 QTLs using the least absolute shrinkage selection operator (LASSO) method. QTLs identified by the CIM method account for 6.4% to 19.7% of the phenotypic variation. Small intervals of three QTLs (qCGR-1, qKW-2, and qGWP-4) contain several genes, including one gene (GRMZM2G139872) encoding the F-box protein, three genes (GRMZM2G180811, GRMZM5G828139, and GRMZM5G873194) encoding the WD40-repeat protein, and

  18. Validated context-dependent associations of coronary heart disease risk with genotype variation in the chromosome 9p21 region

    DEFF Research Database (Denmark)

    Lusk, Christine M; Dyson, Greg; Clark, Andrew G

    2014-01-01

    identified by the CARDIoGRAMplusC4D Consortium study, of which ARIC was a part. We then tested each marker SNP genotype effect on prediction of CHD within sub-groups of the ARIC sample defined by traditional CHD risk factors by applying a novel multi-model strategy, PRIM. We observed that the effects of SNP...

  19. Effects of genotype and population density on growth performance, carcass characteristics, and cost-benefits of broiler chickens in north central Nigeria.

    Science.gov (United States)

    Yakubu, Abdulmojeed; Ayoade, John A; Dahiru, Yakubu M

    2010-04-01

    The influence of genotype and stocking densities on growth performance, carcass qualities, and cost-benefits of broilers were examined in a 28-day trial. Two hundred and seven 4-week-old birds each of Anak Titan and Arbor Acre hybrid broiler types were randomly assigned to three stocking density treatments of 8.3, 11.1, and 14.3 birds/m(2) in a 2 x 3 factorial arrangement. Final body weight, average weekly body weight and average weekly feed intake were affected (P Arbor Acres. However, average weekly body weight gain and feed conversion ratio were similar (P > 0.05) in both genetic groups. The effect of placement density on some growth parameters did not follow a linear trend. Arbor Acres had significantly (P 0.05) influenced by genotype and housing density. The economic analysis revealed that higher gross margin was recorded for Arbor Acres compared to Anak Titans (euro 2.76 versus euro 2.19; P 0.05) on profit margin. Genotype x stocking density interaction effect was significant for some of the carcass indices investigated. It is concluded that under sub-humid conditions of a tropical environment, the use of Arbor Acre genetic type as well as a placement density of 14.3 birds/m(2) appeared to be more profitable.

  20. New algorithm improves fine structure of the barley consensus SNP map

    Directory of Open Access Journals (Sweden)

    Endelman Jeffrey B

    2011-08-01

    Full Text Available Abstract Background The need to integrate information from multiple linkage maps is a long-standing problem in genetics. One way to visualize the complex ordinal relationships is with a directed graph, where each vertex in the graph is a bin of markers. When there are no ordering conflicts between the linkage maps, the result is a directed acyclic graph, or DAG, which can then be linearized to produce a consensus map. Results New algorithms for the simplification and linearization of consensus graphs have been implemented as a package for the R computing environment called DAGGER. The simplified consensus graphs produced by DAGGER exactly capture the ordinal relationships present in a series of linkage maps. Using either linear or quadratic programming, DAGGER generates a consensus map with minimum error relative to the linkage maps while remaining ordinally consistent with them. Both linearization methods produce consensus maps that are compressed relative to the mean of the linkage maps. After rescaling, however, the consensus maps had higher accuracy (and higher marker density than the individual linkage maps in genetic simulations. When applied to four barley linkage maps genotyped at nearly 3000 SNP markers, DAGGER produced a consensus map with improved fine structure compared to the existing barley consensus SNP map. The root-mean-squared error between the linkage maps and the DAGGER map was 0.82 cM per marker interval compared to 2.28 cM for the existing consensus map. Examination of the barley hardness locus at the 5HS telomere, for which there is a physical map, confirmed that the DAGGER output was more accurate for fine structure analysis. Conclusions The R package DAGGER is an effective, freely available resource for integrating the information from a set of consistent linkage maps.

  1. Association between MDM2 SNP309 T>G polymorphism and the risk of bladder cancer: new data in a Chinese population and an updated meta-analysis

    Directory of Open Access Journals (Sweden)

    Xie LG

    2015-12-01

    Full Text Available Linguo Xie,1,2,* Yan Sun,2,* Tao Chen,1,2,* Dawei Tian,1,2 Yujuan Li,3 Yu Zhang,1,2 Na Ding,2 Zhonghua Shen,1,2 Hao Xu,1,2 Xuewu Nian,4 Nan Sha,1,2 Ruifa Han,1,2 Hailong Hu,1,2 Changli Wu1,2 Objective: Human murine double minute 2 protein (MDM2 is mainly a negative regulator of p53 tumor suppressor pathway. We aimed to investigate the association between MDM2 SNP309 polymorphism and bladder cancer risk. Methods: A total of 535 bladder cancer patients and 649 health controls were recruited for our study. MDM2 SNP309 T>G polymorphism was genotyped by polymerase chain reaction-ligase detection reaction method. Logistic regression was used to analyze the relationship between the genotype and susceptibility of bladder cancer. Kaplan–Meier estimates and log-rank test were obtained to analyze the association between the genotype and risk of recrudesce in nonmuscle-invasive bladder cancer patients. A multivariable Cox proportional hazards model was fitted to identify independent prognostic factors. To further investigate the association, we conducted a meta-analysis including six studies. Results: The frequency of the MDM2 SNP309 T>G polymorphism showed no significant difference between cases and controls (all P>0.05. In the stratification analysis, the results showed that G allele carriers were prone to have a significant decrease in risk of low-grade bladder cancer (adjusted odds ratio: 0.613, 95% confidence interval: 0.427–0.881, and G variant was associated with a significantly reduced risk of recurrence in nonmuscle-invasive bladder cancer patients with or without chemotherapy (P<0.05. The results of the meta-analysis showed that G allele and GG genotype of MDM2 SNP309 polymorphism were significantly associated with increased risk of bladder cancer in Caucasians (both P<0.05, and no association was observed in total populations and Asians (P>0.05. Conclusion: MDM2 SNP309 T>G polymorphism has no influence on bladder cancer risk in Asians, but

  2. Genome-wide linkage mapping of yield-related traits in three Chinese bread wheat populations using high-density SNP markers.

    Science.gov (United States)

    Li, Faji; Wen, Weie; He, Zhonghu; Liu, Jindong; Jin, Hui; Cao, Shuanghe; Geng, Hongwei; Yan, Jun; Zhang, Pingzhi; Wan, Yingxiu; Xia, Xianchun

    2018-06-01

    We identified 21 new and stable QTL, and 11 QTL clusters for yield-related traits in three bread wheat populations using the wheat 90 K SNP assay. Identification of quantitative trait loci (QTL) for yield-related traits and closely linked molecular markers is important in order to identify gene/QTL for marker-assisted selection (MAS) in wheat breeding. The objectives of the present study were to identify QTL for yield-related traits and dissect the relationships among different traits in three wheat recombinant inbred line (RIL) populations derived from crosses Doumai × Shi 4185 (D × S), Gaocheng 8901 × Zhoumai 16 (G × Z) and Linmai 2 × Zhong 892 (L × Z). Using the available high-density linkage maps previously constructed with the wheat 90 K iSelect single nucleotide polymorphism (SNP) array, 65, 46 and 53 QTL for 12 traits were identified in the three RIL populations, respectively. Among them, 34, 23 and 27 were likely to be new QTL. Eighteen common QTL were detected across two or three populations. Eleven QTL clusters harboring multiple QTL were detected in different populations, and the interval 15.5-32.3 cM around the Rht-B1 locus on chromosome 4BS harboring 20 QTL is an important region determining grain yield (GY). Thousand-kernel weight (TKW) is significantly affected by kernel width and plant height (PH), whereas flag leaf width can be used to select lines with large kernel number per spike. Eleven candidate genes were identified, including eight cloned genes for kernel, heading date (HD) and PH-related traits as well as predicted genes for TKW, spike length and HD. The closest SNP markers of stable QTL or QTL clusters can be used for MAS in wheat breeding using kompetitive allele-specific PCR or semi-thermal asymmetric reverse PCR assays for improvement of GY.

  3. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality

    Directory of Open Access Journals (Sweden)

    Shahin S Ali

    2015-08-01

    Full Text Available Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed SNP markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNASeq-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups. Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.

  4. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality.

    Science.gov (United States)

    Ali, Shahin S; Shao, Jonathan; Strem, Mary D; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W; Bailey, Bryan A

    2015-01-01

    Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.

  5. Prevalence of single nucleotide polymorphism among 27 diverse alfalfa genotypes as assessed by transcriptome sequencing

    Directory of Open Access Journals (Sweden)

    Li Xuehui

    2012-10-01

    Full Text Available Abstract Background Alfalfa, a perennial, outcrossing species, is a widely planted forage legume producing highly nutritious biomass. Currently, improvement of cultivated alfalfa mainly relies on recurrent phenotypic selection. Marker assisted breeding strategies can enhance alfalfa improvement efforts, particularly if many genome-wide markers are available. Transcriptome sequencing enables efficient high-throughput discovery of single nucleotide polymorphism (SNP markers for a complex polyploid species. Result The transcriptomes of 27 alfalfa genotypes, including elite breeding genotypes, parents of mapping populations, and unimproved wild genotypes, were sequenced using an Illumina Genome Analyzer IIx. De novo assembly of quality-filtered 72-bp reads generated 25,183 contigs with a total length of 26.8 Mbp and an average length of 1,065 bp, with an average read depth of 55.9-fold for each genotype. Overall, 21,954 (87.2% of the 25,183 contigs represented 14,878 unique protein accessions. Gene ontology (GO analysis suggested that a broad diversity of genes was represented in the resulting sequences. The realignment of individual reads to the contigs enabled the detection of 872,384 SNPs and 31,760 InDels. High resolution melting (HRM analysis was used to validate 91% of 192 putative SNPs identified by sequencing. Both allelic variants at about 95% of SNP sites identified among five wild, unimproved genotypes are still present in cultivated alfalfa, and all four US breeding programs also contain a high proportion of these SNPs. Thus, little evidence exists among this dataset for loss of significant DNA sequence diversity from either domestication or breeding of alfalfa. Structure analysis indicated that individuals from the subspecies falcata, the diploid subspecies caerulea, and the tetraploid subspecies sativa (cultivated tetraploid alfalfa were clearly separated. Conclusion We used transcriptome sequencing to discover large numbers of SNPs

  6. Polymorphism rs2073618 of the TNFRSF11B (OPG Gene and Bone Mineral Density in Mexican Women with Rheumatoid Arthritis

    Directory of Open Access Journals (Sweden)

    C. A. Nava-Valdivia

    2017-01-01

    Full Text Available Osteoporosis (OP is highly prevalent in rheumatoid arthritis (RA and is influenced by genetic factors. Single-nucleotide polymorphism (SNP rs2073618 in the TNFRSF11B osteoprotegerin (OPG gene has been related to postmenopausal OP although, to date, no information has been described concerning whether this polymorphism is implied in abnormalities of bone mineral density (BMD in RA. We evaluated, in a case-control study performed in Mexican-Mestizo women with RA, whether SNP rs2073618 in the TNFRSF11B gene is associated with a decrease in BMD. RA patients were classified as follows: (1 low BMD and (2 normal BMD. All patients were genotyped for the rs2073618 polymorphism by PCR-RFLP. The frequency of low BMD was 74.4%. Higher age was observed in RA with low BMD versus normal BMD (62 and 54 years, resp.; p<0.001. Worse functioning and lower BMI were observed in RA with low BMD (p=0.003 and p=0.002, resp.. We found similar genotype frequencies in RA with low BMD versus RA with normal BMD (GG genotype 71% versus 64.4%, GC 26% versus 33%, and CC 3% versus 2.2%, resp.; p=0.6. We concluded that in Mexican-Mestizo female patients with RA, the rs2073618 polymorphism of the TNRFS11B gene is not associated with low BMD.

  7. The metabochip, a custom genotyping array for genetic studies of metabolic, cardiovascular, and anthropometric traits.

    Directory of Open Access Journals (Sweden)

    Benjamin F Voight

    Full Text Available Genome-wide association studies have identified hundreds of loci for type 2 diabetes, coronary artery disease and myocardial infarction, as well as for related traits such as body mass index, glucose and insulin levels, lipid levels, and blood pressure. These studies also have pointed to thousands of loci with promising but not yet compelling association evidence. To establish association at additional loci and to characterize the genome-wide significant loci by fine-mapping, we designed the "Metabochip," a custom genotyping array that assays nearly 200,000 SNP markers. Here, we describe the Metabochip and its component SNP sets, evaluate its performance in capturing variation across the allele-frequency spectrum, describe solutions to methodological challenges commonly encountered in its analysis, and evaluate its performance as a platform for genotype imputation. The metabochip achieves dramatic cost efficiencies compared to designing single-trait follow-up reagents, and provides the opportunity to compare results across a range of related traits. The metabochip and similar custom genotyping arrays offer a powerful and cost-effective approach to follow-up large-scale genotyping and sequencing studies and advance our understanding of the genetic basis of complex human diseases and traits.

  8. Performance of commercial platforms for rapid genotyping of polymorphisms affecting warfarin dose.

    Science.gov (United States)

    King, Cristi R; Porche-Sorbet, Rhonda M; Gage, Brian F; Ridker, Paul M; Renaud, Yannick; Phillips, Michael S; Eby, Charles

    2008-06-01

    Initiation of warfarin therapy is associated with bleeding owing to its narrow therapeutic window and unpredictable therapeutic dose. Pharmacogenetic-based dosing algorithms can improve accuracy of initial warfarin dosing but require rapid genotyping for cytochrome P-450 2C9 (CYP2C9) *2 and *3 single nucleotide polymorphisms (SNPs) and a vitamin K epoxide reductase (VKORC1) SNP. We evaluated 4 commercial systems: INFINITI analyzer (AutoGenomics, Carlsbad, CA), Invader assay (Third Wave Technologies, Madison, WI), Tag-It Mutation Detection assay (Luminex Molecular Diagnostics, formerly Tm Bioscience, Toronto, Canada), and Pyrosequencing (Biotage, Uppsala, Sweden). We genotyped 112 DNA samples and resolved any discrepancies with bidirectional sequencing. The INFINITI analyzer was 100% accurate for all SNPs and required 8 hours. Invader and Tag-It were 100% accurate for CYP2C9 SNPs, 99% accurate for VKORC1 -1639/3673 SNP, and required 3 hours and 8 hours, respectively. Pyrosequencing was 99% accurate for CYP2C9 *2, 100% accurate for CYP2C9 *3, and 100% accurate for VKORC1 and required 4 hours. Current commercial platforms provide accurate and rapid genotypes for pharmacogenetic dosing during initiation of warfarin therapy.

  9. High-resolution melting genotyping of Enterococcus faecium based on multilocus sequence typing derived single nucleotide polymorphisms.

    Directory of Open Access Journals (Sweden)

    Steven Y C Tong

    Full Text Available We have developed a single nucleotide polymorphism (SNP nucleated high-resolution melting (HRM technique to genotype Enterococcus faecium. Eight SNPs were derived from the E. faecium multilocus sequence typing (MLST database and amplified fragments containing these SNPs were interrogated by HRM. We tested the HRM genotyping scheme on 85 E. faecium bloodstream isolates and compared the results with MLST, pulsed-field gel electrophoresis (PFGE and an allele specific real-time PCR (AS kinetic PCR SNP typing method. In silico analysis based on predicted HRM curves according to the G+C content of each fragment for all 567 sequence types (STs in the MLST database together with empiric data from the 85 isolates demonstrated that HRM analysis resolves E. faecium into 231 "melting types" (MelTs and provides a Simpson's Index of Diversity (D of 0.991 with respect to MLST. This is a significant improvement on the AS kinetic PCR SNP typing scheme that resolves 61 SNP types with D of 0.95. The MelTs were concordant with the known ST of the isolates. For the 85 isolates, there were 13 PFGE patterns, 17 STs, 14 MelTs and eight SNP types. There was excellent concordance between PFGE, MLST and MelTs with Adjusted Rand Indices of PFGE to MelT 0.936 and ST to MelT 0.973. In conclusion, this HRM based method appears rapid and reproducible. The results are concordant with MLST and the MLST based population structure.

  10. Distribution and molecular evolution of bacillus anthracis genotypes in Namibia.

    Directory of Open Access Journals (Sweden)

    Wolfgang Beyer

    Full Text Available The recent development of genetic markers for Bacillus anthracis has made it possible to monitor the spread and distribution of this pathogen during and between anthrax outbreaks. In Namibia, anthrax outbreaks occur annually in the Etosha National Park (ENP and on private game and livestock farms. We genotyped 384 B. anthracis isolates collected between 1983-2010 to identify the possible epidemiological correlations of anthrax outbreaks within and outside the ENP and to analyze genetic relationships between isolates from domestic and wild animals. The isolates came from 20 animal species and from the environment and were genotyped using a 31-marker multi-locus-VNTR-analysis (MLVA and, in part, by twelve single nucleotide polymorphism (SNP markers and four single nucleotide repeat (SNR markers. A total of 37 genotypes (GT were identified by MLVA, belonging to four SNP-groups. All GTs belonged to the A-branch in the cluster- and SNP-analyses. Thirteen GTs were found only outside the ENP, 18 only within the ENP and 6 both inside and outside. Genetic distances between isolates increased with increasing time between isolations. However, genetic distance between isolates at the beginning and end of the study period was relatively small, indicating that while the majority of GTs were only found sporadically, three genetically close GTs, accounting for more than four fifths of all the ENP isolates, appeared dominant throughout the study period. Genetic distances among isolates were significantly greater for isolates from different host species, but this effect was small, suggesting that while species-specific ecological factors may affect exposure processes, transmission cycles in different host species are still highly interrelated. The MLVA data were further used to establish a model of the probable evolution of GTs within the endemic region of the ENP. SNR-analysis was helpful in correlating an isolate with its source but did not elucidate

  11. Cell Line Controls for the Genotyping of a Spectrum of Human Single Nucleotide Polymorphisms in the Clinical Laboratory.

    Science.gov (United States)

    Kimbacher, Christine; Paar, Christian; Freystetter, Andrea; Berg, Joerg

    2018-05-01

    Genotyping for clinically important single nucleotide polymorphisms (SNPs) is performed by many clinical routine laboratories. To support testing, quality controls and reference materials are needed. Those may be derived from residual patient samples, left over samples of external quality assurance schemes, plasmid DNA or DNA from cell lines. DNAs from cell lines are commutable and available in large amounts. DNA from 38 cell lines were examined for suitability as controls in 11 SNP assays that are frequently used in a clinical routine laboratory: FV (1691G>A), FII (20210G>A), PAI-1 4G/5G polymorphism, MTHFR (677C>T, 1298A>C), HFE (H63D, S65C, C282Y), APOE (E2, E3, E4), LPH (-13910C>T), UGT1A1 (*28, *36, *37), TPMT (*2, *3A, *3B, *3C), VKORC1 (-1639G>A, 1173C>T), CYP2C9 (*2, *3, *5). Genotyping was performed by real-time PCR with melting curve analysis and confirmed by bi-directional sequencing. We find an almost complete spectrum of genotypic constellations within these 38 cell lines. About 12 cell lines appear sufficient as genotypic controls for the 11 SNP assays by covering almost all of the genotypes. However, hetero- and homozygous genotypes for FII and the alleles TPMT*2, UGT1A1*37 and CYP2C9*5 were not detected in any of the cell lines. DNA from most of the examined cell lines appear suitable as quality controls for these SNP assays in the laboratory routine, as to the implementation of those assays or to prepare samples for quality assurance schemes. Our study may serve as a pilot to further characterize these cell lines to arrive at the status of reference materials.

  12. Combining target enrichment with barcode multiplexing for high throughput SNP discovery

    Directory of Open Access Journals (Sweden)

    Lunke Sebastian

    2010-11-01

    Full Text Available Abstract Background The primary goal of genetic linkage analysis is to identify genes affecting a phenotypic trait. After localisation of the linkage region, efficient genetic dissection of the disease linked loci requires that functional variants are identified across the loci. These functional variations are difficult to detect due to extent of genetic diversity and, to date, incomplete cataloguing of the large number of variants present both within and between populations. Massively parallel sequencing platforms offer unprecedented capacity for variant discovery, however the number of samples analysed are still limited by cost per sample. Some progress has been made in reducing the cost of resequencing using either multiplexing methodologies or through the utilisation of targeted enrichment technologies which provide the ability to resequence genomic areas of interest rather that full genome sequencing. Results We developed a method that combines current multiplexing methodologies with a solution-based target enrichment method to further reduce the cost of resequencing where region-specific sequencing is required. Our multiplex/enrichment strategy produced high quality data with nominal reduction of sequencing depth. We undertook a genotyping study and were successful in the discovery of novel SNP alleles in all samples at uniplex, duplex and pentaplex levels. Conclusion Our work describes the successful combination of a targeted enrichment method and index barcode multiplexing to reduce costs, time and labour associated with processing large sample sets. Furthermore, we have shown that the sequencing depth obtained is adequate for credible SNP genotyping analysis at uniplex, duplex and pentaplex levels.

  13. Genotype-covariate interaction effects and the heritability of adult body mass index

    NARCIS (Netherlands)

    Robinson, Matthew R.; English, Geoffrey; Moser, Gerhard; Lloyd-Jones, Luke R; Triplett, Marcus A; Zhu, Zhihong; Nolte, Ilja M; van Vliet-Ostaptchouk, Jana V; Snieder, Harold; Esko, Tonu; Milani, Lili; Mägi, Reedik; Metspalu, Andres; Magnusson, Patrik K. E.; Pedersen, Nancy L.; Ingelsson, Erik; Johannesson, Magnus; Yang, Jian; Cesarini, David; Visscher, Peter M.

    Obesity is a worldwide epidemic, with major health and economic costs. Here we estimate heritability for body mass index (BMI) in 172,000 sibling pairs and 150,832 unrelated individuals and explore the contribution of genotype-covariate interaction effects at common SNP loci. We find evidence for

  14. Use of allele-specific FAIRE to determine functional regulatory polymorphism using large-scale genotyping arrays.

    Directory of Open Access Journals (Sweden)

    Andrew J P Smith

    Full Text Available Following the widespread use of genome-wide association studies (GWAS, focus is turning towards identification of causal variants rather than simply genetic markers of diseases and traits. As a step towards a high-throughput method to identify genome-wide, non-coding, functional regulatory variants, we describe the technique of allele-specific FAIRE, utilising large-scale genotyping technology (FAIRE-gen to determine allelic effects on chromatin accessibility and regulatory potential. FAIRE-gen was explored using lymphoblastoid cells and the 50,000 SNP Illumina CVD BeadChip. The technique identified an allele-specific regulatory polymorphism within NR1H3 (coding for LXR-α, rs7120118, coinciding with a previously GWAS-identified SNP for HDL-C levels. This finding was confirmed using FAIRE-gen with the 200,000 SNP Illumina Metabochip and verified with the established method of TaqMan allelic discrimination. Examination of this SNP in two prospective Caucasian cohorts comprising 15,000 individuals confirmed the association with HDL-C levels (combined beta = 0.016; p = 0.0006, and analysis of gene expression identified an allelic association with LXR-α expression in heart tissue. Using increasingly comprehensive genotyping chips and distinct tissues for examination, FAIRE-gen has the potential to aid the identification of many causal SNPs associated with disease from GWAS.

  15. Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays

    Directory of Open Access Journals (Sweden)

    Wagner Mark C

    2005-05-01

    Full Text Available Abstract Background Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation. Results A method and SPR Opt (SNP and PCR-RFLP Optimization software to perform a comprehensive, whole-genome analysis to forensically discriminate multiple sequences is presented. Tools for the optimization of forensic typing using Single Nucleotide Polymorphism (SNP and PCR-Restriction Fragment Length Polymorphism (PCR-RFLP analyses across multiple isolate sequences of a species are described. The PCR-RFLP analysis includes prediction and selection of optimal primers and restriction enzymes to enable maximum isolate discrimination based on sequence information. SPR Opt calculates all SNP or PCR-RFLP variations present in the sequences, groups them into haplotypes according to their co-segregation across those sequences, and performs combinatoric analyses to determine which sets of haplotypes provide maximal discrimination among all the input sequences. Those set combinations requiring that membership in the fewest haplotypes be queried (i.e. the fewest assays be performed are found. These analyses highlight variable regions based on existing sequence data. These markers may be heterogeneous among unsequenced isolates as well, and thus may be useful for characterizing the relationships among unsequenced as well as sequenced isolates. The predictions are multi-locus. Analyses of mumps and SARS viruses are summarized. Phylogenetic trees created based on SNPs, PCR-RFLPs, and full genomes are compared for SARS virus, illustrating that purported phylogenies based only on SNP or PCR-RFLP variations do not match those based on multiple sequence alignment of the full genomes. Conclusion This is the first software to optimize the selection of forensic markers to maximize information gained from the fewest assays, accepting whole or partial genome sequence data as input. As

  16. Affymetrix SNP array data for wild Dutch great tits (Parus major)

    NARCIS (Netherlands)

    Silva, Da Vinicius; Laine, Veronika N.; Bosse, M.; Oers, C.H.J.; Dibbits, B.W.; Visser, M.E.; Crooijmans, R.P.M.A.; Groenen, M.

    2018-01-01

    The great tit is a widely studied passerine bird species in ecology that, in the past decades, has provided important insights into speciation, phenology, behavior and microevolution. After completion of the great tit genome sequence, a customized high density 650k SNP array was developed enabling

  17. Accurate continuous geographic assignment from low- to high-density SNP data

    DEFF Research Database (Denmark)

    Guillot, Gilles; Jónsson, Hákon; Hinge, Antoine

    2016-01-01

    of georeferenced genotypes. Statistical inference under this model can be implemented within the theoretical framework of Integrated Nested Laplace Approximation (INLA), which represents one of the major recent breakthroughs in statistics, devoid of Monte Carlo simulations. We compare the performance of our method...... and SPA in a simulation framework. We highlight the accuracy and limits of continuous spatial assignment methods at various scales by analyzing genotype datasets from a diversity of species, including Florida scrub jay birds Aphelocoma coerulescens, Arabidopsis thaliana and humans, representing 41 to 197...

  18. Kernel machine SNP set analysis provides new insight into the association between obesity and polymorphisms located on the chromosomal 16q.12.2 region: Tehran Lipid and Glucose Study.

    Science.gov (United States)

    Javanrouh, Niloufar; Daneshpour, Maryam S; Soltanian, Ali Reza; Tapak, Leili

    2018-06-05

    Obesity is a serious health problem that leads to low quality of life and early mortality. To the purpose of prevention and gene therapy for such a worldwide disease, genome wide association study is a powerful tool for finding SNPs associated with increased risk of obesity. To conduct an association analysis, kernel machine regression is a generalized regression method, has an advantage of considering the epistasis effects as well as the correlation between individuals due to unknown factors. In this study, information of the people who participated in Tehran cardio-metabolic genetic study was used. They were genotyped for the chromosomal region, evaluation 986 variations located at 16q12.2; build 38hg. Kernel machine regression and single SNP analysis were used to assess the association between obesity and SNPs genotyped data. We found that associated SNP sets with obesity, were almost in the FTO (P = 0.01), AIKTIP (P = 0.02) and MMP2 (P = 0.02) genes. Moreover, two SNPs, i.e., rs10521296 and rs11647470, showed significant association with obesity using kernel regression (P = 0.02). In conclusion, significant sets were randomly distributed throughout the region with more density around the FTO, AIKTIP and MMP2 genes. Furthermore, two intergenic SNPs showed significant association after using kernel machine regression. Therefore, more studies have to be conducted to assess their functionality or precise mechanism. Copyright © 2018 Elsevier B.V. All rights reserved.

  19. Genotyping-by-Sequencing and Its Exploitation for Forage and Cool-Season Grain Legume Breeding

    Science.gov (United States)

    Annicchiarico, Paolo; Nazzicari, Nelson; Wei, Yanling; Pecetti, Luciano; Brummer, Edward C.

    2017-01-01

    Genotyping-by-Sequencing (GBS) may drastically reduce genotyping costs compared with single nucleotide polymorphism (SNP) array platforms. However, it may require optimization for specific crops to maximize the number of available markers. Exploiting GBS-generated markers may require optimization, too (e.g., to cope with missing data). This study aimed (i) to compare elements of GBS protocols on legume species that differ for genome size, ploidy, and breeding system, and (ii) to show successful applications and challenges of GBS data on legume species. Preliminary work on alfalfa and Medicago truncatula suggested the greater interest of ApeKI over PstI:MspI DNA digestion. We compared KAPA and NEB Taq polymerases in combination with primer extensions that were progressively more selective on restriction sites, and found greater number of polymorphic SNP loci in pea, white lupin and diploid alfalfa when adopting KAPA with a non-selective primer. This protocol displayed a slight advantage also for tetraploid alfalfa (where SNP calling requires higher read depth). KAPA offered the further advantage of more uniform amplification than NEB over fragment sizes and GC contents. The number of GBS-generated polymorphic markers exceeded 6,500 in two tetraploid alfalfa reference populations and a world collection of lupin genotypes, and 2,000 in different sets of pea or lupin recombinant inbred lines. The predictive ability of GBS-based genomic selection was influenced by the genotype missing data threshold and imputation, as well as by the genomic selection model, with the best model depending on traits and data sets. We devised a simple method for comparing phenotypic vs. genomic selection in terms of predicted yield gain per year for same evaluation costs, whose application to preliminary data for alfalfa and pea in a hypothetical selection scenario for each crop indicated a distinct advantage of genomic selection. PMID:28536584

  20. Genotyping-by-Sequencing and Its Exploitation for Forage and Cool-Season Grain Legume Breeding

    Directory of Open Access Journals (Sweden)

    Paolo Annicchiarico

    2017-05-01

    Full Text Available Genotyping-by-Sequencing (GBS may drastically reduce genotyping costs compared with single nucleotide polymorphism (SNP array platforms. However, it may require optimization for specific crops to maximize the number of available markers. Exploiting GBS-generated markers may require optimization, too (e.g., to cope with missing data. This study aimed (i to compare elements of GBS protocols on legume species that differ for genome size, ploidy, and breeding system, and (ii to show successful applications and challenges of GBS data on legume species. Preliminary work on alfalfa and Medicago truncatula suggested the greater interest of ApeKI over PstI:MspI DNA digestion. We compared KAPA and NEB Taq polymerases in combination with primer extensions that were progressively more selective on restriction sites, and found greater number of polymorphic SNP loci in pea, white lupin and diploid alfalfa when adopting KAPA with a non-selective primer. This protocol displayed a slight advantage also for tetraploid alfalfa (where SNP calling requires higher read depth. KAPA offered the further advantage of more uniform amplification than NEB over fragment sizes and GC contents. The number of GBS-generated polymorphic markers exceeded 6,500 in two tetraploid alfalfa reference populations and a world collection of lupin genotypes, and 2,000 in different sets of pea or lupin recombinant inbred lines. The predictive ability of GBS-based genomic selection was influenced by the genotype missing data threshold and imputation, as well as by the genomic selection model, with the best model depending on traits and data sets. We devised a simple method for comparing phenotypic vs. genomic selection in terms of predicted yield gain per year for same evaluation costs, whose application to preliminary data for alfalfa and pea in a hypothetical selection scenario for each crop indicated a distinct advantage of genomic selection.

  1. Imputation across genotyping arrays for genome-wide association studies: assessment of bias and a correction strategy.

    Science.gov (United States)

    Johnson, Eric O; Hancock, Dana B; Levy, Joshua L; Gaddis, Nathan C; Saccone, Nancy L; Bierut, Laura J; Page, Grier P

    2013-05-01

    A great promise of publicly sharing genome-wide association data is the potential to create composite sets of controls. However, studies often use different genotyping arrays, and imputation to a common set of SNPs has shown substantial bias: a problem which has no broadly applicable solution. Based on the idea that using differing genotyped SNP sets as inputs creates differential imputation errors and thus bias in the composite set of controls, we examined the degree to which each of the following occurs: (1) imputation based on the union of genotyped SNPs (i.e., SNPs available on one or more arrays) results in bias, as evidenced by spurious associations (type 1 error) between imputed genotypes and arbitrarily assigned case/control status; (2) imputation based on the intersection of genotyped SNPs (i.e., SNPs available on all arrays) does not evidence such bias; and (3) imputation quality varies by the size of the intersection of genotyped SNP sets. Imputations were conducted in European Americans and African Americans with reference to HapMap phase II and III data. Imputation based on the union of genotyped SNPs across the Illumina 1M and 550v3 arrays showed spurious associations for 0.2 % of SNPs: ~2,000 false positives per million SNPs imputed. Biases remained problematic for very similar arrays (550v1 vs. 550v3) and were substantial for dissimilar arrays (Illumina 1M vs. Affymetrix 6.0). In all instances, imputing based on the intersection of genotyped SNPs (as few as 30 % of the total SNPs genotyped) eliminated such bias while still achieving good imputation quality.

  2. Grouping and clustering of maize Lancaster germplasm inbreds according to the results of SNP-analysis

    Directory of Open Access Journals (Sweden)

    K. V. Derkach

    2017-08-01

    Full Text Available The objective of this article is the grouping and clustering of maize inbred lines based on the results of SNP-genotyping for the verification of a separate cluster of Lancaster germplasm inbred lines. As material for the study, we used 91 maize (Zea mays L. inbred lines, including 31 Lancaster germplasm lines and 60 inbred lines of other germplasms (23 Iodent inbreds, 15 Reid inbreds, 7 Lacon inbreds, 12 Mix inbreds and 3 exotic inbreds. The majority of the given inbred lines are included in the Dnipro breeding programme. The SNP-genotyping of these inbred lines was conducted using BDI-III panel of 384 SNP-markers developed by BioDiagnostics, Inc. (USA on the base of Illumina VeraCode Bead Plate. The SNP-markers of this panel are biallelic and are located on all 10 maize chromosomes. Their range of conductivity was >0.6. The SNP-analysis was made in completely automated regime on Illumina BeadStation equipment at BioDiagnostics, Inc. (USA. A principal component analysis was applied to group a general set of 91 inbreds according to allelic states of SNP-markers and to identify a cluster of Lancaster inbreds. The clustering and determining hierarchy in 31 Lancaster germplasm inbreds used quantitative cluster analysis. The share of monomorphic markers in the studied set of 91 inbred lines equaled 0.7%, and the share of dimorphic markers equaled 99.3%. Minor allele frequency (MAF > 0.2 was observed for 80.6% of dimorphic markers, the average index of shift of gene diversity equaled 0.2984, PIC on average reached 0.3144. The index of gene diversity of markers varied from 0.1701 to 0.1901, pairwise genetic distances between inbred lines ranged from 0.0316–0.8000, the frequencies of major alleles of SNP-markers were within 0.5085–0.9821, and the frequencies of minor alleles were within 0.0179–0.4915. The average homozygosity of inbred lines was 98.8%. The principal component analysis of SNP-distances confirmed the isolation of the Lancaster

  3. Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery

    Directory of Open Access Journals (Sweden)

    Stothard Paul

    2011-11-01

    Full Text Available Abstract Background One of the goals of livestock genomics research is to identify the genetic differences responsible for variation in phenotypic traits, particularly those of economic importance. Characterizing the genetic variation in livestock species is an important step towards linking genes or genomic regions with phenotypes. The completion of the bovine genome sequence and recent advances in DNA sequencing technology allow for in-depth characterization of the genetic variations present in cattle. Here we describe the whole-genome resequencing of two Bos taurus bulls from distinct breeds for the purpose of identifying and annotating novel forms of genetic variation in cattle. Results The genomes of a Black Angus bull and a Holstein bull were sequenced to 22-fold and 19-fold coverage, respectively, using the ABI SOLiD system. Comparisons of the sequences with the Btau4.0 reference assembly yielded 7 million single nucleotide polymorphisms (SNPs, 24% of which were identified in both animals. Of the total SNPs found in Holstein, Black Angus, and in both animals, 81%, 81%, and 75% respectively are novel. In-depth annotations of the data identified more than 16 thousand distinct non-synonymous SNPs (85% novel between the two datasets. Alignments between the SNP-altered proteins and orthologues from numerous species indicate that many of the SNPs alter well-conserved amino acids. Several SNPs predicted to create or remove stop codons were also found. A comparison between the sequencing SNPs and genotyping results from the BovineHD high-density genotyping chip indicates a detection rate of 91% for homozygous SNPs and 81% for heterozygous SNPs. The false positive rate is estimated to be about 2% for both the Black Angus and Holstein SNP sets, based on follow-up genotyping of 422 and 427 SNPs, respectively. Comparisons of read depth between the two bulls along the reference assembly identified 790 putative copy-number variations (CNVs. Ten

  4. Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine

    Directory of Open Access Journals (Sweden)

    Garnier-Géré Pauline

    2011-07-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait., the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels. Offspring from three-generation outbred (G2 and inbred (F2 pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using

  5. SNPServer: a real-time SNP discovery tool.

    Science.gov (United States)

    Savage, David; Batley, Jacqueline; Erwin, Tim; Logan, Erica; Love, Christopher G; Lim, Geraldine A C; Mongin, Emmanuel; Barker, Gary; Spangenberg, German C; Edwards, David

    2005-07-01

    SNPServer is a real-time flexible tool for the discovery of SNPs (single nucleotide polymorphisms) within DNA sequence data. The program uses BLAST, to identify related sequences, and CAP3, to cluster and align these sequences. The alignments are parsed to the SNP discovery software autoSNP, a program that detects SNPs and insertion/deletion polymorphisms (indels). Alternatively, lists of related sequences or pre-assembled sequences may be entered for SNP discovery. SNPServer and autoSNP use redundancy to differentiate between candidate SNPs and sequence errors. For each candidate SNP, two measures of confidence are calculated, the redundancy of the polymorphism at a SNP locus and the co-segregation of the candidate SNP with other SNPs in the alignment. SNPServer is available at http://hornbill.cspp.latrobe.edu.au/snpdiscovery.html.

  6. Light response of sunflower and canola as affected by plant density, plant genotype and N fertilization.

    Science.gov (United States)

    Soleymani, A

    2017-08-01

    Crop response to light is an important parameter determining crop growth. Three field (split plots) experiments were conducted to investigate the effects of plant density, plant genotype and N fertilization on the light absorption and light extinction of sunflower (Helianthus annuus L.) and canola (Brassica napus L.). A detailed set of plant growth, light absorption and crop yield and oil related parameters were determined. Light was measured at noon during the sunny days with clear sky. In experiment I, although the plant density (PD) of 14 resulted in the highest rate of sunflower light absorption (31.37%) and light extinction (0.756), the highest rate of grain yield and grain oil yield was resulted at PD12 at 3639 and 1457.9kg/ha, respectively; as well as by genotype SUP.A. In experiment II (canola), PD80 resulted in the highest rate of light absorption (13.13%), light extinction (0.63), grain yield (2189.4kg/ha) and grain oil yield (556.54kg/ha). This was also the case for Genotype H. In experiment III (canola), although N150 resulted in the highest rate of light absorption (10.74%) and light extinction (0.48), the highest rate of grain yield (3413.6kg/ha) and grain oil yield (891.86kg/ha) was resulted at N100 as well as by Genotype H401. Results indicate how light properties, crop growth and yield of sunflower and canola can be affected by plant and environmental parameters, which are also of practical use by farmers. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. A High-Density SNP Genetic Linkage Map and QTL Analysis of Growth-Related Traits in a Hybrid Family of Oysters (Crassostrea gigas × Crassostrea angulata Using Genotyping-by-Sequencing

    Directory of Open Access Journals (Sweden)

    Jinpeng Wang

    2016-05-01

    Full Text Available Oysters are among the most important species in global aquaculture. Crassostrea gigas, and its subspecies C. angulata, are the major cultured species. To determine the genetic basis of growth-related traits in oysters, we constructed a second-generation linkage map from 3367 single-nucleotide polymorphisms (SNPs based on genotyping-by-sequencing, genotyped from a C. gigas × C. angulata hybrid family. These 3367 SNPs were distributed on 1695 markers, which were assigned to 10 linkage groups. The genetic linkage map had a total length of 1084.3 cM, with an average of 0.8 cM between markers; it thus represents the densest genetic map constructed for oysters to date. Twenty-seven quantitative trait loci (QTL for five growth-related traits were detected. These QTL could explain 4.2–7.7% (mean = 5.4% of the phenotypic variation. In total, 50.8% of phenotypic variance for shell width, 7.7% for mass weight, and 34.1% for soft tissue weight were explained. The detected QTL were distributed among eight linkage groups, and more than half (16 were concentrated within narrow regions in their respective linkage groups. Thirty-eight annotated genes were identified within the QTL regions, two of which are key genes for carbohydrate metabolism. Other genes were found to participate in assembly and regulation of the actin cytoskeleton, signal transduction, and regulation of cell differentiation and development. The newly developed high-density genetic map, and the QTL and candidate genes identified provide a valuable genetic resource and a basis for marker-assisted selection for C. gigas and C. angulata.

  8. Cathepsin D SNP associated with increased risk of variant Creutzfeldt-Jakob disease

    Directory of Open Access Journals (Sweden)

    Sanchez-Juan Pascual

    2008-04-01

    Full Text Available Abstract Background Variant Creutzfeldt-Jakob disease (vCJD originally resulted from the consumption of foodstuffs contaminated by bovine spongiform encephalopathy (BSE material, with 163 confirmed cases in the UK to date. Many thousands are likely to have been exposed to dietary infection and so it is important (for surveillance, epidemic modelling, public health and understanding pathogenesis to identify genetic factors that may affect individual susceptibility to infection. This study looked at a polymorphism in the cathepsin D gene (refSNP ID: rs17571 previously examined in Alzheimer's disease (AD. Methods Blood samples taken from 110 vCJD patients were tested for the C-T base change, and genotype data were compared with published frequencies for a control population using multiple logistic regression. Results There was a significant excess of the cathepsin D polymorphism TT genotype in the vCJD cohort compared to controls. The TT genotype was found to have a 9.75 fold increase in risk of vCJD compared to the CT genotype and a 10.92 fold increase compared to the CC genotype. Conclusion This mutation event has been observed to alter the protease activity of the cathepsin D protein and has been linked to an increase in amyloid beta plaque formation in AD. vCJD neuropathology is characterised by the presence of amyloid plaques, formed from the prion protein, and therefore alterations in the amyloid processing activity of cathepsin D may affect the neuropathogenesis of this disease.

  9. Impact of QTL minor allele frequency on genomic evaluation using real genotype data and simulated phenotypes in Japanese Black cattle.

    Science.gov (United States)

    Uemoto, Yoshinobu; Sasaki, Shinji; Kojima, Takatoshi; Sugimoto, Yoshikazu; Watanabe, Toshio

    2015-11-19

    Genetic variance that is not captured by single nucleotide polymorphisms (SNPs) is due to imperfect linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTLs), and the extent of LD between SNPs and QTLs depends on different minor allele frequencies (MAF) between them. To evaluate the impact of MAF of QTLs on genomic evaluation, we performed a simulation study using real cattle genotype data. In total, 1368 Japanese Black cattle and 592,034 SNPs (Illumina BovineHD BeadChip) were used. We simulated phenotypes using real genotypes under different scenarios, varying the MAF categories, QTL heritability, number of QTLs, and distribution of QTL effect. After generating true breeding values and phenotypes, QTL heritability was estimated and the prediction accuracy of genomic estimated breeding value (GEBV) was assessed under different SNP densities, prediction models, and population size by a reference-test validation design. The extent of LD between SNPs and QTLs in this population was higher in the QTLs with high MAF than in those with low MAF. The effect of MAF of QTLs depended on the genetic architecture, evaluation strategy, and population size in genomic evaluation. In genetic architecture, genomic evaluation was affected by the MAF of QTLs combined with the QTL heritability and the distribution of QTL effect. The number of QTL was not affected on genomic evaluation if the number of QTL was more than 50. In the evaluation strategy, we showed that different SNP densities and prediction models affect the heritability estimation and genomic prediction and that this depends on the MAF of QTLs. In addition, accurate QTL heritability and GEBV were obtained using denser SNP information and the prediction model accounted for the SNPs with low and high MAFs. In population size, a large sample size is needed to increase the accuracy of GEBV. The MAF of QTL had an impact on heritability estimation and prediction accuracy. Most genetic variance can be captured

  10. Interactions Between SNP Alleles at Multiple Loci and Variation in Skin Pigmentation in 122 Caucasians

    Directory of Open Access Journals (Sweden)

    Sumiko Anno

    2007-01-01

    Full Text Available This study was undertaken to clarify the molecular basis for human skin color variation and the environmental adaptability to ultraviolet irradiation, with the ultimate goal of predicting the impact of changes in future environments on human health risk. One hundred twenty-two Caucasians living in Toledo, Ohio participated. Back and cheek skin were assayed for melanin as a quantitative trait marker. Buccal cell samples were collected and used for DNA extraction. DNA was used for SNP genotyping using the Masscode™ system, which entails two-step PCR amplification and a platform chemistry which allows cleavable mass spectrometry tags. The results show gene-gene interaction between SNP alleles at multiple loci (not necessarily on the same chromosome contributes to inter-individual skin color variation while suggesting a high probability of linkage disequilibrium. Confirmation of these findings requires further study with other ethic groups to analyze the associations between SNP alleles at multiple loci and human skin color variation. Our overarching goal is to use remote sensing data to clarify the interaction between atmospheric environments and SNP allelic frequency and investigate human adaptability to ultraviolet irradiation. Such information should greatly assist in the prediction of the health effects of future environmental changes such as ozone depletion and increased ultraviolet exposure. If such health effects are to some extent predictable, it might be possible to prepare for such changes in advance and thus reduce the extent of their impact.

  11. Clinical significance of SNP (rs2596542 in histocompatibility complex class I-related gene A promoter region among hepatitis C virus related hepatocellular carcinoma cases

    Directory of Open Access Journals (Sweden)

    Amal A. Mohamed

    2017-07-01

    Full Text Available The major histocompatibility complex class I-related gene A (MICA is an antigen induced by stress and performs an integral role in immune responses as an anti-infectious and antitumor agent. This work was designed to investigate whether (SNP rs2596542C/T in MICA promoter region is predictive of liver cirrhosis (LC and hepatocellular carcinoma (HCC or not. Forty-seven healthy controls and 94 HCV-infected patients, subdivided into 47 LC and 47 HCC subjects were enrolled in this study. SNP association was studied using real time PCR and soluble serum MICA concentration was measured using ELISA. Results showed that heterozygous genotype rs2596542CT was significantly (P = 0.022 distributed between HCC and LC related CHC patients. The sMICA was significantly higher (P = 0.0001 among HCC and LC. No significant association (P = 0.56 between rs2596542CT genotypes and sMICA levels was observed. Studying SNP rs2596542C/T association with HCC and LC susceptibility revealed that statistical significant differences (P = 0.013, P = 0.027 were only observed between SNP rs2596542C/T and each of HCC and LC, respectively, versus healthy controls, indicating that the rs2596542C/T genetic variation is not a significant contributor to HCC development in LC patients. Moreover, the T allele was considered a risk factor for HCC and LC vulnerability in HCV patients (OR = 1.93 and 2.1, respectively, while the C allele contributes to decreasing HCC risk. Therefore, SNP (rs2596542C/T in MICA promoter region and sMICA levels might be potential useful markers in the assessment of liver disease progression to LC and HCC.

  12. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

    Science.gov (United States)

    Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

    2015-08-01

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  13. Genomic evaluations with many more genotypes

    Directory of Open Access Journals (Sweden)

    Wiggans George R

    2011-03-01

    Full Text Available Abstract Background Genomic evaluations in Holstein dairy cattle have quickly become more reliable over the last two years in many countries as more animals have been genotyped for 50,000 markers. Evaluations can also include animals genotyped with more or fewer markers using new tools such as the 777,000 or 2,900 marker chips recently introduced for cattle. Gains from more markers can be predicted using simulation, whereas strategies to use fewer markers have been compared using subsets of actual genotypes. The overall cost of selection is reduced by genotyping most animals at less than the highest density and imputing their missing genotypes using haplotypes. Algorithms to combine different densities need to be efficient because numbers of genotyped animals and markers may continue to grow quickly. Methods Genotypes for 500,000 markers were simulated for the 33,414 Holsteins that had 50,000 marker genotypes in the North American database. Another 86,465 non-genotyped ancestors were included in the pedigree file, and linkage disequilibrium was generated directly in the base population. Mixed density datasets were created by keeping 50,000 (every tenth of the markers for most animals. Missing genotypes were imputed using a combination of population haplotyping and pedigree haplotyping. Reliabilities of genomic evaluations using linear and nonlinear methods were compared. Results Differing marker sets for a large population were combined with just a few hours of computation. About 95% of paternal alleles were determined correctly, and > 95% of missing genotypes were called correctly. Reliability of breeding values was already high (84.4% with 50,000 simulated markers. The gain in reliability from increasing the number of markers to 500,000 was only 1.6%, but more than half of that gain resulted from genotyping just 1,406 young bulls at higher density. Linear genomic evaluations had reliabilities 1.5% lower than the nonlinear evaluations with 50

  14. Rapid genotyping with DNA micro-arrays for high-density linkage mapping and QTL mapping in common buckwheat (Fagopyrum esculentum Moench)

    Science.gov (United States)

    Yabe, Shiori; Hara, Takashi; Ueno, Mariko; Enoki, Hiroyuki; Kimura, Tatsuro; Nishimura, Satoru; Yasui, Yasuo; Ohsawa, Ryo; Iwata, Hiroyoshi

    2014-01-01

    For genetic studies and genomics-assisted breeding, particularly of minor crops, a genotyping system that does not require a priori genomic information is preferable. Here, we demonstrated the potential of a novel array-based genotyping system for the rapid construction of high-density linkage map and quantitative trait loci (QTL) mapping. By using the system, we successfully constructed an accurate, high-density linkage map for common buckwheat (Fagopyrum esculentum Moench); the map was composed of 756 loci and included 8,884 markers. The number of linkage groups converged to eight, which is the basic number of chromosomes in common buckwheat. The sizes of the linkage groups of the P1 and P2 maps were 773.8 and 800.4 cM, respectively. The average interval between adjacent loci was 2.13 cM. The linkage map constructed here will be useful for the analysis of other common buckwheat populations. We also performed QTL mapping for main stem length and detected four QTL. It took 37 days to process 178 samples from DNA extraction to genotyping, indicating the system enables genotyping of genome-wide markers for a few hundred buckwheat plants before the plants mature. The novel system will be useful for genomics-assisted breeding in minor crops without a priori genomic information. PMID:25914583

  15. Revision of the SNPforID 34-plex forensic ancestry test: Assay enhancements, standard reference sample genotypes and extended population studies.

    Science.gov (United States)

    Fondevila, M; Phillips, C; Santos, C; Freire Aradas, A; Vallone, P M; Butler, J M; Lareu, M V; Carracedo, A

    2013-01-01

    A revision of an established 34 SNP forensic ancestry test has been made by swapping the under-performing rs727811 component SNP with the highly informative rs3827760 that shows a near-fixed East Asian specific allele. We collated SNP variability data for the revised SNP set in 66 reference populations from 1000 Genomes and HGDP-CEPH panels and used this as reference data to analyse four U.S. populations showing a range of admixture patterns. The U.S. Hispanics sample in particular displayed heterogeneous values of co-ancestry between European, Native American and African contributors, likely to reflect in part, the way this disparate group is defined using cultural as well as population genetic parameters. The genotyping of over 700 U.S. population samples also provided the opportunity to thoroughly gauge peak mobility variation and peak height ratios observed from routine use of the single base extension chemistry of the 34-plex test. Finally, the genotyping of the widely used DNA profiling Standard Reference Material samples plus other control DNAs completes the audit of the 34-plex assay to allow forensic practitioners to apply this test more readily in their own laboratories. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  16. Highly effective SNP-based association mapping and management of recessive defects in livestock

    DEFF Research Database (Denmark)

    Charlier, Carole; Coppieters, Wouter; Rollin, Frédéric

    2008-01-01

    The widespread use of elite sires by means of artificial insemination in livestock breeding leads to the frequent emergence of recessive genetic defects, which cause significant economic and animal welfare concerns. Here we show that the availability of genome-wide, high-density SNP panels, combi...

  17. dbSNP

    Data.gov (United States)

    U.S. Department of Health & Human Services — dbSNP is a database of single nucleotide polymorphisms (SNPs) and multiple small-scale variations that include insertions/deletions, microsatellites, and...

  18. CFSAN SNP Pipeline: an automated method for constructing SNP matrices from next-generation sequence data

    Directory of Open Access Journals (Sweden)

    Steve Davis

    2015-08-01

    Full Text Available The analysis of next-generation sequence (NGS data is often a fragmented step-wise process. For example, multiple pieces of software are typically needed to map NGS reads, extract variant sites, and construct a DNA sequence matrix containing only single nucleotide polymorphisms (i.e., a SNP matrix for a set of individuals. The management and chaining of these software pieces and their outputs can often be a cumbersome and difficult task. Here, we present CFSAN SNP Pipeline, which combines into a single package the mapping of NGS reads to a reference genome with Bowtie2, processing of those mapping (BAM files using SAMtools, identification of variant sites using VarScan, and production of a SNP matrix using custom Python scripts. We also introduce a Python package (CFSAN SNP Mutator that when given a reference genome will generate variants of known position against which we validate our pipeline. We created 1,000 simulated Salmonella enterica sp. enterica Serovar Agona genomes at 100× and 20× coverage, each containing 500 SNPs, 20 single-base insertions and 20 single-base deletions. For the 100× dataset, the CFSAN SNP Pipeline recovered 98.9% of the introduced SNPs and had a false positive rate of 1.04 × 10−6; for the 20× dataset 98.8% of SNPs were recovered and the false positive rate was 8.34 × 10−7. Based on these results, CFSAN SNP Pipeline is a robust and accurate tool that it is among the first to combine into a single executable the myriad steps required to produce a SNP matrix from NGS data. Such a tool is useful to those working in an applied setting (e.g., food safety traceback investigations as well as for those interested in evolutionary questions.

  19. Accuracy of Genomic Evaluations of Juvenile Growth Rate in Common Carp (Cyprinus carpio Using Genotyping by Sequencing

    Directory of Open Access Journals (Sweden)

    Christos Palaiokostas

    2018-03-01

    Full Text Available Cyprinids are the most important group of farmed fish globally in terms of production volume, with common carp (Cyprinus carpio being one of the most valuable species of the group. The use of modern selective breeding methods in carp is at a formative stage, implying a large scope for genetic improvement of key production traits. In the current study, a population of 1,425 carp juveniles, originating from a partial factorial cross between 40 sires and 20 dams, was used for investigating the potential of genomic selection (GS for juvenile growth, an exemplar polygenic production trait. RAD sequencing was used to identify and genotype SNP markers for subsequent parentage assignment, construction of a medium density genetic map (12,311 SNPs, genome-wide association study (GWAS, and testing of GS. A moderate heritability was estimated for body length of carp at 120 days (as a proxy of juvenile growth of 0.33 (s.e. 0.05. No genome-wide significant QTL was identified using a single marker GWAS approach. Genomic prediction of breeding values outperformed pedigree-based prediction, resulting in 18% improvement in prediction accuracy. The impact of reduced SNP densities on prediction accuracy was tested by varying minor allele frequency (MAF thresholds, with no drop in prediction accuracy until the MAF threshold is set <0.3 (2,744 SNPs. These results point to the potential for GS to improve economically important traits in common carp breeding programs.

  20. An Improved Consensus Linkage Map of Barley Based on Flow-Sorted Chromosomes and Single Nucleotide Polymorphism Markers

    Directory of Open Access Journals (Sweden)

    María Muñoz-Amatriaín

    2011-11-01

    Full Text Available Recent advances in high-throughput genotyping have made it easier to combine information from different mapping populations into consensus genetic maps, which provide increased marker density and genome coverage compared to individual maps. Previously, a single nucleotide polymorphism (SNP-based genotyping platform was developed and used to genotype 373 individuals in four barley ( L. mapping populations. This led to a 2943 SNP consensus genetic map with 975 unique positions. In this work, we add data from six additional populations and more individuals from one of the original populations to develop an improved consensus map from 1133 individuals. A stringent and systematic analysis of each of the 10 populations was performed to achieve uniformity. This involved reexamination of the four populations included in the previous map. As a consequence, we present a robust consensus genetic map that contains 2994 SNP loci mapped to 1163 unique positions. The map spans 1137.3 cM with an average density of one marker bin per 0.99 cM. A novel application of the genotyping platform for gene detection allowed the assignment of 2930 genes to flow-sorted chromosomes or arms, confirmed the position of 2545 SNP-mapped loci, added chromosome or arm allocations to an additional 370 SNP loci, and delineated pericentromeric regions for chromosomes 2H to 7H. Marker order has been improved and map resolution has been increased by almost 20%. These increased precision outcomes enable more optimized SNP selection for marker-assisted breeding and support association genetic analysis and map-based cloning. It will also improve the anchoring of DNA sequence scaffolds and the barley physical map to the genetic map.

  1. Integrated cryptosporidium assay to determine oocyst density, infectivity, and genotype for risk assessment of source and reuse water.

    Science.gov (United States)

    King, Brendon; Fanok, Stella; Phillips, Renae; Swaffer, Brooke; Monis, Paul

    2015-05-15

    Cryptosporidium continues to be problematic for the water industry, with risk assessments often indicating that treatment barriers may fail under extreme conditions. However, risk analyses have historically used oocyst densities and not considered either oocyst infectivity or species/genotype, which can result in an overestimation of risk if the oocysts are not human infective. We describe an integrated assay for determining oocyst density, infectivity, and genotype from a single-sample concentrate, an important advance that overcomes the need for processing multiple-grab samples or splitting sample concentrates for separate analyses. The assay incorporates an oocyst recovery control and is compatible with standard primary concentration techniques. Oocysts were purified from primary concentrates using immunomagnetic separation prior to processing by an infectivity assay. Plate-based cell culture was used to detect infectious foci, with a monolayer washing protocol developed to allow recovery and enumeration of oocysts. A simple DNA extraction protocol was developed to allow typing of any wells containing infectious Cryptosporidium. Water samples from a variety of source water and wastewater matrices, including a semirural catchment, wastewater, an aquifer recharge site, and storm water, were analyzed using the assay. Results demonstrate that the assay can reliably determine oocyst densities, infectivity, and genotype from single-grab samples for a variety of water matrices and emphasize the varying nature of Cryptosporidium risk extant throughout source waters and wastewaters. This assay should therefore enable a more comprehensive understanding of Cryptosporidium risk for different water sources, assisting in the selection of appropriate risk mitigation measures. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  2. Validation and discovery of genotype-phenotype associations in chronic diseases using linked data.

    Science.gov (United States)

    Pathak, Jyotishman; Kiefer, Richard; Freimuth, Robert; Chute, Christopher

    2012-01-01

    This study investigates federated SPARQL queries over Linked Open Data (LOD) in the Semantic Web to validate existing, and potentially discover new genotype-phenotype associations from public datasets. In particular, we report our preliminary findings for identifying such associations for commonly occurring chronic diseases using the Online Mendelian Inheritance in Man (OMIM) and Database for SNPs (dbSNP) within the LOD knowledgebase and compare them with Gene Wiki for coverage and completeness. Our results indicate that Semantic Web technologies can play an important role for in-silico identification of novel disease-gene-SNP associations, although additional verification is required before such information can be applied and used effectively.

  3. A Whole Genome Association Study on Meat Quality Traits Using High Density SNP Chips in a Cross between Korean Native Pig and Landrace

    Directory of Open Access Journals (Sweden)

    K.-T Lee

    2012-11-01

    Full Text Available A whole genome association (WGA study was performed to detect significant polymorphisms for meat quality traits in an F2 cross population (N = 478 that were generated with Korean native pig sires and Landrace dams in National Livestock Research Institute, Songwhan, Korea. The animals were genotyped using Illumina porcine 60k SNP beadchips, in which a set of 46,865 SNPs were available for the WGA analyses on ten carcass quality traits; live weight, crude protein, crude lipids, crude ash, water holding capacity, drip loss, shear force, CIE L, CIE a and CIE b. Phenotypes were regressed on additive and dominance effects for each SNP using a simple linear regression model, after adjusting for sex, sire and slaughter stage as fixed effects. With the significant SNPs for each trait (p<0.001, a stepwise regression procedure was applied to determine the best set of SNPs with the additive and/or dominance effects. A total of 106 SNPs, or quantitative trait loci (QTL were detected, and about 32 to 66% of the total phenotypic variation was explained by the significant SNPs for each trait. The QTL were identified in most porcine chromosomes (SSCs, in which majority of the QTL were detected in SSCs 1, 2, 12, 13, 14 and 16. Several QTL clusters were identified on SSCs 12, 16 and 17, and a cluster of QTL influencing crude protein, crude lipid, drip loss, shear force, CIE a and CIE b were located between 20 and 29 Mb of SSC12. A pleiotropic QTL for drip loss, CIE L and CIE b was also detected on SSC16. These QTL need to be validated in commercial pig populations for genetic improvement in meat quality via marker-assisted selection.

  4. Genetic architecture of clinical mastitis traits in dairy cattle

    DEFF Research Database (Denmark)

    Sahana, Goutam; Guldbrandtsen, Bernt; Lund, Mogens Sandø

    2012-01-01

    investigate the genetic architecture of clinical mastitis and somatic cell score traits in dairy cattle using a high density (HD) SNP panel. Mastitis, an inflammation of the mammary gland most commonly caused by bacterial infection, is a frequent disease in dairy cattle. Clinical mastitis and somatic cell...... score from first three lactations were studied for association with SNP markers in 4,200 progeny-tested Nordic Holstein bulls. Single trait breeding values were used as phenotypes. All the individuals were genotyped with BovineSNP50 Beadchip. Part of this population was also genotyped with the Bovine...... mixed model analysis. After Bonferroni correction 12, 372 SNP exhibited genome-wide significant associations with mastitis related traits. A total 61 QTL regions on 22 chromosomes associated with mastitis related traits were identified. The SNP with highest effect explained 5.6% of the variance...

  5. The analysis of correlation between IL-1B gene expression and genotyping in multiple sclerosis patients.

    Science.gov (United States)

    Heidary, Masoumeh; Rakhshi, Nahid; Pahlevan Kakhki, Majid; Behmanesh, Mehrdad; Sanati, Mohammad Hossein; Sanadgol, Nima; Kamaladini, Hossein; Nikravesh, Abbas

    2014-08-15

    IL-1B is released by monocytes, astrocytes and brain endothelial cells and seems to be involved in inflammatory reactions of the central nervous system (CNS) in multiple sclerosis (MS). This study aims to evaluate the expression level of IL-1B mRNA in peripheral blood mononuclear cells (PBMCs), genotype the rs16944 SNP and find out the role of this SNP on the expression level of IL-1B in MS patients. We found that the expression level of IL-1B in MS patients increased 3.336 times more than controls in PBMCs but the rs16944 SNP in the promoter region of IL-1B did not affect the expression level of this gene and there was not association of this SNP with MS in the examined population. Also, our data did not reveal any correlation between normalized expressions of IL-1B gene with age of participants, age of onset, and disease duration. Copyright © 2014 Elsevier B.V. All rights reserved.

  6. Physical mapping of QTL for tuber yield, starch content and starch yield in tetraploid potato (Solanum tuberosum L.) by means of genome wide genotyping by sequencing and the 8.3 K SolCAP SNP array.

    Science.gov (United States)

    Schönhals, Elske Maria; Ding, Jia; Ritter, Enrique; Paulo, Maria João; Cara, Nicolás; Tacke, Ekhard; Hofferbert, Hans-Reinhard; Lübeck, Jens; Strahwald, Josef; Gebhardt, Christiane

    2017-08-22

    Tuber yield and starch content of the cultivated potato are complex traits of decisive importance for breeding improved varieties. Natural variation of tuber yield and starch content depends on the environment and on multiple, mostly unknown genetic factors. Dissection and molecular identification of the genes and their natural allelic variants controlling these complex traits will lead to the development of diagnostic DNA-based markers, by which precision and efficiency of selection can be increased (precision breeding). Three case-control populations were assembled from tetraploid potato cultivars based on maximizing the differences between high and low tuber yield (TY), starch content (TSC) and starch yield (TSY, arithmetic product of TY and TSC). The case-control populations were genotyped by restriction-site associated DNA sequencing (RADseq) and the 8.3 k SolCAP SNP genotyping array. The allele frequencies of single nucleotide polymorphisms (SNPs) were compared between cases and controls. RADseq identified, depending on data filtering criteria, between 6664 and 450 genes with one or more differential SNPs for one, two or all three traits. Differential SNPs in 275 genes were detected using the SolCAP array. A genome wide association study using the SolCAP array on an independent, unselected population identified SNPs associated with tuber starch content in 117 genes. Physical mapping of the genes containing differential or associated SNPs, and comparisons between the two genome wide genotyping methods and two different populations identified genome segments on all twelve potato chromosomes harboring one or more quantitative trait loci (QTL) for TY, TSC and TSY. Several hundred genes control tuber yield and starch content in potato. They are unequally distributed on all potato chromosomes, forming clusters between 0.5-4 Mbp width. The largest fraction of these genes had unknown function, followed by genes with putative signalling and regulatory functions. The

  7. A study on association of SNP-43 polymorphism in Calpain-10 gene with type 2 diabetes mellitus in the population of Eastern Azerbaijan province

    Directory of Open Access Journals (Sweden)

    Fatemeh Bahreini

    2012-04-01

    Full Text Available Background: Genome – wide analysis of genetic predisposition to type 2 diabetes mellitus in different populations have yielded variable results. Calpain10, a gene that encodes a non-lysosomal cysteine protease, has been recently proposed as a type 2 diabetes susceptibility genes in the non-insulin-dependent diabetes mellitus NIDDM1 region. Aim of the study is to evaluate the relation between SNP43 and type2 diabetes. Methods: A total of 102 diabetic subjects and 100 undiabetic controls enrolled in a case-control study in Eastern Azerbaijan Azerbaijan Province. The polymerase chain reaction restriction fragment length polymorphism technique (PCR–RFLP was applied. We use X2 test and logistic regression to analysis of data. Results: Genotypes distribution of calpain10 gene in control group were 11(11%, 86(86% and 3(3% with respected to A/G, G/G and A/A genotypes, respectively. In diabetic group genotypes distribution were 7(6.9%, 95(93.1% and zero, respectively. The G allele frequency was significant difference in case and control groups. Conclusion: Since G allele is a risk factor to affect type 2 diabetes disease hence SNP43 of calpain10 gene had significant association with type 2 diabetes in Eastern Azerbaijan.

  8. Identification of novel single nucleotide polymorphisms (SNPs in deer (Odocoileus spp. using the BovineSNP50 BeadChip.

    Directory of Open Access Journals (Sweden)

    Gwilym D Haynes

    Full Text Available Single nucleotide polymorphisms (SNPs are growing in popularity as a genetic marker for investigating evolutionary processes. A panel of SNPs is often developed by comparing large quantities of DNA sequence data across multiple individuals to identify polymorphic sites. For non-model species, this is particularly difficult, as performing the necessary large-scale genomic sequencing often exceeds the resources available for the project. In this study, we trial the Bovine SNP50 BeadChip developed in cattle (Bos taurus for identifying polymorphic SNPs in cervids Odocoileus hemionus (mule deer and black-tailed deer and O. virginianus (white-tailed deer in the Pacific Northwest. We found that 38.7% of loci could be genotyped, of which 5% (n = 1068 were polymorphic. Of these 1068 polymorphic SNPs, a mixture of putatively neutral loci (n = 878 and loci under selection (n = 190 were identified with the F(ST-outlier method. A range of population genetic analyses were implemented using these SNPs and a panel of 10 microsatellite loci. The three types of deer could readily be distinguished with both the SNP and microsatellite datasets. This study demonstrates that commercially developed SNP chips are a viable means of SNP discovery for non-model organisms, even when used between very distantly related species (the Bovidae and Cervidae families diverged some 25.1-30.1 million years before present.

  9. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    NARCIS (Netherlands)

    Chagné, D.; Crowhurst, R.N.; Troggio, M.; Davey, M.W.; Gilmore, B.; Lawley, C.; Vanderzande, S.; Hellens, R.P.; Kumar, S.; Cestaro, A.; Velasco, R.; Main, D.; Rees, J.D.; Iezzoni, A.F.; Mockler, T.; Wilhelm, L.; Weg, van de W.E.; Gardiner, S.E.; Bassil, N.; Peace, C.

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide

  10. rSNPBase 3.0: an updated database of SNP-related regulatory elements, element-gene pairs and SNP-based gene regulatory networks.

    Science.gov (United States)

    Guo, Liyuan; Wang, Jing

    2018-01-04

    Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element-target gene pairs (E-G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. A functional SNP associated with atopic dermatitis controls cell type-specific methylation of the VSTM1 gene locus

    Directory of Open Access Journals (Sweden)

    Dilip Kumar

    2017-02-01

    Full Text Available Abstract Background Expression quantitative trait loci (eQTL databases represent a valuable resource to link disease-associated SNPs to specific candidate genes whose gene expression is significantly modulated by the SNP under investigation. We previously identified signal inhibitory receptor on leukocytes-1 (SIRL-1 as a powerful regulator of human innate immune cell function. While it is constitutively high expressed on neutrophils, on monocytes the SIRL-1 surface expression varies strongly between individuals. The underlying mechanism of regulation, its genetic control as well as potential clinical implications had not been explored yet. Methods Whole blood eQTL data of a Chinese cohort was used to identify SNPs regulating the expression of VSTM1, the gene encoding SIRL-1. The genotype effect was validated by flow cytometry (cell surface expression, correlated with electrophoretic mobility shift assay (EMSA, chromatin immunoprecipitation (ChIP and bisulfite sequencing (C-methylation and its functional impact studied the inhibition of reactive oxygen species (ROS. Results We found a significant association of a single CpG-SNP, rs612529T/C, located in the promoter of VSTM1. Through flow cytometry analysis we confirmed that primarily in the monocytes the protein level of SIRL-1 is strongly associated with genotype of this SNP. In monocytes, the T allele of this SNP facilitates binding of the transcription factors YY1 and PU.1, of which the latter has been recently shown to act as docking site for modifiers of DNA methylation. In line with this notion rs612529T associates with a complete demethylation of the VSTM1 promoter correlating with the allele-specific upregulation of SIRL-1 expression. In monocytes, this upregulation strongly impacts the IgA-induced production of ROS by these cells. Through targeted association analysis we found a significant Meta P value of 1.14 × 10–6 for rs612529 for association to atopic dermatitis (AD

  12. SNPpy--database management for SNP data from genome wide association studies.

    Directory of Open Access Journals (Sweden)

    Faheem Mitha

    Full Text Available BACKGROUND: We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS. This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP data. SNPpy and its dependencies are open source software. RESULTS: The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. CONCLUSIONS: By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is a practical and extensible solution for investigators who seek to deploy central management of their GWAS data.

  13. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies.

    Science.gov (United States)

    Gimode, Davis; Odeny, Damaris A; de Villiers, Etienne P; Wanyonyi, Solomon; Dida, Mathews M; Mneney, Emmarold E; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M

    2016-01-01

    Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional

  14. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies.

    Directory of Open Access Journals (Sweden)

    Davis Gimode

    Full Text Available Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS technologies to develop both Simple Sequence Repeat (SSR and Single Nucleotide Polymorphism (SNP markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included

  15. Characterizing associations and SNP-environment interactions for GWAS-identified prostate cancer risk markers--results from BPC3.

    Directory of Open Access Journals (Sweden)

    Sara Lindstrom

    2011-02-01

    Full Text Available Genome-wide association studies (GWAS have identified multiple single nucleotide polymorphisms (SNPs associated with prostate cancer risk. However, whether these associations can be consistently replicated, vary with disease aggressiveness (tumor stage and grade and/or interact with non-genetic potential risk factors or other SNPs is unknown. We therefore genotyped 39 SNPs from regions identified by several prostate cancer GWAS in 10,501 prostate cancer cases and 10,831 controls from the NCI Breast and Prostate Cancer Cohort Consortium (BPC3. We replicated 36 out of 39 SNPs (P-values ranging from 0.01 to 10⁻²⁸. Two SNPs located near KLK3 associated with PSA levels showed differential association with Gleason grade (rs2735839, P = 0.0001 and rs266849, P = 0.0004; case-only test, where the alleles associated with decreasing PSA levels were inversely associated with low-grade (as defined by Gleason grade < 8 tumors but positively associated with high-grade tumors. No other SNP showed differential associations according to disease stage or grade. We observed no effect modification by SNP for association with age at diagnosis, family history of prostate cancer, diabetes, BMI, height, smoking or alcohol intake. Moreover, we found no evidence of pair-wise SNP-SNP interactions. While these SNPs represent new independent risk factors for prostate cancer, we saw little evidence for effect modification by other SNPs or by the environmental factors examined.

  16. Partial preferential chromosome pairing is genotype dependent in tetraploid rose.

    Science.gov (United States)

    Bourke, Peter M; Arens, Paul; Voorrips, Roeland E; Esselink, G Danny; Koning-Boucoiran, Carole F S; Van't Westende, Wendy P C; Santos Leonardo, Tiago; Wissink, Patrick; Zheng, Chaozhi; van Geest, Geert; Visser, Richard G F; Krens, Frans A; Smulders, Marinus J M; Maliepaard, Chris

    2017-04-01

    It has long been recognised that polyploid species do not always neatly fall into the categories of auto- or allopolyploid, leading to the term 'segmental allopolyploid' to describe everything in between. The meiotic behaviour of such intermediate species is not fully understood, nor is there consensus as to how to model their inheritance patterns. In this study we used a tetraploid cut rose (Rosa hybrida) population, genotyped using the 68K WagRhSNP array, to construct an ultra-high-density linkage map of all homologous chromosomes using methods previously developed for autotetraploids. Using the predicted bivalent configurations in this population we quantified differences in pairing behaviour among and along homologous chromosomes, leading us to correct our estimates of recombination frequency to account for this behaviour. This resulted in the re-mapping of 25 695 SNP markers across all homologues of the seven rose chromosomes, tailored to the pairing behaviour of each chromosome in each parent. We confirmed the inferred differences in pairing behaviour among chromosomes by examining repulsion-phase linkage estimates, which also carry information about preferential pairing and recombination. Currently, the closest sequenced relative to rose is Fragaria vesca. Aligning the integrated ultra-dense rose map with the strawberry genome sequence provided a detailed picture of the synteny, confirming overall co-linearity but also revealing new genomic rearrangements. Our results suggest that pairing affinities may vary along chromosome arms, which broadens our current understanding of segmental allopolyploidy. © 2017 The Authors The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.

  17. Efficient genome-wide genotyping strategies and data integration in crop plants.

    Science.gov (United States)

    Torkamaneh, Davoud; Boyle, Brian; Belzile, François

    2018-03-01

    Next-generation sequencing (NGS) has revolutionized plant and animal research by providing powerful genotyping methods. This review describes and discusses the advantages, challenges and, most importantly, solutions to facilitate data processing, the handling of missing data, and cross-platform data integration. Next-generation sequencing technologies provide powerful and flexible genotyping methods to plant breeders and researchers. These methods offer a wide range of applications from genome-wide analysis to routine screening with a high level of accuracy and reproducibility. Furthermore, they provide a straightforward workflow to identify, validate, and screen genetic variants in a short time with a low cost. NGS-based genotyping methods include whole-genome re-sequencing, SNP arrays, and reduced representation sequencing, which are widely applied in crops. The main challenges facing breeders and geneticists today is how to choose an appropriate genotyping method and how to integrate genotyping data sets obtained from various sources. Here, we review and discuss the advantages and challenges of several NGS methods for genome-wide genetic marker development and genotyping in crop plants. We also discuss how imputation methods can be used to both fill in missing data in genotypic data sets and to integrate data sets obtained using different genotyping tools. It is our hope that this synthetic view of genotyping methods will help geneticists and breeders to integrate these NGS-based methods in crop plant breeding and research.

  18. Population genetic analysis of ascertained SNP data

    Directory of Open Access Journals (Sweden)

    Nielsen Rasmus

    2004-03-01

    Full Text Available Abstract The large single nucleotide polymorphism (SNP typing projects have provided an invaluable data resource for human population geneticists. Almost all of the available SNP loci, however, have been identified through a SNP discovery protocol that will influence the allelic distributions in the sampled loci. Standard methods for population genetic analysis based on the available SNP data will, therefore, be biased. This paper discusses the effect of this ascertainment bias on allelic distributions and on methods for quantifying linkage disequilibrium and estimating demographic parameters. Several recently developed methods for correcting for the ascertainment bias will also be discussed.

  19. A novel synonymous SNP (A47A of the TMEM95 gene is significantly associated with the reproductive traits related to testis in male piglets

    Directory of Open Access Journals (Sweden)

    L. Liu

    2017-07-01

    Full Text Available Transmembrane protein 95 (TMEM95 is located on the acrosomal membrane of the sperm head involved in the acrosome reaction; thus, it is regarded as affecting spermatogenesis and reproduction traits. The aim of this study was to explore the novel single nucleotide polymorphisms (SNPs within the pig TMEM95 gene as well as to evaluate their associations with the testicular sizes in male Landrace (LD and Large White (LW breeds. After pool sequencing and bioinformatics analysis, only one novel coding SNP was found in exon 1, namely NC_010454.3: g.341T > C, resulting in a synonymous mutation (A47A. This SNP could be genotyped using the StuI polymerase chain reaction–restriction fragment length polymorphism (PCR-RFLP assay. The minor allelic frequencies (MAFs were 0.259 and 0.480 in the LD and LW breeds. Their polymorphism information content (PIC values were 0.310 and 0.375. The LW population was at the Hardy–Weinberg equilibrium (HWE (p > 0.05, whereas the LD population was not (p < 0.05. Association analyses demonstrated that a significant relationship was found between this A47A polymorphism and testis weight at 40 days of age in the LW population (p  =  0.047, and the heterozygote individuals showed lower testis weight than those with other genotypes. Moreover, this SNP was significantly associated with three testis measurement traits at 15 days of age in the LW population (p < 0.05; the individuals with genotypes TT and TC showed consistently superior testis measurement traits than those with genotype CC. These findings demonstrate that the A47A polymorphism had a significant effect on testis measurement traits, suggesting that the TMEM95 gene could be a candidate gene associated with reproductive traits. These results could contribute to breeding and genetics programs in the pig industry via DNA marker-assisted selection (MAS.

  20. Genetic relationships among Vietnamese local pigs investigated using genome-wide SNP markers.

    Science.gov (United States)

    Ishihara, S; Arakawa, A; Taniguchi, M; Luu, Q M; Pham, D L; Nguyen, B V; Mikawa, S; Kikuchi, K

    2018-02-01

    Vietnam is one of the most important countries for pig domestication, and a total of 26 local breeds have been reported. In the present study, genetic relationships among the various pig breeds were investigated using 90 samples collected from local pigs (15 breeds) in 15 distantly separated, distinct areas of the country and six samples from Landrace pigs in Hanoi as an out-group of a common Western breed. All samples were genotyped using the Illumina Porcine SNP60 v2 Genotyping BeadChip. We used 15 160-15 217 SNPs that showed a high degree of polymorphism in the Vietnamese breeds for identifying genetic relationships among the Vietnamese breeds. Principal components analysis showed that most pigs indigenous to Vietnam formed clusters correlated with their original geographic locations. Some Vietnamese breeds formed a cluster that was genetically related to the Western breed Landrace, suggesting the possibility of crossbreeding. These findings will be useful for the conservation and management of Vietnamese local pig breeds. © 2018 Stichting International Foundation for Animal Genetics.

  1. High density FTA plates serve as efficient long-term sample storage for HLA genotyping.

    Science.gov (United States)

    Lange, V; Arndt, K; Schwarzelt, C; Boehme, I; Giani, A S; Schmidt, A H; Ehninger, G; Wassmuth, R

    2014-02-01

    Storage of dried blood spots (DBS) on high-density FTA(®) plates could constitute an appealing alternative to frozen storage. However, it remains controversial whether DBS are suitable for high-resolution sequencing of human leukocyte antigen (HLA) alleles. Therefore, we extracted DNA from DBS that had been stored for up to 4 years, using six different methods. We identified those extraction methods that recovered sufficient high-quality DNA for reliable high-resolution HLA sequencing. Further, we confirmed that frozen whole blood samples that had been stored for several years can be transferred to filter paper without compromising HLA genotyping upon extraction. Concluding, DNA derived from high-density FTA(®) plates is suitable for high-resolution HLA sequencing, provided that appropriate extraction protocols are employed. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  2. Report on ISFG SNP Panel Discussion

    DEFF Research Database (Denmark)

    Butler, John M.; Budowle, B.; Gill, P.

    2008-01-01

    Six scientists presented their views and experience with single nucleotide polymorphism (SNP) markers, multiplexes, and methods regarding their potential application in forensic identity and relationship testing. Benefits and limitations of SNPs were reviewed, as were different SNP marker...

  3. Fine scale mapping of the 17q22 breast cancer locus using dense SNPs, genotyped within the Collaborative Oncological Gene-Environment Study (COGs).

    Science.gov (United States)

    Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Alonso, M Rosario; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W; Benitez, Javier; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M; Couch, Fergus J; Cox, Angela; Cross, Simon S; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F; Fasching, Peter A; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G; Goldberg, Mark S; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L; Muir, Kenneth; Neuhausen, Susan L; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C; Stram, Daniel O; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H; Tessier, Daniel C; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M; Vincent, Daniel; Winqvist, Robert; Wu, Anna H; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D P; Hall, Per; Edwards, Stacey L; Simard, Jacques; French, Juliet D; Chenevix-Trench, Georgia; Dunning, Alison M

    2016-09-07

    Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90-0.94; P = 8.96 × 10(-15))) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10(-09), r(2) = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10(-11), r(2) = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus.

  4. Genomic variation in myeloma: design, content, and initial application of the Bank On A Cure SNP Panel to detect associations with progression-free survival

    Directory of Open Access Journals (Sweden)

    Fang Gang

    2008-09-01

    Full Text Available Abstract Background We have engaged in an international program designated the Bank On A Cure, which has established DNA banks from multiple cooperative and institutional clinical trials, and a platform for examining the association of genetic variations with disease risk and outcomes in multiple myeloma. We describe the development and content of a novel custom SNP panel that contains 3404 SNPs in 983 genes, representing cellular functions and pathways that may influence disease severity at diagnosis, toxicity, progression or other treatment outcomes. A systematic search of national databases was used to identify non-synonymous coding SNPs and SNPs within transcriptional regulatory regions. To explore SNP associations with PFS we compared SNP profiles of short term (less than 1 year, n = 70 versus long term progression-free survivors (greater than 3 years, n = 73 in two phase III clinical trials. Results Quality controls were established, demonstrating an accurate and robust screening panel for genetic variations, and some initial racial comparisons of allelic variation were done. A variety of analytical approaches, including machine learning tools for data mining and recursive partitioning analyses, demonstrated predictive value of the SNP panel in survival. While the entire SNP panel showed genotype predictive association with PFS, some SNP subsets were identified within drug response, cellular signaling and cell cycle genes. Conclusion A targeted gene approach was undertaken to develop an SNP panel that can test for associations with clinical outcomes in myeloma. The initial analysis provided some predictive power, demonstrating that genetic variations in the myeloma patient population may influence PFS.

  5. Murine Double Minute 2 SNP T309G Polymorphism and Urinary Tract Cancer Risk: A Meta-Analysis.

    Science.gov (United States)

    Ding, Hui; Dai, Yu; Ning, Zhongyun; Fan, Ning; Wang, Zhiping; Li, Pei; Zhang, Liyuan; Tao, Yan; Wang, Hanzhang

    2016-03-01

    Urinary tract cancer is a common cause of cancer-related death. The etiology and pathogenesis of urinary tract cancer remain unclear, with genetic and epigenetic factors playing an important role. Studies of the polymorphism of murine double minute 2 (MDM2) have shown inconclusive trends in the risk of urinary tract cancer.To clarify this inconsistency, we conducted updated meta-analyses to evaluate the role of MDM2 T309G polymorphism in urinary tract cancer susceptibility.Data sources were Pubmed (1966-May 2015), Chinese biomedicine literature database (1978-May 2015), and hand searching of the reference lists of included studies:(1) research categories case-control study or a nested case-control study; (2) information evaluating the association between the MDM2 SNP309 and urinary tract cancer risk; (3) studies with sufficient data to perform a meta-analysis.It included the use of odds ratios (ORs) to assess the strength of the association, and 95% confidence intervals (CIs) give a sense of the precision of the estimate. We used I for the assessment of between-study heterogeneity, and publication bias was assessed using the funnel plot and the Egger test. Statistical analyses were performed by Review Manage, version 5.0 and Stata 11.0.A total of 18 studies met the eligibility criteria and were included in our analyses. Overall, there was no statistical association between MDM2 SNP309 and prostate cancer risk for the allele contrast, the GG genotype, the recessive genetic model, the dominant genetic model, and prostate cancer risk in all subjects (OR = 0.96, 95% CI 0.87-1.05, P = 0.36; OR = 0.93, 95% CI 0.75-1.15, P = 0.50; OR = 1.00, 95% CI 0.87-1.15, P = 0.99; OR = 0.93, 95% CI 0.80-1.07, P = 0.30), and between MDM2 SNP309 and bladder cancer risk (the allele contrast: OR = 1.06, 95% CI 0.89-1.27, P = 0.50; the GG genotype: OR = 1.12, 95% CI 0.79-1.61, P = 0.52; the dominant genetic model: OR = 1.03, 95% CI 0

  6. Imputation of single nucleotide polymorhpism genotypes of Hereford cattle: reference panel size, family relationship and population structure

    Science.gov (United States)

    The objective of this study is to investigate single nucleotide polymorphism (SNP) genotypes imputation of Hereford cattle. Purebred Herefords were from two sources, Line 1 Hereford (N=240) and representatives of Industry Herefords (N=311). Using different reference panels of 62 and 494 males with 1...

  7. High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis

    NARCIS (Netherlands)

    Eyre, Steve; Bowes, John; Diogo, Dorothee; Lee, Annette; Barton, Anne; Martin, Paul; Zhernakova, Alexandra; Stahl, Eli; Viatte, Sebastien; McAllister, Kate; Amos, Christopher I.; Padyukov, Leonid; Toes, Rene E. M.; Huizinga, Tom W. J.; Wijmenga, Cisca; Trynka, Gosia; Franke, Lude; Westra, Harm-Jan; Alfredsson, Lars; Hu, Xinli; Sandor, Cynthia; de Bakker, Paul I. W.; Davila, Sonia; Khor, Chiea Chuen; Heng, Khai Koon; Andrews, Robert; Edkins, Sarah; Hunt, Sarah E.; Langford, Cordelia; Symmons, Deborah; Concannon, Pat; Onengut-Gumuscu, Suna; Rich, Stephen S.; Deloukas, Panos; Gonzalez-Gay, Miguel A.; Rodriguez-Rodriguez, Luis; Arlsetig, Lisbeth; Martin, Javier; Rantapaa-Dahlqvist, Solbritt; Plenge, Robert M.; Raychaudhuri, Soumya; Klareskog, Lars; Gregersen, Peter K.; Worthington, Jane

    2012-01-01

    Using the Immunochip custom SNP array, which was designed for dense genotyping of 186 loci identified through genome-wide association studies (GWAS), we analyzed 11,475 individuals with rheumatoid arthritis (cases) of European ancestry and 15,870 controls for 129,464 markers. We combined these data

  8. Comparison of SNP Variation and Distribution in Indigenous Ethiopian and Korean Cattle (Hanwoo Populations

    Directory of Open Access Journals (Sweden)

    Zewdu Edea

    2012-09-01

    Full Text Available Although a large number of single nucleotide polymorphisms (SNPs have been identified from the bovine genome-sequencing project, few of these have been validated at large in Bos indicus breeds. We have genotyped 192 animals, representing 5 cattle populations of Ethiopia, with the Illumina Bovine 8K SNP BeadChip. These include 1 Sanga (Danakil, 3 zebu (Borana, Arsi and Ambo, and 1 zebu × Sanga intermediate (Horro breeds. The Hanwoo (Bos taurus was included for comparison purposes. Analysis of 7,045 SNP markers revealed that the mean minor allele frequency (MAF was 0.23, 0.22, 0.21, 0.21, 0.23, and 0.29 for Ambo, Arsi, Borana, Danakil, Horro, and Hanwoo, respectively. Significant differences of MAF were observed between the indigenous Ethiopian cattle populations and Hanwoo breed (p < 0.001. Across the Ethiopian cattle populations, a common variant MAF (≥0.10 and ≤0.5 accounted for an overall estimated 73.79% of the 7,045 SNPs. The Hanwoo displayed a higher proportion of common variant SNPs (90%. Investigation within Ethiopian cattle populations showed that on average, 16.64% of the markers were monomorphic, but in the Hanwoo breed, only 6% of the markers were monomorphic. Across the sampled Ethiopian cattle populations, the mean observed and expected heterozygosities were 0.314 and 0.313, respectively. The level of SNP variation identified in this particular study highlights that these markers can be potentially used for genetic studies in African cattle breeds.

  9. saSNP Approach for Scalable SNP Analyses of Multiple Bacterial or Viral Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Gardner, Shea [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Slezak, Tom [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

    2010-07-27

    With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs. The method is fast to compute, finding SNPs and building a SNP phylogeny in seconds to hours. We use it to identify thousands of putative SNPs from all publicly available Filoviridae, Poxviridae, foot-and-mouth disease virus, Bacillus, and Escherichia coli genomes and plasmids. The SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle as input hundreds of gigabases of sequence in a single run. The algorithm is based on k-mer analysis using a suffix array, so we call it saSNP.

  10. Association of yield-related traits in founder genotypes and derivatives of common wheat (Triticum aestivum L.

    Directory of Open Access Journals (Sweden)

    Jie Guo

    2018-02-01

    Full Text Available Abstract Background Yield improvement is an ever-important objective of wheat breeding. Studying and understanding the phenotypes and genotypes of yield-related traits has potential for genetic improvement of crops. Results The genotypes of 215 wheat cultivars including 11 founder parents and 106 derivatives were analyzed by the 9 K wheat SNP iSelect assay. A total of 4138 polymorphic single nucleotide polymorphism (SNP loci were detected on 21 chromosomes, of which 3792 were mapped to single chromosome locations. All genotypes were phenotyped for six yield-related traits including plant height (PH, spike length (SL, spikelet number per spike (SNPS, kernel number per spike (KNPS, kernel weight per spike (KWPS, and thousand kernel weight (TKW in six irrigated environments. Genome-wide association analysis detected 117 significant associations of 76 SNPs on 15 chromosomes with phenotypic explanation rates (R 2 ranging from 2.03 to 12.76%. In comparing allelic variation between founder parents and their derivatives (106 and other cultivars (98 using the 76 associated SNPs, we found that the region 116.0–133.2 cM on chromosome 5A in founder parents and derivatives carried alleles positively influencing kernel weight per spike (KWPS, rarely found in other cultivars. Conclusion The identified favorable alleles could mark important chromosome regions in derivatives that were inherited from founder parents. Our results unravel the genetic of yield in founder genotypes, and provide tools for marker-assisted selection for yield improvement.

  11. GMFilter and SXTestPlate: software tools for improving the SNPlex™ genotyping system

    Directory of Open Access Journals (Sweden)

    Schreiber Stefan

    2009-03-01

    Full Text Available Abstract Background Genotyping of single-nucleotide polymorphisms (SNPs is a fundamental technology in modern genetics. The SNPlex™ mid-throughput genotyping system (Applied Biosystems, Foster City, CA, USA enables the multiplexed genotyping of up to 48 SNPs simultaneously in a single DNA sample. The high level of automation and the large amount of data produced in a high-throughput laboratory require advanced software tools for quality control and workflow management. Results We have developed two programs, which address two main aspects of quality control in a SNPlex™ genotyping environment: GMFilter improves the analysis of SNPlex™ plates by removing wells with a low overall signal intensity. It enables scientists to automatically process the raw data in a standardized way before analyzing a plate with the proprietary GeneMapper software from Applied Biosystems. SXTestPlate examines the genotype concordance of a SNPlex™ test plate, which was typed with a control SNP set. This program allows for regular quality control checks of a SNPlex™ genotyping platform. It is compatible to other genotyping methods as well. Conclusion GMFilter and SXTestPlate provide a valuable tool set for laboratories engaged in genotyping based on the SNPlex™ system. The programs enhance the analysis of SNPlex™ plates with the GeneMapper software and enable scientists to evaluate the performance of their genotyping platform.

  12. The role of biochemical variations and genotype testing in determining the virological response of patients infected with hepatitis C virus

    Directory of Open Access Journals (Sweden)

    Abid Shoukat

    2018-01-01

    Full Text Available Background: In hepatitis C virus (HCV, infection viral and IL28B genotype along with many clinical and biochemical factors can influence response rates to pegylated interferon plus ribavirin (Peg-IFN-a/R therapy and progression to chronic hepatitis C (CHC. Aims: The present study was conducted to determine the effect of biochemical and risk factors on treatment outcome in CHC patients in relation to their viral and host genotype. Settings and Design: The present study was a prospective Pe- IFN efficacy study consisting of Peg-IFN-a/R therapy for 24–48 weeks including 250 HCV infected patients. Materials and Methods: Biochemical parameters were determined by Beckman Coulter AU680 automated analyzer. HCV and Interleukin 28B (IL28B genotyping were carried out by polymerase chain reaction-restriction fragment length polymorphism and viral load was determined by quantitative real-time PCR. Results: Wild outnumbered the variant genotypes in rs 12979860, rs 12980275, and rs 8099917 SNP of IL28B gene. Sustained virological response (SVR SVR and viral genotype were significantly associated with age, hepatic steatosis, low-grade varices, and serum aspartate transaminase levels (at the end of treatment (P < 0.05. In addition, SVR was significantly influenced by body mass index (BMI, insulin resistance, serum low-density lipoprotein , and ferritin levels (P < 0.05. Viral genotype 1 infected patients had higher serum cholesterol and triglyceride levels (P < 0.05. Conclusions: Although the IL28B sequence variation is the major factor that can influence response rates to antiviral therapy, viral and biochemical factors also have a definite role to play in the diagnosis, etiology, and treatment outcome in HCV-infected patients.

  13. Relationship Between Genotype Variants Follicle-stimulating Hormone Receptor Gene Polymorphisms (FSHR) and Morphology of Oocytes Prior to ICSI Procedures

    OpenAIRE

    Gashi, Zafer; Elezaj, Shkelzen; Zeqiraj, Afrim; Grabanica, Driton; Shabani, Isak; Gruda, Bujar; Gashi, Fitore

    2016-01-01

    Introduction: This study investigated association of Asn680Ser FSHR polymorphism with the ovarian response in 104 women of Albanian ethnic population enrolled in ICSI program. The reason of infertility in all cases has been identified as male factor. Methods: Analysis of the Asn680Ser polymorphism was performed using TaqMan? SNP Genotyping Assay. Clinical and endocrinologic parameters were analyzed based on the genotype, age, BMI, oocyte yield, number of transferred embryos and pregnancy rate...

  14. Multiplex target enrichment using DNA indexing for ultra-high throughput SNP detection.

    LENUS (Irish Health Repository)

    Kenny, Elaine M

    2011-02-01

    Screening large numbers of target regions in multiple DNA samples for sequence variation is an important application of next-generation sequencing but an efficient method to enrich the samples in parallel has yet to be reported. We describe an advanced method that combines DNA samples using indexes or barcodes prior to target enrichment to facilitate this type of experiment. Sequencing libraries for multiple individual DNA samples, each incorporating a unique 6-bp index, are combined in equal quantities, enriched using a single in-solution target enrichment assay and sequenced in a single reaction. Sequence reads are parsed based on the index, allowing sequence analysis of individual samples. We show that the use of indexed samples does not impact on the efficiency of the enrichment reaction. For three- and nine-indexed HapMap DNA samples, the method was found to be highly accurate for SNP identification. Even with sequence coverage as low as 8x, 99% of sequence SNP calls were concordant with known genotypes. Within a single experiment, this method can sequence the exonic regions of hundreds of genes in tens of samples for sequence and structural variation using as little as 1 μg of input DNA per sample.

  15. Diverse Genotypes of Yersinia pestis Caused Plague in Madagascar in 2007.

    Science.gov (United States)

    Riehm, Julia M; Projahn, Michaela; Vogler, Amy J; Rajerison, Minoaerisoa; Andersen, Genevieve; Hall, Carina M; Zimmermann, Thomas; Soanandrasana, Rahelinirina; Andrianaivoarimanana, Voahangy; Straubinger, Reinhard K; Nottingham, Roxanne; Keim, Paul; Wagner, David M; Scholz, Holger C

    2015-06-01

    Yersinia pestis is the causative agent of human plague and is endemic in various African, Asian and American countries. In Madagascar, the disease represents a significant public health problem with hundreds of human cases a year. Unfortunately, poor infrastructure makes outbreak investigations challenging. DNA was extracted directly from 93 clinical samples from patients with a clinical diagnosis of plague in Madagascar in 2007. The extracted DNAs were then genotyped using three molecular genotyping methods, including, single nucleotide polymorphism (SNP) typing, multi-locus variable-number tandem repeat analysis (MLVA), and Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) analysis. These methods provided increasing resolution, respectively. The results of these analyses revealed that, in 2007, ten molecular groups, two newly described here and eight previously identified, were responsible for causing human plague in geographically distinct areas of Madagascar. Plague in Madagascar is caused by numerous distinct types of Y. pestis. Genotyping method choice should be based upon the discriminatory power needed, expense, and available data for any desired comparisons. We conclude that genotyping should be a standard tool used in epidemiological investigations of plague outbreaks.

  16. p.Q192R SNP of PON1 seems not to be Associated with Carotid Atherosclerosis Risk Factors in an Asymptomatic and Normolipidemic Brazilian Population Sample

    Science.gov (United States)

    Scherrer, Daniel Zanetti; Zago, Vanessa Helena de Souza; Vieira, Isabela Calanca; Parra, Eliane Soler; Panzoldo, Natália Baratella; Alexandre, Fernanda; Secolin, Rodrigo; Baracat, Jamal; Quintão, Eder Carlos Rocha; de Faria, Eliana Cotta

    2015-01-01

    Background Evidences suggest that paraoxonase 1 (PON1) confers important antioxidant and anti-inflammatory properties when associated with high-density lipoprotein (HDL). Objective To investigate the relationships between p.Q192R SNP of PON1, biochemical parameters and carotid atherosclerosis in an asymptomatic, normolipidemic Brazilian population sample. Methods We studied 584 volunteers (females n = 326, males n = 258; 19-75 years of age). Total genomic DNA was extracted and SNP was detected in the TaqMan® SNP OpenArray® genotyping platform (Applied Biosystems, Foster City, CA). Plasma lipoproteins and apolipoproteins were determined and PON1 activity was measured using paraoxon as a substrate. High-resolution β-mode ultrasonography was used to measure cIMT and the presence of carotid atherosclerotic plaques in a subgroup of individuals (n = 317). Results The presence of p.192Q was associated with a significant increase in PON1 activity (RR = 12.30 (11.38); RQ = 46.96 (22.35); QQ = 85.35 (24.83) μmol/min; p < 0.0001), HDL-C (RR= 45 (37); RQ = 62 (39); QQ = 69 (29) mg/dL; p < 0.001) and apo A-I (RR = 140.76 ± 36.39; RQ = 147.62 ± 36.92; QQ = 147.49 ± 36.65 mg/dL; p = 0.019). Stepwise regression analysis revealed that heterozygous and p.192Q carriers influenced by 58% PON1 activity towards paraoxon. The univariate linear regression analysis demonstrated that p.Q192R SNP was not associated with mean cIMT; as a result, in the multiple regression analysis, no variables were selected with 5% significance. In logistic regression analysis, the studied parameters were not associated with the presence of carotid plaques. Conclusion In low-risk individuals, the presence of the p.192Q variant of PON1 is associated with a beneficial plasma lipid profile but not with carotid atherosclerosis. PMID:26039660

  17. Forensic assays of ricin: development of snp assays to generate precise genetic signatures for mixed genotypes found in ricin preparations

    Energy Technology Data Exchange (ETDEWEB)

    Jackson, Paul J. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Hill, Karen K. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

    2009-11-09

    The results outlined in this report provide the information for needed to apply a SNP-based forensic analysis to diverse ricin preparations. The same methods could be useful in castor breeding programs that seek to reduce or eliminate ricin in oil-producing R. communis cultivars.

  18. SNP discovery and chromosome anchoring provide the first physically-anchored hexaploid oat map and reveal synteny with model species.

    Directory of Open Access Journals (Sweden)

    Rebekah E Oliver

    Full Text Available A physically anchored consensus map is foundational to modern genomics research; however, construction of such a map in oat (Avena sativa L., 2n = 6x = 42 has been hindered by the size and complexity of the genome, the scarcity of robust molecular markers, and the lack of aneuploid stocks. Resources developed in this study include a modified SNP discovery method for complex genomes, a diverse set of oat SNP markers, and a novel chromosome-deficient SNP anchoring strategy. These resources were applied to build the first complete, physically-anchored consensus map of hexaploid oat. Approximately 11,000 high-confidence in silico SNPs were discovered based on nine million inter-varietal sequence reads of genomic and cDNA origin. GoldenGate genotyping of 3,072 SNP assays yielded 1,311 robust markers, of which 985 were mapped in 390 recombinant-inbred lines from six bi-parental mapping populations ranging in size from 49 to 97 progeny. The consensus map included 985 SNPs and 68 previously-published markers, resolving 21 linkage groups with a total map distance of 1,838.8 cM. Consensus linkage groups were assigned to 21 chromosomes using SNP deletion analysis of chromosome-deficient monosomic hybrid stocks. Alignments with sequenced genomes of rice and Brachypodium provide evidence for extensive conservation of genomic regions, and renewed encouragement for orthology-based genomic discovery in this important hexaploid species. These results also provide a framework for high-resolution genetic analysis in oat, and a model for marker development and map construction in other species with complex genomes and limited resources.

  19. Single tube genotyping of sickle cell anaemia using PCR-based SNP analysis.

    Science.gov (United States)

    Waterfall, C M; Cobb, B D

    2001-12-01

    Allele-specific amplification (ASA) is a generally applicable technique for the detection of known single nucleotide polymorphisms (SNPs), deletions, insertions and other sequence variations. Conventionally, two reactions are required to determine the zygosity of DNA in a two-allele system, along with significant upstream optimisation to define the specific test conditions. Here, we combine single tube bi-directional ASA with a 'matrix-based' optimisation strategy, speeding up the whole process in a reduced reaction set. We use sickle cell anaemia as our model SNP system, a genetic disease that is currently screened using ASA methods. Discriminatory conditions were rapidly optimised enabling the unambiguous identification of DNA from homozygous sickle cell patients (HbS/S), heterozygous carriers (HbA/S) or normal DNA in a single tube. Simple downstream mathematical analyses based on product yield across the optimisation set allow an insight into the important aspects of priming competition and component interactions in this competitive PCR. This strategy can be applied to any polymorphism, defining specific conditions using a multifactorial approach. The inherent simplicity and low cost of this PCR-based method validates bi-directional ASA as an effective tool in future clinical screening and pharmacogenomic research where more expensive fluorescence-based approaches may not be desirable.

  20. Interim Report on SNP analysis and forensic microarray probe design for South American hemorrhagic fever viruses, tick-borne encephalitis virus, henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever viruses, Rift Valley fever

    Energy Technology Data Exchange (ETDEWEB)

    Jaing, C; Gardner, S

    2012-06-05

    The goal of this project is to develop forensic genotyping assays for select agent viruses, enhancing the current capabilities for the viral bioforensics and law enforcement community. We used a multipronged approach combining bioinformatics analysis, PCR-enriched samples, microarrays and TaqMan assays to develop high resolution and cost effective genotyping methods for strain level forensic discrimination of viruses. We have leveraged substantial experience and efficiency gained through year 1 on software development, SNP discovery, TaqMan signature design and phylogenetic signature mapping to scale up the development of forensics signatures in year 2. In this report, we have summarized the whole genome wide SNP analysis and microarray probe design for forensics characterization of South American hemorrhagic fever viruses, tick-borne encephalitis viruses and henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus and Japanese encephalitis virus.

  1. Exploring germplasm diversity to understand the domestication process in Cicer spp. using SNP and DArT markers.

    Directory of Open Access Journals (Sweden)

    Manish Roorkiwal

    Full Text Available To estimate genetic diversity within and between 10 interfertile Cicer species (94 genotypes from the primary, secondary and tertiary gene pool, we analysed 5,257 DArT markers and 651 KASPar SNP markers. Based on successful allele calling in the tertiary gene pool, 2,763 DArT and 624 SNP markers that are polymorphic between genotypes from the gene pools were analyzed further. STRUCTURE analyses were consistent with 3 cultivated populations, representing kabuli, desi and pea-shaped seed types, with substantial admixture among these groups, while two wild populations were observed using DArT markers. AMOVA was used to partition variance among hierarchical sets of landraces and wild species at both the geographical and species level, with 61% of the variation found between species, and 39% within species. Molecular variance among the wild species was high (39% compared to the variation present in cultivated material (10%. Observed heterozygosity was higher in wild species than the cultivated species for each linkage group. Our results support the Fertile Crescent both as the center of domestication and diversification of chickpea. The collection used in the present study covers all the three regions of historical chickpea cultivation, with the highest diversity in the Fertile Crescent region. Shared alleles between different gene pools suggest the possibility of gene flow among these species or incomplete lineage sorting and could indicate complicated patterns of divergence and fusion of wild chickpea taxa in the past.

  2. High-throughput genotyping-by-sequencing facilitates molecular tagging of a novel rust resistance gene, R 15 , in sunflower (Helianthus annuus L.).

    Science.gov (United States)

    Ma, G J; Song, Q J; Markell, S G; Qi, L L

    2018-03-21

    A novel rust resistance gene, R 15 , derived from the cultivated sunflower HA-R8 was assigned to linkage group 8 of the sunflower genome using a genotyping-by-sequencing approach. SNP markers closely linked to R 15 were identified, facilitating marker-assisted selection of resistance genes. The rust virulence gene is co-evolving with the resistance gene in sunflower, leading to the emergence of new physiologic pathotypes. This presents a continuous threat to the sunflower crop necessitating the development of resistant sunflower hybrids providing a more efficient, durable, and environmentally friendly host plant resistance. The inbred line HA-R8 carries a gene conferring resistance to all known races of the rust pathogen in North America and can be used as a broad-spectrum resistance resource. Based on phenotypic assessments of 140 F 2 individuals derived from a cross of HA 89 with HA-R8, rust resistance in the population was found to be conferred by a single dominant gene (R 15 ) originating from HA-R8. Genotypic analysis with the currently available SSR markers failed to find any association between rust resistance and any markers. Therefore, we used genotyping-by-sequencing (GBS) analysis to achieve better genomic coverage. The GBS data showed that R 15 was located at the top end of linkage group (LG) 8. Saturation with 71 previously mapped SNP markers selected within this region further showed that it was located in a resistance gene cluster on LG8, and mapped to a 1.0-cM region between three co-segregating SNP makers SFW01920, SFW00128, and SFW05824 as well as the NSA_008457 SNP marker. These closely linked markers will facilitate marker-assisted selection and breeding in sunflower.

  3. Prediction of the optimum hybridization conditions of dot-blot-SNP analysis using estimated melting temperature of oligonucleotide probes.

    Science.gov (United States)

    Shiokai, Sachiko; Kitashiba, Hiroyasu; Nishio, Takeshi

    2010-08-01

    Although the dot-blot-SNP technique is a simple cost-saving technique suitable for genotyping of many plant individuals, optimization of hybridization and washing conditions for each SNP marker requires much time and labor. For prediction of the optimum hybridization conditions for each probe, we compared T (m) values estimated from nucleotide sequences using the DINAMelt web server, measured T (m) values, and hybridization conditions yielding allele-specific signals. The estimated T (m) values were comparable to the measured T (m) values with small differences of less than 3 degrees C for most of the probes. There were differences of approximately 14 degrees C between the specific signal detection conditions and estimated T (m) values. Change of one level of SSC concentrations of 0.1, 0.2, 0.5, and 1.0x SSC corresponded to a difference of approximately 5 degrees C in optimum signal detection temperature. Increasing the sensitivity of signal detection by shortening the exposure time to X-ray film changed the optimum hybridization condition for specific signal detection. Addition of competitive oligonucleotides to the hybridization mixture increased the suitable hybridization conditions by 1.8. Based on these results, optimum hybridization conditions for newly produced dot-blot-SNP markers will become predictable.

  4. Polymorphisms in genes involved in the estrogen pathway and mammographic density

    Directory of Open Access Journals (Sweden)

    Dumas Isabelle

    2010-11-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs in genes involved in the estrogen pathway appear to be associated with breast cancer risk and possibly with mammographic density (MD, but little is known of these associations among premenopausal women. This study examines the association of 11 polymorphisms in five estrogen-related genes (estrogen receptors alpha and beta (ERα, ERβ, 17β-hydroxysteroid dehydrogenase 1 (HSD17B1, catechol-O-methyltransferase (COMT, cytochrome P450 1B1 (CYP1B1 with premenopausal MD. Effect modification of four estrogen-related factors (parity, age at menarche, hormonal derivatives use and body mass index (BMI on this relation is also assessed. Methods Polymorphisms were genotyped in 741 premenopausal Caucasian women whose MD was measured in absolute density (AD, cm2 and percent density using a computer-assisted method. Multivariate linear models were used to examine the associations (Ptrend and interactions (Pi. Results None of the SNPs showed a statistically significant association with AD. However, each additional rare allele of rs1056836 CYP1B1 was associated with a reduction in AD among nulliparous women (Ptrend = 0.004, while no association was observed among parous women (Ptrend = 0.62; Pi = 0.02. An increase in the number of rare alleles of the HSD17B1 SNP (rs598126 and rs2010750 was associated with an increase in AD among women who never used hormonal derivatives (Ptrend = 0.06 and Ptrend = 0.04, respectively, but with a decrease in AD among past hormonal derivatives users (Ptrend = 0.04; Pi = 0.02 and Ptrend = 0.08; Pi = 0.01, respectively. Moreover, a negative association of rs598126 HSD17B1 SNP with AD was observed among women with higher BMI (>median (Ptrend = 0.01; Pi = 0.02. A negative association between an increased number of rare alleles of COMT rs4680 SNP and AD was limited to women who never used hormonal derivatives (Ptrend = 0.02; Pi = 0.03 or with late age at menarche (>median

  5. Polymorphisms in genes involved in the estrogen pathway and mammographic density

    International Nuclear Information System (INIS)

    Dumas, Isabelle; Diorio, Caroline

    2010-01-01

    Single nucleotide polymorphisms (SNPs) in genes involved in the estrogen pathway appear to be associated with breast cancer risk and possibly with mammographic density (MD), but little is known of these associations among premenopausal women. This study examines the association of 11 polymorphisms in five estrogen-related genes (estrogen receptors alpha and beta (ERα, ERβ), 17β-hydroxysteroid dehydrogenase 1 (HSD17B1), catechol-O-methyltransferase (COMT), cytochrome P450 1B1 (CYP1B1)) with premenopausal MD. Effect modification of four estrogen-related factors (parity, age at menarche, hormonal derivatives use and body mass index (BMI)) on this relation is also assessed. Polymorphisms were genotyped in 741 premenopausal Caucasian women whose MD was measured in absolute density (AD, cm 2 ) and percent density using a computer-assisted method. Multivariate linear models were used to examine the associations (P trend ) and interactions (P i ). None of the SNPs showed a statistically significant association with AD. However, each additional rare allele of rs1056836 CYP1B1 was associated with a reduction in AD among nulliparous women (P trend = 0.004), while no association was observed among parous women (P trend = 0.62; P i = 0.02). An increase in the number of rare alleles of the HSD17B1 SNP (rs598126 and rs2010750) was associated with an increase in AD among women who never used hormonal derivatives (P trend = 0.06 and P trend = 0.04, respectively), but with a decrease in AD among past hormonal derivatives users (P trend = 0.04; P i = 0.02 and P trend = 0.08; P i = 0.01, respectively). Moreover, a negative association of rs598126 HSD17B1 SNP with AD was observed among women with higher BMI (>median) (P trend = 0.01; P i = 0.02). A negative association between an increased number of rare alleles of COMT rs4680 SNP and AD was limited to women who never used hormonal derivatives (P trend = 0.02; P i = 0.03) or with late age at menarche (>median) (P trend = 0.03; P i

  6. Joint effect of unlinked genotypes: application to type 2 diabetes in the EPIC-Potsdam case-cohort study.

    Science.gov (United States)

    Knüppel, Sven; Meidtner, Karina; Arregui, Maria; Holzhütter, Hermann-Georg; Boeing, Heiner

    2015-07-01

    Analyzing multiple single nucleotide polymorphisms (SNPs) is a promising approach to finding genetic effects beyond single-locus associations. We proposed the use of multilocus stepwise regression (MSR) to screen for allele combinations as a method to model joint effects, and compared the results with the often used genetic risk score (GRS), conventional stepwise selection, and the shrinkage method LASSO. In contrast to MSR, the GRS, conventional stepwise selection, and LASSO model each genotype by the risk allele doses. We reanalyzed 20 unlinked SNPs related to type 2 diabetes (T2D) in the EPIC-Potsdam case-cohort study (760 cases, 2193 noncases). No SNP-SNP interactions and no nonlinear effects were found. Two SNP combinations selected by MSR (Nagelkerke's R² = 0.050 and 0.048) included eight SNPs with mean allele combination frequency of 2%. GRS and stepwise selection selected nearly the same SNP combinations consisting of 12 and 13 SNPs (Nagelkerke's R² ranged from 0.020 to 0.029). LASSO showed similar results. The MSR method showed the best model fit measured by Nagelkerke's R² suggesting that further improvement may render this method a useful tool in genetic research. However, our comparison suggests that the GRS is a simple way to model genetic effects since it does not consider linkage, SNP-SNP interactions, and no non-linear effects. © 2015 John Wiley & Sons Ltd/University College London.

  7. Characterization of 68Zn uptake, translocation, and accumulation into developing grains and young leaves of high Zn-density rice genotype*

    Science.gov (United States)

    Wu, Chun-yong; Feng, Ying; Shohag, Md. Jahidul Islam; Lu, Ling-li; Wei, Yan-yan; Gao, Chong; Yang, Xiao-e

    2011-01-01

    Zinc (Zn) is an essential micronutrient for humans, but Zn deficiency has become serious as equally as iron (Fe) and vitamin A deficiencies nowadays. Selection and breeding of high Zn-density crops is a suitable, cost-effective, and sustainable way to improve human health. However, the mechanism of high Zn density in rice grain is not fully understood, especially how Zn transports from soil to grains. Hydroponics experiments were carried out to compare Zn uptake and distribution in two different Zn-density rice genotypes using stable isotope technique. At seedling stage, IR68144 showed higher 68Zn uptake and transport rate to the shoot for the short-term, but no significant difference was observed in both genotypes for the long-term. Zn in xylem sap of IR68144 was consistently higher, and IR68144 exhibited higher Zn absorption ratio than IR64 at sufficient (2.0 µmol/L) or surplus (8.0 µmol/L) Zn supply level. IR64 and IR68144 showed similar patterns of 68Zn accumulation in new leaves at seedling stage and in developing grains at ripening stage, whereas 68Zn in new leaves and grains of IR68144 was consistently higher. These results suggested that a rapid root-to-shoot translocation and enhanced xylem loading capacity may be the crucial processes for high Zn density in rice grains. PMID:21528496

  8. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

    Science.gov (United States)

    Koning-Boucoiran, Carole F S; Esselink, G Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P C; Gitonga, Virginia W; Krens, Frans A; Voorrips, Roeland E; van de Weg, W Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J M

    2015-01-01

    In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.

  9. Genome-wide joint meta-analysis of SNP and SNP-by-smoking interaction identifies novel loci for pulmonary function.

    Directory of Open Access Journals (Sweden)

    Dana B Hancock

    Full Text Available Genome-wide association studies have identified numerous genetic loci for spirometic measures of pulmonary function, forced expiratory volume in one second (FEV(1, and its ratio to forced vital capacity (FEV(1/FVC. Given that cigarette smoking adversely affects pulmonary function, we conducted genome-wide joint meta-analyses (JMA of single nucleotide polymorphism (SNP and SNP-by-smoking (ever-smoking or pack-years associations on FEV(1 and FEV(1/FVC across 19 studies (total N = 50,047. We identified three novel loci not previously associated with pulmonary function. SNPs in or near DNER (smallest P(JMA = 5.00×10(-11, HLA-DQB1 and HLA-DQA2 (smallest P(JMA = 4.35×10(-9, and KCNJ2 and SOX9 (smallest P(JMA = 1.28×10(-8 were associated with FEV(1/FVC or FEV(1 in meta-analysis models including SNP main effects, smoking main effects, and SNP-by-smoking (ever-smoking or pack-years interaction. The HLA region has been widely implicated for autoimmune and lung phenotypes, unlike the other novel loci, which have not been widely implicated. We evaluated DNER, KCNJ2, and SOX9 and found them to be expressed in human lung tissue. DNER and SOX9 further showed evidence of differential expression in human airway epithelium in smokers compared to non-smokers. Our findings demonstrated that joint testing of SNP and SNP-by-environment interaction identified novel loci associated with complex traits that are missed when considering only the genetic main effects.

  10. Improved Ancestry Estimation for both Genotyping and Sequencing Data using Projection Procrustes Analysis and Genotype Imputation

    Science.gov (United States)

    Wang, Chaolong; Zhan, Xiaowei; Liang, Liming; Abecasis, Gonçalo R.; Lin, Xihong

    2015-01-01

    Accurate estimation of individual ancestry is important in genetic association studies, especially when a large number of samples are collected from multiple sources. However, existing approaches developed for genome-wide SNP data do not work well with modest amounts of genetic data, such as in targeted sequencing or exome chip genotyping experiments. We propose a statistical framework to estimate individual ancestry in a principal component ancestry map generated by a reference set of individuals. This framework extends and improves upon our previous method for estimating ancestry using low-coverage sequence reads (LASER 1.0) to analyze either genotyping or sequencing data. In particular, we introduce a projection Procrustes analysis approach that uses high-dimensional principal components to estimate ancestry in a low-dimensional reference space. Using extensive simulations and empirical data examples, we show that our new method (LASER 2.0), combined with genotype imputation on the reference individuals, can substantially outperform LASER 1.0 in estimating fine-scale genetic ancestry. Specifically, LASER 2.0 can accurately estimate fine-scale ancestry within Europe using either exome chip genotypes or targeted sequencing data with off-target coverage as low as 0.05×. Under the framework of LASER 2.0, we can estimate individual ancestry in a shared reference space for samples assayed at different loci or by different techniques. Therefore, our ancestry estimation method will accelerate discovery in disease association studies not only by helping model ancestry within individual studies but also by facilitating combined analysis of genetic data from multiple sources. PMID:26027497

  11. Casein SNP in Norwegian goats: additive and dominance effects on milk composition and quality

    Science.gov (United States)

    2011-01-01

    Background The four casein proteins in goat milk are encoded by four closely linked casein loci (CSN1S1, CSN2, CSN1S2 and CSN3) within 250 kb on caprine chromosome 6. A deletion in exon 12 of CSN1S1, so far reported only in Norwegian goats, has been found at high frequency (0.73). Such a high frequency is difficult to explain because the national breeding goal selects against the variant's effect. Methods In this study, 575 goats were genotyped for 38 Single Nucleotide Polymorphisms (SNP) located within the four casein genes. Milk production records of these goats were obtained from the Norwegian Dairy Goat Control. Test-day mixed models with additive and dominance fixed effects of single SNP were fitted in a model including polygenic effects. Results Significant additive effects of single SNP within CSN1S1 and CSN3 were found for fat % and protein %, milk yield and milk taste. The allele with the deletion showed additive and dominance effects on protein % and fat %, and overdominance effects on milk quantity (kg) and lactose %. At its current frequency, the observed dominance (overdominance) effects of the deletion allele reduced its substitution effect (and additive genetic variance available for selection) in the population substantially. Conclusions The selection pressure of conventional breeding on the allele with the deletion is limited due to the observed dominance (overdominance) effects. Inclusion of molecular information in the national breeding scheme will reduce the frequency of this deletion in the population. PMID:21864407

  12. Genotypic difference in salinity tolerance in quinoa is determined by differential control of xylem Na+ loading and stomatal density

    DEFF Research Database (Denmark)

    Shabala, Sergey; Hariadi, Yuda; Jacobsen, Sven-Erik

    2013-01-01

    old seedlings. Six weeks after the treatment commenced, leaf sap Na and K content and osmolality, stomatal density, chlorophyll fluorescence characteristics, and xylem sap Na and K composition were measured. Responses to salinity differed greatly among the varieties. All cultivars had substantially...... increased K+ concentrations in the leaf sap, but the most tolerant cultivars had lower xylem Na+ content at the time of sampling. Most tolerant cultivars had lowest leaf sap osmolality. All varieties reduced stomata density when grown under saline conditions. All varieties clustered into two groups...... to the xylem, and reduced stomata density are important physiological traits contributing to genotypic differences in salinity tolerance in quinoa, a halophyte species from Chenopodium family....

  13. A customized pigmentation SNP array identifies a novel SNP associated with melanoma predisposition in the SLC45A2 gene.

    Directory of Open Access Journals (Sweden)

    Maider Ibarrola-Villava

    Full Text Available As the incidence of Malignant Melanoma (MM reflects an interaction between skin colour and UV exposure, variations in genes implicated in pigmentation and tanning response to UV may be associated with susceptibility to MM. In this study, 363 SNPs in 65 gene regions belonging to the pigmentation pathway have been successfully genotyped using a SNP array. Five hundred and ninety MM cases and 507 controls were analyzed in a discovery phase I. Ten candidate SNPs based on a p-value threshold of 0.01 were identified. Two of them, rs35414 (SLC45A2 and rs2069398 (SILV/CKD2, were statistically significant after conservative Bonferroni correction. The best six SNPs were further tested in an independent Spanish series (624 MM cases and 789 controls. A novel SNP located on the SLC45A2 gene (rs35414 was found to be significantly associated with melanoma in both phase I and phase II (P<0.0001. None of the other five SNPs were replicated in this second phase of the study. However, three SNPs in TYR, SILV/CDK2 and ADAMTS20 genes (rs17793678, rs2069398 and rs1510521 respectively had an overall p-value<0.05 when considering the whole DNA collection (1214 MM cases and 1296 controls. Both the SLC45A2 and the SILV/CDK2 variants behave as protective alleles, while the TYR and ADAMTS20 variants seem to function as risk alleles. Cumulative effects were detected when these four variants were considered together. Furthermore, individuals carrying two or more mutations in MC1R, a well-known low penetrance melanoma-predisposing gene, had a decreased MM risk if concurrently bearing the SLC45A2 protective variant. To our knowledge, this is the largest study on Spanish sporadic MM cases to date.

  14. SNP mining porcine ESTs with MAVIANT, a novel tool for SNP evaluation and annotation

    DEFF Research Database (Denmark)

    Panitz, Frank; Stengaard, Henrik; Hornshoj, Henrik

    2007-01-01

    MOTIVATION: Single nucleotide polymorphisms (SNPs) analysis is an important means to study genetic variation. A fast and cost-efficient approach to identify large numbers of novel candidates is the SNP mining of large scale sequencing projects. The increasing availability of sequence trace data...... manual annotation, which is immediately accessible and can be easily shared with external collaborators. RESULTS: Large-scale SNP mining of polymorphisms bases on porcine EST sequences yielded more than 7900 candidate SNPs in coding regions (cSNPs), which were annotated relative to the human genome. Non...

  15. A high-density SNP genetic linkage map for the silver-lipped pearl oyster, Pinctada maxima: a valuable resource for gene localisation and marker-assisted selection.

    Science.gov (United States)

    Jones, David B; Jerry, Dean R; Khatkar, Mehar S; Raadsma, Herman W; Zenger, Kyall R

    2013-11-20

    The silver-lipped pearl oyster, Pinctada maxima, is an important tropical aquaculture species extensively farmed for the highly sought "South Sea" pearls. Traditional breeding programs have been initiated for this species in order to select for improved pearl quality, but many economic traits under selection are complex, polygenic and confounded with environmental factors, limiting the accuracy of selection. The incorporation of a marker-assisted selection (MAS) breeding approach would greatly benefit pearl breeding programs by allowing the direct selection of genes responsible for pearl quality. However, before MAS can be incorporated, substantial genomic resources such as genetic linkage maps need to be generated. The construction of a high-density genetic linkage map for P. maxima is not only essential for unravelling the genomic architecture of complex pearl quality traits, but also provides indispensable information on the genome structure of pearl oysters. A total of 1,189 informative genome-wide single nucleotide polymorphisms (SNPs) were incorporated into linkage map construction. The final linkage map consisted of 887 SNPs in 14 linkage groups, spans a total genetic distance of 831.7 centimorgans (cM), and covers an estimated 96% of the P. maxima genome. Assessment of sex-specific recombination across all linkage groups revealed limited overall heterochiasmy between the sexes (i.e. 1.15:1 F/M map length ratio). However, there were pronounced localised differences throughout the linkage groups, whereby male recombination was suppressed near the centromeres compared to female recombination, but inflated towards telomeric regions. Mean values of LD for adjacent SNP pairs suggest that a higher density of markers will be required for powerful genome-wide association studies. Finally, numerous nacre biomineralization genes were localised providing novel positional information for these genes. This high-density SNP genetic map is the first comprehensive linkage

  16. Multiplex pyrosequencing assay using AdvISER-MH-PYRO algorithm: a case for rapid and cost-effective genotyping analysis of prostate cancer risk-associated SNPs.

    Science.gov (United States)

    Ambroise, Jérôme; Butoescu, Valentina; Robert, Annie; Tombal, Bertrand; Gala, Jean-Luc

    2015-06-25

    Single Nucleotide Polymorphisms (SNPs) identified in Genome Wide Association Studies (GWAS) have generally moderate association with related complex diseases. Accordingly, Multilocus Genetic Risk Scores (MGRSs) have been computed in previous studies in order to assess the cumulative association of multiple SNPs. When several SNPs have to be genotyped for each patient, using successive uniplex pyrosequencing reactions increases analytical reagent expenses and Turnaround Time (TAT). While a set of several pyrosequencing primers could theoretically be used to analyze multiplex amplicons, this would generate overlapping primer-specific pyro-signals that are visually uninterpretable. In the current study, two multiplex assays were developed consisting of a quadruplex (n=4) and a quintuplex (n=5) polymerase chain reaction (PCR) each followed by multiplex pyrosequencing analysis. The aim was to reliably but rapidly genotype a set of prostate cancer-related SNPs (n=9). The nucleotide dispensation order was selected using SENATOR software. Multiplex pyro-signals were analyzed using the new AdvISER-MH-PYRO software based on a sparse representation of the signal. Using uniplex assays as gold standard, the concordance between multiplex and uniplex assays was assessed on DNA extracted from patient blood samples (n = 10). All genotypes (n=90) generated with the quadruplex and the quintuplex pyroquencing assays were perfectly (100 %) concordant with uniplex pyrosequencing. Using multiplex genotyping approach for analyzing a set of 90 patients allowed reducing TAT by approximately 75 % (i.e., from 2025 to 470 min) while reducing reagent consumption and cost by approximately 70 % (i.e., from ~229 US$ /patient to ~64 US$ /patient). This combination of quadruplex and quintuplex pyrosequencing and PCR assays enabled to reduce the amount of DNA required for multi-SNP analysis, and to lower the global TAT and costs of SNP genotyping while providing results as reliable as uniplex

  17. Partitioning of copy-number genotypes in pedigrees

    Directory of Open Access Journals (Sweden)

    Andelfinger Gregor U

    2010-05-01

    Full Text Available Abstract Background Copy number variations (CNVs and polymorphisms (CNPs have only recently gained the genetic community's attention. Conservative estimates have shown that CNVs and CNPs might affect more than 10% of the genome and that they may be at least as important as single nucleotide polymorphisms in assessing human variability. Widely used tools for CNP analysis have been implemented in Birdsuite and PLINK for the purpose of conducting genetic association studies based on the unpartitioned total number of CNP copies provided by the intensities from Affymetrix's Genome-Wide Human SNP Array. Here, we are interested in partitioning copy number variations and polymorphisms in extended pedigrees for the purpose of linkage analysis on familial data. Results We have developed CNGen, a new software for the partitioning of copy number polymorphism using the integrated genotypes from Birdsuite with the Affymetrix platform. The algorithm applied to familial trios or extended pedigrees can produce partitioned copy number genotypes with distinct parental alleles. We have validated the algorithm using simulations on a complex pedigree structure using frequencies calculated from a real dataset of 300 genotyped samples from 42 pedigrees segregating a congenital heart defect phenotype. Conclusions CNGen is the first published software for the partitioning of copy number genotypes in pedigrees, making possible the use CNPs and CNVs for linkage analysis. It was implemented with the Python interpreter version 2.5.2. It was successfully tested on current Linux, Windows and Mac OS workstations.

  18. A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T in the equine myostatin (MSTN gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses

    Directory of Open Access Journals (Sweden)

    Whiston Ronan

    2010-10-01

    Full Text Available Abstract Background Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. Results A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f and middle-long distance (> 8 f races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN [Punadj. = 6.96 × 10-6]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806 comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (Punadj. = 1.61 × 10-9; PBonf. = 6.58 × 10-5. In a candidate gene study we have previously reported a SNP (g.66493737C>T in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r2 = 0.86. Conclusions Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 × 10-10; BIEC2-417495, Punadj. = 1.61 × 10-9. Functional investigations will be required to determine whether this

  19. Simplification of genotyping techniques of the ABO blood type experiment and exploration of population genetics.

    Science.gov (United States)

    Hu, Jian; Zhou, Yi-ren; Ding, Jia-lin; Wang, Zhi-yuan; Liu, Ling; Wang, Ye-kai; Lou, Hui-ling; Qiao, Shou-yi; Wu, Yan-hua

    2017-05-20

    The ABO blood type is one of the most common and widely used genetic traits in humans. Three glycosyltransferase-encoding gene alleles, I A , I B and i, produce three red blood cell surface antigens, by which the ABO blood type is classified. By using the ABO blood type experiment as an ideal case for genetics teaching, we can easily introduce to the students several genetic concepts, including multiple alleles, gene interaction, single nucleotide polymorphism (SNP) and gene evolution. Herein we have innovated and integrated our ABO blood type genetics experiments. First, in the section of Molecular Genetics, a new method of ABO blood genotyping was established: specific primers based on SNP sites were designed to distinguish three alleles through quantitative real-time PCR. Next, the experimental teaching method of Gene Evolution was innovated in the Population Genetics section: a gene-evolution software was developed to simulate the evolutionary tendency of the ABO genotype encoding alleles under diverse conditions. Our reform aims to extend the contents of genetics experiments, to provide additional teaching approaches, and to improve the learning efficiency of our students eventually.

  20. Evaluation of inbreeding depression in Holstein cattle using whole-genome SNP markers and alternative measures of genomic inbreeding.

    Science.gov (United States)

    Bjelland, D W; Weigel, K A; Vukasinovic, N; Nkrumah, J D

    2013-07-01

    The effects of increased pedigree inbreeding in dairy cattle populations have been well documented and result in a negative impact on profitability. Recent advances in genotyping technology have allowed researchers to move beyond pedigree analysis and study inbreeding at a molecular level. In this study, 5,853 animals were genotyped for 54,001 single nucleotide polymorphisms (SNP); 2,913 cows had phenotypic records including a single lactation for milk yield (from either lactation 1, 2, 3, or 4), reproductive performance, and linear type conformation. After removing SNP with poor call rates, low minor allele frequencies, and departure from Hardy-Weinberg equilibrium, 33,025 SNP remained for analyses. Three measures of genomic inbreeding were evaluated: percent homozygosity (FPH), inbreeding calculated from runs of homozygosity (FROH), and inbreeding derived from a genomic relationship matrix (FGRM). Average FPH was 60.5±1.1%, average FROH was 3.8±2.1%, and average FGRM was 20.8±2.3%, where animals with larger values for each of the genomic inbreeding indices were considered more inbred. Decreases in total milk yield to 205d postpartum of 53, 20, and 47kg per 1% increase in FPH, FROH, and FGRM, respectively, were observed. Increases in days open per 1% increase in FPH (1.76 d), FROH (1.72 d), and FGRM (1.06 d) were also noted, as well as increases in maternal calving difficulty (0.09, 0.03, and 0.04 on a 5-point scale for FPH, FROH, and FGRM, respectively). Several linear type traits, such as strength (-0.40, -0.11, and -0.19), rear legs rear view (-0.35, -0.16, and -0.14), front teat placement (0.35, 0.25, 0.18), and teat length (-0.24, -0.14, and -0.13) were also affected by increases in FPH, FROH, and FGRM, respectively. Overall, increases in each measure of genomic inbreeding in this study were associated with negative effects on production and reproductive ability in dairy cows. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc

  1. FGFR1OP tagSNP but not CCR6 polymorphisms are associated with Vogt-Koyanagi-Harada syndrome in Chinese Han.

    Directory of Open Access Journals (Sweden)

    Xianglong Yi

    Full Text Available BACKGROUND: Polymorphisms of the CC chemokine receptor 6 (CCR6 and FGFR10P tagSNP (locus close to CCR6 at 6q27 have recently been reported to be associated with the susceptibility to several immune-related diseases. This study was designed to determine the association of CCR6 and FGFR10P (tagSNPs with Vogt-Koyanagi-Harada (VKH syndrome, an autoimmune disease directed against melanocytes, in two independent Chinese Han populations. METHODOLOGY/PRINCIPAL FINDINGS: A total of 601 VKH patients and 725 healthy controls from two Chinese Han populations were genotyped by the polymerase chain reaction-restriction fragment length polymorphism method. Hardy-Weinberg equilibrium was tested using the χ(2 test. Genotype frequencies were estimated by direct counting. Allele and genotype frequencies were compared between patients and controls using the χ(2 test. The frequency of the A allele of rs2301436 was significantly higher both in Cohort 1 and Cohort 2 as compared with two separate controls (P = 0.044; P = 0.049, respectively. The significance was lost after Bonferroni correction in both cohorts (Pc = 0.516; Pc = 0.392, respectively. The frequency of the A allele was significantly higher in the combined patient group as compared with all controls before and after Bonferroni correction (P = 0.005, Pc = 0.025. The genotype and allele frequencies of rs3093024, rs6902119, rs3093023 and rs968334 were not different between patients with VKH and healthy controls based on analysis either for both cohorts or for the patients and controls in total. Analysis according to extra ocular clinical findings including headache, alopecia and poliosis, vitiligo and tinnitus did not show any association of the five polymorphisms with these parameters. CONCLUSION: These results suggest that the rs2301436 tagSNP of FGFR10P is positively associated with susceptibility to VKH syndrome in the tested Chinese Han populations. No association was found for

  2. HIGH-THROUGHPUT IDENTIFICATION OF THE PREDOMINANT MALARIA PARASITE CLONE IN COMPLEX BLOOD STAGE INFECTIONS USING A MULTI-SNP MOLECULAR HAPLOTYPING ASSAY

    Science.gov (United States)

    COLE-TOBIAN, JENNIFER L.; ZIMMERMAN, PETER A.; KING, CHRISTOPHER L.

    2013-01-01

    Individuals living in malaria endemic areas are often infected with multiple parasite clones. Currently used single nucleotide polymorphism (SNP) genotyping methods for malaria parasites are cumbersome; furthermore, few methods currently exist that can rapidly determine the most abundant clone in these complex infections. Here we describe an oligonucleotide ligation assay (OLA) to distinguish SNPs in the Plasmodium vivax Duffy binding protein gene (Pvdbp) at 14 polymorphic residues simultaneously. Allele abundance is determined by the highest mean fluorescent intensity of each allele. Using mixtures of plasmids encoding known haplotypes of the Pvdbp, single clones of P. vivax parasites from infected Aotus monkeys, and well-defined mixed infections from field samples, we were able to identify the predominant Pvdbp genotype with > 93% accuracy when the dominant clone is twice as abundant as a lesser genotype and > 97% of the time if the ratio was 5:1 or greater. Thus, the OLA can accurately, reproducibly, and rapidly determine the predominant parasite haplotype in complex blood stage infections. PMID:17255222

  3. Allelopathic interference of alfalfa (Medicago sativa L.) genotypes to annual ryegrass (Lolium rigidum).

    Science.gov (United States)

    Zubair, Hasan Muhammad; Pratley, James E; Sandral, G A; Humphries, A

    2017-07-01

    Alfalfa (Medicago sativa L.) genotypes at varying densities were investigated for allelopathic impact using annual ryegrass (Lolium rigidum) as the target species in a laboratory bioassay. Three densities (15, 30, and 50 seedlings/beaker) and 40 alfalfa genotypes were evaluated by the equal compartment agar method (ECAM). Alfalfa genotypes displayed a range of allelopathic interference in ryegrass seedlings, reducing root length from 5 to 65%. The growth of ryegrass decreased in response to increasing density of alfalfa seedlings. At the lowest density, Q75 and Titan9 were the least allelopathic genotypes. An overall inhibition index was calculated to rank each alfalfa genotype. Reduction in seed germination of annual ryegrass occurred in the presence of several alfalfa genotypes including Force 10, Haymaster7 and SARDI Five. A comprehensive metabolomic analysis using Quadruple Time of Flight (Q-TOF), was conducted to compare six alfalfa genotypes. Variation in chemical compounds was found between alfalfa root extracts and exudates and also between genotypes. Further individual compound assessments and quantitative study at greater chemical concentrations are needed to clarify the allelopathic activity. Considerable genetic variation exists among alfalfa genotypes for allelopathic activity creating the opportunity for its use in weed suppression through selection.

  4. FunctSNP: an R package to link SNPs to functional knowledge and dbAutoMaker: a suite of Perl scripts to build SNP databases

    Directory of Open Access Journals (Sweden)

    Watson-Haigh Nathan S

    2010-06-01

    Full Text Available Abstract Background Whole genome association studies using highly dense single nucleotide polymorphisms (SNPs are a set of methods to identify DNA markers associated with variation in a particular complex trait of interest. One of the main outcomes from these studies is a subset of statistically significant SNPs. Finding the potential biological functions of such SNPs can be an important step towards further use in human and agricultural populations (e.g., for identifying genes related to susceptibility to complex diseases or genes playing key roles in development or performance. The current challenge is that the information holding the clues to SNP functions is distributed across many different databases. Efficient bioinformatics tools are therefore needed to seamlessly integrate up-to-date functional information on SNPs. Many web services have arisen to meet the challenge but most work only within the framework of human medical research. Although we acknowledge the importance of human research, we identify there is a need for SNP annotation tools for other organisms. Description We introduce an R package called FunctSNP, which is the user interface to custom built species-specific databases. The local relational databases contain SNP data together with functional annotations extracted from online resources. FunctSNP provides a unified bioinformatics resource to link SNPs with functional knowledge (e.g., genes, pathways, ontologies. We also introduce dbAutoMaker, a suite of Perl scripts, which can be scheduled to run periodically to automatically create/update the customised SNP databases. We illustrate the use of FunctSNP with a livestock example, but the approach and software tools presented here can be applied also to human and other organisms. Conclusions Finding the potential functional significance of SNPs is important when further using the outcomes from whole genome association studies. FunctSNP is unique in that it is the only R

  5. SNP Analysis and Whole Exome Sequencing: Their Application in the Analysis of a Consanguineous Pedigree Segregating Ataxia

    Directory of Open Access Journals (Sweden)

    Sarah L. Nickerson

    2015-10-01

    Full Text Available Autosomal recessive cerebellar ataxia encompasses a large and heterogeneous group of neurodegenerative disorders. We employed single nucleotide polymorphism (SNP analysis and whole exome sequencing to investigate a consanguineous Maori pedigree segregating ataxia. We identified a novel mutation in exon 10 of the SACS gene: c.7962T>G p.(Tyr2654*, establishing the diagnosis of autosomal recessive spastic ataxia of Charlevoix-Saguenay (ARSACS. Our findings expand both the genetic and phenotypic spectrum of this rare disorder, and highlight the value of high-density SNP analysis and whole exome sequencing as powerful and cost-effective tools in the diagnosis of genetically heterogeneous disorders such as the hereditary ataxias.

  6. Determination of coefficient defining leaf area development in different genotypes, plant types and planting densities in peanut (Arachis hypogeae L.).

    Science.gov (United States)

    Halilou, Oumarou; Hissene, Halime Mahamat; Clavijo Michelangeli, José A; Hamidou, Falalou; Sinclair, Thomas R; Soltani, Afshin; Mahamane, Saadou; Vadez, Vincent

    2016-12-01

    Rapid leaf area development may be attractive under a number of cropping conditions to enhance the vigor of crop establishment and allow rapid canopy closure for maximizing light interception and shading of weed competitors. This study was undertaken to determine (1) if parameters describing leaf area development varied among ten peanut ( Arachis hypogeae L.) genotypes grown in field and pot experiments, (2) if these parameters were affected by the planting density, and (3) if these parameters varied between Spanish and Virginia genotypes. Leaf area development was described by two steps: prediction of main stem number of nodes based on phyllochron development and plant leaf area dependent based on main stem node number. There was no genetic variation in the phyllochron measured in the field. However, the phyllochron was much longer for plants grown in pots as compared to the field-grown plants. These results indicated a negative aspect of growing peanut plants in the pots used in this experiment. In contrast to phyllochron, there was no difference in the relationship between plant leaf area and main stem node number between the pot and field experiments. However, there was genetic variation in both the pot and field experiments in the exponential coefficient (PLAPOW) of the power function used to describe leaf area development from node number. This genetic variation was confirmed in another experiment with a larger number of genotypes, although possible G × E interaction for the PLAPOW was found. Sowing density did not affect the power function relating leaf area to main stem node number. There was also no difference in the power function coefficient between Spanish and Virginia genotypes. SSM (Simple Simulation model) reliably predicted leaf canopy development in groundnut. Indeed the leaf area showed a close agreement between predicted and observed values up to 60000 cm 2  m -2 . The slightly higher prediction in India and slightly lower prediction in

  7. High-throughput genotyping for species identification and diversity assessment in germplasm collections.

    Science.gov (United States)

    Mason, Annaliese S; Zhang, Jing; Tollenaere, Reece; Vasquez Teuber, Paula; Dalton-Morgan, Jessica; Hu, Liyong; Yan, Guijun; Edwards, David; Redden, Robert; Batley, Jacqueline

    2015-09-01

    Germplasm collections provide an extremely valuable resource for breeders and researchers. However, misclassification of accessions by species often hinders the effective use of these collections. We propose that use of high-throughput genotyping tools can provide a fast, efficient and cost-effective way of confirming species in germplasm collections, as well as providing valuable genetic diversity data. We genotyped 180 Brassicaceae samples sourced from the Australian Grains Genebank across the recently released Illumina Infinium Brassica 60K SNP array. Of these, 76 were provided on the basis of suspected misclassification and another 104 were sourced independently from the germplasm collection. Presence of the A- and C-genomes combined with principle components analysis clearly separated Brassica rapa, B. oleracea, B. napus, B. carinata and B. juncea samples into distinct species groups. Several lines were further validated using chromosome counts. Overall, 18% of samples (32/180) were misclassified on the basis of species. Within these 180 samples, 23/76 (30%) supplied on the basis of suspected misclassification were misclassified, and 9/105 (9%) of the samples randomly sourced from the Australian Grains Genebank were misclassified. Surprisingly, several individuals were also found to be the product of interspecific hybridization events. The SNP (single nucleotide polymorphism) array proved effective at confirming species, and provided useful information related to genetic diversity. As similar genomic resources become available for different crops, high-throughput molecular genotyping will offer an efficient and cost-effective method to screen germplasm collections worldwide, facilitating more effective use of these valuable resources by breeders and researchers. © 2015 John Wiley & Sons Ltd.

  8. QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species

    Directory of Open Access Journals (Sweden)

    Voorrips Roeland E

    2006-10-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are important tools in studying complex genetic traits and genome evolution. Computational strategies for SNP discovery make use of the large number of sequences present in public databases (in most cases as expressed sequence tags (ESTs and are considered to be faster and more cost-effective than experimental procedures. A major challenge in computational SNP discovery is distinguishing allelic variation from sequence variation between paralogous sequences, in addition to recognizing sequencing errors. For the majority of the public EST sequences, trace or quality files are lacking which makes detection of reliable SNPs even more difficult because it has to rely on sequence comparisons only. Results We have developed a new algorithm to detect reliable SNPs and insertions/deletions (indels in EST data, both with and without quality files. Implemented in a pipeline called QualitySNP, it uses three filters for the identification of reliable SNPs. Filter 1 screens for all potential SNPs and identifies variation between or within genotypes. Filter 2 is the core filter that uses a haplotype-based strategy to detect reliable SNPs. Clusters with potential paralogs as well as false SNPs caused by sequencing errors are identified. Filter 3 screens SNPs by calculating a confidence score, based upon sequence redundancy and quality. Non-synonymous SNPs are subsequently identified by detecting open reading frames of consensus sequences (contigs with SNPs. The pipeline includes a data storage and retrieval system for haplotypes, SNPs and alignments. QualitySNP's versatility is demonstrated by the identification of SNPs in EST datasets from potato, chicken and humans. Conclusion QualitySNP is an efficient tool for SNP detection, storage and retrieval in diploid as well as polyploid species. It is available for running on Linux or UNIX systems. The program, test data, and user manual are available at

  9. High density genetic mapping identifies new susceptibility loci for rheumatoid arthritis

    Science.gov (United States)

    Eyre, Steve; Bowes, John; Diogo, Dorothée; Lee, Annette; Barton, Anne; Martin, Paul; Zhernakova, Alexandra; Stahl, Eli; Viatte, Sebastien; McAllister, Kate; Amos, Christopher I.; Padyukov, Leonid; Toes, Rene E.M.; Huizinga, Tom W.J.; Wijmenga, Cisca; Trynka, Gosia; Franke, Lude; Westra, Harm-Jan; Alfredsson, Lars; Hu, Xinli; Sandor, Cynthia; de Bakker, Paul I.W.; Davila, Sonia; Khor, Chiea Chuen; Heng, Khai Koon; Andrews, Robert; Edkins, Sarah; Hunt, Sarah E; Langford, Cordelia; Symmons, Deborah; Concannon, Pat; Onengut-Gumuscu, Suna; Rich, Stephen S; Deloukas, Panos; Gonzalez-Gay, Miguel A.; Rodriguez-Rodriguez, Luis; Ärlsetig, Lisbeth; Martin, Javier; Rantapää-Dahlqvist, Solbritt; Plenge, Robert; Raychaudhuri, Soumya; Klareskog, Lars; Gregersen, Peter K; Worthington, Jane

    2012-01-01

    Summary Using the Immunochip custom single nucleotide polymorphism (SNP) array, designed for dense genotyping of 186 genome wide association study (GWAS) confirmed loci we analysed 11,475 rheumatoid arthritis cases of European ancestry and 15,870 controls for 129,464 markers. The data were combined in meta-analysis with GWAS data from additional independent cases (n=2,363) and controls (n=17,872). We identified fourteen novel loci; nine were associated with rheumatoid arthritis overall and 5 specifically in anti-citrillunated peptide antibody positive disease, bringing the number of confirmed European ancestry rheumatoid arthritis loci to 46. We refined the peak of association to a single gene for 19 loci, identified secondary independent effects at six loci and association to low frequency variants (minor allele frequency <0.05) at 4 loci. Bioinformatic analysis of the data generated strong hypotheses for the causal SNP at seven loci. This study illustrates the advantages of dense SNP mapping analysis to inform subsequent functional investigations. PMID:23143596

  10. SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.

    Science.gov (United States)

    Merelli, Ivan; Calabria, Andrea; Cozzi, Paolo; Viti, Federica; Mosca, Ettore; Milanesi, Luciano

    2013-01-01

    The capability of correlating specific genotypes with human diseases is a complex issue in spite of all advantages arisen from high-throughput technologies, such as Genome Wide Association Studies (GWAS). New tools for genetic variants interpretation and for Single Nucleotide Polymorphisms (SNPs) prioritization are actually needed. Given a list of the most relevant SNPs statistically associated to a specific pathology as result of a genotype study, a critical issue is the identification of genes that are effectively related to the disease by re-scoring the importance of the identified genetic variations. Vice versa, given a list of genes, it can be of great importance to predict which SNPs can be involved in the onset of a particular disease, in order to focus the research on their effects. We propose a new bioinformatics approach to support biological data mining in the analysis and interpretation of SNPs associated to pathologies. This system can be employed to design custom genotyping chips for disease-oriented studies and to re-score GWAS results. The proposed method relies (1) on the data integration of public resources using a gene-centric database design, (2) on the evaluation of a set of static biomolecular annotations, defined as features, and (3) on the SNP scoring function, which computes SNP scores using parameters and weights set by users. We employed a machine learning classifier to set default feature weights and an ontological annotation layer to enable the enrichment of the input gene set. We implemented our method as a web tool called SNPranker 2.0 (http://www.itb.cnr.it/snpranker), improving our first published release of this system. A user-friendly interface allows the input of a list of genes, SNPs or a biological process, and to customize the features set with relative weights. As result, SNPranker 2.0 returns a list of SNPs, localized within input and ontologically enriched genes, combined with their prioritization scores. Different

  11. Diversity in 113 cowpea [Vigna unguiculata (L) Walp] accessions assessed with 458 SNP markers.

    Science.gov (United States)

    Egbadzor, Kenneth F; Ofori, Kwadwo; Yeboah, Martin; Aboagye, Lawrence M; Opoku-Agyeman, Michael O; Danquah, Eric Y; Offei, Samuel K

    2014-01-01

    Single Nucleotide Polymorphism (SNP) markers were used in characterization of 113 cowpea accessions comprising of 108 from Ghana and 5 from abroad. Leaf tissues from plants cultivated at the University of Ghana were genotyped at KBioscience in the United Kingdom. Data was generated for 477 SNPs, out of which 458 revealed polymorphism. The results were used to analyze genetic dissimilarity among the accessions using Darwin 5 software. The markers discriminated among all of the cowpea accessions and the dissimilarity values which ranged from 0.006 to 0.63 were used for factorial plot. Unexpected high levels of heterozygosity were observed on some of the accessions. Accessions known to be closely related clustered together in a dendrogram drawn with WPGMA method. A maximum length sub-tree which comprised of 48 core accessions was constructed. The software package structure was used to separate accessions into three groups, and the programme correctly identified varieties that were known hybrids. The hybrids were those accessions with numerous heterozygous loci. The structure plot showed closely related accessions with similar genome patterns. The SNP markers were more efficient in discriminating among the cowpea germplasm than morphological, seed protein polymorphism and simple sequence repeat studies reported earlier on the same collection.

  12. Multi-Population Selective Genotyping to Identify Soybean [Glycine max (L.) Merr.] Seed Protein and Oil QTLs.

    Science.gov (United States)

    Phansak, Piyaporn; Soonsuwon, Watcharin; Hyten, David L; Song, Qijian; Cregan, Perry B; Graef, George L; Specht, James E

    2016-06-01

    Plant breeders continually generate ever-higher yielding cultivars, but also want to improve seed constituent value, which is mainly protein and oil, in soybean [Glycine max (L.) Merr.]. Identification of genetic loci governing those two traits would facilitate that effort. Though genome-wide association offers one such approach, selective genotyping of multiple biparental populations offers a complementary alternative, and was evaluated here, using 48 F2:3 populations (n = ∼224 plants) created by mating 48 high protein germplasm accessions to cultivars of similar maturity, but with normal seed protein content. All F2:3 progeny were phenotyped for seed protein and oil, but only 22 high and 22 low extreme progeny in each F2:3 phenotypic distribution were genotyped with a 1536-SNP chip (ca 450 bimorphic SNPs detected per mating). A significant quantitative trait locus (QTL) on one or more chromosomes was detected for protein in 35 (73%), and for oil in 25 (52%), of the 48 matings, and these QTL exhibited additive effects of ≥ 4 g kg(-1) and R(2) values of 0.07 or more. These results demonstrated that a multiple-population selective genotyping strategy, when focused on matings between parental phenotype extremes, can be used successfully to identify germplasm accessions possessing large-effect QTL alleles. Such accessions would be of interest to breeders to serve as parental donors of those alleles in cultivar development programs, though 17 of the 48 accessions were not unique in terms of SNP genotype, indicating that diversity among high protein accessions in the germplasm collection is less than what might ordinarily be assumed. Copyright © 2016 Phansak et al.

  13. Genome Wide Association Study of SNP-, Gene-, and Pathway-based Approaches to Identify Genes Influencing Susceptibility to Staphylococcus aureus Infections

    Directory of Open Access Journals (Sweden)

    Zhan eYe

    2014-05-01

    Full Text Available Background: We conducted a genome-wide association study (GWAS to identify specific genetic variants that underlie susceptibility to disease caused by Staphylococcus aureus in humans. Methods: Cases (n=309 and controls (n=2,925 were genotyped at 508,921 single nucleotide polymorphisms (SNPs. Cases had at least one laboratory and clinician confirmed disease caused by S. aureus whereas controls did not. R-package (for SNP association, EIGENSOFT (to estimate and adjust for population stratification and gene- (VEGAS and pathway-based (DAVID, PANTHER, and Ingenuity Pathway Analysis analyses were performed.Results: No SNP reached genome-wide significance. Four SNPs exceeded the pConclusion: We identified potential susceptibility genes for S. aureus diseases in this preliminary study but confirmation by other studies is needed. The observed associations could be relevant given the complexity of S. aureus as a pathogen and its ability to exploit multiple biological pathways to cause infections in humans.

  14. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.

    Directory of Open Access Journals (Sweden)

    Carole F S Koning-Boucoiran

    2015-04-01

    Full Text Available In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array.Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L. genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.

  15. Genotype-Phenotype Associations of the CD-Associated Single Nucleotide Polymorphism within the Gene Locus Encoding Protein Tyrosine Phosphatase Non-Receptor Type 22 in Patients of the Swiss IBD Cohort.

    Directory of Open Access Journals (Sweden)

    Marianne R Spalinger

    Full Text Available Protein tyrosine phosphatase non-receptor type 22 (PTPN22 plays an important role in immune cell function and intestinal homeostasis. The single nucleotide polymorphism (SNP rs2476601 within the PTPN22 gene locus results in aberrant function of PTPN22 protein and protects from Crohn's disease (CD. Here, we investigated associations of PTPN22 SNP rs2476601 in inflammatory bowel disease (IBD patients in the Swiss IBD Cohort Study (SIBDCS.2'028 SIBDCS patients (1173 CD and 855 ulcerative colitis (UC patients were included. The clinical characteristics were analysed for an association with the presence of the PTPN22 SNP rs2476601 genotypes 'homozygous variant' (AA, 'heterozygous' (GA and 'homozygous wild-type' (GG.13 patients (0.6% were homozygous variant (AA for the PTPN22 polymorphism, 269 (13.3% heterozygous variant (GA and 1'746 (86.1% homozygous wild-type (GG. In CD, AA and GA genotypes were associated with less use of steroids and antibiotics, and reduced prevalence of vitamin D and calcium deficiency. In UC the AA and GA genotype was associated with increased use of azathioprine and anti-TNF antibodies, but significantly less patients with the PTPN22 variant featured malabsorption syndrome (p = 0.026.Our study for the first time addressed how presence of SNP rs2476601 within the PTPN22 gene affects clinical characteristics in IBD-patients. Several factors that correlate with more severe disease were found to be less common in CD patients carrying the A-allele, pointing towards a protective role for this variant in affected CD patients. In UC patients however, we found the opposite trend, suggesting a disease-promoting effect of the A-allele.

  16. Reference-free SNP discovery for the Eurasian beaver from restriction site-associated DNA paired-end data.

    Science.gov (United States)

    Senn, Helen; Ogden, Rob; Cezard, Timothee; Gharbi, Karim; Iqbal, Zamin; Johnson, Eric; Kamps-Hughes, Nick; Rosell, Frank; McEwing, Ross

    2013-06-01

    In this study, we used restriction site-associated DNA (RAD) sequencing to discover SNP markers suitable for population genetic and parentage analysis with the aim of using them for monitoring the reintroduction of the Eurasian beaver (Castor fibre) to Scotland. In the absence of a reference genome for beaver, we built contigs and discovered SNPs within them using paired-end RAD data, so as to have sufficient flanking region around the SNPs to conduct marker design. To do this, we used a simple pipeline which catalogued the Read 1 data in stacks and then used the assembler cortex_var to conduct de novo assembly and genotyping of multiple samples using the Read 2 data. The analysis of around 1.1 billion short reads of sequence data was reduced to a set of 2579 high-quality candidate SNP markers that were polymorphic in Norwegian and Bavarian beaver. Both laboratory validation of a subset of eight of the SNPs (1.3% error) and internal validation by confirming patterns of Mendelian inheritance in a family group (0.9% error) confirmed the success of this approach. © 2013 John Wiley & Sons Ltd.

  17. Genotyping-by-sequencing (GBS), an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding

    OpenAIRE

    He, Jiangfeng; Zhao, Xiaoqing; Laroche, André; Lu, Zhen-Xiang; Liu, HongKui; Li, Ziqin

    2014-01-01

    Marker-assisted selection (MAS) refers to the use of molecular markers to assist phenotypic selections in crop improvement. Several types of molecular markers, such as single nucleotide polymorphism (SNP), have been identified and effectively used in plant breeding. The application of next-generation sequencing (NGS) technologies has led to remarkable advances in whole genome sequencing, which provides ultra-throughput sequences to revolutionize plant genotyping and breeding. To further broad...

  18. Molecular phylogeny and SNP variation of polar bears (Ursus maritimus), brown bears (U. arctos), and black bears (U. americanus) derived from genome sequences.

    Science.gov (United States)

    Cronin, Matthew A; Rincon, Gonzalo; Meredith, Robert W; MacNeil, Michael D; Islas-Trejo, Alma; Cánovas, Angela; Medrano, Juan F

    2014-01-01

    We assessed the relationships of polar bears (Ursus maritimus), brown bears (U. arctos), and black bears (U. americanus) with high throughput genomic sequencing data with an average coverage of 25× for each species. A total of 1.4 billion 100-bp paired-end reads were assembled using the polar bear and annotated giant panda (Ailuropoda melanoleuca) genome sequences as references. We identified 13.8 million single nucleotide polymorphisms (SNP) in the 3 species aligned to the polar bear genome. These data indicate that polar bears and brown bears share more SNP with each other than either does with black bears. Concatenation and coalescence-based analysis of consensus sequences of approximately 1 million base pairs of ultraconserved elements in the nuclear genome resulted in a phylogeny with black bears as the sister group to brown and polar bears, and all brown bears are in a separate clade from polar bears. Genotypes for 162 SNP loci of 336 bears from Alaska and Montana showed that the species are genetically differentiated and there is geographic population structure of brown and black bears but not polar bears.

  19. The rs3957357C>T SNP in GSTA1 Is Associated with a Higher Risk of Occurrence of Hepatocellular Carcinoma in European Individuals.

    Directory of Open Access Journals (Sweden)

    Hanane Akhdar

    Full Text Available Glutathione S-transferases (GSTs detoxify toxic molecules by conjugation with reduced glutathione and regulate cell signaling. Single nucleotide polymorphisms (SNPs of GST genes have been suggested to affect GST functions and thus to increase the risk of human hepatocellular carcinoma (HCC. As GSTA1 is expressed in hepatocytes and the rs3957357C>T (TT SNP is known to downregulate GSTA1 mRNA expression, the aims of this study were: (i to explore the relationship between the TT SNP in GSTA1 and the occurrence of HCC; (ii to measure GSTA1 mRNA expression in HCCs. For that purpose, we genotyped non-tumor-tissue-derived DNA from 48 HCC patients and white-blood-cell-derived DNA from 37 healthy individuals by restriction fragment length polymorphism (RFLP. In addition, expression of GSTA1 mRNA was assessed by real-time PCR in 18 matching pairs of HCCs and non-tumor livers. Survival analysis was performed on an annotated microarray dataset containing 247 HCC patients (GSE14520. The GSTA1 TT genotype was more frequent in HCC than in non-HCC patients (27% versus 5%, respectively, suggesting that individuals carrying this genotype could be associated with 2-fold higher risk of developing HCCs (odds ratio = 2.1; p = 0.02. Also, we found that GSTA1 mRNA expression was lower in HCCs than in non-tumor livers. HCCs expressing the highest GSTA1 mRNA levels were the smallest in size (R = -0.67; p = 0.007, expressed the highest levels of liver-enriched genes such as ALB (albumin, R = -0.67; p = 0.007 and COL18A1 (procollagen type XVIII, R = -0.50; p = 0.03 and showed the most favorable disease-free (OR = 0.54; p<0.001 and overall (OR = 0.56; p = 0.006 outcomes. Moreover, GSTA1 was found within a 263-gene network involved in well-differentiated hepatocyte functions. In conclusion, HCCs are characterized by two GSTA1 features: the TT SNP and reduced GSTA1 gene expression in a context of hepatocyte de-differentiation.

  20. Assessing SNP-SNP interactions among DNA repair, modification and metabolism related pathway genes in breast cancer susceptibility.

    Directory of Open Access Journals (Sweden)

    Yadav Sapkota

    Full Text Available Genome-wide association studies (GWASs have identified low-penetrance common variants (i.e., single nucleotide polymorphisms, SNPs associated with breast cancer susceptibility. Although GWASs are primarily focused on single-locus effects, gene-gene interactions (i.e., epistasis are also assumed to contribute to the genetic risks for complex diseases including breast cancer. While it has been hypothesized that moderately ranked (P value based weak single-locus effects in GWASs could potentially harbor valuable information for evaluating epistasis, we lack systematic efforts to investigate SNPs showing consistent associations with weak statistical significance across independent discovery and replication stages. The objectives of this study were i to select SNPs showing single-locus effects with weak statistical significance for breast cancer in a GWAS and/or candidate-gene studies; ii to replicate these SNPs in an independent set of breast cancer cases and controls; and iii to explore their potential SNP-SNP interactions contributing to breast cancer susceptibility. A total of 17 SNPs related to DNA repair, modification and metabolism pathway genes were selected since these pathways offer a priori knowledge for potential epistatic interactions and an overall role in breast carcinogenesis. The study design included predominantly Caucasian women (2,795 cases and 4,505 controls from Alberta, Canada. We observed two two-way SNP-SNP interactions (APEX1-rs1130409 and RPAP1-rs2297381; MLH1-rs1799977 and MDM2-rs769412 in logistic regression that conferred elevated risks for breast cancer (P(interaction<7.3 × 10(-3. Logic regression identified an interaction involving four SNPs (MBD2-rs4041245, MLH1-rs1799977, MDM2-rs769412, BRCA2-rs1799943 (P(permutation = 2.4 × 10(-3. SNPs involved in SNP-SNP interactions also showed single-locus effects with weak statistical significance, while BRCA2-rs1799943 showed stronger statistical significance (P

  1. Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data

    Directory of Open Access Journals (Sweden)

    Ni Yunyun

    2012-09-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs have been associated with many aspects of human development and disease, and many non-coding SNPs associated with disease risk are presumed to affect gene regulation. We have previously shown that SNPs within transcription factor binding sites can affect transcription factor binding in an allele-specific and heritable manner. However, such analysis has relied on prior whole-genome genotypes provided by large external projects such as HapMap and the 1000 Genomes Project. This requirement limits the study of allele-specific effects of SNPs in primary patient samples from diseases of interest, where complete genotypes are not readily available. Results In this study, we show that we are able to identify SNPs de novo and accurately from ChIP-seq data generated in the ENCODE Project. Our de novo identified SNPs from ChIP-seq data are highly concordant with published genotypes. Independent experimental verification of more than 100 sites estimates our false discovery rate at less than 5%. Analysis of transcription factor binding at de novo identified SNPs revealed widespread heritable allele-specific binding, confirming previous observations. SNPs identified from ChIP-seq datasets were significantly enriched for disease-associated variants, and we identified dozens of allele-specific binding events in non-coding regions that could distinguish between disease and normal haplotypes. Conclusions Our approach combines SNP discovery, genotyping and allele-specific analysis, but is selectively focused on functional regulatory elements occupied by transcription factors or epigenetic marks, and will therefore be valuable for identifying the functional regulatory consequences of non-coding SNPs in primary disease samples.

  2. Brain-derived neurotrophic factor Val66Met genotype modulates amygdala habituation.

    Science.gov (United States)

    Perez-Rodriguez, M Mercedes; New, Antonia S; Goldstein, Kim E; Rosell, Daniel; Yuan, Qiaoping; Zhou, Zhifeng; Hodgkinson, Colin; Goldman, David; Siever, Larry J; Hazlett, Erin A

    2017-05-30

    A deficit in amygdala habituation to repeated emotional stimuli may be an endophenotype of disorders characterized by emotion dysregulation, such as borderline personality disorder (BPD). Amygdala reactivity to emotional stimuli is genetically modulated by brain-derived neurotrophic factor (BDNF) variants. Whether amygdala habituation itself is also modulated by BDNF genotypes remains unknown. We used imaging-genetics to examine the effect of BDNF Val66Met genotypes on amygdala habituation to repeated emotional stimuli. We used functional magnetic resonance imaging (fMRI) in 57 subjects (19 BPD patients, 18 patients with schizotypal personality disorder [SPD] and 20 healthy controls [HC]) during a task involving viewing of unpleasant, neutral, and pleasant pictures, each presented twice to measure habituation. Amygdala responses across genotypes (Val66Met SNP Met allele-carriers vs. Non-Met carriers) and diagnoses (HC, BPD, SPD) were examined with ANOVA. The BDNF 66Met allele was significantly associated with a deficit in amygdala habituation, particularly for emotional pictures. The association of the 66Met allele with a deficit in habituation to unpleasant emotional pictures remained significant in the subsample of BPD patients. Using imaging-genetics, we found preliminary evidence that deficient amygdala habituation may be modulated by BDNF genotype. Copyright © 2017. Published by Elsevier B.V.

  3. When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.

    Science.gov (United States)

    Gardner, Shea N; Hall, Barry G

    2013-01-01

    Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four "raw read" genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths.

  4. [Genotyping of ABO Blood Group in Partial Population of Yunnan Province by SNaPshot Technology].

    Science.gov (United States)

    Yu, S X; Zeng, F M; Jin, Y Z; Wan, H J; Zhai, D; Xing, Y M; Cheng, B W

    2017-06-01

    To detect the genotype of ABO blood group by SNaPshot technology. DNA were extracted from the peripheral blood samples with known blood groups (obtained by serology) of 107 unrelated individuals in Yunnan. Six SNP loci of the 261th, 297th, 681th, 703th, 802th, and 803th nucleotide positions were detected by SNaPshot Multiplex kit, and relevant genetics parameters were calculated. In 107 blood samples, the allele frequencies of types A, B, O A , and O G were 0.355 1, 0.168 2, 0.230 0 and 0.247 6, respectively, while that of types A G and cis AB were not detected. The genotyping results of ABO blood group were consistent with that of serologic testing. SNaPshot technology can be adapted for genotyping of ABO blood group. Copyright© by the Editorial Department of Journal of Forensic Medicine

  5. Extending the scope of diagnostic chromosome analysis: detection of single gene defects using high-resolution SNP microarrays.

    Science.gov (United States)

    Bruno, Damien L; Stark, Zornitza; Amor, David J; Burgess, Trent; Butler, Kathy; Corrie, Sylvea; Francis, David; Ganesamoorthy, Devika; Hills, Louise; James, Paul A; O'Rielly, Darren; Oertel, Ralph; Savarirayan, Ravi; Prabhakara, Krishnamurthy; Salce, Nicholas; Slater, Howard R

    2011-12-01

    Microarray analysis has provided significant advances in the diagnosis of conditions resulting from submicroscopic chromosome abnormalities. It has been recommended that array testing should be a "first tier" test in the evaluation of individuals with intellectual disability, developmental delay, congenital anomalies, and autism. The availability of arrays with increasingly high probe coverage and resolution has increased the detection of decreasingly small copy number changes (CNCs) down to the intragenic or even exon level. Importantly, arrays that genotype SNPs also detect extended regions of homozygosity. We describe 14 examples of single gene disorders caused by intragenic changes from a consecutive set of 6,500 tests using high-resolution SNP microarrays. These cases illustrate the increased scope of cytogenetic testing beyond dominant chromosome rearrangements that typically contain many genes. Nine of the cases confirmed the clinical diagnosis, that is, followed a "phenotype to genotype" approach. Five were diagnosed by the laboratory analysis in the absence of a specific clinical diagnosis, that is, followed a "genotype to phenotype" approach. Two were clinically significant, incidental findings. The importance of astute clinical assessment and laboratory-clinician consultation is emphasized to optimize the value of microarrays in the diagnosis of disorders caused by single gene copy number and sequence mutations. © 2011 Wiley-Liss, Inc.

  6. Clonal diversity analysis using SNP microarray: a new prognostic tool for chronic lymphocytic leukemia.

    Science.gov (United States)

    Zhang, Linsheng; Znoyko, Iya; Costa, Luciano J; Conlin, Laura K; Daber, Robert D; Self, Sally E; Wolff, Daynna J

    2011-12-01

    Chronic lymphocytic leukemia (CLL) is a clinically heterogeneous disease. The methods currently used for monitoring CLL and determining conditions for treatment are limited in their ability to predict disease progression, patient survival, and response to therapy. Although clonal diversity and the acquisition of new chromosomal abnormalities during the disease course (clonal evolution) have been associated with disease progression, their prognostic potential has been underappreciated because cytogenetic and fluorescence in situ hybridization (FISH) studies have a restricted ability to detect genomic abnormalities and clonal evolution. We hypothesized that whole genome analysis using high resolution single nucleotide polymorphism (SNP) microarrays would be useful to detect diversity and infer clonal evolution to offer prognostic information. In this study, we used the Infinium Omni1 BeadChip (Illumina, San Diego, CA) array for the analysis of genetic variation and percent mosaicism in 25 non-selected CLL patients to explore the prognostic value of the assessment of clonal diversity in patients with CLL. We calculated the percentage of mosaicism for each abnormality by applying a mathematical algorithm to the genotype frequency data and by manual determination using the Simulated DNA Copy Number (SiDCoN) tool, which was developed from a computer model of mosaicism. At least one genetic abnormality was identified in each case, and the SNP data was 98% concordant with FISH results. Clonal diversity, defined as the presence of two or more genetic abnormalities with differing percentages of mosaicism, was observed in 12 patients (48%), and the diversity correlated with the disease stage. Clonal diversity was present in most cases of advanced disease (Rai stages III and IV) or those with previous treatment, whereas 9 of 13 patients without detected clonal diversity were asymptomatic or clinically stable. In conclusion, SNP microarray studies with simultaneous evaluation

  7. The iSelect 9 K SNP analysis revealed polyploidization induced revolutionary changes and intense human selection causing strong haplotype blocks in wheat.

    Science.gov (United States)

    Hao, Chenyang; Wang, Yuquan; Chao, Shiaoman; Li, Tian; Liu, Hongxia; Wang, Lanfen; Zhang, Xueyong

    2017-01-30

    A Chinese wheat mini core collection was genotyped using the wheat 9 K iSelect SNP array. Total 2420 and 2396 polymorphic SNPs were detected on the A and the B genome chromosomes, which formed 878 haplotype blocks. There were more blocks in the B genome, but the average block size was significantly (P polyploidization of wheat (both tetraploidization and hexaploidization) induced revolutionary changes in both the A and the B genomes, with a greater increase of gene diversity compared to their diploid ancestors. Modern breeding has dramatically increased diversity in the gene coding regions, though obvious blocks were formed on most of the chromosomes in both tetraploid and hexaploid wheats. Tag-SNP markers identified in this study can be used for marker assisted selection using haplotype blocks as a wheat breeding strategy. This strategy can also be employed to facilitate genome selection in other self-pollinating crop species.

  8. Genotype by environment interaction for adult body weights of shrimp Penaeus vannamei when grown at low and high densitie

    Directory of Open Access Journals (Sweden)

    Famula Thomas R

    2008-09-01

    Full Text Available Abstract Shrimp is one of few marine species cultured worldwide for which several selective breeding programs are being conducted. One environmental factor that can affect the response to selection in breeding programs is the density at which the shrimp are cultured (low-medium-high. Phenotypic plasticity in the growth response to different densities might be accompanied by a significant genotype by environment interaction, evidenced by a change in heritabilities between environments and by a genetic correlation less than one for a unique trait between environments. Our goal was to understand whether different growth densities affect estimates of those genetic parameters for adult body weight (BW in the Pacific white shrimp (Penaeus vannamei. BW heritabilities were significantly different between environments, with the largest at high density. These differences resulted from both an increased additive genetic variance and a decreased environmental variance when grown at high density. The genetic correlation between BWs at the two environmental conditions was significantly less than one. Whereas these results might be suggestive for carrying out shrimp selective breeding for BW under high density conditions, further understanding of genetic correlations between growth and reproductive traits within a given environment is necessary, as there are indications of reduced reproductive fitness for shrimp grown at high densities.

  9. Detection of superior genotype of fatty acid synthase in Korean native cattle by an environment-adjusted statistical model

    Directory of Open Access Journals (Sweden)

    Jea-Young Lee

    2017-06-01

    Full Text Available Objective This study examines the genetic factors influencing the phenotypes (four economic traits:oleic acid [C18:1], monounsaturated fatty acids, carcass weight, and marbling score of Hanwoo. Methods To enhance the accuracy of the genetic analysis, the study proposes a new statistical model that excludes environmental factors. A statistically adjusted, analysis of covariance model of environmental and genetic factors was developed, and estimated environmental effects (covariate effects of age and effects of calving farms were excluded from the model. Results The accuracy was compared before and after adjustment. The accuracy of the best single nucleotide polymorphism (SNP in C18:1 increased from 60.16% to 74.26%, and that of the two-factor interaction increased from 58.69% to 87.19%. Also, superior SNPs and SNP interactions were identified using the multifactor dimensionality reduction method in Table 1 to 4. Finally, high- and low-risk genotypes were compared based on their mean scores for each trait. Conclusion The proposed method significantly improved the analysis accuracy and identified superior gene-gene interactions and genotypes for each of the four economic traits of Hanwoo.

  10. Sequential sentinel SNP Regional Association Plots (SSS-RAP): an approach for testing independence of SNP association signals using meta-analysis data.

    Science.gov (United States)

    Zheng, Jie; Gaunt, Tom R; Day, Ian N M

    2013-01-01

    Genome-Wide Association Studies (GWAS) frequently incorporate meta-analysis within their framework. However, conditional analysis of individual-level data, which is an established approach for fine mapping of causal sites, is often precluded where only group-level summary data are available for analysis. Here, we present a numerical and graphical approach, "sequential sentinel SNP regional association plot" (SSS-RAP), which estimates regression coefficients (beta) with their standard errors using the meta-analysis summary results directly. Under an additive model, typical for genes with small effect, the effect for a sentinel SNP can be transformed to the predicted effect for a possibly dependent SNP through a 2×2 2-SNP haplotypes table. The approach assumes Hardy-Weinberg equilibrium for test SNPs. SSS-RAP is available as a Web-tool (http://apps.biocompute.org.uk/sssrap/sssrap.cgi). To develop and illustrate SSS-RAP we analyzed lipid and ECG traits data from the British Women's Heart and Health Study (BWHHS), evaluated a meta-analysis for ECG trait and presented several simulations. We compared results with existing approaches such as model selection methods and conditional analysis. Generally findings were consistent. SSS-RAP represents a tool for testing independence of SNP association signals using meta-analysis data, and is also a convenient approach based on biological principles for fine mapping in group level summary data. © 2012 Blackwell Publishing Ltd/University College London.

  11. Effect of imputing markers from a low-density chip on the reliability of genomic breeding values in Holstein populations

    DEFF Research Database (Denmark)

    Dassonneville, R; Brøndum, Rasmus Froberg; Druet, T

    2011-01-01

    The purpose of this study was to investigate the imputation error and loss of reliability of direct genomic values (DGV) or genomically enhanced breeding values (GEBV) when using genotypes imputed from a 3,000-marker single nucleotide polymorphism (SNP) panel to a 50,000-marker SNP panel. Data...... of missing markers and prediction of breeding values were performed using 2 different reference populations in each country: either a national reference population or a combined EuroGenomics reference population. Validation for accuracy of imputation and genomic prediction was done based on national test...... with a national reference data set gave an absolute loss of 0.05 in mean reliability of GEBV in the French study, whereas a loss of 0.03 was obtained for reliability of DGV in the Nordic study. When genotypes were imputed using the EuroGenomics reference, a loss of 0.02 in mean reliability of GEBV was detected...

  12. Partitioned learning of deep Boltzmann machines for SNP data.

    Science.gov (United States)

    Hess, Moritz; Lenz, Stefan; Blätte, Tamara J; Bullinger, Lars; Binder, Harald

    2017-10-15

    Learning the joint distributions of measurements, and in particular identification of an appropriate low-dimensional manifold, has been found to be a powerful ingredient of deep leaning approaches. Yet, such approaches have hardly been applied to single nucleotide polymorphism (SNP) data, probably due to the high number of features typically exceeding the number of studied individuals. After a brief overview of how deep Boltzmann machines (DBMs), a deep learning approach, can be adapted to SNP data in principle, we specifically present a way to alleviate the dimensionality problem by partitioned learning. We propose a sparse regression approach to coarsely screen the joint distribution of SNPs, followed by training several DBMs on SNP partitions that were identified by the screening. Aggregate features representing SNP patterns and the corresponding SNPs are extracted from the DBMs by a combination of statistical tests and sparse regression. In simulated case-control data, we show how this can uncover complex SNP patterns and augment results from univariate approaches, while maintaining type 1 error control. Time-to-event endpoints are considered in an application with acute myeloid leukemia patients, where SNP patterns are modeled after a pre-screening based on gene expression data. The proposed approach identified three SNPs that seem to jointly influence survival in a validation dataset. This indicates the added value of jointly investigating SNPs compared to standard univariate analyses and makes partitioned learning of DBMs an interesting complementary approach when analyzing SNP data. A Julia package is provided at 'http://github.com/binderh/BoltzmannMachines.jl'. binderh@imbi.uni-freiburg.de. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  13. The AA genotype of the regulatory BCL2 promoter polymorphism ( 938C>A) is associated with a favorable outcome in lymph node negative invasive breast cancer patients.

    Science.gov (United States)

    Bachmann, Hagen S; Otterbach, Friedrich; Callies, Rainer; Nückel, Holger; Bau, Maja; Schmid, Kurt W; Siffert, Winfried; Kimmig, Rainer

    2007-10-01

    Expression of the antiapoptotic and antiproliferative protein Bcl-2 has been repeatedly shown to be associated with better clinical outcome in breast cancer. We recently showed a novel regulatory (-938C>A) single-nucleotide polymorphism (SNP) in the inhibitory P2 BCL2 gene promoter generating significantly different BCL2 promoter activities. Paraffin-embedded neoplastic and nonneoplastic tissues from 274 patients (161 still alive after a follow-up period of at least 80 months) with primary unilateral invasive breast carcinoma were investigated. Bcl-2 expression of tumor cells was shown by immunohistochemistry; nonneoplastic tissues were used for genotyping. Both the Bcl-2 expression and the (-938C>A) genotypes were correlated with the patients' survival. Kaplan-Meier curves revealed a significant association of the AA genotype with increased survival (P = 0.030) in lymph node-negative breast cancer patients, whereas no genotype effect could be observed in lymph node-positive cases. Ten-year survival rates were 88.6% for the AA genotype, 78.4% for the AC genotype, and 65.8% for the CC genotype. Multivariable Cox regression identified the BCL2 (-938CC) genotype as an independent prognostic factor for cancer-related death in lymph node-negative breast carcinoma patients (hazard ratio, 3.59; P = 0.032). Immunohistochemical Bcl-2 expression was significantly associated with the clinical outcome of lymph node-positive but not of lymph node-negative breast cancer patients. In lymph node-negative cases, the (-938C>A) SNP was both significantly related with the immunohistochemically determined level of Bcl-2 expression (P = 0.044) and the survival of patients with Bcl-2-expressing carcinomas (P = 0.006). These results suggest the (-938C>A) polymorphism as a survival prognosticator as well as indicator of a high-risk group within patients with lymph node-negative breast cancer.

  14. A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing

    Directory of Open Access Journals (Sweden)

    Guangtu Gao

    2018-04-01

    Full Text Available Single-nucleotide polymorphisms (SNPs are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout (Oncorhynchus mykiss, SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD libraries, reduced representation libraries (RRL and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1 which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup, followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs and multi-sequence variants (MSVs. Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25. The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and

  15. Genome wide in silico SNP-tumor association analysis

    International Nuclear Information System (INIS)

    Qiu, Ping; Wang, Luquan; Kostich, Mitch; Ding, Wei; Simon, Jason S; Greene, Jonathan R

    2004-01-01

    Carcinogenesis occurs, at least in part, due to the accumulation of mutations in critical genes that control the mechanisms of cell proliferation, differentiation and death. Publicly accessible databases contain millions of expressed sequence tag (EST) and single nucleotide polymorphism (SNP) records, which have the potential to assist in the identification of SNPs overrepresented in tumor tissue. An in silico SNP-tumor association study was performed utilizing tissue library and SNP information available in NCBI's dbEST (release 092002) and dbSNP (build 106). A total of 4865 SNPs were identified which were present at higher allele frequencies in tumor compared to normal tissues. A subset of 327 (6.7%) SNPs induce amino acid changes to the protein coding sequences. This approach identified several SNPs which have been previously associated with carcinogenesis, as well as a number of SNPs that now warrant further investigation This novel in silico approach can assist in prioritization of genes and SNPs in the effort to elucidate the genetic mechanisms underlying the development of cancer

  16. Exploring meat quality of pig throug the application of molecular genetics

    DEFF Research Database (Denmark)

    Zhang, Lu

    association mapping, expression study and genetic variation analysis. Many state of the art technologies including high-density SNP chip genotyping, hig-throughput re-sequencing and strand-specific RNA-Seq were employed. Numerous quantitative trait loci, candidate genes and a segmental duplication associated...

  17. RASSF1A and the rs2073498 Cancer Associated SNP

    International Nuclear Information System (INIS)

    Donninger, Howard; Barnoud, Thibaut; Nelson, Nick; Kassler, Suzanna; Clark, Jennifer; Cummins, Timothy D.; Powell, David W.; Nyante, Sarah; Millikan, Robert C.; Clark, Geoffrey J.

    2011-01-01

    RASSF1A is one of the most frequently inactivated tumor suppressors yet identified in human cancer. It is pro-apoptotic and appears to function as a scaffolding protein that interacts with a variety of other tumor suppressors to modulate their function. It can also complex with the Ras oncoprotein and may serve to integrate pro-growth and pro-death signaling pathways. A SNP has been identified that is present in approximately 29% of European populations [rs2073498, A(133)S]. Several studies have now presented evidence that this SNP is associated with an enhanced risk of developing breast cancer. We have used a proteomics based approach to identify multiple differences in the pattern of protein/protein interactions mediated by the wild type compared to the SNP variant protein. We have also identified a significant difference in biological activity between wild type and SNP variant protein. However, we have found only a very modest association of the SNP with breast cancer predisposition.

  18. A single-tube 27-plex SNP assay for estimating individual ancestry and admixture from three continents.

    Science.gov (United States)

    Wei, Yi-Liang; Wei, Li; Zhao, Lei; Sun, Qi-Fan; Jiang, Li; Zhang, Tao; Liu, Hai-Bo; Chen, Jian-Gang; Ye, Jian; Hu, Lan; Li, Cai-Xia

    2016-01-01

    A single-tube multiplex assay of a small set of ancestry-informative markers (AIMs) for effectively estimating individual ancestry and admixture is an ideal forensic tool to trace the population origin of an unknown DNA sample. We present a newly developed 27-plex single nucleotide polymorphism (SNP) panel with highly robust and balanced differential power to perfectly assign individuals to African, European, and East Asian ancestries. Evaluating 968 previously described intercontinental AIMs from three HapMap population genotyping datasets (Yoruban in Ibadan, Nigeria (YRI); Utah residents with Northern and Western European ancestry from the Centre de'Etude du Polymorphism Humain (CEPH) collection (CEU); and Han Chinese in Beijing, China (CHB)), the best set of markers was selected on the basis of Hardy-Weinberg equilibrium (p > 0.00001), population-specific allele frequency (two of three δ values >0.5), according to linkage disequilibrium (r (2) ancestry of the 11 populations in the HapMap project. Then, we tested the 27-plex SNP assay with 1164 individuals from 17 additional populations. The results demonstrated that the SNP panel was successful for ancestry inference of individuals with African, European, and East Asian ancestry. Furthermore, the system performed well when inferring the admixture of Eurasians (EUR/EAS) after analyzing admixed populations from Xinjiang (Central Asian) as follows: Tajik (68:27), Uyghur (49:46), Kirgiz (40:57), and Kazak (36:60). For individual analyses, we interpreted each sample with a three-ancestry component percentage and a population match probability sequence. This multiplex assay is a convenient and cost-effective tool to assist in criminal investigations, as well as to correct for the effects of population stratification for case-control studies.

  19. Genomic divergence of zebu and taurine cattle identified through high-density SNP genotyping

    Science.gov (United States)

    Natural selection has molded the evolution across all taxa. At an arguable date of around 330,000 years ago there were already at least two different types of cattle that became ancestors of nearly all modern cattle, the Bos primigenius taurus more adapted to temperate climates and the tropically ad...

  20. Genomic divergence of indicine and taurine cattle identified through high-density SNP genotyping

    Science.gov (United States)

    At an arguable date of around 330,000 years ago there were already at least two different types of cattle that became ancestors of nearly all modern cattle, the Bos primigenius taurus more adapted to temperate climates and the tropically adapted Bos primigenius indicus. Human selection exponentially...

  1. AHSG gene polymorphisms are associated with bone mineral density in Caucasian nuclear families

    International Nuclear Information System (INIS)

    Yang Yanjun; Wang Yanbo; Lei Shufeng; Long Jirong; Shen Hui; Zhao Lanjuan; Jiang Deke; Xiao Sumei; Chen Xiangding; Chen Yuan; Deng Hongwen

    2007-01-01

    Purpose. To investigate the role of alpha2-HS glycoprotein (AHSG) gene on bone mineral density (BMD) variation. Methods. A total of 665 subjects from 157 Caucasian nuclear families were genotyped at the AHSG NlaIII, SacI sites. The association and linkage between the single SNP markers and haplotypes constructed by two markers in this gene and BMDs at the spine and hip were determined by using quantitative transmission disequilibrium test (QTDT). Results. Significant within-family associations were obtained for spine BMD at both of studied markers (P = 0.036 and 0.005 at the NlaIII and SacI sites, respectively). Significant (P = 0.008 at the NlaIII locus) (P = 0.004 at the SacI locus) total associations at spine BMD were detected. Haplotype analyses confirmed those within-family and total association. Conclusions. These data suggest the polymorphisms in the AHSG gene may have effects on BMD variation in Caucasian population

  2. Single Nucleotide Polymorphisms in Taste Receptor Genes Are Associated with Snacking Patterns of Preschool-Aged Children in the Guelph Family Health Study: A Pilot Study

    Directory of Open Access Journals (Sweden)

    Elie Chamoun

    2018-01-01

    Full Text Available Snacking is an integral component of eating habits in young children that is often overlooked in nutrition research. While snacking is a substantial source of calories in preschoolers’ diets, there is limited knowledge about the factors that drive snacking patterns. The genetics of taste may help to better understand the snacking patterns of children. The rs1761667 single nucleotide polymorphism (SNP in the CD36 gene has been linked to fat taste sensitivity, the rs35874116 SNP in the TAS1R2 gene has been related to sweet taste preference, and the rs713598 SNP in the TAS2R38 gene has been associated with aversion to bitter, green leafy vegetables. This study seeks to determine the cross-sectional associations between three taste receptor SNPs and snacking patterns among preschoolers in the Guelph Family Health Study. Preschoolers’ snack quality, quantity, and frequency were assessed using three-day food records and saliva was collected for SNP genotyping (n = 47. Children with the TT genotype in TAS1R2 consumed snacks with significantly more calories from sugar, and these snacks were consumed mostly in the evening. Total energy density of snacks was highest in the CC and CG genotypes compared to the GG genotype in TAS2R38, and also greater in the AA genotype in CD36 compared to G allele carriers, however this difference was not individually attributable to energy from fat, carbohydrates, sugar, or protein. Genetic variation in taste receptors may influence snacking patterns of preschoolers.

  3. Glycine and a glycine dehydrogenase (GLDC) SNP as citalopram/escitalopram response biomarkers in depression: pharmacometabolomics-informed pharmacogenomics.

    Science.gov (United States)

    Ji, Y; Hebbring, S; Zhu, H; Jenkins, G D; Biernacka, J; Snyder, K; Drews, M; Fiehn, O; Zeng, Z; Schaid, D; Mrazek, D A; Kaddurah-Daouk, R; Weinshilboum, R M

    2011-01-01

    Major depressive disorder (MDD) is a common psychiatric disease. Selective serotonin reuptake inhibitors (SSRIs) are an important class of drugs used in the treatment of MDD. However, many patients do not respond adequately to SSRI therapy. We used a pharmacometabolomics-informed pharmacogenomic research strategy to identify citalopram/escitalopram treatment outcome biomarkers. Metabolomic assay of plasma samples from 20 escitalopram remitters and 20 nonremitters showed that glycine was negatively associated with treatment outcome (P = 0.0054). This observation was pursued by genotyping tag single-nucleotide polymorphisms (SNPs) for genes encoding glycine synthesis and degradation enzymes, using 529 DNA samples from SSRI-treated MDD patients. The rs10975641 SNP in the glycine dehydrogenase (GLDC) gene was associated with treatment outcome phenotypes. Genotyping for rs10975641 was carried out in 1,245 MDD patients in the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) study, and its presence was significant (P = 0.02) in DNA taken from these patients. These results highlight a possible role for glycine in SSRI response and illustrate the use of pharmacometabolomics to "inform" pharmacogenomics.

  4. MiR-2964a-5p binding site SNP regulates ATM expression contributing to age-related cataract risk.

    Science.gov (United States)

    Rong, Han; Gu, Shanshan; Zhang, Guowei; Kang, Lihua; Yang, Mei; Zhang, Junfang; Shen, Xinyue; Guan, Huaijin

    2017-10-17

    This study was to explore the involvement of DNA repair genes in the pathogenesis of age-related cataract (ARC). We genotyped nine single nucleotide polymorphisms (SNPs) of genes responsible to DNA double strand breaks (DSBs) in 804 ARC cases and 804 controls in a cohort of eye diseases in Chinese population and found that the ataxia telangiectasia mutated ( ATM ) gene-rs4585:G>T was significantly associated with ARC risk. An in vitro functional test found that miR-2964a-5p specifically down-regulated luciferase reporter expression and ATM expression in the cell lines transfected with rs4585 T allele compared to rs4585 G allele. The molecular assay on human tissue samples discovered that ATM expression was down-regulated in majority of ARC tissues and correlated with ATM genotypes. In addition, the Comet assay of cellular DNA damage of peripheral lymphocytes indicated that individuals carrying the G allele (GG/GT) of ATM -rs4585 had lower DNA breaks compared to individuals with TT genotype. These findings suggested that the SNP rs4585 in ATM might affect ARC risk through modulating the regulatory affinity of miR-2964a-5p. The reduced DSBs repair might be involved in ARC pathogenesis.

  5. Desenvolvimento de afilhos e componentes do rendimento em genótipos de trigo sob diferentes densidades de semeadura Tiller development and yield components in wheat genotypes under different seeding densities

    Directory of Open Access Journals (Sweden)

    Igor Pirez Valério

    2008-03-01

    Full Text Available O objetivo deste trabalho foi determinar o padrão de desenvolvimento de afilhos em genótipos de trigo contrastantes para esse caráter, em diferentes densidades de semeadura, bem como seus efeitos sobre os componentes do rendimento de grãos. O experimento foi conduzido no Município de Capão do Leão, RS, em 2006. Dez cultivares de trigo, selecionadas com base na capacidade de afilhamento, foram utilizadas em delineamento de parcelas divididas, com a parcela composta pelo fator cultivar, e as subparcelas pelas densidades de semeadura, com 50, 200, 350, 500 e 650 sementes aptas por metro quadrado. Observou-se que a senescência de afilhos esteve diretamente relacionada ao potencial de afilhamento dos genótipos. Os genótipos com elevada capacidade de afilhamento apresentaram efeito mais pronunciado da senescência, com o aumento da densidade de semeadura. O melhor ajuste dos componentes do rendimento foi obtido por meio da adequada densidade de semeadura, que deve ser realizada com base no potencial de afilhamento dos genótipos. Além disto, o efeito compensatório de genótipos com reduzido potencial de afilhamento é resultante de maior massa de grãos por espiga, em detrimento do número de espigas por unidade de área.The objective of this work was to determine the developmental pattern of tillers in wheat genotypes showing contrasting number of tillers. The genotypes were tested under different seeding densities, in order to evaluate their effect on grain yield components. The experiment was performed in Capão do Leão County, Rio Grande do Sul State, Brazil, in 2006. Ten wheat cultivars, selected by their tillering ability, were tested in a split plot design, where plots were formed by the genotype (cultivar, and subplots by different seeding densities: 50, 200, 350, 500 and 650 seeds per square meter. Results indicated that tiller senescence is directly correlated to tillering potential of the genotypes, and genotypes with higher

  6. Genotyping single spore isolates of a Pasteuria penetrans population occurring in Florida using SNP-based markers

    Science.gov (United States)

    The aim of this study was to examine genotypic variation and virulence characteristics of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida. Six single spore lines (ssp), 16ssp, 17ssp, 18ssp, 25ssp, 26ssp, and 30ssp were generated by infecting...

  7. SNP-based typing: a useful tool to study Bordetella pertussis populations.

    Directory of Open Access Journals (Sweden)

    Marjolein van Gent

    Full Text Available To monitor changes in Bordetella pertussis populations, mainly two typing methods are used; Pulsed-Field Gel Electrophoresis (PFGE and Multiple-Locus Variable-Number Tandem Repeat Analysis (MLVA. In this study, a single nucleotide polymorphism (SNP typing method, based on 87 SNPs, was developed and compared with PFGE and MLVA. The discriminatory indices of SNP typing, PFGE and MLVA were found to be 0.85, 0.95 and 0.83, respectively. Phylogenetic analysis, using SNP typing as Gold Standard, revealed false homoplasies in the PFGE and MLVA trees. Further, in contrast to the SNP-based tree, the PFGE- and MLVA-based trees did not reveal a positive correlation between root-to-tip distance and the isolation year of strains. Thus PFGE and MLVA do not allow an estimation of the relative age of the selected strains. In conclusion, SNP typing was found to be phylogenetically more informative than PFGE and more discriminative than MLVA. Further, in contrast to PFGE, it is readily standardized allowing interlaboratory comparisons. We applied SNP typing to study strains with a novel allele for the pertussis toxin promoter, ptxP3, which have a worldwide distribution and which have replaced the resident ptxP1 strains in the last 20 years. Previously, we showed that ptxP3 strains showed increased pertussis toxin expression and that their emergence was associated with increased notification in The Netherlands. SNP typing showed that the ptxP3 strains isolated in the Americas, Asia, Australia and Europe formed a monophyletic branch which recently diverged from ptxP1 strains. Two predominant ptxP3 SNP types were identified which spread worldwide. The widespread use of SNP typing will enhance our understanding of the evolution and global epidemiology of B. pertussis.

  8. SNP-Based Typing: A Useful Tool to Study Bordetella pertussis Populations

    Science.gov (United States)

    van der Heide, Han G. J.; Heuvelman, Kees J.; Kallonen, Teemu; He, Qiushui; Mertsola, Jussi; Advani, Abdolreza; Hallander, Hans O.; Janssens, Koen; Hermans, Peter W.; Mooi, Frits R.

    2011-01-01

    To monitor changes in Bordetella pertussis populations, mainly two typing methods are used; Pulsed-Field Gel Electrophoresis (PFGE) and Multiple-Locus Variable-Number Tandem Repeat Analysis (MLVA). In this study, a single nucleotide polymorphism (SNP) typing method, based on 87 SNPs, was developed and compared with PFGE and MLVA. The discriminatory indices of SNP typing, PFGE and MLVA were found to be 0.85, 0.95 and 0.83, respectively. Phylogenetic analysis, using SNP typing as Gold Standard, revealed false homoplasies in the PFGE and MLVA trees. Further, in contrast to the SNP-based tree, the PFGE- and MLVA-based trees did not reveal a positive correlation between root-to-tip distance and the isolation year of strains. Thus PFGE and MLVA do not allow an estimation of the relative age of the selected strains. In conclusion, SNP typing was found to be phylogenetically more informative than PFGE and more discriminative than MLVA. Further, in contrast to PFGE, it is readily standardized allowing interlaboratory comparisons. We applied SNP typing to study strains with a novel allele for the pertussis toxin promoter, ptxP3, which have a worldwide distribution and which have replaced the resident ptxP1 strains in the last 20 years. Previously, we showed that ptxP3 strains showed increased pertussis toxin expression and that their emergence was associated with increased notification in the Netherlands. SNP typing showed that the ptxP3 strains isolated in the Americas, Asia, Australia and Europe formed a monophyletic branch which recently diverged from ptxP1 strains. Two predominant ptxP3 SNP types were identified which spread worldwide. The widespread use of SNP typing will enhance our understanding of the evolution and global epidemiology of B. pertussis. PMID:21647370

  9. A SNP Harvester Analysis to Better Detect SNPs of CCDC158 Gene That Are Associated with Carcass Quality Traits in Hanwoo

    Directory of Open Access Journals (Sweden)

    Jea-Young Lee

    2013-06-01

    Full Text Available The purpose of this study was to investigate interaction effects of genes using a Harvester method. A sample of Korean cattle, Hanwoo (n = 476 was chosen from the National Livestock Research Institute of Korea that were sired by 50 Korean proven bulls. The steers were born between the spring of 1998 and the autumn of 2002 and reared under a progeny-testing program at the Daekwanryeong and Namwon branches of NLRI. The steers were slaughtered at approximately 24 months of age and carcass quality traits were measured. A SNP Harvester method was applied with a support vector machine (SVM to detect significant SNPs in the CCDC158 gene and interaction effects between the SNPs that were associated with average daily gains, cold carcass weight, longissimus dorsi muscle area, and marbling scores. The statistical significance of the major SNP combinations was evaluated with x2-statistics. The genotype combinations of three SNPs, g.34425+102 A>T(AA, g.4102636T>G(GT, and g.11614+19G>T(GG had a greater effect than the rest of SNP combinations, e.g. 0.82 vs. 0.75 kg, 343 vs. 314 kg, 80.4 vs 74.7 cm2, and 7.35 vs. 5.01, for the four respective traits (p<0.001. Also, the estimates were greater compared with single SNPs analyzed (the greatest estimates were 0.76 kg, 320 kg, 75.5 cm2, and 5.31, respectively. This result suggests that the SNP Harvester method is a good option when multiple SNPs and interaction effects are tested. The significant SNPs could be applied to improve meat quality of Hanwoo via marker-assisted selection.

  10. A Larger Chocolate Chip—Development of a 15K Theobroma cacao L. SNP Array to Create High-Density Linkage Maps

    Directory of Open Access Journals (Sweden)

    Donald Livingstone

    2017-12-01

    Full Text Available Cacao (Theobroma cacao L. is an important cash crop in tropical regions around the world and has a rich agronomic history in South America. As a key component in the cosmetic and confectionary industries, millions of people worldwide use products made from cacao, ranging from shampoo to chocolate. An Illumina Infinity II array was created using 13,530 SNPs identified within a small diversity panel of cacao. Of these SNPs, 12,643 derive from variation within annotated cacao genes. The genotypes of 3,072 trees were obtained, including two mapping populations from Ecuador. High-density linkage maps for these two populations were generated and compared to the cacao genome assembly. Phenotypic data from these populations were combined with the linkage maps to identify the QTLs for yield and disease resistance.

  11. A Larger Chocolate Chip-Development of a 15K Theobroma cacao L. SNP Array to Create High-Density Linkage Maps.

    Science.gov (United States)

    Livingstone, Donald; Stack, Conrad; Mustiga, Guiliana M; Rodezno, Dayana C; Suarez, Carmen; Amores, Freddy; Feltus, Frank A; Mockaitis, Keithanne; Cornejo, Omar E; Motamayor, Juan C

    2017-01-01

    Cacao ( Theobroma cacao L.) is an important cash crop in tropical regions around the world and has a rich agronomic history in South America. As a key component in the cosmetic and confectionary industries, millions of people worldwide use products made from cacao, ranging from shampoo to chocolate. An Illumina Infinity II array was created using 13,530 SNPs identified within a small diversity panel of cacao. Of these SNPs, 12,643 derive from variation within annotated cacao genes. The genotypes of 3,072 trees were obtained, including two mapping populations from Ecuador. High-density linkage maps for these two populations were generated and compared to the cacao genome assembly. Phenotypic data from these populations were combined with the linkage maps to identify the QTLs for yield and disease resistance.

  12. Family-based multi-SNP X chromosome analysis using parental information

    Directory of Open Access Journals (Sweden)

    Alison S. Wise

    2016-02-01

    Full Text Available We propose a method for association analysis of haplotypes on the X chromosome that offers both improved power and robustness to population stratification in studies of affected offspring and their parents if all three have been genotyped. The method makes use of assumed parental haplotype exchangeability, a weaker assumption than Hardy-Weinberg equilibrium. Parental haplotype exchangeability requires that in the source population, of the three X chromosome haplotypes carried by the two parents, each is equally likely to be carried by the father. We propose a pseudo-sibling approach that exploits that exchangeability assumption. Our method extends the single-SNP PIX-LRT method to multiple SNPs in a high linkage block. We describe methods for testing the parental haplotype exchangeability assumption and also for determining how apparent violations can be distinguished from true fetal effects or maternally-mediated effects. We show results of simulations that demonstrate nominal type I error rate and good power. The methods are then applied to dbGaP data on the birth defect oral cleft, using both Asian and Caucasian families with cleft.

  13. A SNP resource for studying North American moose [version 1; referees: 2 approved, 1 approved with reservations

    Directory of Open Access Journals (Sweden)

    Theodore S. Kalbfleisch

    2018-01-01

    Full Text Available Background: Moose (Alces alces colonized the North American continent from Asia less than 15,000 years ago, and spread across the boreal forest regions of Canada and the northern United States (US.  Contemporary populations have low genetic diversity, due either to low number of individuals in the original migration (founder effect, and/or subsequent population bottlenecks in North America.  Genetic tests based on informative single nucleotide polymorphism (SNP markers are helpful in forensic and wildlife conservation activities, but have been difficult to develop for moose, due to the lack of a reference genome assembly and whole genome sequence (WGS data. Methods:  WGS data were generated for four individual moose from the US states of Alaska, Idaho, Wyoming, and Vermont with minimum and average genome coverage depths of 14- and 19-fold, respectively.  Cattle and sheep reference genomes were used for aligning sequence reads and identifying moose SNPs. Results:  Approximately 11% and 9% of moose WGS reads aligned to cattle and sheep genomes, respectively.  The reads clustered at genomic segments, where sequence identity between these species was greater than 95%.  In these segments, average mapped read depth was approximately 19-fold.  Sets of 46,005 and 36,934 high-confidence SNPs were identified from cattle and sheep comparisons, respectively, with 773 and 552 of those having minor allele frequency of 0.5 and conserved flanking sequences in all three species.  Among the four moose, heterozygosity and allele sharing of SNP genotypes were consistent with decreasing levels of moose genetic diversity from west to east.  A minimum set of 317 SNPs, informative across all four moose, was selected as a resource for future SNP assay design. Conclusions:  All SNPs and associated information are available, without restriction, to support development of SNP-based tests for animal identification, parentage determination, and estimating

  14. Learning gene networks under SNP perturbations using eQTL datasets.

    Directory of Open Access Journals (Sweden)

    Lingxue Zhang

    2014-02-01

    Full Text Available The standard approach for identifying gene networks is based on experimental perturbations of gene regulatory systems such as gene knock-out experiments, followed by a genome-wide profiling of differential gene expressions. However, this approach is significantly limited in that it is not possible to perturb more than one or two genes simultaneously to discover complex gene interactions or to distinguish between direct and indirect downstream regulations of the differentially-expressed genes. As an alternative, genetical genomics study has been proposed to treat naturally-occurring genetic variants as potential perturbants of gene regulatory system and to recover gene networks via analysis of population gene-expression and genotype data. Despite many advantages of genetical genomics data analysis, the computational challenge that the effects of multifactorial genetic perturbations should be decoded simultaneously from data has prevented a widespread application of genetical genomics analysis. In this article, we propose a statistical framework for learning gene networks that overcomes the limitations of experimental perturbation methods and addresses the challenges of genetical genomics analysis. We introduce a new statistical model, called a sparse conditional Gaussian graphical model, and describe an efficient learning algorithm that simultaneously decodes the perturbations of gene regulatory system by a large number of SNPs to identify a gene network along with expression quantitative trait loci (eQTLs that perturb this network. While our statistical model captures direct genetic perturbations of gene network, by performing inference on the probabilistic graphical model, we obtain detailed characterizations of how the direct SNP perturbation effects propagate through the gene network to perturb other genes indirectly. We demonstrate our statistical method using HapMap-simulated and yeast eQTL datasets. In particular, the yeast gene network

  15. Genetic Architecture of Capitate Glandular Trichome Density in Florets of Domesticated Sunflower (Helianthus annuus L.).

    Science.gov (United States)

    Gao, Qing-Ming; Kane, Nolan C; Hulke, Brent S; Reinert, Stephan; Pogoda, Cloe S; Tittes, Silas; Prasifka, Jarrad R

    2017-01-01

    Capitate glandular trichomes (CGT), one type of glandular trichomes, are most common in Asteraceae species. CGT can produce various secondary metabolites such as sesquiterpene lactones (STLs) and provide durable resistance to insect pests. In sunflower, CGT-based host resistance is effective to combat the specialist pest, sunflower moth. However, the genetic basis of CGT density is not well understood in sunflower. In this study, we identified two major QTL controlling CGT density in sunflower florets by using a F 4 mapping population derived from the cross HA 300 × RHA 464 with a genetic linkage map constructed from genotyping-by-sequencing data and composed of 2121 SNP markers. One major QTL is located on chromosome 5, which explained 11.61% of the observed phenotypic variation, and the second QTL is located on chromosome 6, which explained 14.06% of the observed phenotypic variation. The QTL effects and the association between CGT density and QTL support interval were confirmed in a validation population which included 39 sunflower inbred lines with diverse genetic backgrounds. We also identified two strong candidate genes in the QTL support intervals, and the functions of their orthologs in other plant species suggested their potential roles in regulating capitate glandular trichome density in sunflower. Our results provide valuable information to sunflower breeding community for developing host resistance to sunflower insect pests.

  16. Genome-wide linkage analysis of QTL for growth and body composition employing the PorcineSNP60 BeadChip

    Directory of Open Access Journals (Sweden)

    Fernández Ana I

    2012-05-01

    Full Text Available Abstract Background The traditional strategy to map QTL is to use linkage analysis employing a limited number of markers. These analyses report wide QTL confidence intervals, making very difficult to identify the gene and polymorphisms underlying the QTL effects. The arrival of genome-wide panels of SNPs makes available thousands of markers increasing the information content and therefore the likelihood of detecting and fine mapping QTL regions. The aims of the current study are to confirm previous QTL regions for growth and body composition traits in different generations of an Iberian x Landrace intercross (IBMAP and especially identify new ones with narrow confidence intervals by employing the PorcineSNP60 BeadChip in linkage analyses. Results Three generations (F3, Backcross 1 and Backcross 2 of the IBMAP and their related animals were genotyped with PorcineSNP60 BeadChip. A total of 8,417 SNPs equidistantly distributed across autosomes were selected after filtering by quality, position and frequency to perform the QTL scan. The joint and separate analyses of the different IBMAP generations allowed confirming QTL regions previously identified in chromosomes 4 and 6 as well as new ones mainly for backfat thickness in chromosomes 4, 5, 11, 14 and 17 and shoulder weight in chromosomes 1, 2, 9 and 13; and many other to the chromosome-wide signification level. In addition, most of the detected QTLs displayed narrow confidence intervals, making easier the selection of positional candidate genes. Conclusions The use of higher density of markers has allowed to confirm results obtained in previous QTL scans carried out with microsatellites. Moreover several new QTL regions have been now identified in regions probably not covered by markers in previous scans, most of these QTLs displayed narrow confidence intervals. Finally, prominent putative biological and positional candidate genes underlying those QTL effects are listed based on recent porcine

  17. Performance and precision of double digestion RAD (ddRAD) genotyping in large multiplexed datasets of marine fish species

    DEFF Research Database (Denmark)

    Maroso, F.; Hillen, J E J; Pardo, B. G.

    2018-01-01

    ; (ii) the discrepancy between expected and observed tag length and coverage; (iii) the performances of reference based vs. de novo approaches; (iv) the sources of potential genotyping errors of the library preparation/bioinformatics protocol, by comparing technical replicates. Our results showed use...... a standardized protocol. A common bioinformatics pipeline based on STACKS was established, with and without the use of a reference genome. We performed analyses throughout the production and analysis of ddRAD data in order to explore (i) the loss of information due to heterogeneous raw read number across samples...... of downstream analysis carried out with ddRAD vs single SNP allele specific assay genotypes provided information about the levels of genotyping imprecision that can have a significant impact on allele frequency estimations and population assignment. The results and insights presented here will help to select...

  18. T.I.M.S: TaqMan Information Management System, tools to organize data flow in a genotyping laboratory

    Science.gov (United States)

    Monnier, Stéphanie; Cox, David G; Albion, Tim; Canzian, Federico

    2005-01-01

    Background Single Nucleotide Polymorphism (SNP) genotyping is a major activity in biomedical research. The Taqman technology is one of the most commonly used approaches. It produces large amounts of data that are difficult to process by hand. Laboratories not equipped with a Laboratory Information Management System (LIMS) need tools to organize the data flow. Results We propose a package of Visual Basic programs focused on sample management and on the parsing of input and output TaqMan files. The code is written in Visual Basic, embedded in the Microsoft Office package, and it allows anyone to have access to those tools, without any programming skills and with basic computer requirements. Conclusion We have created useful tools focused on management of TaqMan genotyping data, a critical issue in genotyping laboratories whithout a more sophisticated and expensive system, such as a LIMS. PMID:16221298

  19. Evaluation of the Ion Torrent™ HID SNP 169-plex

    DEFF Research Database (Denmark)

    Børsting, Claus; Fordyce, Sarah L; Olofsson, Jill Katharina

    2014-01-01

    The Ion Torrent™ HID SNP assay amplified 136 autosomal SNPs and 33 Y-chromosome markers in one PCR and the markers were subsequently typed using the Ion PGM™ second generation sequencing platform. A total of 51 of the autosomal SNPs were selected from the SNPforID panel that is routinely used...... in our ISO 17025 accredited laboratory. Concordance between the Ion Torrent™ HID SNP assay and the SNPforID assay was tested by typing 44 Iraqis twice with the Ion Torrent™ HID SNP assay. The same samples were previously typed with the SNPforID assay and the Y-chromosome haplogroups of the individuals...

  20. [Association analysis of SNP-63 and indel-19 variant in the calpain-10 gene with polycystic ovary syndrome in women of reproductive age].

    Science.gov (United States)

    Flores-Martínez, Silvia Esperanza; Castro-Martínez, Anna Gabriela; López-Quintero, Andrés; García-Zapién, Alejandra Guadalupe; Torres-Rodríguez, Ruth Noemí; Sánchez-Corona, José

    2015-01-01

    Polycystic ovary syndrome is a complex and heterogeneous disease involving both reproductive and metabolic problems. It has been suggested a genetic predisposition in the etiology of this syndrome. The identification of calpain-10 gene (CAPN10) as the first candidate gene for type 2 diabetes mellitus, has focused the interest in investigating their possible relation with the polycystic ovary syndrome, because this syndrome is associated with hyperinsulinemia and insulin resistance, two metabolic abnormalities associated with type 2 diabetes mellitus. To investigate if there is association between the SNP-63 and the variant indel-19 of the CAPN10 gene and polycystic ovary syndrome in women of reproductive age. This study included 101 women (55 with polycystic ovary syndrome and 46 without polycystic ovary syndrome). The genetic variant indel-19 was identified by electrophoresis of the amplified fragments by PCR, and the SNP-63 by PCR-RFLP. The allele and genotype frequencies of the two variants do not differ significatly between women with polycystic ovary syndrome and control women group. The haplotype 21 (defined by the insertion allele of indel-19 variant and C allele of SNP-63) was found with higher frequency in both study groups, being more frequent in the polycystic ovary syndrome patients group, however, this difference was not statistically significant (p = 0.8353). The results suggest that SNP-63 and indel-19 variant of the CAPN10 gene do not represent a risk factor for polycystic ovary syndrome in our patients group. Copyright © 2015. Published by Masson Doyma México S.A.

  1. New Insights into the Lake Chad Basin Population Structure Revealed by High-Throughput Genotyping of Mitochondrial DNA Coding SNPs

    Czech Academy of Sciences Publication Activity Database

    Cerezo, M.; Černý, Viktor; Carracedo, Á.; Salas, A.

    2011-01-01

    Roč. 6, č. 4 (2011), e18682 E-ISSN 1932-6203 R&D Projects: GA ČR GA206/08/1587 Institutional research plan: CEZ:AV0Z80020508 Keywords : population history * archaeogenetics * Lake Chad * SNP genotyping Subject RIV: AC - Archeology, Anthropology, Ethnology Impact factor: 4.092, year: 2011 http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0018682

  2. Integration of genomic resources to uncover pleiotropic regions associated with age at puberty and reproductive longevity in sows

    Science.gov (United States)

    Commercial and experimental genetic resources were used to investigate genetic pleiotropic factors that influence age at puberty, litter-size and reproductive longevity. The phenotypes were complemented by high-density genotyping and whole genome and RNA sequencing. The SNPs from Porcine SNP60 BeadA...

  3. A Fast Method for DEFB1-44C/G SNP Genotyping in Brazilian Patients with Periodontitis

    Directory of Open Access Journals (Sweden)

    Rafael Rafael Amorim Cavalcanti de Siqueira

    2014-01-01

    Full Text Available Aim: Defensins are cationic antimicrobial peptides expressed in epithelial cells. Such peptides exhibit antibacterial, antifungal and antiviral properties, and are a component of the innate immune response. It has been suggested that they have a protective role in the oral cavity. This study evaluated the DEFB1 polymorphism in diabetic patients with or without periodontitis in comparison to healthy controls. Material and Methods: We used Hairpin-Shaped Primer (HP assay to study the distribution of the -44 C/G SNP (rs1800972 in 119 human DNAs obtained from diabetic patients and healthy control patients. Results: The results indicate that there are no differences in distribution between groups and that in diabetic periodontitis patients the homozygous mutant could be found more frequently. Conclusion: Further studies are necessary in order to investigate the role of DEFB1 polymorphisms in diabetic periodontitis patients and the influence of the peptide in periodontal pathogens.

  4. (SNP) markers for the Chinese black sleeper, Bostrychus sinensis

    African Journals Online (AJOL)

    We characterized 11 single nucleotide ploymorphism (SNP) markers for the Chinese black sleeper, Bostrychus sinensis. These markers were isolated from a genomic library and tested in ten geographically distant individuals of B. sinensis. Polymorphisms of these SNP loci were assessed using a wild population including ...

  5. PARP-1 Variant Rs1136410 Confers Protection against Coronary Artery Disease in a Chinese Han Population: A Two-Stage Case-Control Study Involving 5643 Subjects

    Directory of Open Access Journals (Sweden)

    Xue-bin Wang

    2017-11-01

    Full Text Available Inhibition of poly(ADP-ribose polymerase (PARP may protect against coronary artery disease (CAD in animal models, and rs1136410, a non-synonymous single nucleotide polymorphism (SNP in PARP-1, has a potential impact on PARP activities in vitro. This two-stage case-control study, involving 2803 CAD patients and 2840 controls, aimed to investigate the associations of PARP-1 rs1136410 with CAD development, lipid levels, PARP activities, 8-hydroxy-2′-dexyguanosine (8-OHdG, and interleukin (IL-6 levels in a Chinese Han population. Assuming a recessive model, the variant genotype GG of SNP rs1136410 showed a significantly inverse association with CAD risk (adjusted odds ratio (OR = 0.73, P < 0.001, left main coronary artery (LMCA lesions (P = 0.003, vessel scores (P = 0.003, and modified Gensini scores (P < 0.001. There were significant correlations of SNP rs1136410 with higher levels of total cholesterol (TC and lower levels of high-density lipoprotein cholesterol (HDL-c. In gene-environment interaction analyses, participants with the variant genotype GG, but without smoking habit, type 2 diabetes mellitus, and hyperlipidemia, conferred an 84% (P < 0.001 decreased risk of CAD. The genotype-phenotype correlation analyses further supported the functional roles of SNP rs1136410 in decreasing PARP activities and 8-OHdG levels. Taken together, our data suggest that SNP rs1136410 may confer protection against CAD through modulation of PARP activities and gene-environment interactions in a Chinese Han population.

  6. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin.

    Directory of Open Access Journals (Sweden)

    Michela Troggio

    Full Text Available High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432, but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

  7. Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping

    DEFF Research Database (Denmark)

    Zhan, Bujie; Fadista, João; Thomsen, Bo

    2011-01-01

    Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome...... of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation...

  8. Novel modeling of combinatorial miRNA targeting identifies SNP with potential role in bone density.

    Directory of Open Access Journals (Sweden)

    Claudia Coronnello

    Full Text Available MicroRNAs (miRNAs are post-transcriptional regulators that bind to their target mRNAs through base complementarity. Predicting miRNA targets is a challenging task and various studies showed that existing algorithms suffer from high number of false predictions and low to moderate overlap in their predictions. Until recently, very few algorithms considered the dynamic nature of the interactions, including the effect of less specific interactions, the miRNA expression level, and the effect of combinatorial miRNA binding. Addressing these issues can result in a more accurate miRNA:mRNA modeling with many applications, including efficient miRNA-related SNP evaluation. We present a novel thermodynamic model based on the Fermi-Dirac equation that incorporates miRNA expression in the prediction of target occupancy and we show that it improves the performance of two popular single miRNA target finders. Modeling combinatorial miRNA targeting is a natural extension of this model. Two other algorithms show improved prediction efficiency when combinatorial binding models were considered. ComiR (Combinatorial miRNA targeting, a novel algorithm we developed, incorporates the improved predictions of the four target finders into a single probabilistic score using ensemble learning. Combining target scores of multiple miRNAs using ComiR improves predictions over the naïve method for target combination. ComiR scoring scheme can be used for identification of SNPs affecting miRNA binding. As proof of principle, ComiR identified rs17737058 as disruptive to the miR-488-5p:NCOA1 interaction, which we confirmed in vitro. We also found rs17737058 to be significantly associated with decreased bone mineral density (BMD in two independent cohorts indicating that the miR-488-5p/NCOA1 regulatory axis is likely critical in maintaining BMD in women. With increasing availability of comprehensive high-throughput datasets from patients ComiR is expected to become an essential

  9. Effect of Co-segregating Markers on High-Density Genetic Maps and Prediction of Map Expansion Using Machine Learning Algorithms.

    Science.gov (United States)

    N'Diaye, Amidou; Haile, Jemanesh K; Fowler, D Brian; Ammar, Karim; Pozniak, Curtis J

    2017-01-01

    Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called 'large p, small n' problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion

  10. Effect of Co-segregating Markers on High-Density Genetic Maps and Prediction of Map Expansion Using Machine Learning Algorithms

    Directory of Open Access Journals (Sweden)

    Amidou N’Diaye

    2017-08-01

    Full Text Available Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called ‘large p, small n’ problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers. While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat and Norstar × Cappelle Desprez (bread wheat. The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF, we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez. Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase

  11. Genotyping-By-Sequencing (GBS) Detects Genetic Structure and Confirms Behavioral QTL in Tame and Aggressive Foxes (Vulpes vulpes).

    Science.gov (United States)

    Johnson, Jennifer L; Wittgenstein, Helena; Mitchell, Sharon E; Hyma, Katie E; Temnykh, Svetlana V; Kharlamova, Anastasiya V; Gulevich, Rimma G; Vladimirova, Anastasiya V; Fong, Hiu Wa Flora; Acland, Gregory M; Trut, Lyudmila N; Kukekova, Anna V

    2015-01-01

    The silver fox (Vulpes vulpes) offers a novel model for studying the genetics of social behavior and animal domestication. Selection of foxes, separately, for tame and for aggressive behavior has yielded two strains with markedly different, genetically determined, behavioral phenotypes. Tame strain foxes are eager to establish human contact while foxes from the aggressive strain are aggressive and difficult to handle. These strains have been maintained as separate outbred lines for over 40 generations but their genetic structure has not been previously investigated. We applied a genotyping-by-sequencing (GBS) approach to provide insights into the genetic composition of these fox populations. Sequence analysis of EcoT22I genomic libraries of tame and aggressive foxes identified 48,294 high quality SNPs. Population structure analysis revealed genetic divergence between the two strains and more diversity in the aggressive strain than in the tame one. Significant differences in allele frequency between the strains were identified for 68 SNPs. Three of these SNPs were located on fox chromosome 14 within an interval of a previously identified behavioral QTL, further supporting the importance of this region for behavior. The GBS SNP data confirmed that significant genetic diversity has been preserved in both fox populations despite many years of selective breeding. Analysis of SNP allele frequencies in the two populations identified several regions of genetic divergence between the tame and aggressive foxes, some of which may represent targets of selection for behavior. The GBS protocol used in this study significantly expanded genomic resources for the fox, and can be adapted for SNP discovery and genotyping in other canid species.

  12. A SNP panel and online tool for checking genotype concordance through comparing QR codes.

    Directory of Open Access Journals (Sweden)

    Yonghong Du

    Full Text Available In the current precision medicine era, more and more samples get genotyped and sequenced. Both researchers and commercial companies expend significant time and resources to reduce the error rate. However, it has been reported that there is a sample mix-up rate of between 0.1% and 1%, not to mention the possibly higher mix-up rate during the down-stream genetic reporting processes. Even on the low end of this estimate, this translates to a significant number of mislabeled samples, especially over the projected one billion people that will be sequenced within the next decade. Here, we first describe a method to identify a small set of Single nucleotide polymorphisms (SNPs that can uniquely identify a personal genome, which utilizes allele frequencies of five major continental populations reported in the 1000 genomes project and the ExAC Consortium. To make this panel more informative, we added four SNPs that are commonly used to predict ABO blood type, and another two SNPs that are capable of predicting sex. We then implement a web interface (http://qrcme.tech, nicknamed QRC (for QR code based Concordance check, which is capable of extracting the relevant ID SNPs from a raw genetic data, coding its genotype as a quick response (QR code, and comparing QR codes to report the concordance of underlying genetic datasets. The resulting 80 fingerprinting SNPs represent a significant decrease in complexity and the number of markers used for genetic data labelling and tracking. Our method and web tool is easily accessible to both researchers and the general public who consider the accuracy of complex genetic data as a prerequisite towards precision medicine.

  13. A SNP panel and online tool for checking genotype concordance through comparing QR codes.

    Science.gov (United States)

    Du, Yonghong; Martin, Joshua S; McGee, John; Yang, Yuchen; Liu, Eric Yi; Sun, Yingrui; Geihs, Matthias; Kong, Xuejun; Zhou, Eric Lingfeng; Li, Yun; Huang, Jie

    2017-01-01

    In the current precision medicine era, more and more samples get genotyped and sequenced. Both researchers and commercial companies expend significant time and resources to reduce the error rate. However, it has been reported that there is a sample mix-up rate of between 0.1% and 1%, not to mention the possibly higher mix-up rate during the down-stream genetic reporting processes. Even on the low end of this estimate, this translates to a significant number of mislabeled samples, especially over the projected one billion people that will be sequenced within the next decade. Here, we first describe a method to identify a small set of Single nucleotide polymorphisms (SNPs) that can uniquely identify a personal genome, which utilizes allele frequencies of five major continental populations reported in the 1000 genomes project and the ExAC Consortium. To make this panel more informative, we added four SNPs that are commonly used to predict ABO blood type, and another two SNPs that are capable of predicting sex. We then implement a web interface (http://qrcme.tech), nicknamed QRC (for QR code based Concordance check), which is capable of extracting the relevant ID SNPs from a raw genetic data, coding its genotype as a quick response (QR) code, and comparing QR codes to report the concordance of underlying genetic datasets. The resulting 80 fingerprinting SNPs represent a significant decrease in complexity and the number of markers used for genetic data labelling and tracking. Our method and web tool is easily accessible to both researchers and the general public who consider the accuracy of complex genetic data as a prerequisite towards precision medicine.

  14. Genotype-Dependent Effect of Exogenous Nitric Oxide on Cd-induced Changes in Antioxidative Metabolism, Ultrastructure, and Photosynthetic Performance in Barley Seedlings (Hordeum vulgare)

    DEFF Research Database (Denmark)

    Chen, Fei; Wang, Fang; Sun, Hongyan

    2010-01-01

    M Cd increased the accumulation of O2•-, H2O2, and malondialdehyde (MDA) but reduced plant height, chlorophyll content, net photosynthetic rate (P n), and biomass, with a much more severe response in the Cd-sensitive genotype. Antioxidant enzyme activities increased significantly under Cd stress......A greenhouse hydroponic experiment was performed using Cd-sensitive (cv. Dong 17) and Cd-tolerant (Weisuobuzhi) barley seedlings to evaluate how different genotypes responded to cadmium (Cd) toxicity in the presence of sodium nitroprusside (SNP), a nitric oxide (NO) donor. Results showed that 5 μ...... in the roots of the tolerant genotype, whereas in leaves of the sensitive genotype, superoxide dismutase (SOD) and ascorbate peroxide (APX), especially cytosol ascorbate peroxidase (cAPX), decreased after 5-15 days Cd exposure. Moreover, Cd induces NO synthesis by stimulating nitrate reductase and nitric oxide...

  15. Individual patient data meta-analysis shows no association between the SNP rs1800469 in TGFB and late radiotherapy toxicity

    International Nuclear Information System (INIS)

    Barnett, Gillian C.; Elliott, Rebecca M.; Alsner, Jan; Andreassen, Christian N.; Abdelhay, Osama; Burnet, Neil G.; Chang-Claude, Jenny; Coles, Charlotte E.; Gutiérrez-Enríquez, Sara; Fuentes-Raspall, Maria J.; Alonso-Muñoz, Maria C.; Kerns, Sarah; Raabe, Annette; Symonds, R. Paul; Seibold, Petra; Talbot, Chris J.; Wenz, Frederik; Wilkinson, Jennifer; Yarnold, John; Dunning, Alison M.

    2012-01-01

    Background and purpose: Reported associations between risk of radiation-induced normal tissue injury and single nucleotide polymorphisms (SNPs) in TGFB1, encoding the pro-fibrotic cytokine transforming growth factor-beta 1 (TGF-β1), remain controversial. To overcome publication bias, the international Radiogenomics Consortium collected and analysed individual patient level data from both published and unpublished studies. Materials and methods: TGFB1 SNP rs1800469 c.-1347T>C (previously known as C-509T) genotype, treatment-related data, and clinically-assessed fibrosis (measured at least 2 years after therapy) were available in 2782 participants from 11 cohorts. All received adjuvant breast radiotherapy. Associations between late fibrosis or overall toxicity, reported by STAT (Standardised Total Average Toxicity) score, and rs1800469 genotype were assessed. Results: No statistically significant associations between either fibrosis or overall toxicity and rs1800469 genotype were observed with univariate or multivariate regression analysis. The multivariate odds ratio (OR), obtained from meta-analysis, for an increase in late fibrosis grade with each additional rare allele of rs1800469 was 0.98 (95% Confidence Interval (CI) 0.85–1.11). This CI is sufficiently narrow to rule out any clinically relevant effect on toxicity risk in carriers vs. non-carriers with a high probability. Conclusion: This meta-analysis has not confirmed previous reports of association between fibrosis or overall toxicity and rs1800469 genotype in breast cancer patients. It has demonstrated successful collaboration within the Radiogenomics Consortium.

  16. Evaluating Imputation Algorithms for Low-Depth Genotyping-By-Sequencing (GBS Data.

    Directory of Open Access Journals (Sweden)

    Ariel W Chan

    Full Text Available Well-powered genomic studies require genome-wide marker coverage across many individuals. For non-model species with few genomic resources, high-throughput sequencing (HTS methods, such as Genotyping-By-Sequencing (GBS, offer an inexpensive alternative to array-based genotyping. Although affordable, datasets derived from HTS methods suffer from sequencing error, alignment errors, and missing data, all of which introduce noise and uncertainty to variant discovery and genotype calling. Under such circumstances, meaningful analysis of the data is difficult. Our primary interest lies in the issue of how one can accurately infer or impute missing genotypes in HTS-derived datasets. Many of the existing genotype imputation algorithms and software packages were primarily developed by and optimized for the human genetics community, a field where a complete and accurate reference genome has been constructed and SNP arrays have, in large part, been the common genotyping platform. We set out to answer two questions: 1 can we use existing imputation methods developed by the human genetics community to impute missing genotypes in datasets derived from non-human species and 2 are these methods, which were developed and optimized to impute ascertained variants, amenable for imputation of missing genotypes at HTS-derived variants? We selected Beagle v.4, a widely used algorithm within the human genetics community with reportedly high accuracy, to serve as our imputation contender. We performed a series of cross-validation experiments, using GBS data collected from the species Manihot esculenta by the Next Generation (NEXTGEN Cassava Breeding Project. NEXTGEN currently imputes missing genotypes in their datasets using a LASSO-penalized, linear regression method (denoted 'glmnet'. We selected glmnet to serve as a benchmark imputation method for this reason. We obtained estimates of imputation accuracy by masking a subset of observed genotypes, imputing, and

  17. Evaluating Imputation Algorithms for Low-Depth Genotyping-By-Sequencing (GBS) Data.

    Science.gov (United States)

    Chan, Ariel W; Hamblin, Martha T; Jannink, Jean-Luc

    2016-01-01

    Well-powered genomic studies require genome-wide marker coverage across many individuals. For non-model species with few genomic resources, high-throughput sequencing (HTS) methods, such as Genotyping-By-Sequencing (GBS), offer an inexpensive alternative to array-based genotyping. Although affordable, datasets derived from HTS methods suffer from sequencing error, alignment errors, and missing data, all of which introduce noise and uncertainty to variant discovery and genotype calling. Under such circumstances, meaningful analysis of the data is difficult. Our primary interest lies in the issue of how one can accurately infer or impute missing genotypes in HTS-derived datasets. Many of the existing genotype imputation algorithms and software packages were primarily developed by and optimized for the human genetics community, a field where a complete and accurate reference genome has been constructed and SNP arrays have, in large part, been the common genotyping platform. We set out to answer two questions: 1) can we use existing imputation methods developed by the human genetics community to impute missing genotypes in datasets derived from non-human species and 2) are these methods, which were developed and optimized to impute ascertained variants, amenable for imputation of missing genotypes at HTS-derived variants? We selected Beagle v.4, a widely used algorithm within the human genetics community with reportedly high accuracy, to serve as our imputation contender. We performed a series of cross-validation experiments, using GBS data collected from the species Manihot esculenta by the Next Generation (NEXTGEN) Cassava Breeding Project. NEXTGEN currently imputes missing genotypes in their datasets using a LASSO-penalized, linear regression method (denoted 'glmnet'). We selected glmnet to serve as a benchmark imputation method for this reason. We obtained estimates of imputation accuracy by masking a subset of observed genotypes, imputing, and calculating the

  18. Genome-wide association mapping including phenotypes from relatives without genotypes in a single-step (ssGWAS for 6-week body weight in broiler chickens

    Directory of Open Access Journals (Sweden)

    Huiyu eWang

    2014-05-01

    Full Text Available The purpose of this study was to compare results obtained from various methodologies for genome-wide association studies, when applied to real data, in terms of number and commonality of regions identified and their genetic variance explained, computational speed, and possible pitfalls in interpretations of results. Methodologies include: two iteratively reweighted single-step genomic BLUP procedures (ssGWAS1 and ssGWAS2, a single-marker model (CGWAS, and BayesB. The ssGWAS methods utilize genomic breeding values (GEBVs based on combined pedigree, genomic and phenotypic information, while CGWAS and BayesB only utilize phenotypes from genotyped animals or pseudo-phenotypes. In this study, ssGWAS was performed by converting GEBVs to SNP marker effects. Unequal variances for markers were incorporated for calculating weights into a new genomic relationship matrix. SNP weights were refined iteratively. The data was body weight at 6 weeks on 274,776 broiler chickens, of which 4553 were genotyped using a 60k SNP chip. Comparison of genomic regions was based on genetic variances explained by local SNP regions (20 SNPs. After 3 iterations, the noise was greatly reduced of ssGWAS1 and results are similar to that of CGWAS, with 4 out of the top 10 regions in common. In contrast, for BayesB, the plot was dominated by a single region explaining 23.1% of the genetic variance. This same region was found by ssGWAS1 with the same rank, but the amount of genetic variation attributed to the region was only 3%. These finding emphasize the need for caution when comparing and interpreting results from various methods, and highlight that detected associations, and strength of association, strongly depends on methodologies and details of implementations. BayesB appears to overly shrink regions to zero, while overestimating the amount of genetic variation attributed to the remaining SNP effects. The real world is most likely a compromise between methods and remains to

  19. Development of a 44K SNP assay focussing on the analysis of a varroa-specific defence behaviour in honey bees (Apis mellifera carnica).

    Science.gov (United States)

    Spötter, A; Gupta, P; Nürnberg, G; Reinsch, N; Bienefeld, K

    2012-03-01

    Honey bees are exposed to a number of damaging pathogens and parasites. The most destructive among them, affecting mainly the brood, is Varroa destructor. A promising approach to prevent its spread is to breed for Varroa-tolerant honey bees. A trait that has been shown to provide significant resistance against the Varroa mite is hygienic behaviour, a behavioural response of honey bee workers to brood diseases in general. This study reports the development of a 44K SNP assay, specifically designed for the analysis of hygienic behaviour of individual worker bees (Apis mellifera carnica) directed against V. destructor. Initially, 70,000 SNPs chosen from a large set of SNPs published by the Honey Bee Genome Project were validated for their suitability in the analysis of the Varroa resistance trait 'uncapping of Varroa-infested brood'. This was achieved by genotyping of pooled DNA samples of trait bearers and two trait-negative controls using next-generation sequencing. Approximately 36,000 of these validated SNPs and another 8000 SNPs not validated in this study were selected for the construction of a SNP assay. This assay will be employed in following experiments to analyse individualized DNA samples in order to identify quantitative trait loci (QTL) involved in the control of the investigated trait and to evaluate and possibly confirm QTL found in other studies. However, this assay is not just suitable to study Varroa tolerance, it is as well applicable to analyse any other trait in honey bees. In addition, because of its high density, this assay provides access into genomic selection with respect to several traits considered in honey bee breeding. It will become publicly available via AROS Applied Biotechnology AS, Aarhus, Denmark, before the end of the year 2011. © 2011 Blackwell Publishing Ltd.

  20. A set of 14 DIP-SNP markers to detect unbalanced DNA mixtures.

    Science.gov (United States)

    Liu, Zhizhen; Liu, Jinding; Wang, Jiaqi; Chen, Deqing; Liu, Zidong; Shi, Jie; Li, Zeqin; Li, Wenyan; Zhang, Gengqian; Du, Bing

    2018-03-04

    Unbalanced DNA mixture is still a difficult problem for forensic practice. DIP-STRs are useful markers for detection of minor DNA but they are not widespread in the human genome and having long amplicons. In this study, we proposed a novel type of genetic marker, termed DIP-SNP. DIP-SNP refers to the combination of INDEL and SNP in less than 300bp length of human genome. The multiplex PCR and SNaPshot assay were established for 14 DIP-SNP markers in a Chinese Han population from Shanxi, China. This novel compound marker allows detection of the minor DNA contributor with sensitivity from 1:50 to 1:1000 in a DNA mixture of any gender with 1 ng-10 ng DNA template. Most of the DIP-SNP markers had a relatively high probability of informative alleles with an average I value of 0.33. In all, we proposed DIP-SNP as a novel kind of genetic marker for detection of minor contributor from unbalanced DNA mixture and established the detection method by associating the multiplex PCR and SNaPshot assay. DIP-SNP polymorphisms are promising markers for forensic or clinical mixture examination because they are shorter, widespread and higher sensitive. Copyright © 2018 Elsevier Inc. All rights reserved.

  1. T.I.M.S: TaqMan Information Management System, tools to organize data flow in a genotyping laboratory

    Directory of Open Access Journals (Sweden)

    Albion Tim

    2005-10-01

    Full Text Available Abstract Background Single Nucleotide Polymorphism (SNP genotyping is a major activity in biomedical research. The Taqman technology is one of the most commonly used approaches. It produces large amounts of data that are difficult to process by hand. Laboratories not equipped with a Laboratory Information Management System (LIMS need tools to organize the data flow. Results We propose a package of Visual Basic programs focused on sample management and on the parsing of input and output TaqMan files. The code is written in Visual Basic, embedded in the Microsoft Office package, and it allows anyone to have access to those tools, without any programming skills and with basic computer requirements. Conclusion We have created useful tools focused on management of TaqMan genotyping data, a critical issue in genotyping laboratories whithout a more sophisticated and expensive system, such as a LIMS.

  2. A study of associations between early DHA status and fatty acid desaturase (FADS) SNP and developmental outcomes in children of obese mothers.

    Science.gov (United States)

    Andersen, Karina R; Harsløf, Laurine B S; Schnurr, Theresia M; Hansen, Torben; Hellgren, Lars I; Michaelsen, Kim F; Lauritzen, Lotte

    2017-01-01

    DHA from diet or endogenous synthesis has been proposed to affect infant development, however, results are inconclusive. In this study, we aim to verify previously observed fatty acid desaturase gene cluster (FADS) SNP-specific associations with erythrocyte DHA status in 9-month-old children and sex-specific association with developmental outcomes. The study was performed in 166 children (55 % boys) of obese mothers. Erythrocyte fatty acid composition was analysed in blood-samples obtained at 9 months of age, and developmental outcomes assessed by the Ages and Stages Questionnaire at 3 years. Erythrocyte DHA level ranged from 4·4 to 9·9 % of fatty acids, but did not show any association with FADS SNP or other potential determinants. Regression analysis showed associations between erythrocyte DHA and scores for personal-social skills (β 1·8 (95 % CI 0·3, 3·3), P=0·019) and problem solving (β 3·4 (95 % CI 1·2, 5·6), P=0·003). A tendency was observed for an association in opposite direction between minor alleles (G-variant) of rs1535 and rs174575 and personal-social skills (P=0·062 and 0·068, respectively), which became significant when the SNP were combined based on their previously observed effect on erythrocyte DHA at 9 months of age (β 2·6 (95 % CI 0·01, 5·1), P=0·011). Sex-SNP interaction was indicated for rs174575 genotype on fine motor scores (P=0·016), due to higher scores among minor allele carrying girls (P=0·043), whereas no effect was seen among boys. In conclusion, DHA-increasing FADS SNP and erythrocyte DHA status were consistently associated with improved personal-social skills in this small cohort of children of obese mothers irrespective of sex, but the sample was too small to verify potential sex-specific effects.

  3. Expression Level of the DREB2-Type Gene, Identified with Amplifluor SNP Markers, Correlates with Performance, and Tolerance to Dehydration in Bread Wheat Cultivars from Northern Kazakhstan

    Science.gov (United States)

    Shavrukov, Yuri; Zhumalin, Aibek; Serikbay, Dauren; Botayeva, Makpal; Otemisova, Ainur; Absattarova, Aiman; Sereda, Grigoriy; Sereda, Sergey; Shvidchenko, Vladimir; Turbekova, Arysgul; Jatayev, Satyvaldy; Lopato, Sergiy; Soole, Kathleen; Langridge, Peter

    2016-01-01

    A panel of 89 local commercial cultivars of bread wheat was tested in field trials in the dry conditions of Northern Kazakhstan. Two distinct groups of cultivars (six cultivars in each group), which had the highest and the lowest grain yield under drought were selected for further experiments. A dehydration test conducted on detached leaves indicated a strong association between rates of water loss in plants from the first group with highest grain yield production in the dry environment relative to the second group. Modern high-throughput Amplifluor Single Nucleotide Polymorphism (SNP) technology was applied to study allelic variations in a series of drought-responsive genes using 19 SNP markers. Genotyping of an SNP in the TaDREB5 (DREB2-type) gene using the Amplifluor SNP marker KATU48 revealed clear allele distribution across the entire panel of wheat accessions, and distinguished between the two groups of cultivars with high and low yield under drought. Significant differences in expression levels of TaDREB5 were revealed by qRT-PCR. Most wheat plants from the first group of cultivars with high grain yield showed slight up-regulation in the TaDREB5 transcript in dehydrated leaves. In contrast, expression of TaDREB5 in plants from the second group of cultivars with low grain yield was significantly down-regulated. It was found that SNPs did not alter the amino acid sequence of TaDREB5 protein. Thus, a possible explanation is that alternative splicing and up-stream regulation of TaDREB5 may be affected by SNP, but these hypotheses require additional analysis (and will be the focus of future studies). PMID:27917186

  4. Expression level of the DREB2-type gene, identified with Amplifluor SNP markers, correlates with performance and tolerance to dehydration in bread wheat cultivars from Northern Kazakhstan

    Directory of Open Access Journals (Sweden)

    Yuri Shavrukov

    2016-11-01

    Full Text Available A panel of 89 local commercial cultivars of bread wheat was tested in field trials in the dry conditions of Northern Kazakhstan. Two distinct groups of cultivars (six cultivars in each group, which had the highest and the lowest grain yield under drought were selected for further experiments. A dehydration test conducted on detached leaves indicated a strong association between rates of water loss in plants from the first group with highest grain yield production in the dry environment relative to the second group. Modern high-throughput Amplifluor SNP technology was applied to study allelic variations in a series of drought-responsive genes using 19 SNP markers. Genotyping of an SNP in the TaDREB5 (DREB2-type gene using the Amplifluor SNP marker KATU48 revealed clear allele distribution across the entire panel of wheat accessions, and distinguished between the two groups of cultivars with high and low yield under drought. Significant differences in expression levels of TaDREB5 were revealed by qRT-PCR. Most wheat plants from the first group of cultivars with high grain yield showed strong up-regulation of TaDREB5 transcript in dehydrated leaves. In contrast, expression of TaDREB5 in plants from the second group of cultivars with low grain yield was significantly down-regulated. It was found that SNPs did not alter the amino acid sequence of TaDREB5 protein. Thus, a possible explanation is that alternative splicing and up-stream regulation of TaDREB5 may be affected by SNP, but these hypotheses require additional analysis (and will be the focus of future studies.

  5. Use of genotyping by sequencing data to develop a high-throughput and multifunctional SNP panel for conservation applications in Pacific lamprey.

    Science.gov (United States)

    Hess, Jon E; Campbell, Nathan R; Docker, Margaret F; Baker, Cyndi; Jackson, Aaron; Lampman, Ralph; McIlraith, Brian; Moser, Mary L; Statler, David P; Young, William P; Wildbill, Andrew J; Narum, Shawn R

    2015-01-01

    Next-generation sequencing data can be mined for highly informative single nucleotide polymorphisms (SNPs) to develop high-throughput genomic assays for nonmodel organisms. However, choosing a set of SNPs to address a variety of objectives can be difficult because SNPs are often not equally informative. We developed an optimal combination of 96 high-throughput SNP assays from a total of 4439 SNPs identified in a previous study of Pacific lamprey (Entosphenus tridentatus) and used them to address four disparate objectives: parentage analysis, species identification and characterization of neutral and adaptive variation. Nine of these SNPs are FST outliers, and five of these outliers are localized within genes and significantly associated with geography, run-timing and dwarf life history. Two of the 96 SNPs were diagnostic for two other lamprey species that were morphologically indistinguishable at early larval stages and were sympatric in the Pacific Northwest. The majority (85) of SNPs in the panel were highly informative for parentage analysis, that is, putatively neutral with high minor allele frequency across the species' range. Results from three case studies are presented to demonstrate the broad utility of this panel of SNP markers in this species. As Pacific lamprey populations are undergoing rapid decline, these SNPs provide an important resource to address critical uncertainties associated with the conservation and recovery of this imperiled species. © 2014 John Wiley & Sons Ltd.

  6. LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures.

    Science.gov (United States)

    Ryan, Michael; Diekhans, Mark; Lien, Stephanie; Liu, Yun; Karchin, Rachel

    2009-06-01

    LS-SNP/PDB is a new WWW resource for genome-wide annotation of human non-synonymous (amino acid changing) SNPs. It serves high-quality protein graphics rendered with UCSF Chimera molecular visualization software. The system is kept up-to-date by an automated, high-throughput build pipeline that systematically maps human nsSNPs onto Protein Data Bank structures and annotates several biologically relevant features. LS-SNP/PDB is available at (http://ls-snp.icm.jhu.edu/ls-snp-pdb) and via links from protein data bank (PDB) biology and chemistry tabs, UCSC Genome Browser Gene Details and SNP Details pages and PharmGKB Gene Variants Downloads/Cross-References pages.

  7. Use of direct and iterative solvers for estimation of SNP effects in genome-wide selection

    Directory of Open Access Journals (Sweden)

    Eduardo da Cruz Gouveia Pimentel

    2010-01-01

    Full Text Available The aim of this study was to compare iterative and direct solvers for estimation of marker effects in genomic selection. One iterative and two direct methods were used: Gauss-Seidel with Residual Update, Cholesky Decomposition and Gentleman-Givens rotations. For resembling different scenarios with respect to number of markers and of genotyped animals, a simulated data set divided into 25 subsets was used. Number of markers ranged from 1,200 to 5,925 and number of animals ranged from 1,200 to 5,865. Methods were also applied to real data comprising 3081 individuals genotyped for 45181 SNPs. Results from simulated data showed that the iterative solver was substantially faster than direct methods for larger numbers of markers. Use of a direct solver may allow for computing (covariances of SNP effects. When applied to real data, performance of the iterative method varied substantially, depending on the level of ill-conditioning of the coefficient matrix. From results with real data, Gentleman-Givens rotations would be the method of choice in this particular application as it provided an exact solution within a fairly reasonable time frame (less than two hours. It would indeed be the preferred method whenever computer resources allow its use.

  8. Heap: a highly sensitive and accurate SNP detection tool for low-coverage high-throughput sequencing data

    KAUST Repository

    Kobayashi, Masaaki

    2017-04-20

    Recent availability of large-scale genomic resources enables us to conduct so called genome-wide association studies (GWAS) and genomic prediction (GP) studies, particularly with next-generation sequencing (NGS) data. The effectiveness of GWAS and GP depends on not only their mathematical models, but the quality and quantity of variants employed in the analysis. In NGS single nucleotide polymorphism (SNP) calling, conventional tools ideally require more reads for higher SNP sensitivity and accuracy. In this study, we aimed to develop a tool, Heap, that enables robustly sensitive and accurate calling of SNPs, particularly with a low coverage NGS data, which must be aligned to the reference genome sequences in advance. To reduce false positive SNPs, Heap determines genotypes and calls SNPs at each site except for sites at the both ends of reads or containing a minor allele supported by only one read. Performance comparison with existing tools showed that Heap achieved the highest F-scores with low coverage (7X) restriction-site associated DNA sequencing reads of sorghum and rice individuals. This will facilitate cost-effective GWAS and GP studies in this NGS era. Code and documentation of Heap are freely available from https://github.com/meiji-bioinf/heap (29 March 2017, date last accessed) and our web site (http://bioinf.mind.meiji.ac.jp/lab/en/tools.html (29 March 2017, date last accessed)).

  9. FTO genotype is associated with exercise training-induced changes in body composition

    Science.gov (United States)

    Rankinen, Tuomo; Rice, Treva; Teran-Garcia, Margarita; Rao, D.C.; Bouchard, Claude

    2010-01-01

    The fat mass and obesity associated (FTO) gene is the first obesity-susceptibility gene identified by genome-wide association scans and confirmed in several follow-up studies. Homozygotes for the risk allele (A/A) have 1.67 times greater risk of obesity than those who do not have the allele. However, it is not known if regular exercise-induced changes in body composition are influenced by the FTO genotype. The purpose of our study was to test if the FTO genotype is associated with exercise-induced changes in adiposity. Body composition was derived from underwater weighing before and after a 20-week endurance training program in 481 previously sedentary white subjects of the HERITAGE Family Study. FTO SNP rs8050136 was genotyped using Illumina GoldenGate assay. In the sedentary state, the A/A homozygotes were significantly heavier and fatter than the heterozygotes and the C/C homozygotes in men (p=0.004) but not in women (p=0.331; gene-by-sex interaction p=0.0053). The FTO genotype was associated with body fat responses to regular exercise (p<0.005; adjusted for age, sex, and baseline value of response trait): carriers of the C-allele showed three times greater fat mass and %body fat losses than the A/A homozygotes. The FTO genotype explained 2% of the variance in adiposity changes. Our data suggest that the FTO obesity-susceptibility genotype influences the body fat responses to regular exercise. Resistance to exercise-induced reduction in total adiposity may represent one mechanism by which the FTO A allele promotes overweight and obesity. PMID:19543202

  10. Assessing the associations of blood metabolites with osteoporosis: a Mendelian randomization study.

    Science.gov (United States)

    Liu, Li; Wen, Yan; Zhang, Lei; Xu, Peng; Liang, Xiao; Du, Yanan; Li, Ping; He, Awen; Fan, QianRui; Hao, Jingcan; Wang, Wenyu; Guo, Xiong; Shen, Hui; Tian, Qing; Zhang, Feng; Deng, Hong-Wen

    2018-03-01

    Osteoporosis is a metabolic bone disease. The impact of blood metabolites on the development of osteoporosis remains elusive now. To explore the relationship between blood metabolites and osteoporosis. We used 2,286 unrelated Caucasian subjects as discovery samples and 3,143 unrelated Caucasian subjects from the Framingham heart study (FHS) as replication samples. Bone mineral density (BMD) were measured using dual-energy X-ray absorptiometry. Genome-wide SNP genotyping was performed using Affymetrix Human SNP Array 6.0 (for discovery samples) and Affymetrix SNP 500K and 50K array (for FHS replication samples). The SNP sets significantly associated with blood metabolites were obtained from a published whole-genome sequencing study. For each subject, the genetic risk score (GRS) of metabolite was calculated from the genotype data of metabolite associated SNP sets. Pearson correlation analysis was conducted to evaluate the potential impact of blood metabolites on the variations bone phenotypes. 10,000 permutations were conducted to calculate the empirical P value and false discovery rate (FDR). 481 blood metabolites were analyzed in this study. We identified multiple blood metabolites associated with hip BMD, such as 1,5-anhydroglucitol(1,5-AG) (Pdiscovery metabolites on the variations of BMD, and identified several candidate blood metabolites for osteoporosis.

  11. BDNF rs6265 methylation and genotype interact on risk for schizophrenia.

    Science.gov (United States)

    Ursini, Gianluca; Cavalleri, Tommaso; Fazio, Leonardo; Angrisano, Tiziana; Iacovelli, Luisa; Porcelli, Annamaria; Maddalena, Giancarlo; Punzi, Giovanna; Mancini, Marina; Gelao, Barbara; Romano, Raffaella; Masellis, Rita; Calabrese, Francesca; Rampino, Antonio; Taurisano, Paolo; Di Giorgio, Annabella; Keller, Simona; Tarantini, Letizia; Sinibaldi, Lorenzo; Quarto, Tiziana; Popolizio, Teresa; Caforio, Grazia; Blasi, Giuseppe; Riva, Marco A; De Blasi, Antonio; Chiariotti, Lorenzo; Bollati, Valentina; Bertolino, Alessandro

    2016-01-01

    Epigenetic mechanisms can mediate gene-environment interactions relevant for complex disorders. The BDNF gene is crucial for development and brain plasticity, is sensitive to environmental stressors, such as hypoxia, and harbors the functional SNP rs6265 (Val(66)Met), which creates or abolishes a CpG dinucleotide for DNA methylation. We found that methylation at the BDNF rs6265 Val allele in peripheral blood of healthy subjects is associated with hypoxia-related early life events (hOCs) and intermediate phenotypes for schizophrenia in a distinctive manner, depending on rs6265 genotype: in ValVal individuals increased methylation is associated with exposure to hOCs and impaired working memory (WM) accuracy, while the opposite is true for ValMet subjects. Also, rs6265 methylation and hOCs interact in modulating WM-related prefrontal activity, another intermediate phenotype for schizophrenia, with an analogous opposite direction in the 2 genotypes. Consistently, rs6265 methylation has a different association with schizophrenia risk in ValVals and ValMets. The relationships of methylation with BDNF levels and of genotype with BHLHB2 binding likely contribute to these opposite effects of methylation. We conclude that BDNF rs6265 methylation interacts with genotype to bridge early environmental exposures to adult phenotypes, relevant for schizophrenia. The study of epigenetic changes in regions containing genetic variation relevant for human diseases may have beneficial implications for the understanding of how genes are actually translated into phenotypes.

  12. Evaluation of a candidate breast cancer associated SNP in ERCC4 as a risk modifier in BRCA1 and BRCA2 mutation carriers. Results from the Consortium of Investigators of Modifiers of BRCA1/BRCA2 (CIMBA)

    NARCIS (Netherlands)

    Osorio, A.; Milne, R. L.; Pita, G.; Peterlongo, P.; Heikkinen, T.; Simard, J.; Chenevix-Trench, G.; Spurdle, A. B.; Beesley, J.; Chen, X.; Healey, S.; Neuhausen, S. L.; Ding, Y. C.; Couch, F. J.; Wang, X.; Lindor, N.; Manoukian, S.; Barile, M.; Viel, A.; Tizzoni, L.; Szabo, C. I.; Foretova, L.; Zikan, M.; Claes, K.; Greene, M. H.; Mai, P.; Rennert, G.; Lejbkowicz, F.; Barnett-Griness, O.; Andrulis, I. L.; Ozcelik, H.; Weerasooriya, N.; Gerdes, A.-M.; Thomassen, M.; Cruger, D. G.; Caligo, M. A.; Friedman, E.; Kaufman, B.; Laitman, Y.; Cohen, S.; Kontorovich, T.; Gershoni-Baruch, R.; Dagan, E.; Jernström, H.; Askmalm, M. S.; Arver, B.; Malmer, B.; Domchek, S. M.; Nathanson, K. L.; Brunet, J.; Ramón Y Cajal, T.; Yannoukakos, D.; Hamann, U.; Hogervorst, F. B. L.; Verhoef, S.; Gómez García, E. B.; Wijnen, J. T.; van den Ouweland, A.; Easton, D. F.; Peock, S.; Cook, M.; Oliver, C. T.; Frost, D.; Luccarini, C.; Evans, D. G.; Lalloo, F.; Eeles, R.; Pichert, G.; Cook, J.; Hodgson, S.; Morrison, P. J.; Douglas, F.; Godwin, A. K.; Sinilnikova, O. M.; Barjhoux, L.; Stoppa-Lyonnet, D.; Moncoutier, V.; Giraud, S.; Cassini, C.; Olivier-Faivre, L.; Révillion, F.; Peyrat, J.-P.; Muller, D.; Fricker, J.-P.; Lynch, H. T.; John, E. M.; Buys, S.; Daly, M.; Hopper, J. L.; Terry, M. B.; Miron, A.; Yassin, Y.; Goldgar, D.; Singer, C. F.; Gschwantler-Kaulich, D.; Pfeiler, G.; Spiess, A.-C.; Hansen, Thomas V. O.; Johannsson, O. T.; Kirchhoff, T.; Offit, K.; Kosarin, K.; Piedmonte, M.; Rodriguez, G. C.; Wakeley, K.; Boggess, J. F.; Basil, J.; Schwartz, P. E.; Blank, S. V.; Toland, A. E.; Montagna, M.; Casella, C.; Imyanitov, E. N.; Allavena, A.; Schmutzler, R. K.; Versmold, B.; Engel, C.; Meindl, A.; Ditsch, N.; Arnold, N.; Niederacher, D.; Deissler, H.; Fiebig, B.; Varon-Mateeva, R.; Schaefer, D.; Froster, U. G.; Caldes, T.; de la Hoya, M.; McGuffog, L.; Antoniou, A. C.; Nevanlinna, H.; Radice, P.; Benítez, J.; Simard, Jacques; Durocher, Francine; Laframboise, Rachel; Plante, Marie; Bridge, Peter; Parboosingh, Jilian; Chiquette, Jocelyne; Lesperance, Bernard; Karlsson, Per; Nordling, Margareta; Bergman, Annika; Einbeigi, Zakaria; Stenmark-Askmalm, Marie; Liedgren, Sigrun; Borg, Ake; Loman, Niklas; Olsson, Hakan; Kristoffersson, Ulf; Jernstrom, Helena; Harbst, Katja; Henriksson, Karin; Lindblom, Annika; Arver, Brita; von Wachenfeldt, Anna; Liljegren, Annelie; Barbany-Bustinza, Gisela; Rantala, Johanna; Malmer, Beatrice; Stattin, Eva-Lena; Emanuelsson, Monica; Ehrencrona, Hans; Brandell, Richard Rosenquist; Dahl, Niklas; Hogervorst, Frans; Verhoef, Senno; Pijpe, Anouk; van 't Veer, Laura; van Leeuwen, Flora; Rookus, Matti; Collée, Margriet; van den Ouweland, Ans; Kriege, Mieke; Schutte, Mieke; Hooning, Maartje; Seynaeve, Caroline; Tollenaar, Rob; van Asperen, Christi; Wijnen, Juul; Vreeswijk, Maaike; Devilee, Peter; Hoogerbrugge, Nicoline; Ligtenberg, Marjolijn; Ausems, Margreet; van der Luijt, Rob; Aalfs, Cora; van Os, Theo; Meijers-Heijboer, Hanne; Gille, Hans; Gomez-Garcia, Encarna; Blok, Rien; Peock, Susan; Cook, Margaret; Oliver, Clare; Frost, Debra; Miedzybrodzka, Zosia; Gregory, Helen; Morrison, Patrick; Cole, Trevor; McKeown, Carole; Taylor, Amy; Donaldson, Alan; Paterson, Joan; Murray, Alexandra; Rogers, Mark; McCann, Emma; Kennedy, John; Barton, David; Porteous, Mary; Brewer, Carole; Kivuva, Emma; Searle, Anne; Goodman, Selina; Davidson, Rosemarie; Murday, Murday; Bradshaw, Nicola; Snadden, Lesley; Longmuir, Mark; Watt, Catherine; Izatt, Louise; Pichert, Gabriella; Langman, Caroline; Dorkins, Huw; Barwell, Julian; Chu, Carol; Bishop, Tim; Miller, Julie; Ellis, Ian; Evans, D. Gareth; Lalloo, Fiona; Holt, Felicity; Male, Alison; Robinson, Anne; Gardiner, Carol; Douglas, Fiona; Claber, Oonagh; Walker, Lisa; Durell, Sarah; Eeles, Ros; Shanley, Susan; Rahman, Nazneen; Houlston, Richard; Bancrof, Elizabeth; D'Mello, Lucia; Page, Elizabeth; Ardern-Jones, Audrey; Mitra, Anita; Wiggins, Jennifer; Castro, Elena; Cook, Jackie; Quarrell, Oliver; Bardsley, Cathryn; Hodgson, Shirley; Goff, Sheila; Brice, Glen; Winchester, Lizzie; Eccles, Diana; Lucassen, Anneke; Crawford, Gillian; Tyler, Emma; McBride, Donna; Sinilnikova, Olga; Barjhoux, Laure; Giraud, Sophie; Léone, Mélanie; Mazoyer, Sylvie; Stoppa-Lyonnet, Dominique; Gauthier-Villars, Marion; Houdayer, Claude; Moncoutier, Virginie; Belotti, Muriel; de Pauw, Antoine; Bressac-de-Paillerets, Brigitte; Remenieras, Audrey; Byrde, Véronique; Caron, Olivier; Lenoir, Gilbert; Bignon, Yves-Jean; Uhrhammer, Nancy; Lasset, Christine; Bonadona, Valérie; Hardouin, Agnès; Berthet, Pascaline; Bourdon, Violaine; Eisinger, François; Coulet, Florence; Colas, Chrystelle; Soubrier, Florent; Coupier, Isabelle; Peyrat, Jean-Philippe; Fournier, Joëlle; Révillion, Françoise; Vennin, Philippe; Adenis, Claude; Rouleau, Etienne; Lidereau, Rosette; Demange, Liliane; Nogues, Catherine; Muller, Danièle; Fricker, Jean-Pierre; Longy, Michel; Sevenet, Nicolas; Toulas, Christine; Guimbaud, Rosine; Gladieff, Laurence; Feillel, Viviane; Leroux, Dominique; Dreyfus, Hélène; Rebischung, Christine; Cassini, Cécile; Olivier-Faivre, Laurence; Prieur, Fabienne; Ferrer, Sandra Fert; Frénay, Marc; Lynch, Henry T.

    2009-01-01

    In this study we aimed to evaluate the role of a SNP in intron 1 of the ERCC4 gene (rs744154), previously reported to be associated with a reduced risk of breast cancer in the general population, as a breast cancer risk modifier in BRCA1 and BRCA2 mutation carriers. We have genotyped rs744154 in

  13. Association of the serotonin transporter-linked polymorphic region genotype with lower bone mineral density.

    Science.gov (United States)

    Lapid, M I; Kung, S; Frye, M A; Biernacka, J M; Geske, J R; Drake, M T; Jankowski, M D; Clarke, B L

    2017-08-22

    The serotonin transporter-linked polymorphic region (5-HTTLPR) of the serotonin transporter gene (SLC6A4) S allele is linked to pathogenesis of depression and slower response to selective serotonin reuptake inhibitors (SSRIs); depression and SSRIs are independently associated with bone loss. We aimed to determine whether 5-HTTLPR was associated with bone loss. This cross-sectional study included psychiatric patients with both 5-HTTLPR analysis and bone mineral density (BMD) assessment (hip and spine Z-scores if age <50 years and T-scores if ⩾50 years). BMD association with 5-HTTLPR was evaluated under models with additive allele effects and dominant S allele effects using linear regression models. Patients were stratified by age (<50 and ⩾50 years) and sex. Of 3016 patients with 5-HTTLPR genotyping, 239 had BMD assessments. Among the younger patients, the S allele was associated with lower Z-scores at the hip (P=0.002, dominant S allele effects; P=0.004, additive allele effects) and spine (P=0.0006, dominant S allele effects; P=0.01, additive allele effects). In sex-stratified analyses, the association of the S allele with lower BMD in the younger patients was also significant in the subset of women (P⩽0.003 for both hip and spine BMD under the additive allele effect model). In the small group of men younger than 50 years, the S allele was marginally associated with higher spine BMD (P=0.05). BMD T-scores were not associated with 5-HTTLPR genotypes in patients 50 years or older. The 5-HTTLPR variants may modify serotonin effects on bone with sex-specific effects.

  14. Maternal fatty acid desaturase genotype correlates with infant immune responses at 6 months

    DEFF Research Database (Denmark)

    Muc, Magdalena; Kreiner-Møller, Eskil; Larsen, Jeppe Madura

    2015-01-01

    -produced cytokines after anti-CD3/CD28 stimulation of peripheral blood mononuclear cells in 6-month-old infants from the Copenhagen Prospective Study of Asthma in Childhood birth cohort. LCPUFA concentrations of breast milk were assessed at 4 weeks of age, and FADS SNP were determined in both mothers and infants (n...... and cytotoxic T-cells and decreased T-helper cell counts. The minor FADS alleles were associated with lower breast milk AA and EPA, and infants of mothers carrying the minor allele of FADS SNP rs174556 had higher production of IL-10 (r -0.23; P=0.018), IL-17 (r -0.25; P=0.009) and IL-5 (r -0.21; P=0.038) from......Breast milk long-chain PUFA (LCPUFA) have been associated with changes in early life immune responses and may modulate T-cell function in infancy. We studied the effect of maternal fatty acid desaturase (FADS) genotype and breast milk LCPUFA levels on infants' blood T-cell profiles and ex vivo...

  15. The LPL S447X cSNP is associated with decreased blood pressure and plasma triglycerides, and reduced risk of coronary artery disease

    NARCIS (Netherlands)

    Clee, S. M.; Loubser, O.; Collins, J.; Kastelein, J. J.; Hayden, M. R.

    2001-01-01

    Linkage of the lipoprotein lipase (LPL) gene to blood pressure levels has been reported. The LPL S447X single nucleotide polymorphism (cSNP) has been associated with decreased triglycerides (TG), increased high density lipoprotein cholesterol, and a decreased risk of coronary artery disease (CAD),

  16. Identification of two novel mammographic density loci at 6Q25.1.

    Science.gov (United States)

    Brand, Judith S; Li, Jingmei; Humphreys, Keith; Karlsson, Robert; Eriksson, Mikael; Ivansson, Emma; Hall, Per; Czene, Kamila

    2015-06-03

    Mammographic density (MD) is a strong heritable and intermediate phenotype for breast cancer, but much of its genetic variation remains unexplained. We performed a large-scale genetic association study including 8,419 women of European ancestry to identify MD loci. Participants of three Swedish studies were genotyped on a custom Illumina iSelect genotyping array and percent and absolute mammographic density were ascertained using semiautomated and fully automated methods from film and digital mammograms. Linear regression analysis was used to test for SNP-MD associations, adjusting for age, body mass index, menopausal status and six principal components. Meta-analyses were performed by combining P values taking sample size, study-specific inflation factor and direction of effect into account. Genome-wide significant associations were observed for two previously identified loci: ZNF365 (rs10995194, P = 2.3 × 10(-8) for percent MD and P = 8.7 × 10(-9) for absolute MD) and AREG (rs10034692, P = 6.7 × 10(-9) for absolute MD). In addition, we found evidence of association for two variants at 6q25.1, both of which are known breast cancer susceptibility loci: rs9485370 in the TAB2 gene (P = 4.8 × 10(-9) for percent MD and P = 2.5 × 10(-8) for absolute MD) and rs60705924 in the CCDC170/ESR1 region (P = 2.2 × 10(-8) for absolute MD). Both regions have been implicated in estrogen receptor signaling with TAB2 being a potential regulator of tamoxifen response. We identified two novel MD loci at 6q25.1. These findings underscore the importance of 6q25.1 as a susceptibility region and provide more insight into the mechanisms through which MD influences breast cancer risk.

  17. Apolipoprotein E Genotype in Very Preterm Neonates with Intrauterine Growth Restriction: An Analysis of the German Neonatal Network Cohort.

    Science.gov (United States)

    Norda, Stephen; Rausch, Tanja K; Orlikowsky, Thorsten; Hütten, Matthias; Schulz, Sören; Göpel, Wolfgang; Pecks, Ulrich

    2017-01-01

    Aim. Cord blood of intrauterine growth restricted (IUGR) neonates displays lipid changes towards atherosclerotic profiles. Apolipoprotein E (ApoE) and its isoforms (e2, e3, and e4) are involved in the regulation of lipid metabolism. Specifically, ApoE e4 has been associated with atherosclerotic diseases, while e2 has a favorable effect. We therefore hypothesized that ApoE e4 haplotype is frequently observed in IUGR neonates and contributes to impaired fetal growth and the association of IUGR with cardiovascular and metabolic diseases later in life. Methods. A cohort of 4885 preterm infants (≥22+0 and 10th birth weight percentile. Analysis of the single nucleotides rs429358 and rs7412, identifying the ApoE genotype, was carried out using TaqMan® SNP genotyping assays. The proportional odds model was used to assess data. Results. No association was found between genotype and birth weight percentiles in each of the subgroups. Conclusion. ApoE genotype and low birth weight depict two distinct risk factors for cardiovascular disease without being directly associated.

  18. Apolipoprotein E Genotype in Very Preterm Neonates with Intrauterine Growth Restriction: An Analysis of the German Neonatal Network Cohort

    Directory of Open Access Journals (Sweden)

    Stephen Norda

    2017-01-01

    Full Text Available Aim. Cord blood of intrauterine growth restricted (IUGR neonates displays lipid changes towards atherosclerotic profiles. Apolipoprotein E (ApoE and its isoforms (e2, e3, and e4 are involved in the regulation of lipid metabolism. Specifically, ApoE e4 has been associated with atherosclerotic diseases, while e2 has a favorable effect. We therefore hypothesized that ApoE e4 haplotype is frequently observed in IUGR neonates and contributes to impaired fetal growth and the association of IUGR with cardiovascular and metabolic diseases later in life. Methods. A cohort of 4885 preterm infants (≥22+0 and 10th birth weight percentile. Analysis of the single nucleotides rs429358 and rs7412, identifying the ApoE genotype, was carried out using TaqMan® SNP genotyping assays. The proportional odds model was used to assess data. Results. No association was found between genotype and birth weight percentiles in each of the subgroups. Conclusion. ApoE genotype and low birth weight depict two distinct risk factors for cardiovascular disease without being directly associated.

  19. Genome-Wide SNP Discovery and Analysis of Genetic Diversity in Farmed Sika Deer (Cervus nippon in Northeast China Using Double-Digest Restriction Site-Associated DNA Sequencing

    Directory of Open Access Journals (Sweden)

    Hengxing Ba

    2017-09-01

    Full Text Available Sika deer are an economically valuable species owing to their use in traditional Chinese medicine, particularly their velvet antlers. Sika deer in northeast China are mostly farmed in enclosure. Therefore, genetic management of farmed sika deer would benefit from detailed knowledge of their genetic diversity. In this study, we generated over 1.45 billion high-quality paired-end reads (288 Gbp across 42 unrelated individuals using double-digest restriction site-associated DNA sequencing (ddRAD-seq. A total of 96,188 (29.63% putative biallelic SNP loci were identified with an average sequencing depth of 23×. Based on the analysis, we found that the majority of the loci had a deficit of heterozygotes (FIS >0 and low values of Hobs, which could be due to inbreeding and Wahlund effects. We also developed a collection of high-quality SNP probes that will likely be useful in a variety of applications in genotyping for cervid species in the future.

  20. Genome-Wide SNP Discovery and Analysis of Genetic Diversity in Farmed Sika Deer (Cervus nippon) in Northeast China Using Double-Digest Restriction Site-Associated DNA Sequencing.

    Science.gov (United States)

    Ba, Hengxing; Jia, Boyin; Wang, Guiwu; Yang, Yifeng; Kedem, Gilead; Li, Chunyi

    2017-09-07

    Sika deer are an economically valuable species owing to their use in traditional Chinese medicine, particularly their velvet antlers. Sika deer in northeast China are mostly farmed in enclosure. Therefore, genetic management of farmed sika deer would benefit from detailed knowledge of their genetic diversity. In this study, we generated over 1.45 billion high-quality paired-end reads (288 Gbp) across 42 unrelated individuals using double-digest restriction site-associated DNA sequencing (ddRAD-seq). A total of 96,188 (29.63%) putative biallelic SNP loci were identified with an average sequencing depth of 23×. Based on the analysis, we found that the majority of the loci had a deficit of heterozygotes (F IS >0) and low values of H obs , which could be due to inbreeding and Wahlund effects. We also developed a collection of high-quality SNP probes that will likely be useful in a variety of applications in genotyping for cervid species in the future. Copyright © 2017 Ba et al.

  1. Imputing amino acid polymorphisms in human leukocyte antigens.

    Directory of Open Access Journals (Sweden)

    Xiaoming Jia

    Full Text Available DNA sequence variation within human leukocyte antigen (HLA genes mediate susceptibility to a wide range of human diseases. The complex genetic structure of the major histocompatibility complex (MHC makes it difficult, however, to collect genotyping data in large cohorts. Long-range linkage disequilibrium between HLA loci and SNP markers across the major histocompatibility complex (MHC region offers an alternative approach through imputation to interrogate HLA variation in existing GWAS data sets. Here we describe a computational strategy, SNP2HLA, to impute classical alleles and amino acid polymorphisms at class I (HLA-A, -B, -C and class II (-DPA1, -DPB1, -DQA1, -DQB1, and -DRB1 loci. To characterize performance of SNP2HLA, we constructed two European ancestry reference panels, one based on data collected in HapMap-CEPH pedigrees (90 individuals and another based on data collected by the Type 1 Diabetes Genetics Consortium (T1DGC, 5,225 individuals. We imputed HLA alleles in an independent data set from the British 1958 Birth Cohort (N = 918 with gold standard four-digit HLA types and SNPs genotyped using the Affymetrix GeneChip 500 K and Illumina Immunochip microarrays. We demonstrate that the sample size of the reference panel, rather than SNP density of the genotyping platform, is critical to achieve high imputation accuracy. Using the larger T1DGC reference panel, the average accuracy at four-digit resolution is 94.7% using the low-density Affymetrix GeneChip 500 K, and 96.7% using the high-density Illumina Immunochip. For amino acid polymorphisms within HLA genes, we achieve 98.6% and 99.3% accuracy using the Affymetrix GeneChip 500 K and Illumina Immunochip, respectively. Finally, we demonstrate how imputation and association testing at amino acid resolution can facilitate fine-mapping of primary MHC association signals, giving a specific example from type 1 diabetes.

  2. Multiplexed SNP Typing of Ancient DNA Clarifies the Origin of Andaman mtDNA Haplogroups amongst South Asian Tribal Populations

    Science.gov (United States)

    Endicott, Phillip; Metspalu, Mait; Stringer, Chris; Macaulay, Vincent; Cooper, Alan; Sanchez, Juan J.

    2006-01-01

    The issue of errors in genetic data sets is of growing concern, particularly in population genetics where whole genome mtDNA sequence data is coming under increased scrutiny. Multiplexed PCR reactions, combined with SNP typing, are currently under-exploited in this context, but have the potential to genotype whole populations rapidly and accurately, significantly reducing the amount of errors appearing in published data sets. To show the sensitivity of this technique for screening mtDNA genomic sequence data, 20 historic samples of the enigmatic Andaman Islanders and 12 modern samples from three Indian tribal populations (Chenchu, Lambadi and Lodha) were genotyped for 20 coding region sites after provisional haplogroup assignment with control region sequences. The genotype data from the historic samples significantly revise the topologies for the Andaman M31 and M32 mtDNA lineages by rectifying conflicts in published data sets. The new Indian data extend the distribution of the M31a lineage to South Asia, challenging previous interpretations of mtDNA phylogeography. This genetic connection between the ancestors of the Andamanese and South Asian tribal groups ∼30 kya has important implications for the debate concerning migration routes and settlement patterns of humans leaving Africa during the late Pleistocene, and indicates the need for more detailed genotyping strategies. The methodology serves as a low-cost, high-throughput model for the production and authentication of data from modern or ancient DNA, and demonstrates the value of museum collections as important records of human genetic diversity. PMID:17218991

  3. Thermochemical and physical evaluation of poplar genotypes as short rotation forestry crops for energy use

    International Nuclear Information System (INIS)

    Monedero, Esperanza; Hernández, Juan José; Cañellas, Isabel; Otero, Jose María; Sixto, Hortensia

    2016-01-01

    Highlights: • The combustion properties of poplar genotypes harvested from three different sites were analyzed. • Both the genotype and the locations affect the expected combustion behavior. • Among the poplar genotypes, ‘Monviso’ and ‘Viriato’ are expected to have better combustion behavior. • No operating problems derived from fouling/corrosion are expected for any of the genotypes. - Abstract: Short rotation plantations of fast-growing species provide a promising way to produce heat and electricity from renewable sources. The thermo-chemical and physical properties of different genotypes of poplar in short rotation forestry crops grown at three locations with different climatic and edaphic characteristics as well as planting density, have been determined in order to characterize the most appropriate biomass in terms of energy potential. The planting density was 6666 or 13,333 trees/ha (depending on the location) in a rotation of three-four years and the analysis was carried out at the end of the first rotation. For all the genotypes, experimental tests to quantify the moisture content, particle size distribution, bulk density, heating value, ash content and composition as well as the volatile matter were performed. In addition, natural air drying of biomass (stem and branches) was studied in two locations with the aim of determining the humidity loss during raw storage. A significant effect of the genotype and the planting density on the biomass properties was observed. The results obtained indicate that ‘Monviso’ and ‘Viriato’ are the most suitable genotypes. No operational problems related to ash fouling and deposition in combustion devices are expected for any of the genotypes studied.

  4. snpTree - a web-server to identify and construct SNP trees from whole genome sequence data

    DEFF Research Database (Denmark)

    Leekitcharoenphon, Pimlapas; Kaas, Rolf Sommer; Thomsen, Martin Christen Frølund

    2012-01-01

    identify SNPs and construct phylogenetic trees from WGS as well as from assembled genomes or contigs. WGS data in fastq format are aligned to reference genomes by BWA while contigs in fasta format are processed by Nucmer. SNPs are concatenated based on position on reference genome and a tree is constructed...... to differentiate and classify isolates. One of the successfully and broadly used methods is analysis of single nucletide polymorphisms (SNPs). Currently, there are different tools and methods to identify SNPs including various options and cut-off values. Furthermore, all current methods require bioinformatic...... skills. Thus, we lack a standard and simple automatic tool to determine SNPs and construct phylogenetic tree from WGS data. Results Here we introduce snpTree, a server for online-automatic SNPs analysis. This tool is composed of different SNPs analysis suites, perl and python scripts. snpTree can...

  5. Genotyping of 75 SNPs using arrays for individual identification in five population groups.

    Science.gov (United States)

    Hwa, Hsiao-Lin; Wu, Lawrence Shih Hsin; Lin, Chun-Yen; Huang, Tsun-Ying; Yin, Hsiang-I; Tseng, Li-Hui; Lee, James Chun-I

    2016-01-01

    Single nucleotide polymorphism (SNP) typing offers promise to forensic genetics. Various strategies and panels for analyzing SNP markers for individual identification have been published. However, the best panels with fewer identity SNPs for all major population groups are still under discussion. This study aimed to find more autosomal SNPs with high heterozygosity for individual identification among Asian populations. Ninety-six autosomal SNPs of 502 DNA samples from unrelated individuals of five population groups (208 Taiwanese Han, 83 Filipinos, 62 Thais, 69 Indonesians, and 80 individuals with European, Near Eastern, or South Asian ancestry) were analyzed using arrays in an initial screening, and 75 SNPs (group A, 46 newly selected SNPs; groups B, 29 SNPs based on a previous SNP panel) were selected for further statistical analyses. Some SNPs with high heterozygosity from Asian populations were identified. The combined random match probability of the best 40 and 45 SNPs was between 3.16 × 10(-17) and 7.75 × 10(-17) and between 2.33 × 10(-19) and 7.00 × 10(-19), respectively, in all five populations. These loci offer comparable power to short tandem repeats (STRs) for routine forensic profiling. In this study, we demonstrated the population genetic characteristics and forensic parameters of 75 SNPs with high heterozygosity from five population groups. This SNPs panel can provide valuable genotypic information and can be helpful in forensic casework for individual identification among these populations.

  6. Genotyping by sequencing (GBS, an ultimate marker-assisted selection (MAS tool to accelerate plant breeding

    Directory of Open Access Journals (Sweden)

    Jiangfeng eHe

    2014-09-01

    Full Text Available Marker-assisted selection (MAS refers to the use of molecular markers to assist phenotypic selections in crop improvement. Several types of molecular markers, such as single nucleotide polymorphism (SNP, have been identified and effectively used in plant breeding. The application of next-generation sequencing (NGS technologies has led to remarkable advances in whole genome sequencing, which provides ultra-throughput sequences to revolutionize plant genotyping and breeding. To further broaden NGS usages to large crop genomes such as maize and wheat, genotyping by sequencing (GBS has been developed and applied in sequencing multiplexed samples that combine molecular marker discovery and genotyping. GBS is a novel application of NGS protocols for discovering and genotyping SNPs in crop genomes and populations. The GBS approach includes the digestion of genomic DNA with restriction enzymes followed by the ligation of barcode adapter, PCR amplification and sequencing of the amplified DNA pool on a single lane of flow cells. Bioinformatic pipelines are needed to analyze and interpret GBS datasets. As an ultimate MAS tool and a cost-effective technique, GBS has been successfully used in implementing genome-wide association study (GWAS, genomic diversity study, genetic linkage analysis, molecular marker discovery and genomic selection (GS under a large scale of plant breeding programs.

  7. Genotyping-by-sequencing (GBS), an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding.

    Science.gov (United States)

    He, Jiangfeng; Zhao, Xiaoqing; Laroche, André; Lu, Zhen-Xiang; Liu, HongKui; Li, Ziqin

    2014-01-01

    Marker-assisted selection (MAS) refers to the use of molecular markers to assist phenotypic selections in crop improvement. Several types of molecular markers, such as single nucleotide polymorphism (SNP), have been identified and effectively used in plant breeding. The application of next-generation sequencing (NGS) technologies has led to remarkable advances in whole genome sequencing, which provides ultra-throughput sequences to revolutionize plant genotyping and breeding. To further broaden NGS usages to large crop genomes such as maize and wheat, genotyping-by-sequencing (GBS) has been developed and applied in sequencing multiplexed samples that combine molecular marker discovery and genotyping. GBS is a novel application of NGS protocols for discovering and genotyping SNPs in crop genomes and populations. The GBS approach includes the digestion of genomic DNA with restriction enzymes followed by the ligation of barcode adapter, PCR amplification and sequencing of the amplified DNA pool on a single lane of flow cells. Bioinformatic pipelines are needed to analyze and interpret GBS datasets. As an ultimate MAS tool and a cost-effective technique, GBS has been successfully used in implementing genome-wide association study (GWAS), genomic diversity study, genetic linkage analysis, molecular marker discovery and genomic selection under a large scale of plant breeding programs.

  8. Cross-Sectional and Longitudinal Effects of CREB1 Genotypes on Individual Differences in Memory and Executive Function: Findings from the BLSA

    Directory of Open Access Journals (Sweden)

    Claudia Wolf

    2017-05-01

    Full Text Available Purpose: Previously, we have shown that the SNP rs10932201 genotype of the cyclic AMP responsive element binding protein 1 gene (CREB1 contributes to individual differences in executive and memory function at the neural system and behavioral levels in healthy, young adults. However, longitudinal effects of CREB1 genotypes on cognition have not yet been addressed. Furthermore we were interested in replicating associations between CREB1 genotypes and human cognition in previous cross-sectional studies and explore whether APOE4 status might modify these relations.Materials and Methods: We investigated whether common, independent tag SNPs within CREB1 (rs2253206, rs10932201, rs6785 influence individual differences in age-related longitudinal change and level of executive function and memory performance independent of baseline age, sex, APOE4 status, and education. Our analysis included data from cognitively unimpaired older adults participating in the Baltimore Longitudinal Study of Aging. Eleven measures from six cognitive tests (sample sizes range 617–786 were analyzed using linear mixed effects and generalized estimating equations models. Mean baseline age ranged from 50 to 69 years and mean time of follow-up (interval ranged from 8 to 22 years.Results: We found significant effects of all three CREB1 SNPs on performance level and/or longitudinal change in performance based on eight measures assessing semantic memory, episodic memory, or both executive function and semantic memory. SNP rs10932201 showed the most significant and largest effect (Cohen’s d = -0.70, p < 0.01 on age-related longitudinal decline of semantic memory. Additionally, we show interactions between all three CREB1 SNPs and APOE4 status on age-related longitudinal declines and levels of memory and executive function.Conclusion: Our results suggest that CREB1 genotypes independently and by interactions with APOE4 status contribute to individual differences in cognitive aging.

  9. SNP-SNP interaction analysis of NF-κB signaling pathway on breast cancer survival

    DEFF Research Database (Denmark)

    Jamshidi, Maral; Fagerholm, Rainer; Khan, Sofia

    2015-01-01

    of SNP pairs without and with an interaction term. We found two interacting pairs associating with prognosis: patients simultaneously homozygous for the rare alleles of rs5996080 and rs7973914 had worse survival (HRinteraction 6.98, 95% CI=3.3-14.4, P=1.42E-07), and patients carrying at least one rare...

  10. Assessment of inbreeding depression in Nellore cows (Bos indicus) through high-density SNP genotypes

    Science.gov (United States)

    Inbreeding has been incriminated as a cause of decrease in reproductive performance in cattle. This negative correlation is known as ‘inbreeding depression’, and evidence supporting this hypothesis was generated from association studies between reproductive traits and estimates of inbreeding coeffic...

  11. Genome-wide SNP scan of pooled DNA reveals nonsense mutation in FGF20 in the scaleless line of featherless chickens

    Directory of Open Access Journals (Sweden)

    Wells Kirsty L

    2012-06-01

    Full Text Available Abstract Background Scaleless (sc/sc chickens carry a single recessive mutation that causes a lack of almost all body feathers, as well as foot scales and spurs, due to a failure of skin patterning during embryogenesis. This spontaneous mutant line, first described in the 1950s, has been used extensively to explore the tissue interactions involved in ectodermal appendage formation in embryonic skin. Moreover, the trait is potentially useful in tropical agriculture due to the ability of featherless chickens to tolerate heat, which is at present a major constraint to efficient poultry meat production in hot climates. In the interests of enhancing our understanding of feather placode development, and to provide the poultry industry with a strategy to breed heat-tolerant meat-type chickens (broilers, we mapped and identified the sc mutation. Results Through a cost-effective and labour-efficient SNP array mapping approach using DNA from sc/sc and sc/+ blood sample pools, we map the sc trait to chromosome 4 and show that a nonsense mutation in FGF20 is completely associated with the sc/sc phenotype. This mutation, common to all sc/sc individuals and absent from wild type, is predicted to lead to loss of a highly conserved region of the FGF20 protein important for FGF signalling. In situ hybridisation and quantitative RT-PCR studies reveal that FGF20 is epidermally expressed during the early stages of feather placode patterning. In addition, we describe a dCAPS genotyping assay based on the mutation, developed to facilitate discrimination between wild type and sc alleles. Conclusions This work represents the first loss of function genetic evidence supporting a role for FGF ligand signalling in feather development, and suggests FGF20 as a novel central player in the development of vertebrate skin appendages, including hair follicles and exocrine glands. In addition, this is to our knowledge the first report describing the use of the chicken SNP array to

  12. Interest in genomic SNP testing for prostate cancer risk: a pilot survey.

    Science.gov (United States)

    Hall, Michael J; Ruth, Karen J; Chen, David Yt; Gross, Laura M; Giri, Veda N

    2015-01-01

    Advancements in genomic testing have led to the identification of single nucleotide polymorphisms (SNPs) associated with prostate cancer. The clinical utility of SNP tests to evaluate prostate cancer risk is unclear. Studies have not examined predictors of interest in novel genomic SNP tests for prostate cancer risk in a diverse population. Consecutive participants in the Fox Chase Prostate Cancer Risk Assessment Program (PRAP) (n = 40) and unselected men from surgical urology clinics (n = 40) completed a one-time survey. Items examined interest in genomic SNP testing for prostate cancer risk, knowledge, impact of unsolicited findings, and psychosocial factors including health literacy. Knowledge of genomic SNP tests was low in both groups, but interest was higher among PRAP men (p testing in both groups. Multivariable modeling identified several predictors of higher interest in a genomic SNP test including higher perceived risk (p = 0.025), indicating zero reasons for not wanting testing (vs ≥1 reason) (p = 0.013), and higher health literacy (p = 0.016). Knowledge of genomic SNP testing was low in this sample, but higher among high-risk men. High-risk status may increase interest in novel genomic tests, while low literacy may lessen interest.

  13. Genotyping Rs2274625 Marker in NPHS2 Gene Associated with Nephrotic Syndrome in Isfahan Population

    Directory of Open Access Journals (Sweden)

    L Esmaili Chamgordani

    2015-12-01

    Full Text Available Introduction: Nephrotic syndrome (NS is a genetic disease belonging to a heterogeneous group of glomerular disorders, which mainly occurs within the children. Linkage analysis using single nucleotide polymorphisms (SNP is used as an indirect method in molecular diagnosis of the disease. A large number of SNP markers have been introduced in NPHS2gene in the available electronic databases. Method: In the present study, the genotype and informative status of rs2274625 marker in NPHS2 genewas investigated in 120 unrelated healthy individuals using Tetra-primer ARMS PCR technique and newly designed primers. Allelic frequency and presence of Hardy Weinberg Equilibrium (HWE was estimated using GenePop website. Furthermore, PowerMarker software was utilized in order to compute the index of polymorphism information content (PIC. Results: The study results indicated allele frequency of 97% and 3% for C and T alleles, respectively, in regard with rs2274625 marker within Isfahan population. Moreover, the PIC for the rs2274625 marker was 0.5%, and HWE revealed the equilibruim of the study population in regard with the related marker. Conclusion: As the study findings indicated, rs2274625 could be introduced as an SNP marker in the linkage analysis in order to molecularly trace NPHS2 gene mutations in molecular NS diagnosis in Isfahan population as a representative sample of the Iranian population.

  14. Genome rearrangements detected by SNP microarrays in individuals with intellectual disability referred with possible Williams syndrome.

    Directory of Open Access Journals (Sweden)

    Ariel M Pani

    2010-08-01

    Full Text Available Intellectual disability (ID affects 2-3% of the population and may occur with or without multiple congenital anomalies (MCA or other medical conditions. Established genetic syndromes and visible chromosome abnormalities account for a substantial percentage of ID diagnoses, although for approximately 50% the molecular etiology is unknown. Individuals with features suggestive of various syndromes but lacking their associated genetic anomalies pose a formidable clinical challenge. With the advent of microarray techniques, submicroscopic genome alterations not associated with known syndromes are emerging as a significant cause of ID and MCA.High-density SNP microarrays were used to determine genome wide copy number in 42 individuals: 7 with confirmed alterations in the WS region but atypical clinical phenotypes, 31 with ID and/or MCA, and 4 controls. One individual from the first group had the most telomeric gene in the WS critical region deleted along with 2 Mb of flanking sequence. A second person had the classic WS deletion and a rearrangement on chromosome 5p within the Cri du Chat syndrome (OMIM:123450 region. Six individuals from the ID/MCA group had large rearrangements (3 deletions, 3 duplications, one of whom had a large inversion associated with a deletion that was not detected by the SNP arrays.Combining SNP microarray analyses and qPCR allowed us to clone and sequence 21 deletion breakpoints in individuals with atypical deletions in the WS region and/or ID or MCA. Comparison of these breakpoints to databases of genomic variation revealed that 52% occurred in regions harboring structural variants in the general population. For two probands the genomic alterations were flanked by segmental duplications, which frequently mediate recurrent genome rearrangements; these may represent new genomic disorders. While SNP arrays and related technologies can identify potentially pathogenic deletions and duplications, obtaining sequence information

  15. Nuclear Species-Diagnostic SNP Markers Mined from 454 Amplicon Sequencing Reveal Admixture Genomic Structure of Modern Citrus Varieties

    Science.gov (United States)

    Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick

    2015-01-01

    Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP

  16. Construction of a high-density DArTseq SNP-based genetic map and identification of genomic regions with segregation distortion in a genetic population derived from a cross between feral and cultivated-type watermelon.

    Science.gov (United States)

    Ren, Runsheng; Ray, Rumiana; Li, Pingfang; Xu, Jinhua; Zhang, Man; Liu, Guang; Yao, Xiefeng; Kilian, Andrzej; Yang, Xingping

    2015-08-01

    Watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai] is an economically important vegetable crop grown extensively worldwide. To facilitate the identification of agronomically important traits and provide new information for genetic and genomic research on this species, a high-density genetic linkage map of watermelon was constructed using an F2 population derived from a cross between elite watermelon cultivar K3 and wild watermelon germplasm PI 189225. Based on a sliding window approach, a total of 1,161 bin markers representing 3,465 SNP markers were mapped onto 11 linkage groups corresponding to the chromosome pair number of watermelon. The total length of the genetic map is 1,099.2 cM, with an average distance between bins of 1.0 cM. The number of markers in each chromosome varies from 62 in chromosome 07 to 160 in chromosome 05. The length of individual chromosomes ranged between 61.8 cM for chromosome 07 and 140.2 cM for chromosome 05. A total of 616 SNP bin markers showed significant (P watermelon cultivar K3 allele and 103 were skewed toward PI 189225. The number of SNPs and InDels per Mb varied considerably across the segregation distorted regions (SDRs) on each chromosome, and a mixture of dense and sparse SNPs and InDel SDRs coexisted on some chromosomes suggesting that SDRs were randomly distributed throughout the genome. Recombination rates varied greatly among each chromosome, from 2.0 to 4.2 centimorgans per megabase (cM/Mb). An inconsistency was found between the genetic and physical positions on the map for a segment on chromosome 11. The high-density genetic map described in the present study will facilitate fine mapping of quantitative trait loci, the identification of candidate genes, map-based cloning, as well as marker-assisted selection (MAS) in watermelon breeding programs.

  17. Re-Ranking Sequencing Variants in the Post-GWAS Era for Accurate Causal Variant Identification

    Science.gov (United States)

    Faye, Laura L.; Machiela, Mitchell J.; Kraft, Peter; Bull, Shelley B.; Sun, Lei

    2013-01-01

    Next generation sequencing has dramatically increased our ability to localize disease-causing variants by providing base-pair level information at costs increasingly feasible for the large sample sizes required to detect complex-trait associations. Yet, identification of causal variants within an established region of association remains a challenge. Counter-intuitively, certain factors that increase power to detect an associated region can decrease power to localize the causal variant. First, combining GWAS with imputation or low coverage sequencing to achieve the large sample sizes required for high power can have the unintended effect of producing differential genotyping error among SNPs. This tends to bias the relative evidence for association toward better genotyped SNPs. Second, re-use of GWAS data for fine-mapping exploits previous findings to ensure genome-wide significance in GWAS-associated regions. However, using GWAS findings to inform fine-mapping analysis can bias evidence away from the causal SNP toward the tag SNP and SNPs in high LD with the tag. Together these factors can reduce power to localize the causal SNP by more than half. Other strategies commonly employed to increase power to detect association, namely increasing sample size and using higher density genotyping arrays, can, in certain common scenarios, actually exacerbate these effects and further decrease power to localize causal variants. We develop a re-ranking procedure that accounts for these adverse effects and substantially improves the accuracy of causal SNP identification, often doubling the probability that the causal SNP is top-ranked. Application to the NCI BPC3 aggressive prostate cancer GWAS with imputation meta-analysis identified a new top SNP at 2 of 3 associated loci and several additional possible causal SNPs at these loci that may have otherwise been overlooked. This method is simple to implement using R scripts provided on the author's website. PMID:23950724

  18. High-throughput bacterial SNP typing identifies distinct clusters of Salmonella Typhi causing typhoid in Nepalese children

    LENUS (Irish Health Repository)

    Holt, Kathryn E

    2010-05-31

    Abstract Background Salmonella Typhi (S. Typhi) causes typhoid fever, which remains an important public health issue in many developing countries. Kathmandu, the capital of Nepal, is an area of high incidence and the pediatric population appears to be at high risk of exposure and infection. Methods We recently defined the population structure of S. Typhi, using new sequencing technologies to identify nearly 2,000 single nucleotide polymorphisms (SNPs) that can be used as unequivocal phylogenetic markers. Here we have used the GoldenGate (Illumina) platform to simultaneously type 1,500 of these SNPs in 62 S. Typhi isolates causing severe typhoid in children admitted to Patan Hospital in Kathmandu. Results Eight distinct S. Typhi haplotypes were identified during the 20-month study period, with 68% of isolates belonging to a subclone of the previously defined H58 S. Typhi. This subclone was closely associated with resistance to nalidixic acid, with all isolates from this group demonstrating a resistant phenotype and harbouring the same resistance-associated SNP in GyrA (Phe83). A secondary clone, comprising 19% of isolates, was observed only during the second half of the study. Conclusions Our data demonstrate the utility of SNP typing for monitoring bacterial populations over a defined period in a single endemic setting. We provide evidence for genotype introduction and define a nalidixic acid resistant subclone of S. Typhi, which appears to be the dominant cause of severe pediatric typhoid in Kathmandu during the study period.

  19. Combinations of genetic variants associated with bipolar disorder

    DEFF Research Database (Denmark)

    Mellerup, Erling; Andreassen, Ole A; Bennike, Bente

    2017-01-01

    The main objective of the study was to find genetic variants that in combination are significantly associated with bipolar disorder. In previous studies of bipolar disorder, combinations of three and four single nucleotide polymorphisms (SNP) genotypes taken from 803 SNPs were analyzed, and five...... clusters of combinations were found to be significantly associated with bipolar disorder. In the present study, combinations of ten SNP genotypes taken from the same 803 SNPs were analyzed, and one cluster of combinations was found to be significantly associated with bipolar disorder. Combinations from......, heterozygote or variant homozygote. In the combinations containing 10 SNP genotypes almost all the genotypes were the normal homozygote. Such a finding may indicate that accumulation in the genome of combinations containing few SNP genotypes may be a risk factor for bipolar disorder when those combinations...

  20. Density Stress has Minimal Impacts on the Barley or Maize Seedling Transcriptome

    Directory of Open Access Journals (Sweden)

    Summer St. Pierre

    2011-03-01

    Full Text Available High planting density affects the morphology and productivity of many crop species. Our objectives were to examine the phenotypic and transcriptomic changes that occur during plant density stress in barley ( L. and maize ( L. seedlings. In maize and barley seedlings, density stress impacted several morphological traits. Gene expression profiles were examined in four barley and five maize genotypes grown at low and high plant densities. Only 221 barley and 35 maize genes exhibited differential expression in response to plant density stress. The majority of the gene expression changes were observed in a subset of the genotypes and reflected minor changes in the level of expression, indicating that the plant density stress imposed in this study did not result in major changes in gene expression. Also, little overlap was observed within barley or maize genotypes in gene expression during density stress, indicating that genotypic differences play a major role in the response to density stress. While it is clear that gene expression differences are involved in morphological changes induced by high plant densities, it is likely that many of these gene expression differences are subtle and restricted to particular tissues and developmental time.

  1. Robust Demographic Inference from Genomic and SNP Data

    Science.gov (United States)

    Excoffier, Laurent; Dupanloup, Isabelle; Huerta-Sánchez, Emilia; Sousa, Vitor C.; Foll, Matthieu

    2013-01-01

    We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with , the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets. PMID:24204310

  2. The frequency of genotypes for the SNP Ser/Ser in the studied population of Albanian women is higher in the Balkan region

    Directory of Open Access Journals (Sweden)

    Zafer Gashi

    2016-08-01

    Full Text Available In women undergoing natural cycles, just one oocyte is usually selected for ovulation, yet routine clinical techniques to support the development of multiple follicles using additional gonadotrophins result in numerous ovulations. Several parameters have been postulated as predictors of ovarian response (inhibin B, 17-β-estradiol and antiMüllerian hormone. Nevertheless, the FSH level on the day 3 of menstrual cycle remains, the most widely used biomarker due to its low cost, although, the genetic background of individuals seems to determine the response of patients to rFSH stimulation better than the stimulation design. Consequently, the variants of FSHR were explored and they may be involved in the role of FSH receptor in mediated signal transduction and with ovarian response in infertile women submitted to ovarian stimulation. In this study we examined, for the first time, the prevalence of genotype variants Asn680Ser in population Albanian women from Kosovo Dukagjin region who took part in IVF / ICSI program. The frequencies of the Asn680Ser genotype variants were as follows: Asn/Asn 22.1%, Asn/Ser 47.1%, and Ser/Ser 30.8%, respectively (Table 1. bE2 levels between the three genotype variants showed slight but statistically significant difference (p= 0.0308. No difference was also found between the genotype groups either in terms of AFC, amount of the FSH required for ovulation induction, stimulation length days, number of dominant follicles, oocyte retrieval number or endometrial thickness (Table 2. BMI was significantly higher in the Ser/Ser group as compared to those from the Asn/Ser or the Asn/Asn group (p= 0.0010 (Table 2. In the study population of Albanian women Dukagjin region of Kosovo had a higher incidence of Ser / SER genotype compared to Asn / Asn genotype. Our research results in the Albanian population differ from published data for other ethnic groups in the Balkans.

  3. Genome-wide SNP discovery in tetraploid alfalfa using 454 sequencing and high resolution melting analysis

    Directory of Open Access Journals (Sweden)

    Zhao Patrick X

    2011-07-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most common type of sequence variation among plants and are often functionally important. We describe the use of 454 technology and high resolution melting analysis (HRM for high throughput SNP discovery in tetraploid alfalfa (Medicago sativa L., a species with high economic value but limited genomic resources. Results The alfalfa genotypes selected from M. sativa subsp. sativa var. 'Chilean' and M. sativa subsp. falcata var. 'Wisfal', which differ in water stress sensitivity, were used to prepare cDNA from tissue of clonally-propagated plants grown under either well-watered or water-stressed conditions, and then pooled for 454 sequencing. Based on 125.2 Mb of raw sequence, a total of 54,216 unique sequences were obtained including 24,144 tentative consensus (TCs sequences and 30,072 singletons, ranging from 100 bp to 6,662 bp in length, with an average length of 541 bp. We identified 40,661 candidate SNPs distributed throughout the genome. A sample of candidate SNPs were evaluated and validated using high resolution melting (HRM analysis. A total of 3,491 TCs harboring 20,270 candidate SNPs were located on the M. truncatula (MT 3.5.1 chromosomes. Gene Ontology assignments indicate that sequences obtained cover a broad range of GO categories. Conclusions We describe an efficient method to identify thousands of SNPs distributed throughout the alfalfa genome covering a broad range of GO categories. Validated SNPs represent valuable molecular marker resources that can be used to enhance marker density in linkage maps, identify potential factors involved in heterosis and genetic variation, and as tools for association mapping and genomic selection in alfalfa.

  4. Genotyping of Mycobacterium leprae for better understanding of leprosy transmission in Fortaleza, Northeastern Brazil.

    Science.gov (United States)

    Fontes, Amanda N B; Lima, Luana N G C; Mota, Rosa M S; Almeida, Rosa L F; Pontes, Maria A; Gonçalves, Heitor de S; Frota, Cristiane C; Vissa, Varalakshmi D; Brennan, Patrick J; Guimaraes, Ricardo J P S; Kendall, Carl; Kerr, Ligia R F S; Suffys, Philip N

    2017-12-01

    Leprosy is endemic in large part of Brazil with 28,761 new patients in 2015, the second largest number worldwide and reaches 9/10.000 in highly endemic regions and 2.7/10.000 in the city of Fortaleza, Ceará, Northeast Brazil. For better understanding of risk factors for leprosy transmission, we conducted an epidemiologic study supplemented by 17 locus VNTR and SNP 1-4 typing of Mycobacterium leprae in skin biopsy samples from new multibacillary (MB) patients diagnosed at a reference center in 2009 and 2010. Among the 1,519 new patients detected during the study period, 998 (65.7%) were MB and we performed DNA extraction and genotyping on 160 skin biopsy samples, resulting in 159 (16%) good multilocus VNTR types. Thirty-eight of these patients also provided VNTR types from M. leprae in nasal swabs. The SNP-Type was obtained for 157 patients and 87% were of type 4. Upon consideration all VNTR markers, 156 different genotypes and three pairs with identical genotypes were observed; no epidemiologic relation could be observed between individuals in these pairs. Considerable variability in differentiating index (DI) was observed between the different markers and the four with highest DI [(AT)15, (TA)18, (AT)17 and (GAA)21] frequently demonstrated differences in copy number when comparing genotypes from both type of samples. Excluding these markers from analysis resulted in 83 genotypes, 20 of which included 96 of the patients (60.3%). These clusters were composed of two (n = 8), three (n = 6), four (n = 1), five (n = 2), six (n = 1), 19 (n = 1) and 23 (n = 23) individuals and suggests that recent transmission is contributing to the maintenance of leprosy in Fortaleza. When comparing epidemiological and clinical variables among patients within clustered or with unique M. leprae genotypes, a positive bacterial index in skin biopsies and knowledge of working with someone with the disease were significantly associated with clustering. A tendency to belong to a cluster was

  5. Heterogeneous computing architecture for fast detection of SNP-SNP interactions.

    Science.gov (United States)

    Sluga, Davor; Curk, Tomaz; Zupan, Blaz; Lotric, Uros

    2014-06-25

    The extent of data in a typical genome-wide association study (GWAS) poses considerable computational challenges to software tools for gene-gene interaction discovery. Exhaustive evaluation of all interactions among hundreds of thousands to millions of single nucleotide polymorphisms (SNPs) may require weeks or even months of computation. Massively parallel hardware within a modern Graphic Processing Unit (GPU) and Many Integrated Core (MIC) coprocessors can shorten the run time considerably. While the utility of GPU-based implementations in bioinformatics has been well studied, MIC architecture has been introduced only recently and may provide a number of comparative advantages that have yet to be explored and tested. We have developed a heterogeneous, GPU and Intel MIC-accelerated software module for SNP-SNP interaction discovery to replace the previously single-threaded computational core in the interactive web-based data exploration program SNPsyn. We report on differences between these two modern massively parallel architectures and their software environments. Their utility resulted in an order of magnitude shorter execution times when compared to the single-threaded CPU implementation. GPU implementation on a single Nvidia Tesla K20 runs twice as fast as that for the MIC architecture-based Xeon Phi P5110 coprocessor, but also requires considerably more programming effort. General purpose GPUs are a mature platform with large amounts of computing power capable of tackling inherently parallel problems, but can prove demanding for the programmer. On the other hand the new MIC architecture, albeit lacking in performance reduces the programming effort and makes it up with a more general architecture suitable for a wider range of problems.

  6. Interference of Homologous Sequences on the SNP Study of CYP2A13 Gene

    Directory of Open Access Journals (Sweden)

    Qinghua ZHOU

    2010-02-01

    Full Text Available Background and objective It has been proven that cytochrome P450 enzyme 2A13 (CYP2A13 played an important role in the association between single nucleotide polymorphisms (SNP and human diseases. Cytochrome P450 enzymes are a group of isoenzymes, whose sequence homology may interfere with the study for SNP. The aim of this study is to explore the interference on the SNP study of CYP2A13 caused by homologous sequences. Methods Taqman probe was applied to detect distribution of rs8192789 sites in 573 subjects, and BLAST method was used to analyze the amplified sequences. Partial sequences of CYP2A13 were emplified by PCR from 60 cases. The emplified sequences were TA cloned and sequenced. Results For rs8192789 loci in 573 cases, only 3 cases were TT, while the rest were CT heterozygotes, which was caused by homologous sequences. There are a large number of overlapping peaks in identical sequences of 60 cases, and the SNP of 101 amino acid site reported in the SNP database is not found. The cloned sequences are 247 bp, 235 bp fragments. Conclusion The homologous sequences may interfere the study for SNP of CYP2A13, and some SNP may not exist.

  7. Associations of Polymorphisms in MTHFR Gene with the Risk of Age-Related Cataract in Chinese Han Population: A Genotype-Phenotype Analysis.

    Directory of Open Access Journals (Sweden)

    Xue-bin Wang

    Full Text Available Homocysteine (Hcy is a potential risk factor for age-related cataract (ARC. Methylenetetrahydrofolate reductase (MTHFR is the key enzyme for Hcy metabolism, and variants of MTHFR may affect MTHFR enzyme activity. This study mainly evaluated the associations between variants in MTHFR gene, plasma MTHFR enzyme activity, total Hcy (tHcy levels and ARC risk in Chinese population. Four single nucleotide polymorphisms (SNPs in MTHFR gene were genotyped using the high-resolution melting (HRM method in 502 ARC patients (mean age, 70.2 [SD, 9.0], 46.0% male and 890 healthy controls (mean age, 67.1 [SD, 11.1], 47.6% male. The plasma MTHFR activity, folic acid (FA, vitamins B12 and B6 levels were detected by enzyme-linked immunosorbent assays (ELISA. The plasma tHcy levels were measured by an automated enzymatic assay. After the Bonferroni correction, the minor allele T of SNP rs1801133 showed a significant association with an increased risk of overall ARC (OR = 1.26, P = 0.003. Consistent association was also found between SNP rs1801133 and cortical ARC risk (OR = 1.44, P = 0.003. Haplotype analyses revealed an adverse effect of the haplotype "C-A-T-C" (alleles in order of SNPs rs3737967, rs1801131, rs1801133 and rs9651118 on ARC risk (OR = 1.55, P = 0.003. Moreover, in a joint analysis of SNPs rs9651118 and rs1801133, subjects with two unfavorable genotypes had a 1.76-fold increased risk of ARC compared with the reference group, and a statistically significant dose-response trend (Ptrend = 0.001 was also observed. Further, in healthy controls and patients with cortical ARC, the allele T of SNP rs1801133 and the increasing number of unfavorable genotypes were significantly correlated with decreased MTHFR activity as well as increased tHcy levels. However, there was no significant association between FA, vitamins B12, B6 levels and MTHFR variants. Our data indicated that variants in MTHFR gene might individually and jointly influence susceptibility to ARC

  8. Plant genotypic diversity reduces the rate of consumer resource utilization.

    Science.gov (United States)

    McArt, Scott H; Thaler, Jennifer S

    2013-07-07

    While plant species diversity can reduce herbivore densities and herbivory, little is known regarding how plant genotypic diversity alters resource utilization by herbivores. Here, we show that an invasive folivore--the Japanese beetle (Popillia japonica)--increases 28 per cent in abundance, but consumes 24 per cent less foliage in genotypic polycultures compared with monocultures of the common evening primrose (Oenothera biennis). We found strong complementarity for reduced herbivore damage among plant genotypes growing in polycultures and a weak dominance effect of particularly resistant genotypes. Sequential feeding by P. japonica on different genotypes from polycultures resulted in reduced consumption compared with feeding on different plants of the same genotype from monocultures. Thus, diet mixing among plant genotypes reduced herbivore consumption efficiency. Despite positive complementarity driving an increase in fruit production in polycultures, we observed a trade-off between complementarity for increased plant productivity and resistance to herbivory, suggesting costs in the complementary use of resources by plant genotypes may manifest across trophic levels. These results elucidate mechanisms for how plant genotypic diversity simultaneously alters resource utilization by both producers and consumers, and show that population genotypic diversity can increase the resistance of a native plant to an invasive herbivore.

  9. Two combinatorial optimization problems for SNP discovery using base-specific cleavage and mass spectrometry.

    Science.gov (United States)

    Chen, Xin; Wu, Qiong; Sun, Ruimin; Zhang, Louxin

    2012-01-01

    The discovery of single-nucleotide polymorphisms (SNPs) has important implications in a variety of genetic studies on human diseases and biological functions. One valuable approach proposed for SNP discovery is based on base-specific cleavage and mass spectrometry. However, it is still very challenging to achieve the full potential of this SNP discovery approach. In this study, we formulate two new combinatorial optimization problems. While both problems are aimed at reconstructing the sample sequence that would attain the minimum number of SNPs, they search over different candidate sequence spaces. The first problem, denoted as SNP - MSP, limits its search to sequences whose in silico predicted mass spectra have all their signals contained in the measured mass spectra. In contrast, the second problem, denoted as SNP - MSQ, limits its search to sequences whose in silico predicted mass spectra instead contain all the signals of the measured mass spectra. We present an exact dynamic programming algorithm for solving the SNP - MSP problem and also show that the SNP - MSQ problem is NP-hard by a reduction from a restricted variation of the 3-partition problem. We believe that an efficient solution to either problem above could offer a seamless integration of information in four complementary base-specific cleavage reactions, thereby improving the capability of the underlying biotechnology for sensitive and accurate SNP discovery.

  10. Snap: an integrated SNP annotation platform

    DEFF Research Database (Denmark)

    Li, Shengting; Ma, Lijia; Li, Heng

    2007-01-01

    Snap (Single Nucleotide Polymorphism Annotation Platform) is a server designed to comprehensively analyze single genes and relationships between genes basing on SNPs in the human genome. The aim of the platform is to facilitate the study of SNP finding and analysis within the framework of medical...

  11. APOE genotype-function relationship: evidence of -491 A/T promoter polymorphism modifying transcription control but not type 2 diabetes risk.

    Directory of Open Access Journals (Sweden)

    Hua Geng

    Full Text Available BACKGROUND: The apolipoprotein E gene (APOE coding polymorphism modifies the risks of Alzheimer's disease, type 2 diabetes, and coronary heart disease. Aside from the coding variants, single nucleotide polymorphism (SNP of the APOE promoter has also been shown to modify the risk of Alzheimer's disease. METHODOLOGY/PRINCIPAL FINDINGS: In this study we investigate the genotype-function relationship of APOE promoter polymorphism at molecular level and at physiological level: i.e., in transcription control of the gene and in the risk of type 2 diabetes. In molecular studies, the effect of the APOE -491A/T (rs449647 polymorphism on gene transcription was accessed by dual-luciferase reporter gene assays. The -491 A to T substitution decreased the activity (p<0.05 of the cloned APOE promoter (-1017 to +406. Using the -501 to -481 nucleotide sequence of the APOE promoter as a 'bait' to screen the human brain cDNA library by yeast one-hybrid system yielded ATF4, an endoplasmic reticulum stress response gene, as one of the interacting factors. Electrophoretic-mobility-shift assays (EMSA and chromatin immuno-precipitation (ChIP analyses further substantiated the physical interaction between ATF4 and the APOE promoter. Over-expression of ATF4 stimulated APOE expression whereas siRNA against ATF4 suppressed the expression of the gene. However, interaction between APOE promoter and ATF4 was not -491A/T-specific. At physiological level, the genotype-function relationship of APOE promoter polymorphism was studied in type 2 diabetes. In 630 cases and 595 controls, three APOE promoter SNPs -491A/T, -219G/T (rs405509, and +113G/C (rs440446 were genotyped and tested for association with type 2 diabetes in Hong Kong Chinese. No SNP or haplotype association with type 2 diabetes was detected. CONCLUSIONS/SIGNIFICANCE: At molecular level, polymorphism -491A/T and ATF4 elicit independent control of APOE gene expression. At physiological level, no genotype

  12. Application of genotyping-by-sequencing on semiconductor sequencing platforms: a comparison of genetic and reference-based marker ordering in barley.

    Directory of Open Access Journals (Sweden)

    Martin Mascher

    Full Text Available The rapid development of next-generation sequencing platforms has enabled the use of sequencing for routine genotyping across a range of genetics studies and breeding applications. Genotyping-by-sequencing (GBS, a low-cost, reduced representation sequencing method, is becoming a common approach for whole-genome marker profiling in many species. With quickly developing sequencing technologies, adapting current GBS methodologies to new platforms will leverage these advancements for future studies. To test new semiconductor sequencing platforms for GBS, we genotyped a barley recombinant inbred line (RIL population. Based on a previous GBS approach, we designed bar code and adapter sets for the Ion Torrent platforms. Four sets of 24-plex libraries were constructed consisting of 94 RILs and the two parents and sequenced on two Ion platforms. In parallel, a 96-plex library of the same RILs was sequenced on the Illumina HiSeq 2000. We applied two different computational pipelines to analyze sequencing data; the reference-independent TASSEL pipeline and a reference-based pipeline using SAMtools. Sequence contigs positioned on the integrated physical and genetic map were used for read mapping and variant calling. We found high agreement in genotype calls between the different platforms and high concordance between genetic and reference-based marker order. There was, however, paucity in the number of SNP that were jointly discovered by the different pipelines indicating a strong effect of alignment and filtering parameters on SNP discovery. We show the utility of the current barley genome assembly as a framework for developing very low-cost genetic maps, facilitating high resolution genetic mapping and negating the need for developing de novo genetic maps for future studies in barley. Through demonstration of GBS on semiconductor sequencing platforms, we conclude that the GBS approach is amenable to a range of platforms and can easily be modified as new

  13. Alternative SNP detection platforms, HRM and biosensors, for varietal identification in Vitis vinifera L. using F3H and LDOX genes.

    Science.gov (United States)

    Gomes, Sónia; Castro, Cláudia; Barrias, Sara; Pereira, Leonor; Jorge, Pedro; Fernandes, José R; Martins-Lopes, Paula

    2018-04-11

    The wine sector requires quick and reliable methods for Vitis vinifera L. varietal identification. The number of V. vinifera varieties is estimated in about 5,000 worldwide. Single Nucleotide Polymorphisms (SNPs) represent the most basic and abundant form of genetic sequence variation, being adequate for varietal discrimination. The aim of this work was to develop DNA-based assays suitable to detect SNP variation in V. vinifera, allowing varietal discrimination. Genotyping by sequencing allowed the detection of eleven SNPs on two genes of the anthocyanin pathway, the flavanone 3-hydroxylase (F3H, EC: 1.14.11.9), and the leucoanthocyanidin dioxygenase (LDOX, EC 1.14.11.19; synonym anthocyanidin synthase, ANS) in twenty V. vinifera varieties. Three High Resolution Melting (HRM) assays were designed based on the sequencing information, discriminating five of the 20 varieties: Alicante Bouschet, Donzelinho Tinto, Merlot, Moscatel Galego and Tinta Roriz. Sanger sequencing of the HRM assay products confirmed the HRM profiles. Three probes, with different lengths and sequences, were used as bio-recognition elements in an optical biosensor platform based on a long period grating (LPG) fiber optic sensor. The label free platform detected a difference of a single SNP using genomic DNA samples. The two different platforms were successfully applied for grapevine varietal identification.

  14. Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety 'Amrapali' (Mangifera indica L.).

    Science.gov (United States)

    Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

    2016-01-01

    Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.

  15. Quantification of within-sample genetic heterogeneity from SNP-array data

    DEFF Research Database (Denmark)

    Martinez, Pierre; Kimberley, Christopher; Birkbak, Nicolai Juul

    2017-01-01

    Intra-tumour genetic heterogeneity (ITH) fosters drug resistance and is a critical hurdle to clinical treatment. ITH can be well-measured using multi-region sampling but this is costly and challenging to implement. There is therefore a need for tools to estimate ITH in individual samples, using...... standard genomic data such as SNP-arrays, that could be implemented routinely. We designed two novel scores S and R, respectively based on the Shannon diversity index and Ripley's L statistic of spatial homogeneity, to quantify ITH in single SNP-array samples. We created in-silico and in-vitro mixtures...... sequencing data but heterogeneity in the fraction of tumour cells present across samples hampered accurate quantification. The prognostic potential of both scores was moderate but significantly predictive of survival in several tumour types (corrected p = 0.03). Our work thus shows how individual SNP...

  16. Imputation of genotypes in Danish two-way crossbred pigs using low density panels

    DEFF Research Database (Denmark)

    Xiang, Tao; Christensen, Ole Fredslund; Legarra, Andres

    Genotype imputation is commonly used as an initial step of genomic selection. Studies on humans, plants and ruminants suggested many factors would affect the performance of imputation. However, studies rarely investigated pigs, especially crossbred pigs. In this study, different scenarios...... of imputation from 5K SNPs to 7K SNPs on Danish Landrace, Yorkshire, and crossbred Landrace-Yorkshire were compared. In conclusion, genotype imputation on crossbreds performs equally well as in purebreds, when parental breeds are used as the reference panel. When the size of reference is considerably large...... SNPs. This dataset will be analyzed for genomic selection in a future study...

  17. Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design

    Directory of Open Access Journals (Sweden)

    Shashi N. Goonetilleke

    2018-01-01

    Full Text Available In crop plant genetics, linkage maps provide the basis for the mapping of loci that affect important traits and for the selection of markers to be applied in crop improvement. In outcrossing species such as almond (Prunus dulcis Mill. D. A. Webb, application of a double pseudotestcross mapping approach to the F1 progeny of a biparental cross leads to the construction of a linkage map for each parent. Here, we report on the application of genotyping by sequencing to discover and map single nucleotide polymorphisms in the almond cultivars “Nonpareil” and “Lauranne.” Allele-specific marker assays were developed for 309 tag pairs. Application of these assays to 231 Nonpareil × Lauranne F1 progeny provided robust linkage maps for each parent. Analysis of phenotypic data for shell hardness demonstrated the utility of these maps for quantitative trait locus mapping. Comparison of these maps to the peach genome assembly confirmed high synteny and collinearity between the peach and almond genomes. The marker assays were applied to progeny from several other Nonpareil crosses, providing the basis for a composite linkage map of Nonpareil. Applications of the assays to a panel of almond clones and a panel of rootstocks used for almond production demonstrated the broad applicability of the markers and provide subsets of markers that could be used to discriminate among accessions. The sequence-based linkage maps and single nucleotide polymorphism assays presented here could be useful resources for the genetic analysis and genetic improvement of almond.

  18. A function accounting for training set size and marker density to model the average accuracy of genomic prediction.

    Science.gov (United States)

    Erbe, Malena; Gredler, Birgit; Seefried, Franz Reinhold; Bapst, Beat; Simianer, Henner

    2013-01-01

    Prediction of genomic breeding values is of major practical relevance in dairy cattle breeding. Deterministic equations have been suggested to predict the accuracy of genomic breeding values in a given design which are based on training set size, reliability of phenotypes, and the number of independent chromosome segments ([Formula: see text]). The aim of our study was to find a general deterministic equation for the average accuracy of genomic breeding values that also accounts for marker density and can be fitted empirically. Two data sets of 5'698 Holstein Friesian bulls genotyped with 50 K SNPs and 1'332 Brown Swiss bulls genotyped with 50 K SNPs and imputed to ∼600 K SNPs were available. Different k-fold (k = 2-10, 15, 20) cross-validation scenarios (50 replicates, random assignment) were performed using a genomic BLUP approach. A maximum likelihood approach was used to estimate the parameters of different prediction equations. The highest likelihood was obtained when using a modified form of the deterministic equation of Daetwyler et al. (2010), augmented by a weighting factor (w) based on the assumption that the maximum achievable accuracy is [Formula: see text]. The proportion of genetic variance captured by the complete SNP sets ([Formula: see text]) was 0.76 to 0.82 for Holstein Friesian and 0.72 to 0.75 for Brown Swiss. When modifying the number of SNPs, w was found to be proportional to the log of the marker density up to a limit which is population and trait specific and was found to be reached with ∼20'000 SNPs in the Brown Swiss population studied.

  19. A function accounting for training set size and marker density to model the average accuracy of genomic prediction.

    Directory of Open Access Journals (Sweden)

    Malena Erbe

    Full Text Available Prediction of genomic breeding values is of major practical relevance in dairy cattle breeding. Deterministic equations have been suggested to predict the accuracy of genomic breeding values in a given design which are based on training set size, reliability of phenotypes, and the number of independent chromosome segments ([Formula: see text]. The aim of our study was to find a general deterministic equation for the average accuracy of genomic breeding values that also accounts for marker density and can be fitted empirically. Two data sets of 5'698 Holstein Friesian bulls genotyped with 50 K SNPs and 1'332 Brown Swiss bulls genotyped with 50 K SNPs and imputed to ∼600 K SNPs were available. Different k-fold (k = 2-10, 15, 20 cross-validation scenarios (50 replicates, random assignment were performed using a genomic BLUP approach. A maximum likelihood approach was used to estimate the parameters of different prediction equations. The highest likelihood was obtained when using a modified form of the deterministic equation of Daetwyler et al. (2010, augmented by a weighting factor (w based on the assumption that the maximum achievable accuracy is [Formula: see text]. The proportion of genetic variance captured by the complete SNP sets ([Formula: see text] was 0.76 to 0.82 for Holstein Friesian and 0.72 to 0.75 for Brown Swiss. When modifying the number of SNPs, w was found to be proportional to the log of the marker density up to a limit which is population and trait specific and was found to be reached with ∼20'000 SNPs in the Brown Swiss population studied.

  20. MDM2 promoter SNP344T>A (rs1196333 status does not affect cancer risk.

    Directory of Open Access Journals (Sweden)

    Stian Knappskog

    Full Text Available The MDM2 proto-oncogene plays a key role in central cellular processes like growth control and apoptosis, and the gene locus is frequently amplified in sarcomas. Two polymorphisms located in the MDM2 promoter P2 have been shown to affect cancer risk. One of these polymorphisms (SNP309T>G; rs2279744 facilitates Sp1 transcription factor binding to the promoter and is associated with increased cancer risk. In contrast, SNP285G>C (rs117039649, located 24 bp upstream of rs2279744, and in complete linkage disequilibrium with the SNP309G allele, reduces Sp1 recruitment and lowers cancer risk. Thus, fine tuning of MDM2 expression has proven to be of significant importance with respect to tumorigenesis. We assessed the potential functional effects of a third MDM2 promoter P2 polymorphism (SNP344T>A; rs1196333 located on the SNP309T allele. While in silico analyses indicated SNP344A to modulate TFAP2A, SPIB and AP1 transcription factor binding, we found no effect of SNP344 status on MDM2 expression levels. Assessing the frequency of SNP344A in healthy Caucasians (n = 2,954 and patients suffering from ovarian (n = 1,927, breast (n = 1,271, endometrial (n = 895 or prostatic cancer (n = 641, we detected no significant difference in the distribution of this polymorphism between any of these cancer forms and healthy controls (6.1% in healthy controls, and 4.9%, 5.0%, 5.4% and 7.2% in the cancer groups, respectively. In conclusion, our findings provide no evidence indicating that SNP344A may affect MDM2 transcription or cancer risk.