WorldWideScience

Sample records for gene-based snp marker

  1. SNP marker detection and genotyping in tilapia

    NARCIS (Netherlands)

    Bers, van N.E.M.; Crooijmans, R.P.M.A.; Groenen, M.A.M.; Dibbits, B.W.; Komen, J.

    2012-01-01

    We have generated a unique resource consisting of nearly 175 000 short contig sequences and 3569 SNP markers from the widely cultured GIFT (Genetically Improved Farmed Tilapia) strain of Nile tilapia (Oreochromis niloticus). In total, 384 SNPs were selected to monitor the wider applicability of the

  2. (SNP) markers for the Chinese black sleeper, Bostrychus sinensis

    African Journals Online (AJOL)

    We characterized 11 single nucleotide ploymorphism (SNP) markers for the Chinese black sleeper, Bostrychus sinensis. These markers were isolated from a genomic library and tested in ten geographically distant individuals of B. sinensis. Polymorphisms of these SNP loci were assessed using a wild population including ...

  3. Development of a single nucleotide polymorphism (SNP) marker for ...

    African Journals Online (AJOL)

    The nature of the single nucleotide polymorphism (SNP) marker was validated by DNA sequencing of the parental PCR products. Using high resolution melt (HRM) profiles and normalised difference plots, we successfully differentiated the homozygous dominant (wild type), homozygous recessive (LPA) and heterozygous ...

  4. (SNP) markers for the Chinese black sleeper, Bostrychus sinensis

    African Journals Online (AJOL)

    ajl yemi

    2011-04-25

    Apr 25, 2011 ... Polynesia, north to Japan and south to Australia (Kottelat et al., 1993; Masuda ... developed the first set of SNP markers for Chinese black sleeper which can ... Then, the. 44 primer pairs were designed based on all the cloning.

  5. Gene-based single nucleotide polymorphism markers for genetic and association mapping in common bean.

    Science.gov (United States)

    Galeano, Carlos H; Cortés, Andrés J; Fernández, Andrea C; Soler, Álvaro; Franco-Herrera, Natalia; Makunde, Godwill; Vanderleyden, Jos; Blair, Matthew W

    2012-06-26

    In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. In short, this study illustrates the power of intron-based markers for linkage and association mapping in

  6. SNP markers retrieval for a non-model species: a practical approach

    Directory of Open Access Journals (Sweden)

    Shahin Arwa

    2012-01-01

    Full Text Available Abstract Background SNP (Single Nucleotide Polymorphism markers are rapidly becoming the markers of choice for applications in breeding because of next generation sequencing technology developments. For SNP development by NGS technologies, correct assembly of the huge amounts of sequence data generated is essential. Little is known about assembler's performance, especially when dealing with highly heterogeneous species that show a high genome complexity and what the possible consequences are of differences in assemblies on SNP retrieval. This study tested two assemblers (CAP3 and CLC on 454 data from four lily genotypes and compared results with respect to SNP retrieval. Results CAP3 assembly resulted in higher numbers of contigs, lower numbers of reads per contig, and shorter average read lengths compared to CLC. Blast comparisons showed that CAP3 contigs were highly redundant. Contrastingly, CLC in rare cases combined paralogs in one contig. Redundant and chimeric contigs may lead to erroneous SNPs. Filtering for redundancy can be done by blasting selected SNP markers to the contigs and discarding all the SNP markers that show more than one blast hit. Results on chimeric contigs showed that only four out of 2,421 SNP markers were selected from chimeric contigs. Conclusion In practice, CLC performs better in assembling highly heterogeneous genome sequences compared to CAP3, and consequently SNP retrieval is more efficient. Additionally a simple flow scheme is suggested for SNP marker retrieval that can be valid for all non-model species.

  7. A gene-based SNP resource and linkage map for the copepod Tigriopus californicus

    Directory of Open Access Journals (Sweden)

    Foley Brad R

    2011-11-01

    Full Text Available Abstract Background As yet, few genomic resources have been developed in crustaceans. This lack is particularly evident in Copepoda, given the extraordinary numerical abundance, and taxonomic and ecological diversity of this group. Tigriopus californicus is ideally suited to serve as a genetic model copepod and has been the subject of extensive work in environmental stress and reproductive isolation. Accordingly, we set out to develop a broadly-useful panel of genetic markers and to construct a linkage map dense enough for quantitative trait locus detection in an interval mapping framework for T. californicus--a first for copepods. Results One hundred and ninety Single Nucleotide Polymorphisms (SNPs were used to genotype our mapping population of 250 F2 larvae. We were able to construct a linkage map with an average intermarker distance of 1.8 cM, and a maximum intermarker distance of 10.3 cM. All markers were assembled into linkage groups, and the 12 linkage groups corresponded to the 12 known chromosomes of T. californicus. We estimate a total genome size of 401.0 cM, and a total coverage of 73.7%. Seventy five percent of the mapped markers were detected in 9 additional populations of T. californicus. Of available model arthropod genomes, we were able to show more colocalized pairs of homologues between T. californicus and the honeybee Apis mellifera, than expected by chance, suggesting preserved macrosynteny between Hymenoptera and Copepoda. Conclusions Our study provides an abundance of linked markers spanning all chromosomes. Many of these markers are also found in multiple populations of T. californicus, and in two other species in the genus. The genomic resource we have developed will enable mapping throughout the geographical range of this species and in closely related species. This linkage map will facilitate genome sequencing, mapping and assembly in an ecologically and taxonomically interesting group for which genomic resources are

  8. A gene-based SNP resource and linkage map for the copepod Tigriopus californicus

    Science.gov (United States)

    2011-01-01

    Background As yet, few genomic resources have been developed in crustaceans. This lack is particularly evident in Copepoda, given the extraordinary numerical abundance, and taxonomic and ecological diversity of this group. Tigriopus californicus is ideally suited to serve as a genetic model copepod and has been the subject of extensive work in environmental stress and reproductive isolation. Accordingly, we set out to develop a broadly-useful panel of genetic markers and to construct a linkage map dense enough for quantitative trait locus detection in an interval mapping framework for T. californicus--a first for copepods. Results One hundred and ninety Single Nucleotide Polymorphisms (SNPs) were used to genotype our mapping population of 250 F2 larvae. We were able to construct a linkage map with an average intermarker distance of 1.8 cM, and a maximum intermarker distance of 10.3 cM. All markers were assembled into linkage groups, and the 12 linkage groups corresponded to the 12 known chromosomes of T. californicus. We estimate a total genome size of 401.0 cM, and a total coverage of 73.7%. Seventy five percent of the mapped markers were detected in 9 additional populations of T. californicus. Of available model arthropod genomes, we were able to show more colocalized pairs of homologues between T. californicus and the honeybee Apis mellifera, than expected by chance, suggesting preserved macrosynteny between Hymenoptera and Copepoda. Conclusions Our study provides an abundance of linked markers spanning all chromosomes. Many of these markers are also found in multiple populations of T. californicus, and in two other species in the genus. The genomic resource we have developed will enable mapping throughout the geographical range of this species and in closely related species. This linkage map will facilitate genome sequencing, mapping and assembly in an ecologically and taxonomically interesting group for which genomic resources are currently under development

  9. A set of 14 DIP-SNP markers to detect unbalanced DNA mixtures.

    Science.gov (United States)

    Liu, Zhizhen; Liu, Jinding; Wang, Jiaqi; Chen, Deqing; Liu, Zidong; Shi, Jie; Li, Zeqin; Li, Wenyan; Zhang, Gengqian; Du, Bing

    2018-03-04

    Unbalanced DNA mixture is still a difficult problem for forensic practice. DIP-STRs are useful markers for detection of minor DNA but they are not widespread in the human genome and having long amplicons. In this study, we proposed a novel type of genetic marker, termed DIP-SNP. DIP-SNP refers to the combination of INDEL and SNP in less than 300bp length of human genome. The multiplex PCR and SNaPshot assay were established for 14 DIP-SNP markers in a Chinese Han population from Shanxi, China. This novel compound marker allows detection of the minor DNA contributor with sensitivity from 1:50 to 1:1000 in a DNA mixture of any gender with 1 ng-10 ng DNA template. Most of the DIP-SNP markers had a relatively high probability of informative alleles with an average I value of 0.33. In all, we proposed DIP-SNP as a novel kind of genetic marker for detection of minor contributor from unbalanced DNA mixture and established the detection method by associating the multiplex PCR and SNaPshot assay. DIP-SNP polymorphisms are promising markers for forensic or clinical mixture examination because they are shorter, widespread and higher sensitive. Copyright © 2018 Elsevier Inc. All rights reserved.

  10. Tri-allelic SNP markers enable analysis of mixed and degraded DNA samples.

    Science.gov (United States)

    Westen, Antoinette A; Matai, Anuska S; Laros, Jeroen F J; Meiland, Hugo C; Jasper, Mandy; de Leeuw, Wiljo J F; de Knijff, Peter; Sijen, Titia

    2009-09-01

    For the analysis of degraded DNA in disaster victim identification (DVI) and criminal investigations, single nucleotide polymorphisms (SNPs) have been recognized as promising markers mainly because they can be analyzed in short sized amplicons. Most SNPs are bi-allelic and are thereby ineffective to detect mixtures, which may lead to incorrect genotyping. We developed an algorithm to find non-binary (i.e. tri-allelic or tetra-allelic) SNPs in the NCBI dbSNP database. We selected 31 potential tri-allelic SNPs with a minor allele frequency of at least 10%. The tri-allelic nature was confirmed for 15 SNPs residing on 14 different chromosomes. Multiplex SNaPshot assays were developed, and the allele frequencies of 16 SNPs were determined among 153 Dutch and 111 Netherlands Antilles reference samples. Using these multiplex SNP assays, the presence of a mixture of two DNA samples in a ratio up to 1:8 could be recognized reliably. Furthermore, we compared the genotyping efficiency of the tri-allelic SNP markers and short tandem repeat (STR) markers by analyzing artificially degraded DNA and DNA from 30 approximately 500-year-old bone and molar samples. In both types of degraded DNA samples, the larger sized STR amplicons failed to amplify whereas the tri-allelic SNP markers still provided valuable information. In conclusion, tri-allelic SNP markers are suited for the analysis of degraded DNA and enable the detection of a second DNA source in a sample.

  11. MDM2 SNP309 and SNP285 Act as Negative Prognostic Markers for Non-small Cell Lung Cancer Adenocarcinoma Patients

    Science.gov (United States)

    Deben, Christophe; Op de Beeck, Ken; Van den Bossche, Jolien; Jacobs, Julie; Lardon, Filip; Wouters, An; Peeters, Marc; Van Camp, Guy; Rolfo, Christian; Deschoolmeester, Vanessa; Pauwels, Patrick

    2017-01-01

    Objectives: Two functional polymorphisms in the MDM2 promoter region, SNP309T>G and SNP285G>C, have been shown to impact MDM2 expression and cancer risk. Currently available data on the prognostic value of MDM2 SNP309 in non-small cell lung cancer (NSCLC) is contradictory and unavailable for SNP285. The goal of this study was to clarify the role of these MDM2 SNPs in the outcome of NSCLC patients. Materials and Methods: In this study we genotyped SNP309 and SNP285 in 98 NSCLC adenocarcinoma patients and determined MDM2 mRNA and protein levels. In addition, we assessed the prognostic value of these common SNPs on overall and progression free survival, taking into account the TP53 status of the tumor. Results and Conclusion: We found that the SNP285C allele, but not the SNP309G allele, was significantly associated with increased MDM2 mRNA expression levels (p = 0.025). However, we did not observe an association with MDM2 protein levels for SNP285. The SNP309G allele was significantly associated with the presence of wild type TP53 (p = 0.047) and showed a strong trend towards increased MDM2 protein levels (p = 0.068). In addition, patients harboring the SNP309G allele showed a worse overall survival, but only in the presence of wild type TP53. The SNP285C allele was significantly associated with an early age of diagnosis and metastasis. Additionally, the SNP285C allele acted as an independent predictor for worse progression free survival (HR = 3.97; 95% CI = 1.51 - 10.42; p = 0.005). Our data showed that both SNP309 (in the presence of wild type TP53) and SNP285 act as negative prognostic markers for NSCLC patients, implicating a prominent role for these variants in the outcome of these patients. PMID:28819417

  12. Transcriptome sequencing of different narrow-leafed lupin tissue types provides a comprehensive uni-gene assembly and extensive gene-based molecular markers

    Science.gov (United States)

    Kamphuis, Lars G; Hane, James K; Nelson, Matthew N; Gao, Lingling; Atkins, Craig A; Singh, Karam B

    2015-01-01

    Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map. PMID:25060816

  13. Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing.

    Directory of Open Access Journals (Sweden)

    ShiGang Yu

    Full Text Available Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV or low estimated breeding value (LEBV. A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the

  14. Development of 101 Gene-based Single Nucleotide Polymorphism Markers in Sea Cucumber, Apostichopus japonicus

    Directory of Open Access Journals (Sweden)

    Wei Lu

    2012-06-01

    Full Text Available Single nucleotide polymorphisms (SNPs are currently the marker of choice in a variety of genetic studies. Using the high resolution melting (HRM genotyping approach, 101 gene-based SNP markers were developed for Apostichopus japonicus, a sea cucumber species with economic significance for the aquaculture industry in East Asian countries. HRM analysis revealed that all the loci showed polymorphisms when evaluated using 40 A. japonicus individuals collected from a natural population. The minor allele frequency ranged from 0.035 to 0.489. The observed and expected heterozygosities ranged from 0.050 to 0.833 and 0.073 to 0.907, respectively. Thirteen loci were found to depart significantly from Hardy–Weinberg equilibrium (HWE after Bonferroni corrections. Significant linkage disequilibrium (LD was detected in one pair of markers. These SNP markers are expected to be useful for future quantitative trait loci (QTL analysis, and to facilitate marker-assisted selection (MAS in A. japonicus.

  15. Applying SNP marker technology in the cacao breeding program at the Cocoa Research Institute of Ghana

    Science.gov (United States)

    In this investigation 45 parental cacao plants and five progeny derived from the parental stock studied were genotyped using six SNP markers to determine off-types or mislabeled clones and to authenticate crosses made in the Cocoa Research Institute of Ghana (CRIG) breeding program. Investigation wa...

  16. Use of SNP markers to conserve genome-wide genetic diversity in livestock

    NARCIS (Netherlands)

    Engelsma, K.A.

    2012-01-01

    Conservation of genetic diversity in livestock breeds is important since it is, both within and between breeds, under threat. The availability of large numbers of SNP markers has resulted in new opportunities to estimate genetic diversity in more detail, and to improve prioritization of animals

  17. Pea Marker Database (PMD) - A new online database combining known pea (Pisum sativum L.) gene-based markers.

    Science.gov (United States)

    Kulaeva, Olga A; Zhernakov, Aleksandr I; Afonin, Alexey M; Boikov, Sergei S; Sulima, Anton S; Tikhonovich, Igor A; Zhukov, Vladimir A

    2017-01-01

    Pea (Pisum sativum L.) is the oldest model object of plant genetics and one of the most agriculturally important legumes in the world. Since the pea genome has not been sequenced yet, identification of genes responsible for mutant phenotypes or desirable agricultural traits is usually performed via genetic mapping followed by candidate gene search. Such mapping is best carried out using gene-based molecular markers, as it opens the possibility for exploiting genome synteny between pea and its close relative Medicago truncatula Gaertn., possessing sequenced and annotated genome. In the last 5 years, a large number of pea gene-based molecular markers have been designed and mapped owing to the rapid evolution of "next-generation sequencing" technologies. However, the access to the complete set of markers designed worldwide is limited because the data are not uniformed and therefore hard to use. The Pea Marker Database was designed to combine the information about pea markers in a form of user-friendly and practical online tool. Version 1 (PMD1) comprises information about 2484 genic markers, including their locations in linkage groups, the sequences of corresponding pea transcripts and the names of related genes in M. truncatula. Version 2 (PMD2) is an updated version comprising 15944 pea markers in the same format with several advanced features. To test the performance of the PMD, fine mapping of pea symbiotic genes Sym13 and Sym27 in linkage groups VII and V, respectively, was carried out. The results of mapping allowed us to propose the Sen1 gene (a homologue of SEN1 gene of Lotus japonicus (Regel) K. Larsen) as the best candidate gene for Sym13, and to narrow the list of possible candidate genes for Sym27 to ten, thus proving PMD to be useful for pea gene mapping and cloning. All information contained in PMD1 and PMD2 is available at www.peamarker.arriam.ru.

  18. Identification and characterization of gene-based SSR markers in date palm (Phoenix dactylifera L.

    Directory of Open Access Journals (Sweden)

    Zhao Yongli

    2012-12-01

    Full Text Available Abstract Background Date palm (Phoenix dactylifera L. is an important tree in the Middle East and North Africa due to the nutritional value of its fruit. Molecular Breeding would accelerate genetic improvement of fruit tree through marker assisted selection. However, the lack of molecular markers in date palm restricts the application of molecular breeding. Results In this study, we analyzed 28,889 EST sequences from the date palm genome database to identify simple-sequence repeats (SSRs and to develop gene-based markers, i.e. expressed sequence tag-SSRs (EST-SSRs. We identified 4,609 ESTs as containing SSRs, among which, trinucleotide motifs (69.7% were the most common, followed by tetranucleotide (10.4% and dinucleotide motifs (9.6%. The motif AG (85.7% was most abundant in dinucleotides, while motifs AGG (26.8%, AAG (19.3%, and AGC (16.1% were most common among trinucleotides. A total of 4,967 primer pairs were designed for EST-SSR markers from the computational data. In a follow up laboratory study, we tested a sample of 20 random selected primer pairs for amplification and polymorphism detection using genomic DNA from date palm cultivars. Nearly one-third of these primer pairs detected DNA polymorphism to differentiate the twelve date palm cultivars used. Functional categorization of EST sequences containing SSRs revealed that 3,108 (67.4% of such ESTs had homology with known proteins. Conclusion Date palm EST sequences exhibits a good resource for developing gene-based markers. These genic markers identified in our study may provide a valuable genetic and genomic tool for further genetic research and varietal development in date palm, such as diversity study, QTL mapping, and molecular breeding.

  19. Report on the development of putative functional SSR and SNP markers in passion fruits.

    Science.gov (United States)

    da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro

    2017-09-06

    Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.

  20. Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties.

    Directory of Open Access Journals (Sweden)

    Nivedita Singh

    Full Text Available Simple sequence repeat (SSR and Single Nucleotide Polymorphic (SNP, the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis.

  1. Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties.

    Science.gov (United States)

    Singh, Nivedita; Choudhury, Debjani Roy; Singh, Amit Kumar; Kumar, Sundeep; Srinivasan, Kalyani; Tyagi, R K; Singh, N K; Singh, Rakesh

    2013-01-01

    Simple sequence repeat (SSR) and Single Nucleotide Polymorphic (SNP), the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR) and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC) values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA) indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA) with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD) derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis.

  2. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies.

    Science.gov (United States)

    Gimode, Davis; Odeny, Damaris A; de Villiers, Etienne P; Wanyonyi, Solomon; Dida, Mathews M; Mneney, Emmarold E; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M

    2016-01-01

    Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional

  3. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies.

    Directory of Open Access Journals (Sweden)

    Davis Gimode

    Full Text Available Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS technologies to develop both Simple Sequence Repeat (SSR and Single Nucleotide Polymorphism (SNP markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included

  4. Using SNP markers to estimate additive, dominance and imprinting genetic variance

    DEFF Research Database (Denmark)

    Lopes, M S; Bastiaansen, J W M; Janss, Luc

    The contributions of additive, dominance and imprinting effects to the variance of number of teats (NT) were evaluated in two purebred pig populations using SNP markers. Three different random regression models were evaluated, accounting for the mean and: 1) additive effects (MA), 2) additive...... and dominance effects (MAD) and 3) additive, dominance and imprinting effects (MADI). Additive heritability estimates were 0.30, 0.28 and 0.27-0.28 in both lines using MA, MAD and MADI, respectively. Dominance heritability ranged from 0.06 to 0.08 using MAD and MADI. Imprinting heritability ranged from 0.......01 to 0.02. Dominance effects make an important contribution to the genetic variation of NT in the two lines evaluated. Imprinting effects appeared less important for NT than additive and dominance effects. The SNP random regression model presented and evaluated in this study is a feasible approach...

  5. Diversity in 113 cowpea [Vigna unguiculata (L) Walp] accessions assessed with 458 SNP markers.

    Science.gov (United States)

    Egbadzor, Kenneth F; Ofori, Kwadwo; Yeboah, Martin; Aboagye, Lawrence M; Opoku-Agyeman, Michael O; Danquah, Eric Y; Offei, Samuel K

    2014-01-01

    Single Nucleotide Polymorphism (SNP) markers were used in characterization of 113 cowpea accessions comprising of 108 from Ghana and 5 from abroad. Leaf tissues from plants cultivated at the University of Ghana were genotyped at KBioscience in the United Kingdom. Data was generated for 477 SNPs, out of which 458 revealed polymorphism. The results were used to analyze genetic dissimilarity among the accessions using Darwin 5 software. The markers discriminated among all of the cowpea accessions and the dissimilarity values which ranged from 0.006 to 0.63 were used for factorial plot. Unexpected high levels of heterozygosity were observed on some of the accessions. Accessions known to be closely related clustered together in a dendrogram drawn with WPGMA method. A maximum length sub-tree which comprised of 48 core accessions was constructed. The software package structure was used to separate accessions into three groups, and the programme correctly identified varieties that were known hybrids. The hybrids were those accessions with numerous heterozygous loci. The structure plot showed closely related accessions with similar genome patterns. The SNP markers were more efficient in discriminating among the cowpea germplasm than morphological, seed protein polymorphism and simple sequence repeat studies reported earlier on the same collection.

  6. Development of a set of SNP markers present in expressed genes of the apple.

    Science.gov (United States)

    Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S

    2008-11-01

    Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.

  7. Integration of gene-based markers in a pearl millet genetic map for identification of candidate genes underlying drought tolerance quantitative trait loci

    Directory of Open Access Journals (Sweden)

    Sehgal Deepmala

    2012-01-01

    Full Text Available Abstract Background Identification of genes underlying drought tolerance (DT quantitative trait loci (QTLs will facilitate understanding of molecular mechanisms of drought tolerance, and also will accelerate genetic improvement of pearl millet through marker-assisted selection. We report a map based on genes with assigned functional roles in plant adaptation to drought and other abiotic stresses and demonstrate its use in identifying candidate genes underlying a major DT-QTL. Results Seventy five single nucleotide polymorphism (SNP and conserved intron spanning primer (CISP markers were developed from available expressed sequence tags (ESTs using four genotypes, H 77/833-2, PRLT 2/89-33, ICMR 01029 and ICMR 01004, representing parents of two mapping populations. A total of 228 SNPs were obtained from 30.5 kb sequenced region resulting in a SNP frequency of 1/134 bp. The positions of major pearl millet linkage group (LG 2 DT-QTLs (reported from crosses H 77/833-2 × PRLT 2/89-33 and 841B × 863B were added to the present consensus function map which identified 18 genes, coding for PSI reaction center subunit III, PHYC, actin, alanine glyoxylate aminotransferase, uridylate kinase, acyl-CoA oxidase, dipeptidyl peptidase IV, MADS-box, serine/threonine protein kinase, ubiquitin conjugating enzyme, zinc finger C- × 8-C × 5-C × 3-H type, Hd3, acetyl CoA carboxylase, chlorophyll a/b binding protein, photolyase, protein phosphatase1 regulatory subunit SDS22 and two hypothetical proteins, co-mapping in this DT-QTL interval. Many of these candidate genes were found to have significant association with QTLs of grain yield, flowering time and leaf rolling under drought stress conditions. Conclusions We have exploited available pearl millet EST sequences to generate a mapped resource of seventy five new gene-based markers for pearl millet and demonstrated its use in identifying candidate genes underlying a major DT-QTL in this species. The reported gene-based

  8. Genetic Diversity in Jatropha curcas L. Assessed with SSR and SNP Markers

    Directory of Open Access Journals (Sweden)

    Juan M. Montes

    2014-08-01

    Full Text Available Jatropha curcas L. (jatropha is an undomesticated plant that has recently received great attention for its utilization in biofuel production, rehabilitation of wasteland, and rural development. Knowledge of genetic diversity and marker-trait associations is urgently needed for the design of breeding strategies. The main goal of this study was to assess the genetic structure and diversity in jatropha germplasm with co-dominant markers (Simple Sequence Repeats (SSR and Single Nucleotide Polymorphism (SNP in a diverse, worldwide, germplasm panel of 70 accessions. We found a high level of homozygosis in the germplasm that does not correspond to the purely outcrossing mating system assumed to be present in jatropha. We hypothesize that the prevalent mating system of jatropha comprise a high level of self-fertilization and that the outcrossing rate is low. Genetic diversity in accessions from Central America and Mexico was higher than in accession from Africa, Asia, and South America. We identified makers associated with the presence of phorbol esters. We think that the utilization of molecular markers in breeding of jatropha will significantly accelerate the development of improved cultivars.

  9. Identification of a sex-linked SNP marker in the salmon louse (Lepeophtheirus salmonis using RAD sequencing.

    Directory of Open Access Journals (Sweden)

    Stephen N Carmichael

    Full Text Available The salmon louse (Lepeophtheirus salmonis (Krøyer, 1837 is a parasitic copepod that can, if untreated, cause considerable damage to Atlantic salmon (Salmo salar Linnaeus, 1758 and incurs significant costs to the Atlantic salmon mariculture industry. Salmon lice are gonochoristic and normally show sex ratios close to 1:1. While this observation suggests that sex determination in salmon lice is genetic, with only minor environmental influences, the mechanism of sex determination in the salmon louse is unknown. This paper describes the identification of a sex-linked Single Nucleotide Polymorphism (SNP marker, providing the first evidence for a genetic mechanism of sex determination in the salmon louse. Restriction site-associated DNA sequencing (RAD-seq was used to isolate SNP markers in a laboratory-maintained salmon louse strain. A total of 85 million raw Illumina 100 base paired-end reads produced 281,838 unique RAD-tags across 24 unrelated individuals. RAD marker Lsa101901 showed complete association with phenotypic sex for all individuals analysed, being heterozygous in females and homozygous in males. Using an allele-specific PCR assay for genotyping, this SNP association pattern was further confirmed for three unrelated salmon louse strains, displaying complete association with phenotypic sex in a total of 96 genotyped individuals. The marker Lsa101901 was located in the coding region of the prohibitin-2 gene, which showed a sex-dependent differential expression, with mRNA levels determined by RT-qPCR about 1.8-fold higher in adult female than adult male salmon lice. This study's observations of a novel sex-linked SNP marker are consistent with sex determination in the salmon louse being genetic and following a female heterozygous system. Marker Lsa101901 provides a tool to determine the genetic sex of salmon lice, and could be useful in the development of control strategies.

  10. Population structure and genetic diversity characterization of a sunflower association mapping population using SSR and SNP markers.

    Science.gov (United States)

    Filippi, Carla V; Aguirre, Natalia; Rivas, Juan G; Zubrzycki, Jeremias; Puebla, Andrea; Cordes, Diego; Moreno, Maria V; Fusari, Corina M; Alvarez, Daniel; Heinz, Ruth A; Hopp, Horacio E; Paniego, Norma B; Lia, Veronica V

    2015-02-13

    Argentina has a long tradition of sunflower breeding, and its germplasm is a valuable genetic resource worldwide. However, knowledge of the genetic constitution and variability levels of the Argentinean germplasm is still scarce, rendering the global map of cultivated sunflower diversity incomplete. In this study, 42 microsatellite loci and 384 single nucleotide polymorphisms (SNPs) were used to characterize the first association mapping population used for quantitative trait loci mapping in sunflower, along with a selection of allied open-pollinated and composite populations from the germplasm bank of the National Institute of Agricultural Technology of Argentina. The ability of different kinds of markers to assess genetic diversity and population structure was also evaluated. The analysis of polymorphism in the set of sunflower accessions studied here showed that both the microsatellites and SNP markers were informative for germplasm characterization, although to different extents. In general, the estimates of genetic variability were moderate. The average genetic diversity, as quantified by the expected heterozygosity, was 0.52 for SSR loci and 0.29 for SNPs. Within SSR markers, those derived from non-coding regions were able to capture higher levels of diversity than EST-SSR. A significant correlation was found between SSR and SNP- based genetic distances among accessions. Bayesian and multivariate methods were used to infer population structure. Evidence for the existence of three different genetic groups was found consistently across data sets (i.e., SSR, SNP and SSR + SNP), with the maintainer/restorer status being the most prevalent characteristic associated with group delimitation. The present study constitutes the first report comparing the performance of SSR and SNP markers for population genetics analysis in cultivated sunflower. We show that the SSR and SNP panels examined here, either used separately or in conjunction, allowed consistent

  11. QTL underlying some agronomic traits in barley detected by SNP markers.

    Science.gov (United States)

    Wang, Jibin; Sun, Genlou; Ren, Xifeng; Li, Chengdao; Liu, Lipan; Wang, Qifei; Du, Binbin; Sun, Dongfa

    2016-07-07

    Increasing the yield of barley (Hordeum vulgare L.) is a main breeding goal in developing barley cultivars. A high density genetic linkage map containing 1894 SNP and 68 SSR markers covering 1375.8 cM was constructed and used for mapping quantitative traits. A late-generation double haploid population (DH) derived from the Huaai 11 × Huadamai 6 cross was used to identify QTLs and QTL × environment interactions for ten traits affecting grain yield including length of main spike (MSL), spikelet number on main spike (SMS), spikelet number per plant (SLP), grain number per plant (GP), grain weight per plant (GWP), grain number per spike (GS), thousand grain weight (TGW), grain weight per spike (GWS), spike density (SPD) and spike number per plant (SP). In single environment analysis using composite interval mapping (CIM), a total of 221 QTLs underlying the ten traits were detected in five consecutive years (2009-2013). The QTLs detected in each year were 50, 48, 41, 41 and 41 for the year 2009 to 2013. The QTLs associated with these traits were generally clustered on chromosome 2H, 4H and 7H. In multi-environment analysis, a total of 111 significant QTLs including 18 for MSL, 16 for SMS, 15 for SPD, 5 for SP, 4 for SLP, 14 for TGW, 5 for GP, 11 for GS, 8 for GWP, and 15 for GWS were detected in the five years. Most QTLs showed significant QTL × environment interactions (QEI), nine QTLs (qIMSL3-1, qIMSL4-1, qIMSL4-2, qIMSL6-1, qISMS7-1, qISPD2-7, qISPD7-1, qITGW3-1 and qIGWS4-3) were detected with minimal QEI effects and stable in different years. Among 111 QTLs,71 (63.40 %) QTLs were detected in both single and multiple environments. Three main QTL cluster regions associated with the 10 agronomic traits on chromosome 2H, 4H and 7H were detected. The QTLs for SMS, SLP, GP and GWP were located in the region near Vrs1 on chromosome 2H. The QTLs underlying SMS, SPD and SLP were clustered on chromosome 4H. On the terminal of chromosome 7H, there was a QTL

  12. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

    Science.gov (United States)

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-06-04

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.

  13. Whole-genome single-nucleotide polymorphism (SNP marker discovery and association analysis with the eicosapentaenoic acid (EPA and docosahexaenoic acid (DHA content in Larimichthys crocea

    Directory of Open Access Journals (Sweden)

    Shijun Xiao

    2016-12-01

    Full Text Available Whole-genome single-nucleotide polymorphism (SNP markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA and docosahexaenoic acid (DHA content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms.

  14. Single strand conformation polymorphism based SNP and Indel markers for genetic mapping and synteny analysis of common bean (Phaseolus vulgaris L.

    Directory of Open Access Journals (Sweden)

    Gómez Marcela

    2009-12-01

    Full Text Available Abstract Background Expressed sequence tags (ESTs are an important source of gene-based markers such as those based on insertion-deletions (Indels or single-nucleotide polymorphisms (SNPs. Several gel based methods have been reported for the detection of sequence variants, however they have not been widely exploited in common bean, an important legume crop of the developing world. The objectives of this project were to develop and map EST based markers using analysis of single strand conformation polymorphisms (SSCPs, to create a transcript map for common bean and to compare synteny of the common bean map with sequenced chromosomes of other legumes. Results A set of 418 EST based amplicons were evaluated for parental polymorphisms using the SSCP technique and 26% of these presented a clear conformational or size polymorphism between Andean and Mesoamerican genotypes. The amplicon based markers were then used for genetic mapping with segregation analysis performed in the DOR364 × G19833 recombinant inbred line (RIL population. A total of 118 new marker loci were placed into an integrated molecular map for common bean consisting of 288 markers. Of these, 218 were used for synteny analysis and 186 presented homology with segments of the soybean genome with an e-value lower than 7 × 10-12. The synteny analysis with soybean showed a mosaic pattern of syntenic blocks with most segments of any one common bean linkage group associated with two soybean chromosomes. The analysis with Medicago truncatula and Lotus japonicus presented fewer syntenic regions consistent with the more distant phylogenetic relationship between the galegoid and phaseoloid legumes. Conclusion The SSCP technique is a useful and inexpensive alternative to other SNP or Indel detection techniques for saturating the common bean genetic map with functional markers that may be useful in marker assisted selection. In addition, the genetic markers based on ESTs allowed the construction

  15. Single strand conformation polymorphism based SNP and Indel markers for genetic mapping and synteny analysis of common bean (Phaseolus vulgaris L.).

    Science.gov (United States)

    Galeano, Carlos H; Fernández, Andrea C; Gómez, Marcela; Blair, Matthew W

    2009-12-23

    Expressed sequence tags (ESTs) are an important source of gene-based markers such as those based on insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). Several gel based methods have been reported for the detection of sequence variants, however they have not been widely exploited in common bean, an important legume crop of the developing world. The objectives of this project were to develop and map EST based markers using analysis of single strand conformation polymorphisms (SSCPs), to create a transcript map for common bean and to compare synteny of the common bean map with sequenced chromosomes of other legumes. A set of 418 EST based amplicons were evaluated for parental polymorphisms using the SSCP technique and 26% of these presented a clear conformational or size polymorphism between Andean and Mesoamerican genotypes. The amplicon based markers were then used for genetic mapping with segregation analysis performed in the DOR364 x G19833 recombinant inbred line (RIL) population. A total of 118 new marker loci were placed into an integrated molecular map for common bean consisting of 288 markers. Of these, 218 were used for synteny analysis and 186 presented homology with segments of the soybean genome with an e-value lower than 7 x 10-12. The synteny analysis with soybean showed a mosaic pattern of syntenic blocks with most segments of any one common bean linkage group associated with two soybean chromosomes. The analysis with Medicago truncatula and Lotus japonicus presented fewer syntenic regions consistent with the more distant phylogenetic relationship between the galegoid and phaseoloid legumes. The SSCP technique is a useful and inexpensive alternative to other SNP or Indel detection techniques for saturating the common bean genetic map with functional markers that may be useful in marker assisted selection. In addition, the genetic markers based on ESTs allowed the construction of a transcript map and given their high conservation

  16. Exploring germplasm diversity to understand the domestication process in Cicer spp. using SNP and DArT markers.

    Directory of Open Access Journals (Sweden)

    Manish Roorkiwal

    Full Text Available To estimate genetic diversity within and between 10 interfertile Cicer species (94 genotypes from the primary, secondary and tertiary gene pool, we analysed 5,257 DArT markers and 651 KASPar SNP markers. Based on successful allele calling in the tertiary gene pool, 2,763 DArT and 624 SNP markers that are polymorphic between genotypes from the gene pools were analyzed further. STRUCTURE analyses were consistent with 3 cultivated populations, representing kabuli, desi and pea-shaped seed types, with substantial admixture among these groups, while two wild populations were observed using DArT markers. AMOVA was used to partition variance among hierarchical sets of landraces and wild species at both the geographical and species level, with 61% of the variation found between species, and 39% within species. Molecular variance among the wild species was high (39% compared to the variation present in cultivated material (10%. Observed heterozygosity was higher in wild species than the cultivated species for each linkage group. Our results support the Fertile Crescent both as the center of domestication and diversification of chickpea. The collection used in the present study covers all the three regions of historical chickpea cultivation, with the highest diversity in the Fertile Crescent region. Shared alleles between different gene pools suggest the possibility of gene flow among these species or incomplete lineage sorting and could indicate complicated patterns of divergence and fusion of wild chickpea taxa in the past.

  17. Nuclear Species-Diagnostic SNP Markers Mined from 454 Amplicon Sequencing Reveal Admixture Genomic Structure of Modern Citrus Varieties

    Science.gov (United States)

    Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick

    2015-01-01

    Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP

  18. Identification and Validation of SNP Markers Linked to Dwarf Traits Using SLAF-Seq Technology in Lagerstroemia

    Science.gov (United States)

    Ju, Yiqian; Jiao, Yao; Feng, Lu; Pan, Huitang; Cheng, Tangren; Zhang, Qixiang

    2016-01-01

    The genetic control of plant architecture is a promising approach to breed desirable cultivars, particularly in ornamental flowers. In this study, the F1 population (142 seedlings) derived from Lagerstroemia fauriei (non-dwarf) × L. indica ‘Pocomoke’ (dwarf) was phenotyped for six traits (plant height (PH), internode length (IL), internode number, primary lateral branch height (PLBH), secondary lateral branch height and primary branch number), and the IL and PLBH traits were positively correlated with the PH trait and considered representative indexes of PH. Fifty non-dwarf and dwarf seedlings were pooled and subjected to a specific-locus amplified fragment sequencing (SLAF-seq) method, which screened 1221 polymorphic markers. A total of 3 markers segregating between bulks were validated in the F1 population, with the M16337 and M38412 markers highly correlated with the IL trait and the M25207 marker highly correlated with the PLBH trait. These markers provide a predictability of approximately 80% using a single marker (M25207) and a predictability of 90% using marker combinations (M16337 + M25207) in the F1 population, which revealed that the IL and the PLBH traits, especially the PLBH, were the decisive elements for PH in terms of molecular regulation. Further validation was performed in the BC1 population and a set of 28 Lagerstroemia stocks using allele-specific PCR (AS-PCR) technology, and the results showed the stability and reliability of the SNP markers and the co-determination of PH by multiple genes. Our findings provide an important theoretical and practical basis for the early prediction and indirect selection of PH using the IL and the PLBH, and the detected SNPs may be useful for marker-assisted selection (MAS) in crape myrtle. PMID:27404662

  19. Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety 'Amrapali' (Mangifera indica L.).

    Science.gov (United States)

    Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

    2016-01-01

    Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.

  20. Population structure of Atlantic Mackerel inferred from RAD-seq derived SNP markers: effects of sequence clustering parameters and hierarchical SNP selection

    KAUST Repository

    Rodrí guez-Ezpeleta, Naiara; Bradbury, Ian R.; Mendibil, Iñ aki; Á lvarez, Paula; Cotano, Unai; Irigoien, Xabier

    2016-01-01

    : the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides

  1. Genotyping of single spore isolates of a Pasteuria penetrans population occurring in Florida using SNP-based markers.

    Science.gov (United States)

    Joseph, S; Schmidt, L M; Danquah, W B; Timper, P; Mekete, T

    2017-02-01

    To generate single spore lines of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida and examine genotypic variation and virulence characteristics exist within the population. Six single spore lines (SSP), 16SSP, 17SSP, 18SSP, 25SSP, 26SSP and 30SSP were generated. Genetic variability was evaluated by comparing single-nucleotide polymorphisms (SNPs) in six protein-coding genes and the 16S rRNA gene. An average of one SNP was observed for every 69 bp in the 16S rRNA, whereas no SNPs were observed in the protein-coding sequences. Hierarchical cluster analysis of 16S rRNA sequences placed the clones into three distinct clades. Bio-efficacy analysis revealed significant heterogeneity in the level virulence and host specificity between the individual clones. The SNP markers developed to the 5' hypervariable region of the 16S rRNA gene may be useful in biotype differentiation within a population of P. penetrans. This study demonstrates an efficient method for generating single spore lines of P. penetrans and gives a deep insight into genetic heterogeneity and varying level of virulence exists within a population parasitizing a specific Meloidogyne sp. host. The results also suggest that the application of generalist spore lines in nematode management may achieve broad RKN control. © 2016 The Society for Applied Microbiology.

  2. SNP genotyping technologies

    DEFF Research Database (Denmark)

    Studer, Bruno; Kölliker, Roland

    2013-01-01

    In the recent years, single nucleotide polymorphism (SNP) markers have emerged as the marker technology of choice for plant genetics and breeding applications. Besides the efficient technologies available for SNP discovery even in complex genomes, one of the main reasons for this is the availabil...

  3. Application of next-generation sequencing technology to study genetic diversity and identify unique SNP markers in bread wheat from Kazakhstan.

    Science.gov (United States)

    Shavrukov, Yuri; Suchecki, Radoslaw; Eliby, Serik; Abugalieva, Aigul; Kenebayev, Serik; Langridge, Peter

    2014-09-28

    New SNP marker platforms offer the opportunity to investigate the relationships between wheat cultivars from different regions and assess the mechanism and processes that have led to adaptation to particular production environments. Wheat breeding has a long history in Kazakhstan and the aim of this study was to explore the relationship between key varieties from Kazakhstan and germplasm from breeding programs for other regions. The study revealed 5,898 polymorphic markers amongst ten cultivars, of which 2,730 were mapped in the consensus genetic map. Mapped SNP markers were distributed almost equally across the A and B genomes, with between 279 and 484 markers assigned to each chromosome. Marker coverage was approximately 10-fold lower in the D genome. There were 863 SNP markers identified as unique to specific cultivars, and clusters of these markers (regions containing more than three closely mapped unique SNPs) showed specific patterns on the consensus genetic map for each cultivar. Significant intra-varietal genetic polymorphism was identified in three cultivars (Tzelinnaya 3C, Kazakhstanskaya rannespelaya and Kazakhstanskaya 15). Phylogenetic analysis based on inter-varietal polymorphism showed that the very old cultivar Erythrospermum 841 was the most genetically distinct from the other nine cultivars from Kazakhstan, falling in a clade together with the American cultivar Sonora and genotypes from Central and South Asia. The modern cultivar Kazakhstanskaya 19 also fell into a separate clade, together with the American cultivar Thatcher. The remaining eight cultivars shared a single sub-clade but were categorised into four clusters. The accumulated data for SNP marker polymorphisms amongst bread wheat genotypes from Kazakhstan may be used for studying genetic diversity in bread wheat, with potential application for marker-assisted selection and the preparation of a set of genotype-specific markers.

  4. Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design

    Directory of Open Access Journals (Sweden)

    Shashi N. Goonetilleke

    2018-01-01

    Full Text Available In crop plant genetics, linkage maps provide the basis for the mapping of loci that affect important traits and for the selection of markers to be applied in crop improvement. In outcrossing species such as almond (Prunus dulcis Mill. D. A. Webb, application of a double pseudotestcross mapping approach to the F1 progeny of a biparental cross leads to the construction of a linkage map for each parent. Here, we report on the application of genotyping by sequencing to discover and map single nucleotide polymorphisms in the almond cultivars “Nonpareil” and “Lauranne.” Allele-specific marker assays were developed for 309 tag pairs. Application of these assays to 231 Nonpareil × Lauranne F1 progeny provided robust linkage maps for each parent. Analysis of phenotypic data for shell hardness demonstrated the utility of these maps for quantitative trait locus mapping. Comparison of these maps to the peach genome assembly confirmed high synteny and collinearity between the peach and almond genomes. The marker assays were applied to progeny from several other Nonpareil crosses, providing the basis for a composite linkage map of Nonpareil. Applications of the assays to a panel of almond clones and a panel of rootstocks used for almond production demonstrated the broad applicability of the markers and provide subsets of markers that could be used to discriminate among accessions. The sequence-based linkage maps and single nucleotide polymorphism assays presented here could be useful resources for the genetic analysis and genetic improvement of almond.

  5. Genome-Wide Association Study for Identification and Validation of Novel SNP Markers for Sr6 Stem Rust Resistance Gene in Bread Wheat.

    Science.gov (United States)

    Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S

    2018-01-01

    Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.

  6. Genetic relationships among Vietnamese local pigs investigated using genome-wide SNP markers.

    Science.gov (United States)

    Ishihara, S; Arakawa, A; Taniguchi, M; Luu, Q M; Pham, D L; Nguyen, B V; Mikawa, S; Kikuchi, K

    2018-02-01

    Vietnam is one of the most important countries for pig domestication, and a total of 26 local breeds have been reported. In the present study, genetic relationships among the various pig breeds were investigated using 90 samples collected from local pigs (15 breeds) in 15 distantly separated, distinct areas of the country and six samples from Landrace pigs in Hanoi as an out-group of a common Western breed. All samples were genotyped using the Illumina Porcine SNP60 v2 Genotyping BeadChip. We used 15 160-15 217 SNPs that showed a high degree of polymorphism in the Vietnamese breeds for identifying genetic relationships among the Vietnamese breeds. Principal components analysis showed that most pigs indigenous to Vietnam formed clusters correlated with their original geographic locations. Some Vietnamese breeds formed a cluster that was genetically related to the Western breed Landrace, suggesting the possibility of crossbreeding. These findings will be useful for the conservation and management of Vietnamese local pig breeds. © 2018 Stichting International Foundation for Animal Genetics.

  7. SNP discovery and marker development for disease resistance candidate genes in common carp (Cyprinus carpio)

    Science.gov (United States)

    Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers of susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpes virus 3 (CyHV-3) is highly contagious and virulent in common carp. With the aim to investigate the gene...

  8. Mapping of a major QTL for salt tolerance of mature field-grown maize plants based on SNP markers.

    Science.gov (United States)

    Luo, Meijie; Zhao, Yanxin; Zhang, Ruyang; Xing, Jinfeng; Duan, Minxiao; Li, Jingna; Wang, Naishun; Wang, Wenguang; Zhang, Shasha; Chen, Zhihui; Zhang, Huasheng; Shi, Zi; Song, Wei; Zhao, Jiuran

    2017-08-15

    Salt stress significantly restricts plant growth and production. Maize is an important food and economic crop but is also a salt sensitive crop. Identification of the genetic architecture controlling salt tolerance facilitates breeders to select salt tolerant lines. However, the critical quantitative trait loci (QTLs) responsible for the salt tolerance of field-grown maize plants are still unknown. To map the main genetic factors contributing to salt tolerance in mature maize, a double haploid population (240 individuals) and 1317 single nucleotide polymorphism (SNP) markers were employed to produce a genetic linkage map covering 1462.05 cM. Plant height of mature maize cultivated in the saline field (SPH) and plant height-based salt tolerance index (ratio of plant height between saline and control fields, PHI) were used to evaluate salt tolerance of mature maize plants. A major QTL for SPH was detected on Chromosome 1 with the LOD score of 22.4, which explained 31.2% of the phenotypic variation. In addition, the major QTL conditioning PHI was also mapped at the same position on Chromosome 1, and two candidate genes involving in ion homeostasis were identified within the confidence interval of this QTL. The detection of the major QTL in adult maize plant establishes the basis for the map-based cloning of genes associated with salt tolerance and provides a potential target for marker assisted selection in developing maize varieties with salt tolerance.

  9. A new panel of SNP markers for the individual identification of North American pumas

    Science.gov (United States)

    Fitak, Robert R.; Naidu, Ashwin; Thompson, Ron W.; Culver, Melanie

    2016-01-01

    Pumas Puma concolor are one of the most studied terrestrial carnivores because of their widespread distribution, substantial ecological impacts, and conflicts with humans. Over the past decade, managing pumas has involved extensive efforts including the use of genetic methods. Microsatellites have been the most commonly used genetic markers; however, technical artifacts and little overlap of frequently used loci render large-scale comparison of puma genetic data across studies challenging. Therefore, a panel of genetic markers that can produce consistent genotypes across studies without the need for extensive calibrations is essential for range-wide genetic management of puma populations. Here, we describe the development of PumaPlex, a high-throughput assay to genotype 25 single nucleotide polymorphisms in pumas. We validated PumaPlex in 748 North American pumas Puma concolor couguar, and demonstrated its ability to generate reproducible genotypes and accurately identify individuals. Furthermore, in a test using fecal deoxyribonucleic acid (DNA) samples, we found that PumaPlex produced significantly more genotypes with fewer errors than 12 microsatellite loci, 8 of which are commonly used. Our results demonstrate that PumaPlex is a valuable tool for the genetic monitoring and management of North American puma populations. Given the analytical simplicity, reproducibility, and high-throughput capability of single nucleotide polymorphisms, PumaPlex provides a standard panel of markers that promotes the comparison of genotypes across studies and independent of the genotyping technology used.

  10. Outlier SNP markers reveal fine-scale genetic structuring across European hake populations (Merluccius merluccius)

    DEFF Research Database (Denmark)

    Milano, I.; Babbucci, M.; Cariani, A.

    2014-01-01

    fishery. Analysis of 850 individuals from 19 locations across the entire distribution range showed evidence for several outlier loci, with significantly higher resolving power. While 299 putatively neutral SNPs confirmed the genetic break between basins (FCT = 0.016) and weak differentiation within basins...... even when neutral markers provide genetic homogeneity across populations. Here, 381 SNPs located in transcribed regions were used to assess largeand fine-scale population structure in the European hake (Merluccius merluccius), a widely distributed demersal species of high priority for the European...

  11. Report on ISFG SNP Panel Discussion

    DEFF Research Database (Denmark)

    Butler, John M.; Budowle, B.; Gill, P.

    2008-01-01

    Six scientists presented their views and experience with single nucleotide polymorphism (SNP) markers, multiplexes, and methods regarding their potential application in forensic identity and relationship testing. Benefits and limitations of SNPs were reviewed, as were different SNP marker...

  12. Expression Level of the DREB2-Type Gene, Identified with Amplifluor SNP Markers, Correlates with Performance, and Tolerance to Dehydration in Bread Wheat Cultivars from Northern Kazakhstan

    Science.gov (United States)

    Shavrukov, Yuri; Zhumalin, Aibek; Serikbay, Dauren; Botayeva, Makpal; Otemisova, Ainur; Absattarova, Aiman; Sereda, Grigoriy; Sereda, Sergey; Shvidchenko, Vladimir; Turbekova, Arysgul; Jatayev, Satyvaldy; Lopato, Sergiy; Soole, Kathleen; Langridge, Peter

    2016-01-01

    A panel of 89 local commercial cultivars of bread wheat was tested in field trials in the dry conditions of Northern Kazakhstan. Two distinct groups of cultivars (six cultivars in each group), which had the highest and the lowest grain yield under drought were selected for further experiments. A dehydration test conducted on detached leaves indicated a strong association between rates of water loss in plants from the first group with highest grain yield production in the dry environment relative to the second group. Modern high-throughput Amplifluor Single Nucleotide Polymorphism (SNP) technology was applied to study allelic variations in a series of drought-responsive genes using 19 SNP markers. Genotyping of an SNP in the TaDREB5 (DREB2-type) gene using the Amplifluor SNP marker KATU48 revealed clear allele distribution across the entire panel of wheat accessions, and distinguished between the two groups of cultivars with high and low yield under drought. Significant differences in expression levels of TaDREB5 were revealed by qRT-PCR. Most wheat plants from the first group of cultivars with high grain yield showed slight up-regulation in the TaDREB5 transcript in dehydrated leaves. In contrast, expression of TaDREB5 in plants from the second group of cultivars with low grain yield was significantly down-regulated. It was found that SNPs did not alter the amino acid sequence of TaDREB5 protein. Thus, a possible explanation is that alternative splicing and up-stream regulation of TaDREB5 may be affected by SNP, but these hypotheses require additional analysis (and will be the focus of future studies). PMID:27917186

  13. Expression level of the DREB2-type gene, identified with Amplifluor SNP markers, correlates with performance and tolerance to dehydration in bread wheat cultivars from Northern Kazakhstan

    Directory of Open Access Journals (Sweden)

    Yuri Shavrukov

    2016-11-01

    Full Text Available A panel of 89 local commercial cultivars of bread wheat was tested in field trials in the dry conditions of Northern Kazakhstan. Two distinct groups of cultivars (six cultivars in each group, which had the highest and the lowest grain yield under drought were selected for further experiments. A dehydration test conducted on detached leaves indicated a strong association between rates of water loss in plants from the first group with highest grain yield production in the dry environment relative to the second group. Modern high-throughput Amplifluor SNP technology was applied to study allelic variations in a series of drought-responsive genes using 19 SNP markers. Genotyping of an SNP in the TaDREB5 (DREB2-type gene using the Amplifluor SNP marker KATU48 revealed clear allele distribution across the entire panel of wheat accessions, and distinguished between the two groups of cultivars with high and low yield under drought. Significant differences in expression levels of TaDREB5 were revealed by qRT-PCR. Most wheat plants from the first group of cultivars with high grain yield showed strong up-regulation of TaDREB5 transcript in dehydrated leaves. In contrast, expression of TaDREB5 in plants from the second group of cultivars with low grain yield was significantly down-regulated. It was found that SNPs did not alter the amino acid sequence of TaDREB5 protein. Thus, a possible explanation is that alternative splicing and up-stream regulation of TaDREB5 may be affected by SNP, but these hypotheses require additional analysis (and will be the focus of future studies.

  14. SNP detection from de novo transcriptome sequencing in the bivalve Macoma balthica: marker development for evolutionary studies.

    Directory of Open Access Journals (Sweden)

    Eric Pante

    Full Text Available Hybrid zones are noteworthy systems for the study of environmental adaptation to fast-changing environments, as they constitute reservoirs of polymorphism and are key to the maintenance of biodiversity. They can move in relation to climate fluctuations, as temperature can affect both selection and migration, or remain trapped by environmental and physical barriers. There is therefore a very strong incentive to study the dynamics of hybrid zones subjected to climate variations. The infaunal bivalve Macoma balthica emerges as a noteworthy model species, as divergent lineages hybridize, and its native NE Atlantic range is currently contracting to the North. To investigate the dynamics and functioning of hybrid zones in M. balthica, we developed new molecular markers by sequencing the collective transcriptome of 30 individuals. Ten individuals were pooled for each of the three populations sampled at the margins of two hybrid zones. A single 454 run generated 277 Mb from which 17K SNPs were detected. SNP density averaged 1 polymorphic site every 14 to 19 bases, for mitochondrial and nuclear loci, respectively. An [Formula: see text] scan detected high genetic divergence among several hundred SNPs, some of them involved in energetic metabolism, cellular respiration and physiological stress. The high population differentiation, recorded for nuclear-encoded ATP synthase and NADH dehydrogenase as well as most mitochondrial loci, suggests cytonuclear genetic incompatibilities. Results from this study will help pave the way to a high-resolution study of hybrid zone dynamics in M. balthica, and the relative importance of endogenous and exogenous barriers to gene flow in this system.

  15. Generation and analysis of ESTs from the eastern oyster, Crassostrea virginica Gmelin and identification of microsatellite and SNP markers

    Directory of Open Access Journals (Sweden)

    Wallace Richard

    2007-06-01

    Full Text Available Abstract Background The eastern oyster, Crassostrea virginica (Gmelin 1791, is an economically important species cultured in many areas in North America. It is also ecologically important because of the impact of its filter feeding behaviour on water quality. Populations of C. virginica have been threatened by overfishing, habitat degradation, and diseases. Through genome research, strategies are being developed to reverse its population decline. However, large-scale expressed sequence tag (EST resources have been lacking for this species. Efficient generation of EST resources from this species has been hindered by a high redundancy of transcripts. The objectives of this study were to construct a normalized cDNA library for efficient EST analysis, to generate thousands of ESTs, and to analyze the ESTs for microsatellites and potential single nucleotide polymorphisms (SNPs. Results A normalized and subtracted C. virginica cDNA library was constructed from pooled RNA isolated from hemocytes, mantle, gill, gonad and digestive tract, muscle, and a whole juvenile oyster. A total of 6,528 clones were sequenced from this library generating 5,542 high-quality EST sequences. Cluster analysis indicated the presence of 635 contigs and 4,053 singletons, generating a total of 4,688 unique sequences. About 46% (2,174 of the unique ESTs had significant hits (E-value ≤ 1e-05 to the non-redundant protein database; 1,104 of which were annotated using Gene Ontology (GO terms. A total of 35 microsatellites were identified from the ESTs, with 18 having sufficient flanking sequences for primer design. A total of 6,533 putative SNPs were also identified using all existing and the newly generated EST resources of the eastern oysters. Conclusion A high quality normalized cDNA library was constructed. A total of 5,542 ESTs were generated representing 4,688 unique sequences. Putative microsatellite and SNP markers were identified. These genome resources provide the

  16. Population structure of Atlantic Mackerel inferred from RAD-seq derived SNP markers: effects of sequence clustering parameters and hierarchical SNP selection

    KAUST Repository

    Rodríguez-Ezpeleta, Naiara

    2016-03-03

    Restriction-site associated DNA sequencing (RAD-seq) and related methods are revolutionizing the field of population genomics in non-model organisms as they allow generating an unprecedented number of single nucleotide polymorphisms (SNPs) even when no genomic information is available. Yet, RAD-seq data analyses rely on assumptions on nature and number of nucleotide variants present in a single locus, the choice of which may lead to an under- or overestimated number of SNPs and/or to incorrectly called genotypes. Using the Atlantic mackerel (Scomber scombrus L.) and a close relative, the Atlantic chub mackerel (Scomber colias), as case study, here we explore the sensitivity of population structure inferences to two crucial aspects in RAD-seq data analysis: the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides insights into the effects of alternative RAD-seq data analysis strategies on population structure inferences that are directly applicable to other species.

  17. A genetic linkage map with 178 SSR and 1 901 SNP markers constructed using a RIL population in wheat (Triticum aestivum L.)

    Institute of Scientific and Technical Information of China (English)

    ZHAI Hui-jie; FENG Zhi-yu; LIU Xin-ye; CHENG Xue-jiao; PENG Hui-ru; YAO Ying-yin; SUN Qi-xin; NI Zhong-fu

    2015-01-01

    The construction of high density genetic linkage map provides a powerful tool to detect and map quantitative trait loci (QTLs) controlling agronomically important traits. In this study, simple sequence repeat (SSR) markers and Illumina 9K iSelect single nucleotide polymorphism (SNP) genechip were employed to construct one genetic linkage map of common wheat (Triticum aestivum L.) using 191 recombinant inbred lines (RILs) derived from cross Yu 8679xJing 411. This map included 1 901 SNP loci and 178 SSR loci, covering 1 659.9 cM and 1 000 marker bins, with an average interval distance of 1.66 cM. A, B and D genomes covered 719.1,703.5 and 237.3 cM, with an average interval distance of 1.66, 1.45 and 2.9 cM, respectively. Notably, the genetic linkage map covered 20 chromosomes, with the exception of chromosome 5D. Bioinformatics analysis revealed that 1 754 (92.27%) of 1 901 mapped SNP loci could be aligned to 1 215 distinct wheat unigenes, among which 1 184 (97.4%) were located on one single chromosome, and the rest 31 (2.6%) were located on 2 to 3 chromosomes. By performing in silico comparison, 214 chromosome deletion bin-mapped expressed sequence tags (ESTs), 1 043 Brachypodium genes and 1 033 rice genes were further added onto the genetic linkage map. This map not only integrated genetic and physical maps, SSR and SNP loci, respectively, but also provided the information of Brachypodium and rice genes corresponding to 1 754 SNP loci. Therefore, it will be a useful tool for comparative genomics analysis, fine mapping of QTL/gene controlling agronomically important traits and marker-assisted selection breeding in wheat.

  18. Exploiting transcriptome data for the development and characterization of gene-based SSR markers related to cold tolerance in oil palm (Elaeis guineensis).

    Science.gov (United States)

    Xiao, Yong; Zhou, Lixia; Xia, Wei; Mason, Annaliese S; Yang, Yaodong; Ma, Zilong; Peng, Ming

    2014-12-19

    The oil palm (Elaeis guineensis, 2n = 32) has the highest oil yield of any crop species, as well as comprising the richest dietary source of provitamin A. For the tropical species, the best mean growth temperature is about 27°C, with a minimal growth temperature of 15°C. Hence, the plantation area is limited into the geographical ranges of 10°N to 10°S. Enhancing cold tolerance capability will increase the total cultivation area and subsequently oil productivity of this tropical species. Developing molecular markers related to cold tolerance would be helpful for molecular breeding of cold tolerant Elaeis guineensis. In total, 5791 gene-based SSRs were identified in 51,452 expressed sequences from Elaeis guineensis transcriptome data: approximately one SSR was detected per 10 expressed sequences. Of these 5791 gene-based SSRs, 916 were derived from expressed sequences up- or down-regulated at least two-fold in response to cold stress. A total of 182 polymorphic markers were developed and characterized from 442 primer pairs flanking these cold-responsive SSR repeats. The polymorphic information content (PIC) of these polymorphic SSR markers across 24 lines of Elaeis guineensis varied from 0.08 to 0.65 (mean = 0.31 ± 0.12). Using in-silico mapping, 137 (75.3%) of the 182 polymorphic SSR markers were located onto the 16 Elaeis guineensis chromosomes. Total coverage of 473 Mbp was achieved, with an average physical distance of 3.4 Mbp between adjacent markers (range 96 bp - 20.8 Mbp). Meanwhile, Comparative analysis of transcriptome under cold stress revealed that one ICE1 putative ortholog, five CBF putative orthologs, 19 NAC transcription factors and four cold-induced orhologs were up-regulated at least two fold in response to cold stress. Interestingly, 5' untranslated region of both Unigene21287 (ICE1) and CL2628.Contig1 (NAC) both contained an SSR markers. In the present study, a series of SSR markers were developed based on sequences

  19. Accuracy of Assignment of Atlantic Salmon (Salmo salar L.) to Rivers and Regions in Scotland and Northeast England Based on Single Nucleotide Polymorphism (SNP) Markers

    Science.gov (United States)

    Gilbey, John; Cauwelier, Eef; Coulson, Mark W.; Stradmeyer, Lee; Sampayo, James N.; Armstrong, Anja; Verspoor, Eric; Corrigan, Laura; Shelley, Jonathan; Middlemas, Stuart

    2016-01-01

    Understanding the habitat use patterns of migratory fish, such as Atlantic salmon (Salmo salar L.), and the natural and anthropogenic impacts on them, is aided by the ability to identify individuals to their stock of origin. Presented here are the results of an analysis of informative single nucleotide polymorphic (SNP) markers for detecting genetic structuring in Atlantic salmon in Scotland and NE England and their ability to allow accurate genetic stock identification. 3,787 fish from 147 sites covering 27 rivers were screened at 5,568 SNP markers. In order to identify a cost-effective subset of SNPs, they were ranked according to their ability to differentiate between fish from different rivers. A panel of 288 SNPs was used to examine both individual assignments and mixed stock fisheries and eighteen assignment units were defined. The results improved greatly on previously available methods and, for the first time, fish caught in the marine environment can be confidently assigned to geographically coherent units within Scotland and NE England, including individual rivers. As such, this SNP panel has the potential to aid understanding of the various influences acting upon Atlantic salmon on their marine migrations, be they natural environmental variations and/or anthropogenic impacts, such as mixed stock fisheries and interactions with marine power generation installations. PMID:27723810

  20. Characterizing the population structure and genetic diversity of maize breeding germplasm in Southwest China using genome-wide SNP markers.

    Science.gov (United States)

    Zhang, Xiao; Zhang, Hua; Li, Lujiang; Lan, Hai; Ren, Zhiyong; Liu, Dan; Wu, Ling; Liu, Hailan; Jaqueth, Jennifer; Li, Bailin; Pan, Guangtang; Gao, Shibin

    2016-08-31

    Maize breeding germplasm used in Southwest China has high complexity because of the diverse ecological features of this area. In this study, the population structure, genetic diversity, and linkage disequilibrium decay distance of 362 important inbred lines collected from the breeding program of Southwest China were characterized using the MaizeSNP50 BeadChip with 56,110 single nucleotide polymorphisms (SNPs). With respect to population structure, two (Tropical and Temperate), three (Tropical, Stiff Stalk and non-Stiff Stalk), four [Tropical, group A germplasm derived from modern U.S. hybrids (PA), group B germplasm derived from modern U.S. hybrids (PB) and Reid] and six (Tropical, PB, Reid, Iowa Stiff Stalk Synthetic, PA and North) subgroups were identified. With increasing K value, the Temperate group showed pronounced hierarchical structure with division into further subgroups. The Genetic Diversity of each group was also estimated, and the Tropical group was more diverse than the Temperate group. Seven low-genetic-diversity and one high-genetic-diversity regions were collectively identified in the Temperate, Tropical groups, and the entire panel. SNPs with significant variation in allele frequency between the Tropical and Temperate groups were also evaluated. Among them, a region located at 130 Mb on Chromosome 2 showed the highest genetic diversity, including both number of SNPs with significant variation and the ratio of significant SNPs to total SNPs. Linkage disequilibrium decay distance in the Temperate group was greater (2.5-3 Mb) than that in the entire panel (0.5-0.75 Mb) and the Tropical group (0.25-0.5 Mb). A large region at 30-120 Mb of Chromosome 7 was concluded to be a region conserved during the breeding process by comparison between S37, which was considered a representative tropical line in Southwest China, and its 30 most similar derived lines. For the panel covered most of widely used inbred lines in Southwest China, this work

  1. Annotated genetic linkage maps of Pinus pinaster Ait. from a Central Spain population using microsatellite and gene based markers.

    Science.gov (United States)

    de Miguel, Marina; de Maria, Nuria; Guevara, M Angeles; Diaz, Luis; Sáez-Laguna, Enrique; Sánchez-Gómez, David; Chancerel, Emilie; Aranda, Ismael; Collada, Carmen; Plomion, Christophe; Cabezas, José-Antonio; Cervera, María-Teresa

    2012-10-04

    Pinus pinaster Ait. is a major resin producing species in Spain. Genetic linkage mapping can facilitate marker-assisted selection (MAS) through the identification of Quantitative Trait Loci and selection of allelic variants of interest in breeding populations. In this study, we report annotated genetic linkage maps for two individuals (C14 and C15) belonging to a breeding program aiming to increase resin production. We use different types of DNA markers, including last-generation molecular markers. We obtained 13 and 14 linkage groups for C14 and C15 maps, respectively. A total of 211 and 215 markers were positioned on each map and estimated genome length was between 1,870 and 2,166 cM respectively, which represents near 65% of genome coverage. Comparative mapping with previously developed genetic linkage maps for P. pinaster based on about 60 common markers enabled aligning linkage groups to this reference map. The comparison of our annotated linkage maps and linkage maps reporting QTL information revealed 11 annotated SNPs in candidate genes that co-localized with previously reported QTLs for wood properties and water use efficiency. This study provides genetic linkage maps from a Spanish population that shows high levels of genetic divergence with French populations from which segregating progenies have been previously mapped. These genetic maps will be of interest to construct a reliable consensus linkage map for the species. The importance of developing functional genetic linkage maps is highlighted, especially when working with breeding populations for its future application in MAS for traits of interest.

  2. Annotated genetic linkage maps of Pinus pinaster Ait. from a Central Spain population using microsatellite and gene based markers

    Directory of Open Access Journals (Sweden)

    de Miguel Marina

    2012-10-01

    Full Text Available Abstract Background Pinus pinaster Ait. is a major resin producing species in Spain. Genetic linkage mapping can facilitate marker-assisted selection (MAS through the identification of Quantitative Trait Loci and selection of allelic variants of interest in breeding populations. In this study, we report annotated genetic linkage maps for two individuals (C14 and C15 belonging to a breeding program aiming to increase resin production. We use different types of DNA markers, including last-generation molecular markers. Results We obtained 13 and 14 linkage groups for C14 and C15 maps, respectively. A total of 211 and 215 markers were positioned on each map and estimated genome length was between 1,870 and 2,166 cM respectively, which represents near 65% of genome coverage. Comparative mapping with previously developed genetic linkage maps for P. pinaster based on about 60 common markers enabled aligning linkage groups to this reference map. The comparison of our annotated linkage maps and linkage maps reporting QTL information revealed 11 annotated SNPs in candidate genes that co-localized with previously reported QTLs for wood properties and water use efficiency. Conclusions This study provides genetic linkage maps from a Spanish population that shows high levels of genetic divergence with French populations from which segregating progenies have been previously mapped. These genetic maps will be of interest to construct a reliable consensus linkage map for the species. The importance of developing functional genetic linkage maps is highlighted, especially when working with breeding populations for its future application in MAS for traits of interest.

  3. Development and dissection of diagnostic SNP markers for the downy mildew resistance genes Pl Arg and Pl 8 and maker-assisted gene pyramiding in sunflower (Helianthus annuus L.).

    Science.gov (United States)

    Qi, L L; Talukder, Z I; Hulke, B S; Foley, M E

    2017-06-01

    Diagnostic DNA markers are an invaluable resource in breeding programs for successful introgression and pyramiding of disease resistance genes. Resistance to downy mildew (DM) disease in sunflower is mediated by Pl genes which are known to be effective against the causal fungus, Plasmopara halstedii. Two DM resistance genes, Pl Arg and Pl 8 , are highly effective against P. halstedii races in the USA, and have been previously mapped to the sunflower linkage groups (LGs) 1 and 13, respectively, using simple sequence repeat (SSR) markers. In this study, we developed high-density single nucleotide polymorphism (SNP) maps encompassing the Pl arg and Pl 8 genes and identified diagnostic SNP markers closely linked to these genes. The specificity of the diagnostic markers was validated in a highly diverse panel of 548 sunflower lines. Dissection of a large marker cluster co-segregated with Pl Arg revealed that the closest SNP markers NSA_007595 and NSA_001835 delimited Pl Arg to an interval of 2.83 Mb on the LG1 physical map. The SNP markers SFW01497 and SFW06597 delimited Pl 8 to an interval of 2.85 Mb on the LG13 physical map. We also developed sunflower lines with homozygous, three gene pyramids carrying Pl Arg , Pl 8 , and the sunflower rust resistance gene R 12 using the linked SNP markers from a segregating F 2 population of RHA 340 (carrying Pl 8 )/RHA 464 (carrying Pl Arg and R 12 ). The high-throughput diagnostic SNP markers developed in this study will facilitate marker-assisted selection breeding, and the pyramided sunflower lines will provide durable resistance to downy mildew and rust diseases.

  4. Characterizing associations and SNP-environment interactions for GWAS-identified prostate cancer risk markers--results from BPC3.

    Directory of Open Access Journals (Sweden)

    Sara Lindstrom

    2011-02-01

    Full Text Available Genome-wide association studies (GWAS have identified multiple single nucleotide polymorphisms (SNPs associated with prostate cancer risk. However, whether these associations can be consistently replicated, vary with disease aggressiveness (tumor stage and grade and/or interact with non-genetic potential risk factors or other SNPs is unknown. We therefore genotyped 39 SNPs from regions identified by several prostate cancer GWAS in 10,501 prostate cancer cases and 10,831 controls from the NCI Breast and Prostate Cancer Cohort Consortium (BPC3. We replicated 36 out of 39 SNPs (P-values ranging from 0.01 to 10⁻²⁸. Two SNPs located near KLK3 associated with PSA levels showed differential association with Gleason grade (rs2735839, P = 0.0001 and rs266849, P = 0.0004; case-only test, where the alleles associated with decreasing PSA levels were inversely associated with low-grade (as defined by Gleason grade < 8 tumors but positively associated with high-grade tumors. No other SNP showed differential associations according to disease stage or grade. We observed no effect modification by SNP for association with age at diagnosis, family history of prostate cancer, diabetes, BMI, height, smoking or alcohol intake. Moreover, we found no evidence of pair-wise SNP-SNP interactions. While these SNPs represent new independent risk factors for prostate cancer, we saw little evidence for effect modification by other SNPs or by the environmental factors examined.

  5. Development of COS-SNP and HRM markers for high-throughput and reliable haplotype-based detection of Lr14a in durum wheat (Triticum durum Desf.).

    Science.gov (United States)

    Terracciano, Irma; Maccaferri, Marco; Bassi, Filippo; Mantovani, Paola; Sanguineti, Maria C; Salvi, Silvio; Simková, Hana; Doležel, Jaroslav; Massi, Andrea; Ammar, Karim; Kolmer, James; Tuberosa, Roberto

    2013-04-01

    Leaf rust (Puccinia triticina Eriks. & Henn.) is a major disease affecting durum wheat production. The Lr14a-resistant gene present in the durum wheat cv. Creso and its derivative cv. Colosseo is one of the best characterized leaf-rust resistance sources deployed in durum wheat breeding. Lr14a has been mapped close to the simple sequence repeat markers gwm146, gwm344 and wmc10 in the distal portion of the chromosome arm 7BL, a gene-dense region. The objectives of this study were: (1) to enrich the Lr14a region with single nucleotide polymorphisms (SNPs) and high-resolution melting (HRM)-based markers developed from conserved ortholog set (COS) genes and from sequenced Diversity Array Technology (DArT(®)) markers; (2) to further investigate the gene content and colinearity of this region with the Brachypodium and rice genomes. Ten new COS-SNP and five HRM markers were mapped within an 8.0 cM interval spanning Lr14a. Two HRM markers pinpointed the locus in an interval of HRM designed for agarose gel electrophoresis/KASPar(®) assays and high-resolution melting analysis, respectively, as well as the double-marker combinations ubw14/ubw18, ubw14/ubw35 and wPt-4038-HRM-ubw35 will be useful for germplasm haplotyping and for molecular-assisted breeding.

  6. Marker-assisted introgression of drought tolerance from wild ancestors into popular Indian rice varieties using a 7K Infinium SNP array

    Directory of Open Access Journals (Sweden)

    Ravindra Donde

    2017-10-01

    Full Text Available Recent advances in the area of genomics have led to the development of high throughput genotyping platforms that have immensely contributed to molecular breeding programs. Custom-designed single nucleotide polymorphism (SNP arrays provide an efficient, cost effective, high throughput genotyping tool for QTL/gene mapping, variety identification, marker-assisted selection, etc. In the current study, two interspecific libraries of Chromosome Segment Substitution Lines (CSSLs were evaluated under both drought and control conditions to identify lines with superior yield under drought. The CSSL libraries consisted of 48 BC4F3 lines derived from O. sativa cv. Curinga (tropical japonica x O. rufipogon, and 32 BC4F3 lines derived from O. sativa cv. Curinga (tropical japonica x O. meridionalis. The phenotypic screening of these 80 CSSLs led to the identification of three lines, MER-20, RUF-16, and RUF-44, that yielded well under drought stress. This line was backcrossed with popular rice variety of India, Swarna-Sub1 to introgress wild chromosome segments responsible for reproductive stage drought tolerance. During backcrossing, tracking of wild introgressions and monitoring of recurrent parent genome recovery was facilitated by the use of the Cornell 6K and 7K Infinium rice SNP arrays. The 6K and 7K SNP arrays assayed 5275 SNPs and 7099 SNPs, respectively, distributed across the 12 chromosomes. In our populations of (MER-20X Swarna sub1 BC2F1 lines, 1775 SNPs were polymorphic using the 6K array. The percentage of recurrent parent genome in these backcrossed lines ranged from 33-92% and the percentage of wild donor genome ranged from 8-67%. Using genotypic selection, 5% of plants were identified for further marker assisted backcrossing, based on the presence of the target donor (wild segment and maximum recovery of recurrent parent background. In the next generation, BC3F1 lines were genotyped using the 7K SNP array, which identified 2521 polymorphic SNPs

  7. Development, genetic mapping and QTL association of cotton PHYA, PHYB, and HY5-specific CAPS and dCAPS markers

    Science.gov (United States)

    Among SNP markers that become increasingly valuable in molecular breeding of crop plants are the CAP and dCAP markers derived from the genes of interest. To date, the number of such gene-based markers is small in polyploid crop plants such as tetraploid cotton that has A and D subgenomes. The obje...

  8. Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety ‘Amrapali’ (Mangifera indica L.)

    Science.gov (United States)

    Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

    2016-01-01

    Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892

  9. Development of Gene-Based SSR Markers in Rice Bean (Vigna umbellata L. Based on Transcriptome Data.

    Directory of Open Access Journals (Sweden)

    Honglin Chen

    Full Text Available Rice bean (Vigna umbellata (Thunb. Ohwi & Ohashi is a warm season annual legume mainly grown in East Asia. Only scarce genomic resources are currently available for this legume crop species and no simple sequence repeat (SSR markers have been specifically developed for rice bean yet. In this study, approximately 26 million high quality cDNA sequence reads were obtained from rice bean using Illumina paired-end sequencing technology and assembled into 71,929 unigenes with an average length of 986 bp. Of these unigenes, 38,840 (33.2% showed significant similarity to proteins in the NCBI non-redundant protein and nucleotide sequence databases. Furthermore, 30,170 (76.3% could be classified into gene ontology categories, 25,451 (64.4% into Swiss-Prot categories and 21,982 (55.6% into KOG database categories (E-value < 1.0E-5. A total of 9,301 (23.5% were mapped onto 118 pathways using the Kyoto Encyclopedia of Genes and Genome (KEGG pathway database. A total of 3,011 genic SSRs were identified as potential molecular markers. AG/CT (30.3%, AAG/CTT (8.1% and AGAA/TTCT (20.0% are the three main repeat motifs. A total of 300 SSR loci were randomly selected for validation by using PCR amplification. Of these loci, 23 primer pairs were polymorphic among 32 rice bean accessions. A UPGMA dendrogram revealed three major clusters among 32 rice bean accessions. The large number of SSR-containing sequences and genic SSRs in this study will be valuable for the construction of high-resolution genetic linkage maps, association or comparative mapping and genetic analyses of various Vigna species.

  10. Gene-based SSR markers for common bean (Phaseolus vulgaris L.) derived from root and leaf tissue ESTs: an integration of the BMc series.

    Science.gov (United States)

    Blair, Matthew W; Hurtado, Natalia; Chavarro, Carolina M; Muñoz-Torres, Monica C; Giraldo, Martha C; Pedraza, Fabio; Tomkins, Jeff; Wing, Rod

    2011-03-22

    Granada and Mesoamerica subgroup 1 (black beans) both with regards to gene expression and as sources of markers. However, we found few differences between SSR type and frequency between the G19833 leaf and DOR364 root tissue-derived ESTs. Overall, our work adds to the analysis of microsatellite frequency evaluation for common bean and provides a new set of 120 BMc markers which combined with the 248 previously developed BMc markers brings the total in this series to 368 markers. Once we include BMd markers, which are derived from GenBank sequences, the current total of gene-based markers from our laboratory surpasses 500 markers. These markers are basic for studies of the transcriptome of common bean and can form anchor points for genetic mapping studies in the future.

  11. QTL Mapping of Adult-Plant Resistance to Leaf Rust in the Wheat Cross Zhou 8425B/Chinese Spring Using High-Density SNP Markers

    Directory of Open Access Journals (Sweden)

    Peipei Zhang

    2017-05-01

    Full Text Available Wheat leaf rust is an important disease worldwide. Growing resistant cultivars is an effective means to control the disease. In the present study, 244 recombinant inbred lines from Zhou 8425B/Chinese Spring cross were phenotyped for leaf rust severities during the 2011–2012, 2012–2013, 2013–2014, and 2014–2015 cropping seasons at Baoding, Hebei province, and 2012–2013 and 2013–2014 cropping seasons in Zhoukou, Henan province. The population was genotyped using the high-density Illumina iSelect 90K SNP assay and SSR markers. Inclusive composite interval mapping identified eight QTL, designated as QLr.hebau-2AL, QLr.hebau-2BS, QLr.hebau-3A, QLr.hebau-3BS, QLr.hebau-4AL, QLr.hebau-4B, QLr.hebau-5BL, and QLr.hebau-7DS, respectively. QLr.hebau-2BS, QLr.hebau-3A, QLr.hebau-3BS, and QLr.hebau-5BL were derived from Zhou 8425B, whereas the other four were from Chinese Spring. Three stable QTL on chromosomes 2BS, 4B and 7DS explained 7.5–10.6%, 5.5–24.4%, and 11.2–20.9% of the phenotypic variance, respectively. QLr.hebau-2BS in Zhou 8425B might be the same as LrZH22 in Zhoumai 22; QLr.hebau-4B might be the residual resistance of Lr12, and QLr.hebau-7DS is Lr34. QLr.hebau-2AL, QLr.hebau-3BS, QLr.hebau-4AL, and QLr.hebau-5BL are likely to be novel QTL for leaf rust. These QTL and their closely linked SNP and SSR markers can be used for fine mapping, candidate gene discovery, and marker-assisted selection in wheat breeding.

  12. Marcadores SNP: conceitos básicos, aplicações no manejo e no melhoramento animal e perspectivas para o futuro SNP markers: basic concepts, applications in animal breeding and management and perspectives for the future

    Directory of Open Access Journals (Sweden)

    Alexandre Rodrigues Caetano

    2009-07-01

    molecular markers to characterize genetic resources and generate tools for animal breeding and management date from the end of the 80s. In the last 20 years the technologies to generate molecular data went through several innovation cycles. The last wave of technological innovations represents a true revolution, bringing methods to identify and genotype SNP (Single Nucleotide Polymorphism markers in large scale. High density DNA chips were generated to genotype from tens of thousands to hundreds of thousands of SNPs in a single assay. Furthermore, other medium density technologies allow for the genotyping of tens to hundreds of makers, in high numbers of samples, with very high speed and automation. These new technologies allowed for the generation of new applications, such as the methods to genetically evaluate and select animals based on their Genomic Value (Genomic Estimated Breeding Value - GEBV. The statistical methods for genomic evaluation and selection are in full development, but the technology already became reality with the release of the first bull summary for the Holstein breed with GEBVs for milk production and quality traits in January 2009. In addition, these technologies brought new options for development of diagnostic tests for paternity testing, individual identification, traceability, etc. Also, these new technologies to genotype SNP markers facilitated the development of outsourcing companies to generate molecular data, allowing any group to conduct advanced experiments, always using the most advanced technologies, without the need of investments into equipment.

  13. Genome-wide linkage mapping of yield-related traits in three Chinese bread wheat populations using high-density SNP markers.

    Science.gov (United States)

    Li, Faji; Wen, Weie; He, Zhonghu; Liu, Jindong; Jin, Hui; Cao, Shuanghe; Geng, Hongwei; Yan, Jun; Zhang, Pingzhi; Wan, Yingxiu; Xia, Xianchun

    2018-06-01

    We identified 21 new and stable QTL, and 11 QTL clusters for yield-related traits in three bread wheat populations using the wheat 90 K SNP assay. Identification of quantitative trait loci (QTL) for yield-related traits and closely linked molecular markers is important in order to identify gene/QTL for marker-assisted selection (MAS) in wheat breeding. The objectives of the present study were to identify QTL for yield-related traits and dissect the relationships among different traits in three wheat recombinant inbred line (RIL) populations derived from crosses Doumai × Shi 4185 (D × S), Gaocheng 8901 × Zhoumai 16 (G × Z) and Linmai 2 × Zhong 892 (L × Z). Using the available high-density linkage maps previously constructed with the wheat 90 K iSelect single nucleotide polymorphism (SNP) array, 65, 46 and 53 QTL for 12 traits were identified in the three RIL populations, respectively. Among them, 34, 23 and 27 were likely to be new QTL. Eighteen common QTL were detected across two or three populations. Eleven QTL clusters harboring multiple QTL were detected in different populations, and the interval 15.5-32.3 cM around the Rht-B1 locus on chromosome 4BS harboring 20 QTL is an important region determining grain yield (GY). Thousand-kernel weight (TKW) is significantly affected by kernel width and plant height (PH), whereas flag leaf width can be used to select lines with large kernel number per spike. Eleven candidate genes were identified, including eight cloned genes for kernel, heading date (HD) and PH-related traits as well as predicted genes for TKW, spike length and HD. The closest SNP markers of stable QTL or QTL clusters can be used for MAS in wheat breeding using kompetitive allele-specific PCR or semi-thermal asymmetric reverse PCR assays for improvement of GY.

  14. Using SNP markers to dissect linkage disequilibrium at a major quantitative trait locus for resistance to the potato cyst nematode Globodera pallida on potato chromosome V.

    Science.gov (United States)

    Achenbach, Ute; Paulo, Joao; Ilarionova, Evgenyia; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Gebhardt, Christiane

    2009-02-01

    The damage caused by the parasitic root cyst nematode Globodera pallida is a major yield-limiting factor in potato cultivation . Breeding for resistance is facilitated by the PCR-based marker 'HC', which is diagnostic for an allele conferring high resistance against G. pallida pathotype Pa2/3 that has been introgressed from the wild potato species Solanum vernei into the Solanum tuberosum tetraploid breeding pool. The major quantitative trait locus (QTL) controlling this nematode resistance maps on potato chromosome V in a hot spot for resistance to various pathogens including nematodes and the oomycete Phytophthora infestans. An unstructured sample of 79 tetraploid, highly heterozygous varieties and breeding clones was selected based on presence (41 genotypes) or absence (38 genotypes) of the HC marker. Testing the clones for resistance to G. pallida confirmed the diagnostic power of the HC marker. The 79 individuals were genotyped for 100 single nucleotide polymorphisms (SNPs) at 10 loci distributed over 38 cM on chromosome V. Forty-five SNPs at six loci spanning 2 cM in the interval between markers GP21-GP179 were associated with resistance to G. pallida. Based on linkage disequilibrium (LD) between SNP markers, six LD groups comprising between 2 and 18 SNPs were identified. The LD groups indicated the existence of multiple alleles at a single resistance locus or at several, physically linked resistance loci. LD group C comprising 18 SNPs corresponded to the 'HC' marker. LD group E included 16 SNPs and showed an association peak, which positioned one nematode resistance locus physically close to the R1 gene family.

  15. SNP-PHAGE – High throughput SNP discovery pipeline

    Directory of Open Access Journals (Sweden)

    Cregan Perry B

    2006-10-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs as defined here are single base sequence changes or short insertion/deletions between or within individuals of a given species. As a result of their abundance and the availability of high throughput analysis technologies SNP markers have begun to replace other traditional markers such as restriction fragment length polymorphisms (RFLPs, amplified fragment length polymorphisms (AFLPs and simple sequence repeats (SSRs or microsatellite markers for fine mapping and association studies in several species. For SNP discovery from chromatogram data, several bioinformatics programs have to be combined to generate an analysis pipeline. Results have to be stored in a relational database to facilitate interrogation through queries or to generate data for further analyses such as determination of linkage disequilibrium and identification of common haplotypes. Although these tasks are routinely performed by several groups, an integrated open source SNP discovery pipeline that can be easily adapted by new groups interested in SNP marker development is currently unavailable. Results We developed SNP-PHAGE (SNP discovery Pipeline with additional features for identification of common haplotypes within a sequence tagged site (Haplotype Analysis and GenBank (-dbSNP submissions. This tool was applied for analyzing sequence traces from diverse soybean genotypes to discover over 10,000 SNPs. This package was developed on UNIX/Linux platform, written in Perl and uses a MySQL database. Scripts to generate a user-friendly web interface are also provided with common queries for preliminary data analysis. A machine learning tool developed by this group for increasing the efficiency of SNP discovery is integrated as a part of this package as an optional feature. The SNP-PHAGE package is being made available open source at http://bfgl.anri.barc.usda.gov/ML/snp-phage/. Conclusion SNP-PHAGE provides a bioinformatics

  16. Evaluation of inbreeding depression in Holstein cattle using whole-genome SNP markers and alternative measures of genomic inbreeding.

    Science.gov (United States)

    Bjelland, D W; Weigel, K A; Vukasinovic, N; Nkrumah, J D

    2013-07-01

    The effects of increased pedigree inbreeding in dairy cattle populations have been well documented and result in a negative impact on profitability. Recent advances in genotyping technology have allowed researchers to move beyond pedigree analysis and study inbreeding at a molecular level. In this study, 5,853 animals were genotyped for 54,001 single nucleotide polymorphisms (SNP); 2,913 cows had phenotypic records including a single lactation for milk yield (from either lactation 1, 2, 3, or 4), reproductive performance, and linear type conformation. After removing SNP with poor call rates, low minor allele frequencies, and departure from Hardy-Weinberg equilibrium, 33,025 SNP remained for analyses. Three measures of genomic inbreeding were evaluated: percent homozygosity (FPH), inbreeding calculated from runs of homozygosity (FROH), and inbreeding derived from a genomic relationship matrix (FGRM). Average FPH was 60.5±1.1%, average FROH was 3.8±2.1%, and average FGRM was 20.8±2.3%, where animals with larger values for each of the genomic inbreeding indices were considered more inbred. Decreases in total milk yield to 205d postpartum of 53, 20, and 47kg per 1% increase in FPH, FROH, and FGRM, respectively, were observed. Increases in days open per 1% increase in FPH (1.76 d), FROH (1.72 d), and FGRM (1.06 d) were also noted, as well as increases in maternal calving difficulty (0.09, 0.03, and 0.04 on a 5-point scale for FPH, FROH, and FGRM, respectively). Several linear type traits, such as strength (-0.40, -0.11, and -0.19), rear legs rear view (-0.35, -0.16, and -0.14), front teat placement (0.35, 0.25, 0.18), and teat length (-0.24, -0.14, and -0.13) were also affected by increases in FPH, FROH, and FGRM, respectively. Overall, increases in each measure of genomic inbreeding in this study were associated with negative effects on production and reproductive ability in dairy cows. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc

  17. Multiple SNP markers reveal fine-scale population and deep phylogeographic structure in European anchovy (Engraulis encrasicolus L.).

    KAUST Repository

    Zarraonaindia, Iratxe; Iriondo, Mikel; Albaina, Aitor; Pardo, Miguel Angel; Manzano, Carmen; Grant, W Stewart; Irigoien, Xabier; Estonba, Andone

    2012-01-01

    DNA SNPs define two deep phylogroups that reflect ancient dispersals and colonizations. These markers define two ecological groups. One major group of Iberian-Atlantic populations is associated with upwelling areas on narrow continental shelves and includes

  18. SNP Marker Integration and QTL Analysis of 12 Agronomic and Morphological Traits in F8 RILs of Pepper (Capsicum annuum L.)

    Science.gov (United States)

    Lu, Fu-Hao; Kwon, Soon-Wook; Yoon, Min-Young; Kim, Ki-Taek; Cho, Myeong-Cheoul; Yoon, Moo-Kyung; Park, Yong-Jin

    2012-01-01

    Red pepper, Capsicum annuum L., has been attracting geneticists’ and breeders’ attention as one of the important agronomic crops. This study was to integrate 41 SNP markers newly developed from comparative transcriptomes into a previous linkage map, and map 12 agronomic and morphological traits into the integrated map. A total of 39 markers found precise position and were assigned to 13 linkage groups (LGs) as well as the unassigned LGe, leading to total 458 molecular markers present in this genetic map. Linkage mapping was supported by the physical mapping to tomato and potato genomes using BLAST retrieving, revealing at least two-thirds of the markers mapped to the corresponding LGs. A sum of 23 quantitative trait loci from 11 traits was detected using the composite interval mapping algorithm. A consistent interval between a035_1 and a170_1 on LG5 was detected as a main-effect locus among the resistance QTLs to Phytophthora capsici at high-, intermediate- and low-level tests, and interactions between the QTLs for high-level resistance test were found. Considering the epistatic effect, those QTLs could explain up to 98.25% of the phenotype variations of resistance. Moreover, 17 QTLs for another eight traits were found to locate on LG3, 4, and 12 mostly with varying phenotypic contribution. Furthermore, the locus for corolla color was mapped to LG10 as a marker. The integrated map and the QTLs identified would be helpful for current genetics research and crop breeding, especially in the Solanaceae family. PMID:22684870

  19. SNP Arrays

    Directory of Open Access Journals (Sweden)

    Jari Louhelainen

    2016-10-01

    Full Text Available The papers published in this Special Issue “SNP arrays” (Single Nucleotide Polymorphism Arrays focus on several perspectives associated with arrays of this type. The range of papers vary from a case report to reviews, thereby targeting wider audiences working in this field. The research focus of SNP arrays is often human cancers but this Issue expands that focus to include areas such as rare conditions, animal breeding and bioinformatics tools. Given the limited scope, the spectrum of papers is nothing short of remarkable and even from a technical point of view these papers will contribute to the field at a general level. Three of the papers published in this Special Issue focus on the use of various SNP array approaches in the analysis of three different cancer types. Two of the papers concentrate on two very different rare conditions, applying the SNP arrays slightly differently. Finally, two other papers evaluate the use of the SNP arrays in the context of genetic analysis of livestock. The findings reported in these papers help to close gaps in the current literature and also to give guidelines for future applications of SNP arrays.

  20. Use of different marker pre-selection methods based on single SNP regression in the estimation of Genomic-EBVs

    Directory of Open Access Journals (Sweden)

    Corrado Dimauro

    2010-01-01

    Full Text Available Two methods of SNPs pre-selection based on single marker regression for the estimation of genomic breeding values (G-EBVs were compared using simulated data provided by the XII QTL-MAS workshop: i Bonferroni correction of the significance threshold and ii Permutation test to obtain the reference distribution of the null hypothesis and identify significant markers at P<0.01 and P<0.001 significance thresholds. From the set of markers significant at P<0.001, random subsets of 50% and 25% markers were extracted, to evaluate the effect of further reducing the number of significant SNPs on G-EBV predictions. The Bonferroni correction method allowed the identification of 595 significant SNPs that gave the best G-EBV accuracies in prediction generations (82.80%. The permutation methods gave slightly lower G-EBV accuracies even if a larger number of SNPs resulted significant (2,053 and 1,352 for 0.01 and 0.001 significance thresholds, respectively. Interestingly, halving or dividing by four the number of SNPs significant at P<0.001 resulted in an only slightly decrease of G-EBV accuracies. The genetic structure of the simulated population with few QTL carrying large effects, might have favoured the Bonferroni method.

  1. Candidate Gene Identification with SNP Marker-Based Fine Mapping of Anthracnose Resistance Gene Co-4 in Common Bean.

    Science.gov (United States)

    Burt, Andrew J; William, H Manilal; Perry, Gregory; Khanal, Raja; Pauls, K Peter; Kelly, James D; Navabi, Alireza

    2015-01-01

    Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris). Alleles at the Co-4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08) where Co-4 is localized. Three SCAR markers with known linkage to Co-4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK-4 loci found in previous studies. It is possible that the Co-4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases.

  2. Prediction of heterosis using genome-wide SNP-marker data: application to egg production traits in white Leghorn crosses

    NARCIS (Netherlands)

    Amuzu-Aweh, E.N.; Bijma, P.; Kinghorn, B.P.; verreijken, A.; Arendonk, van J.A.M.; Bovenhuis, H.

    2013-01-01

    Prediction of heterosis has a long history with mixed success, partly due to low numbers of genetic markers and/or small data sets. We investigated the prediction of heterosis for egg number, egg weight and survival days in domestic white Leghorns, using ~400¿000 individuals from 47 crosses and

  3. GWAS and Genomic Prediction Based on Markers of SNP-CHIPS and Sequence Data in Cattle Populations

    DEFF Research Database (Denmark)

    Wu, Xiaoping

    This thesis investigated the methods and models for genome wide association study and genomic prediction. The main conclusions are: 1) The power of QTL detection can be increased by increasing marker densities, and the Bayesian variable selection model together with the analysis of the QTL intens...

  4. SNP discovery and development of genetic markers for mapping immune response genes in common carp (Cyprinus carpio)

    Science.gov (United States)

    Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers for susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpesvirus 3 (CyHV-3) is highly contagious and virulent in common carp (Cyprinus carpio). With the aim to de...

  5. Development of single nucleotide polymorphism (SNP) markers from the mango (Mangiferaindica) transcriptome for mapping and estimation of genetic diversity

    Science.gov (United States)

    The development of resources for genomic studies in Mangifera indica (mango) will allow marker-assisted selection and identification of genetically diverse germplasm, greatly aiding mango breeding programs. We report here a first step in developing such resources, our identification of thousands una...

  6. Candidate Gene Identification with SNP Marker-Based Fine Mapping of Anthracnose Resistance Gene Co-4 in Common Bean.

    Directory of Open Access Journals (Sweden)

    Andrew J Burt

    Full Text Available Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris. Alleles at the Co-4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08 where Co-4 is localized. Three SCAR markers with known linkage to Co-4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK-4 loci found in previous studies. It is possible that the Co-4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases.

  7. Gains in QTL detection using an ultra-high density SNP map based on population sequencing relative to traditional RFLP/SSR markers.

    Directory of Open Access Journals (Sweden)

    Huihui Yu

    Full Text Available Huge efforts have been invested in the last two decades to dissect the genetic bases of complex traits including yields of many crop plants, through quantitative trait locus (QTL analyses. However, almost all the studies were based on linkage maps constructed using low-throughput molecular markers, e.g. restriction fragment length polymorphisms (RFLPs and simple sequence repeats (SSRs, thus are mostly of low density and not able to provide precise and complete information about the numbers and locations of the genes or QTLs controlling the traits. In this study, we constructed an ultra-high density genetic map based on high quality single nucleotide polymorphisms (SNPs from low-coverage sequences of a recombinant inbred line (RIL population of rice, generated using new sequencing technology. The quality of the map was assessed by validating the positions of several cloned genes including GS3 and GW5/qSW5, two major QTLs for grain length and grain width respectively, and OsC1, a qualitative trait locus for pigmentation. In all the cases the loci could be precisely resolved to the bins where the genes are located, indicating high quality and accuracy of the map. The SNP map was used to perform QTL analysis for yield and three yield-component traits, number of tillers per plant, number of grains per panicle and grain weight, using data from field trials conducted over years, in comparison to QTL mapping based on RFLPs/SSRs. The SNP map detected more QTLs especially for grain weight, with precise map locations, demonstrating advantages in detecting power and resolution relative to the RFLP/SSR map. Thus this study provided an example for ultra-high density map construction using sequencing technology. Moreover, the results obtained are helpful for understanding the genetic bases of the yield traits and for fine mapping and cloning of QTLs.

  8. A sweetpotato gene index established by de novo assembly of pyrosequencing and Sanger sequences and mining for gene-based microsatellite markers

    Directory of Open Access Journals (Sweden)

    Solis Julio

    2010-10-01

    Full Text Available Abstract Background Sweetpotato (Ipomoea batatas (L. Lam., a hexaploid outcrossing crop, is an important staple and food security crop in developing countries in Africa and Asia. The availability of genomic resources for sweetpotato is in striking contrast to its importance for human nutrition. Previously existing sequence data were restricted to around 22,000 expressed sequence tag (EST sequences and ~ 1,500 GenBank sequences. We have used 454 pyrosequencing to augment the available gene sequence information to enhance functional genomics and marker design for this plant species. Results Two quarter 454 pyrosequencing runs used two normalized cDNA collections from stems and leaves from drought-stressed sweetpotato clone Tanzania and yielded 524,209 reads, which were assembled together with 22,094 publically available expressed sequence tags into 31,685 sets of overlapping DNA segments and 34,733 unassembled sequences. Blastx comparisons with the UniRef100 database allowed annotation of 23,957 contigs and 15,342 singletons resulting in 24,657 putatively unique genes. Further, 27,119 sequences had no match to protein sequences of UniRef100database. On the basis of this gene index, we have identified 1,661 gene-based microsatellite sequences, of which 223 were selected for testing and 195 were successfully amplified in a test panel of 6 hexaploid (I. batatas and 2 diploid (I. trifida accessions. Conclusions The sweetpotato gene index is a useful source for functionally annotated sweetpotato gene sequences that contains three times more gene sequence information for sweetpotato than previous EST assemblies. A searchable version of the gene index, including a blastn function, is available at http://www.cipotato.org/sweetpotato_gene_index.

  9. Multiple SNP markers reveal fine-scale population and deep phylogeographic structure in European anchovy (Engraulis encrasicolus L.).

    KAUST Repository

    Zarraonaindia, Iratxe

    2012-07-30

    Geographic surveys of allozymes, microsatellites, nuclear DNA (nDNA) and mitochondrial DNA (mtDNA) have detected several genetic subdivisions among European anchovy populations. However, these studies have been limited in their power to detect some aspects of population structure by the use of a single or a few molecular markers, or by limited geographic sampling. We use a multi-marker approach, 47 nDNA and 15 mtDNA single nucleotide polymorphisms (SNPs), to analyze 626 European anchovies from the whole range of the species to resolve shallow and deep levels of population structure. Nuclear SNPs define 10 genetic entities within two larger genetically distinctive groups associated with oceanic variables and different life-history traits. MtDNA SNPs define two deep phylogroups that reflect ancient dispersals and colonizations. These markers define two ecological groups. One major group of Iberian-Atlantic populations is associated with upwelling areas on narrow continental shelves and includes populations spawning and overwintering in coastal areas. A second major group includes northern populations in the North East (NE) Atlantic (including the Bay of Biscay) and the Mediterranean and is associated with wide continental shelves with local larval retention currents. This group tends to spawn and overwinter in oceanic areas. These two groups encompass ten populations that differ from previously defined management stocks in the Alboran Sea, Iberian-Atlantic and Bay of Biscay regions. In addition, a new North Sea-English Channel stock is defined. SNPs indicate that some populations in the Bay of Biscay are genetically closer to North Western (NW) Mediterranean populations than to other populations in the NE Atlantic, likely due to colonizations of the Bay of Biscay and NW Mediterranean by migrants from a common ancestral population. Northern NE Atlantic populations were subsequently established by migrants from the Bay of Biscay. Populations along the Iberian

  10. Identification of Single Nucleotide Polymorphism (SNP in Mono Amine Oxidase A (MAO-A Gene as a genetic marker for aggressiveness in sheep

    Directory of Open Access Journals (Sweden)

    Eko Handiwirawan

    2012-12-01

    Full Text Available In the population, there are aggressive sheep in a small number which requires special management those specific animal house and routine management. The purpose of this study was to identify the variation of DNA marker SNP (single nucleotide polymorphism as a genetic marker for the aggressive trait in several of sheep breed. The identification of point mutations in exon 8 of MAO-A gene associated with aggressive behavior in sheep may be further useful to become of DNA markers for the aggressive trait in sheep. Five of sheep breed were used, i.e.: Barbados Black belly Cross sheep (BC, Composite Garut (KG, Local Garut (LG, Composite Sumatra (KS and St. Cross Croix (SC. Duration of ten behavior traits, blood serotonin concentrations and DNA sequence of exon 8 of MAO-A gene from the sheep aggressive and nonaggressive were observed. PROC GLM of SAS Ver. 9.0 program was used to analyze variable behavior and blood serotonin concentrations. DNA polymorphism in exon 8 of MAO-A gene was analyzed using the MEGA software Ver. 4.0. The results show that the percentage of the aggressive rams of each breed was less than 10 percent; except for the KS sheep is higher (23%. Based on the duration of behavior, aggressive sheep group was not significantly different with non aggressive sheep group, except duration of care giving and drinking behavior. It is known that concentration of blood serotonin in aggressive and non aggressive rams was not significantly different. The aggressive trait in sheep has a mechanism or a different cause like that occurs in mice and humans. In this study, aggressive behavior in sheep was not associated with a mutation in exon 8 of MAO-A gene.

  11. A high-density SNP genetic linkage map for the silver-lipped pearl oyster, Pinctada maxima: a valuable resource for gene localisation and marker-assisted selection.

    Science.gov (United States)

    Jones, David B; Jerry, Dean R; Khatkar, Mehar S; Raadsma, Herman W; Zenger, Kyall R

    2013-11-20

    The silver-lipped pearl oyster, Pinctada maxima, is an important tropical aquaculture species extensively farmed for the highly sought "South Sea" pearls. Traditional breeding programs have been initiated for this species in order to select for improved pearl quality, but many economic traits under selection are complex, polygenic and confounded with environmental factors, limiting the accuracy of selection. The incorporation of a marker-assisted selection (MAS) breeding approach would greatly benefit pearl breeding programs by allowing the direct selection of genes responsible for pearl quality. However, before MAS can be incorporated, substantial genomic resources such as genetic linkage maps need to be generated. The construction of a high-density genetic linkage map for P. maxima is not only essential for unravelling the genomic architecture of complex pearl quality traits, but also provides indispensable information on the genome structure of pearl oysters. A total of 1,189 informative genome-wide single nucleotide polymorphisms (SNPs) were incorporated into linkage map construction. The final linkage map consisted of 887 SNPs in 14 linkage groups, spans a total genetic distance of 831.7 centimorgans (cM), and covers an estimated 96% of the P. maxima genome. Assessment of sex-specific recombination across all linkage groups revealed limited overall heterochiasmy between the sexes (i.e. 1.15:1 F/M map length ratio). However, there were pronounced localised differences throughout the linkage groups, whereby male recombination was suppressed near the centromeres compared to female recombination, but inflated towards telomeric regions. Mean values of LD for adjacent SNP pairs suggest that a higher density of markers will be required for powerful genome-wide association studies. Finally, numerous nacre biomineralization genes were localised providing novel positional information for these genes. This high-density SNP genetic map is the first comprehensive linkage

  12. Differentiation of Populus species using chloroplast single nucleotide polymorphism (SNP) markers--essential for comprehensible and reliable poplar breeding.

    Science.gov (United States)

    Schroeder, H; Hoeltken, A M; Fladung, M

    2012-03-01

    Within the genus Populus several species belonging to different sections are cross-compatible. Hence, high numbers of interspecies hybrids occur naturally and, additionally, have been artificially produced in huge breeding programmes during the last 100 years. Therefore, determination of a single poplar species, used for the production of 'multi-species hybrids' is often difficult, and represents a great challenge for the use of molecular markers in species identification. Within this study, over 20 chloroplast regions, both intergenic spacers and coding regions, have been tested for their ability to differentiate different poplar species using 23 already published barcoding primer combinations and 17 newly designed primer combinations. About half of the published barcoding primers yielded amplification products, whereas the new primers designed on the basis of the total sequenced cpDNA genome of Populus trichocarpa Torr. & Gray yielded much higher amplification success. Intergenic spacers were found to be more variable than coding regions within the genus Populus. The highest discrimination power of Populus species was found in the combination of two intergenic spacers (trnG-psbK, psbK-psbl) and the coding region rpoC. In barcoding projects, the coding regions matK and rbcL are often recommended, but within the genus Populus they only show moderate variability and are not efficient in species discrimination. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.

  13. Genetic diversity and population structure assessed by SSR and SNP markers in a large germplasm collection of grape

    Science.gov (United States)

    2013-01-01

    Background The economic importance of grapevine has driven significant efforts in genomics to accelerate the exploitation of Vitis resources for development of new cultivars. However, although a large number of clonally propagated accessions are maintained in grape germplasm collections worldwide, their use for crop improvement is limited by the scarcity of information on genetic diversity, population structure and proper phenotypic assessment. The identification of representative and manageable subset of accessions would facilitate access to the diversity available in large collections. A genome-wide germplasm characterization using molecular markers can offer reliable tools for adjusting the quality and representativeness of such core samples. Results We investigated patterns of molecular diversity at 22 common microsatellite loci and 384 single nucleotide polymorphisms (SNPs) in 2273 accessions of domesticated grapevine V. vinifera ssp. sativa, its wild relative V. vinifera ssp. sylvestris, interspecific hybrid cultivars and rootstocks. Despite the large number of putative duplicates and extensive clonal relationships among the accessions, we observed high level of genetic variation. In the total germplasm collection the average genetic diversity, as quantified by the expected heterozygosity, was higher for SSR loci (0.81) than for SNPs (0.34). The analysis of the genetic structure in the grape germplasm collection revealed several levels of stratification. The primary division was between accessions of V. vinifera and non-vinifera, followed by the distinction between wild and domesticated grapevine. Intra-specific subgroups were detected within cultivated grapevine representing different eco-geographic groups. The comparison of a phenological core collection and genetic core collections showed that the latter retained more genetic diversity, while maintaining a similar phenotypic variability. Conclusions The comprehensive molecular characterization of our grape

  14. Genic SNP markers and legume synteny reveal candidate genes underlying QTL for Macrophomina phaseolina resistance and maturity in cowpea [Vigna unguiculata (L Walp.

    Directory of Open Access Journals (Sweden)

    Ehlers Jeffrey D

    2011-01-01

    Full Text Available Abstract Background Macrophomina phaseolina is an emerging and devastating fungal pathogen that causes significant losses in crop production under high temperatures and drought stress. An increasing number of disease incidence reports highlight the wide prevalence of the pathogen around the world and its contribution toward crop yield suppression. In cowpea [Vigna unguiculata (L Walp.], limited sources of low-level host resistance have been identified, the genetic basis of which is unknown. In this study we report on the identification of strong sources of host resistance to M. phaseolina and the genetic mapping of putative resistance loci on a cowpea genetic map comprised of gene-derived single nucleotide polymorphisms (SNPs and amplified fragment length polymorphisms (AFLPs. Results Nine quantitative trait loci (QTLs, accounting for between 6.1 and 40.0% of the phenotypic variance (R2, were identified using plant mortality data taken over three years in field experiments and disease severity scores taken from two greenhouse experiments. Based on annotated genic SNPs as well as synteny with soybean (Glycine max and Medicago truncatula, candidate resistance genes were found within mapped QTL intervals. QTL Mac-2 explained the largest percent R2 and was identified in three field and one greenhouse experiments where the QTL peak co-located with a SNP marker derived from a pectin esterase inhibitor encoding gene. Maturity effects on the expression of resistance were indicated by the co-location of Mac-6 and Mac-7 QTLs with maturity-related senescence QTLs Mat-2 and Mat-1, respectively. Homologs of the ELF4 and FLK flowering genes were found in corresponding syntenic soybean regions. Only three Macrophomina resistance QTLs co-located with delayed drought-induced premature senescence QTLs previously mapped in the same population, suggesting that largely different genetic mechanisms mediate cowpea response to drought stress and Macrophomina infection

  15. Candidate SNP markers of aggressiveness-related complications and comorbidities of genetic diseases are predicted by a significant change in the affinity of TATA-binding protein for human gene promoters.

    Science.gov (United States)

    Chadaeva, Irina V; Ponomarenko, Mikhail P; Rasskazov, Dmitry A; Sharypova, Ekaterina B; Kashina, Elena V; Matveeva, Marina Yu; Arshinova, Tatjana V; Ponomarenko, Petr M; Arkova, Olga V; Bondar, Natalia P; Savinkova, Ludmila K; Kolchanov, Nikolay A

    2016-12-28

    Aggressiveness in humans is a hereditary behavioral trait that mobilizes all systems of the body-first of all, the nervous and endocrine systems, and then the respiratory, vascular, muscular, and others-e.g., for the defense of oneself, children, family, shelter, territory, and other possessions as well as personal interests. The level of aggressiveness of a person determines many other characteristics of quality of life and lifespan, acting as a stress factor. Aggressive behavior depends on many parameters such as age, gender, diseases and treatment, diet, and environmental conditions. Among them, genetic factors are believed to be the main parameters that are well-studied at the factual level, but in actuality, genome-wide studies of aggressive behavior appeared relatively recently. One of the biggest projects of the modern science-1000 Genomes-involves identification of single nucleotide polymorphisms (SNPs), i.e., differences of individual genomes from the reference genome. SNPs can be associated with hereditary diseases, their complications, comorbidities, and responses to stress or a drug. Clinical comparisons between cohorts of patients and healthy volunteers (as a control) allow for identifying SNPs whose allele frequencies significantly separate them from one another as markers of the above conditions. Computer-based preliminary analysis of millions of SNPs detected by the 1000 Genomes project can accelerate clinical search for SNP markers due to preliminary whole-genome search for the most meaningful candidate SNP markers and discarding of neutral and poorly substantiated SNPs. Here, we combine two computer-based search methods for SNPs (that alter gene expression) {i} Web service SNP_TATA_Comparator (DNA sequence analysis) and {ii} PubMed-based manual search for articles on aggressiveness using heuristic keywords. Near the known binding sites for TATA-binding protein (TBP) in human gene promoters, we found aggressiveness-related candidate SNP markers

  16. Snap: an integrated SNP annotation platform

    DEFF Research Database (Denmark)

    Li, Shengting; Ma, Lijia; Li, Heng

    2007-01-01

    Snap (Single Nucleotide Polymorphism Annotation Platform) is a server designed to comprehensively analyze single genes and relationships between genes basing on SNPs in the human genome. The aim of the platform is to facilitate the study of SNP finding and analysis within the framework of medical...

  17. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    NARCIS (Netherlands)

    Chagné, D.; Crowhurst, R.N.; Troggio, M.; Davey, M.W.; Gilmore, B.; Lawley, C.; Vanderzande, S.; Hellens, R.P.; Kumar, S.; Cestaro, A.; Velasco, R.; Main, D.; Rees, J.D.; Iezzoni, A.F.; Mockler, T.; Wilhelm, L.; Weg, van de W.E.; Gardiner, S.E.; Bassil, N.; Peace, C.

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide

  18. Empirical evaluation of DArT, SNP, and SSR marker-systems for genotyping, clustering, and assigning sugar beet hybrid varieties into populations

    NARCIS (Netherlands)

    Simko, I.; Eujayl, I.; Hintum, van T.J.L.

    2012-01-01

    Dominant and co-dominant molecular markers are routinely used in plant genetic research. In the present study we assessed the success-rate of three marker-systems for estimating genotypic diversity, clustering varieties into populations, and assigning a single variety into the expected population. A

  19. Empirical evaluation of DArT, SNP, and SSR marker-systems for genotyping, clustering, and assigning sugar beet hybrid varieties into populations

    Science.gov (United States)

    Dominant and co-dominant molecular markers are routinely used in plant genetic diversity research. In the present study we assessed the success-rate of three marker-systems for estimating genotypic diversity, clustering varieties into populations, and assigning a single variety into the expected pop...

  20. Using SNP markers to dissect linkage disequilibrium at a major quantitative trait locus for resistance to the potato cyst nematode Globodera pallida on potato chromosome V

    NARCIS (Netherlands)

    Achenbach, U.; Caldas Paulo, M.J.; Ilarionova, E.; Lübeck, J.; Strahwald, J.; Tacke, E.; Hofferbert, H.R.

    2009-01-01

    The damage caused by the parasitic root cyst nematode Globodera pallida is a major yield-limiting factor in potato cultivation . Breeding for resistance is facilitated by the PCR-based marker 'HC', which is diagnostic for an allele conferring high resistance against G. pallida pathotype Pa2/3 that

  1. Development of Molecular Markers Linked to Powdery Mildew Resistance Gene Pm4b by Combining SNP Discovery from Transcriptome Sequencing Data with Bulked Segregant Analysis (BSR-Seq) in Wheat.

    Science.gov (United States)

    Wu, Peipei; Xie, Jingzhong; Hu, Jinghuang; Qiu, Dan; Liu, Zhiyong; Li, Jingting; Li, Miaomiao; Zhang, Hongjun; Yang, Li; Liu, Hongwei; Zhou, Yang; Zhang, Zhongjun; Li, Hongjie

    2018-01-01

    Powdery mildew resistance gene Pm4b , originating from Triticum persicum , is effective against the prevalent Blumeria graminis f. sp. tritici ( Bgt ) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F 2:3 mapping population (237 families) derived from a pair of isogenic lines VPM1/7 ∗ Bainong 3217 F 4 (carrying Pm4b ) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F 2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b -linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13 , Xics43 , and Xics76 , were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with

  2. Development of Molecular Markers Linked to Powdery Mildew Resistance Gene Pm4b by Combining SNP Discovery from Transcriptome Sequencing Data with Bulked Segregant Analysis (BSR-Seq in Wheat

    Directory of Open Access Journals (Sweden)

    Peipei Wu

    2018-02-01

    Full Text Available Powdery mildew resistance gene Pm4b, originating from Triticum persicum, is effective against the prevalent Blumeria graminis f. sp. tritici (Bgt isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS. The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq was applied in an F2:3 mapping population (237 families derived from a pair of isogenic lines VPM1/7∗Bainong 3217 F4 (carrying Pm4b and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b-linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13, Xics43, and Xics76, were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with

  3. Towards a molecular taxonomic key of the Aurantioideae subfamily using chloroplastic SNP diagnostic markers of the main clades genotyped by competitive allele-specific PCR.

    Science.gov (United States)

    Oueslati, Amel; Ollitrault, Frederique; Baraket, Ghada; Salhi-Hannachi, Amel; Navarro, Luis; Ollitrault, Patrick

    2016-08-18

    Chloroplast DNA is a primary source of molecular variations for phylogenetic analysis of photosynthetic eukaryotes. However, the sequencing and analysis of multiple chloroplastic regions is difficult to apply to large collections or large samples of natural populations. The objective of our work was to demonstrate that a molecular taxonomic key based on easy, scalable and low-cost genotyping method should be developed from a set of Single Nucleotide Polymorphisms (SNPs) diagnostic of well-established clades. It was applied to the Aurantioideae subfamily, the largest group of the Rutaceae family that includes the cultivated citrus species. The publicly available nucleotide sequences of eight plastid genomic regions were compared for 79 accessions of the Aurantioideae subfamily to search for SNPs revealing taxonomic differentiation at the inter-tribe, inter-subtribe, inter-genus and interspecific levels. Diagnostic SNPs (DSNPs) were found for 46 of the 54 clade levels analysed. Forty DSNPs were selected to develop KASPar markers and their taxonomic value was tested by genotyping 108 accessions of the Aurantioideae subfamily. Twenty-seven markers diagnostic of 24 clades were validated and they displayed a very high rate of transferability in the Aurantioideae subfamily (only 1.2 % of missing data on average). The UPGMA from the validated markers produced a cladistic organisation that was highly coherent with the previous phylogenetic analysis based on the sequence data of the eight plasmid regions. In particular, the monophyletic origin of the "true citrus" genera plus Oxanthera was validated. However, some clarification remains necessary regarding the organisation of the other wild species of the Citreae tribe. We validated the concept that with well-established clades, DSNPs can be selected and efficiently transformed into competitive allele-specific PCR markers (KASPar method) allowing cost-effective highly efficient cladistic analysis in large collections at

  4. SNP interaction pattern identifier (SIPI)

    DEFF Research Database (Denmark)

    Lin, Hui Yi; Chen, Dung Tsa; Huang, Po Yu

    2017-01-01

    Motivation: Testing SNP-SNP interactions is considered as a key for overcoming bottlenecks of genetic association studies. However, related statistical methods for testing SNP-SNP interactions are underdeveloped. Results: We propose the SNP Interaction Pattern Identifier (SIPI), which tests 45...

  5. Markers

    Science.gov (United States)

    Healthy Schools Network, Inc., 2011

    2011-01-01

    Dry erase whiteboards come with toxic dry erase markers and toxic cleaning products. Dry erase markers labeled "nontoxic" are not free of toxic chemicals and can cause health problems. Children are especially vulnerable to environmental health hazards; moreover, schools commonly have problems with indoor air pollution, as they are more densely…

  6. Combined use of a new SNP-based assay and multilocus SSR markers to assess genetic diversity of Xylella fastidiosa subsp. pauca infecting citrus and coffee plants.

    Science.gov (United States)

    Montes-Borrego, Miguel; Lopes, Joao R S; Jiménez-Díaz, Rafael M; Landa, Blanca B

    2015-03-01

    Two haplotypes of Xylella fastidiosa subsp. pauca (Xfp) that correlated with their host of origin were identified in a collection of 90 isolates infecting citrus and coffee plants in Brazil, based on a single-nucleotide polymorphism in the gyrB sequence. A new single-nucleotide primer extension (SNuPE) protocol was designed for rapid identification of Xfp according to the host source. The protocol proved to be robust for the prediction of the Xfp host source in blind tests using DNA from cultures of the bacterium, infected plants, and insect vectors allowed to feed on Xfp-infected citrus plants. AMOVA and STRUCTURE analyses of microsatellite data separated most Xfp populations on the basis of their host source, indicating that they were genetically distinct. The combined use of the SNaPshot protocol and three previously developed multilocus SSR markers showed that two haplotypes and distinct isolates of Xfp infect citrus and coffee in Brazil and that multiple, genetically different isolates can be present in a single orchard or infect a single tree. This combined approach will be very useful in studies of the epidemiology of Xfp-induced diseases, host specificity of bacterial genotypes, the occurrence of Xfp host jumping, vector feeding habits, etc., in economically important cultivated plants or weed host reservoirs of Xfp in Brazil and elsewhere. Copyright© by the Spanish Society for Microbiology and Institute for Catalan Studies.

  7. dbSNP

    Data.gov (United States)

    U.S. Department of Health & Human Services — dbSNP is a database of single nucleotide polymorphisms (SNPs) and multiple small-scale variations that include insertions/deletions, microsatellites, and...

  8. Rice genetic marker database: An identification of single nucleotide ...

    African Journals Online (AJOL)

    based genetic marker system to provide information about SNP and QTL markers in rice. The SNP marker database provides 7,227 SNP markers including location information on chromosomes by using genetic map. It allows users to access a ...

  9. Evaluation of the Ion Torrent™ HID SNP 169-plex

    DEFF Research Database (Denmark)

    Børsting, Claus; Fordyce, Sarah L; Olofsson, Jill Katharina

    2014-01-01

    The Ion Torrent™ HID SNP assay amplified 136 autosomal SNPs and 33 Y-chromosome markers in one PCR and the markers were subsequently typed using the Ion PGM™ second generation sequencing platform. A total of 51 of the autosomal SNPs were selected from the SNPforID panel that is routinely used...... in our ISO 17025 accredited laboratory. Concordance between the Ion Torrent™ HID SNP assay and the SNPforID assay was tested by typing 44 Iraqis twice with the Ion Torrent™ HID SNP assay. The same samples were previously typed with the SNPforID assay and the Y-chromosome haplogroups of the individuals...

  10. SNP Polymorphism Survey of the Parental Lines of ISRA Sorghum Breeding Program as Part of the Feed the Future

    Data.gov (United States)

    US Agency for International Development — Polymorphism of SNP Markers (single nucleotide polymorphisms) was assessed on 24 parental lines of the ISRA sorghum breeding program . About 1300 SNP have been used...

  11. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    Science.gov (United States)

    Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718

  12. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple.

    Directory of Open Access Journals (Sweden)

    David Chagné

    Full Text Available As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of 'Golden Delicious', SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional, and genomic selection in apple.

  13. Development and application of a 20K SNP array in potato

    NARCIS (Netherlands)

    Vos, Peter

    2016-01-01

    In this thesis the results are described of investigations of various application of genome wide SNP (single nucleotide polymorphism) markers. The set of SNP markers was identified by GBS (genotyping by sequencing) strategy. The resulting dataset of 129,156 SNPs across 83 tetraploid varieties was

  14. Large SNP arrays for genotyping in crop plants

    Indian Academy of Sciences (India)

    Genotyping with large numbers of molecular markers is now an indispensable tool within plant genetics and breeding. Especially through the identification of large numbers of single nucleotide polymorphism (SNP) markers using the novel high-throughput sequencing technologies, it is now possible to reliably identify many ...

  15. High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).

    Science.gov (United States)

    Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C

    2016-03-01

    Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies. © 2015 John Wiley & Sons Ltd.

  16. Forensic SNP genotyping with SNaPshot

    DEFF Research Database (Denmark)

    Fondevila, M; Børsting, C; Phillips, C

    2017-01-01

    to routine STR profiling, use of SNaPshot is an important part of the development of SNP sets for a wide range of forensic applications with these markers, from genotyping highly degraded DNA with very short amplicons to the introduction of SNPs to ascertain the ancestry and physical characteristics......This review explores the key factors that influence the optimization, routine use, and profile interpretation of the SNaPshot single-base extension (SBE) system applied to forensic single-nucleotide polymorphism (SNP) genotyping. Despite being a mainly complimentary DNA genotyping technique...... of an unidentified contact trace donor. However, this technology, as resourceful as it is, displays several features that depart from the usual STR genotyping far enough to demand a certain degree of expertise from the forensic analyst before tackling the complex casework on which SNaPshot application provides...

  17. Development and Applications of a High Throughput Genotyping Tool for Polyploid Crops: Single Nucleotide Polymorphism (SNP Array

    Directory of Open Access Journals (Sweden)

    Qian You

    2018-02-01

    Full Text Available Polypoid species play significant roles in agriculture and food production. Many crop species are polyploid, such as potato, wheat, strawberry, and sugarcane. Genotyping has been a daunting task for genetic studies of polyploid crops, which lags far behind the diploid crop species. Single nucleotide polymorphism (SNP array is considered to be one of, high-throughput, relatively cost-efficient and automated genotyping approaches. However, there are significant challenges for SNP identification in complex, polyploid genomes, which has seriously slowed SNP discovery and array development in polyploid species. Ploidy is a significant factor impacting SNP qualities and validation rates of SNP markers in SNP arrays, which has been proven to be a very important tool for genetic studies and molecular breeding. In this review, we (1 discussed the pros and cons of SNP array in general for high throughput genotyping, (2 presented the challenges of and solutions to SNP calling in polyploid species, (3 summarized the SNP selection criteria and considerations of SNP array design for polyploid species, (4 illustrated SNP array applications in several different polyploid crop species, then (5 discussed challenges, available software, and their accuracy comparisons for genotype calling based on SNP array data in polyploids, and finally (6 provided a series of SNP array design and genotype calling recommendations. This review presents a complete overview of SNP array development and applications in polypoid crops, which will benefit the research in molecular breeding and genetics of crops with complex genomes.

  18. SAQC: SNP Array Quality Control

    Directory of Open Access Journals (Sweden)

    Li Ling-Hui

    2011-04-01

    Full Text Available Abstract Background Genome-wide single-nucleotide polymorphism (SNP arrays containing hundreds of thousands of SNPs from the human genome have proven useful for studying important human genome questions. Data quality of SNP arrays plays a key role in the accuracy and precision of downstream data analyses. However, good indices for assessing data quality of SNP arrays have not yet been developed. Results We developed new quality indices to measure the quality of SNP arrays and/or DNA samples and investigated their statistical properties. The indices quantify a departure of estimated individual-level allele frequencies (AFs from expected frequencies via standardized distances. The proposed quality indices followed lognormal distributions in several large genomic studies that we empirically evaluated. AF reference data and quality index reference data for different SNP array platforms were established based on samples from various reference populations. Furthermore, a confidence interval method based on the underlying empirical distributions of quality indices was developed to identify poor-quality SNP arrays and/or DNA samples. Analyses of authentic biological data and simulated data show that this new method is sensitive and specific for the detection of poor-quality SNP arrays and/or DNA samples. Conclusions This study introduces new quality indices, establishes references for AFs and quality indices, and develops a detection method for poor-quality SNP arrays and/or DNA samples. We have developed a new computer program that utilizes these methods called SNP Array Quality Control (SAQC. SAQC software is written in R and R-GUI and was developed as a user-friendly tool for the visualization and evaluation of data quality of genome-wide SNP arrays. The program is available online (http://www.stat.sinica.edu.tw/hsinchou/genetics/quality/SAQC.htm.

  19. SNP Discovery In Marine Fish Species By 454 Sequencing

    DEFF Research Database (Denmark)

    Panitz, Frank; Nielsen, Rasmus Ory; van Houdt, Jeroen K J

    2011-01-01

    Based on the 454 Next-Generation-Sequencing technology (Roche) a high throughput screening method was devised in order to generate novel genetic markers (SNPs). SNP discovery was performed for three target species of marine fish: hake (Merluccius merluccius), herring (Clupea harengus) and sole...

  20. Heritability, SNP- and gene-based analyses of cannabis use initiation and age at onset

    NARCIS (Netherlands)

    Minica, C.C.; Dolan, C.V.; Hottenga, J.J.; Pool, R.; Fedko, I.O.; Mbarek, H.; Huppertz, C.; Bartels, M.; Boomsma, D.I.; Vink, J.M.

    2015-01-01

    Prior searches for genetic variants (GVs) implicated in initiation of cannabis use have been limited to common single nucleotide polymorphisms (SNPs) typed in HapMap samples. Denser SNPs are now available with the completion of the 1000 Genomes and the Genome of the Netherlands projects. More

  1. Heritability, SNP- and Gene-Based Analyses of Cannabis Use Initiation and Age at Onset

    NARCIS (Netherlands)

    Minica, C.C.; Dolan, C.V.; Hottenga, J.J.; Pool, R.; Fedko, I.O.; Mbarek, H.; Huppertz, C.; Bartels, M.; Boomsma, D.I.; Vink, J.M.

    2015-01-01

    Prior searches for genetic variants (GVs) implicated in initiation of cannabis use have been limited to common single nucleotide polymorphisms (SNPs) typed in HapMap samples. Denser SNPs are now available with the completion of the 1000 Genomes and the Genome of the Netherlands projects. More

  2. A gene-based analysis of variants in the serum/glucocorticoid regulated kinase (SGK genes with blood pressure responses to sodium intake: the GenSalt Study.

    Directory of Open Access Journals (Sweden)

    Changwei Li

    Full Text Available Serum and glucocorticoid regulated kinase (SGK plays a critical role in the regulation of renal sodium transport. We examined the association between SGK genes and salt sensitivity of blood pressure (BP using single-marker and gene-based association analysis.A 7-day low-sodium (51.3 mmol sodium/day followed by a 7-day high-sodium intervention (307.8 mmol sodium/day was conducted among 1,906 Chinese participants. BP measurements were obtained at baseline and each intervention using a random-zero sphygmomanometer. Additive associations between each SNP and salt-sensitivity phenotypes were assessed using a mixed linear regression model to account for family dependencies. Gene-based analyses were conducted using the truncated p-value method. The Bonferroni-method was used to adjust for multiple testing in all analyses.In single-marker association analyses, SGK1 marker rs2758151 was significantly associated with diastolic BP (DBP response to high-sodium intervention (P = 0.0010. DBP responses (95% confidence interval to high-sodium intervention for genotypes C/C, C/T, and T/T were 2.04 (1.57 to 2.52, 1.79 (1.42 to 2.16, and 0.85 (0.30 to 1.41 mmHg, respectively. Similar trends were observed for SBP and MAP responses although not significant (P = 0.15 and 0.0026, respectively. In addition, gene-based analyses demonstrated significant associations between SGK1 and SBP, DBP and MAP responses to high sodium intervention (P = 0.0002, 0.0076, and 0.00001, respectively. Neither SGK2 nor SGK3 were associated with the salt-sensitivity phenotypes in single-maker or gene-based analyses.The current study identified association of the SGK1 gene and BP salt-sensitivity in the Han Chinese population. Further studies are warranted to identify causal SGK1 gene variants.

  3. New generation pharmacogenomic tools: a SNP linkage disequilibrium Map, validated SNP assay resource, and high-throughput instrumentation system for large-scale genetic studies.

    Science.gov (United States)

    De La Vega, Francisco M; Dailey, David; Ziegle, Janet; Williams, Julie; Madden, Dawn; Gilbert, Dennis A

    2002-06-01

    Since public and private efforts announced the first draft of the human genome last year, researchers have reported great numbers of single nucleotide polymorphisms (SNPs). We believe that the availability of well-mapped, quality SNP markers constitutes the gateway to a revolution in genetics and personalized medicine that will lead to better diagnosis and treatment of common complex disorders. A new generation of tools and public SNP resources for pharmacogenomic and genetic studies--specifically for candidate-gene, candidate-region, and whole-genome association studies--will form part of the new scientific landscape. This will only be possible through the greater accessibility of SNP resources and superior high-throughput instrumentation-assay systems that enable affordable, highly productive large-scale genetic studies. We are contributing to this effort by developing a high-quality linkage disequilibrium SNP marker map and an accompanying set of ready-to-use, validated SNP assays across every gene in the human genome. This effort incorporates both the public sequence and SNP data sources, and Celera Genomics' human genome assembly and enormous resource ofphysically mapped SNPs (approximately 4,000,000 unique records). This article discusses our approach and methodology for designing the map, choosing quality SNPs, designing and validating these assays, and obtaining population frequency ofthe polymorphisms. We also discuss an advanced, high-performance SNP assay chemisty--a new generation of the TaqMan probe-based, 5' nuclease assay-and high-throughput instrumentation-software system for large-scale genotyping. We provide the new SNP map and validation information, validated SNP assays and reagents, and instrumentation systems as a novel resource for genetic discoveries.

  4. Development of maizeSNP3072, a high-throughput compatible SNP array, for DNA fingerprinting identification of Chinese maize varieties.

    Science.gov (United States)

    Tian, Hong-Li; Wang, Feng-Ge; Zhao, Jiu-Ran; Yi, Hong-Mei; Wang, Lu; Wang, Rui; Yang, Yang; Song, Wei

    2015-01-01

    Single nucleotide polymorphisms (SNPs) are abundant and evenly distributed throughout the maize ( Zea mays L.) genome. SNPs have several advantages over simple sequence repeats, such as ease of data comparison and integration, high-throughput processing of loci, and identification of associated phenotypes. SNPs are thus ideal for DNA fingerprinting, genetic diversity analysis, and marker-assisted breeding. Here, we developed a high-throughput and compatible SNP array, maizeSNP3072, containing 3072 SNPs developed from the maizeSNP50 array. To improve genotyping efficiency, a high-quality cluster file, maizeSNP3072_GT.egt, was constructed. All 3072 SNP loci were localized within different genes, where they were distributed in exons (43 %), promoters (21 %), 3' untranslated regions (UTRs; 22 %), 5' UTRs (9 %), and introns (5 %). The average genotyping failure rate using these SNPs was only 6 %, or 3 % using the cluster file to call genotypes. The genotype consistency of repeat sample analysis on Illumina GoldenGate versus Infinium platforms exceeded 96.4 %. The minor allele frequency (MAF) of the SNPs averaged 0.37 based on data from 309 inbred lines. The 3072 SNPs were highly effective for distinguishing among 276 examined hybrids. Comparative analysis using Chinese varieties revealed that the 3072SNP array showed a better marker success rate and higher average MAF values, evaluation scores, and variety-distinguishing efficiency than the maizeSNP50K array. The maizeSNP3072 array thus can be successfully used in DNA fingerprinting identification of Chinese maize varieties and shows potential as a useful tool for germplasm resource evaluation and molecular marker-assisted breeding.

  5. Sunflower Hybrid Breeding: From Markers to Genomic Selection.

    Science.gov (United States)

    Dimitrijevic, Aleksandra; Horn, Renate

    2017-01-01

    In sunflower, molecular markers for simple traits as, e.g., fertility restoration, high oleic acid content, herbicide tolerance or resistances to Plasmopara halstedii, Puccinia helianthi , or Orobanche cumana have been successfully used in marker-assisted breeding programs for years. However, agronomically important complex quantitative traits like yield, heterosis, drought tolerance, oil content or selection for disease resistance, e.g., against Sclerotinia sclerotiorum have been challenging and will require genome-wide approaches. Plant genetic resources for sunflower are being collected and conserved worldwide that represent valuable resources to study complex traits. Sunflower association panels provide the basis for genome-wide association studies, overcoming disadvantages of biparental populations. Advances in technologies and the availability of the sunflower genome sequence made novel approaches on the whole genome level possible. Genotype-by-sequencing, and whole genome sequencing based on next generation sequencing technologies facilitated the production of large amounts of SNP markers for high density maps as well as SNP arrays and allowed genome-wide association studies and genomic selection in sunflower. Genome wide or candidate gene based association studies have been performed for traits like branching, flowering time, resistance to Sclerotinia head and stalk rot. First steps in genomic selection with regard to hybrid performance and hybrid oil content have shown that genomic selection can successfully address complex quantitative traits in sunflower and will help to speed up sunflower breeding programs in the future. To make sunflower more competitive toward other oil crops higher levels of resistance against pathogens and better yield performance are required. In addition, optimizing plant architecture toward a more complex growth type for higher plant densities has the potential to considerably increase yields per hectare. Integrative approaches

  6. Sunflower Hybrid Breeding: From Markers to Genomic Selection

    Science.gov (United States)

    Dimitrijevic, Aleksandra; Horn, Renate

    2018-01-01

    In sunflower, molecular markers for simple traits as, e.g., fertility restoration, high oleic acid content, herbicide tolerance or resistances to Plasmopara halstedii, Puccinia helianthi, or Orobanche cumana have been successfully used in marker-assisted breeding programs for years. However, agronomically important complex quantitative traits like yield, heterosis, drought tolerance, oil content or selection for disease resistance, e.g., against Sclerotinia sclerotiorum have been challenging and will require genome-wide approaches. Plant genetic resources for sunflower are being collected and conserved worldwide that represent valuable resources to study complex traits. Sunflower association panels provide the basis for genome-wide association studies, overcoming disadvantages of biparental populations. Advances in technologies and the availability of the sunflower genome sequence made novel approaches on the whole genome level possible. Genotype-by-sequencing, and whole genome sequencing based on next generation sequencing technologies facilitated the production of large amounts of SNP markers for high density maps as well as SNP arrays and allowed genome-wide association studies and genomic selection in sunflower. Genome wide or candidate gene based association studies have been performed for traits like branching, flowering time, resistance to Sclerotinia head and stalk rot. First steps in genomic selection with regard to hybrid performance and hybrid oil content have shown that genomic selection can successfully address complex quantitative traits in sunflower and will help to speed up sunflower breeding programs in the future. To make sunflower more competitive toward other oil crops higher levels of resistance against pathogens and better yield performance are required. In addition, optimizing plant architecture toward a more complex growth type for higher plant densities has the potential to considerably increase yields per hectare. Integrative approaches

  7. Sunflower Hybrid Breeding: From Markers to Genomic Selection

    Directory of Open Access Journals (Sweden)

    Aleksandra Dimitrijevic

    2018-01-01

    Full Text Available In sunflower, molecular markers for simple traits as, e.g., fertility restoration, high oleic acid content, herbicide tolerance or resistances to Plasmopara halstedii, Puccinia helianthi, or Orobanche cumana have been successfully used in marker-assisted breeding programs for years. However, agronomically important complex quantitative traits like yield, heterosis, drought tolerance, oil content or selection for disease resistance, e.g., against Sclerotinia sclerotiorum have been challenging and will require genome-wide approaches. Plant genetic resources for sunflower are being collected and conserved worldwide that represent valuable resources to study complex traits. Sunflower association panels provide the basis for genome-wide association studies, overcoming disadvantages of biparental populations. Advances in technologies and the availability of the sunflower genome sequence made novel approaches on the whole genome level possible. Genotype-by-sequencing, and whole genome sequencing based on next generation sequencing technologies facilitated the production of large amounts of SNP markers for high density maps as well as SNP arrays and allowed genome-wide association studies and genomic selection in sunflower. Genome wide or candidate gene based association studies have been performed for traits like branching, flowering time, resistance to Sclerotinia head and stalk rot. First steps in genomic selection with regard to hybrid performance and hybrid oil content have shown that genomic selection can successfully address complex quantitative traits in sunflower and will help to speed up sunflower breeding programs in the future. To make sunflower more competitive toward other oil crops higher levels of resistance against pathogens and better yield performance are required. In addition, optimizing plant architecture toward a more complex growth type for higher plant densities has the potential to considerably increase yields per hectare

  8. SNP Data Quality Control in a National Beef and Dairy Cattle System and Highly Accurate SNP Based Parentage Verification and Identification

    Directory of Open Access Journals (Sweden)

    Matthew C. McClure

    2018-03-01

    Full Text Available A major use of genetic data is parentage verification and identification as inaccurate pedigrees negatively affect genetic gain. Since 2012 the international standard for single nucleotide polymorphism (SNP verification in Bos taurus cattle has been the ISAG SNP panels. While these ISAG panels provide an increased level of parentage accuracy over microsatellite markers (MS, they can validate the wrong parent at ≤1% misconcordance rate levels, indicating that more SNP are needed if a more accurate pedigree is required. With rapidly increasing numbers of cattle being genotyped in Ireland that represent 61 B. taurus breeds from a wide range of farm types: beef/dairy, AI/pedigree/commercial, purebred/crossbred, and large to small herd size the Irish Cattle Breeding Federation (ICBF analyzed different SNP densities to determine that at a minimum ≥500 SNP are needed to consistently predict only one set of parents at a ≤1% misconcordance rate. For parentage validation and prediction ICBF uses 800 SNP (ICBF800 selected based on SNP clustering quality, ISAG200 inclusion, call rate (CR, and minor allele frequency (MAF in the Irish cattle population. Large datasets require sample and SNP quality control (QC. Most publications only deal with SNP QC via CR, MAF, parent-progeny conflicts, and Hardy-Weinberg deviation, but not sample QC. We report here parentage, SNP QC, and a genomic sample QC pipelines to deal with the unique challenges of >1 million genotypes from a national herd such as SNP genotype errors from mis-tagging of animals, lab errors, farm errors, and multiple other issues that can arise. We divide the pipeline into two parts: a Genotype QC and an Animal QC pipeline. The Genotype QC identifies samples with low call rate, missing or mixed genotype classes (no BB genotype or ABTG alleles present, and low genotype frequencies. The Animal QC handles situations where the genotype might not belong to the listed individual by identifying: >1 non

  9. A large-scale chromosome-specific SNP discovery guideline.

    Science.gov (United States)

    Akpinar, Bala Ani; Lucas, Stuart; Budak, Hikmet

    2017-01-01

    Single-nucleotide polymorphisms (SNPs) are the most prevalent type of variation in genomes that are increasingly being used as molecular markers in diversity analyses, mapping and cloning of genes, and germplasm characterization. However, only a few studies reported large-scale SNP discovery in Aegilops tauschii, restricting their potential use as markers for the low-polymorphic D genome. Here, we report 68,592 SNPs found on the gene-related sequences of the 5D chromosome of Ae. tauschii genotype MvGB589 using genomic and transcriptomic sequences from seven Ae. tauschii accessions, including AL8/78, the only genotype for which a draft genome sequence is available at present. We also suggest a workflow to compare SNP positions in homologous regions on the 5D chromosome of Triticum aestivum, bread wheat, to mark single nucleotide variations between these closely related species. Overall, the identified SNPs define a density of 4.49 SNPs per kilobyte, among the highest reported for the genic regions of Ae. tauschii so far. To our knowledge, this study also presents the first chromosome-specific SNP catalog in Ae. tauschii that should facilitate the association of these SNPs with morphological traits on chromosome 5D to be ultimately targeted for wheat improvement.

  10. A robust SNP barcode for typing Mycobacterium tuberculosis complex strains

    KAUST Repository

    Coll, Francesc

    2014-09-01

    Strain-specific genomic diversity in the Mycobacterium tuberculosis complex (MTBC) is an important factor in pathogenesis that may affect virulence, transmissibility, host response and emergence of drug resistance. Several systems have been proposed to classify MTBC strains into distinct lineages and families. Here, we investigate single-nucleotide polymorphisms (SNPs) as robust (stable) markers of genetic variation for phylogenetic analysis. We identify ∼92k SNP across a global collection of 1,601 genomes. The SNP-based phylogeny is consistent with the gold-standard regions of difference (RD) classification system. Of the ∼7k strain-specific SNPs identified, 62 markers are proposed to discriminate known circulating strains. This SNP-based barcode is the first to cover all main lineages, and classifies a greater number of sublineages than current alternatives. It may be used to classify clinical isolates to evaluate tools to control the disease, including therapeutics and vaccines whose effectiveness may vary by strain type. © 2014 Macmillan Publishers Limited.

  11. SNP-SNP interactions in breast cancer susceptibility

    International Nuclear Information System (INIS)

    Onay, Venüs Ümmiye; Ozcelik, Hilmi; Briollais, Laurent; Knight, Julia A; Shi, Ellen; Wang, Yuanyuan; Wells, Sean; Li, Hong; Rajendram, Isaac; Andrulis, Irene L

    2006-01-01

    Breast cancer predisposition genes identified to date (e.g., BRCA1 and BRCA2) are responsible for less than 5% of all breast cancer cases. Many studies have shown that the cancer risks associated with individual commonly occurring single nucleotide polymorphisms (SNPs) are incremental. However, polygenic models suggest that multiple commonly occurring low to modestly penetrant SNPs of cancer related genes might have a greater effect on a disease when considered in combination. In an attempt to identify the breast cancer risk conferred by SNP interactions, we have studied 19 SNPs from genes involved in major cancer related pathways. All SNPs were genotyped by TaqMan 5'nuclease assay. The association between the case-control status and each individual SNP, measured by the odds ratio and its corresponding 95% confidence interval, was estimated using unconditional logistic regression models. At the second stage, two-way interactions were investigated using multivariate logistic models. The robustness of the interactions, which were observed among SNPs with stronger functional evidence, was assessed using a bootstrap approach, and correction for multiple testing based on the false discovery rate (FDR) principle. None of these SNPs contributed to breast cancer risk individually. However, we have demonstrated evidence for gene-gene (SNP-SNP) interaction among these SNPs, which were associated with increased breast cancer risk. Our study suggests cross talk between the SNPs of the DNA repair and immune system (XPD-[Lys751Gln] and IL10-[G(-1082)A]), cell cycle and estrogen metabolism (CCND1-[Pro241Pro] and COMT-[Met108/158Val]), cell cycle and DNA repair (BARD1-[Pro24Ser] and XPD-[Lys751Gln]), and within carcinogen metabolism (GSTP1-[Ile105Val] and COMT-[Met108/158Val]) pathways. The importance of these pathways and their communication in breast cancer predisposition has been emphasized previously, but their biological interactions through SNPs have not been described

  12. SNP-SNP interactions in breast cancer susceptibility

    Directory of Open Access Journals (Sweden)

    Wang Yuanyuan

    2006-05-01

    Full Text Available Abstract Background Breast cancer predisposition genes identified to date (e.g., BRCA1 and BRCA2 are responsible for less than 5% of all breast cancer cases. Many studies have shown that the cancer risks associated with individual commonly occurring single nucleotide polymorphisms (SNPs are incremental. However, polygenic models suggest that multiple commonly occurring low to modestly penetrant SNPs of cancer related genes might have a greater effect on a disease when considered in combination. Methods In an attempt to identify the breast cancer risk conferred by SNP interactions, we have studied 19 SNPs from genes involved in major cancer related pathways. All SNPs were genotyped by TaqMan 5'nuclease assay. The association between the case-control status and each individual SNP, measured by the odds ratio and its corresponding 95% confidence interval, was estimated using unconditional logistic regression models. At the second stage, two-way interactions were investigated using multivariate logistic models. The robustness of the interactions, which were observed among SNPs with stronger functional evidence, was assessed using a bootstrap approach, and correction for multiple testing based on the false discovery rate (FDR principle. Results None of these SNPs contributed to breast cancer risk individually. However, we have demonstrated evidence for gene-gene (SNP-SNP interaction among these SNPs, which were associated with increased breast cancer risk. Our study suggests cross talk between the SNPs of the DNA repair and immune system (XPD-[Lys751Gln] and IL10-[G(-1082A], cell cycle and estrogen metabolism (CCND1-[Pro241Pro] and COMT-[Met108/158Val], cell cycle and DNA repair (BARD1-[Pro24Ser] and XPD-[Lys751Gln], and within carcinogen metabolism (GSTP1-[Ile105Val] and COMT-[Met108/158Val] pathways. Conclusion The importance of these pathways and their communication in breast cancer predisposition has been emphasized previously, but their

  13. Design and characterization of a 52K SNP chip for goats.

    Directory of Open Access Journals (Sweden)

    Gwenola Tosser-Klopp

    Full Text Available The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50-60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed: Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes, sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years.

  14. Imputation of microsatellite alleles from dense SNP genotypes for parental verification

    Directory of Open Access Journals (Sweden)

    Matthew eMcclure

    2012-08-01

    Full Text Available Microsatellite (MS markers have recently been used for parental verification and are still the international standard despite higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP-based assays. Despite domestic and international interest from producers and research communities, no viable means currently exist to verify parentage for an individual unless all familial connections were analyzed using the same DNA marker type (MS or SNP. A simple and cost-effective method was devised to impute MS alleles from SNP haplotypes within breeds. For some MS, imputation results may allow inference across breeds. A total of 347 dairy cattle representing 4 dairy breeds (Brown Swiss, Guernsey, Holstein, and Jersey were used to generate reference haplotypes. This approach has been verified (>98% accurate for imputing the International Society of Animal Genetics (ISAG recommended panel of 12 MS for cattle parentage verification across a validation set of 1,307 dairy animals.. Implementation of this method will allow producers and breed associations to transition to SNP-based parentage verification utilizing MS genotypes from historical data on parents where SNP genotypes are missing. This approach may be applicable to additional cattle breeds and other species that wish to migrate from MS- to SNP- based parental verification.

  15. Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array

    Energy Technology Data Exchange (ETDEWEB)

    Gardner, S; Jaing, C

    2012-03-27

    The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interim report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.

  16. Population genetic analysis of ascertained SNP data

    Directory of Open Access Journals (Sweden)

    Nielsen Rasmus

    2004-03-01

    Full Text Available Abstract The large single nucleotide polymorphism (SNP typing projects have provided an invaluable data resource for human population geneticists. Almost all of the available SNP loci, however, have been identified through a SNP discovery protocol that will influence the allelic distributions in the sampled loci. Standard methods for population genetic analysis based on the available SNP data will, therefore, be biased. This paper discusses the effect of this ascertainment bias on allelic distributions and on methods for quantifying linkage disequilibrium and estimating demographic parameters. Several recently developed methods for correcting for the ascertainment bias will also be discussed.

  17. Model SNP development for complex genomes based on hexaploid oat using high-throughput 454 sequencing technology

    Directory of Open Access Journals (Sweden)

    Chao Shiaoman

    2011-01-01

    Full Text Available Abstract Background Genetic markers are pivotal to modern genomics research; however, discovery and genotyping of molecular markers in oat has been hindered by the size and complexity of the genome, and by a scarcity of sequence data. The purpose of this study was to generate oat expressed sequence tag (EST information, develop a bioinformatics pipeline for SNP discovery, and establish a method for rapid, cost-effective, and straightforward genotyping of SNP markers in complex polyploid genomes such as oat. Results Based on cDNA libraries of four cultivated oat genotypes, approximately 127,000 contigs were assembled from approximately one million Roche 454 sequence reads. Contigs were filtered through a novel bioinformatics pipeline to eliminate ambiguous polymorphism caused by subgenome homology, and 96 in silico SNPs were selected from 9,448 candidate loci for validation using high-resolution melting (HRM analysis. Of these, 52 (54% were polymorphic between parents of the Ogle1040 × TAM O-301 (OT mapping population, with 48 segregating as single Mendelian loci, and 44 being placed on the existing OT linkage map. Ogle and TAM amplicons from 12 primers were sequenced for SNP validation, revealing complex polymorphism in seven amplicons but general sequence conservation within SNP loci. Whole-amplicon interrogation with HRM revealed insertions, deletions, and heterozygotes in secondary oat germplasm pools, generating multiple alleles at some primer targets. To validate marker utility, 36 SNP assays were used to evaluate the genetic diversity of 34 diverse oat genotypes. Dendrogram clusters corresponded generally to known genome composition and genetic ancestry. Conclusions The high-throughput SNP discovery pipeline presented here is a rapid and effective method for identification of polymorphic SNP alleles in the oat genome. The current-generation HRM system is a simple and highly-informative platform for SNP genotyping. These techniques provide

  18. FunctSNP: an R package to link SNPs to functional knowledge and dbAutoMaker: a suite of Perl scripts to build SNP databases

    Directory of Open Access Journals (Sweden)

    Watson-Haigh Nathan S

    2010-06-01

    Full Text Available Abstract Background Whole genome association studies using highly dense single nucleotide polymorphisms (SNPs are a set of methods to identify DNA markers associated with variation in a particular complex trait of interest. One of the main outcomes from these studies is a subset of statistically significant SNPs. Finding the potential biological functions of such SNPs can be an important step towards further use in human and agricultural populations (e.g., for identifying genes related to susceptibility to complex diseases or genes playing key roles in development or performance. The current challenge is that the information holding the clues to SNP functions is distributed across many different databases. Efficient bioinformatics tools are therefore needed to seamlessly integrate up-to-date functional information on SNPs. Many web services have arisen to meet the challenge but most work only within the framework of human medical research. Although we acknowledge the importance of human research, we identify there is a need for SNP annotation tools for other organisms. Description We introduce an R package called FunctSNP, which is the user interface to custom built species-specific databases. The local relational databases contain SNP data together with functional annotations extracted from online resources. FunctSNP provides a unified bioinformatics resource to link SNPs with functional knowledge (e.g., genes, pathways, ontologies. We also introduce dbAutoMaker, a suite of Perl scripts, which can be scheduled to run periodically to automatically create/update the customised SNP databases. We illustrate the use of FunctSNP with a livestock example, but the approach and software tools presented here can be applied also to human and other organisms. Conclusions Finding the potential functional significance of SNPs is important when further using the outcomes from whole genome association studies. FunctSNP is unique in that it is the only R

  19. Unraveling biocomplexity of Northeast Atlantic herring stocks using SNP markers

    DEFF Research Database (Denmark)

    Bekkevold, Dorte; Limborg, Morten; Helyar, Sarah

    2012-01-01

    Atlantic herring (Clupea harengus) exhibit biocomplexity, with widespread, geographically explicit populations that perform long‐range migration to common feeding and wintering areas, where they are exploited by fisheries. This means that exploited stocks do not describe discrete units, thereby c...... and spatial dynamics applicable to stock assessment methods, as well as presenting a traceability tool for certification of herring and herring products...

  20. (SNP) markers for the Chinese black sleeper, Bostrychus sinensis

    African Journals Online (AJOL)

    ajl yemi

    2011-04-25

    Apr 25, 2011 ... The Chinese black sleeper, Bostrychus sinensis. Lacepede 1801, occurs from the northern Indian Ocean coast, reaching east to the Pacific, Melanesia and ... specificity, spawns in burrow and shows behavior of guarding eggs. These characteristics not only influence their expansion capability, but make ...

  1. Characterization of fifteen SNP markers by mining EST in sea ...

    Indian Academy of Sciences (India)

    1Liaoning Key Lab of Marine Fishery Molecular Biology, Liaoning Ocean and Fisheries Science Research Institute,. Dalian 116023 ... 2009), and Atlantic salmon (Hayes et al. ..... in a self-incompatible and partially clonal forest tree species -.

  2. A SNP-Based Molecular Barcode for Characterization of Common Wheat.

    Directory of Open Access Journals (Sweden)

    LiFeng Gao

    Full Text Available Wheat is grown as a staple crop worldwide. It is important to develop an effective genotyping tool for this cereal grain both to identify germplasm diversity and to protect the rights of breeders. Single-nucleotide polymorphism (SNP genotyping provides a means for developing a practical, rapid, inexpensive and high-throughput assay. Here, we investigated SNPs as robust markers of genetic variation for typing wheat cultivars. We identified SNPs from an array of 9000 across a collection of 429 well-known wheat cultivars grown in China, of which 43 SNP markers with high minor allele frequency and variations discriminated the selected wheat varieties and their wild ancestors. This SNP-based barcode will allow for the rapid and precise identification of wheat germplasm resources and newly released varieties and will further assist in the wheat breeding program.

  3. [Restriction endonuclease digest - melting curve analysis: a new SNP genotyping and its application in traditional Chinese medicine authentication].

    Science.gov (United States)

    Jiang, Chao; Huang, Lu-Qi; Yuan, Yuan; Chen, Min; Hou, Jing-Yi; Wu, Zhi-Gang; Lin, Shu-Fang

    2014-04-01

    Single nucleotide polymorphisms (SNP) is an important molecular marker in traditional Chinese medicine research, and it is widely used in TCM authentication. The present study created a new genotyping method by combining restriction endonuclease digesting with melting curve analysis, which is a stable, rapid and easy doing SNP genotyping method. The new method analyzed SNP genotyping of two chloroplast SNP which was located in or out of the endonuclease recognition site, the results showed that when attaching a 14 bp GC-clamp (cggcgggagggcgg) to 5' end of the primer and selecting suited endonuclease to digest the amplification products, the melting curve of Lonicera japonica and Atractylodes macrocephala were all of double peaks and the adulterants Shan-yin-hua and A. lancea were of single peaks. The results indicated that the method had good stability and reproducibility for identifying authentic medicines from its adulterants. It is a potential SNP genotyping method and named restriction endonuclease digest - melting curve analysis.

  4. SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping

    Directory of Open Access Journals (Sweden)

    Chang Hsueh-Wei

    2010-04-01

    Full Text Available Abstract Background PCR-restriction fragment length polymorphism (RFLP assay is a cost-effective method for SNP genotyping and mutation detection, but the manual mining for restriction enzyme sites is challenging and cumbersome. Three years after we constructed SNP-RFLPing, a freely accessible database and analysis tool for restriction enzyme mining of SNPs, significant improvements over the 2006 version have been made and incorporated into the latest version, SNP-RFLPing 2. Results The primary aim of SNP-RFLPing 2 is to provide comprehensive PCR-RFLP information with multiple functionality about SNPs, such as SNP retrieval to multiple species, different polymorphism types (bi-allelic, tri-allelic, tetra-allelic or indels, gene-centric searching, HapMap tagSNPs, gene ontology-based searching, miRNAs, and SNP500Cancer. The RFLP restriction enzymes and the corresponding PCR primers for the natural and mutagenic types of each SNP are simultaneously analyzed. All the RFLP restriction enzyme prices are also provided to aid selection. Furthermore, the previously encountered updating problems for most SNP related databases are resolved by an on-line retrieval system. Conclusions The user interfaces for functional SNP analyses have been substantially improved and integrated. SNP-RFLPing 2 offers a new and user-friendly interface for RFLP genotyping that can be used in association studies and is freely available at http://bio.kuas.edu.tw/snp-rflping2.

  5. SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping.

    Science.gov (United States)

    Chang, Hsueh-Wei; Cheng, Yu-Huei; Chuang, Li-Yeh; Yang, Cheng-Hong

    2010-04-08

    PCR-restriction fragment length polymorphism (RFLP) assay is a cost-effective method for SNP genotyping and mutation detection, but the manual mining for restriction enzyme sites is challenging and cumbersome. Three years after we constructed SNP-RFLPing, a freely accessible database and analysis tool for restriction enzyme mining of SNPs, significant improvements over the 2006 version have been made and incorporated into the latest version, SNP-RFLPing 2. The primary aim of SNP-RFLPing 2 is to provide comprehensive PCR-RFLP information with multiple functionality about SNPs, such as SNP retrieval to multiple species, different polymorphism types (bi-allelic, tri-allelic, tetra-allelic or indels), gene-centric searching, HapMap tagSNPs, gene ontology-based searching, miRNAs, and SNP500Cancer. The RFLP restriction enzymes and the corresponding PCR primers for the natural and mutagenic types of each SNP are simultaneously analyzed. All the RFLP restriction enzyme prices are also provided to aid selection. Furthermore, the previously encountered updating problems for most SNP related databases are resolved by an on-line retrieval system. The user interfaces for functional SNP analyses have been substantially improved and integrated. SNP-RFLPing 2 offers a new and user-friendly interface for RFLP genotyping that can be used in association studies and is freely available at http://bio.kuas.edu.tw/snp-rflping2.

  6. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

    Science.gov (United States)

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ~4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification pr...

  7. SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

    Science.gov (United States)

    Gretchen H. Roffler; Stephen J. Amish; Seth Smith; Ted Cosart; Marty Kardos; Michael K. Schwartz; Gordon Luikart

    2016-01-01

    Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding...

  8. SNP discovery in nonmodel organisms: strand bias and base-substitution errors reduce conversion rates.

    Science.gov (United States)

    Gonçalves da Silva, Anders; Barendse, William; Kijas, James W; Barris, Wes C; McWilliam, Sean; Bunch, Rowan J; McCullough, Russell; Harrison, Blair; Hoelzel, A Rus; England, Phillip R

    2015-07-01

    Single nucleotide polymorphisms (SNPs) have become the marker of choice for genetic studies in organisms of conservation, commercial or biological interest. Most SNP discovery projects in nonmodel organisms apply a strategy for identifying putative SNPs based on filtering rules that account for random sequencing errors. Here, we analyse data used to develop 4723 novel SNPs for the commercially important deep-sea fish, orange roughy (Hoplostethus atlanticus), to assess the impact of not accounting for systematic sequencing errors when filtering identified polymorphisms when discovering SNPs. We used SAMtools to identify polymorphisms in a velvet assembly of genomic DNA sequence data from seven individuals. The resulting set of polymorphisms were filtered to minimize 'bycatch'-polymorphisms caused by sequencing or assembly error. An Illumina Infinium SNP chip was used to genotype a final set of 7714 polymorphisms across 1734 individuals. Five predictors were examined for their effect on the probability of obtaining an assayable SNP: depth of coverage, number of reads that support a variant, polymorphism type (e.g. A/C), strand-bias and Illumina SNP probe design score. Our results indicate that filtering out systematic sequencing errors could substantially improve the efficiency of SNP discovery. We show that BLASTX can be used as an efficient tool to identify single-copy genomic regions in the absence of a reference genome. The results have implications for research aiming to identify assayable SNPs and build SNP genotyping assays for nonmodel organisms. © 2014 John Wiley & Sons Ltd.

  9. SNP genotyping by DNA photoligation: application to SNP detection of genes from food crops

    Energy Technology Data Exchange (ETDEWEB)

    Yoshimura, Yoshinaga; Ohtake, Tomoko; Okada, Hajime; Fujimoto, Kenzo [School of Materials Science, Japan Advanced Institute of Science and Technology, 1-1 Asahidai, Nomi, Ishikawa 923-1292 (Japan); Ami, Takehiro [Innovation Plaza Ishikawa, Japan Science and Technology Agency, 2-13 Asahidai, Nomi, Ishikawa 923-1211 (Japan); Tsukaguchi, Tadashi, E-mail: kenzo@jaist.ac.j [Faculty of Bioresources and Environmental Sciences, Ishikawa Prefectural University, 1-308 Suematsu, Nonoichi, Ishikawa 921-8836 (Japan)

    2009-06-15

    We describe a simple and inexpensive single-nucleotide polymorphism (SNP) typing method, using DNA photoligation with 5-carboxyvinyl-2'-deoxyuridine and two fluorophores. This SNP-typing method facilitates qualitative determination of genes from indica and japonica rice, and showed a high degree of single nucleotide specificity up to 10 000. This method can be used in the SNP typing of actual genomic DNA samples from food crops.

  10. SNP genotyping by DNA photoligation: application to SNP detection of genes from food crops

    Directory of Open Access Journals (Sweden)

    Yoshinaga Yoshimura, Tomoko Ohtake, Hajime Okada, Takehiro Ami, Tadashi Tsukaguchi and Kenzo Fujimoto

    2009-01-01

    Full Text Available We describe a simple and inexpensive single-nucleotide polymorphism (SNP typing method, using DNA photoligation with 5-carboxyvinyl-2'-deoxyuridine and two fluorophores. This SNP-typing method facilitates qualitative determination of genes from indica and japonica rice, and showed a high degree of single nucleotide specificity up to 10 000. This method can be used in the SNP typing of actual genomic DNA samples from food crops.

  11. A Nonlinear Model for Gene-Based Gene-Environment Interaction

    Directory of Open Access Journals (Sweden)

    Jian Sa

    2016-06-01

    Full Text Available A vast amount of literature has confirmed the role of gene-environment (G×E interaction in the etiology of complex human diseases. Traditional methods are predominantly focused on the analysis of interaction between a single nucleotide polymorphism (SNP and an environmental variable. Given that genes are the functional units, it is crucial to understand how gene effects (rather than single SNP effects are influenced by an environmental variable to affect disease risk. Motivated by the increasing awareness of the power of gene-based association analysis over single variant based approach, in this work, we proposed a sparse principle component regression (sPCR model to understand the gene-based G×E interaction effect on complex disease. We first extracted the sparse principal components for SNPs in a gene, then the effect of each principal component was modeled by a varying-coefficient (VC model. The model can jointly model variants in a gene in which their effects are nonlinearly influenced by an environmental variable. In addition, the varying-coefficient sPCR (VC-sPCR model has nice interpretation property since the sparsity on the principal component loadings can tell the relative importance of the corresponding SNPs in each component. We applied our method to a human birth weight dataset in Thai population. We analyzed 12,005 genes across 22 chromosomes and found one significant interaction effect using the Bonferroni correction method and one suggestive interaction. The model performance was further evaluated through simulation studies. Our model provides a system approach to evaluate gene-based G×E interaction.

  12. Characterization of single nucleotide polymorphism markers for eelgrass (Zostera marina)

    NARCIS (Netherlands)

    Ferber, Steven; Reusch, Thorsten B. H.; Stam, Wytze T.; Olsen, Jeanine L.

    We characterized 37 single nucleotide polymorphism (SNP) makers for eelgrass Zostera marina. SNP markers were developed using existing EST (expressed sequence tag)-libraries to locate polymorphic loci and develop primers from the functional expressed genes that are deposited in The ZOSTERA database

  13. A low-density SNP array for analyzing differential selection in freshwater and marine populations of threespine stickleback (Gasterosteus aculeatus)

    DEFF Research Database (Denmark)

    Ferchaud, Anne-Laure; Pedersen, Susanne H.; Bekkevold, Dorte

    2014-01-01

    for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. Results......: RAD (Restriction site Associated DNA) sequencing of sixty individuals representing two freshwater and one marine population led to the identification of 33,993 SNP markers. Ninety-six of these were chosen for the low-density SNP array, among which 70 represented SNPs under putatively directional...... selection in freshwater vs. marine environments, whereas 26 SNPs were assumed to be neutral. Annotation of these regions revealed several genes that are candidates for affecting stickleback phenotypic variation, some of which have been observed in previous studies whereas others are new. Conclusions: We...

  14. Grouping and clustering of maize Lancaster germplasm inbreds according to the results of SNP-analysis

    Directory of Open Access Journals (Sweden)

    K. V. Derkach

    2017-08-01

    Full Text Available The objective of this article is the grouping and clustering of maize inbred lines based on the results of SNP-genotyping for the verification of a separate cluster of Lancaster germplasm inbred lines. As material for the study, we used 91 maize (Zea mays L. inbred lines, including 31 Lancaster germplasm lines and 60 inbred lines of other germplasms (23 Iodent inbreds, 15 Reid inbreds, 7 Lacon inbreds, 12 Mix inbreds and 3 exotic inbreds. The majority of the given inbred lines are included in the Dnipro breeding programme. The SNP-genotyping of these inbred lines was conducted using BDI-III panel of 384 SNP-markers developed by BioDiagnostics, Inc. (USA on the base of Illumina VeraCode Bead Plate. The SNP-markers of this panel are biallelic and are located on all 10 maize chromosomes. Their range of conductivity was >0.6. The SNP-analysis was made in completely automated regime on Illumina BeadStation equipment at BioDiagnostics, Inc. (USA. A principal component analysis was applied to group a general set of 91 inbreds according to allelic states of SNP-markers and to identify a cluster of Lancaster inbreds. The clustering and determining hierarchy in 31 Lancaster germplasm inbreds used quantitative cluster analysis. The share of monomorphic markers in the studied set of 91 inbred lines equaled 0.7%, and the share of dimorphic markers equaled 99.3%. Minor allele frequency (MAF > 0.2 was observed for 80.6% of dimorphic markers, the average index of shift of gene diversity equaled 0.2984, PIC on average reached 0.3144. The index of gene diversity of markers varied from 0.1701 to 0.1901, pairwise genetic distances between inbred lines ranged from 0.0316–0.8000, the frequencies of major alleles of SNP-markers were within 0.5085–0.9821, and the frequencies of minor alleles were within 0.0179–0.4915. The average homozygosity of inbred lines was 98.8%. The principal component analysis of SNP-distances confirmed the isolation of the Lancaster

  15. High-throughput SNP genotyping in Cucurbita pepo for map construction and quantitative trait loci mapping.

    Science.gov (United States)

    Esteras, Cristina; Gómez, Pedro; Monforte, Antonio J; Blanca, José; Vicente-Dólera, Nelly; Roig, Cristina; Nuez, Fernando; Picó, Belén

    2012-02-22

    Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species.The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most of these markers are located in

  16. Analysis of SNP rs16754 of WT1 gene in a series of de novo acute myeloid leukemia patients.

    Science.gov (United States)

    Luna, Irene; Such, Esperanza; Cervera, Jose; Barragán, Eva; Jiménez-Velasco, Antonio; Dolz, Sandra; Ibáñez, Mariam; Gómez-Seguí, Inés; López-Pavía, María; Llop, Marta; Fuster, Óscar; Oltra, Silvestre; Moscardó, Federico; Martínez-Cuadrón, David; Senent, M Leonor; Gascón, Adriana; Montesinos, Pau; Martín, Guillermo; Bolufer, Pascual; Sanz, Miguel A

    2012-12-01

    The single nucleotide polymorphism (SNP) rs16754 of the WT1 gene has been previously described as a possible prognostic marker in normal karyotype acute myeloid leukemia (AML) patients. Nevertheless, the findings in this field are not always reproducible in different series. One hundred and seventy-five adult de novo AML patients were screened with two different methods for the detection of SNP rs16754: high-resolution melting (HRM) and FRET hybridization probes. Direct sequencing was used to validate both techniques. The SNP was detected in 52 out of 175 patients (30 %), both by HRM and hybridization probes. Direct sequencing confirmed that every positive sample in the screening methods had a variation in the DNA sequence. Patients with the wild-type genotype (WT1(AA)) for the SNP rs16754 were significantly younger than those with the heterozygous WT1(AG) genotype. No other difference was observed for baseline characteristic or outcome between patients with or without the SNP. Both techniques are equally reliable and reproducible as screening methods for the detection of the SNP rs16754, allowing for the selection of those samples that will need to be sequenced. We were unable to confirm the suggested favorable outcome of SNP rs16754 in de novo AML.

  17. Genetic markers and their application in livestock breeding in South ...

    African Journals Online (AJOL)

    The ultimate use of DNA markers would be to identify quantitative trait loci (QTL) in order to practice genotypic selection. This paper reviews DNA markers (RAPD, DFP, RFLP AFLP, minisatellites, microsatellites, SNP) and provides a brief overview of the current application of these markers in animal breeding.

  18. Rational design of gene-based vaccines.

    Science.gov (United States)

    Barouch, Dan H

    2006-01-01

    Vaccine development has traditionally been an empirical discipline. Classical vaccine strategies include the development of attenuated organisms, whole killed organisms, and protein subunits, followed by empirical optimization and iterative improvements. While these strategies have been remarkably successful for a wide variety of viruses and bacteria, these approaches have proven more limited for pathogens that require cellular immune responses for their control. In this review, current strategies to develop and optimize gene-based vaccines are described, with an emphasis on novel approaches to improve plasmid DNA vaccines and recombinant adenovirus vector-based vaccines. Copyright 2006 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.

  19. TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

    Science.gov (United States)

    Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

    2018-04-11

    Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions

  20. SNP-VISTA: An Interactive SNPs Visualization Tool

    Energy Technology Data Exchange (ETDEWEB)

    Shah, Nameeta; Teplitsky, Michael V.; Pennacchio, Len A.; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L.

    2005-07-05

    Recent advances in sequencing technologies promise better diagnostics for many diseases as well as better understanding of evolution of microbial populations. Single Nucleotide Polymorphisms(SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it is possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease and then screen for causative mutations.In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples makes possible more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at http://genome.lbl.gov/vista/snpvista.

  1. SNP Discovery for mapping alien introgressions in wheat

    Science.gov (United States)

    2014-01-01

    Background Monitoring alien introgressions in crop plants is difficult due to the lack of genetic and molecular mapping information on the wild crop relatives. The tertiary gene pool of wheat is a very important source of genetic variability for wheat improvement against biotic and abiotic stresses. By exploring the 5Mg short arm (5MgS) of Aegilops geniculata, we can apply chromosome genomics for the discovery of SNP markers and their use for monitoring alien introgressions in wheat (Triticum aestivum L). Results The short arm of chromosome 5Mg of Ae. geniculata Roth (syn. Ae. ovata L.; 2n = 4x = 28, UgUgMgMg) was flow-sorted from a wheat line in which it is maintained as a telocentric chromosome. DNA of the sorted arm was amplified and sequenced using an Illumina Hiseq 2000 with ~45x coverage. The sequence data was used for SNP discovery against wheat homoeologous group-5 assemblies. A total of 2,178 unique, 5MgS-specific SNPs were discovered. Randomly selected samples of 59 5MgS-specific SNPs were tested (44 by KASPar assay and 15 by Sanger sequencing) and 84% were validated. Of the selected SNPs, 97% mapped to a chromosome 5Mg addition to wheat (the source of t5MgS), and 94% to 5Mg introgressed from a different accession of Ae. geniculata substituting for chromosome 5D of wheat. The validated SNPs also identified chromosome segments of 5MgS origin in a set of T5D-5Mg translocation lines; eight SNPs (25%) mapped to TA5601 [T5DL · 5DS-5MgS(0.75)] and three (8%) to TA5602 [T5DL · 5DS-5MgS (0.95)]. SNPs (gsnp_5ms83 and gsnp_5ms94), tagging chromosome T5DL · 5DS-5MgS(0.95) with the smallest introgression carrying resistance to leaf rust (Lr57) and stripe rust (Yr40), were validated in two released germplasm lines with Lr57 and Yr40 genes. Conclusion This approach should be widely applicable for the identification of species/genome-specific SNPs. The development of a large number of SNP markers will facilitate the precise introgression and

  2. SNP Discovery for mapping alien introgressions in wheat.

    Science.gov (United States)

    Tiwari, Vijay K; Wang, Shichen; Sehgal, Sunish; Vrána, Jan; Friebe, Bernd; Kubaláková, Marie; Chhuneja, Praveen; Doležel, Jaroslav; Akhunov, Eduard; Kalia, Bhanu; Sabir, Jamal; Gill, Bikram S

    2014-04-10

    Monitoring alien introgressions in crop plants is difficult due to the lack of genetic and molecular mapping information on the wild crop relatives. The tertiary gene pool of wheat is a very important source of genetic variability for wheat improvement against biotic and abiotic stresses. By exploring the 5Mg short arm (5MgS) of Aegilops geniculata, we can apply chromosome genomics for the discovery of SNP markers and their use for monitoring alien introgressions in wheat (Triticum aestivum L). The short arm of chromosome 5Mg of Ae. geniculata Roth (syn. Ae. ovata L.; 2n = 4x = 28, UgUgMgMg) was flow-sorted from a wheat line in which it is maintained as a telocentric chromosome. DNA of the sorted arm was amplified and sequenced using an Illumina Hiseq 2000 with ~45x coverage. The sequence data was used for SNP discovery against wheat homoeologous group-5 assemblies. A total of 2,178 unique, 5MgS-specific SNPs were discovered. Randomly selected samples of 59 5MgS-specific SNPs were tested (44 by KASPar assay and 15 by Sanger sequencing) and 84% were validated. Of the selected SNPs, 97% mapped to a chromosome 5Mg addition to wheat (the source of t5MgS), and 94% to 5Mg introgressed from a different accession of Ae. geniculata substituting for chromosome 5D of wheat. The validated SNPs also identified chromosome segments of 5MgS origin in a set of T5D-5Mg translocation lines; eight SNPs (25%) mapped to TA5601 [T5DL · 5DS-5MgS(0.75)] and three (8%) to TA5602 [T5DL · 5DS-5MgS (0.95)]. SNPs (gsnp_5ms83 and gsnp_5ms94), tagging chromosome T5DL · 5DS-5MgS(0.95) with the smallest introgression carrying resistance to leaf rust (Lr57) and stripe rust (Yr40), were validated in two released germplasm lines with Lr57 and Yr40 genes. This approach should be widely applicable for the identification of species/genome-specific SNPs. The development of a large number of SNP markers will facilitate the precise introgression and monitoring of alien segments in crop

  3. Parentage Reconstruction in Eucalyptus nitens Using SNPs and Microsatellite Markers: A Comparative Analysis of Marker Data Power and Robustness.

    Directory of Open Access Journals (Sweden)

    Emily J Telfer

    Full Text Available Pedigree reconstruction using molecular markers enables efficient management of inbreeding in open-pollinated breeding strategies, replacing expensive and time-consuming controlled pollination. This is particularly useful in preferentially outcrossed, insect pollinated Eucalypts known to suffer considerable inbreeding depression from related matings. A single nucleotide polymorphism (SNP marker panel consisting of 106 markers was selected for pedigree reconstruction from the recently developed high-density Eucalyptus Infinium SNP chip (EuCHIP60K. The performance of this SNP panel for pedigree reconstruction in open-pollinated progenies of two Eucalyptus nitens seed orchards was compared with that of two microsatellite panels with 13 and 16 markers respectively. The SNP marker panel out-performed one of the microsatellite panels in the resolution power to reconstruct pedigrees and out-performed both panels with respect to data quality. Parentage of all but one offspring in each clonal seed orchard was correctly matched to the expected seed parent using the SNP marker panel, whereas parentage assignment to less than a third of the expected seed parents were supported using the 13-microsatellite panel. The 16-microsatellite panel supported all but one of the recorded seed parents, one better than the SNP panel, although there was still a considerable level of missing and inconsistent data. SNP marker data was considerably superior to microsatellite data in accuracy, reproducibility and robustness. Although microsatellites and SNPs data provide equivalent resolution for pedigree reconstruction, microsatellite analysis requires more time and experience to deal with the uncertainties of allele calling and faces challenges for data transferability across labs and over time. While microsatellite analysis will continue to be useful for some breeding tasks due to the high information content, existing infrastructure and low operating costs, the multi

  4. Parentage Reconstruction in Eucalyptus nitens Using SNPs and Microsatellite Markers: A Comparative Analysis of Marker Data Power and Robustness.

    Science.gov (United States)

    Telfer, Emily J; Stovold, Grahame T; Li, Yongjun; Silva-Junior, Orzenil B; Grattapaglia, Dario G; Dungey, Heidi S

    2015-01-01

    Pedigree reconstruction using molecular markers enables efficient management of inbreeding in open-pollinated breeding strategies, replacing expensive and time-consuming controlled pollination. This is particularly useful in preferentially outcrossed, insect pollinated Eucalypts known to suffer considerable inbreeding depression from related matings. A single nucleotide polymorphism (SNP) marker panel consisting of 106 markers was selected for pedigree reconstruction from the recently developed high-density Eucalyptus Infinium SNP chip (EuCHIP60K). The performance of this SNP panel for pedigree reconstruction in open-pollinated progenies of two Eucalyptus nitens seed orchards was compared with that of two microsatellite panels with 13 and 16 markers respectively. The SNP marker panel out-performed one of the microsatellite panels in the resolution power to reconstruct pedigrees and out-performed both panels with respect to data quality. Parentage of all but one offspring in each clonal seed orchard was correctly matched to the expected seed parent using the SNP marker panel, whereas parentage assignment to less than a third of the expected seed parents were supported using the 13-microsatellite panel. The 16-microsatellite panel supported all but one of the recorded seed parents, one better than the SNP panel, although there was still a considerable level of missing and inconsistent data. SNP marker data was considerably superior to microsatellite data in accuracy, reproducibility and robustness. Although microsatellites and SNPs data provide equivalent resolution for pedigree reconstruction, microsatellite analysis requires more time and experience to deal with the uncertainties of allele calling and faces challenges for data transferability across labs and over time. While microsatellite analysis will continue to be useful for some breeding tasks due to the high information content, existing infrastructure and low operating costs, the multi-species SNP resource

  5. Compression and fast retrieval of SNP data.

    Science.gov (United States)

    Sambo, Francesco; Di Camillo, Barbara; Toffolo, Gianna; Cobelli, Claudio

    2014-11-01

    The increasing interest in rare genetic variants and epistatic genetic effects on complex phenotypic traits is currently pushing genome-wide association study design towards datasets of increasing size, both in the number of studied subjects and in the number of genotyped single nucleotide polymorphisms (SNPs). This, in turn, is leading to a compelling need for new methods for compression and fast retrieval of SNP data. We present a novel algorithm and file format for compressing and retrieving SNP data, specifically designed for large-scale association studies. Our algorithm is based on two main ideas: (i) compress linkage disequilibrium blocks in terms of differences with a reference SNP and (ii) compress reference SNPs exploiting information on their call rate and minor allele frequency. Tested on two SNP datasets and compared with several state-of-the-art software tools, our compression algorithm is shown to be competitive in terms of compression rate and to outperform all tools in terms of time to load compressed data. Our compression and decompression algorithms are implemented in a C++ library, are released under the GNU General Public License and are freely downloadable from http://www.dei.unipd.it/~sambofra/snpack.html. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  6. An integrated SNP mining and utilization (ISMU) pipeline for next generation sequencing data.

    Science.gov (United States)

    Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A V S K; Varshney, Rajeev K

    2014-01-01

    Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone

  7. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography

    Science.gov (United States)

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-01-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined ‘elimination’ status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of M. leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420

  8. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography.

    Science.gov (United States)

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-03-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined 'elimination' status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of Mycobacterium leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. Copyright © 2012 Elsevier B.V. All rights reserved.

  9. An Improved Opposition-Based Learning Particle Swarm Optimization for the Detection of SNP-SNP Interactions

    Science.gov (United States)

    Shang, Junliang; Sun, Yan; Li, Shengjun; Liu, Jin-Xing; Zheng, Chun-Hou; Zhang, Junying

    2015-01-01

    SNP-SNP interactions have been receiving increasing attention in understanding the mechanism underlying susceptibility to complex diseases. Though many works have been done for the detection of SNP-SNP interactions, the algorithmic development is still ongoing. In this study, an improved opposition-based learning particle swarm optimization (IOBLPSO) is proposed for the detection of SNP-SNP interactions. Highlights of IOBLPSO are the introduction of three strategies, namely, opposition-based learning, dynamic inertia weight, and a postprocedure. Opposition-based learning not only enhances the global explorative ability, but also avoids premature convergence. Dynamic inertia weight allows particles to cover a wider search space when the considered SNP is likely to be a random one and converges on promising regions of the search space while capturing a highly suspected SNP. The postprocedure is used to carry out a deep search in highly suspected SNP sets. Experiments of IOBLPSO are performed on both simulation data sets and a real data set of age-related macular degeneration, results of which demonstrate that IOBLPSO is promising in detecting SNP-SNP interactions. IOBLPSO might be an alternative to existing methods for detecting SNP-SNP interactions. PMID:26236727

  10. Development and characterization of a high density SNP genotyping assay for cattle.

    Directory of Open Access Journals (Sweden)

    Lakshmi K Matukumalli

    Full Text Available The success of genome-wide association (GWA studies for the detection of sequence variation affecting complex traits in human has spurred interest in the use of large-scale high-density single nucleotide polymorphism (SNP genotyping for the identification of quantitative trait loci (QTL and for marker-assisted selection in model and agricultural species. A cost-effective and efficient approach for the development of a custom genotyping assay interrogating 54,001 SNP loci to support GWA applications in cattle is described. A novel algorithm for achieving a compressed inter-marker interval distribution proved remarkably successful, with median interval of 37 kb and maximum predicted gap of <350 kb. The assay was tested on a panel of 576 animals from 21 cattle breeds and six outgroup species and revealed that from 39,765 to 46,492 SNP are polymorphic within individual breeds (average minor allele frequency (MAF ranging from 0.24 to 0.27. The assay also identified 79 putative copy number variants in cattle. Utility for GWA was demonstrated by localizing known variation for coat color and the presence/absence of horns to their correct genomic locations. The combination of SNP selection and the novel spacing algorithm allows an efficient approach for the development of high-density genotyping platforms in species having full or even moderate quality draft sequence. Aspects of the approach can be exploited in species which lack an available genome sequence. The BovineSNP50 assay described here is commercially available from Illumina and provides a robust platform for mapping disease genes and QTL in cattle.

  11. SNP discovery and chromosome anchoring provide the first physically-anchored hexaploid oat map and reveal synteny with model species.

    Directory of Open Access Journals (Sweden)

    Rebekah E Oliver

    Full Text Available A physically anchored consensus map is foundational to modern genomics research; however, construction of such a map in oat (Avena sativa L., 2n = 6x = 42 has been hindered by the size and complexity of the genome, the scarcity of robust molecular markers, and the lack of aneuploid stocks. Resources developed in this study include a modified SNP discovery method for complex genomes, a diverse set of oat SNP markers, and a novel chromosome-deficient SNP anchoring strategy. These resources were applied to build the first complete, physically-anchored consensus map of hexaploid oat. Approximately 11,000 high-confidence in silico SNPs were discovered based on nine million inter-varietal sequence reads of genomic and cDNA origin. GoldenGate genotyping of 3,072 SNP assays yielded 1,311 robust markers, of which 985 were mapped in 390 recombinant-inbred lines from six bi-parental mapping populations ranging in size from 49 to 97 progeny. The consensus map included 985 SNPs and 68 previously-published markers, resolving 21 linkage groups with a total map distance of 1,838.8 cM. Consensus linkage groups were assigned to 21 chromosomes using SNP deletion analysis of chromosome-deficient monosomic hybrid stocks. Alignments with sequenced genomes of rice and Brachypodium provide evidence for extensive conservation of genomic regions, and renewed encouragement for orthology-based genomic discovery in this important hexaploid species. These results also provide a framework for high-resolution genetic analysis in oat, and a model for marker development and map construction in other species with complex genomes and limited resources.

  12. SNP high-throughput screening in grapevine using the SNPlex™ genotyping system

    Directory of Open Access Journals (Sweden)

    Velasco Riccardo

    2008-01-01

    Full Text Available Abstract Background Until recently, only a small number of low- and mid-throughput methods have been used for single nucleotide polymorphism (SNP discovery and genotyping in grapevine (Vitis vinifera L.. However, following completion of the sequence of the highly heterozygous genome of Pinot Noir, it has been possible to identify millions of electronic SNPs (eSNPs thus providing a valuable source for high-throughput genotyping methods. Results Herein we report the first application of the SNPlex™ genotyping system in grapevine aiming at the anchoring of an eukaryotic genome. This approach combines robust SNP detection with automated assay readout and data analysis. 813 candidate eSNPs were developed from non-repetitive contigs of the assembled genome of Pinot Noir and tested in 90 progeny of Syrah × Pinot Noir cross. 563 new SNP-based markers were obtained and mapped. The efficiency rate of 69% was enhanced to 80% when multiple displacement amplification (MDA methods were used for preparation of genomic DNA for the SNPlex assay. Conclusion Unlike other SNP genotyping methods used to investigate thousands of SNPs in a few genotypes, or a few SNPs in around a thousand genotypes, the SNPlex genotyping system represents a good compromise to investigate several hundred SNPs in a hundred or more samples simultaneously. Therefore, the use of the SNPlex assay, coupled with whole genome amplification (WGA, is a good solution for future applications in well-equipped laboratories.

  13. Interactions Between SNP Alleles at Multiple Loci and Variation in Skin Pigmentation in 122 Caucasians

    Directory of Open Access Journals (Sweden)

    Sumiko Anno

    2007-01-01

    Full Text Available This study was undertaken to clarify the molecular basis for human skin color variation and the environmental adaptability to ultraviolet irradiation, with the ultimate goal of predicting the impact of changes in future environments on human health risk. One hundred twenty-two Caucasians living in Toledo, Ohio participated. Back and cheek skin were assayed for melanin as a quantitative trait marker. Buccal cell samples were collected and used for DNA extraction. DNA was used for SNP genotyping using the Masscode™ system, which entails two-step PCR amplification and a platform chemistry which allows cleavable mass spectrometry tags. The results show gene-gene interaction between SNP alleles at multiple loci (not necessarily on the same chromosome contributes to inter-individual skin color variation while suggesting a high probability of linkage disequilibrium. Confirmation of these findings requires further study with other ethic groups to analyze the associations between SNP alleles at multiple loci and human skin color variation. Our overarching goal is to use remote sensing data to clarify the interaction between atmospheric environments and SNP allelic frequency and investigate human adaptability to ultraviolet irradiation. Such information should greatly assist in the prediction of the health effects of future environmental changes such as ozone depletion and increased ultraviolet exposure. If such health effects are to some extent predictable, it might be possible to prepare for such changes in advance and thus reduce the extent of their impact.

  14. Tag SNP selection via a genetic algorithm.

    Science.gov (United States)

    Mahdevar, Ghasem; Zahiri, Javad; Sadeghi, Mehdi; Nowzari-Dalini, Abbas; Ahrabian, Hayedeh

    2010-10-01

    Single Nucleotide Polymorphisms (SNPs) provide valuable information on human evolutionary history and may lead us to identify genetic variants responsible for human complex diseases. Unfortunately, molecular haplotyping methods are costly, laborious, and time consuming; therefore, algorithms for constructing full haplotype patterns from small available data through computational methods, Tag SNP selection problem, are convenient and attractive. This problem is proved to be an NP-hard problem, so heuristic methods may be useful. In this paper we present a heuristic method based on genetic algorithm to find reasonable solution within acceptable time. The algorithm was tested on a variety of simulated and experimental data. In comparison with the exact algorithm, based on brute force approach, results show that our method can obtain optimal solutions in almost all cases and runs much faster than exact algorithm when the number of SNP sites is large. Our software is available upon request to the corresponding author.

  15. CFSAN SNP Pipeline: an automated method for constructing SNP matrices from next-generation sequence data

    Directory of Open Access Journals (Sweden)

    Steve Davis

    2015-08-01

    Full Text Available The analysis of next-generation sequence (NGS data is often a fragmented step-wise process. For example, multiple pieces of software are typically needed to map NGS reads, extract variant sites, and construct a DNA sequence matrix containing only single nucleotide polymorphisms (i.e., a SNP matrix for a set of individuals. The management and chaining of these software pieces and their outputs can often be a cumbersome and difficult task. Here, we present CFSAN SNP Pipeline, which combines into a single package the mapping of NGS reads to a reference genome with Bowtie2, processing of those mapping (BAM files using SAMtools, identification of variant sites using VarScan, and production of a SNP matrix using custom Python scripts. We also introduce a Python package (CFSAN SNP Mutator that when given a reference genome will generate variants of known position against which we validate our pipeline. We created 1,000 simulated Salmonella enterica sp. enterica Serovar Agona genomes at 100× and 20× coverage, each containing 500 SNPs, 20 single-base insertions and 20 single-base deletions. For the 100× dataset, the CFSAN SNP Pipeline recovered 98.9% of the introduced SNPs and had a false positive rate of 1.04 × 10−6; for the 20× dataset 98.8% of SNPs were recovered and the false positive rate was 8.34 × 10−7. Based on these results, CFSAN SNP Pipeline is a robust and accurate tool that it is among the first to combine into a single executable the myriad steps required to produce a SNP matrix from NGS data. Such a tool is useful to those working in an applied setting (e.g., food safety traceback investigations as well as for those interested in evolutionary questions.

  16. Exhaustive Genome-Wide Search for SNP-SNP Interactions Across 10 Human Diseases

    Directory of Open Access Journals (Sweden)

    William Murk

    2016-07-01

    Full Text Available The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs with minor allele frequency (MAF ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10−12. Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.

  17. Development of admixture mapping panels for African Americans from commercial high-density SNP arrays

    Directory of Open Access Journals (Sweden)

    Dunston Georgia M

    2010-07-01

    Full Text Available Abstract Background Admixture mapping is a powerful approach for identifying genetic variants involved in human disease that exploits the unique genomic structure in recently admixed populations. To use existing published panels of ancestry-informative markers (AIMs for admixture mapping, markers have to be genotyped de novo for each admixed study sample and samples representing the ancestral parental populations. The increased availability of dense marker data on commercial chips has made it feasible to develop panels wherein the markers need not be predetermined. Results We developed two panels of AIMs (~2,000 markers each based on the Affymetrix Genome-Wide Human SNP Array 6.0 for admixture mapping with African American samples. These two AIM panels had good map power that was higher than that of a denser panel of ~20,000 random markers as well as other published panels of AIMs. As a test case, we applied the panels in an admixture mapping study of hypertension in African Americans in the Washington, D.C. metropolitan area. Conclusions Developing marker panels for admixture mapping from existing genome-wide genotype data offers two major advantages: (1 no de novo genotyping needs to be done, thereby saving costs, and (2 markers can be filtered for various quality measures and replacement markers (to minimize gaps can be selected at no additional cost. Panels of carefully selected AIMs have two major advantages over panels of random markers: (1 the map power from sparser panels of AIMs is higher than that of ~10-fold denser panels of random markers, and (2 clusters can be labeled based on information from the parental populations. With current technology, chip-based genome-wide genotyping is less expensive than genotyping ~20,000 random markers. The major advantage of using random markers is the absence of ascertainment effects resulting from the process of selecting markers. The ability to develop marker panels informative for ancestry from

  18. SNPServer: a real-time SNP discovery tool.

    Science.gov (United States)

    Savage, David; Batley, Jacqueline; Erwin, Tim; Logan, Erica; Love, Christopher G; Lim, Geraldine A C; Mongin, Emmanuel; Barker, Gary; Spangenberg, German C; Edwards, David

    2005-07-01

    SNPServer is a real-time flexible tool for the discovery of SNPs (single nucleotide polymorphisms) within DNA sequence data. The program uses BLAST, to identify related sequences, and CAP3, to cluster and align these sequences. The alignments are parsed to the SNP discovery software autoSNP, a program that detects SNPs and insertion/deletion polymorphisms (indels). Alternatively, lists of related sequences or pre-assembled sequences may be entered for SNP discovery. SNPServer and autoSNP use redundancy to differentiate between candidate SNPs and sequence errors. For each candidate SNP, two measures of confidence are calculated, the redundancy of the polymorphism at a SNP locus and the co-segregation of the candidate SNP with other SNPs in the alignment. SNPServer is available at http://hornbill.cspp.latrobe.edu.au/snpdiscovery.html.

  19. Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array.

    Science.gov (United States)

    Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J

    2012-05-25

    A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been

  20. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

    Science.gov (United States)

    Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

    2015-08-01

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  1. Gene-based testing of interactions in association studies of quantitative traits.

    Directory of Open Access Journals (Sweden)

    Li Ma

    Full Text Available Various methods have been developed for identifying gene-gene interactions in genome-wide association studies (GWAS. However, most methods focus on individual markers as the testing unit, and the large number of such tests drastically erodes statistical power. In this study, we propose novel interaction tests of quantitative traits that are gene-based and that confer advantage in both statistical power and biological interpretation. The framework of gene-based gene-gene interaction (GGG tests combine marker-based interaction tests between all pairs of markers in two genes to produce a gene-level test for interaction between the two. The tests are based on an analytical formula we derive for the correlation between marker-based interaction tests due to linkage disequilibrium. We propose four GGG tests that extend the following P value combining methods: minimum P value, extended Simes procedure, truncated tail strength, and truncated P value product. Extensive simulations point to correct type I error rates of all tests and show that the two truncated tests are more powerful than the other tests in cases of markers involved in the underlying interaction not being directly genotyped and in cases of multiple underlying interactions. We applied our tests to pairs of genes that exhibit a protein-protein interaction to test for gene-level interactions underlying lipid levels using genotype data from the Atherosclerosis Risk in Communities study. We identified five novel interactions that are not evident from marker-based interaction testing and successfully replicated one of these interactions, between SMAD3 and NEDD9, in an independent sample from the Multi-Ethnic Study of Atherosclerosis. We conclude that our GGG tests show improved power to identify gene-level interactions in existing, as well as emerging, association studies.

  2. saSNP Approach for Scalable SNP Analyses of Multiple Bacterial or Viral Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Gardner, Shea [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Slezak, Tom [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

    2010-07-27

    With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs. The method is fast to compute, finding SNPs and building a SNP phylogeny in seconds to hours. We use it to identify thousands of putative SNPs from all publicly available Filoviridae, Poxviridae, foot-and-mouth disease virus, Bacillus, and Escherichia coli genomes and plasmids. The SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle as input hundreds of gigabases of sequence in a single run. The algorithm is based on k-mer analysis using a suffix array, so we call it saSNP.

  3. SNP mining porcine ESTs with MAVIANT, a novel tool for SNP evaluation and annotation

    DEFF Research Database (Denmark)

    Panitz, Frank; Stengaard, Henrik; Hornshoj, Henrik

    2007-01-01

    MOTIVATION: Single nucleotide polymorphisms (SNPs) analysis is an important means to study genetic variation. A fast and cost-efficient approach to identify large numbers of novel candidates is the SNP mining of large scale sequencing projects. The increasing availability of sequence trace data...... manual annotation, which is immediately accessible and can be easily shared with external collaborators. RESULTS: Large-scale SNP mining of polymorphisms bases on porcine EST sequences yielded more than 7900 candidate SNPs in coding regions (cSNPs), which were annotated relative to the human genome. Non...

  4. Development and characterization of 35 single nucleotide polymorphism markers for the brown alga Fucus vesiculosus

    NARCIS (Netherlands)

    Canovas, Fernando; Mota, Catarina; Ferreira-Costa, Joana; Serrao, Ester; Coyer, Jim; Olsen, Jeanine; Pearson, Gareth

    2011-01-01

    We characterized 35 single nucleotide polymorphism (SNP) markers for the brown alga Fucus vesiculosus. Based on existing Fucus Expressed Sequence Tag libraries for heat and desiccation-stressed tissue, SNPs were developed and confirmed by re-sequencing cDNA from a diverse panel of individuals. SNP

  5. SNP discovery in the bovine milk transcriptome using RNA-Seq technology.

    Science.gov (United States)

    Cánovas, Angela; Rincon, Gonzalo; Islas-Trejo, Alma; Wickramasinghe, Saumya; Medrano, Juan F

    2010-12-01

    High-throughput sequencing of RNA (RNA-Seq) was developed primarily to analyze global gene expression in different tissues. However, it also is an efficient way to discover coding SNPs. The objective of this study was to perform a SNP discovery analysis in the milk transcriptome using RNA-Seq. Seven milk samples from Holstein cows were analyzed by sequencing cDNAs using the Illumina Genome Analyzer system. We detected 19,175 genes expressed in milk samples corresponding to approximately 70% of the total number of genes analyzed. The SNP detection analysis revealed 100,734 SNPs in Holstein samples, and a large number of those corresponded to differences between the Holstein breed and the Hereford bovine genome assembly Btau4.0. The number of polymorphic SNPs within Holstein cows was 33,045. The accuracy of RNA-Seq SNP discovery was tested by comparing SNPs detected in a set of 42 candidate genes expressed in milk that had been resequenced earlier using Sanger sequencing technology. Seventy of 86 SNPs were detected using both RNA-Seq and Sanger sequencing technologies. The KASPar Genotyping System was used to validate unique SNPs found by RNA-Seq but not observed by Sanger technology. Our results confirm that analyzing the transcriptome using RNA-Seq technology is an efficient and cost-effective method to identify SNPs in transcribed regions. This study creates guidelines to maximize the accuracy of SNP discovery and prevention of false-positive SNP detection, and provides more than 33,000 SNPs located in coding regions of genes expressed during lactation that can be used to develop genotyping platforms to perform marker-trait association studies in Holstein cattle.

  6. SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

    Science.gov (United States)

    Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon

    2016-01-01

    Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.

  7. Predicting the disease of Alzheimer with SNP biomarkers and clinical data using data mining classification approach: decision tree.

    Science.gov (United States)

    Erdoğan, Onur; Aydin Son, Yeşim

    2014-01-01

    Single Nucleotide Polymorphisms (SNPs) are the most common genomic variations where only a single nucleotide differs between individuals. Individual SNPs and SNP profiles associated with diseases can be utilized as biological markers. But there is a need to determine the SNP subsets and patients' clinical data which is informative for the diagnosis. Data mining approaches have the highest potential for extracting the knowledge from genomic datasets and selecting the representative SNPs as well as most effective and informative clinical features for the clinical diagnosis of the diseases. In this study, we have applied one of the widely used data mining classification methodology: "decision tree" for associating the SNP biomarkers and significant clinical data with the Alzheimer's disease (AD), which is the most common form of "dementia". Different tree construction parameters have been compared for the optimization, and the most accurate tree for predicting the AD is presented.

  8. Solar Radiation-Associated Adaptive SNP Genetic Differentiation in Wild Emmer Wheat, Triticum dicoccoides.

    Science.gov (United States)

    Ren, Jing; Chen, Liang; Jin, Xiaoli; Zhang, Miaomiao; You, Frank M; Wang, Jirui; Frenkel, Vladimir; Yin, Xuegui; Nevo, Eviatar; Sun, Dongfa; Luo, Ming-Cheng; Peng, Junhua

    2017-01-01

    Whole-genome scans with large number of genetic markers provide the opportunity to investigate local adaptation in natural populations and identify candidate genes under positive selection. In the present study, adaptation genetic differentiation associated with solar radiation was investigated using 695 polymorphic SNP markers in wild emmer wheat originated in a micro-site at Yehudiyya, Israel. The test involved two solar radiation niches: (1) sun, in-between trees; and (2) shade, under tree canopy, separated apart by a distance of 2-4 m. Analysis of molecular variance showed a small (0.53%) but significant portion of overall variation between the sun and shade micro-niches, indicating a non-ignorable genetic differentiation between sun and shade habitats. Fifty SNP markers showed a medium (0.05 ≤ F ST ≤ 0.15) or high genetic differentiation ( F ST > 0.15). A total of 21 outlier loci under positive selection were identified by using four different F ST -outlier testing algorithms. The markers and genome locations under positive selection are consistent with the known patterns of selection. These results suggested that genetic differentiation between sun and shade habitats is substantial, radiation-associated, and therefore ecologically determined. Hence, the results of this study reflected effects of natural selection through solar radiation on EST-related SNP genetic diversity, resulting presumably in different adaptive complexes at a micro-scale divergence. The present work highlights the evolutionary theory and application significance of solar radiation-driven natural selection in wheat improvement.

  9. Comparison of SNP Variation and Distribution in Indigenous Ethiopian and Korean Cattle (Hanwoo Populations

    Directory of Open Access Journals (Sweden)

    Zewdu Edea

    2012-09-01

    Full Text Available Although a large number of single nucleotide polymorphisms (SNPs have been identified from the bovine genome-sequencing project, few of these have been validated at large in Bos indicus breeds. We have genotyped 192 animals, representing 5 cattle populations of Ethiopia, with the Illumina Bovine 8K SNP BeadChip. These include 1 Sanga (Danakil, 3 zebu (Borana, Arsi and Ambo, and 1 zebu × Sanga intermediate (Horro breeds. The Hanwoo (Bos taurus was included for comparison purposes. Analysis of 7,045 SNP markers revealed that the mean minor allele frequency (MAF was 0.23, 0.22, 0.21, 0.21, 0.23, and 0.29 for Ambo, Arsi, Borana, Danakil, Horro, and Hanwoo, respectively. Significant differences of MAF were observed between the indigenous Ethiopian cattle populations and Hanwoo breed (p < 0.001. Across the Ethiopian cattle populations, a common variant MAF (≥0.10 and ≤0.5 accounted for an overall estimated 73.79% of the 7,045 SNPs. The Hanwoo displayed a higher proportion of common variant SNPs (90%. Investigation within Ethiopian cattle populations showed that on average, 16.64% of the markers were monomorphic, but in the Hanwoo breed, only 6% of the markers were monomorphic. Across the sampled Ethiopian cattle populations, the mean observed and expected heterozygosities were 0.314 and 0.313, respectively. The level of SNP variation identified in this particular study highlights that these markers can be potentially used for genetic studies in African cattle breeds.

  10. The clinical application of single-sperm-based SNP haplotyping for PGD of osteogenesis imperfecta.

    Science.gov (United States)

    Chen, Linjun; Diao, Zhenyu; Xu, Zhipeng; Zhou, Jianjun; Yan, Guijun; Sun, Haixiang

    2018-05-15

    Osteogenesis imperfecta (OI) is a genetically heterogeneous disorder, presenting either autosomal dominant, autosomal recessive or X-linked inheritance patterns. The majority of OI cases are autosomal dominant and are caused by heterozygous mutations in either the COL1A1 or COL1A2 gene. In these dominant disorders, allele dropout (ADO) can lead to misdiagnosis in preimplantation genetic diagnosis (PGD). Polymorphic markers linked to the mutated genes have been used to establish haplotypes for identifying ADO and ensuring the accuracy of PGD. However, the haplotype of male patients cannot be determined without data from affected relatives. Here, we developed a method for single-sperm-based single-nucleotide polymorphism (SNP) haplotyping via next-generation sequencing (NGS) for the PGD of OI. After NGS, 10 informative polymorphic SNP markers located upstream and downstream of the COL1A1 gene and its pathogenic mutation site were linked to individual alleles in a single sperm from an affected male. After haplotyping, a normal blastocyst was transferred to the uterus for a subsequent frozen embryo transfer cycle. The accuracy of PGD was confirmed by amniocentesis at 19 weeks of gestation. A healthy infant weighing 4,250 g was born via vaginal delivery at the 40th week of gestation. Single-sperm-based SNP haplotyping can be applied for PGD of any monogenic disorders or de novo mutations in males in whom the haplotype of paternal mutations cannot be determined due to a lack of affected relatives. ADO: allele dropout; DI: dentinogenesis imperfect; ESHRE: European Society of Human Reproduction and Embryology; FET: frozen embryo transfer; gDNA: genomic DNA; ICSI: intracytoplasmic sperm injection; IVF: in vitro fertilization; MDA: multiple displacement amplification; NGS: next-generation sequencing; OI: osteogenesis imperfect; PBS: phosphate buffer saline; PCR: polymerase chain reaction; PGD: preimplantation genetic diagnosis; SNP: single-nucleotide polymorphism; STR

  11. Genome-wide SNP identification in multiple morphotypes of allohexaploid tall fescue (Festuca arundinacea Schreb

    Directory of Open Access Journals (Sweden)

    Hand Melanie L

    2012-06-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs provide essential tools for the advancement of research in plant genomics, and the development of SNP resources for many species has been accelerated by the capabilities of second-generation sequencing technologies. The current study aimed to develop and use a novel bioinformatic pipeline to generate a comprehensive collection of SNP markers within the agriculturally important pasture grass tall fescue; an outbreeding allopolyploid species displaying three distinct morphotypes: Continental, Mediterranean and rhizomatous. Results A bioinformatic pipeline was developed that successfully identified SNPs within genotypes from distinct tall fescue morphotypes, following the sequencing of 414 polymerase chain reaction (PCR – generated amplicons using 454 GS FLX technology. Equivalent amplicon sets were derived from representative genotypes of each morphotype, including six Continental, five Mediterranean and one rhizomatous. A total of 8,584 and 2,292 SNPs were identified with high confidence within the Continental and Mediterranean morphotypes respectively. The success of the bioinformatic approach was demonstrated through validation (at a rate of 70% of a subset of 141 SNPs using both SNaPshot™ and GoldenGate™ assay chemistries. Furthermore, the quantitative genotyping capability of the GoldenGate™ assay revealed that approximately 30% of the putative SNPs were accessible to co-dominant scoring, despite the hexaploid genome structure. The sub-genome-specific origin of each SNP validated from Continental tall fescue was predicted using a phylogenetic approach based on comparison with orthologous sequences from predicted progenitor species. Conclusions Using the appropriate bioinformatic approach, amplicon resequencing based on 454 GS FLX technology is an effective method for the identification of polymorphic SNPs within the genomes of Continental and Mediterranean tall fescue. The

  12. SNP-SNP interaction analysis of NF-κB signaling pathway on breast cancer survival

    DEFF Research Database (Denmark)

    Jamshidi, Maral; Fagerholm, Rainer; Khan, Sofia

    2015-01-01

    of SNP pairs without and with an interaction term. We found two interacting pairs associating with prognosis: patients simultaneously homozygous for the rare alleles of rs5996080 and rs7973914 had worse survival (HRinteraction 6.98, 95% CI=3.3-14.4, P=1.42E-07), and patients carrying at least one rare...

  13. Transcriptomic SNP discovery for custom genotyping arrays: impacts of sequence data, SNP calling method and genotyping technology on the probability of validation success.

    Science.gov (United States)

    Humble, Emily; Thorne, Michael A S; Forcada, Jaume; Hoffman, Joseph I

    2016-08-26

    Single nucleotide polymorphism (SNP) discovery is an important goal of many studies. However, the number of 'putative' SNPs discovered from a sequence resource may not provide a reliable indication of the number that will successfully validate with a given genotyping technology. For this it may be necessary to account for factors such as the method used for SNP discovery and the type of sequence data from which it originates, suitability of the SNP flanking sequences for probe design, and genomic context. To explore the relative importance of these and other factors, we used Illumina sequencing to augment an existing Roche 454 transcriptome assembly for the Antarctic fur seal (Arctocephalus gazella). We then mapped the raw Illumina reads to the new hybrid transcriptome using BWA and BOWTIE2 before calling SNPs with GATK. The resulting markers were pooled with two existing sets of SNPs called from the original 454 assembly using NEWBLER and SWAP454. Finally, we explored the extent to which SNPs discovered using these four methods overlapped and predicted the corresponding validation outcomes for both Illumina Infinium iSelect HD and Affymetrix Axiom arrays. Collating markers across all discovery methods resulted in a global list of 34,718 SNPs. However, concordance between the methods was surprisingly poor, with only 51.0 % of SNPs being discovered by more than one method and 13.5 % being called from both the 454 and Illumina datasets. Using a predictive modeling approach, we could also show that SNPs called from the Illumina data were on average more likely to successfully validate, as were SNPs called by more than one method. Above and beyond this pattern, predicted validation outcomes were also consistently better for Affymetrix Axiom arrays. Our results suggest that focusing on SNPs called by more than one method could potentially improve validation outcomes. They also highlight possible differences between alternative genotyping technologies that could be

  14. A SNP Genotyping Array for Hexaploid Oat

    Directory of Open Access Journals (Sweden)

    Nicholas A. Tinker

    2014-11-01

    Full Text Available Recognizing a need in cultivated hexaploid oat ( L. for a reliable set of reference single nucleotide polymorphisms (SNPs, we have developed a 6000 (6K BeadChip design containing 257 Infinium I and 5486 Infinium II designs corresponding to 5743 SNPs. Of those, 4975 SNPs yielded successful assays after array manufacturing. These SNPs were discovered based on a variety of bioinformatics pipelines in complementary DNA (cDNA and genomic DNA originating from 20 or more diverse oat cultivars. The array was validated in 1100 samples from six recombinant inbred line (RIL mapping populations and sets of diverse oat cultivars and breeding lines, and provided approximately 3500 discernible Mendelian polymorphisms. Here, we present an annotation of these SNPs, including methods of discovery, gene identification and orthology, population-genetic characteristics, and tentative positions on an oat consensus map. We also evaluate a new cluster-based method of calling SNPs. The SNP design sequences are made publicly available, and the full SNP genotyping platform is available for commercial purchase from an independent third party.

  15. A high-density SNP map for accurate mapping of seed fibre QTL in Brassica napus L.

    Directory of Open Access Journals (Sweden)

    Liezhao Liu

    Full Text Available A high density genetic linkage map for the complex allotetraploid crop species Brassica napus (oilseed rape was constructed in a late-generation recombinant inbred line (RIL population, using genome-wide single nucleotide polymorphism (SNP markers assayed by the Brassica 60 K Infinium BeadChip Array. The linkage map contains 9164 SNP markers covering 1832.9 cM. 1232 bins account for 7648 of the markers. A subset of 2795 SNP markers, with an average distance of 0.66 cM between adjacent markers, was applied for QTL mapping of seed colour and the cell wall fiber components acid detergent lignin (ADL, cellulose and hemicellulose. After phenotypic analyses across four different environments a total of 11 QTL were detected for seed colour and fiber traits. The high-density map considerably improved QTL resolution compared to the previous low-density maps. A previously identified major QTL with very high effects on seed colour and ADL was pinpointed to a narrow genome interval on chromosome A09, while a minor QTL explaining 8.1% to 14.1% of variation for ADL was detected on chromosome C05. Five and three QTL accounting for 4.7% to 21.9% and 7.3% to 16.9% of the phenotypic variation for cellulose and hemicellulose, respectively, were also detected. To our knowledge this is the first description of QTL for seed cellulose and hemicellulose in B. napus, representing interesting new targets for improving oil content. The high density SNP genetic map enables navigation from interesting B. napus QTL to Brassica genome sequences, giving useful new information for understanding the genetics of key seed quality traits in rapeseed.

  16. V-MitoSNP: visualization of human mitochondrial SNPs

    Directory of Open Access Journals (Sweden)

    Tsui Ke-Hung

    2006-08-01

    Full Text Available Abstract Background Mitochondrial single nucleotide polymorphisms (mtSNPs constitute important data when trying to shed some light on human diseases and cancers. Unfortunately, providing relevant mtSNP genotyping information in mtDNA databases in a neatly organized and transparent visual manner still remains a challenge. Amongst the many methods reported for SNP genotyping, determining the restriction fragment length polymorphisms (RFLPs is still one of the most convenient and cost-saving methods. In this study, we prepared the visualization of the mtDNA genome in a way, which integrates the RFLP genotyping information with mitochondria related cancers and diseases in a user-friendly, intuitive and interactive manner. The inherent problem associated with mtDNA sequences in BLAST of the NCBI database was also solved. Description V-MitoSNP provides complete mtSNP information for four different kinds of inputs: (1 color-coded visual input by selecting genes of interest on the genome graph, (2 keyword search by locus, disease and mtSNP rs# ID, (3 visualized input of nucleotide range by clicking the selected region of the mtDNA sequence, and (4 sequences mtBLAST. The V-MitoSNP output provides 500 bp (base pairs flanking sequences for each SNP coupled with the RFLP enzyme and the corresponding natural or mismatched primer sets. The output format enables users to see the SNP genotype pattern of the RFLP by virtual electrophoresis of each mtSNP. The rate of successful design of enzymes and primers for RFLPs in all mtSNPs was 99.1%. The RFLP information was validated by actual agarose electrophoresis and showed successful results for all mtSNPs tested. The mtBLAST function in V-MitoSNP provides the gene information within the input sequence rather than providing the complete mitochondrial chromosome as in the NCBI BLAST database. All mtSNPs with rs number entries in NCBI are integrated in the corresponding SNP in V-MitoSNP. Conclusion V-MitoSNP is a web

  17. Construction of an SNP-based high-density linkage map for flax (Linum usitatissimum L.) using specific length amplified fragment sequencing (SLAF-seq) technology.

    Science.gov (United States)

    Yi, Liuxi; Gao, Fengyun; Siqin, Bateer; Zhou, Yu; Li, Qiang; Zhao, Xiaoqing; Jia, Xiaoyun; Zhang, Hui

    2017-01-01

    Flax is an important crop for oil and fiber, however, no high-density genetic maps have been reported for this species. Specific length amplified fragment sequencing (SLAF-seq) is a high-resolution strategy for large scale de novo discovery and genotyping of single nucleotide polymorphisms. In this study, SLAF-seq was employed to develop SNP markers in an F2 population to construct a high-density genetic map for flax. In total, 196.29 million paired-end reads were obtained. The average sequencing depth was 25.08 in male parent, 32.17 in the female parent, and 9.64 in each F2 progeny. In total, 389,288 polymorphic SLAFs were detected, from which 260,380 polymorphic SNPs were developed. After filtering, 4,638 SNPs were found suitable for genetic map construction. The final genetic map included 4,145 SNP markers on 15 linkage groups and was 2,632.94 cM in length, with an average distance of 0.64 cM between adjacent markers. To our knowledge, this map is the densest SNP-based genetic map for flax. The SNP markers and genetic map reported in here will serve as a foundation for the fine mapping of quantitative trait loci (QTLs), map-based gene cloning and marker assisted selection (MAS) for flax.

  18. Identification, Characterization, and Mapping of a Novel SNP Associated with Body Color Transparency in Juvenile Red Sea Bream (Pagrus major).

    Science.gov (United States)

    Sawayama, Eitaro; Noguchi, Daiki; Nakayama, Kei; Takagi, Motohiro

    2018-03-23

    We previously reported a body color deformity in juvenile red sea bream, which shows transparency in the juvenile stage because of delayed chromatophore development compared with normal individuals, and this finding suggested a genetic cause based on parentage assessments. To conduct marker-assisted selection to eliminate broodstock inheriting the causative gene, developing DNA markers associated with the phenotype was needed. We first conducted SNP mining based on AFLP analysis using bulked-DNA from normal and transparent individuals. One SNP was identified from a transparent-specific AFLP fragment, which significantly associated with transparent individuals. Two alleles (A/G) were observed in this locus, and the genotype G/G was dominantly observed in the transparent groups (97.1%) collected from several production lots produced from different broodstock populations. A few normal individuals inherited the G/G genotype (5.0%), but the A/A and A/G genotypes were dominantly observed in the normal groups. The homologs region of the SNP was searched using a medaka genome database, and intron 12 of the Nell2a gene (located on chromosome 6 of the medaka genome) was highly matched. We also mapped the red sea bream Nell2a gene on the previously developed linkage maps, and this gene was mapped on a male linkage group, LG4-M. The newly found SNP was useful in eliminating broodstock possessing the causative gene of the body color transparency observed in juvenile stage of red sea bream.

  19. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality

    Directory of Open Access Journals (Sweden)

    Shahin S Ali

    2015-08-01

    Full Text Available Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed SNP markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNASeq-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups. Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.

  20. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality.

    Science.gov (United States)

    Ali, Shahin S; Shao, Jonathan; Strem, Mary D; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W; Bailey, Bryan A

    2015-01-01

    Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.

  1. A general SNP-based molecular barcode for Plasmodium falciparum identification and tracking

    Directory of Open Access Journals (Sweden)

    Rosen David

    2008-10-01

    Full Text Available Abstract Background Single nucleotide polymorphism (SNP genotyping provides the means to develop a practical, rapid, inexpensive assay that will uniquely identify any Plasmodium falciparum parasite using a small amount of DNA. Such an assay could be used to distinguish recrudescence from re-infection in drug trials, to monitor the frequency and distribution of specific parasites in a patient population undergoing drug treatment or vaccine challenge, or for tracking samples and determining purity of isolates in the laboratory during culture adaptation and sub-cloning, as well as routine passage. Methods A panel of twenty-four SNP markers has been identified that exhibit a high minor allele frequency (average MAF > 35%, for which robust TaqMan genotyping assays were constructed. All SNPs were identified through whole genome sequencing and MAF was estimated through Affymetrix array-based genotyping of a worldwide collection of parasites. These assays create a "molecular barcode" to uniquely identify a parasite genome. Results Using 24 such markers no two parasites known to be of independent origin have yet been found to have the same allele signature. The TaqMan genotyping assays can be performed on a variety of samples including cultured parasites, frozen whole blood, or whole blood spotted onto filter paper with a success rate > 99%. Less than 5 ng of parasite DNA is needed to complete a panel of 24 markers. The ability of this SNP panel to detect and identify parasites was compared to the standard molecular methods, MSP-1 and MSP-2 typing. Conclusion This work provides a facile field-deployable genotyping tool that can be used without special skills with standard lab equipment, and at reasonable cost that will unambiguously identify and track P. falciparum parasites both from patient samples and in the laboratory.

  2. SNIT: SNP identification for strain typing

    Directory of Open Access Journals (Sweden)

    Reifman Jaques

    2011-09-01

    Full Text Available Abstract With ever-increasing numbers of microbial genomes being sequenced, efficient tools are needed to perform strain-level identification of any newly sequenced genome. Here, we present the SNP identification for strain typing (SNIT pipeline, a fast and accurate software system that compares a newly sequenced bacterial genome with other genomes of the same species to identify single nucleotide polymorphisms (SNPs and small insertions/deletions (indels. Based on this information, the pipeline analyzes the polymorphic loci present in all input genomes to identify the genome that has the fewest differences with the newly sequenced genome. Similarly, for each of the other genomes, SNIT identifies the input genome with the fewest differences. Results from five bacterial species show that the SNIT pipeline identifies the correct closest neighbor with 75% to 100% accuracy. The SNIT pipeline is available for download at http://www.bhsai.org/snit.html

  3. SNP Discovery and Development of a High-Density Genotyping Array for Sunflower

    Science.gov (United States)

    Bachlava, Eleni; Taylor, Christopher A.; Tang, Shunxue; Bowers, John E.; Mandel, Jennifer R.; Burke, John M.; Knapp, Steven J.

    2012-01-01

    Recent advances in next-generation DNA sequencing technologies have made possible the development of high-throughput SNP genotyping platforms that allow for the simultaneous interrogation of thousands of single-nucleotide polymorphisms (SNPs). Such resources have the potential to facilitate the rapid development of high-density genetic maps, and to enable genome-wide association studies as well as molecular breeding approaches in a variety of taxa. Herein, we describe the development of a SNP genotyping resource for use in sunflower (Helianthus annuus L.). This work involved the development of a reference transcriptome assembly for sunflower, the discovery of thousands of high quality SNPs based on the generation and analysis of ca. 6 Gb of transcriptome re-sequencing data derived from multiple genotypes, the selection of 10,640 SNPs for inclusion in the genotyping array, and the use of the resulting array to screen a diverse panel of sunflower accessions as well as related wild species. The results of this work revealed a high frequency of polymorphic SNPs and relatively high level of cross-species transferability. Indeed, greater than 95% of successful SNP assays revealed polymorphism, and more than 90% of these assays could be successfully transferred to related wild species. Analysis of the polymorphism data revealed patterns of genetic differentiation that were largely congruent with the evolutionary history of sunflower, though the large number of markers allowed for finer resolution than has previously been possible. PMID:22238659

  4. Identification of Mendelian inconsistencies between SNP and pedigree information of sibs

    Directory of Open Access Journals (Sweden)

    Calus Mario PL

    2011-10-01

    Full Text Available Abstract Background Using SNP genotypes to apply genomic selection in breeding programs is becoming common practice. Tools to edit and check the quality of genotype data are required. Checking for Mendelian inconsistencies makes it possible to identify animals for which pedigree information and genotype information are not in agreement. Methods Straightforward tests to detect Mendelian inconsistencies exist that count the number of opposing homozygous marker (e.g. SNP genotypes between parent and offspring (PAR-OFF. Here, we develop two tests to identify Mendelian inconsistencies between sibs. The first test counts SNP with opposing homozygous genotypes between sib pairs (SIBCOUNT. The second test compares pedigree and SNP-based relationships (SIBREL. All tests iteratively remove animals based on decreasing numbers of inconsistent parents and offspring or sibs. The PAR-OFF test, followed by either SIB test, was applied to a dataset comprising 2,078 genotyped cows and 211 genotyped sires. Theoretical expectations for distributions of test statistics of all three tests were calculated and compared to empirically derived values. Type I and II error rates were calculated after applying the tests to the edited data, while Mendelian inconsistencies were introduced by permuting pedigree against genotype data for various proportions of animals. Results Both SIB tests identified animal pairs for which pedigree and genomic relationships could be considered as inconsistent by visual inspection of a scatter plot of pairwise pedigree and SNP-based relationships. After removal of 235 animals with the PAR-OFF test, SIBCOUNT (SIBREL identified 18 (22 additional inconsistent animals. Seventeen animals were identified by both methods. The numbers of incorrectly deleted animals (Type I error, were equally low for both methods, while the numbers of incorrectly non-deleted animals (Type II error, were considerably higher for SIBREL compared to SIBCOUNT. Conclusions

  5. Evaluation of Bovine High-Density SNP Genotyping Array in Indigenous Dairy Cattle Breeds.

    Science.gov (United States)

    Dash, S; Singh, A; Bhatia, A K; Jayakumar, S; Sharma, A; Singh, S; Ganguly, I; Dixit, S P

    2018-04-03

    In total 52 samples of Sahiwal ( 19 ), Tharparkar ( 17 ), and Gir ( 16 ) were genotyped by using BovineHD SNP chip to analyze minor allele frequency (MAF), genetic diversity, and linkage disequilibrium among these cattle. The common SNPs of BovineHD and 54K SNP Chips were also extracted and evaluated for their performance. Only 40%-50% SNPs of these arrays was found informative for genetic analysis in these cattle breeds. The overall mean of MAF for SNPs of BovineHD SNPChip was 0.248 ± 0.006, 0.241 ± 0.007, and 0.242 ± 0.009 in Sahiwal, Tharparkar and Gir, respectively, while that for 54K SNPs was on lower side. The average Reynold's genetic distance between breeds ranged from 0.042 to 0.055 based on BovineHD Beadchip, and from 0.052 to 0.084 based on 54K SNP Chip. The estimates of genetic diversity based on HD and 54K chips were almost same and, hence, low density chip seems to be good enough to decipher genetic diversity of these cattle breeds. The linkage disequilibrium started decaying (r 2  < 0.2) at 140 kb inter-marker distance and, hence, a 20K low density customized SNP array from HD chip could be designed for genomic selection in these cattle else the 54K Bead Chip as such will be useful.

  6. Association between SNP and haplotypes in PPARGCl and adiponectin genes and bone mineral density in Chinese nuclear families

    Institute of Scientific and Technical Information of China (English)

    Zhen-lin ZHANG; Jin-wei HE; Yue-juan QIN; Yun-qiu HU; Miao LI; Yu-juan LIU; Hao ZHANG; Wei-wei HU

    2007-01-01

    Aim: To assess the contribution of single nucleotide polymorphisms (SNP) and haplotypes in the peroxisome proliferator-activated receptor-γ co-activator-1(PPARGC1) and adiponectin genes to normal bone mineral density (BMD) variation in healthy Chinese women and men. Methods: We performed population-based (ANOVA) and family-based (quantitative trait locus transmission disequi-librium test) association studies of PPARGC1 and adiponectin genes. SNP in the 2 genes were genotyped. BMD was measured using dual-energy X-ray absorptiometry in the lumbar spine and hip in 401 nuclear families with a total of1260 subjects, including 458 premenopausal women, 20-40 years of age; 401 post-menopausal women (mothers), 43-74 years of age; and 401 men (fathers), 49-76years of age. Results: Significant within-family association was found between the Thr394Thr polymorphism in the PPGAGC1 gene and peak BMD in the femoral neck (P=0.026). Subsequent permutations were in agreement with this significant within-family association result (P=0.016), but Thr394Thr SNP only accounted for0.7% of the variation in femoral neck peak BMD. However, no significant within-family association was detected between each SNP in the adiponect in gene and peak BMD. Although no significant association was found between BMD and SNP in the PPARGC1 and adiponectin genes in both men and postmenopausal women, haplotype 2 (T-T) in the adiponect in gene was associated with lumbar spine BMD in postmenopausal women (P=0.019). Conclusion: Our findings sug-gest that Thr394Thr SNP in the PPARGC1 gene was associated with peak BMD in the femoral neck in Chinese women. Confirmation of our results is needed in other populations and with more functional markers within and flanking the PPARGC1 or adiponectin genes region.

  7. [Genetic diversity analysis of Andrographis paniculata in China based on SRAP and SNP].

    Science.gov (United States)

    Chen, Rong; Wang, Xiao-Yun; Song, Yu-Ning; Zhu, Yun-feng; Wang, Peng-liang; Li, Min; Zhong, Guo-Yue

    2014-12-01

    In order to reveal genetic diversity of domestic Andrographis paniculata and its impact on quality, genetic backgrounds of 103 samples from 7 provinces in China were analyzed using SRAP marker and SNP marker. Genetic structures of the A. paniculata populations were estimated with Powermarker V 3.25 and Mega 6.0 software, and polymorphic SNPs were identified with CodonCode Aligner software. The results showed that the genetic distances of domestic A. paniculata germplasm ranged from 0. 01 to 0.09, and no polymorphic SNPs were discovered in coding sequence fragments of ent-copalyl diphosphate synthase. A. paniculata germplasm from various regions in China had poor genetic diversity. This phenomenon was closely related to strict self-fertilization and earlier introduction from the same origin. Therefore, genetic background had little impact on variable qualities of A. paniculata in domestic market. Mutation breeding, polyploid breeding and molecular breeding were proposed as promising strategies in germplasm innovation.

  8. A SNP and SSR Based Genetic Map of Asparagus Bean (Vigna. unguiculata ssp. sesquipedialis) and Comparison with the Broader Species

    Science.gov (United States)

    Xu, Pei; Wu, Xiaohua; Wang, Baogen; Liu, Yonghua; Ehlers, Jeffery D.; Close, Timothy J.; Roberts, Philip A.; Diop, Ndeye-Ndack; Qin, Dehui; Hu, Tingting; Lu, Zhongfu; Li, Guojing

    2011-01-01

    Asparagus bean (Vigna. unguiculata ssp. sesquipedialis) is a distinctive subspecies of cowpea [Vigna. unguiculata (L.) Walp.] that apparently originated in East Asia and is characterized by extremely long and thin pods and an aggressive climbing growth habit. The crop is widely cultivated throughout Asia for the production of immature pods known as ‘long beans’ or ‘asparagus beans’. While the genome of cowpea ssp. unguiculata has been characterized recently by high-density genetic mapping and partial sequencing, little is known about the genome of asparagus bean. We report here the first genetic map of asparagus bean based on SNP and SSR markers. The current map consists of 375 loci mapped onto 11 linkage groups (LGs), with 191 loci detected by SNP markers and 184 loci by SSR markers. The overall map length is 745 cM, with an average marker distance of 1.98 cM. There are four high marker-density blocks distributed on three LGs and three regions of segregation distortion (SDRs) identified on two other LGs, two of which co-locate in chromosomal regions syntenic to SDRs in soybean. Synteny between asparagus bean and the model legume Lotus. japonica was also established. This work provides the basis for mapping and functional analysis of genes/QTLs of particular interest in asparagus bean, as well as for comparative genomics study of cowpea at the subspecies level. PMID:21253606

  9. A SNP and SSR based genetic map of asparagus bean (Vigna. unguiculata ssp. sesquipedialis and comparison with the broader species.

    Directory of Open Access Journals (Sweden)

    Pei Xu

    Full Text Available Asparagus bean (Vigna. unguiculata ssp. sesquipedialis is a distinctive subspecies of cowpea [Vigna. unguiculata (L. Walp.] that apparently originated in East Asia and is characterized by extremely long and thin pods and an aggressive climbing growth habit. The crop is widely cultivated throughout Asia for the production of immature pods known as 'long beans' or 'asparagus beans'. While the genome of cowpea ssp. unguiculata has been characterized recently by high-density genetic mapping and partial sequencing, little is known about the genome of asparagus bean. We report here the first genetic map of asparagus bean based on SNP and SSR markers. The current map consists of 375 loci mapped onto 11 linkage groups (LGs, with 191 loci detected by SNP markers and 184 loci by SSR markers. The overall map length is 745 cM, with an average marker distance of 1.98 cM. There are four high marker-density blocks distributed on three LGs and three regions of segregation distortion (SDRs identified on two other LGs, two of which co-locate in chromosomal regions syntenic to SDRs in soybean. Synteny between asparagus bean and the model legume Lotus. japonica was also established. This work provides the basis for mapping and functional analysis of genes/QTLs of particular interest in asparagus bean, as well as for comparative genomics study of cowpea at the subspecies level.

  10. Development and Evaluation of a Barley 50k iSelect SNP Array

    Directory of Open Access Journals (Sweden)

    Micha M. Bayer

    2017-10-01

    Full Text Available High-throughput genotyping arrays continue to be an attractive, cost-effective alternative to sequencing based approaches. We have developed a new 50k Illumina Infinium iSelect genotyping array for barley, a cereal crop species of major international importance. The majority of SNPs on the array have been extracted from variants called in exome capture data of a wide range of European barley germplasm. We used the recently published barley pseudomolecule assembly to map the exome capture data, which allowed us to generate markers with accurate physical positions and detailed gene annotation. Markers from an existing and widely used barley 9k Infinium iSelect array were carried over onto the 50k chip for backward compatibility. The array design featured 49,267 SNP markers that converted into 44,040 working assays, of which 43,461 were scorable in GenomeStudio. Of the working assays, 6,251 are from the 9k iSelect platform. We validated the SNPs by comparing the genotype calls from the new array to legacy datasets. Rates of agreement averaged 98.1 and 93.9% respectively for the legacy 9k iSelect SNP set (Comadran et al., 2012 and the exome capture SNPs. To test the utility of the 50k chip for genetic mapping, we genotyped a segregating population derived from a Golden Promise × Morex cross (Liu et al., 2014 and mapped over 14,000 SNPs to genetic positions which showed a near exact correspondence to their known physical positions. Manual adjustment of the cluster files used by the interpreting software for genotype scoring improved results substantially, but migration of cluster files between sites led to a deterioration of results, suggesting that local adjustment of cluster files is required on a site-per-site basis. Information relating to the markers on the chip is available online at https://ics.hutton.ac.uk/50k.

  11. New algorithm improves fine structure of the barley consensus SNP map

    Directory of Open Access Journals (Sweden)

    Endelman Jeffrey B

    2011-08-01

    Full Text Available Abstract Background The need to integrate information from multiple linkage maps is a long-standing problem in genetics. One way to visualize the complex ordinal relationships is with a directed graph, where each vertex in the graph is a bin of markers. When there are no ordering conflicts between the linkage maps, the result is a directed acyclic graph, or DAG, which can then be linearized to produce a consensus map. Results New algorithms for the simplification and linearization of consensus graphs have been implemented as a package for the R computing environment called DAGGER. The simplified consensus graphs produced by DAGGER exactly capture the ordinal relationships present in a series of linkage maps. Using either linear or quadratic programming, DAGGER generates a consensus map with minimum error relative to the linkage maps while remaining ordinally consistent with them. Both linearization methods produce consensus maps that are compressed relative to the mean of the linkage maps. After rescaling, however, the consensus maps had higher accuracy (and higher marker density than the individual linkage maps in genetic simulations. When applied to four barley linkage maps genotyped at nearly 3000 SNP markers, DAGGER produced a consensus map with improved fine structure compared to the existing barley consensus SNP map. The root-mean-squared error between the linkage maps and the DAGGER map was 0.82 cM per marker interval compared to 2.28 cM for the existing consensus map. Examination of the barley hardness locus at the 5HS telomere, for which there is a physical map, confirmed that the DAGGER output was more accurate for fine structure analysis. Conclusions The R package DAGGER is an effective, freely available resource for integrating the information from a set of consistent linkage maps.

  12. A low-density SNP array for analyzing differential selection in freshwater and marine populations of threespine stickleback (Gasterosteus aculeatus).

    Science.gov (United States)

    Ferchaud, Anne-Laure; Pedersen, Susanne H; Bekkevold, Dorte; Jian, Jianbo; Niu, Yongchao; Hansen, Michael M

    2014-10-06

    The threespine stickleback (Gasterosteus aculeatus) has become an important model species for studying both contemporary and parallel evolution. In particular, differential adaptation to freshwater and marine environments has led to high differentiation between freshwater and marine stickleback populations at the phenotypic trait of lateral plate morphology and the underlying candidate gene Ectodysplacin (EDA). Many studies have focused on this trait and candidate gene, although other genes involved in marine-freshwater adaptation may be equally important. In order to develop a resource for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. RAD (Restriction site Associated DNA) sequencing of sixty individuals representing two freshwater and one marine population led to the identification of 33,993 SNP markers. Ninety-six of these were chosen for the low-density SNP array, among which 70 represented SNPs under putatively directional selection in freshwater vs. marine environments, whereas 26 SNPs were assumed to be neutral. Annotation of these regions revealed several genes that are candidates for affecting stickleback phenotypic variation, some of which have been observed in previous studies whereas others are new. We have developed a cost-efficient low-density SNP array that allows for rapid screening of polymorphisms in threespine stickleback. The array provides a valuable tool for analyzing adaptive divergence between freshwater and marine stickleback populations beyond the well-established candidate gene Ectodysplacin (EDA).

  13. Conclusive evidence for hexasomic inheritance in chrysanthemum based on analysis of a 183 k SNP array.

    Science.gov (United States)

    van Geest, Geert; Voorrips, Roeland E; Esselink, Danny; Post, Aike; Visser, Richard Gf; Arens, Paul

    2017-08-07

    Cultivated chrysanthemum is an outcrossing hexaploid (2n = 6× = 54) with a disputed mode of inheritance. In this paper, we present a single nucleotide polymorphism (SNP) selection pipeline that was used to design an Affymetrix Axiom array with 183 k SNPs from RNA sequencing data (1). With this array, we genotyped four bi-parental populations (with sizes of 405, 53, 76 and 37 offspring plants respectively), and a cultivar panel of 63 genotypes. Further, we present a method for dosage scoring in hexaploids from signal intensities of the array based on mixture models (2) and validation of selection steps in the SNP selection pipeline (3). The resulting genotypic data is used to draw conclusions on the mode of inheritance in chrysanthemum (4), and to make an inference on allelic expression bias (5). With use of the mixture model approach, we successfully called the dosage of 73,936 out of 183,130 SNPs (40.4%) that segregated in any of the bi-parental populations. To investigate the mode of inheritance, we analysed markers that segregated in the large bi-parental population (n = 405). Analysis of segregation of duplex x nulliplex SNPs resulted in evidence for genome-wide hexasomic inheritance. This evidence was substantiated by the absence of strong linkage between markers in repulsion, which indicated absence of full disomic inheritance. We present the success rate of SNP discovery out of RNA sequencing data as affected by different selection steps, among which SNP coverage over genotypes and use of different types of sequence read mapping software. Genomic dosage highly correlated with relative allele coverage from the RNA sequencing data, indicating that most alleles are expressed according to their genomic dosage. The large population, genotyped with a very large number of markers, is a unique framework for extensive genetic analyses in hexaploid chrysanthemum. As starting point, we show conclusive evidence for genome-wide hexasomic inheritance.

  14. SNP discovery and High Resolution Melting Analysis from massive transcriptome sequencing in the California red abalone Haliotis rufescens.

    Science.gov (United States)

    Valenzuela-Muñoz, Valentina; Araya-Garay, José Miguel; Gallardo-Escárate, Cristian

    2013-06-01

    The California red abalone, Haliotis rufescens that belongs to the Haliotidae family, is the largest species of abalone in the world that has sustained the major fishery and aquaculture production in the USA and Mexico. This native mollusk has not been evaluated or assigned a conservation category even though in the last few decades it was heavily exploited until it disappeared in some areas along the California coast. In Chile, the red abalone was introduced in the 1970s from California wild abalone stocks for the purposes of aquaculture. Considering the number of years that the red abalone has been cultivated in Chile crucial genetic information is scarce and critical issues remain unresolved. This study reports and validates novel single nucleotide polymorphisms (SNP) markers for the red abalone H. rufescens using cDNA pyrosequencing. A total of 622 high quality SNPs were identified in 146 sequences with an estimated frequency of 1 SNP each 1000bp. Forty-five SNPs markers with functional information for gene ontology were selected. Of these, 8 were polymorphic among the individuals screened: Heat shock protein 70 (HSP70), vitellogenin (VTG), lysin, alginate lyase enzyme (AL), Glucose-regulated protein 94 (GRP94), fructose-bisphosphate aldolase (FBA), sulfatase 1A precursor (S1AP) and ornithine decarboxylase antizyme (ODC). Two additional sequences were also identified with polymorphisms but no similarities with known proteins were achieved. To validate the putative SNP markers, High Resolution Melting Analysis (HRMA) was conducted in a wild and hatchery-bred population. Additionally, SNP cross-amplifications were tested in two further native abalone species, Haliotis fulgens and Haliotis corrugata. This study provides novel candidate genes that could be used to evaluate loss of genetic diversity due to hatchery selection or inbreeding effects. Copyright © 2013 Elsevier B.V. All rights reserved.

  15. Identification of novel single nucleotide polymorphisms (SNPs in deer (Odocoileus spp. using the BovineSNP50 BeadChip.

    Directory of Open Access Journals (Sweden)

    Gwilym D Haynes

    Full Text Available Single nucleotide polymorphisms (SNPs are growing in popularity as a genetic marker for investigating evolutionary processes. A panel of SNPs is often developed by comparing large quantities of DNA sequence data across multiple individuals to identify polymorphic sites. For non-model species, this is particularly difficult, as performing the necessary large-scale genomic sequencing often exceeds the resources available for the project. In this study, we trial the Bovine SNP50 BeadChip developed in cattle (Bos taurus for identifying polymorphic SNPs in cervids Odocoileus hemionus (mule deer and black-tailed deer and O. virginianus (white-tailed deer in the Pacific Northwest. We found that 38.7% of loci could be genotyped, of which 5% (n = 1068 were polymorphic. Of these 1068 polymorphic SNPs, a mixture of putatively neutral loci (n = 878 and loci under selection (n = 190 were identified with the F(ST-outlier method. A range of population genetic analyses were implemented using these SNPs and a panel of 10 microsatellite loci. The three types of deer could readily be distinguished with both the SNP and microsatellite datasets. This study demonstrates that commercially developed SNP chips are a viable means of SNP discovery for non-model organisms, even when used between very distantly related species (the Bovidae and Cervidae families diverged some 25.1-30.1 million years before present.

  16. Marker-assisted selection in poultry

    International Nuclear Information System (INIS)

    Koning, D.-J. de; Hocking, P.M.

    2007-01-01

    Among livestock species, chicken has the most extensive genomics toolbox available for detection of quantitative trait loci (QTL) and marker-assisted selection (MAS). The uptake of MAS is therefore not limited by technical resources but mostly by the priorities and financial constraints of the few remaining poultry breeding companies. With the cost of genotyping decreasing rapidly, an increase in the use of direct trait- single nucleotide polymorphism (SNP)-associations in MAS can be predicted. (author)

  17. An evaluation of the genetic-matched pair study design using genome-wide SNP data from the European population

    DEFF Research Database (Denmark)

    Lu, Timothy Tehua; Lao, Oscar; Nothnagel, Michael

    2009-01-01

    of cases (76.0%), the BOM of a given individual, based on the complete marker set, came from a different recruitment site than the individual itself. A second marker set, specifically selected for ancestry sensitivity using singular value decomposition, performed even more poorly and was no more capable......Genetic matching potentially provides a means to alleviate the effects of incomplete Mendelian randomization in population-based gene-disease association studies. We therefore evaluated the genetic-matched pair study design on the basis of genome-wide SNP data (309,790 markers; Affymetrix Gene......Chip Human Mapping 500K Array) from 2457 individuals, sampled at 23 different recruitment sites across Europe. Using pair-wise identity-by-state (IBS) as a matching criterion, we tried to derive a subset of markers that would allow identification of the best overall matching (BOM) partner for a given...

  18. A procedure for the detection of linkage with high density SNP arrays in a large pedigree with colorectal cancer

    International Nuclear Information System (INIS)

    Middeldorp, Anneke; Wijnen, Juul T; Wezel, Tom van; Jagmohan-Changur, Shantie; Helmer, Quinta; Klift, Heleen M van der; Tops, Carli MJ; Vasen, Hans FA; Devilee, Peter; Morreau, Hans; Houwing-Duistermaat, Jeanine J

    2007-01-01

    The apparent dominant model of colorectal cancer (CRC) inheritance in several large families, without mutations in known CRC susceptibility genes, suggests the presence of so far unidentified genes with strong or moderate effect on the development of CRC. Linkage analysis could lead to identification of susceptibility genes in such families. In comparison to classical linkage analysis with multi-allelic markers, single nucleotide polymorphism (SNP) arrays have increased information content and can be processed with higher throughput. Therefore, SNP arrays can be excellent tools for linkage analysis. However, the vast number of SNPs on the SNP arrays, combined with large informative pedigrees (e.g. >35–40 bits), presents us with a computational complexity that is challenging for existing statistical packages or even exceeds their capacity. We therefore setup a procedure for linkage analysis in large pedigrees and validated the method by genotyping using SNP arrays of a colorectal cancer family with a known MLH1 germ line mutation. Quality control of the genotype data was performed in Alohomora, Mega2 and SimWalk2, with removal of uninformative SNPs, Mendelian inconsistencies and Mendelian consistent errors, respectively. Linkage disequilibrium was measured by SNPLINK and Merlin. Parametric linkage analysis using two flanking markers was performed using MENDEL. For multipoint parametric linkage analysis and haplotype analysis, SimWalk2 was used. On chromosome 3, in the MLH1-region, a LOD score of 1.9 was found by parametric linkage analysis using two flanking markers. On chromosome 11 a small region with LOD 1.1 was also detected. Upon linkage disequilibrium removal, multipoint linkage analysis yielded a LOD score of 2.1 in the MLH1 region, whereas the LOD score dropped to negative values in the region on chromosome 11. Subsequent haplotype analysis in the MLH1 region perfectly matched the mutation status of the family members. We developed a workflow for linkage

  19. Searching for an Accurate Marker-Based Prediction of an Individual Quantitative Trait in Molecular Plant Breeding

    Science.gov (United States)

    Fu, Yong-Bi; Yang, Mo-Hua; Zeng, Fangqin; Biligetu, Bill

    2017-01-01

    Molecular plant breeding with the aid of molecular markers has played an important role in modern plant breeding over the last two decades. Many marker-based predictions for quantitative traits have been made to enhance parental selection, but the trait prediction accuracy remains generally low, even with the aid of dense, genome-wide SNP markers. To search for more accurate trait-specific prediction with informative SNP markers, we conducted a literature review on the prediction issues in molecular plant breeding and on the applicability of an RNA-Seq technique for developing function-associated specific trait (FAST) SNP markers. To understand whether and how FAST SNP markers could enhance trait prediction, we also performed a theoretical reasoning on the effectiveness of these markers in a trait-specific prediction, and verified the reasoning through computer simulation. To the end, the search yielded an alternative to regular genomic selection with FAST SNP markers that could be explored to achieve more accurate trait-specific prediction. Continuous search for better alternatives is encouraged to enhance marker-based predictions for an individual quantitative trait in molecular plant breeding. PMID:28729875

  20. Searching for an Accurate Marker-Based Prediction of an Individual Quantitative Trait in Molecular Plant Breeding

    Directory of Open Access Journals (Sweden)

    Yong-Bi Fu

    2017-07-01

    Full Text Available Molecular plant breeding with the aid of molecular markers has played an important role in modern plant breeding over the last two decades. Many marker-based predictions for quantitative traits have been made to enhance parental selection, but the trait prediction accuracy remains generally low, even with the aid of dense, genome-wide SNP markers. To search for more accurate trait-specific prediction with informative SNP markers, we conducted a literature review on the prediction issues in molecular plant breeding and on the applicability of an RNA-Seq technique for developing function-associated specific trait (FAST SNP markers. To understand whether and how FAST SNP markers could enhance trait prediction, we also performed a theoretical reasoning on the effectiveness of these markers in a trait-specific prediction, and verified the reasoning through computer simulation. To the end, the search yielded an alternative to regular genomic selection with FAST SNP markers that could be explored to achieve more accurate trait-specific prediction. Continuous search for better alternatives is encouraged to enhance marker-based predictions for an individual quantitative trait in molecular plant breeding.

  1. Searching for an Accurate Marker-Based Prediction of an Individual Quantitative Trait in Molecular Plant Breeding.

    Science.gov (United States)

    Fu, Yong-Bi; Yang, Mo-Hua; Zeng, Fangqin; Biligetu, Bill

    2017-01-01

    Molecular plant breeding with the aid of molecular markers has played an important role in modern plant breeding over the last two decades. Many marker-based predictions for quantitative traits have been made to enhance parental selection, but the trait prediction accuracy remains generally low, even with the aid of dense, genome-wide SNP markers. To search for more accurate trait-specific prediction with informative SNP markers, we conducted a literature review on the prediction issues in molecular plant breeding and on the applicability of an RNA-Seq technique for developing function-associated specific trait (FAST) SNP markers. To understand whether and how FAST SNP markers could enhance trait prediction, we also performed a theoretical reasoning on the effectiveness of these markers in a trait-specific prediction, and verified the reasoning through computer simulation. To the end, the search yielded an alternative to regular genomic selection with FAST SNP markers that could be explored to achieve more accurate trait-specific prediction. Continuous search for better alternatives is encouraged to enhance marker-based predictions for an individual quantitative trait in molecular plant breeding.

  2. (SNP) assay for population stratification test between eastern Asians

    African Journals Online (AJOL)

    Yomi

    2012-01-03

    Jan 3, 2012 ... program STRUCTURE 2.0, which uses a Markov chain Monte. Carlo (MCMC) algorithm to cluster individuals into different cryptic ... HapMap project. .... Evaluation of the 124-plex SNP typing microarray for forensic testing.

  3. A single nucleotide polymorphism (SNP) assay for population ...

    African Journals Online (AJOL)

    A single nucleotide polymorphism (SNP) assay for population stratification test ... phenotypes and unlinked candidate loci in case-control and cohort studies of ... Key words: Chinese, Japanese, population stratification, ancestry informative ...

  4. Comparative Analysis of Disease-Linked Single Nucleotide Polymorphic Markers from Brassica rapa for Their Applicability to Brassica oleracea

    Science.gov (United States)

    Cho, Young-Il; Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Lee, Hye-Eun; Kim, Do-Sun

    2015-01-01

    Numerous studies using single nucleotide polymorphisms (SNPs) have been conducted in humans, and other animals, and in major crops, including rice, soybean, and Chinese cabbage. However, the number of SNP studies in cabbage is limited. In this present study, we evaluated whether 7,645 SNPs previously identified as molecular markers linked to disease resistance in the Brassica rapa genome could be applied to B. oleracea. In a BLAST analysis using the SNP sequences of B. rapa and B. oleracea genomic sequence data registered in the NCBI database, 256 genes for which SNPs had been identified in B. rapa were found in B. oleracea. These genes were classified into three functional groups: molecular function (64 genes), biological process (96 genes), and cellular component (96 genes). A total of 693 SNP markers, including 145 SNP markers [BRH—developed from the B. rapa genome for high-resolution melt (HRM) analysis], 425 SNP markers (BRP—based on the B. rapa genome that could be applied to B. oleracea), and 123 new SNP markers (BRS—derived from BRP and designed for HRM analysis), were investigated for their ability to amplify sequences from cabbage genomic DNA. In total, 425 of the SNP markers (BRP-based on B. rapa genome), selected from 7,645 SNPs, were successfully applied to B. oleracea. Using PCR, 108 of 145 BRH (74.5%), 415 of 425 BRP (97.6%), and 118 of 123 BRS (95.9%) showed amplification, suggesting that it is possible to apply SNP markers developed based on the B. rapa genome to B. oleracea. These results provide valuable information that can be utilized in cabbage genetics and breeding programs using molecular markers derived from other Brassica species. PMID:25790283

  5. Prediction of the optimum hybridization conditions of dot-blot-SNP analysis using estimated melting temperature of oligonucleotide probes.

    Science.gov (United States)

    Shiokai, Sachiko; Kitashiba, Hiroyasu; Nishio, Takeshi

    2010-08-01

    Although the dot-blot-SNP technique is a simple cost-saving technique suitable for genotyping of many plant individuals, optimization of hybridization and washing conditions for each SNP marker requires much time and labor. For prediction of the optimum hybridization conditions for each probe, we compared T (m) values estimated from nucleotide sequences using the DINAMelt web server, measured T (m) values, and hybridization conditions yielding allele-specific signals. The estimated T (m) values were comparable to the measured T (m) values with small differences of less than 3 degrees C for most of the probes. There were differences of approximately 14 degrees C between the specific signal detection conditions and estimated T (m) values. Change of one level of SSC concentrations of 0.1, 0.2, 0.5, and 1.0x SSC corresponded to a difference of approximately 5 degrees C in optimum signal detection temperature. Increasing the sensitivity of signal detection by shortening the exposure time to X-ray film changed the optimum hybridization condition for specific signal detection. Addition of competitive oligonucleotides to the hybridization mixture increased the suitable hybridization conditions by 1.8. Based on these results, optimum hybridization conditions for newly produced dot-blot-SNP markers will become predictable.

  6. Using AFLP markers and the Geneland program for the inference of population genetic structure

    DEFF Research Database (Denmark)

    Guillot, Gilles; Santos, Filipe

    2010-01-01

    the computer program Geneland designed to infer population structure has been adapted to deal with dominant markers; and (ii) we use Geneland for numerical comparison of dominant and codominant markers to perform clustering. AFLP markers lead to less accurate results than bi-allelic codominant markers...... such as single nucleotide polymorphisms (SNP) markers but this difference becomes negligible for data sets of common size (number of individuals n≥100, number of markers L≥200). The latest Geneland version (3.2.1) handling dominant markers is freely available as an R package with a fully clickable graphical...

  7. Fine-mapping additive and dominant SNP effects using group-LASSO and Fractional Resample Model Averaging

    Science.gov (United States)

    Sabourin, Jeremy; Nobel, Andrew B.; Valdar, William

    2014-01-01

    Genomewide association studies sometimes identify loci at which both the number and identities of the underlying causal variants are ambiguous. In such cases, statistical methods that model effects of multiple SNPs simultaneously can help disentangle the observed patterns of association and provide information about how those SNPs could be prioritized for follow-up studies. Current multi-SNP methods, however, tend to assume that SNP effects are well captured by additive genetics; yet when genetic dominance is present, this assumption translates to reduced power and faulty prioritizations. We describe a statistical procedure for prioritizing SNPs at GWAS loci that efficiently models both additive and dominance effects. Our method, LLARRMA-dawg, combines a group LASSO procedure for sparse modeling of multiple SNP effects with a resampling procedure based on fractional observation weights; it estimates for each SNP the robustness of association with the phenotype both to sampling variation and to competing explanations from other SNPs. In producing a SNP prioritization that best identifies underlying true signals, we show that: our method easily outperforms a single marker analysis; when additive-only signals are present, our joint model for additive and dominance is equivalent to or only slightly less powerful than modeling additive-only effects; and, when dominance signals are present, even in combination with substantial additive effects, our joint model is unequivocally more powerful than a model assuming additivity. We also describe how performance can be improved through calibrated randomized penalization, and discuss how dominance in ungenotyped SNPs can be incorporated through either heterozygote dosage or multiple imputation. PMID:25417853

  8. Use of direct and iterative solvers for estimation of SNP effects in genome-wide selection

    Directory of Open Access Journals (Sweden)

    Eduardo da Cruz Gouveia Pimentel

    2010-01-01

    Full Text Available The aim of this study was to compare iterative and direct solvers for estimation of marker effects in genomic selection. One iterative and two direct methods were used: Gauss-Seidel with Residual Update, Cholesky Decomposition and Gentleman-Givens rotations. For resembling different scenarios with respect to number of markers and of genotyped animals, a simulated data set divided into 25 subsets was used. Number of markers ranged from 1,200 to 5,925 and number of animals ranged from 1,200 to 5,865. Methods were also applied to real data comprising 3081 individuals genotyped for 45181 SNPs. Results from simulated data showed that the iterative solver was substantially faster than direct methods for larger numbers of markers. Use of a direct solver may allow for computing (covariances of SNP effects. When applied to real data, performance of the iterative method varied substantially, depending on the level of ill-conditioning of the coefficient matrix. From results with real data, Gentleman-Givens rotations would be the method of choice in this particular application as it provided an exact solution within a fairly reasonable time frame (less than two hours. It would indeed be the preferred method whenever computer resources allow its use.

  9. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.

    Science.gov (United States)

    Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D

    2013-02-28

    Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to

  10. Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association

    Directory of Open Access Journals (Sweden)

    Kelsey E. Grinde

    2017-09-01

    Full Text Available To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightforward single-marker association statistics can suffer from substantial bias introduced by conditioning on gene-based test significance, due to the phenomenon often referred to as “winner's curse.” We illustrate the ramifications of this bias on variant effect size estimation and variant prioritization/ranking approaches, outline parameters of genetic architecture that affect this bias, and propose a bootstrap resampling method to correct for this bias. We find that our correction method significantly reduces the bias due to winner's curse (average two-fold decrease in bias, p < 2.2 × 10−6 and, consequently, substantially improves mean squared error and variant prioritization/ranking. The method is particularly helpful in adjustment for winner's curse effects when the initial gene-based test has low power and for relatively more common, non-causal variants. Adjustment for winner's curse is recommended for all post-hoc estimation and ranking of variants after a gene-based test. Further work is necessary to continue seeking ways to reduce bias and improve inference in post-hoc analysis of gene-based tests under a wide variety of genetic architectures.

  11. Partitioned learning of deep Boltzmann machines for SNP data.

    Science.gov (United States)

    Hess, Moritz; Lenz, Stefan; Blätte, Tamara J; Bullinger, Lars; Binder, Harald

    2017-10-15

    Learning the joint distributions of measurements, and in particular identification of an appropriate low-dimensional manifold, has been found to be a powerful ingredient of deep leaning approaches. Yet, such approaches have hardly been applied to single nucleotide polymorphism (SNP) data, probably due to the high number of features typically exceeding the number of studied individuals. After a brief overview of how deep Boltzmann machines (DBMs), a deep learning approach, can be adapted to SNP data in principle, we specifically present a way to alleviate the dimensionality problem by partitioned learning. We propose a sparse regression approach to coarsely screen the joint distribution of SNPs, followed by training several DBMs on SNP partitions that were identified by the screening. Aggregate features representing SNP patterns and the corresponding SNPs are extracted from the DBMs by a combination of statistical tests and sparse regression. In simulated case-control data, we show how this can uncover complex SNP patterns and augment results from univariate approaches, while maintaining type 1 error control. Time-to-event endpoints are considered in an application with acute myeloid leukemia patients, where SNP patterns are modeled after a pre-screening based on gene expression data. The proposed approach identified three SNPs that seem to jointly influence survival in a validation dataset. This indicates the added value of jointly investigating SNPs compared to standard univariate analyses and makes partitioned learning of DBMs an interesting complementary approach when analyzing SNP data. A Julia package is provided at 'http://github.com/binderh/BoltzmannMachines.jl'. binderh@imbi.uni-freiburg.de. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  12. MDM2 SNP309, gene-gene interaction, and tumor susceptibility: an updated meta-analysis

    Directory of Open Access Journals (Sweden)

    Wu Wei

    2011-05-01

    in the stratified analysis by p53 mutation status (GG vs TT: OR = 1.17, 95% CI = 0.75-1.82 and TG vs TT: OR = 1.09, 95% CI = 0.89-1.34 for positive p53 mutation status; GG vs TT: OR = 0.95, 95% CI = 0.72-1.25 and TG vs TT: OR = 1.06, 95% CI = 0.85-1.30 for negative p53 mutation status. Conclusions The analyses indicate that MDM2 SNP309 serves as a tumor susceptibility marker, and that there is an association between MDM2 SNP309 and p53 Arg72Pro regarding tumor susceptibility. Further studies that take into consideration environmental stresses and functional genetic variants in the p53-MDM2-related genes are warranted.

  13. Heterogeneous computing architecture for fast detection of SNP-SNP interactions.

    Science.gov (United States)

    Sluga, Davor; Curk, Tomaz; Zupan, Blaz; Lotric, Uros

    2014-06-25

    The extent of data in a typical genome-wide association study (GWAS) poses considerable computational challenges to software tools for gene-gene interaction discovery. Exhaustive evaluation of all interactions among hundreds of thousands to millions of single nucleotide polymorphisms (SNPs) may require weeks or even months of computation. Massively parallel hardware within a modern Graphic Processing Unit (GPU) and Many Integrated Core (MIC) coprocessors can shorten the run time considerably. While the utility of GPU-based implementations in bioinformatics has been well studied, MIC architecture has been introduced only recently and may provide a number of comparative advantages that have yet to be explored and tested. We have developed a heterogeneous, GPU and Intel MIC-accelerated software module for SNP-SNP interaction discovery to replace the previously single-threaded computational core in the interactive web-based data exploration program SNPsyn. We report on differences between these two modern massively parallel architectures and their software environments. Their utility resulted in an order of magnitude shorter execution times when compared to the single-threaded CPU implementation. GPU implementation on a single Nvidia Tesla K20 runs twice as fast as that for the MIC architecture-based Xeon Phi P5110 coprocessor, but also requires considerably more programming effort. General purpose GPUs are a mature platform with large amounts of computing power capable of tackling inherently parallel problems, but can prove demanding for the programmer. On the other hand the new MIC architecture, albeit lacking in performance reduces the programming effort and makes it up with a more general architecture suitable for a wider range of problems.

  14. Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology

    Science.gov (United States)

    Ramos, Antonio M.; Crooijmans, Richard P. M. A.; Affara, Nabeel A.; Amaral, Andreia J.; Archibald, Alan L.; Beever, Jonathan E.; Bendixen, Christian; Churcher, Carol; Clark, Richard; Dehais, Patrick; Hansen, Mark S.; Hedegaard, Jakob; Hu, Zhi-Liang; Kerstens, Hindrik H.; Law, Andy S.; Megens, Hendrik-Jan; Milan, Denis; Nonneman, Danny J.; Rohrer, Gary A.; Rothschild, Max F.; Smith, Tim P. L.; Schnabel, Robert D.; Van Tassell, Curt P.; Taylor, Jeremy F.; Wiedmann, Ralph T.; Schook, Lawrence B.; Groenen, Martien A. M.

    2009-01-01

    Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs. PMID:19654876

  15. Novel Quantitative Real-Time LCR for the Sensitive Detection of SNP Frequencies in Pooled DNA: Method Development, Evaluation and Application

    Science.gov (United States)

    Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios

    2011-01-01

    Background Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. Methods The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. Conclusions The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. Significance The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food. PMID:21283808

  16. Pacifiplex: an ancestry-informative SNP panel centred on Australia and the Pacific region.

    Science.gov (United States)

    Santos, Carla; Phillips, Christopher; Fondevila, Manuel; Daniel, Runa; van Oorschot, Roland A H; Burchard, Esteban G; Schanfield, Moses S; Souto, Luis; Uacyisrael, Jolame; Via, Marc; Carracedo, Ángel; Lareu, Maria V

    2016-01-01

    The analysis of human population variation is an area of considerable interest in the forensic, medical genetics and anthropological fields. Several forensic single nucleotide polymorphism (SNP) assays provide ancestry-informative genotypes in sensitive tests designed to work with limited DNA samples, including a 34-SNP multiplex differentiating African, European and East Asian ancestries. Although assays capable of differentiating Oceanian ancestry at a global scale have become available, this study describes markers compiled specifically for differentiation of Oceanian populations. A sensitive multiplex assay, termed Pacifiplex, was developed and optimized in a small-scale test applicable to forensic analyses. The Pacifiplex assay comprises 29 ancestry-informative marker SNPs (AIM-SNPs) selected to complement the 34-plex test, that in a combined set distinguish Africans, Europeans, East Asians and Oceanians. Nine Pacific region study populations were genotyped with both SNP assays, then compared to four reference population groups from the HGDP-CEPH human diversity panel. STRUCTURE analyses estimated population cluster membership proportions that aligned with the patterns of variation suggested for each study population's currently inferred demographic histories. Aboriginal Taiwanese and Philippine samples indicated high East Asian ancestry components, Papua New Guinean and Aboriginal Australians samples were predominantly Oceanian, while other populations displayed cluster patterns explained by the distribution of divergence amongst Melanesians, Polynesians and Micronesians. Genotype data from Pacifiplex and 34-plex tests is particularly well suited to analysis of Australian Aboriginal populations and when combined with Y and mitochondrial DNA variation will provide a powerful set of markers for ancestry inference applied to modern Australian demographic profiles. On a broader geographic scale, Pacifiplex adds highly informative data for inferring the ancestry

  17. RASSF1A and the rs2073498 Cancer Associated SNP

    International Nuclear Information System (INIS)

    Donninger, Howard; Barnoud, Thibaut; Nelson, Nick; Kassler, Suzanna; Clark, Jennifer; Cummins, Timothy D.; Powell, David W.; Nyante, Sarah; Millikan, Robert C.; Clark, Geoffrey J.

    2011-01-01

    RASSF1A is one of the most frequently inactivated tumor suppressors yet identified in human cancer. It is pro-apoptotic and appears to function as a scaffolding protein that interacts with a variety of other tumor suppressors to modulate their function. It can also complex with the Ras oncoprotein and may serve to integrate pro-growth and pro-death signaling pathways. A SNP has been identified that is present in approximately 29% of European populations [rs2073498, A(133)S]. Several studies have now presented evidence that this SNP is associated with an enhanced risk of developing breast cancer. We have used a proteomics based approach to identify multiple differences in the pattern of protein/protein interactions mediated by the wild type compared to the SNP variant protein. We have also identified a significant difference in biological activity between wild type and SNP variant protein. However, we have found only a very modest association of the SNP with breast cancer predisposition.

  18. Dog Y chromosomal DNA sequence: identification, sequencing and SNP discovery

    Directory of Open Access Journals (Sweden)

    Kirkness Ewen

    2006-10-01

    Full Text Available Abstract Background Population genetic studies of dogs have so far mainly been based on analysis of mitochondrial DNA, describing only the history of female dogs. To get a picture of the male history, as well as a second independent marker, there is a need for studies of biallelic Y-chromosome polymorphisms. However, there are no biallelic polymorphisms reported, and only 3200 bp of non-repetitive dog Y-chromosome sequence deposited in GenBank, necessitating the identification of dog Y chromosome sequence and the search for polymorphisms therein. The genome has been only partially sequenced for one male dog, disallowing mapping of the sequence into specific chromosomes. However, by comparing the male genome sequence to the complete female dog genome sequence, candidate Y-chromosome sequence may be identified by exclusion. Results The male dog genome sequence was analysed by Blast search against the human genome to identify sequences with a best match to the human Y chromosome and to the female dog genome to identify those absent in the female genome. Candidate sequences were then tested for male specificity by PCR of five male and five female dogs. 32 sequences from the male genome, with a total length of 24 kbp, were identified as male specific, based on a match to the human Y chromosome, absence in the female dog genome and male specific PCR results. 14437 bp were then sequenced for 10 male dogs originating from Europe, Southwest Asia, Siberia, East Asia, Africa and America. Nine haplotypes were found, which were defined by 14 substitutions. The genetic distance between the haplotypes indicates that they originate from at least five wolf haplotypes. There was no obvious trend in the geographic distribution of the haplotypes. Conclusion We have identified 24159 bp of dog Y-chromosome sequence to be used for population genetic studies. We sequenced 14437 bp in a worldwide collection of dogs, identifying 14 SNPs for future SNP analyses, and

  19. A genome wide survey of SNP variation reveals the genetic structure of sheep breeds.

    Directory of Open Access Journals (Sweden)

    James W Kijas

    Full Text Available The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identifying the first genome-wide set of SNP for sheep, we report on levels of genetic variability both within and between a diverse sample of ovine populations. Then, using cluster analysis and the partitioning of genetic variation, we demonstrate sheep are characterised by weak phylogeographic structure, overlapping genetic similarity and generally low differentiation which is consistent with their short evolutionary history. The degree of population substructure was, however, sufficient to cluster individuals based on geographic origin and known breed history. Specifically, African and Asian populations clustered separately from breeds of European origin sampled from Australia, New Zealand, Europe and North America. Furthermore, we demonstrate the presence of stratification within some, but not all, ovine breeds. The results emphasize that careful documentation of genetic structure will be an essential prerequisite when mapping the genetic basis of complex traits. Furthermore, the identification of a subset of SNP able to assign individuals into broad groupings demonstrates even a small panel of markers may be suitable for applications such as traceability.

  20. Genomewide high-density SNP linkage analysis of non-BRCA1/2 breast cancer families identifies various candidate regions and has greater power than microsatellite studies

    Directory of Open Access Journals (Sweden)

    Gonzalez-Neira Anna

    2007-08-01

    Full Text Available Abstract Background The recent development of new high-throughput technologies for SNP genotyping has opened the possibility of taking a genome-wide linkage approach to the search for new candidate genes involved in heredity diseases. The two major breast cancer susceptibility genes BRCA1 and BRCA2 are involved in 30% of hereditary breast cancer cases, but the discovery of additional breast cancer predisposition genes for the non-BRCA1/2 breast cancer families has so far been unsuccessful. Results In order to evaluate the power improvement provided by using SNP markers in a real situation, we have performed a whole genome screen of 19 non-BRCA1/2 breast cancer families using 4720 genomewide SNPs with Illumina technology (Illumina's Linkage III Panel, with an average distance of 615 Kb/SNP. We identified six regions on chromosomes 2, 3, 4, 7, 11 and 14 as candidates to contain genes involved in breast cancer susceptibility, and additional fine mapping genotyping using microsatellite markers around linkage peaks confirmed five of them, excluding the region on chromosome 3. These results were consistent in analyses that excluded SNPs in high linkage disequilibrium. The results were compared with those obtained previously using a 10 cM microsatellite scan (STR-GWS and we found lower or not significant linkage signals with STR-GWS data compared to SNP data in all cases. Conclusion Our results show the power increase that SNPs can supply in linkage studies.

  1. Tumors markers

    International Nuclear Information System (INIS)

    Yamaguchi-Mizumoto, N.H.

    1989-01-01

    In order to study blood and cell components alterations (named tumor markers) that may indicate the presence of a tumor, several methods are presented. Aspects as diagnostic, prognostic, therapeutic value and clinical evaluation are discussed. (M.A.C.)

  2. Olive oil DNA fingerprinting by multiplex SNP genotyping on fluorescent microspheres.

    Science.gov (United States)

    Kalogianni, Despina P; Bazakos, Christos; Boutsika, Lemonia M; Targem, Mehdi Ben; Christopoulos, Theodore K; Kalaitzis, Panagiotis; Ioannou, Penelope C

    2015-04-01

    Olive oil cultivar verification is of primary importance for the competitiveness of the product and the protection of consumers and producers from fraudulence. Single-nucleotide polymorphisms (SNPs) have emerged as excellent DNA markers for authenticity testing. This paper reports the first multiplex SNP genotyping assay for olive oil cultivar identification that is performed on a suspension of fluorescence-encoded microspheres. Up to 100 sets of microspheres, with unique "fluorescence signatures", are available. Allele discrimination was accomplished by primer extension reaction. The reaction products were captured via hybridization on the microspheres and analyzed, within seconds, by a flow cytometer. The "fluorescence signature" of each microsphere is assigned to a specific allele, whereas the signal from a reporter fluorophore denotes the presence of the allele. As a model, a panel of three SNPs was chosen that enabled identification of five common Greek olive cultivars (Adramytini, Chondrolia Chalkidikis, Kalamon, Koroneiki, and Valanolia).

  3. Reference-free SNP discovery for the Eurasian beaver from restriction site-associated DNA paired-end data.

    Science.gov (United States)

    Senn, Helen; Ogden, Rob; Cezard, Timothee; Gharbi, Karim; Iqbal, Zamin; Johnson, Eric; Kamps-Hughes, Nick; Rosell, Frank; McEwing, Ross

    2013-06-01

    In this study, we used restriction site-associated DNA (RAD) sequencing to discover SNP markers suitable for population genetic and parentage analysis with the aim of using them for monitoring the reintroduction of the Eurasian beaver (Castor fibre) to Scotland. In the absence of a reference genome for beaver, we built contigs and discovered SNPs within them using paired-end RAD data, so as to have sufficient flanking region around the SNPs to conduct marker design. To do this, we used a simple pipeline which catalogued the Read 1 data in stacks and then used the assembler cortex_var to conduct de novo assembly and genotyping of multiple samples using the Read 2 data. The analysis of around 1.1 billion short reads of sequence data was reduced to a set of 2579 high-quality candidate SNP markers that were polymorphic in Norwegian and Bavarian beaver. Both laboratory validation of a subset of eight of the SNPs (1.3% error) and internal validation by confirming patterns of Mendelian inheritance in a family group (0.9% error) confirmed the success of this approach. © 2013 John Wiley & Sons Ltd.

  4. Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction

    DEFF Research Database (Denmark)

    Brøndum, Rasmus Froberg; Su, Guosheng; Janss, Luc

    2015-01-01

    This study investigated the effect on the reliability of genomic prediction when a small number of significant variants from single marker analysis based on whole genome sequence data were added to the regular 54k single nucleotide polymorphism (SNP) array data. The extra markers were selected...... with the aim of augmenting the custom low-density Illumina BovineLD SNP chip (San Diego, CA) used in the Nordic countries. The single-marker analysis was done breed-wise on all 16 index traits included in the breeding goals for Nordic Holstein, Danish Jersey, and Nordic Red cattle plus the total merit index...... itself. Depending on the trait’s economic weight, 15, 10, or 5 quantitative trait loci (QTL) were selected per trait per breed and 3 to 5 markers were selected to tag each QTL. After removing duplicate markers (same marker selected for more than one trait or breed) and filtering for high pairwise linkage...

  5. EST-derived SNP discovery and selective pressure analysis in Pacific white shrimp ( Litopenaeus vannamei)

    Science.gov (United States)

    Liu, Chengzhang; Wang, Xia; Xiang, Jianhai; Li, Fuhua

    2012-09-01

    Pacific white shrimp has become a major aquaculture and fishery species worldwide. Although a large scale EST resource has been publicly available since 2008, the data have not yet been widely used for SNP discovery or transcriptome-wide assessment of selective pressure. In this study, a set of 155 411 expressed sequence tags (ESTs) from the NCBI database were computationally analyzed and 17 225 single nucleotide polymorphisms (SNPs) were predicted, including 9 546 transitions, 5 124 transversions and 2 481 indels. Among the 7 298 SNP substitutions located in functionally annotated contigs, 58.4% (4 262) are non-synonymous SNPs capable of introducing amino acid mutations. Two hundred and fifty nonsynonymous SNPs in genes associated with economic traits have been identified as candidates for markers in selective breeding. Diversity estimates among the synonymous nucleotides were on average 3.49 times greater than those in non-synonymous, suggesting negative selection. Distribution of non-synonymous to synonymous substitutions (Ka/Ks) ratio ranges from 0 to 4.01, (average 0.42, median 0.26), suggesting that the majority of the affected genes are under purifying selection. Enrichment analysis identified multiple gene ontology categories under positive or negative selection. Categories involved in innate immune response and male gamete generation are rich in positively selected genes, which is similar to reports in Drosophila and primates. This work is the first transcriptome-wide assessment of selective pressure in a Penaeid shrimp species. The functionally annotated SNPs provide a valuable resource of potential molecular markers for selective breeding.

  6. Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association

    Science.gov (United States)

    Grinde, Kelsey E.; Arbet, Jaron; Green, Alden; O'Connell, Michael; Valcarcel, Alessandra; Westra, Jason; Tintle, Nathan

    2017-01-01

    To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightforward single-marker association statistics can suffer from substantial bias introduced by conditioning on gene-based test significance, due to the phenomenon often referred to as “winner's curse.” We illustrate the ramifications of this bias on variant effect size estimation and variant prioritization/ranking approaches, outline parameters of genetic architecture that affect this bias, and propose a bootstrap resampling method to correct for this bias. We find that our correction method significantly reduces the bias due to winner's curse (average two-fold decrease in bias, p bias and improve inference in post-hoc analysis of gene-based tests under a wide variety of genetic architectures. PMID:28959274

  7. The iSelect 9 K SNP analysis revealed polyploidization induced revolutionary changes and intense human selection causing strong haplotype blocks in wheat.

    Science.gov (United States)

    Hao, Chenyang; Wang, Yuquan; Chao, Shiaoman; Li, Tian; Liu, Hongxia; Wang, Lanfen; Zhang, Xueyong

    2017-01-30

    A Chinese wheat mini core collection was genotyped using the wheat 9 K iSelect SNP array. Total 2420 and 2396 polymorphic SNPs were detected on the A and the B genome chromosomes, which formed 878 haplotype blocks. There were more blocks in the B genome, but the average block size was significantly (P polyploidization of wheat (both tetraploidization and hexaploidization) induced revolutionary changes in both the A and the B genomes, with a greater increase of gene diversity compared to their diploid ancestors. Modern breeding has dramatically increased diversity in the gene coding regions, though obvious blocks were formed on most of the chromosomes in both tetraploid and hexaploid wheats. Tag-SNP markers identified in this study can be used for marker assisted selection using haplotype blocks as a wheat breeding strategy. This strategy can also be employed to facilitate genome selection in other self-pollinating crop species.

  8. Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries

    Directory of Open Access Journals (Sweden)

    Kumar Santosh

    2012-12-01

    Full Text Available Abstract Background Flax (Linum usitatissimum L. is a significant fibre and oilseed crop. Current flax molecular markers, including isozymes, RAPDs, AFLPs and SSRs are of limited use in the construction of high density linkage maps and for association mapping applications due to factors such as low reproducibility, intense labour requirements and/or limited numbers. We report here on the use of a reduced representation library strategy combined with next generation Illumina sequencing for rapid and large scale discovery of SNPs in eight flax genotypes. SNP discovery was performed through in silico analysis of the sequencing data against the whole genome shotgun sequence assembly of flax genotype CDC Bethune. Genotyping-by-sequencing of an F6-derived recombinant inbred line population provided validation of the SNPs. Results Reduced representation libraries of eight flax genotypes were sequenced on the Illumina sequencing platform resulting in sequence coverage ranging from 4.33 to 15.64X (genome equivalents. Depending on the relatedness of the genotypes and the number and length of the reads, between 78% and 93% of the reads mapped onto the CDC Bethune whole genome shotgun sequence assembly. A total of 55,465 SNPs were discovered with the largest number of SNPs belonging to the genotypes with the highest mapping coverage percentage. Approximately 84% of the SNPs discovered were identified in a single genotype, 13% were shared between any two genotypes and the remaining 3% in three or more. Nearly a quarter of the SNPs were found in genic regions. A total of 4,706 out of 4,863 SNPs discovered in Macbeth were validated using genotyping-by-sequencing of 96 F6 individuals from a recombinant inbred line population derived from a cross between CDC Bethune and Macbeth, corresponding to a validation rate of 96.8%. Conclusions Next generation sequencing of reduced representation libraries was successfully implemented for genome-wide SNP discovery from

  9. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality

    OpenAIRE

    Ali, Shahin S.; Shao, Jonathan; Strem, Mary D.; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W.; Bailey, Bryan A.

    2015-01-01

    Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers fro...

  10. Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples.

    Science.gov (United States)

    Henshall, John M; Dierens, Leanne; Sellars, Melony J

    2014-09-02

    While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are

  11. Sodium nitroprusside (SNP) alleviates the oxidative stress induced ...

    African Journals Online (AJOL)

    Oxidative damage is often induced by abiotic stress, nitric oxide (NO) is considered as a functional molecule in modulating antioxidant metabolism of plants. In the present study, effects of sodium nitroprusside (SNP), a NO donor, on the phenotype, antioxidant capacity and chloroplast ultrastructure of cucumber leaves were ...

  12. Genomic scans for selective sweeps using SNP data

    DEFF Research Database (Denmark)

    Nielsen, Rasmus; Williamson, Scott; Kim, Yuseob

    2005-01-01

    of the selection coefficient. To illustrate the method, we apply our approach to data from the Seattle SNP project and to Chromosome 2 data from the HapMap project. In Chromosome 2, the most extreme signal is found in the lactase gene, which previously has been shown to be undergoing positive selection. Evidence...

  13. Application of high resolution SNP arrays in patients with congenital ...

    Indian Academy of Sciences (India)

    clinical experience in implementing whole-genome high-resolution SNP arrays to investigate 33 patients with syndromic and .... Online Mendelian Inheritance in Man database (OMIM, ..... of damaged mitochondria through either autophagy or mito- ..... malformations: associations with maternal and infant character- istics in a ...

  14. Phenylethynylpyrene excimer forming hybridization probes for fluorescence SNP detection

    DEFF Research Database (Denmark)

    Prokhorenko, Igor A.; Astakhova, Irina V.; Momynaliev, Kuvat T.

    2009-01-01

    Excimer formation is a unique feature of some fluorescent dyes (e.g., pyrene) which can be used for probing the proximity of biomolecules. Pyrene excimer fluorescence has previously been used for homogeneous detection of single nucleotide polymorphism (SNP) on DNA. 1-Phenylethynylpyrene (1-1-PEPy...

  15. Do you really know where this SNP goes?

    Science.gov (United States)

    The release of build 10.2 of the swine genome was a marked improvement over previous builds and has proven extremely useful. However, as most know, there are regions of the genome that this particular build does not accurately represent. For instance, nearly 25% of the 62,162 SNP on the Illumina Por...

  16. SNP based heritability estimation using a Bayesian approach

    DEFF Research Database (Denmark)

    Krag, Kristian; Janss, Luc; Mahdi Shariati, Mohammad

    2013-01-01

    . Differences in family structure were in general not found to influence the estimation of the heritability. For the sample sizes used in this study, a 10-fold increase of SNP density did not improve precision estimates compared with set-ups with a less dense distribution of SNPs. The methods used in this study...

  17. Genome wide in silico SNP-tumor association analysis

    International Nuclear Information System (INIS)

    Qiu, Ping; Wang, Luquan; Kostich, Mitch; Ding, Wei; Simon, Jason S; Greene, Jonathan R

    2004-01-01

    Carcinogenesis occurs, at least in part, due to the accumulation of mutations in critical genes that control the mechanisms of cell proliferation, differentiation and death. Publicly accessible databases contain millions of expressed sequence tag (EST) and single nucleotide polymorphism (SNP) records, which have the potential to assist in the identification of SNPs overrepresented in tumor tissue. An in silico SNP-tumor association study was performed utilizing tissue library and SNP information available in NCBI's dbEST (release 092002) and dbSNP (build 106). A total of 4865 SNPs were identified which were present at higher allele frequencies in tumor compared to normal tissues. A subset of 327 (6.7%) SNPs induce amino acid changes to the protein coding sequences. This approach identified several SNPs which have been previously associated with carcinogenesis, as well as a number of SNPs that now warrant further investigation This novel in silico approach can assist in prioritization of genes and SNPs in the effort to elucidate the genetic mechanisms underlying the development of cancer

  18. SNP typing on the NanoChip electronic microarray

    DEFF Research Database (Denmark)

    Børsting, Claus; Sanchez Sanchez, Juan Jose; Morling, Niels

    2005-01-01

    We describe a single nucleotide polymorphism (SNP) typing protocol developed for the NanoChip electronic microarray. The NanoChip array consists of 100 electrodes covered by a thin hydrogel layer containing streptavidin. An electric currency can be applied to one, several, or all electrodes...

  19. In silico characterization of functional SNP within the oestrogen ...

    Indian Academy of Sciences (India)

    MAHA REBAÕ

    (polyphen-2, SNAP), as well as by the ESEfinder program, and one nonsense nsSNP was found. For noncoding ... mon type of genetic variation in the human genome that are ...... polymorphisms in type 2 diabetes mellitus and in android type.

  20. In silico characterization of functional SNP within the oestrogen ...

    Indian Academy of Sciences (India)

    MAHA REBAÕ

    found that one SNP in 5 UTR may potentially change protein expression level, nine SNPs were found to affect miRNA binding site and 28 SNPs might affect ..... Riancho et al. 2010), breast cancer (Tapper et al. 2008; Ding et al. .... in postmenopausal women: associations with common estrogen receptor alpha polymorphic ...

  1. iLOCi: a SNP interaction prioritization technique for detecting epistasis in genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Piriyapongsa Jittima

    2012-12-01

    Full Text Available Abstract Background Genome-wide association studies (GWAS do not provide a full account of the heritability of genetic diseases since gene-gene interactions, also known as epistasis are not considered in single locus GWAS. To address this problem, a considerable number of methods have been developed for identifying disease-associated gene-gene interactions. However, these methods typically fail to identify interacting markers explaining more of the disease heritability over single locus GWAS, since many of the interactions significant for disease are obscured by uninformative marker interactions e.g., linkage disequilibrium (LD. Results In this study, we present a novel SNP interaction prioritization algorithm, named iLOCi (Interacting Loci. This algorithm accounts for marker dependencies separately in case and control groups. Disease-associated interactions are then prioritized according to a novel ranking score calculated from the difference in marker dependencies for every possible pair between case and control groups. The analysis of a typical GWAS dataset can be completed in less than a day on a standard workstation with parallel processing capability. The proposed framework was validated using simulated data and applied to real GWAS datasets using the Wellcome Trust Case Control Consortium (WTCCC data. The results from simulated data showed the ability of iLOCi to identify various types of gene-gene interactions, especially for high-order interaction. From the WTCCC data, we found that among the top ranked interacting SNP pairs, several mapped to genes previously known to be associated with disease, and interestingly, other previously unreported genes with biologically related roles. Conclusion iLOCi is a powerful tool for uncovering true disease interacting markers and thus can provide a more complete understanding of the genetic basis underlying complex disease. The program is available for download at http://www4a.biotec.or.th/GI/tools/iloci.

  2. Genetic diversity and structure in a collection of tulip cultivars assessed by SNP markers

    NARCIS (Netherlands)

    Tang, N.; Shahin, A.; Bijman, P.J.J.; Liu, J.; Tuyl, van J.M.; Arens, P.

    2013-01-01

    Although tulip is one of the most important bulbous crops worldwide, the genetic background of most cultivars is unclear at present. The purposes of this study are to investigate genetic diversity and to identify the genetic structure and relationships among tulip cultivars. A total of 236

  3. Development of a SNP marker for detection of the low phytic acid ...

    African Journals Online (AJOL)

    Sharmane

    2013-02-27

    Feb 27, 2013 ... (Douglas et al., 2000; Li et al., 2000; Spencer et al., ..... Adams CL, Hambidge M, Raboy V, Dorsch JA, Sian L, Westcott JL,. Krebs NF (2002). .... Park S-W, An S-J, Yang H-B, Kwon J-K, Kang B-C (2009). Optimization of high ...

  4. High resolution melting (HRM) analysis in sugar beet: identification of SNP markers associated to Fusarium resistance

    Science.gov (United States)

    Fusarium spp. cause severe damage in many agricultural crops including sugar beet. Sugar beet needs to be protected from these soil borne pathogens to guarantee an optimal sugar yield in the field. The genetic control is the key to overcoming this disease. Identification of single nucleotide polymor...

  5. Development and mapping of gene-tagged SNP markers in laccases of maize (Zea mays L.)

    DEFF Research Database (Denmark)

    Andersen, J R; Asp, T; Lu, Y C

    2009-01-01

    Laccases, EC 1.10.3.2 or p-diphenol : dioxygen oxidoreductases, have been proposed to be involved in the oxidative polymerization of monolignols into lignins in plants. While 17 laccases have been identified in Arabidopsis, only five (ZmLac1-5) have so far been identified in maize. By a bioinform...

  6. Developing Single Nucleotide Polymorphism (SNP) markers for the identification of pineapple (Ananas comosus) germplasm

    Science.gov (United States)

    Pineapple (Ananas comosus [L.] Merr.) is the third most important tropical fruit in the world after banana and mango and a major agricultural commodity in Hawaii. As a crop with vegetative propagation, genetic redundancy is a major challenge for efficient genebank management and in breeding. Using E...

  7. A SNP resource for studying North American moose [version 1; referees: 2 approved, 1 approved with reservations

    Directory of Open Access Journals (Sweden)

    Theodore S. Kalbfleisch

    2018-01-01

    Full Text Available Background: Moose (Alces alces colonized the North American continent from Asia less than 15,000 years ago, and spread across the boreal forest regions of Canada and the northern United States (US.  Contemporary populations have low genetic diversity, due either to low number of individuals in the original migration (founder effect, and/or subsequent population bottlenecks in North America.  Genetic tests based on informative single nucleotide polymorphism (SNP markers are helpful in forensic and wildlife conservation activities, but have been difficult to develop for moose, due to the lack of a reference genome assembly and whole genome sequence (WGS data. Methods:  WGS data were generated for four individual moose from the US states of Alaska, Idaho, Wyoming, and Vermont with minimum and average genome coverage depths of 14- and 19-fold, respectively.  Cattle and sheep reference genomes were used for aligning sequence reads and identifying moose SNPs. Results:  Approximately 11% and 9% of moose WGS reads aligned to cattle and sheep genomes, respectively.  The reads clustered at genomic segments, where sequence identity between these species was greater than 95%.  In these segments, average mapped read depth was approximately 19-fold.  Sets of 46,005 and 36,934 high-confidence SNPs were identified from cattle and sheep comparisons, respectively, with 773 and 552 of those having minor allele frequency of 0.5 and conserved flanking sequences in all three species.  Among the four moose, heterozygosity and allele sharing of SNP genotypes were consistent with decreasing levels of moose genetic diversity from west to east.  A minimum set of 317 SNPs, informative across all four moose, was selected as a resource for future SNP assay design. Conclusions:  All SNPs and associated information are available, without restriction, to support development of SNP-based tests for animal identification, parentage determination, and estimating

  8. Reliable allele detection using SNP-based PCR primers containing Locked Nucleic Acid: application in genetic mapping

    Directory of Open Access Journals (Sweden)

    Trognitz Friederike

    2007-02-01

    Full Text Available Abstract Background The diploid, Solanum caripense, a wild relative of potato and tomato, possesses valuable resistance to potato late blight and we are interested in the genetic base of this resistance. Due to extremely low levels of genetic variation within the S. caripense genome it proved impossible to generate a dense genetic map and to assign individual Solanum chromosomes through the use of conventional chromosome-specific SSR, RFLP, AFLP, as well as gene- or locus-specific markers. The ease of detection of DNA polymorphisms depends on both frequency and form of sequence variation. The narrow genetic background of close relatives and inbreds complicates the detection of persisting, reduced polymorphism and is a challenge to the development of reliable molecular markers. Nonetheless, monomorphic DNA fragments representing not directly usable conventional markers can contain considerable variation at the level of single nucleotide polymorphisms (SNPs. This can be used for the design of allele-specific molecular markers. The reproducible detection of allele-specific markers based on SNPs has been a technical challenge. Results We present a fast and cost-effective protocol for the detection of allele-specific SNPs by applying Sequence Polymorphism-Derived (SPD markers. These markers proved highly efficient for fingerprinting of individuals possessing a homogeneous genetic background. SPD markers are obtained from within non-informative, conventional molecular marker fragments that are screened for SNPs to design allele-specific PCR primers. The method makes use of primers containing a single, 3'-terminal Locked Nucleic Acid (LNA base. We demonstrate the applicability of the technique by successful genetic mapping of allele-specific SNP markers derived from monomorphic Conserved Ortholog Set II (COSII markers mapped to Solanum chromosomes, in S. caripense. By using SPD markers it was possible for the first time to map the S. caripense alleles

  9. Accurate determination of genetic identity for a single cacao bean, using molecular markers with a nanofluidic system, ensures cocoa authentication.

    Science.gov (United States)

    Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Bellato, Cláudia M; Motilal, Lambert; Zhang, Dapeng

    2014-01-15

    Cacao (Theobroma cacao L.), the source of cocoa, is an economically important tropical crop. One problem with the premium cacao market is contamination with off-types adulterating raw premium material. Accurate determination of the genetic identity of single cacao beans is essential for ensuring cocoa authentication. Using nanofluidic single nucleotide polymorphism (SNP) genotyping with 48 SNP markers, we generated SNP fingerprints for small quantities of DNA extracted from the seed coat of single cacao beans. On the basis of the SNP profiles, we identified an assumed adulterant variety, which was unambiguously distinguished from the authentic beans by multilocus matching. Assignment tests based on both Bayesian clustering analysis and allele frequency clearly separated all 30 authentic samples from the non-authentic samples. Distance-based principle coordinate analysis further supported these results. The nanofluidic SNP protocol, together with forensic statistical tools, is sufficiently robust to establish authentication and to verify gourmet cacao varieties. This method shows significant potential for practical application.

  10. (SSR) markers

    African Journals Online (AJOL)

    acer

    2013-06-26

    Jun 26, 2013 ... analysis was in general agreement with PCoA in discrimi- nating the cultivars. Conclusions. Estimation of morphological diversity may provide addi- tional information on the present finding. Nonetheless, the 29 SSR markers provided considerable genetic reso- lution and this genetic diversity analysis ...

  11. (SSR) markers

    African Journals Online (AJOL)

    SAM

    2014-07-30

    Jul 30, 2014 ... India and the country is currently the leading producer, consumer and exporter of ... registration with the competent authority for plant variety protection. Conventionally ... detection of duplicates, parental verification in crosses, gene tagging in .... allelic patterns as revealed by the current set of SSR markers.

  12. Comparing the predictive abilities of phenotypic and marker-assisted selection methods in a biparental lettuce population

    Science.gov (United States)

    Breeding and selection for the traits with polygenic inheritance is a challenging task that can be done by phenotypic selection, by marker-assisted selection or by genome wide selection. We tested predictive ability of four selection models in a biparental population genotyped with 95 SNP markers an...

  13. Dissection of Recombination Attributes for Multiple Maize Populations Using a Common SNP Assay

    Directory of Open Access Journals (Sweden)

    Haiying Guan

    2017-11-01

    Full Text Available Recombination is a vital characteristic for quantitative trait loci mapping and breeding to enhance the yield potential of maize. However, recombination characteristics in globally used segregating populations have never been evaluated at similar genetic marker densities. This study aimed to divulge the characteristics of recombination events, recombinant chromosomal segments, and recombination frequency for four dissimilar populations. These populations were doubled haploid (DH, recombination inbred line (RIL, intermated B73xMo17 (IBM, and multi-parent advanced generation inter-cross (MAGIC, using the Illumina MaizeSNP50 BeadChip to provide markers. Our results revealed that the average number of recombination events was 16, 41, 72, and 86 per line in DH, RIL, IBM, and MAGIC populations, respectively. Accordingly, the average length of recombinant chromosomal segments was 84.8, 47.3, 29.2, and 20.4 Mb in DH, RIL, IBM, and MAGIC populations, respectively. Furtherly, the recombination frequency varied in different genomic regions and population types [DH (0–12.7 cM/Mb, RIL (0–15.5 cM/Mb, IBM (0–24.1 cM/Mb, MAGIC (0–42.3 cM/Mb]. Utilizing different sub-sets of lines, the recombination bin number and size were analyzed in each population. Additionally, different sub-sets of markers and lines were employed to estimate the recombination bin number and size via formulas for relationship in these populations. The relationship between recombination events and recombination bin length was also examined. Our results contribute to determining the most suitable number of genetic markers, lines in each population, and population type for successful mapping and breeding.

  14. A single-tube 27-plex SNP assay for estimating individual ancestry and admixture from three continents.

    Science.gov (United States)

    Wei, Yi-Liang; Wei, Li; Zhao, Lei; Sun, Qi-Fan; Jiang, Li; Zhang, Tao; Liu, Hai-Bo; Chen, Jian-Gang; Ye, Jian; Hu, Lan; Li, Cai-Xia

    2016-01-01

    A single-tube multiplex assay of a small set of ancestry-informative markers (AIMs) for effectively estimating individual ancestry and admixture is an ideal forensic tool to trace the population origin of an unknown DNA sample. We present a newly developed 27-plex single nucleotide polymorphism (SNP) panel with highly robust and balanced differential power to perfectly assign individuals to African, European, and East Asian ancestries. Evaluating 968 previously described intercontinental AIMs from three HapMap population genotyping datasets (Yoruban in Ibadan, Nigeria (YRI); Utah residents with Northern and Western European ancestry from the Centre de'Etude du Polymorphism Humain (CEPH) collection (CEU); and Han Chinese in Beijing, China (CHB)), the best set of markers was selected on the basis of Hardy-Weinberg equilibrium (p > 0.00001), population-specific allele frequency (two of three δ values >0.5), according to linkage disequilibrium (r (2) ancestry of the 11 populations in the HapMap project. Then, we tested the 27-plex SNP assay with 1164 individuals from 17 additional populations. The results demonstrated that the SNP panel was successful for ancestry inference of individuals with African, European, and East Asian ancestry. Furthermore, the system performed well when inferring the admixture of Eurasians (EUR/EAS) after analyzing admixed populations from Xinjiang (Central Asian) as follows: Tajik (68:27), Uyghur (49:46), Kirgiz (40:57), and Kazak (36:60). For individual analyses, we interpreted each sample with a three-ancestry component percentage and a population match probability sequence. This multiplex assay is a convenient and cost-effective tool to assist in criminal investigations, as well as to correct for the effects of population stratification for case-control studies.

  15. High-density SNP genotyping of tomato (Solanum lycopersicum L. reveals patterns of genetic variation due to breeding.

    Directory of Open Access Journals (Sweden)

    Sung-Chur Sim

    Full Text Available The effects of selection on genome variation were investigated and visualized in tomato using a high-density single nucleotide polymorphism (SNP array. 7,720 SNPs were genotyped on a collection of 426 tomato accessions (410 inbreds and 16 hybrids and over 97% of the markers were polymorphic in the entire collection. Principal component analysis (PCA and pairwise estimates of F(st supported that the inbred accessions represented seven sub-populations including processing, large-fruited fresh market, large-fruited vintage, cultivated cherry, landrace, wild cherry, and S. pimpinellifolium. Further divisions were found within both the contemporary processing and fresh market sub-populations. These sub-populations showed higher levels of genetic diversity relative to the vintage sub-population. The array provided a large number of polymorphic SNP markers across each sub-population, ranging from 3,159 in the vintage accessions to 6,234 in the cultivated cherry accessions. Visualization of minor allele frequency revealed regions of the genome that distinguished three representative sub-populations of cultivated tomato (processing, fresh market, and vintage, particularly on chromosomes 2, 4, 5, 6, and 11. The PCA loadings and F(st outlier analysis between these three sub-populations identified a large number of candidate loci under positive selection on chromosomes 4, 5, and 11. The extent of linkage disequilibrium (LD was examined within each chromosome for these sub-populations. LD decay varied between chromosomes and sub-populations, with large differences reflective of breeding history. For example, on chromosome 11, decay occurred over 0.8 cM for processing accessions and over 19.7 cM for fresh market accessions. The observed SNP variation and LD decay suggest that different patterns of genetic variation in cultivated tomato are due to introgression from wild species and selection for market specialization.

  16. Canonical correlation analysis for gene-based pleiotropy discovery.

    Directory of Open Access Journals (Sweden)

    Jose A Seoane

    2014-10-01

    Full Text Available Genome-wide association studies have identified a wealth of genetic variants involved in complex traits and multifactorial diseases. There is now considerable interest in testing variants for association with multiple phenotypes (pleiotropy and for testing multiple variants for association with a single phenotype (gene-based association tests. Such approaches can increase statistical power by combining evidence for association over multiple phenotypes or genetic variants respectively. Canonical Correlation Analysis (CCA measures the correlation between two sets of multidimensional variables, and thus offers the potential to combine these two approaches. To apply CCA, we must restrict the number of attributes relative to the number of samples. Hence we consider modules of genetic variation that can comprise a gene, a pathway or another biologically relevant grouping, and/or a set of phenotypes. In order to do this, we use an attribute selection strategy based on a binary genetic algorithm. Applied to a UK-based prospective cohort study of 4286 women (the British Women's Heart and Health Study, we find improved statistical power in the detection of previously reported genetic associations, and identify a number of novel pleiotropic associations between genetic variants and phenotypes. New discoveries include gene-based association of NSF with triglyceride levels and several genes (ACSM3, ERI2, IL18RAP, IL23RAP and NRG1 with left ventricular hypertrophy phenotypes. In multiple-phenotype analyses we find association of NRG1 with left ventricular hypertrophy phenotypes, fibrinogen and urea and pleiotropic relationships of F7 and F10 with Factor VII, Factor IX and cholesterol levels.

  17. De novo SNP discovery in the Scandinavian brown bear (Ursus arctos.

    Directory of Open Access Journals (Sweden)

    Anita J Norman

    Full Text Available Information about relatedness between individuals in wild populations is advantageous when studying evolutionary, behavioural and ecological processes. Genomic data can be used to determine relatedness between individuals either when no prior knowledge exists or to confirm suspected relatedness. Here we present a set of 96 SNPs suitable for inferring relatedness for brown bears (Ursus arctos within Scandinavia. We sequenced reduced representation libraries from nine individuals throughout the geographic range. With consensus reads containing putative SNPs, we applied strict filtering criteria with the aim of finding only high-quality, highly-informative SNPs. We tested 150 putative SNPs of which 96% were validated on a panel of 68 individuals. Ninety-six of the validated SNPs with the highest minor allele frequency were selected. The final SNP panel includes four mitochondrial markers, two monomorphic Y-chromosome sex-determination markers, three X-chromosome SNPs and 87 autosomal SNPs. From our validation sample panel, we identified two previously known parent-offspring dyads with reasonable accuracy. This panel of SNPs is a promising tool for inferring relatedness in the brown bear population in Scandinavia.

  18. Marker lamps

    International Nuclear Information System (INIS)

    Watkins, D.V.

    1980-01-01

    A marker lamp is described which consists of a block of transparent plastics material encapsulated in which is a radioactive light source. These lights comprise a small sealed glass capsule, the hollow inside surface of which is coated with phosphor and which contains tritium or similar radioactive gas. The use of such lamps for identification marking of routes, for example roads, and for identification of underwater oil pipelines is envisaged. (U.K.)

  19. Inter-laboratory evaluation of the EUROFORGEN Global ancestry-informative SNP panel by massively parallel sequencing using the Ion PGM™.

    Science.gov (United States)

    Eduardoff, M; Gross, T E; Santos, C; de la Puente, M; Ballard, D; Strobl, C; Børsting, C; Morling, N; Fusco, L; Hussing, C; Egyed, B; Souto, L; Uacyisrael, J; Syndercombe Court, D; Carracedo, Á; Lareu, M V; Schneider, P M; Parson, W; Phillips, C; Parson, W; Phillips, C

    2016-07-01

    The EUROFORGEN Global ancestry-informative SNP (AIM-SNPs) panel is a forensic multiplex of 128 markers designed to differentiate an individual's ancestry from amongst the five continental population groups of Africa, Europe, East Asia, Native America, and Oceania. A custom multiplex of AmpliSeq™ PCR primers was designed for the Global AIM-SNPs to perform massively parallel sequencing using the Ion PGM™ system. This study assessed individual SNP genotyping precision using the Ion PGM™, the forensic sensitivity of the multiplex using dilution series, degraded DNA plus simple mixtures, and the ancestry differentiation power of the final panel design, which required substitution of three original ancestry-informative SNPs with alternatives. Fourteen populations that had not been previously analyzed were genotyped using the custom multiplex and these studies allowed assessment of genotyping performance by comparison of data across five laboratories. Results indicate a low level of genotyping error can still occur from sequence misalignment caused by homopolymeric tracts close to the target SNP, despite careful scrutiny of candidate SNPs at the design stage. Such sequence misalignment required the exclusion of component SNP rs2080161 from the Global AIM-SNPs panel. However, the overall genotyping precision and sensitivity of this custom multiplex indicates the Ion PGM™ assay for the Global AIM-SNPs is highly suitable for forensic ancestry analysis with massively parallel sequencing. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  20. Ascertainment biases in SNP chips affect measures of population divergence

    DEFF Research Database (Denmark)

    Albrechtsen, Anders; Nielsen, Finn Cilius; Nielsen, Rasmus

    2010-01-01

    Chip-based high-throughput genotyping has facilitated genome-wide studies of genetic diversity. Many studies have utilized these large data sets to make inferences about the demographic history of human populations using measures of genetic differentiation such as F(ST) or principal component...... on direct sequencing. In addition, we also analyze publicly available genome-wide data. We demonstrate that the ascertainment biases will distort measures of human diversity and possibly change conclusions drawn from these measures in some times unexpected ways. We also show that details of the genotyping...... analyses. However, the single nucleotide polymorphism (SNP) chip data suffer from ascertainment biases caused by the SNP discovery process in which a small number of individuals from selected populations are used as discovery panels. In this study, we investigate the effect of the ascertainment bias...

  1. Assessing SNP-SNP interactions among DNA repair, modification and metabolism related pathway genes in breast cancer susceptibility.

    Directory of Open Access Journals (Sweden)

    Yadav Sapkota

    Full Text Available Genome-wide association studies (GWASs have identified low-penetrance common variants (i.e., single nucleotide polymorphisms, SNPs associated with breast cancer susceptibility. Although GWASs are primarily focused on single-locus effects, gene-gene interactions (i.e., epistasis are also assumed to contribute to the genetic risks for complex diseases including breast cancer. While it has been hypothesized that moderately ranked (P value based weak single-locus effects in GWASs could potentially harbor valuable information for evaluating epistasis, we lack systematic efforts to investigate SNPs showing consistent associations with weak statistical significance across independent discovery and replication stages. The objectives of this study were i to select SNPs showing single-locus effects with weak statistical significance for breast cancer in a GWAS and/or candidate-gene studies; ii to replicate these SNPs in an independent set of breast cancer cases and controls; and iii to explore their potential SNP-SNP interactions contributing to breast cancer susceptibility. A total of 17 SNPs related to DNA repair, modification and metabolism pathway genes were selected since these pathways offer a priori knowledge for potential epistatic interactions and an overall role in breast carcinogenesis. The study design included predominantly Caucasian women (2,795 cases and 4,505 controls from Alberta, Canada. We observed two two-way SNP-SNP interactions (APEX1-rs1130409 and RPAP1-rs2297381; MLH1-rs1799977 and MDM2-rs769412 in logistic regression that conferred elevated risks for breast cancer (P(interaction<7.3 × 10(-3. Logic regression identified an interaction involving four SNPs (MBD2-rs4041245, MLH1-rs1799977, MDM2-rs769412, BRCA2-rs1799943 (P(permutation = 2.4 × 10(-3. SNPs involved in SNP-SNP interactions also showed single-locus effects with weak statistical significance, while BRCA2-rs1799943 showed stronger statistical significance (P

  2. Application of high resolution SNP arrays in patients with congenital ...

    Indian Academy of Sciences (India)

    TING-YING LEI

    lent oligonucleotide-based array-CGH to determine the exact breakpoints in 14 patients with partial deletions of chromo- some 13q21.1-qter. They were able to refine the smallest deletion region linked to cleft lip/palate (13q31.3–13q33.1). Except for the arrays that measure DNA copy number differ- ences only, SNP arrays, ...

  3. SNPdetector: a software tool for sensitive and accurate SNP detection.

    Directory of Open Access Journals (Sweden)

    Jinghui Zhang

    2005-10-01

    Full Text Available Identification of single nucleotide polymorphisms (SNPs and mutations is important for the discovery of genetic predisposition to complex diseases. PCR resequencing is the method of choice for de novo SNP discovery. However, manual curation of putative SNPs has been a major bottleneck in the application of this method to high-throughput screening. Therefore it is critical to develop a more sensitive and accurate computational method for automated SNP detection. We developed a software tool, SNPdetector, for automated identification of SNPs and mutations in fluorescence-based resequencing reads. SNPdetector was designed to model the process of human visual inspection and has a very low false positive and false negative rate. We demonstrate the superior performance of SNPdetector in SNP and mutation analysis by comparing its results with those derived by human inspection, PolyPhred (a popular SNP detection tool, and independent genotype assays in three large-scale investigations. The first study identified and validated inter- and intra-subspecies variations in 4,650 traces of 25 inbred mouse strains that belong to either the Mus musculus species or the M. spretus species. Unexpected heterozygosity in CAST/Ei strain was observed in two out of 1,167 mouse SNPs. The second study identified 11,241 candidate SNPs in five ENCODE regions of the human genome covering 2.5 Mb of genomic sequence. Approximately 50% of the candidate SNPs were selected for experimental genotyping; the validation rate exceeded 95%. The third study detected ENU-induced mutations (at 0.04% allele frequency in 64,896 traces of 1,236 zebra fish. Our analysis of three large and diverse test datasets demonstrated that SNPdetector is an effective tool for genome-scale research and for large-sample clinical studies. SNPdetector runs on Unix/Linux platform and is available publicly (http://lpg.nci.nih.gov.

  4. Robust Demographic Inference from Genomic and SNP Data

    Science.gov (United States)

    Excoffier, Laurent; Dupanloup, Isabelle; Huerta-Sánchez, Emilia; Sousa, Vitor C.; Foll, Matthieu

    2013-01-01

    We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with , the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets. PMID:24204310

  5. SNP-based linkage mapping for validation of QTLs for resistance to ascochyta blight in lentil

    Directory of Open Access Journals (Sweden)

    Shimna Sudheesh

    2016-11-01

    Full Text Available Lentil (Lens culinaris Medik. is a self-pollinating, diploid, annual, cool-season, food legume crop that is cultivated throughout the world. Ascochyta blight (AB, caused by Ascochyta lentis Vassilievsky, is an economically important and widespread disease of lentil. Development of cultivars with high levels of durable resistance provides an environmentally acceptable and economically feasible method for AB control. A detailed understanding of the genetic basis of AB resistance is hence highly desirable, in order to obtain insight into the number and influence of resistance genes. Genetic linkage maps based on single nucleotide polymorphisms (SNP and simple sequence repeat (SSR markers have been developed from three recombinant inbred line (RIL populations. The IH x NF map contained 460 loci across 1461.6 cM, while the IH x DIG map contained 329 loci across 1302.5 cM and the third map, NF x DIG contained 330 loci across 1914.1 cM. Data from these maps were combined with a map from a previously published study through use of bridging markers to generate a consensus linkage map containing 689 loci distributed across 7 linkage groups (LGs, with a cumulative length of 2429.61 cM at an average density of one marker per 3.5 cM. Trait dissection of AB resistance was performed for the RIL populations, identifying totals of two and three quantitative trait loci (QTLs explaining 52% and 69% of phenotypic variation for resistance to infection in the IH x DIG and IH x NF populations, respectively. Presence of common markers in the vicinity of the AB_IH1- and AB_IH2.1/AB_IH2.2-containing regions on both maps supports the inference that a common genomic region is responsible for conferring resistance and is associated with the resistant parent, Indianhead. The third QTL was derived from Northfield. Evaluation of markers associated with AB resistance across a diverse lentil germplasm panel revealed that the identity of alleles associated with AB_IH1 predicted

  6. snpTree - a web-server to identify and construct SNP trees from whole genome sequence data

    DEFF Research Database (Denmark)

    Leekitcharoenphon, Pimlapas; Kaas, Rolf Sommer; Thomsen, Martin Christen Frølund

    2012-01-01

    identify SNPs and construct phylogenetic trees from WGS as well as from assembled genomes or contigs. WGS data in fastq format are aligned to reference genomes by BWA while contigs in fasta format are processed by Nucmer. SNPs are concatenated based on position on reference genome and a tree is constructed...... to differentiate and classify isolates. One of the successfully and broadly used methods is analysis of single nucletide polymorphisms (SNPs). Currently, there are different tools and methods to identify SNPs including various options and cut-off values. Furthermore, all current methods require bioinformatic...... skills. Thus, we lack a standard and simple automatic tool to determine SNPs and construct phylogenetic tree from WGS data. Results Here we introduce snpTree, a server for online-automatic SNPs analysis. This tool is composed of different SNPs analysis suites, perl and python scripts. snpTree can...

  7. Detecting imbalanced expression of SNP alleles by minisequencing on microarrays

    Directory of Open Access Journals (Sweden)

    Dahlgren Andreas

    2004-10-01

    Full Text Available Abstract Background Each of the human genes or transcriptional units is likely to contain single nucleotide polymorphisms that may give rise to sequence variation between individuals and tissues on the level of RNA. Based on recent studies, differential expression of the two alleles of heterozygous coding single nucleotide polymorphisms (SNPs may be frequent for human genes. Methods with high accuracy to be used in a high throughput setting are needed for systematic surveys of expressed sequence variation. In this study we evaluated two formats of multiplexed, microarray based minisequencing for quantitative detection of imbalanced expression of SNP alleles. We used a panel of ten SNPs located in five genes known to be expressed in two endothelial cell lines as our model system. Results The accuracy and sensitivity of quantitative detection of allelic imbalance was assessed for each SNP by constructing regression lines using a dilution series of mixed samples from individuals of different genotype. Accurate quantification of SNP alleles by both assay formats was evidenced for by R2 values > 0.95 for the majority of the regression lines. According to a two sample t-test, we were able to distinguish 1–9% of a minority SNP allele from a homozygous genotype, with larger variation between SNPs than between assay formats. Six of the SNPs, heterozygous in either of the two cell lines, were genotyped in RNA extracted from the endothelial cells. The coefficient of variation between the fluorescent signals from five parallel reactions was similar for cDNA and genomic DNA. The fluorescence signal intensity ratios measured in the cDNA samples were compared to those in genomic DNA to determine the relative expression levels of the two alleles of each SNP. Four of the six SNPs tested displayed a higher than 1.4-fold difference in allelic ratios between cDNA and genomic DNA. The results were verified by allele-specific oligonucleotide hybridisation and

  8. Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays

    Directory of Open Access Journals (Sweden)

    Wagner Mark C

    2005-05-01

    Full Text Available Abstract Background Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation. Results A method and SPR Opt (SNP and PCR-RFLP Optimization software to perform a comprehensive, whole-genome analysis to forensically discriminate multiple sequences is presented. Tools for the optimization of forensic typing using Single Nucleotide Polymorphism (SNP and PCR-Restriction Fragment Length Polymorphism (PCR-RFLP analyses across multiple isolate sequences of a species are described. The PCR-RFLP analysis includes prediction and selection of optimal primers and restriction enzymes to enable maximum isolate discrimination based on sequence information. SPR Opt calculates all SNP or PCR-RFLP variations present in the sequences, groups them into haplotypes according to their co-segregation across those sequences, and performs combinatoric analyses to determine which sets of haplotypes provide maximal discrimination among all the input sequences. Those set combinations requiring that membership in the fewest haplotypes be queried (i.e. the fewest assays be performed are found. These analyses highlight variable regions based on existing sequence data. These markers may be heterogeneous among unsequenced isolates as well, and thus may be useful for characterizing the relationships among unsequenced as well as sequenced isolates. The predictions are multi-locus. Analyses of mumps and SARS viruses are summarized. Phylogenetic trees created based on SNPs, PCR-RFLPs, and full genomes are compared for SARS virus, illustrating that purported phylogenies based only on SNP or PCR-RFLP variations do not match those based on multiple sequence alignment of the full genomes. Conclusion This is the first software to optimize the selection of forensic markers to maximize information gained from the fewest assays, accepting whole or partial genome sequence data as input. As

  9. Development and validation of a 20K single nucleotide polymorphism (SNP) whole genome genotyping array for apple (Malus × domestica Borkh).

    Science.gov (United States)

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.

  10. Development and validation of a 20K single nucleotide polymorphism (SNP whole genome genotyping array for apple (Malus × domestica Borkh.

    Directory of Open Access Journals (Sweden)

    Luca Bianco

    Full Text Available High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus. A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs. Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.

  11. Development and Validation of a 20K Single Nucleotide Polymorphism (SNP) Whole Genome Genotyping Array for Apple (Malus × domestica Borkh)

    Science.gov (United States)

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088

  12. Genotyping Rs2274625 Marker in NPHS2 Gene Associated with Nephrotic Syndrome in Isfahan Population

    Directory of Open Access Journals (Sweden)

    L Esmaili Chamgordani

    2015-12-01

    Full Text Available Introduction: Nephrotic syndrome (NS is a genetic disease belonging to a heterogeneous group of glomerular disorders, which mainly occurs within the children. Linkage analysis using single nucleotide polymorphisms (SNP is used as an indirect method in molecular diagnosis of the disease. A large number of SNP markers have been introduced in NPHS2gene in the available electronic databases. Method: In the present study, the genotype and informative status of rs2274625 marker in NPHS2 genewas investigated in 120 unrelated healthy individuals using Tetra-primer ARMS PCR technique and newly designed primers. Allelic frequency and presence of Hardy Weinberg Equilibrium (HWE was estimated using GenePop website. Furthermore, PowerMarker software was utilized in order to compute the index of polymorphism information content (PIC. Results: The study results indicated allele frequency of 97% and 3% for C and T alleles, respectively, in regard with rs2274625 marker within Isfahan population. Moreover, the PIC for the rs2274625 marker was 0.5%, and HWE revealed the equilibruim of the study population in regard with the related marker. Conclusion: As the study findings indicated, rs2274625 could be introduced as an SNP marker in the linkage analysis in order to molecularly trace NPHS2 gene mutations in molecular NS diagnosis in Isfahan population as a representative sample of the Iranian population.

  13. Genotyping-by-sequencing (GBS), an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding

    OpenAIRE

    He, Jiangfeng; Zhao, Xiaoqing; Laroche, André; Lu, Zhen-Xiang; Liu, HongKui; Li, Ziqin

    2014-01-01

    Marker-assisted selection (MAS) refers to the use of molecular markers to assist phenotypic selections in crop improvement. Several types of molecular markers, such as single nucleotide polymorphism (SNP), have been identified and effectively used in plant breeding. The application of next-generation sequencing (NGS) technologies has led to remarkable advances in whole genome sequencing, which provides ultra-throughput sequences to revolutionize plant genotyping and breeding. To further broad...

  14. Whole-genome SNP association in the horse: identification of a deletion in myosin Va responsible for Lavender Foal Syndrome.

    Directory of Open Access Journals (Sweden)

    Samantha A Brooks

    2010-04-01

    Full Text Available Lavender Foal Syndrome (LFS is a lethal inherited disease of horses with a suspected autosomal recessive mode of inheritance. LFS has been primarily diagnosed in a subgroup of the Arabian breed, the Egyptian Arabian horse. The condition is characterized by multiple neurological abnormalities and a dilute coat color. Candidate genes based on comparative phenotypes in mice and humans include the ras-associated protein RAB27a (RAB27A and myosin Va (MYO5A. Here we report mapping of the locus responsible for LFS using a small set of 36 horses segregating for LFS. These horses were genotyped using a newly available single nucleotide polymorphism (SNP chip containing 56,402 discriminatory elements. The whole genome scan identified an associated region containing these two functional candidate genes. Exon sequencing of the MYO5A gene from an affected foal revealed a single base deletion in exon 30 that changes the reading frame and introduces a premature stop codon. A PCR-based Restriction Fragment Length Polymorphism (PCR-RFLP assay was designed and used to investigate the frequency of the mutant gene. All affected horses tested were homozygous for this mutation. Heterozygous carriers were detected in high frequency in families segregating for this trait, and the frequency of carriers in unrelated Egyptian Arabians was 10.3%. The mapping and discovery of the LFS mutation represents the first successful use of whole-genome SNP scanning in the horse for any trait. The RFLP assay can be used to assist breeders in avoiding carrier-to-carrier matings and thus in preventing the birth of affected foals.

  15. Rapid detection of SNP (c.309T>G in the MDM2 gene by the Duplex SmartAmp method.

    Directory of Open Access Journals (Sweden)

    Yasuaki Enokida

    Full Text Available BACKGROUND: Genetic polymorphisms in the human MDM2 gene are suggested to be a tumor susceptibility marker and a prognostic factor for cancer. It has been reported that a single nucleotide polymorphism (SNP c.309T>G in the MDM2 gene attenuates the tumor suppressor activity of p53 and accelerates tumor formation in humans. METHODOLOGY: In this study, to detect the SNP c.309T>G in the MDM2 gene, we have developed a new SNP detection method, named "Duplex SmartAmp," which enabled us to simultaneously detect both 309T and 309G alleles in one tube. To develop this new method, we introduced new primers i.e., nBP and oBPs, as well as two different fluorescent dyes that separately detect those genetic polymorphisms. RESULTS AND CONCLUSIONS: By the Duplex SmartAmp method, the genetic polymorphisms of the MDM2 gene were detected directly from a small amount of genomic DNA or blood samples. We used 96 genomic DNA and 24 blood samples to validate the Duplex SmartAmp by comparison with results of the conventional PCR-RFLP method; consequently, the Duplex SmartAmp results agreed totally with those of the PCR-RFLP method. Thus, the new SNP detection method is considered useful for detecting the SNP c.309T>G in the MDM2 gene so as to judge cancer susceptibility against some cellular stress in the clinical setting, and also to handle a large number of samples and enable rapid clinical diagnosis.

  16. Clinical significance of SNP (rs2596542 in histocompatibility complex class I-related gene A promoter region among hepatitis C virus related hepatocellular carcinoma cases

    Directory of Open Access Journals (Sweden)

    Amal A. Mohamed

    2017-07-01

    Full Text Available The major histocompatibility complex class I-related gene A (MICA is an antigen induced by stress and performs an integral role in immune responses as an anti-infectious and antitumor agent. This work was designed to investigate whether (SNP rs2596542C/T in MICA promoter region is predictive of liver cirrhosis (LC and hepatocellular carcinoma (HCC or not. Forty-seven healthy controls and 94 HCV-infected patients, subdivided into 47 LC and 47 HCC subjects were enrolled in this study. SNP association was studied using real time PCR and soluble serum MICA concentration was measured using ELISA. Results showed that heterozygous genotype rs2596542CT was significantly (P = 0.022 distributed between HCC and LC related CHC patients. The sMICA was significantly higher (P = 0.0001 among HCC and LC. No significant association (P = 0.56 between rs2596542CT genotypes and sMICA levels was observed. Studying SNP rs2596542C/T association with HCC and LC susceptibility revealed that statistical significant differences (P = 0.013, P = 0.027 were only observed between SNP rs2596542C/T and each of HCC and LC, respectively, versus healthy controls, indicating that the rs2596542C/T genetic variation is not a significant contributor to HCC development in LC patients. Moreover, the T allele was considered a risk factor for HCC and LC vulnerability in HCV patients (OR = 1.93 and 2.1, respectively, while the C allele contributes to decreasing HCC risk. Therefore, SNP (rs2596542C/T in MICA promoter region and sMICA levels might be potential useful markers in the assessment of liver disease progression to LC and HCC.

  17. rSNPBase 3.0: an updated database of SNP-related regulatory elements, element-gene pairs and SNP-based gene regulatory networks.

    Science.gov (United States)

    Guo, Liyuan; Wang, Jing

    2018-01-04

    Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element-target gene pairs (E-G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Identification of molecular markers associated with Verticillium wilt resistance in alfalfa (Medicago sativa L.) using high-resolution melting.

    Science.gov (United States)

    Zhang, Tiejun; Yu, Long-Xi; McCord, Per; Miller, David; Bhamidimarri, Suresh; Johnson, David; Monteros, Maria J; Ho, Julie; Reisen, Peter; Samac, Deborah A

    2014-01-01

    Verticillium wilt, caused by the soilborne fungus, Verticillium alfalfae, is one of the most serious diseases of alfalfa (Medicago sativa L.) worldwide. To identify loci associated with resistance to Verticillium wilt, a bulk segregant analysis was conducted in susceptible or resistant pools constructed from 13 synthetic alfalfa populations, followed by association mapping in two F1 populations consisted of 352 individuals. Simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers were used for genotyping. Phenotyping was done by manual inoculation of the pathogen to replicated cloned plants of each individual and disease severity was scored using a standard scale. Marker-trait association was analyzed by TASSEL. Seventeen SNP markers significantly associated with Verticillium wilt resistance were identified and they were located on chromosomes 1, 2, 4, 7 and 8. SNP markers identified on chromosomes 2, 4 and 7 co-locate with regions of Verticillium wilt resistance loci reported in M. truncatula. Additional markers identified on chromosomes 1 and 8 located the regions where no Verticillium resistance locus has been reported. This study highlights the value of SNP genotyping by high resolution melting to identify the disease resistance loci in tetraploid alfalfa. With further validation, the markers identified in this study could be used for improving resistance to Verticillium wilt in alfalfa breeding programs.

  19. Identification of molecular markers associated with Verticillium wilt resistance in alfalfa (Medicago sativa L. using high-resolution melting.

    Directory of Open Access Journals (Sweden)

    Tiejun Zhang

    Full Text Available Verticillium wilt, caused by the soilborne fungus, Verticillium alfalfae, is one of the most serious diseases of alfalfa (Medicago sativa L. worldwide. To identify loci associated with resistance to Verticillium wilt, a bulk segregant analysis was conducted in susceptible or resistant pools constructed from 13 synthetic alfalfa populations, followed by association mapping in two F1 populations consisted of 352 individuals. Simple sequence repeat (SSR and single nucleotide polymorphism (SNP markers were used for genotyping. Phenotyping was done by manual inoculation of the pathogen to replicated cloned plants of each individual and disease severity was scored using a standard scale. Marker-trait association was analyzed by TASSEL. Seventeen SNP markers significantly associated with Verticillium wilt resistance were identified and they were located on chromosomes 1, 2, 4, 7 and 8. SNP markers identified on chromosomes 2, 4 and 7 co-locate with regions of Verticillium wilt resistance loci reported in M. truncatula. Additional markers identified on chromosomes 1 and 8 located the regions where no Verticillium resistance locus has been reported. This study highlights the value of SNP genotyping by high resolution melting to identify the disease resistance loci in tetraploid alfalfa. With further validation, the markers identified in this study could be used for improving resistance to Verticillium wilt in alfalfa breeding programs.

  20. High-density single nucleotide polymorphism (SNP) array mapping in Brassica oleracea: identification of QTL associated with carotenoid variation in broccoli florets.

    Science.gov (United States)

    Brown, Allan F; Yousef, Gad G; Chebrolu, Kranthi K; Byrd, Robert W; Everhart, Koyt W; Thomas, Aswathy; Reid, Robert W; Parkin, Isobel A P; Sharpe, Andrew G; Oliver, Rebekah; Guzman, Ivette; Jackson, Eric W

    2014-09-01

    A high-resolution genetic linkage map of B. oleracea was developed from a B. napus SNP array. The work will facilitate genetic and evolutionary studies in Brassicaceae. A broccoli population, VI-158 × BNC, consisting of 150 F2:3 families was used to create a saturated Brassica oleracea (diploid: CC) linkage map using a recently developed rapeseed (Brassica napus) (tetraploid: AACC) Illumina Infinium single nucleotide polymorphism (SNP) array. The map consisted of 547 non-redundant SNP markers spanning 948.1 cM across nine chromosomes with an average interval size of 1.7 cM. As the SNPs are anchored to the genomic reference sequence of the rapid cycling B. oleracea TO1000, we were able to estimate that the map provides 96 % coverage of the diploid genome. Carotenoid analysis of 2 years data identified 3 QTLs on two chromosomes that are associated with up to half of the phenotypic variation associated with the accumulation of total or individual compounds. By searching the genome sequences of the two related diploid species (B. oleracea and B. rapa), we further identified putative carotenoid candidate genes in the region of these QTLs. This is the first description of the use of a B. napus SNP array to rapidly construct high-density genetic linkage maps of one of the constituent diploid species. The unambiguous nature of these markers with regard to genomic sequences provides evidence to the nature of genes underlying the QTL, and demonstrates the value and impact this resource will have on Brassica research.

  1. Single nucleotide polymorphism (SNP) detection on a magnetoresistive sensor

    DEFF Research Database (Denmark)

    Rizzi, Giovanni; Østerberg, Frederik Westergaard; Dufva, Martin

    2013-01-01

    We present a magnetoresistive sensor platform for hybridization assays and demonstrate its applicability on single nucleotide polymorphism (SNP) genotyping. The sensor relies on anisotropic magnetoresistance in a new geometry with a local negative reference and uses the magnetic field from...... the sensor bias current to magnetize magnetic beads in the vicinity of the sensor. The method allows for real-time measurements of the specific bead binding to the sensor surface during DNA hybridization and washing. Compared to other magnetic biosensing platforms, our approach eliminates the need...... for external electromagnets and thus allows for miniaturization of the sensor platform....

  2. SNP and haplotype mapping for genetic analysis in the rat

    Czech Academy of Sciences Publication Activity Database

    Saar, K.; Beck, A.; Bihoreau, M. T.; Birney, E.; Brocklebank, D.; Chen, Y.; Cuppen, E.; Demonchy, S.; Dopazo, J.; Flicek, P.; Foglio, M.; Fujiyama, A.; Gut, I. G.; Gauguier, D.; Guigo, R.; Guryev, V.; Heinig, M.; Hummel, O.; Jahn, N.; Klages, S.; Křen, Vladimír; Kube, M.; Kuhl, H.; Kuramoto, T.; Pravenec, Michal

    2008-01-01

    Roč. 40, č. 5 (2008), s. 560-566 ISSN 1061-4036 R&D Projects: GA MŠk(CZ) 1P05ME791; GA MŠk(CZ) 1M0520; GA MŠk(CZ) ME08006 Grant - others:HHMI(US) 55005624; -(XE) LSHG-CT-2005-019015 Institutional research plan: CEZ:AV0Z50110509 Source of funding: N - neverejné zdroje ; R - rámcový projekt EK Keywords : SNP * rat * complete map Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 30.259, year: 2008

  3. Preliminary Study on the Single Nucleotide Polymorphism (SNP of XRCC1 Gene Identificationto Improve the Outcomes of Radiotherapy for Cervical Cancer

    Directory of Open Access Journals (Sweden)

    Devita Tetriana

    2015-09-01

    Full Text Available Cervical cancer is the most fatal disease among Indonesian women. In recognition of the substantial variation in the intrinsic response of individuals to radiation, an effort had been done to identify the genetic markers, primarily Single Nucleotide polymorphisms (SNPs, which are associated with responsiveness of cancer cells to radiation therapy. One of these SNPs is X-ray repair cross-complementing protein 1 (XRCC1 that is one of the most important genes in deoxyribonucleic acid (DNA repair pathways. Meta-analysis in the determination of the association of XRCC1 polymorphisms with cervical cancer revealed the potential role of XRCC1 polymorphisms in predicting cell response to radiotherapy.Our preliminary study with real-time polymerase chain reaction (RT-PCR showed that radiotherapy affected the XRCC1 gene analyzed in blood of cervical cancer patient. Other published study found three SNPs of XRCC1 (Arg194Trp, Arg280His, and Arg399Gln that cause amino acid substitutions. Arg194Trp is only SNPs that associated with high risk of cervical cancer but not others. Additionally, structure and function of this protein can be altered by functional SNPs, which may lead to the susceptibility of individuals to cancers. Anotherstudy found G399A polymorphisms. We concluded that SNP of this DNA repair genes have been found to be good predictors of efficacy of radiotherapy.Kanker serviks adalah penyakit yang paling fatal pada perempuan di Indonesia. Untuk memahami variasi substansial respon intrinsik individual terhadap radiasi, suatu usaha telah dilakukan untuk mengidentifikasi petanda genetik, terutama Single Nucleotide polymorphism (SNP, yang berkaitan dengan responsel kanker terhadap terapi radiasi. Satu dari SNP tersebut adalah X-ray repair cross-complementing protein 1 (XRCC1 yang merupakan satu dari gen paling penting dalam lajur perbaikan asam deoksiribonukleat (DNA. Meta-analysis dalam penentuan hubungan polimorfisme XRCC1 dengan kanker serviks

  4. Wheat in the Mediterranean revisited--tetraploid wheat landraces assessed with elite bread wheat Single Nucleotide Polymorphism markers.

    Science.gov (United States)

    Oliveira, Hugo R; Hagenblad, Jenny; Leino, Matti W; Leigh, Fiona J; Lister, Diane L; Penã-Chocarro, Leonor; Jones, Martin K

    2014-05-08

    Single Nucleotide Polymorphism (SNP) panels recently developed for the assessment of genetic diversity in wheat are primarily based on elite varieties, mostly those of bread wheat. The usefulness of such SNP panels for studying wheat evolution and domestication has not yet been fully explored and ascertainment bias issues can potentially affect their applicability when studying landraces and tetraploid ancestors of bread wheat. We here evaluate whether population structure and evolutionary history can be assessed in tetraploid landrace wheats using SNP markers previously developed for the analysis of elite cultivars of hexaploid wheat. We genotyped more than 100 tetraploid wheat landraces and wild emmer wheat accessions, some of which had previously been screened with SSR markers, for an existing SNP panel and obtained publically available genotypes for the same SNPs for hexaploid wheat varieties and landraces. Results showed that quantification of genetic diversity can be affected by ascertainment bias but that the effects of ascertainment bias can at least partly be alleviated by merging SNPs to haplotypes. Analyses of population structure and genetic differentiation show strong subdivision between the tetraploid wheat subspecies, except for durum and rivet that are not separable. A more detailed population structure of durum landraces could be obtained than with SSR markers. The results also suggest an emmer, rather than durum, ancestry of bread wheat and with gene flow from wild emmer. SNP markers developed for elite cultivars show great potential for inferring population structure and can address evolutionary questions in landrace wheat. Issues of marker genome specificity and mapping need, however, to be addressed. Ascertainment bias does not seem to interfere with the ability of a SNP marker system developed for elite bread wheat accessions to detect population structure in other types of wheat.

  5. A gene-based radiation hybrid map of the gilthead sea bream Sparus aurata refines and exploits conserved synteny with Tetraodon nigroviridis

    Directory of Open Access Journals (Sweden)

    Tsalavouta Matina

    2007-02-01

    Full Text Available Abstract Background Comparative teleost studies are of great interest since they are important in aquaculture and in evolutionary issues. Comparing genomes of fully sequenced model fish species with those of farmed fish species through comparative mapping offers shortcuts for quantitative trait loci (QTL detections and for studying genome evolution through the identification of regions of conserved synteny in teleosts. Here a comparative mapping study is presented by radiation hybrid (RH mapping genes of the gilthead sea bream Sparus aurata, a non-model teleost fish of commercial and evolutionary interest, as it represents the worldwide distributed species-rich family of Sparidae. Results An additional 74 microsatellite markers and 428 gene-based markers appropriate for comparative mapping studies were mapped on the existing RH map of Sparus aurata. The anchoring of the RH map to the genetic linkage map resulted in 24 groups matching the karyotype of Sparus aurata. Homologous sequences to Tetraodon were identified for 301 of the gene-based markers positioned on the RH map of Sparus aurata. Comparison between Sparus aurata RH groups and Tetraodon chromosomes (karyotype of Tetraodon consists of 21 chromosomes in this study reveals an unambiguous one-to-one relationship suggesting that three Tetraodon chromosomes correspond to six Sparus aurata radiation hybrid groups. The exploitation of this conserved synteny relationship is furthermore demonstrated by in silico mapping of gilthead sea bream expressed sequence tags (EST that give a significant similarity hit to Tetraodon. Conclusion The addition of primarily gene-based markers increased substantially the density of the existing RH map and facilitated comparative analysis. The anchoring of this gene-based radiation hybrid map to the genome maps of model species broadened the pool of candidate genes that mainly control growth, disease resistance, sex determination and reversal, reproduction as well

  6. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.

    Directory of Open Access Journals (Sweden)

    Carole F S Koning-Boucoiran

    2015-04-01

    Full Text Available In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array.Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L. genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.

  7. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

    Science.gov (United States)

    Koning-Boucoiran, Carole F S; Esselink, G Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P C; Gitonga, Virginia W; Krens, Frans A; Voorrips, Roeland E; van de Weg, W Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J M

    2015-01-01

    In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.

  8. FAO/IAEA international symposium on applications of gene-based technologies for improving animal production and health in developing countries. Book of extended synopses

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2003-07-01

    Genetic engineering is at the forefront of much biological research - basic, adaptive and applied or near market. Manipulation of genes to bring about the expression of a specific product, or to produce a characteristic or trait, offers exciting possibilities within both the plant and the animal kingdom. The opportunities, in terms of improving livestock productivity or reducing losses from disease, lie in a number of areas. In almost all areas of this research, isotopic markers are extensively used and are in most cases essential for achieving the levels of sensitivity required for genetic characterization and manipulation. Genetic engineering has the potential to solve many problems relating to animal productivity and health. At present the focus is on the problems that face livestock producers in the developed world. If the full benefit of this technology is to be realized globally, the problems confronting livestock farmers in developing countries will have to be considered. The characterization and application of methods in these regions has to be managed and exploited. It is hoped that this Symposium will stimulate the international exchange of information and ideas that contribute to greater accessibility and enhanced use of gene based technologies in animal agriculture in developing countries. OBJECTIVES: To create an interactive environment to discuss the role and future potential of gene based technologies for improving animal production and health; To identify constraints in the use of gene based technologies in developing countries and to determine how to use these technologies in a simple, practical way; To identify and prioritize specific research needs; To explore the possibility of international co-ordination in the area of gene based technologies in animal agriculture; To examine ethical, technological, policy and environmental issues and the role of nuclear techniques in the further development and application of gene based technologies with

  9. FAO/IAEA international symposium on applications of gene-based technologies for improving animal production and health in developing countries. Book of extended synopses

    International Nuclear Information System (INIS)

    2003-01-01

    Genetic engineering is at the forefront of much biological research - basic, adaptive and applied or near market. Manipulation of genes to bring about the expression of a specific product, or to produce a characteristic or trait, offers exciting possibilities within both the plant and the animal kingdom. The opportunities, in terms of improving livestock productivity or reducing losses from disease, lie in a number of areas. In almost all areas of this research, isotopic markers are extensively used and are in most cases essential for achieving the levels of sensitivity required for genetic characterization and manipulation. Genetic engineering has the potential to solve many problems relating to animal productivity and health. At present the focus is on the problems that face livestock producers in the developed world. If the full benefit of this technology is to be realized globally, the problems confronting livestock farmers in developing countries will have to be considered. The characterization and application of methods in these regions has to be managed and exploited. It is hoped that this Symposium will stimulate the international exchange of information and ideas that contribute to greater accessibility and enhanced use of gene based technologies in animal agriculture in developing countries. OBJECTIVES: To create an interactive environment to discuss the role and future potential of gene based technologies for improving animal production and health; To identify constraints in the use of gene based technologies in developing countries and to determine how to use these technologies in a simple, practical way; To identify and prioritize specific research needs; To explore the possibility of international co-ordination in the area of gene based technologies in animal agriculture; To examine ethical, technological, policy and environmental issues and the role of nuclear techniques in the further development and application of gene based technologies with

  10. A SNP based high-density linkage map of Apis cerana reveals a high recombination rate similar to Apis mellifera.

    Directory of Open Access Journals (Sweden)

    Yuan Yuan Shi

    Full Text Available BACKGROUND: The Eastern honey bee, Apis cerana Fabricius, is distributed in southern and eastern Asia, from India and China to Korea and Japan and southeast to the Moluccas. This species is also widely kept for honey production besides Apis mellifera. Apis cerana is also a model organism for studying social behavior, caste determination, mating biology, sexual selection, and host-parasite interactions. Few resources are available for molecular research in this species, and a linkage map was never constructed. A linkage map is a prerequisite for quantitative trait loci mapping and for analyzing genome structure. We used the Chinese honey bee, Apis cerana cerana to construct the first linkage map in the Eastern honey bee. RESULTS: F2 workers (N = 103 were genotyped for 126,990 single nucleotide polymorphisms (SNPs. After filtering low quality and those not passing the Mendel test, we obtained 3,000 SNPs, 1,535 of these were informative and used to construct a linkage map. The preliminary map contains 19 linkage groups, we then mapped the 19 linkage groups to 16 chromosomes by comparing the markers to the genome of A. mellfiera. The final map contains 16 linkage groups with a total of 1,535 markers. The total genetic distance is 3,942.7 centimorgans (cM with the largest linkage group (180 loci measuring 574.5 cM. Average marker interval for all markers across the 16 linkage groups is 2.6 cM. CONCLUSION: We constructed a high density linkage map for A. c. cerana with 1,535 markers. Because the map is based on SNP markers, it will enable easier and faster genotyping assays than randomly amplified polymorphic DNA or microsatellite based maps used in A. mellifera.

  11. Fine-scaled human genetic structure revealed by SNP microarrays.

    Science.gov (United States)

    Xing, Jinchuan; Watkins, W Scott; Witherspoon, David J; Zhang, Yuhua; Guthery, Stephen L; Thara, Rangaswamy; Mowry, Bryan J; Bulayeva, Kazima; Weiss, Robert B; Jorde, Lynn B

    2009-05-01

    We report an analysis of more than 240,000 loci genotyped using the Affymetrix SNP microarray in 554 individuals from 27 worldwide populations in Africa, Asia, and Europe. To provide a more extensive and complete sampling of human genetic variation, we have included caste and tribal samples from two states in South India, Daghestanis from eastern Europe, and the Iban from Malaysia. Consistent with observations made by Charles Darwin, our results highlight shared variation among human populations and demonstrate that much genetic variation is geographically continuous. At the same time, principal components analyses reveal discernible genetic differentiation among almost all identified populations in our sample, and in most cases, individuals can be clearly assigned to defined populations on the basis of SNP genotypes. All individuals are accurately classified into continental groups using a model-based clustering algorithm, but between closely related populations, genetic and self-classifications conflict for some individuals. The 250K data permitted high-level resolution of genetic variation among Indian caste and tribal populations and between highland and lowland Daghestani populations. In particular, upper-caste individuals from Tamil Nadu and Andhra Pradesh form one defined group, lower-caste individuals from these two states form another, and the tribal Irula samples form a third. Our results emphasize the correlation of genetic and geographic distances and highlight other elements, including social factors that have contributed to population structure.

  12. A SNP uncoupling Mina expression from the TGFβ signaling pathway.

    Science.gov (United States)

    Lian, Shang L; Mihi, Belgacem; Koyanagi, Madoka; Nakayama, Toshinori; Bix, Mark

    2018-03-01

    Mina is a JmjC family 2-oxoglutarate oxygenase with pleiotropic roles in cell proliferation, cancer, T cell differentiation, pulmonary inflammation, and intestinal parasite expulsion. Although Mina expression varies according to cell-type, developmental stage and activation state, its transcriptional regulation is poorly understood. Across inbred mouse strains, Mina protein level exhibits a bimodal distribution, correlating with inheritance of a biallelic haplotype block comprising 21 promoter/intron 1-region SNPs. We previously showed that heritable differences in Mina protein level are transcriptionally regulated. Accordingly, we decided to test the hypothesis that at least one of the promoter/intron 1-region SNPs perturbs a Mina cis-regulatory element (CRE). Here, we have comprehensively scanned for CREs across a Mina locus-spanning 26-kilobase genomic interval. We discovered 8 potential CREs and functionally validated 4 of these, the strongest of which (E2), residing in intron 1, contained a SNP whose BALB/c-but not C57Bl/6 allele-abolished both Smad3 binding and transforming growth factor beta (TGFβ) responsiveness. Our results demonstrate the TGFβ signaling pathway plays a critical role in regulating Mina expression and SNP rs4191790 controls heritable variation in Mina expression level, raising important questions regarding the evolution of an allele that uncouples Mina expression from the TGFβ signaling pathway. © 2017 The Authors. Immunity, Inflammation and Disease Published by John Wiley & Sons Ltd.

  13. Psoriasis prediction from genome-wide SNP profiles

    Directory of Open Access Journals (Sweden)

    Fang Xiangzhong

    2011-01-01

    Full Text Available Abstract Background With the availability of large-scale genome-wide association study (GWAS data, choosing an optimal set of SNPs for disease susceptibility prediction is a challenging task. This study aimed to use single nucleotide polymorphisms (SNPs to predict psoriasis from searching GWAS data. Methods Totally we had 2,798 samples and 451,724 SNPs. Process for searching a set of SNPs to predict susceptibility for psoriasis consisted of two steps. The first one was to search top 1,000 SNPs with high accuracy for prediction of psoriasis from GWAS dataset. The second one was to search for an optimal SNP subset for predicting psoriasis. The sequential information bottleneck (sIB method was compared with classical linear discriminant analysis(LDA for classification performance. Results The best test harmonic mean of sensitivity and specificity for predicting psoriasis by sIB was 0.674(95% CI: 0.650-0.698, while only 0.520(95% CI: 0.472-0.524 was reported for predicting disease by LDA. Our results indicate that the new classifier sIB performs better than LDA in the study. Conclusions The fact that a small set of SNPs can predict disease status with average accuracy of 68% makes it possible to use SNP data for psoriasis prediction.

  14. High-throughput bacterial SNP typing identifies distinct clusters of Salmonella Typhi causing typhoid in Nepalese children

    LENUS (Irish Health Repository)

    Holt, Kathryn E

    2010-05-31

    Abstract Background Salmonella Typhi (S. Typhi) causes typhoid fever, which remains an important public health issue in many developing countries. Kathmandu, the capital of Nepal, is an area of high incidence and the pediatric population appears to be at high risk of exposure and infection. Methods We recently defined the population structure of S. Typhi, using new sequencing technologies to identify nearly 2,000 single nucleotide polymorphisms (SNPs) that can be used as unequivocal phylogenetic markers. Here we have used the GoldenGate (Illumina) platform to simultaneously type 1,500 of these SNPs in 62 S. Typhi isolates causing severe typhoid in children admitted to Patan Hospital in Kathmandu. Results Eight distinct S. Typhi haplotypes were identified during the 20-month study period, with 68% of isolates belonging to a subclone of the previously defined H58 S. Typhi. This subclone was closely associated with resistance to nalidixic acid, with all isolates from this group demonstrating a resistant phenotype and harbouring the same resistance-associated SNP in GyrA (Phe83). A secondary clone, comprising 19% of isolates, was observed only during the second half of the study. Conclusions Our data demonstrate the utility of SNP typing for monitoring bacterial populations over a defined period in a single endemic setting. We provide evidence for genotype introduction and define a nalidixic acid resistant subclone of S. Typhi, which appears to be the dominant cause of severe pediatric typhoid in Kathmandu during the study period.

  15. SNPpy--database management for SNP data from genome wide association studies.

    Directory of Open Access Journals (Sweden)

    Faheem Mitha

    Full Text Available BACKGROUND: We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS. This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP data. SNPpy and its dependencies are open source software. RESULTS: The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. CONCLUSIONS: By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is a practical and extensible solution for investigators who seek to deploy central management of their GWAS data.

  16. dartr: An r package to facilitate analysis of SNP data generated from reduced representation genome sequencing.

    Science.gov (United States)

    Gruber, Bernd; Unmack, Peter J; Berry, Oliver F; Georges, Arthur

    2018-05-01

    Although vast technological advances have been made and genetic software packages are growing in number, it is not a trivial task to analyse SNP data. We announce a new r package, dartr, enabling the analysis of single nucleotide polymorphism data for population genomic and phylogenomic applications. dartr provides user-friendly functions for data quality control and marker selection, and permits rigorous evaluations of conformation to Hardy-Weinberg equilibrium, gametic-phase disequilibrium and neutrality. The package reports standard descriptive statistics, permits exploration of patterns in the data through principal components analysis and conducts standard F-statistics, as well as basic phylogenetic analyses, population assignment, isolation by distance and exports data to a variety of commonly used downstream applications (e.g., newhybrids, faststructure and phylogeny applications) outside of the r environment. The package serves two main purposes: first, a user-friendly approach to lower the hurdle to analyse such data-therefore, the package comes with a detailed tutorial targeted to the r beginner to allow data analysis without requiring deep knowledge of r. Second, we use a single, well-established format-genlight from the adegenet package-as input for all our functions to avoid data reformatting. By strictly using the genlight format, we hope to facilitate this format as the de facto standard of future software developments and hence reduce the format jungle of genetic data sets. The dartr package is available via the r CRAN network and GitHub. © 2017 John Wiley & Sons Ltd.

  17. Detecting selection signatures between Duroc and Duroc synthetic pig populations using high-density SNP chip.

    Science.gov (United States)

    Edea, Z; Hong, J-K; Jung, J-H; Kim, D-W; Kim, Y-M; Kim, E-S; Shin, S S; Jung, Y C; Kim, K-S

    2017-08-01

    The development of high throughput genotyping techniques has facilitated the identification of selection signatures of pigs. The detection of genomic selection signals in a population subjected to differential selection pressures may provide insights into the genes associated with economically and biologically important traits. To identify genomic regions under selection, we genotyped 488 Duroc (D) pigs and 155 D × Korean native pigs (DKNPs) using the Porcine SNP70K BeadChip. By applying the F ST and extended haplotype homozygosity (EHH-Rsb) methods, we detected genes under directional selection associated with growth/stature (DOCK7, PLCB4, HS2ST1, FBP2 and TG), carcass and meat quality (TG, COL14A1, FBXO5, NR3C1, SNX7, ARHGAP26 and DPYD), number of teats (LOC100153159 and LRRC1), pigmentation (MME) and ear morphology (SOX5), which are all mostly near or at fixation. These results could be a basis for investigating the underlying mutations associated with observed phenotypic variation. Validation using genome-wide association analysis would also facilitate the inclusion of some of these markers in genetic evaluation programs. © 2017 Stichting International Foundation for Animal Genetics.

  18. Genome-wide SNP discovery in tetraploid alfalfa using 454 sequencing and high resolution melting analysis

    Directory of Open Access Journals (Sweden)

    Zhao Patrick X

    2011-07-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most common type of sequence variation among plants and are often functionally important. We describe the use of 454 technology and high resolution melting analysis (HRM for high throughput SNP discovery in tetraploid alfalfa (Medicago sativa L., a species with high economic value but limited genomic resources. Results The alfalfa genotypes selected from M. sativa subsp. sativa var. 'Chilean' and M. sativa subsp. falcata var. 'Wisfal', which differ in water stress sensitivity, were used to prepare cDNA from tissue of clonally-propagated plants grown under either well-watered or water-stressed conditions, and then pooled for 454 sequencing. Based on 125.2 Mb of raw sequence, a total of 54,216 unique sequences were obtained including 24,144 tentative consensus (TCs sequences and 30,072 singletons, ranging from 100 bp to 6,662 bp in length, with an average length of 541 bp. We identified 40,661 candidate SNPs distributed throughout the genome. A sample of candidate SNPs were evaluated and validated using high resolution melting (HRM analysis. A total of 3,491 TCs harboring 20,270 candidate SNPs were located on the M. truncatula (MT 3.5.1 chromosomes. Gene Ontology assignments indicate that sequences obtained cover a broad range of GO categories. Conclusions We describe an efficient method to identify thousands of SNPs distributed throughout the alfalfa genome covering a broad range of GO categories. Validated SNPs represent valuable molecular marker resources that can be used to enhance marker density in linkage maps, identify potential factors involved in heterosis and genetic variation, and as tools for association mapping and genomic selection in alfalfa.

  19. Development and validation of the Axiom(®) Apple480K SNP genotyping array.

    Science.gov (United States)

    Bianco, Luca; Cestaro, Alessandro; Linsmith, Gareth; Muranty, Hélène; Denancé, Caroline; Théron, Anthony; Poncet, Charles; Micheletti, Diego; Kerschbamer, Emanuela; Di Pierro, Erica A; Larger, Simone; Pindo, Massimo; Van de Weg, Eric; Davassi, Alessandro; Laurens, François; Velasco, Riccardo; Durel, Charles-Eric; Troggio, Michela

    2016-04-01

    Cultivated apple (Malus × domestica Borkh.) is one of the most important fruit crops in temperate regions, and has great economic and cultural value. The apple genome is highly heterozygous and has undergone a recent duplication which, combined with a rapid linkage disequilibrium decay, makes it difficult to perform genome-wide association (GWA) studies. Single nucleotide polymorphism arrays offer highly multiplexed assays at a relatively low cost per data point and can be a valid tool for the identification of the markers associated with traits of interest. Here, we describe the development and validation of a 487K SNP Affymetrix Axiom(®) genotyping array for apple and discuss its potential applications. The array has been built from the high-depth resequencing of 63 different cultivars covering most of the genetic diversity in cultivated apple. The SNPs were chosen by applying a focal points approach to enrich genic regions, but also to reach a uniform coverage of non-genic regions. A total of 1324 apple accessions, including the 92 progenies of two mapping populations, have been genotyped with the Axiom(®) Apple480K to assess the effectiveness of the array. A large majority of SNPs (359 994 or 74%) fell in the stringent class of poly high resolution polymorphisms. We also devised a filtering procedure to identify a subset of 275K very robust markers that can be safely used for germplasm surveys in apple. The Axiom(®) Apple480K has now been commercially released both for public and proprietary use and will likely be a reference tool for GWA studies in apple. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  20. Effects of Single Nucleotide Polymorphism Marker Density on Haplotype Block Partition

    Directory of Open Access Journals (Sweden)

    Sun Ah Kim

    2016-12-01

    Full Text Available Many researchers have found that one of the most important characteristics of the structure of linkage disequilibrium is that the human genome can be divided into non-overlapping block partitions in which only a small number of haplotypes are observed. The location and distribution of haplotype blocks can be seen as a population property influenced by population genetic events such as selection, mutation, recombination and population structure. In this study, we investigate the effects of the density of markers relative to the full set of all polymorphisms in the region on the results of haplotype partitioning for five popular haplotype block partition methods: three methods in Haploview (confidence interval, four gamete test, and solid spine, MIG++ implemented in PLINK 1.9 and S-MIG++. We used several experimental datasets obtained by sampling subsets of single nucleotide polymorphism (SNP markers of chromosome 22 region in the 1000 Genomes Project data and also the HapMap phase 3 data to compare the results of haplotype block partitions by five methods. With decreasing sampling ratio down to 20% of the original SNP markers, the total number of haplotype blocks decreases and the length of haplotype blocks increases for all algorithms. When we examined the marker-independence of the haplotype block locations constructed from the datasets of different density, the results using below 50% of the entire SNP markers were very different from the results using the entire SNP markers. We conclude that the haplotype block construction results should be used and interpreted carefully depending on the selection of markers and the purpose of the study.

  1. An improved PSO algorithm for generating protective SNP barcodes in breast cancer.

    Directory of Open Access Journals (Sweden)

    Li-Yeh Chuang

    Full Text Available BACKGROUND: Possible single nucleotide polymorphism (SNP interactions in breast cancer are usually not investigated in genome-wide association studies. Previously, we proposed a particle swarm optimization (PSO method to compute these kinds of SNP interactions. However, this PSO does not guarantee to find the best result in every implement, especially when high-dimensional data is investigated for SNP-SNP interactions. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we propose IPSO algorithm to improve the reliability of PSO for the identification of the best protective SNP barcodes (SNP combinations and genotypes with maximum difference between cases and controls associated with breast cancer. SNP barcodes containing different numbers of SNPs were computed. The top five SNP barcode results are retained for computing the next SNP barcode with a one-SNP-increase for each processing step. Based on the simulated data for 23 SNPs of six steroid hormone metabolisms and signalling-related genes, the performance of our proposed IPSO algorithm is evaluated. Among 23 SNPs, 13 SNPs displayed significant odds ratio (OR values (1.268 to 0.848; p<0.05 for breast cancer. Based on IPSO algorithm, the jointed effect in terms of SNP barcodes with two to seven SNPs show significantly decreasing OR values (0.84 to 0.57; p<0.05 to 0.001. Using PSO algorithm, two to four SNPs show significantly decreasing OR values (0.84 to 0.77; p<0.05 to 0.001. Based on the results of 20 simulations, medians of the maximum differences for each SNP barcode generated by IPSO are higher than by PSO. The interquartile ranges of the boxplot, as well as the upper and lower hinges for each n-SNP barcode (n = 3∼10 are more narrow in IPSO than in PSO, suggesting that IPSO is highly reliable for SNP barcode identification. CONCLUSIONS/SIGNIFICANCE: Overall, the proposed IPSO algorithm is robust to provide exact identification of the best protective SNP barcodes for breast cancer.

  2. Kazusa Marker DataBase: a database for genomics, genetics, and molecular breeding in plants

    Science.gov (United States)

    Shirasawa, Kenta; Isobe, Sachiko; Tabata, Satoshi; Hirakawa, Hideki

    2014-01-01

    In order to provide useful genomic information for agronomical plants, we have established a database, the Kazusa Marker DataBase (http://marker.kazusa.or.jp). This database includes information on DNA markers, e.g., SSR and SNP markers, genetic linkage maps, and physical maps, that were developed at the Kazusa DNA Research Institute. Keyword searches for the markers, sequence data used for marker development, and experimental conditions are also available through this database. Currently, 10 plant species have been targeted: tomato (Solanum lycopersicum), pepper (Capsicum annuum), strawberry (Fragaria × ananassa), radish (Raphanus sativus), Lotus japonicus, soybean (Glycine max), peanut (Arachis hypogaea), red clover (Trifolium pratense), white clover (Trifolium repens), and eucalyptus (Eucalyptus camaldulensis). In addition, the number of plant species registered in this database will be increased as our research progresses. The Kazusa Marker DataBase will be a useful tool for both basic and applied sciences, such as genomics, genetics, and molecular breeding in crops. PMID:25320561

  3. Genetic dissection of powdery mildew resistance in interspecific half-sib grapevine families using SNP-based maps.

    Science.gov (United States)

    Teh, Soon Li; Fresnedo-Ramírez, Jonathan; Clark, Matthew D; Gadoury, David M; Sun, Qi; Cadle-Davidson, Lance; Luby, James J

    2017-01-01

    Quantitative trait locus (QTL) identification in perennial fruit crops is impeded largely by their lengthy generation time, resulting in costly and labor-intensive maintenance of breeding programs. In a grapevine (genus Vitis ) breeding program, although experimental families are typically unreplicated, the genetic backgrounds may contain similar progenitors previously selected due to their contribution of favorable alleles. In this study, we investigated the utility of joint QTL identification provided by analyzing half-sib families. The genetic control of powdery mildew was studied using two half-sib F 1 families, namely GE0711/1009 (MN1264 × MN1214; N  = 147) and GE1025 (MN1264 × MN1246; N  = 125) with multiple species in their ancestry. Maternal genetic maps consisting of 1077 and 1641 single nucleotide polymorphism (SNP) markers, respectively, were constructed using a pseudo-testcross strategy. Ratings of field resistance to powdery mildew were obtained based on whole-plant evaluation of disease severity. This 2-year analysis uncovered two QTLs that were validated on a consensus map in these half-sib families with improved precision relative to the parental maps. Examination of haplotype combinations based on the two QTL regions identified strong association of haplotypes inherited from 'Seyval blanc', through MN1264, with powdery mildew resistance. This investigation also encompassed the use of microsatellite markers to establish a correlation between 206-bp (UDV-015b) and 357-bp (VViv67) fragment sizes with resistance-carrying haplotypes. Our work is one of the first reports in grapevine demonstrating the use of SNP-based maps and haplotypes for QTL identification and tagging of powdery mildew resistance in half-sib families.

  4. SNP detection for massively parallel whole-genome resequencing

    DEFF Research Database (Denmark)

    Li, Ruiqiang; Li, Yingrui; Fang, Xiaodong

    2009-01-01

    -genome or target region resequencing. Here, we have developed a consensus-calling and SNP-detection method for sequencing-by-synthesis Illumina Genome Analyzer technology. We designed this method by carefully considering the data quality, alignment, and experimental errors common to this technology. All...... of this information was integrated into a single quality score for each base under Bayesian theory to measure the accuracy of consensus calling. We tested this methodology using a large-scale human resequencing data set of 36x coverage and assembled a high-quality nonrepetitive consensus sequence for 92.......25% of the diploid autosomes and 88.07% of the haploid X chromosome. Comparison of the consensus sequence with Illumina human 1M BeadChip genotyped alleles from the same DNA sample showed that 98.6% of the 37,933 genotyped alleles on the X chromosome and 98% of 999,981 genotyped alleles on autosomes were covered...

  5. Honey bee-inspired algorithms for SNP haplotype reconstruction problem

    Science.gov (United States)

    PourkamaliAnaraki, Maryam; Sadeghi, Mehdi

    2016-03-01

    Reconstructing haplotypes from SNP fragments is an important problem in computational biology. There have been a lot of interests in this field because haplotypes have been shown to contain promising data for disease association research. It is proved that haplotype reconstruction in Minimum Error Correction model is an NP-hard problem. Therefore, several methods such as clustering techniques, evolutionary algorithms, neural networks and swarm intelligence approaches have been proposed in order to solve this problem in appropriate time. In this paper, we have focused on various evolutionary clustering techniques and try to find an efficient technique for solving haplotype reconstruction problem. It can be referred from our experiments that the clustering methods relying on the behaviour of honey bee colony in nature, specifically bees algorithm and artificial bee colony methods, are expected to result in more efficient solutions. An application program of the methods is available at the following link. http://www.bioinf.cs.ipm.ir/software/haprs/

  6. Grouping preprocess for haplotype inference from SNP and CNV data

    International Nuclear Information System (INIS)

    Shindo, Hiroyuki; Chigira, Hiroshi; Nagaoka, Tomoyo; Inoue, Masato; Kamatani, Naoyuki

    2009-01-01

    The method of statistical haplotype inference is an indispensable technique in the field of medical science. The authors previously reported Hardy-Weinberg equilibrium-based haplotype inference that could manage single nucleotide polymorphism (SNP) data. We recently extended the method to cover copy number variation (CNV) data. Haplotype inference from mixed data is important because SNPs and CNVs are occasionally in linkage disequilibrium. The idea underlying the proposed method is simple, but the algorithm for it needs to be quite elaborate to reduce the calculation cost. Consequently, we have focused on the details on the algorithm in this study. Although the main advantage of the method is accuracy, in that it does not use any approximation, its main disadvantage is still the calculation cost, which is sometimes intractable for large data sets with missing values.

  7. Grouping preprocess for haplotype inference from SNP and CNV data

    Energy Technology Data Exchange (ETDEWEB)

    Shindo, Hiroyuki; Chigira, Hiroshi; Nagaoka, Tomoyo; Inoue, Masato [Department of Electrical Engineering and Bioscience, School of Advanced Science and Engineering, Waseda University, 3-4-1, Okubo, Shinjuku-ku, Tokyo 169-8555 (Japan); Kamatani, Naoyuki, E-mail: masato.inoue@eb.waseda.ac.j [Institute of Rheumatology, Tokyo Women' s Medical University, 10-22, Kawada-cho, Shinjuku-ku, Tokyo 162-0054 (Japan)

    2009-12-01

    The method of statistical haplotype inference is an indispensable technique in the field of medical science. The authors previously reported Hardy-Weinberg equilibrium-based haplotype inference that could manage single nucleotide polymorphism (SNP) data. We recently extended the method to cover copy number variation (CNV) data. Haplotype inference from mixed data is important because SNPs and CNVs are occasionally in linkage disequilibrium. The idea underlying the proposed method is simple, but the algorithm for it needs to be quite elaborate to reduce the calculation cost. Consequently, we have focused on the details on the algorithm in this study. Although the main advantage of the method is accuracy, in that it does not use any approximation, its main disadvantage is still the calculation cost, which is sometimes intractable for large data sets with missing values.

  8. UPD detection using homozygosity profiling with a SNP genotyping microarray.

    Science.gov (United States)

    Papenhausen, Peter; Schwartz, Stuart; Risheg, Hiba; Keitges, Elisabeth; Gadi, Inder; Burnside, Rachel D; Jaswaney, Vikram; Pappas, John; Pasion, Romela; Friedman, Kenneth; Tepperberg, James

    2011-04-01

    Single nucleotide polymorphism (SNP) based chromosome microarrays provide both a high-density whole genome analysis of copy number and genotype. In the past 21 months we have analyzed over 13,000 samples primarily referred for developmental delay using the Affymetrix SNP/CN 6.0 version array platform. In addition to copy number, we have focused on the relative distribution of allele homozygosity (HZ) throughout the genome to confirm a strong association of uniparental disomy (UPD) with regions of isoallelism found in most confirmed cases of UPD. We sought to determine whether a long contiguous stretch of HZ (LCSH) greater than a threshold value found only in a single chromosome would correlate with UPD of that chromosome. Nine confirmed UPD cases were retrospectively analyzed with the array in the study, each showing the anticipated LCSH with the smallest 13.5 Mb in length. This length is well above the average longest run of HZ in a set of control patients and was then set as the prospective threshold for reporting possible UPD correlation. Ninety-two cases qualified at that threshold, 46 of those had molecular UPD testing and 29 were positive. Including retrospective cases, 16 showed complete HZ across the chromosome, consistent with total isoUPD. The average size LCSH in the 19 cases that were not completely HZ was 46.3 Mb with a range of 13.5-127.8 Mb. Three patients showed only segmental UPD. Both the size and location of the LCSH are relevant to correlation with UPD. Further studies will continue to delineate an optimal threshold for LCSH/UPD correlation. Copyright © 2011 Wiley-Liss, Inc.

  9. Differential growth of Mycobacterium leprae strains (SNP genotypes) in armadillos.

    Science.gov (United States)

    Sharma, Rahul; Singh, Pushpendra; Pena, Maria; Subramanian, Ramesh; Chouljenko, Vladmir; Kim, Joohyun; Kim, Nayong; Caskey, John; Baudena, Marie A; Adams, Linda B; Truman, Richard W

    2018-04-14

    Leprosy (Hansen's Disease) has occurred throughout human history, and persists today at a low prevalence in most populations. Caused by Mycobacterium leprae, the infection primarily involves the skin, mucosa and peripheral nerves. The susceptible host range for Mycobacterium leprae is quite narrow. Besides humans, nine banded armadillos (Dasypus novemcinctus) and red squirrels (Sciurus vulgaris) are the only other natural hosts for M. leprae, but only armadillos recapitulate the disease as seen in humans. Armadillos across the Southern United States harbor a single predominant genotypic strain (SNP Type-3I) of M. leprae, which is also implicated in the zoonotic transmission of leprosy. We investigated, whether the zoonotic strain (3I) has any notable growth advantages in armadillos over another genetically distant strain-type (SNP Type-4P) of M. leprae, and if M. leprae strains manifest any notably different pathology among armadillos. We co-infected armadillos (n = 6) with 2 × 10 9 highly viable M. leprae of both strains and assessed the relative growth and dissemination of each strain in the animals. We also analyzed 12 additional armadillos, 6 each individually infected with the same quantity of either strain. The infections were allowed to fulminate and the clinical manifestations of the disease were noted. Animals were humanely sacrificed at the terminal stage of infection and the number of bacilli per gram of liver, spleen and lymph node tissue were enumerated by Q-PCR assay. The growth of M. leprae strain 4P was significantly higher (P leprae strains within armadillos suggest there are notable pathological variations between M. leprae strain-types. Copyright © 2018. Published by Elsevier B.V.

  10. Application of multi-SNP approaches Bayesian LASSO and AUC-RF to detect main effects of inflammatory-gene variants associated with bladder cancer risk.

    Directory of Open Access Journals (Sweden)

    Evangelina López de Maturana

    Full Text Available The relationship between inflammation and cancer is well established in several tumor types, including bladder cancer. We performed an association study between 886 inflammatory-gene variants and bladder cancer risk in 1,047 cases and 988 controls from the Spanish Bladder Cancer (SBC/EPICURO Study. A preliminary exploration with the widely used univariate logistic regression approach did not identify any significant SNP after correcting for multiple testing. We further applied two more comprehensive methods to capture the complexity of bladder cancer genetic susceptibility: Bayesian Threshold LASSO (BTL, a regularized regression method, and AUC-Random Forest, a machine-learning algorithm. Both approaches explore the joint effect of markers. BTL analysis identified a signature of 37 SNPs in 34 genes showing an association with bladder cancer. AUC-RF detected an optimal predictive subset of 56 SNPs. 13 SNPs were identified by both methods in the total population. Using resources from the Texas Bladder Cancer study we were able to replicate 30% of the SNPs assessed. The associations between inflammatory SNPs and bladder cancer were reexamined among non-smokers to eliminate the effect of tobacco, one of the strongest and most prevalent environmental risk factor for this tumor. A 9 SNP-signature was detected by BTL. Here we report, for the first time, a set of SNP in inflammatory genes jointly associated with bladder cancer risk. These results highlight the importance of the complex structure of genetic susceptibility associated with cancer risk.

  11. A 34K SNP genotyping array for Populus trichocarpa: design, application to the study of natural populations and transferability to other Populus species.

    Science.gov (United States)

    Geraldes, A; Difazio, S P; Slavov, G T; Ranjan, P; Muchero, W; Hannemann, J; Gunter, L E; Wymore, A M; Grassa, C J; Farzaneh, N; Porth, I; McKown, A D; Skyba, O; Li, E; Fujita, M; Klápště, J; Martin, J; Schackwitz, W; Pennacchio, C; Rokhsar, D; Friedmann, M C; Wasteneys, G O; Guy, R D; El-Kassaby, Y A; Mansfield, S D; Cronk, Q C B; Ehlting, J; Douglas, C J; Tuskan, G A

    2013-03-01

    Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. For such studies, the use of large single nucleotide polymorphism (SNP) genotyping arrays still offers the most cost-effective solution. Herein we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species latitudinal range. We adopted a candidate gene approach to the array design that resulted in the selection of 34 131 SNPs, the majority of which are located in, or within 2 kb of, 3543 candidate genes. A subset of the SNPs on the array (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%. We demonstrate that even among small numbers of samples (n = 10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca. Finally, we provide evidence for the utility of the array to address evolutionary questions such as intraspecific studies of genetic differentiation, species assignment and the detection of natural hybrids. © 2013 Blackwell Publishing Ltd.

  12. A 48-plex autosomal SNP GenPlex™ assay for human individualization and relationship testing

    DEFF Research Database (Denmark)

    Tomas Mas, Carmen; Børsting, Claus; Morling, Niels

    2012-01-01

    SNPs are being increasingly used by forensic laboratories. Different platforms have been developed for SNP typing. We describe the GenPlex™ HID system protocol, a new SNP-typing platform developed by Applied Biosystems where 48 of the 52 SNPforID SNPs and amelogenin are included. The GenPlex™ HID...

  13. Performance of the SNPforID 52 SNP-plex assay in paternity testing

    DEFF Research Database (Denmark)

    Børsting, Claus; Sanchez, Juan Jose; Hansen, Hanna E

    2008-01-01

    (VNTRs). The typical PIs based on 15 STRs or seven VNTRs were 5-50 times higher than the typical PIs based on 52 SNPs. Six mutations in tandem repeats were detected among the randomly selected trios. In contrast, there was not found any mutations in the SNP loci. The results showed that the 52 SNP...

  14. Evaluation of the OvineSNP50 chip for use in four South African ...

    African Journals Online (AJOL)

    Relatively rapid and cost-effective genotyping using the OvineSNP50 chip holds great promise for the South African sheep industry and research partners. However, SNP ascertainment bias may influence inferences from the genotyping results of South African sheep breeds. Therefore, samples from Dorper, Namaqua ...

  15. Molecular markers: a potential resource for ginger genetic diversity studies.

    Science.gov (United States)

    Ismail, Nor Asiah; Rafii, M Y; Mahmud, T M M; Hanafi, M M; Miah, Gous

    2016-12-01

    Ginger is an economically important and valuable plant around the world. Ginger is used as a food, spice, condiment, medicine and ornament. There is available information on biochemical aspects of ginger, but few studies have been reported on its molecular aspects. The main objective of this review is to accumulate the available molecular marker information and its application in diverse ginger studies. This review article was prepared by combing material from published articles and our own research. Molecular markers allow the identification and characterization of plant genotypes through direct access to hereditary material. In crop species, molecular markers are applied in different aspects and are useful in breeding programs. In ginger, molecular markers are commonly used to identify genetic variation and classify the relatedness among varieties, accessions, and species. Consequently, it provides important input in determining resourceful management strategies for ginger improvement programs. Alternatively, a molecular marker could function as a harmonizing tool for documenting species. This review highlights the application of molecular markers (isozyme, RAPD, AFLP, SSR, ISSR and others such as RFLP, SCAR, NBS and SNP) in genetic diversity studies of ginger species. Some insights on the advantages of the markers are discussed. The detection of genetic variation among promising cultivars of ginger has significance for ginger improvement programs. This update of recent literature will help researchers and students select the appropriate molecular markers for ginger-related research.

  16. Identification of SNP barcode biomarkers for genes associated with facial emotion perception using particle swarm optimization algorithm.

    Science.gov (United States)

    Chuang, Li-Yeh; Lane, Hsien-Yuan; Lin, Yu-Da; Lin, Ming-Teng; Yang, Cheng-Hong; Chang, Hsueh-Wei

    2014-01-01

    Facial emotion perception (FEP) can affect social function. We previously reported that parts of five tested single-nucleotide polymorphisms (SNPs) in the MET and AKT1 genes may individually affect FEP performance. However, the effects of SNP-SNP interactions on FEP performance remain unclear. This study compared patients with high and low FEP performances (n = 89 and 93, respectively). A particle swarm optimization (PSO) algorithm was used to identify the best SNP barcodes (i.e., the SNP combinations and genotypes that revealed the largest differences between the high and low FEP groups). The analyses of individual SNPs showed no significant differences between the high and low FEP groups. However, comparisons of multiple SNP-SNP interactions involving different combinations of two to five SNPs showed that the best PSO-generated SNP barcodes were significantly associated with high FEP score. The analyses of the joint effects of the best SNP barcodes for two to five interacting SNPs also showed that the best SNP barcodes had significantly higher odds ratios (2.119 to 3.138; P < 0.05) compared to other SNP barcodes. In conclusion, the proposed PSO algorithm effectively identifies the best SNP barcodes that have the strongest associations with FEP performance. This study also proposes a computational methodology for analyzing complex SNP-SNP interactions in social cognition domains such as recognition of facial emotion.

  17. LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures.

    Science.gov (United States)

    Ryan, Michael; Diekhans, Mark; Lien, Stephanie; Liu, Yun; Karchin, Rachel

    2009-06-01

    LS-SNP/PDB is a new WWW resource for genome-wide annotation of human non-synonymous (amino acid changing) SNPs. It serves high-quality protein graphics rendered with UCSF Chimera molecular visualization software. The system is kept up-to-date by an automated, high-throughput build pipeline that systematically maps human nsSNPs onto Protein Data Bank structures and annotates several biologically relevant features. LS-SNP/PDB is available at (http://ls-snp.icm.jhu.edu/ls-snp-pdb) and via links from protein data bank (PDB) biology and chemistry tabs, UCSC Genome Browser Gene Details and SNP Details pages and PharmGKB Gene Variants Downloads/Cross-References pages.

  18. Development of high-throughput SNP-based genotyping in Acacia auriculiformis x A. mangium hybrids using short-read transcriptome data

    Directory of Open Access Journals (Sweden)

    Wong Melissa ML

    2012-12-01

    Full Text Available Abstract Background Next Generation Sequencing has provided comprehensive, affordable and high-throughput DNA sequences for Single Nucleotide Polymorphism (SNP discovery in Acacia auriculiformis and Acacia mangium. Like other non-model species, SNP detection and genotyping in Acacia are challenging due to lack of genome sequences. The main objective of this study is to develop the first high-throughput SNP genotyping assay for linkage map construction of A. auriculiformis x A. mangium hybrids. Results We identified a total of 37,786 putative SNPs by aligning short read transcriptome data from four parents of two Acacia hybrid mapping populations using Bowtie against 7,839 de novo transcriptome contigs. Given a set of 10 validated SNPs from two lignin genes, our in silico SNP detection approach is highly accurate (100% compared to the traditional in vitro approach (44%. Further validation of 96 SNPs using Illumina GoldenGate Assay gave an overall assay success rate of 89.6% and conversion rate of 37.5%. We explored possible factors lowering assay success rate by predicting exon-intron boundaries and paralogous genes of Acacia contigs using Medicago truncatula genome as reference. This assessment revealed that presence of exon-intron boundary is the main cause (50% of assay failure. Subsequent SNPs filtering and improved assay design resulted in assay success and conversion rate of 92.4% and 57.4%, respectively based on 768 SNPs genotyping. Analysis of clustering patterns revealed that 27.6% of the assays were not reproducible and flanking sequence might play a role in determining cluster compression. In addition, we identified a total of 258 and 319 polymorphic SNPs in A. auriculiformis and A. mangium natural germplasms, respectively. Conclusion We have successfully discovered a large number of SNP markers in A. auriculiformis x A. mangium hybrids using next generation transcriptome sequencing. By using a reference genome from the most closely

  19. A customized pigmentation SNP array identifies a novel SNP associated with melanoma predisposition in the SLC45A2 gene.

    Directory of Open Access Journals (Sweden)

    Maider Ibarrola-Villava

    Full Text Available As the incidence of Malignant Melanoma (MM reflects an interaction between skin colour and UV exposure, variations in genes implicated in pigmentation and tanning response to UV may be associated with susceptibility to MM. In this study, 363 SNPs in 65 gene regions belonging to the pigmentation pathway have been successfully genotyped using a SNP array. Five hundred and ninety MM cases and 507 controls were analyzed in a discovery phase I. Ten candidate SNPs based on a p-value threshold of 0.01 were identified. Two of them, rs35414 (SLC45A2 and rs2069398 (SILV/CKD2, were statistically significant after conservative Bonferroni correction. The best six SNPs were further tested in an independent Spanish series (624 MM cases and 789 controls. A novel SNP located on the SLC45A2 gene (rs35414 was found to be significantly associated with melanoma in both phase I and phase II (P<0.0001. None of the other five SNPs were replicated in this second phase of the study. However, three SNPs in TYR, SILV/CDK2 and ADAMTS20 genes (rs17793678, rs2069398 and rs1510521 respectively had an overall p-value<0.05 when considering the whole DNA collection (1214 MM cases and 1296 controls. Both the SLC45A2 and the SILV/CDK2 variants behave as protective alleles, while the TYR and ADAMTS20 variants seem to function as risk alleles. Cumulative effects were detected when these four variants were considered together. Furthermore, individuals carrying two or more mutations in MC1R, a well-known low penetrance melanoma-predisposing gene, had a decreased MM risk if concurrently bearing the SLC45A2 protective variant. To our knowledge, this is the largest study on Spanish sporadic MM cases to date.

  20. Maximization of Markers Linked in Coupling for Tetraploid Potatoes via Monoparental Haploids

    Directory of Open Access Journals (Sweden)

    Annette M. Bartkiewicz

    2018-05-01

    Full Text Available Haploid potato populations derived from a single tetraploid donor constitute an efficient strategy to analyze markers segregating from a single donor genotype. Analysis of marker segregation in populations derived from crosses between polysomic tetraploids is complicated by a maximum of eight segregating alleles, multiple dosages of the markers and problems related to linkage analysis of marker segregation in repulsion. Here, we present data on two monoparental haploid populations generated by prickle pollination of two tetraploid cultivars with Solanum phureja and genotyped with the 12.8 k SolCAP single nucleotide polymorphism (SNP array. We show that in a population of monoparental haploids, the number of biallelic SNP markers segregating in linkage to loci from the tetraploid donor genotype is much larger than in putative crosses of this genotype to a diverse selection of 125 tetraploid cultivars. Although this strategy is more laborious than conventional breeding, the generation of haploid progeny for efficient marker analysis is straightforward if morphological markers and flow cytometry are utilized to select true haploid progeny. The level of introgressed fragments from S. phureja, the haploid inducer, is very low, supporting its suitability for genetic analysis. Mapping with single-dose markers allowed the analysis of quantitative trait loci (QTL for four phenotypic traits.

  1. MDM2 gene SNP309 T/G and p53 gene SNP72 G/C do not influence diffuse large B-cell non-Hodgkin lymphoma onset or survival in central European Caucasians

    Directory of Open Access Journals (Sweden)

    Landt Olfert

    2008-04-01

    Full Text Available Abstract Background SNP309 T/G (rs2279744 causes higher levels of MDM2, the most important negative regulator of the p53 tumor suppressor. SNP72 G/C (rs1042522 gives rise to a p53 protein with a greatly reduced capacity to induce apoptosis. Both polymorphisms have been implicated in cancer. The SNP309 G-allele has recently been reported to accelerate diffuse large B-cell lymphoma (DLBCL formation in pre-menopausal women and suggested to constitute a genetic basis for estrogen affecting human tumorigenesis. Here we asked whether SNP309 and SNP72 are associated with DLBCL in women and are correlated with age of onset, diagnosis, or patient's survival. Methods SNP309 and SNP72 were PCR-genotyped in a case-control study that included 512 controls and 311 patients diagnosed with aggressive NHL. Of these, 205 were diagnosed with DLBCL. Results The age of onset was similar in men and women. The control and patients group showed similar SNP309 and SNP72 genotype frequencies. Importantly and in contrast to the previous findings, similar genotype frequencies were observed in female patients diagnosed by 51 years of age and those diagnosed later. Specifically, 3/20 female DLBCL patients diagnosed by 51 years of age were homozygous for SNP309 G and 2/20 DLBCL females in that age group were homozygous for SNP72 C. Neither SNP309 nor SNP72 had a significant influence on event-free and overall survival in multivariate analyses. Conclusion In contrast to the previous study on Ashkenazi Jewish Caucasians, DLBCL in pre-menopausal women of central European Caucasian ethnicity was not associated with SNP309 G. Neither SNP309 nor SNP72 seem to be correlated with age of onset, diagnosis, or survival of patients.

  2. A SNP Harvester Analysis to Better Detect SNPs of CCDC158 Gene That Are Associated with Carcass Quality Traits in Hanwoo

    Directory of Open Access Journals (Sweden)

    Jea-Young Lee

    2013-06-01

    Full Text Available The purpose of this study was to investigate interaction effects of genes using a Harvester method. A sample of Korean cattle, Hanwoo (n = 476 was chosen from the National Livestock Research Institute of Korea that were sired by 50 Korean proven bulls. The steers were born between the spring of 1998 and the autumn of 2002 and reared under a progeny-testing program at the Daekwanryeong and Namwon branches of NLRI. The steers were slaughtered at approximately 24 months of age and carcass quality traits were measured. A SNP Harvester method was applied with a support vector machine (SVM to detect significant SNPs in the CCDC158 gene and interaction effects between the SNPs that were associated with average daily gains, cold carcass weight, longissimus dorsi muscle area, and marbling scores. The statistical significance of the major SNP combinations was evaluated with x2-statistics. The genotype combinations of three SNPs, g.34425+102 A>T(AA, g.4102636T>G(GT, and g.11614+19G>T(GG had a greater effect than the rest of SNP combinations, e.g. 0.82 vs. 0.75 kg, 343 vs. 314 kg, 80.4 vs 74.7 cm2, and 7.35 vs. 5.01, for the four respective traits (p<0.001. Also, the estimates were greater compared with single SNPs analyzed (the greatest estimates were 0.76 kg, 320 kg, 75.5 cm2, and 5.31, respectively. This result suggests that the SNP Harvester method is a good option when multiple SNPs and interaction effects are tested. The significant SNPs could be applied to improve meat quality of Hanwoo via marker-assisted selection.

  3. Heap: a highly sensitive and accurate SNP detection tool for low-coverage high-throughput sequencing data

    KAUST Repository

    Kobayashi, Masaaki; Ohyanagi, Hajime; Takanashi, Hideki; Asano, Satomi; Kudo, Toru; Kajiya-Kanegae, Hiromi; Nagano, Atsushi J.; Tainaka, Hitoshi; Tokunaga, Tsuyoshi; Sazuka, Takashi; Iwata, Hiroyoshi; Tsutsumi, Nobuhiro; Yano, Kentaro

    2017-01-01

    and GP depends on not only their mathematical models, but the quality and quantity of variants employed in the analysis. In NGS single nucleotide polymorphism (SNP) calling, conventional tools ideally require more reads for higher SNP sensitivity

  4. Forensic ancestry analysis with two capillary electrophoresis ancestry informative marker (AIM) panels

    DEFF Research Database (Denmark)

    Santos, C; Fondevila, M; Ballard, D

    2015-01-01

    that analyzes the genotype data alongside calculation of Bayes likelihood ratios. Exercise results indicated consistent genotyping performance from both tests, reaching a particularly high level of reliability for the Indel test. SNP genotyping gave 93.5% concordance (compared to the organizing laboratory...... relationship between input DNA and signal strength as each marker is detected with a single dye, so mixed DNA is more reliably detected. We report the results of a collaborative inter-laboratory exercise of 19 participants (15 from the EDNAP European DNA Profiling group) that assessed a 34-plex SNP test using...... the correct ancestry to the other samples using Snipper, with the exception of one laboratory with SNP miscalls that incorrectly assigned ancestry of two samples and did not obtain informative likelihood ratios for a third. Therefore, successful ancestry assignments were achieved by participants in 92 of 95...

  5. An Improved Consensus Linkage Map of Barley Based on Flow-Sorted Chromosomes and Single Nucleotide Polymorphism Markers

    Directory of Open Access Journals (Sweden)

    María Muñoz-Amatriaín

    2011-11-01

    Full Text Available Recent advances in high-throughput genotyping have made it easier to combine information from different mapping populations into consensus genetic maps, which provide increased marker density and genome coverage compared to individual maps. Previously, a single nucleotide polymorphism (SNP-based genotyping platform was developed and used to genotype 373 individuals in four barley ( L. mapping populations. This led to a 2943 SNP consensus genetic map with 975 unique positions. In this work, we add data from six additional populations and more individuals from one of the original populations to develop an improved consensus map from 1133 individuals. A stringent and systematic analysis of each of the 10 populations was performed to achieve uniformity. This involved reexamination of the four populations included in the previous map. As a consequence, we present a robust consensus genetic map that contains 2994 SNP loci mapped to 1163 unique positions. The map spans 1137.3 cM with an average density of one marker bin per 0.99 cM. A novel application of the genotyping platform for gene detection allowed the assignment of 2930 genes to flow-sorted chromosomes or arms, confirmed the position of 2545 SNP-mapped loci, added chromosome or arm allocations to an additional 370 SNP loci, and delineated pericentromeric regions for chromosomes 2H to 7H. Marker order has been improved and map resolution has been increased by almost 20%. These increased precision outcomes enable more optimized SNP selection for marker-assisted breeding and support association genetic analysis and map-based cloning. It will also improve the anchoring of DNA sequence scaffolds and the barley physical map to the genetic map.

  6. Alkali-developable silicone-based negative photoresist (SNP) for deep UV, electron beam, and X-ray lithographies

    International Nuclear Information System (INIS)

    Ban, Hiroshi; Tanaka, Akinobu; Kawai, Yoshio; Deguchi, Kimiyoshi

    1989-01-01

    A new silicone-based negative photoresist (SNP) developable with alkaline aqueous solutions is prepared. SNP composed of acetylated phenylsilsesquioxane oligomer and azidopyrene is applied to deep UV, electron beam (EB), and X-ray lithographies. SNP slightly swells in alkaline developers, thus exhibiting exceptionally high resolution characteristics for a negative resist. The resistance of SNP to oxygen reactive ion etching is approximately 30 times greater than that of conventional novolac resists. (author)

  7. Reduced SNP panels for genetic identification and introgression analysis in the dark honey bee (Apis mellifera mellifera.

    Directory of Open Access Journals (Sweden)

    Irene Muñoz

    Full Text Available Beekeeping activities, especially queen trading, have shaped the distribution of honey bee (Apis mellifera subspecies in Europe, and have resulted in extensive introductions of two eastern European C-lineage subspecies (A. m. ligustica and A. m. carnica into the native range of the M-lineage A. m. mellifera subspecies in Western Europe. As a consequence, replacement and gene flow between native and commercial populations have occurred at varying levels across western European populations. Genetic identification and introgression analysis using molecular markers is an important tool for management and conservation of honey bee subspecies. Previous studies have monitored introgression by using microsatellite, PCR-RFLP markers and most recently, high density assays using single nucleotide polymorphism (SNP markers. While the latter are almost prohibitively expensive, the information gained to date can be exploited to create a reduced panel containing the most ancestry-informative markers (AIMs for those purposes with very little loss of information. The objective of this study was to design reduced panels of AIMs to verify the origin of A. m. mellifera individuals and to provide accurate estimates of the level of C-lineage introgression into their genome. The discriminant power of the SNPs using a variety of metrics and approaches including the Weir & Cockerham's FST, an FST-based outlier test, Delta, informativeness (In, and PCA was evaluated. This study shows that reduced AIMs panels assign individuals to the correct origin and calculates the admixture level with a high degree of accuracy. These panels provide an essential tool in Europe for genetic stock identification and estimation of admixture levels which can assist management strategies and monitor honey bee conservation programs.

  8. Two combinatorial optimization problems for SNP discovery using base-specific cleavage and mass spectrometry.

    Science.gov (United States)

    Chen, Xin; Wu, Qiong; Sun, Ruimin; Zhang, Louxin

    2012-01-01

    The discovery of single-nucleotide polymorphisms (SNPs) has important implications in a variety of genetic studies on human diseases and biological functions. One valuable approach proposed for SNP discovery is based on base-specific cleavage and mass spectrometry. However, it is still very challenging to achieve the full potential of this SNP discovery approach. In this study, we formulate two new combinatorial optimization problems. While both problems are aimed at reconstructing the sample sequence that would attain the minimum number of SNPs, they search over different candidate sequence spaces. The first problem, denoted as SNP - MSP, limits its search to sequences whose in silico predicted mass spectra have all their signals contained in the measured mass spectra. In contrast, the second problem, denoted as SNP - MSQ, limits its search to sequences whose in silico predicted mass spectra instead contain all the signals of the measured mass spectra. We present an exact dynamic programming algorithm for solving the SNP - MSP problem and also show that the SNP - MSQ problem is NP-hard by a reduction from a restricted variation of the 3-partition problem. We believe that an efficient solution to either problem above could offer a seamless integration of information in four complementary base-specific cleavage reactions, thereby improving the capability of the underlying biotechnology for sensitive and accurate SNP discovery.

  9. Interference of Homologous Sequences on the SNP Study of CYP2A13 Gene

    Directory of Open Access Journals (Sweden)

    Qinghua ZHOU

    2010-02-01

    Full Text Available Background and objective It has been proven that cytochrome P450 enzyme 2A13 (CYP2A13 played an important role in the association between single nucleotide polymorphisms (SNP and human diseases. Cytochrome P450 enzymes are a group of isoenzymes, whose sequence homology may interfere with the study for SNP. The aim of this study is to explore the interference on the SNP study of CYP2A13 caused by homologous sequences. Methods Taqman probe was applied to detect distribution of rs8192789 sites in 573 subjects, and BLAST method was used to analyze the amplified sequences. Partial sequences of CYP2A13 were emplified by PCR from 60 cases. The emplified sequences were TA cloned and sequenced. Results For rs8192789 loci in 573 cases, only 3 cases were TT, while the rest were CT heterozygotes, which was caused by homologous sequences. There are a large number of overlapping peaks in identical sequences of 60 cases, and the SNP of 101 amino acid site reported in the SNP database is not found. The cloned sequences are 247 bp, 235 bp fragments. Conclusion The homologous sequences may interfere the study for SNP of CYP2A13, and some SNP may not exist.

  10. Genetic Polymorphism of MDM2 SNP309 in Patients with Helicobacter Pylori-Associated Gastritis.

    Science.gov (United States)

    Tongtawee, Taweesak; Dechsukhum, Chavaboon; Leeanansaksiri, Wilairat; Kaewpitoon, Soraya; Kaewpitoon, Natthawut; Loyd, Ryan A; Matrakool, Likit; Panpimanmas, Sukij

    2015-01-01

    Helicobacter pylori plays an important role in gastric cancer, which has a relatively low inciduence in Thailand. MDM2 is a major negative regulator of p53, the key tumor suppressor involved in tumorigenesis of the majority of human cancers. Whether its expression might explain the relative lack of gastric cancer in Thailand was assessed here. This single-center study was conducted in the northeast region of Thailand. Gastric mucosa from 100 patients with Helicobacter pylori associated gastritis was analyzed for MDM2 SNP309 using real-time PCR hybridization (light-cycler) probes. In the total 100 Helicobacter pylori associated gastritis cases the incidence of SNP 309 T/T homozygous was 78 % with SNP309 G/T heterozygous found in 19% and SNP309 G/G homozygous in 3%. The result show SNP 309 T/T and SNP 309 G/T to be rather common in the Thai population. Our study indicates that the MDM2 SNP309 G/G homozygous genotype might be a risk factor for gastric cancer in Thailand and the fact that it is infrequent could explain to some extent the low incidence of gastric cancer in the Thai population.

  11. GenomeRunner web server: regulatory similarity and differences define the functional impact of SNP sets.

    Science.gov (United States)

    Dozmorov, Mikhail G; Cara, Lukas R; Giles, Cory B; Wren, Jonathan D

    2016-08-01

    The growing amount of regulatory data from the ENCODE, Roadmap Epigenomics and other consortia provides a wealth of opportunities to investigate the functional impact of single nucleotide polymorphisms (SNPs). Yet, given the large number of regulatory datasets, researchers are posed with a challenge of how to efficiently utilize them to interpret the functional impact of SNP sets. We developed the GenomeRunner web server to automate systematic statistical analysis of SNP sets within a regulatory context. Besides defining the functional impact of SNP sets, GenomeRunner implements novel regulatory similarity/differential analyses, and cell type-specific regulatory enrichment analysis. Validated against literature- and disease ontology-based approaches, analysis of 39 disease/trait-associated SNP sets demonstrated that the functional impact of SNP sets corresponds to known disease relationships. We identified a group of autoimmune diseases with SNPs distinctly enriched in the enhancers of T helper cell subpopulations, and demonstrated relevant cell type-specificity of the functional impact of other SNP sets. In summary, we show how systematic analysis of genomic data within a regulatory context can help interpreting the functional impact of SNP sets. GenomeRunner web server is freely available at http://www.integrativegenomics.org/ mikhail.dozmorov@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  12. Sequential sentinel SNP Regional Association Plots (SSS-RAP): an approach for testing independence of SNP association signals using meta-analysis data.

    Science.gov (United States)

    Zheng, Jie; Gaunt, Tom R; Day, Ian N M

    2013-01-01

    Genome-Wide Association Studies (GWAS) frequently incorporate meta-analysis within their framework. However, conditional analysis of individual-level data, which is an established approach for fine mapping of causal sites, is often precluded where only group-level summary data are available for analysis. Here, we present a numerical and graphical approach, "sequential sentinel SNP regional association plot" (SSS-RAP), which estimates regression coefficients (beta) with their standard errors using the meta-analysis summary results directly. Under an additive model, typical for genes with small effect, the effect for a sentinel SNP can be transformed to the predicted effect for a possibly dependent SNP through a 2×2 2-SNP haplotypes table. The approach assumes Hardy-Weinberg equilibrium for test SNPs. SSS-RAP is available as a Web-tool (http://apps.biocompute.org.uk/sssrap/sssrap.cgi). To develop and illustrate SSS-RAP we analyzed lipid and ECG traits data from the British Women's Heart and Health Study (BWHHS), evaluated a meta-analysis for ECG trait and presented several simulations. We compared results with existing approaches such as model selection methods and conditional analysis. Generally findings were consistent. SSS-RAP represents a tool for testing independence of SNP association signals using meta-analysis data, and is also a convenient approach based on biological principles for fine mapping in group level summary data. © 2012 Blackwell Publishing Ltd/University College London.

  13. MDM2 promoter SNP344T>A (rs1196333 status does not affect cancer risk.

    Directory of Open Access Journals (Sweden)

    Stian Knappskog

    Full Text Available The MDM2 proto-oncogene plays a key role in central cellular processes like growth control and apoptosis, and the gene locus is frequently amplified in sarcomas. Two polymorphisms located in the MDM2 promoter P2 have been shown to affect cancer risk. One of these polymorphisms (SNP309T>G; rs2279744 facilitates Sp1 transcription factor binding to the promoter and is associated with increased cancer risk. In contrast, SNP285G>C (rs117039649, located 24 bp upstream of rs2279744, and in complete linkage disequilibrium with the SNP309G allele, reduces Sp1 recruitment and lowers cancer risk. Thus, fine tuning of MDM2 expression has proven to be of significant importance with respect to tumorigenesis. We assessed the potential functional effects of a third MDM2 promoter P2 polymorphism (SNP344T>A; rs1196333 located on the SNP309T allele. While in silico analyses indicated SNP344A to modulate TFAP2A, SPIB and AP1 transcription factor binding, we found no effect of SNP344 status on MDM2 expression levels. Assessing the frequency of SNP344A in healthy Caucasians (n = 2,954 and patients suffering from ovarian (n = 1,927, breast (n = 1,271, endometrial (n = 895 or prostatic cancer (n = 641, we detected no significant difference in the distribution of this polymorphism between any of these cancer forms and healthy controls (6.1% in healthy controls, and 4.9%, 5.0%, 5.4% and 7.2% in the cancer groups, respectively. In conclusion, our findings provide no evidence indicating that SNP344A may affect MDM2 transcription or cancer risk.

  14. SNP-based typing: a useful tool to study Bordetella pertussis populations.

    Directory of Open Access Journals (Sweden)

    Marjolein van Gent

    Full Text Available To monitor changes in Bordetella pertussis populations, mainly two typing methods are used; Pulsed-Field Gel Electrophoresis (PFGE and Multiple-Locus Variable-Number Tandem Repeat Analysis (MLVA. In this study, a single nucleotide polymorphism (SNP typing method, based on 87 SNPs, was developed and compared with PFGE and MLVA. The discriminatory indices of SNP typing, PFGE and MLVA were found to be 0.85, 0.95 and 0.83, respectively. Phylogenetic analysis, using SNP typing as Gold Standard, revealed false homoplasies in the PFGE and MLVA trees. Further, in contrast to the SNP-based tree, the PFGE- and MLVA-based trees did not reveal a positive correlation between root-to-tip distance and the isolation year of strains. Thus PFGE and MLVA do not allow an estimation of the relative age of the selected strains. In conclusion, SNP typing was found to be phylogenetically more informative than PFGE and more discriminative than MLVA. Further, in contrast to PFGE, it is readily standardized allowing interlaboratory comparisons. We applied SNP typing to study strains with a novel allele for the pertussis toxin promoter, ptxP3, which have a worldwide distribution and which have replaced the resident ptxP1 strains in the last 20 years. Previously, we showed that ptxP3 strains showed increased pertussis toxin expression and that their emergence was associated with increased notification in The Netherlands. SNP typing showed that the ptxP3 strains isolated in the Americas, Asia, Australia and Europe formed a monophyletic branch which recently diverged from ptxP1 strains. Two predominant ptxP3 SNP types were identified which spread worldwide. The widespread use of SNP typing will enhance our understanding of the evolution and global epidemiology of B. pertussis.

  15. SNP-Based Typing: A Useful Tool to Study Bordetella pertussis Populations

    Science.gov (United States)

    van der Heide, Han G. J.; Heuvelman, Kees J.; Kallonen, Teemu; He, Qiushui; Mertsola, Jussi; Advani, Abdolreza; Hallander, Hans O.; Janssens, Koen; Hermans, Peter W.; Mooi, Frits R.

    2011-01-01

    To monitor changes in Bordetella pertussis populations, mainly two typing methods are used; Pulsed-Field Gel Electrophoresis (PFGE) and Multiple-Locus Variable-Number Tandem Repeat Analysis (MLVA). In this study, a single nucleotide polymorphism (SNP) typing method, based on 87 SNPs, was developed and compared with PFGE and MLVA. The discriminatory indices of SNP typing, PFGE and MLVA were found to be 0.85, 0.95 and 0.83, respectively. Phylogenetic analysis, using SNP typing as Gold Standard, revealed false homoplasies in the PFGE and MLVA trees. Further, in contrast to the SNP-based tree, the PFGE- and MLVA-based trees did not reveal a positive correlation between root-to-tip distance and the isolation year of strains. Thus PFGE and MLVA do not allow an estimation of the relative age of the selected strains. In conclusion, SNP typing was found to be phylogenetically more informative than PFGE and more discriminative than MLVA. Further, in contrast to PFGE, it is readily standardized allowing interlaboratory comparisons. We applied SNP typing to study strains with a novel allele for the pertussis toxin promoter, ptxP3, which have a worldwide distribution and which have replaced the resident ptxP1 strains in the last 20 years. Previously, we showed that ptxP3 strains showed increased pertussis toxin expression and that their emergence was associated with increased notification in the Netherlands. SNP typing showed that the ptxP3 strains isolated in the Americas, Asia, Australia and Europe formed a monophyletic branch which recently diverged from ptxP1 strains. Two predominant ptxP3 SNP types were identified which spread worldwide. The widespread use of SNP typing will enhance our understanding of the evolution and global epidemiology of B. pertussis. PMID:21647370

  16. Drop-out probabilities of IrisPlex SNP alleles

    DEFF Research Database (Denmark)

    Andersen, Jeppe Dyrberg; Tvedebrink, Torben; Mogensen, Helle Smidt

    2013-01-01

    In certain crime cases, information about a perpetrator's phenotype, including eye colour, may be a valuable tool if no DNA profile of any suspect or individual in the DNA database matches the DNA profile found at the crime scene. Often, the available DNA material is sparse and allelic drop-out...... of true alleles is possible. As part of the validation of the IrisPlex assay in our ISO17025 accredited, forensic genetic laboratory, we estimated the probability of drop-out of specific SNP alleles using 29 and 30 PCR cycles and 25, 50 and 100 Single Base Extension (SBE) cycles. We observed no drop-out...... when the amount of DNA was greater than 125 pg for 29 cycles of PCR and greater than 62 pg for 30 cycles of PCR. With the use of a logistic regression model, we estimated the allele specific probability of drop-out in heterozygote systems based on the signal strength of the observed allele...

  17. In-silico single nucleotide polymorphisms (SNP) mining of Sorghum ...

    African Journals Online (AJOL)

    Single nucleotide polymorphisms (SNPs) may be considered the ultimate genetic markers as they represent the finest resolution of a DNA sequence (a single nucleotide), and are generally abundant in populations with a low mutation rate. SNPs are important tools in studying complex genetic traits and genome evolution.

  18. Introgression Browser: High throughput whole-genome SNP visualization

    NARCIS (Netherlands)

    Aflitos, S.A.; Sanchez Perez, G.F.; Ridder, de D.; Fransz, P.; Schranz, M.E.; Jong, de J.H.S.G.M.; Peters, S.A.

    2015-01-01

    Breeding by introgressive hybridization is a pivotal strategy to broaden the genetic basis of crops. Usually, the desired traits are monitored in consecutive crossing generations by marker-assisted selection, but their analyses fail in chromosome regions where crossover recombinants are rare or not

  19. Introgression browser: high-throughput whole-genome SNP visualization

    NARCIS (Netherlands)

    Alves Aflitos, S.; Sanchez-Perez, G.; de Ridder, D.; Fransz, P.; Schranz, M.E.; de Jong, H.; Peters, S.A.

    2015-01-01

    Breeding by introgressive hybridization is a pivotal strategy to broaden the genetic basis of crops. Usually, the desired traits are monitored in consecutive crossing generations by marker-assisted selection, but their analyses fail in chromosome regions where crossover recombinants are rare or not

  20. Genome-wide linkage analysis of QTL for growth and body composition employing the PorcineSNP60 BeadChip

    Directory of Open Access Journals (Sweden)

    Fernández Ana I

    2012-05-01

    Full Text Available Abstract Background The traditional strategy to map QTL is to use linkage analysis employing a limited number of markers. These analyses report wide QTL confidence intervals, making very difficult to identify the gene and polymorphisms underlying the QTL effects. The arrival of genome-wide panels of SNPs makes available thousands of markers increasing the information content and therefore the likelihood of detecting and fine mapping QTL regions. The aims of the current study are to confirm previous QTL regions for growth and body composition traits in different generations of an Iberian x Landrace intercross (IBMAP and especially identify new ones with narrow confidence intervals by employing the PorcineSNP60 BeadChip in linkage analyses. Results Three generations (F3, Backcross 1 and Backcross 2 of the IBMAP and their related animals were genotyped with PorcineSNP60 BeadChip. A total of 8,417 SNPs equidistantly distributed across autosomes were selected after filtering by quality, position and frequency to perform the QTL scan. The joint and separate analyses of the different IBMAP generations allowed confirming QTL regions previously identified in chromosomes 4 and 6 as well as new ones mainly for backfat thickness in chromosomes 4, 5, 11, 14 and 17 and shoulder weight in chromosomes 1, 2, 9 and 13; and many other to the chromosome-wide signification level. In addition, most of the detected QTLs displayed narrow confidence intervals, making easier the selection of positional candidate genes. Conclusions The use of higher density of markers has allowed to confirm results obtained in previous QTL scans carried out with microsatellites. Moreover several new QTL regions have been now identified in regions probably not covered by markers in previous scans, most of these QTLs displayed narrow confidence intervals. Finally, prominent putative biological and positional candidate genes underlying those QTL effects are listed based on recent porcine

  1. Transcriptome analysis and SNP development can resolve population differentiation of Streblospio benedicti, a developmentally dimorphic marine annelid.

    Directory of Open Access Journals (Sweden)

    Christina Zakas

    Full Text Available Next-generation sequencing technology is now frequently being used to develop genomic tools for non-model organisms, which are generally important for advancing studies of evolutionary ecology. One such species, the marine annelid Streblospio benedicti, is an ideal system to study the evolutionary consequences of larval life history mode because the species displays a rare offspring dimorphism termed poecilogony, where females can produce either many small offspring or a few large ones. To further develop S. benedicti as a model system for studies of life history evolution, we apply 454 sequencing to characterize the transcriptome for embryos, larvae, and juveniles of this species, for which no genomic resources are currently available. Here we performed a de novo alignment of 336,715 reads generated by a quarter GS-FLX (Roche 454 run, which produced 7,222 contigs. We developed a novel approach for evaluating the site frequency spectrum across the transcriptome to identify potential signatures of selection. We also developed 84 novel single nucleotide polymorphism (SNP markers for this species that are used to distinguish coastal populations of S. benedicti. We validated the SNPs by genotyping individuals of different developmental modes using the BeadXPress Golden Gate assay (Illumina. This allowed us to evaluate markers that may be associated with life-history mode.

  2. Typing of 48 autosomal SNPs and amelogenin with GenPlex SNP genotyping system in forensic genetics

    DEFF Research Database (Denmark)

    Tomas Mas, Carmen; Stangegaard, Michael; Børsting, Claus

    2008-01-01

    , Somalia and Greenland were investigated with GenPlex using a Biomek 3000 (Beckman Coulter) robot. The results were compared to results obtained with an ISO 17025 accredited SNP typing assay based on single base extension (SBE). With the GenPlex SNP genotyping system, full SNP profiles were obtained in 97.......6% of the investigations. Perfect concordance was obtained in duplicate investigations and the SNP genotypes obtained with the GenPlex system were concordant with those of the accredited SBE based SNP typing system except for one result in rs901398 in one of 286 individuals most likely due to a mutation 6 bp downstream...

  3. Genome-wide joint meta-analysis of SNP and SNP-by-smoking interaction identifies novel loci for pulmonary function.

    Directory of Open Access Journals (Sweden)

    Dana B Hancock

    Full Text Available Genome-wide association studies have identified numerous genetic loci for spirometic measures of pulmonary function, forced expiratory volume in one second (FEV(1, and its ratio to forced vital capacity (FEV(1/FVC. Given that cigarette smoking adversely affects pulmonary function, we conducted genome-wide joint meta-analyses (JMA of single nucleotide polymorphism (SNP and SNP-by-smoking (ever-smoking or pack-years associations on FEV(1 and FEV(1/FVC across 19 studies (total N = 50,047. We identified three novel loci not previously associated with pulmonary function. SNPs in or near DNER (smallest P(JMA = 5.00×10(-11, HLA-DQB1 and HLA-DQA2 (smallest P(JMA = 4.35×10(-9, and KCNJ2 and SOX9 (smallest P(JMA = 1.28×10(-8 were associated with FEV(1/FVC or FEV(1 in meta-analysis models including SNP main effects, smoking main effects, and SNP-by-smoking (ever-smoking or pack-years interaction. The HLA region has been widely implicated for autoimmune and lung phenotypes, unlike the other novel loci, which have not been widely implicated. We evaluated DNER, KCNJ2, and SOX9 and found them to be expressed in human lung tissue. DNER and SOX9 further showed evidence of differential expression in human airway epithelium in smokers compared to non-smokers. Our findings demonstrated that joint testing of SNP and SNP-by-environment interaction identified novel loci associated with complex traits that are missed when considering only the genetic main effects.

  4. Forensic genetic SNP typing of low-template DNA and highly degraded DNA from crime case samples.

    Science.gov (United States)

    Børsting, Claus; Mogensen, Helle Smidt; Morling, Niels

    2013-05-01

    Heterozygote imbalances leading to allele drop-outs and disproportionally large stutters leading to allele drop-ins are known stochastic phenomena related to STR typing of low-template DNA (LtDNA). The large stutters and the many drop-ins in typical STR stutter positions are artifacts from the PCR amplification of tandem repeats. These artifacts may be avoided by typing bi-allelic markers instead of STRs. In this work, the SNPforID multiplex assay was used to type LtDNA. A sensitized SNP typing protocol was introduced, that increased signal strengths without increasing noise and without affecting the heterozygote balance. Allele drop-ins were only observed in experiments with 25 pg of DNA and not in experiments with 50 and 100 pg of DNA. The allele drop-in rate in the 25 pg experiments was 0.06% or 100 times lower than what was previously reported for STR typing of LtDNA. A composite model and two different consensus models were used to interpret the SNP data. Correct profiles with 42-49 SNPs were generated from the 50 and 100 pg experiments, whereas a few incorrect genotypes were included in the generated profiles from the 25 pg experiments. With the strict consensus model, between 35 and 48 SNPs were correctly typed in the 25 pg experiments and only one allele drop-out (error rate: 0.07%) was observed in the consensus profiles. A total of 28 crime case samples were selected for typing with the sensitized SNPforID protocol. The samples were previously typed with old STR kits during the crime case investigation and only partial profiles (0-6 STRs) were obtained. Eleven of the samples could not be quantified with the Quantifiler™ Human DNA Quantification kit because of partial or complete inhibition of the PCR. For eight of these samples, SNP typing was only possible when the buffer and DNA polymerase used in the original protocol was replaced with the AmpFℓSTR(®) SEfiler Plus™ Master Mix, which was developed specifically for challenging forensic samples. All

  5. Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species.

    Directory of Open Access Journals (Sweden)

    Brant K Peterson

    Full Text Available The ability to efficiently and accurately determine genotypes is a keystone technology in modern genetics, crucial to studies ranging from clinical diagnostics, to genotype-phenotype association, to reconstruction of ancestry and the detection of selection. To date, high capacity, low cost genotyping has been largely achieved via "SNP chip" microarray-based platforms which require substantial prior knowledge of both genome sequence and variability, and once designed are suitable only for those targeted variable nucleotide sites. This method introduces substantial ascertainment bias and inherently precludes detection of rare or population-specific variants, a major source of information for both population history and genotype-phenotype association. Recent developments in reduced-representation genome sequencing experiments on massively parallel sequencers (commonly referred to as RAD-tag or RADseq have brought direct sequencing to the problem of population genotyping, but increased cost and procedural and analytical complexity have limited their widespread adoption. Here, we describe a complete laboratory protocol, including a custom combinatorial indexing method, and accompanying software tools to facilitate genotyping across large numbers (hundreds or more of individuals for a range of markers (hundreds to hundreds of thousands. Our method requires no prior genomic knowledge and achieves per-site and per-individual costs below that of current SNP chip technology, while requiring similar hands-on time investment, comparable amounts of input DNA, and downstream analysis times on the order of hours. Finally, we provide empirical results from the application of this method to both genotyping in a laboratory cross and in wild populations. Because of its flexibility, this modified RADseq approach promises to be applicable to a diversity of biological questions in a wide range of organisms.

  6. Mapping Late Leaf Spot Resistance in Peanut (Arachis hypogaea Using QTL-seq Reveals Markers for Marker-Assisted Selection

    Directory of Open Access Journals (Sweden)

    Josh Clevenger

    2018-02-01

    Full Text Available Late leaf spot (LLS; Cercosporidium personatum is a major fungal disease of cultivated peanut (Arachis hypogaea. A recombinant inbred line population segregating for quantitative field resistance was used to identify quantitative trait loci (QTL using QTL-seq. High rates of false positive SNP calls using established methods in this allotetraploid crop obscured significant QTLs. To resolve this problem, robust parental SNPs were first identified using polyploid-specific SNP identification pipelines, leading to discovery of significant QTLs for LLS resistance. These QTLs were confirmed over 4 years of field data. Selection with markers linked to these QTLs resulted in a significant increase in resistance, showing that these markers can be immediately applied in breeding programs. This study demonstrates that QTL-seq can be used to rapidly identify QTLs controlling highly quantitative traits in polyploid crops with complex genomes. Markers identified can then be deployed in breeding programs, increasing the efficiency of selection using molecular tools.Key Message: Field resistance to late leaf spot is a quantitative trait controlled by many QTLs. Using polyploid-specific methods, QTL-seq is faster and more cost effective than QTL mapping.

  7. Polygenic analysis of genome-wide SNP data identifies common variants on allergic rhinitis

    DEFF Research Database (Denmark)

    Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette

    Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...

  8. Association test based on SNP set: logistic kernel machine based test vs. principal component analysis.

    Directory of Open Access Journals (Sweden)

    Yang Zhao

    Full Text Available GWAS has facilitated greatly the discovery of risk SNPs associated with complex diseases. Traditional methods analyze SNP individually and are limited by low power and reproducibility since correction for multiple comparisons is necessary. Several methods have been proposed based on grouping SNPs into SNP sets using biological knowledge and/or genomic features. In this article, we compare the linear kernel machine based test (LKM and principal components analysis based approach (PCA using simulated datasets under the scenarios of 0 to 3 causal SNPs, as well as simple and complex linkage disequilibrium (LD structures of the simulated regions. Our simulation study demonstrates that both LKM and PCA can control the type I error at the significance level of 0.05. If the causal SNP is in strong LD with the genotyped SNPs, both the PCA with a small number of principal components (PCs and the LKM with kernel of linear or identical-by-state function are valid tests. However, if the LD structure is complex, such as several LD blocks in the SNP set, or when the causal SNP is not in the LD block in which most of the genotyped SNPs reside, more PCs should be included to capture the information of the causal SNP. Simulation studies also demonstrate the ability of LKM and PCA to combine information from multiple causal SNPs and to provide increased power over individual SNP analysis. We also apply LKM and PCA to analyze two SNP sets extracted from an actual GWAS dataset on non-small cell lung cancer.

  9. Direct inference of SNP heterozygosity rates and resolution of LOH detection.

    Directory of Open Access Journals (Sweden)

    Xiaohong Li

    2007-11-01

    Full Text Available Single nucleotide polymorphisms (SNPs have been increasingly utilized to investigate somatic genetic abnormalities in premalignancy and cancer. LOH is a common alteration observed during cancer development, and SNP assays have been used to identify LOH at specific chromosomal regions. The design of such studies requires consideration of the resolution for detecting LOH throughout the genome and identification of the number and location of SNPs required to detect genetic alterations in specific genomic regions. Our study evaluated SNP distribution patterns and used probability models, Monte Carlo simulation, and real human subject genotype data to investigate the relationships between the number of SNPs, SNP HET rates, and the sensitivity (resolution for detecting LOH. We report that variances of SNP heterozygosity rate in dbSNP are high for a large proportion of SNPs. Two statistical methods proposed for directly inferring SNP heterozygosity rates require much smaller sample sizes (intermediate sizes and are feasible for practical use in SNP selection or verification. Using HapMap data, we showed that a region of LOH greater than 200 kb can be reliably detected, with losses smaller than 50 kb having a substantially lower detection probability when using all SNPs currently in the HapMap database. Higher densities of SNPs may exist in certain local chromosomal regions that provide some opportunities for reliably detecting LOH of segment sizes smaller than 50 kb. These results suggest that the interpretation of the results from genome-wide scans for LOH using commercial arrays need to consider the relationships among inter-SNP distance, detection probability, and sample size for a specific study. New experimental designs for LOH studies would also benefit from considering the power of detection and sample sizes required to accomplish the proposed aims.

  10. A Commentary on Pitfalls of Predicting Complex Traits From SNP's

    DEFF Research Database (Denmark)

    de los Campos, Gustavo; Sorensen, Daniel

    2013-01-01

    As stated by Wray and co-authors1, knowing the proportion of variance of a trait that is explained by regression on markers in the population (h2M) is relevant because, in principle, h2M represents the maximum prediction accuracy (R2TST) that is achievable in testing (TST) data if marker effects...... of h2M (Ref. 5), conseqeuenty, it is not obvious that R2TST can achieve values equal to the finite sample estimate of h2G-BLUP. In a recent article5, we studied the R2TST of G-BLUP and its relationship with h2G-BLUP. We show analytically that mis-specification of the training–testing (TRN–TST) genomic...

  11. Tipping the Proteome with Gene-Based Vaccines: Weighing in on the Role of Nano materials

    International Nuclear Information System (INIS)

    Flores, K.J.; Craig, M.; Smith, J.J.; DeLong, R.K.; Wanekaya, A.; Dong, L.

    2012-01-01

    Since the first generation of DNA vaccines was introduced in 1988, remarkable improvements have been made to improve their efficacy and immunogenicity. Although human clinical trials have shown that delivery of DNA vaccines is well tolerated and safe, the potency of these vaccines in humans is somewhat less than optimal. The development of a gene-based vaccine that was effective enough to be approved for clinical use in humans would be one of, if not the most important, advance in vaccines to date. This paper highlights the literature relating to gene-based vaccines, specifically DNA vaccines, and suggests possible approaches to boost their performance. In addition, we explore the idea that combining RNA and nano materials may hold the key to successful gene-based vaccines for prevention and treatment of disease

  12. Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine

    Directory of Open Access Journals (Sweden)

    Garnier-Géré Pauline

    2011-07-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait., the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels. Offspring from three-generation outbred (G2 and inbred (F2 pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using

  13. Association of an SNP in a novel DREB2-like gene SiDREB2 with stress tolerance in foxtail millet [Setaria italica (L.)].

    Science.gov (United States)

    Lata, Charu; Bhutty, Sarita; Bahadur, Ranjit Prasad; Majee, Manoj; Prasad, Manoj

    2011-06-01

    The DREB genes code for important plant transcription factors involved in the abiotic stress response and signal transduction. Characterization of DREB genes and development of functional markers for effective alleles is important for marker-assisted selection in foxtail millet. Here the characterization of a cDNA (SiDREB2) encoding a putative dehydration-responsive element-binding protein 2 from foxtail millet and the development of an allele-specific marker (ASM) for dehydration tolerance is reported. A cDNA clone (GenBank accession no. GT090998) coding for a putative DREB2 protein was isolated as a differentially expressed gene from a 6 h dehydration stress SSH library. A 5' RACE (rapid amplification of cDNA ends) was carried out to obtain the full-length cDNA, and sequence analysis showed that SiDREB2 encoded a polypeptide of 234 amino acids with a predicted mol. wt of 25.72 kDa and a theoretical pI of 5.14. A theoretical model of the tertiary structure shows that it has a highly conserved GCC-box-binding N-terminal domain, and an acidic C-terminus that acts as an activation domain for transcription. Based on its similarity to AP2 domains, SiDREB2 was classified into the A-2 subgroup of the DREB subfamily. Quantitative real-time PCR analysis showed significant up-regulation of SiDREB2 by dehydration (polyethylene glycol) and salinity (NaCl), while its expression was less affected by other stresses. A synonymous single nucleotide polymorphism (SNP) associated with dehydration tolerance was detected at the 558th base pair (an A/G transition) in the SiDREB2 gene in a core set of 45 foxtail millet accessions used. Based on the identified SNP, three primers were designed to develop an ASM for dehydration tolerance. The ASM produced a 261 bp fragment in all the tolerant accessions and produced no amplification in the sensitive accessions. The use of this ASM might be faster, cheaper, and more reproducible than other SNP genotyping methods, and thus will enable

  14. Electrochemical Li Topotactic Reaction in Layered SnP3 for Superior Li-Ion Batteries

    Science.gov (United States)

    Park, Jae-Wan; Park, Cheol-Min

    2016-10-01

    The development of new anode materials having high electrochemical performances and interesting reaction mechanisms is highly required to satisfy the need for long-lasting mobile electronic devices and electric vehicles. Here, we report a layer crystalline structured SnP3 and its unique electrochemical behaviors with Li. The SnP3 was simply synthesized through modification of Sn crystallography by combination with P and its potential as an anode material for LIBs was investigated. During Li insertion reaction, the SnP3 anode showed an interesting two-step electrochemical reaction mechanism comprised of a topotactic transition (0.7-2.0 V) and a conversion (0.0-2.0 V) reaction. When the SnP3-based composite electrode was tested within the topotactic reaction region (0.7-2.0 V) between SnP3 and LixSnP3 (x ≤ 4), it showed excellent electrochemical properties, such as a high volumetric capacity (1st discharge/charge capacity was 840/663 mA h cm-3) with a high initial coulombic efficiency, stable cycle behavior (636 mA h cm-3 over 100 cycles), and fast rate capability (550 mA h cm-3 at 3C). This layered SnP3 anode will be applicable to a new anode material for rechargeable LIBs.

  15. Involvement of Sodium Nitroprusside (SNP in the Mechanism That Delays Stem Bending of Different Gerbera Cultivars

    Directory of Open Access Journals (Sweden)

    Aung H. Naing

    2017-11-01

    Full Text Available Longevity of cut flowers of many gerbera cultivars (Gerbera jamesonii is typically short because of stem bending; hence, stem bending that occurs during the early vase life period is a major problem in gerbera. Here, we investigated the effects of sodium nitroprusside (SNP on the delay of stem bending in the gerbera cultivars, Alliance, Rosalin, and Bintang, by examining relative fresh weight, bacterial density in the vase solution, transcriptional analysis of a lignin biosynthesis gene, antioxidant activity, and xylem blockage. All three gerbera cultivars responded to SNP by delaying stem bending, compared to the controls; however, the responses were dose- and cultivar-dependent. Among the treatments, SNP at 20 mg L-1 was the best to delay stem bending in Alliance, while dosages of 10 and 5 mg L-1 were the best for Rosalin and Bintang, respectively. However, stem bending in Alliance and Rosalin was faster than in Bintang, indicating a discrepancy influenced by genotype. According to our analysis of the role of SNP in the delay of stem bending, the results revealed that SNP treatment inhibited bacterial growth and xylem blockage, enhanced expression levels of a lignin biosynthesis gene, and maintained antioxidant activities. Therefore, it is suggested that the cause of stem bending is associated with the above-mentioned parameters and SNP is involved in the mechanism that delays stem bending in the different gerbera cultivars.

  16. Interest in genomic SNP testing for prostate cancer risk: a pilot survey.

    Science.gov (United States)

    Hall, Michael J; Ruth, Karen J; Chen, David Yt; Gross, Laura M; Giri, Veda N

    2015-01-01

    Advancements in genomic testing have led to the identification of single nucleotide polymorphisms (SNPs) associated with prostate cancer. The clinical utility of SNP tests to evaluate prostate cancer risk is unclear. Studies have not examined predictors of interest in novel genomic SNP tests for prostate cancer risk in a diverse population. Consecutive participants in the Fox Chase Prostate Cancer Risk Assessment Program (PRAP) (n = 40) and unselected men from surgical urology clinics (n = 40) completed a one-time survey. Items examined interest in genomic SNP testing for prostate cancer risk, knowledge, impact of unsolicited findings, and psychosocial factors including health literacy. Knowledge of genomic SNP tests was low in both groups, but interest was higher among PRAP men (p testing in both groups. Multivariable modeling identified several predictors of higher interest in a genomic SNP test including higher perceived risk (p = 0.025), indicating zero reasons for not wanting testing (vs ≥1 reason) (p = 0.013), and higher health literacy (p = 0.016). Knowledge of genomic SNP testing was low in this sample, but higher among high-risk men. High-risk status may increase interest in novel genomic tests, while low literacy may lessen interest.

  17. Genetic Markers Analyses and Bioinformatic Approaches to Distinguish Between Olive Tree (Olea europaea L.) Cultivars.

    Science.gov (United States)

    Ben Ayed, Rayda; Ben Hassen, Hanen; Ennouri, Karim; Rebai, Ahmed

    2016-12-01

    The genetic diversity of 22 olive tree cultivars (Olea europaea L.) sampled from different Mediterranean countries was assessed using 5 SNP markers (FAD2.1; FAD2.3; CALC; SOD and ANTHO3) located in four different genes. The genotyping analysis of the 22 cultivars with 5 SNP loci revealed 11 alleles (average 2.2 per allele). The dendrogram based on cultivar genotypes revealed three clusters consistent with the cultivars classification. Besides, the results obtained with the five SNPs were compared to those obtained with the SSR markers using bioinformatic analyses and by computing a cophenetic correlation coefficient, indicating the usefulness of the UPGMA method for clustering plant genotypes. Based on principal coordinate analysis using a similarity matrix, the first two coordinates, revealed 54.94 % of the total variance. This work provides a more comprehensive explanation of the diversity available in Tunisia olive cultivars, and an important contribution for olive breeding and olive oil authenticity.

  18. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.

    Directory of Open Access Journals (Sweden)

    Xiao-Lin Wu

    Full Text Available Low-density (LD single nucleotide polymorphism (SNP arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD or high-density (HD SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE or haplotype-averaged Shannon entropy (HASE and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus

  19. Effect of imputing markers from a low-density chip on the reliability of genomic breeding values in Holstein populations

    DEFF Research Database (Denmark)

    Dassonneville, R; Brøndum, Rasmus Froberg; Druet, T

    2011-01-01

    The purpose of this study was to investigate the imputation error and loss of reliability of direct genomic values (DGV) or genomically enhanced breeding values (GEBV) when using genotypes imputed from a 3,000-marker single nucleotide polymorphism (SNP) panel to a 50,000-marker SNP panel. Data...... of missing markers and prediction of breeding values were performed using 2 different reference populations in each country: either a national reference population or a combined EuroGenomics reference population. Validation for accuracy of imputation and genomic prediction was done based on national test...... with a national reference data set gave an absolute loss of 0.05 in mean reliability of GEBV in the French study, whereas a loss of 0.03 was obtained for reliability of DGV in the Nordic study. When genotypes were imputed using the EuroGenomics reference, a loss of 0.02 in mean reliability of GEBV was detected...

  20. Population genomic structure and linkage disequilibrium analysis of South African goat breeds using genome-wide SNP data.

    Science.gov (United States)

    Mdladla, K; Dzomba, E F; Huson, H J; Muchadeyi, F C

    2016-08-01

    The sustainability of goat farming in marginal areas of southern Africa depends on local breeds that are adapted to specific agro-ecological conditions. Unimproved non-descript goats are the main genetic resources used for the development of commercial meat-type breeds of South Africa. Little is known about genetic diversity and the genetics of adaptation of these indigenous goat populations. This study investigated the genetic diversity, population structure and breed relations, linkage disequilibrium, effective population size and persistence of gametic phase in goat populations of South Africa. Three locally developed meat-type breeds of the Boer (n = 33), Savanna (n = 31), Kalahari Red (n = 40), a feral breed of Tankwa (n = 25) and unimproved non-descript village ecotypes (n = 110) from four goat-producing provinces of the Eastern Cape, KwaZulu-Natal, Limpopo and North West were assessed using the Illumina Goat 50K SNP Bead Chip assay. The proportion of SNPs with minor allele frequencies >0.05 ranged from 84.22% in the Tankwa to 97.58% in the Xhosa ecotype, with a mean of 0.32 ± 0.13 across populations. Principal components analysis, admixture and pairwise FST identified Tankwa as a genetically distinct population and supported clustering of the populations according to their historical origins. Genome-wide FST identified 101 markers potentially under positive selection in the Tankwa. Average linkage disequilibrium was highest in the Tankwa (r(2)  = 0.25 ± 0.26) and lowest in the village ecotypes (r(2) range = 0.09 ± 0.12 to 0.11 ± 0.14). We observed an effective population size of 100 kb with the exception of those in Savanna and Tswana populations. This study highlights the high level of genetic diversity in South African indigenous goats as well as the utility of the genome-wide SNP marker panels in genetic studies of these populations. © 2016 Stichting International Foundation for Animal Genetics.

  1. On the impact of relatedness on SNP association analysis.

    Science.gov (United States)

    Gross, Arnd; Tönjes, Anke; Scholz, Markus

    2017-12-06

    When testing for SNP (single nucleotide polymorphism) associations in related individuals, observations are not independent. Simple linear regression assuming independent normally distributed residuals results in an increased type I error and the power of the test is also affected in a more complicate manner. Inflation of type I error is often successfully corrected by genomic control. However, this reduces the power of the test when relatedness is of concern. In the present paper, we derive explicit formulae to investigate how heritability and strength of relatedness contribute to variance inflation of the effect estimate of the linear model. Further, we study the consequences of variance inflation on hypothesis testing and compare the results with those of genomic control correction. We apply the developed theory to the publicly available HapMap trio data (N=129), the Sorbs (a self-contained population with N=977 characterised by a cryptic relatedness structure) and synthetic family studies with different sample sizes (ranging from N=129 to N=999) and different degrees of relatedness. We derive explicit and easily to apply approximation formulae to estimate the impact of relatedness on the variance of the effect estimate of the linear regression model. Variance inflation increases with increasing heritability. Relatedness structure also impacts the degree of variance inflation as shown for example family structures. Variance inflation is smallest for HapMap trios, followed by a synthetic family study corresponding to the trio data but with larger sample size than HapMap. Next strongest inflation is observed for the Sorbs, and finally, for a synthetic family study with a more extreme relatedness structure but with similar sample size as the Sorbs. Type I error increases rapidly with increasing inflation. However, for smaller significance levels, power increases with increasing inflation while the opposite holds for larger significance levels. When genomic control

  2. Assessment of Cultivar Distinctness in Alfalfa: A Comparison of Genotyping-by-Sequencing, Simple-Sequence Repeat Marker, and Morphophysiological Observations

    Directory of Open Access Journals (Sweden)

    Paolo Annicchiarico

    2016-07-01

    Full Text Available Cultivar registration agencies typically require morphophysiological trait-based distinctness of candidate cultivars. This requirement is difficult to achieve for cultivars of major perennial forages because of their genetic structure and ever-increasing number of registered material, leading to possible rejection of agronomically valuable cultivars. This study aimed to explore the value of molecular markers applied to replicated bulked plants (three bulks of 100 independent plants each per cultivar to assess alfalfa ( L. subsp. cultivar distinctness. We compared genotyping-by-sequencing information based on 2902 polymorphic single-nucleotide polymorphism (SNP markers (>30 reads per DNA sample with morphophysiological information based on 11 traits and with simple-sequence repeat (SSR marker information from 41 polymorphic markers for their ability to distinguish 11 alfalfa landraces representative of the germplasm from northern Italy. Three molecular criteria, one based on cultivar differences for individual SSR bands and two based on overall SNP marker variation assessed either by statistically significant cultivar differences on principal component axes or discriminant analysis, distinctly outperformed the morphophysiological criterion. Combining the morphophysiological criterion with either molecular marker method increased discrimination among cultivars, since morphophysiological diversity was unrelated to SSR marker-based diversity ( = 0.04 and poorly related to SNP marker-based diversity ( = 0.23, < 0.15. The criterion based on statistically significant SNP allele frequency differences was less discriminating than morphophysiological variation. Marker-based distinctness, which can be assessed at low cost and without interactions with testing conditions, could validly substitute for (or complement morphophysiological distinctness in alfalfa cultivar registration schemes. It also has interest in sui generis registration systems aimed at

  3. Genetic identity, ancestry and parentage in farmer selections of cacao from Aceh, Indonesia revealed by single nucleotide polymorphism (SNP) markers

    Science.gov (United States)

    Cacao (Theobroma cacao L.) is the source of cocoa powder and butter used for chocolate and this species originated in the rainforests of South America. Indonesia is the 3rd largest cacao producer in the world with an annual cacao output of 0.55 million tons. Knowledge of on-farm genetic diversity is...

  4. Detecting genotypic variation among the single spore isolates of Pasteuria penetrans population occuring in Florida using SNP-based markers

    Science.gov (United States)

    Pasteuria penetrans is a naturally occurring soil-borne endospore-forming bacterium, which functions as a castrating parasite of plant-parasitic nematodes belonging to the genus Meloidogyne. Pasteuria penetrans is established as an effective biological control agent for control and management o...

  5. Genotyping single spore isolates of a Pasteuria penetrans population occurring in Florida using SNP-based markers

    Science.gov (United States)

    The aim of this study was to examine genotypic variation and virulence characteristics of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida. Six single spore lines (ssp), 16ssp, 17ssp, 18ssp, 25ssp, 26ssp, and 30ssp were generated by infecting...

  6. Non-mendelian inheritance of SNP markers reveals extensive chromosomal translocations in dioecious hops (humulus lupulus L.)

    Science.gov (United States)

    Hop (Humulus lupulus) is a high-climbing, herbaceous perennial, dioecious vine, and has a long history of use as flavoring and stability agent in beer as well as nutraceutical medicine, bio-fuel fermentations and animal fodder. However, the modes of genetic inheritance and genetic diversity are poor...

  7. Varietal identification of tea (Camellia sinensis [L.] Kuntze) using nanofluidic array of Single Nucleotide Polymorphism (SNP) markers

    Science.gov (United States)

    Apart from water, tea is the world’s most widely consumed beverage. Tea is produced in more than 50 countries with an annual production of approximately 4.7 million tons. The market segment for specialty tea has been expanding rapidly owing to increased demand, resulting in higher revenues and profi...

  8. Association study of phenology, yield and quality related traits in table grapes using SSR and SNP markers

    OpenAIRE

    Zarouri, Belkacem

    2016-01-01

    The advent of cheaper high throughput genotyping technologies and the availability of large germplasm collections encouraged the extension of Genome-Wide Association Studies (GWAS) to crop plants. However, to date these strategies have not yet been tested in grapevine (Vitis vinífera L.). Taking advantage of the availability of a large grapevine germplasm collection maintained at the germplasm bank of El Encín (Alcalá de Henares, Madrid, Spain) and the relatively affordable genotyping tools, ...

  9. fcGENE: a versatile tool for processing and transforming SNP datasets.

    Directory of Open Access Journals (Sweden)

    Nab Raj Roshyara

    Full Text Available Modern analysis of high-dimensional SNP data requires a number of biometrical and statistical methods such as pre-processing, analysis of population structure, association analysis and genotype imputation. Software used for these purposes often rely on specific and incompatible input and output data formats. Therefore extensive data management including multiple format conversions is necessary during analyses.In order to support fast and efficient management and bio-statistical quality control of high-dimensional SNP data, we developed the publically available software fcGENE using C++ object-oriented programming language. This software simplifies and automates the use of different existing analysis packages, especially during the workflow of genotype imputations and corresponding analyses.fcGENE transforms SNP data and imputation results into different formats required for a large variety of analysis packages such as PLINK, SNPTEST, HAPLOVIEW, EIGENSOFT, GenABEL and tools used for genotype imputation such as MaCH, IMPUTE, BEAGLE and others. Data Management tasks like merging, splitting, extracting SNP and pedigree information can be performed. fcGENE also supports a number of bio-statistical quality control processes and quality based filtering processes at SNP- and sample-wise level. The tool also generates templates of commands required to run specific software packages, especially those required for genotype imputation. We demonstrate the functionality of fcGENE by example workflows of SNP data analyses and provide a comprehensive manual of commands, options and applications.We have developed a user-friendly open-source software fcGENE, which comprehensively supports SNP data management, quality control and analysis workflows. Download statistics and corresponding feedbacks indicate that software is highly recognised and extensively applied by the scientific community.

  10. DNA-based genetic markers for Rapid Cycling Brassica rapa (Fast Plants type designed for the teaching laboratory.

    Directory of Open Access Journals (Sweden)

    Eryn E. Slankster

    2012-06-01

    Full Text Available We have developed DNA-based genetic markers for rapid-cycling Brassica rapa (RCBr, also known as Fast Plants. Although markers for Brassica rapa already exist, ours were intentionally designed for use in a teaching laboratory environment. The qualities we selected for were robust amplification in PCR, polymorphism in RCBr strains, and alleles that can be easily resolved in simple agarose slab gels. We have developed two single nucleotide polymorphism (SNP based markers and 14 variable number tandem repeat (VNTR-type markers spread over four chromosomes. The DNA sequences of these markers represent variation in a wide range of genomic features. Among the VNTR-type markers, there are examples of variation in a nongenic region, variation within an intron, and variation in the coding sequence of a gene. Among the SNP-based markers there are examples of polymorphism in intronic DNA and synonymous substitution in a coding sequence. Thus these markers can serve laboratory exercises in both transmission genetics and molecular biology.

  11. A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T in the equine myostatin (MSTN gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses

    Directory of Open Access Journals (Sweden)

    Whiston Ronan

    2010-10-01

    polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds.

  12. Environmental Response and Genomic Regions Correlated with Rice Root Growth and Yield under Drought in the OryzaSNP Panel across Multiple Study Systems.

    Directory of Open Access Journals (Sweden)

    Len J Wade

    Full Text Available The rapid progress in rice genotyping must be matched by advances in phenotyping. A better understanding of genetic variation in rice for drought response, root traits, and practical methods for studying them are needed. In this study, the OryzaSNP set (20 diverse genotypes that have been genotyped for SNP markers was phenotyped in a range of field and container studies to study the diversity of rice root growth and response to drought. Of the root traits measured across more than 20 root experiments, root dry weight showed the most stable genotypic performance across studies. The environment (E component had the strongest effect on yield and root traits. We identified genomic regions correlated with root dry weight, percent deep roots, maximum root depth, and grain yield based on a correlation analysis with the phenotypes and aus, indica, or japonica introgression regions using the SNP data. Two genomic regions were identified as hot spots in which root traits and grain yield were co-located; on chromosome 1 (39.7-40.7 Mb and on chromosome 8 (20.3-21.9 Mb. Across experiments, the soil type/ growth medium showed more correlations with plant growth than the container dimensions. Although the correlations among studies and genetic co-location of root traits from a range of study systems points to their potential utility to represent responses in field studies, the best correlations were observed when the two setups had some similar properties. Due to the co-location of the identified genomic regions (from introgression block analysis with QTL for a number of previously reported root and drought traits, these regions are good candidates for detailed characterization to contribute to understanding rice improvement for response to drought. This study also highlights the utility of characterizing a small set of 20 genotypes for root growth, drought response, and related genomic regions.

  13. Mining and Analysis of SNP in Response to Salinity Stress in Upland Cotton (Gossypium hirsutum L.).

    Science.gov (United States)

    Wang, Xiaoge; Lu, Xuke; Wang, Junjuan; Wang, Delong; Yin, Zujun; Fan, Weili; Wang, Shuai; Ye, Wuwei

    2016-01-01

    Salinity stress is a major abiotic factor that affects crop output, and as a pioneer crop in saline and alkaline land, salt tolerance study of cotton is particularly important. In our experiment, four salt-tolerance varieties with different salt tolerance indexes including CRI35 (65.04%), Kanghuanwei164 (56.19%), Zhong9807 (55.20%) and CRI44 (50.50%), as well as four salt-sensitive cotton varieties including Hengmian3 (48.21%), GK50 (40.20%), Xinyan96-48 (34.90%), ZhongS9612 (24.80%) were used as the materials. These materials were divided into salt-tolerant group (ST) and salt-sensitive group (SS). Illumina Cotton SNP 70K Chip was used to detect SNP in different cotton varieties. SNPv (SNP variation of the same seedling pre- and after- salt stress) in different varieties were screened; polymorphic SNP and SNPr (SNP related to salt tolerance) were obtained. Annotation and analysis of these SNPs showed that (1) the induction efficiency of salinity stress on SNPv of cotton materials with different salt tolerance index was different, in which the induction efficiency on salt-sensitive materials was significantly higher than that on salt-tolerant materials. The induction of salt stress on SNPv was obviously biased. (2) SNPv induced by salt stress may be related to the methylation changes under salt stress. (3) SNPr may influence salt tolerance of plants by affecting the expression of salt-tolerance related genes.

  14. The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies

    Science.gov (United States)

    Barnett, Ian; Mukherjee, Rajarshi; Lin, Xihong

    2017-01-01

    It is of substantial interest to study the effects of genes, genetic pathways, and networks on the risk of complex diseases. These genetic constructs each contain multiple SNPs, which are often correlated and function jointly, and might be large in number. However, only a sparse subset of SNPs in a genetic construct is generally associated with the disease of interest. In this article, we propose the generalized higher criticism (GHC) to test for the association between an SNP set and a disease outcome. The higher criticism is a test traditionally used in high-dimensional signal detection settings when marginal test statistics are independent and the number of parameters is very large. However, these assumptions do not always hold in genetic association studies, due to linkage disequilibrium among SNPs and the finite number of SNPs in an SNP set in each genetic construct. The proposed GHC overcomes the limitations of the higher criticism by allowing for arbitrary correlation structures among the SNPs in an SNP-set, while performing accurate analytic p-value calculations for any finite number of SNPs in the SNP-set. We obtain the detection boundary of the GHC test. We compared empirically using simulations the power of the GHC method with existing SNP-set tests over a range of genetic regions with varied correlation structures and signal sparsity. We apply the proposed methods to analyze the CGEM breast cancer genome-wide association study. Supplementary materials for this article are available online. PMID:28736464

  15. [Relationship between genetic polymorphisms of 3 SNP loci in 5-HTT gene and paranoid schizophrenia].

    Science.gov (United States)

    Xuan, Jin-Feng; Ding, Mei; Pang, Hao; Xing, Jia-Xin; Sun, Yi-Hua; Yao, Jun; Zhao, Yi; Li, Chun-Mei; Wang, Bao-Jie

    2012-12-01

    To investigate the population genetic data of 3 SNP loci (rs25533, rs34388196 and rs1042173) of 5-hydroxytryptamine transporter (5-HTT) gene and the association with paranoid schizophrenia. Three SNP loci of 5-HTT gene were examined in 132 paranoid schizophrenia patients and 150 unrelated healthy individuals of Northern Chinese Han population by PCR-RFLP technique. The Hardy-Weinberg equilibrium test was performed using the chi-square test and the data of haplotype frequency and population genetics parameters were statistically analyzed. Among these three SNP loci, four haplotypes were obtained. There were no statistically significant differences between the patient group and the control group (P > 0.05). The DP values of the 3 SNP loci were 0.276, 0.502 and 0.502. The PIC of them were 0.151, 0.281 and 0.281. The PE of them were 0.014, 0.072 and 0.072. The three SNP loci and four haplotypes of 5-HTT gene have no association with paranoid schizophrenia, while the polymorphism still have high potential application in forensic practice.

  16. Underestimated effect sizes in GWAS: fundamental limitations of single SNP analysis for dichotomous phenotypes.

    Directory of Open Access Journals (Sweden)

    Sven Stringer

    Full Text Available Complex diseases are often highly heritable. However, for many complex traits only a small proportion of the heritability can be explained by observed genetic variants in traditional genome-wide association (GWA studies. Moreover, for some of those traits few significant SNPs have been identified. Single SNP association methods test for association at a single SNP, ignoring the effect of other SNPs. We show using a simple multi-locus odds model of complex disease that moderate to large effect sizes of causal variants may be estimated as relatively small effect sizes in single SNP association testing. This underestimation effect is most severe for diseases influenced by numerous risk variants. We relate the underestimation effect to the concept of non-collapsibility found in the statistics literature. As described, continuous phenotypes generated with linear genetic models are not affected by this underestimation effect. Since many GWA studies apply single SNP analysis to dichotomous phenotypes, previously reported results potentially underestimate true effect sizes, thereby impeding identification of true effect SNPs. Therefore, when a multi-locus model of disease risk is assumed, a multi SNP analysis may be more appropriate.

  17. Leveraging ethnic group incidence variation to investigate genetic susceptibility to glioma: A novel candidate SNP approach

    Directory of Open Access Journals (Sweden)

    Daniel Ian Jacobs

    2012-10-01

    Full Text Available Objectives: Using a novel candidate SNP approach, we aimed to identify a possible genetic basis for the higher glioma incidence in Whites relative to East Asians and African-Americans. Methods: We hypothesized that genetic regions containing SNPs with extreme differences in allele frequencies across ethnicities are most likely to harbor susceptibility variants. We used International HapMap Project data to identify 3,961 candidate SNPs with the largest allele frequency differences in Whites compared to East Asians and Africans and tested these SNPs for association with glioma risk in a set of White cases and controls. Top SNPs identified in the discovery dataset were tested for association with glioma in five independent replication datasets. Results: No SNP achieved statistical significance in either the discovery or replication datasets after accounting for multiple testing. However, the most strongly associated SNP, rs879471, was found to be in linkage disequilibrium with a previously identified risk SNP, rs6010620, in RTEL1. We estimate rs6010620 to account for a glioma incidence rate ratio of 1.34 for Whites relative to East Asians. Conclusions: We explored genetic susceptibility to glioma using a novel candidate SNP method which may be applicable to other diseases with appropriate epidemiologic patterns.

  18. Short Tree, Long Tree, Right Tree, Wrong Tree: New Acquisition Bias Corrections for Inferring SNP Phylogenies.

    Science.gov (United States)

    Leaché, Adam D; Banbury, Barbara L; Felsenstein, Joseph; de Oca, Adrián Nieto-Montes; Stamatakis, Alexandros

    2015-11-01

    Single nucleotide polymorphisms (SNPs) are useful markers for phylogenetic studies owing in part to their ubiquity throughout the genome and ease of collection. Restriction site associated DNA sequencing (RADseq) methods are becoming increasingly popular for SNP data collection, but an assessment of the best practises for using these data in phylogenetics is lacking. We use computer simulations, and new double digest RADseq (ddRADseq) data for the lizard family Phrynosomatidae, to investigate the accuracy of RAD loci for phylogenetic inference. We compare the two primary ways RAD loci are used during phylogenetic analysis, including the analysis of full sequences (i.e., SNPs together with invariant sites), or the analysis of SNPs on their own after excluding invariant sites. We find that using full sequences rather than just SNPs is preferable from the perspectives of branch length and topological accuracy, but not of computational time. We introduce two new acquisition bias corrections for dealing with alignments composed exclusively of SNPs, a conditional likelihood method and a reconstituted DNA approach. The conditional likelihood method conditions on the presence of variable characters only (the number of invariant sites that are unsampled but known to exist is not considered), while the reconstituted DNA approach requires the user to specify the exact number of unsampled invariant sites prior to the analysis. Under simulation, branch length biases increase with the amount of missing data for both acquisition bias correction methods, but branch length accuracy is much improved in the reconstituted DNA approach compared to the conditional likelihood approach. Phylogenetic analyses of the empirical data using concatenation or a coalescent-based species tree approach provide strong support for many of the accepted relationships among phrynosomatid lizards, suggesting that RAD loci contain useful phylogenetic signal across a range of divergence times despite the

  19. Whole genome SNP discovery and analysis of genetic diversity in Turkey (Meleagris gallopavo)

    Science.gov (United States)

    2012-01-01

    Background The turkey (Meleagris gallopavo) is an important agricultural species and the second largest contributor to the world’s poultry meat production. Genetic improvement is attributed largely to selective breeding programs that rely on highly heritable phenotypic traits, such as body size and breast muscle development. Commercial breeding with small effective population sizes and epistasis can result in loss of genetic diversity, which in turn can lead to reduced individual fitness and reduced response to selection. The presence of genomic diversity in domestic livestock species therefore, is of great importance and a prerequisite for rapid and accurate genetic improvement of selected breeds in various environments, as well as to facilitate rapid adaptation to potential changes in breeding goals. Genomic selection requires a large number of genetic markers such as e.g. single nucleotide polymorphisms (SNPs) the most abundant source of genetic variation within the genome. Results Alignment of next generation sequencing data of 32 individual turkeys from different populations was used for the discovery of 5.49 million SNPs, which subsequently were used for the analysis of genetic diversity among the different populations. All of the commercial lines branched from a single node relative to the heritage varieties and the South Mexican turkey population. Heterozygosity of all individuals from the different turkey populations ranged from 0.17-2.73 SNPs/Kb, while heterozygosity of populations ranged from 0.73-1.64 SNPs/Kb. The average frequency of heterozygous SNPs in individual turkeys was 1.07 SNPs/Kb. Five genomic regions with very low nucleotide variation were identified in domestic turkeys that showed state of fixation towards alleles different than wild alleles. Conclusion The turkey genome is much less diverse with a relatively low frequency of heterozygous SNPs as compared to other livestock species like chicken and pig. The whole genome SNP discovery

  20. Whole genome SNP discovery and analysis of genetic diversity in Turkey (Meleagris gallopavo

    Directory of Open Access Journals (Sweden)

    Aslam Muhammad L

    2012-08-01

    Full Text Available Abstract Background The turkey (Meleagris gallopavo is an important agricultural species and the second largest contributor to the world’s poultry meat production. Genetic improvement is attributed largely to selective breeding programs that rely on highly heritable phenotypic traits, such as body size and breast muscle development. Commercial breeding with small effective population sizes and epistasis can result in loss of genetic diversity, which in turn can lead to reduced individual fitness and reduced response to selection. The presence of genomic diversity in domestic livestock species therefore, is of great importance and a prerequisite for rapid and accurate genetic improvement of selected breeds in various environments, as well as to facilitate rapid adaptation to potential changes in breeding goals. Genomic selection requires a large number of genetic markers such as e.g. single nucleotide polymorphisms (SNPs the most abundant source of genetic variation within the genome. Results Alignment of next generation sequencing data of 32 individual turkeys from different populations was used for the discovery of 5.49 million SNPs, which subsequently were used for the analysis of genetic diversity among the different populations. All of the commercial lines branched from a single node relative to the heritage varieties and the South Mexican turkey population. Heterozygosity of all individuals from the different turkey populations ranged from 0.17-2.73 SNPs/Kb, while heterozygosity of populations ranged from 0.73-1.64 SNPs/Kb. The average frequency of heterozygous SNPs in individual turkeys was 1.07 SNPs/Kb. Five genomic regions with very low nucleotide variation were identified in domestic turkeys that showed state of fixation towards alleles different than wild alleles. Conclusion The turkey genome is much less diverse with a relatively low frequency of heterozygous SNPs as compared to other livestock species like chicken and pig. The

  1. Tantalum markers in radiography

    International Nuclear Information System (INIS)

    Aronson, A.S.; Jonsson, N.; Alberius, P.

    1985-01-01

    The biocompatibility of two types of radiopaque tantalum markers was evaluated histologically. Reactions to pin markers (99.9% purity) and spherical markers (95.2% purity) were investigated after 3-6 weeks in rabbits and 5-48 weeks in children with abnormal growth. Both marker types were firmly attached to bone trabeculae; this was most pronounced in rabbit bone, and no adverse macroscopic reactions were observed. Microscopically, no reactions or only slight fibrosis of bone tissue were detected, while soft tissues only demonstrated a minor inflammatory reaction. Nevertheless, the need for careful preparation and execution of marker implantations is stressed, and particularly avoidance iof the use of emery in sharpening of cannulae. The bioinertness of tantalum was reconfirmed as was its suitability for use as skeletal and soft tissue radiographic markers. (orig.)

  2. Developing a SNP panel for forensic identification of individuals

    DEFF Research Database (Denmark)

    Kidd, KK; Pakstis, AJ; Speed, WC

    2006-01-01

    of genetic variation from the world's major geographical regions. Those with little allele frequency variation on the seven populations are then screened on a total of 40 populations ( approximately 2100 individuals) and the most promising retained. The preliminary panel of 19 SNPs, from an initial selection......, because allele frequencies can vary greatly among populations, the population genetics of match probabilities is a critical issue. Some SNPs, however, show little allele frequency variation among populations while remaining highly informative. We describe here both an efficient strategy for identifying...... and characterizing such SNPs, and test that strategy on a broad representation of world populations. Markers with high heterozygosity and little frequency variation among African American, European American, and East Asian populations are selected for additional screening on seven populations that provide a sampling...

  3. Developing Exon-Primed Intron-Crossing (EPIC) markers for population genetic studies in three Aedes disease vectors.

    Science.gov (United States)

    White, Vanessa Linley; Endersby, Nancy Margaret; Chan, Janice; Hoffmann, Ary Anthony; Weeks, Andrew Raymond

    2015-03-01

    Aedes aegypti, Aedes notoscriptus, and Aedes albopictus are important vectors of many arboviruses implicated in human disease such as dengue fever. Genetic markers applied across vector species can provide important information on population structure, gene flow, insecticide resistance, and taxonomy, however, robust microsatellite markers have proven difficult to develop in these species and mosquitoes generally. Here we consider the utility and transferability of 15 Ribosome protein (Rp) Exon-Primed Intron-Crossing (EPIC) markers for population genetic studies in these 3 Aedes species. Rp EPIC markers designed for Ae. aegypti also successfully amplified populations of the sister species, Ae. albopictus, as well as the distantly related species, Ae. notoscriptus. High SNP and good indel diversity in sequenced alleles plus support for amplification of the same regions across populations and species were additional benefits of these markers. These findings point to the general value of EPIC markers in mosquito population studies. © 2014 Institute of Zoology, Chinese Academy of Sciences.

  4. Use of genotyping by sequencing data to develop a high-throughput and multifunctional SNP panel for conservation applications in Pacific lamprey.

    Science.gov (United States)

    Hess, Jon E; Campbell, Nathan R; Docker, Margaret F; Baker, Cyndi; Jackson, Aaron; Lampman, Ralph; McIlraith, Brian; Moser, Mary L; Statler, David P; Young, William P; Wildbill, Andrew J; Narum, Shawn R

    2015-01-01

    Next-generation sequencing data can be mined for highly informative single nucleotide polymorphisms (SNPs) to develop high-throughput genomic assays for nonmodel organisms. However, choosing a set of SNPs to address a variety of objectives can be difficult because SNPs are often not equally informative. We developed an optimal combination of 96 high-throughput SNP assays from a total of 4439 SNPs identified in a previous study of Pacific lamprey (Entosphenus tridentatus) and used them to address four disparate objectives: parentage analysis, species identification and characterization of neutral and adaptive variation. Nine of these SNPs are FST outliers, and five of these outliers are localized within genes and significantly associated with geography, run-timing and dwarf life history. Two of the 96 SNPs were diagnostic for two other lamprey species that were morphologically indistinguishable at early larval stages and were sympatric in the Pacific Northwest. The majority (85) of SNPs in the panel were highly informative for parentage analysis, that is, putatively neutral with high minor allele frequency across the species' range. Results from three case studies are presented to demonstrate the broad utility of this panel of SNP markers in this species. As Pacific lamprey populations are undergoing rapid decline, these SNPs provide an important resource to address critical uncertainties associated with the conservation and recovery of this imperiled species. © 2014 John Wiley & Sons Ltd.

  5. [Genetic Variability and Structure of SNP Haplotypes in the DMPK Gene in Yakuts and Other Ethnic Groups of Northern Eurasia in Relation to Myotonic Dystrophy].

    Science.gov (United States)

    Swarovskaya, M G; Stepanova, S K; Marussin, A V; Sukhomyasova, A L; Maximova, N R; Stepanov, V A

    2015-06-01

    The genetic variability of the DMPK locus has been studied in relation to six SNP markers (rs2070736, rs572634, rs1799894, rs527221, rs915915, and rs10415988) in Yakuts with myotonic dystrophy (MD) in the Yakut population and in populations of northern Eurasia. Significant differences were observed in the allele frequencies between patients and a population sample of Yakuts for three SNP loci (rs915915, rs1799894, and rs10415988) associated with a high chance of disease manifestation. The odds ratios (OR) of MD development in representatives of the Yakut population for these three loci were 2.59 (95% CI, p = 0,004), 4.99 (95% CI, p = 0.000), and 3.15 (95% CI, p = 0.01), respectively. Haplotype TTTCTC, which is associated with MD, and haplotype GTCCTT, which was observed only in Yakut MD patients (never in MD patients of non-Yakut origin), were revealed. A low level of variability in the locus of DMRK gene in Yakuts (H(e) = 0.283) compared with other examined populations was noted. An analysis of pairwise genetic relationships between populations revealed their significant differentiation for all the examined loci. In addition, a low level of differentiation in territorial groups of Yakut populations (F(ST) = 0.79%), which was related to the high subdivision of the northern Eurasian population (F(ST) = 11.83%), was observed.

  6. A gene-based linkage map for Bicyclus anynana butterflies allows for a comprehensive analysis of synteny with the lepidopteran reference genome.

    Directory of Open Access Journals (Sweden)

    Patrícia Beldade

    2009-02-01

    Full Text Available Lepidopterans (butterflies and moths are a rich and diverse order of insects, which, despite their economic impact and unusual biological properties, are relatively underrepresented in terms of genomic resources. The genome of the silkworm Bombyx mori has been fully sequenced, but comparative lepidopteran genomics has been hampered by the scarcity of information for other species. This is especially striking for butterflies, even though they have diverse and derived phenotypes (such as color vision and wing color patterns and are considered prime models for the evolutionary and developmental analysis of ecologically relevant, complex traits. We focus on Bicyclus anynana butterflies, a laboratory system for studying the diversification of novelties and serially repeated traits. With a panel of 12 small families and a biphasic mapping approach, we first assigned 508 expressed genes to segregation groups and then ordered 297 of them within individual linkage groups. We also coarsely mapped seven color pattern loci. This is the richest gene-based map available for any butterfly species and allowed for a broad-coverage analysis of synteny with the lepidopteran reference genome. Based on 462 pairs of mapped orthologous markers in Bi. anynana and Bo. mori, we observed strong conservation of gene assignment to chromosomes, but also evidence for numerous large- and small-scale chromosomal rearrangements. With gene collections growing for a variety of target organisms, the ability to place those genes in their proper genomic context is paramount. Methods to map expressed genes and to compare maps with relevant model systems are crucial to extend genomic-level analysis outside classical model species. Maps with gene-based markers are useful for comparative genomics and to resolve mapped genomic regions to a tractable number of candidate genes, especially if there is synteny with related model species. This is discussed in relation to the identification of

  7. Forensic genetic SNP typing of low-template DNA and highly degraded DNA from crime case samples

    DEFF Research Database (Denmark)

    Børsting, Claus; Mogensen, Helle Smidt; Morling, Niels

    2013-01-01

    the heterozygote balance. Allele drop-ins were only observed in experiments with 25 pg of DNA and not in experiments with 50 and 100 pg of DNA. The allele drop-in rate in the 25 pg experiments was 0.06% or 100 times lower than what was previously reported for STR typing of LtDNA. A composite model and two......Heterozygote imbalances leading to allele drop-outs and disproportionally large stutters leading to allele drop-ins are known stochastic phenomena related to STR typing of low-template DNA (LtDNA). The large stutters and the many drop-ins in typical STR stutter positions are artifacts from the PCR...... amplification of tandem repeats. These artifacts may be avoided by typing bi-allelic markers instead of STRs. In this work, the SNPforID multiplex assay was used to type LtDNA. A sensitized SNP typing protocol was introduced, that increased signal strengths without increasing noise and without affecting...

  8. Genetic diversity and structure of elite cotton germplasm (Gossypium hirsutum L.) using genome-wide SNP data.

    Science.gov (United States)

    Ai, XianTao; Liang, YaJun; Wang, JunDuo; Zheng, JuYun; Gong, ZhaoLong; Guo, JiangPing; Li, XueYuan; Qu, YanYing

    2017-10-01

    Cotton (Gossypium spp.) is the most important natural textile fiber crop, and Gossypium hirsutum L. is responsible for 90% of the annual cotton crop in the world. Information on cotton genetic diversity and population structure is essential for new breeding lines. In this study, we analyzed population structure and genetic diversity of 288 elite Gossypium hirsutum cultivar accessions collected from around the world, and especially from China, using genome-wide single nucleotide polymorphisms (SNP) markers. The average polymorphsim information content (PIC) was 0.25, indicating a relatively low degree of genetic diversity. Population structure analysis revealed extensive admixture and identified three subgroups. Phylogenetic analysis supported the subgroups identified by STRUCTURE. The results from both population structure and phylogenetic analysis were, for the most part, in agreement with pedigree information. Analysis of molecular variance revealed a larger amount of variation was due to diversity within the groups. Establishment of genetic diversity and population structure from this study could be useful for genetic and genomic analysis and systematic utilization of the standing genetic variation in upland cotton.

  9. Markers and mapping revisited: finding your gene.

    Science.gov (United States)

    Jones, Neil; Ougham, Helen; Thomas, Howard; Pasakinskiene, Izolda

    2009-01-01

    This paper is an update of our earlier review (Jones et al., 1997, Markers and mapping: we are all geneticists now. New Phytologist 137: 165-177), which dealt with the genetics of mapping, in terms of recombination as the basis of the procedure, and covered some of the first generation of markers, including restriction fragment length polymorphisms (RFLPs), random amplified polymorphic DNA (RAPDs), simple sequence repeats (SSRs) and quantitative trait loci (QTLs). In the intervening decade there have been numerous developments in marker science with many new systems becoming available, which are herein described: cleavage amplification polymorphism (CAP), sequence-specific amplification polymorphism (S-SAP), inter-simple sequence repeat (ISSR), sequence tagged site (STS), sequence characterized amplification region (SCAR), selective amplification of microsatellite polymorphic loci (SAMPL), single nucleotide polymorphism (SNP), expressed sequence tag (EST), sequence-related amplified polymorphism (SRAP), target region amplification polymorphism (TRAP), microarrays, diversity arrays technology (DArT), single-strand conformation polymorphism (SSCP), denaturing gradient gel electrophoresis (DGGE), temperature gradient gel electrophoresis (TGGE) and methylation-sensitive PCR. In addition there has been an explosion of knowledge and databases in the area of genomics and bioinformatics. The number of flowering plant ESTs is c. 19 million and counting, with all the opportunity that this provides for gene-hunting, while the survey of bioinformatics and computer resources points to a rapid growth point for future activities in unravelling and applying the burst of new information on plant genomes. A case study is presented on tracking down a specific gene (stay-green (SGR), a post-transcriptional senescence regulator) using the full suite of mapping tools and comparative mapping resources. We end with a brief speculation on how genome analysis may progress into the future of

  10. Identification of genetic markers linked to anthracnose resistance in sorghum using association analysis.

    Science.gov (United States)

    Upadhyaya, Hari D; Wang, Yi-Hong; Sharma, Rajan; Sharma, Shivali

    2013-06-01

    Anthracnose in sorghum caused by Colletotrichum sublineolum is one of the most destructive diseases affecting sorghum production under warm and humid conditions. Markers and genes linked to resistance to the disease are important for plant breeding. Using 14,739 SNP markers, we have mapped eight loci linked to resistance in sorghum through association analysis of a sorghum mini-core collection consisting of 242 diverse accessions evaluated for anthracnose resistance for 2 years in the field. The mini-core was representative of the International Crops Research Institute for the Semi-Arid Tropics' world-wide sorghum landrace collection. Eight marker loci were associated with anthracnose resistance in both years. Except locus 8, disease resistance-related genes were found in all loci based on their physical distance from linked SNP markers. These include two NB-ARC class of R genes on chromosome 10 that were partially homologous to the rice blast resistance gene Pib, two hypersensitive response-related genes: autophagy-related protein 3 on chromosome 1 and 4 harpin-induced 1 (Hin1) homologs on chromosome 8, a RAV transcription factor that is also part of R gene pathway, an oxysterol-binding protein that functions in the non-specific host resistance, and homologs of menthone:neomenthol reductase (MNR) that catalyzes a menthone reduction to produce the antimicrobial neomenthol. These genes and markers may be developed into molecular tools for genetic improvement of anthracnose resistance in sorghum.

  11. Quantification of within-sample genetic heterogeneity from SNP-array data

    DEFF Research Database (Denmark)

    Martinez, Pierre; Kimberley, Christopher; Birkbak, Nicolai Juul

    2017-01-01

    Intra-tumour genetic heterogeneity (ITH) fosters drug resistance and is a critical hurdle to clinical treatment. ITH can be well-measured using multi-region sampling but this is costly and challenging to implement. There is therefore a need for tools to estimate ITH in individual samples, using...... standard genomic data such as SNP-arrays, that could be implemented routinely. We designed two novel scores S and R, respectively based on the Shannon diversity index and Ripley's L statistic of spatial homogeneity, to quantify ITH in single SNP-array samples. We created in-silico and in-vitro mixtures...... sequencing data but heterogeneity in the fraction of tumour cells present across samples hampered accurate quantification. The prognostic potential of both scores was moderate but significantly predictive of survival in several tumour types (corrected p = 0.03). Our work thus shows how individual SNP...

  12. SNP calling using genotype model selection on high-throughput sequencing data

    KAUST Repository

    You, Na

    2012-01-16

    Motivation: A review of the available single nucleotide polymorphism (SNP) calling procedures for Illumina high-throughput sequencing (HTS) platform data reveals that most rely mainly on base-calling and mapping qualities as sources of error when calling SNPs. Thus, errors not involved in base-calling or alignment, such as those in genomic sample preparation, are not accounted for.Results: A novel method of consensus and SNP calling, Genotype Model Selection (GeMS), is given which accounts for the errors that occur during the preparation of the genomic sample. Simulations and real data analyses indicate that GeMS has the best performance balance of sensitivity and positive predictive value among the tested SNP callers. © The Author 2012. Published by Oxford University Press. All rights reserved.

  13. Eight new genomes and synthetic controls increase the accessibility of rapid melt-MAMA SNP typing of Coxiella burnetii.

    Directory of Open Access Journals (Sweden)

    Edvin Karlsson

    Full Text Available The case rate of Q fever in Europe has increased dramatically in recent years, mainly because of an epidemic in the Netherlands in 2009. Consequently, there is a need for more extensive genetic characterization of the disease agent Coxiella burnetii in order to better understand the epidemiology and spread of this disease. Genome reference data are essential for this purpose, but only thirteen genome sequences are currently available. Current methods for typing C. burnetii are criticized for having problems in comparing results across laboratories, require the use of genomic control DNA, and/or rely on markers in highly variable regions. We developed in this work a method for single nucleotide polymorphism (SNP typing of C. burnetii isolates and tissue samples based on new assays targeting ten phylogenetically stable synonymous canonical SNPs (canSNPs. These canSNPs represent previously known phylogenetic branches and were here identified from sequence comparisons of twenty-one C. burnetii genomes, eight of which were sequenced in this work. Importantly, synthetic control templates were developed, to make the method useful to laboratories lacking genomic control DNA. An analysis of twenty-one C. burnetii genomes confirmed that the species exhibits high sequence identity. Most of its SNPs (7,493/7,559 shared by >1 genome follow a clonal inheritance pattern and are therefore stable phylogenetic typing markers. The assays were validated using twenty-six genetically diverse C. burnetii isolates and three tissue samples from small ruminants infected during the epidemic in the Netherlands. Each sample was assigned to a clade. Synthetic controls (vector and PCR amplified gave identical results compared to the corresponding genomic controls and are viable alternatives to genomic DNA. The results from the described method indicate that it could be useful for cheap and rapid disease source tracking at non-specialized laboratories, which requires accurate

  14. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin.

    Directory of Open Access Journals (Sweden)

    Michela Troggio

    Full Text Available High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432, but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

  15. Gene-based meta-analysis of genome-wide association studies implicates new loci involved in obesity

    DEFF Research Database (Denmark)

    Hägg, Sara; Ganna, Andrea; Van Der Laan, Sander W

    2015-01-01

    ) approach to assign variants to genes and to calculate gene-based P-values based on simulations. The VEGAS method was applied to each cohort separately before a gene-based meta-analysis was performed. In Stage 1, two known (FTO and TMEM18) and six novel (PEX2, MTFR2, SSFA2, IARS2, CEP295 and TXNDC12) loci...

  16. On marker-based parentage verification via non-linear optimization.

    Science.gov (United States)

    Boerner, Vinzent

    2017-06-15

    Parentage verification by molecular markers is mainly based on short tandem repeat markers. Single nucleotide polymorphisms (SNPs) as bi-allelic markers have become the markers of choice for genotyping projects. Thus, the subsequent step is to use SNP genotypes for parentage verification as well. Recent developments of algorithms such as evaluating opposing homozygous SNP genotypes have drawbacks, for example the inability of rejecting all animals of a sample of potential parents. This paper describes an algorithm for parentage verification by constrained regression which overcomes the latter limitation and proves to be very fast and accurate even when the number of SNPs is as low as 50. The algorithm was tested on a sample of 14,816 animals with 50, 100 and 500 SNP genotypes randomly selected from 40k genotypes. The samples of putative parents of these animals contained either five random animals, or four random animals and the true sire. Parentage assignment was performed by ranking of regression coefficients, or by setting a minimum threshold for regression coefficients. The assignment quality was evaluated by the power of assignment (P[Formula: see text]) and the power of exclusion (P[Formula: see text]). If the sample of putative parents contained the true sire and parentage was assigned by coefficient ranking, P[Formula: see text] and P[Formula: see text] were both higher than 0.99 for the 500 and 100 SNP genotypes, and higher than 0.98 for the 50 SNP genotypes. When parentage was assigned by a coefficient threshold, P[Formula: see text] was higher than 0.99 regardless of the number of SNPs, but P[Formula: see text] decreased from 0.99 (500 SNPs) to 0.97 (100 SNPs) and 0.92 (50 SNPs). If the sample of putative parents did not contain the true sire and parentage was rejected using a coefficient threshold, the algorithm achieved a P[Formula: see text] of 1 (500 SNPs), 0.99 (100 SNPs) and 0.97 (50 SNPs). The algorithm described here is easy to implement

  17. Vitis phylogenomics: hybridization intensities from a SNP array outperform genotype calls.

    Directory of Open Access Journals (Sweden)

    Allison J Miller

    Full Text Available Understanding relationships among species is a fundamental goal of evolutionary biology. Single nucleotide polymorphisms (SNPs identified through next generation sequencing and related technologies enable phylogeny reconstruction by providing unprecedented numbers of characters for analysis. One approach to SNP-based phylogeny reconstruction is to identify SNPs in a subset of individuals, and then to compile SNPs on an array that can be used to genotype additional samples at hundreds or thousands of sites simultaneously. Although powerful and efficient, this method is subject to ascertainment bias because applying variation discovered in a representative subset to a larger sample favors identification of SNPs with high minor allele frequencies and introduces bias against rare alleles. Here, we demonstrate that the use of hybridization intensity data, rather than genotype calls, reduces the effects of ascertainment bias. Whereas traditional SNP calls assess known variants based on diversity housed in the discovery panel, hybridization intensity data survey variation in the broader sample pool, regardless of whether those variants are present in the initial SNP discovery process. We apply SNP genotype and hybridization intensity data derived from the Vitis9kSNP array developed for grape to show the effects of ascertainment bias and to reconstruct evolutionary relationships among Vitis species. We demonstrate that phylogenies constructed using hybridization intensities suffer less from the distorting effects of ascertainment bias, and are thus more accurate than phylogenies based on genotype calls. Moreover, we reconstruct the phylogeny of the genus Vitis using hybridization data, show that North American subgenus Vitis species are monophyletic, and resolve several previously poorly known relationships among North American species. This study builds on earlier work that applied the Vitis9kSNP array to evolutionary questions within Vitis vinifera

  18. Functional characterization of the Thr946Ala SNP at the type 1 diabetes IFIH1 locus.

    Science.gov (United States)

    Zouk, Hana; Marchand, Luc; Li, Quan; Polychronakos, Constantin

    2014-02-01

    The Thr allele at the Thr946Ala non-synonymous single-nucleotide polymorphism (nsSNP) in the IFIH1 gene confers risk for type 1 diabetes (T1D). IFIH1 binds viral double-stranded RNA (dsRNA), inducing a type I interferon (IFN) response. Reports of this nsSNP's role in IFIH1 expression regulation have produced conflicting results and a study evaluating transfected Thr946Ala protein alleles in an artificial system overexpressing IFIH1 shows that the SNP does not affect IFH1 function. In this study, we examine the effects of the Thr946Ala polymorphism on IFN-α response in a cell line that endogenously expresses physiological levels of IFIH1. Eleven lymphoblastoid cell lines (LCLs) homozygous for the major predisposing allele (Thr/Thr) and 6 LCLs homozygous for the minor protective allele (Ala/Ala) were electroporated with the viral dsRNA mimic, poly I:C, in three independent experiments. Media were collected 24 hours later and measured for IFN-α production by ELISA. Basal IFN response is minimal in mock-transfected cells from both genotypes and increases by about 8-fold in cells treated with poly I:C. LCLs with the Ala/Ala genotype have slightly higher IFN-α levels than their Thr/Thr counterparts but this did not reach statistical significance because of the large variability of the IFN response, due mostly to two high outliers (biological, not technical). A larger sample size would be needed to determine whether the Thr946Ala SNP affects the poly I:C-driven IFN-α response. Additionally, the possibility that this nsSNP recognizes viral dsRNA specificities cannot be ruled out. Thus, the mechanism of the observed association of this SNP with T1D remains to be determined.

  19. Environmental Application of Reporter-Genes Based Biosensors for Chemical Contamination Screening

    Directory of Open Access Journals (Sweden)

    Matejczyk Marzena

    2014-12-01

    Full Text Available The paper presents results of research concerning possibilities of applications of reporter-genes based microorganisms, including the selective presentation of defects and advantages of different new scientific achievements of methodical solutions in genetic system constructions of biosensing elements for environmental research. The most robust and popular genetic fusion and new trends in reporter genes technology – such as LacZ (β-galactosidase, xylE (catechol 2,3-dioxygenase, gfp (green fluorescent proteins and its mutated forms, lux (prokaryotic luciferase, luc (eukaryotic luciferase, phoA (alkaline phosphatase, gusA and gurA (β-glucuronidase, antibiotics and heavy metals resistance are described. Reporter-genes based biosensors with use of genetically modified bacteria and yeast successfully work for genotoxicity, bioavailability and oxidative stress assessment for detection and monitoring of toxic compounds in drinking water and different environmental samples, surface water, soil, sediments.

  20. A 50 SNP-multiplex mass spectrometry assay for human identification

    DEFF Research Database (Denmark)

    Wächter, Andrea; Mengel-From, Jonas; Børsting, Claus

    2008-01-01

    We developed a 50 SNP-multiplex assay for detection on a MALDI-TOF MS platform based on the SNPs in the 52 SNP-multiplex assay recently developed by the SNPforID Consortium. After PCR amplification, the products were purified on Qiagen columns and used as templates in one single base extension (SBE...... primers were extended with biotin labelled ddNTPs and purified on avidin beads ensuring that only the extended SBE primers were isolated and spotted on the MALDI-TOF anchor target. Detection of the 50 extended primers from the SBE reaction was performed in a mass range between 3000 and 10,000 m/z...

  1. Radiopaque anastomosis marker

    International Nuclear Information System (INIS)

    Elliott, D.P.; Halseth, W.L.

    1977-01-01

    This invention relates to split ring markers fabricated in whole or in part from a radiopaque material, usually metal, having the terminal ends thereof and a medial portion formed to define eyelets by means of which said marker can be sutured to the tissue at the site of an anastomosis to provide a visual indication of its location when examined fluoroscopically

  2. Genotyping by Sequencing for SNP-Based Linkage Map Construction and QTL Analysis of Chilling Requirement and Bloom Date in Peach [Prunus persica (L. Batsch].

    Directory of Open Access Journals (Sweden)

    Douglas Gary Bielenberg

    Full Text Available Low-cost, high throughput genotyping methods are crucial to marker discovery and marker-assisted breeding efforts, but have not been available for many 'specialty crops' such as fruit and nut trees. Here we apply the Genotyping-By-Sequencing (GBS method developed for cereals to the discovery of single nucleotide polymorphisms (SNPs in a peach F2 mapping population. Peach is a genetic and genomic model within the Rosaceae and will provide a template for the use of this method with other members of this family. Our F2 mapping population of 57 genotypes segregates for bloom time (BD and chilling requirement (CR and we have extensively phenotyped this population. The population derives from a selfed F1 progeny of a cross between 'Hakuho' (high CR and 'UFGold' (low CR. We were able to successfully employ GBS and the TASSEL GBS pipeline without modification of the original methodology using the ApeKI restriction enzyme and multiplexing at an equivalent of 96 samples per Illumina HiSeq 2000 lane. We obtained hundreds of SNP markers which were then used to construct a genetic linkage map and identify quantitative trait loci (QTL for BD and CR.

  3. SNP-Seek II: A resource for allele mining and analysis of big genomic data in Oryza sativa

    Directory of Open Access Journals (Sweden)

    Locedie Mansueto

    2016-11-01

    In this paper, we discuss the datasets stored in SNP-Seek, architecture of the database and web application, interoperability methodologies in place, and discuss a few use cases demonstrating the utility of SNP-Seek for diversity analysis and molecular breeding.

  4. Functional SNP associated with birth weight in independent populations identified with a permutation step added to GBLUP-GWAS

    Science.gov (United States)

    This study was conducted as an initial assessment of a newly available genotyping assay containing about 34,000 common SNP included on previous SNP chips, and 199,000 sequence variants predicted to affect gene function. Objectives were to identify functional variants associated with birth weight in...

  5. Birth Characteristics and Childhood Leukemia Risk: Correlations With Genetic Markers.

    Science.gov (United States)

    Kennedy, Amy E; Kamdar, Kala Y; Lupo, Philip J; Okcu, Mehmet F; Scheurer, Michael E; Dorak, Mehmet T

    2015-07-01

    Birth characteristics such as birth order, birth weight, birth defects, and Down syndrome showed some of the first risk associations with childhood leukemia. Examinations of correlations between birth characteristics and leukemia risk markers have been limited to birth weight-related genetic polymorphisms. We integrated information on nongenetic and genetic markers by evaluating the relationship of birth characteristics, genetic markers for childhood acute lymphoblastic leukemia (ALL) susceptibility, and ALL risk together. The multiethnic study consisted of cases with childhood ALL (n=161) and healthy controls (n=261). Birth characteristic data were collected through questionnaires, and genotyping was achieved by TaqMan SNP Genotyping Assays. We observed risk associations for birth weight over 4000 g (odds ratios [OR]=1.93; 95% confidence interval [CI], 1.16-3.19), birth length (OR=1.18 per inch; 95% CI, 1.01-1.38), and with gestational age (OR=1.10 per week; 95% CI, 1.00-1.21). Only the HFE tag single-nucleotide polymorphism (SNP) rs9366637 showed an inverse correlation with a birth characteristic, gestational age, with a gene-dosage effect (P=0.005), and in interaction with a transferrin receptor rs3817672 genotype (Pinteraction=0.05). This correlation translated into a strong association for rs9366637 with preterm birth (OR=5.0; 95% CI, 1.19-20.9). Our study provides evidence for the involvement of prenatal events in the development of childhood ALL. The inverse correlation of rs9366637 with gestational age has implications on the design of HFE association studies in birth weight and childhood conditions using full-term newborns as controls.

  6. A case of false mother included with 46 autosomal STR markers.

    Science.gov (United States)

    Li, Li; Lin, Yuan; Liu, Yan; Zhu, Ruxin; Zhao, Zhenmin; Que, Tingzhi

    2015-01-01

    For solving a maternity case, 19 autosomal short tandem repeats (STRs) were amplified using the AmpFℓSTR(®) Sinofiler(TM) kit and PowerPlex(®) 16 System. Additional 27 autosomal STR loci were analyzed using two domestic kits AGCU 21+1 and STRtyper-10G. The combined maternity index (CMI) was calculated to be 3.3 × 10(13), but the putative mother denied that she had given birth to the child. In order to reach an accurate conclusion, further testing of 20 X-chromosomal short tandem repeats (X-STRs), 40 single nucleotide polymorphism (SNP) loci, and mitochondrial DNA (mtDNA) was carried out. The putative mother and the boy shared at least one allele at all 46 tested autosomal STR loci. But, according to the profile data of 20 X-STR and 40 SNP markers, different genotypes at 13 X-STR loci and five SNP loci excluded maternity. Mitochondrial profiles also clearly excluded the mother as a parent of the son because they have multiple differences. It was finally found that the putative mother is the sister of the biological father. Different kinds of genetic markers needfully supplement the use of autosomal STR loci in case where the putative parent is suspected to be related to the true parent.

  7. SNP calling using genotype model selection on high-throughput sequencing data

    KAUST Repository

    You, Na; Murillo, Gabriel; Su, Xiaoquan; Zeng, Xiaowei; Xu, Jian; Ning, Kang; Zhang, ShouDong; Zhu, Jian-Kang; Cui, Xinping

    2012-01-01

    calling SNPs. Thus, errors not involved in base-calling or alignment, such as those in genomic sample preparation, are not accounted for.Results: A novel method of consensus and SNP calling, Genotype Model Selection (GeMS), is given which accounts

  8. The impact of SNP fingerprinting and parentage analysis on the effectiveness of variety recommendations in cacao

    Science.gov (United States)

    Evidence for the impact of mislabeling and/or pollen contamination on consistency of field performance has been lacking to reinforce the need for strict adherence to quality control protocols in cacao seed garden and germplasm plot management. The present study used SNP fingerprinting at 64 loci to ...

  9. EvoSNP-DB: A database of genetic diversity in East Asian populations.

    Science.gov (United States)

    Kim, Young Uk; Kim, Young Jin; Lee, Jong-Young; Park, Kiejung

    2013-08-01

    Genome-wide association studies (GWAS) have become popular as an approach for the identification of large numbers of phenotype-associated variants. However, differences in genetic architecture and environmental factors mean that the effect of variants can vary across populations. Understanding population genetic diversity is valuable for the investigation of possible population specific and independent effects of variants. EvoSNP-DB aims to provide information regarding genetic diversity among East Asian populations, including Chinese, Japanese, and Korean. Non-redundant SNPs (1.6 million) were genotyped in 54 Korean trios (162 samples) and were compared with 4 million SNPs from HapMap phase II populations. EvoSNP-DB provides two user interfaces for data query and visualization, and integrates scores of genetic diversity (Fst and VarLD) at the level of SNPs, genes, and chromosome regions. EvoSNP-DB is a web-based application that allows users to navigate and visualize measurements of population genetic differences in an interactive manner, and is available online at [http://biomi.cdc.go.kr/EvoSNP/].

  10. Usefulness of the SNP microarray technology to identify rare mutations in the case of perinatal death

    DEFF Research Database (Denmark)

    Hoeffding, L. K.; Kock, K. F.; Johnsen, Iben Birgit Gade

    2015-01-01

    The single nucleotide polymorphism (SNP) microarray technology has emerged as a powerful tool to screen the whole genome for sub-microscopic duplications and deletions that are not detectable by traditional cytogenetic analysis. Case: We report a case of a female twin born at 27th week of gestation...

  11. An abbreviated SNP panel for ancestry assignment of honeybees (Apis mellifera)

    Science.gov (United States)

    This paper examines whether an abbreviated panel of 37 single nucleotide polymorphisms (SNPs) has the same power as a larger and more expensive panel of 95 SNPs to assign ancestry of honeybees (Apis mellifera) to three ancestral lineages. We selected 37 SNPs from the original 95 SNP panel using alle...

  12. New tools and methods for direct programmatic access to the dbSNP relational database.

    Science.gov (United States)

    Saccone, Scott F; Quan, Jiaxi; Mehta, Gaurang; Bolze, Raphael; Thomas, Prasanth; Deelman, Ewa; Tischfield, Jay A; Rice, John P

    2011-01-01

    Genome-wide association studies often incorporate information from public biological databases in order to provide a biological reference for interpreting the results. The dbSNP database is an extensive source of information on single nucleotide polymorphisms (SNPs) for many different organisms, including humans. We have developed free software that will download and install a local MySQL implementation of the dbSNP relational database for a specified organism. We have also designed a system for classifying dbSNP tables in terms of common tasks we wish to accomplish using the database. For each task we have designed a small set of custom tables that facilitate task-related queries and provide entity-relationship diagrams for each task composed from the relevant dbSNP tables. In order to expose these concepts and methods to a wider audience we have developed web tools for querying the database and browsing documentation on the tables and columns to clarify the relevant relational structure. All web tools and software are freely available to the public at http://cgsmd.isi.edu/dbsnpq. Resources such as these for programmatically querying biological databases are essential for viably integrating biological information into genetic association experiments on a genome-wide scale.

  13. Comparison of three PCR-based assays for SNP genotyping in sugar beet

    Science.gov (United States)

    Background: PCR allelic discrimination technologies have broad applications in the detection of single nucleotide polymorphisms (SNPs) in genetics and genomics. The use of fluorescence-tagged probes is the leading method for targeted SNP detection, but assay costs and error rates could be improved t...

  14. Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses

    NARCIS (Netherlands)

    Orr, J.L.; Back, W.; Gu, J.; Leegwater, P.H.; Govindarajan, P.; Conroy, J.; Ducro, B.J.; Arendonk, van J.A.M.

    2010-01-01

    The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of

  15. Design and Characterization of a 52K SNP Chip for Goats

    NARCIS (Netherlands)

    Tosser-klopp, G.; Bardou, P.; Bouchez, O.; Cabau, C.; Crooijmans, R.P.M.A.; Dong, Y.; Donnadieu-Tonon, C.; Eggen, A.; Heuven, H.C.M.; Jamli, S.; Jiken, A.J.; Klopp, C.; Lawley, C.T.; McEwen, J.; Martin, P.; Moreno, C.R.; Mulsant, P.; Nabihoudine, I.; Pailhoux, E.; Palhiere, I.; Rupp, R.; Sarry, J.; Sayre, B.L.; Tircazes, A.; Wang, J.; Wang, W.; Zhang, W.G.

    2014-01-01

    The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a

  16. A SNP-centric database for the investigation of the human genome

    Directory of Open Access Journals (Sweden)

    Kohane Isaac S

    2004-03-01

    Full Text Available Abstract Background Single Nucleotide Polymorphisms (SNPs are an increasingly important tool for genetic and biomedical research. Although current genomic databases contain information on several million SNPs and are growing at a very fast rate, the true value of a SNP in this context is a function of the quality of the annotations that characterize it. Retrieving and analyzing such data for a large number of SNPs often represents a major bottleneck in the design of large-scale association studies. Description SNPper is a web-based application designed to facilitate the retrieval and use of human SNPs for high-throughput research purposes. It provides a rich local database generated by combining SNP data with the Human Genome sequence and with several other data sources, and offers the user a variety of querying, visualization and data export tools. In this paper we describe the structure and organization of the SNPper database, we review the available data export and visualization options, and we describe how the architecture of SNPper and its specialized data structures support high-volume SNP analysis. Conclusions The rich annotation database and the powerful data manipulation and presentation facilities it offers make SNPper a very useful online resource for SNP research. Its success proves the great need for integrated and interoperable resources in the field of computational biology, and shows how such systems may play a critical role in supporting the large-scale computational analysis of our genome.

  17. SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.

    Science.gov (United States)

    Lee, Tae-Ho; Guo, Hui; Wang, Xiyin; Kim, Changsoo; Paterson, Andrew H

    2014-02-26

    Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.

  18. Combinations of SNP genotypes from the Wellcome Trust Case Control Study of bipolar patients

    DEFF Research Database (Denmark)

    Mellerup, Erling; Jørgensen, Martin Balslev; Dam, Henrik

    2018-01-01

    Objectives: Combinations of genetic variants are the basis for polygenic disorders. We examined combinations of SNP genotypes taken from the 446 729 SNPs in The Wellcome Trust Case Control Study of bipolar patients. Methods: Parallel computing by graphics processing units, cloud computing, and data...

  19. The Association of FTO SNP rs9939609 with Weight Gain at University

    NARCIS (Netherlands)

    Meisel, S.F.; Beeken, R.J.; Jaarsveld, C.H.M. van; Wardle, J.

    2015-01-01

    AIM: We tested the hypothesis that the obesity-associated FTO SNP rs9939609 would be associated with clinically significant weight gain (>/= 5% of initial body weight) in the first year of university; a time identified as high risk for weight gain. METHODS: We collected anthropometric data from

  20. Affymetrix SNP array data for wild Dutch great tits (Parus major)

    NARCIS (Netherlands)

    Silva, Da Vinicius; Laine, Veronika N.; Bosse, M.; Oers, C.H.J.; Dibbits, B.W.; Visser, M.E.; Crooijmans, R.P.M.A.; Groenen, M.

    2018-01-01

    The great tit is a widely studied passerine bird species in ecology that, in the past decades, has provided important insights into speciation, phenology, behavior and microevolution. After completion of the great tit genome sequence, a customized high density 650k SNP array was developed enabling

  1. Advanced statistical tools for SNP arrays : signal calibration, copy number estimation and single array genotyping

    NARCIS (Netherlands)

    Rippe, Ralph Christian Alexander

    2012-01-01

    Fluorescence bias in in signals from individual SNP arrays can be calibrated using linear models. Given the data, the system of equations is very large, so a specialized symbolic algorithm was developed. These models are also used to illustrate that genomic waves do not exist, but are merely an

  2. Experience from large scale use of the EuroGenomics custom SNP chip in cattle

    DEFF Research Database (Denmark)

    Boichard, Didier A; Boussaha, Mekki; Capitan, Aurélien

    2018-01-01

    This article presents the strategy to evaluate candidate mutations underlying QTL or responsible for genetic defects, based upon the design and large-scale use of the Eurogenomics custom SNP chip set up for bovine genomic selection. Some variants under study originated from mapping genetic defect...

  3. Longevity and plasticity of CFTR provide an argument for noncanonical SNP organization in hominid DNA.

    Directory of Open Access Journals (Sweden)

    Aubrey E Hill

    Full Text Available Like many other ancient genes, the cystic fibrosis transmembrane conductance regulator (CFTR has survived for hundreds of millions of years. In this report, we consider whether such prodigious longevity of an individual gene--as opposed to an entire genome or species--should be considered surprising in the face of eons of relentless DNA replication errors, mutagenesis, and other causes of sequence polymorphism. The conventions that modern human SNP patterns result either from purifying selection or random (neutral drift were not well supported, since extant models account rather poorly for the known plasticity and function (or the established SNP distributions found in a multitude of genes such as CFTR. Instead, our analysis can be taken as a polemic indicating that SNPs in CFTR and many other mammalian genes may have been generated--and continue to accrue--in a fundamentally more organized manner than would otherwise have been expected. The resulting viewpoint contradicts earlier claims of 'directional' or 'intelligent design-type' SNP formation, and has important implications regarding the pace of DNA adaptation, the genesis of conserved non-coding DNA, and the extent to which eukaryotic SNP formation should be viewed as adaptive.

  4. Highly effective SNP-based association mapping and management of recessive defects in livestock

    DEFF Research Database (Denmark)

    Charlier, Carole; Coppieters, Wouter; Rollin, Frédéric

    2008-01-01

    The widespread use of elite sires by means of artificial insemination in livestock breeding leads to the frequent emergence of recessive genetic defects, which cause significant economic and animal welfare concerns. Here we show that the availability of genome-wide, high-density SNP panels, combi...

  5. A novel approach to analyzing fMRI and SNP data via parallel independent component analysis

    Science.gov (United States)

    Liu, Jingyu; Pearlson, Godfrey; Calhoun, Vince; Windemuth, Andreas

    2007-03-01

    There is current interest in understanding genetic influences on brain function in both the healthy and the disordered brain. Parallel independent component analysis, a new method for analyzing multimodal data, is proposed in this paper and applied to functional magnetic resonance imaging (fMRI) and a single nucleotide polymorphism (SNP) array. The method aims to identify the independent components of each modality and the relationship between the two modalities. We analyzed 92 participants, including 29 schizophrenia (SZ) patients, 13 unaffected SZ relatives, and 50 healthy controls. We found a correlation of 0.79 between one fMRI component and one SNP component. The fMRI component consists of activations in cingulate gyrus, multiple frontal gyri, and superior temporal gyrus. The related SNP component is contributed to significantly by 9 SNPs located in sets of genes, including those coding for apolipoprotein A-I, and C-III, malate dehydrogenase 1 and the gamma-aminobutyric acid alpha-2 receptor. A significant difference in the presences of this SNP component is found between the SZ group (SZ patients and their relatives) and the control group. In summary, we constructed a framework to identify the interactions between brain functional and genetic information; our findings provide new insight into understanding genetic influences on brain function in a common mental disorder.

  6. Assessing the Clinical Utility of SNP Microarray for Prader-Willi Syndrome due to Uniparental Disomy.

    Science.gov (United States)

    Santoro, Stephanie L; Hashimoto, Sayaka; McKinney, Aimee; Mihalic Mosher, Theresa; Pyatt, Robert; Reshmi, Shalini C; Astbury, Caroline; Hickey, Scott E

    2017-01-01

    Maternal uniparental disomy (UPD) 15 is one of the molecular causes of Prader-Willi syndrome (PWS), a multisystem disorder which presents with neonatal hypotonia and feeding difficulty. Current diagnostic algorithms differ regarding the use of SNP microarray to detect PWS. We retrospectively examined the frequency with which SNP microarray could identify regions of homozygosity (ROH) in patients with PWS. We determined that 7/12 (58%) patients with previously confirmed PWS by methylation analysis and microsatellite-positive UPD studies had ROH (>10 Mb) by SNP microarray. Additional assessment of 5,000 clinical microarrays, performed from 2013 to present, determined that only a single case of ROH for chromosome 15 was not caused by an imprinting disorder or identity by descent. We observed that ROH for chromosome 15 is rarely incidental and strongly associated with hypotonic infants having features of PWS. Although UPD microsatellite studies remain essential to definitively establish the presence of UPD, SNP microarray has important utility in the timely diagnostic algorithm for PWS. © 2017 S. Karger AG, Basel.

  7. Prediction of a deletion copy number variant by a dense SNP panel

    NARCIS (Netherlands)

    Kadri, N.K.; Koks, P.D.; Meuwissen, T.H.E.

    2012-01-01

    Background: A newly recognized type of genetic variation, Copy Number Variation (CNV), is detected in mammalian genomes, e.g. the cattle genome. This form of variation can potentially cause phenotypic variation. Our objective was to determine whether dense SNP (single nucleotide polymorphisms)

  8. Identification of Mendelian inconsistencies between SNP and pedigree Information of Sibs

    NARCIS (Netherlands)

    Calus, M.P.L.; Mulder, H.A.; Bastiaansen, J.W.M.

    2011-01-01

    Background Using SNP genotypes to apply genomic selection in breeding programs is becoming common practice. Tools to edit and check the quality of genotype data are required. Checking for Mendelian inconsistencies makes it possible to identify animals for which pedigree information and genotype

  9. Construction of a high-density DArTseq SNP-based genetic map and identification of genomic regions with segregation distortion in a genetic population derived from a cross between feral and cultivated-type watermelon.

    Science.gov (United States)

    Ren, Runsheng; Ray, Rumiana; Li, Pingfang; Xu, Jinhua; Zhang, Man; Liu, Guang; Yao, Xiefeng; Kilian, Andrzej; Yang, Xingping

    2015-08-01

    Watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai] is an economically important vegetable crop grown extensively worldwide. To facilitate the identification of agronomically important traits and provide new information for genetic and genomic research on this species, a high-density genetic linkage map of watermelon was constructed using an F2 population derived from a cross between elite watermelon cultivar K3 and wild watermelon germplasm PI 189225. Based on a sliding window approach, a total of 1,161 bin markers representing 3,465 SNP markers were mapped onto 11 linkage groups corresponding to the chromosome pair number of watermelon. The total length of the genetic map is 1,099.2 cM, with an average distance between bins of 1.0 cM. The number of markers in each chromosome varies from 62 in chromosome 07 to 160 in chromosome 05. The length of individual chromosomes ranged between 61.8 cM for chromosome 07 and 140.2 cM for chromosome 05. A total of 616 SNP bin markers showed significant (P watermelon cultivar K3 allele and 103 were skewed toward PI 189225. The number of SNPs and InDels per Mb varied considerably across the segregation distorted regions (SDRs) on each chromosome, and a mixture of dense and sparse SNPs and InDel SDRs coexisted on some chromosomes suggesting that SDRs were randomly distributed throughout the genome. Recombination rates varied greatly among each chromosome, from 2.0 to 4.2 centimorgans per megabase (cM/Mb). An inconsistency was found between the genetic and physical positions on the map for a segment on chromosome 11. The high-density genetic map described in the present study will facilitate fine mapping of quantitative trait loci, the identification of candidate genes, map-based cloning, as well as marker-assisted selection (MAS) in watermelon breeding programs.

  10. RS-SNP: a random-set method for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Mukherjee Sayan

    2011-03-01

    Full Text Available Abstract Background The typical objective of Genome-wide association (GWA studies is to identify single-nucleotide polymorphisms (SNPs and corresponding genes with the strongest evidence of association (the 'most-significant SNPs/genes' approach. Borrowing ideas from micro-array data analysis, we propose a new method, named RS-SNP, for detecting sets of genes enriched in SNPs moderately associated to the phenotype. RS-SNP assesses whether the number of significant SNPs, with p-value P ≤ α, belonging to a given SNP set is statistically significant. The rationale of proposed method is that two kinds of null hypotheses are taken into account simultaneously. In the first null model the genotype and the phenotype are assumed to be independent random variables and the null distribution is the probability of the number of significant SNPs in greater than observed by chance. The second null model assumes the number of significant SNPs in depends on the size of and not on the identity of the SNPs in . Statistical significance is assessed using non-parametric permutation tests. Results We applied RS-SNP to the Crohn's disease (CD data set collected by the Wellcome Trust Case Control Consortium (WTCCC and compared the results with GENGEN, an approach recently proposed in literature. The enrichment analysis using RS-SNP and the set of pathways contained in the MSigDB C2 CP pathway collection highlighted 86 pathways rich in SNPs weakly associated to CD. Of these, 47 were also indicated to be significant by GENGEN. Similar results were obtained using the MSigDB C5 pathway collection. Many of the pathways found to be enriched by RS-SNP have a well-known connection to CD and often with inflammatory diseases. Conclusions The proposed method is a valuable alternative to other techniques for enrichment analysis of SNP sets. It is well founded from a theoretical and statistical perspective. Moreover, the experimental comparison with GENGEN highlights that it is

  11. SNP discovery in the transcriptome of white Pacific shrimp Litopenaeus vannamei by next generation sequencing.

    Directory of Open Access Journals (Sweden)

    Yang Yu

    Full Text Available The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies.

  12. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff

    Science.gov (United States)

    Cingolani, Pablo; Platts, Adrian; Wang, Le Lily; Coon, Melissa; Nguyen, Tung; Wang, Luan; Land, Susan J.; Lu, Xiangyi; Ruden, Douglas M.

    2012-01-01

    We describe a new computer program, SnpEff, for rapidly categorizing the effects of variants in genome sequences. Once a genome is sequenced, SnpEff annotates variants based on their genomic locations and predicts coding effects. Annotated genomic locations include intronic, untranslated region, upstream, downstream, splice site, or intergenic regions. Coding effects such as synonymous or non-synonymous amino acid replacement, start codon gains or losses, stop codon gains or losses, or frame shifts can be predicted. Here the use of SnpEff is illustrated by annotating ~356,660 candidate SNPs in ~117 Mb unique sequences, representing a substitution rate of ~1/305 nucleotides, between the Drosophila melanogaster w1118; iso-2; iso-3 strain and the reference y1; cn1 bw1 sp1 strain. We show that ~15,842 SNPs are synonymous and ~4,467 SNPs are non-synonymous (N/S ~0.28). The remaining SNPs are in other categories, such as stop codon gains (38 SNPs), stop codon losses (8 SNPs), and start codon gains (297 SNPs) in the 5′UTR. We found, as expected, that the SNP frequency is proportional to the recombination frequency (i.e., highest in the middle of chromosome arms). We also found that start-gain or stop-lost SNPs in Drosophila melanogaster often result in additions of N-terminal or C-terminal amino acids that are conserved in other Drosophila species. It appears that the 5′ and 3′ UTRs are reservoirs for genetic variations that changes the termini of proteins during evolution of the Drosophila genus. As genome sequencing is becoming inexpensive and routine, SnpEff enables rapid analyses of whole-genome sequencing data to be performed by an individual laboratory. PMID:22728672

  13. Effect of Myostatin SNP on muscle fiber properties in male Thoroughbred horses during training period.

    Science.gov (United States)

    Miyata, Hirofumi; Itoh, Rika; Sato, Fumio; Takebe, Naoya; Hada, Tetsuro; Tozaki, Teruaki

    2017-10-20

    Variants of the Myostatin gene have been shown to have an influence on muscle hypertrophy phenotypes in a wide range of mammalian species. Recently, a Thoroughbred horse with a C-Allele at the g.66493737C/T single-nucleotide polymorphism (SNP) has been reported to be suited to short-distance racing. In this study, we examined the effect of the Myostatin SNP on muscle fiber properties in young Thoroughbred horses during a training period. To investigate the effect of the Myostatin SNP on muscle fiber before training, several mRNA expressions were relatively quantified in biopsy samples from the middle gluteal muscle of 27 untrained male Thoroughbred horses (1.5 years old) using real-time RT-PCR analysis. The remaining muscle samples were used for immunohistochemical analysis to determine the population and area of each fiber type. All measurements were revaluated in biopsy samples of the same horses after a 5-month period of conventional training. Although the expressions of Myostatin mRNA decreased in all SNP genotypes, a significant decrease was found in only the C/C genotype after training. While, expression of VEGFa, PGC1α, and SDHa mRNAs, which relate to the biogenesis of mitochondria and capillaries, was significantly higher (54-82%) in the T/T than the C/C genotypes after training. It is suggested that hypertrophy of muscle fiber is directly associated with a decrease in Myostatin mRNA expression in the C/C genotype, and that increased expressions of VEGFa, PGC1α, and SDHa in the T/T genotype might be indirectly caused by the Myostatin SNP.

  14. Multiple marker abundance profiling

    DEFF Research Database (Denmark)

    Hooper, Cornelia M.; Stevens, Tim J.; Saukkonen, Anna

    2017-01-01

    proteins and the scoring accuracy of lower-abundance proteins in Arabidopsis. NPAS was combined with subcellular protein localization data, facilitating quantitative estimations of organelle abundance during routine experimental procedures. A suite of targeted proteomics markers for subcellular compartment...

  15. (DArT) markers

    Indian Academy of Sciences (India)

    2EH Graham Centre for Agricultural Innovation (NSW Department of Industry and Investment and Charles Sturt. University), P. O. Box 588 Wagga Wagga, NSW 2650, Australia. 3Guangxi .... and obtain marker statistics. The exact order of the ...

  16. VT Roadside Historic Markers

    Data.gov (United States)

    Vermont Center for Geographic Information — Roadside Historic Site Marker program has proven an effective way to commemorate Vermont’s many people, events, and places of regional, statewide, or national...

  17. SUPLEMENTASI Lactobacillus acidophilus SNP-2 PADA TAPE DAN PENGARUHNYA PADA RELAWAN [Supplementation of Lactocbacillus acidophilus SNP-2 Into Tape and its Effect to the Volunteer

    Directory of Open Access Journals (Sweden)

    Endang S Rahayu1

    2004-08-01

    Full Text Available Functional food is defined as any potentially healthful food or food ingredient that may provide a health benefit beyond the traditional nutrients it contains. Many researches have been conducted on the health benefit of probiotic (life bacterial cells, one of the ingredient of functional foods. One of the potential bacteria used for probiotic agent and also involved in traditional fermented foods are lactic acid bacteria (LAB. Previous research showed that Lactobacillus acidophilus SNP-2 isolated from faecal material of healthy infant is resistant to acid and bile salt, and has an antagonistic effect against several enteric bacterial pathogens. The objective of this research was to study the effect of L. acidophilus SNP-2 as probiotic agent to the health benefits. These bacteria were supplemented into tape ketan (fermented sticky rice, the indigenous Indonesian fermented food. Tape ketan was chosen as the carrier of probiotic biomass based on the high population of LAB in this product, i.e., 1.3 x 108 CFU/g. Addition of L. acidophilus SNP-2 biomass prior to fermentation of tape ketan resulted in a higher total of LAB cells, i.e. 2.1 x 109 CFU/g compared to the amount of 1.5 x 108 CFU/g when the addition was done after fermentation. Consumption of tape ketan containing probiotic agent by the volunteers increased the population of lactobacilli (from 1.7x107 CFU/g to 9.9x107 CFU/g and decreased the population of enterobacteriacea (from 5.4x109 CFU/g to 4.4x108 in their faecal material. This phenomenon revealed that probiotic agent was able to colonize and inhibit the growth of enterobacteriaceae in the gastrointestinal tract. The result implied that tape ketan can be used as a carrier for probiotic agent and it can be categorized as functional food

  18. KMgene: a unified R package for gene-based association analysis for complex traits.

    Science.gov (United States)

    Yan, Qi; Fang, Zhou; Chen, Wei; Stegle, Oliver

    2018-02-09

    In this report, we introduce an R package KMgene for performing gene-based association tests for familial, multivariate or longitudinal traits using kernel machine (KM) regression under a generalized linear mixed model (GLMM) framework. Extensive simulations were performed to evaluate the validity of the approaches implemented in KMgene. http://cran.r-project.org/web/packages/KMgene. qi.yan@chp.edu or wei.chen@chp.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2018. Published by Oxford University Press.

  19. Applications of gene-based technologies for improving animal production and health in developing countries

    International Nuclear Information System (INIS)

    Makkar, H.P.S.; Viljoen, G.J.

    2005-01-01

    This book provides a compilation of peer-reviewed scientific contributions from authoritative researchers attending an international symposium convened by the Animal Production and Health Sub-programme of the Animal Production and Health (APH), Joint FAO/IAEA Programme in cooperation with the Animal Production and Health Division of the FAO. These Proceedings contain invaluable information on the role and future potential of gene-based technologies for improving animal production and health, possible applications and constraints in the use of this technology in developing countries and their specific research needs

  20. Effect of Co-segregating Markers on High-Density Genetic Maps and Prediction of Map Expansion Using Machine Learning Algorithms.

    Science.gov (United States)

    N'Diaye, Amidou; Haile, Jemanesh K; Fowler, D Brian; Ammar, Karim; Pozniak, Curtis J

    2017-01-01

    Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called 'large p, small n' problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion

  1. Effect of Co-segregating Markers on High-Density Genetic Maps and Prediction of Map Expansion Using Machine Learning Algorithms

    Directory of Open Access Journals (Sweden)

    Amidou N’Diaye

    2017-08-01

    Full Text Available Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called ‘large p, small n’ problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers. While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat and Norstar × Cappelle Desprez (bread wheat. The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF, we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez. Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase

  2. When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.

    Science.gov (United States)

    Gardner, Shea N; Hall, Barry G

    2013-01-01

    Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four "raw read" genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths.

  3. Identification of mitochondrial DNA sequence variation and development of single nucleotide polymorphic markers for CMS-D8 in cotton.

    Science.gov (United States)

    Suzuki, Hideaki; Yu, Jiwen; Wang, Fei; Zhang, Jinfa

    2013-06-01

    Cytoplasmic male sterility (CMS), which is a maternally inherited trait and controlled by novel chimeric genes in the mitochondrial genome, plays a pivotal role in the production of hybrid seed. In cotton, no PCR-based marker has been developed to discriminate CMS-D8 (from Gossypium trilobum) from its normal Upland cotton (AD1, Gossypium hirsutum) cytoplasm. The objective of the current study was to develop PCR-based single nucleotide polymorphic (SNP) markers from mitochondrial genes for the CMS-D8 cytoplasm. DNA sequence variation in mitochondrial genes involved in the oxidative phosphorylation chain including ATP synthase subunit 1, 4, 6, 8 and 9, and cytochrome c oxidase 1, 2 and 3 subunits were identified by comparing CMS-D8, its isogenic maintainer and restorer lines on the same nuclear genetic background. An allelic specific PCR (AS-PCR) was utilized for SNP typing by incorporating artificial mismatched nucleotides into the third or fourth base from the 3' terminus in both the specific and nonspecific primers. The result indicated that the method modifying allele-specific primers was successful in obtaining eight SNP markers out of eight SNPs using eight primer pairs to discriminate two alleles between AD1 and CMS-D8 cytoplasms. Two of the SNPs for atp1 and cox1 could also be used in combination to discriminate between CMS-D8 and CMS-D2 cytoplasms. Additionally, a PCR-based marker from a nine nucleotide insertion-deletion (InDel) sequence (AATTGTTTT) at the 59-67 bp positions from the start codon of atp6, which is present in the CMS and restorer lines with the D8 cytoplasm but absent in the maintainer line with the AD1 cytoplasm, was also developed. A SNP marker for two nucleotide substitutions (AA in AD1 cytoplasm to CT in CMS-D8 cytoplasm) in the intron (1,506 bp) of cox2 gene was also developed. These PCR-based SNP markers should be useful in discriminating CMS-D8 and AD1 cytoplasms, or those with CMS-D2 cytoplasm as a rapid, simple, inexpensive, and

  4. Criteria of GenCall score to edit marker data and methods to handle missing markers have an influence on accuracy of genomic predictions

    DEFF Research Database (Denmark)

    Edriss, Vahid; Guldbrandtsen, Bernt; Lund, Mogens Sandø

    2013-01-01

    The aim of this study was to investigate the effect of different strategies for handling low-quality or missing data on prediction accuracy for direct genomic values of protein yield, mastitis and fertility using a Bayesian variable model and a GBLUP model in the Danish Jersey population. The data...... contained 1071 Jersey bulls that were genotyped with the Illumina Bovine 50K chip. After preliminary editing, 39227 SNP remained in the dataset. Four methods to handle missing genotypes were: 1) BEAGLE: missing markers were imputed using Beagle 3.3 software, 2) COMMON: missing genotypes at a locus were...

  5. Dynamic variable selection in SNP genotype autocalling from APEX microarray data

    Directory of Open Access Journals (Sweden)

    Zamar Ruben H

    2006-11-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are DNA sequence variations, occurring when a single nucleotide – adenine (A, thymine (T, cytosine (C or guanine (G – is altered. Arguably, SNPs account for more than 90% of human genetic variation. Our laboratory has developed a highly redundant SNP genotyping assay consisting of multiple probes with signals from multiple channels for a single SNP, based on arrayed primer extension (APEX. This mini-sequencing method is a powerful combination of a highly parallel microarray with distinctive Sanger-based dideoxy terminator sequencing chemistry. Using this microarray platform, our current genotype calling system (known as SNP Chart is capable of calling single SNP genotypes by manual inspection of the APEX data, which is time-consuming and exposed to user subjectivity bias. Results Using a set of 32 Coriell DNA samples plus three negative PCR controls as a training data set, we have developed a fully-automated genotyping algorithm based on simple linear discriminant analysis (LDA using dynamic variable selection. The algorithm combines separate analyses based on the multiple probe sets to give a final posterior probability for each candidate genotype. We have tested our algorithm on a completely independent data set of 270 DNA samples, with validated genotypes, from patients admitted to the intensive care unit (ICU of St. Paul's Hospital (plus one negative PCR control sample. Our method achieves a concordance rate of 98.9% with a 99.6% call rate for a set of 96 SNPs. By adjusting the threshold value for the final posterior probability of the called genotype, the call rate reduces to 94.9% with a higher concordance rate of 99.6%. We also reversed the two independent data sets in their training and testing roles, achieving a concordance rate up to 99.8%. Conclusion The strength of this APEX chemistry-based platform is its unique redundancy having multiple probes for a single SNP. Our

  6. SWPhylo - A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees.

    Science.gov (United States)

    Yu, Xiaoyu; Reva, Oleg N

    2018-01-01

    Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA.

  7. SWPhylo – A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees

    Science.gov (United States)

    Yu, Xiaoyu; Reva, Oleg N

    2018-01-01

    Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA. PMID:29511354

  8. Weighted functional linear regression models for gene-based association analysis.

    Science.gov (United States)

    Belonogova, Nadezhda M; Svishcheva, Gulnara R; Wilson, James F; Campbell, Harry; Axenovich, Tatiana I

    2018-01-01

    Functional linear regression models are effectively used in gene-based association analysis of complex traits. These models combine information about individual genetic variants, taking into account their positions and reducing the influence of noise and/or observation errors. To increase the power of methods, where several differently informative components are combined, weights are introduced to give the advantage to more informative components. Allele-specific weights have been introduced to collapsing and kernel-based approaches to gene-based association analysis. Here we have for the first time introduced weights to functional linear regression models adapted for both independent and family samples. Using data simulated on the basis of GAW17 genotypes and weights defined by allele frequencies via the beta distribution, we demonstrated that type I errors correspond to declared values and that increasing the weights of causal variants allows the power of functional linear models to be increased. We applied the new method to real data on blood pressure from the ORCADES sample. Five of the six known genes with P models. Moreover, we found an association between diastolic blood pressure and the VMP1 gene (P = 8.18×10-6), when we used a weighted functional model. For this gene, the unweighted functional and weighted kernel-based models had P = 0.004 and 0.006, respectively. The new method has been implemented in the program package FREGAT, which is freely available at https://cran.r-project.org/web/packages/FREGAT/index.html.

  9. Molecular markers in glioma.

    Science.gov (United States)

    Ludwig, Kirsten; Kornblum, Harley I

    2017-09-01

    Gliomas are the most malignant and aggressive form of brain tumors, and account for the majority of brain cancer related deaths. Malignant gliomas, including glioblastoma are treated with radiation and temozolomide, with only a minor benefit in survival time. A number of advances have been made in understanding glioma biology, including the discovery of cancer stem cells, termed glioma stem cells (GSC). Some of these advances include the delineation of molecular heterogeneity both between tumors from different patients as well as within tumors from the same patient. Such research highlights the importance of identifying and validating molecular markers in glioma. This review, intended as a practical resource for both clinical and basic investigators, summarizes some of the more well-known molecular markers (MGMT, 1p/19q, IDH, EGFR, p53, PI3K, Rb, and RAF), discusses how they are identified, and what, if any, clinical relevance they may have, in addition to discussing some of the specific biology for these markers. Additionally, we discuss identification methods for studying putative GSC's (CD133, CD15, A2B5, nestin, ALDH1, proteasome activity, ABC transporters, and label-retention). While much research has been done on these markers, there is still a significant amount that we do not yet understand, which may account for some conflicting reports in the literature. Furthermore, it is unlikely that the investigator will be able to utilize one single marker to prospectively identify and isolate GSC from all, or possibly, any gliomas.

  10. Tumour markers in urology

    International Nuclear Information System (INIS)

    Schmid, L.; Fornara, P.; Fabricius, P.G.

    1988-01-01

    The same applies essentially also for the bladder carcinomas: There is no reliable marker for these cancers which would be useful for clinical purposes. TPA has proven to be too non-specific in malignoma-detection and therefore hardly facilitates clinical decision-making in individual cases. The CEA is not sensitive enough to be recommendable for routine application. However, in advanced stages a CEA examination may be useful if applied within the scope of therapeutic efforts made to evaluate efficacy. In cases of carcinomas of the prostate the sour prostate-specific phosphatase (SPP) and, more recently, especially the prostate-specific antigen (PSA) have proven in follow-up and therapy monitoring, whereby the PSA is superior to the SPP. Nevertheless, both these markers should be employed in therapy monitoring because differences in behaviour will be observed when the desired treatment effect is only achieved in one of the two markers producing tumour cell clonuses. Both markers, but especially the PSA, are quite reliably in agreement with the result of the introduced chemo-/hormone therapy, whereby an increase may be a sure indicator of relapse several months previous to clinical symptoms, imaging procedures, so-called routine laboratory results and subjective complaints. However, none of the 2 markers is appropriate for the purposes of screening or early diagnosis of carcinomas of the prostate. (orig.) [de

  11. Avirulence (AVR) Gene-Based Diagnosis Complements Existing Pathogen Surveillance Tools for Effective Deployment of Resistance (R) Genes Against Rice Blast Disease.

    Science.gov (United States)

    Selisana, S M; Yanoria, M J; Quime, B; Chaipanya, C; Lu, G; Opulencia, R; Wang, G-L; Mitchell, T; Correll, J; Talbot, N J; Leung, H; Zhou, B

    2017-06-01

    Avirulence (AVR) genes in Magnaporthe oryzae, the fungal pathogen that causes the devastating rice blast disease, have been documented to be major targets subject to mutations to avoid recognition by resistance (R) genes. In this study, an AVR-gene-based diagnosis tool for determining the virulence spectrum of a rice blast pathogen population was developed and validated. A set of 77 single-spore field isolates was subjected to pathotype analysis using differential lines, each containing a single R gene, and classified into 20 virulent pathotypes, except for 4 isolates that lost pathogenicity. In all, 10 differential lines showed low frequency (95%), inferring the effectiveness of R genes present in the respective differential lines. In addition, the haplotypes of seven AVR genes were determined by polymerase chain reaction amplification and sequencing, if applicable. The calculated frequency of different AVR genes displayed significant variations in the population. AVRPiz-t and AVR-Pii were detected in 100 and 84.9% of the isolates, respectively. Five AVR genes such as AVR-Pik-D (20.5%) and AVR-Pik-E (1.4%), AVRPiz-t (2.7%), AVR-Pita (0%), AVR-Pia (0%), and AVR1-CO39 (0%) displayed low or even zero frequency. The frequency of AVR genes correlated almost perfectly with the resistance frequency of the cognate R genes in differential lines, except for International Rice Research Institute-bred blast-resistant lines IRBLzt-T, IRBLta-K1, and IRBLkp-K60. Both genetic analysis and molecular marker validation revealed an additional R gene, most likely Pi19 or its allele, in these three differential lines. This can explain the spuriously higher resistance frequency of each target R gene based on conventional pathotyping. This study demonstrates that AVR-gene-based diagnosis provides a precise, R-gene-specific, and differential line-free assessment method that can be used for determining the virulence spectrum of a rice blast pathogen population and for predicting the

  12. A comparison of genomic selection models across time in interior spruce (Picea engelmannii × glauca) using unordered SNP imputation methods.

    Science.gov (United States)

    Ratcliffe, B; El-Dien, O G; Klápště, J; Porth, I; Chen, C; Jaquish, B; El-Kassaby, Y A

    2015-12-01

    Genomic selection (GS) potentially offers an unparalleled advantage over traditional pedigree-based selection (TS) methods by reducing the time commitment required to carry out a single cycle of tree improvement. This quality is particularly appealing to tree breeders, where lengthy improvement cycles are the norm. We explored the prospect of implementing GS for interior spruce (Picea engelmannii × glauca) utilizing a genotyped population of 769 trees belonging to 25 open-pollinated families. A series of repeated tree height measurements through ages 3-40 years permitted the testing of GS methods temporally. The genotyping-by-sequencing (GBS) platform was used for single nucleotide polymorphism (SNP) discovery in conjunction with three unordered imputation methods applied to a data set with 60% missing information. Further, three diverse GS models were evaluated based on predictive accuracy (PA), and their marker effects. Moderate levels of PA (0.31-0.55) were observed and were of sufficient capacity to deliver improved selection response over TS. Additionally, PA varied substantially through time accordingly with spatial competition among trees. As expected, temporal PA was well correlated with age-age genetic correlation (r=0.99), and decreased substantially with increasing difference in age between the training and validation populations (0.04-0.47). Moreover, our imputation comparisons indicate that k-nearest neighbor and singular value decomposition yielded a greater number of SNPs and gave higher predictive accuracies than imputing with the mean. Furthermore, the ridge regression (rrBLUP) and BayesCπ (BCπ) models both yielded equal, and better PA than the generalized ridge regression heteroscedastic effect model for the traits evaluated.

  13. Genome-wide SNP scan of pooled DNA reveals nonsense mutation in FGF20 in the scaleless line of featherless chickens

    Directory of Open Access Journals (Sweden)

    Wells Kirsty L

    2012-06-01

    map genes based on genotyping of DNA samples from pooled whole blood. The identification of the sc mutation has important implications for the future breeding of this potentially useful trait for the poultry industry, and our genotyping assay can facilitate its rapid introgression into production lines.

  14. Using SNP array to identify aneuploidy and segmental imbalance in translocation carriers

    D