WorldWideScience

Sample records for genome-wide snp markers

  1. Use of SNP markers to conserve genome-wide genetic diversity in livestock

    NARCIS (Netherlands)

    Engelsma, K.A.

    2012-01-01

    Conservation of genetic diversity in livestock breeds is important since it is, both within and between breeds, under threat. The availability of large numbers of SNP markers has resulted in new opportunities to estimate genetic diversity in more detail, and to improve prioritization of animals

  2. Genetic relationships among Vietnamese local pigs investigated using genome-wide SNP markers.

    Science.gov (United States)

    Ishihara, S; Arakawa, A; Taniguchi, M; Luu, Q M; Pham, D L; Nguyen, B V; Mikawa, S; Kikuchi, K

    2018-02-01

    Vietnam is one of the most important countries for pig domestication, and a total of 26 local breeds have been reported. In the present study, genetic relationships among the various pig breeds were investigated using 90 samples collected from local pigs (15 breeds) in 15 distantly separated, distinct areas of the country and six samples from Landrace pigs in Hanoi as an out-group of a common Western breed. All samples were genotyped using the Illumina Porcine SNP60 v2 Genotyping BeadChip. We used 15 160-15 217 SNPs that showed a high degree of polymorphism in the Vietnamese breeds for identifying genetic relationships among the Vietnamese breeds. Principal components analysis showed that most pigs indigenous to Vietnam formed clusters correlated with their original geographic locations. Some Vietnamese breeds formed a cluster that was genetically related to the Western breed Landrace, suggesting the possibility of crossbreeding. These findings will be useful for the conservation and management of Vietnamese local pig breeds. © 2018 Stichting International Foundation for Animal Genetics.

  3. Characterizing the population structure and genetic diversity of maize breeding germplasm in Southwest China using genome-wide SNP markers.

    Science.gov (United States)

    Zhang, Xiao; Zhang, Hua; Li, Lujiang; Lan, Hai; Ren, Zhiyong; Liu, Dan; Wu, Ling; Liu, Hailan; Jaqueth, Jennifer; Li, Bailin; Pan, Guangtang; Gao, Shibin

    2016-08-31

    representatively not only illustrates the foundation and evolution trend of maize breeding resource as a theoretical reference for the improvement of heterosis, but also provides plenty of information for genetic researches such as genome-wide association study and marker-assisted selection in the future.

  4. Genome-Wide Association Study for Identification and Validation of Novel SNP Markers for Sr6 Stem Rust Resistance Gene in Bread Wheat.

    Science.gov (United States)

    Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S

    2018-01-01

    Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.

  5. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    NARCIS (Netherlands)

    Chagné, D.; Crowhurst, R.N.; Troggio, M.; Davey, M.W.; Gilmore, B.; Lawley, C.; Vanderzande, S.; Hellens, R.P.; Kumar, S.; Cestaro, A.; Velasco, R.; Main, D.; Rees, J.D.; Iezzoni, A.F.; Mockler, T.; Wilhelm, L.; Weg, van de W.E.; Gardiner, S.E.; Bassil, N.; Peace, C.

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide

  6. Prediction of heterosis using genome-wide SNP-marker data: application to egg production traits in white Leghorn crosses

    NARCIS (Netherlands)

    Amuzu-Aweh, E.N.; Bijma, P.; Kinghorn, B.P.; verreijken, A.; Arendonk, van J.A.M.; Bovenhuis, H.

    2013-01-01

    Prediction of heterosis has a long history with mixed success, partly due to low numbers of genetic markers and/or small data sets. We investigated the prediction of heterosis for egg number, egg weight and survival days in domestic white Leghorns, using ~400¿000 individuals from 47 crosses and

  7. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    Science.gov (United States)

    Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718

  8. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple.

    Directory of Open Access Journals (Sweden)

    David Chagné

    Full Text Available As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of 'Golden Delicious', SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional, and genomic selection in apple.

  9. Genome-wide linkage mapping of yield-related traits in three Chinese bread wheat populations using high-density SNP markers.

    Science.gov (United States)

    Li, Faji; Wen, Weie; He, Zhonghu; Liu, Jindong; Jin, Hui; Cao, Shuanghe; Geng, Hongwei; Yan, Jun; Zhang, Pingzhi; Wan, Yingxiu; Xia, Xianchun

    2018-06-01

    We identified 21 new and stable QTL, and 11 QTL clusters for yield-related traits in three bread wheat populations using the wheat 90 K SNP assay. Identification of quantitative trait loci (QTL) for yield-related traits and closely linked molecular markers is important in order to identify gene/QTL for marker-assisted selection (MAS) in wheat breeding. The objectives of the present study were to identify QTL for yield-related traits and dissect the relationships among different traits in three wheat recombinant inbred line (RIL) populations derived from crosses Doumai × Shi 4185 (D × S), Gaocheng 8901 × Zhoumai 16 (G × Z) and Linmai 2 × Zhong 892 (L × Z). Using the available high-density linkage maps previously constructed with the wheat 90 K iSelect single nucleotide polymorphism (SNP) array, 65, 46 and 53 QTL for 12 traits were identified in the three RIL populations, respectively. Among them, 34, 23 and 27 were likely to be new QTL. Eighteen common QTL were detected across two or three populations. Eleven QTL clusters harboring multiple QTL were detected in different populations, and the interval 15.5-32.3 cM around the Rht-B1 locus on chromosome 4BS harboring 20 QTL is an important region determining grain yield (GY). Thousand-kernel weight (TKW) is significantly affected by kernel width and plant height (PH), whereas flag leaf width can be used to select lines with large kernel number per spike. Eleven candidate genes were identified, including eight cloned genes for kernel, heading date (HD) and PH-related traits as well as predicted genes for TKW, spike length and HD. The closest SNP markers of stable QTL or QTL clusters can be used for MAS in wheat breeding using kompetitive allele-specific PCR or semi-thermal asymmetric reverse PCR assays for improvement of GY.

  10. Psoriasis prediction from genome-wide SNP profiles

    Directory of Open Access Journals (Sweden)

    Fang Xiangzhong

    2011-01-01

    Full Text Available Abstract Background With the availability of large-scale genome-wide association study (GWAS data, choosing an optimal set of SNPs for disease susceptibility prediction is a challenging task. This study aimed to use single nucleotide polymorphisms (SNPs to predict psoriasis from searching GWAS data. Methods Totally we had 2,798 samples and 451,724 SNPs. Process for searching a set of SNPs to predict susceptibility for psoriasis consisted of two steps. The first one was to search top 1,000 SNPs with high accuracy for prediction of psoriasis from GWAS dataset. The second one was to search for an optimal SNP subset for predicting psoriasis. The sequential information bottleneck (sIB method was compared with classical linear discriminant analysis(LDA for classification performance. Results The best test harmonic mean of sensitivity and specificity for predicting psoriasis by sIB was 0.674(95% CI: 0.650-0.698, while only 0.520(95% CI: 0.472-0.524 was reported for predicting disease by LDA. Our results indicate that the new classifier sIB performs better than LDA in the study. Conclusions The fact that a small set of SNPs can predict disease status with average accuracy of 68% makes it possible to use SNP data for psoriasis prediction.

  11. Exhaustive Genome-Wide Search for SNP-SNP Interactions Across 10 Human Diseases

    Directory of Open Access Journals (Sweden)

    William Murk

    2016-07-01

    Full Text Available The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs with minor allele frequency (MAF ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10−12. Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.

  12. A genome wide survey of SNP variation reveals the genetic structure of sheep breeds.

    Directory of Open Access Journals (Sweden)

    James W Kijas

    Full Text Available The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identifying the first genome-wide set of SNP for sheep, we report on levels of genetic variability both within and between a diverse sample of ovine populations. Then, using cluster analysis and the partitioning of genetic variation, we demonstrate sheep are characterised by weak phylogeographic structure, overlapping genetic similarity and generally low differentiation which is consistent with their short evolutionary history. The degree of population substructure was, however, sufficient to cluster individuals based on geographic origin and known breed history. Specifically, African and Asian populations clustered separately from breeds of European origin sampled from Australia, New Zealand, Europe and North America. Furthermore, we demonstrate the presence of stratification within some, but not all, ovine breeds. The results emphasize that careful documentation of genetic structure will be an essential prerequisite when mapping the genetic basis of complex traits. Furthermore, the identification of a subset of SNP able to assign individuals into broad groupings demonstrates even a small panel of markers may be suitable for applications such as traceability.

  13. Genome wide in silico SNP-tumor association analysis

    International Nuclear Information System (INIS)

    Qiu, Ping; Wang, Luquan; Kostich, Mitch; Ding, Wei; Simon, Jason S; Greene, Jonathan R

    2004-01-01

    Carcinogenesis occurs, at least in part, due to the accumulation of mutations in critical genes that control the mechanisms of cell proliferation, differentiation and death. Publicly accessible databases contain millions of expressed sequence tag (EST) and single nucleotide polymorphism (SNP) records, which have the potential to assist in the identification of SNPs overrepresented in tumor tissue. An in silico SNP-tumor association study was performed utilizing tissue library and SNP information available in NCBI's dbEST (release 092002) and dbSNP (build 106). A total of 4865 SNPs were identified which were present at higher allele frequencies in tumor compared to normal tissues. A subset of 327 (6.7%) SNPs induce amino acid changes to the protein coding sequences. This approach identified several SNPs which have been previously associated with carcinogenesis, as well as a number of SNPs that now warrant further investigation This novel in silico approach can assist in prioritization of genes and SNPs in the effort to elucidate the genetic mechanisms underlying the development of cancer

  14. Genome-wide SNP identification in multiple morphotypes of allohexaploid tall fescue (Festuca arundinacea Schreb

    Directory of Open Access Journals (Sweden)

    Hand Melanie L

    2012-06-01

    GoldenGate™ assay is capable of high-throughput co-dominant SNP allele detection, and minimises the problems associated with SNP genotyping in a polyploid by effectively reducing the complexity to a diploid system. This SNP collection may now be refined and used in applications such as cultivar identification, genetic linkage map construction, genome-wide association studies and genomic selection in tall fescue. The bioinformatic pipeline described here represents an effective general method for SNP discovery within outbreeding allopolyploid species.

  15. Polygenic analysis of genome-wide SNP data identifies common variants on allergic rhinitis

    DEFF Research Database (Denmark)

    Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette

    Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...

  16. SNPpy--database management for SNP data from genome wide association studies.

    Directory of Open Access Journals (Sweden)

    Faheem Mitha

    Full Text Available BACKGROUND: We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS. This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP data. SNPpy and its dependencies are open source software. RESULTS: The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. CONCLUSIONS: By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is a practical and extensible solution for investigators who seek to deploy central management of their GWAS data.

  17. Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries

    Directory of Open Access Journals (Sweden)

    Kumar Santosh

    2012-12-01

    Full Text Available Abstract Background Flax (Linum usitatissimum L. is a significant fibre and oilseed crop. Current flax molecular markers, including isozymes, RAPDs, AFLPs and SSRs are of limited use in the construction of high density linkage maps and for association mapping applications due to factors such as low reproducibility, intense labour requirements and/or limited numbers. We report here on the use of a reduced representation library strategy combined with next generation Illumina sequencing for rapid and large scale discovery of SNPs in eight flax genotypes. SNP discovery was performed through in silico analysis of the sequencing data against the whole genome shotgun sequence assembly of flax genotype CDC Bethune. Genotyping-by-sequencing of an F6-derived recombinant inbred line population provided validation of the SNPs. Results Reduced representation libraries of eight flax genotypes were sequenced on the Illumina sequencing platform resulting in sequence coverage ranging from 4.33 to 15.64X (genome equivalents. Depending on the relatedness of the genotypes and the number and length of the reads, between 78% and 93% of the reads mapped onto the CDC Bethune whole genome shotgun sequence assembly. A total of 55,465 SNPs were discovered with the largest number of SNPs belonging to the genotypes with the highest mapping coverage percentage. Approximately 84% of the SNPs discovered were identified in a single genotype, 13% were shared between any two genotypes and the remaining 3% in three or more. Nearly a quarter of the SNPs were found in genic regions. A total of 4,706 out of 4,863 SNPs discovered in Macbeth were validated using genotyping-by-sequencing of 96 F6 individuals from a recombinant inbred line population derived from a cross between CDC Bethune and Macbeth, corresponding to a validation rate of 96.8%. Conclusions Next generation sequencing of reduced representation libraries was successfully implemented for genome-wide SNP discovery from

  18. An evaluation of the genetic-matched pair study design using genome-wide SNP data from the European population

    DEFF Research Database (Denmark)

    Lu, Timothy Tehua; Lao, Oscar; Nothnagel, Michael

    2009-01-01

    of cases (76.0%), the BOM of a given individual, based on the complete marker set, came from a different recruitment site than the individual itself. A second marker set, specifically selected for ancestry sensitivity using singular value decomposition, performed even more poorly and was no more capable......Genetic matching potentially provides a means to alleviate the effects of incomplete Mendelian randomization in population-based gene-disease association studies. We therefore evaluated the genetic-matched pair study design on the basis of genome-wide SNP data (309,790 markers; Affymetrix Gene......Chip Human Mapping 500K Array) from 2457 individuals, sampled at 23 different recruitment sites across Europe. Using pair-wise identity-by-state (IBS) as a matching criterion, we tried to derive a subset of markers that would allow identification of the best overall matching (BOM) partner for a given...

  19. RS-SNP: a random-set method for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Mukherjee Sayan

    2011-03-01

    Full Text Available Abstract Background The typical objective of Genome-wide association (GWA studies is to identify single-nucleotide polymorphisms (SNPs and corresponding genes with the strongest evidence of association (the 'most-significant SNPs/genes' approach. Borrowing ideas from micro-array data analysis, we propose a new method, named RS-SNP, for detecting sets of genes enriched in SNPs moderately associated to the phenotype. RS-SNP assesses whether the number of significant SNPs, with p-value P ≤ α, belonging to a given SNP set is statistically significant. The rationale of proposed method is that two kinds of null hypotheses are taken into account simultaneously. In the first null model the genotype and the phenotype are assumed to be independent random variables and the null distribution is the probability of the number of significant SNPs in greater than observed by chance. The second null model assumes the number of significant SNPs in depends on the size of and not on the identity of the SNPs in . Statistical significance is assessed using non-parametric permutation tests. Results We applied RS-SNP to the Crohn's disease (CD data set collected by the Wellcome Trust Case Control Consortium (WTCCC and compared the results with GENGEN, an approach recently proposed in literature. The enrichment analysis using RS-SNP and the set of pathways contained in the MSigDB C2 CP pathway collection highlighted 86 pathways rich in SNPs weakly associated to CD. Of these, 47 were also indicated to be significant by GENGEN. Similar results were obtained using the MSigDB C5 pathway collection. Many of the pathways found to be enriched by RS-SNP have a well-known connection to CD and often with inflammatory diseases. Conclusions The proposed method is a valuable alternative to other techniques for enrichment analysis of SNP sets. It is well founded from a theoretical and statistical perspective. Moreover, the experimental comparison with GENGEN highlights that it is

  20. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis

    Science.gov (United States)

    2011-09-01

    SNP Array v2. A ‘proof-of-concept’ advanced data mining algorithm for unsupervised analysis of genome-wide association study (GWAS) dataset was... Opal F AUS Yes U141 Peggs F AUS Yes U142 Taxi F AUS Yes U143 Riso MI MAL Yes U144 Szarik MI GSD Yes U145 Astor MI MAL Yes U146 Roy MC MAL Yes... mining of genetic studies in general, and especially GWAS. As a proof-of-concept, a classification analysis of the WG SNP typing dataset of a

  1. iLOCi: a SNP interaction prioritization technique for detecting epistasis in genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Piriyapongsa Jittima

    2012-12-01

    Full Text Available Abstract Background Genome-wide association studies (GWAS do not provide a full account of the heritability of genetic diseases since gene-gene interactions, also known as epistasis are not considered in single locus GWAS. To address this problem, a considerable number of methods have been developed for identifying disease-associated gene-gene interactions. However, these methods typically fail to identify interacting markers explaining more of the disease heritability over single locus GWAS, since many of the interactions significant for disease are obscured by uninformative marker interactions e.g., linkage disequilibrium (LD. Results In this study, we present a novel SNP interaction prioritization algorithm, named iLOCi (Interacting Loci. This algorithm accounts for marker dependencies separately in case and control groups. Disease-associated interactions are then prioritized according to a novel ranking score calculated from the difference in marker dependencies for every possible pair between case and control groups. The analysis of a typical GWAS dataset can be completed in less than a day on a standard workstation with parallel processing capability. The proposed framework was validated using simulated data and applied to real GWAS datasets using the Wellcome Trust Case Control Consortium (WTCCC data. The results from simulated data showed the ability of iLOCi to identify various types of gene-gene interactions, especially for high-order interaction. From the WTCCC data, we found that among the top ranked interacting SNP pairs, several mapped to genes previously known to be associated with disease, and interestingly, other previously unreported genes with biologically related roles. Conclusion iLOCi is a powerful tool for uncovering true disease interacting markers and thus can provide a more complete understanding of the genetic basis underlying complex disease. The program is available for download at http://www4a.biotec.or.th/GI/tools/iloci.

  2. Population genomic structure and linkage disequilibrium analysis of South African goat breeds using genome-wide SNP data.

    Science.gov (United States)

    Mdladla, K; Dzomba, E F; Huson, H J; Muchadeyi, F C

    2016-08-01

    The sustainability of goat farming in marginal areas of southern Africa depends on local breeds that are adapted to specific agro-ecological conditions. Unimproved non-descript goats are the main genetic resources used for the development of commercial meat-type breeds of South Africa. Little is known about genetic diversity and the genetics of adaptation of these indigenous goat populations. This study investigated the genetic diversity, population structure and breed relations, linkage disequilibrium, effective population size and persistence of gametic phase in goat populations of South Africa. Three locally developed meat-type breeds of the Boer (n = 33), Savanna (n = 31), Kalahari Red (n = 40), a feral breed of Tankwa (n = 25) and unimproved non-descript village ecotypes (n = 110) from four goat-producing provinces of the Eastern Cape, KwaZulu-Natal, Limpopo and North West were assessed using the Illumina Goat 50K SNP Bead Chip assay. The proportion of SNPs with minor allele frequencies >0.05 ranged from 84.22% in the Tankwa to 97.58% in the Xhosa ecotype, with a mean of 0.32 ± 0.13 across populations. Principal components analysis, admixture and pairwise FST identified Tankwa as a genetically distinct population and supported clustering of the populations according to their historical origins. Genome-wide FST identified 101 markers potentially under positive selection in the Tankwa. Average linkage disequilibrium was highest in the Tankwa (r(2)  = 0.25 ± 0.26) and lowest in the village ecotypes (r(2) range = 0.09 ± 0.12 to 0.11 ± 0.14). We observed an effective population size of 100 kb with the exception of those in Savanna and Tswana populations. This study highlights the high level of genetic diversity in South African indigenous goats as well as the utility of the genome-wide SNP marker panels in genetic studies of these populations. © 2016 Stichting International Foundation for Animal Genetics.

  3. Chapter 10: Mining genome-wide genetic markers.

    Directory of Open Access Journals (Sweden)

    Xiang Zhang

    Full Text Available Genome-wide association study (GWAS aims to discover genetic factors underlying phenotypic traits. The large number of genetic factors poses both computational and statistical challenges. Various computational approaches have been developed for large scale GWAS. In this chapter, we will discuss several widely used computational approaches in GWAS. The following topics will be covered: (1 An introduction to the background of GWAS. (2 The existing computational approaches that are widely used in GWAS. This will cover single-locus, epistasis detection, and machine learning methods that have been recently developed in biology, statistic, and computer science communities. This part will be the main focus of this chapter. (3 The limitations of current approaches and future directions.

  4. Genome-wide SNP data unveils the globalization of domesticated pigs.

    Science.gov (United States)

    Yang, Bin; Cui, Leilei; Perez-Enciso, Miguel; Traspov, Aleksei; Crooijmans, Richard P M A; Zinovieva, Natalia; Schook, Lawrence B; Archibald, Alan; Gatphayak, Kesinee; Knorr, Christophe; Triantafyllidis, Alex; Alexandri, Panoraia; Semiadi, Gono; Hanotte, Olivier; Dias, Deodália; Dovč, Peter; Uimari, Pekka; Iacolina, Laura; Scandura, Massimo; Groenen, Martien A M; Huang, Lusheng; Megens, Hendrik-Jan

    2017-09-21

    Pigs were domesticated independently in Eastern and Western Eurasia early during the agricultural revolution, and have since been transported and traded across the globe. Here, we present a worldwide survey on 60K genome-wide single nucleotide polymorphism (SNP) data for 2093 pigs, including 1839 domestic pigs representing 122 local and commercial breeds, 215 wild boars, and 39 out-group suids, from Asia, Europe, America, Oceania and Africa. The aim of this study was to infer global patterns in pig domestication and diversity related to demography, migration, and selection. A deep phylogeographic division reflects the dichotomy between early domestication centers. In the core Eastern and Western domestication regions, Chinese pigs show differentiation between breeds due to geographic isolation, whereas this is less pronounced in European pigs. The inferred European origin of pigs in the Americas, Africa, and Australia reflects European expansion during the sixteenth to nineteenth centuries. Human-mediated introgression, which is due, in particular, to importing Chinese pigs into the UK during the eighteenth and nineteenth centuries, played an important role in the formation of modern pig breeds. Inbreeding levels vary markedly between populations, from almost no runs of homozygosity (ROH) in a number of Asian wild boar populations, to up to 20% of the genome covered by ROH in a number of Southern European breeds. Commercial populations show moderate ROH statistics. For domesticated pigs and wild boars in Asia and Europe, we identified highly differentiated loci that include candidate genes related to muscle and body development, central nervous system, reproduction, and energy balance, which are putatively under artificial selection. Key events related to domestication, dispersal, and mixing of pigs from different regions are reflected in the 60K SNP data, including the globalization that has recently become full circle since Chinese pig breeders in the past

  5. SNP-based pathway enrichment analysis for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Potkin Steven G

    2011-04-01

    Full Text Available Abstract Background Recently we have witnessed a surge of interest in using genome-wide association studies (GWAS to discover the genetic basis of complex diseases. Many genetic variations, mostly in the form of single nucleotide polymorphisms (SNPs, have been identified in a wide spectrum of diseases, including diabetes, cancer, and psychiatric diseases. A common theme arising from these studies is that the genetic variations discovered by GWAS can only explain a small fraction of the genetic risks associated with the complex diseases. New strategies and statistical approaches are needed to address this lack of explanation. One such approach is the pathway analysis, which considers the genetic variations underlying a biological pathway, rather than separately as in the traditional GWAS studies. A critical challenge in the pathway analysis is how to combine evidences of association over multiple SNPs within a gene and multiple genes within a pathway. Most current methods choose the most significant SNP from each gene as a representative, ignoring the joint action of multiple SNPs within a gene. This approach leads to preferential identification of genes with a greater number of SNPs. Results We describe a SNP-based pathway enrichment method for GWAS studies. The method consists of the following two main steps: 1 for a given pathway, using an adaptive truncated product statistic to identify all representative (potentially more than one SNPs of each gene, calculating the average number of representative SNPs for the genes, then re-selecting the representative SNPs of genes in the pathway based on this number; and 2 ranking all selected SNPs by the significance of their statistical association with a trait of interest, and testing if the set of SNPs from a particular pathway is significantly enriched with high ranks using a weighted Kolmogorov-Smirnov test. We applied our method to two large genetically distinct GWAS data sets of schizophrenia, one

  6. Genetic diversity and structure of elite cotton germplasm (Gossypium hirsutum L.) using genome-wide SNP data.

    Science.gov (United States)

    Ai, XianTao; Liang, YaJun; Wang, JunDuo; Zheng, JuYun; Gong, ZhaoLong; Guo, JiangPing; Li, XueYuan; Qu, YanYing

    2017-10-01

    Cotton (Gossypium spp.) is the most important natural textile fiber crop, and Gossypium hirsutum L. is responsible for 90% of the annual cotton crop in the world. Information on cotton genetic diversity and population structure is essential for new breeding lines. In this study, we analyzed population structure and genetic diversity of 288 elite Gossypium hirsutum cultivar accessions collected from around the world, and especially from China, using genome-wide single nucleotide polymorphisms (SNP) markers. The average polymorphsim information content (PIC) was 0.25, indicating a relatively low degree of genetic diversity. Population structure analysis revealed extensive admixture and identified three subgroups. Phylogenetic analysis supported the subgroups identified by STRUCTURE. The results from both population structure and phylogenetic analysis were, for the most part, in agreement with pedigree information. Analysis of molecular variance revealed a larger amount of variation was due to diversity within the groups. Establishment of genetic diversity and population structure from this study could be useful for genetic and genomic analysis and systematic utilization of the standing genetic variation in upland cotton.

  7. Use of direct and iterative solvers for estimation of SNP effects in genome-wide selection

    Directory of Open Access Journals (Sweden)

    Eduardo da Cruz Gouveia Pimentel

    2010-01-01

    Full Text Available The aim of this study was to compare iterative and direct solvers for estimation of marker effects in genomic selection. One iterative and two direct methods were used: Gauss-Seidel with Residual Update, Cholesky Decomposition and Gentleman-Givens rotations. For resembling different scenarios with respect to number of markers and of genotyped animals, a simulated data set divided into 25 subsets was used. Number of markers ranged from 1,200 to 5,925 and number of animals ranged from 1,200 to 5,865. Methods were also applied to real data comprising 3081 individuals genotyped for 45181 SNPs. Results from simulated data showed that the iterative solver was substantially faster than direct methods for larger numbers of markers. Use of a direct solver may allow for computing (covariances of SNP effects. When applied to real data, performance of the iterative method varied substantially, depending on the level of ill-conditioning of the coefficient matrix. From results with real data, Gentleman-Givens rotations would be the method of choice in this particular application as it provided an exact solution within a fairly reasonable time frame (less than two hours. It would indeed be the preferred method whenever computer resources allow its use.

  8. OpenADAM: an open source genome-wide association data management system for Affymetrix SNP arrays

    Directory of Open Access Journals (Sweden)

    Sham P C

    2008-12-01

    Full Text Available Abstract Background Large scale genome-wide association studies have become popular since the introduction of high throughput genotyping platforms. Efficient management of the vast array of data generated poses many challenges. Description We have developed an open source web-based data management system for the large amount of genotype data generated from the Affymetrix GeneChip® Mapping Array and Affymetrix Genome-Wide Human SNP Array platforms. The database supports genotype calling using DM, BRLMM, BRLMM-P or Birdseed algorithms provided by the Affymetrix Power Tools. The genotype and corresponding pedigree data are stored in a relational database for efficient downstream data manipulation and analysis, such as calculation of allele and genotype frequencies, sample identity checking, and export of genotype data in various file formats for analysis using commonly-available software. A novel method for genotyping error estimation is implemented using linkage disequilibrium information from the HapMap project. All functionalities are accessible via a web-based user interface. Conclusion OpenADAM provides an open source database system for management of Affymetrix genome-wide association SNP data.

  9. Genome-wide SNP discovery in tetraploid alfalfa using 454 sequencing and high resolution melting analysis

    Directory of Open Access Journals (Sweden)

    Zhao Patrick X

    2011-07-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most common type of sequence variation among plants and are often functionally important. We describe the use of 454 technology and high resolution melting analysis (HRM for high throughput SNP discovery in tetraploid alfalfa (Medicago sativa L., a species with high economic value but limited genomic resources. Results The alfalfa genotypes selected from M. sativa subsp. sativa var. 'Chilean' and M. sativa subsp. falcata var. 'Wisfal', which differ in water stress sensitivity, were used to prepare cDNA from tissue of clonally-propagated plants grown under either well-watered or water-stressed conditions, and then pooled for 454 sequencing. Based on 125.2 Mb of raw sequence, a total of 54,216 unique sequences were obtained including 24,144 tentative consensus (TCs sequences and 30,072 singletons, ranging from 100 bp to 6,662 bp in length, with an average length of 541 bp. We identified 40,661 candidate SNPs distributed throughout the genome. A sample of candidate SNPs were evaluated and validated using high resolution melting (HRM analysis. A total of 3,491 TCs harboring 20,270 candidate SNPs were located on the M. truncatula (MT 3.5.1 chromosomes. Gene Ontology assignments indicate that sequences obtained cover a broad range of GO categories. Conclusions We describe an efficient method to identify thousands of SNPs distributed throughout the alfalfa genome covering a broad range of GO categories. Validated SNPs represent valuable molecular marker resources that can be used to enhance marker density in linkage maps, identify potential factors involved in heterosis and genetic variation, and as tools for association mapping and genomic selection in alfalfa.

  10. Genome-wide joint meta-analysis of SNP and SNP-by-smoking interaction identifies novel loci for pulmonary function.

    Directory of Open Access Journals (Sweden)

    Dana B Hancock

    Full Text Available Genome-wide association studies have identified numerous genetic loci for spirometic measures of pulmonary function, forced expiratory volume in one second (FEV(1, and its ratio to forced vital capacity (FEV(1/FVC. Given that cigarette smoking adversely affects pulmonary function, we conducted genome-wide joint meta-analyses (JMA of single nucleotide polymorphism (SNP and SNP-by-smoking (ever-smoking or pack-years associations on FEV(1 and FEV(1/FVC across 19 studies (total N = 50,047. We identified three novel loci not previously associated with pulmonary function. SNPs in or near DNER (smallest P(JMA = 5.00×10(-11, HLA-DQB1 and HLA-DQA2 (smallest P(JMA = 4.35×10(-9, and KCNJ2 and SOX9 (smallest P(JMA = 1.28×10(-8 were associated with FEV(1/FVC or FEV(1 in meta-analysis models including SNP main effects, smoking main effects, and SNP-by-smoking (ever-smoking or pack-years interaction. The HLA region has been widely implicated for autoimmune and lung phenotypes, unlike the other novel loci, which have not been widely implicated. We evaluated DNER, KCNJ2, and SOX9 and found them to be expressed in human lung tissue. DNER and SOX9 further showed evidence of differential expression in human airway epithelium in smokers compared to non-smokers. Our findings demonstrated that joint testing of SNP and SNP-by-environment interaction identified novel loci associated with complex traits that are missed when considering only the genetic main effects.

  11. ChIP on SNP-chip for genome-wide analysis of human histone H4 hyperacetylation

    Directory of Open Access Journals (Sweden)

    Porter Christopher J

    2007-09-01

    Full Text Available Abstract Background SNP microarrays are designed to genotype Single Nucleotide Polymorphisms (SNPs. These microarrays report hybridization of DNA fragments and therefore can be used for the purpose of detecting genomic fragments. Results Here, we demonstrate that a SNP microarray can be effectively used in this way to perform chromatin immunoprecipitation (ChIP on chip as an alternative to tiling microarrays. We illustrate this novel application by mapping whole genome histone H4 hyperacetylation in human myoblasts and myotubes. We detect clusters of hyperacetylated histone H4, often spanning across up to 300 kilobases of genomic sequence. Using complementary genome-wide analyses of gene expression by DNA microarray we demonstrate that these clusters of hyperacetylated histone H4 tend to be associated with expressed genes. Conclusion The use of a SNP array for a ChIP-on-chip application (ChIP on SNP-chip will be of great value to laboratories whose interest is the determination of general rules regarding the relationship of specific chromatin modifications to transcriptional status throughout the genome and to examine the asymmetric modification of chromatin at heterozygous loci.

  12. Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses

    NARCIS (Netherlands)

    Orr, J.L.; Back, W.; Gu, J.; Leegwater, P.H.; Govindarajan, P.; Conroy, J.; Ducro, B.J.; Arendonk, van J.A.M.

    2010-01-01

    The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of

  13. Novel approach for deriving genome wide SNP analysis data from archived blood spots

    Science.gov (United States)

    2012-01-01

    Background The ability to transport and store DNA at room temperature in low volumes has the advantage of optimising cost, time and storage space. Blood spots on adapted filter papers are popular for this, with FTA (Flinders Technology Associates) Whatman™TM technology being one of the most recent. Plant material, plasmids, viral particles, bacteria and animal blood have been stored and transported successfully using this technology, however the method of porcine DNA extraction from FTA Whatman™TM cards is a relatively new approach, allowing nucleic acids to be ready for downstream applications such as PCR, whole genome amplification, sequencing and subsequent application to single nucleotide polymorphism microarrays has hitherto been under-explored. Findings DNA was extracted from FTA Whatman™TM cards (following adaptations of the manufacturer’s instructions), whole genome amplified and subsequently analysed to validate the integrity of the DNA for downstream SNP analysis. DNA was successfully extracted from 288/288 samples and amplified by WGA. Allele dropout post WGA, was observed in less than 2% of samples and there was no clear evidence of amplification bias nor contamination. Acceptable call rates on porcine SNP chips were also achieved using DNA extracted and amplified in this way. Conclusions DNA extracted from FTA Whatman cards is of a high enough quality and quantity following whole genomic amplification to perform meaningful SNP chip studies. PMID:22974252

  14. Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication

    Science.gov (United States)

    vonHoldt, Bridgett M.; Pollinger, John P.; Lohmueller, Kirk E.; Han, Eunjung; Parker, Heidi G.; Quignon, Pascale; Degenhardt, Jeremiah D.; Boyko, Adam R.; Earl, Dent A.; Auton, Adam; Reynolds, Andy; Bryc, Kasia; Brisbin, Abra; Knowles, James C.; Mosher, Dana S.; Spady, Tyrone C.; Elkahloun, Abdel; Geffen, Eli; Pilot, Malgorzata; Jedrzejewski, Wlodzimierz; Greco, Claudia; Randi, Ettore; Bannasch, Danika; Wilton, Alan; Shearman, Jeremy; Musiani, Marco; Cargill, Michelle; Jones, Paul G.; Qian, Zuwei; Huang, Wei; Ding, Zhao-Li; Zhang, Ya-ping; Bustamante, Carlos D.; Ostrander, Elaine A.; Novembre, John; Wayne, Robert K.

    2010-01-01

    Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication1,2. To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data3. Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity. PMID:20237475

  15. Genome-wide microsatellite characterization and marker development in the sequenced Brassica crop species.

    Science.gov (United States)

    Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

    2014-02-01

    Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species.

  16. Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses.

    Science.gov (United States)

    Orr, N; Back, W; Gu, J; Leegwater, P; Govindarajan, P; Conroy, J; Ducro, B; Van Arendonk, J A M; MacHugh, D E; Ennis, S; Hill, E W; Brama, P A J

    2010-12-01

    The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of inheritance, to a 2-MB region of chromosome 14 using just 10 affected animals and 10 controls. We successfully genotyped 34,429 SNPs that were tested for association with dwarfism using chi-square tests. The most significant SNP in our study, BIEC2-239376 (P(2df)=4.54 × 10(-5), P(rec)=7.74 × 10(-6)), is located close to a gene implicated in human dwarfism. Fine-mapping and resequencing analyses did not aid in further localization of the causative variant, and replication of our findings in independent sample sets will be necessary to confirm these results. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.

  17. Genome-wide linkage analysis of QTL for growth and body composition employing the PorcineSNP60 BeadChip

    Directory of Open Access Journals (Sweden)

    Fernández Ana I

    2012-05-01

    Full Text Available Abstract Background The traditional strategy to map QTL is to use linkage analysis employing a limited number of markers. These analyses report wide QTL confidence intervals, making very difficult to identify the gene and polymorphisms underlying the QTL effects. The arrival of genome-wide panels of SNPs makes available thousands of markers increasing the information content and therefore the likelihood of detecting and fine mapping QTL regions. The aims of the current study are to confirm previous QTL regions for growth and body composition traits in different generations of an Iberian x Landrace intercross (IBMAP and especially identify new ones with narrow confidence intervals by employing the PorcineSNP60 BeadChip in linkage analyses. Results Three generations (F3, Backcross 1 and Backcross 2 of the IBMAP and their related animals were genotyped with PorcineSNP60 BeadChip. A total of 8,417 SNPs equidistantly distributed across autosomes were selected after filtering by quality, position and frequency to perform the QTL scan. The joint and separate analyses of the different IBMAP generations allowed confirming QTL regions previously identified in chromosomes 4 and 6 as well as new ones mainly for backfat thickness in chromosomes 4, 5, 11, 14 and 17 and shoulder weight in chromosomes 1, 2, 9 and 13; and many other to the chromosome-wide signification level. In addition, most of the detected QTLs displayed narrow confidence intervals, making easier the selection of positional candidate genes. Conclusions The use of higher density of markers has allowed to confirm results obtained in previous QTL scans carried out with microsatellites. Moreover several new QTL regions have been now identified in regions probably not covered by markers in previous scans, most of these QTLs displayed narrow confidence intervals. Finally, prominent putative biological and positional candidate genes underlying those QTL effects are listed based on recent porcine

  18. Using rice genome-wide association studies to identify DNA markers for marker-assisted selection

    Science.gov (United States)

    Rice association mapping panels are collections of rice (Oryza sativa L.) accessions developed for genome-wide association (GWA) studies. One of these panels, the Rice Diversity Panel 1 (RDP1) was phenotyped by various research groups for several traits of interest, and more recently, genotyped with...

  19. RAD-seq derived genome-wide nuclear markers resolve the phylogeny of tunas

    KAUST Repository

    Díaz-Arce, Natalia

    2016-06-07

    Although species from the genus Thunnus include some of the most commercially important and most severely overexploited fishes, the phylogeny of this genus is still unresolved, hampering evolutionary and traceability studies that could help improve conservation and management strategies for these species. Previous attempts based on mitochondrial and nuclear markers were unsuccessful in inferring a congruent and reliable phylogeny, probably due to mitochondrial introgression events and lack of enough phylogenetically informative markers. Here we infer the first genome-wide nuclear marker-based phylogeny of tunas using restriction site associated DNA sequencing (RAD-seq) data. Our results, derived from phylogenomic inferences obtained from 128 nucleotide matrices constructed using alternative data assembly procedures, support a single Thunnus evolutionary history that challenges previous assumptions based on morphological and molecular data.

  20. Admixture patterns and genetic differentiation in negrito groups from West Malaysia estimated from genome-wide SNP data.

    Science.gov (United States)

    Jinam, Timothy A; Phipps, Maude E; Saitou, Naruya

    2013-01-01

    Southeast Asia houses various culturally and linguistically diverse ethnic groups. In Malaysia, where the Malay, Chinese, and Indian ethnic groups form the majority, there exist minority groups such as the "negritos" who are believed to be descendants of the earliest settlers of Southeast Asia. Here we report patterns of genetic substructure and admixture in two Malaysian negrito populations (Jehai and Kensiu), using ~50,000 genome-wide single-nucleotide polymorphism (SNP) data. We found traces of recent admixture in both the negrito populations, particularly in the Jehai, with the Malay through principal component analysis and STRUCTURE analysis software, which suggested that the admixture was as recent as one generation ago. We also identified significantly differentiated nonsynonymous SNPs and haplotype blocks related to intracellular transport, metabolic processes, and detection of stimulus. These results highlight the different levels of admixture experienced by the two Malaysian negritos. Delineating admixture and differentiated genomic regions should be of importance in designing and interpretation of molecular anthropology and disease association studies. Copyright © 2013 Wayne State University Press, Detroit, Michigan 48201-1309.

  1. Gaussian covariance graph models accounting for correlated marker effects in genome-wide prediction.

    Science.gov (United States)

    Martínez, C A; Khare, K; Rahman, S; Elzo, M A

    2017-10-01

    Several statistical models used in genome-wide prediction assume uncorrelated marker allele substitution effects, but it is known that these effects may be correlated. In statistics, graphical models have been identified as a useful tool for covariance estimation in high-dimensional problems and it is an area that has recently experienced a great expansion. In Gaussian covariance graph models (GCovGM), the joint distribution of a set of random variables is assumed to be Gaussian and the pattern of zeros of the covariance matrix is encoded in terms of an undirected graph G. In this study, methods adapting the theory of GCovGM to genome-wide prediction were developed (Bayes GCov, Bayes GCov-KR and Bayes GCov-H). In simulated data sets, improvements in correlation between phenotypes and predicted breeding values and accuracies of predicted breeding values were found. Our models account for correlation of marker effects and permit to accommodate general structures as opposed to models proposed in previous studies, which consider spatial correlation only. In addition, they allow incorporation of biological information in the prediction process through its use when constructing graph G, and their extension to the multi-allelic loci case is straightforward. © 2017 Blackwell Verlag GmbH.

  2. Genome-wide characterization of microsatelittes and marker development in the carcinogenic liver fluke Clonorchis sinensis

    Science.gov (United States)

    Nguyen, Thao T.B.; Arimatsu, Yuji; Hong, Sung-Jong; Brindley, Paul J.; Blair, David; Laha, Thewarach; Sripa, Banchob

    2015-01-01

    Clonorchis sinensis is an important carcinogenic human liver fluke endemic in East and Southeast Asia. There are several conventional molecular markers have been used for identification and genetic diversity, however, no information about microsatellites of this liver fluke published so far. We here report microsatellite characterization and marker development for genetic diversity study in C. sinensis using genome-wide bioinformatics approach. Based on our search criteria, a total of 256,990 microsatellites (≥ 12 base pairs) were identified from genome database of C. sinensis with hexa-nucleotide motif being the most abundant (51%) followed by penta-nucleotide (18.3%) and tri-nucleotide (12.7%). The tetra-nucleotide, di-nucleotide and mononucleotide motifs accounted for 9.75 %, 7.63% and 0.14%, respectively. The total length of all microsatellites accounts for 0. 72 % of 547 Mb of the whole genome size and the frequency of microsatellites were found to be one microsatellite in every 2.13 kb of DNA. For the di-, tri, and tetra-nucleotide, the repeat numbers redundant are six (28%), four (45%) and three (76%), respectively. The ATC repeat is the most abundant microsatellites followed by AT, AAT and AC, respectively. Within 40 microsatellite loci developed, 24 microsatellite markers showed potential to differentiate between C. sinensis and O. viverrini. Seven out of 24 loci showed heterozygous with observed heterozygosity ranged from 0.467 to 1. Four-primer sets could amplify both C. sinensis and O. viverrini DNA with different sizes. This study provides basic information of C. sinensis microsatellites and the genome-wide markers developed may be a useful tool for genetic study of C. sinensis. PMID:25782682

  3. Genome-wide characterization of microsatellites and marker development in the carcinogenic liver fluke Clonorchis sinensis.

    Science.gov (United States)

    Nguyen, Thao T B; Arimatsu, Yuji; Hong, Sung-Jong; Brindley, Paul J; Blair, David; Laha, Thewarach; Sripa, Banchob

    2015-06-01

    Clonorchis sinensis is an important carcinogenic human liver fluke endemic in East and Southeast Asia. There are several conventional molecular markers that have been used for identification and genetic diversity; however, no information about microsatellites of this liver fluke is published so far. We here report microsatellite characterization and marker development for a genetic diversity study in C. sinensis, using a genome-wide bioinformatics approach. Based on our search criteria, a total of 256,990 microsatellites (≥12 base pairs) were identified from a genome database of C. sinensis, with hexanucleotide motif being the most abundant (51%) followed by pentanucleotide (18.3%) and trinucleotide (12.7%). The tetranucleotide, dinucleotide, and mononucleotide motifs accounted for 9.75, 7.63, and 0.14%, respectively. The total length of all microsatellites accounts for 0. 72% of 547 Mb of the whole genome size, and the frequency of microsatellites was found to be one microsatellite in every 2.13 kb of DNA. For the di-, tri-, and tetranucleotide, the repeat numbers redundant are six (28%), four (45%), and three (76%), respectively. The ATC repeat is the most abundant microsatellites followed by AT, AAT, and AC, respectively. Within 40 microsatellite loci developed, 24 microsatellite markers showed potential to differentiate between C. sinensis and Opisthorchis viverrini. Seven out of 24 loci showed to be heterozygous with observed heterozygosity that ranged from 0.467 to 1. Four primer sets could amplify both C. sinensis and O. viverrini DNA with different sizes. This study provides basic information of C. sinensis microsatellites, and the genome-wide markers developed may be a useful tool for the genetic study of C. sinensis.

  4. Analysis of genetic diversity in Brown Swiss, Jersey and Holstein populations using genome-wide single nucleotide polymorphism markers

    Directory of Open Access Journals (Sweden)

    Melka Melkaye G

    2012-03-01

    Full Text Available Abstract Background Studies of genetic diversity are essential in understanding the extent of differentiation between breeds, and in designing successful diversity conservation strategies. The objective of this study was to evaluate the level of genetic diversity within and between North American Brown Swiss (BS, n = 900, Jersey (JE, n = 2,922 and Holstein (HO, n = 3,535 cattle, using genotyped bulls. GENEPOP and FSTAT software were used to evaluate the level of genetic diversity within each breed and between each pair of the three breeds based on genome-wide SNP markers (n = 50,972. Results Hardy-Weinberg equilibrium (HWE exact test within breeds showed a significant deviation from equilibrium within each population (P st indicated that the combination of BS and HO in an ideally amalgamated population had higher genetic diversity than the other pairs of breeds. Conclusion Results suggest that the three bull populations have substantially different gene pools. BS and HO show the largest gene differentiation and jointly the highest total expected gene diversity compared to when JE is considered. If the loss of genetic diversity within breeds worsens in the future, the use of crossbreeding might be an option to recover genetic diversity, especially for the breeds with small population size.

  5. Genome Wide Association Study of SNP-, Gene-, and Pathway-based Approaches to Identify Genes Influencing Susceptibility to Staphylococcus aureus Infections

    Directory of Open Access Journals (Sweden)

    Zhan eYe

    2014-05-01

    Full Text Available Background: We conducted a genome-wide association study (GWAS to identify specific genetic variants that underlie susceptibility to disease caused by Staphylococcus aureus in humans. Methods: Cases (n=309 and controls (n=2,925 were genotyped at 508,921 single nucleotide polymorphisms (SNPs. Cases had at least one laboratory and clinician confirmed disease caused by S. aureus whereas controls did not. R-package (for SNP association, EIGENSOFT (to estimate and adjust for population stratification and gene- (VEGAS and pathway-based (DAVID, PANTHER, and Ingenuity Pathway Analysis analyses were performed.Results: No SNP reached genome-wide significance. Four SNPs exceeded the pConclusion: We identified potential susceptibility genes for S. aureus diseases in this preliminary study but confirmation by other studies is needed. The observed associations could be relevant given the complexity of S. aureus as a pathogen and its ability to exploit multiple biological pathways to cause infections in humans.

  6. Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies

    Directory of Open Access Journals (Sweden)

    McElwee Joshua

    2009-06-01

    Full Text Available Abstract Background Although high-throughput genotyping arrays have made whole-genome association studies (WGAS feasible, only a small proportion of SNPs in the human genome are actually surveyed in such studies. In addition, various SNP arrays assay different sets of SNPs, which leads to challenges in comparing results and merging data for meta-analyses. Genome-wide imputation of untyped markers allows us to address these issues in a direct fashion. Methods 384 Caucasian American liver donors were genotyped using Illumina 650Y (Ilmn650Y arrays, from which we also derived genotypes from the Ilmn317K array. On these data, we compared two imputation methods: MACH and BEAGLE. We imputed 2.5 million HapMap Release22 SNPs, and conducted GWAS on ~40,000 liver mRNA expression traits (eQTL analysis. In addition, 200 Caucasian American and 200 African American subjects were genotyped using the Affymetrix 500 K array plus a custom 164 K fill-in chip. We then imputed the HapMap SNPs and quantified the accuracy by randomly masking observed SNPs. Results MACH and BEAGLE perform similarly with respect to imputation accuracy. The Ilmn650Y results in excellent imputation performance, and it outperforms Affx500K or Ilmn317K sets. For Caucasian Americans, 90% of the HapMap SNPs were imputed at 98% accuracy. As expected, imputation of poorly tagged SNPs (untyped SNPs in weak LD with typed markers was not as successful. It was more challenging to impute genotypes in the African American population, given (1 shorter LD blocks and (2 admixture with Caucasian populations in this population. To address issue (2, we pooled HapMap CEU and YRI data as an imputation reference set, which greatly improved overall performance. The approximate 40,000 phenotypes scored in these populations provide a path to determine empirically how the power to detect associations is affected by the imputation procedures. That is, at a fixed false discovery rate, the number of cis

  7. Next-Generation Sequencing Approaches in Genome-Wide Discovery of Single Nucleotide Polymorphism Markers Associated with Pungency and Disease Resistance in Pepper.

    Science.gov (United States)

    Manivannan, Abinaya; Kim, Jin-Hee; Yang, Eun-Young; Ahn, Yul-Kyun; Lee, Eun-Su; Choi, Sena; Kim, Do-Sun

    2018-01-01

    Pepper is an economically important horticultural plant that has been widely used for its pungency and spicy taste in worldwide cuisines. Therefore, the domestication of pepper has been carried out since antiquity. Owing to meet the growing demand for pepper with high quality, organoleptic property, nutraceutical contents, and disease tolerance, genomics assisted breeding techniques can be incorporated to develop novel pepper varieties with desired traits. The application of next-generation sequencing (NGS) approaches has reformed the plant breeding technology especially in the area of molecular marker assisted breeding. The availability of genomic information aids in the deeper understanding of several molecular mechanisms behind the vital physiological processes. In addition, the NGS methods facilitate the genome-wide discovery of DNA based markers linked to key genes involved in important biological phenomenon. Among the molecular markers, single nucleotide polymorphism (SNP) indulges various benefits in comparison with other existing DNA based markers. The present review concentrates on the impact of NGS approaches in the discovery of useful SNP markers associated with pungency and disease resistance in pepper. The information provided in the current endeavor can be utilized for the betterment of pepper breeding in future.

  8. Next-Generation Sequencing Approaches in Genome-Wide Discovery of Single Nucleotide Polymorphism Markers Associated with Pungency and Disease Resistance in Pepper

    Directory of Open Access Journals (Sweden)

    Abinaya Manivannan

    2018-01-01

    Full Text Available Pepper is an economically important horticultural plant that has been widely used for its pungency and spicy taste in worldwide cuisines. Therefore, the domestication of pepper has been carried out since antiquity. Owing to meet the growing demand for pepper with high quality, organoleptic property, nutraceutical contents, and disease tolerance, genomics assisted breeding techniques can be incorporated to develop novel pepper varieties with desired traits. The application of next-generation sequencing (NGS approaches has reformed the plant breeding technology especially in the area of molecular marker assisted breeding. The availability of genomic information aids in the deeper understanding of several molecular mechanisms behind the vital physiological processes. In addition, the NGS methods facilitate the genome-wide discovery of DNA based markers linked to key genes involved in important biological phenomenon. Among the molecular markers, single nucleotide polymorphism (SNP indulges various benefits in comparison with other existing DNA based markers. The present review concentrates on the impact of NGS approaches in the discovery of useful SNP markers associated with pungency and disease resistance in pepper. The information provided in the current endeavor can be utilized for the betterment of pepper breeding in future.

  9. Meta-analysis of Genome Wide Association Studies Identifies Genetic Markers of Late Toxicity Following Radiotherapy for Prostate Cancer

    Directory of Open Access Journals (Sweden)

    Sarah L. Kerns

    2016-08-01

    Full Text Available Nearly 50% of cancer patients undergo radiotherapy. Late radiotherapy toxicity affects quality-of-life in long-term cancer survivors and risk of side-effects in a minority limits doses prescribed to the majority of patients. Development of a test predicting risk of toxicity could benefit many cancer patients. We aimed to meta-analyze individual level data from four genome-wide association studies from prostate cancer radiotherapy cohorts including 1564 men to identify genetic markers of toxicity. Prospectively assessed two-year toxicity endpoints (urinary frequency, decreased urine stream, rectal bleeding, overall toxicity and single nucleotide polymorphism (SNP associations were tested using multivariable regression, adjusting for clinical and patient-related risk factors. A fixed-effects meta-analysis identified two SNPs: rs17599026 on 5q31.2 with urinary frequency (odds ratio [OR] 3.12, 95% confidence interval [CI] 2.08–4.69, p-value 4.16 × 10−8 and rs7720298 on 5p15.2 with decreased urine stream (OR 2.71, 95% CI 1.90–3.86, p-value = 3.21 × 10−8. These SNPs lie within genes that are expressed in tissues adversely affected by pelvic radiotherapy including bladder, kidney, rectum and small intestine. The results show that heterogeneous radiotherapy cohorts can be combined to identify new moderate-penetrance genetic variants associated with radiotherapy toxicity. The work provides a basis for larger collaborative efforts to identify enough variants for a future test involving polygenic risk profiling.

  10. Genome-wide indel markers shared by diverse Asian rice cultivars compared to Japanese rice cultivar ?Koshihikari?

    OpenAIRE

    Yonemaru, Jun-ichi; Choi, Sun Hee; Sakai, Hiroaki; Ando, Tsuyu; Shomura, Ayahiko; Yano, Masahiro; Wu, Jianzhong; Fukuoka, Shuichi

    2015-01-01

    Insertion-deletion (indel) polymorphisms, such as simple sequence repeats, have been widely used as DNA markers to identify QTLs and genes and to facilitate rice breeding. Recently, next-generation sequencing has produced deep sequences that allow genome-wide detection of indels. These polymorphisms can potentially be used to develop high-accuracy polymerase chain reaction (PCR)-based markers. Here, re-sequencing of 5 indica, 2 aus, and 3 tropical japonica cultivars and Japanese elite cultiva...

  11. Genome-wide detection of CNVs in Chinese indigenous sheep with different types of tails using ovine high-density 600K SNP arrays

    OpenAIRE

    Zhu, Caiye; Fan, Hongying; Yuan, Zehu; Hu, Shijin; Ma, Xiaomeng; Xuan, Junli; Wang, Hongwei; Zhang, Li; Wei, Caihong; Zhang, Qin; Zhao, Fuping; Du, Lixin

    2016-01-01

    Chinese indigenous sheep can be classified into three types based on tail morphology: fat-tailed, fat-rumped, and thin-tailed sheep, of which the typical breeds are large-tailed Han sheep, Altay sheep, and Tibetan sheep, respectively. To unravel the genetic mechanisms underlying the phenotypic differences among Chinese indigenous sheep with tails of three different types, we used ovine high-density 600K SNP arrays to detect genome-wide copy number variation (CNV). In large-tailed Han sheep, A...

  12. Estimating additive and non-additive genetic variances and predicting genetic merits using genome-wide dense single nucleotide polymorphism markers.

    Directory of Open Access Journals (Sweden)

    Guosheng Su

    Full Text Available Non-additive genetic variation is usually ignored when genome-wide markers are used to study the genetic architecture and genomic prediction of complex traits in human, wild life, model organisms or farm animals. However, non-additive genetic effects may have an important contribution to total genetic variation of complex traits. This study presented a genomic BLUP model including additive and non-additive genetic effects, in which additive and non-additive genetic relation matrices were constructed from information of genome-wide dense single nucleotide polymorphism (SNP markers. In addition, this study for the first time proposed a method to construct dominance relationship matrix using SNP markers and demonstrated it in detail. The proposed model was implemented to investigate the amounts of additive genetic, dominance and epistatic variations, and assessed the accuracy and unbiasedness of genomic predictions for daily gain in pigs. In the analysis of daily gain, four linear models were used: 1 a simple additive genetic model (MA, 2 a model including both additive and additive by additive epistatic genetic effects (MAE, 3 a model including both additive and dominance genetic effects (MAD, and 4 a full model including all three genetic components (MAED. Estimates of narrow-sense heritability were 0.397, 0.373, 0.379 and 0.357 for models MA, MAE, MAD and MAED, respectively. Estimated dominance variance and additive by additive epistatic variance accounted for 5.6% and 9.5% of the total phenotypic variance, respectively. Based on model MAED, the estimate of broad-sense heritability was 0.506. Reliabilities of genomic predicted breeding values for the animals without performance records were 28.5%, 28.8%, 29.2% and 29.5% for models MA, MAE, MAD and MAED, respectively. In addition, models including non-additive genetic effects improved unbiasedness of genomic predictions.

  13. Mosaic maternal uniparental disomy of chromosome 15 in Prader-Willi syndrome: utility of genome-wide SNP array.

    Science.gov (United States)

    Izumi, Kosuke; Santani, Avni B; Deardorff, Matthew A; Feret, Holly A; Tischler, Tanya; Thiel, Brian D; Mulchandani, Surabhi; Stolle, Catherine A; Spinner, Nancy B; Zackai, Elaine H; Conlin, Laura K

    2013-01-01

    Prader-Willi syndrome is caused by the loss of paternal gene expression on 15q11.2-q13.2, and one of the mechanisms resulting in Prader-Willi syndrome phenotype is maternal uniparental disomy of chromosome 15. Various mechanisms including trisomy rescue, monosomy rescue, and post fertilization errors can lead to uniparental disomy, and its mechanism can be inferred from the pattern of uniparental hetero and isodisomy. Detection of a mosaic cell line provides a unique opportunity to understand the mechanism of uniparental disomy; however, mosaic uniparental disomy is a rare finding in patients with Prader-Willi syndrome. We report on two infants with Prader-Willi syndrome caused by mosaic maternal uniparental disomy 15. Patient 1 has mosaic uniparental isodisomy of the entire chromosome 15, and Patient 2 has mosaic uniparental mixed iso/heterodisomy 15. Genome-wide single-nucleotide polymorphism array was able to demonstrate the presence of chromosomally normal cell line in the Patient 1 and trisomic cell line in Patient 2, and provide the evidence that post-fertilization error and trisomy rescue as a mechanism of uniparental disomy in each case, respectively. Given its ability of detecting small percent mosaicism as well as its capability of identifying the loss of heterozygosity of chromosomal regions, genome-wide single-nucleotide polymorphism array should be utilized as an adjunct to the standard methylation analysis in the evaluation of Prader-Willi syndrome. Copyright © 2012 Wiley Periodicals, Inc.

  14. A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T in the equine myostatin (MSTN gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses

    Directory of Open Access Journals (Sweden)

    Whiston Ronan

    2010-10-01

    Full Text Available Abstract Background Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. Results A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f and middle-long distance (> 8 f races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN [Punadj. = 6.96 × 10-6]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806 comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (Punadj. = 1.61 × 10-9; PBonf. = 6.58 × 10-5. In a candidate gene study we have previously reported a SNP (g.66493737C>T in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r2 = 0.86. Conclusions Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 × 10-10; BIEC2-417495, Punadj. = 1.61 × 10-9. Functional investigations will be required to determine whether this

  15. Genome-Wide SNP Discovery, Genotyping and Their Preliminary Applications for Population Genetic Inference in Spotted Sea Bass (Lateolabrax maculatus.

    Directory of Open Access Journals (Sweden)

    Juan Wang

    Full Text Available Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE for 30 individuals from two populations. The nucleotide diversity (π for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001 and the putatively neutral SNPs (FST = 0.0347, P < 0.001. However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001. Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40% significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus.

  16. SNP marker detection and genotyping in tilapia

    NARCIS (Netherlands)

    Bers, van N.E.M.; Crooijmans, R.P.M.A.; Groenen, M.A.M.; Dibbits, B.W.; Komen, J.

    2012-01-01

    We have generated a unique resource consisting of nearly 175 000 short contig sequences and 3569 SNP markers from the widely cultured GIFT (Genetically Improved Farmed Tilapia) strain of Nile tilapia (Oreochromis niloticus). In total, 384 SNPs were selected to monitor the wider applicability of the

  17. Genome-wide distribution and organization of microsatellites in plants: an insight into marker development in Brachypodium.

    Directory of Open Access Journals (Sweden)

    Humira Sonah

    Full Text Available Plant genomes are complex and contain large amounts of repetitive DNA including microsatellites that are distributed across entire genomes. Whole genome sequences of several monocot and dicot plants that are available in the public domain provide an opportunity to study the origin, distribution and evolution of microsatellites, and also facilitate the development of new molecular markers. In the present investigation, a genome-wide analysis of microsatellite distribution in monocots (Brachypodium, sorghum and rice and dicots (Arabidopsis, Medicago and Populus was performed. A total of 797,863 simple sequence repeats (SSRs were identified in the whole genome sequences of six plant species. Characterization of these SSRs revealed that mono-nucleotide repeats were the most abundant repeats, and that the frequency of repeats decreased with increase in motif length both in monocots and dicots. However, the frequency of SSRs was higher in dicots than in monocots both for nuclear and chloroplast genomes. Interestingly, GC-rich repeats were the dominant repeats only in monocots, with the majority of them being present in the coding region. These coding GC-rich repeats were found to be involved in different biological processes, predominantly binding activities. In addition, a set of 22,879 SSR markers that were validated by e-PCR were developed and mapped on different chromosomes in Brachypodium for the first time, with a frequency of 101 SSR markers per Mb. Experimental validation of 55 markers showed successful amplification of 80% SSR markers in 16 Brachypodium accessions. An online database 'BraMi' (Brachypodium microsatellite markers of these genome-wide SSR markers was developed and made available in the public domain. The observed differential patterns of SSR marker distribution would be useful for studying microsatellite evolution in a monocot-dicot system. SSR markers developed in this study would be helpful for genomic studies in Brachypodium

  18. High-throughput development of genome-wide locus-specific informative SSR markers in wheat

    Science.gov (United States)

    Although simple sequence repeat (SSR) markers are not new, they are still useful and often used markers in molecular mapping and marker-assisted breeding, particularly in developing countries. However, locus-specific SSR markers could be more useful and informative in wheat breeding and genetic stud...

  19. DOMINO: development of informative molecular markers for phylogenetic and genome-wide population genetic studies in non-model organisms.

    Science.gov (United States)

    Frías-López, Cristina; Sánchez-Herrero, José F; Guirao-Rico, Sara; Mora, Elisa; Arnedo, Miquel A; Sánchez-Gracia, Alejandro; Rozas, Julio

    2016-12-15

    The development of molecular markers is one of the most important challenges in phylogenetic and genome wide population genetics studies, especially in studies with non-model organisms. A highly promising approach for obtaining suitable markers is the utilization of genomic partitioning strategies for the simultaneous discovery and genotyping of a large number of markers. Unfortunately, not all markers obtained from these strategies provide enough information for solving multiple evolutionary questions at a reasonable taxonomic resolution. We have developed Development Of Molecular markers In Non-model Organisms (DOMINO), a bioinformatics tool for informative marker development from both next generation sequencing (NGS) data and pre-computed sequence alignments. The application implements popular NGS tools with new utilities in a highly versatile pipeline specifically designed to discover or select personalized markers at different levels of taxonomic resolution. These markers can be directly used to study the taxa surveyed for their design, utilized for further downstream PCR amplification in a broader set taxonomic scope, or exploited as suitable templates to bait design for target DNA enrichment techniques. We conducted an exhaustive evaluation of the performance of DOMINO via computer simulations and illustrate its utility to find informative markers in an empirical dataset. DOMINO is freely available from www.ub.edu/softevol/domino CONTACT: elsanchez@ub.edu or jrozas@ub.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  20. Genome-wide SNP identification, linkage map construction and QTL mapping for seed mineral concentrations and contents in pea (Pisum sativum L.).

    Science.gov (United States)

    Ma, Yu; Coyne, Clarice J; Grusak, Michael A; Mazourek, Michael; Cheng, Peng; Main, Dorrie; McGee, Rebecca J

    2017-02-13

    Marker-assisted breeding is now routinely used in major crops to facilitate more efficient cultivar improvement. This has been significantly enabled by the use of next-generation sequencing technology to identify loci and markers associated with traits of interest. While rich in a range of nutritional components, such as protein, mineral nutrients, carbohydrates and several vitamins, pea (Pisum sativum L.), one of the oldest domesticated crops in the world, remains behind many other crops in the availability of genomic and genetic resources. To further improve mineral nutrient levels in pea seeds requires the development of genome-wide tools. The objectives of this research were to develop these tools by: identifying genome-wide single nucleotide polymorphisms (SNPs) using genotyping by sequencing (GBS); constructing a high-density linkage map and comparative maps with other legumes, and identifying quantitative trait loci (QTL) for levels of boron, calcium, iron, potassium, magnesium, manganese, molybdenum, phosphorous, sulfur, and zinc in the seed, as well as for seed weight. In this study, 1609 high quality SNPs were found to be polymorphic between 'Kiflica' and 'Aragorn', two parents of an F 6 -derived recombinant inbred line (RIL) population. Mapping 1683 markers including 75 previously published markers and 1608 SNPs developed from the present study generated a linkage map of size 1310.1 cM. Comparative mapping with other legumes demonstrated that the highest level of synteny was observed between pea and the genome of Medicago truncatula. QTL analysis of the RIL population across two locations revealed at least one QTL for each of the mineral nutrient traits. In total, 46 seed mineral concentration QTLs, 37 seed mineral content QTLs, and 6 seed weight QTLs were discovered. The QTLs explained from 2.4% to 43.3% of the phenotypic variance. The genome-wide SNPs and the genetic linkage map developed in this study permitted QTL identification for pea seed mineral

  1. Hierarchical modeling of genome-wide Short Tandem Repeat (STR) markers infers native American prehistory.

    Science.gov (United States)

    Lewis, Cecil M

    2010-02-01

    This study examines a genome-wide dataset of 678 Short Tandem Repeat loci characterized in 444 individuals representing 29 Native American populations as well as the Tundra Netsi and Yakut populations from Siberia. Using these data, the study tests four current hypotheses regarding the hierarchical distribution of neutral genetic variation in native South American populations: (1) the western region of South America harbors more variation than the eastern region of South America, (2) Central American and western South American populations cluster exclusively, (3) populations speaking the Chibchan-Paezan and Equatorial-Tucanoan language stock emerge as a group within an otherwise South American clade, (4) Chibchan-Paezan populations in Central America emerge together at the tips of the Chibchan-Paezan cluster. This study finds that hierarchical models with the best fit place Central American populations, and populations speaking the Chibchan-Paezan language stock, at a basal position or separated from the South American group, which is more consistent with a serial founder effect into South America than that previously described. Western (Andean) South America is found to harbor similar levels of variation as eastern (Equatorial-Tucanoan and Ge-Pano-Carib) South America, which is inconsistent with an initial west coast migration into South America. Moreover, in all relevant models, the estimates of genetic diversity within geographic regions suggest a major bottleneck or founder effect occurring within the North American subcontinent, before the peopling of Central and South America. 2009 Wiley-Liss, Inc.

  2. Two non-synonymous markers in PTPN21, identified by genome-wide association study data-mining and replication, are associated with schizophrenia.

    LENUS (Irish Health Repository)

    Chen, Jingchun

    2011-09-01

    We conducted data-mining analyses of genome wide association (GWA) studies of the CATIE and MGS-GAIN datasets, and found 13 markers in the two physically linked genes, PTPN21 and EML5, showing nominally significant association with schizophrenia. Linkage disequilibrium (LD) analysis indicated that all 7 markers from PTPN21 shared high LD (r(2)>0.8), including rs2274736 and rs2401751, the two non-synonymous markers with the most significant association signals (rs2401751, P=1.10 × 10(-3) and rs2274736, P=1.21 × 10(-3)). In a meta-analysis of all 13 replication datasets with a total of 13,940 subjects, we found that the two non-synonymous markers are significantly associated with schizophrenia (rs2274736, OR=0.92, 95% CI: 0.86-0.97, P=5.45 × 10(-3) and rs2401751, OR=0.92, 95% CI: 0.86-0.97, P=5.29 × 10(-3)). One SNP (rs7147796) in EML5 is also significantly associated with the disease (OR=1.08, 95% CI: 1.02-1.14, P=6.43 × 10(-3)). These 3 markers remain significant after Bonferroni correction. Furthermore, haplotype conditioned analyses indicated that the association signals observed between rs2274736\\/rs2401751 and rs7147796 are statistically independent. Given the results that 2 non-synonymous markers in PTPN21 are associated with schizophrenia, further investigation of this locus is warranted.

  3. RAD-seq derived genome-wide nuclear markers resolve the phylogeny of tunas

    KAUST Repository

    Dí az-Arce, Natalia; Arrizabalaga, Haritz; Murua, Hilario; Irigoien, Xabier; Rodrí guez-Ezpeleta, Naiara

    2016-01-01

    conservation and management strategies for these species. Previous attempts based on mitochondrial and nuclear markers were unsuccessful in inferring a congruent and reliable phylogeny, probably due to mitochondrial introgression events and lack of enough

  4. Development of Genome-Wide SSR Markers from Angelica gigas Nakai Using Next Generation Sequencing.

    Science.gov (United States)

    Gil, Jinsu; Um, Yurry; Kim, Serim; Kim, Ok Tae; Koo, Sung Cheol; Reddy, Chinreddy Subramanyam; Kim, Seong-Cheol; Hong, Chang Pyo; Park, Sin-Gi; Kim, Ho Bang; Lee, Dong Hoon; Jeong, Byung-Hoon; Chung, Jong-Wook; Lee, Yi

    2017-09-21

    Angelica gigas Nakai is an important medicinal herb, widely utilized in Asian countries especially in Korea, Japan, and China. Although it is a vital medicinal herb, the lack of sequencing data and efficient molecular markers has limited the application of a genetic approach for horticultural improvements. Simple sequence repeats (SSRs) are universally accepted molecular markers for population structure study. In this study, we found over 130,000 SSRs, ranging from di- to deca-nucleotide motifs, using the genome sequence of Manchu variety (MV) of A. gigas, derived from next generation sequencing (NGS). From the putative SSR regions identified, a total of 16,496 primer sets were successfully designed. Among them, we selected 848 SSR markers that showed polymorphism from in silico analysis and contained tri- to hexa-nucleotide motifs. We tested 36 SSR primer sets for polymorphism in 16 A. gigas accessions. The average polymorphism information content (PIC) was 0.69; the average observed heterozygosity ( H O ) values, and the expected heterozygosity ( H E ) values were 0.53 and 0.73, respectively. These newly developed SSR markers would be useful tools for molecular genetics, genotype identification, genetic mapping, molecular breeding, and studying species relationships of the Angelica genus.

  5. Genome-wide ENU mutagenesis in combination with high density SNP analysis and exome sequencing provides rapid identification of novel mouse models of developmental disease.

    Directory of Open Access Journals (Sweden)

    Georgina Caruana

    Full Text Available Mice harbouring gene mutations that cause phenotypic abnormalities during organogenesis are invaluable tools for linking gene function to normal development and human disorders. To generate mouse models harbouring novel alleles that are involved in organogenesis we conducted a phenotype-driven, genome-wide mutagenesis screen in mice using the mutagen N-ethyl-N-nitrosourea (ENU.ENU was injected into male C57BL/6 mice and the mutations transmitted through the germ-line. ENU-induced mutations were bred to homozygosity and G3 embryos screened at embryonic day (E 13.5 and E18.5 for abnormalities in limb and craniofacial structures, skin, blood, vasculature, lungs, gut, kidneys, ureters and gonads. From 52 pedigrees screened 15 were detected with anomalies in one or more of the structures/organs screened. Using single nucleotide polymorphism (SNP-based linkage analysis in conjunction with candidate gene or next-generation sequencing (NGS we identified novel recessive alleles for Fras1, Ift140 and Lig1.In this study we have generated mouse models in which the anomalies closely mimic those seen in human disorders. The association between novel mutant alleles and phenotypes will lead to a better understanding of gene function in normal development and establish how their dysfunction causes human anomalies and disease.

  6. The history of human populations in the Japanese Archipelago inferred from genome-wide SNP data with a special reference to the Ainu and the Ryukyuan populations.

    Science.gov (United States)

    Jinam, Timothy; Nishida, Nao; Hirai, Momoki; Kawamura, Shoji; Oota, Hiroki; Umetsu, Kazuo; Kimura, Ryosuke; Ohashi, Jun; Tajima, Atsushi; Yamamoto, Toshimichi; Tanabe, Hideyuki; Mano, Shuhei; Suto, Yumiko; Kaname, Tadashi; Naritomi, Kenji; Yanagi, Kumiko; Niikawa, Norio; Omoto, Keiichi; Tokunaga, Katsushi; Saitou, Naruya

    2012-12-01

    The Japanese Archipelago stretches over 4000 km from north to south, and is the homeland of the three human populations; the Ainu, the Mainland Japanese and the Ryukyuan. The archeological evidence of human residence on this Archipelago goes back to >30 000 years, and various migration routes and root populations have been proposed. Here, we determined close to one million single-nucleotide polymorphisms (SNPs) for the Ainu and the Ryukyuan, and compared these with existing data sets. This is the first report of these genome-wide SNP data. Major findings are: (1) Recent admixture with the Mainland Japanese was observed for more than one third of the Ainu individuals from principal component analysis and frappe analyses; (2) The Ainu population seems to have experienced admixture with another population, and a combination of two types of admixtures is the unique characteristics of this population; (3) The Ainu and the Ryukyuan are tightly clustered with 100% bootstrap probability followed by the Mainland Japanese in the phylogenetic trees of East Eurasian populations. These results clearly support the dual structure model on the Japanese Archipelago populations, though the origins of the Jomon and the Yayoi people still remain to be solved.

  7. Development of a panel of genome-wide ancestry informative markers to study admixture throughout the Americas.

    Directory of Open Access Journals (Sweden)

    Joshua Mark Galanter

    Full Text Available Most individuals throughout the Americas are admixed descendants of Native American, European, and African ancestors. Complex historical factors have resulted in varying proportions of ancestral contributions between individuals within and among ethnic groups. We developed a panel of 446 ancestry informative markers (AIMs optimized to estimate ancestral proportions in individuals and populations throughout Latin America. We used genome-wide data from 953 individuals from diverse African, European, and Native American populations to select AIMs optimized for each of the three main continental populations that form the basis of modern Latin American populations. We selected markers on the basis of locus-specific branch length to be informative, well distributed throughout the genome, capable of being genotyped on widely available commercial platforms, and applicable throughout the Americas by minimizing within-continent heterogeneity. We then validated the panel in samples from four admixed populations by comparing ancestry estimates based on the AIMs panel to estimates based on genome-wide association study (GWAS data. The panel provided balanced discriminatory power among the three ancestral populations and accurate estimates of individual ancestry proportions (R² > 0.9 for ancestral components with significant between-subject variance. Finally, we genotyped samples from 18 populations from Latin America using the AIMs panel and estimated variability in ancestry within and between these populations. This panel and its reference genotype information will be useful resources to explore population history of admixture in Latin America and to correct for the potential effects of population stratification in admixed samples in the region.

  8. Genome-Wide Study of Percent Emphysema on Computed Tomography in the General Population. The Multi-Ethnic Study of Atherosclerosis Lung/SNP Health Association Resource Study

    Science.gov (United States)

    Manichaikul, Ani; Hoffman, Eric A.; Smolonska, Joanna; Gao, Wei; Cho, Michael H.; Baumhauer, Heather; Budoff, Matthew; Austin, John H. M.; Washko, George R.; Carr, J. Jeffrey; Kaufman, Joel D.; Pottinger, Tess; Powell, Charles A.; Wijmenga, Cisca; Zanen, Pieter; Groen, Harry J. M.; Postma, Dirkje S.; Wanner, Adam; Rouhani, Farshid N.; Brantly, Mark L.; Powell, Rhea; Smith, Benjamin M.; Rabinowitz, Dan; Raffel, Leslie J.; Hinckley Stukovsky, Karen D.; Crapo, James D.; Beaty, Terri H.; Hokanson, John E.; Silverman, Edwin K.; Dupuis, Josée; O’Connor, George T.; Boezen, H. Marike; Rich, Stephen S.

    2014-01-01

    Rationale: Pulmonary emphysema overlaps partially with spirometrically defined chronic obstructive pulmonary disease and is heritable, with moderately high familial clustering. Objectives: To complete a genome-wide association study (GWAS) for the percentage of emphysema-like lung on computed tomography in the Multi-Ethnic Study of Atherosclerosis (MESA) Lung/SNP Health Association Resource (SHARe) Study, a large, population-based cohort in the United States. Methods: We determined percent emphysema and upper-lower lobe ratio in emphysema defined by lung regions less than −950 HU on cardiac scans. Genetic analyses were reported combined across four race/ethnic groups: non-Hispanic white (n = 2,587), African American (n = 2,510), Hispanic (n = 2,113), and Chinese (n = 704) and stratified by race and ethnicity. Measurements and Main Results: Among 7,914 participants, we identified regions at genome-wide significance for percent emphysema in or near SNRPF (rs7957346; P = 2.2 × 10−8) and PPT2 (rs10947233; P = 3.2 × 10−8), both of which replicated in an additional 6,023 individuals of European ancestry. Both single-nucleotide polymorphisms were previously implicated as genes influencing lung function, and analyses including lung function revealed independent associations for percent emphysema. Among Hispanics, we identified a genetic locus for upper-lower lobe ratio near the α-mannosidase–related gene MAN2B1 (rs10411619; P = 1.1 × 10−9; minor allele frequency [MAF], 4.4%). Among Chinese, we identified single-nucleotide polymorphisms associated with upper-lower lobe ratio near DHX15 (rs7698250; P = 1.8 × 10−10; MAF, 2.7%) and MGAT5B (rs7221059; P = 2.7 × 10−8; MAF, 2.6%), which acts on α-linked mannose. Among African Americans, a locus near a third α-mannosidase–related gene, MAN1C1 (rs12130495; P = 9.9 × 10−6; MAF, 13.3%) was associated with percent emphysema. Conclusions: Our results suggest that some genes previously identified as

  9. Physical mapping of QTL for tuber yield, starch content and starch yield in tetraploid potato (Solanum tuberosum L.) by means of genome wide genotyping by sequencing and the 8.3 K SolCAP SNP array.

    Science.gov (United States)

    Schönhals, Elske Maria; Ding, Jia; Ritter, Enrique; Paulo, Maria João; Cara, Nicolás; Tacke, Ekhard; Hofferbert, Hans-Reinhard; Lübeck, Jens; Strahwald, Josef; Gebhardt, Christiane

    2017-08-22

    Tuber yield and starch content of the cultivated potato are complex traits of decisive importance for breeding improved varieties. Natural variation of tuber yield and starch content depends on the environment and on multiple, mostly unknown genetic factors. Dissection and molecular identification of the genes and their natural allelic variants controlling these complex traits will lead to the development of diagnostic DNA-based markers, by which precision and efficiency of selection can be increased (precision breeding). Three case-control populations were assembled from tetraploid potato cultivars based on maximizing the differences between high and low tuber yield (TY), starch content (TSC) and starch yield (TSY, arithmetic product of TY and TSC). The case-control populations were genotyped by restriction-site associated DNA sequencing (RADseq) and the 8.3 k SolCAP SNP genotyping array. The allele frequencies of single nucleotide polymorphisms (SNPs) were compared between cases and controls. RADseq identified, depending on data filtering criteria, between 6664 and 450 genes with one or more differential SNPs for one, two or all three traits. Differential SNPs in 275 genes were detected using the SolCAP array. A genome wide association study using the SolCAP array on an independent, unselected population identified SNPs associated with tuber starch content in 117 genes. Physical mapping of the genes containing differential or associated SNPs, and comparisons between the two genome wide genotyping methods and two different populations identified genome segments on all twelve potato chromosomes harboring one or more quantitative trait loci (QTL) for TY, TSC and TSY. Several hundred genes control tuber yield and starch content in potato. They are unequally distributed on all potato chromosomes, forming clusters between 0.5-4 Mbp width. The largest fraction of these genes had unknown function, followed by genes with putative signalling and regulatory functions. The

  10. Adiponectin Concentrations: A Genome-wide Association Study

    OpenAIRE

    Jee, Sun Ha; Sull, Jae Woong; Lee, Jong-Eun; Shin, Chol; Park, Jongkeun; Kimm, Heejin; Cho, Eun-Young; Shin, Eun-Soon; Yun, Ji Eun; Park, Ji Wan; Kim, Sang Yeun; Lee, Sun Ju; Jee, Eun Jung; Baik, Inkyung; Kao, Linda

    2010-01-01

    Adiponectin is associated with obesity and insulin resistance. To date, there has been no genome-wide association study (GWAS) of adiponectin levels in Asians. Here we present a GWAS of a cohort of Korean volunteers. A total of 4,001 subjects were genotyped by using a genome-wide marker panel in a two-stage design (979 subjects initially and 3,022 in a second stage). Another 2,304 subjects were used for follow-up replication studies with selected markers. In the discovery phase, the top SNP a...

  11. Genetic relatedness of indigenous ethnic groups in northern Borneo to neighboring populations from Southeast Asia, as inferred from genome-wide SNP data.

    Science.gov (United States)

    Yew, Chee Wei; Hoque, Mohd Zahirul; Pugh-Kitingan, Jacqueline; Minsong, Alexander; Voo, Christopher Lok Yung; Ransangan, Julian; Lau, Sophia Tiek Ying; Wang, Xu; Saw, Woei Yuh; Ong, Rick Twee-Hee; Teo, Yik-Ying; Xu, Shuhua; Hoh, Boon-Peng; Phipps, Maude E; Kumar, S Vijay

    2018-07-01

    The region of northern Borneo is home to the current state of Sabah, Malaysia. It is located closest to the southern Philippine islands and may have served as a viaduct for ancient human migration onto or off of Borneo Island. In this study, five indigenous ethnic groups from Sabah were subjected to genome-wide SNP genotyping. These individuals represent the "North Borneo"-speaking group of the great Austronesian family. They have traditionally resided in the inland region of Sabah. The dataset was merged with public datasets, and the genetic relatedness of these groups to neighboring populations from the islands of Southeast Asia, mainland Southeast Asia and southern China was inferred. Genetic structure analysis revealed that these groups formed a genetic cluster that was independent of the clusters of neighboring populations. Additionally, these groups exhibited near-absolute proportions of a genetic component that is also common among Austronesians from Taiwan and the Philippines. They showed no genetic admixture with Austro-Melanesian populations. Furthermore, phylogenetic analysis showed that they are closely related to non-Austro-Melansian Filipinos as well as to Taiwan natives but are distantly related to populations from mainland Southeast Asia. Relatively lower heterozygosity and higher pairwise genetic differentiation index (F ST ) values than those of nearby populations indicate that these groups might have experienced genetic drift in the past, resulting in their differentiation from other Austronesians. Subsequent formal testing suggested that these populations have received no gene flow from neighboring populations. Taken together, these results imply that the indigenous ethnic groups of northern Borneo shared a common ancestor with Taiwan natives and non-Austro-Melanesian Filipinos and then isolated themselves on the inland of Sabah. This isolation presumably led to no admixture with other populations, and these individuals therefore underwent

  12. Analysis of Genome-Wide Copy Number Variations in Chinese Indigenous and Western Pig Breeds by 60 K SNP Genotyping Arrays

    Science.gov (United States)

    Sun, Yaqi; Wang, Hongyang; Wang, Chao; Yu, Shaobo; Liu, Jing; Zhang, Yu; Fan, Bin; Li, Kui; Liu, Bang

    2014-01-01

    Copy number variations (CNVs) represent a substantial source of structural variants in mammals and contribute to both normal phenotypic variability and disease susceptibility. Although low-resolution CNV maps are produced in many domestic animals, and several reports have been published about the CNVs of porcine genome, the differences between Chinese and western pigs still remain to be elucidated. In this study, we used Porcine SNP60 BeadChip and PennCNV algorithm to perform a genome-wide CNV detection in 302 individuals from six Chinese indigenous breeds (Tongcheng, Laiwu, Luchuan, Bama, Wuzhishan and Ningxiang pigs), three western breeds (Yorkshire, Landrace and Duroc) and one hybrid (Tongcheng×Duroc). A total of 348 CNV Regions (CNVRs) across genome were identified, covering 150.49 Mb of the pig genome or 6.14% of the autosomal genome sequence. In these CNVRs, 213 CNVRs were found to exist only in the six Chinese indigenous breeds, and 60 CNVRs only in the three western breeds. The characters of CNVs in four Chinese normal size breeds (Luchuan, Tongcheng and Laiwu pigs) and two minipig breeds (Bama and Wuzhishan pigs) were also analyzed in this study. Functional annotation suggested that these CNVRs possess a great variety of molecular function and may play important roles in phenotypic and production traits between Chinese and western breeds. Our results are important complementary to the CNV map in pig genome, which provide new information about the diversity of Chinese and western pig breeds, and facilitate further research on porcine genome CNVs. PMID:25198154

  13. The genetic aetiology of cannabis use initiation: A meta-analysis of genome-wide association studies and a SNP-based heritability estimation

    NARCIS (Netherlands)

    Verweij, K.J.H.; Vinkhuyzen, A.A.E.; Benyamin, B.; Lynskey, M.T.; Quaye, L.; Agrawal, A.; Gordon, S.D.; Montgomery, G.W.; Madden, P.A.F.; Heath, A.C.; Spector, T.D.; Martin, N.G.; Medland, S.E.

    2013-01-01

    While initiation of cannabis use is around 40% heritable, not much is known about the underlying genetic aetiology. Here, we meta-analysed two genome-wide association studies of initiation of cannabis use with >10000 individuals. None of the genetic variants reached genome-wide significance. We also

  14. The genetic etiology of cannabis use initiation: a meta-analysis of genome-wide association studies, and a SNP-based heritability estimation.

    NARCIS (Netherlands)

    Verweij, K.J.H.; Vinkhuyzen, A.A.E.; Benyamin, B.; Lynskey, M.T.; Quaye, L.; Agrawal, A.; Gordon, S.D.; Montgomery, G.W.; Madden, P.A.F.; Heath, A.C.; Spector, T.D.; Martin, N.G.; Medland, S.E.

    2013-01-01

    While initiation of cannabis use is around 40% heritable, not much is known about the underlying genetic aetiology. Here, we meta-analysed two genome-wide association studies of initiation of cannabis use with > 10 000 individuals. None of the genetic variants reached genome-wide significance. We

  15. Genome-Wide DNA Copy Number Analysis of Acute Lymphoblastic Leukemia Identifies New Genetic Markers Associated with Clinical Outcome.

    Directory of Open Access Journals (Sweden)

    Maribel Forero-Castro

    Full Text Available Identifying additional genetic alterations associated with poor prognosis in acute lymphoblastic leukemia (ALL is still a challenge.To characterize the presence of additional DNA copy number alterations (CNAs in children and adults with ALL by whole-genome oligonucleotide array (aCGH analysis, and to identify their associations with clinical features and outcome. Array-CGH was carried out in 265 newly diagnosed ALLs (142 children and 123 adults. The NimbleGen CGH 12x135K array (Roche was used to analyze genetic gains and losses. CNAs were analyzed with GISTIC and aCGHweb software. Clinical and biological variables were analyzed. Three of the patients showed chromothripsis (cth6, cth14q and cth15q. CNAs were associated with age, phenotype, genetic subtype and overall survival (OS. In the whole cohort of children, the losses on 14q32.33 (p = 0.019 and 15q13.2 (p = 0.04 were related to shorter OS. In the group of children without good- or poor-risk cytogenetics, the gain on 1p36.11 was a prognostic marker independently associated with shorter OS. In adults, the gains on 19q13.2 (p = 0.001 and Xp21.1 (p = 0.029, and the loss of 17p (p = 0.014 were independent markers of poor prognosis with respect to OS. In summary, CNAs are frequent in ALL and are associated with clinical parameters and survival. Genome-wide DNA copy number analysis allows the identification of genetic markers that predict clinical outcome, suggesting that detection of these genetic lesions will be useful in the management of patients newly diagnosed with ALL.

  16. (SNP) markers for the Chinese black sleeper, Bostrychus sinensis

    African Journals Online (AJOL)

    We characterized 11 single nucleotide ploymorphism (SNP) markers for the Chinese black sleeper, Bostrychus sinensis. These markers were isolated from a genomic library and tested in ten geographically distant individuals of B. sinensis. Polymorphisms of these SNP loci were assessed using a wild population including ...

  17. Genome-wide study of percent emphysema on computed tomography in the general population. The Multi-Ethnic Study of Atherosclerosis Lung/SNP Health Association Resource Study

    NARCIS (Netherlands)

    Manichaikul, Ani; Hoffman, Eric A.; Smolonska, Joanna; Gao, Wei; Cho, Michael H.; Baumhauer, Heather; Budoff, Matthew; Austin, John H. M.; Washko, George R.; Carr, J. Jeffrey; Kaufman, Joel D.; Pottinger, Tess; Powell, Charles A.; Wijmenga, Cisca; Zanen, Pieter; Groen, Harry J.M.; Postma, Dirkje S.; Wanner, Adam; Rouhani, Farshid N.; Brantly, Mark L.; Powell, Rhea; Smith, Benjamin M.; Rabinowitz, Dan; Raffel, Leslie J.; Stukovsky, Karen D. Hinckley; Crapo, James D.; Beaty, Terri H.; Hokanson, John E.; Silverman, Edwin K.; Dupuis, Josee; O'Connor, George T.; Boezen, Hendrika; Rich, Stephen S.; Barr, R. Graham

    2014-01-01

    Rationale: Pulmonary emphysema overlaps partially with spirometrically defined chronic obstructive pulmonary disease and is heritable, with moderately high familial clustering. Objectives: To complete a genome-wide association study (GWAS) for the percentage of emphysema-like lung on computed

  18. Population structure revealed by different marker types (SSR or DArT) has an impact on the results of genome-wide association mapping in European barley cultivars

    NARCIS (Netherlands)

    Matthies, I.E.; Hintum, van T.J.L.; Weise, S.; Röder, M.S.

    2012-01-01

    Diversity arrays technology (DArT) and simple sequence repeat (SSR) markers were applied to investigate population structure, extent of linkage disequilibrium and genetic diversity (kinship) on a genome-wide level in European barley (Hordeum vulgare L.) cultivars. A set of 183 varieties could be

  19. Genome-wide association study, genomic prediction and marker-assisted selection for seed weight in soybean (Glycine max).

    Science.gov (United States)

    Zhang, Jiaoping; Song, Qijian; Cregan, Perry B; Jiang, Guo-Liang

    2016-01-01

    Twenty-two loci for soybean SW and candidate genes conditioning seed development were identified; and prediction accuracies of GS and MAS were estimated through cross-validation and validation with unrelated populations. Soybean (Glycine max) is a major crop for plant protein and oil production, and seed weight (SW) is important for yield and quality in food/vegetable uses of soybean. However, our knowledge of genes controlling SW remains limited. To better understand the molecular mechanism underlying the trait and explore marker-based breeding approaches, we conducted a genome-wide association study in a population of 309 soybean germplasm accessions using 31,045 single nucleotide polymorphisms (SNPs), and estimated the prediction accuracy of genomic selection (GS) and marker-assisted selection (MAS) for SW. Twenty-two loci of minor effect associated with SW were identified, including hotspots on Gm04 and Gm19. The mixed model containing these loci explained 83.4% of phenotypic variation. Candidate genes with Arabidopsis orthologs conditioning SW were also proposed. The prediction accuracies of GS and MAS by cross-validation were 0.75-0.87 and 0.62-0.75, respectively, depending on the number of SNPs used and the size of training population. GS also outperformed MAS when the validation was performed using unrelated panels across a wide range of maturities, with an average prediction accuracy of 0.74 versus 0.53. This study convincingly demonstrated that soybean SW is controlled by numerous minor-effect loci. It greatly enhances our understanding of the genetic basis of SW in soybean and facilitates the identification of genes controlling the trait. It also suggests that GS holds promise for accelerating soybean breeding progress. The results are helpful for genetic improvement and genomic prediction of yield in soybean.

  20. Developing a novel panel of genome-wide ancestry informative markers for bio-geographical ancestry estimates.

    Science.gov (United States)

    Jia, Jing; Wei, Yi-Liang; Qin, Cui-Jiao; Hu, Lan; Wan, Li-Hua; Li, Cai-Xia

    2014-01-01

    Inferring the ancestral origin of DNA samples can be helpful in correcting population stratification in disease association studies or guiding crime investigations. Populations throughout the world vary in appearance features and biological characteristics. Based on this idea, we performed a genome-wide scan for SNPs within genes that are related to physical and biological traits. Using the HapMap database, we screened 52 genes and their flanking regions. Thirty-five SNPs that displayed highly contrasting allele frequencies (F(st)>0.3, linkage disequilibrium r(2)0.001) among Africans, Europeans, and East Asians were selected and validated. A multiplexed assay was developed to genotype these 35 SNPs in 357 individuals from 10 populations worldwide. This panel provided accurate estimates of individual ancestry proportions with balanced discriminatory power among the three continental ancestries: Africans, Europeans, and East Asians. It also proved very effective in evaluating admixed populations living in joint regions of continents (e.g., Uyghurs and Indians) and discriminating some subpopulations within each of the three continents. Structure analysis was performed to establish and evaluate the panel of ancestry-informative markers, and the components of each population were also described to indicate the structural composition. The 21 population structures in our study are consistent with geographic patterns, and individuals were properly assigned to their original ancestral populations with proportion analyses and random match probability calculations. Thus, the panel and its population information will be useful resources to minimize the effects of population stratification in association analyses and to assign the most likely origin of an unknown DNA contributor in forensic investigations. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  1. Fine mapping of breast cancer genome-wide association studies loci in women of African ancestry identifies novel susceptibility markers.

    Science.gov (United States)

    Zheng, Yonglan; Ogundiran, Temidayo O; Falusi, Adeyinka G; Nathanson, Katherine L; John, Esther M; Hennis, Anselm J M; Ambs, Stefan; Domchek, Susan M; Rebbeck, Timothy R; Simon, Michael S; Nemesure, Barbara; Wu, Suh-Yuh; Leske, Maria Cristina; Odetunde, Abayomi; Niu, Qun; Zhang, Jing; Afolabi, Chibuzor; Gamazon, Eric R; Cox, Nancy J; Olopade, Christopher O; Olopade, Olufunmilayo I; Huo, Dezheng

    2013-07-01

    Numerous single nucleotide polymorphisms (SNPs) associated with breast cancer susceptibility have been identified by genome-wide association studies (GWAS). However, these SNPs were primarily discovered and validated in women of European and Asian ancestry. Because linkage disequilibrium is ancestry-dependent and heterogeneous among racial/ethnic populations, we evaluated common genetic variants at 22 GWAS-identified breast cancer susceptibility loci in a pooled sample of 1502 breast cancer cases and 1378 controls of African ancestry. None of the 22 GWAS index SNPs could be validated, challenging the direct generalizability of breast cancer risk variants identified in Caucasians or Asians to other populations. Novel breast cancer risk variants for women of African ancestry were identified in regions including 5p12 (odds ratio [OR] = 1.40, 95% confidence interval [CI] = 1.11-1.76; P = 0.004), 5q11.2 (OR = 1.22, 95% CI = 1.09-1.36; P = 0.00053) and 10p15.1 (OR = 1.22, 95% CI = 1.08-1.38; P = 0.0015). We also found positive association signals in three regions (6q25.1, 10q26.13 and 16q12.1-q12.2) previously confirmed by fine mapping in women of African ancestry. In addition, polygenic model indicated that eight best markers in this study, compared with 22 GWAS-identified SNPs, could better predict breast cancer risk in women of African ancestry (per-allele OR = 1.21, 95% CI = 1.16-1.27; P = 9.7 × 10(-16)). Our results demonstrate that fine mapping is a powerful approach to better characterize the breast cancer risk alleles in diverse populations. Future studies and new GWAS in women of African ancestry hold promise to discover additional variants for breast cancer susceptibility with clinical implications throughout the African diaspora.

  2. A custom correlation coefficient (CCC) approach for fast identification of multi-SNP association patterns in genome-wide SNPs data.

    Science.gov (United States)

    Climer, Sharlee; Yang, Wei; de las Fuentes, Lisa; Dávila-Román, Victor G; Gu, C Charles

    2014-11-01

    Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of custom correlation coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (six genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets. © 2014 WILEY PERIODICALS, INC.

  3. Genome-wide generation and use of informative intron-spanning and intron-length polymorphism markers for high-throughput genetic analysis in rice

    Science.gov (United States)

    Badoni, Saurabh; Das, Sweta; Sayal, Yogesh K.; Gopalakrishnan, S.; Singh, Ashok K.; Rao, Atmakuri R.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.

    2016-01-01

    We developed genome-wide 84634 ISM (intron-spanning marker) and 16510 InDel-fragment length polymorphism-based ILP (intron-length polymorphism) markers from genes physically mapped on 12 rice chromosomes. These genic markers revealed much higher amplification-efficiency (80%) and polymorphic-potential (66%) among rice accessions even by a cost-effective agarose gel-based assay. A wider level of functional molecular diversity (17–79%) and well-defined precise admixed genetic structure was assayed by 3052 genome-wide markers in a structured population of indica, japonica, aromatic and wild rice. Six major grain weight QTLs (11.9–21.6% phenotypic variation explained) were mapped on five rice chromosomes of a high-density (inter-marker distance: 0.98 cM) genetic linkage map (IR 64 x Sonasal) anchored with 2785 known/candidate gene-derived ISM and ILP markers. The designing of multiple ISM and ILP markers (2 to 4 markers/gene) in an individual gene will broaden the user-preference to select suitable primer combination for efficient assaying of functional allelic variation/diversity and realistic estimation of differential gene expression profiles among rice accessions. The genomic information generated in our study is made publicly accessible through a user-friendly web-resource, “Oryza ISM-ILP marker” database. The known/candidate gene-derived ISM and ILP markers can be enormously deployed to identify functionally relevant trait-associated molecular tags by optimal-resource expenses, leading towards genomics-assisted crop improvement in rice. PMID:27032371

  4. Genome-wide association mapping for female fertility traits in Danish and Swedish Holstein cattle

    DEFF Research Database (Denmark)

    Sahana, G; Guldbrandtsen, B; Bendixen, C

    2010-01-01

    A genome-wide association study was conducted using a mixed model analysis for QTL for fertility traits in Danish and Swedish Holstein cattle. The analysis incorporated 2,531 progeny tested bulls, and a total of 36 387 SNP markers on 29 bovine autosomes were used. Eleven fertility traits were ana...

  5. Genome-wide SNP data and morphology support the distinction of two new species of Kovarikia Soleglad, Fet & Graham, 2014 endemic to California (Scorpiones, Vaejovidae

    Directory of Open Access Journals (Sweden)

    Robert W. Bryson Jr.

    2018-02-01

    Full Text Available Morphologically conserved taxa such as scorpions represent a challenge to delimit. We recently discovered populations of scorpions in the genus Kovarikia Soleglad, Fet & Graham, 2014 on two isolated mountain ranges in southern California. We generated genome-wide single nucleotide polymorphism data and used Bayes factors species delimitation to compare alternative species delimitation scenarios which variously placed scorpions from the two localities with geographically adjacent species or into separate lineages. We also estimated a time-calibrated phylogeny of Kovarikia and examined and compared the morphology of preserved specimens from across its distribution. Genetic results strongly support the distinction of two new lineages, which we describe and name here. Morphology among the species of Kovarikia was relatively conserved, despite deep genetic divergences, consistent with recent studies of stenotopic scorpions with limited vagility. Phylogeographic structure discovered in several previously described species also suggests additional cryptic species are probably present in the genus.

  6. Genome-wide SNP data and morphology support the distinction of two new species of Kovarikia Soleglad, Fet & Graham, 2014 endemic to California (Scorpiones, Vaejovidae)

    Science.gov (United States)

    Bryson, Robert W.; Wood, Dustin A.; Graham, Matthew R.; Soleglad, Michael E.; McCormack, John E.

    2018-01-01

    Morphologically conserved taxa such as scorpions represent a challenge to delimit. We recently discovered populations of scorpions in the genus Kovarikia Soleglad, Fet & Graham, 2014 on two isolated mountain ranges in southern California. We generated genome-wide single nucleotide polymorphism data and used Bayes factors species delimitation to compare alternative species delimitation scenarios which variously placed scorpions from the two localities with geographically adjacent species or into separate lineages. We also estimated a time-calibrated phylogeny of Kovarikia and examined and compared the morphology of preserved specimens from across its distribution. Genetic results strongly support the distinction of two new lineages, which we describe and name here. Morphology among the species of Kovarikia was relatively conserved, despite deep genetic divergences, consistent with recent studies of stenotopic scorpions with limited vagility. Phylogeographic structure discovered in several previously described species also suggests additional cryptic species are probably present in the genus.

  7. Genome-wide development and deployment of informative intron-spanning and intron-length polymorphism markers for genomics-assisted breeding applications in chickpea.

    Science.gov (United States)

    Srivastava, Rishi; Bajaj, Deepak; Sayal, Yogesh K; Meher, Prabina K; Upadhyaya, Hari D; Kumar, Rajendra; Tripathi, Shailesh; Bharadwaj, Chellapilla; Rao, Atmakuri R; Parida, Swarup K

    2016-11-01

    The discovery and large-scale genotyping of informative gene-based markers is essential for rapid delineation of genes/QTLs governing stress tolerance and yield component traits in order to drive genetic enhancement in chickpea. A genome-wide 119169 and 110491 ISM (intron-spanning markers) from 23129 desi and 20386 kabuli protein-coding genes and 7454 in silico InDel (insertion-deletion) (1-45-bp)-based ILP (intron-length polymorphism) markers from 3283 genes were developed that were structurally and functionally annotated on eight chromosomes and unanchored scaffolds of chickpea. A much higher amplification efficiency (83%) and intra-specific polymorphic potential (86%) detected by these markers than that of other sequence-based genetic markers among desi and kabuli chickpea accessions was apparent even by a cost-effective agarose gel-based assay. The genome-wide physically mapped 1718 ILP markers assayed a wider level of functional genetic diversity (19-81%) and well-defined phylogenetics among domesticated chickpea accessions. The gene-derived 1424 ILP markers were anchored on a high-density (inter-marker distance: 0.65cM) desi intra-specific genetic linkage map/functional transcript map (ICC 4958×ICC 2263) of chickpea. This reference genetic map identified six major genomic regions harbouring six robust QTLs mapped on five chromosomes, which explained 11-23% seed weight trait variation (7.6-10.5 LOD) in chickpea. The integration of high-resolution QTL mapping with differential expression profiling detected six including one potential serine carboxypeptidase gene with ILP markers (linked tightly to the major seed weight QTLs) exhibiting seed-specific expression as well as pronounced up-regulation especially in seeds of high (ICC 4958) as compared to low (ICC 2263) seed weight mapping parental accessions. The marker information generated in the present study was made publicly accessible through a user-friendly web-resource, "Chickpea ISM-ILP Marker Database

  8. Development of a single nucleotide polymorphism (SNP) marker for ...

    African Journals Online (AJOL)

    The nature of the single nucleotide polymorphism (SNP) marker was validated by DNA sequencing of the parental PCR products. Using high resolution melt (HRM) profiles and normalised difference plots, we successfully differentiated the homozygous dominant (wild type), homozygous recessive (LPA) and heterozygous ...

  9. (SNP) markers for the Chinese black sleeper, Bostrychus sinensis

    African Journals Online (AJOL)

    ajl yemi

    2011-04-25

    Apr 25, 2011 ... Polynesia, north to Japan and south to Australia (Kottelat et al., 1993; Masuda ... developed the first set of SNP markers for Chinese black sleeper which can ... Then, the. 44 primer pairs were designed based on all the cloning.

  10. Three gangliogliomas: results of GTG-banding, SKY, genome-wide high resolution SNP-array, gene expression and review of the literature.

    Science.gov (United States)

    Xu, Li-Xin; Holland, Heidrun; Kirsten, Holger; Ahnert, Peter; Krupp, Wolfgang; Bauer, Manfred; Schober, Ralf; Mueller, Wolf; Fritzsch, Dominik; Meixensberger, Jürgen; Koschny, Ronald

    2015-04-01

    According to the World Health Organization gangliogliomas are classified as well-differentiated and slowly growing neuroepithelial tumors, composed of neoplastic mature ganglion and glial cells. It is the most frequent tumor entity observed in patients with long-term epilepsy. Comprehensive cytogenetic and molecular cytogenetic data including high-resolution genomic profiling (single nucleotide polymorphism (SNP)-array) of gangliogliomas are scarce but necessary for a better oncological understanding of this tumor entity. For a detailed characterization at the single cell and cell population levels, we analyzed genomic alterations of three gangliogliomas using trypsin-Giemsa banding (GTG-banding) and by spectral karyotyping (SKY) in combination with SNP-array and gene expression array experiments. By GTG and SKY, we could confirm frequently detected chromosomal aberrations (losses within chromosomes 10, 13 and 22; gains within chromosomes 5, 7, 8 and 12), and identify so far unknown genetic aberrations like the unbalanced non-reciprocal translocation t(1;18)(q21;q21). Interestingly, we report on the second so far detected ganglioglioma with ring chromosome 1. Analyses of SNP-array data from two of the tumors and respective germline DNA (peripheral blood) identified few small gains and losses and a number of copy-neutral regions with loss of heterozygosity (LOH) in germline and in tumor tissue. In comparison to germline DNA, tumor tissues did not show substantial regions with significant loss or gain or with newly developed LOH. Gene expression analyses of tumor-specific genes revealed similarities in the profile of the analyzed samples regarding different relevant pathways. Taken together, we describe overlapping but also distinct and novel genetic aberrations of three gangliogliomas. © 2014 Japanese Society of Neuropathology.

  11. Adiponectin Concentrations: A Genome-wide Association Study

    Science.gov (United States)

    Jee, Sun Ha; Sull, Jae Woong; Lee, Jong-Eun; Shin, Chol; Park, Jongkeun; Kimm, Heejin; Cho, Eun-Young; Shin, Eun-Soon; Yun, Ji Eun; Park, Ji Wan; Kim, Sang Yeun; Lee, Sun Ju; Jee, Eun Jung; Baik, Inkyung; Kao, Linda; Yoon, Sungjoo Kim; Jang, Yangsoo; Beaty, Terri H.

    2010-01-01

    Adiponectin is associated with obesity and insulin resistance. To date, there has been no genome-wide association study (GWAS) of adiponectin levels in Asians. Here we present a GWAS of a cohort of Korean volunteers. A total of 4,001 subjects were genotyped by using a genome-wide marker panel in a two-stage design (979 subjects initially and 3,022 in a second stage). Another 2,304 subjects were used for follow-up replication studies with selected markers. In the discovery phase, the top SNP associated with mean log adiponectin was rs3865188 in CDH13 on chromosome 16 (p = 1.69 × 10−15 in the initial sample, p = 6.58 × 10−39 in the second genome-wide sample, and p = 2.12 × 10−32 in the replication sample). The meta-analysis p value for rs3865188 in all 6,305 individuals was 2.82 × 10−83. The association of rs3865188 with high-molecular-weight adiponectin (p = 7.36 × 10−58) was even stronger in the third sample. A reporter assay that evaluated the effects of a CDH13 promoter SNP in complete linkage disequilibrium with rs3865188 revealed that the major allele increased expression 2.2-fold. This study clearly shows that genetic variants in CDH13 influence adiponectin levels in Korean adults. PMID:20887962

  12. Genetic susceptibility markers for a breast-colorectal cancer phenotype: Exploratory results from genome-wide association studies

    Science.gov (United States)

    Joon, Aron; Brewster, Abenaa M.; Chen, Wei V.; Eng, Cathy; Shete, Sanjay; Casey, Graham; Schumacher, Fredrick; Lin, Yi; Harrison, Tabitha A.; White, Emily; Ahsan, Habibul; Andrulis, Irene L.; Whittemore, Alice S.; Ko Win, Aung; Schmidt, Daniel F.; Kapuscinski, Miroslaw K.; Ochs-Balcom, Heather M.; Gallinger, Steven; Jenkins, Mark A.; Newcomb, Polly A.; Lindor, Noralane M.; Peters, Ulrike; Amos, Christopher I.; Lynch, Patrick M.

    2018-01-01

    Background Clustering of breast and colorectal cancer has been observed within some families and cannot be explained by chance or known high-risk mutations in major susceptibility genes. Potential shared genetic susceptibility between breast and colorectal cancer, not explained by high-penetrance genes, has been postulated. We hypothesized that yet undiscovered genetic variants predispose to a breast-colorectal cancer phenotype. Methods To identify variants associated with a breast-colorectal cancer phenotype, we analyzed genome-wide association study (GWAS) data from cases and controls that met the following criteria: cases (n = 985) were women with breast cancer who had one or more first- or second-degree relatives with colorectal cancer, men/women with colorectal cancer who had one or more first- or second-degree relatives with breast cancer, and women diagnosed with both breast and colorectal cancer. Controls (n = 1769), were unrelated, breast and colorectal cancer-free, and age- and sex- frequency-matched to cases. After imputation, 6,220,060 variants were analyzed using the discovery set and variants associated with the breast-colorectal cancer phenotype at Pcolorectal cancer phenotype in the discovery and replication data (most significant; rs7430339, Pdiscovery = 1.2E-04; rs7429100, Preplication = 2.8E-03). In meta-analysis of the discovery and replication data, the most significant association remained at rs7429100 (P = 1.84E-06). Conclusion The results of this exploratory analysis did not find clear evidence for a susceptibility locus with a pleiotropic effect on hereditary breast and colorectal cancer risk, although the suggestive association of genetic variation in the region of ROBO1, a potential tumor suppressor gene, merits further investigation. PMID:29698419

  13. Genome-wide SNP identification, linkage map construction and QTL mapping for seed mineral concentrations and contents in pea (Pisum sativum L.)

    OpenAIRE

    Ma, Yu; Coyne, Clarice J; Grusak, Michael A; Mazourek, Michael; Cheng, Peng; Main, Dorrie; McGee, Rebecca J

    2017-01-01

    Background Marker-assisted breeding is now routinely used in major crops to facilitate more efficient cultivar improvement. This has been significantly enabled by the use of next-generation sequencing technology to identify loci and markers associated with traits of interest. While rich in a range of nutritional components, such as protein, mineral nutrients, carbohydrates and several vitamins, pea (Pisum sativum L.), one of the oldest domesticated crops in the world, remains behind many othe...

  14. Using case-control designs for genome-wide screening for associations between genetic markers and disease susceptibility loci.

    Science.gov (United States)

    Yang, Q; Khoury, M J; Atkinson, M; Sun, F; Cheng, R; Flanders, W D

    1999-01-01

    We used a case-control design to scan the genome for any associations between genetic markers and disease susceptibility loci using the first two replicates of the Mycenaean population from the GAW11 (Problem 2) data. Using a case-control approach, we constructed a series of 2-by-3 tables for each allele of every marker on all six chromosomes. Odds ratios (ORs) and 95% confidence intervals (95% CI) were estimated for all alleles of every marker. We selected the one allele for which the estimated OR had the minimum p-value to plot in the graph. Among these selected ORs, we calculated 95% CI for those that had a p-value Mycenaean population, the case-control design identified allele number 1 of marker 24 on chromosome 1 to be associated with a disease susceptibility gene, OR = 2.10 (95% CI 1.66-2.62). Our approach failed to show any other significant association between case-control status and genetic markers. Stratified analysis on the environmental risk factor (E1) provided no further evidence of significant association other than allele 1 of marker 24 on chromosome 1. These data indicate the absence of linkage disequilibrium for markers flanking loci A, B, and C. Finally, we examined the effect of gene x environment (G x E) interaction for the identified allele. Our results provided no evidence of G x E interaction, but suggested that the environmental exposure alone was a risk factor for the disease.

  15. Genome-wide analysis of SSR and ILP markers in trees: diversity profiling, alternate distribution, and applications in duplication.

    Science.gov (United States)

    Xia, Xinyao; Luan, Lin Lin; Qin, Guanghua; Yu, Li Fang; Wang, Zhi Wei; Dong, Wan Chen; Song, Yumin; Qiao, Yuling; Zhang, Xian Sheng; Sang, Ya Lin; Yang, Long

    2017-12-20

    Molecular markers are efficient tools for breeding and genetic studies. However, despite their ecological and economic importance, their development and application have long been hampered. In this study, we identified 524,170 simple sequence repeat (SSR), 267,636 intron length polymorphism (ILP), and 11,872 potential intron polymorphism (PIP) markers from 16 tree species based on recently available genome sequences. Larger motifs, including hexamers and heptamers, accounted for most of the seven different types of SSR loci. Within these loci, A/T bases comprised a significantly larger proportion of sequence than G/C. SSR and ILP markers exhibited an alternative distribution pattern. Most SSRs were monomorphic markers, and the proportions of polymorphic markers were positively correlated with genome size. By verifying with all 16 tree species, 54 SSR, 418 ILP, and four PIP universal markers were obtained, and their efficiency was examined by PCR. A combination of five SSR and six ILP markers were used for the phylogenetic analysis of 30 willow samples, revealing a positive correlation between genetic diversity and geographic distance. We also found that SSRs can be used as tools for duplication analysis. Our findings provide important foundations for the development of breeding and genetic studies in tree species.

  16. Identification of novel genetic markers associated with clinical phenotypes of systemic sclerosis through a genome-wide association strategy.

    Directory of Open Access Journals (Sweden)

    Olga Gorlova

    2011-07-01

    Full Text Available The aim of this study was to determine, through a genome-wide association study (GWAS, the genetic components contributing to different clinical sub-phenotypes of systemic sclerosis (SSc. We considered limited (lcSSc and diffuse (dcSSc cutaneous involvement, and the relationships with presence of the SSc-specific auto-antibodies, anti-centromere (ACA, and anti-topoisomerase I (ATA. Four GWAS cohorts, comprising 2,296 SSc patients and 5,171 healthy controls, were meta-analyzed looking for associations in the selected subgroups. Eighteen polymorphisms were further tested in nine independent cohorts comprising an additional 3,175 SSc patients and 4,971 controls. Conditional analysis for associated SNPs in the HLA region was performed to explore their independent association in antibody subgroups. Overall analysis showed that non-HLA polymorphism rs11642873 in IRF8 gene to be associated at GWAS level with lcSSc (P = 2.32×10(-12, OR = 0.75. Also, rs12540874 in GRB10 gene (P = 1.27 × 10(-6, OR = 1.15 and rs11047102 in SOX5 gene (P = 1.39×10(-7, OR = 1.36 showed a suggestive association with lcSSc and ACA subgroups respectively. In the HLA region, we observed highly associated allelic combinations in the HLA-DQB1 locus with ACA (P = 1.79×10(-61, OR = 2.48, in the HLA-DPA1/B1 loci with ATA (P = 4.57×10(-76, OR = 8.84, and in NOTCH4 with ACA P = 8.84×10(-21, OR = 0.55 and ATA (P = 1.14×10(-8, OR = 0.54. We have identified three new non-HLA genes (IRF8, GRB10, and SOX5 associated with SSc clinical and auto-antibody subgroups. Within the HLA region, HLA-DQB1, HLA-DPA1/B1, and NOTCH4 associations with SSc are likely confined to specific auto-antibodies. These data emphasize the differential genetic components of subphenotypes of SSc.

  17. Genome-wide SNP scan of pooled DNA reveals nonsense mutation in FGF20 in the scaleless line of featherless chickens

    Directory of Open Access Journals (Sweden)

    Wells Kirsty L

    2012-06-01

    Full Text Available Abstract Background Scaleless (sc/sc chickens carry a single recessive mutation that causes a lack of almost all body feathers, as well as foot scales and spurs, due to a failure of skin patterning during embryogenesis. This spontaneous mutant line, first described in the 1950s, has been used extensively to explore the tissue interactions involved in ectodermal appendage formation in embryonic skin. Moreover, the trait is potentially useful in tropical agriculture due to the ability of featherless chickens to tolerate heat, which is at present a major constraint to efficient poultry meat production in hot climates. In the interests of enhancing our understanding of feather placode development, and to provide the poultry industry with a strategy to breed heat-tolerant meat-type chickens (broilers, we mapped and identified the sc mutation. Results Through a cost-effective and labour-efficient SNP array mapping approach using DNA from sc/sc and sc/+ blood sample pools, we map the sc trait to chromosome 4 and show that a nonsense mutation in FGF20 is completely associated with the sc/sc phenotype. This mutation, common to all sc/sc individuals and absent from wild type, is predicted to lead to loss of a highly conserved region of the FGF20 protein important for FGF signalling. In situ hybridisation and quantitative RT-PCR studies reveal that FGF20 is epidermally expressed during the early stages of feather placode patterning. In addition, we describe a dCAPS genotyping assay based on the mutation, developed to facilitate discrimination between wild type and sc alleles. Conclusions This work represents the first loss of function genetic evidence supporting a role for FGF ligand signalling in feather development, and suggests FGF20 as a novel central player in the development of vertebrate skin appendages, including hair follicles and exocrine glands. In addition, this is to our knowledge the first report describing the use of the chicken SNP array to

  18. Genome-wide development and use of microsatellite markers for large-scale genotyping applications in foxtail millet [Setaria italica (L.)].

    Science.gov (United States)

    Pandey, Garima; Misra, Gopal; Kumari, Kajal; Gupta, Sarika; Parida, Swarup Kumar; Chattopadhyay, Debasis; Prasad, Manoj

    2013-04-01

    The availability of well-validated informative co-dominant microsatellite markers and saturated genetic linkage map has been limited in foxtail millet (Setaria italica L.). In view of this, we conducted a genome-wide analysis and identified 28 342 microsatellite repeat-motifs spanning 405.3 Mb of foxtail millet genome. The trinucleotide repeats (∼48%) was prevalent when compared with dinucleotide repeats (∼46%). Of the 28 342 microsatellites, 21 294 (∼75%) primer pairs were successfully designed, and a total of 15 573 markers were physically mapped on 9 chromosomes of foxtail millet. About 159 markers were validated successfully in 8 accessions of Setaria sp. with ∼67% polymorphic potential. The high percentage (89.3%) of cross-genera transferability across millet and non-millet species with higher transferability percentage in bioenergy grasses (∼79%, Switchgrass and ∼93%, Pearl millet) signifies their importance in studying the bioenergy grasses. In silico comparative mapping of 15 573 foxtail millet microsatellite markers against the mapping data of sorghum (16.9%), maize (14.5%) and rice (6.4%) indicated syntenic relationships among the chromosomes of foxtail millet and target species. The results, thus, demonstrate the immense applicability of developed microsatellite markers in germplasm characterization, phylogenetics, construction of genetic linkage map for gene/quantitative trait loci discovery, comparative mapping in foxtail millet, including other millets and bioenergy grass species.

  19. Genome-Wide Analysis of Microsatellite Markers Based on Sequenced Database in Chinese Spring Wheat (Triticum aestivum L..

    Directory of Open Access Journals (Sweden)

    Bin Han

    Full Text Available Microsatellites or simple sequence repeats (SSRs are distributed across both prokaryotic and eukaryotic genomes and have been widely used for genetic studies and molecular marker-assisted breeding in crops. Though an ordered draft sequence of hexaploid bread wheat have been announced, the researches about systemic analysis of SSRs for wheat still have not been reported so far. In the present study, we identified 364,347 SSRs from among 10,603,760 sequences of the Chinese spring wheat (CSW genome, which were present at a density of 36.68 SSR/Mb. In total, we detected 488 types of motifs ranging from di- to hexanucleotides, among which dinucleotide repeats dominated, accounting for approximately 42.52% of the genome. The density of tri- to hexanucleotide repeats was 24.97%, 4.62%, 3.25% and 24.65%, respectively. AG/CT, AAG/CTT, AGAT/ATCT, AAAAG/CTTTT and AAAATT/AATTTT were the most frequent repeats among di- to hexanucleotide repeats. Among the 21 chromosomes of CSW, the density of repeats was highest on chromosome 2D and lowest on chromosome 3A. The proportions of di-, tri-, tetra-, penta- and hexanucleotide repeats on each chromosome, and even on the whole genome, were almost identical. In addition, 295,267 SSR markers were successfully developed from the 21 chromosomes of CSW, which cover the entire genome at a density of 29.73 per Mb. All of the SSR markers were validated by reverse electronic-Polymerase Chain Reaction (re-PCR; 70,564 (23.9% were found to be monomorphic and 224,703 (76.1% were found to be polymorphic. A total of 45 monomorphic markers were selected randomly for validation purposes; 24 (53.3% amplified one locus, 8 (17.8% amplified multiple identical loci, and 13 (28.9% did not amplify any fragments from the genomic DNA of CSW. Then a dendrogram was generated based on the 24 monomorphic SSR markers among 20 wheat cultivars and three species of its diploid ancestors showing that monomorphic SSR markers represented a promising

  20. Development and Integration of Genome-Wide Polymorphic Microsatellite Markers onto a Reference Linkage Map for Constructing a High-Density Genetic Map of Chickpea.

    Directory of Open Access Journals (Sweden)

    Yash Paul Khajuria

    Full Text Available The identification of informative in silico polymorphic genomic and genic microsatellite markers by comparing the genome and transcriptome sequences of crop genotypes is a rapid, cost-effective and non-laborious approach for large-scale marker validation and genotyping applications, including construction of high-density genetic maps. We designed 1494 markers, including 1016 genomic and 478 transcript-derived microsatellite markers showing in-silico fragment length polymorphism between two parental genotypes (Cicer arietinum ICC4958 and C. reticulatum PI489777 of an inter-specific reference mapping population. High amplification efficiency (87%, experimental validation success rate (81% and polymorphic potential (55% of these microsatellite markers suggest their effective use in various applications of chickpea genetics and breeding. Intra-specific polymorphic potential (48% detected by microsatellite markers in 22 desi and kabuli chickpea genotypes was lower than inter-specific polymorphic potential (59%. An advanced, high-density, integrated and inter-specific chickpea genetic map (ICC4958 x PI489777 having 1697 map positions spanning 1061.16 cM with an average inter-marker distance of 0.625 cM was constructed by assigning 634 novel informative transcript-derived and genomic microsatellite markers on eight linkage groups (LGs of our prior documented, 1063 marker-based genetic map. The constructed genome map identified 88, including four major (7-23 cM longest high-resolution genomic regions on LGs 3, 5 and 8, where the maximum number of novel genomic and genic microsatellite markers were specifically clustered within 1 cM genetic distance. It was for the first time in chickpea that in silico FLP analysis at genome-wide level was carried out and such a large number of microsatellite markers were identified, experimentally validated and further used in genetic mapping. To best of our knowledge, in the presently constructed genetic map, we mapped

  1. Genome-wide conserved non-coding microsatellite (CNMS) marker-based integrative genetical genomics for quantitative dissection of seed weight in chickpea.

    Science.gov (United States)

    Bajaj, Deepak; Saxena, Maneesha S; Kujur, Alice; Das, Shouvik; Badoni, Saurabh; Tripathi, Shailesh; Upadhyaya, Hari D; Gowda, C L L; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K; Parida, Swarup K

    2015-03-01

    Phylogenetic footprinting identified 666 genome-wide paralogous and orthologous CNMS (conserved non-coding microsatellite) markers from 5'-untranslated and regulatory regions (URRs) of 603 protein-coding chickpea genes. The (CT)n and (GA)n CNMS carrying CTRMCAMV35S and GAGA8BKN3 regulatory elements, respectively, are abundant in the chickpea genome. The mapped genic CNMS markers with robust amplification efficiencies (94.7%) detected higher intraspecific polymorphic potential (37.6%) among genotypes, implying their immense utility in chickpea breeding and genetic analyses. Seventeen differentially expressed CNMS marker-associated genes showing strong preferential and seed tissue/developmental stage-specific expression in contrasting genotypes were selected to narrow down the gene targets underlying seed weight quantitative trait loci (QTLs)/eQTLs (expression QTLs) through integrative genetical genomics. The integration of transcript profiling with seed weight QTL/eQTL mapping, molecular haplotyping, and association analyses identified potential molecular tags (GAGA8BKN3 and RAV1AAT regulatory elements and alleles/haplotypes) in the LOB-domain-containing protein- and KANADI protein-encoding transcription factor genes controlling the cis-regulated expression for seed weight in the chickpea. This emphasizes the potential of CNMS marker-based integrative genetical genomics for the quantitative genetic dissection of complex seed weight in chickpea. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  2. Whole-genome single-nucleotide polymorphism (SNP marker discovery and association analysis with the eicosapentaenoic acid (EPA and docosahexaenoic acid (DHA content in Larimichthys crocea

    Directory of Open Access Journals (Sweden)

    Shijun Xiao

    2016-12-01

    Full Text Available Whole-genome single-nucleotide polymorphism (SNP markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA and docosahexaenoic acid (DHA content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms.

  3. A Genome Wide Comparison to Identify Markers to Differentiate the Sex of Larval Stages of Schistosoma haematobium, Schistosoma bovis and their Respective Hybrids.

    Directory of Open Access Journals (Sweden)

    Julien Kincaid-Smith

    2016-11-01

    Full Text Available For scientists working on gonochoric organisms, determining sex can be crucial for many biological questions and experimental studies, such as crossbreeding, but it can also be a challenging task, particularly when no sexual dimorphism is visible or cannot be directly observed. In metazoan parasites of the genus Schistosoma responsible for schistosomiasis, sex is genetically determined in the zygote with a female heterogametic ZW/ZZ system. Adult flukes have a pronounced sexual dimorphism, whereas the sexes of the larval stages are morphologically indistinguishable but can be distinguished uniquely by using molecular methods. Therefore, reliable methods are needed to identify the sex of larvae individuals. Here, we present an endpoint PCR-based assay using female-specific sequences identified using a genome-wide comparative analysis between males and females. This work allowed us to identify sex-markers for Schistosoma haematobium and Schistosoma bovis but also the hybrid between both species that has recently emerged in Corsica (France. Five molecular sex-markers were identified and are female-specific in S. haematobium and the hybrid parasite, whereas three of them are also female-specific in S. bovis. These molecular markers will be useful to conduct studies, such as experimental crosses on these disease-causing blood flukes, which are still largely neglected but no longer restricted to tropical areas.

  4. Genome-wide SNP identification by high-throughput sequencing and selective mapping allows sequence assembly positioning using a framework genetic linkage map

    Directory of Open Access Journals (Sweden)

    Xu Xiangming

    2010-12-01

    Full Text Available Abstract Background Determining the position and order of contigs and scaffolds from a genome assembly within an organism's genome remains a technical challenge in a majority of sequencing projects. In order to exploit contemporary technologies for DNA sequencing, we developed a strategy for whole genome single nucleotide polymorphism sequencing allowing the positioning of sequence contigs onto a linkage map using the bin mapping method. Results The strategy was tested on a draft genome of the fungal pathogen Venturia inaequalis, the causal agent of apple scab, and further validated using sequence contigs derived from the diploid plant genome Fragaria vesca. Using our novel method we were able to anchor 70% and 92% of sequences assemblies for V. inaequalis and F. vesca, respectively, to genetic linkage maps. Conclusions We demonstrated the utility of this approach by accurately determining the bin map positions of the majority of the large sequence contigs from each genome sequence and validated our method by mapping single sequence repeat markers derived from sequence contigs on a full mapping population.

  5. Genome-wide survey and analysis of microsatellites in giant panda (Ailuropoda melanoleuca), with a focus on the applications of a novel microsatellite marker system.

    Science.gov (United States)

    Huang, Jie; Li, Yu-Zhi; Du, Lian-Ming; Yang, Bo; Shen, Fu-Jun; Zhang, He-Min; Zhang, Zhi-He; Zhang, Xiu-Yue; Yue, Bi-Song

    2015-02-07

    The giant panda (Ailuropoda melanoleuca) is a critically endangered species endemic to China. Microsatellites have been preferred as the most popular molecular markers and proven effective in estimating population size, paternity test, genetic diversity for the critically endangered species. The availability of the giant panda complete genome sequences provided the opportunity to carry out genome-wide scans for all types of microsatellites markers, which now opens the way for the analysis and development of microsatellites in giant panda. By screening the whole genome sequence of giant panda in silico mining, we identified microsatellites in the genome of giant panda and analyzed their frequency and distribution in different genomic regions. Based on our search criteria, a repertoire of 855,058 SSRs was detected, with mono-nucleotides being the most abundant. SSRs were found in all genomic regions and were more abundant in non-coding regions than coding regions. A total of 160 primer pairs were designed to screen for polymorphic microsatellites using the selected tetranucleotide microsatellite sequences. The 51 novel polymorphic tetranucleotide microsatellite loci were discovered based on genotyping blood DNA from 22 captive giant pandas in this study. Finally, a total of 15 markers, which showed good polymorphism, stability, and repetition in faecal samples, were used to establish the novel microsatellite marker system for giant panda. Meanwhile, a genotyping database for Chengdu captive giant pandas (n = 57) were set up using this standardized system. What's more, a universal individual identification method was established and the genetic diversity were analysed in this study as the applications of this marker system. The microsatellite abundance and diversity were characterized in giant panda genomes. A total of 154,677 tetranucleotide microsatellites were identified and 15 of them were discovered as the polymorphic and stable loci. The individual

  6. Development of Highly Informative Genome-Wide Single Sequence Repeat Markers for Breeding Applications in Sesame and Construction of a Web Resource: SisatBase

    Directory of Open Access Journals (Sweden)

    Komivi Dossa

    2017-08-01

    Full Text Available The sequencing of the full nuclear genome of sesame (Sesamum indicum L. provides the platform for functional analyses of genome components and their application in breeding programs. Although the importance of microsatellites markers or simple sequence repeats (SSR in crop genotyping, genetics, and breeding applications is well established, only a little information exist concerning SSRs at the whole genome level in sesame. In addition, SSRs represent a suitable marker type for sesame molecular breeding in developing countries where it is mainly grown. In this study, we identified 138,194 genome-wide SSRs of which 76.5% were physically mapped onto the 13 pseudo-chromosomes. Among these SSRs, up to three primers pairs were supplied for 101,930 SSRs and used to in silico amplify the reference genome together with two newly sequenced sesame accessions. A total of 79,957 SSRs (78% were polymorphic between the three genomes thereby suggesting their promising use in different genomics-assisted breeding applications. From these polymorphic SSRs, 23 were selected and validated to have high polymorphic potential in 48 sesame accessions from different growing areas of Africa. Furthermore, we have developed an online user-friendly database, SisatBase (http://www.sesame-bioinfo.org/SisatBase/, which provides free access to SSRs data as well as an integrated platform for functional analyses. Altogether, the reference SSR and SisatBase would serve as useful resources for genetic assessment, genomic studies, and breeding advancement in sesame, especially in developing countries.

  7. High density, genome-wide markers and intra-specific replication yield an unprecedented phylogenetic reconstruction of a globally significant, speciose lineage of Eucalyptus.

    Science.gov (United States)

    Jones, Rebecca C; Nicolle, Dean; Steane, Dorothy A; Vaillancourt, René E; Potts, Brad M

    2016-12-01

    We used genome-wide markers and an unprecedented scale of sampling to construct a phylogeny for a globally significant Eucalyptus lineage that has been impacted by hybridisation, recent radiation and morphological convergence. Our approach, using 3109 DArT markers distributed throughout the genome and 540 samples covering 185 terminal taxa in sections Maidenaria, Exsertaria, Latoangulatae and related smaller sections, with multiple geographically widespread samples per terminal taxon, produced a phylogeny that largely matched the morphological treatment of sections, though sections Exsertaria and Latoangulatae were polyphyletic. At lower levels there were numerous inconsistencies between the morphological treatment and the molecular phylogeny, and taxa within the three main sections were generally not monophyletic at the series (at least 62% polyphyly) or species (at least 52% polyphyly) level. Some of the discrepancies appear to be the result of morphological convergence or misclassifications, and we propose some taxonomic reassessments to address this. However, many inconsistencies appear to be the products of incomplete speciation and/or hybridisation. Our analysis represents a significant advance on previous phylogenies of these important eucalypt sections (which have mainly used single samples to represent each species), thus providing a robust phylogenetic framework for evolutionary and ecological studies. Copyright © 2016 Elsevier Inc. All rights reserved.

  8. Joint genome-wide prediction in several populations accounting for randomness of genotypes: A hierarchical Bayes approach. I: Multivariate Gaussian priors for marker effects and derivation of the joint probability mass function of genotypes.

    Science.gov (United States)

    Martínez, Carlos Alberto; Khare, Kshitij; Banerjee, Arunava; Elzo, Mauricio A

    2017-03-21

    It is important to consider heterogeneity of marker effects and allelic frequencies in across population genome-wide prediction studies. Moreover, all regression models used in genome-wide prediction overlook randomness of genotypes. In this study, a family of hierarchical Bayesian models to perform across population genome-wide prediction modeling genotypes as random variables and allowing population-specific effects for each marker was developed. Models shared a common structure and differed in the priors used and the assumption about residual variances (homogeneous or heterogeneous). Randomness of genotypes was accounted for by deriving the joint probability mass function of marker genotypes conditional on allelic frequencies and pedigree information. As a consequence, these models incorporated kinship and genotypic information that not only permitted to account for heterogeneity of allelic frequencies, but also to include individuals with missing genotypes at some or all loci without the need for previous imputation. This was possible because the non-observed fraction of the design matrix was treated as an unknown model parameter. For each model, a simpler version ignoring population structure, but still accounting for randomness of genotypes was proposed. Implementation of these models and computation of some criteria for model comparison were illustrated using two simulated datasets. Theoretical and computational issues along with possible applications, extensions and refinements were discussed. Some features of the models developed in this study make them promising for genome-wide prediction, the use of information contained in the probability distribution of genotypes is perhaps the most appealing. Further studies to assess the performance of the models proposed here and also to compare them with conventional models used in genome-wide prediction are needed. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. SNP markers retrieval for a non-model species: a practical approach

    Directory of Open Access Journals (Sweden)

    Shahin Arwa

    2012-01-01

    Full Text Available Abstract Background SNP (Single Nucleotide Polymorphism markers are rapidly becoming the markers of choice for applications in breeding because of next generation sequencing technology developments. For SNP development by NGS technologies, correct assembly of the huge amounts of sequence data generated is essential. Little is known about assembler's performance, especially when dealing with highly heterogeneous species that show a high genome complexity and what the possible consequences are of differences in assemblies on SNP retrieval. This study tested two assemblers (CAP3 and CLC on 454 data from four lily genotypes and compared results with respect to SNP retrieval. Results CAP3 assembly resulted in higher numbers of contigs, lower numbers of reads per contig, and shorter average read lengths compared to CLC. Blast comparisons showed that CAP3 contigs were highly redundant. Contrastingly, CLC in rare cases combined paralogs in one contig. Redundant and chimeric contigs may lead to erroneous SNPs. Filtering for redundancy can be done by blasting selected SNP markers to the contigs and discarding all the SNP markers that show more than one blast hit. Results on chimeric contigs showed that only four out of 2,421 SNP markers were selected from chimeric contigs. Conclusion In practice, CLC performs better in assembling highly heterogeneous genome sequences compared to CAP3, and consequently SNP retrieval is more efficient. Additionally a simple flow scheme is suggested for SNP marker retrieval that can be valid for all non-model species.

  10. Using the Pareto principle in genome-wide breeding value estimation.

    Science.gov (United States)

    Yu, Xijiang; Meuwissen, Theo H E

    2011-11-01

    Genome-wide breeding value (GWEBV) estimation methods can be classified based on the prior distribution assumptions of marker effects. Genome-wide BLUP methods assume a normal prior distribution for all markers with a constant variance, and are computationally fast. In Bayesian methods, more flexible prior distributions of SNP effects are applied that allow for very large SNP effects although most are small or even zero, but these prior distributions are often also computationally demanding as they rely on Monte Carlo Markov chain sampling. In this study, we adopted the Pareto principle to weight available marker loci, i.e., we consider that x% of the loci explain (100 - x)% of the total genetic variance. Assuming this principle, it is also possible to define the variances of the prior distribution of the 'big' and 'small' SNP. The relatively few large SNP explain a large proportion of the genetic variance and the majority of the SNP show small effects and explain a minor proportion of the genetic variance. We name this method MixP, where the prior distribution is a mixture of two normal distributions, i.e. one with a big variance and one with a small variance. Simulation results, using a real Norwegian Red cattle pedigree, show that MixP is at least as accurate as the other methods in all studied cases. This method also reduces the hyper-parameters of the prior distribution from 2 (proportion and variance of SNP with big effects) to 1 (proportion of SNP with big effects), assuming the overall genetic variance is known. The mixture of normal distribution prior made it possible to solve the equations iteratively, which greatly reduced computation loads by two orders of magnitude. In the era of marker density reaching million(s) and whole-genome sequence data, MixP provides a computationally feasible Bayesian method of analysis.

  11. Development and application of a 20K SNP array in potato

    NARCIS (Netherlands)

    Vos, Peter

    2016-01-01

    In this thesis the results are described of investigations of various application of genome wide SNP (single nucleotide polymorphism) markers. The set of SNP markers was identified by GBS (genotyping by sequencing) strategy. The resulting dataset of 129,156 SNPs across 83 tetraploid varieties was

  12. Genome-wide scan for visceral leishmaniasis in mixed-breed dogs identifies candidate genes involved in T helper cells and macrophage signaling

    Science.gov (United States)

    We conducted a genome-wide scan for visceral leishmaniasis in mixed-breed dogs from a highly endemic area in Brazil using 149,648 single nucleotide polymorphism (SNP) markers genotyped in 20 cases and 28 controls. Using a mixed model approach, we found two candidate loci on canine autosomes 1 and 2....

  13. A set of 14 DIP-SNP markers to detect unbalanced DNA mixtures.

    Science.gov (United States)

    Liu, Zhizhen; Liu, Jinding; Wang, Jiaqi; Chen, Deqing; Liu, Zidong; Shi, Jie; Li, Zeqin; Li, Wenyan; Zhang, Gengqian; Du, Bing

    2018-03-04

    Unbalanced DNA mixture is still a difficult problem for forensic practice. DIP-STRs are useful markers for detection of minor DNA but they are not widespread in the human genome and having long amplicons. In this study, we proposed a novel type of genetic marker, termed DIP-SNP. DIP-SNP refers to the combination of INDEL and SNP in less than 300bp length of human genome. The multiplex PCR and SNaPshot assay were established for 14 DIP-SNP markers in a Chinese Han population from Shanxi, China. This novel compound marker allows detection of the minor DNA contributor with sensitivity from 1:50 to 1:1000 in a DNA mixture of any gender with 1 ng-10 ng DNA template. Most of the DIP-SNP markers had a relatively high probability of informative alleles with an average I value of 0.33. In all, we proposed DIP-SNP as a novel kind of genetic marker for detection of minor contributor from unbalanced DNA mixture and established the detection method by associating the multiplex PCR and SNaPshot assay. DIP-SNP polymorphisms are promising markers for forensic or clinical mixture examination because they are shorter, widespread and higher sensitive. Copyright © 2018 Elsevier Inc. All rights reserved.

  14. Genome-Wide Linkage and Association Analysis Identifies Major Gene Loci for Guttural Pouch Tympany in Arabian and German Warmblood Horses

    Science.gov (United States)

    Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

    2012-01-01

    Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16–26 Mb and 34–55 Mb and for Arabian on ECA15 at 64–65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT. PMID:22848553

  15. A Genome-Wide Breast Cancer Scan in African Americans

    Science.gov (United States)

    2010-06-01

    SNPs from the African American breast cancer scan to COGs , a European collaborative study which is has designed a SNP array with that will be genotyped...Award Number: W81XWH-08-1-0383 TITLE: A Genome-wide Breast Cancer Scan in African Americans PRINCIPAL INVESTIGATOR: Christopher A...SUBTITLE A Genome-wide Breast Cancer Scan in African Americans 5a. CONTRACT NUMBER 5b. GRANT NUMBER W81XWH-08-1-0383 5c. PROGRAM

  16. GLIDERS - A web-based search engine for genome-wide linkage disequilibrium between HapMap SNPs

    Directory of Open Access Journals (Sweden)

    Broxholme John

    2009-10-01

    Full Text Available Abstract Background A number of tools for the examination of linkage disequilibrium (LD patterns between nearby alleles exist, but none are available for quickly and easily investigating LD at longer ranges (>500 kb. We have developed a web-based query tool (GLIDERS: Genome-wide LInkage DisEquilibrium Repository and Search engine that enables the retrieval of pairwise associations with r2 ≥ 0.3 across the human genome for any SNP genotyped within HapMap phase 2 and 3, regardless of distance between the markers. Description GLIDERS is an easy to use web tool that only requires the user to enter rs numbers of SNPs they want to retrieve genome-wide LD for (both nearby and long-range. The intuitive web interface handles both manual entry of SNP IDs as well as allowing users to upload files of SNP IDs. The user can limit the resulting inter SNP associations with easy to use menu options. These include MAF limit (5-45%, distance limits between SNPs (minimum and maximum, r2 (0.3 to 1, HapMap population sample (CEU, YRI and JPT+CHB combined and HapMap build/release. All resulting genome-wide inter-SNP associations are displayed on a single output page, which has a link to a downloadable tab delimited text file. Conclusion GLIDERS is a quick and easy way to retrieve genome-wide inter-SNP associations and to explore LD patterns for any number of SNPs of interest. GLIDERS can be useful in identifying SNPs with long-range LD. This can highlight mis-mapping or other potential association signal localisation problems.

  17. Strong trans-Pacific break and local conservation units in the Galapagos shark (Carcharhinus galapagensis) revealed by genome-wide cytonuclear markers.

    Science.gov (United States)

    Pazmiño, Diana A; Maes, Gregory E; Green, Madeline E; Simpfendorfer, Colin A; Hoyos-Padilla, E Mauricio; Duffy, Clinton J A; Meyer, Carl G; Kerwath, Sven E; Salinas-de-León, Pelayo; van Herwerden, Lynne

    2018-05-01

    The application of genome-wide cytonuclear molecular data to identify management and adaptive units at various spatio-temporal levels is particularly important for overharvested large predatory organisms, often characterized by smaller, localized populations. Despite being "near threatened", current understanding of habitat use and population structure of Carcharhinus galapagensis is limited to specific areas within its distribution. We evaluated population structure and connectivity across the Pacific Ocean using genome-wide single-nucleotide polymorphisms (~7200 SNPs) and mitochondrial control region sequences (945 bp) for 229 individuals. Neutral SNPs defined at least two genetically discrete geographic groups: an East Tropical Pacific (Mexico, east and west Galapagos Islands), and another central-west Pacific (Lord Howe Island, Middleton Reef, Norfolk Island, Elizabeth Reef, Kermadec, Hawaii and Southern Africa). More fine-grade population structure was suggested using outlier SNPs: west Pacific, Hawaii, Mexico, and Galapagos. Consistently, mtDNA pairwise Φ ST defined three regional stocks: east, central and west Pacific. Compared to neutral SNPs (F ST  = 0.023-0.035), mtDNA exhibited more divergence (Φ ST  = 0.258-0.539) and high overall genetic diversity (h = 0.794 ± 0.014; π = 0.004 ± 0.000), consistent with the longstanding eastern Pacific barrier between the east and central-west Pacific. Hawaiian and Southern African populations group within the west Pacific cluster. Effective population sizes were moderate/high for east/west populations (738 and 3421, respectively). Insights into the biology, connectivity, genetic diversity, and population demographics informs for improved conservation of this species, by delineating three to four conservation units across their Pacific distribution. Implementing such conservation management may be challenging, but is necessary to achieve long-term population resilience at basin and

  18. Tri-allelic SNP markers enable analysis of mixed and degraded DNA samples.

    Science.gov (United States)

    Westen, Antoinette A; Matai, Anuska S; Laros, Jeroen F J; Meiland, Hugo C; Jasper, Mandy; de Leeuw, Wiljo J F; de Knijff, Peter; Sijen, Titia

    2009-09-01

    For the analysis of degraded DNA in disaster victim identification (DVI) and criminal investigations, single nucleotide polymorphisms (SNPs) have been recognized as promising markers mainly because they can be analyzed in short sized amplicons. Most SNPs are bi-allelic and are thereby ineffective to detect mixtures, which may lead to incorrect genotyping. We developed an algorithm to find non-binary (i.e. tri-allelic or tetra-allelic) SNPs in the NCBI dbSNP database. We selected 31 potential tri-allelic SNPs with a minor allele frequency of at least 10%. The tri-allelic nature was confirmed for 15 SNPs residing on 14 different chromosomes. Multiplex SNaPshot assays were developed, and the allele frequencies of 16 SNPs were determined among 153 Dutch and 111 Netherlands Antilles reference samples. Using these multiplex SNP assays, the presence of a mixture of two DNA samples in a ratio up to 1:8 could be recognized reliably. Furthermore, we compared the genotyping efficiency of the tri-allelic SNP markers and short tandem repeat (STR) markers by analyzing artificially degraded DNA and DNA from 30 approximately 500-year-old bone and molar samples. In both types of degraded DNA samples, the larger sized STR amplicons failed to amplify whereas the tri-allelic SNP markers still provided valuable information. In conclusion, tri-allelic SNP markers are suited for the analysis of degraded DNA and enable the detection of a second DNA source in a sample.

  19. A genome-wide RNAi screen identifies novel targets of neratinib resistance leading to identification of potential drug resistant genetic markers.

    Science.gov (United States)

    Seyhan, Attila A; Varadarajan, Usha; Choe, Sung; Liu, Wei; Ryan, Terence E

    2012-04-01

    Neratinib (HKI-272) is a small molecule tyrosine kinase inhibitor of the ErbB receptor family currently in Phase III clinical trials. Despite its efficacy, the mechanism of potential cellular resistance to neratinib and genes involved with it remains unknown. We have used a pool-based lentiviral genome-wide functional RNAi screen combined with a lethal dose of neratinib to discover chemoresistant interactions with neratinib. Our screen has identified a collection of genes whose inhibition by RNAi led to neratinib resistance including genes involved in oncogenesis (e.g. RAB33A, RAB6A and BCL2L14), transcription factors (e.g. FOXP4, TFEC, ZNF), cellular ion transport (e.g. CLIC3, TRAPPC2P1, P2RX2), protein ubiquitination (e.g. UBL5), cell cycle (e.g. CCNF), and genes known to interact with breast cancer-associated genes (e.g. CCNF, FOXP4, TFEC, several ZNF factors, GNA13, IGFBP1, PMEPA1, SOX5, RAB33A, RAB6A, FXR1, DDO, TFEC, OLFM2). The identification of novel mediators of cellular resistance to neratinib could lead to the identification of new or neoadjuvant drug targets. Their use as patient or treatment selection biomarkers could make the application of anti-ErbB therapeutics more clinically effective.

  20. MDM2 SNP309 and SNP285 Act as Negative Prognostic Markers for Non-small Cell Lung Cancer Adenocarcinoma Patients

    Science.gov (United States)

    Deben, Christophe; Op de Beeck, Ken; Van den Bossche, Jolien; Jacobs, Julie; Lardon, Filip; Wouters, An; Peeters, Marc; Van Camp, Guy; Rolfo, Christian; Deschoolmeester, Vanessa; Pauwels, Patrick

    2017-01-01

    Objectives: Two functional polymorphisms in the MDM2 promoter region, SNP309T>G and SNP285G>C, have been shown to impact MDM2 expression and cancer risk. Currently available data on the prognostic value of MDM2 SNP309 in non-small cell lung cancer (NSCLC) is contradictory and unavailable for SNP285. The goal of this study was to clarify the role of these MDM2 SNPs in the outcome of NSCLC patients. Materials and Methods: In this study we genotyped SNP309 and SNP285 in 98 NSCLC adenocarcinoma patients and determined MDM2 mRNA and protein levels. In addition, we assessed the prognostic value of these common SNPs on overall and progression free survival, taking into account the TP53 status of the tumor. Results and Conclusion: We found that the SNP285C allele, but not the SNP309G allele, was significantly associated with increased MDM2 mRNA expression levels (p = 0.025). However, we did not observe an association with MDM2 protein levels for SNP285. The SNP309G allele was significantly associated with the presence of wild type TP53 (p = 0.047) and showed a strong trend towards increased MDM2 protein levels (p = 0.068). In addition, patients harboring the SNP309G allele showed a worse overall survival, but only in the presence of wild type TP53. The SNP285C allele was significantly associated with an early age of diagnosis and metastasis. Additionally, the SNP285C allele acted as an independent predictor for worse progression free survival (HR = 3.97; 95% CI = 1.51 - 10.42; p = 0.005). Our data showed that both SNP309 (in the presence of wild type TP53) and SNP285 act as negative prognostic markers for NSCLC patients, implicating a prominent role for these variants in the outcome of these patients. PMID:28819417

  1. Genome-Wide Development of MicroRNA-Based SSR Markers in Medicago truncatula with Their Transferability Analysis and Utilization in Related Legume Species.

    Science.gov (United States)

    Min, Xueyang; Zhang, Zhengshe; Liu, Yisong; Wei, Xingyi; Liu, Zhipeng; Wang, Yanrong; Liu, Wenxian

    2017-11-18

    Microsatellite (simple sequence repeats, SSRs) marker is one of the most widely used markers in marker-assisted breeding. As one type of functional markers, MicroRNA-based SSR (miRNA-SSR) markers have been exploited mainly in animals, but the development and characterization of miRNA-SSR markers in plants are still limited. In the present study, miRNA-SSR markers for Medicago truncatula ( M. truncatula ) were developed and their cross-species transferability in six leguminous species was evaluated. A total of 169 primer pairs were successfully designed from 130 M. truncatula miRNA genes, the majority of which were mononucleotide repeats (70.41%), followed by dinucleotide repeats (14.20%), compound repeats (11.24%) and trinucleotide repeats (4.14%). Functional classification of SSR-containing miRNA genes showed that all targets could be grouped into three Gene Ontology (GO) categories: 17 in biological process, 11 in molecular function, and 14 in cellular component. The miRNA-SSR markers showed high transferability in other six leguminous species, ranged from 74.56% to 90.53%. Furthermore, 25 Mt - miRNA-SSR markers were used to evaluate polymorphisms in 20 alfalfa accessions, and the polymorphism information content (PIC) values ranged from 0.39 to 0.89 with an average of 0.71, the allele number per marker varied from 3 to 18 with an average of 7.88, indicating a high level of informativeness. The present study is the first time developed and characterized of M. truncatula miRNA-SSRs and demonstrated their utility in transferability, these novel markers will be valuable for genetic diversity analysis, marker-assisted selection and genotyping in leguminous species.

  2. Genome-Wide Association Study for Susceptibility to and Recoverability From Mastitis in Danish Holstein Cows

    Directory of Open Access Journals (Sweden)

    B. G. Welderufael

    2018-04-01

    Full Text Available Because mastitis is very frequent and unavoidable, adding recovery information into the analysis for genetic evaluation of mastitis is of great interest from economical and animal welfare point of view. Here we have performed genome-wide association studies (GWAS to identify associated single nucleotide polymorphisms (SNPs and investigate the genetic background not only for susceptibility to – but also for recoverability from mastitis. Somatic cell count records from 993 Danish Holstein cows genotyped for a total of 39378 autosomal SNP markers were used for the association analysis. Single SNP regression analysis was performed using the statistical software package DMU. Substitution effect of each SNP was tested with a t-test and a genome-wide significance level of P-value < 10-4 was used to declare significant SNP-trait association. A number of significant SNP variants were identified for both traits. Many of the SNP variants associated either with susceptibility to – or recoverability from mastitis were located in or very near to genes that have been reported for their role in the immune system. Genes involved in lymphocyte developments (e.g., MAST3 and STAB2 and genes involved in macrophage recruitment and regulation of inflammations (PDGFD and PTX3 were suggested as possible causal genes for susceptibility to – and recoverability from mastitis, respectively. However, this is the first GWAS study for recoverability from mastitis and our results need to be validated. The findings in the current study are, therefore, a starting point for further investigations in identifying causal genetic variants or chromosomal regions for both susceptibility to – and recoverability from mastitis.

  3. Genome-wide estimates of coancestry and inbreeding in a closed herd of ancient Iberian pigs.

    Directory of Open Access Journals (Sweden)

    María Saura

    Full Text Available Maintaining genetic variation and controlling the increase in inbreeding are crucial requirements in animal conservation programs. The most widely accepted strategy for achieving these objectives is to maximize the effective population size by minimizing the global coancestry obtained from a particular pedigree. However, for most natural or captive populations genealogical information is absent. In this situation, microsatellites have been traditionally the markers of choice to characterize genetic variation, and several estimators of genealogical coefficients have been developed using marker data, with unsatisfactory results. The development of high-throughput genotyping techniques states the necessity of reviewing the paradigm that genealogical coancestry is the best parameter for measuring genetic diversity. In this study, the Illumina PorcineSNP60 BeadChip was used to obtain genome-wide estimates of rates of coancestry and inbreeding and effective population size for an ancient strain of Iberian pigs that is now in serious danger of extinction and for which very accurate genealogical information is available (the Guadyerbas strain. Genome-wide estimates were compared with those obtained from microsatellite and from pedigree data. Estimates of coancestry and inbreeding computed from the SNP chip were strongly correlated with genealogical estimates and these correlations were substantially higher than those between microsatellite and genealogical coefficients. Also, molecular coancestry computed from SNP information was a better predictor of genealogical coancestry than coancestry computed from microsatellites. Rates of change in coancestry and inbreeding and effective population size estimated from molecular data were very similar to those estimated from genealogical data. However, estimates of effective population size obtained from changes in coancestry or inbreeding differed. Our results indicate that genome-wide information represents a

  4. Genome-Wide Association Study for Susceptibility to and Recoverability From Mastitis in Danish Holstein Cows

    Science.gov (United States)

    Welderufael, B. G.; Løvendahl, Peter; de Koning, Dirk-Jan; Janss, Lucas L. G.; Fikse, W. F.

    2018-01-01

    Because mastitis is very frequent and unavoidable, adding recovery information into the analysis for genetic evaluation of mastitis is of great interest from economical and animal welfare point of view. Here we have performed genome-wide association studies (GWAS) to identify associated single nucleotide polymorphisms (SNPs) and investigate the genetic background not only for susceptibility to – but also for recoverability from mastitis. Somatic cell count records from 993 Danish Holstein cows genotyped for a total of 39378 autosomal SNP markers were used for the association analysis. Single SNP regression analysis was performed using the statistical software package DMU. Substitution effect of each SNP was tested with a t-test and a genome-wide significance level of P-value mastitis were located in or very near to genes that have been reported for their role in the immune system. Genes involved in lymphocyte developments (e.g., MAST3 and STAB2) and genes involved in macrophage recruitment and regulation of inflammations (PDGFD and PTX3) were suggested as possible causal genes for susceptibility to – and recoverability from mastitis, respectively. However, this is the first GWAS study for recoverability from mastitis and our results need to be validated. The findings in the current study are, therefore, a starting point for further investigations in identifying causal genetic variants or chromosomal regions for both susceptibility to – and recoverability from mastitis. PMID:29755506

  5. Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing.

    Directory of Open Access Journals (Sweden)

    ShiGang Yu

    Full Text Available Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV or low estimated breeding value (LEBV. A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the

  6. Genome-Wide Discovery of Microsatellite Markers from Diploid Progenitor Species, Arachis duranensis and A. ipaensis, and Their Application in Cultivated Peanut (A. hypogaea

    Directory of Open Access Journals (Sweden)

    Chuanzhi Zhao

    2017-07-01

    Full Text Available Despite several efforts in the last decade toward development of simple sequence repeat (SSR markers in peanut, there is still a need for more markers for conducting different genetic and breeding studies. With the effort of the International Peanut Genome Initiative, the availability of reference genome for both the diploid progenitors of cultivated peanut allowed us to identify 135,529 and 199,957 SSRs from the A (Arachis duranensis and B genomes (Arachis ipaensis, respectively. Genome sequence analysis showed uneven distribution of the SSR motifs across genomes with variation in parameters such as SSR type, repeat number, and SSR length. Using the flanking sequences of identified SSRs, primers were designed for 51,354 and 60,893 SSRs with densities of 49 and 45 SSRs per Mb in A. duranensis and A. ipaensis, respectively. In silico PCR analysis of these SSR markers showed high transferability between wild and cultivated Arachis species. Two physical maps were developed for the A genome and the B genome using these SSR markers, and two reported disease resistance quantitative trait loci (QTLs, qF2TSWV5 for tomato spotted wilt virus (TSWV and qF2LS6 for leaf spot (LS, were mapped in the 8.135 Mb region of chromosome A04 of A. duranensis. From this genomic region, 719 novel SSR markers were developed, which provide the possibility for fine mapping of these QTLs. In addition, this region also harbors 652 genes and 49 of these are defense related genes, including two NB-ARC genes, three LRR receptor-like genes and three WRKY transcription factors. These disease resistance related genes could contribute to resistance to viral (such as TSWV and fungal (such as LS diseases in peanut. In summary, this study not only provides a large number of molecular markers for potential use in peanut genetic map development and QTL mapping but also for map-based gene cloning and molecular breeding.

  7. Methodological Considerations in Estimation of Phenotype Heritability Using Genome-Wide SNP Data, Illustrated by an Analysis of the Heritability of Height in a Large Sample of African Ancestry Adults.

    Directory of Open Access Journals (Sweden)

    Fang Chen

    Full Text Available Height has an extremely polygenic pattern of inheritance. Genome-wide association studies (GWAS have revealed hundreds of common variants that are associated with human height at genome-wide levels of significance. However, only a small fraction of phenotypic variation can be explained by the aggregate of these common variants. In a large study of African-American men and women (n = 14,419, we genotyped and analyzed 966,578 autosomal SNPs across the entire genome using a linear mixed model variance components approach implemented in the program GCTA (Yang et al Nat Genet 2010, and estimated an additive heritability of 44.7% (se: 3.7% for this phenotype in a sample of evidently unrelated individuals. While this estimated value is similar to that given by Yang et al in their analyses, we remain concerned about two related issues: (1 whether in the complete absence of hidden relatedness, variance components methods have adequate power to estimate heritability when a very large number of SNPs are used in the analysis; and (2 whether estimation of heritability may be biased, in real studies, by low levels of residual hidden relatedness. We addressed the first question in a semi-analytic fashion by directly simulating the distribution of the score statistic for a test of zero heritability with and without low levels of relatedness. The second question was addressed by a very careful comparison of the behavior of estimated heritability for both observed (self-reported height and simulated phenotypes compared to imputation R2 as a function of the number of SNPs used in the analysis. These simulations help to address the important question about whether today's GWAS SNPs will remain useful for imputing causal variants that are discovered using very large sample sizes in future studies of height, or whether the causal variants themselves will need to be genotyped de novo in order to build a prediction model that ultimately captures a large fraction of the

  8. Characterization of genome-wide microsatellite markers in rabbitfishes, an important resource for artisanal fisheries in the Indo-West Pacific.

    Science.gov (United States)

    Kiper, Ilkser Erdem; Bloomer, Paulette; Borsa, Philippe; Hoareau, Thierry Bernard

    2018-02-01

    Rabbitfishes are reef-associated fishes that support local fisheries throughout the Indo-West Pacific region. Sound management of the resource requires the development of molecular tools for appropriate stock delimitation of the different species in the family. Microsatellite markers were developed for the cordonnier, Siganus sutor, and their potential for cross-amplification was investigated in 12 congeneric species. A library of 792 repeat-containing sequences was built. Nineteen sets of newly developed primers, and 14 universal finfish microsatellites were tested in S. sutor. Amplification success of the 19 Siganus-specific markers ranged from 32 to 79% in the 12 other Siganus species, slightly decreasing when the genetic distance of the target species to S. sutor increased. Seventeen of these markers were polymorphic in S. sutor and were further assayed in S. luridus, S. rivulatus, and S. spinus, of which respectively 9, 10 and 8 were polymorphic. Statistical power analysis and an analysis of molecular variance showed that subtle genetic differentiation can be detected using these markers, highlighting their utility for the study of genetic diversity and population genetic structure in rabbitfishes.

  9. Development of highly polymorphic simple sequence repeat markers using genome-wide microsatellite variant analysis in Foxtail millet [Setaria italica (L.) P. Beauv].

    Science.gov (United States)

    Zhang, Shuo; Tang, Chanjuan; Zhao, Qiang; Li, Jing; Yang, Lifang; Qie, Lufeng; Fan, Xingke; Li, Lin; Zhang, Ning; Zhao, Meicheng; Liu, Xiaotong; Chai, Yang; Zhang, Xue; Wang, Hailong; Li, Yingtao; Li, Wen; Zhi, Hui; Jia, Guanqing; Diao, Xianmin

    2014-01-28

    Foxtail millet (Setaria italica (L.) Beauv.) is an important gramineous grain-food and forage crop. It is grown worldwide for human and livestock consumption. Its small genome and diploid nature have led to foxtail millet fast becoming a novel model for investigating plant architecture, drought tolerance and C4 photosynthesis of grain and bioenergy crops. Therefore, cost-effective, reliable and highly polymorphic molecular markers covering the entire genome are required for diversity, mapping and functional genomics studies in this model species. A total of 5,020 highly repetitive microsatellite motifs were isolated from the released genome of the genotype 'Yugu1' by sequence scanning. Based on sequence comparison between S. italica and S. viridis, a set of 788 SSR primer pairs were designed. Of these primers, 733 produced reproducible amplicons and were polymorphic among 28 Setaria genotypes selected from diverse geographical locations. The number of alleles detected by these SSR markers ranged from 2 to 16, with an average polymorphism information content of 0.67. The result obtained by neighbor-joining cluster analysis of 28 Setaria genotypes, based on Nei's genetic distance of the SSR data, showed that these SSR markers are highly polymorphic and effective. A large set of highly polymorphic SSR markers were successfully and efficiently developed based on genomic sequence comparison between different genotypes of the genus Setaria. The large number of new SSR markers and their placement on the physical map represent a valuable resource for studying diversity, constructing genetic maps, functional gene mapping, QTL exploration and molecular breeding in foxtail millet and its closely related species.

  10. Genome-wide association study for Identification and validation of novel SNP markers for Sr6 stem rust resistance gene in bread wheat

    Science.gov (United States)

    Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat (Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of...

  11. Genome-wide single nucleotide polymorphisms (SNPs) for a model invasive ascidian Botryllus schlosseri.

    Science.gov (United States)

    Gao, Yangchun; Li, Shiguo; Zhan, Aibin

    2018-04-01

    Invasive species cause huge damages to ecology, environment and economy globally. The comprehensive understanding of invasion mechanisms, particularly genetic bases of micro-evolutionary processes responsible for invasion success, is essential for reducing potential damages caused by invasive species. The golden star tunicate, Botryllus schlosseri, has become a model species in invasion biology, mainly owing to its high invasiveness nature and small well-sequenced genome. However, the genome-wide genetic markers have not been well developed in this highly invasive species, thus limiting the comprehensive understanding of genetic mechanisms of invasion success. Using restriction site-associated DNA (RAD) tag sequencing, here we developed a high-quality resource of 14,119 out of 158,821 SNPs for B. schlosseri. These SNPs were relatively evenly distributed at each chromosome. SNP annotations showed that the majority of SNPs (63.20%) were located at intergenic regions, and 21.51% and 14.58% were located at introns and exons, respectively. In addition, the potential use of the developed SNPs for population genomics studies was primarily assessed, such as the estimate of observed heterozygosity (H O ), expected heterozygosity (H E ), nucleotide diversity (π), Wright's inbreeding coefficient (F IS ) and effective population size (Ne). Our developed SNP resource would provide future studies the genome-wide genetic markers for genetic and genomic investigations, such as genetic bases of micro-evolutionary processes responsible for invasion success.

  12. Genome-wide association study for host response to bovine leukemia virus in Holstein cows.

    Science.gov (United States)

    Brym, P; Bojarojć-Nosowicz, B; Oleński, K; Hering, D M; Ruść, A; Kaczmarczyk, E; Kamiński, S

    2016-07-01

    The mechanisms of leukemogenesis induced by bovine leukemia virus (BLV) and the processes underlying the phenomenon of differential host response to BLV infection still remain poorly understood. The aim of the study was to screen the entire cattle genome to identify markers and candidate genes that might be involved in host response to bovine leukemia virus infection. A genome-wide association study was performed using Holstein cows naturally infected by BLV. A data set included 43 cows (BLV positive) and 30 cows (BLV negative) genotyped for 54,609 SNP markers (Illumina Bovine SNP50 BeadChip). The BLV status of cows was determined by serum ELISA, nested-PCR and hematological counts. Linear Regression Analysis with a False Discovery Rate and kinship matrix (computed on the autosomal SNPs) was calculated to find out which SNP markers significantly differentiate BLV-positive and BLV-negative cows. Nine markers reached genome-wide significance. The most significant SNPs were located on chromosomes 23 (rs41583098), 3 (rs109405425, rs110785500) and 8 (rs43564499) in close vicinity of a patatin-like phospholipase domain containing 1 (PNPLA1); adaptor-related protein complex 4, beta 1 subunit (AP4B1); tripartite motif-containing 45 (TRIM45) and cell division cycle associated 2 (CDCA2) genes, respectively. Furthermore, a list of 41 candidate genes was composed based on their proximity to significant markers (within a distance of ca. 1 Mb) and functional involvement in processes potentially underlying BLV-induced pathogenesis. In conclusion, it was demonstrated that host response to BLV infection involves nine sub-regions of the cattle genome (represented by 9 SNP markers), containing many genes which, based on the literature, could be involved to enzootic bovine leukemia progression. New group of promising candidate genes associated with the host response to BLV infection were identified and could therefore be a target for future studies. The functions of candidate genes

  13. Applying SNP marker technology in the cacao breeding program at the Cocoa Research Institute of Ghana

    Science.gov (United States)

    In this investigation 45 parental cacao plants and five progeny derived from the parental stock studied were genotyped using six SNP markers to determine off-types or mislabeled clones and to authenticate crosses made in the Cocoa Research Institute of Ghana (CRIG) breeding program. Investigation wa...

  14. Genome-Wide SNP Discovery and Analysis of Genetic Diversity in Farmed Sika Deer (Cervus nippon in Northeast China Using Double-Digest Restriction Site-Associated DNA Sequencing

    Directory of Open Access Journals (Sweden)

    Hengxing Ba

    2017-09-01

    Full Text Available Sika deer are an economically valuable species owing to their use in traditional Chinese medicine, particularly their velvet antlers. Sika deer in northeast China are mostly farmed in enclosure. Therefore, genetic management of farmed sika deer would benefit from detailed knowledge of their genetic diversity. In this study, we generated over 1.45 billion high-quality paired-end reads (288 Gbp across 42 unrelated individuals using double-digest restriction site-associated DNA sequencing (ddRAD-seq. A total of 96,188 (29.63% putative biallelic SNP loci were identified with an average sequencing depth of 23×. Based on the analysis, we found that the majority of the loci had a deficit of heterozygotes (FIS >0 and low values of Hobs, which could be due to inbreeding and Wahlund effects. We also developed a collection of high-quality SNP probes that will likely be useful in a variety of applications in genotyping for cervid species in the future.

  15. Genome-Wide SNP Discovery and Analysis of Genetic Diversity in Farmed Sika Deer (Cervus nippon) in Northeast China Using Double-Digest Restriction Site-Associated DNA Sequencing.

    Science.gov (United States)

    Ba, Hengxing; Jia, Boyin; Wang, Guiwu; Yang, Yifeng; Kedem, Gilead; Li, Chunyi

    2017-09-07

    Sika deer are an economically valuable species owing to their use in traditional Chinese medicine, particularly their velvet antlers. Sika deer in northeast China are mostly farmed in enclosure. Therefore, genetic management of farmed sika deer would benefit from detailed knowledge of their genetic diversity. In this study, we generated over 1.45 billion high-quality paired-end reads (288 Gbp) across 42 unrelated individuals using double-digest restriction site-associated DNA sequencing (ddRAD-seq). A total of 96,188 (29.63%) putative biallelic SNP loci were identified with an average sequencing depth of 23×. Based on the analysis, we found that the majority of the loci had a deficit of heterozygotes (F IS >0) and low values of H obs , which could be due to inbreeding and Wahlund effects. We also developed a collection of high-quality SNP probes that will likely be useful in a variety of applications in genotyping for cervid species in the future. Copyright © 2017 Ba et al.

  16. Agronomic and seed quality traits dissected by genome-wide association mapping in Brassica napus

    Directory of Open Access Journals (Sweden)

    Niklas eKörber

    2016-03-01

    Full Text Available In Brassica napus breeding, traits related to commercial success are of highest importance for plant breeders. However, such traits can only be assessed in an advanced developmental stage. % as well as require high experimental effort due to their quantitative inheritance and the importance of genotype*environment interaction. Molecular markers genetically linked to such traits have the potential to accelerate the breeding process of B. napus by marker-assisted selection. Therefore, the objectives of this study were to identify (i genome regions associated with the examined agronomic and seed quality traits, (ii the interrelationship of population structure and the detected associations, and (iii candidate genes for the revealed associations. The diversity set used in this study consisted of 405 Brassica napus inbred lines which were genotyped using a 6K single nucleotide polymorphism (SNP array and phenotyped for agronomic and seed quality traits in field trials. In a genome-wide association study, we detected a total of 112 associations between SNPs and the seed quality traits as well as 46 SNP-trait associations for the agronomic traits with a P-value 100 and a sequence identity of > 70 % to A. thaliana or B. rapa could be found for the agronomic SNP-trait associations and 187 hits of potential candidate genes for the seed quality SNP-trait associations.

  17. GWAS and Genomic Prediction Based on Markers of SNP-CHIPS and Sequence Data in Cattle Populations

    DEFF Research Database (Denmark)

    Wu, Xiaoping

    This thesis investigated the methods and models for genome wide association study and genomic prediction. The main conclusions are: 1) The power of QTL detection can be increased by increasing marker densities, and the Bayesian variable selection model together with the analysis of the QTL intens...

  18. Prediction of malting quality traits in barley based on genome-wide marker data to assess the potential of genomic selection.

    Science.gov (United States)

    Schmidt, Malthe; Kollers, Sonja; Maasberg-Prelle, Anja; Großer, Jörg; Schinkel, Burkhard; Tomerius, Alexandra; Graner, Andreas; Korzun, Viktor

    2016-02-01

    Genomic prediction of malting quality traits in barley shows the potential of applying genomic selection to improve selection for malting quality and speed up the breeding process. Genomic selection has been applied to various plant species, mostly for yield or yield-related traits such as grain dry matter yield or thousand kernel weight, and improvement of resistances against diseases. Quality traits have not been the main scope of analysis for genomic selection, but have rather been addressed by marker-assisted selection. In this study, the potential to apply genomic selection to twelve malting quality traits in two commercial breeding programs of spring and winter barley (Hordeum vulgare L.) was assessed. Phenotypic means were calculated combining multilocational field trial data from 3 or 4 years, depending on the trait investigated. Three to five locations were available in each of these years. Heritabilities for malting traits ranged between 0.50 and 0.98. Predictive abilities (PA), as derived from cross validation, ranged between 0.14 to 0.58 for spring barley and 0.40-0.80 for winter barley. Small training sets were shown to be sufficient to obtain useful PAs, possibly due to the narrow genetic base in this breeding material. Deployment of genomic selection in malting barley breeding clearly has the potential to reduce cost intensive phenotyping for quality traits, increase selection intensity and to shorten breeding cycles.

  19. A genome-wide transcriptome map of pistachio (Pistacia vera L.) provides novel insights into salinity-related genes and marker discovery.

    Science.gov (United States)

    Moazzzam Jazi, Maryam; Seyedi, Seyed Mahdi; Ebrahimie, Esmaeil; Ebrahimi, Mansour; De Moro, Gianluca; Botanga, Christopher

    2017-08-17

    of NCED3 and SOS1 genes were observed between salt-sensitive and salt-tolerant cultivars. This study, as the first report on the whole transcriptome survey of P. vera, provides important resources and paves the way for functional and comparative genomic studies on this major tree to discover the salinity tolerance-related markers and stress response mechanisms for breeding of new pistachio cultivars with more salinity tolerance.

  20. Report on the development of putative functional SSR and SNP markers in passion fruits.

    Science.gov (United States)

    da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro

    2017-09-06

    Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.

  1. High-resolution genetic map for understanding the effect of genome-wide recombination rate on nucleotide diversity in watermelon.

    Science.gov (United States)

    Reddy, Umesh K; Nimmakayala, Padma; Levi, Amnon; Abburi, Venkata Lakshmi; Saminathan, Thangasamy; Tomason, Yan R; Vajja, Gopinath; Reddy, Rishi; Abburi, Lavanya; Wehner, Todd C; Ronin, Yefim; Karol, Abraham

    2014-09-15

    We used genotyping by sequencing to identify a set of 10,480 single nucleotide polymorphism (SNP) markers for constructing a high-resolution genetic map of 1096 cM for watermelon. We assessed the genome-wide variation in recombination rate (GWRR) across the map and found an association between GWRR and genome-wide nucleotide diversity. Collinearity between the map and the genome-wide reference sequence for watermelon was studied to identify inconsistency and chromosome rearrangements. We assessed genome-wide nucleotide diversity, linkage disequilibrium (LD), and selective sweep for wild, semi-wild, and domesticated accessions of Citrullus lanatus var. lanatus to track signals of domestication. Principal component analysis combined with chromosome-wide phylogenetic study based on 1563 SNPs obtained after LD pruning with minor allele frequency of 0.05 resolved the differences between semi-wild and wild accessions as well as relationships among worldwide sweet watermelon. Population structure analysis revealed predominant ancestries for wild, semi-wild, and domesticated watermelons as well as admixture of various ancestries that were important for domestication. Sliding window analysis of Tajima's D across various chromosomes was used to resolve selective sweep. LD decay was estimated for various chromosomes. We identified a strong selective sweep on chromosome 3 consisting of important genes that might have had a role in sweet watermelon domestication. Copyright © 2014 Reddy et al.

  2. Genome-wide study of association and interaction with maternal cytomegalovirus infection suggests new schizophrenia loci.

    Science.gov (United States)

    Børglum, A D; Demontis, D; Grove, J; Pallesen, J; Hollegaard, M V; Pedersen, C B; Hedemand, A; Mattheisen, M; Uitterlinden, A; Nyegaard, M; Ørntoft, T; Wiuf, C; Didriksen, M; Nordentoft, M; Nöthen, M M; Rietschel, M; Ophoff, R A; Cichon, S; Yolken, R H; Hougaard, D M; Mortensen, P B; Mors, O

    2014-03-01

    Genetic and environmental components as well as their interaction contribute to the risk of schizophrenia, making it highly relevant to include environmental factors in genetic studies of schizophrenia. This study comprises genome-wide association (GWA) and follow-up analyses of all individuals born in Denmark since 1981 and diagnosed with schizophrenia as well as controls from the same birth cohort. Furthermore, we present the first genome-wide interaction survey of single nucleotide polymorphisms (SNPs) and maternal cytomegalovirus (CMV) infection. The GWA analysis included 888 cases and 882 controls, and the follow-up investigation of the top GWA results was performed in independent Danish (1396 cases and 1803 controls) and German-Dutch (1169 cases, 3714 controls) samples. The SNPs most strongly associated in the single-marker analysis of the combined Danish samples were rs4757144 in ARNTL (P=3.78 × 10(-6)) and rs8057927 in CDH13 (P=1.39 × 10(-5)). Both genes have previously been linked to schizophrenia or other psychiatric disorders. The strongest associated SNP in the combined analysis, including Danish and German-Dutch samples, was rs12922317 in RUNDC2A (P=9.04 × 10(-7)). A region-based analysis summarizing independent signals in segments of 100 kb identified a new region-based genome-wide significant locus overlapping the gene ZEB1 (P=7.0 × 10(-7)). This signal was replicated in the follow-up analysis (P=2.3 × 10(-2)). Significant interaction with maternal CMV infection was found for rs7902091 (P(SNP × CMV)=7.3 × 10(-7)) in CTNNA3, a gene not previously implicated in schizophrenia, stressing the importance of including environmental factors in genetic studies.

  3. Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties.

    Directory of Open Access Journals (Sweden)

    Nivedita Singh

    Full Text Available Simple sequence repeat (SSR and Single Nucleotide Polymorphic (SNP, the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis.

  4. Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties.

    Science.gov (United States)

    Singh, Nivedita; Choudhury, Debjani Roy; Singh, Amit Kumar; Kumar, Sundeep; Srinivasan, Kalyani; Tyagi, R K; Singh, N K; Singh, Rakesh

    2013-01-01

    Simple sequence repeat (SSR) and Single Nucleotide Polymorphic (SNP), the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR) and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC) values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA) indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA) with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD) derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis.

  5. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies.

    Science.gov (United States)

    Gimode, Davis; Odeny, Damaris A; de Villiers, Etienne P; Wanyonyi, Solomon; Dida, Mathews M; Mneney, Emmarold E; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M

    2016-01-01

    Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional

  6. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies.

    Directory of Open Access Journals (Sweden)

    Davis Gimode

    Full Text Available Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS technologies to develop both Simple Sequence Repeat (SSR and Single Nucleotide Polymorphism (SNP markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included

  7. Genome-wide identification of breed-informative single-nucleotide ...

    African Journals Online (AJOL)

    This is because the SNPs on BovineSNP50 and GGP-80K assays were ascertained as being common in European taurine breeds. Lower MAF and SNP informativeness observed in this study limits the application of these assays in breed assignment, and could have other implications for genome-wide studies in South ...

  8. A novel statistic for genome-wide interaction analysis.

    Directory of Open Access Journals (Sweden)

    Xuesen Wu

    2010-09-01

    Full Text Available Although great progress in genome-wide association studies (GWAS has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked. The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  9. Using SNP markers to estimate additive, dominance and imprinting genetic variance

    DEFF Research Database (Denmark)

    Lopes, M S; Bastiaansen, J W M; Janss, Luc

    The contributions of additive, dominance and imprinting effects to the variance of number of teats (NT) were evaluated in two purebred pig populations using SNP markers. Three different random regression models were evaluated, accounting for the mean and: 1) additive effects (MA), 2) additive...... and dominance effects (MAD) and 3) additive, dominance and imprinting effects (MADI). Additive heritability estimates were 0.30, 0.28 and 0.27-0.28 in both lines using MA, MAD and MADI, respectively. Dominance heritability ranged from 0.06 to 0.08 using MAD and MADI. Imprinting heritability ranged from 0.......01 to 0.02. Dominance effects make an important contribution to the genetic variation of NT in the two lines evaluated. Imprinting effects appeared less important for NT than additive and dominance effects. The SNP random regression model presented and evaluated in this study is a feasible approach...

  10. Identification of Promising Mutants Associated with Egg Production Traits Revealed by Genome-Wide Association Study.

    Directory of Open Access Journals (Sweden)

    Jingwei Yuan

    Full Text Available Egg number (EN, egg laying rate (LR and age at first egg (AFE are important production traits related to egg production in poultry industry. To better understand the knowledge of genetic architecture of dynamic EN during the whole laying cycle and provide the precise positions of associated variants for EN, LR and AFE, laying records from 21 to 72 weeks of age were collected individually for 1,534 F2 hens produced by reciprocal crosses between White Leghorn and Dongxiang Blue-shelled chicken, and their genotypes were assayed by chicken 600 K Affymetrix high density genotyping arrays. Subsequently, pedigree and SNP-based genetic parameters were estimated and a genome-wide association study (GWAS was conducted on EN, LR and AFE. The heritability estimates were similar between pedigree and SNP-based estimates varying from 0.17 to 0.36. In the GWA analysis, we identified nine genome-wide significant loci associated with EN of the laying periods from 21 to 26 weeks, 27 to 36 weeks and 37 to 72 weeks. Analysis of GTF2A1 and CLSPN suggested that they influenced the function of ovary and uterus, and may be considered as relevant candidates. The identified SNP rs314448799 for accumulative EN from 21 to 40 weeks on chromosome 5 created phenotypic differences of 6.86 eggs between two homozygous genotypes, which could be potentially applied to the molecular breeding for EN selection. Moreover, our finding showed that LR was a moderate polygenic trait. The suggestive significant region on chromosome 16 for AFE suggested the relationship between sex maturity and immune in the current population. The present study comprehensively evaluates the role of genetic variants in the development of egg laying. The findings will be helpful to investigation of causative genes function and future marker-assisted selection and genomic selection in chickens.

  11. A high-density SNP genetic linkage map for the silver-lipped pearl oyster, Pinctada maxima: a valuable resource for gene localisation and marker-assisted selection.

    Science.gov (United States)

    Jones, David B; Jerry, Dean R; Khatkar, Mehar S; Raadsma, Herman W; Zenger, Kyall R

    2013-11-20

    The silver-lipped pearl oyster, Pinctada maxima, is an important tropical aquaculture species extensively farmed for the highly sought "South Sea" pearls. Traditional breeding programs have been initiated for this species in order to select for improved pearl quality, but many economic traits under selection are complex, polygenic and confounded with environmental factors, limiting the accuracy of selection. The incorporation of a marker-assisted selection (MAS) breeding approach would greatly benefit pearl breeding programs by allowing the direct selection of genes responsible for pearl quality. However, before MAS can be incorporated, substantial genomic resources such as genetic linkage maps need to be generated. The construction of a high-density genetic linkage map for P. maxima is not only essential for unravelling the genomic architecture of complex pearl quality traits, but also provides indispensable information on the genome structure of pearl oysters. A total of 1,189 informative genome-wide single nucleotide polymorphisms (SNPs) were incorporated into linkage map construction. The final linkage map consisted of 887 SNPs in 14 linkage groups, spans a total genetic distance of 831.7 centimorgans (cM), and covers an estimated 96% of the P. maxima genome. Assessment of sex-specific recombination across all linkage groups revealed limited overall heterochiasmy between the sexes (i.e. 1.15:1 F/M map length ratio). However, there were pronounced localised differences throughout the linkage groups, whereby male recombination was suppressed near the centromeres compared to female recombination, but inflated towards telomeric regions. Mean values of LD for adjacent SNP pairs suggest that a higher density of markers will be required for powerful genome-wide association studies. Finally, numerous nacre biomineralization genes were localised providing novel positional information for these genes. This high-density SNP genetic map is the first comprehensive linkage

  12. Characterizing associations and SNP-environment interactions for GWAS-identified prostate cancer risk markers--results from BPC3.

    Directory of Open Access Journals (Sweden)

    Sara Lindstrom

    2011-02-01

    Full Text Available Genome-wide association studies (GWAS have identified multiple single nucleotide polymorphisms (SNPs associated with prostate cancer risk. However, whether these associations can be consistently replicated, vary with disease aggressiveness (tumor stage and grade and/or interact with non-genetic potential risk factors or other SNPs is unknown. We therefore genotyped 39 SNPs from regions identified by several prostate cancer GWAS in 10,501 prostate cancer cases and 10,831 controls from the NCI Breast and Prostate Cancer Cohort Consortium (BPC3. We replicated 36 out of 39 SNPs (P-values ranging from 0.01 to 10⁻²⁸. Two SNPs located near KLK3 associated with PSA levels showed differential association with Gleason grade (rs2735839, P = 0.0001 and rs266849, P = 0.0004; case-only test, where the alleles associated with decreasing PSA levels were inversely associated with low-grade (as defined by Gleason grade < 8 tumors but positively associated with high-grade tumors. No other SNP showed differential associations according to disease stage or grade. We observed no effect modification by SNP for association with age at diagnosis, family history of prostate cancer, diabetes, BMI, height, smoking or alcohol intake. Moreover, we found no evidence of pair-wise SNP-SNP interactions. While these SNPs represent new independent risk factors for prostate cancer, we saw little evidence for effect modification by other SNPs or by the environmental factors examined.

  13. Genome-wide Association Study Identifies New Loci for Resistance to Leptosphaeria maculans in Canola

    Directory of Open Access Journals (Sweden)

    Harsh Raman

    2016-10-01

    Full Text Available Blackleg, caused by Leptosphaeria maculans, is a significant disease which affects the sustainable production of canola. This study reports a genome-wide association study based on 18,804 polymorphic SNPs to identify loci associated with qualitative and quantitative resistance to L. maculans. Genomic regions delimited with 503 significant SNP markers, that are associated with resistance evaluated using 12 single spore isolates and pathotypes from four canola stubble were identified. Several significant associations were detected at known disease resistance loci including in the vicinity of recently cloned Rlm2/LepR3 genes, and at new loci on chromosomes A01/C01, A02/C02, A03/C03, A05/C05, A06, A08, and A09. In addition, we validated statistically significant associations on A01, A07 and A10 in four genetic mapping populations, demonstrating that GWAS marker loci are indeed associated with resistance to L. maculans. One of the novel loci identified for the first time, Rlm12, conveys adult plant resistance and mapped within 13.2 kb from Arabidopsis R gene of TIR-NBS class. We showed that resistance loci are located in the vicinity of R genes of A. thaliana and B. napus on the sequenced genome of B. napus cv. Darmor-bzh. Significantly associated SNP markers provide a valuable tool to enrich germplasm for favorable alleles in order to improve the level of resistance to L. maculans in canola.

  14. Diversity in 113 cowpea [Vigna unguiculata (L) Walp] accessions assessed with 458 SNP markers.

    Science.gov (United States)

    Egbadzor, Kenneth F; Ofori, Kwadwo; Yeboah, Martin; Aboagye, Lawrence M; Opoku-Agyeman, Michael O; Danquah, Eric Y; Offei, Samuel K

    2014-01-01

    Single Nucleotide Polymorphism (SNP) markers were used in characterization of 113 cowpea accessions comprising of 108 from Ghana and 5 from abroad. Leaf tissues from plants cultivated at the University of Ghana were genotyped at KBioscience in the United Kingdom. Data was generated for 477 SNPs, out of which 458 revealed polymorphism. The results were used to analyze genetic dissimilarity among the accessions using Darwin 5 software. The markers discriminated among all of the cowpea accessions and the dissimilarity values which ranged from 0.006 to 0.63 were used for factorial plot. Unexpected high levels of heterozygosity were observed on some of the accessions. Accessions known to be closely related clustered together in a dendrogram drawn with WPGMA method. A maximum length sub-tree which comprised of 48 core accessions was constructed. The software package structure was used to separate accessions into three groups, and the programme correctly identified varieties that were known hybrids. The hybrids were those accessions with numerous heterozygous loci. The structure plot showed closely related accessions with similar genome patterns. The SNP markers were more efficient in discriminating among the cowpea germplasm than morphological, seed protein polymorphism and simple sequence repeat studies reported earlier on the same collection.

  15. A genome-wide association study of social genetic effects in Landrace pigs.

    Science.gov (United States)

    Hong, Joon Ki; Jeong, Yong Dae; Cho, Eun Seok; Choi, Tae Jeong; Kim, Yong Min; Cho, Kyu Ho; Lee, Jae Bong; Lim, Hyun Tae; Lee, Deuk Hwan

    2018-06-01

    The genetic effects of an individual on the phenotypes of its social partners, such as its pen mates, are known as social genetic effects. This study aims to identify the candidate genes for social (pen-mates') average daily gain (ADG) in pigs by using the genome-wide association approach. Social ADG (sADG) was the average ADG of unrelated pen-mates (strangers). We used the phenotype data (16,802 records) after correcting for batch (week), sex, pen, number of strangers (1 to 7 pigs) in the pen, full-sib rate (0% to 80%) within pen, and age at the end of the test. A total of 1,041 pigs from Landrace breeds were genotyped using the Illumina PorcineSNP60 v2 BeadChip panel, which comprised 61,565 single nucleotide polymorphism (SNP) markers. After quality control, 909 individuals and 39,837 markers remained for sADG in genome-wide association study. We detected five new SNPs, all on chromosome 6, which have not been associated with social ADG or other growth traits to date. One SNP was inside the prostaglandin F2α receptor ( PTGFR ) gene, another SNP was located 22 kb upstream of gene interferon-induced protein 44 ( IFI44 ), and the last three SNPs were between 161 kb and 191 kb upstream of the EGF latrophilin and seven transmembrane domain-containing protein 1 ( ELTD1 ) gene. PTGFR, IFI44, and ELTD1 were never associated with social interaction and social genetic effects in any of the previous studies. The identification of several genomic regions, and candidate genes associated with social genetic effects reported here, could contribute to a better understanding of the genetic basis of interaction traits for ADG. In conclusion, we suggest that the PTGFR, IFI44, and ELTD1 may be used as a molecular marker for sADG, although their functional effect was not defined yet. Thus, it will be of interest to execute association studies in those genes.

  16. Development of a set of SNP markers present in expressed genes of the apple.

    Science.gov (United States)

    Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S

    2008-11-01

    Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.

  17. Genome wide selection in Citrus breeding.

    Science.gov (United States)

    Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A

    2016-10-17

    Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq TM (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.

  18. Efficient genome-wide genotyping strategies and data integration in crop plants.

    Science.gov (United States)

    Torkamaneh, Davoud; Boyle, Brian; Belzile, François

    2018-03-01

    Next-generation sequencing (NGS) has revolutionized plant and animal research by providing powerful genotyping methods. This review describes and discusses the advantages, challenges and, most importantly, solutions to facilitate data processing, the handling of missing data, and cross-platform data integration. Next-generation sequencing technologies provide powerful and flexible genotyping methods to plant breeders and researchers. These methods offer a wide range of applications from genome-wide analysis to routine screening with a high level of accuracy and reproducibility. Furthermore, they provide a straightforward workflow to identify, validate, and screen genetic variants in a short time with a low cost. NGS-based genotyping methods include whole-genome re-sequencing, SNP arrays, and reduced representation sequencing, which are widely applied in crops. The main challenges facing breeders and geneticists today is how to choose an appropriate genotyping method and how to integrate genotyping data sets obtained from various sources. Here, we review and discuss the advantages and challenges of several NGS methods for genome-wide genetic marker development and genotyping in crop plants. We also discuss how imputation methods can be used to both fill in missing data in genotypic data sets and to integrate data sets obtained using different genotyping tools. It is our hope that this synthetic view of genotyping methods will help geneticists and breeders to integrate these NGS-based methods in crop plant breeding and research.

  19. Genome-wide association study of Stayability and Heifer Pregnancy in Red Angus cattle.

    Science.gov (United States)

    Speidel, S E; Buckley, B A; Boldt, R J; Enns, R M; Lee, J; Spangler, M L; Thomas, M G

    2018-04-03

    Reproductive performance is the most important component of cattle production from the standpoint of economic sustainability of commercial beef enterprises. Heifer Pregnancy (HPG) and Stayability (STAY) genetic predictions are 2 selection tools published by the Red Angus Association of America (RAAA) to assist with improvements in reproductive performance. Given the importance of HPG and STAY to the profitability of commercial beef enterprises, the objective of this study was to identify QTL associated with both HPG and STAY in Red Angus cattle. A genome-wide association study (GWAS) was performed using deregressed HPG and STAY EBV, calculated using a single-trait animal model and a 3-generation pedigree with data from the Spring 2015 RAAA National Cattle Evaluation. Each individual animal possessed 74,659 SNP genotypes. Individual animals with a deregressed EBV reliability > 0.05 were merged with the genotype file and marker quality control was performed. Criteria for sifting genotypes consisted of removing those markers where any of the following were found: average call rate less than 0.85, minor allele frequency 0.99). These criteria resulted in 2,664 animals with 62,807 SNP available for GWAS. Association studies were performed using a Bayes Cπ model in the BOLT software package. Marker significance was calculated as the posterior probability of inclusion (PPI), or the number of instances a specific marker was sampled divided by the total number of samples retained from the Markov chain Monte Carlo chains. Nine markers, with a PPI ≥ 3% were identified as QTL associated with HPG on BTA 1, 11, 13, 23, and 29. Twelve markers, with a PPI ≥ 75% were identified as QTL associated with STAY on BTA 6, 8, 9, 12, 15, 18, 22, and 23.

  20. Genome-Wide Association Mapping of Barley Yellow Dwarf Virus Tolerance in Spring Oat (Avena sativa L..

    Directory of Open Access Journals (Sweden)

    Bradley J Foresman

    Full Text Available Barley yellow dwarf viruses (BYDVs are responsible for the disease barley yellow dwarf (BYD and affect many cereals including oat (Avena sativa L.. Until recently, the molecular marker technology in oat has not allowed for many marker-trait association studies to determine the genetic mechanisms for tolerance. A genome-wide association study (GWAS was performed on 428 spring oat lines using a recently developed high-density oat single nucleotide polymorphism (SNP array as well as a SNP-based consensus map. Marker-trait associations were performed using a Q-K mixed model approach to control for population structure and relatedness. Six significant SNP-trait associations representing two QTL were found on chromosomes 3C (Mrg17 and 18D (Mrg04. This is the first report of BYDV tolerance QTL on chromosome 3C (Mrg17 and 18D (Mrg04. Haplotypes using the two QTL were evaluated and distinct classes for tolerance were identified based on the number of favorable alleles. A large number of lines carrying both favorable alleles were observed in the panel.

  1. Genome-Wide Analysis of Seed Acid Detergent Lignin (ADL) and Hull Content in Rapeseed (Brassica napus L.)

    Science.gov (United States)

    Wei, Lijuan; Qu, Cunmin; Xu, Xinfu; Lu, Kun; Qian, Wei; Li, Jiana; Li, Maoteng; Liu, Liezhao

    2015-01-01

    A stable yellow-seeded variety is the breeding goal for obtaining the ideal rapeseed (Brassica napus L.) plant, and the amount of acid detergent lignin (ADL) in the seeds and the hull content (HC) are often used as yellow-seeded rapeseed screening indices. In this study, a genome-wide association analysis of 520 accessions was performed using the Q + K model with a total of 31,839 single-nucleotide polymorphism (SNP) sites. As a result, three significant associations on the B. napus chromosomes A05, A09, and C05 were detected for seed ADL content. The peak SNPs were within 9.27, 14.22, and 20.86 kb of the key genes BnaA.PAL4, BnaA.CAD2/BnaA.CAD3, and BnaC.CCR1, respectively. Further analyses were performed on the major locus of A05, which was also detected in the seed HC examination. A comparison of our genome-wide association study (GWAS) results and previous linkage mappings revealed a common chromosomal region on A09, which indicates that GWAS can be used as a powerful complementary strategy for dissecting complex traits in B. napus. Genomic selection (GS) utilizing the significant SNP markers based on the GWAS results exhibited increased predictive ability, indicating that the predictive ability of a given model can be substantially improved by using GWAS and GS. PMID:26673885

  2. Integrative genome-wide expression profiling identifies three distinct molecular subgroups of renal cell carcinoma with different patient outcome

    Directory of Open Access Journals (Sweden)

    Beleut Manfred

    2012-07-01

    Full Text Available Abstract Background Renal cell carcinoma (RCC is characterized by a number of diverse molecular aberrations that differ among individuals. Recent approaches to molecularly classify RCC were based on clinical, pathological as well as on single molecular parameters. As a consequence, gene expression patterns reflecting the sum of genetic aberrations in individual tumors may not have been recognized. In an attempt to uncover such molecular features in RCC, we used a novel, unbiased and integrative approach. Methods We integrated gene expression data from 97 primary RCC of different pathologic parameters, 15 RCC metastases as well as 34 cancer cell lines for two-way nonsupervised hierarchical clustering using gene groups suggested by the PANTHER Classification System. We depicted the genomic landscape of the resulted tumor groups by means of Single Nuclear Polymorphism (SNP technology. Finally, the achieved results were immunohistochemically analyzed using a tissue microarray (TMA composed of 254 RCC. Results We found robust, genome wide expression signatures, which split RCC into three distinct molecular subgroups. These groups remained stable even if randomly selected gene sets were clustered. Notably, the pattern obtained from RCC cell lines was clearly distinguishable from that of primary tumors. SNP array analysis demonstrated differing frequencies of chromosomal copy number alterations among RCC subgroups. TMA analysis with group-specific markers showed a prognostic significance of the different groups. Conclusion We propose the existence of characteristic and histologically independent genome-wide expression outputs in RCC with potential biological and clinical relevance.

  3. Genetic Variance Partitioning and Genome-Wide Prediction with Allele Dosage Information in Autotetraploid Potato.

    Science.gov (United States)

    Endelman, Jeffrey B; Carley, Cari A Schmitz; Bethke, Paul C; Coombs, Joseph J; Clough, Mark E; da Silva, Washington L; De Jong, Walter S; Douches, David S; Frederick, Curtis M; Haynes, Kathleen G; Holm, David G; Miller, J Creighton; Muñoz, Patricio R; Navarro, Felix M; Novy, Richard G; Palta, Jiwan P; Porter, Gregory A; Rak, Kyle T; Sathuvalli, Vidyasagar R; Thompson, Asunta L; Yencho, G Craig

    2018-05-01

    As one of the world's most important food crops, the potato ( Solanum tuberosum L.) has spurred innovation in autotetraploid genetics, including in the use of SNP arrays to determine allele dosage at thousands of markers. By combining genotype and pedigree information with phenotype data for economically important traits, the objectives of this study were to (1) partition the genetic variance into additive vs. nonadditive components, and (2) determine the accuracy of genome-wide prediction. Between 2012 and 2017, a training population of 571 clones was evaluated for total yield, specific gravity, and chip fry color. Genomic covariance matrices for additive ( G ), digenic dominant ( D ), and additive × additive epistatic ( G # G ) effects were calculated using 3895 markers, and the numerator relationship matrix ( A ) was calculated from a 13-generation pedigree. Based on model fit and prediction accuracy, mixed model analysis with G was superior to A for yield and fry color but not specific gravity. The amount of additive genetic variance captured by markers was 20% of the total genetic variance for specific gravity, compared to 45% for yield and fry color. Within the training population, including nonadditive effects improved accuracy and/or bias for all three traits when predicting total genotypic value. When six F 1 populations were used for validation, prediction accuracy ranged from 0.06 to 0.63 and was consistently lower (0.13 on average) without allele dosage information. We conclude that genome-wide prediction is feasible in potato and that it will improve selection for breeding value given the substantial amount of nonadditive genetic variance in elite germplasm. Copyright © 2018 by the Genetics Society of America.

  4. Genome-Wide Association Study for Autism Spectrum Disorder in Taiwanese Han Population.

    Directory of Open Access Journals (Sweden)

    Po-Hsiu Kuo

    Full Text Available Autism spectrum disorder (ASD is a neurodevelopmental disorder with strong genetic components. Several recent genome-wide association (GWA studies in Caucasian samples have reported a number of gene regions and loci correlated with the risk of ASD--albeit with very little consensus across studies.A two-stage GWA study was employed to identify common genetic variants for ASD in the Taiwanese Han population. The discovery stage included 315 patients with ASD and 1,115 healthy controls, using the Affymetrix SNP array 6.0 platform for genotyping. Several gene regions were then selected for fine-mapping and top markers were examined in extended samples. Single marker, haplotype, gene-based, and pathway analyses were conducted for associations.Seven SNPs had p-values ranging from 3.4~9.9*10-6, but none reached the genome-wide significant level. Five of them were mapped to three known genes (OR2M4, STYK1, and MNT with significant empirical gene-based p-values in OR2M4 (p = 3.4*10(-5 and MNT (p = 0.0008. Results of the fine-mapping study showed single-marker associations in the GLIS1 (rs12082358 and rs12080993 and NAALADL2 (rs3914502 and rs2222447 genes, and gene-based associations for the OR2M3-OR2T5 (olfactory receptor genes, p = 0.02, and GLIPR1/KRR1 gene regions (p = 0.015. Pathway analyses revealed important pathways for ASD, such as olfactory and G protein-coupled receptors signaling pathways.We reported Taiwanese Han specific susceptibility genes and variants for ASD. However, further replication in other Asian populations is warranted to validate our findings. Investigation in the biological functions of our reported genetic variants might also allow for better understanding on the underlying pathogenesis of autism.

  5. Genome-wide association study of pre-harvest sprouting resistance in Chinese wheat founder parents

    Directory of Open Access Journals (Sweden)

    Yu Lin

    2017-07-01

    Full Text Available Abstract Pre-harvest sprouting (PHS is a major abiotic factor affecting grain weight and quality, and is caused by an early break in seed dormancy. Association mapping (AM is used to detect correlations between phenotypes and genotypes based on linkage disequilibrium (LD in wheat breeding programs. We evaluated seed dormancy in 80 Chinese wheat founder parents in five environments and performed a genome-wide association study using 6,057 markers, including 93 simple sequence repeat (SSR, 1,472 diversity array technology (DArT, and 4,492 single nucleotide polymorphism (SNP markers. The general linear model (GLM and the mixed linear model (MLM were used in this study, and two significant markers (tPt-7980 and wPt-6457 were identified. Both markers were located on Chromosome 1B, with wPt-6457 having been identified in a previously reported chromosomal position. The significantly associated loci contain essential information for cloning genes related to resistance to PHS and can be used in wheat breeding programs.

  6. Genetic Diversity in Jatropha curcas L. Assessed with SSR and SNP Markers

    Directory of Open Access Journals (Sweden)

    Juan M. Montes

    2014-08-01

    Full Text Available Jatropha curcas L. (jatropha is an undomesticated plant that has recently received great attention for its utilization in biofuel production, rehabilitation of wasteland, and rural development. Knowledge of genetic diversity and marker-trait associations is urgently needed for the design of breeding strategies. The main goal of this study was to assess the genetic structure and diversity in jatropha germplasm with co-dominant markers (Simple Sequence Repeats (SSR and Single Nucleotide Polymorphism (SNP in a diverse, worldwide, germplasm panel of 70 accessions. We found a high level of homozygosis in the germplasm that does not correspond to the purely outcrossing mating system assumed to be present in jatropha. We hypothesize that the prevalent mating system of jatropha comprise a high level of self-fertilization and that the outcrossing rate is low. Genetic diversity in accessions from Central America and Mexico was higher than in accession from Africa, Asia, and South America. We identified makers associated with the presence of phorbol esters. We think that the utilization of molecular markers in breeding of jatropha will significantly accelerate the development of improved cultivars.

  7. Targeting 160 candidate genes for blood pressure regulation with a genome-wide genotyping array.

    Directory of Open Access Journals (Sweden)

    Siim Sõber

    2009-06-01

    Full Text Available The outcome of Genome-Wide Association Studies (GWAS has challenged the field of blood pressure (BP genetics as previous candidate genes have not been among the top loci in these scans. We used Affymetrix 500K genotyping data of KORA S3 cohort (n = 1,644; Southern-Germany to address (i SNP coverage in 160 BP candidate genes; (ii the evidence for associations with BP traits in genome-wide and replication data, and haplotype analysis. In total, 160 gene regions (genic region+/-10 kb covered 2,411 SNPs across 11.4 Mb. Marker densities in genes varied from 0 (n = 11 to 0.6 SNPs/kb. On average 52.5% of the HAPMAP SNPs per gene were captured. No evidence for association with BP was obtained for 1,449 tested SNPs. Considerable associations (P50% of HAPMAP SNPs were tagged. In general, genes with higher marker density (>0.2 SNPs/kb revealed a better chance to reach close to significance associations. Although, none of the detected P-values remained significant after Bonferroni correction (P<0.05/2319, P<2.15 x 10(-5, the strength of some detected associations was close to this level: rs10889553 (LEPR and systolic BP (SBP (P = 4.5 x 10(-5 as well as rs10954174 (LEP and diastolic BP (DBP (P = 5.20 x 10(-5. In total, 12 markers in 7 genes (ADRA2A, LEP, LEPR, PTGER3, SLC2A1, SLC4A2, SLC8A1 revealed considerable association (P<10(-3 either with SBP, DBP, and/or hypertension (HYP. None of these were confirmed in replication samples (KORA S4, HYPEST, BRIGHT. However, supportive evidence for the association of rs10889553 (LEPR and rs11195419 (ADRA2A with BP was obtained in meta-analysis across samples stratified either by body mass index, smoking or alcohol consumption. Haplotype analysis highlighted LEPR and PTGER3. In conclusion, the lack of associations in BP candidate genes may be attributed to inadequate marker coverage on the genome-wide arrays, small phenotypic effects of the loci and/or complex interaction with life-style and metabolic parameters.

  8. Genetically contextual effects of smoking on genome wide DNA methylation.

    Science.gov (United States)

    Dogan, Meeshanthini V; Beach, Steven R H; Philibert, Robert A

    2017-09-01

    Smoking is the leading cause of death in the United States. It exerts its effects by increasing susceptibility to a variety of complex disorders among those who smoke, and if pregnant, to their unborn children. In prior efforts to understand the epigenetic mechanisms through which this increased vulnerability is conveyed, a number of investigators have conducted genome wide methylation analyses. Unfortunately, secondary to methodological limitations, these studies were unable to examine methylation in gene regions with significant amounts of genetic variation. Using genome wide genetic and epigenetic data from the Framingham Heart Study, we re-examined the relationship of smoking status to genome wide methylation status. When only methylation status is considered, smoking was significantly associated with differential methylation in 310 genes that map to a variety of biological process and cellular differentiation pathways. However, when SNP effects on the magnitude of smoking associated methylation changes are also considered, cis and trans-interaction effects were noted at a total of 266 and 4353 genes with no marked enrichment for any biological pathways. Furthermore, the SNP variation participating in the significant interaction effects is enriched for loci previously associated with complex medical illnesses. The enlarged scope of the methylome shown to be affected by smoking may better explicate the mediational pathways linking smoking with a myriad of smoking related complex syndromes. Additionally, these results strongly suggest that combined epigenetic and genetic data analyses may be critical for a more complete understanding of the relationship between environmental variables, such as smoking, and pathophysiological outcomes. © 2017 Wiley Periodicals, Inc.

  9. A genome-wide identification of chromosomal regions determining nitrogen use efficiency components in wheat (Triticum aestivum L.).

    Science.gov (United States)

    Cormier, Fabien; Le Gouis, Jacques; Dubreuil, Pierre; Lafarge, Stéphane; Praud, Sébastien

    2014-12-01

    This study identified 333 genomic regions associated to 28 traits related to nitrogen use efficiency in European winter wheat using genome-wide association in a 214-varieties panel experimented in eight environments. Improving nitrogen use efficiency is a key factor to sustainably ensure global production increase. However, while high-throughput screening methods remain at a developmental stage, genetic progress may be mainly driven by marker-assisted selection. The objective of this study was to identify chromosomal regions associated with nitrogen use efficiency-related traits in bread wheat (Triticum aestivum L.) using a genome-wide association approach. Two hundred and fourteen European elite varieties were characterised for 28 traits related to nitrogen use efficiency in eight environments in which two different nitrogen fertilisation levels were tested. The genome-wide association study was carried out using 23,603 SNP with a mixed model for taking into account parentage relationships among varieties. We identified 1,010 significantly associated SNP which defined 333 chromosomal regions associated with at least one trait and found colocalisations for 39 % of these chromosomal regions. A method based on linkage disequilibrium to define the associated region was suggested and discussed with reference to false positive rate. Through a network approach, colocalisations were analysed and highlighted the impact of genomic regions controlling nitrogen status at flowering, precocity, and nitrogen utilisation on global agronomic performance. We were able to explain 40 ± 10 % of the total genetic variation. Numerous colocalisations with previously published genomic regions were observed with such candidate genes as Ppd-D1, Rht-D1, NADH-Gogat, and GSe. We highlighted selection pressure on yield and nitrogen utilisation discussing allele frequencies in associated regions.

  10. Evidence for gene-environment interaction in a genome wide study of isolated, non-syndromic cleft palate

    Science.gov (United States)

    Beaty, Terri H.; Ruczinski, Ingo; Murray, Jeffrey C.; Marazita, Mary L.; Munger, Ronald G.; Hetmanski, Jacqueline B.; Murray, Tanda; Redett, Richard J.; Fallin, M. Daniele; Liang, Kung Yee; Wu, Tao; Patel, Poorav J.; Jin, Sheng C.; Zhang, Tian Xiao; Schwender, Holger; Wu-Chou, Yah Huei; Chen, Philip K; Chong, Samuel S; Cheah, Felicia; Yeow, Vincent; Ye, Xiaoqian; Wang, Hong; Huang, Shangzhi; Jabs, Ethylin W.; Shi, Bing; Wilcox, Allen J.; Lie, Rolv T.; Jee, Sun Ha; Christensen, Kaare; Doheny, Kimberley F.; Pugh, Elizabeth W.; Ling, Hua; Scott, Alan F.

    2011-01-01

    Non-syndromic cleft palate (CP) is a common birth defect with a complex and heterogeneous etiology involving both genetic and environmental risk factors. We conducted a genome wide association study (GWAS) using 550 case-parent trios, ascertained through a CP case collected in an international consortium. Family based association tests of single nucleotide polymorphisms (SNP) and three common maternal exposures (maternal smoking, alcohol consumption and multivitamin supplementation) were used in a combined 2 df test for gene (G) and gene-environment (G×E) interaction simultaneously, plus a separate 1 df test for G×E interaction alone. Conditional logistic regression models were used to estimate effects on risk to exposed and unexposed children. While no SNP achieved genome wide significance when considered alone, markers in several genes attained or approached genome wide significance when G×E interaction was included. Among these, MLLT3 and SMC2 on chromosome 9 showed multiple SNPs resulting in increased risk if the mother consumed alcohol during the peri-conceptual period (3 months prior to conception through the first trimester). TBK1 on chr. 12 and ZNF236 on chr. 18 showed multiple SNPs associated with higher risk of CP in the presence of maternal smoking. Additional evidence of reduced risk due to G×E interaction in the presence of multivitamin supplementation was observed for SNPs in BAALC on chr. 8. These results emphasize the need to consider G×E interaction when searching for genes influencing risk to complex and heterogeneous disorders, such as non-syndromic CP. PMID:21618603

  11. Genome-Wide Association Study of Piglet Uniformity and Farrowing Interval

    Directory of Open Access Journals (Sweden)

    Yuan Wang

    2017-11-01

    Full Text Available Piglet uniformity (PU and farrowing interval (FI are important reproductive traits related to production and economic profits in the pig industry. However, the genetic architecture of the longitudinal trends of reproductive traits still remains elusive. Herein, we performed a genome-wide association study (GWAS to detect potential genetic variation and candidate genes underlying the phenotypic records at different parities for PU and FI in a population of 884 Large White pigs. In total, 12 significant SNPs were detected on SSC1, 3, 4, 9, and 14, which collectively explained 1–1.79% of the phenotypic variance for PU from parity 1 to 4, and 2.58–4.11% for FI at different stages. Of these, seven SNPs were located within 16 QTL regions related to swine reproductive traits. One QTL region was associated with birth body weight (related to PU and contained the peak SNP MARC0040730, and another was associated with plasma FSH concentration (related to FI and contained the SNP MARC0031325. Finally, some positional candidate genes for PU and FI were identified because of their roles in prenatal skeletal muscle development, fetal energy substrate, pre-implantation, and the expression of mammary gland epithelium. Identification of novel variants and candidate genes will greatly advance our understanding of the genetic mechanisms of PU and FI, and suggest a specific opportunity for improving marker assisted selection or genomic selection in pigs.

  12. Genome-Wide Association Analysis of Age-Dependent Egg Weights in Chickens

    Directory of Open Access Journals (Sweden)

    Zhuang Liu

    2018-04-01

    Full Text Available Egg weight (EW is an economically-important trait and displays a consecutive increase with the hen's age. Because extremely large eggs cause a range of problems in the poultry industry, we performed a genome-wide association study (GWAS in order to identify genomic variations that are associated with EW. We utilized the Affymetrix 600 K high density SNP array in a population of 1,078 hens at seven time points from day at first egg to 80 weeks age (EW80. Results reveal that a 90 Kb genomic region (169.42 Mb ~ 169.51 Mb in GGA1 is significantly associated with EW36 and is also potentially associated with egg weight at 28, 56, and 66 week of age. The leading SNP could account for 3.66% of the phenotypic variation, while two promising genes (DLEU7 and MIR15A can be mapped to this narrow significant region and may affect EW in a pleiotropic manner. In addition, one gene (CECR2 on GGA1 and two genes (MEIS1 and SPRED2 on GGA3, which involved in the processes of embryogenesis and organogenesis, were also considered to be candidates related to first egg weight (FEW and EW56, respectively. Findings in our study could provide worthy theoretical basis to generate eggs of ideal size based on marker assisted breeding selection.

  13. Genome-wide association study of rust traits in orchardgrass using SLAF-seq technology.

    Science.gov (United States)

    Zeng, Bing; Yan, Haidong; Liu, Xinchun; Zang, Wenjing; Zhang, Ailing; Zhou, Sifan; Huang, Linkai; Liu, Jinping

    2017-01-01

    While orchardgrass ( Dactylis glomerata L.) is a well-known perennial forage species, rust diseases cause serious reductions in the yield and quality of orchardgrass; however, genetic mechanisms of rust resistance are not well understood in orchardgrass. In this study, a genome-wide association study (GWAS) was performed using specific-locus amplified fragment sequencing (SLAF-seq) technology in orchardgrass. A total of 2,334,889 SLAF tags were generated to produce 2,309,777 SNPs. ADMIXTURE analysis revealed unstructured subpopulations for 33 accessions, indicating that this orchardgrass population could be used for association analysis. Linkage disequilibrium (LD) analysis revealed an average r 2 of 0.4 across all SNP pairs, indicating a high extent of LD in these samples. Through GWAS, a total of 4,604 SNPs were found to be significantly ( P  rust trait. The bulk analysis discovered a number of 5,211 SNPs related to rust trait. Two candidate genes, including cytochrome P450, and prolamin were implicated in disease resistance through prediction of functional genes surrounding each high-quality SNP ( P  rust traits based on GWAS analysis and bulk analysis. The large number of SNPs associated with rust traits and these two candidate genes may provide the basis for further research on rust resistance mechanisms and marker-assisted selection (MAS) for rust-resistant lineages.

  14. Characterization of Insect Resistance Loci in the USDA Soybean Germplasm Collection Using Genome-Wide Association Studies

    Directory of Open Access Journals (Sweden)

    Hao-Xun Chang

    2017-05-01

    Full Text Available Management of insects that cause economic damage to yields of soybean mainly rely on insecticide applications. Sources of resistance in soybean plant introductions (PIs to different insect pests have been reported, and some of these sources, like for the soybean aphid (SBA, have been used to develop resistant soybean cultivars. With the availability of SoySNP50K and the statistical power of genome-wide association studies, we integrated phenotypic data for beet armyworm, Mexican bean beetle (MBB, potato leafhopper (PLH, SBA, soybean looper (SBL, velvetbean caterpillar (VBC, and chewing damage caused by unspecified insects for a comprehensive understanding of insect resistance in the United States Department of Agriculture Soybean Germplasm Collection. We identified significant single nucleotide (SNP polymorphic markers for MBB, PLH, SBL, and VBC, and we highlighted several leucine-rich repeat-containing genes and myeloblastosis transcription factors within the high linkage disequilibrium region surrounding significant SNP markers. Specifically for soybean resistance to PLH, we found the PLH locus is close but distinct to a locus for soybean pubescence density on chromosome 12. The results provide genetic support that pubescence density may not directly link to PLH resistance. This study offers a novel insight of soybean resistance to four insect pests and reviews resistance mapping studies for major soybean insects.

  15. Genome-wide association and biological pathway analysis for milk-fat composition in Danish Holstein and Danish Jersey cattle

    DEFF Research Database (Denmark)

    Buitenhuis, Bart; Janss, Luc L G; Poulsen, Nina Aagaard

    2014-01-01

    provide new possibilities to change the milk fat composition by selective breeding. In this study a genome wide association scan (GWAS) in the DH and DJ was performed for a detailed milk fatty acid (FA) profile using the HD bovine SNP array and subsequently a biological pathway analysis based on the SNP...

  16. Detection of gene-environment interaction in pedigree data using genome-wide genotypes

    NARCIS (Netherlands)

    Nivard, Michel G.; Middeldorp, Christel M.; Lubke, Gitta; Hottenga, Jouke-Jan; Abdellaoui, Abdel; Boomsma, Dorret I.; Dolan, Conor V.

    2016-01-01

    Heritability may be estimated using phenotypic data collected in relatives or in distantly related individuals using genome-wide single nucleotide polymorphism (SNP) data. We combined these approaches by re-parameterizing the model proposed by Zaitlen et al and extended this model to include

  17. Genome-Wide Association Uncovers Shared Genetic Effects Among Personality Traits and Mood States

    NARCIS (Netherlands)

    Luciano, Michelle; Huffman, Jennifer E.; Arias-Vásquez, Alejandro; Vinkhuyzen, Anna A. E.; Middeldorp, Christel M.; Giegling, Ina; Payton, Antony; Davies, Gail; Zgaga, Lina; Janzing, Joost; Ke, Xiayi; Galesloot, Tessel; Hartmann, Annette M.; Ollier, William; Tenesa, Albert; Hayward, Caroline; Verhagen, Maaike; Montgomery, Grant W.; Hottenga, Jouke-Jan; Konte, Bettina; Starr, John M.; Vitart, Veronique; Vos, Pieter E.; Madden, Pamela A. F.; Willemsen, Gonneke; Konnerth, Heike; Horan, Michael A.; Porteous, David J.; Campbell, Harry; Vermeulen, Sita H.; Heath, Andrew C.; Wright, Alan; Polasek, Ozren; Kovacevic, Sanja B.; Hastie, Nicholas D.; Franke, Barbara; Boomsma, Dorret I.; Martin, Nicholas G.; Rujescu, Dan; Wilson, James F.; Buitelaar, Jan; Pendleton, Neil; Rudan, Igor; Deary, Ian J.

    2012-01-01

    Measures of personality and psychological distress are correlated and exhibit genetic covariance. We conducted univariate genome-wide SNP (similar to 2.5 million) and gene-based association analyses of these traits and examined the overlap in results across traits, including a prediction analysis of

  18. Genome-wide association study for milking speed in French Holstein cows

    DEFF Research Database (Denmark)

    Marete, Andrew Gitahi; Sahana, Goutam; Fritz, Sebastian

    2018-01-01

    Using a combination of data from the BovineSNP50 BeadChip SNP array (Illumina, San Diego, CA) and a EuroGenomics (Amsterdam, the Netherlands) custom single nucleotide polymorphism (SNP) chip with SNP pre-selected from whole genome sequence data, we carried out an association study of milking speed...... associated with milking speed. As clinical mastitis and somatic cell score have an unfavorable genetic correlation with milking speed, we tested whether the most significant SNP on these 22 chromosomes associated with milking speed were also associated with clinical mastitis or somatic cell score. Nine...... hundred seventy-one genome-wide significant SNP were associated with milking speed. Of these, 86 were associated with clinical mastitis and 198 with somatic cell score. The most significant association signals for milking speed were observed on chromosomes 7, 8, 10, 14, and 18. The most significant signal...

  19. Evaluation of multiple approaches to identify genome-wide polymorphisms in closely related genotypes of sweet cherry (Prunus avium L.

    Directory of Open Access Journals (Sweden)

    Seanna Hewitt

    Full Text Available Identification of genetic polymorphisms and subsequent development of molecular markers is important for marker assisted breeding of superior cultivars of economically important species. Sweet cherry (Prunus avium L. is an economically important non-climacteric tree fruit crop in the Rosaceae family and has undergone a genetic bottleneck due to breeding, resulting in limited genetic diversity in the germplasm that is utilized for breeding new cultivars. Therefore, it is critical to recognize the best platforms for identifying genome-wide polymorphisms that can help identify, and consequently preserve, the diversity in a genetically constrained species. For the identification of polymorphisms in five closely related genotypes of sweet cherry, a gel-based approach (TRAP, reduced representation sequencing (TRAPseq, a 6k cherry SNParray, and whole genome sequencing (WGS approaches were evaluated in the identification of genome-wide polymorphisms in sweet cherry cultivars. All platforms facilitated detection of polymorphisms among the genotypes with variable efficiency. In assessing multiple SNP detection platforms, this study has demonstrated that a combination of appropriate approaches is necessary for efficient polymorphism identification, especially between closely related cultivars of a species. The information generated in this study provides a valuable resource for future genetic and genomic studies in sweet cherry, and the insights gained from the evaluation of multiple approaches can be utilized for other closely related species with limited genetic diversity in the breeding germplasm. Keywords: Polymorphisms, Prunus avium, Next-generation sequencing, Target region amplification polymorphism (TRAP, Genetic diversity, SNParray, Reduced representation sequencing, Whole genome sequencing (WGS

  20. Genome-Wide Association Study Reveals Novel Genes Associated with Culm Cellulose Content in Bread Wheat (Triticum aestivum, L.

    Directory of Open Access Journals (Sweden)

    Simerjeet Kaur

    2017-11-01

    Full Text Available Plant cell wall formation is a complex, coordinated and developmentally regulated process. Cellulose is the most dominant constituent of plant cell walls. Because of its paracrystalline structure, cellulose is the main determinant of mechanical strength of plant tissues. As the most abundant polysaccharide on earth, it is also the focus of cellulosic biofuel industry. To reduce culm lodging in wheat and for improved ethanol production, delineation of the variation for stem cellulose content could prove useful. We present results on the analysis of the stem cellulose content of 288 diverse wheat accessions and its genome-wide association study (GWAS. Cellulose concentration ranged from 35 to 52% (w/w. Cellulose content was normally distributed in the accessions around a mean and median of 45% (w/w. Genome-wide marker-trait association study using 21,073 SNPs helped identify nine SNPs that were associated (p < 1E-05 with cellulose content. Four strongly associated (p < 8.17E-05 SNP markers were linked to wheat unigenes, which included β-tubulin, Auxin-induced protein 5NG4, and a putative transmembrane protein of unknown function. These genes may be directly or indirectly involved in the formation of cellulose in wheat culms. GWAS results from this study have the potential for genetic manipulation of cellulose content in bread wheat and other small grain cereals to enhance culm strength and improve biofuel production.

  1. Identification of a sex-linked SNP marker in the salmon louse (Lepeophtheirus salmonis using RAD sequencing.

    Directory of Open Access Journals (Sweden)

    Stephen N Carmichael

    Full Text Available The salmon louse (Lepeophtheirus salmonis (Krøyer, 1837 is a parasitic copepod that can, if untreated, cause considerable damage to Atlantic salmon (Salmo salar Linnaeus, 1758 and incurs significant costs to the Atlantic salmon mariculture industry. Salmon lice are gonochoristic and normally show sex ratios close to 1:1. While this observation suggests that sex determination in salmon lice is genetic, with only minor environmental influences, the mechanism of sex determination in the salmon louse is unknown. This paper describes the identification of a sex-linked Single Nucleotide Polymorphism (SNP marker, providing the first evidence for a genetic mechanism of sex determination in the salmon louse. Restriction site-associated DNA sequencing (RAD-seq was used to isolate SNP markers in a laboratory-maintained salmon louse strain. A total of 85 million raw Illumina 100 base paired-end reads produced 281,838 unique RAD-tags across 24 unrelated individuals. RAD marker Lsa101901 showed complete association with phenotypic sex for all individuals analysed, being heterozygous in females and homozygous in males. Using an allele-specific PCR assay for genotyping, this SNP association pattern was further confirmed for three unrelated salmon louse strains, displaying complete association with phenotypic sex in a total of 96 genotyped individuals. The marker Lsa101901 was located in the coding region of the prohibitin-2 gene, which showed a sex-dependent differential expression, with mRNA levels determined by RT-qPCR about 1.8-fold higher in adult female than adult male salmon lice. This study's observations of a novel sex-linked SNP marker are consistent with sex determination in the salmon louse being genetic and following a female heterozygous system. Marker Lsa101901 provides a tool to determine the genetic sex of salmon lice, and could be useful in the development of control strategies.

  2. Genome-wide association study of pathological gambling.

    Science.gov (United States)

    Lang, M; Leménager, T; Streit, F; Fauth-Bühler, M; Frank, J; Juraeva, D; Witt, S H; Degenhardt, F; Hofmann, A; Heilmann-Heimbach, S; Kiefer, F; Brors, B; Grabe, H-J; John, U; Bischof, A; Bischof, G; Völker, U; Homuth, G; Beutel, M; Lind, P A; Medland, S E; Slutske, W S; Martin, N G; Völzke, H; Nöthen, M M; Meyer, C; Rumpf, H-J; Wurst, F M; Rietschel, M; Mann, K F

    2016-08-01

    Pathological gambling is a behavioural addiction with negative economic, social, and psychological consequences. Identification of contributing genes and pathways may improve understanding of aetiology and facilitate therapy and prevention. Here, we report the first genome-wide association study of pathological gambling. Our aims were to identify pathways involved in pathological gambling, and examine whether there is a genetic overlap between pathological gambling and alcohol dependence. Four hundred and forty-five individuals with a diagnosis of pathological gambling according to the Diagnostic and Statistical Manual of Mental Disorders were recruited in Germany, and 986 controls were drawn from a German general population sample. A genome-wide association study of pathological gambling comprising single marker, gene-based, and pathway analyses, was performed. Polygenic risk scores were generated using data from a German genome-wide association study of alcohol dependence. No genome-wide significant association with pathological gambling was found for single markers or genes. Pathways for Huntington's disease (P-value=6.63×10(-3)); 5'-adenosine monophosphate-activated protein kinase signalling (P-value=9.57×10(-3)); and apoptosis (P-value=1.75×10(-2)) were significant. Polygenic risk score analysis of the alcohol dependence dataset yielded a one-sided nominal significant P-value in subjects with pathological gambling, irrespective of comorbid alcohol dependence status. The present results accord with previous quantitative formal genetic studies which showed genetic overlap between non-substance- and substance-related addictions. Furthermore, pathway analysis suggests shared pathology between Huntington's disease and pathological gambling. This finding is consistent with previous imaging studies. Copyright © 2016 Elsevier Masson SAS. All rights reserved.

  3. Genome-Wide Association Studies of Anthracnose and Angular Leaf Spot Resistance in Common Bean (Phaseolus vulgaris L..

    Directory of Open Access Journals (Sweden)

    Juliana Morini Küpper Cardoso Perseguini

    Full Text Available The common bean (Phaseolus vulgaris L. is the world's most important legume for human consumption. Anthracnose (ANT; Colletotrichum lindemuthianum and angular leaf spot (ALS; Pseudocercospora griseola are complex diseases that cause major yield losses in common bean. Depending on the cultivar and environmental conditions, anthracnose and angular leaf spot infections can reduce crop yield drastically. This study aimed to estimate linkage disequilibrium levels and identify quantitative resistance loci (QRL controlling resistance to both ANT and ALS diseases of 180 accessions of common bean using genome-wide association analysis. A randomized complete block design with four replicates was performed for the ANT and ALS experiments, with four plants per genotype in each replicate. Association mapping analyses were performed for ANT and ALS using a mixed linear model approach implemented in TASSEL. A total of 17 and 11 significant statistically associations involving SSRs were detected for ANT and ALS resistance loci, respectively. Using SNPs, 21 and 17 significant statistically associations were obtained for ANT and angular ALS, respectively, providing more associations with this marker. The SSR-IAC167 and PvM95 markers, both located on chromosome Pv03, and the SNP scaffold00021_89379, were associated with both diseases. The other markers were distributed across the entire common bean genome, with chromosomes Pv03 and Pv08 showing the greatest number of loci associated with ANT resistance. The chromosome Pv04 was the most saturated one, with six markers associated with ALS resistance. The telomeric region of this chromosome showed four markers located between approximately 2.5 Mb and 4.4 Mb. Our results demonstrate the great potential of genome-wide association studies to identify QRLs related to ANT and ALS in common bean. The results indicate a quantitative and complex inheritance pattern for both diseases in common bean. Our findings will

  4. Genome-Wide Association Studies of Anthracnose and Angular Leaf Spot Resistance in Common Bean (Phaseolus vulgaris L.).

    Science.gov (United States)

    Perseguini, Juliana Morini Küpper Cardoso; Oblessuc, Paula Rodrigues; Rosa, João Ricardo Bachega Feijó; Gomes, Kleber Alves; Chiorato, Alisson Fernando; Carbonell, Sérgio Augusto Morais; Garcia, Antonio Augusto Franco; Vianello, Rosana Pereira; Benchimol-Reis, Luciana Lasry

    2016-01-01

    The common bean (Phaseolus vulgaris L.) is the world's most important legume for human consumption. Anthracnose (ANT; Colletotrichum lindemuthianum) and angular leaf spot (ALS; Pseudocercospora griseola) are complex diseases that cause major yield losses in common bean. Depending on the cultivar and environmental conditions, anthracnose and angular leaf spot infections can reduce crop yield drastically. This study aimed to estimate linkage disequilibrium levels and identify quantitative resistance loci (QRL) controlling resistance to both ANT and ALS diseases of 180 accessions of common bean using genome-wide association analysis. A randomized complete block design with four replicates was performed for the ANT and ALS experiments, with four plants per genotype in each replicate. Association mapping analyses were performed for ANT and ALS using a mixed linear model approach implemented in TASSEL. A total of 17 and 11 significant statistically associations involving SSRs were detected for ANT and ALS resistance loci, respectively. Using SNPs, 21 and 17 significant statistically associations were obtained for ANT and angular ALS, respectively, providing more associations with this marker. The SSR-IAC167 and PvM95 markers, both located on chromosome Pv03, and the SNP scaffold00021_89379, were associated with both diseases. The other markers were distributed across the entire common bean genome, with chromosomes Pv03 and Pv08 showing the greatest number of loci associated with ANT resistance. The chromosome Pv04 was the most saturated one, with six markers associated with ALS resistance. The telomeric region of this chromosome showed four markers located between approximately 2.5 Mb and 4.4 Mb. Our results demonstrate the great potential of genome-wide association studies to identify QRLs related to ANT and ALS in common bean. The results indicate a quantitative and complex inheritance pattern for both diseases in common bean. Our findings will contribute to more

  5. Candidate SNP markers of aggressiveness-related complications and comorbidities of genetic diseases are predicted by a significant change in the affinity of TATA-binding protein for human gene promoters.

    Science.gov (United States)

    Chadaeva, Irina V; Ponomarenko, Mikhail P; Rasskazov, Dmitry A; Sharypova, Ekaterina B; Kashina, Elena V; Matveeva, Marina Yu; Arshinova, Tatjana V; Ponomarenko, Petr M; Arkova, Olga V; Bondar, Natalia P; Savinkova, Ludmila K; Kolchanov, Nikolay A

    2016-12-28

    Aggressiveness in humans is a hereditary behavioral trait that mobilizes all systems of the body-first of all, the nervous and endocrine systems, and then the respiratory, vascular, muscular, and others-e.g., for the defense of oneself, children, family, shelter, territory, and other possessions as well as personal interests. The level of aggressiveness of a person determines many other characteristics of quality of life and lifespan, acting as a stress factor. Aggressive behavior depends on many parameters such as age, gender, diseases and treatment, diet, and environmental conditions. Among them, genetic factors are believed to be the main parameters that are well-studied at the factual level, but in actuality, genome-wide studies of aggressive behavior appeared relatively recently. One of the biggest projects of the modern science-1000 Genomes-involves identification of single nucleotide polymorphisms (SNPs), i.e., differences of individual genomes from the reference genome. SNPs can be associated with hereditary diseases, their complications, comorbidities, and responses to stress or a drug. Clinical comparisons between cohorts of patients and healthy volunteers (as a control) allow for identifying SNPs whose allele frequencies significantly separate them from one another as markers of the above conditions. Computer-based preliminary analysis of millions of SNPs detected by the 1000 Genomes project can accelerate clinical search for SNP markers due to preliminary whole-genome search for the most meaningful candidate SNP markers and discarding of neutral and poorly substantiated SNPs. Here, we combine two computer-based search methods for SNPs (that alter gene expression) {i} Web service SNP_TATA_Comparator (DNA sequence analysis) and {ii} PubMed-based manual search for articles on aggressiveness using heuristic keywords. Near the known binding sites for TATA-binding protein (TBP) in human gene promoters, we found aggressiveness-related candidate SNP markers

  6. Genome-Wide Association Study of Major Agronomic Traits Related to Domestication in Peanut

    Directory of Open Access Journals (Sweden)

    Xingguo Zhang

    2017-09-01

    Full Text Available Peanut (Arachis hypogaea consists of two subspecies, hypogaea and fastigiata, and has been cultivated worldwide for hundreds of years. Here, 158 peanut accessions were selected to dissect the molecular footprint of agronomic traits related to domestication using specific-locus amplified fragment sequencing (SLAF-seq method. Then, a total of 17,338 high-quality single nucleotide polymorphisms (SNPs in the whole peanut genome were revealed. Eleven agronomic traits in 158 peanut accessions were subsequently analyzed using genome-wide association studies (GWAS. Candidate genes responsible for corresponding traits were then analyzed in genomic regions surrounding the peak SNPs, and 1,429 genes were found within 200 kb windows centerd on GWAS-identified peak SNPs related to domestication. Highly differentiated genomic regions were observed between hypogaea and fastigiata accessions using FST values and sequence diversity (π ratios. Among the 1,429 genes, 662 were located on chromosome A3, suggesting the presence of major selective sweeps caused by artificial selection during long domestication. These findings provide a promising insight into the complicated genetic architecture of domestication-related traits in peanut, and reveal whole-genome SNP markers of beneficial candidate genes for marker-assisted selection (MAS in future breeding programs.

  7. Genome-Wide Association of Stem Water Soluble Carbohydrates in Bread Wheat.

    Science.gov (United States)

    Dong, Yan; Liu, Jindong; Zhang, Yan; Geng, Hongwei; Rasheed, Awais; Xiao, Yonggui; Cao, Shuanghe; Fu, Luping; Yan, Jun; Wen, Weie; Zhang, Yong; Jing, Ruilian; Xia, Xianchun; He, Zhonghu

    2016-01-01

    Water soluble carbohydrates (WSC) in stems play an important role in buffering grain yield in wheat against biotic and abiotic stresses; however, knowledge of genes controlling WSC is very limited. We conducted a genome-wide association study (GWAS) using a high-density 90K SNP array to better understand the genetic basis underlying WSC, and to explore marker-based breeding approaches. WSC was evaluated in an association panel comprising 166 Chinese bread wheat cultivars planted in four environments. Fifty two marker-trait associations (MTAs) distributed across 23 loci were identified for phenotypic best linear unbiased estimates (BLUEs), and 11 MTAs were identified in two or more environments. Liner regression showed a clear dependence of WSC BLUE scores on numbers of favorable (increasing WSC content) and unfavorable alleles (decreasing WSC), indicating that genotypes with higher numbers of favorable or lower numbers of unfavorable alleles had higher WSC content. In silico analysis of flanking sequences of trait-associated SNPs revealed eight candidate genes related to WSC content grouped into two categories based on the type of encoding proteins, namely, defense response proteins and proteins triggered by environmental stresses. The identified SNPs and candidate genes related to WSC provide opportunities for breeding higher WSC wheat cultivars.

  8. Genome-Wide Association of Stem Water Soluble Carbohydrates in Bread Wheat.

    Directory of Open Access Journals (Sweden)

    Yan Dong

    Full Text Available Water soluble carbohydrates (WSC in stems play an important role in buffering grain yield in wheat against biotic and abiotic stresses; however, knowledge of genes controlling WSC is very limited. We conducted a genome-wide association study (GWAS using a high-density 90K SNP array to better understand the genetic basis underlying WSC, and to explore marker-based breeding approaches. WSC was evaluated in an association panel comprising 166 Chinese bread wheat cultivars planted in four environments. Fifty two marker-trait associations (MTAs distributed across 23 loci were identified for phenotypic best linear unbiased estimates (BLUEs, and 11 MTAs were identified in two or more environments. Liner regression showed a clear dependence of WSC BLUE scores on numbers of favorable (increasing WSC content and unfavorable alleles (decreasing WSC, indicating that genotypes with higher numbers of favorable or lower numbers of unfavorable alleles had higher WSC content. In silico analysis of flanking sequences of trait-associated SNPs revealed eight candidate genes related to WSC content grouped into two categories based on the type of encoding proteins, namely, defense response proteins and proteins triggered by environmental stresses. The identified SNPs and candidate genes related to WSC provide opportunities for breeding higher WSC wheat cultivars.

  9. Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies.

    Directory of Open Access Journals (Sweden)

    Clive J Hoggart

    2008-07-01

    Full Text Available Testing one SNP at a time does not fully realise the potential of genome-wide association studies to identify multiple causal variants, which is a plausible scenario for many complex diseases. We show that simultaneous analysis of the entire set of SNPs from a genome-wide study to identify the subset that best predicts disease outcome is now feasible, thanks to developments in stochastic search methods. We used a Bayesian-inspired penalised maximum likelihood approach in which every SNP can be considered for additive, dominant, and recessive contributions to disease risk. Posterior mode estimates were obtained for regression coefficients that were each assigned a prior with a sharp mode at zero. A non-zero coefficient estimate was interpreted as corresponding to a significant SNP. We investigated two prior distributions and show that the normal-exponential-gamma prior leads to improved SNP selection in comparison with single-SNP tests. We also derived an explicit approximation for type-I error that avoids the need to use permutation procedures. As well as genome-wide analyses, our method is well-suited to fine mapping with very dense SNP sets obtained from re-sequencing and/or imputation. It can accommodate quantitative as well as case-control phenotypes, covariate adjustment, and can be extended to search for interactions. Here, we demonstrate the power and empirical type-I error of our approach using simulated case-control data sets of up to 500 K SNPs, a real genome-wide data set of 300 K SNPs, and a sequence-based dataset, each of which can be analysed in a few hours on a desktop workstation.

  10. A genome-wide association study of aging.

    Science.gov (United States)

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora

    2011-11-01

    Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity. Copyright © 2011 Elsevier Inc. All rights reserved.

  11. Genome-wide comparative analysis of four Indian Drosophila species.

    Science.gov (United States)

    Mohanty, Sujata; Khanna, Radhika

    2017-12-01

    Comparative analysis of multiple genomes of closely or distantly related Drosophila species undoubtedly creates excitement among evolutionary biologists in exploring the genomic changes with an ecology and evolutionary perspective. We present herewith the de novo assembled whole genome sequences of four Drosophila species, D. bipectinata, D. takahashii, D. biarmipes and D. nasuta of Indian origin using Next Generation Sequencing technology on an Illumina platform along with their detailed assembly statistics. The comparative genomics analysis, e.g. gene predictions and annotations, functional and orthogroup analysis of coding sequences and genome wide SNP distribution were performed. The whole genome of Zaprionus indianus of Indian origin published earlier by us and the genome sequences of previously sequenced 12 Drosophila species available in the NCBI database were included in the analysis. The present work is a part of our ongoing genomics project of Indian Drosophila species.

  12. Susceptibility to Chronic Mucus Hypersecretion, a Genome Wide Association Study

    DEFF Research Database (Denmark)

    Dijkstra, Akkelies E; Smolonska, Joanna; van den Berge, Maarten

    2014-01-01

    by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years). Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism...... (SNP). RESULTS: A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10(-6), OR = 1.17), located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1) on chromosome 3. The risk allele (G) was associated with higher mRNA expression...... of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA) study of CMH in Caucasian populations. METHODS: GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed...

  13. A Genome-wide Combinatorial Strategy Dissects Complex Genetic Architecture of Seed Coat Color in Chickpea.

    Science.gov (United States)

    Bajaj, Deepak; Das, Shouvik; Upadhyaya, Hari D; Ranjan, Rajeev; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C L Laxmipathi; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K; Parida, Swarup K

    2015-01-01

    The study identified 9045 high-quality SNPs employing both genome-wide GBS- and candidate gene-based SNP genotyping assays in 172, including 93 cultivated (desi and kabuli) and 79 wild chickpea accessions. The GWAS in a structured population of 93 sequenced accessions detected 15 major genomic loci exhibiting significant association with seed coat color. Five seed color-associated major genomic loci underlying robust QTLs mapped on a high-density intra-specific genetic linkage map were validated by QTL mapping. The integration of association and QTL mapping with gene haplotype-specific LD mapping and transcript profiling identified novel allelic variants (non-synonymous SNPs) and haplotypes in a MATE secondary transporter gene regulating light/yellow brown and beige seed coat color differentiation in chickpea. The down-regulation and decreased transcript expression of beige seed coat color-associated MATE gene haplotype was correlated with reduced proanthocyanidins accumulation in the mature seed coats of beige than light/yellow brown seed colored desi and kabuli accessions for their coloration/pigmentation. This seed color-regulating MATE gene revealed strong purifying selection pressure primarily in LB/YB seed colored desi and wild Cicer reticulatum accessions compared with the BE seed colored kabuli accessions. The functionally relevant molecular tags identified have potential to decipher the complex transcriptional regulatory gene function of seed coat coloration and for understanding the selective sweep-based seed color trait evolutionary pattern in cultivated and wild accessions during chickpea domestication. The genome-wide integrated approach employed will expedite marker-assisted genetic enhancement for developing cultivars with desirable seed coat color types in chickpea.

  14. Genomic selection: genome-wide prediction in plant improvement.

    Science.gov (United States)

    Desta, Zeratsion Abera; Ortiz, Rodomiro

    2014-09-01

    Association analysis is used to measure relations between markers and quantitative trait loci (QTL). Their estimation ignores genes with small effects that trigger underpinning quantitative traits. By contrast, genome-wide selection estimates marker effects across the whole genome on the target population based on a prediction model developed in the training population (TP). Whole-genome prediction models estimate all marker effects in all loci and capture small QTL effects. Here, we review several genomic selection (GS) models with respect to both the prediction accuracy and genetic gain from selection. Phenotypic selection or marker-assisted breeding protocols can be replaced by selection, based on whole-genome predictions in which phenotyping updates the model to build up the prediction accuracy. Copyright © 2014 Elsevier Ltd. All rights reserved.

  15. Genome-wide association for heifer reproduction and calf performance traits in beef cattle.

    Science.gov (United States)

    Akanno, Everestus C; Plastow, Graham; Fitzsimmons, Carolyn; Miller, Stephen P; Baron, Vern; Ominski, Kimberly; Basarab, John A

    2015-12-01

    The aim of this study was to identify SNP markers that associate with variation in beef heifer reproduction and performance of their calves. A genome-wide association study was performed by means of the generalized quasi-likelihood score (GQLS) method using heifer genotypes from the BovineSNP50 BeadChip and estimated breeding values for pre-breeding body weight (PBW), pregnancy rate (PR), calving difficulty (CD), age at first calving (AFC), calf birth weight (BWT), calf weaning weight (WWT), and calf pre-weaning average daily gain (ADG). Data consisted of 785 replacement heifers from three Canadian research herds, namely Brandon Research Centre, Brandon, Manitoba, University of Alberta Roy Berg Kinsella Ranch, Kinsella, Alberta, and Lacombe Research Centre, Lacombe, Alberta. After applying a false discovery rate correction at a 5% significance level, a total of 4, 3, 3, 9, 6, 2, and 1 SNPs were significantly associated with PBW, PR, CD, AFC, BWT, WWT, and ADG, respectively. These SNPs were located on chromosomes 1, 5-7, 9, 13-16, 19-21, 24, 25, and 27-29. Chromosomes 1, 5, and 24 had SNPs with pleiotropic effects. New significant SNPs that impact functional traits were detected, many of which have not been previously reported. The results of this study support quantitative genetic studies related to the inheritance of these traits, and provides new knowledge regarding beef cattle quantitative trait loci effects. The identification of these SNPs provides a starting point to identify genes affecting heifer reproduction traits and performance of their calves (BWT, WWT, and ADG). They also contribute to a better understanding of the biology underlying these traits and will be potentially useful in marker- and genome-assisted selection and management.

  16. Genome-Wide Association Study of Metabolic Traits Reveals Novel Gene-Metabolite-Disease Links

    Science.gov (United States)

    Nicholls, Andrew W.; Salek, Reza M.; Marques-Vidal, Pedro; Morya, Edgard; Sameshima, Koichi; Montoliu, Ivan; Da Silva, Laeticia; Collino, Sebastiano; Martin, François-Pierre; Rezzi, Serge; Steinbeck, Christoph; Waterworth, Dawn M.; Waeber, Gérard; Vollenweider, Peter; Beckmann, Jacques S.; Le Coutre, Johannes; Mooser, Vincent; Bergmann, Sven; Genick, Ulrich K.; Kutalik, Zoltán

    2014-01-01

    Metabolic traits are molecular phenotypes that can drive clinical phenotypes and may predict disease progression. Here, we report results from a metabolome- and genome-wide association study on 1H-NMR urine metabolic profiles. The study was conducted within an untargeted approach, employing a novel method for compound identification. From our discovery cohort of 835 Caucasian individuals who participated in the CoLaus study, we identified 139 suggestively significant (P<5×10−8) and independent associations between single nucleotide polymorphisms (SNP) and metabolome features. Fifty-six of these associations replicated in the TasteSensomics cohort, comprising 601 individuals from São Paulo of vastly diverse ethnic background. They correspond to eleven gene-metabolite associations, six of which had been previously identified in the urine metabolome and three in the serum metabolome. Our key novel findings are the associations of two SNPs with NMR spectral signatures pointing to fucose (rs492602, P = 6.9×10−44) and lysine (rs8101881, P = 1.2×10−33), respectively. Fine-mapping of the first locus pinpointed the FUT2 gene, which encodes a fucosyltransferase enzyme and has previously been associated with Crohn's disease. This implicates fucose as a potential prognostic disease marker, for which there is already published evidence from a mouse model. The second SNP lies within the SLC7A9 gene, rare mutations of which have been linked to severe kidney damage. The replication of previous associations and our new discoveries demonstrate the potential of untargeted metabolomics GWAS to robustly identify molecular disease markers. PMID:24586186

  17. Genome-Wide Association Study of Calcium Accumulation in Grains of European Wheat Cultivars

    Directory of Open Access Journals (Sweden)

    Dalia Z. Alomari

    2017-10-01

    Full Text Available Mineral concentrations in cereals are important for human health, especially for people who depend mainly on consuming cereal diet. In this study, we carried out a genome-wide association study (GWAS of calcium concentrations in wheat (Triticum aestivum L. grains using a European wheat diversity panel of 353 varieties [339 winter wheat (WW plus 14 of spring wheat (SW] and phenotypic data based on two field seasons. High genotyping densities of single-nucleotide polymorphism (SNP markers were obtained from the application of the 90k iSELECT ILLUMINA chip and a 35k Affymetrix chip. Inductively coupled plasma optical emission spectrometry (ICP-OES was used to measure the calcium concentrations of the wheat grains. Best linear unbiased estimates (BLUEs for calcium were calculated across the seasons and ranged from 288.20 to 647.50 among the varieties (μg g-1 DW with a mean equaling 438.102 (μg g-1 DW, and the heritability was 0.73. A total of 485 SNP marker–trait associations (MTAs were detected in data obtained from grains cultivated in both of the two seasons and BLUE values by considering associations with a -log10 (P-value ≥3.0. Among these SNP markers, we detected 276 markers with a positive allele effect and 209 markers with a negative allele effect. These MTAs were found on all chromosomes except chromosomes 3D, 4B, and 4D. The most significant association was located on chromosome 5A (114.5 cM and was linked to a gene encoding cation/sugar symporter activity as a potential candidate gene. Additionally, a number of candidate genes for the uptake or transport of calcium were located near significantly associated SNPs. This analysis highlights a number of genomic regions and candidate genes for further analysis as well as the challenges faced when mapping environmentally variable traits in genetically highly diverse variety panels. The research demonstrates the feasibility of the GWAS approach for illuminating the genetic architecture of

  18. Meta-analysis of Genome-Wide Association Studies for Extraversion

    DEFF Research Database (Denmark)

    van den Berg, Stéphanie M; de Moor, Marleen H M; Verweij, K. J. H.

    2016-01-01

    small sample sizes of those studies. Here, we report on a large meta-analysis of GWA studies for extraversion in 63,030 subjects in 29 cohorts. Extraversion item data from multiple personality inventories were harmonized across inventories and cohorts. No genome-wide significant associations were found...... at the single nucleotide polymorphism (SNP) level but there was one significant hit at the gene level for a long non-coding RNA site (LOC101928162). Genome-wide complex trait analysis in two large cohorts showed that the additive variance explained by common SNPs was not significantly different from zero...

  19. Population structure and genetic diversity characterization of a sunflower association mapping population using SSR and SNP markers.

    Science.gov (United States)

    Filippi, Carla V; Aguirre, Natalia; Rivas, Juan G; Zubrzycki, Jeremias; Puebla, Andrea; Cordes, Diego; Moreno, Maria V; Fusari, Corina M; Alvarez, Daniel; Heinz, Ruth A; Hopp, Horacio E; Paniego, Norma B; Lia, Veronica V

    2015-02-13

    Argentina has a long tradition of sunflower breeding, and its germplasm is a valuable genetic resource worldwide. However, knowledge of the genetic constitution and variability levels of the Argentinean germplasm is still scarce, rendering the global map of cultivated sunflower diversity incomplete. In this study, 42 microsatellite loci and 384 single nucleotide polymorphisms (SNPs) were used to characterize the first association mapping population used for quantitative trait loci mapping in sunflower, along with a selection of allied open-pollinated and composite populations from the germplasm bank of the National Institute of Agricultural Technology of Argentina. The ability of different kinds of markers to assess genetic diversity and population structure was also evaluated. The analysis of polymorphism in the set of sunflower accessions studied here showed that both the microsatellites and SNP markers were informative for germplasm characterization, although to different extents. In general, the estimates of genetic variability were moderate. The average genetic diversity, as quantified by the expected heterozygosity, was 0.52 for SSR loci and 0.29 for SNPs. Within SSR markers, those derived from non-coding regions were able to capture higher levels of diversity than EST-SSR. A significant correlation was found between SSR and SNP- based genetic distances among accessions. Bayesian and multivariate methods were used to infer population structure. Evidence for the existence of three different genetic groups was found consistently across data sets (i.e., SSR, SNP and SSR + SNP), with the maintainer/restorer status being the most prevalent characteristic associated with group delimitation. The present study constitutes the first report comparing the performance of SSR and SNP markers for population genetics analysis in cultivated sunflower. We show that the SSR and SNP panels examined here, either used separately or in conjunction, allowed consistent

  20. QTL underlying some agronomic traits in barley detected by SNP markers.

    Science.gov (United States)

    Wang, Jibin; Sun, Genlou; Ren, Xifeng; Li, Chengdao; Liu, Lipan; Wang, Qifei; Du, Binbin; Sun, Dongfa

    2016-07-07

    Increasing the yield of barley (Hordeum vulgare L.) is a main breeding goal in developing barley cultivars. A high density genetic linkage map containing 1894 SNP and 68 SSR markers covering 1375.8 cM was constructed and used for mapping quantitative traits. A late-generation double haploid population (DH) derived from the Huaai 11 × Huadamai 6 cross was used to identify QTLs and QTL × environment interactions for ten traits affecting grain yield including length of main spike (MSL), spikelet number on main spike (SMS), spikelet number per plant (SLP), grain number per plant (GP), grain weight per plant (GWP), grain number per spike (GS), thousand grain weight (TGW), grain weight per spike (GWS), spike density (SPD) and spike number per plant (SP). In single environment analysis using composite interval mapping (CIM), a total of 221 QTLs underlying the ten traits were detected in five consecutive years (2009-2013). The QTLs detected in each year were 50, 48, 41, 41 and 41 for the year 2009 to 2013. The QTLs associated with these traits were generally clustered on chromosome 2H, 4H and 7H. In multi-environment analysis, a total of 111 significant QTLs including 18 for MSL, 16 for SMS, 15 for SPD, 5 for SP, 4 for SLP, 14 for TGW, 5 for GP, 11 for GS, 8 for GWP, and 15 for GWS were detected in the five years. Most QTLs showed significant QTL × environment interactions (QEI), nine QTLs (qIMSL3-1, qIMSL4-1, qIMSL4-2, qIMSL6-1, qISMS7-1, qISPD2-7, qISPD7-1, qITGW3-1 and qIGWS4-3) were detected with minimal QEI effects and stable in different years. Among 111 QTLs,71 (63.40 %) QTLs were detected in both single and multiple environments. Three main QTL cluster regions associated with the 10 agronomic traits on chromosome 2H, 4H and 7H were detected. The QTLs for SMS, SLP, GP and GWP were located in the region near Vrs1 on chromosome 2H. The QTLs underlying SMS, SPD and SLP were clustered on chromosome 4H. On the terminal of chromosome 7H, there was a QTL

  1. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

    Science.gov (United States)

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-06-04

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.

  2. Comparing the predictive abilities of phenotypic and marker-assisted selection methods in a biparental lettuce population

    Science.gov (United States)

    Breeding and selection for the traits with polygenic inheritance is a challenging task that can be done by phenotypic selection, by marker-assisted selection or by genome wide selection. We tested predictive ability of four selection models in a biparental population genotyped with 95 SNP markers an...

  3. Genome-wide association study to identify candidate loci and genes for Mn toxicity tolerance in rice.

    Directory of Open Access Journals (Sweden)

    Asis Shrestha

    Full Text Available Manganese (Mn is an essential micro-nutrient for plants, but flooded rice fields can accumulate high levels of Mn2+ leading to Mn toxicity. Here, we present a genome-wide association study (GWAS to identify candidate loci conferring Mn toxicity tolerance in rice (Oryza sativa L.. A diversity panel of 288 genotypes was grown in hydroponic solutions in a greenhouse under optimal and toxic Mn concentrations. We applied a Mn toxicity treatment (5 ppm Mn2+, 3 weeks at twelve days after transplanting. Mn toxicity caused moderate damage in rice in terms of biomass loss and symptom formation despite extremely high shoot Mn concentrations ranging from 2.4 to 17.4 mg g-1. The tropical japonica subpopulation was more sensitive to Mn toxicity than other subpopulations. Leaf damage symptoms were significantly correlated with Mn uptake into shoots. Association mapping was conducted for seven traits using 416741 single nucleotide polymorphism (SNP markers using a mixed linear model, and detected six significant associations for the traits shoot manganese concentration and relative shoot length. Candidate regions contained genes coding for a heavy metal transporter, peroxidase precursor and Mn2+ ion binding proteins. The significant marker SNP-2.22465867 caused an amino acid change in a gene (LOC_Os02g37170 with unknown function. This study demonstrated significant natural variation in rice for Mn toxicity tolerance and the possibility of using GWAS to unravel genetic factors responsible for such complex traits.

  4. Genome-Wide Association Mapping and Genomic Selection for Alfalfa (Medicago sativa) Forage Quality Traits.

    Science.gov (United States)

    Biazzi, Elisa; Nazzicari, Nelson; Pecetti, Luciano; Brummer, E Charles; Palmonari, Alberto; Tava, Aldo; Annicchiarico, Paolo

    2017-01-01

    Genetic progress for forage quality has been poor in alfalfa (Medicago sativa L.), the most-grown forage legume worldwide. This study aimed at exploring opportunities for marker-assisted selection (MAS) and genomic selection of forage quality traits based on breeding values of parent plants. Some 154 genotypes from a broadly-based reference population were genotyped by genotyping-by-sequencing (GBS), and phenotyped for leaf-to-stem ratio, leaf and stem contents of protein, neutral detergent fiber (NDF) and acid detergent lignin (ADL), and leaf and stem NDF digestibility after 24 hours (NDFD), of their dense-planted half-sib progenies in three growing conditions (summer harvest, full irrigation; summer harvest, suspended irrigation; autumn harvest). Trait-marker analyses were performed on progeny values averaged over conditions, owing to modest germplasm × condition interaction. Genomic selection exploited 11,450 polymorphic SNP markers, whereas a subset of 8,494 M. truncatula-aligned markers were used for a genome-wide association study (GWAS). GWAS confirmed the polygenic control of quality traits and, in agreement with phenotypic correlations, indicated substantially different genetic control of a given trait in stems and leaves. It detected several SNPs in different annotated genes that were highly linked to stem protein content. Also, it identified a small genomic region on chromosome 8 with high concentration of annotated genes associated with leaf ADL, including one gene probably involved in the lignin pathway. Three genomic selection models, i.e., Ridge-regression BLUP, Bayes B and Bayesian Lasso, displayed similar prediction accuracy, whereas SVR-lin was less accurate. Accuracy values were moderate (0.3-0.4) for stem NDFD and leaf protein content, modest for leaf ADL and NDFD, and low to very low for the other traits. Along with previous results for the same germplasm set, this study indicates that GBS data can be exploited to improve both quality traits

  5. Exploring germplasm diversity to understand the domestication process in Cicer spp. using SNP and DArT markers.

    Directory of Open Access Journals (Sweden)

    Manish Roorkiwal

    Full Text Available To estimate genetic diversity within and between 10 interfertile Cicer species (94 genotypes from the primary, secondary and tertiary gene pool, we analysed 5,257 DArT markers and 651 KASPar SNP markers. Based on successful allele calling in the tertiary gene pool, 2,763 DArT and 624 SNP markers that are polymorphic between genotypes from the gene pools were analyzed further. STRUCTURE analyses were consistent with 3 cultivated populations, representing kabuli, desi and pea-shaped seed types, with substantial admixture among these groups, while two wild populations were observed using DArT markers. AMOVA was used to partition variance among hierarchical sets of landraces and wild species at both the geographical and species level, with 61% of the variation found between species, and 39% within species. Molecular variance among the wild species was high (39% compared to the variation present in cultivated material (10%. Observed heterozygosity was higher in wild species than the cultivated species for each linkage group. Our results support the Fertile Crescent both as the center of domestication and diversification of chickpea. The collection used in the present study covers all the three regions of historical chickpea cultivation, with the highest diversity in the Fertile Crescent region. Shared alleles between different gene pools suggest the possibility of gene flow among these species or incomplete lineage sorting and could indicate complicated patterns of divergence and fusion of wild chickpea taxa in the past.

  6. Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

    Directory of Open Access Journals (Sweden)

    Huaiyong Luo

    Full Text Available The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.

  7. Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

    Science.gov (United States)

    Luo, Huaiyong; Wang, Xiaojie; Zhan, Gangming; Wei, Guorong; Zhou, Xinli; Zhao, Jing; Huang, Lili; Kang, Zhensheng

    2015-01-01

    The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs) are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.

  8. Nuclear Species-Diagnostic SNP Markers Mined from 454 Amplicon Sequencing Reveal Admixture Genomic Structure of Modern Citrus Varieties

    Science.gov (United States)

    Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick

    2015-01-01

    Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP

  9. A genome-wide scan for common alleles affecting risk for autism

    Science.gov (United States)

    Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R.; Correia, Catarina; Abrahams, Brett S.; Sykes, Nuala; Pagnamenta, Alistair T.; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J.; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F.; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R.; Casallo, Guillermo; Casey, Jillian; Chu, Su H.; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L.; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A.; Folstein, Susan E.; Fombonne, Eric; Freitag, Christine M.; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T.; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J.; Hakonarson, Hakon; Heron, Elizabeth A.; Hill, Matthew; Holt, Richard; Howe, Jennifer L.; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M.; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M.; Lamb, Janine A.; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L.; Lionel, Anath C.; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C.; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R.; McConachie, Helen; McDougle, Christopher J.; McGrath, Jane; McMahon, William M.; Melhem, Nadine M.; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J.; Mirza, Ghazala K.; Munson, Jeff; Nelson, Stanley F.; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R.; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L.; Bierut, Laura J.; Rice, John P.; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C.; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P.; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B.; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H.; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L.; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D.; Cantor, Rita M.; Cook, Edwin H.; Coon, Hilary; Cuccaro, Michael L.; Gallagher, Louise; Geschwind, Daniel H.; Gill, Michael; Haines, Jonathan L.; Miller, Judith; Monaco, Anthony P.; Nurnberger, John I.; Paterson, Andrew D.; Pericak-Vance, Margaret A.; Schellenberg, Gerard D.; Scherer, Stephen W.; Sutcliffe, James S.; Szatmari, Peter; Vicente, Astrid M.; Vieland, Veronica J.; Wijsman, Ellen M.; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim

    2010-01-01

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10−8. When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10−8 threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C. PMID:20663923

  10. A genome-wide scan for common alleles affecting risk for autism.

    LENUS (Irish Health Repository)

    Anney, Richard

    2010-10-15

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner\\'s curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C.

  11. A genome-wide scan for common alleles affecting risk for autism.

    Science.gov (United States)

    Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R; Correia, Catarina; Abrahams, Brett S; Sykes, Nuala; Pagnamenta, Alistair T; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R; Casallo, Guillermo; Casey, Jillian; Chu, Su H; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A; Folstein, Susan E; Fombonne, Eric; Freitag, Christine M; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J; Hakonarson, Hakon; Heron, Elizabeth A; Hill, Matthew; Holt, Richard; Howe, Jennifer L; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M; Lamb, Janine A; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L; Lionel, Anath C; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R; McConachie, Helen; McDougle, Christopher J; McGrath, Jane; McMahon, William M; Melhem, Nadine M; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J; Mirza, Ghazala K; Munson, Jeff; Nelson, Stanley F; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L; Bierut, Laura J; Rice, John P; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D; Cantor, Rita M; Cook, Edwin H; Coon, Hilary; Cuccaro, Michael L; Gallagher, Louise; Geschwind, Daniel H; Gill, Michael; Haines, Jonathan L; Miller, Judith; Monaco, Anthony P; Nurnberger, John I; Paterson, Andrew D; Pericak-Vance, Margaret A; Schellenberg, Gerard D; Scherer, Stephen W; Sutcliffe, James S; Szatmari, Peter; Vicente, Astrid M; Vieland, Veronica J; Wijsman, Ellen M; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim

    2010-10-15

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C.

  12. Genome-wide survey of single-nucleotide polymorphisms reveals fine-scale population structure and signs of selection in the threatened Caribbean elkhorn coral, Acropora palmata

    Directory of Open Access Journals (Sweden)

    Meghann K. Devlin-Durante

    2017-11-01

    Full Text Available The advent of next-generation sequencing tools has made it possible to conduct fine-scale surveys of population differentiation and genome-wide scans for signatures of selection in non-model organisms. Such surveys are of particular importance in sharply declining coral species, since knowledge of population boundaries and signs of local adaptation can inform restoration and conservation efforts. Here, we use genome-wide surveys of single-nucleotide polymorphisms in the threatened Caribbean elkhorn coral, Acropora palmata, to reveal fine-scale population structure and infer the major barrier to gene flow that separates the eastern and western Caribbean populations between the Bahamas and Puerto Rico. The exact location of this break had been subject to discussion because two previous studies based on microsatellite data had come to differing conclusions. We investigate this contradiction by analyzing an extended set of 11 microsatellite markers including the five previously employed and discovered that one of the original microsatellite loci is apparently under selection. Exclusion of this locus reconciles the results from the SNP and the microsatellite datasets. Scans for outlier loci in the SNP data detected 13 candidate loci under positive selection, however there was no correlation between available environmental parameters and genetic distance. Together, these results suggest that reef restoration efforts should use local sources and utilize existing functional variation among geographic regions in ex situ crossing experiments to improve stress resistance of this species.

  13. Genome-wide survey of single-nucleotide polymorphisms reveals fine-scale population structure and signs of selection in the threatened Caribbean elkhorn coral, Acropora palmata.

    Science.gov (United States)

    Devlin-Durante, Meghann K; Baums, Iliana B

    2017-01-01

    The advent of next-generation sequencing tools has made it possible to conduct fine-scale surveys of population differentiation and genome-wide scans for signatures of selection in non-model organisms. Such surveys are of particular importance in sharply declining coral species, since knowledge of population boundaries and signs of local adaptation can inform restoration and conservation efforts. Here, we use genome-wide surveys of single-nucleotide polymorphisms in the threatened Caribbean elkhorn coral, Acropora palmata , to reveal fine-scale population structure and infer the major barrier to gene flow that separates the eastern and western Caribbean populations between the Bahamas and Puerto Rico. The exact location of this break had been subject to discussion because two previous studies based on microsatellite data had come to differing conclusions. We investigate this contradiction by analyzing an extended set of 11 microsatellite markers including the five previously employed and discovered that one of the original microsatellite loci is apparently under selection. Exclusion of this locus reconciles the results from the SNP and the microsatellite datasets. Scans for outlier loci in the SNP data detected 13 candidate loci under positive selection, however there was no correlation between available environmental parameters and genetic distance. Together, these results suggest that reef restoration efforts should use local sources and utilize existing functional variation among geographic regions in ex situ crossing experiments to improve stress resistance of this species.

  14. A genome-wide association scan in pig identifies novel regions associated with feed efficiency trait

    DEFF Research Database (Denmark)

    Sahana, Goutam; Kadlecová, Veronika; Hornshøj, Henrik

    2013-01-01

    Feed conversion ratio (FCR) is an economically important trait in pigs and feed accounts for a significant proportion of the costs involved in pig production. In this study we used a high density SNP chip panel, Porcine SNP60 BeadChip, to identify association between FCR and SNP markers and to st...

  15. Genome-Wide Association Mapping of Leaf Rust Response in a Durum Wheat Worldwide Germplasm Collection.

    Science.gov (United States)

    Aoun, Meriem; Breiland, Matthew; Kathryn Turner, M; Loladze, Alexander; Chao, Shiaoman; Xu, Steven S; Ammar, Karim; Anderson, James A; Kolmer, James A; Acevedo, Maricelis

    2016-11-01

    Leaf rust (caused by Erikss. []) is increasingly impacting durum wheat ( L. var. ) production with the recent appearance of races with virulence to widely grown cultivars in many durum producing areas worldwide. A highly virulent race on durum wheat was recently detected in Kansas. This race may spread to the northern Great Plains, where most of the US durum wheat is produced. The objective of this study was to identify sources of resistance to several races from the United States and Mexico at seedling stage in the greenhouse and at adult stage in field experiments. Genome-wide association study (GWAS) was used to identify single-nucleotide polymorphism (SNP) markers associated with leaf rust response in a worldwide durum wheat collection of 496 accessions. Thirteen accessions were resistant across all experiments. Association mapping revealed 88 significant SNPs associated with leaf rust response. Of these, 33 SNPs were located on chromosomes 2A and 2B, and 55 SNPs were distributed across all other chromosomes except for 1B and 7B. Twenty markers were associated with leaf rust response at seedling stage, while 68 markers were associated with leaf rust response at adult plant stage. The current study identified a total of 14 previously uncharacterized loci associated with leaf rust response in durum wheat. The discovery of these loci through association mapping (AM) is a significant step in identifying useful sources of resistance that can be used to broaden the relatively narrow leaf rust resistance spectrum in durum wheat germplasm. Copyright © 2016 Crop Science Society of America.

  16. Genome-Wide Association Mapping for Identification of Quantitative Trait Loci for Rectal Temperature during Heat Stress in Holstein Cattle

    Science.gov (United States)

    Dikmen, Serdal; Cole, John B.; Null, Daniel J.; Hansen, Peter J.

    2013-01-01

    Heat stress compromises production, fertility, and health of dairy cattle. One mitigation strategy is to select individuals that are genetically resistant to heat stress. Most of the negative effects of heat stress on animal performance are a consequence of either physiological adaptations to regulate body temperature or adverse consequences of failure to regulate body temperature. Thus, selection for regulation of body temperature during heat stress could increase thermotolerance. The objective was to perform a genome-wide association study (GWAS) for rectal temperature (RT) during heat stress in lactating Holstein cows and identify SNPs associated with genes that have large effects on RT. Records on afternoon RT where the temperature-humidity index was ≥78.2 were obtained from 4,447 cows sired by 220 bulls, resulting in 1,440 useable genotypes from the Illumina BovineSNP50 BeadChip with 39,759 SNP. For GWAS, 2, 3, 4, 5, and 10 adjacent SNP were averaged to identify consensus genomic regions associated with RT. The largest proportion of SNP variance (0.07 to 0.44%) was explained by markers flanking the region between 28,877,547 and 28,907,154 bp on Bos taurus autosome (BTA) 24. That region is flanked by U1 (28,822,883 to 28,823,043) and NCAD (28,992,666 to 29,241,119). In addition, the SNP at 58,500,249 bp on BTA 16 explained 0.08% and 0.11% of the SNP variance for 2- and 3-SNP analyses, respectively. That contig includes SNORA19, RFWD2 and SCARNA3. Other SNPs associated with RT were located on BTA 16 (close to CEP170 and PLD5), BTA 5 (near SLCO1C1 and PDE3A), BTA 4 (near KBTBD2 and LSM5), and BTA 26 (located in GOT1, a gene implicated in protection from cellular stress). In conclusion, there are QTL for RT in heat-stressed dairy cattle. These SNPs could prove useful in genetic selection and for identification of genes involved in physiological responses to heat stress. PMID:23935954

  17. A genome-wide association study of attempted suicide

    Science.gov (United States)

    Willour, Virginia L.; Seifuddin, Fayaz; Mahon, Pamela B.; Jancic, Dubravka; Pirooznia, Mehdi; Steele, Jo; Schweizer, Barbara; Goes, Fernando S.; Mondimore, Francis M.; MacKinnon, Dean F.; Perlis, Roy H.; Lee, Phil Hyoun; Huang, Jie; Kelsoe, John R.; Shilling, Paul D.; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W.; Craddock, Nicholas; DePaulo, J. Raymond; Schulze, Thomas G.; McMahon, Francis J.; Zandi, Peter P.; Potash, James B.

    2011-01-01

    The heritable component to attempted and completed suicide is partly related to psychiatric disorders and also partly independent of them. While attempted suicide linkage regions have been identified on 2p11–12 and 6q25–26, there are likely many more such loci, the discovery of which will require a much higher resolution approach, such as the genome-wide association study (GWAS). With this in mind, we conducted an attempted suicide GWAS that compared the single nucleotide polymorphism (SNP) genotypes of 1,201 bipolar (BP) subjects with a history of suicide attempts to the genotypes of 1,497 BP subjects without a history of suicide attempts. 2,507 SNPs with evidence for association at p<0.001 were identified. These associated SNPs were subsequently tested for association in a large and independent BP sample set. None of these SNPs were significantly associated in the replication sample after correcting for multiple testing, but the combined analysis of the two sample sets produced an association signal on 2p25 (rs300774) at the threshold of genome-wide significance (p= 5.07 × 10−8). The associated SNPs on 2p25 fall in a large linkage disequilibrium block containing the ACP1 gene, a gene whose expression is significantly elevated in BP subjects who have completed suicide. Furthermore, the ACP1 protein is a tyrosine phosphatase that influences Wnt signaling, a pathway regulated by lithium, making ACP1 a functional candidate for involvement in the phenotype. Larger GWAS sample sets will be required to confirm the signal on 2p25 and to identify additional genetic risk factors increasing susceptibility for attempted suicide. PMID:21423239

  18. Rare CNVs in Suicide Attempt include Schizophrenia-Associated Loci and Neurodevelopmental Genes: A Pilot Genome-Wide and Family-Based Study.

    Directory of Open Access Journals (Sweden)

    Marcus Sokolowski

    Full Text Available Suicidal behavior (SB has a complex etiology involving genes and environment. One of the genetic components in SB could be copy number variations (CNVs, as CNVs are implicated in neurodevelopmental disorders. However, a recently published genome-wide and case-control study did not observe any significant role of CNVs in SB. Here we complemented these initial observations by instead using a family-based trio-sample that is robust to control biases, having severe suicide attempt (SA in offspring as main outcome (n = 660 trios. We first tested for CNV associations on the genome-wide Illumina 1M SNP-array by using FBAT-CNV methodology, which allows for evaluating CNVs without reliance on CNV calling algorithms, analogous to a common SNP-based GWAS. We observed association of certain T-cell receptor markers, but this likely reflected inter-individual variation in somatic rearrangements rather than association with SA outcome. Next, we used the PennCNV software to call 385 putative rare (100 kb CNVs, observed in n = 225 SA offspring. Nine SA offspring had rare CNV calls in a set of previously schizophrenia-associated loci, indicating the importance of such CNVs in certain SA subjects. Several additional, very large (>1MB sized CNV calls in 15 other SA offspring also spanned pathogenic regions or other neural genes of interest. Overall, 45 SA had CNVs enriched for 65 medically relevant genes previously shown to be affected by CNVs, which were characterized by a neurodevelopmental biology. A neurodevelopmental implication was partly congruent with our previous SNP-based GWAS, but follow-up analysis here indicated that carriers of rare CNVs had a decreased burden of common SNP risk-alleles compared to non-carriers. In conclusion, while CNVs did not show genome-wide association by the FBAT-CNV methodology, our preliminary observations indicate rare pathogenic CNVs affecting neurodevelopmental functions in a subset of SA, who were distinct from SA having

  19. Genome-wide Association Study to Identify Quantitative Trait Loci for Meat and Carcass Quality Traits in Berkshire

    Science.gov (United States)

    Iqbal, Asif; Kim, You-Sam; Kang, Jun-Mo; Lee, Yun-Mi; Rai, Rajani; Jung, Jong-Hyun; Oh, Dong-Yup; Nam, Ki-Chang; Lee, Hak-Kyo; Kim, Jong-Joo

    2015-01-01

    Meat and carcass quality attributes are of crucial importance influencing consumer preference and profitability in the pork industry. A set of 400 Berkshire pigs were collected from Dasan breeding farm, Namwon, Chonbuk province, Korea that were born between 2012 and 2013. To perform genome wide association studies (GWAS), eleven meat and carcass quality traits were considered, including carcass weight, backfat thickness, pH value after 24 hours (pH24), Commission Internationale de l’Eclairage lightness in meat color (CIE L), redness in meat color (CIE a), yellowness in meat color (CIE b), filtering, drip loss, heat loss, shear force and marbling score. All of the 400 animals were genotyped with the Porcine 62K SNP BeadChips (Illumina Inc., USA). A SAS general linear model procedure (SAS version 9.2) was used to pre-adjust the animal phenotypes before GWAS with sire and sex effects as fixed effects and slaughter age as a covariate. After fitting the fixed and covariate factors in the model, the residuals of the phenotype regressed on additive effects of each single nucleotide polymorphism (SNP) under a linear regression model (PLINK version 1.07). The significant SNPs after permutation testing at a chromosome-wise level were subjected to stepwise regression analysis to determine the best set of SNP markers. A total of 55 significant (p<0.05) SNPs or quantitative trait loci (QTL) were detected on various chromosomes. The QTLs explained from 5.06% to 8.28% of the total phenotypic variation of the traits. Some QTLs with pleiotropic effect were also identified. A pair of significant QTL for pH24 was also found to affect both CIE L and drip loss percentage. The significant QTL after characterization of the functional candidate genes on the QTL or around the QTL region may be effectively and efficiently used in marker assisted selection to achieve enhanced genetic improvement of the trait considered. PMID:26580276

  20. Genome-wide Association Study to Identify Quantitative Trait Loci for Meat and Carcass Quality Traits in Berkshire

    Directory of Open Access Journals (Sweden)

    Asif Iqbal

    2015-11-01

    Full Text Available Meat and carcass quality attributes are of crucial importance influencing consumer preference and profitability in the pork industry. A set of 400 Berkshire pigs were collected from Dasan breeding farm, Namwon, Chonbuk province, Korea that were born between 2012 and 2013. To perform genome wide association studies (GWAS, eleven meat and carcass quality traits were considered, including carcass weight, backfat thickness, pH value after 24 hours (pH24, Commission Internationale de l’Eclairage lightness in meat color (CIE L, redness in meat color (CIE a, yellowness in meat color (CIE b, filtering, drip loss, heat loss, shear force and marbling score. All of the 400 animals were genotyped with the Porcine 62K SNP BeadChips (Illumina Inc., USA. A SAS general linear model procedure (SAS version 9.2 was used to pre-adjust the animal phenotypes before GWAS with sire and sex effects as fixed effects and slaughter age as a covariate. After fitting the fixed and covariate factors in the model, the residuals of the phenotype regressed on additive effects of each single nucleotide polymorphism (SNP under a linear regression model (PLINK version 1.07. The significant SNPs after permutation testing at a chromosome-wise level were subjected to stepwise regression analysis to determine the best set of SNP markers. A total of 55 significant (p<0.05 SNPs or quantitative trait loci (QTL were detected on various chromosomes. The QTLs explained from 5.06% to 8.28% of the total phenotypic variation of the traits. Some QTLs with pleiotropic effect were also identified. A pair of significant QTL for pH24 was also found to affect both CIE L and drip loss percentage. The significant QTL after characterization of the functional candidate genes on the QTL or around the QTL region may be effectively and efficiently used in marker assisted selection to achieve enhanced genetic improvement of the trait considered.

  1. FGWAS: Functional genome wide association analysis.

    Science.gov (United States)

    Huang, Chao; Thompson, Paul; Wang, Yalin; Yu, Yang; Zhang, Jingwen; Kong, Dehan; Colen, Rivka R; Knickmeyer, Rebecca C; Zhu, Hongtu

    2017-10-01

    Functional phenotypes (e.g., subcortical surface representation), which commonly arise in imaging genetic studies, have been used to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. However, existing statistical methods largely ignore the functional features (e.g., functional smoothness and correlation). The aim of this paper is to develop a functional genome-wide association analysis (FGWAS) framework to efficiently carry out whole-genome analyses of functional phenotypes. FGWAS consists of three components: a multivariate varying coefficient model, a global sure independence screening procedure, and a test procedure. Compared with the standard multivariate regression model, the multivariate varying coefficient model explicitly models the functional features of functional phenotypes through the integration of smooth coefficient functions and functional principal component analysis. Statistically, compared with existing methods for genome-wide association studies (GWAS), FGWAS can substantially boost the detection power for discovering important genetic variants influencing brain structure and function. Simulation studies show that FGWAS outperforms existing GWAS methods for searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. We have successfully applied FGWAS to large-scale analysis of data from the Alzheimer's Disease Neuroimaging Initiative for 708 subjects, 30,000 vertices on the left and right hippocampal surfaces, and 501,584 SNPs. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. A Genome Wide Association Study on Age at First Calving Using High Density Single Nucleotide Polymorphism Chips in Hanwoo (

    Directory of Open Access Journals (Sweden)

    K.-E. Hyeong

    2014-10-01

    Full Text Available Age at first calving is an important trait for achieving earlier reproductive performance. To detect quantitative trait loci (QTL for reproductive traits, a genome wide association study was conducted on the 96 Hanwoo cows that were born between 2008 and 2010 from 13 sires in a local farm (Juk-Am Hanwoo farm, Suncheon, Korea and genotyped with the Illumina 50K bovine single nucleotide polymorphism (SNP chips. Phenotypes were regressed on additive and dominance effects for each SNP using a simple linear regression model after the effects of birth-year-month and polygenes were considered. A forward regression procedure was applied to determine the best set of SNPs for age at first calving. A total of 15 QTL were detected at the comparison-wise 0.001 level. Two QTL with strong statistical evidence were found at 128.9 Mb and 111.1 Mb on bovine chromosomes (BTA 2 and 7, respectively, each of which accounted for 22% of the phenotypic variance. Also, five significant SNPs were detected on BTAs 10, 16, 20, 26, and 29. Multiple QTL were found on BTAs 1, 2, 7, and 14. The significant QTLs may be applied via marker assisted selection to increase rate of genetic gain for the trait, after validation tests in other Hanwoo cow populations.

  3. Identification and Validation of SNP Markers Linked to Dwarf Traits Using SLAF-Seq Technology in Lagerstroemia

    Science.gov (United States)

    Ju, Yiqian; Jiao, Yao; Feng, Lu; Pan, Huitang; Cheng, Tangren; Zhang, Qixiang

    2016-01-01

    The genetic control of plant architecture is a promising approach to breed desirable cultivars, particularly in ornamental flowers. In this study, the F1 population (142 seedlings) derived from Lagerstroemia fauriei (non-dwarf) × L. indica ‘Pocomoke’ (dwarf) was phenotyped for six traits (plant height (PH), internode length (IL), internode number, primary lateral branch height (PLBH), secondary lateral branch height and primary branch number), and the IL and PLBH traits were positively correlated with the PH trait and considered representative indexes of PH. Fifty non-dwarf and dwarf seedlings were pooled and subjected to a specific-locus amplified fragment sequencing (SLAF-seq) method, which screened 1221 polymorphic markers. A total of 3 markers segregating between bulks were validated in the F1 population, with the M16337 and M38412 markers highly correlated with the IL trait and the M25207 marker highly correlated with the PLBH trait. These markers provide a predictability of approximately 80% using a single marker (M25207) and a predictability of 90% using marker combinations (M16337 + M25207) in the F1 population, which revealed that the IL and the PLBH traits, especially the PLBH, were the decisive elements for PH in terms of molecular regulation. Further validation was performed in the BC1 population and a set of 28 Lagerstroemia stocks using allele-specific PCR (AS-PCR) technology, and the results showed the stability and reliability of the SNP markers and the co-determination of PH by multiple genes. Our findings provide an important theoretical and practical basis for the early prediction and indirect selection of PH using the IL and the PLBH, and the detected SNPs may be useful for marker-assisted selection (MAS) in crape myrtle. PMID:27404662

  4. Genome-wide association studies of obesity and metabolic syndrome.

    Science.gov (United States)

    Fall, Tove; Ingelsson, Erik

    2014-01-25

    Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  5. Genome-wide association study of Tourette Syndrome

    Science.gov (United States)

    Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.

    2012-01-01

    Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924

  6. Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety 'Amrapali' (Mangifera indica L.).

    Science.gov (United States)

    Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

    2016-01-01

    Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.

  7. A Novel QTL for Powdery Mildew Resistance in Nordic Spring Barley (Hordeum vulgare L. ssp. vulgare) Revealed by Genome-Wide Association Study.

    Science.gov (United States)

    Bengtsson, Therése; Åhman, Inger; Manninen, Outi; Reitan, Lars; Christerson, Therese; Due Jensen, Jens; Krusell, Lene; Jahoor, Ahmed; Orabi, Jihad

    2017-01-01

    The powdery mildew fungus, Blumeria graminis f. sp. hordei is a worldwide threat to barley ( Hordeum vulgare L. ssp. vulgare ) production. One way to control the disease is by the development and deployment of resistant cultivars. A genome-wide association study was performed in a Nordic spring barley panel consisting of 169 genotypes, to identify marker-trait associations significant for powdery mildew. Powdery mildew was scored during three years (2012-2014) in four different locations within the Nordic region. There were strong correlations between data from all locations and years. In total four QTLs were identified, one located on chromosome 4H in the same region as the previously identified mlo locus and three on chromosome 6H. Out of these three QTLs identified on chromosome 6H, two are in the same region as previously reported QTLs for powdery mildew resistance, whereas one QTL appears to be novel. The top NCBI BLASTn hit of the SNP markers within the novel QTL predicted the responsible gene to be the 26S proteasome regulatory subunit, RPN1, which is required for innate immunity and powdery mildew-induced cell death in Arabidopsis . The results from this study have revealed SNP marker candidates that can be exploited for use in marker-assisted selection and stacking of genes for powdery mildew resistance in barley.

  8. Genome-wide Analysis of Gene Regulation

    DEFF Research Database (Denmark)

    Chen, Yun

    to protein: through epigenetic modifications, transcription regulators or post-transcriptional controls. The following papers concern several layers of gene regulation with questions answered by different HTS approaches. Genome-wide screening of epigenetic changes by ChIP-seq allowed us to study both spatial...... and temporal alterations of histone modifications (Papers I and II). Coupling the data with machine learning approaches, we established a prediction framework to assess the most informative histone marks as well as their most influential nucleosome positions in predicting the promoter usages. (Papers I...... they regulated or if the sites had global elevated usage rates by multiple TFs. Using RNA-seq, 5’end-seq in combination with depletion of 5’exonuclease as well as nonsensemediated decay (NMD) factors, we systematically analyzed NMD substrates as well as their degradation intermediates in human cells (Paper V...

  9. Genome-wide association study of schizophrenia in Japanese population.

    Directory of Open Access Journals (Sweden)

    Kazuo Yamada

    Full Text Available Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions. The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila-like 2] gene located on 9p21.3 (p = 0.00087. In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026. The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology.

  10. Susceptibility to chronic mucus hypersecretion, a genome wide association study.

    Directory of Open Access Journals (Sweden)

    Akkelies E Dijkstra

    Full Text Available Chronic mucus hypersecretion (CMH is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA study of CMH in Caucasian populations.GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years. Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP.A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10(-6, OR = 1.17, located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1 on chromosome 3. The risk allele (G was associated with higher mRNA expression of SATB1 (4.3×10(-9 in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture.Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH.

  11. Significant Locus and Metabolic Genetic Correlations Revealed in Genome-Wide Association Study of Anorexia Nervosa.

    Science.gov (United States)

    Duncan, Laramie; Yilmaz, Zeynep; Gaspar, Helena; Walters, Raymond; Goldstein, Jackie; Anttila, Verneri; Bulik-Sullivan, Brendan; Ripke, Stephan; Thornton, Laura; Hinney, Anke; Daly, Mark; Sullivan, Patrick F; Zeggini, Eleftheria; Breen, Gerome; Bulik, Cynthia M

    2017-09-01

    The authors conducted a genome-wide association study of anorexia nervosa and calculated genetic correlations with a series of psychiatric, educational, and metabolic phenotypes. Following uniform quality control and imputation procedures using the 1000 Genomes Project (phase 3) in 12 case-control cohorts comprising 3,495 anorexia nervosa cases and 10,982 controls, the authors performed standard association analysis followed by a meta-analysis across cohorts. Linkage disequilibrium score regression was used to calculate genome-wide common variant heritability (single-nucleotide polymorphism [SNP]-based heritability [h 2 SNP ]), partitioned heritability, and genetic correlations (r g ) between anorexia nervosa and 159 other phenotypes. Results were obtained for 10,641,224 SNPs and insertion-deletion variants with minor allele frequencies >1% and imputation quality scores >0.6. The h 2 SNP of anorexia nervosa was 0.20 (SE=0.02), suggesting that a substantial fraction of the twin-based heritability arises from common genetic variation. The authors identified one genome-wide significant locus on chromosome 12 (rs4622308) in a region harboring a previously reported type 1 diabetes and autoimmune disorder locus. Significant positive genetic correlations were observed between anorexia nervosa and schizophrenia, neuroticism, educational attainment, and high-density lipoprotein cholesterol, and significant negative genetic correlations were observed between anorexia nervosa and body mass index, insulin, glucose, and lipid phenotypes. Anorexia nervosa is a complex heritable phenotype for which this study has uncovered the first genome-wide significant locus. Anorexia nervosa also has large and significant genetic correlations with both psychiatric phenotypes and metabolic traits. The study results encourage a reconceptualization of this frequently lethal disorder as one with both psychiatric and metabolic etiology.

  12. Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array

    Energy Technology Data Exchange (ETDEWEB)

    Gardner, S; Jaing, C

    2012-03-27

    The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interim report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.

  13. A Genome-Wide Association Study Primer for Clinicians

    Directory of Open Access Journals (Sweden)

    Tzu-Hao Wang

    2009-06-01

    Full Text Available Genome-wide association studies (GWAS use high-throughput genotyping technology to relate hundreds of thousands of genetic markers (genotypes to clinical conditions and measurable traits (phenotypes. This review is intended to serve as an introduction to GWAS for clinicians, to allow them to better appreciate the value and limitations of GWAS for genotype-disease association studies. The input of clinicians is vital for GWAS, since disease heterogeneity is frequently a confounding factor that can only really be solved by clinicians. For diseases that are difficult to diagnose, clinicians should ensure that the cases do indeed have the disease; for common diseases, clinicians should ensure that the controls are truly disease-free.

  14. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    Energy Technology Data Exchange (ETDEWEB)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMcahon, Katherine D.; Mamlstrom, Rex R.

    2014-05-12

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.

  15. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    Energy Technology Data Exchange (ETDEWEB)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMahon, Katherine D.; Malmstrom, Rex R.

    2014-06-18

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.

  16. Population structure of Atlantic Mackerel inferred from RAD-seq derived SNP markers: effects of sequence clustering parameters and hierarchical SNP selection

    KAUST Repository

    Rodrí guez-Ezpeleta, Naiara; Bradbury, Ian R.; Mendibil, Iñ aki; Á lvarez, Paula; Cotano, Unai; Irigoien, Xabier

    2016-01-01

    : the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides

  17. Integrating genome-wide genetic variations and monocyte expression data reveals trans-regulated gene modules in humans.

    Directory of Open Access Journals (Sweden)

    Maxime Rotival

    2011-12-01

    Full Text Available One major expectation from the transcriptome in humans is to characterize the biological basis of associations identified by genome-wide association studies. So far, few cis expression quantitative trait loci (eQTLs have been reliably related to disease susceptibility. Trans-regulating mechanisms may play a more prominent role in disease susceptibility. We analyzed 12,808 genes detected in at least 5% of circulating monocyte samples from a population-based sample of 1,490 European unrelated subjects. We applied a method of extraction of expression patterns-independent component analysis-to identify sets of co-regulated genes. These patterns were then related to 675,350 SNPs to identify major trans-acting regulators. We detected three genomic regions significantly associated with co-regulated gene modules. Association of these loci with multiple expression traits was replicated in Cardiogenics, an independent study in which expression profiles of monocytes were available in 758 subjects. The locus 12q13 (lead SNP rs11171739, previously identified as a type 1 diabetes locus, was associated with a pattern including two cis eQTLs, RPS26 and SUOX, and 5 trans eQTLs, one of which (MADCAM1 is a potential candidate for mediating T1D susceptibility. The locus 12q24 (lead SNP rs653178, which has demonstrated extensive disease pleiotropy, including type 1 diabetes, hypertension, and celiac disease, was associated to a pattern strongly correlating to blood pressure level. The strongest trans eQTL in this pattern was CRIP1, a known marker of cellular proliferation in cancer. The locus 12q15 (lead SNP rs11177644 was associated with a pattern driven by two cis eQTLs, LYZ and YEATS4, and including 34 trans eQTLs, several of them tumor-related genes. This study shows that a method exploiting the structure of co-expressions among genes can help identify genomic regions involved in trans regulation of sets of genes and can provide clues for understanding the

  18. Genome-wide scans for delineation of candidate genes regulating seed-protein content in chickpea

    Directory of Open Access Journals (Sweden)

    Hari Deo eUpadhyaya

    2016-03-01

    Full Text Available Identification of potential genes/alleles governing complex seed-protein content (SPC trait is essential in marker-assisted breeding for quality trait improvement of chickpea. Henceforth, the present study utilized an integrated genomics-assisted breeding strategy encompassing trait association analysis, selective genotyping in traditional bi-parental mapping population and differential expression profiling for the first-time to understand the complex genetic architecture of quantitative SPC trait in chickpea. For GWAS (genome-wide association study, high-throughput genotyping information of 16376 genome-based SNPs (single nucleotide polymorphism discovered from a structured population of 336 sequenced desi and kabuli accessions [with 150-200 kb LD (linkage disequilibrium decay] was utilized. This led to identification of seven most effective genomic loci (genes associated [10 to 20% with 41% combined PVE (phenotypic variation explained] with SPC trait in chickpea. Regardless of the diverse desi and kabuli genetic backgrounds, a comparable level of association potential of the identified seven genomic loci with SPC trait was observed. Five SPC-associated genes were validated successfully in parental accessions and homozygous individuals of an intra-specific desi RIL (recombinant inbred line mapping population (ICC 12299 x ICC 4958 by selective genotyping. The seed-specific expression, including differential up-regulation (> 4-fold of six SPC-associated genes particularly in accessions, parents and homozygous individuals of the aforementioned mapping population with high level of contrasting seed-protein content (21-22% was evident. Collectively, the integrated genomic approach delineated diverse naturally occurring novel functional SNP allelic variants in six potential candidate genes regulating SPC trait in chickpea. Of these, a non-synonymous SNP allele-carrying zinc finger transcription factor gene exhibiting strong association with SPC trait

  19. Genotyping of single spore isolates of a Pasteuria penetrans population occurring in Florida using SNP-based markers.

    Science.gov (United States)

    Joseph, S; Schmidt, L M; Danquah, W B; Timper, P; Mekete, T

    2017-02-01

    To generate single spore lines of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida and examine genotypic variation and virulence characteristics exist within the population. Six single spore lines (SSP), 16SSP, 17SSP, 18SSP, 25SSP, 26SSP and 30SSP were generated. Genetic variability was evaluated by comparing single-nucleotide polymorphisms (SNPs) in six protein-coding genes and the 16S rRNA gene. An average of one SNP was observed for every 69 bp in the 16S rRNA, whereas no SNPs were observed in the protein-coding sequences. Hierarchical cluster analysis of 16S rRNA sequences placed the clones into three distinct clades. Bio-efficacy analysis revealed significant heterogeneity in the level virulence and host specificity between the individual clones. The SNP markers developed to the 5' hypervariable region of the 16S rRNA gene may be useful in biotype differentiation within a population of P. penetrans. This study demonstrates an efficient method for generating single spore lines of P. penetrans and gives a deep insight into genetic heterogeneity and varying level of virulence exists within a population parasitizing a specific Meloidogyne sp. host. The results also suggest that the application of generalist spore lines in nematode management may achieve broad RKN control. © 2016 The Society for Applied Microbiology.

  20. SNP genotyping technologies

    DEFF Research Database (Denmark)

    Studer, Bruno; Kölliker, Roland

    2013-01-01

    In the recent years, single nucleotide polymorphism (SNP) markers have emerged as the marker technology of choice for plant genetics and breeding applications. Besides the efficient technologies available for SNP discovery even in complex genomes, one of the main reasons for this is the availabil...

  1. Application of next-generation sequencing technology to study genetic diversity and identify unique SNP markers in bread wheat from Kazakhstan.

    Science.gov (United States)

    Shavrukov, Yuri; Suchecki, Radoslaw; Eliby, Serik; Abugalieva, Aigul; Kenebayev, Serik; Langridge, Peter

    2014-09-28

    New SNP marker platforms offer the opportunity to investigate the relationships between wheat cultivars from different regions and assess the mechanism and processes that have led to adaptation to particular production environments. Wheat breeding has a long history in Kazakhstan and the aim of this study was to explore the relationship between key varieties from Kazakhstan and germplasm from breeding programs for other regions. The study revealed 5,898 polymorphic markers amongst ten cultivars, of which 2,730 were mapped in the consensus genetic map. Mapped SNP markers were distributed almost equally across the A and B genomes, with between 279 and 484 markers assigned to each chromosome. Marker coverage was approximately 10-fold lower in the D genome. There were 863 SNP markers identified as unique to specific cultivars, and clusters of these markers (regions containing more than three closely mapped unique SNPs) showed specific patterns on the consensus genetic map for each cultivar. Significant intra-varietal genetic polymorphism was identified in three cultivars (Tzelinnaya 3C, Kazakhstanskaya rannespelaya and Kazakhstanskaya 15). Phylogenetic analysis based on inter-varietal polymorphism showed that the very old cultivar Erythrospermum 841 was the most genetically distinct from the other nine cultivars from Kazakhstan, falling in a clade together with the American cultivar Sonora and genotypes from Central and South Asia. The modern cultivar Kazakhstanskaya 19 also fell into a separate clade, together with the American cultivar Thatcher. The remaining eight cultivars shared a single sub-clade but were categorised into four clusters. The accumulated data for SNP marker polymorphisms amongst bread wheat genotypes from Kazakhstan may be used for studying genetic diversity in bread wheat, with potential application for marker-assisted selection and the preparation of a set of genotype-specific markers.

  2. An Open Access Database of Genome-wide Association Results

    Directory of Open Access Journals (Sweden)

    Johnson Andrew D

    2009-01-01

    Full Text Available Abstract Background The number of genome-wide association studies (GWAS is growing rapidly leading to the discovery and replication of many new disease loci. Combining results from multiple GWAS datasets may potentially strengthen previous conclusions and suggest new disease loci, pathways or pleiotropic genes. However, no database or centralized resource currently exists that contains anywhere near the full scope of GWAS results. Methods We collected available results from 118 GWAS articles into a database of 56,411 significant SNP-phenotype associations and accompanying information, making this database freely available here. In doing so, we met and describe here a number of challenges to creating an open access database of GWAS results. Through preliminary analyses and characterization of available GWAS, we demonstrate the potential to gain new insights by querying a database across GWAS. Results Using a genomic bin-based density analysis to search for highly associated regions of the genome, positive control loci (e.g., MHC loci were detected with high sensitivity. Likewise, an analysis of highly repeated SNPs across GWAS identified replicated loci (e.g., APOE, LPL. At the same time we identified novel, highly suggestive loci for a variety of traits that did not meet genome-wide significant thresholds in prior analyses, in some cases with strong support from the primary medical genetics literature (SLC16A7, CSMD1, OAS1, suggesting these genes merit further study. Additional adjustment for linkage disequilibrium within most regions with a high density of GWAS associations did not materially alter our findings. Having a centralized database with standardized gene annotation also allowed us to examine the representation of functional gene categories (gene ontologies containing one or more associations among top GWAS results. Genes relating to cell adhesion functions were highly over-represented among significant associations (p -14, a finding

  3. Genome-wide scan of gastrointestinal nematode resistance in closed Angus population selected for minimized influence of MHC.

    Science.gov (United States)

    Kim, Eui-Soo; Sonstegard, Tad S; da Silva, Marcos V G B; Gasbarre, Louis C; Van Tassell, Curtis P

    2015-01-01

    Genetic markers associated with parasite indicator traits are ideal targets for study of marker assisted selection aimed at controlling infections that reduce herd use of anthelminthics. For this study, we collected gastrointestinal (GI) nematode fecal egg count (FEC) data from post-weaning animals of an Angus resource population challenged to a 26 week natural exposure on pasture. In all, data from 487 animals was collected over a 16 year period between 1992 and 2007, most of which were selected for a specific DRB1 allele to reduce the influence of potential allelic variant effects of the MHC locus. A genome-wide association study (GWAS) based on BovineSNP50 genotypes revealed six genomic regions located on bovine Chromosomes 3, 5, 8, 15 and 27; which were significantly associated (-log10 p=4.3) with Box-Cox transformed mean FEC (BC-MFEC). DAVID analysis of the genes within the significant genomic regions suggested a correlation between our results and annotation for genes involved in inflammatory response to infection. Furthermore, ROH and selection signature analyses provided strong evidence that the genomic regions associated BC-MFEC have not been affected by local autozygosity or recent experimental selection. These findings provide useful information for parasite resistance prediction for young grazing cattle and suggest new candidate gene targets for development of disease-modifying therapies or future studies of host response to GI parasite infection.

  4. A novel variational Bayes multiple locus Z-statistic for genome-wide association studies with Bayesian model averaging

    Science.gov (United States)

    Logsdon, Benjamin A.; Carty, Cara L.; Reiner, Alexander P.; Dai, James Y.; Kooperberg, Charles

    2012-01-01

    Motivation: For many complex traits, including height, the majority of variants identified by genome-wide association studies (GWAS) have small effects, leaving a significant proportion of the heritable variation unexplained. Although many penalized multiple regression methodologies have been proposed to increase the power to detect associations for complex genetic architectures, they generally lack mechanisms for false-positive control and diagnostics for model over-fitting. Our methodology is the first penalized multiple regression approach that explicitly controls Type I error rates and provide model over-fitting diagnostics through a novel normally distributed statistic defined for every marker within the GWAS, based on results from a variational Bayes spike regression algorithm. Results: We compare the performance of our method to the lasso and single marker analysis on simulated data and demonstrate that our approach has superior performance in terms of power and Type I error control. In addition, using the Women's Health Initiative (WHI) SNP Health Association Resource (SHARe) GWAS of African-Americans, we show that our method has power to detect additional novel associations with body height. These findings replicate by reaching a stringent cutoff of marginal association in a larger cohort. Availability: An R-package, including an implementation of our variational Bayes spike regression (vBsr) algorithm, is available at http://kooperberg.fhcrc.org/soft.html. Contact: blogsdon@fhcrc.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22563072

  5. Genome-wide scan of gastrointestinal nematode resistance in closed Angus population selected for minimized influence of MHC.

    Directory of Open Access Journals (Sweden)

    Eui-Soo Kim

    Full Text Available Genetic markers associated with parasite indicator traits are ideal targets for study of marker assisted selection aimed at controlling infections that reduce herd use of anthelminthics. For this study, we collected gastrointestinal (GI nematode fecal egg count (FEC data from post-weaning animals of an Angus resource population challenged to a 26 week natural exposure on pasture. In all, data from 487 animals was collected over a 16 year period between 1992 and 2007, most of which were selected for a specific DRB1 allele to reduce the influence of potential allelic variant effects of the MHC locus. A genome-wide association study (GWAS based on BovineSNP50 genotypes revealed six genomic regions located on bovine Chromosomes 3, 5, 8, 15 and 27; which were significantly associated (-log10 p=4.3 with Box-Cox transformed mean FEC (BC-MFEC. DAVID analysis of the genes within the significant genomic regions suggested a correlation between our results and annotation for genes involved in inflammatory response to infection. Furthermore, ROH and selection signature analyses provided strong evidence that the genomic regions associated BC-MFEC have not been affected by local autozygosity or recent experimental selection. These findings provide useful information for parasite resistance prediction for young grazing cattle and suggest new candidate gene targets for development of disease-modifying therapies or future studies of host response to GI parasite infection.

  6. Genome-wide association studies in asthma: progress and pitfalls

    Directory of Open Access Journals (Sweden)

    March ME

    2015-01-01

    Full Text Available Michael E March,1 Patrick MA Sleiman,1,2 Hakon Hakonarson1,2 1Center for Applied Genomics, Children's Hospital of Philadelphia Research Institute, 2Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Abstract: Genetic studies of asthma have revealed that there is considerable heritability to the phenotype. An extensive history of candidate-gene studies has identified a long list of genes associated with immune function that are potentially involved in asthma pathogenesis. However, many of the results of candidate-gene studies have failed to be replicated, leaving in question the true impact of the implicated biological pathways on asthma. With the advent of genome-wide association studies, geneticists are able to examine the association of hundreds of thousands of genetic markers with a phenotype, allowing the hypothesis-free identification of variants associated with disease. Many such studies examining asthma or related phenotypes have been published, and several themes have begun to emerge regarding the biological pathways underpinning asthma. The results of many genome-wide association studies have currently not been replicated, and the large sample sizes required for this experimental strategy invoke difficulties with sample stratification and phenotypic heterogeneity. Recently, large collaborative groups of researchers have formed consortia focused on asthma, with the goals of sharing material and data and standardizing diagnosis and experimental methods. Additionally, research has begun to focus on genetic variants that affect the response to asthma medications and on the biology that generates the heterogeneity in the asthma phenotype. As this work progresses, it will move asthma patients closer to more specific, personalized medicine. Keywords: asthma, genetics, GWAS, pharmacogenetics, biomarkers

  7. Genome-Wide Association Study in Obsessive-Compulsive Disorder: Results from the OCGAS

    Science.gov (United States)

    Mattheisen, Manuel; Samuels, Jack F.; Wang, Ying; Greenberg, Benjamin D.; Fyer, Abby J.; McCracken, James T.; Geller, Daniel A.; Murphy, Dennis L.; Knowles, James A.; Grados, Marco A.; Riddle, Mark A.; Rasmussen, Steven A.; McLaughlin, Nicole C.; Nurmi, Erica; Askland, Kathleen D.; Qin, Hai-De; Cullen, Bernadette A.; Piacentini, John; Pauls, David L.; Bienvenu, O. Joseph; Stewart, S. Evelyn; Liang, Kung-Yee; Goes, Fernando S.; Maher, Brion; Pulver, Ann E.; Shugart, Yin-Yao; Valle, David; Lange, Cristoph; Nestadt, Gerald

    2014-01-01

    Obsessive-compulsive disorder (OCD) is a psychiatric condition characterized by intrusive thoughts and urges and repetitive, intentional behaviors that cause significant distress and impair functioning. The OCD Collaborative Genetics Association Study (OCGAS) is comprised of comprehensively assessed OCD patients, with an early age of OCD onset. After application of a stringent quality control protocol, a total of 1 065 families (containing 1 406 patients with OCD), combined with population-based samples (resulting in a total sample of 5 061 individuals), were studied. An integrative analyses pipeline was utilized, involving association testing at SNP- and gene-levels (via a hybrid approach that allowed for combined analyses of the family- and population-based data). The smallest P-value was observed for a marker on chromosome 9 (near PTPRD, P=4.13×10−7). Pre-synaptic PTPRD promotes the differentiation of glutamatergic synapses and interacts with SLITRK3. Together, both proteins selectively regulate the development of inhibitory GABAergic synapses. Although no SNPs were identified as associated with OCD at genome-wide significance level, follow-up analyses of GWAS signals from a previously published OCD study identified significant enrichment (P=0.0176). Secondary analyses of high confidence interaction partners of DLGAP1 and GRIK2 (both showing evidence for association in our follow-up and the original GWAS study) revealed a trend of association (P=0.075) for a set of genes such as NEUROD6, SV2A, GRIA4, SLC1A2, and PTPRD. Analyses at the gene-level revealed association of IQCK and C16orf88 (both P<1×10−6, experiment-wide significant), as well as OFCC1 (P=6.29×10−5). The suggestive findings in this study await replication in larger samples. PMID:24821223

  8. Genome-Wide Association Study for Nine Plant Architecture Traits in Sorghum

    Directory of Open Access Journals (Sweden)

    Jing Zhao

    2016-07-01

    Full Text Available Sorghum [ (L Moench], an important grain and forage crop, is receiving significant attention as a lignocellulosic feedstock because of its water-use efficiency and high biomass yield potential. Because of the advancement of genotyping and sequencing technologies, genome-wide association study (GWAS has become a routinely used method to investigate the genetic mechanisms underlying natural phenotypic variation. In this study, we performed a GWAS for nine grain and biomass-related plant architecture traits to determine their overall genetic architecture and the specific association of allelic variants in gibberellin (GA biosynthesis and signaling genes with these phenotypes. A total of 101 single-nucleotide polymorphism (SNP representative regions were associated with at least one of the nine traits, and two of the significant markers correspond to GA candidate genes, ( and (, affecting plant height and seed number, respectively. The resolution of a previously reported quantitative trait loci (QTL for leaf angle on chromosome 7 was increased to a 1.67 Mb region containing seven candidate genes with good prospects for further investigation. This study provides new knowledge of the association of GA genes with plant architecture traits and the genomic regions controlling variation in leaf angle, stem circumference, internode number, tiller number, seed number, panicle exsertion, and panicle length. The GA gene affecting seed number variation ( and the genomic region on chromosome 7 associated with variation in leaf angle are also important outcomes of this study and represent the foundation of future validation studies needed to apply this knowledge in breeding programs.

  9. Genome-wide association study identifies candidate genes for starch content regulation in maize kernels

    Directory of Open Access Journals (Sweden)

    Na Liu

    2016-07-01

    Full Text Available Kernel starch content is an important trait in maize (Zea mays L. as it accounts for 65% to 75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60% to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001, among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437 is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops.

  10. Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design

    Directory of Open Access Journals (Sweden)

    Shashi N. Goonetilleke

    2018-01-01

    Full Text Available In crop plant genetics, linkage maps provide the basis for the mapping of loci that affect important traits and for the selection of markers to be applied in crop improvement. In outcrossing species such as almond (Prunus dulcis Mill. D. A. Webb, application of a double pseudotestcross mapping approach to the F1 progeny of a biparental cross leads to the construction of a linkage map for each parent. Here, we report on the application of genotyping by sequencing to discover and map single nucleotide polymorphisms in the almond cultivars “Nonpareil” and “Lauranne.” Allele-specific marker assays were developed for 309 tag pairs. Application of these assays to 231 Nonpareil × Lauranne F1 progeny provided robust linkage maps for each parent. Analysis of phenotypic data for shell hardness demonstrated the utility of these maps for quantitative trait locus mapping. Comparison of these maps to the peach genome assembly confirmed high synteny and collinearity between the peach and almond genomes. The marker assays were applied to progeny from several other Nonpareil crosses, providing the basis for a composite linkage map of Nonpareil. Applications of the assays to a panel of almond clones and a panel of rootstocks used for almond production demonstrated the broad applicability of the markers and provide subsets of markers that could be used to discriminate among accessions. The sequence-based linkage maps and single nucleotide polymorphism assays presented here could be useful resources for the genetic analysis and genetic improvement of almond.

  11. Genome-wide study of association and interaction with maternal cytomegalovirus infection suggests new schizophrenia loci

    DEFF Research Database (Denmark)

    Børglum, A D; Demontis, D; Grove, J

    2014-01-01

    Genetic and environmental components as well as their interaction contribute to the risk of schizophrenia, making it highly relevant to include environmental factors in genetic studies of schizophrenia. This study comprises genome-wide association (GWA) and follow-up analyses of all individuals...... born in Denmark since 1981 and diagnosed with schizophrenia as well as controls from the same birth cohort. Furthermore, we present the first genome-wide interaction survey of single nucleotide polymorphisms (SNPs) and maternal cytomegalovirus (CMV) infection. The GWA analysis included 888 cases...... was found for rs7902091 (P(SNP × CMV)=7.3 × 10(-7)) in CTNNA3, a gene not previously implicated in schizophrenia, stressing the importance of including environmental factors in genetic studies....

  12. Genome-wide analysis identifies 12 loci influencing human reproductive behavior

    Science.gov (United States)

    Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J.; Tropf, Felix C.; Shen, Xia; Wilson, James F.; Chasman, Daniel I.; Nolte, Ilja M.; Tragante, Vinicius; van der Laan, Sander W.; Perry, John R. B.; Kong, Augustine; Ahluwalia, Tarunveer; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F.; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J.; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F.; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J.; Gieger, Christian; Gunderson, Erica P.; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K.; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A.; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F.; McMahon, George; Meddens, S. Fleur W.; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A.; Monnereau, Claire; van der Most, Peter J.; Myhre, Ronny; Nalls, Mike A.; Nutile, Teresa; Panagiota, Kalafati Ioanna; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B.; Rich-Edwards, Janet; Rietveld, Cornelius A.; Robino, Antonietta; Rose, Lynda M.; Rueedi, Rico; Ryan, Kathy; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A.; Stolk, Lisette; Streeten, Elizabeth; Tonjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V.; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I.; Buring, Julie E.; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R.; Cucca, Francesco; Daniela, Toniolo; Davey-Smith, George; Deary, Ian J.; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M.; de Geus, Eco JC.; Eriksson, Johan G.; Evans, Denis A.; Faul, Jessica D.; Felicita, Sala Cinzia; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J.F.; de Haan, Hugoline G.; Haerting, Johannes; Harris, Tamara B.; Heath, Andrew C.; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hypponen, Elina; Jacobsson, Bo; Jaddoe, Vincent W. V.; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L.R.; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; McQuillan, Ruth; Medland, Sarah E.; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Michela, Traglia; Milani, Lili; Mitchell, Paul; Montgomery, Grant W.; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K.; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda WJH; Perola, Markus; Peyser, Patricia A.; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J.; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M.; Ring, Susan M.; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D.; Starr, John M.; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A.; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tönjes, Anke; Tung, Joyce Y.; Uitterlinden, André G.; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G.; Wang, Jie Jin; Wareham, Nicholas J.; Weir, David R.; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F.; Zondervan, Krina T.; Stefansson, Kari; Krueger, Robert F.; Lee, James J.; Benjamin, Daniel J.; Cesarini, David; Koellinger, Philipp D.; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C.

    2017-01-01

    The genetic architecture of human reproductive behavior – age at first birth (AFB) and number of children ever born (NEB) – has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified and the underlying mechanisms of AFB and NEB are poorly understood. We report the largest genome-wide association study to date of both sexes including 251,151 individuals for AFB and 343,072 for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study, and four additional loci in a gene-based effort. These loci harbor genes that are likely to play a role – either directly or by affecting non-local gene expression – in human reproduction and infertility, thereby increasing our understanding of these complex traits. PMID:27798627

  13. Genome wide association studies for body conformation traits in the Chinese Holstein cattle population

    DEFF Research Database (Denmark)

    Wu, Xiaoping; Fang, Ming; Liu, Lin

    2013-01-01

    .Results: The Illumina BovineSNP50 BeadChip was used to identify single nucleotide polymorphisms (SNPs) that are associated with body conformation traits. A least absolute shrinkage and selection operator (LASSO) was applied to detect multiple SNPs simultaneously for 29 body conformation traits with 1,314 Chinese...... Holstein cattle and 52,166 SNPs. Totally, 59 genome-wide significant SNPs associated with 26 conformation traits were detected by genome-wide association analysis; five SNPs were within previously reported QTL regions (Animal Quantitative Trait Loci (QTL) database) and 11 were very close to the reported...... SNPs. Twenty-two SNPs were located within annotated gene regions, while the remainder were 0.6-826 kb away from known genes. Some of the genes had clear biological functions related to conformation traits. By combining information about the previously reported QTL regions and the biological functions...

  14. Genome-wide single-generation signatures of local selection in the panmictic European eel

    DEFF Research Database (Denmark)

    Pujolar, J. M.; Jacobsen, M. W.; Als, Thomas Damm

    2014-01-01

    Next-generation sequencing and the collection of genome-wide data allow identifying adaptive variation and footprints of directional selection. Using a large SNP data set from 259 RAD-sequenced European eel individuals (glass eels) from eight locations between 34 and 64oN, we examined the patterns...... of genome-wide genetic diversity across locations. We tested for local selection by searching for increased population differentiation using FST-based outlier tests and by testing for significant associations between allele frequencies and environmental variables. The overall low genetic differentiation...... with single-generation signatures of spatially varying selection acting on glass eels. After screening 50 354 SNPs, a total of 754 potentially locally selected SNPs were identified. Candidate genes for local selection constituted a wide array of functions, including calcium signalling, neuroactive ligand...

  15. Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-seq.

    Science.gov (United States)

    Xie, Dongwei; Dai, Zhigang; Yang, Zemao; Sun, Jian; Zhao, Debao; Yang, Xue; Zhang, Liguo; Tang, Qing; Su, Jianguang

    2017-01-01

    Flax ( Linum usitatissimum L.) is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq) was employed to perform a genome-wide association study (GWAS) for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP) loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM) and a mixed linear model (MLM) as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.

  16. Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L. Using SLAF-seq

    Directory of Open Access Journals (Sweden)

    Dongwei Xie

    2018-01-01

    Full Text Available Flax (Linum usitatissimum L. is an important cash crop, and its agronomic traits directly affect yield and quality. Molecular studies on flax remain inadequate because relatively few flax genes have been associated with agronomic traits or have been identified as having potential applications. To identify markers and candidate genes that can potentially be used for genetic improvement of crucial agronomic traits, we examined 224 specimens of core flax germplasm; specifically, phenotypic data for key traits, including plant height, technical length, number of branches, number of fruits, and 1000-grain weight were investigated under three environmental conditions before specific-locus amplified fragment sequencing (SLAF-seq was employed to perform a genome-wide association study (GWAS for these five agronomic traits. Subsequently, the results were used to screen single nucleotide polymorphism (SNP loci and candidate genes that exhibited a significant correlation with the important agronomic traits. Our analyses identified a total of 42 SNP loci that showed significant correlations with the five important agronomic flax traits. Next, candidate genes were screened in the 10 kb zone of each of the 42 SNP loci. These SNP loci were then analyzed by a more stringent screening via co-identification using both a general linear model (GLM and a mixed linear model (MLM as well as co-occurrences in at least two of the three environments, whereby 15 final candidate genes were obtained. Based on these results, we determined that UGT and PL are candidate genes for plant height, GRAS and XTH are candidate genes for the number of branches, Contig1437 and LU0019C12 are candidate genes for the number of fruits, and PHO1 is a candidate gene for the 1000-seed weight. We propose that the identified SNP loci and corresponding candidate genes might serve as a biological basis for improving crucial agronomic flax traits.

  17. Genetic diversity and population structure assessed by SSR and SNP markers in a large germplasm collection of grape

    Science.gov (United States)

    2013-01-01

    Background The economic importance of grapevine has driven significant efforts in genomics to accelerate the exploitation of Vitis resources for development of new cultivars. However, although a large number of clonally propagated accessions are maintained in grape germplasm collections worldwide, their use for crop improvement is limited by the scarcity of information on genetic diversity, population structure and proper phenotypic assessment. The identification of representative and manageable subset of accessions would facilitate access to the diversity available in large collections. A genome-wide germplasm characterization using molecular markers can offer reliable tools for adjusting the quality and representativeness of such core samples. Results We investigated patterns of molecular diversity at 22 common microsatellite loci and 384 single nucleotide polymorphisms (SNPs) in 2273 accessions of domesticated grapevine V. vinifera ssp. sativa, its wild relative V. vinifera ssp. sylvestris, interspecific hybrid cultivars and rootstocks. Despite the large number of putative duplicates and extensive clonal relationships among the accessions, we observed high level of genetic variation. In the total germplasm collection the average genetic diversity, as quantified by the expected heterozygosity, was higher for SSR loci (0.81) than for SNPs (0.34). The analysis of the genetic structure in the grape germplasm collection revealed several levels of stratification. The primary division was between accessions of V. vinifera and non-vinifera, followed by the distinction between wild and domesticated grapevine. Intra-specific subgroups were detected within cultivated grapevine representing different eco-geographic groups. The comparison of a phenological core collection and genetic core collections showed that the latter retained more genetic diversity, while maintaining a similar phenotypic variability. Conclusions The comprehensive molecular characterization of our grape

  18. SNP discovery and marker development for disease resistance candidate genes in common carp (Cyprinus carpio)

    Science.gov (United States)

    Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers of susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpes virus 3 (CyHV-3) is highly contagious and virulent in common carp. With the aim to investigate the gene...

  19. Genome-wide association scan in HIV-1-infected individuals identifying variants influencing disease course.

    Directory of Open Access Journals (Sweden)

    Daniëlle van Manen

    Full Text Available BACKGROUND: AIDS develops typically after 7-11 years of untreated HIV-1 infection, with extremes of very rapid disease progression (15 years. To reveal additional host genetic factors that may impact on the clinical course of HIV-1 infection, we designed a genome-wide association study (GWAS in 404 participants of the Amsterdam Cohort Studies on HIV-1 infection and AIDS. METHODS: The association of SNP genotypes with the clinical course of HIV-1 infection was tested in Cox regression survival analyses using AIDS-diagnosis and AIDS-related death as endpoints. RESULTS: Multiple, not previously identified SNPs, were identified to be strongly associated with disease progression after HIV-1 infection, albeit not genome-wide significant. However, three independent SNPs in the top ten associations between SNP genotypes and time between seroconversion and AIDS-diagnosis, and one from the top ten associations between SNP genotypes and time between seroconversion and AIDS-related death, had P-values smaller than 0.05 in the French Genomics of Resistance to Immunodeficiency Virus cohort on disease progression. CONCLUSIONS: Our study emphasizes that the use of different phenotypes in GWAS may be useful to unravel the full spectrum of host genetic factors that may be associated with the clinical course of HIV-1 infection.

  20. Genome-Wide Association Scan in HIV-1-Infected Individuals Identifying Variants Influencing Disease Course

    Science.gov (United States)

    van Manen, Daniëlle; Delaneau, Olivier; Kootstra, Neeltje A.; Boeser-Nunnink, Brigitte D.; Limou, Sophie; Bol, Sebastiaan M.; Burger, Judith A.; Zwinderman, Aeilko H.; Moerland, Perry D.; van 't Slot, Ruben; Zagury, Jean-François; van 't Wout, Angélique B.; Schuitemaker, Hanneke

    2011-01-01

    Background AIDS develops typically after 7–11 years of untreated HIV-1 infection, with extremes of very rapid disease progression (15 years). To reveal additional host genetic factors that may impact on the clinical course of HIV-1 infection, we designed a genome-wide association study (GWAS) in 404 participants of the Amsterdam Cohort Studies on HIV-1 infection and AIDS. Methods The association of SNP genotypes with the clinical course of HIV-1 infection was tested in Cox regression survival analyses using AIDS-diagnosis and AIDS-related death as endpoints. Results Multiple, not previously identified SNPs, were identified to be strongly associated with disease progression after HIV-1 infection, albeit not genome-wide significant. However, three independent SNPs in the top ten associations between SNP genotypes and time between seroconversion and AIDS-diagnosis, and one from the top ten associations between SNP genotypes and time between seroconversion and AIDS-related death, had P-values smaller than 0.05 in the French Genomics of Resistance to Immunodeficiency Virus cohort on disease progression. Conclusions Our study emphasizes that the use of different phenotypes in GWAS may be useful to unravel the full spectrum of host genetic factors that may be associated with the clinical course of HIV-1 infection. PMID:21811574

  1. Mapping of a major QTL for salt tolerance of mature field-grown maize plants based on SNP markers.

    Science.gov (United States)

    Luo, Meijie; Zhao, Yanxin; Zhang, Ruyang; Xing, Jinfeng; Duan, Minxiao; Li, Jingna; Wang, Naishun; Wang, Wenguang; Zhang, Shasha; Chen, Zhihui; Zhang, Huasheng; Shi, Zi; Song, Wei; Zhao, Jiuran

    2017-08-15

    Salt stress significantly restricts plant growth and production. Maize is an important food and economic crop but is also a salt sensitive crop. Identification of the genetic architecture controlling salt tolerance facilitates breeders to select salt tolerant lines. However, the critical quantitative trait loci (QTLs) responsible for the salt tolerance of field-grown maize plants are still unknown. To map the main genetic factors contributing to salt tolerance in mature maize, a double haploid population (240 individuals) and 1317 single nucleotide polymorphism (SNP) markers were employed to produce a genetic linkage map covering 1462.05 cM. Plant height of mature maize cultivated in the saline field (SPH) and plant height-based salt tolerance index (ratio of plant height between saline and control fields, PHI) were used to evaluate salt tolerance of mature maize plants. A major QTL for SPH was detected on Chromosome 1 with the LOD score of 22.4, which explained 31.2% of the phenotypic variation. In addition, the major QTL conditioning PHI was also mapped at the same position on Chromosome 1, and two candidate genes involving in ion homeostasis were identified within the confidence interval of this QTL. The detection of the major QTL in adult maize plant establishes the basis for the map-based cloning of genes associated with salt tolerance and provides a potential target for marker assisted selection in developing maize varieties with salt tolerance.

  2. Genome-wide selection signatures in Pinzgau cattle

    Directory of Open Access Journals (Sweden)

    Radovan Kasarda

    2015-08-01

    Full Text Available The aim of this study was to identify the evidence of recent selection based on estimation of the integrated Haplotype Score (iHS, population differentiation index (FST and characterize affected regions near QTL associated with traits under strong selection in Pinzgau cattle. In total 21 Austrian and 19 Slovak purebreed bulls genotyped with Illumina bovineHD and  bovineSNP50 BeadChip were used to identify genomic regions under selection. Only autosomal loci with call rate higher than 90%, minor allele frequency higher than 0.01 and Hardy-Weinberg equlibrium limit of 0.001 were included in the subsequent analyses of selection sweeps presence. The final dataset was consisted from 30538 SNPs with 81.86 kb average adjacent SNPs spacing. The iHS score were averaged into non-overlapping 500 kb segments across the genome. The FST values were also plotted against genome position based on sliding windows approach and averaged over 8 consecutive SNPs. Based on integrated Haplotype Score evaluation only 7 regions with iHS score higher than 1.7 was found. The average iHS score observed for each adjacent syntenic regions indicated slight effect of recent selection in analysed group of Pinzgau bulls. The level of genetic differentiation between Austrian and Slovak bulls estimated based on FST index was low. Only 24% of FST values calculated for each SNP was greather than 0.01. By using sliding windows approach was found that 5% of analysed windows had higher value than 0.01. Our results indicated use of similar selection scheme in breeding programs of Slovak and Austrian Pinzgau bulls. The evidence for genome-wide association between signatures of selection and regions affecting complex traits such as milk production was insignificant, because the loci in segments identified as affected by selection were very distant from each other. Identification of genomic regions that may be under pressure of selection for phenotypic traits to better understanding of the

  3. A new panel of SNP markers for the individual identification of North American pumas

    Science.gov (United States)

    Fitak, Robert R.; Naidu, Ashwin; Thompson, Ron W.; Culver, Melanie

    2016-01-01

    Pumas Puma concolor are one of the most studied terrestrial carnivores because of their widespread distribution, substantial ecological impacts, and conflicts with humans. Over the past decade, managing pumas has involved extensive efforts including the use of genetic methods. Microsatellites have been the most commonly used genetic markers; however, technical artifacts and little overlap of frequently used loci render large-scale comparison of puma genetic data across studies challenging. Therefore, a panel of genetic markers that can produce consistent genotypes across studies without the need for extensive calibrations is essential for range-wide genetic management of puma populations. Here, we describe the development of PumaPlex, a high-throughput assay to genotype 25 single nucleotide polymorphisms in pumas. We validated PumaPlex in 748 North American pumas Puma concolor couguar, and demonstrated its ability to generate reproducible genotypes and accurately identify individuals. Furthermore, in a test using fecal deoxyribonucleic acid (DNA) samples, we found that PumaPlex produced significantly more genotypes with fewer errors than 12 microsatellite loci, 8 of which are commonly used. Our results demonstrate that PumaPlex is a valuable tool for the genetic monitoring and management of North American puma populations. Given the analytical simplicity, reproducibility, and high-throughput capability of single nucleotide polymorphisms, PumaPlex provides a standard panel of markers that promotes the comparison of genotypes across studies and independent of the genotyping technology used.

  4. Development of admixture mapping panels for African Americans from commercial high-density SNP arrays

    Directory of Open Access Journals (Sweden)

    Dunston Georgia M

    2010-07-01

    Full Text Available Abstract Background Admixture mapping is a powerful approach for identifying genetic variants involved in human disease that exploits the unique genomic structure in recently admixed populations. To use existing published panels of ancestry-informative markers (AIMs for admixture mapping, markers have to be genotyped de novo for each admixed study sample and samples representing the ancestral parental populations. The increased availability of dense marker data on commercial chips has made it feasible to develop panels wherein the markers need not be predetermined. Results We developed two panels of AIMs (~2,000 markers each based on the Affymetrix Genome-Wide Human SNP Array 6.0 for admixture mapping with African American samples. These two AIM panels had good map power that was higher than that of a denser panel of ~20,000 random markers as well as other published panels of AIMs. As a test case, we applied the panels in an admixture mapping study of hypertension in African Americans in the Washington, D.C. metropolitan area. Conclusions Developing marker panels for admixture mapping from existing genome-wide genotype data offers two major advantages: (1 no de novo genotyping needs to be done, thereby saving costs, and (2 markers can be filtered for various quality measures and replacement markers (to minimize gaps can be selected at no additional cost. Panels of carefully selected AIMs have two major advantages over panels of random markers: (1 the map power from sparser panels of AIMs is higher than that of ~10-fold denser panels of random markers, and (2 clusters can be labeled based on information from the parental populations. With current technology, chip-based genome-wide genotyping is less expensive than genotyping ~20,000 random markers. The major advantage of using random markers is the absence of ascertainment effects resulting from the process of selecting markers. The ability to develop marker panels informative for ancestry from

  5. Genome-wide association mapping for stripe rust (Puccinia striiformis F. sp. tritici) in US Pacific Northwest winter wheat (Triticum aestivum L.).

    Science.gov (United States)

    Naruoka, Y; Garland-Campbell, K A; Carter, A H

    2015-06-01

    Potential novel and known QTL for race-specific all-stage and adult plant resistance to stripe rust were identified by genome-wide association mapping in the US PNW winter wheat accessions. Stripe rust (Puccinia striiformis F. sp. tritici; also known as yellow rust) is a globally devastating disease of wheat (Triticum aestivum L.) and a major threat to wheat production in the US Pacific Northwest (PNW), therefore both adult plant and all-stage resistance have been introduced into the winter wheat breeding programs in the PNW. The goal of this study was to identify quantitative trait loci (QTL) and molecular markers for these resistances through genome-wide association (GWAS) mapping in winter wheat accessions adapted to the PNW. Stripe rust response for adult plants was evaluated in naturally occurring epidemics in a total of nine environments in Washington State, USA. Seedling response was evaluated with three races under artificial inoculation in the greenhouse. The panel was genotyped with the 9K Illumina Wheat single nucleotide polymorphism (SNP) array and additional markers linked to previously reported genes and QTL for stripe rust resistance. The population was grouped into three sub-populations. Markers linked to Yr17 and previously reported QTL for stripe rust resistance were identified on chromosomes 1B, 2A, and 2B. Potentially novel QTL associated with race-specific seedling response were identified on chromosomes 1B and 1D. Potentially novel QTL associated with adult plant response were located on chromosomes 2A, 2B, 3B, 4A, and 4B. Stripe rust was reduced when multiple alleles for resistance were present. The resistant allele frequencies were different among sub-populations in the panel. This information provides breeders with germplasm and closely linked markers for stripe rust resistance to facilitate the transfer of multiple loci for durable stripe rust resistance into wheat breeding lines and cultivars.

  6. Rapid scoring of genes in microbial pan-genome-wide association studies with Scoary.

    Science.gov (United States)

    Brynildsrud, Ola; Bohlin, Jon; Scheffer, Lonneke; Eldholm, Vegard

    2016-11-25

    Genome-wide association studies (GWAS) have become indispensable in human medicine and genomics, but very few have been carried out on bacteria. Here we introduce Scoary, an ultra-fast, easy-to-use, and widely applicable software tool that scores the components of the pan-genome for associations to observed phenotypic traits while accounting for population stratification, with minimal assumptions about evolutionary processes. We call our approach pan-GWAS to distinguish it from traditional, single nucleotide polymorphism (SNP)-based GWAS. Scoary is implemented in Python and is available under an open source GPLv3 license at https://github.com/AdmiralenOla/Scoary .

  7. Prediction of Cacao (Theobroma cacao) Resistance to Moniliophthora spp. Diseases via Genome-Wide Association Analysis and Genomic Selection.

    Science.gov (United States)

    McElroy, Michel S; Navarro, Alberto J R; Mustiga, Guiliana; Stack, Conrad; Gezan, Salvador; Peña, Geover; Sarabia, Widem; Saquicela, Diego; Sotomayor, Ignacio; Douglas, Gavin M; Migicovsky, Zoë; Amores, Freddy; Tarqui, Omar; Myles, Sean; Motamayor, Juan C

    2018-01-01

    Cacao ( Theobroma cacao ) is a globally important crop, and its yield is severely restricted by disease. Two of the most damaging diseases, witches' broom disease (WBD) and frosty pod rot disease (FPRD), are caused by a pair of related fungi: Moniliophthora perniciosa and Moniliophthora roreri , respectively. Resistant cultivars are the most effective long-term strategy to address Moniliophthora diseases, but efficiently generating resistant and productive new cultivars will require robust methods for screening germplasm before field testing. Marker-assisted selection (MAS) and genomic selection (GS) provide two potential avenues for predicting the performance of new genotypes, potentially increasing the selection gain per unit time. To test the effectiveness of these two approaches, we performed a genome-wide association study (GWAS) and GS on three related populations of cacao in Ecuador genotyped with a 15K single nucleotide polymorphism (SNP) microarray for three measures of WBD infection (vegetative broom, cushion broom, and chirimoya pod), one of FPRD (monilia pod) and two productivity traits (total fresh weight of pods and % healthy pods produced). GWAS yielded several SNPs associated with disease resistance in each population, but none were significantly correlated with the same trait in other populations. Genomic selection, using one population as a training set to estimate the phenotypes of the remaining two (composed of different families), varied among traits, from a mean prediction accuracy of 0.46 (vegetative broom) to 0.15 (monilia pod), and varied between training populations. Simulations demonstrated that selecting seedlings using GWAS markers alone generates no improvement over selecting at random, but that GS improves the selection process significantly. Our results suggest that the GWAS markers discovered here are not sufficiently predictive across diverse germplasm to be useful for MAS, but that using all markers in a GS framework holds

  8. Prediction of Cacao (Theobroma cacao Resistance to Moniliophthora spp. Diseases via Genome-Wide Association Analysis and Genomic Selection

    Directory of Open Access Journals (Sweden)

    Michel S. McElroy

    2018-03-01

    Full Text Available Cacao (Theobroma cacao is a globally important crop, and its yield is severely restricted by disease. Two of the most damaging diseases, witches’ broom disease (WBD and frosty pod rot disease (FPRD, are caused by a pair of related fungi: Moniliophthora perniciosa and Moniliophthora roreri, respectively. Resistant cultivars are the most effective long-term strategy to address Moniliophthora diseases, but efficiently generating resistant and productive new cultivars will require robust methods for screening germplasm before field testing. Marker-assisted selection (MAS and genomic selection (GS provide two potential avenues for predicting the performance of new genotypes, potentially increasing the selection gain per unit time. To test the effectiveness of these two approaches, we performed a genome-wide association study (GWAS and GS on three related populations of cacao in Ecuador genotyped with a 15K single nucleotide polymorphism (SNP microarray for three measures of WBD infection (vegetative broom, cushion broom, and chirimoya pod, one of FPRD (monilia pod and two productivity traits (total fresh weight of pods and % healthy pods produced. GWAS yielded several SNPs associated with disease resistance in each population, but none were significantly correlated with the same trait in other populations. Genomic selection, using one population as a training set to estimate the phenotypes of the remaining two (composed of different families, varied among traits, from a mean prediction accuracy of 0.46 (vegetative broom to 0.15 (monilia pod, and varied between training populations. Simulations demonstrated that selecting seedlings using GWAS markers alone generates no improvement over selecting at random, but that GS improves the selection process significantly. Our results suggest that the GWAS markers discovered here are not sufficiently predictive across diverse germplasm to be useful for MAS, but that using all markers in a GS framework holds

  9. Outlier SNP markers reveal fine-scale genetic structuring across European hake populations (Merluccius merluccius)

    DEFF Research Database (Denmark)

    Milano, I.; Babbucci, M.; Cariani, A.

    2014-01-01

    fishery. Analysis of 850 individuals from 19 locations across the entire distribution range showed evidence for several outlier loci, with significantly higher resolving power. While 299 putatively neutral SNPs confirmed the genetic break between basins (FCT = 0.016) and weak differentiation within basins...... even when neutral markers provide genetic homogeneity across populations. Here, 381 SNPs located in transcribed regions were used to assess largeand fine-scale population structure in the European hake (Merluccius merluccius), a widely distributed demersal species of high priority for the European...

  10. Genome-Wide Mutagenesis in Borrelia burgdorferi.

    Science.gov (United States)

    Lin, Tao; Gao, Lihui

    2018-01-01

    population of mutants with different tags, after recovered from different tissues of infected mice and ticks, mutants from output pool and input pool are detected using high-throughput, semi-quantitative Luminex ® FLEXMAP™ or next-generation sequencing (Tn-seq) technologies. Thus far, we have created a high-density, sequence-defined transposon library of over 6600 STM mutants for the efficient genome-wide investigation of genes and gene products required for wild-type pathogenesis, host-pathogen interactions, in vitro growth, in vivo survival, physiology, morphology, chemotaxis, motility, structure, metabolism, gene regulation, plasmid maintenance and replication, etc. The insertion sites of 4480 transposon mutants have been determined. About 800 predicted protein-encoding genes in the genome were disrupted in the STM transposon library. The infectivity and some functions of 800 mutants in 500 genes have been determined. Analysis of these transposon mutants has yielded valuable information regarding the genes and gene products important in the pathogenesis and biology of B. burgdorferi and its tick vectors.

  11. Genome-wide association study for claw disorders and trimming status in dairy cattle.

    Science.gov (United States)

    van der Spek, D; van Arendonk, J A M; Bovenhuis, H

    2015-02-01

    Performing a genome-wide association study (GWAS) might add to a better understanding of the development of claw disorders and the need for trimming. Therefore, the aim of the current study was to perform a GWAS on claw disorders and trimming status and to validate the results for claw disorders based on an independent data set. Data consisted of 20,474 cows with phenotypes for claw disorders and 50,238 cows with phenotypes for trimming status. Recorded claw disorders used in the current study were double sole (DS), interdigital hyperplasia (IH), sole hemorrhage (SH), sole ulcer (SU), white line separation (WLS), a combination of infectious claw disorders consisting of (inter-)digital dermatitis and heel erosion, and a combination of laminitis-related claw disorders (DS, SH, SU, and WLS). Of the cows with phenotypes for claw disorders, 1,771 cows were genotyped and these cow data were used for the GWAS on claw disorders. A SNP was considered significant when the false discovery rate≤0.05 and suggestive when the false discovery rate≤0.20. An independent data set of 185 genotyped bulls having at least 5 daughters with phenotypes (6,824 daughters in total) for claw disorders was used to validate significant and suggestive SNP detected based on the cow data. To analyze the trait "trimming status" (i.e., the need for claw trimming), a data set with 327 genotyped bulls having at least 5 daughters with phenotypes (18,525 daughters in total) was used. Based on the cow data, in total 10 significant and 45 suggestive SNP were detected for claw disorders. The 10 significant SNP were associated with SU, and mainly located on BTA8. The suggestive SNP were associated with DS, IH, SU, and laminitis-related claw disorders. Three of the suggestive SNP were validated in the data set of 185 bulls, and were located on BTA13, BTA14, and BTA17. For infectious claw disorders, SH, and WLS, no significant or suggestive SNP associations were detected. For trimming status, 1 significant

  12. A Genome-Wide Association Study in Large White and Landrace Pig Populations for Number Piglets Born Alive

    Science.gov (United States)

    Bergfelder-Drüing, Sarah; Grosse-Brinkhaus, Christine; Lind, Bianca; Erbe, Malena; Schellander, Karl; Simianer, Henner; Tholen, Ernst

    2015-01-01

    The number of piglets born alive (NBA) per litter is one of the most important traits in pig breeding due to its influence on production efficiency. It is difficult to improve NBA because the heritability of the trait is low and it is governed by a high number of loci with low to moderate effects. To clarify the biological and genetic background of NBA, genome-wide association studies (GWAS) were performed using 4,012 Large White and Landrace pigs from herdbook and commercial breeding companies in Germany (3), Austria (1) and Switzerland (1). The animals were genotyped with the Illumina PorcineSNP60 BeadChip. Because of population stratifications within and between breeds, clusters were formed using the genetic distances between the populations. Five clusters for each breed were formed and analysed by GWAS approaches. In total, 17 different significant markers affecting NBA were found in regions with known effects on female reproduction. No overlapping significant chromosome areas or QTL between Large White and Landrace breed were detected. PMID:25781935

  13. A genome-wide association study in large white and landrace pig populations for number piglets born alive.

    Directory of Open Access Journals (Sweden)

    Sarah Bergfelder-Drüing

    Full Text Available The number of piglets born alive (NBA per litter is one of the most important traits in pig breeding due to its influence on production efficiency. It is difficult to improve NBA because the heritability of the trait is low and it is governed by a high number of loci with low to moderate effects. To clarify the biological and genetic background of NBA, genome-wide association studies (GWAS were performed using 4,012 Large White and Landrace pigs from herdbook and commercial breeding companies in Germany (3, Austria (1 and Switzerland (1. The animals were genotyped with the Illumina PorcineSNP60 BeadChip. Because of population stratifications within and between breeds, clusters were formed using the genetic distances between the populations. Five clusters for each breed were formed and analysed by GWAS approaches. In total, 17 different significant markers affecting NBA were found in regions with known effects on female reproduction. No overlapping significant chromosome areas or QTL between Large White and Landrace breed were detected.

  14. Genome-wide Association Study for Warner-Bratzler Shear Force and Sensory Traits in Hanwoo (Korean Cattle

    Directory of Open Access Journals (Sweden)

    C. G. Dang

    2014-09-01

    Full Text Available Significant SNPs associated with Warner-Bratzler (WB shear force and sensory traits were confirmed for Hanwoo beef (Korean cattle. A Bonferroni-corrected genome-wide significant association (p<1.3×10−6 was detected with only one single nucleotide polymorphism (SNP on chromosome 5 for WB shear force. A slightly higher number of SNPs was significantly (p<0.001 associated with WB shear force than with other sensory traits. Further, 50, 25, 29, and 34 SNPs were significantly associated with WB shear force, tenderness, juiciness, and flavor likeness, respectively. The SNPs between p = 0.001 and p = 0.0001 thresholds explained 3% to 9% of the phenotypic variance, while the most significant SNPs accounted for 7% to 12% of the phenotypic variance. In conclusion, because WB shear force and sensory evaluation were moderately affected by a few loci and minimally affected by other loci, further studies are required by using a large sample size and high marker density.

  15. Genome-Wide Association Mapping for Resistance to Leaf and Stripe Rust in Winter-Habit Hexaploid Wheat Landraces.

    Directory of Open Access Journals (Sweden)

    Albert Kertho

    Full Text Available Leaf rust, caused by Puccinia triticina (Pt, and stripe rust, caused by P. striiformis f. sp. tritici (Pst, are destructive foliar diseases of wheat worldwide. Breeding for disease resistance is the preferred strategy of managing both diseases. The continued emergence of new races of Pt and Pst requires a constant search for new sources of resistance. Here we report a genome-wide association analysis of 567 winter wheat (Triticum aestivum landrace accessions using the Infinium iSelect 9K wheat SNP array to identify loci associated with seedling resistance to five races of Pt (MDCL, MFPS, THBL, TDBG, and TBDJ and one race of Pst (PSTv-37 frequently found in the Northern Great Plains of the United States. Mixed linear models identified 65 and eight significant markers associated with leaf rust and stripe rust, respectively. Further, we identified 31 and three QTL associated with resistance to Pt and Pst, respectively. Eleven QTL, identified on chromosomes 3A, 4A, 5A, and 6D, are previously unknown for leaf rust resistance in T. aestivum.

  16. A Genome-Wide Association Study Reveals Genes Associated with Fusarium Ear Rot Resistance in a Maize Core Diversity Panel

    Science.gov (United States)

    Zila, Charles T.; Samayoa, L. Fernando; Santiago, Rogelio; Butrón, Ana; Holland, James B.

    2013-01-01

    Fusarium ear rot is a common disease of maize that affects food and feed quality globally. Resistance to the disease is highly quantitative, and maize breeders have difficulty incorporating polygenic resistance alleles from unadapted donor sources into elite breeding populations without having a negative impact on agronomic performance. Identification of specific allele variants contributing to improved resistance may be useful to breeders by allowing selection of resistance alleles in coupling phase linkage with favorable agronomic characteristics. We report the results of a genome-wide association study to detect allele variants associated with increased resistance to Fusarium ear rot in a maize core diversity panel of 267 inbred lines evaluated in two sets of environments. We performed association tests with 47,445 single-nucleotide polymorphisms (SNPs) while controlling for background genomic relationships with a mixed model and identified three marker loci significantly associated with disease resistance in at least one subset of environments. Each associated SNP locus had relatively small additive effects on disease resistance (±1.1% on a 0–100% scale), but nevertheless were associated with 3 to 12% of the genotypic variation within or across environment subsets. Two of three identified SNPs colocalized with genes that have been implicated with programmed cell death. An analysis of associated allele frequencies within the major maize subpopulations revealed enrichment for resistance alleles in the tropical/subtropical and popcorn subpopulations compared with other temperate breeding pools. PMID:24048647

  17. SAQC: SNP Array Quality Control

    Directory of Open Access Journals (Sweden)

    Li Ling-Hui

    2011-04-01

    Full Text Available Abstract Background Genome-wide single-nucleotide polymorphism (SNP arrays containing hundreds of thousands of SNPs from the human genome have proven useful for studying important human genome questions. Data quality of SNP arrays plays a key role in the accuracy and precision of downstream data analyses. However, good indices for assessing data quality of SNP arrays have not yet been developed. Results We developed new quality indices to measure the quality of SNP arrays and/or DNA samples and investigated their statistical properties. The indices quantify a departure of estimated individual-level allele frequencies (AFs from expected frequencies via standardized distances. The proposed quality indices followed lognormal distributions in several large genomic studies that we empirically evaluated. AF reference data and quality index reference data for different SNP array platforms were established based on samples from various reference populations. Furthermore, a confidence interval method based on the underlying empirical distributions of quality indices was developed to identify poor-quality SNP arrays and/or DNA samples. Analyses of authentic biological data and simulated data show that this new method is sensitive and specific for the detection of poor-quality SNP arrays and/or DNA samples. Conclusions This study introduces new quality indices, establishes references for AFs and quality indices, and develops a detection method for poor-quality SNP arrays and/or DNA samples. We have developed a new computer program that utilizes these methods called SNP Array Quality Control (SAQC. SAQC software is written in R and R-GUI and was developed as a user-friendly tool for the visualization and evaluation of data quality of genome-wide SNP arrays. The program is available online (http://www.stat.sinica.edu.tw/hsinchou/genetics/quality/SAQC.htm.

  18. A genome-wide association study identifies rs2000999 as a strong genetic determinant of circulating haptoglobin levels.

    Directory of Open Access Journals (Sweden)

    Philippe Froguel

    Full Text Available Haptoglobin is an acute phase inflammatory marker. Its main function is to bind hemoglobin released from erythrocytes to aid its elimination, and thereby haptoglobin prevents the generation of reactive oxygen species in the blood. Haptoglobin levels have been repeatedly associated with a variety of inflammation-linked infectious and non-infectious diseases, including malaria, tuberculosis, human immunodeficiency virus, hepatitis C, diabetes, carotid atherosclerosis, and acute myocardial infarction. However, a comprehensive genetic assessment of the inter-individual variability of circulating haptoglobin levels has not been conducted so far.We used a genome-wide association study initially conducted in 631 French children followed by a replication in three additional European sample sets and we identified a common single nucleotide polymorphism (SNP, rs2000999 located in the Haptoglobin gene (HP as a strong genetic predictor of circulating Haptoglobin levels (P(overall = 8.1 × 10(-59, explaining 45.4% of its genetic variability (11.8% of Hp global variance. The functional relevance of rs2000999 was further demonstrated by its specific association with HP mRNA levels (β = 0.23 ± 0.08, P = 0.007. Finally, SNP rs2000999 was associated with decreased total and low-density lipoprotein cholesterol in 8,789 European children (P(total cholesterol = 0.002 and P(LDL = 0.0008.Given the central position of haptoglobin in many inflammation-related metabolic pathways, the relevance of rs2000999 genotyping when evaluating haptoglobin concentration should be further investigated in order to improve its diagnostic/therapeutic and/or prevention impact.

  19. Use of modern tomato breeding germplasm for deciphering the genetic control of agronomical traits by Genome Wide Association study.

    Science.gov (United States)

    Bauchet, Guillaume; Grenier, Stéphane; Samson, Nicolas; Bonnet, Julien; Grivet, Laurent; Causse, Mathilde

    2017-05-01

    A panel of 300 tomato accessions including breeding materials was built and characterized with >11,000 SNP. A population structure in six subgroups was identified. Strong heterogeneity in linkage disequilibrium and recombination landscape among groups and chromosomes was shown. GWAS identified several associations for fruit weight, earliness and plant growth. Genome-wide association studies (GWAS) have become a method of choice in quantitative trait dissection. First limited to highly polymorphic and outcrossing species, it is now applied in horticultural crops, notably in tomato. Until now GWAS in tomato has been performed on panels of heirloom and wild accessions. Using modern breeding materials would be of direct interest for breeding purpose. To implement GWAS on a large panel of 300 tomato accessions including 168 breeding lines, this study assessed the genetic diversity and linkage disequilibrium decay and revealed the population structure and performed GWA experiment. Genetic diversity and population structure analyses were based on molecular markers (>11,000 SNP) covering the whole genome. Six genetic subgroups were revealed and associated to traits of agronomical interest, such as fruit weight and disease resistance. Estimates of linkage disequilibrium highlighted the heterogeneity of its decay among genetic subgroups. Haplotype definition allowed a fine characterization of the groups and their recombination landscape revealing the patterns of admixture along the genome. Selection footprints showed results in congruence with introgressions. Taken together, all these elements refined our knowledge of the genetic material included in this panel and allowed the identification of several associations for fruit weight, plant growth and earliness, deciphering the genetic architecture of these complex traits and identifying several new loci useful for tomato breeding.

  20. Genome-wide association mapping of resistance to eyespot disease (Pseudocercosporella herpotrichoides) in European winter wheat (Triticum aestivum L.) and fine-mapping of Pch1.

    Science.gov (United States)

    Zanke, Christine D; Rodemann, Bernd; Ling, Jie; Muqaddasi, Quddoos H; Plieske, Jörg; Polley, Andreas; Kollers, Sonja; Ebmeyer, Erhard; Korzun, Viktor; Argillier, Odile; Stiewe, Gunther; Zschäckel, Thomas; Ganal, Martin W; Röder, Marion S

    2017-03-01

    Genotypes with recombination events in the Triticum ventricosum introgression on chromosome 7D allowed to fine-map resistance gene Pch1, the main source of eyespot resistance in European winter wheat cultivars. Eyespot (also called Strawbreaker) is a common and serious fungal disease of winter wheat caused by the necrotrophic fungi Oculimacula yallundae and Oculimacula acuformis (former name Pseudocercosporella herpotrichoides). A genome-wide association study (GWAS) for eyespot was performed with 732 microsatellite markers (SSR) and 7761 mapped SNP markers derived from the 90 K iSELECT wheat array using a panel of 168 European winter wheat varieties as well as three spring wheat varieties and phenotypic evaluation of eyespot in field tests in three environments. Best linear unbiased estimations (BLUEs) were calculated across all trials and ranged from 1.20 (most resistant) to 5.73 (most susceptible) with an average value of 4.24 and a heritability of H 2  = 0.91. A total of 108 SSR and 235 SNP marker-trait associations (MTAs) were identified by considering associations with a -log 10 (P value) ≥3.0. Significant MTAs for eyespot-score BLUEs were found on chromosomes 1D, 2A, 2D, 3D, 5A, 5D, 6A, 7A and 7D for the SSR markers and chromosomes 1B, 2A, 2B, 2D, 3B and 7D for the SNP markers. For 18 varieties (10.5%), a highly resistant phenotype was detected that was linked to the presence of the resistance gene Pch1 on chromosome 7D. The identification of genotypes with recombination events in the introgressed genomic segment from Triticum ventricosum harboring the Pch1 resistance gene on chromosome 7DL allowed the fine-mapping of this gene using additional SNP markers and a potential candidate gene Traes_7DL_973A33763 coding for a CC-NBS-LRR class protein was identified.

  1. Genome-wide association study reveals greater polygenic loading for schizophrenia in cases with a family history of illness

    DEFF Research Database (Denmark)

    Bigdeli, Tim B.; Ripke, Stephan; Bacanu, Silviu-Alin

    2016-01-01

    Genome-wide association studies (GWAS) of schizophrenia have yielded more than 100 common susceptibility variants, and strongly support a substantial polygenic contribution of a large number of small allelic effects. It has been hypothesized that familial schizophrenia is largely a consequence...... of inherited rather than environmental factors. We investigated the extent to which familiality of schizophrenia is associated with enrichment for common risk variants detectable in a large GWAS. We analyzed single nucleotide polymorphism (SNP) data for cases reporting a family history of psychotic illness (N...... history subgroup. Comparison of genome-wide polygenic risk scores based on GWAS summary statistics indicated a significant enrichment for SNP effects among family history positive compared to family history negative cases (Nagelkerke's R2=0.0021; P=0.00331; P-value threshold

  2. Genome-wide meta-analyses identify multiple loci associated with smoking behavior.

    LENUS (Irish Health Repository)

    2010-05-01

    Consistent but indirect evidence has implicated genetic factors in smoking behavior. We report meta-analyses of several smoking phenotypes within cohorts of the Tobacco and Genetics Consortium (n = 74,053). We also partnered with the European Network of Genetic and Genomic Epidemiology (ENGAGE) and Oxford-GlaxoSmithKline (Ox-GSK) consortia to follow up the 15 most significant regions (n > 140,000). We identified three loci associated with number of cigarettes smoked per day. The strongest association was a synonymous 15q25 SNP in the nicotinic receptor gene CHRNA3 (rs1051730[A], beta = 1.03, standard error (s.e.) = 0.053, P = 2.8 x 10(-73)). Two 10q25 SNPs (rs1329650[G], beta = 0.367, s.e. = 0.059, P = 5.7 x 10(-10); and rs1028936[A], beta = 0.446, s.e. = 0.074, P = 1.3 x 10(-9)) and one 9q13 SNP in EGLN2 (rs3733829[G], beta = 0.333, s.e. = 0.058, P = 1.0 x 10(-8)) also exceeded genome-wide significance for cigarettes per day. For smoking initiation, eight SNPs exceeded genome-wide significance, with the strongest association at a nonsynonymous SNP in BDNF on chromosome 11 (rs6265[C], odds ratio (OR) = 1.06, 95% confidence interval (Cl) 1.04-1.08, P = 1.8 x 10(-8)). One SNP located near DBH on chromosome 9 (rs3025343[G], OR = 1.12, 95% Cl 1.08-1.18, P = 3.6 x 10(-8)) was significantly associated with smoking cessation.

  3. Genome-wide association mapping of partial resistance to Phytophthora sojae in soybean plant introductions from the Republic of Korea.

    Science.gov (United States)

    Schneider, Rhiannon; Rolling, William; Song, Qijian; Cregan, Perry; Dorrance, Anne E; McHale, Leah K

    2016-08-11

    Phytophthora root and stem rot is one of the most yield-limiting diseases of soybean [Glycine max (L.) Merr], caused by the oomycete Phytophthora sojae. Partial resistance is controlled by several genes and, compared to single gene (Rps gene) resistance to P. sojae, places less selection pressure on P. sojae populations. Thus, partial resistance provides a more durable resistance against the pathogen. In previous work, plant introductions (PIs) originating from the Republic of Korea (S. Korea) have shown to be excellent sources for high levels of partial resistance against P. sojae. Resistance to two highly virulent P. sojae isolates was assessed in 1395 PIs from S. Korea via a greenhouse layer test. Lines exhibiting possible Rps gene immunity or rot due to other pathogens were removed and the remaining 800 lines were used to identify regions of quantitative resistance using genome-wide association mapping. Sixteen SNP markers on chromosomes 3, 13 and 19 were significantly associated with partial resistance to P. sojae and were grouped into seven quantitative trait loci (QTL) by linkage disequilibrium blocks. Two QTL on chromosome 3 and three QTL on chromosome 19 represent possible novel loci for partial resistance to P. sojae. While candidate genes at QTL varied in their predicted functions, the coincidence of QTLs 3-2 and 13-1 on chromosomes 3 and 13, respectively, with Rps genes and resistance gene analogs provided support for the hypothesized mechanism of partial resistance involving weak R-genes. QTL contributing to partial resistance towards P. sojae in soybean germplasm originating from S. Korea were identified. The QTL identified in this study coincide with previously reported QTL, Rps genes, as well as novel loci for partial resistance. Molecular markers associated with these QTL can be used in the marker-assisted introgression of these alleles into elite cultivars. Annotations of genes within QTL allow hypotheses on the possible mechanisms of partial

  4. From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.

    Directory of Open Access Journals (Sweden)

    Zhi Wei

    2009-10-01

    Full Text Available Genome-wide association studies (GWAS have been fruitful in identifying disease susceptibility loci for common and complex diseases. A remaining question is whether we can quantify individual disease risk based on genotype data, in order to facilitate personalized prevention and treatment for complex diseases. Previous studies have typically failed to achieve satisfactory performance, primarily due to the use of only a limited number of confirmed susceptibility loci. Here we propose that sophisticated machine-learning approaches with a large ensemble of markers may improve the performance of disease risk assessment. We applied a Support Vector Machine (SVM algorithm on a GWAS dataset generated on the Affymetrix genotyping platform for type 1 diabetes (T1D and optimized a risk assessment model with hundreds of markers. We subsequently tested this model on an independent Illumina-genotyped dataset with imputed genotypes (1,008 cases and 1,000 controls, as well as a separate Affymetrix-genotyped dataset (1,529 cases and 1,458 controls, resulting in area under ROC curve (AUC of approximately 0.84 in both datasets. In contrast, poor performance was achieved when limited to dozens of known susceptibility loci in the SVM model or logistic regression model. Our study suggests that improved disease risk assessment can be achieved by using algorithms that take into account interactions between a large ensemble of markers. We are optimistic that genotype-based disease risk assessment may be feasible for diseases where a notable proportion of the risk has already been captured by SNP arrays.

  5. A genome-wide association analysis of a broad psychosis phenotype identifies three loci for further investigation

    OpenAIRE

    Psychosis Endophenotypes International Consortium; Wellcome Trust Case-Control Consortium; Bramon, E.; Pirinen, M.; Strange, A.; Lin, K.; Freeman, C.; Bellenguez, C.; Su, Z.; Band, G.; Pearson, R.; Vukcevic, D.; Langford, C.; Deloukas, P.; Hunt, S.

    2014-01-01

    BACKGROUND: Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories. METHODS: 1239 cases with schizophrenia, schizoaffective disorder, or psychotic bipolar disorder; 857 of their unaffected relatives, and 2739 healthy controls were genotyped with the Affymetrix 6.0 single nucleotide polymorphism (SNP) array. Analyses of 69...

  6. A Genome-wide Association Analysis of a Broad Psychosis Phenotype Identifies Three Loci for Further Investigation

    OpenAIRE

    Tosato, Sarah; Myin-germeys, Inez; Barroso, Ines; Bender, Stephan; Giegling, Ina; Arranz, Maria J.; Donnelly, Peter; Bellenguez, Celine; Brown, Matthew A.; Lawrie, Stephen; Kalaydjieva, Luba; Vukcevic, Damjan; Kahn, Rene S.; Dronov, Serge; Walshe, Muriel

    2014-01-01

    Background: Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories.Methods: 1239 cases with schizophrenia, schizoaffective disorder, or psychotic bipolar disorder; 857 of their unaffected relatives, and 2739 healthy controls were genotyped with the Affymetrix 6.0 single nucleotide polymorphism (SNP) array. Analyses of 695,19...

  7. Genome-Wide Association Study Identifies Loci for Salt Tolerance during Germination in Autotetraploid Alfalfa (Medicago sativa L.) Using Genotyping-by-Sequencing

    Science.gov (United States)

    Yu, Long-Xi; Liu, Xinchun; Boge, William; Liu, Xiang-Ping

    2016-01-01

    Salinity is one of major abiotic stresses limiting alfalfa (Medicago sativa L.) production in the arid and semi-arid regions in US and other counties. In this study, we used a diverse panel of alfalfa accessions previously described by Zhang et al. (2015) to identify molecular markers associated with salt tolerance during germination using genome-wide association study (GWAS) and genotyping-by-sequencing (GBS). Phenotyping was done by germinating alfalfa seeds under different levels of salt stress. Phenotypic data of adjusted germination rates and SNP markers generated by GBS were used for marker-trait association. Thirty six markers were significantly associated with salt tolerance in at least one level of salt treatments. Alignment of sequence tags to the Medicago truncatula genome revealed genetic locations of the markers on all chromosomes except chromosome 3. Most significant markers were found on chromosomes 1, 2, and 4. BLAST search using the flanking sequences of significant markers identified 14 putative candidate genes linked to 23 significant markers. Most of them were repeatedly identified in two or three salt treatments. Several loci identified in the present study had similar genetic locations to the reported QTL associated with salt tolerance in M. truncatula. A locus identified on chromosome 6 by this study overlapped with that by drought in our previous study. To our knowledge, this is the first report on mapping loci associated with salt tolerance during germination in autotetraploid alfalfa. Further investigation on these loci and their linked genes would provide insight into understanding molecular mechanisms by which salt and drought stresses affect alfalfa growth. Functional markers closely linked to the resistance loci would be useful for MAS to improve alfalfa cultivars with enhanced resistance to drought and salt stresses. PMID:27446182

  8. Genome-Wide Association of Copy Number Polymorphisms and Kidney Function.

    Directory of Open Access Journals (Sweden)

    Man Li

    Full Text Available Genome-wide association studies (GWAS using single nucleotide polymorphisms (SNPs have identified more than 50 loci associated with estimated glomerular filtration rate (eGFR, a measure of kidney function. However, significant SNPs account for a small proportion of eGFR variability. Other forms of genetic variation have not been comprehensively evaluated for association with eGFR. In this study, we assess whether changes in germline DNA copy number are associated with GFR estimated from serum creatinine, eGFRcrea. We used hidden Markov models (HMMs to identify copy number polymorphic regions (CNPs from high-throughput SNP arrays for 2,514 African (AA and 8,645 European ancestry (EA participants in the Atherosclerosis Risk in Communities (ARIC study. Separately for the EA and AA cohorts, we used Bayesian Gaussian mixture models to estimate copy number at regions identified by the HMM or previously reported in the HapMap Project. We identified 312 and 464 autosomal CNPs among individuals of EA and AA, respectively. Multivariate models adjusted for SNP-derived covariates of population structure identified one CNP in the EA cohort near genome-wide statistical significance (Bonferroni-adjusted p = 0.067 located on chromosome 5 (876-880kb. Overall, our findings suggest a limited role of CNPs in explaining eGFR variability.

  9. Genome-wide SNPs lead to strong signals of geographic structure and relatedness patterns in the major arbovirus vector, Aedes aegypti.

    Science.gov (United States)

    Rašić, Gordana; Filipović, Igor; Weeks, Andrew R; Hoffmann, Ary A

    2014-04-11

    Genetic markers are widely used to understand the biology and population dynamics of disease vectors, but often markers are limited in the resolution they provide. In particular, the delineation of population structure, fine scale movement and patterns of relatedness are often obscured unless numerous markers are available. To address this issue in the major arbovirus vector, the yellow fever mosquito (Aedes aegypti), we used double digest Restriction-site Associated DNA (ddRAD) sequencing for the discovery of genome-wide single nucleotide polymorphisms (SNPs). We aimed to characterize the new SNP set and to test the resolution against previously described microsatellite markers in detecting broad and fine-scale genetic patterns in Ae. aegypti. We developed bioinformatics tools that support the customization of restriction enzyme-based protocols for SNP discovery. We showed that our approach for RAD library construction achieves unbiased genome representation that reflects true evolutionary processes. In Ae. aegypti samples from three continents we identified more than 18,000 putative SNPs. They were widely distributed across the three Ae. aegypti chromosomes, with 47.9% found in intergenic regions and 17.8% in exons of over 2,300 genes. Pattern of their imputed effects in ORFs and UTRs were consistent with those found in a recent transcriptome study. We demonstrated that individual mosquitoes from Indonesia, Australia, Vietnam and Brazil can be assigned with a very high degree of confidence to their region of origin using a large SNP panel. We also showed that familial relatedness of samples from a 0.4 km2 area could be confidently established with a subset of SNPs. Using a cost-effective customized RAD sequencing approach supported by our bioinformatics tools, we characterized over 18,000 SNPs in field samples of the dengue fever mosquito Ae. aegypti. The variants were annotated and positioned onto the three Ae. aegypti chromosomes. The new SNP set provided much

  10. Genome-wide Association Study of Personality Traits in the Long Life Family Study

    Directory of Open Access Journals (Sweden)

    Harold T Bae

    2013-05-01

    Full Text Available Personality traits have been shown to be associated with longevity and healthy aging. In order to discover novel genetic modifiers associated with personality traits as related with longevity, we performed a genome-wide association study (GWAS on personality factors assessed by NEO-FFI in individuals enrolled in the Long Life Family Study (LLFS, a study of 583 families (N up to 4595 with clustering for longevity in the United States and Denmark. Three SNPs, in almost perfect LD, associated with agreeableness reached genome-wide significance (p<10-8 and replicated in an additional sample of 1279 LLFS subjects, although one (rs9650241 failed to replicate and the other two were not available in two independent replication cohorts, the Baltimore Longitudinal Study of Aging and the New England Centenarian Study. Based on 10,000,000 permutations, the empirical p-value of 2X10-7 was observed for the genome-wide significant SNPs. Seventeen SNPs that reached marginal statistical significance in the two previous GWASs (p-value < 10-4 and 10-5, were also marginally significantly associated in this study (p-value < 0.05, although none of the associations passed the Bonferroni correction. In addition, we tested age-by-SNP interactions and found some significant associations. Since scores of personality traits in LLFS subjects change in the oldest ages, and genetic factors outweigh environmental factors to achieve extreme ages, these age-by-SNP interactions could be a proxy for complex gene-gene interactions affecting personality traits and longevity.

  11. Genome-wide association study of susceptibility loci for breast cancer in Sardinian population.

    Science.gov (United States)

    Palomba, Grazia; Loi, Angela; Porcu, Eleonora; Cossu, Antonio; Zara, Ilenia; Budroni, Mario; Dei, Mariano; Lai, Sandra; Mulas, Antonella; Olmeo, Nina; Ionta, Maria Teresa; Atzori, Francesco; Cuccuru, Gianmauro; Pitzalis, Maristella; Zoledziewska, Magdalena; Olla, Nazario; Lovicu, Mario; Pisano, Marina; Abecasis, Gonçalo R; Uda, Manuela; Tanda, Francesco; Michailidou, Kyriaki; Easton, Douglas F; Chanock, Stephen J; Hoover, Robert N; Hunter, David J; Schlessinger, David; Sanna, Serena; Crisponi, Laura; Palmieri, Giuseppe

    2015-05-10

    Despite progress in identifying genes associated with breast cancer, many more risk loci exist. Genome-wide association analyses in genetically-homogeneous populations, such as that of Sardinia (Italy), could represent an additional approach to detect low penetrance alleles. We performed a genome-wide association study comparing 1431 Sardinian patients with non-familial, BRCA1/2-mutation-negative breast cancer to 2171 healthy Sardinian blood donors. DNA was genotyped using GeneChip Human Mapping 500 K Arrays or Genome-Wide Human SNP Arrays 6.0. To increase genomic coverage, genotypes of additional SNPs were imputed using data from HapMap Phase II. After quality control filtering of genotype data, 1367 cases (9 men) and 1658 controls (1156 men) were analyzed on a total of 2,067,645 SNPs. Overall, 33 genomic regions (67 candidate SNPs) were associated with breast cancer risk at the p <  0(-6) level. Twenty of these regions contained defined genes, including one already associated with breast cancer risk: TOX3. With a lower threshold for preliminary significance to p < 10(-5), we identified 11 additional SNPs in FGFR2, a well-established breast cancer-associated gene. Ten candidate SNPs were selected, excluding those already associated with breast cancer, for technical validation as well as replication in 1668 samples from the same population. Only SNP rs345299, located in intron 1 of VAV3, remained suggestively associated (p-value, 1.16 x 10(-5)), but it did not associate with breast cancer risk in pooled data from two large, mixed-population cohorts. This study indicated the role of TOX3 and FGFR2 as breast cancer susceptibility genes in BRCA1/2-wild-type breast cancer patients from Sardinian population.

  12. Genome-wide association study of susceptibility loci for breast cancer in Sardinian population

    International Nuclear Information System (INIS)

    Palomba, Grazia; Loi, Angela; Porcu, Eleonora; Cossu, Antonio; Zara, Ilenia

    2015-01-01

    Despite progress in identifying genes associated with breast cancer, many more risk loci exist. Genome-wide association analyses in genetically-homogeneous populations, such as that of Sardinia (Italy), could represent an additional approach to detect low penetrance alleles. We performed a genome-wide association study comparing 1431 Sardinian patients with non-familial, BRCA1/2-mutation-negative breast cancer to 2171 healthy Sardinian blood donors. DNA was genotyped using GeneChip Human Mapping 500 K Arrays or Genome-Wide Human SNP Arrays 6.0. To increase genomic coverage, genotypes of additional SNPs were imputed using data from HapMap Phase II. After quality control filtering of genotype data, 1367 cases (9 men) and 1658 controls (1156 men) were analyzed on a total of 2,067,645 SNPs. Overall, 33 genomic regions (67 candidate SNPs) were associated with breast cancer risk at the p < 10 −6 level. Twenty of these regions contained defined genes, including one already associated with breast cancer risk: TOX3. With a lower threshold for preliminary significance to p < 10 −5 , we identified 11 additional SNPs in FGFR2, a well-established breast cancer-associated gene. Ten candidate SNPs were selected, excluding those already associated with breast cancer, for technical validation as well as replication in 1668 samples from the same population. Only SNP rs345299, located in intron 1 of VAV3, remained suggestively associated (p-value, 1.16x10 −5 ), but it did not associate with breast cancer risk in pooled data from two large, mixed-population cohorts. This study indicated the role of TOX3 and FGFR2 as breast cancer susceptibility genes in BRCA1/2-wild-type breast cancer patients from Sardinian population. The online version of this article (doi:10.1186/s12885-015-1392-9) contains supplementary material, which is available to authorized users

  13. Genome-wide search for gene-gene interactions in colorectal cancer.

    Directory of Open Access Journals (Sweden)

    Shuo Jiao

    Full Text Available Genome-wide association studies (GWAS have successfully identified a number of single-nucleotide polymorphisms (SNPs associated with colorectal cancer (CRC risk. However, these susceptibility loci known today explain only a small fraction of the genetic risk. Gene-gene interaction (GxG is considered to be one source of the missing heritability. To address this, we performed a genome-wide search for pair-wise GxG associated with CRC risk using 8,380 cases and 10,558 controls in the discovery phase and 2,527 cases and 2,658 controls in the replication phase. We developed a simple, but powerful method for testing interaction, which we term the Average Risk Due to Interaction (ARDI. With this method, we conducted a genome-wide search to identify SNPs showing evidence for GxG with previously identified CRC susceptibility loci from 14 independent regions. We also conducted a genome-wide search for GxG using the marginal association screening and examining interaction among SNPs that pass the screening threshold (p<10(-4. For the known locus rs10795668 (10p14, we found an interacting SNP rs367615 (5q21 with replication p = 0.01 and combined p = 4.19×10(-8. Among the top marginal SNPs after LD pruning (n = 163, we identified an interaction between rs1571218 (20p12.3 and rs10879357 (12q21.1 (nominal combined p = 2.51×10(-6; Bonferroni adjusted p = 0.03. Our study represents the first comprehensive search for GxG in CRC, and our results may provide new insight into the genetic etiology of CRC.

  14. Genome-wide association mapping including phenotypes from relatives without genotypes in a single-step (ssGWAS for 6-week body weight in broiler chickens

    Directory of Open Access Journals (Sweden)

    Huiyu eWang

    2014-05-01

    Full Text Available The purpose of this study was to compare results obtained from various methodologies for genome-wide association studies, when applied to real data, in terms of number and commonality of regions identified and their genetic variance explained, computational speed, and possible pitfalls in interpretations of results. Methodologies include: two iteratively reweighted single-step genomic BLUP procedures (ssGWAS1 and ssGWAS2, a single-marker model (CGWAS, and BayesB. The ssGWAS methods utilize genomic breeding values (GEBVs based on combined pedigree, genomic and phenotypic information, while CGWAS and BayesB only utilize phenotypes from genotyped animals or pseudo-phenotypes. In this study, ssGWAS was performed by converting GEBVs to SNP marker effects. Unequal variances for markers were incorporated for calculating weights into a new genomic relationship matrix. SNP weights were refined iteratively. The data was body weight at 6 weeks on 274,776 broiler chickens, of which 4553 were genotyped using a 60k SNP chip. Comparison of genomic regions was based on genetic variances explained by local SNP regions (20 SNPs. After 3 iterations, the noise was greatly reduced of ssGWAS1 and results are similar to that of CGWAS, with 4 out of the top 10 regions in common. In contrast, for BayesB, the plot was dominated by a single region explaining 23.1% of the genetic variance. This same region was found by ssGWAS1 with the same rank, but the amount of genetic variation attributed to the region was only 3%. These finding emphasize the need for caution when comparing and interpreting results from various methods, and highlight that detected associations, and strength of association, strongly depends on methodologies and details of implementations. BayesB appears to overly shrink regions to zero, while overestimating the amount of genetic variation attributed to the remaining SNP effects. The real world is most likely a compromise between methods and remains to

  15. Genome-wide association study of multiplex schizophrenia pedigrees

    DEFF Research Database (Denmark)

    Levinson, Douglas F; Shi, Jianxin; Wang, Kai

    2012-01-01

    The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs).......The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs)....

  16. Genome-wide association study of clinical dimensions of schizophrenia

    DEFF Research Database (Denmark)

    Fanous, Ayman H; Zhou, Baiyu; Aggen, Steven H

    2012-01-01

    Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia.......Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia....

  17. Combining Genome-Wide Information with a Functional Structural Plant Model to Simulate 1-Year-Old Apple Tree Architecture.

    Science.gov (United States)

    Migault, Vincent; Pallas, Benoît; Costes, Evelyne

    2016-01-01

    In crops, optimizing target traits in breeding programs can be fostered by selecting appropriate combinations of architectural traits which determine light interception and carbon acquisition. In apple tree, architectural traits were observed to be under genetic control. However, architectural traits also result from many organogenetic and morphological processes interacting with the environment. The present study aimed at combining a FSPM built for apple tree, MAppleT, with genetic determinisms of architectural traits, previously described in a bi-parental population. We focused on parameters related to organogenesis (phyllochron and immediate branching) and morphogenesis processes (internode length and leaf area) during the first year of tree growth. Two independent datasets collected in 2004 and 2007 on 116 genotypes, issued from a 'Starkrimson' × 'Granny Smith' cross, were used. The phyllochron was estimated as a function of thermal time and sylleptic branching was modeled subsequently depending on phyllochron. From a genetic map built with SNPs, marker effects were estimated on four MAppleT parameters with rrBLUP, using 2007 data. These effects were then considered in MAppleT to simulate tree development in the two climatic conditions. The genome wide prediction model gave consistent estimations of parameter values with correlation coefficients between observed values and estimated values from SNP markers ranging from 0.79 to 0.96. However, the accuracy of the prediction model following cross validation schemas was lower. Three integrative traits (the number of leaves, trunk length, and number of sylleptic laterals) were considered for validating MAppleT simulations. In 2007 climatic conditions, simulated values were close to observations, highlighting the correct simulation of genetic variability. However, in 2004 conditions which were not used for model calibration, the simulations differed from observations. This study demonstrates the possibility of

  18. GWAMA: software for genome-wide association meta-analysis

    Directory of Open Access Journals (Sweden)

    Mägi Reedik

    2010-05-01

    Full Text Available Abstract Background Despite the recent success of genome-wide association studies in identifying novel loci contributing effects to complex human traits, such as type 2 diabetes and obesity, much of the genetic component of variation in these phenotypes remains unexplained. One way to improving power to detect further novel loci is through meta-analysis of studies from the same population, increasing the sample size over any individual study. Although statistical software analysis packages incorporate routines for meta-analysis, they are ill equipped to meet the challenges of the scale and complexity of data generated in genome-wide association studies. Results We have developed flexible, open-source software for the meta-analysis of genome-wide association studies. The software incorporates a variety of error trapping facilities, and provides a range of meta-analysis summary statistics. The software is distributed with scripts that allow simple formatting of files containing the results of each association study and generate graphical summaries of genome-wide meta-analysis results. Conclusions The GWAMA (Genome-Wide Association Meta-Analysis software has been developed to perform meta-analysis of summary statistics generated from genome-wide association studies of dichotomous phenotypes or quantitative traits. Software with source files, documentation and example data files are freely available online at http://www.well.ox.ac.uk/GWAMA.

  19. Association study of phenology, yield and quality related traits in table grapes using SSR and SNP markers

    OpenAIRE

    Zarouri, Belkacem

    2016-01-01

    The advent of cheaper high throughput genotyping technologies and the availability of large germplasm collections encouraged the extension of Genome-Wide Association Studies (GWAS) to crop plants. However, to date these strategies have not yet been tested in grapevine (Vitis vinífera L.). Taking advantage of the availability of a large grapevine germplasm collection maintained at the germplasm bank of El Encín (Alcalá de Henares, Madrid, Spain) and the relatively affordable genotyping tools, ...

  20. Report on ISFG SNP Panel Discussion

    DEFF Research Database (Denmark)

    Butler, John M.; Budowle, B.; Gill, P.

    2008-01-01

    Six scientists presented their views and experience with single nucleotide polymorphism (SNP) markers, multiplexes, and methods regarding their potential application in forensic identity and relationship testing. Benefits and limitations of SNPs were reviewed, as were different SNP marker...

  1. Genome-Wide Gene-Environment Study Identifies Glutamate Receptor Gene GRIN2A as a Parkinson's Disease Modifier Gene via Interaction with Coffee

    Science.gov (United States)

    Hamza, Taye H.; Chen, Honglei; Hill-Burns, Erin M.; Rhodes, Shannon L.; Montimurro, Jennifer; Kay, Denise M.; Tenesa, Albert; Kusel, Victoria I.; Sheehan, Patricia; Eaaswarkhanth, Muthukrishnan; Yearout, Dora; Samii, Ali; Roberts, John W.; Agarwal, Pinky; Bordelon, Yvette; Park, Yikyung; Wang, Liyong; Gao, Jianjun; Vance, Jeffery M.; Kendler, Kenneth S.; Bacanu, Silviu-Alin; Scott, William K.; Ritz, Beate; Nutt, John; Factor, Stewart A.; Zabetian, Cyrus P.; Payami, Haydeh

    2011-01-01

    Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD). We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC), and we performed a genome-wide association and interaction study (GWAIS), testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal components. We then stratified subjects as heavy or light coffee-drinkers and performed genome-wide association study (GWAS) in each group. We replicated the most significant SNP. Finally, we imputed the NGRC dataset, increasing genomic coverage to examine the region of interest in detail. The primary analyses (GWAIS, GWAS, Replication) were performed using genotyped data. In GWAIS, the most significant signal came from rs4998386 and the neighboring SNPs in GRIN2A. GRIN2A encodes an NMDA-glutamate-receptor subunit and regulates excitatory neurotransmission in the brain. Achieving P2df = 10−6, GRIN2A surpassed all known PD susceptibility genes in significance in the GWAIS. In stratified GWAS, the GRIN2A signal was present in heavy coffee-drinkers (OR = 0.43; P = 6×10−7) but not in light coffee-drinkers. The a priori Replication hypothesis that “Among heavy coffee-drinkers, rs4998386_T carriers have lower PD risk than rs4998386_CC carriers” was confirmed: ORReplication = 0.59, PReplication = 10−3; ORPooled = 0.51, PPooled = 7×10−8. Compared to light coffee-drinkers with rs4998386_CC genotype, heavy coffee-drinkers with rs4998386_CC genotype had 18% lower risk (P = 3×10−3), whereas heavy coffee-drinkers with rs4998386_TC genotype had 59% lower risk (P = 6×10−13). Imputation revealed a block of SNPs that achieved P2dfcoffee-drinkers. This study is proof of concept that inclusion of environmental factors can help identify genes that

  2. Genome-wide gene-environment study identifies glutamate receptor gene GRIN2A as a Parkinson's disease modifier gene via interaction with coffee.

    Science.gov (United States)

    Hamza, Taye H; Chen, Honglei; Hill-Burns, Erin M; Rhodes, Shannon L; Montimurro, Jennifer; Kay, Denise M; Tenesa, Albert; Kusel, Victoria I; Sheehan, Patricia; Eaaswarkhanth, Muthukrishnan; Yearout, Dora; Samii, Ali; Roberts, John W; Agarwal, Pinky; Bordelon, Yvette; Park, Yikyung; Wang, Liyong; Gao, Jianjun; Vance, Jeffery M; Kendler, Kenneth S; Bacanu, Silviu-Alin; Scott, William K; Ritz, Beate; Nutt, John; Factor, Stewart A; Zabetian, Cyrus P; Payami, Haydeh

    2011-08-01

    Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD). We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC), and we performed a genome-wide association and interaction study (GWAIS), testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal components. We then stratified subjects as heavy or light coffee-drinkers and performed genome-wide association study (GWAS) in each group. We replicated the most significant SNP. Finally, we imputed the NGRC dataset, increasing genomic coverage to examine the region of interest in detail. The primary analyses (GWAIS, GWAS, Replication) were performed using genotyped data. In GWAIS, the most significant signal came from rs4998386 and the neighboring SNPs in GRIN2A. GRIN2A encodes an NMDA-glutamate-receptor subunit and regulates excitatory neurotransmission in the brain. Achieving P(2df) = 10(-6), GRIN2A surpassed all known PD susceptibility genes in significance in the GWAIS. In stratified GWAS, the GRIN2A signal was present in heavy coffee-drinkers (OR = 0.43; P = 6×10(-7)) but not in light coffee-drinkers. The a priori Replication hypothesis that "Among heavy coffee-drinkers, rs4998386_T carriers have lower PD risk than rs4998386_CC carriers" was confirmed: OR(Replication) = 0.59, P(Replication) = 10(-3); OR(Pooled) = 0.51, P(Pooled) = 7×10(-8). Compared to light coffee-drinkers with rs4998386_CC genotype, heavy coffee-drinkers with rs4998386_CC genotype had 18% lower risk (P = 3×10(-3)), whereas heavy coffee-drinkers with rs4998386_TC genotype had 59% lower risk (P = 6×10(-13)). Imputation revealed a block of SNPs that achieved P(2df)coffee-drinkers. This study is proof of concept that inclusion of environmental factors can help identify

  3. Genome-wide gene-environment study identifies glutamate receptor gene GRIN2A as a Parkinson's disease modifier gene via interaction with coffee.

    Directory of Open Access Journals (Sweden)

    Taye H Hamza

    2011-08-01

    Full Text Available Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD. We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC, and we performed a genome-wide association and interaction study (GWAIS, testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal components. We then stratified subjects as heavy or light coffee-drinkers and performed genome-wide association study (GWAS in each group. We replicated the most significant SNP. Finally, we imputed the NGRC dataset, increasing genomic coverage to examine the region of interest in detail. The primary analyses (GWAIS, GWAS, Replication were performed using genotyped data. In GWAIS, the most significant signal came from rs4998386 and the neighboring SNPs in GRIN2A. GRIN2A encodes an NMDA-glutamate-receptor subunit and regulates excitatory neurotransmission in the brain. Achieving P(2df = 10(-6, GRIN2A surpassed all known PD susceptibility genes in significance in the GWAIS. In stratified GWAS, the GRIN2A signal was present in heavy coffee-drinkers (OR = 0.43; P = 6×10(-7 but not in light coffee-drinkers. The a priori Replication hypothesis that "Among heavy coffee-drinkers, rs4998386_T carriers have lower PD risk than rs4998386_CC carriers" was confirmed: OR(Replication = 0.59, P(Replication = 10(-3; OR(Pooled = 0.51, P(Pooled = 7×10(-8. Compared to light coffee-drinkers with rs4998386_CC genotype, heavy coffee-drinkers with rs4998386_CC genotype had 18% lower risk (P = 3×10(-3, whereas heavy coffee-drinkers with rs4998386_TC genotype had 59% lower risk (P = 6×10(-13. Imputation revealed a block of SNPs that achieved P(2df<5×10(-8 in GWAIS, and OR = 0.41, P = 3×10(-8 in heavy coffee-drinkers. This study is proof of

  4. Expression Level of the DREB2-Type Gene, Identified with Amplifluor SNP Markers, Correlates with Performance, and Tolerance to Dehydration in Bread Wheat Cultivars from Northern Kazakhstan

    Science.gov (United States)

    Shavrukov, Yuri; Zhumalin, Aibek; Serikbay, Dauren; Botayeva, Makpal; Otemisova, Ainur; Absattarova, Aiman; Sereda, Grigoriy; Sereda, Sergey; Shvidchenko, Vladimir; Turbekova, Arysgul; Jatayev, Satyvaldy; Lopato, Sergiy; Soole, Kathleen; Langridge, Peter

    2016-01-01

    A panel of 89 local commercial cultivars of bread wheat was tested in field trials in the dry conditions of Northern Kazakhstan. Two distinct groups of cultivars (six cultivars in each group), which had the highest and the lowest grain yield under drought were selected for further experiments. A dehydration test conducted on detached leaves indicated a strong association between rates of water loss in plants from the first group with highest grain yield production in the dry environment relative to the second group. Modern high-throughput Amplifluor Single Nucleotide Polymorphism (SNP) technology was applied to study allelic variations in a series of drought-responsive genes using 19 SNP markers. Genotyping of an SNP in the TaDREB5 (DREB2-type) gene using the Amplifluor SNP marker KATU48 revealed clear allele distribution across the entire panel of wheat accessions, and distinguished between the two groups of cultivars with high and low yield under drought. Significant differences in expression levels of TaDREB5 were revealed by qRT-PCR. Most wheat plants from the first group of cultivars with high grain yield showed slight up-regulation in the TaDREB5 transcript in dehydrated leaves. In contrast, expression of TaDREB5 in plants from the second group of cultivars with low grain yield was significantly down-regulated. It was found that SNPs did not alter the amino acid sequence of TaDREB5 protein. Thus, a possible explanation is that alternative splicing and up-stream regulation of TaDREB5 may be affected by SNP, but these hypotheses require additional analysis (and will be the focus of future studies). PMID:27917186

  5. Expression level of the DREB2-type gene, identified with Amplifluor SNP markers, correlates with performance and tolerance to dehydration in bread wheat cultivars from Northern Kazakhstan

    Directory of Open Access Journals (Sweden)

    Yuri Shavrukov

    2016-11-01

    Full Text Available A panel of 89 local commercial cultivars of bread wheat was tested in field trials in the dry conditions of Northern Kazakhstan. Two distinct groups of cultivars (six cultivars in each group, which had the highest and the lowest grain yield under drought were selected for further experiments. A dehydration test conducted on detached leaves indicated a strong association between rates of water loss in plants from the first group with highest grain yield production in the dry environment relative to the second group. Modern high-throughput Amplifluor SNP technology was applied to study allelic variations in a series of drought-responsive genes using 19 SNP markers. Genotyping of an SNP in the TaDREB5 (DREB2-type gene using the Amplifluor SNP marker KATU48 revealed clear allele distribution across the entire panel of wheat accessions, and distinguished between the two groups of cultivars with high and low yield under drought. Significant differences in expression levels of TaDREB5 were revealed by qRT-PCR. Most wheat plants from the first group of cultivars with high grain yield showed strong up-regulation of TaDREB5 transcript in dehydrated leaves. In contrast, expression of TaDREB5 in plants from the second group of cultivars with low grain yield was significantly down-regulated. It was found that SNPs did not alter the amino acid sequence of TaDREB5 protein. Thus, a possible explanation is that alternative splicing and up-stream regulation of TaDREB5 may be affected by SNP, but these hypotheses require additional analysis (and will be the focus of future studies.

  6. SNP detection from de novo transcriptome sequencing in the bivalve Macoma balthica: marker development for evolutionary studies.

    Directory of Open Access Journals (Sweden)

    Eric Pante

    Full Text Available Hybrid zones are noteworthy systems for the study of environmental adaptation to fast-changing environments, as they constitute reservoirs of polymorphism and are key to the maintenance of biodiversity. They can move in relation to climate fluctuations, as temperature can affect both selection and migration, or remain trapped by environmental and physical barriers. There is therefore a very strong incentive to study the dynamics of hybrid zones subjected to climate variations. The infaunal bivalve Macoma balthica emerges as a noteworthy model species, as divergent lineages hybridize, and its native NE Atlantic range is currently contracting to the North. To investigate the dynamics and functioning of hybrid zones in M. balthica, we developed new molecular markers by sequencing the collective transcriptome of 30 individuals. Ten individuals were pooled for each of the three populations sampled at the margins of two hybrid zones. A single 454 run generated 277 Mb from which 17K SNPs were detected. SNP density averaged 1 polymorphic site every 14 to 19 bases, for mitochondrial and nuclear loci, respectively. An [Formula: see text] scan detected high genetic divergence among several hundred SNPs, some of them involved in energetic metabolism, cellular respiration and physiological stress. The high population differentiation, recorded for nuclear-encoded ATP synthase and NADH dehydrogenase as well as most mitochondrial loci, suggests cytonuclear genetic incompatibilities. Results from this study will help pave the way to a high-resolution study of hybrid zone dynamics in M. balthica, and the relative importance of endogenous and exogenous barriers to gene flow in this system.

  7. Design and characterization of a 52K SNP chip for goats.

    Directory of Open Access Journals (Sweden)

    Gwenola Tosser-Klopp

    Full Text Available The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50-60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed: Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes, sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years.

  8. Generation and analysis of ESTs from the eastern oyster, Crassostrea virginica Gmelin and identification of microsatellite and SNP markers

    Directory of Open Access Journals (Sweden)

    Wallace Richard

    2007-06-01

    Full Text Available Abstract Background The eastern oyster, Crassostrea virginica (Gmelin 1791, is an economically important species cultured in many areas in North America. It is also ecologically important because of the impact of its filter feeding behaviour on water quality. Populations of C. virginica have been threatened by overfishing, habitat degradation, and diseases. Through genome research, strategies are being developed to reverse its population decline. However, large-scale expressed sequence tag (EST resources have been lacking for this species. Efficient generation of EST resources from this species has been hindered by a high redundancy of transcripts. The objectives of this study were to construct a normalized cDNA library for efficient EST analysis, to generate thousands of ESTs, and to analyze the ESTs for microsatellites and potential single nucleotide polymorphisms (SNPs. Results A normalized and subtracted C. virginica cDNA library was constructed from pooled RNA isolated from hemocytes, mantle, gill, gonad and digestive tract, muscle, and a whole juvenile oyster. A total of 6,528 clones were sequenced from this library generating 5,542 high-quality EST sequences. Cluster analysis indicated the presence of 635 contigs and 4,053 singletons, generating a total of 4,688 unique sequences. About 46% (2,174 of the unique ESTs had significant hits (E-value ≤ 1e-05 to the non-redundant protein database; 1,104 of which were annotated using Gene Ontology (GO terms. A total of 35 microsatellites were identified from the ESTs, with 18 having sufficient flanking sequences for primer design. A total of 6,533 putative SNPs were also identified using all existing and the newly generated EST resources of the eastern oysters. Conclusion A high quality normalized cDNA library was constructed. A total of 5,542 ESTs were generated representing 4,688 unique sequences. Putative microsatellite and SNP markers were identified. These genome resources provide the

  9. A genome-wide association study of cleft lip with and without cleft palate identifies risk variants near MAFB and ABCA4

    DEFF Research Database (Denmark)

    Beaty, Terri H; Murray, Jeffrey C; Marazita, Mary L

    2010-01-01

    Case-parent trios were used in a genome-wide association study of cleft lip with and without cleft palate. SNPs near two genes not previously associated with cleft lip with and without cleft palate (MAFB, most significant SNP rs13041247, with odds ratio (OR) per minor allele = 0.704, 95% CI 0.635...

  10. Population structure of Atlantic Mackerel inferred from RAD-seq derived SNP markers: effects of sequence clustering parameters and hierarchical SNP selection

    KAUST Repository

    Rodríguez-Ezpeleta, Naiara

    2016-03-03

    Restriction-site associated DNA sequencing (RAD-seq) and related methods are revolutionizing the field of population genomics in non-model organisms as they allow generating an unprecedented number of single nucleotide polymorphisms (SNPs) even when no genomic information is available. Yet, RAD-seq data analyses rely on assumptions on nature and number of nucleotide variants present in a single locus, the choice of which may lead to an under- or overestimated number of SNPs and/or to incorrectly called genotypes. Using the Atlantic mackerel (Scomber scombrus L.) and a close relative, the Atlantic chub mackerel (Scomber colias), as case study, here we explore the sensitivity of population structure inferences to two crucial aspects in RAD-seq data analysis: the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides insights into the effects of alternative RAD-seq data analysis strategies on population structure inferences that are directly applicable to other species.

  11. A genetic linkage map with 178 SSR and 1 901 SNP markers constructed using a RIL population in wheat (Triticum aestivum L.)

    Institute of Scientific and Technical Information of China (English)

    ZHAI Hui-jie; FENG Zhi-yu; LIU Xin-ye; CHENG Xue-jiao; PENG Hui-ru; YAO Ying-yin; SUN Qi-xin; NI Zhong-fu

    2015-01-01

    The construction of high density genetic linkage map provides a powerful tool to detect and map quantitative trait loci (QTLs) controlling agronomically important traits. In this study, simple sequence repeat (SSR) markers and Illumina 9K iSelect single nucleotide polymorphism (SNP) genechip were employed to construct one genetic linkage map of common wheat (Triticum aestivum L.) using 191 recombinant inbred lines (RILs) derived from cross Yu 8679xJing 411. This map included 1 901 SNP loci and 178 SSR loci, covering 1 659.9 cM and 1 000 marker bins, with an average interval distance of 1.66 cM. A, B and D genomes covered 719.1,703.5 and 237.3 cM, with an average interval distance of 1.66, 1.45 and 2.9 cM, respectively. Notably, the genetic linkage map covered 20 chromosomes, with the exception of chromosome 5D. Bioinformatics analysis revealed that 1 754 (92.27%) of 1 901 mapped SNP loci could be aligned to 1 215 distinct wheat unigenes, among which 1 184 (97.4%) were located on one single chromosome, and the rest 31 (2.6%) were located on 2 to 3 chromosomes. By performing in silico comparison, 214 chromosome deletion bin-mapped expressed sequence tags (ESTs), 1 043 Brachypodium genes and 1 033 rice genes were further added onto the genetic linkage map. This map not only integrated genetic and physical maps, SSR and SNP loci, respectively, but also provided the information of Brachypodium and rice genes corresponding to 1 754 SNP loci. Therefore, it will be a useful tool for comparative genomics analysis, fine mapping of QTL/gene controlling agronomically important traits and marker-assisted selection breeding in wheat.

  12. Genome-Wide Association Mapping of Flowering and Ripening Periods in Apple

    Directory of Open Access Journals (Sweden)

    Jorge Urrestarazu

    2017-11-01

    Full Text Available Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM, which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16 which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe, and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated

  13. Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Rekaya R

    2016-11-01

    Full Text Available Romdhane Rekaya,1–3 Shannon Smith,4 El Hamidi Hay,5 Nourhene Farhat,6 Samuel E Aggrey3,7 1Department of Animal and Dairy Science, College of Agricultural and Environmental Sciences, 2Department of Statistics, Franklin College of Arts and Sciences, 3Institute of Bioinformatics, The University of Georgia, Athens, GA, 4Zoetis, Kalamazoo, MI, 5United States Department of Agriculture, Agricultural Research Service, Beltsville, MD, 6Carolinas HealthCare System Blue Ridge, Morganton, NC, 7Department of Poultry Science, College of Agricultural and Environmental Sciences, University of Georgia, Athens, GA, USA Abstract: Errors in the binary status of some response traits are frequent in human, animal, and plant applications. These error rates tend to differ between cases and controls because diagnostic and screening tests have different sensitivity and specificity. This increases the inaccuracies of classifying individuals into correct groups, giving rise to both false-positive and false-negative cases. The analysis of these noisy binary responses due to misclassification will undoubtedly reduce the statistical power of genome-wide association studies (GWAS. A threshold model that accommodates varying diagnostic errors between cases and controls was investigated. A simulation study was carried out where several binary data sets (case–control were generated with varying effects for the most influential single nucleotide polymorphisms (SNPs and different diagnostic error rate for cases and controls. Each simulated data set consisted of 2000 individuals. Ignoring misclassification resulted in biased estimates of true influential SNP effects and inflated estimates for true noninfluential markers. A substantial reduction in bias and increase in accuracy ranging from 12% to 32% was observed when the misclassification procedure was invoked. In fact, the majority of influential SNPs that were not identified using the noisy data were captured using the

  14. Preliminary genome-wide association study of bipolar disorder in the Japanese population.

    Science.gov (United States)

    Hattori, Eiji; Toyota, Tomoko; Ishitsuka, Yuichi; Iwayama, Yoshimi; Yamada, Kazuo; Ujike, Hiroshi; Morita, Yukitaka; Kodama, Masafumi; Nakata, Kenji; Minabe, Yoshio; Nakamura, Kazuhiko; Iwata, Yasuhide; Takei, Nori; Mori, Norio; Naitoh, Hiroshi; Yamanouchi, Yoshio; Iwata, Nakao; Ozaki, Norio; Kato, Tadafumi; Nishikawa, Toru; Kashiwa, Atsushi; Suzuki, Mika; Shioe, Kunihiko; Shinohara, Manabu; Hirano, Masami; Nanko, Shinichiro; Akahane, Akihisa; Ueno, Mikako; Kaneko, Naoshi; Watanabe, Yuichiro; Someya, Toshiyuki; Hashimoto, Kenji; Iyo, Masaomi; Itokawa, Masanari; Arai, Makoto; Nankai, Masahiro; Inada, Toshiya; Yoshida, Sumiko; Kunugi, Hiroshi; Nakamura, Michiko; Iijima, Yoshimi; Okazaki, Yuji; Higuchi, Teruhiko; Yoshikawa, Takeo

    2009-12-05

    Recent progress in genotyping technology and the development of public databases has enabled large-scale genome-wide association tests with diseases. We performed a two-stage genome-wide association study (GWAS) of bipolar disorder (BD) in Japanese cohorts. First we used Affymetrix 100K GeneChip arrays in the analysis of 107 cases with bipolar I disorder and 107 controls, and selected markers that were nominally significant (P < 0.01) in at least one of the three models (1,577 markers in total). In the follow-up stage, we analyzed these markers using an Illumina platform (1,526 markers; 51 markers were not designable for the platform) and an independent sample set, which consisted of 395 cases (bipolar I + II) and 409 controls. We also assessed the population stratification of current samples using principal components analysis. After the two-stage analysis, 89 markers remained nominally significant (allelic P < 0.05) with the same allele being consistently over-represented in both the first and the follow-up stages. However, none of these were significant after correction for multiple-testing by false discovery rates. Sample stratification was virtually negligible. Collectively, this is the first GWAS of BD in the Japanese population. But given the small sample size and the limited genomic coverage, these results should be taken as preliminary. 2009 Wiley-Liss, Inc.

  15. Uncovering the hidden risk architecture of the schizophrenias: confirmation in three independent genome-wide association studies.

    Science.gov (United States)

    Arnedo, Javier; Svrakic, Dragan M; Del Val, Coral; Romero-Zaliz, Rocío; Hernández-Cuervo, Helena; Fanous, Ayman H; Pato, Michele T; Pato, Carlos N; de Erausquin, Gabriel A; Cloninger, C Robert; Zwir, Igor

    2015-02-01

    The authors sought to demonstrate that schizophrenia is a heterogeneous group of heritable disorders caused by different genotypic networks that cause distinct clinical syndromes. In a large genome-wide association study of cases with schizophrenia and controls, the authors first identified sets of interacting single-nucleotide polymorphisms (SNPs) that cluster within particular individuals (SNP sets) regardless of clinical status. Second, they examined the risk of schizophrenia for each SNP set and tested replicability in two independent samples. Third, they identified genotypic networks composed of SNP sets sharing SNPs or subjects. Fourth, they identified sets of distinct clinical features that cluster in particular cases (phenotypic sets or clinical syndromes) without regard for their genetic background. Fifth, they tested whether SNP sets were associated with distinct phenotypic sets in a replicable manner across the three studies. The authors identified 42 SNP sets associated with a 70% or greater risk of schizophrenia, and confirmed 34 (81%) or more with similar high risk of schizophrenia in two independent samples. Seventeen networks of SNP sets did not share any SNP or subject. These disjoint genotypic networks were associated with distinct gene products and clinical syndromes (i.e., the schizophrenias) varying in symptoms and severity. Associations between genotypic networks and clinical syndromes were complex, showing multifinality and equifinality. The interactive networks explained the risk of schizophrenia more than the average effects of all SNPs (24%). Schizophrenia is a group of heritable disorders caused by a moderate number of separate genotypic networks associated with several distinct clinical syndromes.

  16. A Genome-Wide Association Study Identifies Risk Loci to Equine Recurrent Uveitis in German Warmblood Horses

    Science.gov (United States)

    Kulbrock, Maike; Lehner, Stefanie; Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

    2013-01-01

    Equine recurrent uveitis (ERU) is a common eye disease affecting up to 3–15% of the horse population. A genome-wide association study (GWAS) using the Illumina equine SNP50 bead chip was performed to identify loci conferring risk to ERU. The sample included a total of 144 German warmblood horses. A GWAS showed a significant single nucleotide polymorphism (SNP) on horse chromosome (ECA) 20 at 49.3 Mb, with IL-17A and IL-17F being the closest genes. This locus explained a fraction of 23% of the phenotypic variance for ERU. A GWAS taking into account the severity of ERU, revealed a SNP on ECA18 nearby to the crystalline gene cluster CRYGA-CRYGF. For both genomic regions on ECA18 and 20, significantly associated haplotypes containing the genome-wide significant SNPs could be demonstrated. In conclusion, our results are indicative for a genetic component regulating the possible critical role of IL-17A and IL-17F in the pathogenesis of ERU. The associated SNP on ECA18 may be indicative for cataract formation in the course of ERU. PMID:23977091

  17. Accuracy of Assignment of Atlantic Salmon (Salmo salar L.) to Rivers and Regions in Scotland and Northeast England Based on Single Nucleotide Polymorphism (SNP) Markers

    Science.gov (United States)

    Gilbey, John; Cauwelier, Eef; Coulson, Mark W.; Stradmeyer, Lee; Sampayo, James N.; Armstrong, Anja; Verspoor, Eric; Corrigan, Laura; Shelley, Jonathan; Middlemas, Stuart

    2016-01-01

    Understanding the habitat use patterns of migratory fish, such as Atlantic salmon (Salmo salar L.), and the natural and anthropogenic impacts on them, is aided by the ability to identify individuals to their stock of origin. Presented here are the results of an analysis of informative single nucleotide polymorphic (SNP) markers for detecting genetic structuring in Atlantic salmon in Scotland and NE England and their ability to allow accurate genetic stock identification. 3,787 fish from 147 sites covering 27 rivers were screened at 5,568 SNP markers. In order to identify a cost-effective subset of SNPs, they were ranked according to their ability to differentiate between fish from different rivers. A panel of 288 SNPs was used to examine both individual assignments and mixed stock fisheries and eighteen assignment units were defined. The results improved greatly on previously available methods and, for the first time, fish caught in the marine environment can be confidently assigned to geographically coherent units within Scotland and NE England, including individual rivers. As such, this SNP panel has the potential to aid understanding of the various influences acting upon Atlantic salmon on their marine migrations, be they natural environmental variations and/or anthropogenic impacts, such as mixed stock fisheries and interactions with marine power generation installations. PMID:27723810

  18. A Genome-Wide Association Study Suggests Novel Loci Associated with a Schizophrenia-Related Brain-Based Phenotype.

    Directory of Open Access Journals (Sweden)

    Johanna Hass

    Full Text Available Patients with schizophrenia and their siblings typically show subtle changes of brain structures, such as a reduction of hippocampal volume. Hippocampal volume is heritable, may explain a variety of cognitive symptoms of schizophrenia and is thus considered an intermediate phenotype for this mental illness. The aim of our analyses was to identify single-nucleotide polymorphisms (SNP related to hippocampal volume without making prior assumptions about possible candidate genes. In this study, we combined genetics, imaging and neuropsychological data obtained from the Mind Clinical Imaging Consortium study of schizophrenia (n = 328. A total of 743,591 SNPs were tested for association with hippocampal volume in a genome-wide association study. Gene expression profiles of human hippocampal tissue were investigated for gene regions of significantly associated SNPs. None of the genetic markers reached genome-wide significance. However, six highly correlated SNPs (rs4808611, rs35686037, rs12982178, rs1042178, rs10406920, rs8170 on chromosome 19p13.11, located within or in close proximity to the genes NR2F6, USHBP1, and BABAM1, as well as four SNPs in three other genomic regions (chromosome 1, 2 and 10 had p-values between 6.75×10(-6 and 8.3×10(-7. Using existing data of a very recently published GWAS of hippocampal volume and additional data of a multicentre study in a large cohort of adolescents of European ancestry, we found supporting evidence for our results. Furthermore, allelic differences in rs4808611 and rs8170 were highly associated with differential mRNA expression in the cis-acting region. Associations with memory functioning indicate a possible functional importance of the identified risk variants. Our findings provide new insights into the genetic architecture of a brain structure closely linked to schizophrenia. In silico replication, mRNA expression and cognitive data provide additional support for the relevance of our findings

  19. Genome-wide association of lipid-lowering response to statins in combined study populations.

    Directory of Open Access Journals (Sweden)

    Mathew J Barber

    2010-03-01

    Full Text Available Statins effectively lower total and plasma LDL-cholesterol, but the magnitude of decrease varies among individuals. To identify single nucleotide polymorphisms (SNPs contributing to this variation, we performed a combined analysis of genome-wide association (GWA results from three trials of statin efficacy.Bayesian and standard frequentist association analyses were performed on untreated and statin-mediated changes in LDL-cholesterol, total cholesterol, HDL-cholesterol, and triglyceride on a total of 3932 subjects using data from three studies: Cholesterol and Pharmacogenetics (40 mg/day simvastatin, 6 weeks, Pravastatin/Inflammation CRP Evaluation (40 mg/day pravastatin, 24 weeks, and Treating to New Targets (10 mg/day atorvastatin, 8 weeks. Genotype imputation was used to maximize genomic coverage and to combine information across studies. Phenotypes were normalized within each study to account for systematic differences among studies, and fixed-effects combined analysis of the combined sample were performed to detect consistent effects across studies. Two SNP associations were assessed as having posterior probability greater than 50%, indicating that they were more likely than not to be genuinely associated with statin-mediated lipid response. SNP rs8014194, located within the CLMN gene on chromosome 14, was strongly associated with statin-mediated change in total cholesterol with an 84% probability by Bayesian analysis, and a p-value exceeding conventional levels of genome-wide significance by frequentist analysis (P = 1.8 x 10(-8. This SNP was less significantly associated with change in LDL-cholesterol (posterior probability = 0.16, P = 4.0 x 10(-6. Bayesian analysis also assigned a 51% probability that rs4420638, located in APOC1 and near APOE, was associated with change in LDL-cholesterol.Using combined GWA analysis from three clinical trials involving nearly 4,000 individuals treated with simvastatin, pravastatin, or atorvastatin, we

  20. Genome-wide association mapping reveals a rich genetic architecture of stripe rust resistance loci in emmer wheat (Triticum turgidum ssp. dicoccum).

    Science.gov (United States)

    Liu, Weizhen; Maccaferri, Marco; Chen, Xianming; Laghetti, Gaetano; Pignone, Domenico; Pumphrey, Michael; Tuberosa, Roberto

    2017-11-01

    SNP-based genome scanning in worldwide domesticated emmer germplasm showed high genetic diversity, rapid linkage disequilibrium decay and 51 loci for stripe rust resistance, a large proportion of which were novel. Cultivated emmer wheat (Triticum turgidum ssp. dicoccum), one of the oldest domesticated crops in the world, is a potentially rich reservoir of variation for improvement of resistance/tolerance to biotic and abiotic stresses in wheat. Resistance to stripe rust (Puccinia striiformis f. sp. tritici) in emmer wheat has been under-investigated. Here, we employed genome-wide association (GWAS) mapping with a mixed linear model to dissect effective stripe rust resistance loci in a worldwide collection of 176 cultivated emmer wheat accessions. Adult plants were tested in six environments and seedlings were evaluated with five races from the United States and one from Italy under greenhouse conditions. Five accessions were resistant across all experiments. The panel was genotyped with the wheat 90,000 Illumina iSelect single nucleotide polymorphism (SNP) array and 5106 polymorphic SNP markers with mapped positions were obtained. A high level of genetic diversity and fast linkage disequilibrium decay were observed. In total, we identified 14 loci associated with field resistance in multiple environments. Thirty-seven loci were significantly associated with all-stage (seedling) resistance and six of them were effective against multiple races. Of the 51 total loci, 29 were mapped distantly from previously reported stripe rust resistance genes or quantitative trait loci and represent newly discovered resistance loci. Our results suggest that GWAS is an effective method for characterizing genes in cultivated emmer wheat and confirm that emmer wheat is a rich source of stripe rust resistance loci that can be used for wheat improvement.

  1. Detection of gene–environment interaction in pedigree data using genome-wide genotypes

    Science.gov (United States)

    Nivard, Michel G; Middeldorp, Christel M; Lubke, Gitta; Hottenga, Jouke-Jan; Abdellaoui, Abdel; Boomsma, Dorret I; Dolan, Conor V

    2016-01-01

    Heritability may be estimated using phenotypic data collected in relatives or in distantly related individuals using genome-wide single nucleotide polymorphism (SNP) data. We combined these approaches by re-parameterizing the model proposed by Zaitlen et al and extended this model to include moderation of (total and SNP-based) genetic and environmental variance components by a measured moderator. By means of data simulation, we demonstrated that the type 1 error rates of the proposed test are correct and parameter estimates are accurate. As an application, we considered the moderation by age or year of birth of variance components associated with body mass index (BMI), height, attention problems (AP), and symptoms of anxiety and depression. The genetic variance of BMI was found to increase with age, but the environmental variance displayed a greater increase with age, resulting in a proportional decrease of the heritability of BMI. Environmental variance of height increased with year of birth. The environmental variance of AP increased with age. These results illustrate the assessment of moderation of environmental and genetic effects, when estimating heritability from combined SNP and family data. The assessment of moderation of genetic and environmental variance will enhance our understanding of the genetic architecture of complex traits. PMID:27436263

  2. Genome-wide association study identifies a maternal copy-number deletion in PSG11 enriched among preeclampsia patients

    Directory of Open Access Journals (Sweden)

    Zhao Linlu

    2012-06-01

    Full Text Available Abstract Background Specific genetic contributions for preeclampsia (PE are currently unknown. This genome-wide association study (GWAS aims to identify maternal single nucleotide polymorphisms (SNPs and copy-number variants (CNVs involved in the etiology of PE. Methods A genome-wide scan was performed on 177 PE cases (diagnosed according to National Heart, Lung and Blood Institute guidelines and 116 normotensive controls. White female study subjects from Iowa were genotyped on Affymetrix SNP 6.0 microarrays. CNV calls made using a combination of four detection algorithms (Birdseye, Canary, PennCNV, and QuantiSNP were merged using CNVision and screened with stringent prioritization criteria. Due to limited DNA quantities and the deleterious nature of copy-number deletions, it was decided a priori that only deletions would be selected for assay on the entire case-control dataset using quantitative real-time PCR. Results The top four SNP candidates had an allelic or genotypic p-value between 10-5 and 10-6, however, none surpassed the Bonferroni-corrected significance threshold. Three recurrent rare deletions meeting prioritization criteria detected in multiple cases were selected for targeted genotyping. A locus of particular interest was found showing an enrichment of case deletions in 19q13.31 (5/169 cases and 1/114 controls, which encompasses the PSG11 gene contiguous to a highly plastic genomic region. All algorithm calls for these regions were assay confirmed. Conclusions CNVs may confer risk for PE and represent interesting regions that warrant further investigation. Top SNP candidates identified from the GWAS, although not genome-wide significant, may be useful to inform future studies in PE genetics.

  3. Joint analysis of three genome-wide association studies of esophageal squamous cell carcinoma in Chinese populations

    Science.gov (United States)

    Zhan, Qimin; Hu, Zhibin; He, Zhonghu; Jia, Weihua; Zhou, Yifeng; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Zhao, Xue-Ke; Gao, She-Gan; Yuan, Zhi-Qing; Zhou, Fu-You; Fan, Zong-Min; Cui, Ji-Li; Lin, Hong-Li; Han, Xue-Na; Li, Bei; Chen, Xi; Dawsey, Sanford M.; Liao, Linda; Lee, Maxwell P.; Ding, Ti; Qiao, You-Lin; Liu, Zhihua; Liu, Yu; Yu, Dianke; Chang, Jiang; Wei, Lixuan; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Han, Jing-Jing; Zhou, Sheng-Li; Zhang, Peng; Zhang, Dong-Yun; Yuan, Yuan; Huang, Ying; Liu, Chunling; Zhai, Kan; Qiao, Yan; Jin, Guangfu; Guo, Chuanhai; Fu, Jianhua; Miao, Xiaoping; Lu, Changdong; Yang, Haijun; Wang, Chaoyu; Wheeler, William A.; Gail, Mitchell; Yeager, Meredith; Yuenger, Jeff; Guo, Er-Tao; Li, Ai-Li; Zhang, Wei; Li, Xue-Min; Sun, Liang-Dan; Ma, Bao-Gen; Li, Yan; Tang, Sa; Peng, Xiu-Qing; Liu, Jing; Hutchinson, Amy; Jacobs, Kevin; Giffen, Carol; Burdette, Laurie; Fraumeni, Joseph F.; Shen, Hongbing; Ke, Yang; Zeng, Yixin; Wu, Tangchun; Kraft, Peter; Chung, Charles C.; Tucker, Margaret A.; Hou, Zhi-Chao; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Wang, Li; Yuan, Guo; Chen, Li-Sha; Liu, Xiao; Ma, Teng; Meng, Hui; Sun, Li; Li, Xin-Min; Li, Xiu-Min; Ku, Jian-Wei; Zhou, Ying-Fa; Yang, Liu-Qin; Wang, Zhou; Li, Yin; Qige, Qirenwang; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Yuan, Ling; Yue, Wen-Bin; Wang, Ran; Wang, Lu-Wen; Fan, Xue-Ping; Zhu, Fang-Heng; Zhao, Wei-Xing; Mao, Yi-Min; Zhang, Mei; Xing, Guo-Lan; Li, Ji-Lin; Han, Min; Ren, Jing-Li; Liu, Bin; Ren, Shu-Wei; Kong, Qing-Peng; Li, Feng; Sheyhidin, Ilyar; Wei, Wu; Zhang, Yan-Rui; Feng, Chang-Wei; Wang, Jin; Yang, Yu-Hua; Hao, Hong-Zhang; Bao, Qi-De; Liu, Bao-Chi; Wu, Ai-Qun; Xie, Dong; Yang, Wan-Cai; Wang, Liang; Zhao, Xiao-Hang; Chen, Shu-Qing; Hong, Jun-Yan; Zhang, Xue-Jun; Freedman, Neal D; Goldstein, Alisa M.; Lin, Dongxin; Taylor, Philip R.; Wang, Li-Dong; Chanock, Stephen J.

    2014-01-01

    We conducted a joint (pooled) analysis of three genome-wide association studies (GWAS) 1-3 of esophageal squamous cell carcinoma (ESCC) in ethnic Chinese (5,337 ESCC cases and 5,787 controls) with 9,654 ESCC cases and 10,058 controls for follow-up. In a logistic regression model adjusted for age, sex, study, and two eigenvectors, two new loci achieved genome-wide significance, marked by rs7447927 at 5q31.2 (per-allele odds ratio (OR) = 0.85, 95% CI 0.82-0.88; P=7.72x10−20) and rs1642764 at 17p13.1 (per-allele OR= 0.88, 95% CI 0.85-0.91; P=3.10x10−13). rs7447927 is a synonymous single nucleotide polymorphism (SNP) in TMEM173 and rs1642764 is an intronic SNP in ATP1B2, near TP53. Furthermore, a locus in the HLA class II region at 6p21.32 (rs35597309) achieved genome-wide significance in the two populations at highest risk for ESSC (OR=1.33, 95% CI 1.22-1.46; P=1.99x10−10). Our joint analysis identified new ESCC susceptibility loci overall as well as a new locus unique to the ESCC high risk Taihang Mountain region. PMID:25129146

  4. Concept and design of a genome-wide association genotyping array tailored for transplantation-specific studies

    DEFF Research Database (Denmark)

    Li, Yun R.; van Setten, Jessica; Verma, Shefali S.

    2015-01-01

    genome-wide genotyping array, the 'TxArray', comprising approximately 782,000 markers with tailored content for deeper capture of variants across HLA, KIR, pharmacogenomic, and metabolic loci important in transplantation. To test concordance and genotyping quality, we genotyped 85 HapMap samples...

  5. Demography-adjusted tests of neutrality based on genome-wide SNP data

    KAUST Repository

    Rafajlović, Marina

    2014-08-01

    Tests of the neutral evolution hypothesis are usually built on the standard model which assumes that mutations are neutral and the population size remains constant over time. However, it is unclear how such tests are affected if the last assumption is dropped. Here, we extend the unifying framework for tests based on the site frequency spectrum, introduced by Achaz and Ferretti, to populations of varying size. Key ingredients are the first two moments of the site frequency spectrum. We show how these moments can be computed analytically if a population has experienced two instantaneous size changes in the past. We apply our method to data from ten human populations gathered in the 1000 genomes project, estimate their demographies and define demography-adjusted versions of Tajima\\'s D, Fay & Wu\\'s H, and Zeng\\'s E. Our results show that demography-adjusted test statistics facilitate the direct comparison between populations and that most of the differences among populations seen in the original unadjusted tests can be explained by their underlying demographies. Upon carrying out whole-genome screens for deviations from neutrality, we identify candidate regions of recent positive selection. We provide track files with values of the adjusted and unadjusted tests for upload to the UCSC genome browser. © 2014 Elsevier Inc.

  6. Genome-wide SNP genotyping resolves signatures of selection and tetrasomic recombination in peanut

    Science.gov (United States)

    Peanut (Arachis hypogaea; 2n=4x=40) is a nutritious food and a good source of vitamins, minerals, and healthy fats. Expansion of genetic and genomic resources for genetic enhancement of cultivated peanut has gained momentum from the sequenced genomes of the diploid ancestors of cultivated peanut. ...

  7. Demography-adjusted tests of neutrality based on genome-wide SNP data

    KAUST Repository

    Rafajlović, Marina; Klassmann, Alexander; Eriksson, Anders; Wiehe, Thomas H E; Mehlig, Bernhard

    2014-01-01

    Tests of the neutral evolution hypothesis are usually built on the standard model which assumes that mutations are neutral and the population size remains constant over time. However, it is unclear how such tests are affected if the last assumption

  8. Genome-wide association study identified CNP12587 region underlying height variation in Chinese females.

    Directory of Open Access Journals (Sweden)

    Yin-Ping Zhang

    Full Text Available Human height is a highly heritable trait considered as an important factor for health. There has been limited success in identifying the genetic factors underlying height variation. We aim to identify sequence variants associated with adult height by a genome-wide association study of copy number variants (CNVs in Chinese.Genome-wide CNV association analyses were conducted in 1,625 unrelated Chinese adults and sex specific subgroup for height variation, respectively. Height was measured with a stadiometer. Affymetrix SNP6.0 genotyping platform was used to identify copy number polymorphisms (CNPs. We constructed a genomic map containing 1,009 CNPs in Chinese individuals and performed a genome-wide association study of CNPs with height.We detected 10 significant association signals for height (p<0.05 in the whole population, 9 and 11 association signals for Chinese female and male population, respectively. A copy number polymorphism (CNP12587, chr18:54081842-54086942, p = 2.41 × 10(-4 was found to be significantly associated with height variation in Chinese females even after strict Bonferroni correction (p = 0.048. Confirmatory real time PCR experiments lent further support for CNV validation. Compared to female subjects with two copies of the CNP, carriers of three copies had an average of 8.1% decrease in height. An important candidate gene, ubiquitin-protein ligase NEDD4-like (NEDD4L, was detected at this region, which plays important roles in bone metabolism by binding to bone formation regulators.Our findings suggest the important genetic variants underlying height variation in Chinese.

  9. a potential source of spurious associations in genome-wide ...

    Indian Academy of Sciences (India)

    2010-04-01

    Apr 1, 2010 ... Genome-wide association studies (GWAS) examine the entire human genome with the goal of identifying genetic variants. (usually single nucleotide polymorphisms (SNPs)) that are associated with phenotypic traits such as disease status and drug response. The discordance of significantly associated ...

  10. Genome-wide association study identifies five new schizophrenia loci

    NARCIS (Netherlands)

    Ripke, S.; Sanders, A. R.; Kendler, K. S.; Levinson, D. F.; Sklar, P.; Holmans, P. A.; Lin, D. Y.; Duan, J.; Ophoff, R. A.; Andreassen, O. A.; Scolnick, E.; Cichon, S.; St Clair, D.; Corvin, A.; Gurling, H.; Werge, T.; Rujescu, D.; Blackwood, D. H.; Pato, C. N.; Malhotra, A. K.; Purcell, S.; Dudbridge, F.; Neale, B. M.; Rossin, L.; Visscher, P. M.; Posthuma, D.; Ruderfer, D. M.; Fanous, A.; Stefansson, H.; Steinberg, S.; Mowry, B. J.; Golimbet, V.; de Hert, M.; Jonsson, E. G.; Bitter, I.; Pietilainen, O. P.; Collier, D. A.; Tosato, S.; Agartz, I.; Albus, M.; Alexander, M.; Amdur, R. L.; Amin, F.; Bass, N.; Bergen, S. E.; Black, D. W.; Borglum, A. D.; Brown, M. A.; Bruggeman, R.; Buccola, N. G.; Byerley, W. F.; Cahn, W.; Cantor, R. M.; Carr, V. J.; Catts, S. V.; Choudhury, K.; Cloninger, C. R.; Cormican, P.; Craddock, N.; Danoy, P. A.; Datta, S.; de Haan, L.; Demontis, D.; Dikeos, D.; Djurovic, S.; Donnely, P.; Donohoe, G.; Duong, L.; Dwyer, S.; Fink-Jensen, A.; Freedman, R.; Freimer, N. B.; Friedl, M.; Georgieva, L.; Giegling, I.; Gill, M.; Glenthoj, B.; Godard, S.; Hamshere, M.; Hansen, M.; Hartmann, A. M.; Henskens, F. A.; Hougaard, D. M.; Hultman, C. M.; Ingason, A.; Jablensky, A. V.; Jakobsen, K. D.; Jay, M.; Jurgens, G.; Kahn, R. S.; Keller, M. C.; Kenis, G.; Kenny, E.; Kim, Y.; Kirov, G. K.; Konnerth, H.; Konte, B.; Krabbendam, L.; Krasucki, R.; Lasseter, V. K.; Laurent, C.; Lawrence, J.; Lencz, T.; Lerer, F. B.; Liang, K. Y.; Lichtenstein, P.; Lieberman, J. A.; Linszen, D. H.; Lonnqvist, J.; Loughland, C. M.; Maclean, A. W.; Maher, B. S.; Maier, W.; Mallet, J.; Malloy, P.; Mattheisen, M.; Mattingsdal, M.; McGhee, K. A.; McGrath, J. J.; McIntosh, A.; McLean, D. E.; McQuillin, A.; Melle, I.; Michie, P. T.; Milanova, V.; Morris, D. W.; Mors, O.; Mortensen, P. B.; Moskvina, V.; Muglia, P.; Myin-Germeys, I.; Nertney, D. A.; Nestadt, G.; Nielsen, J.; Nikolov, I.; Nordentoft, M.; Norton, N.; Nothen, M. M.; O'Dushlaine, C. T.; Olincy, A.; Olsen, L.; O'Neill, F. A.; Orntoft, T. F.; Owen, M. J.; Pantelis, C.; Papadimitriou, G.; Pato, M. T.; Peltonen, L.; Petursson, H.; Pickard, B.; Pimm, J.; Pulver, A. E.; Puri, V.; Quested, D.; Quinn, E. M.; Rasmussen, H. B.; Rethelyi, J. M.; Ribble, R.; Rietschel, M.; Riley, B. P.; Ruggeri, M.; Schall, U.; Schulze, T. G.; Schwab, S. G.; Scott, R. J.; Shi, J.; Sigurdsson, E.; Silvermann, J. M.; Spencer, C. C.; Stefansson, K.; Strange, A.; Strengman, E.; Stroup, T. S.; Suvisaari, J.; Terenius, L.; Thirumalai, S.; Thygesen, J. H.; Timm, S.; Toncheva, D.; van den Oord, E.; van Os, J.; van Winkel, R.; Veldink, J.; Walsh, D.; Wang, A. G.; Wiersma, D.; Wildenauer, D. B.; Williams, H. J.; Williams, N. M.; Wormley, B.; Zammit, S.; Sullivan, P. F.; O'Donovan, M. C.; Daly, M. J.; Gejman, P. V.

    2011-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded

  11. Genome-wide linkage analysis for human longevity

    DEFF Research Database (Denmark)

    Beekman, Marian; Blanché, Hélène; Perola, Markus

    2013-01-01

    Clear evidence exists for heritability of human longevity, and much interest is focused on identifying genes associated with longer lives. To identify such longevity alleles, we performed the largest genome-wide linkage scan thus far reported. Linkage analyses included 2118 nonagenarian Caucasian...

  12. Genome-wide association study of Tourette's syndrome

    NARCIS (Netherlands)

    Scharf, J. M.; Yu, D.; Mathews, C. A.; Neale, B. M.; Stewart, S. E.; Fagerness, J. A.; Evans, P.; Gamazon, E.; Edlund, C. K.; Service, S. K.; Tikhomirov, A.; Osiecki, L.; Illmann, C.; Pluzhnikov, A.; Konkashbaev, A.; Davis, L. K.; Han, B.; Crane, J.; Moorjani, P.; Crenshaw, A. T.; Parkin, M. A.; Reus, V. I.; Lowe, T. L.; Rangel-Lugo, M.; Chouinard, S.; Dion, Y.; Girard, S.; Cath, D. C.; Smit, J. H.; King, R. A.; Fernandez, T. V.; Leckman, J. F.; Kidd, K. K.; Kidd, J. R.; Pakstis, A. J.; State, M. W.; Herrera, L. D.; Romero, R.; Fournier, E.; Sandor, P.; Barr, C. L.; Phan, N.; Gross-Tsur, V.; Benarroch, F.; Pollak, Y.; Budman, C. L.; Bruun, R. D.; Erenberg, G.; Naarden, A. L.; Hoekstra, P. J.

    2013-01-01

    Tourette's syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association

  13. Genome-wide association study identifies five new schizophrenia loci.

    LENUS (Irish Health Repository)

    Ripke, Stephan

    2011-10-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10(-11)) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10(-9)), ANK3 (rs10994359, P = 2.5 × 10(-8)) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10(-9)).

  14. Genome-wide association studies (GWAS) of adiposity

    DEFF Research Database (Denmark)

    Oskari Kilpeläinen, Tuomas; Ingelsson, Erik

    2016-01-01

    Adiposity is strongly heritable and one of the leading risk factors for type 2 diabetes, cardiovascular disease, cancer, and premature death. In the past 8 years, genome-wide association studies (GWAS) have greatly increased our understanding of the genes and biological pathways that regulate...

  15. Genome-wide copy number variation (CNV in patients with autoimmune Addison's disease

    Directory of Open Access Journals (Sweden)

    Brønstad Ingeborg

    2011-08-01

    Full Text Available Abstract Background Addison's disease (AD is caused by an autoimmune destruction of the adrenal cortex. The pathogenesis is multi-factorial, involving genetic components and hitherto unknown environmental factors. The aim of the present study was to investigate if gene dosage in the form of copy number variation (CNV could add to the repertoire of genetic susceptibility to autoimmune AD. Methods A genome-wide study using the Affymetrix GeneChip® Genome-Wide Human SNP Array 6.0 was conducted in 26 patients with AD. CNVs in selected genes were further investigated in a larger material of patients with autoimmune AD (n = 352 and healthy controls (n = 353 by duplex Taqman real-time polymerase chain reaction assays. Results We found that low copy number of UGT2B28 was significantly more frequent in AD patients compared to controls; conversely high copy number of ADAM3A was associated with AD. Conclusions We have identified two novel CNV associations to ADAM3A and UGT2B28 in AD. The mechanism by which this susceptibility is conferred is at present unclear, but may involve steroid inactivation (UGT2B28 and T cell maturation (ADAM3A. Characterization of these proteins may unravel novel information on the pathogenesis of autoimmunity.

  16. Genome-wide copy number variation (CNV) in patients with autoimmune Addison's disease

    Science.gov (United States)

    2011-01-01

    Background Addison's disease (AD) is caused by an autoimmune destruction of the adrenal cortex. The pathogenesis is multi-factorial, involving genetic components and hitherto unknown environmental factors. The aim of the present study was to investigate if gene dosage in the form of copy number variation (CNV) could add to the repertoire of genetic susceptibility to autoimmune AD. Methods A genome-wide study using the Affymetrix GeneChip® Genome-Wide Human SNP Array 6.0 was conducted in 26 patients with AD. CNVs in selected genes were further investigated in a larger material of patients with autoimmune AD (n = 352) and healthy controls (n = 353) by duplex Taqman real-time polymerase chain reaction assays. Results We found that low copy number of UGT2B28 was significantly more frequent in AD patients compared to controls; conversely high copy number of ADAM3A was associated with AD. Conclusions We have identified two novel CNV associations to ADAM3A and UGT2B28 in AD. The mechanism by which this susceptibility is conferred is at present unclear, but may involve steroid inactivation (UGT2B28) and T cell maturation (ADAM3A). Characterization of these proteins may unravel novel information on the pathogenesis of autoimmunity. PMID:21851588

  17. Genome-wide significant locus for Research Diagnostic Criteria Schizoaffective Disorder Bipolar type.

    Science.gov (United States)

    Green, Elaine K; Di Florio, Arianna; Forty, Liz; Gordon-Smith, Katherine; Grozeva, Detelina; Fraser, Christine; Richards, Alexander L; Moran, Jennifer L; Purcell, Shaun; Sklar, Pamela; Kirov, George; Owen, Michael J; O'Donovan, Michael C; Craddock, Nick; Jones, Lisa; Jones, Ian R

    2017-12-01

    Studies have suggested that Research Diagnostic Criteria for Schizoaffective Disorder Bipolar type (RDC-SABP) might identify a more genetically homogenous subgroup of bipolar disorder. Aiming to identify loci associated with RDC-SABP, we have performed a replication study using independent RDC-SABP cases (n = 144) and controls (n = 6,559), focusing on the 10 loci that reached a p-value bipolar disorder sample. Combining the WTCCC and replication datasets by meta-analysis (combined RDC-SABP, n = 423, controls, n = 9,494), we observed genome-wide significant association at one SNP, rs2352974, located within the intron of the gene TRAIP on chromosome 3p21.31 (p-value, 4.37 × 10 -8 ). This locus did not reach genome-wide significance in bipolar disorder or schizophrenia large Psychiatric Genomic Consortium datasets, suggesting that it may represent a relatively specific genetic risk for the bipolar subtype of schizoaffective disorder. © 2017 Wiley Periodicals, Inc.

  18. Genome-wide association study of the four-constitution medicine.

    Science.gov (United States)

    Yin, Chang Shik; Park, Hi Joon; Chung, Joo-Ho; Lee, Hye-Jung; Lee, Byung-Cheol

    2009-12-01

    Four-constitution medicine (FCM), also known as Sasang constitutional medicine, and the heritage of the long history of individualized acupuncture medicine tradition, is one of the holistic and traditional systems of constitution to appraise and categorize individual differences into four major types. This study first reports a genome-wide association study on FCM, to explore the genetic basis of FCM and facilitate the integration of FCM with conventional individual differences research. Healthy individuals of the Korean population were classified into the four constitutional types (FCTs). A total of 353,202 single nucleotide polymorphisms (SNPs) were typed using whole genome amplified samples, and six-way comparison of FCM types provided lists of significantly differential SNPs. In one-to-one FCT comparisons, 15,944 SNPs were significantly differential, and 5 SNPs were commonly significant in all of the three comparisons. In one-to-two FCT comparisons, 22,616 SNPs were significantly differential, and 20 SNPs were commonly significant in all of the three comparison groups. This study presents the association between genome-wide SNP profiles and the categorization of the FCM, and it could further provide a starting point of genome-based identification and research of the constitutions of FCM.

  19. Genome-Wide Association Study Identifies Four Loci Associated with Eruption of Permanent Teeth

    Science.gov (United States)

    Zhang, Hao; Shaffer, John R.; Hansen, Thomas; Esserlind, Ann-Louise; Boyd, Heather A.; Nohr, Ellen A.; Timpson, Nicholas J.; Fatemifar, Ghazaleh; Paternoster, Lavinia; Evans, David M.; Weyant, Robert J.; Levy, Steven M.; Lathrop, Mark; Smith, George Davey; Murray, Jeffrey C.; Olesen, Jes; Werge, Thomas; Marazita, Mary L.; Sørensen, Thorkild I. A.; Melbye, Mads

    2011-01-01

    The sequence and timing of permanent tooth eruption is thought to be highly heritable and can have important implications for the risk of malocclusion, crowding, and periodontal disease. We conducted a genome-wide association study of number of permanent teeth erupted between age 6 and 14 years, analyzed as age-adjusted standard deviation score averaged over multiple time points, based on childhood records for 5,104 women from the Danish National Birth Cohort. Four loci showed association at Peruption and were also known to influence height and breast cancer, respectively. The two other loci pointed to genomic regions without any previous significant genome-wide association study results. The intronic SNP rs7924176 in ADK could be linked to gene expression in monocytes. The combined effect of the four genetic variants was most pronounced between age 10 and 12 years, where children with 6 to 8 delayed tooth eruption alleles had on average 3.5 (95% confidence interval: 2.9–4.1) fewer permanent teeth than children with 0 or 1 of these alleles. PMID:21931568

  20. Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes

    Directory of Open Access Journals (Sweden)

    Sungkyoung Choi

    2016-12-01

    Full Text Available The success of genome-wide association studies (GWASs has enabled us to improve risk assessment and provide novel genetic variants for diagnosis, prevention, and treatment. However, most variants discovered by GWASs have been reported to have very small effect sizes on complex human diseases, which has been a big hurdle in building risk prediction models. Recently, many statistical approaches based on penalized regression have been developed to solve the “large p and small n” problem. In this report, we evaluated the performance of several statistical methods for predicting a binary trait: stepwise logistic regression (SLR, least absolute shrinkage and selection operator (LASSO, and Elastic-Net (EN. We first built a prediction model by combining variable selection and prediction methods for type 2 diabetes using Affymetrix Genome-Wide Human SNP Array 5.0 from the Korean Association Resource project. We assessed the risk prediction performance using area under the receiver operating characteristic curve (AUC for the internal and external validation datasets. In the internal validation, SLR-LASSO and SLR-EN tended to yield more accurate predictions than other combinations. During the external validation, the SLR-SLR and SLR-EN combinations achieved the highest AUC of 0.726. We propose these combinations as a potentially powerful risk prediction model for type 2 diabetes.

  1. Genome-wide association study of handedness excludes simple genetic models

    Science.gov (United States)

    Armour, J AL; Davison, A; McManus, I C

    2014-01-01

    Handedness is a human behavioural phenotype that appears to be congenital, and is often assumed to be inherited, but for which the developmental origin and underlying causation(s) have been elusive. Models of the genetic basis of variation in handedness have been proposed that fit different features of the observed resemblance between relatives, but none has been decisively tested or a corresponding causative locus identified. In this study, we applied data from well-characterised individuals studied at the London Twin Research Unit. Analysis of genome-wide SNP data from 3940 twins failed to identify any locus associated with handedness at a genome-wide level of significance. The most straightforward interpretation of our analyses is that they exclude the simplest formulations of the ‘right-shift' model of Annett and the ‘dextral/chance' model of McManus, although more complex modifications of those models are still compatible with our observations. For polygenic effects, our study is inadequately powered to reliably detect alleles with effect sizes corresponding to an odds ratio of 1.2, but should have good power to detect effects at an odds ratio of 2 or more. PMID:24065183

  2. Genome-wide quantitative trait loci mapping of the human cerebrospinal fluid proteome.

    Science.gov (United States)

    Sasayama, Daimei; Hattori, Kotaro; Ogawa, Shintaro; Yokota, Yuuki; Matsumura, Ryo; Teraishi, Toshiya; Hori, Hiroaki; Ota, Miho; Yoshida, Sumiko; Kunugi, Hiroshi

    2017-01-01

    Cerebrospinal fluid (CSF) is virtually the only one accessible source of proteins derived from the central nervous system (CNS) of living humans and possibly reflects the pathophysiology of a variety of neuropsychiatric diseases. However, little is known regarding the genetic basis of variation in protein levels of human CSF. We examined CSF levels of 1,126 proteins in 133 subjects and performed a genome-wide association analysis of 514,227 single nucleotide polymorphisms (SNPs) to detect protein quantitative trait loci (pQTLs). To be conservative, Spearman's correlation was used to identify an association between genotypes of SNPs and protein levels. A total of 421 cis and 25 trans SNP-protein pairs were significantly correlated at a false discovery rate (FDR) of less than 0.01 (nominal P genome-wide association studies. The present findings suggest that genetic variations play an important role in the regulation of protein expression in the CNS. The obtained database may serve as a valuable resource to understand the genetic bases for CNS protein expression pattern in humans. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  3. Development and dissection of diagnostic SNP markers for the downy mildew resistance genes Pl Arg and Pl 8 and maker-assisted gene pyramiding in sunflower (Helianthus annuus L.).

    Science.gov (United States)

    Qi, L L; Talukder, Z I; Hulke, B S; Foley, M E

    2017-06-01

    Diagnostic DNA markers are an invaluable resource in breeding programs for successful introgression and pyramiding of disease resistance genes. Resistance to downy mildew (DM) disease in sunflower is mediated by Pl genes which are known to be effective against the causal fungus, Plasmopara halstedii. Two DM resistance genes, Pl Arg and Pl 8 , are highly effective against P. halstedii races in the USA, and have been previously mapped to the sunflower linkage groups (LGs) 1 and 13, respectively, using simple sequence repeat (SSR) markers. In this study, we developed high-density single nucleotide polymorphism (SNP) maps encompassing the Pl arg and Pl 8 genes and identified diagnostic SNP markers closely linked to these genes. The specificity of the diagnostic markers was validated in a highly diverse panel of 548 sunflower lines. Dissection of a large marker cluster co-segregated with Pl Arg revealed that the closest SNP markers NSA_007595 and NSA_001835 delimited Pl Arg to an interval of 2.83 Mb on the LG1 physical map. The SNP markers SFW01497 and SFW06597 delimited Pl 8 to an interval of 2.85 Mb on the LG13 physical map. We also developed sunflower lines with homozygous, three gene pyramids carrying Pl Arg , Pl 8 , and the sunflower rust resistance gene R 12 using the linked SNP markers from a segregating F 2 population of RHA 340 (carrying Pl 8 )/RHA 464 (carrying Pl Arg and R 12 ). The high-throughput diagnostic SNP markers developed in this study will facilitate marker-assisted selection breeding, and the pyramided sunflower lines will provide durable resistance to downy mildew and rust diseases.

  4. Genome-Wide Association Study Singles Out SCD and LEPR as the Two Main Loci Influencing Intramuscular Fat Content and Fatty Acid Composition in Duroc Pigs.

    Directory of Open Access Journals (Sweden)

    Roger Ros-Freixedes

    Full Text Available Intramuscular fat (IMF content and fatty acid composition affect the organoleptic quality and nutritional value of pork. A genome-wide association study was performed on 138 Duroc pigs genotyped with a 60k SNP chip to detect biologically relevant genomic variants influencing fat content and composition. Despite the limited sample size, the genome-wide association study was powerful enough to detect the association between fatty acid composition and a known haplotypic variant in SCD (SSC14 and to reveal an association of IMF and fatty acid composition in the LEPR region (SSC6. The association of LEPR was later validated with an independent set of 853 pigs using a candidate quantitative trait nucleotide. The SCD gene is responsible for the biosynthesis of oleic acid (C18:1 from stearic acid. This locus affected the stearic to oleic desaturation index (C18:1/C18:0, C18:1, and saturated (SFA and monounsaturated (MUFA fatty acids content. These effects were consistently detected in gluteus medius, longissimus dorsi, and subcutaneous fat. The association of LEPR with fatty acid composition was detected only in muscle and was, at least in part, a consequence of its effect on IMF content, with increased IMF resulting in more SFA, less polyunsaturated fatty acids (PUFA, and greater SFA/PUFA ratio. Marker substitution effects estimated with a subset of 65 animals were used to predict the genomic estimated breeding values of 70 animals born 7 years later. Although predictions with the whole SNP chip information were in relatively high correlation with observed SFA, MUFA, and C18:1/C18:0 (0.48-0.60, IMF content and composition were in general better predicted by using only SNPs at the SCD and LEPR loci, in which case the correlation between predicted and observed values was in the range of 0.36 to 0.54 for all traits. Results indicate that markers in the SCD and LEPR genes can be useful to select for optimum fatty acid profiles of pork.

  5. Evidence for gene-environment interaction in a genome wide study of nonsyndromic cleft palate

    DEFF Research Database (Denmark)

    Beaty, Terri H; Ruczinski, Ingo; Murray, Jeffrey C

    2011-01-01

    Nonsyndromic cleft palate (CP) is a common birth defect with a complex and heterogeneous etiology involving both genetic and environmental risk factors. We conducted a genome-wide association study (GWAS) using 550 case-parent trios, ascertained through a CP case collected in an international...... consortium. Family-based association tests of single nucleotide polymorphisms (SNP) and three common maternal exposures (maternal smoking, alcohol consumption, and multivitamin supplementation) were used in a combined 2 df test for gene (G) and gene-environment (G × E) interaction simultaneously, plus...... multiple SNPs associated with higher risk of CP in the presence of maternal smoking. Additional evidence of reduced risk due to G × E interaction in the presence of multivitamin supplementation was observed for SNPs in BAALC on chr. 8. These results emphasize the need to consider G × E interaction when...

  6. Genome-wide re-sequencing of multidrug-resistant Mycobacterium leprae Airaku-3.

    Science.gov (United States)

    Singh, P; Benjak, A; Carat, S; Kai, M; Busso, P; Avanzi, C; Paniz-Mondolfi, A; Peter, C; Harshman, K; Rougemont, J; Matsuoka, M; Cole, S T

    2014-10-01

    Genotyping and molecular characterization of drug resistance mechanisms in Mycobacterium leprae enables disease transmission and drug resistance trends to be monitored. In the present study, we performed genome-wide analysis of Airaku-3, a multidrug-resistant strain with an unknown mechanism of resistance to rifampicin. We identified 12 unique non-synonymous single-nucleotide polymorphisms (SNPs) including two in the transporter-encoding ctpC and ctpI genes. In addition, two SNPs were found that improve the resolution of SNP-based genotyping, particularly for Venezuelan and South East Asian strains of M. leprae. © 2014 The Authors Clinical Microbiology and Infection © 2014 European Society of Clinical Microbiology and Infectious Diseases.

  7. Genome-wide association study meta-analysis of European and Asian-ancestry samples identifies three novel loci associated with bipolar disorder.

    Science.gov (United States)

    Chen, D T; Jiang, X; Akula, N; Shugart, Y Y; Wendland, J R; Steele, C J M; Kassem, L; Park, J-H; Chatterjee, N; Jamain, S; Cheng, A; Leboyer, M; Muglia, P; Schulze, T G; Cichon, S; Nöthen, M M; Rietschel, M; McMahon, F J; Farmer, A; McGuffin, P; Craig, I; Lewis, C; Hosang, G; Cohen-Woods, S; Vincent, J B; Kennedy, J L; Strauss, J

    2013-02-01

    Meta-analyses of bipolar disorder (BD) genome-wide association studies (GWAS) have identified several genome-wide significant signals in European-ancestry samples, but so far account for little of the inherited risk. We performed a meta-analysis of ∼750,000 high-quality genetic markers on a combined sample of ∼14,000 subjects of European and Asian-ancestry (phase I). The most significant findings were further tested in an extended sample of ∼17,700 cases and controls (phase II). The results suggest novel association findings near the genes TRANK1 (LBA1), LMAN2L and PTGFR. In phase I, the most significant single nucleotide polymorphism (SNP), rs9834970 near TRANK1, was significant at the P=2.4 × 10(-11) level, with no heterogeneity. Supportive evidence for prior association findings near ANK3 and a locus on chromosome 3p21.1 was also observed. The phase II results were similar, although the heterogeneity test became significant for several SNPs. On the basis of these results and other established risk loci, we used the method developed by Park et al. to estimate the number, and the effect size distribution, of BD risk loci that could still be found by GWAS methods. We estimate that >63,000 case-control samples would be needed to identify the ∼105 BD risk loci discoverable by GWAS, and that these will together explain <6% of the inherited risk. These results support previous GWAS findings and identify three new candidate genes for BD. Further studies are needed to replicate these findings and may potentially lead to identification of functional variants. Sample size will remain a limiting factor in the discovery of common alleles associated with BD.

  8. Genome-Wide Meta-Analysis of Longitudinal Alcohol Consumption Across Youth and Early Adulthood.

    Science.gov (United States)

    Adkins, Daniel E; Clark, Shaunna L; Copeland, William E; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A; Silberg, Judy; Brown, Tyson H; Fergusson, David M; Horwood, L John; Eaves, Lindon; van den Oord, Edwin J C G; Sullivan, Patrick F; Costello, E J

    2015-08-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse, and dependence increasing across adolescence and peaking in early adulthood. Here, we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three longitudinal community samples (N=2,126, obs=12,166). Consumption-repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and six others met our 'suggestive' criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms, including neurotransmission, xenobiotic pharmacodynamics, and nuclear hormone receptors (NHR). These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies.

  9. Shared and unique components of human population structure and genome-wide signals of positive selection in South Asia.

    Science.gov (United States)

    Metspalu, Mait; Romero, Irene Gallego; Yunusbayev, Bayazit; Chaubey, Gyaneshwer; Mallick, Chandana Basu; Hudjashov, Georgi; Nelis, Mari; Mägi, Reedik; Metspalu, Ene; Remm, Maido; Pitchappan, Ramasamy; Singh, Lalji; Thangaraj, Kumarasamy; Villems, Richard; Kivisild, Toomas

    2011-12-09

    South Asia harbors one of the highest levels genetic diversity in Eurasia, which could be interpreted as a result of its long-term large effective population size and of admixture during its complex demographic history. In contrast to Pakistani populations, populations of Indian origin have been underrepresented in previous genomic scans of positive selection and population structure. Here we report data for more than 600,000 SNP markers genotyped in 142 samples from 30 ethnic groups in India. Combining our results with other available genome-wide data, we show that Indian populations are characterized by two major ancestry components, one of which is spread at comparable frequency and haplotype diversity in populations of South and West Asia and the Caucasus. The second component is more restricted to South Asia and accounts for more than 50% of the ancestry in Indian populations. Haplotype diversity associated with these South Asian ancestry components is significantly higher than that of the components dominating the West Eurasian ancestry palette. Modeling of the observed haplotype diversities suggests that both Indian ancestry components are older than the purported Indo-Aryan invasion 3,500 YBP. Consistent with the results of pairwise genetic distances among world regions, Indians share more ancestry signals with West than with East Eurasians. However, compared to Pakistani populations, a higher proportion of their genes show regionally specific signals of high haplotype homozygosity. Among such candidates of positive selection in India are MSTN and DOK5, both of which have potential implications in lipid metabolism and the etiology of type 2 diabetes. Copyright © 2011 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  10. Analysis of Genome-Wide Association Studies with Multiple Outcomes Using Penalization

    Science.gov (United States)

    Liu, Jin; Huang, Jian; Ma, Shuangge

    2012-01-01

    Genome-wide association studies have been extensively conducted, searching for markers for biologically meaningful outcomes and phenotypes. Penalization methods have been adopted in the analysis of the joint effects of a large number of SNPs (single nucleotide polymorphisms) and marker identification. This study is partly motivated by the analysis of heterogeneous stock mice dataset, in which multiple correlated phenotypes and a large number of SNPs are available. Existing penalization methods designed to analyze a single response variable cannot accommodate the correlation among multiple response variables. With multiple response variables sharing the same set of markers, joint modeling is first employed to accommodate the correlation. The group Lasso approach is adopted to select markers associated with all the outcome variables. An efficient computational algorithm is developed. Simulation study and analysis of the heterogeneous stock mice dataset show that the proposed method can outperform existing penalization methods. PMID:23272092

  11. High-Resolution Genome-Wide Linkage Mapping Identifies Susceptibility Loci for BMI in the Chinese Population

    DEFF Research Database (Denmark)

    Zhang, Dong Feng; Pang, Zengchang; Li, Shuxia

    2012-01-01

    The genetic loci affecting the commonly used BMI have been intensively investigated using linkage approaches in multiple populations. This study aims at performing the first genome-wide linkage scan on BMI in the Chinese population in mainland China with hypothesis that heterogeneity in genetic...... linkage could exist in different ethnic populations. BMI was measured from 126 dizygotic twins in Qingdao municipality who were genotyped using high-resolution Affymetrix Genome-Wide Human SNP arrays containing about 1 million single-nucleotide polymorphisms (SNPs). Nonparametric linkage analysis...... in western countries. Multiple loci showing suggestive linkage were found on chromosome 1 (lod score 2.38 at 242 cM), chromosome 8 (2.48 at 95 cM), and chromosome 14 (2.2 at 89.4 cM). The strong linkage identified in the Chinese subjects that is consistent with that found in populations of European origin...

  12. Genome-wide association analysis reveals putative Alzheimer's disease susceptibility loci in addition to APOE.

    Science.gov (United States)

    Bertram, Lars; Lange, Christoph; Mullin, Kristina; Parkinson, Michele; Hsiao, Monica; Hogan, Meghan F; Schjeide, Brit M M; Hooli, Basavaraj; Divito, Jason; Ionita, Iuliana; Jiang, Hongyu; Laird, Nan; Moscarillo, Thomas; Ohlsen, Kari L; Elliott, Kathryn; Wang, Xin; Hu-Lince, Diane; Ryder, Marie; Murphy, Amy; Wagner, Steven L; Blacker, Deborah; Becker, K David; Tanzi, Rudolph E

    2008-11-01

    Alzheimer's disease (AD) is a genetically complex and heterogeneous disorder. To date four genes have been established to either cause early-onset autosomal-dominant AD (APP, PSEN1, and PSEN2(1-4)) or to increase susceptibility for late-onset AD (APOE5). However, the heritability of late-onset AD is as high as 80%, (6) and much of the phenotypic variance remains unexplained to date. We performed a genome-wide association (GWA) analysis using 484,522 single-nucleotide polymorphisms (SNPs) on a large (1,376 samples from 410 families) sample of AD families of self-reported European descent. We identified five SNPs showing either significant or marginally significant genome-wide association with a multivariate phenotype combining affection status and onset age. One of these signals (p = 5.7 x 10(-14)) was elicited by SNP rs4420638 and probably reflects APOE-epsilon4, which maps 11 kb proximal (r2 = 0.78). The other four signals were tested in three additional independent AD family samples composed of nearly 2700 individuals from almost 900 families. Two of these SNPs showed significant association in the replication samples (combined p values 0.007 and 0.00002). The SNP (rs11159647, on chromosome 14q31) with the strongest association signal also showed evidence of association with the same allele in GWA data generated in an independent sample of approximately 1,400 AD cases and controls (p = 0.04). Although the precise identity of the underlying locus(i) remains elusive, our study provides compelling evidence for the existence of at least one previously undescribed AD gene that, like APOE-epsilon4, primarily acts as a modifier of onset age.

  13. Genome-Wide Associations Related to Hepatic Histology in Nonalcoholic Fatty Liver Disease in Hispanic Boys.

    Science.gov (United States)

    Wattacheril, Julia; Lavine, Joel E; Chalasani, Naga P; Guo, Xiuqing; Kwon, Soonil; Schwimmer, Jeffrey; Molleston, Jean P; Loomba, Rohit; Brunt, Elizabeth M; Chen, Yii-Der Ida; Goodarzi, Mark O; Taylor, Kent D; Yates, Katherine P; Tonascia, James; Rotter, Jerome I

    2017-11-01

    To identify genetic loci associated with features of histologic severity of nonalcoholic fatty liver disease in a cohort of Hispanic boys. There were 234 eligible Hispanic boys age 2-17 years with clinical, laboratory, and histologic data enrolled in the Nonalcoholic Steatohepatitis Clinical Research Network included in the analysis of 624 297 single nucleotide polymorphisms (SNPs). After the elimination of 4 outliers and 22 boys with cryptic relatedness, association analyses were performed on 208 DNA samples with corresponding liver histology. Logistic regression analyses were carried out for qualitative traits and linear regression analyses were applied for quantitative traits. The median age and body mass index z-score were 12.0 years (IQR, 11.0-14.0) and 2.4 (IQR, 2.1-2.6), respectively. The nonalcoholic fatty liver disease activity score (scores 1-4 vs 5-8) was associated with SNP rs11166927 on chromosome 8 in the TRAPPC9 region (P = 8.7 -07 ). Fibrosis stage was associated with SNP rs6128907 on chromosome 20, near actin related protein 5 homolog (p = 9.9 -07 ). In comparing our results in Hispanic boys with those of previously reported SNPs in adult nonalcoholic steatohepatitis, 2 of 26 susceptibility loci were associated with nonalcoholic fatty liver disease activity score and 2 were associated with fibrosis stage. In this discovery genome-wide association study, we found significant novel gene effects on histologic traits associated with nonalcoholic fatty liver disease activity score and fibrosis that are distinct from those previously recognized by adult nonalcoholic fatty liver disease genome-wide association studies. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. Genome-wide analysis of adolescent psychotic-like experiences shows genetic overlap with psychiatric disorders.

    Science.gov (United States)

    Pain, Oliver; Dudbridge, Frank; Cardno, Alastair G; Freeman, Daniel; Lu, Yi; Lundstrom, Sebastian; Lichtenstein, Paul; Ronald, Angelica

    2018-03-31

    This study aimed to test for overlap in genetic influences between psychotic-like experience traits shown by adolescents in the community, and clinically-recognized psychiatric disorders in adulthood, specifically schizophrenia, bipolar disorder, and major depression. The full spectra of psychotic-like experience domains, both in terms of their severity and type (positive, cognitive, and negative), were assessed using self- and parent-ratings in three European community samples aged 15-19 years (Final N incl. siblings = 6,297-10,098). A mega-genome-wide association study (mega-GWAS) for each psychotic-like experience domain was performed. Single nucleotide polymorphism (SNP)-heritability of each psychotic-like experience domain was estimated using genomic-relatedness-based restricted maximum-likelihood (GREML) and linkage disequilibrium- (LD-) score regression. Genetic overlap between specific psychotic-like experience domains and schizophrenia, bipolar disorder, and major depression was assessed using polygenic risk score (PRS) and LD-score regression. GREML returned SNP-heritability estimates of 3-9% for psychotic-like experience trait domains, with higher estimates for less skewed traits (Anhedonia, Cognitive Disorganization) than for more skewed traits (Paranoia and Hallucinations, Parent-rated Negative Symptoms). Mega-GWAS analysis identified one genome-wide significant association for Anhedonia within IDO2 but which did not replicate in an independent sample. PRS analysis revealed that the schizophrenia PRS significantly predicted all adolescent psychotic-like experience trait domains (Paranoia and Hallucinations only in non-zero scorers). The major depression PRS significantly predicted Anhedonia and Parent-rated Negative Symptoms in adolescence. Psychotic-like experiences during adolescence in the community show additive genetic effects and partly share genetic influences with clinically-recognized psychiatric disorders, specifically schizophrenia and

  15. A genome-wide association study implicates the APOE locus in nonpathological cognitive ageing.

    Science.gov (United States)

    Davies, G; Harris, S E; Reynolds, C A; Payton, A; Knight, H M; Liewald, D C; Lopez, L M; Luciano, M; Gow, A J; Corley, J; Henderson, R; Murray, C; Pattie, A; Fox, H C; Redmond, P; Lutz, M W; Chiba-Falek, O; Linnertz, C; Saith, S; Haggarty, P; McNeill, G; Ke, X; Ollier, W; Horan, M; Roses, A D; Ponting, C P; Porteous, D J; Tenesa, A; Pickles, A; Starr, J M; Whalley, L J; Pedersen, N L; Pendleton, N; Visscher, P M; Deary, I J

    2014-01-01

    Cognitive decline is a feared aspect of growing old. It is a major contributor to lower quality of life and loss of independence in old age. We investigated the genetic contribution to individual differences in nonpathological cognitive ageing in five cohorts of older adults. We undertook a genome-wide association analysis using 549 692 single-nucleotide polymorphisms (SNPs) in 3511 unrelated adults in the Cognitive Ageing Genetics in England and Scotland (CAGES) project. These individuals have detailed longitudinal cognitive data from which phenotypes measuring each individual's cognitive changes were constructed. One SNP--rs2075650, located in TOMM40 (translocase of the outer mitochondrial membrane 40 homolog)--had a genome-wide significant association with cognitive ageing (P=2.5 × 10(-8)). This result was replicated in a meta-analysis of three independent Swedish cohorts (P=2.41 × 10(-6)). An Apolipoprotein E (APOE) haplotype (adjacent to TOMM40), previously associated with cognitive ageing, had a significant effect on cognitive ageing in the CAGES sample (P=2.18 × 10(-8); females, P=1.66 × 10(-11); males, P=0.01). Fine SNP mapping of the TOMM40/APOE region identified both APOE (rs429358; P=3.66 × 10(-11)) and TOMM40 (rs11556505; P=2.45 × 10(-8)) as loci that were associated with cognitive ageing. Imputation and conditional analyses in the discovery and replication cohorts strongly suggest that this effect is due to APOE (rs429358). Functional genomic analysis indicated that SNPs in the TOMM40/APOE region have a functional, regulatory non-protein-coding effect. The APOE region is significantly associated with nonpathological cognitive ageing. The identity and mechanism of one or multiple causal variants remain unclear.

  16. Development of COS-SNP and HRM markers for high-throughput and reliable haplotype-based detection of Lr14a in durum wheat (Triticum durum Desf.).

    Science.gov (United States)

    Terracciano, Irma; Maccaferri, Marco; Bassi, Filippo; Mantovani, Paola; Sanguineti, Maria C; Salvi, Silvio; Simková, Hana; Doležel, Jaroslav; Massi, Andrea; Ammar, Karim; Kolmer, James; Tuberosa, Roberto

    2013-04-01

    Leaf rust (Puccinia triticina Eriks. & Henn.) is a major disease affecting durum wheat production. The Lr14a-resistant gene present in the durum wheat cv. Creso and its derivative cv. Colosseo is one of the best characterized leaf-rust resistance sources deployed in durum wheat breeding. Lr14a has been mapped close to the simple sequence repeat markers gwm146, gwm344 and wmc10 in the distal portion of the chromosome arm 7BL, a gene-dense region. The objectives of this study were: (1) to enrich the Lr14a region with single nucleotide polymorphisms (SNPs) and high-resolution melting (HRM)-based markers developed from conserved ortholog set (COS) genes and from sequenced Diversity Array Technology (DArT(®)) markers; (2) to further investigate the gene content and colinearity of this region with the Brachypodium and rice genomes. Ten new COS-SNP and five HRM markers were mapped within an 8.0 cM interval spanning Lr14a. Two HRM markers pinpointed the locus in an interval of HRM designed for agarose gel electrophoresis/KASPar(®) assays and high-resolution melting analysis, respectively, as well as the double-marker combinations ubw14/ubw18, ubw14/ubw35 and wPt-4038-HRM-ubw35 will be useful for germplasm haplotyping and for molecular-assisted breeding.

  17. Marker-assisted introgression of drought tolerance from wild ancestors into popular Indian rice varieties using a 7K Infinium SNP array

    Directory of Open Access Journals (Sweden)

    Ravindra Donde

    2017-10-01

    Full Text Available Recent advances in the area of genomics have led to the development of high throughput genotyping platforms that have immensely contributed to molecular breeding programs. Custom-designed single nucleotide polymorphism (SNP arrays provide an efficient, cost effective, high throughput genotyping tool for QTL/gene mapping, variety identification, marker-assisted selection, etc. In the current study, two interspecific libraries of Chromosome Segment Substitution Lines (CSSLs were evaluated under both drought and control conditions to identify lines with superior yield under drought. The CSSL libraries consisted of 48 BC4F3 lines derived from O. sativa cv. Curinga (tropical japonica x O. rufipogon, and 32 BC4F3 lines derived from O. sativa cv. Curinga (tropical japonica x O. meridionalis. The phenotypic screening of these 80 CSSLs led to the identification of three lines, MER-20, RUF-16, and RUF-44, that yielded well under drought stress. This line was backcrossed with popular rice variety of India, Swarna-Sub1 to introgress wild chromosome segments responsible for reproductive stage drought tolerance. During backcrossing, tracking of wild introgressions and monitoring of recurrent parent genome recovery was facilitated by the use of the Cornell 6K and 7K Infinium rice SNP arrays. The 6K and 7K SNP arrays assayed 5275 SNPs and 7099 SNPs, respectively, distributed across the 12 chromosomes. In our populations of (MER-20X Swarna sub1 BC2F1 lines, 1775 SNPs were polymorphic using the 6K array. The percentage of recurrent parent genome in these backcrossed lines ranged from 33-92% and the percentage of wild donor genome ranged from 8-67%. Using genotypic selection, 5% of plants were identified for further marker assisted backcrossing, based on the presence of the target donor (wild segment and maximum recovery of recurrent parent background. In the next generation, BC3F1 lines were genotyped using the 7K SNP array, which identified 2521 polymorphic SNPs

  18. Genome-wide gene-environment study identifies glutamate receptor gene GRIN2A as a Parkinson's disease modifier gene via interaction with coffee.

    OpenAIRE

    Taye H Hamza; Honglei Chen; Erin M Hill-Burns; Shannon L Rhodes; Jennifer Montimurro; Denise M Kay; Albert Tenesa; Victoria I Kusel; Patricia Sheehan; Muthukrishnan Eaaswarkhanth; Dora Yearout; Ali Samii; John W Roberts; Pinky Agarwal; Yvette Bordelon

    2011-01-01

    Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD). We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC), and we performed a genome-wide association and interaction study (GWAIS), testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal compo...

  19. A genome-wide association study of resistance to HIV infection in highly exposed uninfected individuals with hemophilia A

    Science.gov (United States)

    Lane, Jérôme; McLaren, Paul J.; Dorrell, Lucy; Shianna, Kevin V.; Stemke, Amanda; Pelak, Kimberly; Moore, Stephen; Oldenburg, Johannes; Alvarez-Roman, Maria Teresa; Angelillo-Scherrer, Anne; Boehlen, Francoise; Bolton-Maggs, Paula H.B.; Brand, Brigit; Brown, Deborah; Chiang, Elaine; Cid-Haro, Ana Rosa; Clotet, Bonaventura; Collins, Peter; Colombo, Sara; Dalmau, Judith; Fogarty, Patrick; Giangrande, Paul; Gringeri, Alessandro; Iyer, Rathi; Katsarou, Olga; Kempton, Christine; Kuriakose, Philip; Lin, Judith; Makris, Mike; Manco-Johnson, Marilyn; Tsakiris, Dimitrios A.; Martinez-Picado, Javier; Mauser-Bunschoten, Evelien; Neff, Anne; Oka, Shinichi; Oyesiku, Lara; Parra, Rafael; Peter-Salonen, Kristiina; Powell, Jerry; Recht, Michael; Shapiro, Amy; Stine, Kimo; Talks, Katherine; Telenti, Amalio; Wilde, Jonathan; Yee, Thynn Thynn; Wolinsky, Steven M.; Martinson, Jeremy; Hussain, Shehnaz K.; Bream, Jay H.; Jacobson, Lisa P.; Carrington, Mary; Goedert, James J.; Haynes, Barton F.; McMichael, Andrew J.; Goldstein, David B.; Fellay, Jacques

    2013-01-01

    Human genetic variation contributes to differences in susceptibility to HIV-1 infection. To search for novel host resistance factors, we performed a genome-wide association study (GWAS) in hemophilia patients highly exposed to potentially contaminated factor VIII infusions. Individuals with hemophilia A and a documented history of factor VIII infusions before the introduction of viral inactivation procedures (1979–1984) were recruited from 36 hemophilia treatment centers (HTCs), and their genome-wide genetic variants were compared with those from matched HIV-infected individuals. Homozygous carriers of known CCR5 resistance mutations were excluded. Single nucleotide polymorphisms (SNPs) and inferred copy number variants (CNVs) were tested using logistic regression. In addition, we performed a pathway enrichment analysis, a heritability analysis, and a search for epistatic interactions with CCR5 Δ32 heterozygosity. A total of 560 HIV-uninfected cases were recruited: 36 (6.4%) were homozygous for CCR5 Δ32 or m303. After quality control and SNP imputation, we tested 1 081 435 SNPs and 3686 CNVs for association with HIV-1 serostatus in 431 cases and 765 HIV-infected controls. No SNP or CNV reached genome-wide significance. The additional analyses did not reveal any strong genetic effect. Highly exposed, yet uninfected hemophiliacs form an ideal study group to investigate host resistance factors. Using a genome-wide approach, we did not detect any significant associations between SNPs and HIV-1 susceptibility, indicating that common genetic variants of major effect are unlikely to explain the observed resistance phenotype in this population. PMID:23372042

  20. DELISHUS: an efficient and exact algorithm for genome-wide detection of deletion polymorphism in autism

    Science.gov (United States)

    Aguiar, Derek; Halldórsson, Bjarni V.; Morrow, Eric M.; Istrail, Sorin

    2012-01-01

    Motivation: The understanding of the genetic determinants of complex disease is undergoing a paradigm shift. Genetic heterogeneity of rare mutations with deleterious effects is more commonly being viewed as a major component of disease. Autism is an excellent example where research is active in identifying matches between the phenotypic and genomic heterogeneities. A considerable portion of autism appears to be correlated with copy number variation, which is not directly probed by single nucleotide polymorphism (SNP) array or sequencing technologies. Identifying the genetic heterogeneity of small deletions remains a major unresolved computational problem partly due to the inability of algorithms to detect them. Results: In this article, we present an algorithmic framework, which we term DELISHUS, that implements three exact algorithms for inferring regions of hemizygosity containing genomic deletions of all sizes and frequencies in SNP genotype data. We implement an efficient backtracking algorithm—that processes a 1 billion entry genome-wide association study SNP matrix in a few minutes—to compute all inherited deletions in a dataset. We further extend our model to give an efficient algorithm for detecting de novo deletions. Finally, given a set of called deletions, we also give a polynomial time algorithm for computing the critical regions of recurrent deletions. DELISHUS achieves significantly lower false-positive rates and higher power than previously published algorithms partly because it considers all individuals in the sample simultaneously. DELISHUS may be applied to SNP array or sequencing data to identify the deletion spectrum for family-based association studies. Availability: DELISHUS is available at http://www.brown.edu/Research/Istrail_Lab/. Contact: Eric_Morrow@brown.edu and Sorin_Istrail@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22689755

  1. Prediction of disease and phenotype associations from genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Stephanie N Lewis

    Full Text Available Genome wide association studies (GWAS have proven useful as a method for identifying genetic variations associated with diseases. In this study, we analyzed GWAS data for 61 diseases and phenotypes to elucidate common associations based on single nucleotide polymorphisms (SNP. The study was an expansion on a previous study on identifying disease associations via data from a single GWAS on seven diseases.Adjustments to the originally reported study included expansion of the SNP dataset using Linkage Disequilibrium (LD and refinement of the four levels of analysis to encompass SNP, SNP block, gene, and pathway level comparisons. A pair-wise comparison between diseases and phenotypes was performed at each level and the Jaccard similarity index was used to measure the degree of association between two diseases/phenotypes. Disease relatedness networks (DRNs were used to visualize our results. We saw predominant relatedness between Multiple Sclerosis, type 1 diabetes, and rheumatoid arthritis for the first three levels of analysis. Expected relatedness was also seen between lipid- and blood-related traits.The predominant associations between Multiple Sclerosis, type 1 diabetes, and rheumatoid arthritis can be validated by clinical studies. The diseases have been proposed to share a systemic inflammation phenotype that can result in progression of additional diseases in patients with one of these three diseases. We also noticed unexpected relationships between metabolic and neurological diseases at the pathway comparison level. The less significant relationships found between diseases require a more detailed literature review to determine validity of the predictions. The results from this study serve as a first step towards a better understanding of seemingly unrelated diseases and phenotypes with similar symptoms or modes of treatment.

  2. Parentage Reconstruction in Eucalyptus nitens Using SNPs and Microsatellite Markers: A Comparative Analysis of Marker Data Power and Robustness.

    Directory of Open Access Journals (Sweden)

    Emily J Telfer

    -species SNP resource available with the EuCHIP60k, opens a whole new array of opportunities for high-throughput, genome-wide or targeted genotyping in species of Eucalyptus.

  3. Parentage Reconstruction in Eucalyptus nitens Using SNPs and Microsatellite Markers: A Comparative Analysis of Marker Data Power and Robustness.

    Science.gov (United States)

    Telfer, Emily J; Stovold, Grahame T; Li, Yongjun; Silva-Junior, Orzenil B; Grattapaglia, Dario G; Dungey, Heidi S

    2015-01-01

    Pedigree reconstruction using molecular markers enables efficient management of inbreeding in open-pollinated breeding strategies, replacing expensive and time-consuming controlled pollination. This is particularly useful in preferentially outcrossed, insect pollinated Eucalypts known to suffer considerable inbreeding depression from related matings. A single nucleotide polymorphism (SNP) marker panel consisting of 106 markers was selected for pedigree reconstruction from the recently developed high-density Eucalyptus Infinium SNP chip (EuCHIP60K). The performance of this SNP panel for pedigree reconstruction in open-pollinated progenies of two Eucalyptus nitens seed orchards was compared with that of two microsatellite panels with 13 and 16 markers respectively. The SNP marker panel out-performed one of the microsatellite panels in the resolution power to reconstruct pedigrees and out-performed both panels with respect to data quality. Parentage of all but one offspring in each clonal seed orchard was correctly matched to the expected seed parent using the SNP marker panel, whereas parentage assignment to less than a third of the expected seed parents were supported using the 13-microsatellite panel. The 16-microsatellite panel supported all but one of the recorded seed parents, one better than the SNP panel, although there was still a considerable level of missing and inconsistent data. SNP marker data was considerably superior to microsatellite data in accuracy, reproducibility and robustness. Although microsatellites and SNPs data provide equivalent resolution for pedigree reconstruction, microsatellite analysis requires more time and experience to deal with the uncertainties of allele calling and faces challenges for data transferability across labs and over time. While microsatellite analysis will continue to be useful for some breeding tasks due to the high information content, existing infrastructure and low operating costs, the multi-species SNP resource

  4. Non-replication study of a genome-wide association study for hypertension and blood pressure in African Americans

    Directory of Open Access Journals (Sweden)

    Kidambi Srividya

    2012-04-01

    Full Text Available Abstract Background A recent genome wide association study in 1017 African Americans identified several single nucleotide polymorphisms that reached genome-wide significance for systolic blood pressure. We attempted to replicate these findings in an independent sample of 2474 unrelated African Americans in the Milwaukee metropolitan area; 53% were women and 47% were hypertensives. Methods We evaluated sixteen top associated SNPs from the above genome wide association study for hypertension as a binary trait or blood pressure as a continuous trait. In addition, we evaluated eight single nucleotide polymorphisms located in two genes (STK-39 and CDH-13 found to be associated with systolic and diastolic blood pressures by other genome wide association studies in European and Amish populations. TaqMan MGB-based chemistry with fluorescent probes was used for genotyping. We had an adequate sample size (80% power to detect an effect size of 1.2-2.0 for all the single nucleotide polymorphisms for hypertension as a binary trait, and 1% variance in blood pressure as a continuous trait. Quantitative trait analyses were performed both by excluding and also by including subjects on anti-hypertensive therapy (after adjustments were made for anti-hypertensive medications. Results For all 24 SNPs, no statistically significant differences were noted in the minor allele frequencies between cases and controls. One SNP (rs2146204 showed borderline association (p = 0.006 with hypertension status using recessive model and systolic blood pressure (p = 0.02, but was not significant after adjusting for multiple comparisons. In quantitative trait analyses, among normotensives only, rs12748299 was associated with SBP (p = 0.002. In addition, several nominally significant associations were noted with SBP and DBP among normotensives but none were statistically significant. Conclusions This study highlights the importance of replication to confirm the validity of genome wide

  5. A genome-wide association search for type 2 diabetes genes in African Americans.

    Directory of Open Access Journals (Sweden)

    Nicholette D Palmer

    Full Text Available African Americans are disproportionately affected by type 2 diabetes (T2DM yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD and 1029 population-based controls. The most significant SNPs (n = 550 independent loci were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071, were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05. Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10(-8. SNP rs7560163 (P = 7.0×10(-9, OR (95% CI = 0.75 (0.67-0.84 is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217 were associated with T2DM (P<0.05 and reached more nominal levels of significance (P<2.5×10(-5 in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations.

  6. Investigation of Maternal Genotype Effects in Autism by Genome-Wide Association

    Science.gov (United States)

    Yuan, Han; Dougherty, Joseph D.

    2014-01-01

    Lay Abstract Autism spectrum disorders (ASDs) are pervasive developmental disorders which have both a genetic and environmental component. One source of the environmental component is the in utero (prenatal) environment. The maternal genome can potentially contribute to the risk of autism in children by altering this prenatal environment. In this study, the possibility of maternal genotype effects was explored by looking for common variants (single nucleotide polymorphisms, or SNPs) in the maternal genome associated with increased risk of autism in children. We performed a case/control genome-wide association study (GWAS) using mothers of probands as cases and either fathers of probands or normal females as controls, using two collections of families with autism. We did not identify any SNP that reached significance and thus a common variant of large effect is unlikely. However, there was evidence for the possibility of a large number of alleles each carrying a small effect. This suggested that if there is a contribution to autism risk through common-variant maternal genetic effects, it may be the result of multiple loci of small effects. We did not investigate rare variants in this study. Scientific Abstract Like most psychiatric disorders, autism spectrum disorders have both a genetic and an environmental component. While previous studies have clearly demonstrated the contribution of in utero (prenatal) environment on autism risk, most of them focused on transient environmental factors. Based on a recent sibling study, we hypothesized that environmental factors could also come from the maternal genome, which would result in persistent effects across siblings. In this study, the possibility of maternal genotype effects was examined by looking for common variants (single nucleotide polymorphisms, or SNPs) in the maternal genome associated with increased risk of autism in children. A case/control genome-wide association study (GWAS) was performed using mothers of

  7. Genetic determinants of cardiovascular events among women with migraine: a genome-wide association study.

    Directory of Open Access Journals (Sweden)

    Markus Schürks

    Full Text Available Migraine is associated with an increased risk for cardiovascular disease (CVD. Both migraine and CVD are highly heritable. However, the genetic liability for CVD among migraineurs is unclear.We performed a genome-wide association study for incident CVD events during 12 years of follow-up among 5,122 migraineurs participating in the population-based Women's Genome Health Study. Migraine was self-reported and CVD events were confirmed after medical records review. We calculated odds ratios (OR and 95% confidence intervals (CI and considered a genome-wide p-value <5×10(-8 as significant.Among the 5,122 women with migraine 164 incident CVD events occurred during follow-up. No SNP was associated with major CVD, ischemic stroke, myocardial infarction, or CVD death at the genome-wide level; however, five SNPs showed association with p<5×10(-6. Among migraineurs with aura rs7698623 in MEPE (OR = 6.37; 95% CI 3.15-12.90; p = 2.7×10(-7 and rs4975709 in IRX4 (OR = 5.06; 95% CI 2.66-9.62; p = 7.7×10(-7 appeared to be associated with ischemic stroke, rs2143678 located close to MDF1 with major CVD (OR = 3.05; 95% CI 1.98-4.69; p = 4.3×10(-7, and the intergenic rs1406961 with CVD death (OR = 12.33; 95% CI 4.62-32.87; p = 5.2×10(-7. Further, rs1047964 in BACE1 appeared to be associated with CVD death among women with any migraine (OR = 4.67; 95% CI 2.53-8.62; p = 8.0×10(-7.Our results provide some suggestion for an association of five SNPs with CVD events among women with migraine; none of the results was genome-wide significant. Four associations appeared among migraineurs with aura, two of those with ischemic stroke. Although our population is among the largest with migraine and incident CVD information, these results must be treated with caution, given the limited number of CVD events among women with migraine and the low minor allele frequencies for three of the SNPs. Our results await independent replication

  8. A mega-analysis of genome-wide association studies for major depressive disorder.

    Science.gov (United States)

    Ripke, Stephan; Wray, Naomi R; Lewis, Cathryn M; Hamilton, Steven P; Weissman, Myrna M; Breen, Gerome; Byrne, Enda M; Blackwood, Douglas H R; Boomsma, Dorret I; Cichon, Sven; Heath, Andrew C; Holsboer, Florian; Lucae, Susanne; Madden, Pamela A F; Martin, Nicholas G; McGuffin, Peter; Muglia, Pierandrea; Noethen, Markus M; Penninx, Brenda P; Pergadia, Michele L; Potash, James B; Rietschel, Marcella; Lin, Danyu; Müller-Myhsok, Bertram; Shi, Jianxin; Steinberg, Stacy; Grabe, Hans J; Lichtenstein, Paul; Magnusson, Patrik; Perlis, Roy H; Preisig, Martin; Smoller, Jordan W; Stefansson, Kari; Uher, Rudolf; Kutalik, Zoltan; Tansey, Katherine E; Teumer, Alexander; Viktorin, Alexander; Barnes, Michael R; Bettecken, Thomas; Binder, Elisabeth B; Breuer, René; Castro, Victor M; Churchill, Susanne E; Coryell, William H; Craddock, Nick; Craig, Ian W; Czamara, Darina; De Geus, Eco J; Degenhardt, Franziska; Farmer, Anne E; Fava, Maurizio; Frank, Josef; Gainer, Vivian S; Gallagher, Patience J; Gordon, Scott D; Goryachev, Sergey; Gross, Magdalena; Guipponi, Michel; Henders, Anjali K; Herms, Stefan; Hickie, Ian B; Hoefels, Susanne; Hoogendijk, Witte; Hottenga, Jouke Jan; Iosifescu, Dan V; Ising, Marcus; Jones, Ian; Jones, Lisa; Jung-Ying, Tzeng; Knowles, James A; Kohane, Isaac S; Kohli, Martin A; Korszun, Ania; Landen, Mikael; Lawson, William B; Lewis, Glyn; Macintyre, Donald; Maier, Wolfgang; Mattheisen, Manuel; McGrath, Patrick J; McIntosh, Andrew; McLean, Alan; Middeldorp, Christel M; Middleton, Lefkos; Montgomery, Grant M; Murphy, Shawn N; Nauck, Matthias; Nolen, Willem A; Nyholt, Dale R; O'Donovan, Michael; Oskarsson, Högni; Pedersen, Nancy; Scheftner, William A; Schulz, Andrea; Schulze, Thomas G; Shyn, Stanley I; Sigurdsson, Engilbert; Slager, Susan L; Smit, Johannes H; Stefansson, Hreinn; Steffens, Michael; Thorgeirsson, Thorgeir; Tozzi, Federica; Treutlein, Jens; Uhr, Manfred; van den Oord, Edwin J C G; Van Grootheest, Gerard; Völzke, Henry; Weilburg, Jeffrey B; Willemsen, Gonneke; Zitman, Frans G; Neale, Benjamin; Daly, Mark; Levinson, Douglas F; Sullivan, Patrick F

    2013-04-01

    Prior genome-wide association studies (GWAS) of major depressive disorder (MDD) have met with limited success. We sought to increase statistical power to detect disease loci by conducting a GWAS mega-analysis for MDD. In the MDD discovery phase, we analyzed more than 1.2 million autosomal and X chromosome single-nucleotide polymorphisms (SNPs) in 18 759 independent and unrelated subjects of recent European ancestry (9240 MDD cases and 9519 controls). In the MDD replication phase, we evaluated 554 SNPs in independent samples (6783 MDD cases and 50 695 controls). We also conducted a cross-disorder meta-analysis using 819 autosomal SNPs with P<0.0001 for either MDD or the Psychiatric GWAS Consortium bipolar disorder (BIP) mega-analysis (9238 MDD cases/8039 controls and 6998 BIP cases/7775 controls). No SNPs achieved genome-wide significance in the MDD discovery phase, the MDD replication phase or in pre-planned secondary analyses (by sex, recurrent MDD, recurrent early-onset MDD, age of onset, pre-pubertal onset MDD or typical-like MDD from a latent class analyses of the MDD criteria). In the MDD-bipolar cross-disorder analysis, 15 SNPs exceeded genome-wide significance (P<5 × 10(-8)), and all were in a 248 kb interval of high LD on 3p21.1 (chr3:52 425 083-53 822 102, minimum P=5.9 × 10(-9) at rs2535629). Although this is the largest genome-wide analysis of MDD yet conducted, its high prevalence means that the sample is still underpowered to detect genetic effects typical for complex traits. Therefore, we were unable to identify robust and replicable findings. We discuss what this means for genetic research for MDD. The 3p21.1 MDD-BIP finding should be interpreted with caution as the most significant SNP did not replicate in MDD samples, and genotyping in independent samples will be needed to resolve its status.

  9. Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.

    Science.gov (United States)

    Nguyen, Thanh-Tung; Huang, Joshua; Wu, Qingyao; Nguyen, Thuy; Li, Mark

    2015-01-01

    Single-nucleotide polymorphisms (SNPs) selection and identification are the most important tasks in Genome-wide association data analysis. The problem is difficult because genome-wide association data is very high dimensional and a large portion of SNPs in the data is irrelevant to the disease. Advanced machine learning methods have been successfully used in Genome-wide association studies (GWAS) for identification of genetic variants that have relatively big effects in some common, complex diseases. Among them, the most successful one is Random Forests (RF). Despite of performing well in terms of prediction accuracy in some data sets with moderate size, RF still suffers from working in GWAS for selecting informative SNPs and building accurate prediction models. In this paper, we propose to use a new two-stage quality-based sampling method in random forests, named ts-RF, for SNP subspace selection for GWAS. The method first applies p-value assessment to find a cut-off point that separates informative and irrelevant SNPs in two groups. The informative SNPs group is further divided into two sub-groups: highly informative and weak informative SNPs. When sampling the SNP subspace for building trees for the forest, only those SNPs from the two sub-groups are taken into account. The feature subspaces always contain highly informative SNPs when used to split a node at a tree. This approach enables one to generate more accurate trees with a lower prediction error, meanwhile possibly avoiding overfitting. It allows one to detect interactions of multiple SNPs with the diseases, and to reduce the dimensionality and the amount of Genome-wide association data needed for learning the RF model. Extensive experiments on two genome-wide SNP data sets (Parkinson case-control data comprised of 408,803 SNPs and Alzheimer case-control data comprised of 380,157 SNPs) and 10 gene data sets have demonstrated that the proposed model significantly reduced prediction errors and outperformed

  10. Genome-wide distribution of genetic diversity and linkage disequilibrium in a mass-selected population of maritime pine

    Science.gov (United States)

    2014-01-01

    Background The accessibility of high-throughput genotyping technologies has contributed greatly to the development of genomic resources in non-model organisms. High-density genotyping arrays have only recently been developed for some economically important species such as conifers. The potential for using genomic technologies in association mapping and breeding depends largely on the genome wide patterns of diversity and linkage disequilibrium in current breeding populations. This study aims to deepen our knowledge regarding these issues in maritime pine, the first species used for reforestation in south western Europe. Results Using a new map merging algorithm, we first established a 1,712 cM composite linkage map (comprising 1,838 SNP markers in 12 linkage groups) by bringing together three already available genetic maps. Using rigorous statistical testing based on kernel density estimation and resampling we identified cold and hot spots of recombination. In parallel, 186 unrelated trees of a mass-selected population were genotyped using a 12k-SNP array. A total of 2,600 informative SNPs allowed to describe historical recombination, genetic diversity and genetic structure of this recently domesticated breeding pool that forms the basis of much of the current and future breeding of this species. We observe very low levels of population genetic structure and find no evidence that artificial selection has caused a reduction in genetic diversity. By combining these two pieces of information, we provided the map position of 1,671 SNPs corresponding to 1,192 different loci. This made it possible to analyze the spatial pattern of genetic diversity (H e ) and long distance linkage disequilibrium (LD) along the chromosomes. We found no particular pattern in the empirical variogram of H e across the 12 linkage groups and, as expected for an outcrossing species with large effective population size, we observed an almost complete lack of long distance LD. Conclusions These

  11. Protein Interaction-Based Genome-Wide Analysis of Incident Coronary Heart Disease

    DEFF Research Database (Denmark)

    Jensen, Majken Karoline; Pers, Tune Hannes; Dworzynski, Piotr

    2011-01-01

    in genes associated with risk of coronary heart disease (CHD). Methods and Results-Genome-wide association analyses of approximately approximate to 700 000 single-nucleotide polymorphisms in 899 incident CHD cases and 1823 age-and sex-matched controls within the Nurses' Health and the Health Professionals...... complex. Conclusions-The integration of a GWA study with PPI data successfully identifies a set of candidate susceptibility genes for incident CHD that would have been missed in single-marker GWA analysis. (Circ Cardiovasc Genet. 2011; 4:549-556.)...

  12. A genome-wide association study of corneal astigmatism: The CREAM Consortium

    OpenAIRE

    Shah, Rupal L.; Li, Qing; Zhao, Wanting; Tedja, Milly S.; Tideman, J. Willem L.; Khawaja, Anthony P.; Fan, Qiao; Yazar, Seyhan; Williams, Katie M.; Verhoeven, Virginie J.M.; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J.

    2018-01-01

    Purpose To identify genes and genetic markers associated with corneal astigmatism. Methods: A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts we...

  13. A genome-wide association study of corneal astigmatism: The CREAM Consortium

    OpenAIRE

    Shah, Rupal L.; Li, Qing; Zhao, Wanting; Tedja, Milly S.; Tideman, J. Willem L.; Khawaja, Anthony P.; Fan, Qiao; Yazar, Seyhan; Williams, Katie M.; Verhoeven, Virginie J.M.; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J.

    2018-01-01

    Purpose To identify genes and genetic markers associated with corneal astigmatism. Methods A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohorts wer...

  14. A genome-wide association study of corneal astigmatism: The CREAM Consortium.

    OpenAIRE

    Shah, Rupal L; Li, Qing; Zhao, Wanting; Tedja, Milly S; Tideman, J Willem L; Khawaja, Anthony P; Fan, Qiao; Yazar, Seyhan; Williams, Katie M; Verhoeven, Virginie J M; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J

    2018-01-01

    Purpose To identify genes and genetic markers associated with corneal astigmatism. Methods A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry...

  15. A genome-wide association study of corneal astigmatism : The CREAM Consortium

    OpenAIRE

    Shah, Rupal L.; Li, Qing; Zhao, Wanting; Tedja, Milly S.; Tideman, J. Willem L.; Khawaja, Anthony P.; Fan, Qiao; Yazar, Seyhan; Williams, Katie M.; Verhoeven, Virginie J.M.; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J.

    2018-01-01

    Purpose: To identify genes and genetic markers associated with corneal astigmatism. Methods: A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohor...

  16. A genome-wide association study of corneal astigmatism:The CREAM consortium

    OpenAIRE

    Shah, Rupal L.; Li, Qing; Zhao, Wanting; Tedja, Milly S.; Tideman, J. Willem L.; Khawaja, Anthony P.; Fan, Qiao; Yazar, Seyhan; Williams, Katie M.; Verhoeven, Virginie J.M.; Xie, Jing; Wang, Ya Xing; Hess, Moritz; Nickels, Stefan; Lackner, Karl J.

    2018-01-01

    Purpose: To identify genes and genetic markers associated with corneal astigmatism. Methods: A meta-analysis of genome-wide association studies (GWASs) of corneal astigmatism undertaken for 14 European ancestry (n=22,250) and 8 Asian ancestry (n=9,120) cohorts was performed by the Consortium for Refractive Error and Myopia. Cases were defined as having >0.75 diopters of corneal astigmatism. Subsequent gene-based and gene-set analyses of the meta-analyzed results of European ancestry cohort...

  17. Prospecting sugarcane resistance to Sugarcane yellow leaf virus by genome-wide association.

    Science.gov (United States)

    Debibakas, S; Rocher, S; Garsmeur, O; Toubi, L; Roques, D; D'Hont, A; Hoarau, J-Y; Daugrois, J H

    2014-08-01

    Using GWAS approaches, we detected independent resistant markers in sugarcane towards a vectored virus disease. Based on comparative genomics, several candidate genes potentially involved in virus/aphid/plant interactions were pinpointed. Yellow leaf of sugarcane is an emerging viral disease whose causal agent is a Polerovirus, the Sugarcane yellow leaf virus (SCYLV) transmitted by aphids. To identify quantitative trait loci controlling resistance to yellow leaf which are of direct relevance for breeding, we undertook a genome-wide association study (GWAS) on a sugarcane cultivar panel (n = 189) representative of current breeding germplasm. This panel was fingerprinted with 3,949 polymorphic markers (DArT and AFLP). The panel was phenotyped for SCYLV infection in leaves and stalks in two trials for two crop cycles, under natural disease pressure prevalent in Guadeloupe. Mixed linear models including co-factors representing population structure fixed effects and pairwise kinship random effects provided an efficient control of the risk of inflated type-I error at a genome-wide level. Six independent markers were significantly detected in association with SCYLV resistance phenotype. These markers explained individually between 9 and 14 % of the disease variation of the cultivar panel. Their frequency in the panel was relatively low (8-20 %). Among them, two markers were detected repeatedly across the GWAS exercises based on the different disease resistance parameters. These two markers could be blasted on Sorghum bicolor genome and candidate genes potentially involved in plant-aphid or plant-virus interactions were localized in the vicinity of sorghum homologs of sugarcane markers. Our results illustrate the potential of GWAS approaches to prospect among sugarcane germplasm for accessions likely bearing resistance alleles of significant effect useful in breeding programs.

  18. Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety ‘Amrapali’ (Mangifera indica L.)

    Science.gov (United States)

    Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

    2016-01-01

    Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892

  19. Genome wide analysis of drug-induced torsades de pointes: lack of common variants with large effect sizes.

    Directory of Open Access Journals (Sweden)

    Elijah R Behr

    Full Text Available Marked prolongation of the QT interval on the electrocardiogram associated with the polymorphic ventricular tachycardia Torsades de Pointes is a serious adverse event during treatment with antiarrhythmic drugs and other culprit medications, and is a common cause for drug relabeling and withdrawal. Although clinical risk factors have been identified, the syndrome remains unpredictable in an individual patient. Here we used genome-wide association analysis to search for common predisposing genetic variants. Cases of drug-induced Torsades de Pointes (diTdP, treatment tolerant controls, and general population controls were ascertained across multiple sites using common definitions, and genotyped on the Illumina 610k or 1M-Duo BeadChips. Principal Components Analysis was used to select 216 Northwestern European diTdP cases and 771 ancestry-matched controls, including treatment-tolerant and general population subjects. With these sample sizes, there is 80% power to detect a variant at genome-wide significance with minor allele frequency of 10% and conferring an odds ratio of ≥2.7. Tests of association were carried out for each single nucleotide polymorphism (SNP by logistic regression adjusting for gender and population structure. No SNP reached genome wide-significance; the variant with the lowest P value was rs2276314, a non-synonymous coding variant in C18orf21 (p  =  3×10(-7, odds ratio = 2, 95% confidence intervals: 1.5-2.6. The haplotype formed by rs2276314 and a second SNP, rs767531, was significantly more frequent in controls than cases (p  =  3×10(-9. Expanding the number of controls and a gene-based analysis did not yield significant associations. This study argues that common genomic variants do not contribute importantly to risk for drug-induced Torsades de Pointes across multiple drugs.

  20. Clinical implication of genome-wide profiling in diffuse large B-cell lymphoma and other subtypes of B-cell lymphoma

    DEFF Research Database (Denmark)

    Iqbal, Javeed; Joshi, Shantaram; Patel, Kavita N

    2007-01-01

    of Lymphoid Neoplasms (REAL) and World Health Organization (WHO) classifications. These classification methods were based on histological, immunophenotypic and cytogenetic markers and widely accepted by pathologists and oncologists worldwide. During last several decades, great progress has been made...... technology. The genome-wide transcriptional measurement, also called gene expression profile (GEP) can accurately define the biological phenotype of the tumor. In this review, important discoveries made by genome-wide GEP in understanding the biology of lymphoma and additionally the diagnostic and prognostic...

  1. Genome-wide association studies and resting heart rate

    DEFF Research Database (Denmark)

    Oskari Kilpeläinen, Tuomas

    2016-01-01

    Genome-wide association studies (GWASs) have revolutionized the search for genetic variants regulating resting heart rate. In the last 10 years, GWASs have led to the identification of at least 21 novel heart rate loci. These discoveries have provided valuable insights into the mechanisms...... and pathways that regulate heart rate and link heart rate to cardiovascular morbidity and mortality. GWASs capture majority of genetic variation in a population sample by utilizing high-throughput genotyping chips measuring genotypes for up to several millions of SNPs across the genome in thousands...... of individuals. This allows the identification of the strongest heart rate associated signals at genome-wide level. While GWASs provide robust statistical evidence of the association of a given genetic locus with heart rate, they are only the starting point for detailed follow-up studies to locate the causal...

  2. Detection of Hereditary 1,25-Hydroxyvitamin D-Resistant Rickets Caused by Uniparental Disomy of Chromosome 12 Using Genome-Wide Single Nucleotide Polymorphism Array.

    Directory of Open Access Journals (Sweden)

    Mayuko Tamura

    Full Text Available Hereditary 1,25-dihydroxyvitamin D-resistant rickets (HVDRR is an autosomal recessive disease caused by biallelic mutations in the vitamin D receptor (VDR gene. No patients have been reported with uniparental disomy (UPD.Using genome-wide single nucleotide polymorphism (SNP array to confirm whether HVDRR was caused by UPD of chromosome 12.A 2-year-old girl with alopecia and short stature and without any family history of consanguinity was diagnosed with HVDRR by typical laboratory data findings and clinical features of rickets. Sequence analysis of VDR was performed, and the origin of the homozygous mutation was investigated by target SNP sequencing, short tandem repeat analysis, and genome-wide SNP array.The patient had a homozygous p.Arg73Ter nonsense mutation. Her mother was heterozygous for the mutation, but her father was negative. We excluded gross deletion of the father's allele or paternal discordance. Genome-wide SNP array of the family (the patient and her parents showed complete maternal isodisomy of chromosome 12. She was successfully treated with high-dose oral calcium.This is the first report of HVDRR caused by UPD, and the third case of complete UPD of chromosome 12, in the published literature. Genome-wide SNP array was useful for detecting isodisomy and the parental origin of the allele. Comprehensive examination of the homozygous state is essential for accurate genetic counseling of recurrence risk and appropriate monitoring for other chromosome 12 related disorders. Furthermore, oral calcium therapy was effective as an initial treatment for rickets in this instance.

  3. Genome-Wide Detection and Analysis of Multifunctional Genes

    Science.gov (United States)

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  4. Genome-wide DNA polymorphism analyses using VariScan

    Directory of Open Access Journals (Sweden)

    Vilella Albert J

    2006-09-01

    Full Text Available Abstract Background DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i exhaustive population-genetic analyses including those based on the coalescent theory; ii analysis adapted to the shallow data generated by the high-throughput genome projects; iii use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.

  5. Genome-Wide Association Mapping of Seed Coat Color in Brassica napus.

    Science.gov (United States)

    Wang, Jia; Xian, Xiaohua; Xu, Xinfu; Qu, Cunmin; Lu, Kun; Li, Jiana; Liu, Liezhao

    2017-07-05

    Seed coat color is an extremely important breeding characteristic of Brassica napus. To elucidate the factors affecting the genetic architecture of seed coat color, a genome-wide association study (GWAS) of seed coat color was conducted with a diversity panel comprising 520 B. napus cultivars and inbred lines. In total, 22 single-nucleotide polymorphisms (SNPs) distributed on 7 chromosomes were found to be associated with seed coat color. The most significant SNPs were found in 2014 near Bn-scaff_15763_1-p233999, only 43.42 kb away from BnaC06g17050D, which is orthologous to Arabidopsis thaliana TRANSPARENT TESTA 12 (TT12), an important gene involved in the transportation of proanthocyanidin precursors into the vacuole. Two of eight repeatedly detected SNPs can be identified and digested by restriction enzymes. Candidate gene mining revealed that the relevant regions of significant SNP loci on the A09 and C08 chromosomes are highly homologous. Moreover, a comparison of the GWAS results to those of previous quantitative trait locus (QTL) studies showed that 11 SNPs were located in the confidence intervals of the QTLs identified in previous studies based on linkage analyses or association mapping. Our results provide insights into the genetic basis of seed coat color in B. napus, and the beneficial allele, SNP information, and candidate genes should be useful for selecting yellow seeds in B. napus breeding.

  6. Genome-wide scan in Hispanics highlights candidate loci for brain white matter hyperintensities.

    Science.gov (United States)

    Beecham, Ashley; Dong, Chuanhui; Wright, Clinton B; Dueker, Nicole; Brickman, Adam M; Wang, Liyong; DeCarli, Charles; Blanton, Susan H; Rundek, Tatjana; Mayeux, Richard; Sacco, Ralph L

    2017-10-01

    To investigate genetic variants influencing white matter hyperintensities (WMHs) in the understudied Hispanic population. Using 6.8 million single nucleotide polymorphisms (SNPs), we conducted a genome-wide association study (GWAS) to identify SNPs associated with WMH volume (WMHV) in 922 Hispanics who underwent brain MRI as a cross-section of 2 community-based cohorts in the Northern Manhattan Study and the Washington Heights-Inwood Columbia Aging Project. Multiple linear modeling with PLINK was performed to examine the additive genetic effects on ln(WMHV) after controlling for age, sex, total intracranial volume, and principal components of ancestry. Gene-based tests of association were performed using VEGAS. Replication was performed in independent samples of Europeans, African Americans, and Asians. From the SNP analysis, a total of 17 independent SNPs in 7 genes had suggestive evidence of association with WMHV in Hispanics ( p < 1 × 10 -5 ) and 5 genes from the gene-based analysis with p < 1 × 10 -3 . One SNP (rs9957475 in GATA6 ) and 1 gene ( UBE2C ) demonstrated evidence of association ( p < 0.05) in the African American sample. Four SNPs with p < 1 × 10 -5 were shown to affect binding of SPI1 using RegulomeDB. This GWAS of 2 community-based Hispanic cohorts revealed several novel WMH-associated genetic variants. Further replication is needed in independent Hispanic samples to validate these suggestive associations, and fine mapping is needed to pinpoint causal variants.

  7. Imputation and quality control steps for combining multiple genome-wide datasets

    Directory of Open Access Journals (Sweden)

    Shefali S Verma

    2014-12-01

    Full Text Available The electronic MEdical Records and GEnomics (eMERGE network brings together DNA biobanks linked to electronic health records (EHRs from multiple institutions. Approximately 52,000 DNA samples from distinct individuals have been genotyped using genome-wide SNP arrays across the nine sites of the network. The eMERGE Coordinating Center and the Genomics Workgroup developed a pipeline to impute and merge genomic data across the different SNP arrays to maximize sample size and power to detect associations with a variety of clinical endpoints. The 1000 Genomes cosmopolitan reference panel was used for imputation. Imputation results were evaluated using the following metrics: accuracy of imputation, allelic R2 (estimated correlation between the imputed and true genotypes, and the relationship between allelic R2 and minor allele frequency. Computation time and memory resources required by two different software packages (BEAGLE and IMPUTE2 were also evaluated. A number of challenges were encountered due to the complexity of using two different imputation software packages, multiple ancestral populations, and many different genotyping platforms. We present lessons learned and describe the pipeline implemented here to impute and merge genomic data sets. The eMERGE imputed dataset will serve as a valuable resource for discovery, leveraging the clinical data that can be mined from the EHR.

  8. Assessment of heterogeneity between European Populations: a Baltic and Danish replication case-control study of SNPs from a recent European ulcerative colitis genome wide association study

    DEFF Research Database (Denmark)

    Andersen, Vibeke; Ernst, Anja; Sventoraityte, Jurgita

    2011-01-01

    the combined Baltic, Danish, and Norwegian panel versus the combined German, British, Belgian, and Greek panel (rs7520292 (P = 0.001), rs12518307 (P = 0.007), and rs2395609 (TCP11) (P = 0.01), respectively). No SNP reached genome-wide significance in the combined analyses of all the panels. Conclusions......Background: Differences in the genetic architecture of inflammatory bowel disease between different European countries and ethnicities have previously been reported. In the present study, we wanted to assess the role of 11 newly identified UC risk variants, derived from a recent European UC genome...... wide association study (GWAS) (Franke et al., 2010), for 1) association with UC in the Nordic countries, 2) for population heterogeneity between the Nordic countries and the rest of Europe, and, 3) eventually, to drive some of the previous findings towards overall genome-wide significance. Methods...

  9. Genome-wide association study identifies genetic loci associated with iron deficiency.

    Directory of Open Access Journals (Sweden)

    Christine E McLaren

    2011-03-01

    Full Text Available The existence of multiple inherited disorders of iron metabolism in man, rodents and other vertebrates suggests genetic contributions to iron deficiency. To identify new genomic locations associated with iron deficiency, a genome-wide association study (GWAS was performed using DNA collected from white men aged≥25 y and women≥50 y in the Hemochromatosis and Iron Overload Screening (HEIRS Study with serum ferritin (SF≤12 µg/L (cases and iron replete controls (SF>100 µg/L in men, SF>50 µg/L in women. Regression analysis was used to examine the association between case-control status (336 cases, 343 controls and quantitative serum iron measures and 331,060 single nucleotide polymorphism (SNP genotypes, with replication analyses performed in a sample of 71 cases and 161 controls from a population of white male and female veterans screened at a US Veterans Affairs (VA medical center. Five SNPs identified in the GWAS met genome-wide statistical significance for association with at least one iron measure, rs2698530 on chr. 2p14; rs3811647 on chr. 3q22, a known SNP in the transferrin (TF gene region; rs1800562 on chr. 6p22, the C282Y mutation in the HFE gene; rs7787204 on chr. 7p21; and rs987710 on chr. 22q11 (GWAS observed P<1.51×10(-7 for all. An association between total iron binding capacity and SNP rs3811647 in the TF gene (GWAS observed P=7.0×10(-9, corrected P=0.012 was replicated within the VA samples (observed P=0.012. Associations with the C282Y mutation in the HFE gene also were replicated. The joint analysis of the HEIRS and VA samples revealed strong associations between rs2698530 on chr. 2p14 and iron status outcomes. These results confirm a previously-described TF polymorphism and implicate one potential new locus as a target for gene identification.

  10. Genome wide linkage disequilibrium in Chinese asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm: implications for domestication history and genome wide association studies.

    Science.gov (United States)

    Xu, P; Wu, X; Wang, B; Luo, J; Liu, Y; Ehlers, J D; Close, T J; Roberts, P A; Lu, Z; Wang, S; Li, G

    2012-07-01

    Association mapping of important traits of crop plants relies on first understanding the extent and patterns of linkage disequilibrium (LD) in the particular germplasm being investigated. We characterize here the genetic diversity, population structure and genome wide LD patterns in a set of asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm from China. A diverse collection of 99 asparagus bean and normal cowpea accessions were genotyped with 1127 expressed sequence tag-derived single nucleotide polymorphism markers (SNPs). The proportion of polymorphic SNPs across the collection was relatively low (39%), with an average number of SNPs per locus of 1.33. Bayesian population structure analysis indicated two subdivisions within the collection sampled that generally represented the 'standard vegetable' type (subgroup SV) and the 'non-standard vegetable' type (subgroup NSV), respectively. Level of LD (r(2)) was higher and extent of LD persisted longer in subgroup SV than in subgroup NSV, whereas LD decayed rapidly (0-2 cM) in both subgroups. LD decay distance varied among chromosomes, with the longest (≈ 5 cM) five times longer than the shortest (≈ 1 cM). Partitioning of LD variance into within- and between-subgroup components coupled with comparative LD decay analysis suggested that linkage group 5, 7 and 10 may have undergone the most intensive epistatic selection toward traits favorable for vegetable use. This work provides a first population genetic insight into domestication history of asparagus bean and demonstrates the feasibility of mapping complex traits by genome wide association study in asparagus bean using a currently available cowpea SNPs marker platform.

  11. QTL Mapping of Adult-Plant Resistance to Leaf Rust in the Wheat Cross Zhou 8425B/Chinese Spring Using High-Density SNP Markers

    Directory of Open Access Journals (Sweden)

    Peipei Zhang

    2017-05-01

    Full Text Available Wheat leaf rust is an important disease worldwide. Growing resistant cultivars is an effective means to control the disease. In the present study, 244 recombinant inbred lines from Zhou 8425B/Chinese Spring cross were phenotyped for leaf rust severities during the 2011–2012, 2012–2013, 2013–2014, and 2014–2015 cropping seasons at Baoding, Hebei province, and 2012–2013 and 2013–2014 cropping seasons in Zhoukou, Henan province. The population was genotyped using the high-density Illumina iSelect 90K SNP assay and SSR markers. Inclusive composite interval mapping identified eight QTL, designated as QLr.hebau-2AL, QLr.hebau-2BS, QLr.hebau-3A, QLr.hebau-3BS, QLr.hebau-4AL, QLr.hebau-4B, QLr.hebau-5BL, and QLr.hebau-7DS, respectively. QLr.hebau-2BS, QLr.hebau-3A, QLr.hebau-3BS, and QLr.hebau-5BL were derived from Zhou 8425B, whereas the other four were from Chinese Spring. Three stable QTL on chromosomes 2BS, 4B and 7DS explained 7.5–10.6%, 5.5–24.4%, and 11.2–20.9% of the phenotypic variance, respectively. QLr.hebau-2BS in Zhou 8425B might be the same as LrZH22 in Zhoumai 22; QLr.hebau-4B might be the residual resistance of Lr12, and QLr.hebau-7DS is Lr34. QLr.hebau-2AL, QLr.hebau-3BS, QLr.hebau-4AL, and QLr.hebau-5BL are likely to be novel QTL for leaf rust. These QTL and their closely linked SNP and SSR markers can be used for fine mapping, candidate gene discovery, and marker-assisted selection in wheat breeding.

  12. Genome-wide SNPs reveal the drivers of gene flow in an urban population of the Asian Tiger Mosquito, Aedes albopictus.

    Science.gov (United States)

    Schmidt, Thomas L; Rašić, Gordana; Zhang, Dongjing; Zheng, Xiaoying; Xi, Zhiyong; Hoffmann, Ary A

    2017-10-01

    Aedes albopictus is a highly invasive disease vector with an expanding worldwide distribution. Genetic assays using low to medium resolution markers have found little evidence of spatial genetic structure even at broad geographic scales, suggesting frequent passive movement along human transportation networks. Here we analysed genetic structure of Aedes albopictus collected from 12 sample sites in Guangzhou, China, using thousands of genome-wide single nucleotide polymorphisms (SNPs). We found evidence for passive gene flow, with distance from shipping terminals being the strongest predictor of genetic distance among mosquitoes. As further evidence of passive dispersal, we found multiple pairs of full-siblings distributed between two sample sites 3.7 km apart. After accounting for geographical variability, we also found evidence for isolation by distance, previously undetectable in Ae. albopictus. These findings demonstrate how large SNP datasets and spatially-explicit hypothesis testing can be used to decipher processes at finer geographic scales than formerly possible. Our approach can be used to help predict new invasion pathways of Ae. albopictus and to refine strategies for vector control that involve the transformation or suppression of mosquito populations.

  13. Marcadores SNP: conceitos básicos, aplicações no manejo e no melhoramento animal e perspectivas para o futuro SNP markers: basic concepts, applications in animal breeding and management and perspectives for the future

    Directory of Open Access Journals (Sweden)

    Alexandre Rodrigues Caetano

    2009-07-01

    molecular markers to characterize genetic resources and generate tools for animal breeding and management date from the end of the 80s. In the last 20 years the technologies to generate molecular data went through several innovation cycles. The last wave of technological innovations represents a true revolution, bringing methods to identify and genotype SNP (Single Nucleotide Polymorphism markers in large scale. High density DNA chips were generated to genotype from tens of thousands to hundreds of thousands of SNPs in a single assay. Furthermore, other medium density technologies allow for the genotyping of tens to hundreds of makers, in high numbers of samples, with very high speed and automation. These new technologies allowed for the generation of new applications, such as the methods to genetically evaluate and select animals based on their Genomic Value (Genomic Estimated Breeding Value - GEBV. The statistical methods for genomic evaluation and selection are in full development, but the technology already became reality with the release of the first bull summary for the Holstein breed with GEBVs for milk production and quality traits in January 2009. In addition, these technologies brought new options for development of diagnostic tests for paternity testing, individual identification, traceability, etc. Also, these new technologies to genotype SNP markers facilitated the development of outsourcing companies to generate molecular data, allowing any group to conduct advanced experiments, always using the most advanced technologies, without the need of investments into equipment.

  14. Genome-Wide Association Mapping of Crown Rust Resistance in Oat Elite Germplasm.

    Science.gov (United States)

    Klos, Kathy Esvelt; Yimer, Belayneh A; Babiker, Ebrahiem M; Beattie, Aaron D; Bonman, J Michael; Carson, Martin L; Chong, James; Harrison, Stephen A; Ibrahim, Amir M H; Kolb, Frederic L; McCartney, Curt A; McMullen, Michael; Fetch, Jennifer Mitchell; Mohammadi, Mohsen; Murphy, J Paul; Tinker, Nicholas A

    2017-07-01

    Oat crown rust, caused by f. sp. , is a major constraint to oat ( L.) production in many parts of the world. In this first comprehensive multienvironment genome-wide association map of oat crown rust, we used 2972 single-nucleotide polymorphisms (SNPs) genotyped on 631 oat lines for association mapping of quantitative trait loci (QTL). Seedling reaction to crown rust in these lines was assessed as infection type (IT) with each of 10 crown rust isolates. Adult plant reaction was assessed in the field in a total of 10 location-years as percentage severity (SV) and as infection reaction (IR) in a 0-to-1 scale. Overall, 29 SNPs on 12 linkage groups were predictive of crown rust reaction in at least one experiment at a genome-wide level of statistical significance. The QTL identified here include those in regions previously shown to be linked with seedling resistance genes , , , , , and and also with adult-plant resistance and adaptation-related QTL. In addition, QTL on linkage groups Mrg03, Mrg08, and Mrg23 were identified in regions not previously associated with crown rust resistance. Evaluation of marker genotypes in a set of crown rust differential lines supported as the identity of . The SNPs with rare alleles associated with lower disease scores may be suitable for use in marker-assisted selection of oat lines for crown rust resistance. Copyright © 2017 Crop Science Society of America.

  15. Genome-wide association study of smoking behaviors in COPD patients

    Science.gov (United States)

    Siedlinski, Mateusz; Cho, Michael H.; Bakke, Per; Gulsvik, Amund; Lomas, David A.; Anderson, Wayne; Kong, Xiangyang; Rennard, Stephen I.; Beaty, Terri H.; Hokanson, John E.; Crapo, James D.; Silverman, Edwin K.

    2012-01-01

    Background Cigarette smoking is a major risk factor for COPD and COPD severity. Previous genome-wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNPs) associated with the number of cigarettes smoked per day (CPD) and a Dopamine Beta-Hydroxylase (DBH) locus associated with smoking cessation in multiple populations. Objective To identify SNPs associated with lifetime average and current CPD, age at smoking initiation, and smoking cessation in COPD subjects. Methods GWAS were conducted in 4 independent cohorts encompassing 3,441 ever-smoking COPD subjects (GOLD stage II or higher). Untyped SNPs were imputed using HapMap (phase II) panel. Results from all cohorts were meta-analyzed. Results Several SNPs near the HLA region on chromosome 6p21 and in an intergenic region on chromosome 2q21 showed associations with age at smoking initiation, both with the lowest p=2×10−7. No SNPs were associated with lifetime average CPD, current CPD or smoking cessation with p<10−6. Nominally significant associations with candidate SNPs within alpha-nicotinic acetylcholine receptors 3/5 (CHRNA3/CHRNA5; e.g. p=0.00011 for SNP rs1051730) and Cytochrome P450 2A6 (CYP2A6; e.g. p=2.78×10−5 for a nonsynonymous SNP rs1801272) regions were observed for lifetime average CPD, however only CYP2A6 showed evidence of significant association with current CPD. A candidate SNP (rs3025343) in the DBH was significantly (p=0.015) associated with smoking cessation. Conclusion We identified two candidate regions associated with age at smoking initiation in COPD subjects. Associations of CHRNA3/CHRNA5 and CYP2A6 loci with CPD and DBH with smoking cessation are also likely of importance in the smoking behaviors of COPD patients. PMID:21685187

  16. Searching for an Accurate Marker-Based Prediction of an Individual Quantitative Trait in Molecular Plant Breeding

    Science.gov (United States)

    Fu, Yong-Bi; Yang, Mo-Hua; Zeng, Fangqin; Biligetu, Bill

    2017-01-01

    Molecular plant breeding with the aid of molecular markers has played an important role in modern plant breeding over the last two decades. Many marker-based predictions for quantitative traits have been made to enhance parental selection, but the trait prediction accuracy remains generally low, even with the aid of dense, genome-wide SNP markers. To search for more accurate trait-specific prediction with informative SNP markers, we conducted a literature review on the prediction issues in molecular plant breeding and on the applicability of an RNA-Seq technique for developing function-associated specific trait (FAST) SNP markers. To understand whether and how FAST SNP markers could enhance trait prediction, we also performed a theoretical reasoning on the effectiveness of these markers in a trait-specific prediction, and verified the reasoning through computer simulation. To the end, the search yielded an alternative to regular genomic selection with FAST SNP markers that could be explored to achieve more accurate trait-specific prediction. Continuous search for better alternatives is encouraged to enhance marker-based predictions for an individual quantitative trait in molecular plant breeding. PMID:28729875

  17. Searching for an Accurate Marker-Based Prediction of an Individual Quantitative Trait in Molecular Plant Breeding

    Directory of Open Access Journals (Sweden)

    Yong-Bi Fu

    2017-07-01

    Full Text Available Molecular plant breeding with the aid of molecular markers has played an important role in modern plant breeding over the last two decades. Many marker-based predictions for quantitative traits have been made to enhance parental selection, but the trait prediction accuracy remains generally low, even with the aid of dense, genome-wide SNP markers. To search for more accurate trait-specific prediction with informative SNP markers, we conducted a literature review on the prediction issues in molecular plant breeding and on the applicability of an RNA-Seq technique for developing function-associated specific trait (FAST SNP markers. To understand whether and how FAST SNP markers could enhance trait prediction, we also performed a theoretical reasoning on the effectiveness of these markers in a trait-specific prediction, and verified the reasoning through computer simulation. To the end, the search yielded an alternative to regular genomic selection with FAST SNP markers that could be explored to achieve more accurate trait-specific prediction. Continuous search for better alternatives is encouraged to enhance marker-based predictions for an individual quantitative trait in molecular plant breeding.

  18. Searching for an Accurate Marker-Based Prediction of an Individual Quantitative Trait in Molecular Plant Breeding.

    Science.gov (United States)

    Fu, Yong-Bi; Yang, Mo-Hua; Zeng, Fangqin; Biligetu, Bill

    2017-01-01

    Molecular plant breeding with the aid of molecular markers has played an important role in modern plant breeding over the last two decades. Many marker-based predictions for quantitative traits have been made to enhance parental selection, but the trait prediction accuracy remains generally low, even with the aid of dense, genome-wide SNP markers. To search for more accurate trait-specific prediction with informative SNP markers, we conducted a literature review on the prediction issues in molecular plant breeding and on the applicability of an RNA-Seq technique for developing function-associated specific trait (FAST) SNP markers. To understand whether and how FAST SNP markers could enhance trait prediction, we also performed a theoretical reasoning on the effectiveness of these markers in a trait-specific prediction, and verified the reasoning through computer simulation. To the end, the search yielded an alternative to regular genomic selection with FAST SNP markers that could be explored to achieve more accurate trait-specific prediction. Continuous search for better alternatives is encouraged to enhance marker-based predictions for an individual quantitative trait in molecular plant breeding.

  19. Genome-wide association study identifies major loci for carcass weight on BTA14 in Hanwoo (Korean cattle.

    Directory of Open Access Journals (Sweden)

    Seung Hwan Lee

    Full Text Available This genome-wide association study (GWAS was conducted to identify major loci that are significantly associated with carcass weight, and their effects, in order to provide increased understanding of the genetic architecture of carcass weight in Hanwoo. This genome-wide association study identified one major chromosome region ranging from 23 Mb to 25 Mb on chromosome 14 as being associated with carcass weight in Hanwoo. Significant Bonferroni-corrected genome-wide associations (P<1.52×10(-6 were detected for 6 Single Nucleotide Polymorphic (SNP loci for carcass weight on chromosome 14. The most significant SNP was BTB-01280026 (P = 4.02×10(-11, located in the 25 Mb region on Bos taurus autosome 14 (BTA14. The other 5 significant SNPs were Hapmap27934-BTC-065223 (P = 4.04×10(-11 in 25.2 Mb, BTB-01143580 (P = 6.35×10(-11 in 24.3 Mb, Hapmap30932-BTC-011225 (P = 5.92×10(-10 in 24.8 Mb, Hapmap27112-BTC-063342 (P = 5.18×10(-9 in 25.4 Mb, and Hapmap24414-BTC-073009 (P = 7.38×10(-8 in 25.4 Mb, all on BTA 14. One SNP (BTB-01143580; P = 6.35×10(-11 lies independently from the other 5 SNPs. The 5 SNPs that lie together showed a large Linkage disequilibrium (LD block (block size of 553 kb with LD coefficients ranging from 0.53 to 0.89 within the block. The most significant SNPs accounted for 6.73% to 10.55% of additive genetic variance, which is quite a large proportion of the total additive genetic variance. The most significant SNP (BTB-01280026; P = 4.02×10(-11 had 16.96 kg of allele substitution effect, and the second most significant SNP (Hapmap27934-BTC-065223; P = 4.04×10(-11 had 18.06 kg of effect on carcass weight, which correspond to 44% and 47%, respectively, of the phenotypic standard deviation for carcass weight in Hanwoo cattle. Our results demonstrated that carcass weight was affected by a major Quantitative Trait Locus (QTL with a large effect and by many SNPs with small effects that are normally

  20. Using SNP markers to dissect linkage disequilibrium at a major quantitative trait locus for resistance to the potato cyst nematode Globodera pallida on potato chromosome V.

    Science.gov (United States)

    Achenbach, Ute; Paulo, Joao; Ilarionova, Evgenyia; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Gebhardt, Christiane

    2009-02-01

    The damage caused by the parasitic root cyst nematode Globodera pallida is a major yield-limiting factor in potato cultivation . Breeding for resistance is facilitated by the PCR-based marker 'HC', which is diagnostic for an allele conferring high resistance against G. pallida pathotype Pa2/3 that has been introgressed from the wild potato species Solanum vernei into the Solanum tuberosum tetraploid breeding pool. The major quantitative trait locus (QTL) controlling this nematode resistance maps on potato chromosome V in a hot spot for resistance to various pathogens including nematodes and the oomycete Phytophthora infestans. An unstructured sample of 79 tetraploid, highly heterozygous varieties and breeding clones was selected based on presence (41 genotypes) or absence (38 genotypes) of the HC marker. Testing the clones for resistance to G. pallida confirmed the diagnostic power of the HC marker. The 79 individuals were genotyped for 100 single nucleotide polymorphisms (SNPs) at 10 loci distributed over 38 cM on chromosome V. Forty-five SNPs at six loci spanning 2 cM in the interval between markers GP21-GP179 were associated with resistance to G. pallida. Based on linkage disequilibrium (LD) between SNP markers, six LD groups comprising between 2 and 18 SNPs were identified. The LD groups indicated the existence of multiple alleles at a single resistance locus or at several, physically linked resistance loci. LD group C comprising 18 SNPs corresponded to the 'HC' marker. LD group E included 16 SNPs and showed an association peak, which positioned one nematode resistance locus physically close to the R1 gene family.

  1. Genome-wide association study to identify common variants associated with brachial circumference: a meta-analysis of 14 cohorts.

    Directory of Open Access Journals (Sweden)

    Vesna Boraska

    Full Text Available Brachial circumference (BC, also known as upper arm or mid arm circumference, can be used as an indicator of muscle mass and fat tissue, which are distributed differently in men and women. Analysis of anthropometric measures of peripheral fat distribution such as BC could help in understanding the complex pathophysiology behind overweight and obesity. The purpose of this study is to identify genetic variants associated with BC through a large-scale genome-wide association scan (GWAS meta-analysis. We used fixed-effects meta-analysis to synthesise summary results across 14 GWAS discovery and 4 replication cohorts comprising overall 22,376 individuals (12,031 women and 10,345 men of European ancestry. Individual analyses were carried out for men, women, and combined across sexes using linear regression and an additive genetic model: adjusted for age and adjusted for age and BMI. We prioritised signals for follow-up in two-stages. We did not detect any signals reaching genome-wide significance. The FTO rs9939609 SNP showed nominal evidence for association (p<0.05 in the age-adjusted strata for men and across both sexes. In this first GWAS meta-analysis for BC to date, we have not identified any genome-wide significant signals and do not observe robust association of previously established obesity loci with BC. Large-scale collaborations will be necessary to achieve higher power to detect loci underlying BC.

  2. Sunflower Hybrid Breeding: From Markers to Genomic Selection.

    Science.gov (United States)

    Dimitrijevic, Aleksandra; Horn, Renate

    2017-01-01

    In sunflower, molecular markers for simple traits as, e.g., fertility restoration, high oleic acid content, herbicide tolerance or resistances to Plasmopara halstedii, Puccinia helianthi , or Orobanche cumana have been successfully used in marker-assisted breeding programs for years. However, agronomically important complex quantitative traits like yield, heterosis, drought tolerance, oil content or selection for disease resistance, e.g., against Sclerotinia sclerotiorum have been challenging and will require genome-wide approaches. Plant genetic resources for sunflower are being collected and conserved worldwide that represent valuable resources to study complex traits. Sunflower association panels provide the basis for genome-wide association studies, overcoming disadvantages of biparental populations. Advances in technologies and the availability of the sunflower genome sequence made novel approaches on the whole genome level possible. Genotype-by-sequencing, and whole genome sequencing based on next generation sequencing technologies facilitated the production of large amounts of SNP markers for high density maps as well as SNP arrays and allowed genome-wide association studies and genomic selection in sunflower. Genome wide or candidate gene based association studies have been performed for traits like branching, flowering time, resistance to Sclerotinia head and stalk rot. First steps in genomic selection with regard to hybrid performance and hybrid oil content have shown that genomic selection can successfully address complex quantitative traits in sunflower and will help to speed up sunflower breeding programs in the future. To make sunflower more competitive toward other oil crops higher levels of resistance against pathogens and better yield performance are required. In addition, optimizing plant architecture toward a more complex growth type for higher plant densities has the potential to considerably increase yields per hectare. Integrative approaches

  3. Sunflower Hybrid Breeding: From Markers to Genomic Selection

    Science.gov (United States)

    Dimitrijevic, Aleksandra; Horn, Renate

    2018-01-01

    In sunflower, molecular markers for simple traits as, e.g., fertility restoration, high oleic acid content, herbicide tolerance or resistances to Plasmopara halstedii, Puccinia helianthi, or Orobanche cumana have been successfully used in marker-assisted breeding programs for years. However, agronomically important complex quantitative traits like yield, heterosis, drought tolerance, oil content or selection for disease resistance, e.g., against Sclerotinia sclerotiorum have been challenging and will require genome-wide approaches. Plant genetic resources for sunflower are being collected and conserved worldwide that represent valuable resources to study complex traits. Sunflower association panels provide the basis for genome-wide association studies, overcoming disadvantages of biparental populations. Advances in technologies and the availability of the sunflower genome sequence made novel approaches on the whole genome level possible. Genotype-by-sequencing, and whole genome sequencing based on next generation sequencing technologies facilitated the production of large amounts of SNP markers for high density maps as well as SNP arrays and allowed genome-wide association studies and genomic selection in sunflower. Genome wide or candidate gene based association studies have been performed for traits like branching, flowering time, resistance to Sclerotinia head and stalk rot. First steps in genomic selection with regard to hybrid performance and hybrid oil content have shown that genomic selection can successfully address complex quantitative traits in sunflower and will help to speed up sunflower breeding programs in the future. To make sunflower more competitive toward other oil crops higher levels of resistance against pathogens and better yield performance are required. In addition, optimizing plant architecture toward a more complex growth type for higher plant densities has the potential to considerably increase yields per hectare. Integrative approaches

  4. Sunflower Hybrid Breeding: From Markers to Genomic Selection

    Directory of Open Access Journals (Sweden)

    Aleksandra Dimitrijevic

    2018-01-01

    Full Text Available In sunflower, molecular markers for simple traits as, e.g., fertility restoration, high oleic acid content, herbicide tolerance or resistances to Plasmopara halstedii, Puccinia helianthi, or Orobanche cumana have been successfully used in marker-assisted breeding programs for years. However, agronomically important complex quantitative traits like yield, heterosis, drought tolerance, oil content or selection for disease resistance, e.g., against Sclerotinia sclerotiorum have been challenging and will require genome-wide approaches. Plant genetic resources for sunflower are being collected and conserved worldwide that represent valuable resources to study complex traits. Sunflower association panels provide the basis for genome-wide association studies, overcoming disadvantages of biparental populations. Advances in technologies and the availability of the sunflower genome sequence made novel approaches on the whole genome level possible. Genotype-by-sequencing, and whole genome sequencing based on next generation sequencing technologies facilitated the production of large amounts of SNP markers for high density maps as well as SNP arrays and allowed genome-wide association studies and genomic selection in sunflower. Genome wide or candidate gene based association studies have been performed for traits like branching, flowering time, resistance to Sclerotinia head and stalk rot. First steps in genomic selection with regard to hybrid performance and hybrid oil content have shown that genomic selection can successfully address complex quantitative traits in sunflower and will help to speed up sunflower breeding programs in the future. To make sunflower more competitive toward other oil crops higher levels of resistance against pathogens and better yield performance are required. In addition, optimizing plant architecture toward a more complex growth type for higher plant densities has the potential to considerably increase yields per hectare

  5. Genome-wide association study of autistic-like traits in a general population study of young adults

    Directory of Open Access Journals (Sweden)

    Rachel Maree Jones

    2013-10-01

    Full Text Available Research has proposed that autistic-like traits in the general population lie on a continuum, with clinical Autism Spectrum Disorder (ASD representing the extreme end of this distribution. Inherent in this proposal is that biological mechanisms associated with clinical ASD may also underpin variation in autistic-like traits within the general population. A genome-wide association study using 2,462,046 single nucleotide polymorphisms (SNPs was undertaken for ASD in 965 individuals from the Western Australian Pregnancy Cohort (Raine Study. No SNP associations reached genome-wide significance (p < 5.0 x 10-8. However, investigations into nominal observed SNP associations (p < 1.0 x 10-5 add support to two positional candidate genes previously implicated in ASD aetiology, PRKCB1 and CBLN1.The rs198198 SNP (p = 9.587 x 10-6, is located within an intron of the protein kinase C, beta 1 (PRKCB1 gene on chromosome 16p11. The PRKCB1 gene has been previously reported in linkage and association studies for ASD, and its mRNA expression has been shown to be significantly down regulated in ASD cases compared with controls. The rs16946931 SNP (p = 1.78 x 10-6 is located in a region flanking the Cerebellin 1 (CBLN1 gene on chromosome 16q12.1. The CBLN1 gene is involved with synaptogenesis and is part of a gene family previously implicated in ASD. This GWA study is only the second to examine SNPs associated with autistic-like traits in the general population, and provides evidence to support roles for the PRKCB1 and CBLN1 genes in risk of clinical ASD.

  6. Genome-wide association study of young-onset hypertension in the Han Chinese population of Taiwan.

    Directory of Open Access Journals (Sweden)

    Hsin-Chou Yang

    Full Text Available Young-onset hypertension has a stronger genetic component than late-onset counterpart; thus, the identification of genes related to its susceptibility is a critical issue for the prevention and management of this disease. We carried out a two-stage association scan to map young-onset hypertension susceptibility genes. The first-stage analysis, a genome-wide association study, analyzed 175 matched case-control pairs; the second-stage analysis, a confirmatory association study, verified the results at the first stage based on a total of 1,008 patients and 1,008 controls. Single-locus association tests, multilocus association tests and pair-wise gene-gene interaction tests were performed to identify young-onset hypertension susceptibility genes. After considering stringent adjustments of multiple testing, gene annotation and single-nucleotide polymorphism (SNP quality, four SNPs from two SNP triplets with strong association signals (-log(10(p>7 and 13 SNPs from 8 interactive SNP pairs with strong interactive signals (-log(10(p>8 were carefully re-examined. The confirmatory study verified the association for a SNP quartet 219 kb and 495 kb downstream of LOC344371 (a hypothetical gene and RASGRP3 on chromosome 2p22.3, respectively. The latter has been implicated in the abnormal vascular responsiveness to endothelin-1 and angiotensin II in diabetic-hypertensive rats. Intrinsic synergy involving IMPG1 on chromosome 6q14.2-q15 was also verified. IMPG1 encodes interphotoreceptor matrix proteoglycan 1 which has cation binding capacity. The genes are novel hypertension targets identified in this first genome-wide hypertension association study of the Han Chinese population.

  7. Genome-wide association study identifies TF as a significant modifier gene of iron metabolism in HFE hemochromatosis.

    Science.gov (United States)

    de Tayrac, Marie; Roth, Marie-Paule; Jouanolle, Anne-Marie; Coppin, Hélène; le Gac, Gérald; Piperno, Alberto; Férec, Claude; Pelucchi, Sara; Scotet, Virginie; Bardou-Jacquet, Edouard; Ropert, Martine; Bouvet, Régis; Génin, Emmanuelle; Mosser, Jean; Deugnier, Yves

    2015-03-01

    Hereditary hemochromatosis (HH) is the most common form of genetic iron loading disease. It is mainly related to the homozygous C282Y/C282Y mutation in the HFE gene that is, however, a necessary but not a sufficient condition to develop clinical and even biochemical HH. This suggests that modifier genes are likely involved in the expressivity of the disease. Our aim was to identify such modifier genes. We performed a genome-wide association study (GWAS) using DNA collected from 474 unrelated C282Y homozygotes. Associations were examined for both quantitative iron burden indices and clinical outcomes with 534,213 single nucleotide polymorphisms (SNP) genotypes, with replication analyses in an independent sample of 748 C282Y homozygotes from four different European centres. One SNP met genome-wide statistical significance for association with transferrin concentration (rs3811647, GWAS p value of 7×10(-9) and replication p value of 5×10(-13)). This SNP, located within intron 11 of the TF gene, had a pleiotropic effect on serum iron (GWAS p value of 4.9×10(-6) and replication p value of 3.2×10(-6)). Both serum transferrin and iron levels were associated with serum ferritin levels, amount of iron removed and global clinical stage (pHFE-associated HH (HFE-HH) patients, identified the rs3811647 polymorphism in the TF gene as the only SNP significantly associated with iron metabolism through serum transferrin and iron levels. Because these two outcomes were clearly associated with the biochemical and clinical expression of the disease, an indirect link between the rs3811647 polymorphism and the phenotypic presentation of HFE-HH is likely. Copyright © 2014 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.

  8. SNP-PHAGE – High throughput SNP discovery pipeline

    Directory of Open Access Journals (Sweden)

    Cregan Perry B

    2006-10-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs as defined here are single base sequence changes or short insertion/deletions between or within individuals of a given species. As a result of their abundance and the availability of high throughput analysis technologies SNP markers have begun to replace other traditional markers such as restriction fragment length polymorphisms (RFLPs, amplified fragment length polymorphisms (AFLPs and simple sequence repeats (SSRs or microsatellite markers for fine mapping and association studies in several species. For SNP discovery from chromatogram data, several bioinformatics programs have to be combined to generate an analysis pipeline. Results have to be stored in a relational database to facilitate interrogation through queries or to generate data for further analyses such as determination of linkage disequilibrium and identification of common haplotypes. Although these tasks are routinely performed by several groups, an integrated open source SNP discovery pipeline that can be easily adapted by new groups interested in SNP marker development is currently unavailable. Results We developed SNP-PHAGE (SNP discovery Pipeline with additional features for identification of common haplotypes within a sequence tagged site (Haplotype Analysis and GenBank (-dbSNP submissions. This tool was applied for analyzing sequence traces from diverse soybean genotypes to discover over 10,000 SNPs. This package was developed on UNIX/Linux platform, written in Perl and uses a MySQL database. Scripts to generate a user-friendly web interface are also provided with common queries for preliminary data analysis. A machine learning tool developed by this group for increasing the efficiency of SNP discovery is integrated as a part of this package as an optional feature. The SNP-PHAGE package is being made available open source at http://bfgl.anri.barc.usda.gov/ML/snp-phage/. Conclusion SNP-PHAGE provides a bioinformatics

  9. Conclusive evidence for hexasomic inheritance in chrysanthemum based on analysis of a 183 k SNP array.

    Science.gov (United States)

    van Geest, Geert; Voorrips, Roeland E; Esselink, Danny; Post, Aike; Visser, Richard Gf; Arens, Paul

    2017-08-07

    Cultivated chrysanthemum is an outcrossing hexaploid (2n = 6× = 54) with a disputed mode of inheritance. In this paper, we present a single nucleotide polymorphism (SNP) selection pipeline that was used to design an Affymetrix Axiom array with 183 k SNPs from RNA sequencing data (1). With this array, we genotyped four bi-parental populations (with sizes of 405, 53, 76 and 37 offspring plants respectively), and a cultivar panel of 63 genotypes. Further, we present a method for dosage scoring in hexaploids from signal intensities of the array based on mixture models (2) and validation of selection steps in the SNP selection pipeline (3). The resulting genotypic data is used to draw conclusions on the mode of inheritance in chrysanthemum (4), and to make an inference on allelic expression bias (5). With use of the mixture model approach, we successfully called the dosage of 73,936 out of 183,130 SNPs (40.4%) that segregated in any of the bi-parental populations. To investigate the mode of inheritance, we analysed markers that segregated in the large bi-parental population (n = 405). Analysis of segregation of duplex x nulliplex SNPs resulted in evidence for genome-wide hexasomic inheritance. This evidence was substantiated by the absence of strong linkage between markers in repulsion, which indicated absence of full disomic inheritance. We present the success rate of SNP discovery out of RNA sequencing data as affected by different selection steps, among which SNP coverage over genotypes and use of different types of sequence read mapping software. Genomic dosage highly correlated with relative allele coverage from the RNA sequencing data, indicating that most alleles are expressed according to their genomic dosage. The large population, genotyped with a very large number of markers, is a unique framework for extensive genetic analyses in hexaploid chrysanthemum. As starting point, we show conclusive evidence for genome-wide hexasomic inheritance.

  10. Evaluation of inbreeding depression in Holstein cattle using whole-genome SNP markers and alternative measures of genomic inbreeding.

    Science.gov (United States)

    Bjelland, D W; Weigel, K A; Vukasinovic, N; Nkrumah, J D

    2013-07-01

    The effects of increased pedigree inbreeding in dairy cattle populations have been well documented and result in a negative impact on profitability. Recent advances in genotyping technology have allowed researchers to move beyond pedigree analysis and study inbreeding at a molecular level. In this study, 5,853 animals were genotyped for 54,001 single nucleotide polymorphisms (SNP); 2,913 cows had phenotypic records including a single lactation for milk yield (from either lactation 1, 2, 3, or 4), reproductive performance, and linear type conformation. After removing SNP with poor call rates, low minor allele frequencies, and departure from Hardy-Weinberg equilibrium, 33,025 SNP remained for analyses. Three measures of genomic inbreeding were evaluated: percent homozygosity (FPH), inbreeding calculated from runs of homozygosity (FROH), and inbreeding derived from a genomic relationship matrix (FGRM). Average FPH was 60.5±1.1%, average FROH was 3.8±2.1%, and average FGRM was 20.8±2.3%, where animals with larger values for each of the genomic inbreeding indices were considered more inbred. Decreases in total milk yield to 205d postpartum of 53, 20, and 47kg per 1% increase in FPH, FROH, and FGRM, respectively, were observed. Increases in days open per 1% increase in FPH (1.76 d), FROH (1.72 d), and FGRM (1.06 d) were also noted, as well as increases in maternal calving difficulty (0.09, 0.03, and 0.04 on a 5-point scale for FPH, FROH, and FGRM, respectively). Several linear type traits, such as strength (-0.40, -0.11, and -0.19), rear legs rear view (-0.35, -0.16, and -0.14), front teat placement (0.35, 0.25, 0.18), and teat length (-0.24, -0.14, and -0.13) were also affected by increases in FPH, FROH, and FGRM, respectively. Overall, increases in each measure of genomic inbreeding in this study were associated with negative effects on production and reproductive ability in dairy cows. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc

  11. Genome-wide association study of antisocial personality disorder.

    Science.gov (United States)

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-09-06

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53-3.14), P=1.9 × 10(-5)). Two polymorphisms at 6p21.2 LINC00951-LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37-1.85), P=1.6 × 10(-9)) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder.

  12. Genome-Wide Approaches to Drosophila Heart Development

    Directory of Open Access Journals (Sweden)

    Manfred Frasch

    2016-05-01

    Full Text Available The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level.

  13. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    Science.gov (United States)

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  14. Microbial genome-wide association studies: lessons from human GWAS.

    Science.gov (United States)

    Power, Robert A; Parkhill, Julian; de Oliveira, Tulio

    2017-01-01

    The reduced costs of sequencing have led to whole-genome sequences for a large number of microorganisms, enabling the application of microbial genome-wide association studies (GWAS). Given the successes of human GWAS in understanding disease aetiology and identifying potential drug targets, microbial GWAS are likely to further advance our understanding of infectious diseases. These advances include insights into pressing global health problems, such as antibiotic resistance and disease transmission. In this Review, we outline the methodologies of GWAS, the current state of the field of microbial GWAS, and how lessons from human GWAS can direct the future of the field.

  15. Genome-wide detection of selection and other evolutionary forces

    DEFF Research Database (Denmark)

    Xu, Zhuofei; Zhou, Rui

    2015-01-01

    As is well known, pathogenic microbes evolve rapidly to escape from the host immune system and antibiotics. Genetic variations among microbial populations occur frequently during the long-term pathogen–host evolutionary arms race, and individual mutation beneficial for the fitness can be fixed...... to scan genome-wide alignments for evidence of positive Darwinian selection, recombination, and other evolutionary forces operating on the coding regions. In this chapter, we describe an integrative analysis pipeline and its application to tracking featured evolutionary trajectories on the genome...

  16. Multiple SNP markers reveal fine-scale population and deep phylogeographic structure in European anchovy (Engraulis encrasicolus L.).

    KAUST Repository

    Zarraonaindia, Iratxe; Iriondo, Mikel; Albaina, Aitor; Pardo, Miguel Angel; Manzano, Carmen; Grant, W Stewart; Irigoien, Xabier; Estonba, Andone

    2012-01-01

    DNA SNPs define two deep phylogroups that reflect ancient dispersals and colonizations. These markers define two ecological groups. One major group of Iberian-Atlantic populations is associated with upwelling areas on narrow continental shelves and includes

  17. Genome-Wide Gene Set Analysis for Identification of Pathways Associated with Alcohol Dependence

    Science.gov (United States)

    Biernacka, Joanna M.; Geske, Jennifer; Jenkins, Gregory D.; Colby, Colin; Rider, David N.; Karpyak, Victor M.; Choi, Doo-Sup; Fridley, Brooke L.

    2013-01-01

    It is believed that multiple genetic variants with small individual effects contribute to the risk of alcohol dependence. Such polygenic effects are difficult to detect in genome-wide association studies that test for association of the phenotype with each single nucleotide polymorphism (SNP) individually. To overcome this challenge, gene set analysis (GSA) methods that jointly test for the effects of pre-defined groups of genes have been proposed. Rather than testing for association between the phenotype and individual SNPs, these analyses evaluate the global evidence of association with a set of related genes enabling the identification of cellular or molecular pathways or biological processes that play a role in development of the disease. It is hoped that by aggregating the evidence of association for all available SNPs in a group of related genes, these approaches will have enhanced power to detect genetic associations with complex traits. We performed GSA using data from a genome-wide study of 1165 alcohol dependent cases and 1379 controls from the Study of Addiction: Genetics and Environment (SAGE), for all 200 pathways listed in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Results demonstrated a potential role of the “Synthesis and Degradation of Ketone Bodies” pathway. Our results also support the potential involvement of the “Neuroactive Ligand Receptor Interaction” pathway, which has previously been implicated in addictive disorders. These findings demonstrate the utility of GSA in the study of complex disease, and suggest specific directions for further research into the genetic architecture of alcohol dependence. PMID:22717047

  18. Genome-wide linkage scan for colorectal cancer susceptibility genes supports linkage to chromosome 3q

    Directory of Open Access Journals (Sweden)

    Velculescu Victor E

    2008-04-01

    Full Text Available Abstract Background Colorectal cancer is one of the most common causes of cancer-related mortality. The disease is clinically and genetically heterogeneous though a strong hereditary component has been identified. However, only a small proportion of the inherited susceptibility can be ascribed to dominant syndromes, such as Hereditary Non-Polyposis Colorectal Cancer (HNPCC or Familial Adenomatous Polyposis (FAP. In an attempt to identify novel colorectal cancer predisposing genes, we have performed a genome-wide linkage analysis in 30 Swedish non-FAP/non-HNPCC families with a strong family history of colorectal cancer. Methods Statistical analysis was performed using multipoint parametric and nonparametric linkage. Results Parametric analysis under the assumption of locus homogeneity excluded any common susceptibility regions harbouring a predisposing gene for colorectal cancer. However, several loci on chromosomes 2q, 3q, 6q, and 7q with suggestive linkage were detected in the parametric analysis under the assumption of locus heterogeneity as well as in the nonparametric analysis. Among these loci, the locus on chromosome 3q21.1-q26.2 was the most consistent finding providing positive results in both parametric and nonparametric analyses Heterogeneity LOD score (HLOD = 1.90, alpha = 0.45, Non-Parametric LOD score (NPL = 2.1. Conclusion The strongest evidence of linkage was seen for the region on chromosome 3. Interestingly, the same region has recently been reported as the most significant finding in a genome-wide analysis performed with SNP arrays; thus our results independently support the finding on chromosome 3q.

  19. A genome-wide association study reveals variants in ARL15 that influence adiponectin levels.

    Directory of Open Access Journals (Sweden)

    J Brent Richards

    2009-12-01

    Full Text Available The adipocyte-derived protein adiponectin is highly heritable and inversely associated with risk of type 2 diabetes mellitus (T2D and coronary heart disease (CHD. We meta-analyzed 3 genome-wide association studies for circulating adiponectin levels (n = 8,531 and sought validation of the lead single nucleotide polymorphisms (SNPs in 5 additional cohorts (n = 6,202. Five SNPs were genome-wide significant in their relationship with adiponectin (P< or =5x10(-8. We then tested whether these 5 SNPs were associated with risk of T2D and CHD using a Bonferroni-corrected threshold of P< or =0.011 to declare statistical significance for these disease associations. SNPs at the adiponectin-encoding ADIPOQ locus demonstrated the strongest associations with adiponectin levels (P-combined = 9.2x10(-19 for lead SNP, rs266717, n = 14,733. A novel variant in the ARL15 (ADP-ribosylation factor-like 15 gene was associated with lower circulating levels of adiponectin (rs4311394-G, P-combined = 2.9x10(-8, n = 14,733. This same risk allele at ARL15 was also associated with a higher risk of CHD (odds ratio [OR] = 1.12, P = 8.5x10(-6, n = 22,421 more nominally, an increased risk of T2D (OR = 1.11, P = 3.2x10(-3, n = 10,128, and several metabolic traits. Expression studies in humans indicated that ARL15 is well-expressed in skeletal muscle. These findings identify a novel protein, ARL15, which influences circulating adiponectin levels and may impact upon CHD risk.

  20. Genome-wide Association Study Implicates PARD3B-based AIDS Restriction

    Science.gov (United States)

    Nelson, George W.; Lautenberger, James A.; Chinn, Leslie; McIntosh, Carl; Johnson, Randall C.; Sezgin, Efe; Kessing, Bailey; Malasky, Michael; Hendrickson, Sher L.; Pontius, Joan; Tang, Minzhong; An, Ping; Winkler, Cheryl A.; Limou, Sophie; Le Clerc, Sigrid; Delaneau, Olivier; Zagury, Jean-François; Schuitemaker, Hanneke; van Manen, Daniëlle; Bream, Jay H.; Gomperts, Edward D.; Buchbinder, Susan; Goedert, James J.; Kirk, Gregory D.; O'Brien, Stephen J.

    2011-01-01

    Background. Host genetic variation influences human immunodeficiency virus (HIV) infection and progression to AIDS. Here we used clinically well-characterized subjects from 5 pretreatment HIV/AIDS cohorts for a genome-wide association study to identify gene associations with rate of AIDS progression. Methods.  European American HIV seroconverters (n = 755) were interrogated for single-nucleotide polymorphisms (SNPs) (n = 700,022) associated with progression to AIDS 1987 (Cox proportional hazards regression analysis, co-dominant model). Results.  Association with slower progression was observed for SNPs in the gene PARD3B. One of these, rs11884476, reached genome-wide significance (relative hazard = 0.3; P =3. 370 × 10−9) after statistical correction for 700,022 SNPs and contributes 4.52% of the overall variance in AIDS progression in this study. Nine of the top-ranked SNPs define a PARD3B haplotype that also displays significant association with progression to AIDS (hazard ratio, 0.3; P = 3.220 × 10−8). One of these SNPs, rs10185378, is a predicted exonic splicing enhancer; significant alteration in the expression profile of PARD3B splicing transcripts was observed in B cell lines with alternate rs10185378 genotypes. This SNP was typed in European cohorts of rapid progressors and was found to be protective for AIDS 1993 definition (odds ratio, 0.43, P = .025). Conclusions. These observations suggest a potential unsuspected pathway of host genetic influence on the dynamics of AIDS progression. PMID:21502085

  1. Genome-wide association analysis of autoantibody positivity in type 1 diabetes cases.

    Directory of Open Access Journals (Sweden)

    Vincent Plagnol

    2011-08-01

    Full Text Available The genetic basis of autoantibody production is largely unknown outside of associations located in the major histocompatibility complex (MHC human leukocyte antigen (HLA region. The aim of this study is the discovery of new genetic associations with autoantibody positivity using genome-wide association scan single nucleotide polymorphism (SNP data in type 1 diabetes (T1D patients with autoantibody measurements. We measured two anti-islet autoantibodies, glutamate decarboxylase (GADA, n = 2,506, insulinoma-associated antigen 2 (IA-2A, n = 2,498, antibodies to the autoimmune thyroid (Graves' disease (AITD autoantigen thyroid peroxidase (TPOA, n = 8,300, and antibodies against gastric parietal cells (PCA, n = 4,328 that are associated with autoimmune gastritis. Two loci passed a stringent genome-wide significance level (p<10(-10: 1q23/FCRL3 with IA-2A and 9q34/ABO with PCA. Eleven of 52 non-MHC T1D loci showed evidence of association with at least one autoantibody at a false discovery rate of 16%: 16p11/IL27-IA-2A, 2q24/IFIH1-IA-2A and PCA, 2q32/STAT4-TPOA, 10p15/IL2RA-GADA, 6q15/BACH2-TPOA, 21q22/UBASH3A-TPOA, 1p13/PTPN22-TPOA, 2q33/CTLA4-TPOA, 4q27/IL2/TPOA, 15q14/RASGRP1/TPOA, and 12q24/SH2B3-GADA and TPOA. Analysis of the TPOA-associated loci in 2,477 cases with Graves' disease identified two new AITD loci (BACH2 and UBASH3A.

  2. Genome-wide association study identifies shared risk loci common to two malignancies in golden retrievers.

    Directory of Open Access Journals (Sweden)

    Noriko Tonomura

    2015-02-01

    Full Text Available Dogs, with their breed-determined limited genetic background, are great models of human disease including cancer. Canine B-cell lymphoma and hemangiosarcoma are both malignancies of the hematologic system that are clinically and histologically similar to human B-cell non-Hodgkin lymphoma and angiosarcoma, respectively. Golden retrievers in the US show significantly elevated lifetime risk for both B-cell lymphoma (6% and hemangiosarcoma (20%. We conducted genome-wide association studies for hemangiosarcoma and B-cell lymphoma, identifying two shared predisposing loci. The two associated loci are located on chromosome 5, and together contribute ~20% of the risk of developing these cancers. Genome-wide p-values for the top SNP of each locus are 4.6×10-7 and 2.7×10-6, respectively. Whole genome resequencing of nine cases and controls followed by genotyping and detailed analysis identified three shared and one B-cell lymphoma specific risk haplotypes within the two loci, but no coding changes were associated with the risk haplotypes. Gene expression analysis of B-cell lymphoma tumors revealed that carrying the risk haplotypes at the first locus is associated with down-regulation of several nearby genes including the proximal gene TRPC6, a transient receptor Ca2+-channel involved in T-cell activation, among other functions. The shared risk haplotype in the second locus overlaps the vesicle transport and release gene STX8. Carrying the shared risk haplotype is associated with gene expression changes of 100 genes enriched for pathways involved in immune cell activation. Thus, the predisposing germ-line mutations in B-cell lymphoma and hemangiosarcoma appear to be regulatory, and affect pathways involved in T-cell mediated immune response in the tumor. This suggests that the interaction between the immune system and malignant cells plays a common role in the tumorigenesis of these relatively different cancers.

  3. Genome-wide association study of retinopathy in individuals without diabetes.

    Directory of Open Access Journals (Sweden)

    Richard A Jensen

    Full Text Available Mild retinopathy (microaneurysms or dot-blot hemorrhages is observed in persons without diabetes or hypertension and may reflect microvascular disease in other organs. We conducted a genome-wide association study (GWAS of mild retinopathy in persons without diabetes.A working group agreed on phenotype harmonization, covariate selection and analytic plans for within-cohort GWAS. An inverse-variance weighted fixed effects meta-analysis was performed with GWAS results from six cohorts of 19,411 Caucasians. The primary analysis included individuals without diabetes and secondary analyses were stratified by hypertension status. We also singled out the results from single nucleotide polymorphisms (SNPs previously shown to be associated with diabetes and hypertension, the two most common causes of retinopathy.No SNPs reached genome-wide significance in the primary analysis or the secondary analysis of participants with hypertension. SNP, rs12155400, in the histone deacetylase 9 gene (HDAC9 on chromosome 7, was associated with retinopathy in analysis of participants without hypertension, -1.3±0.23 (beta ± standard error, p = 6.6×10(-9. Evidence suggests this was a false positive finding. The minor allele frequency was low (∼2%, the quality of the imputation was moderate (r(2 ∼0.7, and no other common variants in the HDAC9 gene were associated with the outcome. SNPs found to be associated with diabetes and hypertension in other GWAS were not associated with retinopathy in persons without diabetes or in subgroups with or without hypertension.This GWAS of retinopathy in individuals without diabetes showed little evidence of genetic associations. Further studies are needed to identify genes associated with these signs in order to help unravel novel pathways and determinants of microvascular diseases.

  4. Genome-wide haplotype analysis of cis expression quantitative trait loci in monocytes.

    Directory of Open Access Journals (Sweden)

    Sophie Garnier

    Full Text Available In order to assess whether gene expression variability could be influenced by several SNPs acting in cis, either through additive or more complex haplotype effects, a systematic genome-wide search for cis haplotype expression quantitative trait loci (eQTL was conducted in a sample of 758 individuals, part of the Cardiogenics Transcriptomic Study, for which genome-wide monocyte expression and GWAS data were available. 19,805 RNA probes were assessed for cis haplotypic regulation through investigation of ~2,1 × 10(9 haplotypic combinations. 2,650 probes demonstrated haplotypic p-values >10(4-fold smaller than the best single SNP p-value. Replication of significant haplotype effects were tested for 412 probes for which SNPs (or proxies that defined the detected haplotypes were available in the Gutenberg Health Study composed of 1,374 individuals. At the Bonferroni correction level of 1.2 × 10(-4 (~0.05/412, 193 haplotypic signals replicated. 1000 G imputation was then conducted, and 105 haplotypic signals still remained more informative than imputed SNPs. In-depth analysis of these 105 cis eQTL revealed that at 76 loci genetic associations were compatible with additive effects of several SNPs, while for the 29 remaining regions data could be compatible with a more complex haplotypic pattern. As 24 of the 105 cis eQTL have previously been reported to be disease-associated loci, this work highlights the need for conducting haplotype-based and 1000 G imputed cis eQTL analysis before commencing functional studies at disease-associated loci.

  5. Genome-wide meta-analysis of common variant differences between men and women

    Science.gov (United States)

    Boraska, Vesna; Jerončić, Ana; Colonna, Vincenza; Southam, Lorraine; Nyholt, Dale R.; William Rayner, Nigel; Perry, John R.B.; Toniolo, Daniela; Albrecht, Eva; Ang, Wei; Bandinelli, Stefania; Barbalic, Maja; Barroso, Inês; Beckmann, Jacques S.; Biffar, Reiner; Boomsma, Dorret; Campbell, Harry; Corre, Tanguy; Erdmann, Jeanette; Esko, Tõnu; Fischer, Krista; Franceschini, Nora; Frayling, Timothy M.; Girotto, Giorgia; Gonzalez, Juan R.; Harris, Tamara B.; Heath, Andrew C.; Heid, Iris M.; Hoffmann, Wolfgang; Hofman, Albert; Horikoshi, Momoko; Hua Zhao, Jing; Jackson, Anne U.; Hottenga, Jouke-Jan; Jula, Antti; Kähönen, Mika; Khaw, Kay-Tee; Kiemeney, Lambertus A.; Klopp, Norman; Kutalik, Zoltán; Lagou, Vasiliki; Launer, Lenore J.; Lehtimäki, Terho; Lemire, Mathieu; Lokki, Marja-Liisa; Loley, Christina; Luan, Jian'an; Mangino, Massimo; Mateo Leach, Irene; Medland, Sarah E.; Mihailov, Evelin; Montgomery, Grant W.; Navis, Gerjan; Newnham, John; Nieminen, Markku S.; Palotie, Aarno; Panoutsopoulou, Kalliope; Peters, Annette; Pirastu, Nicola; Polašek, Ozren; Rehnström, Karola; Ripatti, Samuli; Ritchie, Graham R.S.; Rivadeneira, Fernando; Robino, Antonietta; Samani, Nilesh J.; Shin, So-Youn; Sinisalo, Juha; Smit, Johannes H.; Soranzo, Nicole; Stolk, Lisette; Swinkels, Dorine W.; Tanaka, Toshiko; Teumer, Alexander; Tönjes, Anke; Traglia, Michela; Tuomilehto, Jaakko; Valsesia, Armand; van Gilst, Wiek H.; van Meurs, Joyce B.J.; Smith, Albert Vernon; Viikari, Jorma; Vink, Jacqueline M.; Waeber, Gerard; Warrington, Nicole M.; Widen, Elisabeth; Willemsen, Gonneke; Wright, Alan F.; Zanke, Brent W.; Zgaga, Lina; Boehnke, Michael; d'Adamo, Adamo Pio; de Geus, Eco; Demerath, Ellen W.; den Heijer, Martin; Eriksson, Johan G.; Ferrucci, Luigi; Gieger, Christian; Gudnason, Vilmundur; Hayward, Caroline; Hengstenberg, Christian; Hudson, Thomas J.; Järvelin, Marjo-Riitta; Kogevinas, Manolis; Loos, Ruth J.F.; Martin, Nicholas G.; Metspalu, Andres; Pennell, Craig E.; Penninx, Brenda W.; Perola, Markus; Raitakari, Olli; Salomaa, Veikko; Schreiber, Stefan; Schunkert, Heribert; Spector, Tim D.; Stumvoll, Michael; Uitterlinden, André G.; Ulivi, Sheila; van der Harst, Pim; Vollenweider, Peter; Völzke, Henry; Wareham, Nicholas J.; Wichmann, H.-Erich; Wilson, James F.; Rudan, Igor; Xue, Yali; Zeggini, Eleftheria

    2012-01-01

    The male-to-female sex ratio at birth is constant across world populations with an average of 1.06 (106 male to 100 female live births) for populations of European descent. The sex ratio is considered to be affected by numerous biological and environmental factors and to have a heritable component. The aim of this study was to investigate the presence of common allele modest effects at autosomal and chromosome X variants that could explain the observed sex ratio at birth. We conducted a large-scale genome-wide association scan (GWAS) meta-analysis across 51 studies, comprising overall 114 863 individuals (61 094 women and 53 769 men) of European ancestry and 2 623 828 common (minor allele frequency >0.05) single-nucleotide polymorphisms (SNPs). Allele frequencies were compared between men and women for directly-typed and imputed variants within each study. Forward-time simulations for unlinked, neutral, autosomal, common loci were performed under the demographic model for European populations with a fixed sex ratio and a random mating scheme to assess the probability of detecting significant allele frequency differences. We do not detect any genome-wide significant (P < 5 × 10−8) common SNP differences between men and women in this well-powered meta-analysis. The simulated data provided results entirely consistent with these findings. This large-scale investigation across ∼115 000 individuals shows no detectable contribution from common genetic variants to the observed skew in the sex ratio. The absence of sex-specific differences is useful in guiding genetic association study design, for example when using mixed controls for sex-biased traits. PMID:22843499

  6. Gene Set Analyses of Genome-Wide Association Studies on 49 Quantitative Traits Measured in a Single Genetic Epidemiology Dataset

    Directory of Open Access Journals (Sweden)

    Jihye Kim

    2013-09-01

    Full Text Available Gene set analysis is a powerful tool for interpreting a genome-wide association study result and is gaining popularity these days. Comparison of the gene sets obtained for a variety of traits measured from a single genetic epidemiology dataset may give insights into the biological mechanisms underlying these traits. Based on the previously published single nucleotide polymorphism (SNP genotype data on 8,842 individuals enrolled in the Korea Association Resource project, we performed a series of systematic genome-wide association analyses for 49 quantitative traits of basic epidemiological, anthropometric, or blood chemistry parameters. Each analysis result was subjected to subsequent gene set analyses based on Gene Ontology (GO terms using gene set analysis software, GSA-SNP, identifying a set of GO terms significantly associated to each trait (pcorr < 0.05. Pairwise comparison of the traits in terms of the semantic similarity in their GO sets revealed surprising cases where phenotypically uncorrelated traits showed high similarity in terms of biological pathways. For example, the pH level was related to 7 other traits that showed low phenotypic correlations with it. A literature survey implies that these traits may be regulated partly by common pathways that involve neuronal or nerve systems.

  7. SNP Marker Integration and QTL Analysis of 12 Agronomic and Morphological Traits in F8 RILs of Pepper (Capsicum annuum L.)

    Science.gov (United States)

    Lu, Fu-Hao; Kwon, Soon-Wook; Yoon, Min-Young; Kim, Ki-Taek; Cho, Myeong-Cheoul; Yoon, Moo-Kyung; Park, Yong-Jin

    2012-01-01

    Red pepper, Capsicum annuum L., has been attracting geneticists’ and breeders’ attention as one of the important agronomic crops. This study was to integrate 41 SNP markers newly developed from comparative transcriptomes into a previous linkage map, and map 12 agronomic and morphological traits into the integrated map. A total of 39 markers found precise position and were assigned to 13 linkage groups (LGs) as well as the unassigned LGe, leading to total 458 molecular markers present in this genetic map. Linkage mapping was supported by the physical mapping to tomato and potato genomes using BLAST retrieving, revealing at least two-thirds of the markers mapped to the corresponding LGs. A sum of 23 quantitative trait loci from 11 traits was detected using the composite interval mapping algorithm. A consistent interval between a035_1 and a170_1 on LG5 was detected as a main-effect locus among the resistance QTLs to Phytophthora capsici at high-, intermediate- and low-level tests, and interactions between the QTLs for high-level resistance test were found. Considering the epistatic effect, those QTLs could explain up to 98.25% of the phenotype variations of resistance. Moreover, 17 QTLs for another eight traits were found to locate on LG3, 4, and 12 mostly with varying phenotypic contribution. Furthermore, the locus for corolla color was mapped to LG10 as a marker. The integrated map and the QTLs identified would be helpful for current genetics research and crop breeding, especially in the Solanaceae family. PMID:22684870

  8. A genome-wide association study of serum uric acid in African Americans

    Directory of Open Access Journals (Sweden)

    Gerry Norman P

    2011-02-01

    Full Text Available Abstract Background Uric acid is the primary byproduct of purine metabolism. Hyperuricemia is associated with body mass index (BMI, sex, and multiple complex diseases including gout, hypertension (HTN, renal disease, and type 2 diabetes (T2D. Multiple genome-wide association studies (GWAS in individuals of European ancestry (EA have reported associations between serum uric acid levels (SUAL and specific genomic loci. The purposes of this study were: 1 to replicate major signals reported in EA populations; and 2 to use the weak LD pattern in African ancestry population to better localize (fine-map reported loci and 3 to explore the identification of novel findings cognizant of the moderate sample size. Methods African American (AA participants (n = 1,017 from the Howard University Family Study were included in this study. Genotyping was performed using the Affymetrix® Genome-wide Human SNP Array 6.0. Imputation was performed using MACH and the HapMap reference panels for CEU and YRI. A total of 2,400,542 single nucleotide polymorphisms (SNPs were assessed for association with serum uric acid under the additive genetic model with adjustment for age, sex, BMI, glomerular filtration rate, HTN, T2D, and the top two principal components identified in the assessment of admixture and population stratification. Results Four variants in the gene SLC2A9 achieved genome-wide significance for association with SUAL (p-values ranging from 8.88 × 10-9 to 1.38 × 10-9. Fine-mapping of the SLC2A9 signals identified a 263 kb interval of linkage disequilibrium in the HapMap CEU sample. This interval was reduced to 37 kb in our AA and the HapMap YRI samples. Conclusions The most strongly associated locus for SUAL in EA populations was also the most strongly associated locus in this AA sample. This finding provides evidence for the role of SLC2A9 in uric acid metabolism across human populations. Additionally, our findings demonstrate the utility of following-up EA

  9. Genome-wide association study of PR interval in Hispanics/Latinos identifies novel locus at ID2.

    Science.gov (United States)

    Seyerle, Amanda A; Lin, Henry J; Gogarten, Stephanie M; Stilp, Adrienne; Méndez Giráldez, Raul; Soliman, Elsayed; Baldassari, Antoine; Graff, Mariaelisa; Heckbert, Susan; Kerr, Kathleen F; Kooperberg, Charles; Rodriguez, Carlos; Guo, Xiuqing; Yao, Jie; Sotoodehnia, Nona; Taylor, Kent D; Whitsel, Eric A; Rotter, Jerome I; Laurie, Cathy C; Avery, Christy L

    2017-11-10

    PR interval (PR) is a heritable electrocardiographic measure of atrial and atrioventricular nodal conduction. Changes in PR duration may be associated with atrial fibrillation, heart failure and all-cause mortality. Hispanic/Latino populations have high burdens of cardiovascular morbidity and mortality, are highly admixed and represent exceptional opportunities for novel locus identification. However, they remain chronically understudied. We present the first genome-wide association study (GWAS) of PR in 14 756 participants of Hispanic/Latino ancestry from three studies. Study-specific summary results of the association between 1000 Genomes Phase 1 imputed single-nucleotide polymorphisms (SNPs) and PR assumed an additive genetic model and were adjusted for global ancestry, study centre/region and clinical covariates. Results were combined using fixed-effects, inverse variance weighted meta-analysis. Sequential conditional analyses were used to identify independent signals. Replication of novel loci was performed in populations of Asian, African and European descent. ENCODE and RoadMap data were used to annotate results. We identified a novel genome-wide association (PPR at ID2 (rs6730558), which replicated in Asian and European populations (PPR loci to Hispanics/Latinos. Bioinformatics annotation provided evidence for regulatory function in cardiac tissue. Further, for six loci that generalised, the Hispanic/Latino index SNP was genome-wide significant and identical to (or in high linkage disequilibrium with) the previously identified GWAS lead SNP. Our results suggest that genetic determinants of PR are consistent across race/ethnicity, but extending studies to admixed populations can identify novel associations, underscoring the importance of conducting genetic studies in diverse populations. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise

  10. Genome-Wide Association Study Reveals Greater Polygenic Loading for Schizophrenia in Cases With a Family History of Illness

    Science.gov (United States)

    Bigdeli, Tim B.; Ripke, Stephan; Bacanu, Silviu-Alin; Lee, Sang Hong; Wray, Naomi R.; Gejman, Pablo V.; Rietschel, Marcella; Cichon, Sven; St Clair, David; Corvin, Aiden; Kirov, George; McQuillin, Andrew; Gurling, Hugh; Rujescu, Dan; Andreassen, Ole A.; Werge, Thomas; Blackwood, Douglas H.R.; Pato, Carlos N.; Pato, Michele T.; Malhotra, Anil K.; O’Donovan, Michael C.; Kendler, Kenneth S.; Fanous, Ayman H.

    2018-01-01

    Genome-wide association studies (GWAS) of schizophrenia have yielded more than 100 common susceptibility variants, and strongly support a substantial polygenic contribution of a large number of small allelic effects. It has been hypothesized that familial schizophrenia is largely a consequence of inherited rather than environmental factors. We investigated the extent to which familiality of schizophrenia is associated with enrichment for common risk variants detectable in a large GWAS. We analyzed single nucleotide polymorphism (SNP) data for cases reporting a family history of psychotic illness (N = 978), cases reporting no such family history (N = 4,503), and unscreened controls (N = 8,285) from the Psychiatric Genomics Consortium (PGC1) study of schizophrenia. We used a multinomial logistic regression approach with model-fitting to detect allelic effects specific to either family history subgroup. We also considered a polygenic model, in which we tested whether family history positive subjects carried more schizophrenia risk alleles than family history negative subjects, on average. Several individual SNPs attained suggestive but not genome-wide significant association with either family history subgroup. Comparison of genome-wide polygenic risk scores based on GWAS summary statistics indicated a significant enrichment for SNP effects among family history positive compared to family history negative cases (Nagelkerke’s R2 = 0.0021; P = 0.00331; P-value threshold history positive compared to family history negative cases (0.32 and 0.22, respectively; P = 0.031).We found suggestive evidence of allelic effects detectable in large GWAS of schizophrenia that might be specific to particular family history subgroups. However, consideration of a polygenic risk score indicated a significant enrichment among family history positive cases for common allelic effects