Gao, Yangchun; Li, Shiguo; Zhan, Aibin
Invasive species cause huge damages to ecology, environment and economy globally. The comprehensive understanding of invasion mechanisms, particularly genetic bases of micro-evolutionary processes responsible for invasion success, is essential for reducing potential damages caused by invasive species. The golden star tunicate, Botryllus schlosseri, has become a model species in invasion biology, mainly owing to its high invasiveness nature and small well-sequenced genome. However, the genome-wide genetic markers have not been well developed in this highly invasive species, thus limiting the comprehensive understanding of genetic mechanisms of invasion success. Using restriction site-associated DNA (RAD) tag sequencing, here we developed a high-quality resource of 14,119 out of 158,821 SNPs for B. schlosseri. These SNPs were relatively evenly distributed at each chromosome. SNP annotations showed that the majority of SNPs (63.20%) were located at intergenic regions, and 21.51% and 14.58% were located at introns and exons, respectively. In addition, the potential use of the developed SNPs for population genomics studies was primarily assessed, such as the estimate of observed heterozygosity (H O ), expected heterozygosity (H E ), nucleotide diversity (π), Wright's inbreeding coefficient (F IS ) and effective population size (Ne). Our developed SNP resource would provide future studies the genome-wide genetic markers for genetic and genomic investigations, such as genetic bases of micro-evolutionary processes responsible for invasion success.
This is because the SNPs on BovineSNP50 and GGP-80K assays were ascertained as being common in European taurine breeds. Lower MAF and SNP informativeness observed in this study limits the application of these assays in breed assignment, and could have other implications for genome-wide studies in South ...
Murray, Lee; Mobegi, Victor A; Duffy, Craig W; Assefa, Samuel A; Kwiatkowski, Dominic P; Laman, Eugene; Loua, Kovana M; Conway, David J
In regions where malaria is endemic, individuals are often infected with multiple distinct parasite genotypes, a situation that may impact on evolution of parasite virulence and drug resistance. Most approaches to studying genotypic diversity have involved analysis of a modest number of polymorphic loci, although whole genome sequencing enables a broader characterisation of samples. PCR-based microsatellite typing of a panel of ten loci was performed on Plasmodium falciparum in 95 clinical isolates from a highly endemic area in the Republic of Guinea, to characterize within-isolate genetic diversity. Separately, single nucleotide polymorphism (SNP) data from genome-wide short-read sequences of the same samples were used to derive within-isolate fixation indices (F ws), an inverse measure of diversity within each isolate compared to overall local genetic diversity. The latter indices were compared with the microsatellite results, and also with indices derived by randomly sampling modest numbers of SNPs. As expected, the number of microsatellite loci with more than one allele in each isolate was highly significantly inversely correlated with the genome-wide F ws fixation index (r = -0.88, P 10 % had high correlation (r > 0.90) with the index derived using all SNPs. Different types of data give highly correlated indices of within-infection diversity, although PCR-based analysis detects low-level minority genotypes not apparent in bulk sequence analysis. When whole-genome data are not obtainable, quantitative assay of ten or more SNPs can yield a reasonably accurate estimate of the within-infection fixation index (F ws).
Full Text Available Age at first calving is an important trait for achieving earlier reproductive performance. To detect quantitative trait loci (QTL for reproductive traits, a genome wide association study was conducted on the 96 Hanwoo cows that were born between 2008 and 2010 from 13 sires in a local farm (Juk-Am Hanwoo farm, Suncheon, Korea and genotyped with the Illumina 50K bovine single nucleotide polymorphism (SNP chips. Phenotypes were regressed on additive and dominance effects for each SNP using a simple linear regression model after the effects of birth-year-month and polygenes were considered. A forward regression procedure was applied to determine the best set of SNPs for age at first calving. A total of 15 QTL were detected at the comparison-wise 0.001 level. Two QTL with strong statistical evidence were found at 128.9 Mb and 111.1 Mb on bovine chromosomes (BTA 2 and 7, respectively, each of which accounted for 22% of the phenotypic variance. Also, five significant SNPs were detected on BTAs 10, 16, 20, 26, and 29. Multiple QTL were found on BTAs 1, 2, 7, and 14. The significant QTLs may be applied via marker assisted selection to increase rate of genetic gain for the trait, after validation tests in other Hanwoo cow populations.
Goudey, Benjamin; Abedini, Mani; Hopper, John L; Inouye, Michael; Makalic, Enes; Schmidt, Daniel F; Wagner, John; Zhou, Zeyu; Zobel, Justin; Reumann, Matthias
Genome-wide association studies (GWAS) are a common approach for systematic discovery of single nucleotide polymorphisms (SNPs) which are associated with a given disease. Univariate analysis approaches commonly employed may miss important SNP associations that only appear through multivariate analysis in complex diseases. However, multivariate SNP analysis is currently limited by its inherent computational complexity. In this work, we present a computational framework that harnesses supercomputers. Based on our results, we estimate a three-way interaction analysis on 1.1 million SNP GWAS data requiring over 5.8 years on the full "Avoca" IBM Blue Gene/Q installation at the Victorian Life Sciences Computation Initiative. This is hundreds of times faster than estimates for other CPU based methods and four times faster than runtimes estimated for GPU methods, indicating how the improvement in the level of hardware applied to interaction analysis may alter the types of analysis that can be performed. Furthermore, the same analysis would take under 3 months on the currently largest IBM Blue Gene/Q supercomputer "Sequoia" at the Lawrence Livermore National Laboratory assuming linear scaling is maintained as our results suggest. Given that the implementation used in this study can be further optimised, this runtime means it is becoming feasible to carry out exhaustive analysis of higher order interaction studies on large modern GWAS.
Li, M-H; Tiirikka, T; Kantanen, J
In sheep, coat colour (and pattern) is one of the important traits of great biological, economic and social importance. However, the genetics of sheep coat colour has not yet been fully clarified. We conducted a genome-wide association study of sheep coat colours by genotyping 47 303 single-nucleotide polymorphisms (SNPs) in the Finnsheep population in Finland. We identified 35 SNPs associated with all the coat colours studied, which cover genomic regions encompassing three kno...
Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to show the distribution of these 2 important incompatible cultivated pepper species. Estimated mean nucleotide...
Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis
SNP Array v2. A ‘proof-of-concept’ advanced data mining algorithm for unsupervised analysis of genome-wide association study (GWAS) dataset was... Opal F AUS Yes U141 Peggs F AUS Yes U142 Taxi F AUS Yes U143 Riso MI MAL Yes U144 Szarik MI GSD Yes U145 Astor MI MAL Yes U146 Roy MC MAL Yes... mining of genetic studies in general, and especially GWAS. As a proof-of-concept, a classification analysis of the WG SNP typing dataset of a
Full Text Available Hereditary 1,25-dihydroxyvitamin D-resistant rickets (HVDRR is an autosomal recessive disease caused by biallelic mutations in the vitamin D receptor (VDR gene. No patients have been reported with uniparental disomy (UPD.Using genome-wide single nucleotide polymorphism (SNP array to confirm whether HVDRR was caused by UPD of chromosome 12.A 2-year-old girl with alopecia and short stature and without any family history of consanguinity was diagnosed with HVDRR by typical laboratory data findings and clinical features of rickets. Sequence analysis of VDR was performed, and the origin of the homozygous mutation was investigated by target SNP sequencing, short tandem repeat analysis, and genome-wide SNP array.The patient had a homozygous p.Arg73Ter nonsense mutation. Her mother was heterozygous for the mutation, but her father was negative. We excluded gross deletion of the father's allele or paternal discordance. Genome-wide SNP array of the family (the patient and her parents showed complete maternal isodisomy of chromosome 12. She was successfully treated with high-dose oral calcium.This is the first report of HVDRR caused by UPD, and the third case of complete UPD of chromosome 12, in the published literature. Genome-wide SNP array was useful for detecting isodisomy and the parental origin of the allele. Comprehensive examination of the homozygous state is essential for accurate genetic counseling of recurrence risk and appropriate monitoring for other chromosome 12 related disorders. Furthermore, oral calcium therapy was effective as an initial treatment for rickets in this instance.
Chavez-Galarza, Julio; Johnston, J. Spencer; Azevedo, João; Muñoz, Irene; De la Rúa, Pilar; Patton, John C.; Pinto, M. Alice
Dissecting genome-wide (expansions, contractions, admixture) from genome-specific effects (selection) is a goal of central importance in evolutionary biology because it leads to more robust inferences of demographic history and to identification of adaptive divergence. The publication of the honey bee genome and the development of high-density SNPs genotyping, provide us with powerful tools, allowing us to identify signatures of selection in the honey bee genome. These signatur...
van Binsbergen, R; Veerkamp, R F; Calus, M P L
The correlated responses between traits may differ depending on the makeup of genetic covariances, and may differ from the predictions of polygenic covariances. Therefore, the objective of the present study was to investigate the makeup of the genetic covariances between the well-studied traits: milk yield, fat yield, protein yield, and their percentages in more detail. Phenotypic records of 1,737 heifers of research farms in 4 different countries were used after homogenizing and adjusting for management effects. All cows had a genotype for 37,590 single nucleotide polymorphisms (SNP). A bayesian stochastic search variable selection model was used to estimate the SNP effects for each trait. About 0.5 to 1.0% of the SNP had a significant effect on 1 or more traits; however, the SNP without a significant effect explained most of the genetic variances and covariances of the traits. Single nucleotide polymorphism correlations differed from the polygenic correlations, but only 10 regions were found with an effect on multiple traits; in 1 of these regions the DGAT1 gene was previously reported with an effect on multiple traits. This region explained up to 41% of the variances of 4 traits and explained a major part of the correlation between fat yield and fat percentage and contributes to asymmetry in correlated response between fat yield and fat percentage. Overall, for the traits in this study, the infinitesimal model is expected to be sufficient for the estimation of the variances and covariances. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Full Text Available Abstract Background Clostridium beijerinckii is a prominent solvent-producing microbe that has great potential for biofuel and chemical industries. Although transcriptional analysis is essential to understand gene functions and regulation and thus elucidate proper strategies for further strain improvement, limited information is available on the genome-wide transcriptional analysis for C. beijerinckii. Results The genome-wide transcriptional dynamics of C. beijerinckii NCIMB 8052 over a batch fermentation process was investigated using high-throughput RNA-Seq technology. The gene expression profiles indicated that the glycolysis genes were highly expressed throughout the fermentation, with comparatively more active expression during acidogenesis phase. The expression of acid formation genes was down-regulated at the onset of solvent formation, in accordance with the metabolic pathway shift from acidogenesis to solventogenesis. The acetone formation gene (adc, as a part of the sol operon, exhibited highly-coordinated expression with the other sol genes. Out of the > 20 genes encoding alcohol dehydrogenase in C. beijerinckii, Cbei_1722 and Cbei_2181 were highly up-regulated at the onset of solventogenesis, corresponding to their key roles in primary alcohol production. Most sporulation genes in C. beijerinckii 8052 demonstrated similar temporal expression patterns to those observed in B. subtilis and C. acetobutylicum, while sporulation sigma factor genes sigE and sigG exhibited accelerated and stronger expression in C. beijerinckii 8052, which is consistent with the more rapid forespore and endspore development in this strain. Global expression patterns for specific gene functional classes were examined using self-organizing map analysis. The genes associated with specific functional classes demonstrated global expression profiles corresponding to the cell physiological variation and metabolic pathway switch. Conclusions The results from this
Sundqvist, J; Xu, H; Vodolazkaia, A; Fassbender, A; Kyama, C; Bokor, A; Gemzell-Danielsson, K; D'Hooghe, T M; Falconer, H
Is it possible to replicate the previously identified genetic association of four single-nucleotide polymorphisms (SNPs), rs12700667, rs7798431, rs1250248 and rs7521902, with endometriosis in a Caucasian population? A borderline association was observed for rs1250248 and endometriosis (P = 0.049). However, we could not replicate the other previously identified endometriosis-associated SNPs (rs12700667, rs7798431 and rs7521902) in the same population. Endometriosis is considered a complex disease, influenced by several genetic and environmental factors, as well as interactions between them. Previous studies have found genetic associations with endometriosis for SNPs at the 7p15 and 2q35 loci in a Caucasian population. Allele frequencies of SNPs were investigated in patients with endometriosis and controls. Blood samples and peritoneal biopsies were taken from a Caucasian female population consisting of 1129 patients with endometriosis and 831 controls. DNA was extracted for genotyping. The study was performed at a University hospital and research laboratories. A weak association with endometriosis (all stages) was observed for rs1250248 (P = 0.049). No significant associations were observed for the SNPs rs12700667, rs7798431 and rs7521902. A non-significant trend towards the association of rs1250248 with moderate/severe endometriosis was observed (odds ratio 1.18, 95% confidence interval 0.97-1.44). The inability to confirm all previous findings may result from differences between populations and type II errors. Our result demonstrates the difficulty of identifying common genetic variants in complex diseases. This study was supported by grants from the Karolinska Institutet and Stockholm City County/Karolinska Institutet (ALF), Stockholm, Sweden, Swedish Medical Research Council (K2007-54X-14212-06-3, K2010-54X-14212-09-3), Stockholm, Sweden, Leuven University Research Council (Onderzoeksraad KU Leuven), the Leuven University Hospitals Clinical Research Foundation
Meghann K. Devlin-Durante
Full Text Available The advent of next-generation sequencing tools has made it possible to conduct fine-scale surveys of population differentiation and genome-wide scans for signatures of selection in non-model organisms. Such surveys are of particular importance in sharply declining coral species, since knowledge of population boundaries and signs of local adaptation can inform restoration and conservation efforts. Here, we use genome-wide surveys of single-nucleotide polymorphisms in the threatened Caribbean elkhorn coral, Acropora palmata, to reveal fine-scale population structure and infer the major barrier to gene flow that separates the eastern and western Caribbean populations between the Bahamas and Puerto Rico. The exact location of this break had been subject to discussion because two previous studies based on microsatellite data had come to differing conclusions. We investigate this contradiction by analyzing an extended set of 11 microsatellite markers including the five previously employed and discovered that one of the original microsatellite loci is apparently under selection. Exclusion of this locus reconciles the results from the SNP and the microsatellite datasets. Scans for outlier loci in the SNP data detected 13 candidate loci under positive selection, however there was no correlation between available environmental parameters and genetic distance. Together, these results suggest that reef restoration efforts should use local sources and utilize existing functional variation among geographic regions in ex situ crossing experiments to improve stress resistance of this species.
Devlin-Durante, Meghann K; Baums, Iliana B
The advent of next-generation sequencing tools has made it possible to conduct fine-scale surveys of population differentiation and genome-wide scans for signatures of selection in non-model organisms. Such surveys are of particular importance in sharply declining coral species, since knowledge of population boundaries and signs of local adaptation can inform restoration and conservation efforts. Here, we use genome-wide surveys of single-nucleotide polymorphisms in the threatened Caribbean elkhorn coral, Acropora palmata , to reveal fine-scale population structure and infer the major barrier to gene flow that separates the eastern and western Caribbean populations between the Bahamas and Puerto Rico. The exact location of this break had been subject to discussion because two previous studies based on microsatellite data had come to differing conclusions. We investigate this contradiction by analyzing an extended set of 11 microsatellite markers including the five previously employed and discovered that one of the original microsatellite loci is apparently under selection. Exclusion of this locus reconciles the results from the SNP and the microsatellite datasets. Scans for outlier loci in the SNP data detected 13 candidate loci under positive selection, however there was no correlation between available environmental parameters and genetic distance. Together, these results suggest that reef restoration efforts should use local sources and utilize existing functional variation among geographic regions in ex situ crossing experiments to improve stress resistance of this species.
Melka Melkaye G
Full Text Available Abstract Background Studies of genetic diversity are essential in understanding the extent of differentiation between breeds, and in designing successful diversity conservation strategies. The objective of this study was to evaluate the level of genetic diversity within and between North American Brown Swiss (BS, n = 900, Jersey (JE, n = 2,922 and Holstein (HO, n = 3,535 cattle, using genotyped bulls. GENEPOP and FSTAT software were used to evaluate the level of genetic diversity within each breed and between each pair of the three breeds based on genome-wide SNP markers (n = 50,972. Results Hardy-Weinberg equilibrium (HWE exact test within breeds showed a significant deviation from equilibrium within each population (P st indicated that the combination of BS and HO in an ideally amalgamated population had higher genetic diversity than the other pairs of breeds. Conclusion Results suggest that the three bull populations have substantially different gene pools. BS and HO show the largest gene differentiation and jointly the highest total expected gene diversity compared to when JE is considered. If the loss of genetic diversity within breeds worsens in the future, the use of crossbreeding might be an option to recover genetic diversity, especially for the breeds with small population size.
Caicedo, Ana L; Williamson, Scott H; Hernandez, Ryan D
Domesticated Asian rice (Oryza sativa) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments......, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models...... to explain contemporary patterns of polymorphisms in rice, including a (i) selectively neutral population bottleneck model, (ii) bottleneck plus migration model, (iii) multiple selective sweeps model, and (iv) bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been...
Full Text Available Non-additive genetic variation is usually ignored when genome-wide markers are used to study the genetic architecture and genomic prediction of complex traits in human, wild life, model organisms or farm animals. However, non-additive genetic effects may have an important contribution to total genetic variation of complex traits. This study presented a genomic BLUP model including additive and non-additive genetic effects, in which additive and non-additive genetic relation matrices were constructed from information of genome-wide dense single nucleotide polymorphism (SNP markers. In addition, this study for the first time proposed a method to construct dominance relationship matrix using SNP markers and demonstrated it in detail. The proposed model was implemented to investigate the amounts of additive genetic, dominance and epistatic variations, and assessed the accuracy and unbiasedness of genomic predictions for daily gain in pigs. In the analysis of daily gain, four linear models were used: 1 a simple additive genetic model (MA, 2 a model including both additive and additive by additive epistatic genetic effects (MAE, 3 a model including both additive and dominance genetic effects (MAD, and 4 a full model including all three genetic components (MAED. Estimates of narrow-sense heritability were 0.397, 0.373, 0.379 and 0.357 for models MA, MAE, MAD and MAED, respectively. Estimated dominance variance and additive by additive epistatic variance accounted for 5.6% and 9.5% of the total phenotypic variance, respectively. Based on model MAED, the estimate of broad-sense heritability was 0.506. Reliabilities of genomic predicted breeding values for the animals without performance records were 28.5%, 28.8%, 29.2% and 29.5% for models MA, MAE, MAD and MAED, respectively. In addition, models including non-additive genetic effects improved unbiasedness of genomic predictions.
Kerns, Sarah L.; Ostrer, Harry; Stock, Richard; Li, William; Moore, Julian; Pearlman, Alexander; Campbell, Christopher; Shao Yongzhao; Stone, Nelson; Kusnetz, Lynda; Rosenstein, Barry S.
Purpose: To identify single nucleotide polymorphisms (SNPs) associated with erectile dysfunction (ED) among African-American prostate cancer patients treated with external beam radiation therapy. Methods and Materials: A cohort of African-American prostate cancer patients treated with external beam radiation therapy was observed for the development of ED by use of the five-item Sexual Health Inventory for Men (SHIM) questionnaire. Final analysis included 27 cases (post-treatment SHIM score ≤7) and 52 control subjects (post-treatment SHIM score ≥16). A genome-wide association study was performed using approximately 909,000 SNPs genotyped on Affymetrix 6.0 arrays (Affymetrix, Santa Clara, CA). Results: We identified SNP rs2268363, located in the follicle-stimulating hormone receptor (FSHR) gene, as significantly associated with ED after correcting for multiple comparisons (unadjusted p = 5.46 x 10 -8 , Bonferroni p = 0.028). We identified four additional SNPs that tended toward a significant association with an unadjusted p value -6 . Inference of population substructure showed that cases had a higher proportion of African ancestry than control subjects (77% vs. 60%, p = 0.005). A multivariate logistic regression model that incorporated estimated ancestry and four of the top-ranked SNPs was a more accurate classifier of ED than a model that included only clinical variables. Conclusions: To our knowledge, this is the first genome-wide association study to identify SNPs associated with adverse effects resulting from radiotherapy. It is important to note that the SNP that proved to be significantly associated with ED is located within a gene whose encoded product plays a role in male gonad development and function. Another key finding of this project is that the four SNPs most strongly associated with ED were specific to persons of African ancestry and would therefore not have been identified had a cohort of European ancestry been screened. This study demonstrates
Manivannan, Abinaya; Kim, Jin-Hee; Yang, Eun-Young; Ahn, Yul-Kyun; Lee, Eun-Su; Choi, Sena; Kim, Do-Sun
Pepper is an economically important horticultural plant that has been widely used for its pungency and spicy taste in worldwide cuisines. Therefore, the domestication of pepper has been carried out since antiquity. Owing to meet the growing demand for pepper with high quality, organoleptic property, nutraceutical contents, and disease tolerance, genomics assisted breeding techniques can be incorporated to develop novel pepper varieties with desired traits. The application of next-generation sequencing (NGS) approaches has reformed the plant breeding technology especially in the area of molecular marker assisted breeding. The availability of genomic information aids in the deeper understanding of several molecular mechanisms behind the vital physiological processes. In addition, the NGS methods facilitate the genome-wide discovery of DNA based markers linked to key genes involved in important biological phenomenon. Among the molecular markers, single nucleotide polymorphism (SNP) indulges various benefits in comparison with other existing DNA based markers. The present review concentrates on the impact of NGS approaches in the discovery of useful SNP markers associated with pungency and disease resistance in pepper. The information provided in the current endeavor can be utilized for the betterment of pepper breeding in future.
Full Text Available Pepper is an economically important horticultural plant that has been widely used for its pungency and spicy taste in worldwide cuisines. Therefore, the domestication of pepper has been carried out since antiquity. Owing to meet the growing demand for pepper with high quality, organoleptic property, nutraceutical contents, and disease tolerance, genomics assisted breeding techniques can be incorporated to develop novel pepper varieties with desired traits. The application of next-generation sequencing (NGS approaches has reformed the plant breeding technology especially in the area of molecular marker assisted breeding. The availability of genomic information aids in the deeper understanding of several molecular mechanisms behind the vital physiological processes. In addition, the NGS methods facilitate the genome-wide discovery of DNA based markers linked to key genes involved in important biological phenomenon. Among the molecular markers, single nucleotide polymorphism (SNP indulges various benefits in comparison with other existing DNA based markers. The present review concentrates on the impact of NGS approaches in the discovery of useful SNP markers associated with pungency and disease resistance in pepper. The information provided in the current endeavor can be utilized for the betterment of pepper breeding in future.
Full Text Available Massively parallel sequencing platforms have allowed for the rapid discovery of single nucleotide polymorphisms (SNPs among related genotypes within a species. We describe the creation of reduced representation libraries (RRLs using an initial digestion of nuclear genomic DNA with a methylation-sensitive restriction endonuclease followed by a secondary digestion with the 4bp-restriction endonuclease This strategy allows for the enrichment of hypomethylated genomic DNA, which has been shown to be rich in genic sequences, and the digestion with serves to increase the number of common loci resequenced between individuals. Deep resequencing of these RRLs performed with the Illumina Genome Analyzer led to the identification of 2618 SNPs in rice and 1682 SNPs in soybean for two representative genotypes in each of the species. A subset of these SNPs was validated via Sanger sequencing, exhibiting validation rates of 96.4 and 97.0%, in rice ( and soybean (, respectively. Comparative analysis of the read distribution relative to annotated genes in the reference genome assemblies indicated that the RRL strategy was primarily sampling within genic regions for both species. The massively parallel sequencing of methylation-sensitive RRLs for genome-wide SNP discovery can be applied across a wide range of plant species having sufficient reference genomic sequence.
Yamamoto, Toshio; Nagasaki, Hideki; Yonemaru, Jun-ichi; Ebana, Kaworu; Nakajima, Maiko; Shibaya, Taeko; Yano, Masahiro
To create useful gene combinations in crop breeding, it is necessary to clarify the dynamics of the genome composition created by breeding practices. A large quantity of single-nucleotide polymorphism (SNP) data is required to permit discrimination of chromosome segments among modern cultivars, which are genetically related. Here, we used a high-throughput sequencer to conduct whole-genome sequencing of an elite Japanese rice cultivar, Koshihikari, which is closely related to Nipponbare, whose genome sequencing has been completed. Then we designed a high-throughput typing array based on the SNP information by comparison of the two sequences. Finally, we applied this array to analyze historical representative rice cultivars to understand the dynamics of their genome composition. The total 5.89-Gb sequence for Koshihikari, equivalent to 15.7 x the entire rice genome, was mapped using the Pseudomolecules 4.0 database for Nipponbare. The resultant Koshihikari genome sequence corresponded to 80.1% of the Nipponbare sequence and led to the identification of 67,051 SNPs. A high-throughput typing array consisting of 1917 SNP sites distributed throughout the genome was designed to genotype 151 representative Japanese cultivars that have been grown during the past 150 years. We could identify the ancestral origin of the pedigree haplotypes in 60.9% of the Koshihikari genome and 18 consensus haplotype blocks which are inherited from traditional landraces to current improved varieties. Moreover, it was predicted that modern breeding practices have generally decreased genetic diversity Detection of genome-wide SNPs by both high-throughput sequencer and typing array made it possible to evaluate genomic composition of genetically related rice varieties. With the aid of their pedigree information, we clarified the dynamics of chromosome recombination during the historical rice breeding process. We also found several genomic regions decreasing genetic diversity which might be
Li, M-H; Tiirikka, T; Kantanen, J
In sheep, coat colour (and pattern) is one of the important traits of great biological, economic and social importance. However, the genetics of sheep coat colour has not yet been fully clarified. We conducted a genome-wide association study of sheep coat colours by genotyping 47 303 single-nucleotide polymorphisms (SNPs) in the Finnsheep population in Finland. We identified 35 SNPs associated with all the coat colours studied, which cover genomic regions encompassing three known pigmentation genes (TYRP1, ASIP and MITF) in sheep. Eighteen of these associations were confirmed in further tests between white versus non-white individuals, but none of the 35 associations were significant in the analysis of only non-white colours. Across the tests, the s66432.1 in ASIP showed significant association (P=4.2 × 10(-11) for all the colours; P=2.3 × 10(-11) for white versus non-white colours) with the variation in coat colours and strong linkage disequilibrium with other significant variants surrounding the ASIP gene. The signals detected around the ASIP gene were explained by differences in white versus non-white alleles. Further, a genome scan for selection for white coat pigmentation identified a strong and striking selection signal spanning ASIP. Our study identified the main candidate gene for the coat colour variation between white and non-white as ASIP, an autosomal gene that has been directly implicated in the pathway regulating melanogenesis. Together with ASIP, the two other newly identified genes (TYRP1 and MITF) in the Finnsheep, bordering associated SNPs, represent a new resource for enriching sheep coat-colour genetics and breeding.
Pujolar, J.M.; Jacobsen, M.W.; Frydenberg, J.
Reduced representation genome sequencing such as restriction-site-associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single-nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers for the Eu...... 425 loci and 376 918 associated SNPs provides a valuable tool for future population genetics and genomics studies and allows for targeting specific genes and particularly interesting regions of the eel genome...
Full Text Available Abstract Background We conducted a genome-wide association study (GWAS and validation study for left ventricular (LV mass in the Family Blood Pressure Program – HyperGEN population. LV mass is a sensitive predictor of cardiovascular mortality and morbidity in all genders, races, and ages. Polymorphisms of candidate genes in diverse pathways have been associated with LV mass. However, subsequent studies have often failed to replicate these associations. Genome-wide association studies have unprecedented power to identify potential genes with modest effects on left LV mass. We describe here a GWAS for LV mass in Caucasians using the Affymetrix GeneChip Human Mapping 100 k Set. Cases (N = 101 and controls (N = 101 were selected from extreme tails of the LV mass index distribution from 906 individuals in the HyperGEN study. Eleven of 12 promising (Q Results Despite the relatively small sample, we identified 12 promising SNPs in the GWAS. Eleven SNPs were successfully genotyped in the validation study of 704 Caucasians and 1467 African Americans; 5 SNPs on chromosomes 5, 12, and 20 were significantly (P ≤ 0.05 associated with LV mass after correction for multiple testing. One SNP (rs756529 is intragenic within KCNB1, which is dephosphorylated by calcineurin, a previously reported candidate gene for LV hypertrophy within this population. Conclusion These findings suggest KCNB1 may be involved in the development of LV hypertrophy in humans.
Zhao, Zhenqing; Gu, Honghui; Sheng, Xiaoguang; Yu, Huifang; Wang, Jiansheng; Huang, Long; Wang, Dan
Molecular markers and genetic maps play an important role in plant genomics and breeding studies. Cauliflower is an important and distinctive vegetable; however, very few molecular resources have been reported for this species. In this study, a novel, specific-locus amplified fragment (SLAF) sequencing strategy was employed for large-scale single nucleotide polymorphism (SNP) discovery and high-density genetic map construction in a double-haploid, segregating population of cauliflower. A total of 12.47 Gb raw data containing 77.92 M pair-end reads were obtained after processing and 6815 polymorphic SLAFs between the two parents were detected. The average sequencing depths reached 52.66-fold for the female parent and 49.35-fold for the male parent. Subsequently, these polymorphic SLAFs were used to genotype the population and further filtered based on several criteria to construct a genetic linkage map of cauliflower. Finally, 1776 high-quality SLAF markers, including 2741 SNPs, constituted the linkage map with average data integrity of 95.68%. The final map spanned a total genetic length of 890.01 cM with an average marker interval of 0.50 cM, and covered 364.9 Mb of the reference genome. The markers and genetic map developed in this study could provide an important foundation not only for comparative genomics studies within Brassica oleracea species but also for quantitative trait loci identification and molecular breeding of cauliflower. PMID:27047515
Sahana, G; Guldbrandtsen, B; Thomsen, B; Holm, L-E; Panitz, F; Brøndum, R F; Bendixen, C; Lund, M S
Mastitis is a mammary disease that frequently affects dairy cattle. Despite considerable research on the development of effective prevention and treatment strategies, mastitis continues to be a significant issue in bovine veterinary medicine. To identify major genes that affect mastitis in dairy cattle, 6 chromosomal regions on Bos taurus autosome (BTA) 6, 13, 16, 19, and 20 were selected from a genome scan for 9 mastitis phenotypes using imputed high-density single nucleotide polymorphism arrays. Association analyses using sequence-level variants for the 6 targeted regions were carried out to map causal variants using whole-genome sequence data from 3 breeds. The quantitative trait loci (QTL) discovery population comprised 4,992 progeny-tested Holstein bulls, and QTL were confirmed in 4,442 Nordic Red and 1,126 Jersey cattle. The targeted regions were imputed to the sequence level. The highest association signal for clinical mastitis was observed on BTA 6 at 88.97 Mb in Holstein cattle and was confirmed in Nordic Red cattle. The peak association region on BTA 6 contained 2 genes: vitamin D-binding protein precursor (GC) and neuropeptide FF receptor 2 (NPFFR2), which, based on known biological functions, are good candidates for affecting mastitis. However, strong linkage disequilibrium in this region prevented conclusive determination of the causal gene. A different QTL on BTA 6 located at 88.32 Mb in Holstein cattle affected mastitis. In addition, QTL on BTA 13 and 19 were confirmed to segregate in Nordic Red cattle and QTL on BTA 16 and 20 were confirmed in Jersey cattle. Although several candidate genes were identified in these targeted regions, it was not possible to identify a gene or polymorphism as the causal factor for any of these regions. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Sebastiaan M Bol
Full Text Available HIV-1 infected macrophages play an important role in rendering resting T cells permissive for infection, in spreading HIV-1 to T cells, and in the pathogenesis of AIDS dementia. During highly active anti-retroviral treatment (HAART, macrophages keep producing virus because tissue penetration of antiretrovirals is suboptimal and the efficacy of some is reduced. Thus, to cure HIV-1 infection with antiretrovirals we will also need to efficiently inhibit viral replication in macrophages. The majority of the current drugs block the action of viral enzymes, whereas there is an abundance of yet unidentified host factors that could be targeted. We here present results from a genome-wide association study identifying novel genetic polymorphisms that affect in vitro HIV-1 replication in macrophages.Monocyte-derived macrophages from 393 blood donors were infected with HIV-1 and viral replication was determined using Gag p24 antigen levels. Genomic DNA from individuals with macrophages that had relatively low (n = 96 or high (n = 96 p24 production was used for SNP genotyping with the Illumina 610 Quad beadchip. A total of 494,656 SNPs that passed quality control were tested for association with HIV-1 replication in macrophages, using linear regression. We found a strong association between in vitro HIV-1 replication in monocyte-derived macrophages and SNP rs12483205 in DYRK1A (p = 2.16 × 10(-5. While the association was not genome-wide significant (p<1 × 10(-7, we could replicate this association using monocyte-derived macrophages from an independent group of 31 individuals (p = 0.0034. Combined analysis of the initial and replication cohort increased the strength of the association (p = 4.84 × 10(-6. In addition, we found this SNP to be associated with HIV-1 disease progression in vivo in two independent cohort studies (p = 0.035 and p = 0.0048.These findings suggest that the kinase DYRK1A is involved in the replication of HIV-1, in vitro in macrophages
Bol, Sebastiaan M.; Moerland, Perry D.; Limou, Sophie; van Remmerden, Yvonne; Coulonges, Cédric; van Manen, Daniëlle; Herbeck, Joshua T.; Fellay, Jacques; Sieberer, Margit; Sietzema, Jantine G.; van 't Slot, Ruben; Martinson, Jeremy; Zagury, Jean-François; Schuitemaker, Hanneke; van 't Wout, Angélique B.
Background HIV-1 infected macrophages play an important role in rendering resting T cells permissive for infection, in spreading HIV-1 to T cells, and in the pathogenesis of AIDS dementia. During highly active anti-retroviral treatment (HAART), macrophages keep producing virus because tissue penetration of antiretrovirals is suboptimal and the efficacy of some is reduced. Thus, to cure HIV-1 infection with antiretrovirals we will also need to efficiently inhibit viral replication in macrophages. The majority of the current drugs block the action of viral enzymes, whereas there is an abundance of yet unidentified host factors that could be targeted. We here present results from a genome-wide association study identifying novel genetic polymorphisms that affect in vitro HIV-1 replication in macrophages. Methodology/Principal Findings Monocyte-derived macrophages from 393 blood donors were infected with HIV-1 and viral replication was determined using Gag p24 antigen levels. Genomic DNA from individuals with macrophages that had relatively low (n = 96) or high (n = 96) p24 production was used for SNP genotyping with the Illumina 610 Quad beadchip. A total of 494,656 SNPs that passed quality control were tested for association with HIV-1 replication in macrophages, using linear regression. We found a strong association between in vitro HIV-1 replication in monocyte-derived macrophages and SNP rs12483205 in DYRK1A (p = 2.16×10−5). While the association was not genome-wide significant (p<1×10−7), we could replicate this association using monocyte-derived macrophages from an independent group of 31 individuals (p = 0.0034). Combined analysis of the initial and replication cohort increased the strength of the association (p = 4.84×10−6). In addition, we found this SNP to be associated with HIV-1 disease progression in vivo in two independent cohort studies (p = 0.035 and p = 0.0048). Conclusions/Significance These findings suggest that
Reddy, Umesh K; Nimmakayala, Padma; Levi, Amnon; Abburi, Venkata Lakshmi; Saminathan, Thangasamy; Tomason, Yan R; Vajja, Gopinath; Reddy, Rishi; Abburi, Lavanya; Wehner, Todd C; Ronin, Yefim; Karol, Abraham
We used genotyping by sequencing to identify a set of 10,480 single nucleotide polymorphism (SNP) markers for constructing a high-resolution genetic map of 1096 cM for watermelon. We assessed the genome-wide variation in recombination rate (GWRR) across the map and found an association between GWRR and genome-wide nucleotide diversity. Collinearity between the map and the genome-wide reference sequence for watermelon was studied to identify inconsistency and chromosome rearrangements. We assessed genome-wide nucleotide diversity, linkage disequilibrium (LD), and selective sweep for wild, semi-wild, and domesticated accessions of Citrullus lanatus var. lanatus to track signals of domestication. Principal component analysis combined with chromosome-wide phylogenetic study based on 1563 SNPs obtained after LD pruning with minor allele frequency of 0.05 resolved the differences between semi-wild and wild accessions as well as relationships among worldwide sweet watermelon. Population structure analysis revealed predominant ancestries for wild, semi-wild, and domesticated watermelons as well as admixture of various ancestries that were important for domestication. Sliding window analysis of Tajima's D across various chromosomes was used to resolve selective sweep. LD decay was estimated for various chromosomes. We identified a strong selective sweep on chromosome 3 consisting of important genes that might have had a role in sweet watermelon domestication. Copyright © 2014 Reddy et al.
Sep 25, 2012 ... Codeine, Tramadol, Acetaminophen. CYP2C9. Celecoxib .... Pharmacogenet- ics of acute azathioprine toxicity: relationship to thiopurine ... Martinez C, Cueto R,. Garcia-Martin E. Pharmacogenomics in drug induced liver.
Pujolar, J. M.; Jacobsen, M. W.; Als, Thomas Damm
Next-generation sequencing and the collection of genome-wide data allow identifying adaptive variation and footprints of directional selection. Using a large SNP data set from 259 RAD-sequenced European eel individuals (glass eels) from eight locations between 34 and 64oN, we examined the patterns...... of genome-wide genetic diversity across locations. We tested for local selection by searching for increased population differentiation using FST-based outlier tests and by testing for significant associations between allele frequencies and environmental variables. The overall low genetic differentiation...... with single-generation signatures of spatially varying selection acting on glass eels. After screening 50 354 SNPs, a total of 754 potentially locally selected SNPs were identified. Candidate genes for local selection constituted a wide array of functions, including calcium signalling, neuroactive ligand...
Full Text Available Nucleotide-binding site (NBS disease resistance genes play an important role in defending plants from a variety of pathogens and insect pests. Many R-genes have been identified in various plant species. However, little is known about the NBS-encoding genes in Brachypodium distachyon. In this study, using computational analysis of the B. distachyon genome, we identified 126 regular NBS-encoding genes and characterized them on the bases of structural diversity, conserved protein motifs, chromosomal locations, gene duplications, promoter region, and phylogenetic relationships. EST hits and full-length cDNA sequences (from Brachypodium database of 126 R-like candidates supported their existence. Based on the occurrence of conserved protein motifs such as coiled-coil (CC, NBS, leucine-rich repeat (LRR, these regular NBS-LRR genes were classified into four subgroups: CC-NBS-LRR, NBS-LRR, CC-NBS, and X-NBS. Further expression analysis of the regular NBS-encoding genes in Brachypodium database revealed that these genes are expressed in a wide range of libraries, including those constructed from various developmental stages, tissue types, and drought challenged or nonchallenged tissue.
Full Text Available Gene set analysis is a powerful tool for interpreting a genome-wide association study result and is gaining popularity these days. Comparison of the gene sets obtained for a variety of traits measured from a single genetic epidemiology dataset may give insights into the biological mechanisms underlying these traits. Based on the previously published single nucleotide polymorphism (SNP genotype data on 8,842 individuals enrolled in the Korea Association Resource project, we performed a series of systematic genome-wide association analyses for 49 quantitative traits of basic epidemiological, anthropometric, or blood chemistry parameters. Each analysis result was subjected to subsequent gene set analyses based on Gene Ontology (GO terms using gene set analysis software, GSA-SNP, identifying a set of GO terms significantly associated to each trait (pcorr < 0.05. Pairwise comparison of the traits in terms of the semantic similarity in their GO sets revealed surprising cases where phenotypically uncorrelated traits showed high similarity in terms of biological pathways. For example, the pH level was related to 7 other traits that showed low phenotypic correlations with it. A literature survey implies that these traits may be regulated partly by common pathways that involve neuronal or nerve systems.
Marques, Catarina A; Dickens, Nicholas J; Paape, Daniel; Campbell, Samantha J; McCulloch, Richard
DNA replication initiates on defined genome sites, termed origins. Origin usage appears to follow common rules in the eukaryotic organisms examined to date: all chromosomes are replicated from multiple origins, which display variations in firing efficiency and are selected from a larger pool of potential origins. To ask if these features of DNA replication are true of all eukaryotes, we describe genome-wide origin mapping in the parasite Leishmania. Origin mapping in Leishmania suggests a striking divergence in origin usage relative to characterized eukaryotes, since each chromosome appears to be replicated from a single origin. By comparing two species of Leishmania, we find evidence that such origin singularity is maintained in the face of chromosome fusion or fission events during evolution. Mapping Leishmania origins suggests that all origins fire with equal efficiency, and that the genomic sites occupied by origins differ from related non-origins sites. Finally, we provide evidence that origin location in Leishmania displays striking conservation with Trypanosoma brucei, despite the latter parasite replicating its chromosomes from multiple, variable strength origins. The demonstration of chromosome replication for a single origin in Leishmania, a microbial eukaryote, has implications for the evolution of origin multiplicity and associated controls, and may explain the pervasive aneuploidy that characterizes Leishmania chromosome architecture.
Levinson, Douglas F; Shi, Jianxin; Wang, Kai
The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs).......The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs)....
Børsting, Claus; Pereira, Vania; Andersen, Jeppe Dyrberg
Single nucleotide polymorphisms (SNPs) are the most frequent DNA sequence variations in the genome. They have been studied extensively in the last decade with various purposes in mind. In this chapter, we will discuss the advantages and disadvantages of using SNPs for human identification...... of SNPs. This will allow acquisition of more information from the sample materials and open up for new possibilities as well as new challenges....
Shankaranarayanan, P.; Mendoza-Parra, M.A.; Gool, van W.; Trindade, L.M.; Gronemeyer, H.
Linear amplification of DNA (LinDA) by T7 polymerase is a versatile and robust method for generating sufficient amounts of DNA for genome-wide studies with minute amounts of cells. LinDA can be coupled to a great number of global profiling technologies. Indeed, chromatin immunoprecipitation coupled
Hjortland, Geir Olav; Fodstad, Oystein; Smeland, Sigbjorn; Hovig, Eivind; Meza-Zepeda, Leonardo A; Beiske, Klaus; Ree, Anne H; Tveito, Siri; Hoifodt, Hanne; Bohler, Per J; Hole, Knut H; Myklebost, Ola
Metastatic progression due to development or enrichment of therapy-resistant tumor cells is eventually lethal. Molecular characterization of such chemotherapy resistant tumor cell clones may identify markers responsible for malignant progression and potential targets for new treatment. Here, in a case of stage IV adenocarcinoma of the gastroesophageal junction, we report the successful genome wide analysis using array comparative genomic hybridization (CGH) of DNA from only fourteen tumor cells using a bead-based single cell selection method from a bone metastasis progressing during chemotherapy. In a case of metastatic adenocarcinoma of the gastroesophageal junction, the progression of bone metastasis was observed during a chemotherapy regimen of epirubicin, oxaliplatin and capecitabine, whereas lung-, liver and lymph node metastases as well as the primary tumor were regressing. A bone marrow aspirate sampled at the site of progressing metastasis in the right iliac bone was performed, and single cell molecular analysis using array-CGH of Epithelial Specific Antigen (ESA)-positive metastatic cells, and revealed two distinct regions of amplification, 12p12.1 and 17q12-q21.2 amplicons, containing the KRAS (12p) and ERBB2 (HER2/NEU) (17q) oncogenes. Further intrapatient tumor heterogeneity of these highlighted gene copy number changes was analyzed by fluorescence in situ hybridization (FISH) in all available primary and metastatic tumor biopsies, and ErbB2 protein expression was investigated by immunohistochemistry. ERBB2 was heterogeneously amplified by FISH analysis in the primary tumor, as well as liver and bone metastasis, but homogenously amplified in biopsy specimens from a progressing bone metastasis after three initial cycles of chemotherapy, indicating a possible enrichment of erbB2 positive tumor cells in the progressing bone marrow metastasis during chemotherapy. A similar amplification profile was detected for wild-type KRAS, although more heterogeneously
Full Text Available Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR immune receptors are known play critical roles in effector-triggered immunity (ETI plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analyses and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analyses of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding.
Grønlund, Hugo Ahlm; Moen, Birgitte; Hoorfar, Jeffrey
A major challenge with single-nucleotide polymorphism (SNP) fingerprinting of bacteria and higher organisms is the combination of genome-wide screenings with the potential of multiplexing and accurate SNP detection. Single-nucleotide extension by the minisequencing principle represents a technolo...
Full Text Available The establishment of DNA methylation patterns in oocytes is a highly dynamic process marking gene-regulatory events during fertilization, embryonic development, and adulthood. However, after epigenetic reprogramming in primordial germ cells, how and when DNA methylation is re-established in developing human oocytes remains to be characterized. Here, using single-cell whole-genome bisulfite sequencing, we describe DNA methylation patterns in three different maturation stages of human oocytes. We found that while broad-scale patterns of CpG methylation have been largely established by the immature germinal vesicle stage, localized changes continue into later development. Non-CpG methylation, on the other hand, undergoes a large-scale, generalized remodeling through the final stage of maturation, with the net overall result being the accumulation of methylation as oocytes mature. The role of the genome-wide, non-CpG methylation remodeling in the final stage of oocyte maturation deserves further investigation.
Apr 1, 2010 ... Genome-wide association studies (GWAS) examine the entire human genome with the goal of identifying genetic variants. (usually single nucleotide polymorphisms (SNPs)) that are associated with phenotypic traits such as disease status and drug response. The discordance of significantly associated ...
Full Text Available Cyclic nucleotide gated channels (CNGCs play multifaceted roles in plants, particularly with respect to signaling processes associated with abiotic stress signaling and during host-pathogen interactions. Despite key roles during plant survival and response to environment, little is known about the activity and function of CNGC family in common wheat (Triticum aestivum L., a key stable food around the globe. In this study, we performed a genome-wide identification of CNGC family in wheat and identified a total 47 TaCNGCs in wheat, classifying these genes into four major groups (I–IV with two sub-groups (IVa and IVb. Sequence analysis revealed the presence of several conserved motifs, including a phosphate binding cassette (PBC and a “hinge” region, both of which have been hypothesized to be critical for the function of wheat CNGCs. During wheat infection with Pst, the transcript levels of TaCNGC14 and TaCNGC16, both members of group IVb, showed significant induction during a compatible interaction, while a reduction in gene expression was observed in incompatible interactions. In addition, TaCNGC14 and TaCNGC16 mRNA accumulation was significantly influenced by exogenously applied hormones, including abscisic acid (ABA, methyl jasmonate (MeJA, and salicylic acid (SA, suggesting a role in hormone signaling and/or perception. Silencing of TaCNGC14 and TaCNGC16 limited Pst growth and increased wheat resistance against Pst. The results presented herein contribute to our understanding of the wheat CNGC gene family and the mechanism of TaCNGCs signaling during wheat-Pst interaction.
Full Text Available Tumour cellularity, the relative proportion of tumour and normal cells in a sample, affects the sensitivity of mutation detection, copy number analysis, cancer gene expression and methylation profiling. Tumour cellularity is traditionally estimated by pathological review of sectioned specimens; however this method is both subjective and prone to error due to heterogeneity within lesions and cellularity differences between the sample viewed during pathological review and tissue used for research purposes. In this paper we describe a statistical model to estimate tumour cellularity from SNP array profiles of paired tumour and normal samples using shifts in SNP allele frequency at regions of loss of heterozygosity (LOH in the tumour. We also provide qpure, a software implementation of the method. Our experiments showed that there is a medium correlation 0.42 ([Formula: see text]-value=0.0001 between tumor cellularity estimated by qpure and pathology review. Interestingly there is a high correlation 0.87 ([Formula: see text]-value [Formula: see text] 2.2e-16 between cellularity estimates by qpure and deep Ion Torrent sequencing of known somatic KRAS mutations; and a weaker correlation 0.32 ([Formula: see text]-value=0.004 between IonTorrent sequencing and pathology review. This suggests that qpure may be a more accurate predictor of tumour cellularity than pathology review. qpure can be downloaded from https://sourceforge.net/projects/qpure/.
Avhashoni AA. Zwane
Sep 20, 2016 ... ... high temperatures and low-quality grass and for their resistance to ... growth rate, early marketability, grazing performance and good ... development of the Bonsmara breed (Scholtz, 2010). ..... transferability to water buffalo.
Bernatsky, Sasha; Velásquez García, Héctor A; Spinelli, John; Gaffney, Patrick; Smedby, Karin E; Ramsey-Goldman, Rosalind; Wang, Sophia S.; Adami, Hans-Olov; Albanes, Demetrius; Angelucci, Emanuele; Ansell, Stephen M.; Asmann, Yan W.; Becker, Nikolaus; Benavente, Yolanda; Berndt, Sonja I.; Bertrand, Kimberly A.; Birmann, Brenda M.; Boeing, Heiner; Boffetta, Paolo; Bracci, Paige M.; Brennan, Paul; Brooks-Wilson, Angela R.; Cerhan, James R.; Chanock, Stephen J.; Clavel, Jacqueline; Conde, Lucia; Cotenbader, Karen H; Cox, David G; Cozen, Wendy; Crouch, Simon; De Roos, Anneclaire J.; De Sanjose, Silvia; Di Lollo, Simonetta; Diver, W. Ryan; Dogan, Ahmet; Foretova, Lenka; Ghesquières, Hervé; Giles, Graham G.; Glimelius, Bengt; Habermann, Thomas M.; Haioun, Corinne; Hartge, Patricia; Hjalgrim, Henrik; Holford, Theodore R.; Holly, Elizabeth A.; Jackson, Rebecca D.; Kaaks, Rudolph; Kane, Eleanor; Kelly, Rachel S.; Klein, Robert J.; Kraft, Peter; Kricker, Anne; Lan, Qing; Lawrence, Charles; Liebow, Mark; Lightfoot, Tracy; Link, Brian K.; Maynadie, Marc; McKay, James; Melbye, Mads; Molina, Thierry Jo; Monnereau, Alain; Morton, Lindsay M.; Nieters, Alexandra; North, Kari E.; Novak, Anne J.; Offit, Kenneth; Purdue, Mark P.; Rais, Marco; Riby, Jacques; Roman, Eve; Rothman, Nathaniel; Salles, Gilles; Severi, Gianluca; Severson, Richard K.; Skibola, Christine F.; Slager, Susan L.; Smith, Alex; Smith, Martyn T.; Southey, Melissa C.; Staines, Anthony; Teras, Lauren R.; Thompson, Carrie A.; Tilly, Hervé; Tinker, Lesley F.; Tjonneland, Anne; Turner, Jenny; Vajdic, Claire M.; Vermeulen, Roel C H; Vijai, Joseph; Vineis, Paolo; Virtamo, Jarmo; Wang, Zhaoming; Weinstein, Stephanie; Witzig, Thomas E.; Zelenetz, Andrew; Zeleniuch-Jacquotte, Anne; Zhang, Yawei; Zheng, Tongzhang; Zucca, Mariagrazia; Clarke, Ann E
Objective: Determinants of the increased risk of diffuse large B-cell lymphoma (DLBCL) in SLE are unclear. Using data from a recent lymphoma genome-wide association study (GWAS), we assessed whether certain lupus-related single nucleotide polymorphisms (SNPs) were also associated with DLBCL.
Deelen, Joris; Beekman, Marian; Uh, Hae-Won
By studying the loci which contribute to human longevity, we aim to identify mechanisms that contribute to healthy aging. To identify such loci, we performed a genome-wide association study (GWAS) comparing 403 unrelated nonagenarians from long-living families included in the Leiden Longevity Stu...
Davila Olivas, Nelson H.; Kruijer, Willem; Gort, Gerrit; Wijnen, Cris L.; Loon, van Joop J.A.; Dicke, Marcel
Plants are commonly exposed to abiotic and biotic stresses. We used 350 Arabidopsis thaliana accessions grown under controlled conditions. We employed genome-wide association analysis to investigate the genetic architecture and underlying loci involved in genetic variation in resistance to: two
Liang, Jingjing; Le, Thu H.; Edwards, Digna R. Velez; Tayo, Bamidele O.; Gaulton, Kyle J.; Smith, Jennifer A.; Lu, Yingchang; Jensen, Richard A.; Chen, Guanjie; Yanek, Lisa R.; Schwander, Karen; Tajuddin, Salman M.; Sofer, Tamar; Kim, Wonji; Kayima, James
© 2017 Public Library of Science. All Rights Reserved. Hypertension is a leading cause of global disease, mortality, and disability. While individuals of African descent suffer a disproportionate burden of hypertension and its complications, they have been underrepresented in genetic studies. To identify novel susceptibility loci for blood pressure and hypertension in people of African ancestry, we performed both single and multiple-trait genome-wide association analyses. We analyzed 21 genom...
Full Text Available Hypertension is a leading cause of global disease, mortality, and disability. While individuals of African descent suffer a disproportionate burden of hypertension and its complications, they have been underrepresented in genetic studies. To identify novel susceptibility loci for blood pressure and hypertension in people of African ancestry, we performed both single and multiple-trait genome-wide association analyses. We analyzed 21 genome-wide association studies comprised of 31,968 individuals of African ancestry, and validated our results with additional 54,395 individuals from multi-ethnic studies. These analyses identified nine loci with eleven independent variants which reached genome-wide significance (P < 1.25×10-8 for either systolic and diastolic blood pressure, hypertension, or for combined traits. Single-trait analyses identified two loci (TARID/TCF21 and LLPH/TMBIM4 and multiple-trait analyses identified one novel locus (FRMD3 for blood pressure. At these three loci, as well as at GRP20/CDH17, associated variants had alleles common only in African-ancestry populations. Functional annotation showed enrichment for genes expressed in immune and kidney cells, as well as in heart and vascular cells/tissues. Experiments driven by these findings and using angiotensin-II induced hypertension in mice showed altered kidney mRNA expression of six genes, suggesting their potential role in hypertension. Our study provides new evidence for genes related to hypertension susceptibility, and the need to study African-ancestry populations in order to identify biologic factors contributing to hypertension.
Curran, S.; Bolton, P.; Rozsnyai, K.; Chiocchetti, A.; Klauck, S.M.; Duketis, E.; Poustka, F.; Schlitt, S.; Freitag, C.M.; Lee, I. van der; Muglia, P.; Poot, M.; Staal, W.G.; Jonge, M.V. de; Ophoff, R.A.; Lewis, C.; Skuse, D.; Mandy, W.; Vassos, E.; Fossdal, R.; Magnusson, P.; Hreidarsson, S.; Saemundsen, E.; Stefansson, H.; Stefansson, K.; Collier, D.
The Autism Genome Project (AGP) Consortium recently reported genome-wide significant association between autism and an intronic single nucleotide polymorphism marker, rs4141463, within the MACROD2 gene. In the present study we attempted to replicate this finding using an independent case-control
Milne, Roger L; Benítez, Javier; Nevanlinna, Heli
BACKGROUND: A recent genome-wide association study identified single-nucleotide polymorphism (SNP) 2q35-rs13387042 as a marker of susceptibility to estrogen receptor (ER)-positive breast cancer. We attempted to confirm this association using the Breast Cancer Association Consortium. METHODS: 2q35...
Krintel, Sophine B; Palermo, Giuseppe; Johansen, Julia S
Recently, two genome-wide association studies identified single nucleotide polymorphisms (SNPs) significantly associated with the treatment response to tumor necrosis factor α (TNFα) inhibitors in patients with rheumatoid arthritis (RA). We aimed to replicate these results and identify SNPs and t...
Sharma, Ranu; Rawat, Vimal; Suresh, C G
The nucleotide binding site-leucine rich repeat (NBS-LRR) proteins play an important role in the defense mechanisms against pathogens. Using bioinformatics approach, we identified and annotated 104 NBS-LRR genes in chickpea. Phylogenetic analysis points to their diversification into two families namely TIR-NBS-LRR and non-TIR-NBS-LRR. Gene architecture revealed intron gain/loss events in this resistance gene family during their independent evolution into two families. Comparative genomics analysis elucidated its evolutionary relationship with other fabaceae species. Around 50% NBS-LRRs reside in macro-syntenic blocks underlining positional conservation along with sequence conservation of NBS-LRR genes in chickpea. Transcriptome sequencing data provided evidence for their transcription and tissue-specific expression. Four cis -regulatory elements namely WBOX, DRE, CBF, and GCC boxes, that commonly occur in resistance genes, were present in the promoter regions of these genes. Further, the findings will provide a strong background to use candidate disease resistance NBS-encoding genes and identify their specific roles in chickpea.
Khong, Jwu Jin; Burdon, Kathryn P; Lu, Yi; Laurie, Kate; Leonardos, Lefta; Baird, Paul N; Sahebjada, Srujana; Walsh, John P; Gajdatsy, Adam; Ebeling, Peter R; Hamblin, Peter Shane; Wong, Rosemary; Forehan, Simon P; Fourlanos, Spiros; Roberts, Anthony P; Doogue, Matthew; Selva, Dinesh; Montgomery, Grant W; Macgregor, Stuart; Craig, Jamie E
Graves' disease is an autoimmune thyroid disease of complex inheritance. Multiple genetic susceptibility loci are thought to be involved in Graves' disease and it is therefore likely that these can be identified by genome wide association studies. This study aimed to determine if a genome wide association study, using a pooling methodology, could detect genomic loci associated with Graves' disease. Nineteen of the top ranking single nucleotide polymorphisms including HLA-DQA1 and C6orf10, were clustered within the Major Histo-compatibility Complex region on chromosome 6p21, with rs1613056 reaching genome wide significance (p = 5 × 10 -8 ). Technical validation of top ranking non-Major Histo-compatablity complex single nucleotide polymorphisms with individual genotyping in the discovery cohort revealed four single nucleotide polymorphisms with p ≤ 10 -4 . Rs17676303 on chromosome 1q23.1, located upstream of FCRL3, showed evidence of association with Graves' disease across the discovery, replication and combined cohorts. A second single nucleotide polymorphism rs9644119 downstream of DPYSL2 showed some evidence of association supported by finding in the replication cohort that warrants further study. Pooled genome wide association study identified a genetic variant upstream of FCRL3 as a susceptibility locus for Graves' disease in addition to those identified in the Major Histo-compatibility Complex. A second locus downstream of DPYSL2 is potentially a novel genetic variant in Graves' disease that requires further confirmation.
Lu, Y.; Chen, X.; Beesley, J.; Johnatty, S.E.; Defazio, A.; Lambrechts, S.; Lambrechts, D.; Despierre, E.; Vergotes, I.; Chang-Claude, J.; Hein, R.; Nickels, S.; Wang-Gohrke, S.; Dork, T.; Durst, M.; Antonenkova, N.; Bogdanova, N.; Goodman, M.T.; Lurie, G.; Wilkens, L.R.; Carney, M.E.; Butzow, R.; Nevanlinna, H.; Heikkinen, T.; Leminen, A.; Kiemeney, L.A.L.M.; Massuger, L.F.A.G.; Altena, A.M. van; Aben, K.K.H.; Kjaer, S.K.; Hogdall, E.; Jensen, A.; Brooks-Wilson, A.; Le, N.; Cook, L.; Earp, M.; Kelemen, L.; Easton, D.; Pharoah, P.; Song, H.; Tyrer, J.; Ramus, S.; Menon, U.; Gentry-Maharaj, A.; Gayther, S.A.; Bandera, E.V.; Olson, S.H.; Orlow, I.; Rodriguez-Rodriguez, L.; MacGregor, S.; Chenevix-Trench, G.
Recent Genome-Wide Association Studies (GWAS) have identified four low-penetrance ovarian cancer susceptibility loci. We hypothesized that further moderate- or low-penetrance variants exist among the subset of single-nucleotide polymorphisms (SNPs) not well tagged by the genotyping arrays used in
Nivard, Michel G.; Middeldorp, Christel M.; Lubke, Gitta; Hottenga, Jouke-Jan; Abdellaoui, Abdel; Boomsma, Dorret I.; Dolan, Conor V.
Heritability may be estimated using phenotypic data collected in relatives or in distantly related individuals using genome-wide single nucleotide polymorphism (SNP) data. We combined these approaches by re-parameterizing the model proposed by Zaitlen et al and extended this model to include
Hara, Kazuo; Fujita, Hayato; Johnson, Todd A
Although over 60 loci for type 2 diabetes (T2D) have been identified, there still remains a large genetic component to be clarified. To explore unidentified loci for T2D, we performed a genome-wide association study (GWAS) of 6 209 637 single-nucleotide polymorphisms (SNPs), which were directly g...
Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.
Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…
Ulianov, Sergey V; Tachibana-Konwalski, Kikue; Razin, Sergey V
Recent years have witnessed an explosion of the single-cell biochemical toolbox including chromosome conformation capture (3C)-based methods that provide novel insights into chromatin spatial organization in individual cells. The observations made with these techniques revealed that topologically associating domains emerge from cell population averages and do not exist as static structures in individual cells. Stochastic nature of the genome folding is likely to be biologically relevant and may reflect the ability of chromatin fibers to adopt a number of alternative configurations, some of which could be transiently stabilized and serve regulatory purposes. Single-cell Hi-C approaches provide an opportunity to analyze chromatin folding in rare cell types such as stem cells, tumor progenitors, oocytes, and totipotent cells, contributing to a deeper understanding of basic mechanisms in development and disease. Here, we review key findings of single-cell Hi-C and discuss possible biological reasons and consequences of the inferred dynamic chromatin spatial organization. © 2017 WILEY Periodicals, Inc.
Full Text Available Intramuscular fat (IMF content and fatty acid composition affect the organoleptic quality and nutritional value of pork. A genome-wide association study was performed on 138 Duroc pigs genotyped with a 60k SNP chip to detect biologically relevant genomic variants influencing fat content and composition. Despite the limited sample size, the genome-wide association study was powerful enough to detect the association between fatty acid composition and a known haplotypic variant in SCD (SSC14 and to reveal an association of IMF and fatty acid composition in the LEPR region (SSC6. The association of LEPR was later validated with an independent set of 853 pigs using a candidate quantitative trait nucleotide. The SCD gene is responsible for the biosynthesis of oleic acid (C18:1 from stearic acid. This locus affected the stearic to oleic desaturation index (C18:1/C18:0, C18:1, and saturated (SFA and monounsaturated (MUFA fatty acids content. These effects were consistently detected in gluteus medius, longissimus dorsi, and subcutaneous fat. The association of LEPR with fatty acid composition was detected only in muscle and was, at least in part, a consequence of its effect on IMF content, with increased IMF resulting in more SFA, less polyunsaturated fatty acids (PUFA, and greater SFA/PUFA ratio. Marker substitution effects estimated with a subset of 65 animals were used to predict the genomic estimated breeding values of 70 animals born 7 years later. Although predictions with the whole SNP chip information were in relatively high correlation with observed SFA, MUFA, and C18:1/C18:0 (0.48-0.60, IMF content and composition were in general better predicted by using only SNPs at the SCD and LEPR loci, in which case the correlation between predicted and observed values was in the range of 0.36 to 0.54 for all traits. Results indicate that markers in the SCD and LEPR genes can be useful to select for optimum fatty acid profiles of pork.
McManus, I C; Davison, Angus; Armour, John A L
Right- and left-handedness run in families, show greater concordance in monozygotic than dizygotic twins, and are well described by single-locus Mendelian models. Here we summarize a large genome-wide association study (GWAS) that finds no significant associations with handedness and is consistent with a meta-analysis of GWASs. The GWAS had 99% power to detect a single locus using the conventional criterion of P < 5 × 10(-8) for the single locus models of McManus and Annett. The strong conclusion is that handedness is not controlled by a single genetic locus. A consideration of the genetic architecture of height, primary ciliary dyskinesia, and intelligence suggests that handedness inheritance can be explained by a multilocus variant of the McManus DC model, classical effects on family and twins being barely distinguishable from the single locus model. Based on the ENGAGE meta-analysis of GWASs, we estimate at least 40 loci are involved in determining handedness. © 2013 New York Academy of Sciences.
Binsbergen, van R.; Veerkamp, R.F.; Calus, M.P.L.
The correlated responses between traits may differ depending on the makeup of genetic covariances, and may differ from the predictions of polygenic covariances Therefore, the objective of the present study was to investigate the makeup of the genetic covariances between the well-studied traits: milk
Tenghe, A.M.M.; Bouwman, A.C.; Berglund, B.; Strandberg, E.; Koning, de D.J.; Veerkamp, R.F.
Endocrine fertility traits, which are defined from progesterone concentration levels in milk, are interesting indicators of dairy cow fertility because they more directly reflect the cows own reproductive physiology than classical fertility traits, which are more biased by farm management
van den Berg, Stéphanie M; de Moor, Marleen H M; Verweij, K. J. H.
small sample sizes of those studies. Here, we report on a large meta-analysis of GWA studies for extraversion in 63,030 subjects in 29 cohorts. Extraversion item data from multiple personality inventories were harmonized across inventories and cohorts. No genome-wide significant associations were found...... at the single nucleotide polymorphism (SNP) level but there was one significant hit at the gene level for a long non-coding RNA site (LOC101928162). Genome-wide complex trait analysis in two large cohorts showed that the additive variance explained by common SNPs was not significantly different from zero...
Chad W MacPherson
Full Text Available Genome-wide transcriptional analysis in intestinal epithelial cells (IEC can aid in elucidating the impact of single versus multi-strain probiotic combinations on immunological and cellular mechanisms of action. In this study we used human expression microarray chips in an in vitro intestinal epithelial cell model to investigate the impact of three probiotic bacteria, Lactobacillus helveticus R0052 (Lh-R0052, Bifidobacterium longum subsp. infantis R0033 (Bl-R0033 and Bifidobacterium bifidum R0071 (Bb-R0071 individually and in combination, and of a surface-layer protein (SLP purified from Lh-R0052, on HT-29 cells' transcriptional profile to poly(I:C-induced inflammation. Hierarchical heat map clustering, Set Distiller and String analyses revealed that the effects of Lh-R0052 and Bb-R0071 diverged from those of Bl-R0033 and Lh-R0052-SLP. It was evident from the global analyses with respect to the immune, cellular and homeostasis related pathways that the co-challenge with probiotic combination (PC vastly differed in its effect from the single strains and Lh-R0052-SLP treatments. The multi-strain PC resulted in a greater reduction of modulated genes, found through functional connections between immune and cellular pathways. Cytokine and chemokine analyses based on specific outcomes from the TNF-α and NF-κB signaling pathways revealed single, multi-strain and Lh-R0052-SLP specific attenuation of the majority of proteins measured (TNF-α, IL-8, CXCL1, CXCL2 and CXCL10, indicating potentially different mechanisms. These findings indicate a synergistic effect of the bacterial combinations relative to the single strain and Lh-R0052-SLP treatments in resolving toll-like receptor 3 (TLR3-induced inflammation in IEC and maintaining cellular homeostasis, reinforcing the rationale for using multi-strain formulations as a probiotic.
Rembeck, Karolina; Alsiö, Asa; Christensen, Peer Brehm
Recently, several genome-wide association studies have revealed that single nucleotide polymorphisms (SNPs) in proximity to IL28B predict spontaneous clearance of HCV infection as well as outcome following peginterferon and ribavirin therapy among HCV genotype 1 infected patients. The present stu...
Tincher, Clayton; Long, Hongan; Behringer, Megan; Walker, Noah; Lynch, Michael
Mutations induced by pollutants may promote pathogen evolution, for example by accelerating mutations conferring antibiotic resistance. Generally, evaluating the genome-wide mutagenic effects of long-term sublethal pollutant exposure at single-nucleotide resolution is extremely difficult. To overcome this technical barrier, we use the mutation accumulation/whole-genome sequencing (MA/WGS) method as a mutagenicity test, to quantitatively evaluate genome-wide mutagenesis of Escherichia coli after long-term exposure to a wide gradient of the glyphosate-based herbicide (GBH) Roundup Concentrate Plus. The genome-wide mutation rate decreases as GBH concentration increases, suggesting that even long-term GBH exposure does not compromise the genome stability of bacteria. Copyright © 2017 Tincher et al.
Krapohl, E; Plomin, R
One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES.
Full Text Available The purpose of this study was to compare results obtained from various methodologies for genome-wide association studies, when applied to real data, in terms of number and commonality of regions identified and their genetic variance explained, computational speed, and possible pitfalls in interpretations of results. Methodologies include: two iteratively reweighted single-step genomic BLUP procedures (ssGWAS1 and ssGWAS2, a single-marker model (CGWAS, and BayesB. The ssGWAS methods utilize genomic breeding values (GEBVs based on combined pedigree, genomic and phenotypic information, while CGWAS and BayesB only utilize phenotypes from genotyped animals or pseudo-phenotypes. In this study, ssGWAS was performed by converting GEBVs to SNP marker effects. Unequal variances for markers were incorporated for calculating weights into a new genomic relationship matrix. SNP weights were refined iteratively. The data was body weight at 6 weeks on 274,776 broiler chickens, of which 4553 were genotyped using a 60k SNP chip. Comparison of genomic regions was based on genetic variances explained by local SNP regions (20 SNPs. After 3 iterations, the noise was greatly reduced of ssGWAS1 and results are similar to that of CGWAS, with 4 out of the top 10 regions in common. In contrast, for BayesB, the plot was dominated by a single region explaining 23.1% of the genetic variance. This same region was found by ssGWAS1 with the same rank, but the amount of genetic variation attributed to the region was only 3%. These finding emphasize the need for caution when comparing and interpreting results from various methods, and highlight that detected associations, and strength of association, strongly depends on methodologies and details of implementations. BayesB appears to overly shrink regions to zero, while overestimating the amount of genetic variation attributed to the remaining SNP effects. The real world is most likely a compromise between methods and remains to
Jiao, Hong; Arner, Peter; Hoffstedt, Johan
Recent genome-wide association (GWA) analyses have identified common single nucleotide polymorphisms (SNPs) that are associated with obesity. However, the reported genetic variation in obesity explains only a minor fraction of the total genetic variation expected to be present in the population....... Thus many genetic variants controlling obesity remain to be identified. The aim of this study was to use GWA followed by multiple stepwise validations to identify additional genes associated with obesity....
Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J
The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53-3.14), P=1.9 × 10(-5)). Two polymorphisms at 6p21.2 LINC00951-LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37-1.85), P=1.6 × 10(-9)) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder.
Fall, Tove; Ingelsson, Erik
Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora
Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity. Copyright © 2011 Elsevier Inc. All rights reserved.
M. Ilyas Kamboh
Full Text Available Background. The persistent presence of antiphospholipid antibodies (APA may lead to the development of primary or secondary antiphospholipid syndrome. Although the genetic basis of APA has been suggested, the identity of the underlying genes is largely unknown. In this study, we have performed a genome-wide association study (GWAS in an effort to identify susceptibility loci/genes for three main APA: anticardiolipin antibodies (ACL, lupus anticoagulant (LAC, and anti-β2 glycoprotein I antibodies (anti-β2GPI. Methods. DNA samples were genotyped using the Affymetrix 6.0 array containing 906,600 single-nucleotide polymorphisms (SNPs. Association of SNPs with the antibody status (positive/negative was tested using logistic regression under the additive model. Results. We have identified a number of suggestive novel loci with P
Chen, Gary K; Zheng, Tian; Witte, John S; Goode, Ellen L; Gao, Lei; Hu, Pingzhao; Suh, Young Ju; Suktitipat, Bhoom; Szymczak, Silke; Woo, Jung Hoon; Zhang, Wei
A number of issues arise when analyzing the large amount of data from high-throughput genotype and expression microarray experiments, including design and interpretation of genome-wide association studies of expression phenotypes. These issues were considered by contributions submitted to Group 1 of the Genetic Analysis Workshop 15 (GAW15), which focused on the association of quantitative expression data. These contributions evaluated diverse hypotheses, including those relevant to cancer and obesity research, and used various analytic techniques, many of which were derived from information theory. Several observations from these reports stand out. First, one needs to consider the genetic model of the trait of interest and carefully select which single nucleotide polymorphisms and individuals are included early in the design stage of a study. Second, by targeting specific pathways when analyzing genome-wide data, one can generate more interpretable results than agnostic approaches. Finally, for datasets with small sample sizes but a large number of features like the Genetic Analysis Workshop 15 dataset, machine learning approaches may be more practical than traditional parametric approaches. (c) 2007 Wiley-Liss, Inc.
Dijkstra, Akkelies E; Smolonska, Joanna; van den Berge, Maarten
by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years). Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism...... (SNP). RESULTS: A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10(-6), OR = 1.17), located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1) on chromosome 3. The risk allele (G) was associated with higher mRNA expression...... of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA) study of CMH in Caucasian populations. METHODS: GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed...
Gong, Jian; Hsu, Li; Harrison, Tabitha
Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated...... this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening and the Women’s Health Initiative (WHI). We...... tested association between 2,474,333 single nucleotide polymorphisms (SNPs) and serum selenium concentrations using linear regression models. In the first stage (PLCO) 41 SNPs clustered in 15 regions had p
Nagano, Takashi; Lubling, Yaniv; Yaffe, Eitan; Wingett, Steven W; Dean, Wendy; Tanay, Amos; Fraser, Peter
Hi-C is a powerful method that provides pairwise information on genomic regions in spatial proximity in the nucleus. Hi-C requires millions of cells as input and, as genome organization varies from cell to cell, a limitation of Hi-C is that it only provides a population average of genome conformations. We developed single-cell Hi-C to create snapshots of thousands of chromatin interactions that occur simultaneously in a single cell. To adapt Hi-C to single-cell analysis, we modified the protocol to include in-nucleus ligation. This enables the isolation of single nuclei carrying Hi-C-ligated DNA into separate tubes, followed by reversal of cross-links, capture of biotinylated ligation junctions on streptavidin-coated magnetic beads and PCR amplification of single-cell Hi-C libraries. The entire laboratory protocol can be carried out in 1 week, and although we have demonstrated its use in mouse T helper (TH1) cells, it should be applicable to any cell type or species for which standard Hi-C has been successful. We also developed an analysis pipeline to filter noise and assess the quality of data sets in a few hours. Although the interactome maps produced by single-cell Hi-C are sparse, the data provide useful information to understand cellular variability in nuclear genome organization and chromosome structure. Standard wet and dry laboratory skills in molecular biology and computational analysis are required.
Edsgard, Stefan Daniel; Dalgaard, Marlene Danner; Weinhold, Nils
Testicular germ cell cancer (TGCC) is one of the most heritable forms of cancer. Previous genome-wide association studies have focused on single nucleotide polymorphisms, largely ignoring the influence of copy number variants (CNVs). Here we present a genome-wide study of CNV on a cohort of 212...... of rare CNVs related to cell migration (false-discovery rate = 0.021, 1.8% of cases and 1.1% of controls). Dysregulation during migration of primordial germ cells has previously been suspected to be a part of TGCC development and this set of multiple rare variants may thereby have a minor contribution...
Zhao, Huiying; Nyholt, Dale R.; Yang, Yuanhao; Wang, Jihua; Yang, Yuedong
Genome-wide association studies (GWAS) have successfully identified single variants associated with diseases. To increase the power of GWAS, gene-based and pathway-based tests are commonly employed to detect more risk factors. However, the gene- and pathway-based association tests may be biased towards genes or pathways containing a large number of single-nucleotide polymorphisms (SNPs) with small P-values caused by high linkage disequilibrium (LD) correlations. To address such bias, numerous...
Full Text Available Aleksandra Szczepankiewicz1,21Laboratory of Molecular and Cell Biology, 2Department of Psychiatric Genetics, Poznan University of Medical Sciences, Poznan, PolandAbstract: Bipolar disorder (BD is a complex disorder with a number of susceptibility genes and environmental risk factors involved in its pathogenesis. In recent years, huge progress has been made in molecular techniques for genetic studies, which have enabled identification of numerous genomic regions and genetic variants implicated in BD across populations. Despite the abundance of genetic findings, the results have often been inconsistent and not replicated for many candidate genes/single nucleotide polymorphisms (SNPs. Therefore, the aim of the review presented here is to summarize the most important data reported so far in candidate gene and genome-wide association studies. Taking into account the abundance of association data, this review focuses on the most extensively studied genes and polymorphisms reported so far for BD to present the most promising genomic regions/SNPs involved in BD. The review of association data reveals evidence for several genes (SLC6A4/5-HTT [serotonin transporter gene], BDNF [brain-derived neurotrophic factor], DAOA [D-amino acid oxidase activator], DTNBP1 [dysbindin], NRG1 [neuregulin 1], DISC1 [disrupted in schizophrenia 1] to be crucial candidates in BD, whereas numerous genome-wide association studies conducted in BD indicate polymorphisms in two genes (CACNA1C [calcium channel, voltage-dependent, L type, alpha 1C subunit], ANK3 [ankyrin 3] replicated for association with BD in most of these studies. Nevertheless, further studies focusing on interactions between multiple candidate genes/SNPs, as well as systems biology and pathway analyses are necessary to integrate and improve the way we analyze the currently available association data.Keywords: candidate gene, genome-wide association study, SLC6A4, BDNF, DAOA, DTNBP1, NRG1, DISC1
Zillikens, M Carola; Demissie, Serkalem; Hsu, Yi-Hsiang
Lean body mass, consisting mostly of skeletal muscle, is important for healthy aging. We performed a genome-wide association study for whole body (20 cohorts of European ancestry with n = 38,292) and appendicular (arms and legs) lean body mass (n = 28,330) measured using dual energy X-ray absorpt...... a meta-analysis of genome-wide association studies for whole body lean body mass and find five novel genetic loci to be significantly associated.......-ray absorptiometry or bioelectrical impedance analysis, adjusted for sex, age, height, and fat mass. Twenty-one single-nucleotide polymorphisms were significantly associated with lean body mass either genome wide (p
Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M
RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.
Single nucleotide polymorphisms (SNPs) may be considered the ultimate genetic markers as they represent the finest resolution of a DNA sequence (a single nucleotide), and are generally abundant in populations with a low mutation rate. SNPs are important tools in studying complex genetic traits and genome evolution.
Full Text Available The sequencing of the full nuclear genome of sesame (Sesamum indicum L. provides the platform for functional analyses of genome components and their application in breeding programs. Although the importance of microsatellites markers or simple sequence repeats (SSR in crop genotyping, genetics, and breeding applications is well established, only a little information exist concerning SSRs at the whole genome level in sesame. In addition, SSRs represent a suitable marker type for sesame molecular breeding in developing countries where it is mainly grown. In this study, we identified 138,194 genome-wide SSRs of which 76.5% were physically mapped onto the 13 pseudo-chromosomes. Among these SSRs, up to three primers pairs were supplied for 101,930 SSRs and used to in silico amplify the reference genome together with two newly sequenced sesame accessions. A total of 79,957 SSRs (78% were polymorphic between the three genomes thereby suggesting their promising use in different genomics-assisted breeding applications. From these polymorphic SSRs, 23 were selected and validated to have high polymorphic potential in 48 sesame accessions from different growing areas of Africa. Furthermore, we have developed an online user-friendly database, SisatBase (http://www.sesame-bioinfo.org/SisatBase/, which provides free access to SSRs data as well as an integrated platform for functional analyses. Altogether, the reference SSR and SisatBase would serve as useful resources for genetic assessment, genomic studies, and breeding advancement in sesame, especially in developing countries.
Jensen, Majken Karoline; Pers, Tune Hannes; Dworzynski, Piotr
in genes associated with risk of coronary heart disease (CHD). Methods and Results-Genome-wide association analyses of approximately approximate to 700 000 single-nucleotide polymorphisms in 899 incident CHD cases and 1823 age-and sex-matched controls within the Nurses' Health and the Health Professionals...... complex. Conclusions-The integration of a GWA study with PPI data successfully identifies a set of candidate susceptibility genes for incident CHD that would have been missed in single-marker GWA analysis. (Circ Cardiovasc Genet. 2011; 4:549-556.)...
Full Text Available Abstract Background A recent genome wide association study in 1017 African Americans identified several single nucleotide polymorphisms that reached genome-wide significance for systolic blood pressure. We attempted to replicate these findings in an independent sample of 2474 unrelated African Americans in the Milwaukee metropolitan area; 53% were women and 47% were hypertensives. Methods We evaluated sixteen top associated SNPs from the above genome wide association study for hypertension as a binary trait or blood pressure as a continuous trait. In addition, we evaluated eight single nucleotide polymorphisms located in two genes (STK-39 and CDH-13 found to be associated with systolic and diastolic blood pressures by other genome wide association studies in European and Amish populations. TaqMan MGB-based chemistry with fluorescent probes was used for genotyping. We had an adequate sample size (80% power to detect an effect size of 1.2-2.0 for all the single nucleotide polymorphisms for hypertension as a binary trait, and 1% variance in blood pressure as a continuous trait. Quantitative trait analyses were performed both by excluding and also by including subjects on anti-hypertensive therapy (after adjustments were made for anti-hypertensive medications. Results For all 24 SNPs, no statistically significant differences were noted in the minor allele frequencies between cases and controls. One SNP (rs2146204 showed borderline association (p = 0.006 with hypertension status using recessive model and systolic blood pressure (p = 0.02, but was not significant after adjusting for multiple comparisons. In quantitative trait analyses, among normotensives only, rs12748299 was associated with SBP (p = 0.002. In addition, several nominally significant associations were noted with SBP and DBP among normotensives but none were statistically significant. Conclusions This study highlights the importance of replication to confirm the validity of genome wide
We conducted a genome-wide scan for visceral leishmaniasis in mixed-breed dogs from a highly endemic area in Brazil using 149,648 single nucleotide polymorphism (SNP) markers genotyped in 20 cases and 28 controls. Using a mixed model approach, we found two candidate loci on canine autosomes 1 and 2....
Vaez, Ahmad; Jansen, Rick; Prins, Bram P.; Hottenga, Jouke-Jan; de Geus, Eco J. C.; Boomsma, Dorret I.; Penninx, Brenda W. J. H.; Nolte, Ilja M.; Snieder, Harold; Alizadeh, Behrooz Z.
Background Genome-wide association studies (GWASs) have successfully identified several single nucleotide polymorphisms (SNPs) associated with serum levels of C-reactive protein (CRP). An important limitation of GWASs is that the identified variants merely flag the nearby genomic region and do not
Vaez, A.; Jansen, R.; Prins, B.P.; Hottenga, J.J.; de Geus, E.J.C.; Boomsma, D.I.; Penninx, B.W.J.H.; Nolte, I.M.; Snieder, H.; Alizadeh, BZ
Background - Genome-wide association studies (GWASs) have successfully identified several single nucleotide polymorphisms (SNPs) associated with serum levels of C-reactive protein (CRP). An important limitation of GWASs is that the identified variants merely flag the nearby genomic region and do not
Schork, A.J.; Thompson, W.K.; Pham, P.; Torkamani, A.; Roddey, J.C.; Sullivan, P.F.; Kelsoe, J.; O'Donovan, M.C.; Furberg, H.; Absher, D.; Agudo, A.; Almgren, P.; Ardissino, D.; Assimes, T.L.; Bandinelli, S.; Barzan, L.; Bencko, V.; Benhamou, S.; Benjamin, E.J.; Bernardinelli, L.; Bis, J.; Boehnke, M.; Boerwinkle, E.; Boomsma, D.I.; Brennan, P.; Canova, C.; Castellsagué, X.; Chanock, S.; Chasman, D.I.; Conway, D.I.; Dackor, J.; de Geus, E.J.C.; Duan, J.; Elosua, R.; Everett, B.; Fabianova, E.; Ferrucci, L.; Foretova, L.; Fortmann, S.P.; Franceschini, N.; Frayling, T.M.; Furberg, C.; Gejman, P.V.; Groop, L.; Gu, F.; Guralnik, J.; Hankinson, S.E.; Haritunians, T.; Healy, C.; Hofman, A.; Holcátová, I.; Hunter, D.J.; Hwang, S.J.; Ioannidis, J.P.A.; Iribarren, C.; Jackson, A.U.; Janout, V.; Kaprio, J.; Kim, Y.; Kjaerheim, K.; Knowles, J.W.; Kraft, P.; Ladenvall, C.; Lagiou, P.; Lanthrop, M.; Lerman, C.; Levinson, D.F.; Levy, D.; Li, M.D.; Lin, D.Y.; Lips, E.H.; Lissowska, J.; Lowry, R.B.; Lucas, G.; Macfarlane, T.V.; Maes, H.H.M.; Mannucci, P.M.; Mates, D.; Mauri, F.; McGovern, J.A.; McKay, J.D.; McKnight, B.; Melander, O.; Merlini, P.A.; Milaneschi, Y.; Mohlke, K.L.; O'Donnell, C.J.; Pare, G.; Penninx, B.W.J.H.; Perry, J.R.B.; Posthuma, D.; Preis, S.R.; Psaty, B.; Quertermous, T.; Ramachandran, V.S.; Richiardi, L.; Ridker, P.M.; Rose, J.; Rudnai, P.; Salomaa, V.; Sanders, A.R.; Schwartz, S.M.; Shi, J.; Smit, J.H.; Stringham, H.M.; Szeszenia-Dabrowska, N.; Tanaka, T.; Taylor, K.; Thacker, E.E.; Thornton, L.; Tiemeier, H.; Tuomilehto, J.; Uitterlinden, A.G.; van Duijn, C.M.; Vink, J.M.; Vogelzangs, N.; Voight, B.F.; Walter, S.; Willemsen, G.; Zaridze, D.; Znaor, A.; Akil, H.; Anjorin, A.; Backlund, L.; Badner, J.A.; Barchas, J.D.; Barrett, T.; Bass, N.; Bauer, M.; Bellivier, F.; Bergen, S.E.; Berrettini, W.; Blackwood, D.; Bloss, C.S.; Breen, G.; Breuer, R.; Bunner, W.E.; Burmeister, M.; Byerley, W. F.; Caesar, S.; Chambert, K.; Cichon, S.; St Clair, D.; Collier, D.A.; Corvin, A.; Coryell, W.H.; Craddock, N.; Craig, D.W.; Daly, M.; Day, R.; Degenhardt, F.; Djurovic, S.; Dudbridge, F.; Edenberg, H.J.; Elkin, A.; Etain, B.; Farmer, A.E.; Ferreira, M.A.; Ferrier, I.; Flickinger, M.; Foroud, T.; Frank, J.; Fraser, C.; Frisén, L.; Gershon, E.S.; Gill, M.; Gordon-Smith, K.; Green, E.K.; Greenwood, T.A.; Grozeva, D.; Guan, W.; Gurling, H.; Gustafsson, O.; Hamshere, M.L.; Hautzinger, M.; Herms, S.; Hipolito, M.; Holmans, P.A.; Hultman, C. M.; Jamain, S.; Jones, E.G.; Jones, I.; Jones, L.; Kandaswamy, R.; Kennedy, J.L.; Kirov, G. K.; Koller, D.L.; Kwan, P.; Landén, M.; Langstrom, N.; Lathrop, M.; Lawrence, J.; Lawson, W.B.; Leboyer, M.; Lee, P.H.; Li, J.; Lichtenstein, P.; Lin, D.; Liu, C.; Lohoff, F.W.; Lucae, S.; Mahon, P.B.; Maier, W.; Martin, N.G.; Mattheisen, M.; Matthews, K.; Mattingsdal, M.; McGhee, K.A.; McGuffin, P.; McInnis, M.G.; McIntosh, A.; McKinney, R.; McLean, A.W.; McMahon, F.J.; McQuillin, A.; Meier, S.; Melle, I.; Meng, F.; Mitchell, P.B.; Montgomery, G.W.; Moran, J.; Morken, G.; Morris, D.W.; Moskvina, V.; Muglia, P.; Mühleisen, T.W.; Muir, W.J.; Müller-Myhsok, B.; Myers, R.M.; Nievergelt, C.M.; Nikolov, I.; Nimgaonkar, V.L.; Nöthen, M.M.; Nurnberger, J.I.; Nwulia, E.A.; O'Dushlaine, C.; Osby, U.; Óskarsson, H.; Owen, M.J.; Petursson, H.; Pickard, B.S.; Porgeirsson, P.; Potash, J.B.; Propping, P.; Purcell, S.M.; Quinn, E.; Raychaudhuri, S.; Rice, J.; Rietschel, M.; Ruderfer, D.; Schalling, M.; Schatzberg, A.F.; Scheftner, W.A.; Schofield, P.R.; Schulze, T.G.; Schumacher, J.; Schwarz, M.M.; Scolnick, E.; Scott, L.J.; Shilling, P.D.; Sigurdsson, E.; Sklar, P.; Smith, E.N.; Stefansson, H.; Stefansson, K.; Steffens, M; Steinberg, S.; Strauss, J.; Strohmaier, J.; Szelinger, S.; Thompson, R.C.; Tozzi, F.; Treutlein, J.; Vincent, J.B.; Watson, S.J.; Wienker, T.F.; Williamson, R.; Witt, S.H.; Wright, A.; Xu, W.; Young, A.H.; Zandi, P.P.; Zhang, P.; Zöllner, S.; Agartz, I.; Albus, M.; Alexander, M.; Amdur, R. L.; Amin, F.; Bitter, I.; Black, D.W.; Børglum, A.D.; Brown, M.A.; Bruggeman, R.; Buccola, N.G.; Cahn, W.; Cantor, R.M.; Carr, V.J.; Catts, S. V.; Choudhury, K.; Cloninger, C. R.; Cormican, P.; Danoy, P. A.; Datta, S.; DeHert, M.; Demontis, D.; Dikeos, D.; Donnelly, P.; Donohoe, G.; Duong, L.; Dwyer, S.; Fanous, A.; Fink-Jensen, A.; Freedman, R.; Freimer, N.B.; Friedl, M.; Georgieva, L.; Giegling, I.; Glenthoj, B.; Godard, S.; Golimbet, V.; de Haan, L.; Hansen, M.; Hansen, T.; Hartmann, A.M.; Henskens, F. A.; Hougaard, D. M.; Ingason, A.; Jablensky, A. V.; Jakobsen, K.D.; Jay, M.; Jönsson, E.G.; Jürgens, G.; Kahn, R.S.; Keller, M.C.; Kendler, K.S.; Kenis, G.; Kenny, E.; Konnerth, H.; Konte, B.; Krabbendam, L.; Krasucki, R.; Lasseter, V. K.; Laurent, C.; Lencz, T.; Lerer, F. B.; Liang, K. Y.; Lieberman, J. A.; Linszen, D.H.; Lönnqvist, J.; Loughland, C. M.; Maclean, A. W.; Maher, B.S.; Malhotra, A.K.; Mallet, J.; Malloy, P.; McGrath, J. J.; McLean, D. E.; Michie, P. T.; Milanova, V.; Mors, O.; Mortensen, P.B.; Mowry, B. J.; Myin-Germeys, I.; Neale, B.; Nertney, D. A.; Nestadt, G.; Nielsen, J.; Nordentoft, M.; Norton, N.; O'Neill, F.; Olincy, A.; Olsen, L.; Ophoff, R.A.; Orntoft, T. F.; van Os, J.; Pantelis, C.; Papadimitriou, G.; Pato, C.N.; Peltonen, L.; Pickard, B.; Pietilainen, O.P.; Pimm, J.; Pulver, A. E.; Puri, V.; Quested, D.; Rasmussen, H.B.; Rethelyi, J.M.; Ribble, R.; Riley, B.P.; Rossin, L.; Ruggeri, M.; Rujescu, D.; Schall, U.; Schwab, S. G.; Scott, R.J.; Silverman, J.M.; Spencer, C. C.; Strange, A.; Strengman, E.; Stroup, T.S.; Suvisaari, J.; Terenius, L.; Thirumalai, S.; Timm, S.; Toncheva, D.; Tosato, S.; van den Oord, E.J.; Veldink, J.; Visscher, P.M.; Walsh, D.; Wang, A. G.; Werge, T.; Wiersma, D.; Wildenauer, D. B.; Williams, H.J.; Williams, N.M.; van Winkel, R.; Wormley, B.; Zammit, S.; Schork, N.J.; Andreassen, O.A.; Dale, A.M.
Recent results indicate that genome-wide association studies (GWAS) have the potential to explain much of the heritability of common complex phenotypes, but methods are lacking to reliably identify the remaining associated single nucleotide polymorphisms (SNPs). We applied stratified False Discovery
Schork, Andrew J.; Thompson, Wesley K.; Pham, Phillip; Torkamani, Ali; Roddey, J. Cooper; Sullivan, Patrick F.; Kelsoe, John R.; O'Donovan, Michael C.; Furberg, Helena; Schork, Nicholas J.; Andreassen, Ole A.; Dale, Anders M.; Absher, Devin; Agudo, Antonio; Almgren, Peter; Ardissino, Diego; Assimes, Themistocles L.; Bandinelli, Stephania; Barzan, Luigi; Bencko, Vladimir; Benhamou, Simone; Benjamin, Emelia J.; Bernardinelli, Luisa; Bis, Joshua; Boehnke, Michael; Boerwinkle, Eric; Boomsma, Dorret I.; Brennan, Paul; Canova, Cristina; Castellsagué, Xavier; Chanock, Stephen; Chasman, Daniel; Conway, David I.; Dackor, Jennifer; de Geus, Eco J. C.; Duan, Jubao; Elosua, Roberto; Everett, Brendan; Fabianova, Eleonora; Ferrucci, Luigi; Foretova, Lenka; Fortmann, Stephen P.; Franceschini, Nora; Frayling, Timothy; Furberg, Curt; Gejman, Pablo V.; Groop, Leif; Gu, Fangyi; de Haan, Lieuwe; Linszen, Don H.
Recent results indicate that genome-wide association studies (GWAS) have the potential to explain much of the heritability of common complex phenotypes, but methods are lacking to reliably identify the remaining associated single nucleotide polymorphisms (SNPs). We applied stratified False Discovery
Wang, Xianshu; Pankratz, V. Shane; Fredericksen, Zachary; Tarrell, Robert; Karaus, Mary; McGuffog, Lesley; Pharaoh, Paul D. P.; Ponder, Bruce A. J.; Dunning, Alison M.; Peock, Susan; Cook, Margaret; Oliver, Clare; Frost, Debra; Sinilnikova, Olga M.; Stoppa-Lyonnet, Dominique; Mazoyer, Sylvie; Houdayer, Claude; Hogervorst, Frans B. L.; Hooning, Maartje J.; Ligtenberg, Marjolijn J.; Spurdle, Amanda; Chenevix-Trench, Georgia; Schmutzler, Rita K.; Wappenschmidt, Barbara; Engel, Christoph; Meindl, Alfons; Domchek, Susan M.; Nathanson, Katherine L.; Rebbeck, Timothy R.; Singer, Christian F.; Gschwantler-Kaulich, Daphne; Dressler, Catherina; Fink, Anneliese; Szabo, Csilla I.; Zikan, Michal; Foretova, Lenka; Claes, Kathleen; Thomas, Gilles; Hoover, Robert N.; Hunter, David J.; Chanock, Stephen J.; Easton, Douglas F.; Antoniou, Antonis C.; Couch, Fergus J.; Gregory, Helen; Miedzybrodzka, Zosia; Morrison, Patrick; Cole, Trevor; McKeown, Carole; Taylor, Amy; Donaldson, Alan; Paterson, Joan; Murray, Alexandra; Rogers, Mark; McCann, Emma; Kennedy, John; Barton, David; Porteous, Mary; Brewer, Carole; Kivuva, Emma; Searle, Anne; Goodman, Selina; Davidson, Rosemarie; Murday, Victoria; Bradshaw, Nicola; Snadden, Lesley; Longmuir, Mark; Watt, Catherine; Izatt, Louise; Pichert, Gabriella; Langman, Caroline; Dorkins, Huw; Barwell, Julian; Chu, Carol; Bishop, Tim; Miller, Julie; Ellis, Ian; Evans, D. Gareth; Lalloo, Fiona; Holt, Felicity; Male, Alison; Robinson, Anne; Gardiner, Carol; Douglas, Fiona; Claber, Oonagh; Walker, Lisa; McLeod, Diane; Eeles, Ros; Shanley, Susan; Rahman, Nazneen; Houlston, Richard; Bancroft, Elizabeth; D'Mello, Lucia; Page, Elizabeth; Ardern-Jones, Audrey; Mitra, Anita; Cook, Jackie; Quarrell, Oliver; Bardsley, Cathryn; Hodgson, Shirley; Goff, Sheila; Brice, Glen; Winchester, Lizzie; Eccles, Diana; Lucassen, Anneke; Crawford, Gillian; Tyler, Emma; McBride, Donna; Bérard, Léon; Sinilnikova, Olga; Barjhoux, Laure; Giraud, Sophie; Léone, Mélanie; Gauthier-Villars, Marion; Moncoutier, Virginie; Belotti, Muriel; de Pauw, Antoine; Bressac-de-Paillerets, Brigitte; Remenieras, Audrey; Byrde, Véronique; Caron, Olivier; Lenoir, Gilbert; Bignon, Yves-Jean; Uhrhammer, Nancy; Lasset, Christine; Bonadona, Valérie; Hardouin, Agnès; Berthet, Pascaline; Sobol, Hagay; Bourdon, Violaine; Eisinger, Françoise; Coulet, Florence; Colas, Chrystelle; Soubrier, Florent; Coupier, Isabelle; Payrat, Jean-Philippe; Fournier, Joëlle; Révillion, Françoise; Vennin, Philippe; Adenis, Claude; Rouleau, Etienne; Lidereau, Rosette; Demange, Liliane; Nogues, Catherine; Muller, Danièle; Fricker, Jean-Pierre; Longy, Michel; Sevenet, Nicolas; Toulas, Christine; Guimbaud, Rosine; Gladieff, Laurence; Feillel, Viviane; Leroux, Dominique; Dreyfus, Hélèn; Rebischung, Christine; Cassini, Cécile; Olivier-Faivre, Laurence; Prieur, Fabienne; Ferrer, Sandra Fert; Frénay, Marc; Vénat-Bouvet, Laurence; Lynch, Henry T.; Hogervorst, Frans; Vernhoef, Senno; Pijpe, Anouk; van 't Veer, Laura; van Leeuwen, Flora; Rookus, Matti; Collée, Margriet; van den Ouweland, Ans; Kriege, Mieke; Schutte, Mieke; Hooning, Maartje; Seynaeve, Caroline; van Asperen, Christi; Wijnen, Juul; Vreeswijk, Maaike; Tollenaar, Rob; Devilee, Peter; Ligtenberg, Marjolijn; Hoogerbrugge, Nicoline; Ausems, Margreet; van der Luijt, Rob; Aalfs, Cora; van Os, Theo; Gille, Hans; Waisfisz, Quinten; Meijers-Heijboer, Hanne; Gomez-Garcia, Encarna; van Roozendaal, Kees; Blok, Marinus; Oosterwijk, Jan; van der Hout, Annemieke; Mourits, Marian; Vasen, Hans; Szabo, Csilla; Pohlreich, Petr; Kleibl, Zdenek; Machackova, Eva; Lukesova, Miroslava; de Leeneer, Kim; Poppe, Bruce; de Paepe, Anne
Recent studies have identified single nucleotide polymorphisms (SNPs) that significantly modify breast cancer risk in BRCA1 and BRCA2 mutation carriers. Since these risk modifiers were originally identified as genetic risk factors for breast cancer in genome-wide association studies (GWASs),
Børglum, A D; Demontis, D; Grove, J
Genetic and environmental components as well as their interaction contribute to the risk of schizophrenia, making it highly relevant to include environmental factors in genetic studies of schizophrenia. This study comprises genome-wide association (GWA) and follow-up analyses of all individuals...... born in Denmark since 1981 and diagnosed with schizophrenia as well as controls from the same birth cohort. Furthermore, we present the first genome-wide interaction survey of single nucleotide polymorphisms (SNPs) and maternal cytomegalovirus (CMV) infection. The GWA analysis included 888 cases...... was found for rs7902091 (P(SNP × CMV)=7.3 × 10(-7)) in CTNNA3, a gene not previously implicated in schizophrenia, stressing the importance of including environmental factors in genetic studies....
Wu, Xiaoping; Fang, Ming; Liu, Lin
.Results: The Illumina BovineSNP50 BeadChip was used to identify single nucleotide polymorphisms (SNPs) that are associated with body conformation traits. A least absolute shrinkage and selection operator (LASSO) was applied to detect multiple SNPs simultaneously for 29 body conformation traits with 1,314 Chinese...... Holstein cattle and 52,166 SNPs. Totally, 59 genome-wide significant SNPs associated with 26 conformation traits were detected by genome-wide association analysis; five SNPs were within previously reported QTL regions (Animal Quantitative Trait Loci (QTL) database) and 11 were very close to the reported...... SNPs. Twenty-two SNPs were located within annotated gene regions, while the remainder were 0.6-826 kb away from known genes. Some of the genes had clear biological functions related to conformation traits. By combining information about the previously reported QTL regions and the biological functions...
Lundby, Alicia; Rossin, Elizabeth J.; Steffensen, Annette B.
Genome-wide association studies (GWAS) have identified thousands of loci associated with complex traits, but it is challenging to pinpoint causal genes in these loci and to exploit subtle association signals. We used tissue-specific quantitative interaction proteomics to map a network of five genes...... involved in the Mendelian disorder long QT syndrome (LOTS). We integrated the LOTS network with GWAS loci from the corresponding common complex trait, QT-interval variation, to identify candidate genes that were subsequently confirmed in Xenopus laevis oocytes and zebrafish. We used the LOTS protein...... network to filter weak GWAS signals by identifying single-nucleotide polymorphisms (SNPs) in proximity to genes in the network supported by strong proteomic evidence. Three SNPs passing this filter reached genome-wide significance after replication genotyping. Overall, we present a general strategy...
Full Text Available Alien chromosome substitution (CS lines are treated as vital germplasms for breeding and genetic mapping. Previously, a whole set of nine Brassica rapa-oleracea monosonic alien addition lines (MAALs, C1-C9 was established in the background of natural B. napus genotype “Oro,” after the restituted B. rapa (RBR for Oro was realized. Herein, a monosomic substitution line with one alien C1 chromosome (Cs1 in the RBR complement was selected in the progenies of MAAL C1 and RBR, by the PCR amplification of specific gene markers and fluorescence in situ hybridization. Cs1 exhibited the whole plant morphology similar to RBR except for the defective stamens without fertile pollen grains, but it produced some seeds and progeny plants carrying the C1 chromosome at high rate besides those without the alien chromosome after pollinated by RBR. The viability of the substitution and its progeny for the RBR diploid further elucidated the functional compensation between the chromosome pairs with high homoeology. To reveal the impact of such aneuploidy on genome-wide gene expression, the transcriptomes of MAAL C1, Cs1 and euploid RBR were analyzed. Compared to RBR, Cs1 had sharply reduced gene expression level across chromosome A1, demonstrating the loss of one copy of A1 chromosome. Both additional chromosome C1 in MAAL and substitutional chromosome C1 in Cs1 caused not only cis-effect but also prevalent trans-effect differentially expressed genes. A dominant gene dosage effects prevailed among low expressed genes across chromosome A1 in Cs1, and moreover, dosage effects for some genes potentially contributed to the phenotype deviations. Our results provided novel insights into the transcriptomic perturbation and gene dosage effects on phenotype in CS related to one naturally evolved allopolyploid.
Psychosis Endophenotypes International Consortium; Wellcome Trust Case-Control Consortium; Bramon, E.; Pirinen, M.; Strange, A.; Lin, K.; Freeman, C.; Bellenguez, C.; Su, Z.; Band, G.; Pearson, R.; Vukcevic, D.; Langford, C.; Deloukas, P.; Hunt, S.
BACKGROUND: Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories. METHODS: 1239 cases with schizophrenia, schizoaffective disorder, or psychotic bipolar disorder; 857 of their unaffected relatives, and 2739 healthy controls were genotyped with the Affymetrix 6.0 single nucleotide polymorphism (SNP) array. Analyses of 69...
Tosato, Sarah; Myin-germeys, Inez; Barroso, Ines; Bender, Stephan; Giegling, Ina; Arranz, Maria J.; Donnelly, Peter; Bellenguez, Celine; Brown, Matthew A.; Lawrie, Stephen; Kalaydjieva, Luba; Vukcevic, Damjan; Kahn, Rene S.; Dronov, Serge; Walshe, Muriel
Background: Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories.Methods: 1239 cases with schizophrenia, schizoaffective disorder, or psychotic bipolar disorder; 857 of their unaffected relatives, and 2739 healthy controls were genotyped with the Affymetrix 6.0 single nucleotide polymorphism (SNP) array. Analyses of 695,19...
Willour, Virginia L.; Seifuddin, Fayaz; Mahon, Pamela B.; Jancic, Dubravka; Pirooznia, Mehdi; Steele, Jo; Schweizer, Barbara; Goes, Fernando S.; Mondimore, Francis M.; MacKinnon, Dean F.; Perlis, Roy H.; Lee, Phil Hyoun; Huang, Jie; Kelsoe, John R.; Shilling, Paul D.; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W.; Craddock, Nicholas; DePaulo, J. Raymond; Schulze, Thomas G.; McMahon, Francis J.; Zandi, Peter P.; Potash, James B.
The heritable component to attempted and completed suicide is partly related to psychiatric disorders and also partly independent of them. While attempted suicide linkage regions have been identified on 2p11–12 and 6q25–26, there are likely many more such loci, the discovery of which will require a much higher resolution approach, such as the genome-wide association study (GWAS). With this in mind, we conducted an attempted suicide GWAS that compared the single nucleotide polymorphism (SNP) genotypes of 1,201 bipolar (BP) subjects with a history of suicide attempts to the genotypes of 1,497 BP subjects without a history of suicide attempts. 2,507 SNPs with evidence for association at p<0.001 were identified. These associated SNPs were subsequently tested for association in a large and independent BP sample set. None of these SNPs were significantly associated in the replication sample after correcting for multiple testing, but the combined analysis of the two sample sets produced an association signal on 2p25 (rs300774) at the threshold of genome-wide significance (p= 5.07 × 10−8). The associated SNPs on 2p25 fall in a large linkage disequilibrium block containing the ACP1 gene, a gene whose expression is significantly elevated in BP subjects who have completed suicide. Furthermore, the ACP1 protein is a tyrosine phosphatase that influences Wnt signaling, a pathway regulated by lithium, making ACP1 a functional candidate for involvement in the phenotype. Larger GWAS sample sets will be required to confirm the signal on 2p25 and to identify additional genetic risk factors increasing susceptibility for attempted suicide. PMID:21423239
Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J
The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967
Zillikens, M Carola; Demissie, Serkalem; Hsu, Yi-Hsiang; Yerges-Armstrong, Laura M; Chou, Wen-Chi; Stolk, Lisette; Livshits, Gregory; Broer, Linda; Johnson, Toby; Koller, Daniel L; Kutalik, Zoltán; Luan, Jian'an; Malkin, Ida; Ried, Janina S; Smith, Albert V; Thorleifsson, Gudmar; Vandenput, Liesbeth; Hua Zhao, Jing; Zhang, Weihua; Aghdassi, Ali; Åkesson, Kristina; Amin, Najaf; Baier, Leslie J; Barroso, Inês; Bennett, David A; Bertram, Lars; Biffar, Rainer; Bochud, Murielle; Boehnke, Michael; Borecki, Ingrid B; Buchman, Aron S; Byberg, Liisa; Campbell, Harry; Campos Obanda, Natalia; Cauley, Jane A; Cawthon, Peggy M; Cederberg, Henna; Chen, Zhao; Cho, Nam H; Jin Choi, Hyung; Claussnitzer, Melina; Collins, Francis; Cummings, Steven R; De Jager, Philip L; Demuth, Ilja; Dhonukshe-Rutten, Rosalie A M; Diatchenko, Luda; Eiriksdottir, Gudny; Enneman, Anke W; Erdos, Mike; Eriksson, Johan G; Eriksson, Joel; Estrada, Karol; Evans, Daniel S; Feitosa, Mary F; Fu, Mao; Garcia, Melissa; Gieger, Christian; Girke, Thomas; Glazer, Nicole L; Grallert, Harald; Grewal, Jagvir; Han, Bok-Ghee; Hanson, Robert L; Hayward, Caroline; Hofman, Albert; Hoffman, Eric P; Homuth, Georg; Hsueh, Wen-Chi; Hubal, Monica J; Hubbard, Alan; Huffman, Kim M; Husted, Lise B; Illig, Thomas; Ingelsson, Erik; Ittermann, Till; Jansson, John-Olov; Jordan, Joanne M; Jula, Antti; Karlsson, Magnus; Khaw, Kay-Tee; Kilpeläinen, Tuomas O; Klopp, Norman; Kloth, Jacqueline S L; Koistinen, Heikki A; Kraus, William E; Kritchevsky, Stephen; Kuulasmaa, Teemu; Kuusisto, Johanna; Laakso, Markku; Lahti, Jari; Lang, Thomas; Langdahl, Bente L; Launer, Lenore J; Lee, Jong-Young; Lerch, Markus M; Lewis, Joshua R; Lind, Lars; Lindgren, Cecilia; Liu, Yongmei; Liu, Tian; Liu, Youfang; Ljunggren, Östen; Lorentzon, Mattias; Luben, Robert N; Maixner, William; McGuigan, Fiona E; Medina-Gomez, Carolina; Meitinger, Thomas; Melhus, Håkan; Mellström, Dan; Melov, Simon; Michaëlsson, Karl; Mitchell, Braxton D; Morris, Andrew P; Mosekilde, Leif; Newman, Anne; Nielson, Carrie M; O'Connell, Jeffrey R; Oostra, Ben A; Orwoll, Eric S; Palotie, Aarno; Parker, Stephen C J; Peacock, Munro; Perola, Markus; Peters, Annette; Polasek, Ozren; Prince, Richard L; Räikkönen, Katri; Ralston, Stuart H; Ripatti, Samuli; Robbins, John A; Rotter, Jerome I; Rudan, Igor; Salomaa, Veikko; Satterfield, Suzanne; Schadt, Eric E; Schipf, Sabine; Scott, Laura; Sehmi, Joban; Shen, Jian; Soo Shin, Chan; Sigurdsson, Gunnar; Smith, Shad; Soranzo, Nicole; Stančáková, Alena; Steinhagen-Thiessen, Elisabeth; Streeten, Elizabeth A; Styrkarsdottir, Unnur; Swart, Karin M A; Tan, Sian-Tsung; Tarnopolsky, Mark A; Thompson, Patricia; Thomson, Cynthia A; Thorsteinsdottir, Unnur; Tikkanen, Emmi; Tranah, Gregory J; Tuomilehto, Jaakko; van Schoor, Natasja M; Verma, Arjun; Vollenweider, Peter; Völzke, Henry; Wactawski-Wende, Jean; Walker, Mark; Weedon, Michael N; Welch, Ryan; Wichmann, H-Erich; Widen, Elisabeth; Williams, Frances M K; Wilson, James F; Wright, Nicole C; Xie, Weijia; Yu, Lei; Zhou, Yanhua; Chambers, John C; Döring, Angela; van Duijn, Cornelia M; Econs, Michael J; Gudnason, Vilmundur; Kooner, Jaspal S; Psaty, Bruce M; Spector, Timothy D; Stefansson, Kari; Rivadeneira, Fernando; Uitterlinden, André G; Wareham, Nicholas J; Ossowski, Vicky; Waterworth, Dawn; Loos, Ruth J F; Karasik, David; Harris, Tamara B; Ohlsson, Claes; Kiel, Douglas P
Lean body mass, consisting mostly of skeletal muscle, is important for healthy aging. We performed a genome-wide association study for whole body (20 cohorts of European ancestry with n = 38,292) and appendicular (arms and legs) lean body mass (n = 28,330) measured using dual energy X-ray absorptiometry or bioelectrical impedance analysis, adjusted for sex, age, height, and fat mass. Twenty-one single-nucleotide polymorphisms were significantly associated with lean body mass either genome wide (p lean body mass and in 45,090 (42,360 of European ancestry) subjects from 25 cohorts for appendicular lean body mass was successful for five single-nucleotide polymorphisms in/near HSD17B11, VCAN, ADAMTSL3, IRS1, and FTO for total lean body mass and for three single-nucleotide polymorphisms in/near VCAN, ADAMTSL3, and IRS1 for appendicular lean body mass. Our findings provide new insight into the genetics of lean body mass.Lean body mass is a highly heritable trait and is associated with various health conditions. Here, Kiel and colleagues perform a meta-analysis of genome-wide association studies for whole body lean body mass and find five novel genetic loci to be significantly associated.
Brynildsrud, Ola; Bohlin, Jon; Scheffer, Lonneke; Eldholm, Vegard
Genome-wide association studies (GWAS) have become indispensable in human medicine and genomics, but very few have been carried out on bacteria. Here we introduce Scoary, an ultra-fast, easy-to-use, and widely applicable software tool that scores the components of the pan-genome for associations to observed phenotypic traits while accounting for population stratification, with minimal assumptions about evolutionary processes. We call our approach pan-GWAS to distinguish it from traditional, single nucleotide polymorphism (SNP)-based GWAS. Scoary is implemented in Python and is available under an open source GPLv3 license at https://github.com/AdmiralenOla/Scoary .
A single nucleotide polymorphism (SNP) assay for population stratification test ... phenotypes and unlinked candidate loci in case-control and cohort studies of ... Key words: Chinese, Japanese, population stratification, ancestry informative ...
Neumann, Alexander; Direk, Nese; Crawford, Andrew A; Mirza, Saira; Adams, Hieab; Bolton, Jennifer; Hayward, Caroline; Strachan, David P; Payne, Erin K; Smith, Jennifer A; Milaneschi, Yuri; Penninx, Brenda; Hottenga, Jouke J; de Geus, Eco; Oldehinkel, Albertine J; van der Most, Peter J; de Rijke, Yolanda; Walker, Brian R; Tiemeier, Henning
Cortisol is an important stress hormone affected by a variety of biological and environmental factors, such as the circadian rhythm, exercise and psychological stress. Cortisol is mostly measured using blood or saliva samples. A number of genetic variants have been found to contribute to cortisol levels with these methods. While the effects of several specific single genetic variants is known, the joint genome-wide contribution to cortisol levels is unclear. Our aim was to estimate the amount of cortisol variance explained by common single nucleotide polymorphisms, i.e. the SNP heritability, using a variety of cortisol measures, cohorts and analysis approaches. We analyzed morning plasma (n=5705) and saliva levels (n=1717), as well as diurnal saliva levels (n=1541), in the Rotterdam Study using genomic restricted maximum likelihood estimation. Additionally, linkage disequilibrium score regression was fitted on the results of genome-wide association studies (GWAS) performed by the CORNET consortium on morning plasma cortisol (n=12,597) and saliva cortisol (n=7703). No significant SNP heritability was detected for any cortisol measure, sample or analysis approach. Point estimates ranged from 0% to 9%. Morning plasma cortisol in the CORNET cohorts, the sample with the most power, had a 6% [95%CI: 0-13%] SNP heritability. The results consistently suggest a low SNP heritability of these acute and short-term measures of cortisol. The low SNP heritability may reflect the substantial environmental and, in particular, situational component of these cortisol measures. Future GWAS will require very large sample sizes. Alternatively, more long-term cortisol measures such as hair cortisol samples are needed to discover further genetic pathways regulating cortisol concentrations. Copyright © 2017 Elsevier Ltd. All rights reserved.
Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar
Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16–26 Mb and 34–55 Mb and for Arabian on ECA15 at 64–65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT. PMID:22848553
Full Text Available Since the first report of a genome-wide association study (GWAS on human age-related macular degeneration, GWAS has successfully been used to discover genetic variants for a variety of complex human diseases and/or traits, and thousands of associated loci have been identified. However, the underlying mechanisms for these loci remain largely unknown. To make these GWAS findings more useful, it is necessary to perform in-depth data mining. The data analysis in the post-GWAS era will include the following aspects: fine-mapping of susceptibility regions to identify susceptibility genes for elucidating the biological mechanism of action; joint analysis of susceptibility genes in different diseases; integration of GWAS, transcriptome, and epigenetic data to analyze expression and methylation quantitative trait loci at the whole-genome level, and find single-nucleotide polymorphisms that influence gene expression and DNA methylation; genome-wide association analysis of disease-related DNA copy number variations. Applying these strategies and methods will serve to strengthen GWAS data to enhance the utility and significance of GWAS in improving understanding of the genetics of complex diseases or traits and translate these findings for clinical applications. Keywords: Genome-wide association study, Data mining, Integrative data analysis, Polymorphism, Copy number variation
Hamshere, M L; Walters, J T R; Smith, R
The Schizophrenia Psychiatric Genome-Wide Association Study Consortium (PGC) highlighted 81 single-nucleotide polymorphisms (SNPs) with moderate evidence for association to schizophrenia. After follow-up in independent samples, seven loci attained genome-wide significance (GWS), but multi-locus t...... interval (CI) 78-100%) of the original set of 78 SNPs represent true associations. We also provide strong evidence for overlap in genetic risk between schizophrenia and bipolar disorder.Molecular Psychiatry advance online publication, 22 May 2012; doi:10.1038/mp.2012.67....
Full Text Available The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000-1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs. We estimate that the Roma harbor about 80% West Eurasian ancestry-derived from a combination of European and South Asian sources-and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe.
Full Text Available Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO Cancer Screening and the Women’s Health Initiative (WHI. We tested association between 2,474,333 single nucleotide polymorphisms (SNPs and serum selenium concentrations using linear regression models. In the first stage (PLCO 41 SNPs clustered in 15 regions had p < 1 × 10−5. None of these 41 SNPs reached the significant threshold (p = 0.05/15 regions = 0.003 in the second stage (WHI. Three SNPs had p < 0.05 in the second stage (rs1395479 and rs1506807 in 4q34.3/AGA-NEIL3; and rs891684 in 17q24.3/SLC39A11 and had p between 2.62 × 10−7 and 4.04 × 10−7 in the combined analysis (PLCO + WHI. Additional studies are needed to replicate these findings. Identification of genetic variation that impacts selenium concentrations may contribute to a better understanding of which genes regulate circulating selenium concentrations.
Full Text Available Abstract Background With the availability of large-scale genome-wide association study (GWAS data, choosing an optimal set of SNPs for disease susceptibility prediction is a challenging task. This study aimed to use single nucleotide polymorphisms (SNPs to predict psoriasis from searching GWAS data. Methods Totally we had 2,798 samples and 451,724 SNPs. Process for searching a set of SNPs to predict susceptibility for psoriasis consisted of two steps. The first one was to search top 1,000 SNPs with high accuracy for prediction of psoriasis from GWAS dataset. The second one was to search for an optimal SNP subset for predicting psoriasis. The sequential information bottleneck (sIB method was compared with classical linear discriminant analysis(LDA for classification performance. Results The best test harmonic mean of sensitivity and specificity for predicting psoriasis by sIB was 0.674(95% CI: 0.650-0.698, while only 0.520(95% CI: 0.472-0.524 was reported for predicting disease by LDA. Our results indicate that the new classifier sIB performs better than LDA in the study. Conclusions The fact that a small set of SNPs can predict disease status with average accuracy of 68% makes it possible to use SNP data for psoriasis prediction.
Emily R Davenport
Full Text Available The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both. These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%. For example, we identified an association between a taxon known to affect obesity (genus Akkermansia and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10-7. Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut.
Full Text Available Type 2 diabetes (T2D is one of the most frequent mortality causes in western countries, with rapidly increasing prevalence. Anti-diabetic drugs are the first therapeutic approach, although many patients develop drug resistance. Most drug responsiveness variability can be explained by genetic causes. Inter-individual variability is principally due to single nucleotide polymorphisms, and differential drug responsiveness has been correlated to alteration in genes involved in drug metabolism (CYP2C9 or insulin signaling (IRS1, ABCC8, KCNJ11 and PPARG. However, most genome-wide association studies did not provide clues about the contribution of DNA variations to impaired drug responsiveness. Thus, characterizing T2D drug responsiveness variants is needed to guide clinicians toward tailored therapeutic approaches. Here, we extensively investigated polymorphisms associated with altered drug response in T2D, predicting their effects in silico. Combining different computational approaches, we focused on the expression pattern of genes correlated to drug resistance and inferred evolutionary conservation of polymorphic residues, computationally predicting the biochemical properties of polymorphic proteins. Using RNA-Sequencing followed by targeted validation, we identified and experimentally confirmed that two nucleotide variations in the CAPN10 gene—currently annotated as intronic—fall within two new transcripts in this locus. Additionally, we found that a Single Nucleotide Polymorphism (SNP, currently reported as intergenic, maps to the intron of a new transcript, harboring CAPN10 and GPR35 genes, which undergoes non-sense mediated decay. Finally, we analyzed variants that fall into non-coding regulatory regions of yet underestimated functional significance, predicting that some of them can potentially affect gene expression and/or post-transcriptional regulation of mRNAs affecting the splicing.
Shiotani, Akiko; Murao, Takahisa; Fujita, Yoshihiko; Fujimura, Yoshinori; Sakakibara, Takashi; Nishio, Kazuto; Haruma, Ken
In our previous study, the SLCO1B1 521TT genotype and the SLCO1B1*1b haplotype were significantly associated with the risk of peptic ulcer in patients taking low-dose aspirin (LDA). The aim of the present study was to investigate pharmacogenomic profile of LDA-induced peptic ulcer and ulcer bleeding. Patients taking 100 mg of enteric-coated aspirin for cardiovascular diseases and with a peptic ulcer or ulcer bleeding and patients who also participated in endoscopic surveillance were studied. Genome-wide analysis of single nucleotide polymorphisms (SNPs) was performed using the Affymetrix DME Plus Premier Pack. SLCO1B1*1b haplotype and candidate genotypes of genes associated with ulcer bleeding or small bowel bleeding identified by genome-wide analysis were determined using TaqMan SNP Genotyping Assay kits, polymerase chain reaction-restriction fragment length polymorphism, and direct sequencing. Of 593 patients enrolled, 111 patients had a peptic ulcer and 45 had ulcer bleeding. The frequencies of the SLCO1B1*1b haplotype and CHST2 2082 T allele were significantly greater in patients with peptic ulcer and ulcer bleeding compared to the controls. After adjustment for significant factors, the SLCO1B1*1b haplotype was associated with peptic ulcer (OR 2.20, 95% CI 1.24-3.89) and CHST2 2082 T allele with ulcer bleeding (2.57, 1.07-6.17). The CHST2 2082 T allele as well as SLCO1B1*1b haplotype may identify patients at increased risk for aspirin-induced peptic ulcer or ulcer bleeding. © 2014 Journal of Gastroenterology and Hepatology Foundation and Wiley Publishing Asia Pty Ltd.
Duncan, Laramie; Yilmaz, Zeynep; Gaspar, Helena; Walters, Raymond; Goldstein, Jackie; Anttila, Verneri; Bulik-Sullivan, Brendan; Ripke, Stephan; Thornton, Laura; Hinney, Anke; Daly, Mark; Sullivan, Patrick F; Zeggini, Eleftheria; Breen, Gerome; Bulik, Cynthia M
The authors conducted a genome-wide association study of anorexia nervosa and calculated genetic correlations with a series of psychiatric, educational, and metabolic phenotypes. Following uniform quality control and imputation procedures using the 1000 Genomes Project (phase 3) in 12 case-control cohorts comprising 3,495 anorexia nervosa cases and 10,982 controls, the authors performed standard association analysis followed by a meta-analysis across cohorts. Linkage disequilibrium score regression was used to calculate genome-wide common variant heritability (single-nucleotide polymorphism [SNP]-based heritability [h 2 SNP ]), partitioned heritability, and genetic correlations (r g ) between anorexia nervosa and 159 other phenotypes. Results were obtained for 10,641,224 SNPs and insertion-deletion variants with minor allele frequencies >1% and imputation quality scores >0.6. The h 2 SNP of anorexia nervosa was 0.20 (SE=0.02), suggesting that a substantial fraction of the twin-based heritability arises from common genetic variation. The authors identified one genome-wide significant locus on chromosome 12 (rs4622308) in a region harboring a previously reported type 1 diabetes and autoimmune disorder locus. Significant positive genetic correlations were observed between anorexia nervosa and schizophrenia, neuroticism, educational attainment, and high-density lipoprotein cholesterol, and significant negative genetic correlations were observed between anorexia nervosa and body mass index, insulin, glucose, and lipid phenotypes. Anorexia nervosa is a complex heritable phenotype for which this study has uncovered the first genome-wide significant locus. Anorexia nervosa also has large and significant genetic correlations with both psychiatric phenotypes and metabolic traits. The study results encourage a reconceptualization of this frequently lethal disorder as one with both psychiatric and metabolic etiology.
Ji, Yuan; Schaid, Daniel J; Desta, Zeruesenay; Kubo, Michiaki; Batzler, Anthony J; Snyder, Karen; Mushiroda, Taisei; Kamatani, Naoyuki; Ogburn, Evan; Hall-Flavin, Daniel; Flockhart, David; Nakamura, Yusuke; Mrazek, David A; Weinshilboum, Richard M
Citalopram (CT) and escitalopram (S-CT) are among the most widely prescribed selective serotonin reuptake inhibitors used to treat major depressive disorder (MDD). We applied a genome-wide association study to identify genetic factors that contribute to variation in plasma concentrations of CT or S-CT and their metabolites in MDD patients treated with CT or S-CT. Our genome-wide association study was performed using samples from 435 MDD patients. Linear mixed models were used to account for within-subject correlations of longitudinal measures of plasma drug/metabolite concentrations (4 and 8 weeks after the initiation of drug therapy), and single-nucleotide polymorphisms (SNPs) were modelled as additive allelic effects. Genome-wide significant associations were observed for S-CT concentration with SNPs in or near the CYP2C19 gene on chromosome 10 (rs1074145, P = 4.1 × 10(-9) ) and with S-didesmethylcitalopram concentration for SNPs near the CYP2D6 locus on chromosome 22 (rs1065852, P = 2.0 × 10(-16) ), supporting the important role of these cytochrome P450 (CYP) enzymes in biotransformation of citalopram. After adjustment for the effect of CYP2C19 functional alleles, the analyses also identified novel loci that will require future replication and functional validation. In vitro and in vivo studies have suggested that the biotransformation of CT to monodesmethylcitalopram and didesmethylcitalopram is mediated by CYP isozymes. The results of our genome-wide association study performed in MDD patients treated with CT or S-CT have confirmed those observations but also identified novel genomic loci that might play a role in variation in plasma levels of CT or its metabolites during the treatment of MDD patients with these selective serotonin reuptake inhibitors. © 2014 The British Pharmacological Society.
Full Text Available Genome-wide association studies (GWAS have successfully identified a number of single-nucleotide polymorphisms (SNPs associated with colorectal cancer (CRC risk. However, these susceptibility loci known today explain only a small fraction of the genetic risk. Gene-gene interaction (GxG is considered to be one source of the missing heritability. To address this, we performed a genome-wide search for pair-wise GxG associated with CRC risk using 8,380 cases and 10,558 controls in the discovery phase and 2,527 cases and 2,658 controls in the replication phase. We developed a simple, but powerful method for testing interaction, which we term the Average Risk Due to Interaction (ARDI. With this method, we conducted a genome-wide search to identify SNPs showing evidence for GxG with previously identified CRC susceptibility loci from 14 independent regions. We also conducted a genome-wide search for GxG using the marginal association screening and examining interaction among SNPs that pass the screening threshold (p<10(-4. For the known locus rs10795668 (10p14, we found an interacting SNP rs367615 (5q21 with replication p = 0.01 and combined p = 4.19×10(-8. Among the top marginal SNPs after LD pruning (n = 163, we identified an interaction between rs1571218 (20p12.3 and rs10879357 (12q21.1 (nominal combined p = 2.51×10(-6; Bonferroni adjusted p = 0.03. Our study represents the first comprehensive search for GxG in CRC, and our results may provide new insight into the genetic etiology of CRC.
de Boer, Ynto S; van Gerven, Nicole M F; Zwiers, Antonie; Verwer, Bart J; van Hoek, Bart; van Erpecum, Karel J; Beuers, Ulrich; van Buuren, Henk R; Drenth, Joost P H; den Ouden, Jannie W; Verdonk, Robert C; Koek, Ger H; Brouwer, Johannes T; Guichelaar, Maureen M J; Vrolijk, Jan M; Kraal, Georg; Mulder, Chris J J; van Nieuwkerk, Carin M J; Fischer, Janett; Berg, Thomas; Stickel, Felix; Sarrazin, Christoph; Schramm, Christoph; Lohse, Ansgar W; Weiler-Normann, Christina; Lerch, Markus M; Nauck, Matthias; Völzke, Henry; Homuth, Georg; Bloemena, Elisabeth; Verspaget, Hein W; Kumar, Vinod; Zhernakova, Alexandra; Wijmenga, Cisca; Franke, Lude; Bouma, Gerd
Autoimmune hepatitis (AIH) is an uncommon autoimmune liver disease of unknown etiology. We used a genome-wide approach to identify genetic variants that predispose individuals to AIH. We performed a genome-wide association study of 649 adults in The Netherlands with AIH type 1 and 13,436 controls. Initial associations were further analyzed in an independent replication panel comprising 451 patients with AIH type 1 in Germany and 4103 controls. We also performed an association analysis in the discovery cohort using imputed genotypes of the major histocompatibility complex region. We associated AIH with a variant in the major histocompatibility complex region at rs2187668 (P = 1.5 × 10(-78)). Analysis of this variant in the discovery cohort identified HLA-DRB1*0301 (P = 5.3 × 10(-49)) as a primary susceptibility genotype and HLA-DRB1*0401 (P = 2.8 × 10(-18)) as a secondary susceptibility genotype. We also associated AIH with variants of SH2B3 (rs3184504, 12q24; P = 7.7 × 10(-8)) and CARD10 (rs6000782, 22q13.1; P = 3.0 × 10(-6)). In addition, strong inflation of association signal was found with single-nucleotide polymorphisms associated with other immune-mediated diseases, including primary sclerosing cholangitis and primary biliary cirrhosis, but not with single-nucleotide polymorphisms associated with other genetic traits. In a genome-wide association study, we associated AIH type 1 with variants in the major histocompatibility complex region, and identified variants of SH2B3and CARD10 as likely risk factors. These findings support a complex genetic basis for AIH pathogenesis and indicate that part of the genetic susceptibility overlaps with that for other immune-mediated liver diseases. Copyright © 2014 AGA Institute. Published by Elsevier Inc. All rights reserved.
Full Text Available In this study, 796 male Duroc pigs were used to identify genomic regions controlling growth traits. Three production traits were studied: food conversion ratio, days to 100 KG, and average daily gain, using a panel of 39,436 single nucleotide polymorphisms. In total, we detected 11 genome-wide and 162 chromosome-wide single nucleotide polymorphism trait associations. The Gene ontology analysis identified 14 candidate genes close to significant single nucleotide polymorphisms, with growth-related functions: six for days to 100 KG (WT1, FBXO3, DOCK7, PPP3CA, AGPAT9, and NKX6-1, seven for food conversion ratio (MAP2, TBX15, IVL, ARL15, CPS1, VWC2L, and VAV3, and one for average daily gain (COL27A1. Gene ontology analysis indicated that most of the candidate genes are involved in muscle, fat, bone or nervous system development, nutrient absorption, and metabolism, which are all either directly or indirectly related to growth traits in pigs. Additionally, we found four haplotype blocks composed of suggestive single nucleotide polymorphisms located in the growth trait-related quantitative trait loci and further narrowed down the ranges, the largest of which decreased by ~60 Mb. Hence, our results could be used to improve pig production traits by increasing the frequency of favorable alleles via artificial selection.
Mai, Duy Minh; Sahana, Goutam; Christiansen, Freddy
on BTA4, BTA5, BTA13, BTA20, and BTA29 were new QTL for fat index. We found 7 pleiotropic or very closely linked QTL. Most of the QTL were associated with polymorphisms within narrow regions and several may represent the effects of polymorphisms of genes: DGAT1, casein, ARFGAP3, CYP11B1, and CDC...
Henriques, Dora; Chavez-Galarza, Julio; Kryger, Per; Johnston, J. Spencer; De la Rúa, Pilar; Rufino, José; Dall'Olio, Raffaele; Garnery, Lionel; Pinto, M. Alice
The black honey bee, Apis mellifera mellifera L., is probably the honey bee subspecies more threatened by introgression from foreign subspecies, specially lineage C A. m. carnica and A. m. ligustica. In fact, in some areas of its distributional range, intensive beekeeping with foreign subspecies has driven A. m. mellifera populations to nearly replacement. While massive and repeated introductions may lead to loss of native genetic patrimony, a low level of gene flow can also be detrimental be...
Su, Guosheng; Christensen, Ole Fredslund; Ostersen, Tage
of genomic predictions for daily gain in pigs. In the analysis of daily gain, four linear models were used: 1) a simple additive genetic model (MA), 2) a model including both additive and additive by additive epistatic genetic effects (MAE), 3) a model including both additive and dominance genetic effects...
Hayden, Lystra P; Cho, Michael H; McDonald, Merry-Lynn N; Crapo, James D; Beaty, Terri H; Silverman, Edwin K; Hersh, Craig P
Previous studies have indicated that in adult smokers, a history of childhood pneumonia is associated with reduced lung function and chronic obstructive pulmonary disease. There have been few previous investigations using genome-wide association studies to investigate genetic predisposition to pneumonia. This study aims to identify the genetic variants associated with the development of pneumonia during childhood and over the course of the lifetime. Study subjects included current and former smokers with and without chronic obstructive pulmonary disease participating in the COPDGene Study. Pneumonia was defined by subject self-report, with childhood pneumonia categorized as having the first episode at pneumonia (843 cases, 9,091 control subjects) and lifetime pneumonia (3,766 cases, 5,659 control subjects) were performed separately in non-Hispanic whites and African Americans. Non-Hispanic white and African American populations were combined in the meta-analysis. Top genetic variants from childhood pneumonia were assessed in network analysis. No single-nucleotide polymorphisms reached genome-wide significance, although we identified potential regions of interest. In the childhood pneumonia analysis, this included variants in NGR1 (P = 6.3 × 10 -8 ), PAK6 (P = 3.3 × 10 -7 ), and near MATN1 (P = 2.8 × 10 -7 ). In the lifetime pneumonia analysis, this included variants in LOC339862 (P = 8.7 × 10 -7 ), RAPGEF2 (P = 8.4 × 10 -7 ), PHACTR1 (P = 6.1 × 10 -7 ), near PRR27 (P = 4.3 × 10 -7 ), and near MCPH1 (P = 2.7 × 10 -7 ). Network analysis of the genes associated with childhood pneumonia included top networks related to development, blood vessel morphogenesis, muscle contraction, WNT signaling, DNA damage, apoptosis, inflammation, and immune response (P ≤ 0.05). We have identified genes potentially associated with the risk of pneumonia. Further research will be required to confirm these
The nature of the single nucleotide polymorphism (SNP) marker was validated by DNA sequencing of the parental PCR products. Using high resolution melt (HRM) profiles and normalised difference plots, we successfully differentiated the homozygous dominant (wild type), homozygous recessive (LPA) and heterozygous ...
In order to reveal the single nucleotide polymorphisms (SNPs), genotypes and allelic frequencies of each mutation site of TLR7 gene in Chinese native duck breeds, SNPs of duck TLR7 gene were detected by DNA sequencing. The genotypes of 465 native ducks from eight key protected duck breeds were determined by ...
amplified millions to billions of times by means of a PCR before the PCR product ... Keywords. Single nucleotide polymorphism; real time PCR; DNA melting curve analysis. ... VAL158MET SNP and alcoholism and to test for interac- tions between the .... indicate a heterozygote sample (VAL/MET genotype). The curve with ...
Li, Yonghong; Shiffman, Dov; Oberbauer, Rainer
Single nucleotide polymorphisms (SNPs) are the most common type of genetic variants in the human genome. SNPs are known to modify susceptibility to complex diseases. We describe and discuss methods used to identify SNPs associated with disease in case-control studies. An outline on study population selection, sample collection and genotyping platforms is presented, complemented by SNP selection, data preprocessing and analysis.
Ferber, Steven; Reusch, Thorsten B. H.; Stam, Wytze T.; Olsen, Jeanine L.
We characterized 37 single nucleotide polymorphism (SNP) makers for eelgrass Zostera marina. SNP markers were developed using existing EST (expressed sequence tag)-libraries to locate polymorphic loci and develop primers from the functional expressed genes that are deposited in The ZOSTERA database
The present study was investigating the association between the single nucleotide polymorphism +276 G/T of the adiponectin gene with serum adiponectin level in patients with coronary artery disease (CAD). In this study 100 healthy controls and 100 Egyptian patients with coronary artery disease of both genders ...
Prolactin (PRL), a polypeptide hormone synthesized and secreted by the animal's anterior pituitary gland, plays an important role in the regulation of mammalian lactation and avian reproduction. Considering the significant association between single nucleotide polymorphisms (SNPs) in the 5'-flanking region of PRL and ...
Zhang, Dong Feng; Pang, Zengchang; Li, Shuxia
The genetic loci affecting the commonly used BMI have been intensively investigated using linkage approaches in multiple populations. This study aims at performing the first genome-wide linkage scan on BMI in the Chinese population in mainland China with hypothesis that heterogeneity in genetic...... linkage could exist in different ethnic populations. BMI was measured from 126 dizygotic twins in Qingdao municipality who were genotyped using high-resolution Affymetrix Genome-Wide Human SNP arrays containing about 1 million single-nucleotide polymorphisms (SNPs). Nonparametric linkage analysis...... in western countries. Multiple loci showing suggestive linkage were found on chromosome 1 (lod score 2.38 at 242 cM), chromosome 8 (2.48 at 95 cM), and chromosome 14 (2.2 at 89.4 cM). The strong linkage identified in the Chinese subjects that is consistent with that found in populations of European origin...
Bigdeli, Tim B.; Ripke, Stephan; Bacanu, Silviu-Alin
Genome-wide association studies (GWAS) of schizophrenia have yielded more than 100 common susceptibility variants, and strongly support a substantial polygenic contribution of a large number of small allelic effects. It has been hypothesized that familial schizophrenia is largely a consequence...... of inherited rather than environmental factors. We investigated the extent to which familiality of schizophrenia is associated with enrichment for common risk variants detectable in a large GWAS. We analyzed single nucleotide polymorphism (SNP) data for cases reporting a family history of psychotic illness (N...... history subgroup. Comparison of genome-wide polygenic risk scores based on GWAS summary statistics indicated a significant enrichment for SNP effects among family history positive compared to family history negative cases (Nagelkerke's R2=0.0021; P=0.00331; P-value threshold
Full Text Available Traditional genetic association studies are very difficult in bacteria, as the generally limited recombination leads to large linked haplotype blocks, confounding the identification of causative variants. Beta-lactam antibiotic resistance in Streptococcus pneumoniae arises readily as the bacteria can quickly incorporate DNA fragments encompassing variants that make the transformed strains resistant. However, the causative mutations themselves are embedded within larger recombined blocks, and previous studies have only analysed a limited number of isolates, leading to the description of "mosaic genes" as being responsible for resistance. By comparing a large number of genomes of beta-lactam susceptible and non-susceptible strains, the high frequency of recombination should break up these haplotype blocks and allow the use of genetic association approaches to identify individual causative variants. Here, we performed a genome-wide association study to identify single nucleotide polymorphisms (SNPs and indels that could confer beta-lactam non-susceptibility using 3,085 Thai and 616 USA pneumococcal isolates as independent datasets for the variant discovery. The large sample sizes allowed us to narrow the source of beta-lactam non-susceptibility from long recombinant fragments down to much smaller loci comprised of discrete or linked SNPs. While some loci appear to be universal resistance determinants, contributing equally to non-susceptibility for at least two classes of beta-lactam antibiotics, some play a larger role in resistance to particular antibiotics. All of the identified loci have a highly non-uniform distribution in the populations. They are enriched not only in vaccine-targeted, but also non-vaccine-targeted lineages, which may raise clinical concerns. Identification of single nucleotide polymorphisms underlying resistance will be essential for future use of genome sequencing to predict antibiotic sensitivity in clinical microbiology.
Full Text Available Detection of genetic diversity is important for characterisation of crop plant collections in order to detect the presence of valuable trait variation for use in breeding programs. A collection of faba bean (Vicia faba L. genotypes was evaluated for intra- and inter-population diversity using a set of 768 genome-wide distributed single nucleotide polymorphism (SNP markers, of which 657 obtained successful amplification and detected polymorphisms. Gene diversity and polymorphism information content (PIC values varied between 0.022–0.500 and 0.023–1.00, with averages of 0.363 and 0.287, respectively. The genetic structure of the germplasm collection was analysed and a neighbour-joining (NJ dendrogram was constructed. The faba bean accessions grouped into two major groups, with several additional smaller sub-groups, predominantly on the basis of geographical origin. These results were further supported by principal co-ordinate analysis (PCoA, deriving two major groupings which were differentiated on the basis of site of origin and pedigree relationships. In general, high levels of heterozygosity were observed, presumably due to the partially allogamous nature of the species. The results will facilitate targeted crossing strategies in future faba bean breeding programs in order to achieve genetic gain.
Full Text Available Using accumulating SNP (Single-Nucleotide Polymorphism data, we performed a genome-wide search for polypeptide hormone ligands showing changes in the mature regions to elucidate genotype/phenotype diversity among various human populations. Neuropeptide S (NPS, a brain peptide hormone highly conserved in vertebrates, has diverse physiological effects on anxiety, fear, hyperactivity, food intake, and sleeping time through its cognate receptor-NPSR. Here, we report a SNP rs4751440 (L(6-NPS causing non-synonymous substitution on the 6(th position (V to L of the NPS mature peptide region. L(6-NPS has a higher allele frequency in Europeans than other populations and probably originated from European ancestors ~25,000 yrs ago based on haplotype analysis and Approximate Bayesian Computation. Functional analyses indicate that L(6-NPS exhibits a significant lower bioactivity than the wild type NPS, with ~20-fold higher EC50 values in the stimulation of NPSR. Additional evolutionary and mutagenesis studies further demonstrate the importance of the valine residue in the 6(th position for NPS functions. Given the known physiological roles of NPS receptor in inflammatory bowel diseases, asthma pathogenesis, macrophage immune responses, and brain functions, our study provides the basis to elucidate NPS evolution and signaling diversity among human populations.
Full Text Available The regenerative abilities and the immunosuppressive properties of mesenchymal stromal cells (MSCs make them potentially the ideal cellular product of choice for treatment of autoimmune and other immune mediated disorders. Although the usefulness of MSCs for therapeutic applications is in early phases, their potential clinical use remains of great interest. Current clinical evidence of use of MSCs from both autologous and allogeneic sources to treat autoimmune disorders confers conflicting clinical benefit outcomes. These varied results may possibly be due to MSC use across wide range of autoimmune disorders with clinical heterogeneity or due to variability of the cellular product. In the light of recent genome wide association studies (GWAS, linking predisposition of autoimmune diseases to single nucleotide polymorphisms (SNPs in the susceptible genetic loci, the clinical relevance of MSCs possessing SNPs in the critical effector molecules of immunosuppression is largely undiscussed. It is of further interest in the allogeneic setting, where SNPs in the target pathway of MSC's intervention may also modulate clinical outcome. In the present review, we have discussed the known critical SNPs predisposing to disease susceptibility in various autoimmune diseases and their significance in the immunomodulatory properties of MSCs.
Christine E McLaren
Full Text Available The existence of multiple inherited disorders of iron metabolism suggests genetic contributions to iron deficiency. We previously performed a genome-wide association study of iron-related single nucleotide polymorphisms (SNPs using DNA from white men aged ≥ 25 y and women ≥ 50 y in the Hemochromatosis and Iron Overload Screening (HEIRS Study with serum ferritin (SF ≤ 12 µg/L (cases and controls (SF >100 µg/L in men, SF >50 µg/L in women. We report a follow-up study of white, African-American, Hispanic, and Asian HEIRS participants, analyzed for association between SNPs and eight iron-related outcomes. Three chromosomal regions showed association across multiple populations, including SNPs in the TF and TMPRSS6 genes, and on chromosome 18q21. A novel SNP rs1421312 in TMPRSS6 was associated with serum iron in whites (p = 3.7 × 10(-6 and replicated in African Americans (p = 0.0012.Twenty SNPs in the TF gene region were associated with total iron-binding capacity in whites (p<4.4 × 10(-5; six SNPs replicated in other ethnicities (p<0.01. SNP rs10904850 in the CUBN gene on 10p13 was associated with serum iron in African Americans (P = 1.0 × 10(-5. These results confirm known associations with iron measures and give unique evidence of their role in different ethnicities, suggesting origins in a common founder.
Li, Mulin Jun; Wang, Junwen
As high throughput methods, such as whole genome genotyping arrays, whole exome sequencing (WES) and whole genome sequencing (WGS), have detected huge amounts of genetic variants associated with human diseases, function annotation of these variants is an indispensable step in understanding disease etiology. Large-scale functional genomics projects, such as The ENCODE Project and Roadmap Epigenomics Project, provide genome-wide profiling of functional elements across different human cell types and tissues. With the urgent demands for identification of disease-causal variants, comprehensive and easy-to-use annotation tool is highly in demand. Here we review and discuss current progress and trend of the variant annotation field. Furthermore, we introduce a comprehensive web portal for annotating human genetic variants. We use gene-based features and the latest functional genomics datasets to annotate single nucleotide variation (SNVs) in human, at whole genome scale. We further apply several function prediction algorithms to annotate SNVs that might affect different biological processes, including transcriptional gene regulation, alternative splicing, post-transcriptional regulation, translation and post-translational modifications. The SNVrap web portal is freely available at http://jjwanglab.org/snvrap. Copyright © 2014 Elsevier Inc. All rights reserved.
Vilella Albert J
Full Text Available Abstract Background DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i exhaustive population-genetic analyses including those based on the coalescent theory; ii analysis adapted to the shallow data generated by the high-throughput genome projects; iii use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
Renton, Alan E.; Pliner, Hannah A.; Provenzano, Carlo; Evoli, Amelia; Ricciardi, Roberta; Nalls, Michael A.; Marangi, Giuseppe; Abramzon, Yevgeniya; Arepalli, Sampath; Chong, Sean; Hernandez, Dena G.; Johnson, Janel O.; Bartoccioni, Emanuela; Scuderi, Flavia; Maestri, Michelangelo; Raphael Gibbs, J.; Errichiello, Edoardo; Chiò, Adriano; Restagno, Gabriella; Sabatelli, Mario; Macek, Mark; Scholz, Sonja W.; Corse, Andrea; Chaudhry, Vinay; Benatar, Michael; Barohn, Richard J.; McVey, April; Pasnoor, Mamatha; Dimachkie, Mazen M.; Rowin, Julie; Kissel, John; Freimer, Miriam; Kaminski, Henry J.; Sanders, Donald B.; Lipscomb, Bernadette; Massey, Janice M.; Chopra, Manisha; Howard, James F.; Koopman, Wilma J.; Nicolle, Michael W.; Pascuzzi, Robert M.; Pestronk, Alan; Wulf, Charlie; Florence, Julaine; Blackmore, Derrick; Soloway, Aimee; Siddiqi, Zaeem; Muppidi, Srikanth; Wolfe, Gil; Richman, David; Mezei, Michelle M.; Jiwa, Theresa; Oger, Joel; Drachman, Daniel B.; Traynor, Bryan J.
IMPORTANCE Myasthenia gravis is a chronic, autoimmune, neuromuscular disease characterized by fluctuating weakness of voluntary muscle groups. Although genetic factors are known to play a role in this neuroimmunological condition, the genetic etiology underlying myasthenia gravis is not well understood. OBJECTIVE To identify genetic variants that alter susceptibility to myasthenia gravis, we performed a genome-wide association study. DESIGN, SETTING, AND PARTICIPANTS DNA was obtained from 1032 white individuals from North America diagnosed as having acetylcholine receptor antibody–positive myasthenia gravis and 1998 race/ethnicity-matched control individuals from January 2010 to January 2011. These samples were genotyped on Illumina OmniExpress single-nucleotide polymorphism arrays. An independent cohort of 423 Italian cases and 467 Italian control individuals were used for replication. MAIN OUTCOMES AND MEASURES We calculated P values for association between 8114394 genotyped and imputed variants across the genome and risk for developing myasthenia gravis using logistic regression modeling. A threshold P value of 5.0 × 10−8 was set for genome-wide significance after Bonferroni correction for multiple testing. RESULTS In the over all case-control cohort, we identified association signals at CTLA4 (rs231770; P = 3.98 × 10−8; odds ratio, 1.37; 95% CI, 1.25–1.49), HLA-DQA1 (rs9271871; P = 1.08 × 10−8; odds ratio, 2.31; 95% CI, 2.02 – 2.60), and TNFRSF11A (rs4263037; P = 1.60 × 10−9; odds ratio, 1.41; 95% CI, 1.29–1.53). These findings replicated for CTLA4 and HLA-DQA1 in an independent cohort of Italian cases and control individuals. Further analysis revealed distinct, but overlapping, disease-associated loci for early- and late-onset forms of myasthenia gravis. In the late-onset cases, we identified 2 association peaks: one was located in TNFRSF11A (rs4263037; P = 1.32 × 10−12; odds ratio, 1.56; 95% CI, 1.44–1.68) and the other was detected
Zheng, Xiaoying; Hoffmann, Ary; Xi, Zhiyong; Zhang, Dongjing; Rasic, Gordana; Schmidt, Thomas
Aedes albopictus is a highly invasive disease vector with an expanding worldwide distribution. Genetic assays using low to medium resolution markers have found little evidence of spatial genetic structure even at broad geographic scales, suggesting frequent passive movement along human transportation networks. Here we analysed genetic structure of Ae. albopictus collected from 12 sample sites in Guangzhou, China, using thousands of genome-wide single nucleotide polymorphisms (SNPs). We found ...
Strawbridge, Rona; Dupuis, Josée; Prokopenko, Inga; Barker, Adam; Ahlqvist, Emma; Rybin, Denis; Petrie, John; Bouatia-Naji, Nabila; Dimas, Antigone; Wheeler, Eleanor; Chen, Han; Voight, Benjamin; Taneera, Jalal; Kanoni, Stavroula; Peden, John
textabstractOBJECTIVE - Proinsulin is a precursor of mature insulin and C-peptide. Higher circulating proinsulin levels are associated with impaired b-cell function, raised glucose levels, insulin resistance, and type 2 diabetes (T2D). Studies of the insulin processing pathway could provide new insights about T2D pathophysiology. RESEARCH DESIGN AND METHODS - We have conducted a meta-analysis of genome-wide association tests of ;2.5 million genotyped or imputed single nucleotide polymorphisms...
Middeldorp, Christel M.; Hammerschlag, Anke R.; Ouwens, Klaasjan G.; Groen-Blokhuis, Maria M.; St. Pourcain, Beate; Greven, Corina U.; Pappa, Irene; Tiesler, Carla M.T.; Ang, Wei; Nolte, Ilja M.; Vilor-Tejedor, Natalia; Bacelis, Jonas; Ebejer, Jane L.; Zhao, Huiying; Davies, Gareth E.
ObjectiveTo elucidate the influence of common genetic variants on childhood attention-deficit/hyperactivity disorder (ADHD) symptoms, to identify genetic variants that explain its high heritability, and to investigate the genetic overlap of ADHD symptom scores with ADHD diagnosis.MethodWithin the EArly Genetics and Lifecourse Epidemiology (EAGLE) consortium, genome-wide single nucleotide polymorphisms (SNPs) and ADHD symptom scores were available for 17,666 children (< 13 years) from nine ...
Full Text Available Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions. The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila-like 2] gene located on 9p21.3 (p = 0.00087. In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026. The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology.
Akkelies E Dijkstra
Full Text Available Chronic mucus hypersecretion (CMH is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA study of CMH in Caucasian populations.GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years. Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP.A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10(-6, OR = 1.17, located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1 on chromosome 3. The risk allele (G was associated with higher mRNA expression of SATB1 (4.3×10(-9 in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture.Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH.
Barbara E Stranger
Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.
Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.
Liu, Jin; Huang, Jian; Ma, Shuangge
Genome-wide association studies have been extensively conducted, searching for markers for biologically meaningful outcomes and phenotypes. Penalization methods have been adopted in the analysis of the joint effects of a large number of SNPs (single nucleotide polymorphisms) and marker identification. This study is partly motivated by the analysis of heterogeneous stock mice dataset, in which multiple correlated phenotypes and a large number of SNPs are available. Existing penalization methods designed to analyze a single response variable cannot accommodate the correlation among multiple response variables. With multiple response variables sharing the same set of markers, joint modeling is first employed to accommodate the correlation. The group Lasso approach is adopted to select markers associated with all the outcome variables. An efficient computational algorithm is developed. Simulation study and analysis of the heterogeneous stock mice dataset show that the proposed method can outperform existing penalization methods. PMID:23272092
Full Text Available Cardiovascular diseases are a large contributor to causes of early death in developed countries. Some of these conditions, such as sudden cardiac death and atrial fibrillation, stem from arrhythmias—a spectrum of conditions with abnormal electrical activity in the heart. Genome-wide association studies can identify single nucleotide variations (SNVs that may predispose individuals to developing acquired forms of arrhythmias. Through manual curation of published genome-wide association studies, we have collected a comprehensive list of 75 SNVs associated with cardiac arrhythmias. Ten of the SNVs result in amino acid changes and can be used in proteomic-based detection methods. In an effort to identify additional non-synonymous mutations that affect the proteome, we analyzed the post-translational modification S-nitrosylation, which is known to affect cardiac arrhythmias. We identified loss of seven known S-nitrosylation sites due to non-synonymous single nucleotide variations (nsSNVs. For predicted nitrosylation sites we found 1429 proteins where the sites are modified due to nsSNV. Analysis of the predicted S-nitrosylation dataset for over- or under-representation (compared to the complete human proteome of pathways and functional elements shows significant statistical over-representation of the blood coagulation pathway. Gene Ontology (GO analysis displays statistically over-represented terms related to muscle contraction, receptor activity, motor activity, cystoskeleton components, and microtubule activity. Through the genomic and proteomic context of SNVs and S-nitrosylation sites presented in this study, researchers can look for variation that can predispose individuals to cardiac arrhythmias. Such attempts to elucidate mechanisms of arrhythmia thereby add yet another useful parameter in predicting susceptibility for cardiac diseases.
Kasperaviciūte, Dalia; Catarino, Claudia B; Heinzen, Erin L; Depondt, Chantal; Cavalleri, Gianpiero L; Caboclo, Luis O; Tate, Sarah K; Jamnadas-Khoda, Jenny; Chinthapalli, Krishna; Clayton, Lisa M S; Shianna, Kevin V; Radtke, Rodney A; Mikati, Mohamad A; Gallentine, William B; Husain, Aatif M; Alhusaini, Saud; Leppert, David; Middleton, Lefkos T; Gibson, Rachel A; Johnson, Michael R; Matthews, Paul M; Hosford, David; Heuser, Kjell; Amos, Leslie; Ortega, Marcos; Zumsteg, Dominik; Wieser, Heinz-Gregor; Steinhoff, Bernhard J; Krämer, Günter; Hansen, Jörg; Dorn, Thomas; Kantanen, Anne-Mari; Gjerstad, Leif; Peuralinna, Terhi; Hernandez, Dena G; Eriksson, Kai J; Kälviäinen, Reetta K; Doherty, Colin P; Wood, Nicholas W; Pandolfo, Massimo; Duncan, John S; Sander, Josemir W; Delanty, Norman; Goldstein, David B; Sisodiya, Sanjay M
Partial epilepsies have a substantial heritability. However, the actual genetic causes are largely unknown. In contrast to many other common diseases for which genetic association-studies have successfully revealed common variants associated with disease risk, the role of common variation in partial epilepsies has not yet been explored in a well-powered study. We undertook a genome-wide association-study to identify common variants which influence risk for epilepsy shared amongst partial epilepsy syndromes, in 3445 patients and 6935 controls of European ancestry. We did not identify any genome-wide significant association. A few single nucleotide polymorphisms may warrant further investigation. We exclude common genetic variants with effect sizes above a modest 1.3 odds ratio for a single variant as contributors to genetic susceptibility shared across the partial epilepsies. We show that, at best, common genetic variation can only have a modest role in predisposition to the partial epilepsies when considered across syndromes in Europeans. The genetic architecture of the partial epilepsies is likely to be very complex, reflecting genotypic and phenotypic heterogeneity. Larger meta-analyses are required to identify variants of smaller effect sizes (odds ratio<1.3) or syndrome-specific variants. Further, our results suggest research efforts should also be directed towards identifying the multiple rare variants likely to account for at least part of the heritability of the partial epilepsies. Data emerging from genome-wide association-studies will be valuable during the next serious challenge of interpreting all the genetic variation emerging from whole-genome sequencing studies.
Børglum, A D; Demontis, D; Grove, J; Pallesen, J; Hollegaard, M V; Pedersen, C B; Hedemand, A; Mattheisen, M; Uitterlinden, A; Nyegaard, M; Ørntoft, T; Wiuf, C; Didriksen, M; Nordentoft, M; Nöthen, M M; Rietschel, M; Ophoff, R A; Cichon, S; Yolken, R H; Hougaard, D M; Mortensen, P B; Mors, O
Genetic and environmental components as well as their interaction contribute to the risk of schizophrenia, making it highly relevant to include environmental factors in genetic studies of schizophrenia. This study comprises genome-wide association (GWA) and follow-up analyses of all individuals born in Denmark since 1981 and diagnosed with schizophrenia as well as controls from the same birth cohort. Furthermore, we present the first genome-wide interaction survey of single nucleotide polymorphisms (SNPs) and maternal cytomegalovirus (CMV) infection. The GWA analysis included 888 cases and 882 controls, and the follow-up investigation of the top GWA results was performed in independent Danish (1396 cases and 1803 controls) and German-Dutch (1169 cases, 3714 controls) samples. The SNPs most strongly associated in the single-marker analysis of the combined Danish samples were rs4757144 in ARNTL (P=3.78 × 10(-6)) and rs8057927 in CDH13 (P=1.39 × 10(-5)). Both genes have previously been linked to schizophrenia or other psychiatric disorders. The strongest associated SNP in the combined analysis, including Danish and German-Dutch samples, was rs12922317 in RUNDC2A (P=9.04 × 10(-7)). A region-based analysis summarizing independent signals in segments of 100 kb identified a new region-based genome-wide significant locus overlapping the gene ZEB1 (P=7.0 × 10(-7)). This signal was replicated in the follow-up analysis (P=2.3 × 10(-2)). Significant interaction with maternal CMV infection was found for rs7902091 (P(SNP × CMV)=7.3 × 10(-7)) in CTNNA3, a gene not previously implicated in schizophrenia, stressing the importance of including environmental factors in genetic studies.
Welderufael, B. G.; Løvendahl, Peter; de Koning, Dirk-Jan; Janss, Lucas L. G.; Fikse, W. F.
Because mastitis is very frequent and unavoidable, adding recovery information into the analysis for genetic evaluation of mastitis is of great interest from economical and animal welfare point of view. Here we have performed genome-wide association studies (GWAS) to identify associated single nucleotide polymorphisms (SNPs) and investigate the genetic background not only for susceptibility to – but also for recoverability from mastitis. Somatic cell count records from 993 Danish Holstein cows genotyped for a total of 39378 autosomal SNP markers were used for the association analysis. Single SNP regression analysis was performed using the statistical software package DMU. Substitution effect of each SNP was tested with a t-test and a genome-wide significance level of P-value mastitis were located in or very near to genes that have been reported for their role in the immune system. Genes involved in lymphocyte developments (e.g., MAST3 and STAB2) and genes involved in macrophage recruitment and regulation of inflammations (PDGFD and PTX3) were suggested as possible causal genes for susceptibility to – and recoverability from mastitis, respectively. However, this is the first GWAS study for recoverability from mastitis and our results need to be validated. The findings in the current study are, therefore, a starting point for further investigations in identifying causal genetic variants or chromosomal regions for both susceptibility to – and recoverability from mastitis. PMID:29755506
Full Text Available Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP, uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.
Adkins, D E; Clark, S L; Åberg, K; Hettema, J M; Bukszár, J; McClay, J L; Souza, R P; van den Oord, E J C G
Affecting about 1 in 12 Americans annually, depression is a leading cause of the global disease burden. While a range of effective antidepressants are now available, failure and relapse rates remain substantial, with intolerable side effect burden the most commonly cited reason for discontinuation. Thus, understanding individual differences in susceptibility to antidepressant therapy side effects will be essential to optimize depression treatment. Here we perform genome-wide association studies (GWAS) to identify genetic variation influencing susceptibility to citalopram-induced side effects. The analysis sample consisted of 1762 depression patients, successfully genotyped for 421K single-nucleotide polymorphisms (SNPs), from the Sequenced Treatment Alternatives to Relieve Depression (STAR(*)D) study. Outcomes included five indicators of citalopram side effects: general side effect burden, overall tolerability, sexual side effects, dizziness and vision/hearing side effects. Two SNPs met our genome-wide significance criterion (qeffects of citalopram on vision/hearing side effects (P=3.27 × 10(-8), q=0.026). The second genome-wide significant finding, representing a haplotype spanning ∼30 kb and eight genotyped SNPs in a gene desert on chromosome 13, was associated with general side effect burden (P=3.22 × 10(-7), q=0.096). Suggestive findings were also found for SNPs at LAMA1, AOX2P, EGFLAM, FHIT and RTP2. Although our findings require replication and functional validation, this study demonstrates the potential of GWAS to discover genes and pathways that potentially mediate adverse effects of antidepressant medications.
Li, Jingyun; Zhang, Yuan; Zhang, Luo
Allergic rhinitis and allergy are complex conditions, in which both genetic and environmental factors contribute to the pathogenesis. Genome-wide association studies (GWASs) employing common single-nucleotide polymorphisms have accelerated the search for novel and interesting genes, and also confirmed the role of some previously described genes which may be involved in the cause of allergic rhinitis and allergy. The aim of this review is to provide an overview of the genetic basis of allergic rhinitis and the associated allergic phenotypes, with particular focus on GWASs. The last decade has been marked by the publication of more than 20 GWASs of allergic rhinitis and the associated allergic phenotypes. Allergic diseases and traits have been shown to share a large number of genetic susceptibility loci, of which IL33/IL1RL1, IL-13-RAD50 and C11orf30/LRRC32 appear to be important for more than two allergic phenotypes. GWASs have further reflected the genetic heterogeneity underlying allergic phenotypes. Large-scale genome-wide association strategies are underway to discover new susceptibility variants for allergic rhinitis and allergic phenotypes. Characterization of the underlying genetics provides us with an insight into the potential targets for future studies and the corresponding interventions.
Okbay, Aysu; Beauchamp, Jonathan P.; Fontana, Mark A.; Lee, James J.; Pers, Tune H.; Rietveld, Cornelius A.; Turley, Patrick; Chen, Guo-Bo; Emilsson, Valur; Meddens, S. Fleur W.; Oskarsson, Sven; Pickrell, Joseph K.; Thom, Kevin; Timshel, Pascal; de Vlaming, Ronald; Abdellaoui, Abdel; Ahluwalia, Tarunveer S.; Bacelis, Jonas; Baumbach, Clemens; Bjornsdottir, Gyda; Brandsma, Johannes H.; Concas, Maria Pina; Derringer, Jaime; Furlotte, Nicholas A.; Galesloot, Tessel E.; Girotto, Giorgia; Gupta, Richa; Hall, Leanne M.; Harris, Sarah E.; Hofer, Edith; Horikoshi, Momoko; Huffman, Jennifer E.; Kaasik, Kadri; Kalafati, Ioanna P.; Karlsson, Robert; Kong, Augustine; Lahti, Jari; van der Lee, Sven J.; de Leeuw, Christiaan; Lind, Penelope A.; Lindgren, Karl-Oskar; Liu, Tian; Mangino, Massimo; Marten, Jonathan; Mihailov, Evelin; Miller, Michael B.; van der Most, Peter J.; Oldmeadow, Christopher; Payton, Antony; Pervjakova, Natalia; Peyrot, Wouter J.; Qian, Yong; Raitakari, Olli; Rueedi, Rico; Salvi, Erika; Schmidt, Börge; Schraut, Katharina E.; Shi, Jianxin; Smith, Albert V.; Poot, Raymond A.; Pourcain, Beate; Teumer, Alexander; Thorleifsson, Gudmar; Verweij, Niek; Vuckovic, Dragana; Wellmann, Juergen; Westra, Harm-Jan; Yang, Jingyun; Zhao, Wei; Zhu, Zhihong; Alizadeh, Behrooz Z.; Amin, Najaf; Bakshi, Andrew; Baumeister, Sebastian E.; Biino, Ginevra; Bønnelykke, Klaus; Boyle, Patricia A.; Campbell, Harry; Cappuccio, Francesco P.; Davies, Gail; De Neve, Jan-Emmanuel; Deloukas, Panos; Demuth, Ilja; Ding, Jun; Eibich, Peter; Eisele, Lewin; Eklund, Niina; Evans68, David M.; Faul, Jessica D.; Feitosa, Mary F.; Forstner, Andreas J.; Gandin, Ilaria; Gunnarsson, Bjarni; Halldórsson, Bjarni V.; Harris, Tamara B.; Heath, Andrew C.; Hocking, Lynne J.; Holliday, Elizabeth G.; Homuth, Georg; Horan, Michael A.; Hottenga, Jouke-Jan; de Jager, Philip L.; Joshi, Peter K.; Jugessur, Astanand; Kaakinen, Marika A.; Kähönen, Mika; Kanoni, Stavroula; Keltigangas-Järvinen, Liisa; Kiemeney, Lambertus A.L.M.; Kolcic, Ivana; Koskinen, Seppo; Kraja, Aldi T.; Kroh, Martin; Kutalik, Zoltan; Latvala, Antti; Launer, Lenore J.; Lebreton, Maël P.; Levinson, Douglas F.; Lichtenstein, Paul; Lichtner, Peter; Liewald, David C.M.; Loukola, Anu; Madden, Pamela A.; Mägi, Reedik; Mäki-Opas, Tomi; Marioni, Riccardo E.; Marques-Vidal, Pedro; Meddens, Gerardus A.; McMahon, George; Meisinger, Christa; Meitinger, Thomas; Milaneschi, Yusplitri; Milani, Lili; Montgomery, Grant W.; Myhre, Ronny; Nelson, Christopher P.; Nyholt, Dale R.; Ollier, William E.R.; Palotie, Aarno; Paternoster, Lavinia; Pedersen, Nancy L.; Petrovic, Katja E.; Porteous, David J.; Räikkönen, Katri; Ring, Susan M.; Robino, Antonietta; Rostapshova, Olga; Rudan, Igor; Rustichini, Aldo; Salomaa, Veikko; Sanders, Alan R.; Sarin, Antti-Pekka; Schmidt, Helena; Scott, Rodney J.; Smith, Blair H.; Smith, Jennifer A.; Staessen, Jan A.; Steinhagen-Thiessen, Elisabeth; Strauch, Konstantin; Terracciano, Antonio; Tobin, Martin D.; Ulivi, Sheila; Vaccargiu, Simona; Quaye, Lydia; van Rooij, Frank J.A.; Venturini, Cristina; Vinkhuyzen, Anna A.E.; Völker, Uwe; Völzke, Henry; Vonk, Judith M.; Vozzi, Diego; Waage, Johannes; Ware, Erin B.; Willemsen, Gonneke; Attia, John R.; Bennett, David A.; Berger, Klaus; Bertram, Lars; Bisgaard, Hans; Boomsma, Dorret I.; Borecki, Ingrid B.; Bultmann, Ute; Chabris, Christopher F.; Cucca, Francesco; Cusi, Daniele; Deary, Ian J.; Dedoussis, George V.; van Duijn, Cornelia M.; Eriksson, Johan G.; Franke, Barbara; Franke, Lude; Gasparini, Paolo; Gejman, Pablo V.; Gieger, Christian; Grabe, Hans-Jörgen; Gratten, Jacob; Groenen, Patrick J.F.; Gudnason, Vilmundur; van der Harst, Pim; Hayward, Caroline; Hinds, David A.; Hoffmann, Wolfgang; Hyppönen, Elina; Iacono, William G.; Jacobsson, Bo; Järvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Kaprio, Jaakko; Kardia, Sharon L.R.; Lehtimäki, Terho; Lehrer, Steven F.; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; Metspalu, Andres; Pendleton, Neil; Penninx, Brenda W.J.H.; Perola, Markus; Pirastu, Nicola; Pirastu, Mario; Polasek, Ozren; Posthuma, Danielle; Power, Christine; Province, Michael A.; Samani, Nilesh J.; Schlessinger, David; Schmidt, Reinhold; Sørensen, Thorkild I.A.; Spector, Tim D.; Stefansson, Kari; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tiemeier, Henning; Tung, Joyce Y.; Uitterlinden, André G.; Vitart, Veronique; Vollenweider, Peter; Weir, David R.; Wilson, James F.; Wright, Alan F.; Conley, Dalton C.; Krueger, Robert F.; Smith, George Davey; Hofman, Albert; Laibson, David I.; Medland, Sarah E.; Meyer, Michelle N.; Yang, Jian; Johannesson, Magnus; Visscher, Peter M.; Esko, Tõnu; Koellinger, Philipp D.; Cesarini, David; Benjamin, Daniel J.
Summary Educational attainment (EA) is strongly influenced by social and other environmental factors, but genetic factors are also estimated to account for at least 20% of the variation across individuals1. We report the results of a genome-wide association study (GWAS) for EA that extends our earlier discovery sample1,2 of 101,069 individuals to 293,723 individuals, and a replication in an independent sample of 111,349 individuals from the UK Biobank. We now identify 74 genome-wide significant loci associated with number of years of schooling completed. Single-nucleotide polymorphisms (SNPs) associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioral phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because EA is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric disease. PMID:27225129
Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMcahon, Katherine D.; Mamlstrom, Rex R.
Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.
Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMahon, Katherine D.; Malmstrom, Rex R.
Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.
Markt, Sarah C; Nuttall, Elizabeth; Turman, Constance; Sinnott, Jennifer; Rimm, Eric B; Ecsedy, Ethan; Unger, Robert H; Fall, Katja; Finn, Stephen; Jensen, Majken K; Rider, Jennifer R; Kraft, Peter; Mucci, Lorelei A
To determine the inherited factors associated with the ability to smell asparagus metabolites in urine. Genome wide association study. Nurses' Health Study and Health Professionals Follow-up Study cohorts. 6909 men and women of European-American descent with available genetic data from genome wide association studies. Participants were characterized as asparagus smellers if they strongly agreed with the prompt "after eating asparagus, you notice a strong characteristic odor in your urine," and anosmic if otherwise. We calculated per-allele estimates of asparagus anosmia for about nine million single nucleotide polymorphisms using logistic regression. P values asparagus anosmia, all in a region on chromosome 1 (1q44: 248139851-248595299) containing multiple genes in the olfactory receptor 2 (OR2) family. Conditional analyses revealed three independent markers associated with asparagus anosmia: rs13373863, rs71538191, and rs6689553. A large proportion of people have asparagus anosmia. Genetic variation near multiple olfactory receptor genes is associated with the ability of an individual to smell the metabolites of asparagus in urine. Future replication studies are necessary before considering targeted therapies to help anosmic people discover what they are missing. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Markt, Sarah C; Nuttall, Elizabeth; Turman, Constance; Sinnott, Jennifer; Rimm, Eric B; Ecsedy, Ethan; Unger, Robert H; Fall, Katja; Finn, Stephen; Jensen, Majken K; Rider, Jennifer R; Kraft, Peter
Objective To determine the inherited factors associated with the ability to smell asparagus metabolites in urine. Design Genome wide association study. Setting Nurses’ Health Study and Health Professionals Follow-up Study cohorts. Participants 6909 men and women of European-American descent with available genetic data from genome wide association studies. Main outcome measure Participants were characterized as asparagus smellers if they strongly agreed with the prompt “after eating asparagus, you notice a strong characteristic odor in your urine,” and anosmic if otherwise. We calculated per-allele estimates of asparagus anosmia for about nine million single nucleotide polymorphisms using logistic regression. P values asparagus anosmia, all in a region on chromosome 1 (1q44: 248139851-248595299) containing multiple genes in the olfactory receptor 2 (OR2) family. Conditional analyses revealed three independent markers associated with asparagus anosmia: rs13373863, rs71538191, and rs6689553. Conclusion A large proportion of people have asparagus anosmia. Genetic variation near multiple olfactory receptor genes is associated with the ability of an individual to smell the metabolites of asparagus in urine. Future replication studies are necessary before considering targeted therapies to help anosmic people discover what they are missing. PMID:27965198
Yin, Chang Shik; Park, Hi Joon; Chung, Joo-Ho; Lee, Hye-Jung; Lee, Byung-Cheol
Four-constitution medicine (FCM), also known as Sasang constitutional medicine, and the heritage of the long history of individualized acupuncture medicine tradition, is one of the holistic and traditional systems of constitution to appraise and categorize individual differences into four major types. This study first reports a genome-wide association study on FCM, to explore the genetic basis of FCM and facilitate the integration of FCM with conventional individual differences research. Healthy individuals of the Korean population were classified into the four constitutional types (FCTs). A total of 353,202 single nucleotide polymorphisms (SNPs) were typed using whole genome amplified samples, and six-way comparison of FCM types provided lists of significantly differential SNPs. In one-to-one FCT comparisons, 15,944 SNPs were significantly differential, and 5 SNPs were commonly significant in all of the three comparisons. In one-to-two FCT comparisons, 22,616 SNPs were significantly differential, and 20 SNPs were commonly significant in all of the three comparison groups. This study presents the association between genome-wide SNP profiles and the categorization of the FCM, and it could further provide a starting point of genome-based identification and research of the constitutions of FCM.
Full Text Available Genome-wide association studies (GWAS using single nucleotide polymorphisms (SNPs have identified more than 50 loci associated with estimated glomerular filtration rate (eGFR, a measure of kidney function. However, significant SNPs account for a small proportion of eGFR variability. Other forms of genetic variation have not been comprehensively evaluated for association with eGFR. In this study, we assess whether changes in germline DNA copy number are associated with GFR estimated from serum creatinine, eGFRcrea. We used hidden Markov models (HMMs to identify copy number polymorphic regions (CNPs from high-throughput SNP arrays for 2,514 African (AA and 8,645 European ancestry (EA participants in the Atherosclerosis Risk in Communities (ARIC study. Separately for the EA and AA cohorts, we used Bayesian Gaussian mixture models to estimate copy number at regions identified by the HMM or previously reported in the HapMap Project. We identified 312 and 464 autosomal CNPs among individuals of EA and AA, respectively. Multivariate models adjusted for SNP-derived covariates of population structure identified one CNP in the EA cohort near genome-wide statistical significance (Bonferroni-adjusted p = 0.067 located on chromosome 5 (876-880kb. Overall, our findings suggest a limited role of CNPs in explaining eGFR variability.
Klos, Kathy Esvelt; Yimer, Belayneh A; Babiker, Ebrahiem M; Beattie, Aaron D; Bonman, J Michael; Carson, Martin L; Chong, James; Harrison, Stephen A; Ibrahim, Amir M H; Kolb, Frederic L; McCartney, Curt A; McMullen, Michael; Fetch, Jennifer Mitchell; Mohammadi, Mohsen; Murphy, J Paul; Tinker, Nicholas A
Oat crown rust, caused by f. sp. , is a major constraint to oat ( L.) production in many parts of the world. In this first comprehensive multienvironment genome-wide association map of oat crown rust, we used 2972 single-nucleotide polymorphisms (SNPs) genotyped on 631 oat lines for association mapping of quantitative trait loci (QTL). Seedling reaction to crown rust in these lines was assessed as infection type (IT) with each of 10 crown rust isolates. Adult plant reaction was assessed in the field in a total of 10 location-years as percentage severity (SV) and as infection reaction (IR) in a 0-to-1 scale. Overall, 29 SNPs on 12 linkage groups were predictive of crown rust reaction in at least one experiment at a genome-wide level of statistical significance. The QTL identified here include those in regions previously shown to be linked with seedling resistance genes , , , , , and and also with adult-plant resistance and adaptation-related QTL. In addition, QTL on linkage groups Mrg03, Mrg08, and Mrg23 were identified in regions not previously associated with crown rust resistance. Evaluation of marker genotypes in a set of crown rust differential lines supported as the identity of . The SNPs with rare alleles associated with lower disease scores may be suitable for use in marker-assisted selection of oat lines for crown rust resistance. Copyright © 2017 Crop Science Society of America.
Sasayama, Daimei; Hattori, Kotaro; Ogawa, Shintaro; Yokota, Yuuki; Matsumura, Ryo; Teraishi, Toshiya; Hori, Hiroaki; Ota, Miho; Yoshida, Sumiko; Kunugi, Hiroshi
Cerebrospinal fluid (CSF) is virtually the only one accessible source of proteins derived from the central nervous system (CNS) of living humans and possibly reflects the pathophysiology of a variety of neuropsychiatric diseases. However, little is known regarding the genetic basis of variation in protein levels of human CSF. We examined CSF levels of 1,126 proteins in 133 subjects and performed a genome-wide association analysis of 514,227 single nucleotide polymorphisms (SNPs) to detect protein quantitative trait loci (pQTLs). To be conservative, Spearman's correlation was used to identify an association between genotypes of SNPs and protein levels. A total of 421 cis and 25 trans SNP-protein pairs were significantly correlated at a false discovery rate (FDR) of less than 0.01 (nominal P genome-wide association studies. The present findings suggest that genetic variations play an important role in the regulation of protein expression in the CNS. The obtained database may serve as a valuable resource to understand the genetic bases for CNS protein expression pattern in humans. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: firstname.lastname@example.org.
Beaty, Terri H; Ruczinski, Ingo; Murray, Jeffrey C
Nonsyndromic cleft palate (CP) is a common birth defect with a complex and heterogeneous etiology involving both genetic and environmental risk factors. We conducted a genome-wide association study (GWAS) using 550 case-parent trios, ascertained through a CP case collected in an international...... consortium. Family-based association tests of single nucleotide polymorphisms (SNP) and three common maternal exposures (maternal smoking, alcohol consumption, and multivitamin supplementation) were used in a combined 2 df test for gene (G) and gene-environment (G × E) interaction simultaneously, plus...... multiple SNPs associated with higher risk of CP in the presence of maternal smoking. Additional evidence of reduced risk due to G × E interaction in the presence of multivitamin supplementation was observed for SNPs in BAALC on chr. 8. These results emphasize the need to consider G × E interaction when...
Zhang, G X; Fan, Q C; Wang, J Y; Zhang, T; Xue, Q; Shi, H Q
To identify molecular markers and candidate genes associated with reproductive traits, a genome-wide analysis was performed in Jinghai Yellow Chickens to analyze body weight at first oviposition (BWF), age at first oviposition (AFE), weight of the egg at first oviposition (FEW), egg weight at the age of 300 days (EW300), number of eggs produced by 300 days of age (EN300), egg hatchability (HA) and multiple selection index for egg production (MSI). The results showed that seven single nucleotide polymorphisms (SNPs) were associated with reproductive traits (Preproductive traits were identified (Preproductive traits will greatly advance the understanding of the genetic basis and molecular mechanisms underlying reproductive traits and may have practical significance in breeding programs for the improvements of reproductive traits in the Jinghai Yellow Chicken. Copyright © 2015 Elsevier B.V. All rights reserved.
Kühnisch, Jan; Thiering, Elisabeth; Heitmüller, Daniela; Tiesler, Carla M T; Grallert, Harald; Heinrich-Weltzien, Roswitha; Hickel, Reinhard; Heinrich, Joachim
This genome-wide association study (GWAS) investigated the relationship between molar-incisor hypomineralization (MIH) and possible genetic loci. Clinical and genetic data from the 10-year follow-up of 668 children from the Munich GINI-plus and LISA-plus birth cohort studies were analyzed. The dental examinations included the diagnosis of MIH according to the criteria of the European Academy of Paediatric Dentistry (EAPD). Children with MIH were categorized as those with a minimum of one hypomineralized first permanent molar. A GWAS was implemented following a quality-control step and an additive genetic effect was assumed. A total of 2,013,491 single-nucleotide polymorphisms (SNPs) were available for analysis. Rs13058467, which is located near the SCUBE1 gene on chromosome 22 (p MIH when using a threshold of p value MIH.
Lu, Yi; Chen, Xiaoqing; Beesley, Jonathan
stage 1 GWAS rather than due to problems with the pooling approach. We conclude that there are unlikely to be any moderate or large effects on ovarian cancer risk untagged by less dense arrays. However, our study lacked power to make clear statements on the existence of hitherto untagged small......Recent Genome-Wide Association Studies (GWAS) have identified four low-penetrance ovarian cancer susceptibility loci. We hypothesized that further moderate- or low-penetrance variants exist among the subset of single-nucleotide polymorphisms (SNPs) not well tagged by the genotyping arrays used...... in the previous studies, which would account for some of the remaining risk. We therefore conducted a time- and cost-effective stage 1 GWAS on 342 invasive serous cases and 643 controls genotyped on pooled DNA using the high-density Illumina 1M-Duo array. We followed up 20 of the most significantly associated...
Although species from the genus Thunnus include some of the most commercially important and most severely overexploited fishes, the phylogeny of this genus is still unresolved, hampering evolutionary and traceability studies that could help improve conservation and management strategies for these species. Previous attempts based on mitochondrial and nuclear markers were unsuccessful in inferring a congruent and reliable phylogeny, probably due to mitochondrial introgression events and lack of enough phylogenetically informative markers. Here we infer the first genome-wide nuclear marker-based phylogeny of tunas using restriction site associated DNA sequencing (RAD-seq) data. Our results, derived from phylogenomic inferences obtained from 128 nucleotide matrices constructed using alternative data assembly procedures, support a single Thunnus evolutionary history that challenges previous assumptions based on morphological and molecular data.
Singh, P; Benjak, A; Carat, S; Kai, M; Busso, P; Avanzi, C; Paniz-Mondolfi, A; Peter, C; Harshman, K; Rougemont, J; Matsuoka, M; Cole, S T
Genotyping and molecular characterization of drug resistance mechanisms in Mycobacterium leprae enables disease transmission and drug resistance trends to be monitored. In the present study, we performed genome-wide analysis of Airaku-3, a multidrug-resistant strain with an unknown mechanism of resistance to rifampicin. We identified 12 unique non-synonymous single-nucleotide polymorphisms (SNPs) including two in the transporter-encoding ctpC and ctpI genes. In addition, two SNPs were found that improve the resolution of SNP-based genotyping, particularly for Venezuelan and South East Asian strains of M. leprae. © 2014 The Authors Clinical Microbiology and Infection © 2014 European Society of Clinical Microbiology and Infectious Diseases.
Marete, Andrew Gitahi; Sahana, Goutam; Fritz, Sebastian
Using a combination of data from the BovineSNP50 BeadChip SNP array (Illumina, San Diego, CA) and a EuroGenomics (Amsterdam, the Netherlands) custom single nucleotide polymorphism (SNP) chip with SNP pre-selected from whole genome sequence data, we carried out an association study of milking speed...... associated with milking speed. As clinical mastitis and somatic cell score have an unfavorable genetic correlation with milking speed, we tested whether the most significant SNP on these 22 chromosomes associated with milking speed were also associated with clinical mastitis or somatic cell score. Nine...... hundred seventy-one genome-wide significant SNP were associated with milking speed. Of these, 86 were associated with clinical mastitis and 198 with somatic cell score. The most significant association signals for milking speed were observed on chromosomes 7, 8, 10, 14, and 18. The most significant signal...
J. H. Ha
Full Text Available The purpose of this study was to characterize genetic architecture of behavior patterns in Sapsaree dogs. The breed population (n = 8,256 has been constructed since 1990 over 12 generations and managed at the Sapsaree Breeding Research Institute, Gyeongsan, Korea. Seven behavioral traits were investigated for 882 individuals. The traits were classified as a quantitative or a categorical group, and heritabilities (h2 and variance components were estimated under the Animal model using ASREML 2.0 software program. In general, the h2 estimates of the traits ranged between 0.00 and 0.16. Strong genetic (rG and phenotypic (rP correlations were observed between nerve stability, affability and adaptability, i.e. 0.9 to 0.94 and 0.46 to 0.68, respectively. To detect significant single nucleotide polymorphism (SNP for the behavioral traits, a total of 134 and 60 samples were genotyped using the Illumina 22K CanineSNP20 and 170K CanineHD bead chips, respectively. Two datasets comprising 60 (Sap60 and 183 (Sap183 samples were analyzed, respectively, of which the latter was based on the SNPs that were embedded on both the 22K and 170K chips. To perform genome-wide association analysis, each SNP was considered with the residuals of each phenotype that were adjusted for sex and year of birth as fixed effects. A least squares based single marker regression analysis was followed by a stepwise regression procedure for the significant SNPs (p<0.01, to determine a best set of SNPs for each trait. A total of 41 SNPs were detected with the Sap183 samples for the behavior traits. The significant SNPs need to be verified using other samples, so as to be utilized to improve behavior traits via marker-assisted selection in the Sapsaree population.
Evangelou, Evangelos; Fellay, Jacques; Colombo, Sara
infected with human immunodeficiency virus type 1 (HIV-1) to assess whether differences in type of population (622 seroconverters vs. 636 seroprevalent subjects) or the number of measurements available for defining the phenotype resulted in differences in the effect sizes of associations between single...... nucleotide polymorphisms and the phenotype, HIV-1 viral load at set point. The effect estimate for the top 100 single nucleotide polymorphisms was 0.092 (95% confidence interval: 0.074, 0.110) log(10) viral load (log(10) copies of HIV-1 per mL of blood) greater in seroconverters than in seroprevalent...... available, particularly among seroconverters and for variants that achieved genome-wide significance. Differences in phenotype definition and ascertainment may affect the estimated magnitude of genetic effects and should be considered in optimizing power for discovering new associations....
Lang, M; Leménager, T; Streit, F; Fauth-Bühler, M; Frank, J; Juraeva, D; Witt, S H; Degenhardt, F; Hofmann, A; Heilmann-Heimbach, S; Kiefer, F; Brors, B; Grabe, H-J; John, U; Bischof, A; Bischof, G; Völker, U; Homuth, G; Beutel, M; Lind, P A; Medland, S E; Slutske, W S; Martin, N G; Völzke, H; Nöthen, M M; Meyer, C; Rumpf, H-J; Wurst, F M; Rietschel, M; Mann, K F
Pathological gambling is a behavioural addiction with negative economic, social, and psychological consequences. Identification of contributing genes and pathways may improve understanding of aetiology and facilitate therapy and prevention. Here, we report the first genome-wide association study of pathological gambling. Our aims were to identify pathways involved in pathological gambling, and examine whether there is a genetic overlap between pathological gambling and alcohol dependence. Four hundred and forty-five individuals with a diagnosis of pathological gambling according to the Diagnostic and Statistical Manual of Mental Disorders were recruited in Germany, and 986 controls were drawn from a German general population sample. A genome-wide association study of pathological gambling comprising single marker, gene-based, and pathway analyses, was performed. Polygenic risk scores were generated using data from a German genome-wide association study of alcohol dependence. No genome-wide significant association with pathological gambling was found for single markers or genes. Pathways for Huntington's disease (P-value=6.63×10(-3)); 5'-adenosine monophosphate-activated protein kinase signalling (P-value=9.57×10(-3)); and apoptosis (P-value=1.75×10(-2)) were significant. Polygenic risk score analysis of the alcohol dependence dataset yielded a one-sided nominal significant P-value in subjects with pathological gambling, irrespective of comorbid alcohol dependence status. The present results accord with previous quantitative formal genetic studies which showed genetic overlap between non-substance- and substance-related addictions. Furthermore, pathway analysis suggests shared pathology between Huntington's disease and pathological gambling. This finding is consistent with previous imaging studies. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Drago, Francesca; Karpasitou, Katerina; Poli, Francesca
We have developed a high-throughput system for single nucleotide polymorphism (SNP) genotyping of alleles of diverse blood group systems exploiting Luminex technology. The method uses specific oligonucleotide probes coupled to a specific array of fluorescent microspheres and is designed for typing Jka/Jkb, Fya/Fyb, S/s, K/k, Kpa/Kpb, Jsa/Jsb, Coa/Cob and Lua/Lub alleles. Briefly, two multiplex PCR reactions (PCR I and PCR II) according to the laboratory specific needs are set up. PCR I amplif...
Rizzi, Giovanni; Østerberg, Frederik Westergaard; Dufva, Martin
We present a magnetoresistive sensor platform for hybridization assays and demonstrate its applicability on single nucleotide polymorphism (SNP) genotyping. The sensor relies on anisotropic magnetoresistance in a new geometry with a local negative reference and uses the magnetic field from...... the sensor bias current to magnetize magnetic beads in the vicinity of the sensor. The method allows for real-time measurements of the specific bead binding to the sensor surface during DNA hybridization and washing. Compared to other magnetic biosensing platforms, our approach eliminates the need...... for external electromagnets and thus allows for miniaturization of the sensor platform....
Zhan, Qimin; Hu, Zhibin; He, Zhonghu; Jia, Weihua; Zhou, Yifeng; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Zhao, Xue-Ke; Gao, She-Gan; Yuan, Zhi-Qing; Zhou, Fu-You; Fan, Zong-Min; Cui, Ji-Li; Lin, Hong-Li; Han, Xue-Na; Li, Bei; Chen, Xi; Dawsey, Sanford M.; Liao, Linda; Lee, Maxwell P.; Ding, Ti; Qiao, You-Lin; Liu, Zhihua; Liu, Yu; Yu, Dianke; Chang, Jiang; Wei, Lixuan; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Han, Jing-Jing; Zhou, Sheng-Li; Zhang, Peng; Zhang, Dong-Yun; Yuan, Yuan; Huang, Ying; Liu, Chunling; Zhai, Kan; Qiao, Yan; Jin, Guangfu; Guo, Chuanhai; Fu, Jianhua; Miao, Xiaoping; Lu, Changdong; Yang, Haijun; Wang, Chaoyu; Wheeler, William A.; Gail, Mitchell; Yeager, Meredith; Yuenger, Jeff; Guo, Er-Tao; Li, Ai-Li; Zhang, Wei; Li, Xue-Min; Sun, Liang-Dan; Ma, Bao-Gen; Li, Yan; Tang, Sa; Peng, Xiu-Qing; Liu, Jing; Hutchinson, Amy; Jacobs, Kevin; Giffen, Carol; Burdette, Laurie; Fraumeni, Joseph F.; Shen, Hongbing; Ke, Yang; Zeng, Yixin; Wu, Tangchun; Kraft, Peter; Chung, Charles C.; Tucker, Margaret A.; Hou, Zhi-Chao; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Wang, Li; Yuan, Guo; Chen, Li-Sha; Liu, Xiao; Ma, Teng; Meng, Hui; Sun, Li; Li, Xin-Min; Li, Xiu-Min; Ku, Jian-Wei; Zhou, Ying-Fa; Yang, Liu-Qin; Wang, Zhou; Li, Yin; Qige, Qirenwang; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Yuan, Ling; Yue, Wen-Bin; Wang, Ran; Wang, Lu-Wen; Fan, Xue-Ping; Zhu, Fang-Heng; Zhao, Wei-Xing; Mao, Yi-Min; Zhang, Mei; Xing, Guo-Lan; Li, Ji-Lin; Han, Min; Ren, Jing-Li; Liu, Bin; Ren, Shu-Wei; Kong, Qing-Peng; Li, Feng; Sheyhidin, Ilyar; Wei, Wu; Zhang, Yan-Rui; Feng, Chang-Wei; Wang, Jin; Yang, Yu-Hua; Hao, Hong-Zhang; Bao, Qi-De; Liu, Bao-Chi; Wu, Ai-Qun; Xie, Dong; Yang, Wan-Cai; Wang, Liang; Zhao, Xiao-Hang; Chen, Shu-Qing; Hong, Jun-Yan; Zhang, Xue-Jun; Freedman, Neal D; Goldstein, Alisa M.; Lin, Dongxin; Taylor, Philip R.; Wang, Li-Dong; Chanock, Stephen J.
We conducted a joint (pooled) analysis of three genome-wide association studies (GWAS) 1-3 of esophageal squamous cell carcinoma (ESCC) in ethnic Chinese (5,337 ESCC cases and 5,787 controls) with 9,654 ESCC cases and 10,058 controls for follow-up. In a logistic regression model adjusted for age, sex, study, and two eigenvectors, two new loci achieved genome-wide significance, marked by rs7447927 at 5q31.2 (per-allele odds ratio (OR) = 0.85, 95% CI 0.82-0.88; P=7.72x10−20) and rs1642764 at 17p13.1 (per-allele OR= 0.88, 95% CI 0.85-0.91; P=3.10x10−13). rs7447927 is a synonymous single nucleotide polymorphism (SNP) in TMEM173 and rs1642764 is an intronic SNP in ATP1B2, near TP53. Furthermore, a locus in the HLA class II region at 6p21.32 (rs35597309) achieved genome-wide significance in the two populations at highest risk for ESSC (OR=1.33, 95% CI 1.22-1.46; P=1.99x10−10). Our joint analysis identified new ESCC susceptibility loci overall as well as a new locus unique to the ESCC high risk Taihang Mountain region. PMID:25129146
In high-dimensional studies such as genome-wide association studies, the correction for multiple testing in order to control total type I error results in decreased power to detect modest effects. We present a new analytical approach based on the higher criticism statistic that allows identification of the presence of modest effects. We apply our method to the genome-wide study of rheumatoid arthritis provided in the Genetic Analysis Workshop 16 Problem 1 data set. There is evidence for unknown bias in this study that could be explained by the presence of undetected modest effects. We compared the asymptotic and empirical thresholds for the higher criticism statistic. Using the asymptotic threshold we detected the presence of modest effects genome-wide. We also detected modest effects using 90th percentile of the empirical null distribution as a threshold; however, there is no such evidence when the 95th and 99th percentiles were used. While the higher criticism method suggests that there is some evidence for modest effects, interpreting individual single-nucleotide polymorphisms with significant higher criticism statistics is of undermined value. The goal of higher criticism is to alert the researcher that genetic effects remain to be discovered and to promote the use of more targeted and powerful studies to detect the remaining effects. PMID:20018032
Fragomeni, Breno O; Lourenco, Daniela A L; Masuda, Yutaka; Legarra, Andres; Misztal, Ignacy
Much effort is put into identifying causative quantitative trait nucleotides (QTN) in animal breeding, empowered by the availability of dense single nucleotide polymorphism (SNP) information. Genomic selection using traditional SNP information is easily implemented for any number of genotyped individuals using single-step genomic best linear unbiased predictor (ssGBLUP) with the algorithm for proven and young (APY). Our aim was to investigate whether ssGBLUP is useful for genomic prediction when some or all QTN are known. Simulations included 180,000 animals across 11 generations. Phenotypes were available for all animals in generations 6 to 10. Genotypes for 60,000 SNPs across 10 chromosomes were available for 29,000 individuals. The genetic variance was fully accounted for by 100 or 1000 biallelic QTN. Raw genomic relationship matrices (GRM) were computed from (a) unweighted SNPs, (b) unweighted SNPs and causative QTN, (c) SNPs and causative QTN weighted with results obtained with genome-wide association studies, (d) unweighted SNPs and causative QTN with simulated weights, (e) only unweighted causative QTN, (f-h) as in (b-d) but using only the top 10% causative QTN, and (i) using only causative QTN with simulated weight. Predictions were computed by pedigree-based BLUP (PBLUP) and ssGBLUP. Raw GRM were blended with 1 or 5% of the numerator relationship matrix, or 1% of the identity matrix. Inverses of GRM were obtained directly or with APY. Accuracy of breeding values for 5000 genotyped animals in the last generation with PBLUP was 0.32, and for ssGBLUP it increased to 0.49 with an unweighted GRM, 0.53 after adding unweighted QTN, 0.63 when QTN weights were estimated, and 0.89 when QTN weights were based on true effects known from the simulation. When the GRM was constructed from causative QTN only, accuracy was 0.95 and 0.99 with blending at 5 and 1%, respectively. Accuracies simulating 1000 QTN were generally lower, with a similar trend. Accuracies using the
Full Text Available Genetics is important for breeding and selection of horses but there is a lack of well-established horse-related browsers or databases. In order to better understand horses, more variants and other integrated information are needed. Thus, we construct a horse genomic variants database including expression and other information. Horse Single Nucleotide Polymorphism and Expression Database (HSDB (http://snugenome2.snu.ac.kr/HSDB provides the number of unexplored genomic variants still remaining to be identified in the horse genome including rare variants by using population genome sequences of eighteen horses and RNA-seq of four horses. The identified single nucleotide polymorphisms (SNPs were confirmed by comparing them with SNP chip data and variants of RNA-seq, which showed a concordance level of 99.02% and 96.6%, respectively. Moreover, the database provides the genomic variants with their corresponding transcriptional profiles from the same individuals to help understand the functional aspects of these variants. The database will contribute to genetic improvement and breeding strategies of Thoroughbreds.
Ripke, Stephan; Wray, Naomi R; Lewis, Cathryn M; Hamilton, Steven P; Weissman, Myrna M; Breen, Gerome; Byrne, Enda M; Blackwood, Douglas H R; Boomsma, Dorret I; Cichon, Sven; Heath, Andrew C; Holsboer, Florian; Lucae, Susanne; Madden, Pamela A F; Martin, Nicholas G; McGuffin, Peter; Muglia, Pierandrea; Noethen, Markus M; Penninx, Brenda P; Pergadia, Michele L; Potash, James B; Rietschel, Marcella; Lin, Danyu; Müller-Myhsok, Bertram; Shi, Jianxin; Steinberg, Stacy; Grabe, Hans J; Lichtenstein, Paul; Magnusson, Patrik; Perlis, Roy H; Preisig, Martin; Smoller, Jordan W; Stefansson, Kari; Uher, Rudolf; Kutalik, Zoltan; Tansey, Katherine E; Teumer, Alexander; Viktorin, Alexander; Barnes, Michael R; Bettecken, Thomas; Binder, Elisabeth B; Breuer, René; Castro, Victor M; Churchill, Susanne E; Coryell, William H; Craddock, Nick; Craig, Ian W; Czamara, Darina; De Geus, Eco J; Degenhardt, Franziska; Farmer, Anne E; Fava, Maurizio; Frank, Josef; Gainer, Vivian S; Gallagher, Patience J; Gordon, Scott D; Goryachev, Sergey; Gross, Magdalena; Guipponi, Michel; Henders, Anjali K; Herms, Stefan; Hickie, Ian B; Hoefels, Susanne; Hoogendijk, Witte; Hottenga, Jouke Jan; Iosifescu, Dan V; Ising, Marcus; Jones, Ian; Jones, Lisa; Jung-Ying, Tzeng; Knowles, James A; Kohane, Isaac S; Kohli, Martin A; Korszun, Ania; Landen, Mikael; Lawson, William B; Lewis, Glyn; Macintyre, Donald; Maier, Wolfgang; Mattheisen, Manuel; McGrath, Patrick J; McIntosh, Andrew; McLean, Alan; Middeldorp, Christel M; Middleton, Lefkos; Montgomery, Grant M; Murphy, Shawn N; Nauck, Matthias; Nolen, Willem A; Nyholt, Dale R; O'Donovan, Michael; Oskarsson, Högni; Pedersen, Nancy; Scheftner, William A; Schulz, Andrea; Schulze, Thomas G; Shyn, Stanley I; Sigurdsson, Engilbert; Slager, Susan L; Smit, Johannes H; Stefansson, Hreinn; Steffens, Michael; Thorgeirsson, Thorgeir; Tozzi, Federica; Treutlein, Jens; Uhr, Manfred; van den Oord, Edwin J C G; Van Grootheest, Gerard; Völzke, Henry; Weilburg, Jeffrey B; Willemsen, Gonneke; Zitman, Frans G; Neale, Benjamin; Daly, Mark; Levinson, Douglas F; Sullivan, Patrick F
Prior genome-wide association studies (GWAS) of major depressive disorder (MDD) have met with limited success. We sought to increase statistical power to detect disease loci by conducting a GWAS mega-analysis for MDD. In the MDD discovery phase, we analyzed more than 1.2 million autosomal and X chromosome single-nucleotide polymorphisms (SNPs) in 18 759 independent and unrelated subjects of recent European ancestry (9240 MDD cases and 9519 controls). In the MDD replication phase, we evaluated 554 SNPs in independent samples (6783 MDD cases and 50 695 controls). We also conducted a cross-disorder meta-analysis using 819 autosomal SNPs with P<0.0001 for either MDD or the Psychiatric GWAS Consortium bipolar disorder (BIP) mega-analysis (9238 MDD cases/8039 controls and 6998 BIP cases/7775 controls). No SNPs achieved genome-wide significance in the MDD discovery phase, the MDD replication phase or in pre-planned secondary analyses (by sex, recurrent MDD, recurrent early-onset MDD, age of onset, pre-pubertal onset MDD or typical-like MDD from a latent class analyses of the MDD criteria). In the MDD-bipolar cross-disorder analysis, 15 SNPs exceeded genome-wide significance (P<5 × 10(-8)), and all were in a 248 kb interval of high LD on 3p21.1 (chr3:52 425 083-53 822 102, minimum P=5.9 × 10(-9) at rs2535629). Although this is the largest genome-wide analysis of MDD yet conducted, its high prevalence means that the sample is still underpowered to detect genetic effects typical for complex traits. Therefore, we were unable to identify robust and replicable findings. We discuss what this means for genetic research for MDD. The 3p21.1 MDD-BIP finding should be interpreted with caution as the most significant SNP did not replicate in MDD samples, and genotyping in independent samples will be needed to resolve its status.
Yuan, Han; Dougherty, Joseph D.
Lay Abstract Autism spectrum disorders (ASDs) are pervasive developmental disorders which have both a genetic and environmental component. One source of the environmental component is the in utero (prenatal) environment. The maternal genome can potentially contribute to the risk of autism in children by altering this prenatal environment. In this study, the possibility of maternal genotype effects was explored by looking for common variants (single nucleotide polymorphisms, or SNPs) in the maternal genome associated with increased risk of autism in children. We performed a case/control genome-wide association study (GWAS) using mothers of probands as cases and either fathers of probands or normal females as controls, using two collections of families with autism. We did not identify any SNP that reached significance and thus a common variant of large effect is unlikely. However, there was evidence for the possibility of a large number of alleles each carrying a small effect. This suggested that if there is a contribution to autism risk through common-variant maternal genetic effects, it may be the result of multiple loci of small effects. We did not investigate rare variants in this study. Scientific Abstract Like most psychiatric disorders, autism spectrum disorders have both a genetic and an environmental component. While previous studies have clearly demonstrated the contribution of in utero (prenatal) environment on autism risk, most of them focused on transient environmental factors. Based on a recent sibling study, we hypothesized that environmental factors could also come from the maternal genome, which would result in persistent effects across siblings. In this study, the possibility of maternal genotype effects was examined by looking for common variants (single nucleotide polymorphisms, or SNPs) in the maternal genome associated with increased risk of autism in children. A case/control genome-wide association study (GWAS) was performed using mothers of
Bühler, Kora-Mareen; Giné, Elena; Echeverry-Alzate, Victor; Calleja-Conde, Javier; de Fonseca, Fernando Rodriguez; López-Moreno, Jose Antonio
Drug-related phenotypes are common complex and highly heritable traits. In the last few years, candidate gene (CGAS) and genome-wide association studies (GWAS) have identified a huge number of single nucleotide polymorphisms (SNPs) associated with drug use, abuse or dependence, mainly related to alcohol or nicotine. Nevertheless, few of these associations have been replicated in independent studies. The aim of this study was to provide a review of the SNPs that have been most significantly associated with alcohol-, nicotine-, cannabis- and cocaine-related phenotypes in humans between the years of 2000 and 2012. To this end, we selected CGAS, GWAS, family-based association and case-only studies published in peer-reviewed international scientific journals (using the PubMed/MEDLINE and Addiction GWAS Resource databases) in which a significant association was reported. A total of 371 studies fit the search criteria. We then filtered SNPs with at least one replication study and performed meta-analysis of the significance of the associations. SNPs in the alcohol metabolizing genes, in the cholinergic gene cluster CHRNA5-CHRNA3-CHRNB4, and in the DRD2 and ANNK1 genes, are, to date, the most replicated and significant gene variants associated with alcohol- and nicotine-related phenotypes. In the case of cannabis and cocaine, a far fewer number of studies and replications have been reported, indicating either a need for further investigation or that the genetics of cannabis/cocaine addiction are more elusive. This review brings a global state-of-the-art vision of the behavioral genetics of addiction and collaborates on formulation of new hypothesis to guide future work. © 2015 Society for the Study of Addiction.
Ayman A El-Menyar
Full Text Available Background: Based on several reports including genome-wide association studies, genetic variability has been linked with higher (nearly half susceptibility toward coronary artery disease (CAD. We aimed to evaluate the association of chromosome 9p21 single nucleotide polymorphisms (SNPs: rs2383207, rs10757278, and rs10757274 with the risk and severity of CAD among Arab population. Materials and Methods: A prospective observational case-control study was conducted between 2011 and 2012, in which 236 patients with CAD were recruited from the Heart Hospital in Qatar. Patients were categorized according to their coronary angiographic findings. Also, 152 healthy volunteers were studied to determine if SNPs are associated with risk of CAD. All subjects were genotyped for SNPs (rs2383207, rs2383206, rs10757274 and rs10757278 using allele-specific real-time polymerase chain reaction. Results: Patients with CAD had a mean age of 57 ± 10; of them 77% were males, 54% diabetics, and 25% had family history of CAD. All SNPs were in Hardy-Weinberg equilibrium except rs2383206, with call rate >97%. After adjusting for age, sex and body mass index, the carriers of GG genotype for rs2383207 have increased the risk of having CAD with odds ratio (OR of 1.52 (95% confidence interval [CI] = 1.01-2.961, P = 0.046. Also, rs2383207 contributed to CAD severity with adjusted OR 1.80 (95% CI = 1.04-3.12, P = 0.035 based on the dominant genetic model. The other SNPs (rs10757274 and rs10757278 showed no significant association with the risk of CAD or its severity. Conclusion: Among Arab population in Qatar, only G allele of rs2483207 SNP is significantly associated with risk of CAD and its severity.
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs have emerged as the genetic marker of choice for mapping disease loci and candidate gene association studies, because of their high density and relatively even distribution in the human genomes. There is a need for systems allowing medium multiplexing (ten to hundreds of SNPs with high throughput, which can efficiently and cost-effectively generate genotypes for a very large sample set (thousands of individuals. Methods that are flexible, fast, accurate and cost-effective are urgently needed. This is also important for those who work on high throughput genotyping in non-model systems where off-the-shelf assays are not available and a flexible platform is needed. Results We demonstrate the use of a nanofluidic Integrated Fluidic Circuit (IFC - based genotyping system for medium-throughput multiplexing known as the Dynamic Array, by genotyping 994 individual human DNA samples on 47 different SNP assays, using nanoliter volumes of reagents. Call rates of greater than 99.5% and call accuracies of greater than 99.8% were achieved from our study, which demonstrates that this is a formidable genotyping platform. The experimental set up is very simple, with a time-to-result for each sample of about 3 hours. Conclusion Our results demonstrate that the Dynamic Array is an excellent genotyping system for medium-throughput multiplexing (30-300 SNPs, which is simple to use and combines rapid throughput with excellent call rates, high concordance and low cost. The exceptional call rates and call accuracy obtained may be of particular interest to those working on validation and replication of genome- wide- association (GWA studies.
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most common source of genetic variation in eukaryotic species and have become an important marker for genetic studies. The mosquito Anopheles funestus is one of the major malaria vectors in Africa and yet, prior to this study, no SNPs have been described for this species. Here we report a genome-wide set of SNP markers for use in genetic studies on this important human disease vector. Results DNA fragments from 50 genes were amplified and sequenced from 21 specimens of An. funestus. A third of specimens were field collected in Malawi, a third from a colony of Mozambican origin and a third form a colony of Angolan origin. A total of 494 SNPs including 303 within the coding regions of genes and 5 indels were identified. The physical positions of these SNPs in the genome are known. There were on average 7 SNPs per kilobase similar to that observed in An. gambiae and Drosophila melanogaster. Transitions outnumbered transversions, at a ratio of 2:1. The increased frequency of transition substitutions in coding regions is likely due to the structure of the genetic code and selective constraints. Synonymous sites within coding regions showed a higher polymorphism rate than non-coding introns or 3' and 5'flanking DNA with most of the substitutions in coding regions being observed at the 3rd codon position. A positive correlation in the level of polymorphism was observed between coding and non-coding regions within a gene. By genotyping a subset of 30 SNPs, we confirmed the validity of the SNPs identified during this study. Conclusion This set of SNP markers represents a useful tool for genetic studies in An. funestus, and will be useful in identifying candidate genes that affect diverse ranges of phenotypes that impact on vector control, such as resistance insecticide, mosquito behavior and vector competence.
Rabinowicz Pablo D
Full Text Available Abstract Background Castor bean (Ricinus communis is an agricultural crop and garden ornamental that is widely cultivated and has been introduced worldwide. Understanding population structure and the distribution of castor bean cultivars has been challenging because of limited genetic variability. We analyzed the population genetics of R. communis in a worldwide collection of plants from germplasm and from naturalized populations in Florida, U.S. To assess genetic diversity we conducted survey sequencing of the genomes of seven diverse cultivars and compared the data to a reference genome assembly of a widespread cultivar (Hale. We determined the population genetic structure of 676 samples using single nucleotide polymorphisms (SNPs at 48 loci. Results Bayesian clustering indicated five main groups worldwide and a repeated pattern of mixed genotypes in most countries. High levels of population differentiation occurred between most populations but this structure was not geographically based. Most molecular variance occurred within populations (74% followed by 22% among populations, and 4% among continents. Samples from naturalized populations in Florida indicated significant population structuring consistent with local demes. There was significant population differentiation for 56 of 78 comparisons in Florida (pairwise population ϕPT values, p Conclusion Low levels of genetic diversity and mixing of genotypes have led to minimal geographic structuring of castor bean populations worldwide. Relatively few lineages occur and these are widely distributed. Our approach of determining population genetic structure using SNPs from genome-wide comparisons constitutes a framework for high-throughput analyses of genetic diversity in plants, particularly in species with limited genetic diversity.
Xu, P; Wu, X; Wang, B; Luo, J; Liu, Y; Ehlers, J D; Close, T J; Roberts, P A; Lu, Z; Wang, S; Li, G
Association mapping of important traits of crop plants relies on first understanding the extent and patterns of linkage disequilibrium (LD) in the particular germplasm being investigated. We characterize here the genetic diversity, population structure and genome wide LD patterns in a set of asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm from China. A diverse collection of 99 asparagus bean and normal cowpea accessions were genotyped with 1127 expressed sequence tag-derived single nucleotide polymorphism markers (SNPs). The proportion of polymorphic SNPs across the collection was relatively low (39%), with an average number of SNPs per locus of 1.33. Bayesian population structure analysis indicated two subdivisions within the collection sampled that generally represented the 'standard vegetable' type (subgroup SV) and the 'non-standard vegetable' type (subgroup NSV), respectively. Level of LD (r(2)) was higher and extent of LD persisted longer in subgroup SV than in subgroup NSV, whereas LD decayed rapidly (0-2 cM) in both subgroups. LD decay distance varied among chromosomes, with the longest (≈ 5 cM) five times longer than the shortest (≈ 1 cM). Partitioning of LD variance into within- and between-subgroup components coupled with comparative LD decay analysis suggested that linkage group 5, 7 and 10 may have undergone the most intensive epistatic selection toward traits favorable for vegetable use. This work provides a first population genetic insight into domestication history of asparagus bean and demonstrates the feasibility of mapping complex traits by genome wide association study in asparagus bean using a currently available cowpea SNPs marker platform.
Full Text Available Oluwadamilare Falola,1 Victor Chukwudi Osamor,1,2 Marion Adebiyi,1,2 Ezekiel Adebiyi1,2 1Covenant University Bioinformatics Research (CUBRe, 2Department of Computer and Information Sciences, College of Science and Technology, Covenant University, Ota, Ogun State, Nigeria Background: Schizophrenia is a severe mental disorder affecting >21 million people worldwide. Some genetic studies reported that single nucleotide polymorphism (SNP involving variant rs1344706 from the ZNF804A gene in human beings is associated with the risk of schizophrenia in several populations. Similar results tend to conflict with other reports in literature, indicating that no true significant association exists between rs1344706 and schizophrenia. We seek to determine the level of association of this SNP with schizophrenia in the Asian population using more recent genome-wide association study (GWAS datasets. Methods: Applying a computational approach with inclusion of more recent GWAS datasets, we conducted a meta-analysis to examine the level of association of SNP rs1344706 and the risk of schizophrenia disorder among the Asian population constituting Chinese, Indonesians, Japanese, Kazakhs and Singaporeans. For a total of 21 genetic studies, including a total of 28,842 cases and 35,630 controls, regression analysis, publication bias, Cochran’s Q and I2 tests were performed. The DerSimonian and Laird random-effects model was used to assess the association of the genetic variant to schizophrenia. Leave-one-out sensitivity analysis was also conducted to determine the influence of each study on the final outcome of the association study. Results: Our summarized analysis for Asian population revealed a pooled odds ratio of 1.06, 95% confidence interval of 1.01–1.11 and two-tailed P-value of 0.0228. Our test for heterogeneity showed the presence of large heterogeneity (I2=53.44%, P =0.00207 and Egger’s regression test (P =0.8763 and Begg’s test (P =0
Izumi, Kosuke; Santani, Avni B; Deardorff, Matthew A; Feret, Holly A; Tischler, Tanya; Thiel, Brian D; Mulchandani, Surabhi; Stolle, Catherine A; Spinner, Nancy B; Zackai, Elaine H; Conlin, Laura K
Prader-Willi syndrome is caused by the loss of paternal gene expression on 15q11.2-q13.2, and one of the mechanisms resulting in Prader-Willi syndrome phenotype is maternal uniparental disomy of chromosome 15. Various mechanisms including trisomy rescue, monosomy rescue, and post fertilization errors can lead to uniparental disomy, and its mechanism can be inferred from the pattern of uniparental hetero and isodisomy. Detection of a mosaic cell line provides a unique opportunity to understand the mechanism of uniparental disomy; however, mosaic uniparental disomy is a rare finding in patients with Prader-Willi syndrome. We report on two infants with Prader-Willi syndrome caused by mosaic maternal uniparental disomy 15. Patient 1 has mosaic uniparental isodisomy of the entire chromosome 15, and Patient 2 has mosaic uniparental mixed iso/heterodisomy 15. Genome-wide single-nucleotide polymorphism array was able to demonstrate the presence of chromosomally normal cell line in the Patient 1 and trisomic cell line in Patient 2, and provide the evidence that post-fertilization error and trisomy rescue as a mechanism of uniparental disomy in each case, respectively. Given its ability of detecting small percent mosaicism as well as its capability of identifying the loss of heterozygosity of chromosomal regions, genome-wide single-nucleotide polymorphism array should be utilized as an adjunct to the standard methylation analysis in the evaluation of Prader-Willi syndrome. Copyright © 2012 Wiley Periodicals, Inc.
Lane, Jérôme; McLaren, Paul J.; Dorrell, Lucy; Shianna, Kevin V.; Stemke, Amanda; Pelak, Kimberly; Moore, Stephen; Oldenburg, Johannes; Alvarez-Roman, Maria Teresa; Angelillo-Scherrer, Anne; Boehlen, Francoise; Bolton-Maggs, Paula H.B.; Brand, Brigit; Brown, Deborah; Chiang, Elaine; Cid-Haro, Ana Rosa; Clotet, Bonaventura; Collins, Peter; Colombo, Sara; Dalmau, Judith; Fogarty, Patrick; Giangrande, Paul; Gringeri, Alessandro; Iyer, Rathi; Katsarou, Olga; Kempton, Christine; Kuriakose, Philip; Lin, Judith; Makris, Mike; Manco-Johnson, Marilyn; Tsakiris, Dimitrios A.; Martinez-Picado, Javier; Mauser-Bunschoten, Evelien; Neff, Anne; Oka, Shinichi; Oyesiku, Lara; Parra, Rafael; Peter-Salonen, Kristiina; Powell, Jerry; Recht, Michael; Shapiro, Amy; Stine, Kimo; Talks, Katherine; Telenti, Amalio; Wilde, Jonathan; Yee, Thynn Thynn; Wolinsky, Steven M.; Martinson, Jeremy; Hussain, Shehnaz K.; Bream, Jay H.; Jacobson, Lisa P.; Carrington, Mary; Goedert, James J.; Haynes, Barton F.; McMichael, Andrew J.; Goldstein, David B.; Fellay, Jacques
Human genetic variation contributes to differences in susceptibility to HIV-1 infection. To search for novel host resistance factors, we performed a genome-wide association study (GWAS) in hemophilia patients highly exposed to potentially contaminated factor VIII infusions. Individuals with hemophilia A and a documented history of factor VIII infusions before the introduction of viral inactivation procedures (1979–1984) were recruited from 36 hemophilia treatment centers (HTCs), and their genome-wide genetic variants were compared with those from matched HIV-infected individuals. Homozygous carriers of known CCR5 resistance mutations were excluded. Single nucleotide polymorphisms (SNPs) and inferred copy number variants (CNVs) were tested using logistic regression. In addition, we performed a pathway enrichment analysis, a heritability analysis, and a search for epistatic interactions with CCR5 Δ32 heterozygosity. A total of 560 HIV-uninfected cases were recruited: 36 (6.4%) were homozygous for CCR5 Δ32 or m303. After quality control and SNP imputation, we tested 1 081 435 SNPs and 3686 CNVs for association with HIV-1 serostatus in 431 cases and 765 HIV-infected controls. No SNP or CNV reached genome-wide significance. The additional analyses did not reveal any strong genetic effect. Highly exposed, yet uninfected hemophiliacs form an ideal study group to investigate host resistance factors. Using a genome-wide approach, we did not detect any significant associations between SNPs and HIV-1 susceptibility, indicating that common genetic variants of major effect are unlikely to explain the observed resistance phenotype in this population. PMID:23372042
Full Text Available Background: Single nucleotide polymorphism (SNPs are considered as one of the underlyingcauses of male infertility. Proper sperm chromatin packaging which involves replacement ofhistones with protamines has profound effect on male fertility. Over 20 SNPs have been reportedfor the protamine 1 and 2.Materials and Methods: The aim of this study was to evaluate the frequency of two previouslyreported SNPs using polymerase chain reaction (PCR-restriction fragment length polymorphism(RFLP approach in 35, 96 and 177 normal, oligozoospermic and azoospermic individuals. TheseSNPs are: 1. A base pair substitution (G at position 197 instead of T in protamine type 1 Openreading frame (ORF including untranslated region, which causes an Arg residue change to Serresidue in a highly conserved region. 2. cytidine nucleotide change to thymidine in position of 248of protamine type 2 ORF which caused a nonsense point mutation.Results: The two mentioned SNPs were not present in the studied population, thus concluding thatthese SNPs can not serves as molecular markers for male infertility diagnosis.Conclusion: The results of our study reveal that in a selected Iranian population, the SNP G197Tand C248T are completely absent and are not associated with male infertility and therefore theseSNPs may not represent a molecular marker for genetic diagnosis of male infertility.
Lin, Tao; Gao, Lihui
population of mutants with different tags, after recovered from different tissues of infected mice and ticks, mutants from output pool and input pool are detected using high-throughput, semi-quantitative Luminex ® FLEXMAP™ or next-generation sequencing (Tn-seq) technologies. Thus far, we have created a high-density, sequence-defined transposon library of over 6600 STM mutants for the efficient genome-wide investigation of genes and gene products required for wild-type pathogenesis, host-pathogen interactions, in vitro growth, in vivo survival, physiology, morphology, chemotaxis, motility, structure, metabolism, gene regulation, plasmid maintenance and replication, etc. The insertion sites of 4480 transposon mutants have been determined. About 800 predicted protein-encoding genes in the genome were disrupted in the STM transposon library. The infectivity and some functions of 800 mutants in 500 genes have been determined. Analysis of these transposon mutants has yielded valuable information regarding the genes and gene products important in the pathogenesis and biology of B. burgdorferi and its tick vectors.
Kim, Jin C.; Ha, Ye J.; Roh, Seon A.; Cho, Dong H.; Choi, Eun Y.; Kim, Tae W.; Kim, Jong H.; Kang, Tae W.; Kim, Seon Y.; Kim, Yong S.
Purpose: Studies aimed at predicting individual responsiveness to preoperative chemoradiation therapy (CRT) are urgently needed, especially considering the risks associated with poorly responsive patients. Methods and Materials: A 3-step strategy for the determination of CRT sensitivity is proposed based on (1) the screening of a human genome-wide single-nucleotide polymorphism (SNP) array in correlation with histopathologic tumor regression grade (TRG); (2) clinical association analysis of 113 patients treated with preoperative CRT; and (3) a cell-based functional assay for biological validation. Results: Genome-wide screening identified 9 SNPs associated with preoperative CRT responses. Positive responses (TRG 1-3) were obtained more frequently in patients carrying the reference allele (C) of the SNP CORO2A rs1985859 than in those with the substitution allele (T) (P=.01). Downregulation of CORO2A was significantly associated with reduced early apoptosis by 27% (P=.048) and 39% (P=.023) in RKO and COLO320DM colorectal cancer cells, respectively, as determined by flow cytometry. Reduced radiosensitivity was confirmed by colony-forming assays in the 2 colorectal cancer cells (P=.034 and .015, respectively). The SNP FAM101A rs7955740 was not associated with radiosensitivity in the clinical association analysis. However, downregulation of FAM101A significantly reduced early apoptosis by 29% in RKO cells (P=.047), and it enhanced colony formation in RKO cells (P=.001) and COLO320DM cells (P=.002). Conclusion: CRT-sensitive SNP markers were identified using a novel 3-step process. The candidate marker CORO2A rs1985859 and the putative marker FAM101A rs7955740 may be of value for the prediction of radiosensitivity to preoperative CRT, although further validation is needed in large cohorts
Drago, Francesca; Karpasitou, Katerina; Poli, Francesca
We have developed a high-throughput system for single nucleotide polymorphism (SNP) genotyping of alleles of diverse blood group systems exploiting Luminex technology. The method uses specific oligonucleotide probes coupled to a specific array of fluorescent microspheres and is designed for typing Jk(a)/Jk(b), Fy(a)/Fy(b), S/s, K/k, Kp(a)/Kp(b), Js(a)/Js(b), Co(a)/Co(b) and Lu(a)/Lu(b) alleles. Briefly, two multiplex PCR reactions (PCR I and PCR II) according to the laboratory specific needs are set up. PCR I amplifies the alleles tested routinely, namely Jk(a)/Jk(b), Fy(a)/Fy(b), S/s, and K/k. PCR II amplifies those alleles that are typed less frequently. Biotinylated PCR products are hybridized in a single multiplex assay with the corresponding probe mixture. After incubation with R-phycoerythrin-conjugated streptavidin, the emitted fluorescence is analyzed with Luminex 100. So far, we have typed more than 2,000 subjects, 493 of whom with multiplex assay, and there have been no discrepancies with the serology results other than null and/or weak phenotypes. The cost of consumables and reagents for typing a single biallelic pair per sample is less than EUR 3.-, not including DNA extraction costs. The capability to perform multiplexed reactions makes the method markedly suitable for mass screening of red blood cell alleles. This genotyping approach represents an important tool in transfusion medicine.
Huang, Chao; Thompson, Paul; Wang, Yalin; Yu, Yang; Zhang, Jingwen; Kong, Dehan; Colen, Rivka R; Knickmeyer, Rebecca C; Zhu, Hongtu
Functional phenotypes (e.g., subcortical surface representation), which commonly arise in imaging genetic studies, have been used to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. However, existing statistical methods largely ignore the functional features (e.g., functional smoothness and correlation). The aim of this paper is to develop a functional genome-wide association analysis (FGWAS) framework to efficiently carry out whole-genome analyses of functional phenotypes. FGWAS consists of three components: a multivariate varying coefficient model, a global sure independence screening procedure, and a test procedure. Compared with the standard multivariate regression model, the multivariate varying coefficient model explicitly models the functional features of functional phenotypes through the integration of smooth coefficient functions and functional principal component analysis. Statistically, compared with existing methods for genome-wide association studies (GWAS), FGWAS can substantially boost the detection power for discovering important genetic variants influencing brain structure and function. Simulation studies show that FGWAS outperforms existing GWAS methods for searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. We have successfully applied FGWAS to large-scale analysis of data from the Alzheimer's Disease Neuroimaging Initiative for 708 subjects, 30,000 vertices on the left and right hippocampal surfaces, and 501,584 SNPs. Copyright © 2017 Elsevier Inc. All rights reserved.
Full Text Available Abstract Background Insect bite hypersensitivity is a common allergic disease in horse populations worldwide. Insect bite hypersensitivity is affected by both environmental and genetic factors. However, little is known about genes contributing to the genetic variance associated with insect bite hypersensitivity. Therefore, the aim of our study was to identify and quantify genomic associations with insect bite hypersensitivity in Shetland pony mares and Icelandic horses in the Netherlands. Methods Data on 200 Shetland pony mares and 146 Icelandic horses were collected according to a matched case–control design. Cases and controls were matched on various factors (e.g. region, sire to minimize effects of population stratification. Breed-specific genome-wide association studies were performed using 70 k single nucleotide polymorphisms genotypes. Bayesian variable selection method Bayes-C with a threshold model implemented in GenSel software was applied. A 1 Mb non-overlapping window approach that accumulated contributions of adjacent single nucleotide polymorphisms was used to identify associated genomic regions. Results The percentage of variance explained by all single nucleotide polymorphisms was 13% in Shetland pony mares and 28% in Icelandic horses. The 20 non-overlapping windows explaining the largest percentages of genetic variance were found on nine chromosomes in Shetland pony mares and on 14 chromosomes in Icelandic horses. Overlap in identified associated genomic regions between breeds would suggest interesting candidate regions to follow-up on. Such regions common to both breeds (within 15 Mb were found on chromosomes 3, 7, 11, 20 and 23. Positional candidate genes within 2 Mb from the associated windows were identified on chromosome 20 in both breeds. Candidate genes are within the equine lymphocyte antigen class II region, which evokes an immune response by recognizing many foreign molecules. Conclusions The genome-wide association
Mao, Peng; Brown, Alexander J; Malc, Ewa P; Mieczkowski, Piotr A; Smerdon, Michael J; Roberts, Steven A; Wyrick, John J
DNA base damage is an important contributor to genome instability, but how the formation and repair of these lesions is affected by the genomic landscape and contributes to mutagenesis is unknown. Here, we describe genome-wide maps of DNA base damage, repair, and mutagenesis at single nucleotide resolution in yeast treated with the alkylating agent methyl methanesulfonate (MMS). Analysis of these maps revealed that base excision repair (BER) of alkylation damage is significantly modulated by chromatin, with faster repair in nucleosome-depleted regions, and slower repair and higher mutation density within strongly positioned nucleosomes. Both the translational and rotational settings of lesions within nucleosomes significantly influence BER efficiency; moreover, this effect is asymmetric relative to the nucleosome dyad axis and is regulated by histone modifications. Our data also indicate that MMS-induced mutations at adenine nucleotides are significantly enriched on the nontranscribed strand (NTS) of yeast genes, particularly in BER-deficient strains, due to higher damage formation on the NTS and transcription-coupled repair of the transcribed strand (TS). These findings reveal the influence of chromatin on repair and mutagenesis of base lesions on a genome-wide scale and suggest a novel mechanism for transcription-associated mutation asymmetry, which is frequently observed in human cancers. © 2017 Mao et al.; Published by Cold Spring Harbor Laboratory Press.
Ottolini, Christian S; Capalbo, Antonio; Newnham, Louise
We have developed a protocol for the generation of genome-wide maps (meiomaps) of recombination and chromosome segregation for the three products of human female meiosis: the first and second polar bodies (PB1 and PB2) and the corresponding oocyte. PB1 is biopsied and the oocyte is artificially......-nucleotide polymorphisms (SNPs) genome-wide by microarray. Informative maternal heterozygous SNPs are phased using a haploid PB2 or oocyte as a reference. A simple algorithm is then used to identify the maternal haplotypes for each chromosome, in all of the products of meiosis for each oocyte. This allows mapping...
B. G. Welderufael
Full Text Available Because mastitis is very frequent and unavoidable, adding recovery information into the analysis for genetic evaluation of mastitis is of great interest from economical and animal welfare point of view. Here we have performed genome-wide association studies (GWAS to identify associated single nucleotide polymorphisms (SNPs and investigate the genetic background not only for susceptibility to – but also for recoverability from mastitis. Somatic cell count records from 993 Danish Holstein cows genotyped for a total of 39378 autosomal SNP markers were used for the association analysis. Single SNP regression analysis was performed using the statistical software package DMU. Substitution effect of each SNP was tested with a t-test and a genome-wide significance level of P-value < 10-4 was used to declare significant SNP-trait association. A number of significant SNP variants were identified for both traits. Many of the SNP variants associated either with susceptibility to – or recoverability from mastitis were located in or very near to genes that have been reported for their role in the immune system. Genes involved in lymphocyte developments (e.g., MAST3 and STAB2 and genes involved in macrophage recruitment and regulation of inflammations (PDGFD and PTX3 were suggested as possible causal genes for susceptibility to – and recoverability from mastitis, respectively. However, this is the first GWAS study for recoverability from mastitis and our results need to be validated. The findings in the current study are, therefore, a starting point for further investigations in identifying causal genetic variants or chromosomal regions for both susceptibility to – and recoverability from mastitis.
Richard A Jensen
Full Text Available Mild retinopathy (microaneurysms or dot-blot hemorrhages is observed in persons without diabetes or hypertension and may reflect microvascular disease in other organs. We conducted a genome-wide association study (GWAS of mild retinopathy in persons without diabetes.A working group agreed on phenotype harmonization, covariate selection and analytic plans for within-cohort GWAS. An inverse-variance weighted fixed effects meta-analysis was performed with GWAS results from six cohorts of 19,411 Caucasians. The primary analysis included individuals without diabetes and secondary analyses were stratified by hypertension status. We also singled out the results from single nucleotide polymorphisms (SNPs previously shown to be associated with diabetes and hypertension, the two most common causes of retinopathy.No SNPs reached genome-wide significance in the primary analysis or the secondary analysis of participants with hypertension. SNP, rs12155400, in the histone deacetylase 9 gene (HDAC9 on chromosome 7, was associated with retinopathy in analysis of participants without hypertension, -1.3±0.23 (beta ± standard error, p = 6.6×10(-9. Evidence suggests this was a false positive finding. The minor allele frequency was low (∼2%, the quality of the imputation was moderate (r(2 ∼0.7, and no other common variants in the HDAC9 gene were associated with the outcome. SNPs found to be associated with diabetes and hypertension in other GWAS were not associated with retinopathy in persons without diabetes or in subgroups with or without hypertension.This GWAS of retinopathy in individuals without diabetes showed little evidence of genetic associations. Further studies are needed to identify genes associated with these signs in order to help unravel novel pathways and determinants of microvascular diseases.
Full Text Available Background: We conducted a genome-wide association study (GWAS to identify specific genetic variants that underlie susceptibility to disease caused by Staphylococcus aureus in humans. Methods: Cases (n=309 and controls (n=2,925 were genotyped at 508,921 single nucleotide polymorphisms (SNPs. Cases had at least one laboratory and clinician confirmed disease caused by S. aureus whereas controls did not. R-package (for SNP association, EIGENSOFT (to estimate and adjust for population stratification and gene- (VEGAS and pathway-based (DAVID, PANTHER, and Ingenuity Pathway Analysis analyses were performed.Results: No SNP reached genome-wide significance. Four SNPs exceeded the pConclusion: We identified potential susceptibility genes for S. aureus diseases in this preliminary study but confirmation by other studies is needed. The observed associations could be relevant given the complexity of S. aureus as a pathogen and its ability to exploit multiple biological pathways to cause infections in humans.
Nakajima, Masahiro; Takahashi, Atsushi; Kou, Ikuyo; Rodriguez-Fontenla, Cristina; Gomez-Reino, Juan J.; Furuichi, Tatsuya; Dai, Jin; Sudo, Akihiro; Uchida, Atsumasa; Fukui, Naoshi; Kubo, Michiaki; Kamatani, Naoyuki; Tsunoda, Tatsuhiko; Malizos, Konstantinos N.; Tsezou, Aspasia; Gonzalez, Antonio; Nakamura, Yusuke; Ikegawa, Shiro
Osteoarthritis (OA) is a common disease that has a definite genetic component. Only a few OA susceptibility genes that have definite functional evidence and replication of association have been reported, however. Through a genome-wide association study and a replication using a total of ∼4,800 Japanese subjects, we identified two single nucleotide polymorphisms (SNPs) (rs7775228 and rs10947262) associated with susceptibility to knee OA. The two SNPs were in a region containing HLA class II/III genes and their association reached genome-wide significance (combined P = 2.43×10−8 for rs7775228 and 6.73×10−8 for rs10947262). Our results suggest that immunologic mechanism is implicated in the etiology of OA. PMID:20305777
to protein: through epigenetic modifications, transcription regulators or post-transcriptional controls. The following papers concern several layers of gene regulation with questions answered by different HTS approaches. Genome-wide screening of epigenetic changes by ChIP-seq allowed us to study both spatial...... and temporal alterations of histone modifications (Papers I and II). Coupling the data with machine learning approaches, we established a prediction framework to assess the most informative histone marks as well as their most influential nucleosome positions in predicting the promoter usages. (Papers I...... they regulated or if the sites had global elevated usage rates by multiple TFs. Using RNA-seq, 5’end-seq in combination with depletion of 5’exonuclease as well as nonsensemediated decay (NMD) factors, we systematically analyzed NMD substrates as well as their degradation intermediates in human cells (Paper V...
P. J. Maughan
Full Text Available Quinoa ( Willd. is an important seed crop throughout the Andean region of South America. It is important as a regional food security crop for millions of impoverished rural inhabitants of the Andean Altiplano (high plains. Efforts to improve the crop have led to an increased focus on genetic research. We report the identification of 14,178 putative single nucleotide polymorphisms (SNPs using a genomic reduction protocol as well as the development of 511 functional SNP assays. The SNP assays are based on KASPar genotyping chemistry and were detected using the Fluidigm dynamic array platform. A diversity screen of 113 quinoa accessions showed that the minor allele frequency (MAF of the SNPs ranged from 0.02 to 0.50, with an average MAF of 0.28. Structure analysis of the quinoa diversity panel uncovered the two major subgroups corresponding to the Andean and coastal quinoa ecotypes. Linkage mapping of the SNPs in two recombinant inbred line populations produced an integrated linkage map consisting of 29 linkage groups with 20 large linkage groups, spanning 1404 cM with a marker density of 3.1 cM per SNP marker. The SNPs identified here represent important genomic tools needed in emerging plant breeding programs for advanced genetic analysis of agronomic traits in quinoa.
Atopic dermatitis (AD) is a common inflammatory skin disorder with a strong genetic component. Genome-wide association studies have been successful in the identification of common single nucleotide polymorphisms associated with AD, but their functional relevance has not been investigated yet. This work presents a comprehensive functional characterization of common and infrequent variants at the AD-associated C11orf30/LRRC32 locus. Analyses of cutaneous gene expression profiles in AD patients ...
Full Text Available Plasma fibrinogen is an acute phase protein playing an important role in the blood coagulation cascade having strong associations with smoking, alcohol consumption and body mass index (BMI. Genome-wide association studies (GWAS have identified a variety of gene regions associated with elevated plasma fibrinogen concentrations. However, little is yet known about how associations between environmental factors and fibrinogen might be modified by genetic variation. Therefore, we conducted large-scale meta-analyses of genome-wide interaction studies to identify possible interactions of genetic variants and smoking status, alcohol consumption or BMI on fibrinogen concentration. The present study included 80,607 subjects of European ancestry from 22 studies. Genome-wide interaction analyses were performed separately in each study for about 2.6 million single nucleotide polymorphisms (SNPs across the 22 autosomal chromosomes. For each SNP and risk factor, we performed a linear regression under an additive genetic model including an interaction term between SNP and risk factor. Interaction estimates were meta-analysed using a fixed-effects model. No genome-wide significant interaction with smoking status, alcohol consumption or BMI was observed in the meta-analyses. The most suggestive interaction was found for smoking and rs10519203, located in the LOC123688 region on chromosome 15, with a p value of 6.2 × 10(-8. This large genome-wide interaction study including 80,607 participants found no strong evidence of interaction between genetic variants and smoking status, alcohol consumption or BMI on fibrinogen concentrations. Further studies are needed to yield deeper insight in the interplay between environmental factors and gene variants on the regulation of fibrinogen concentrations.
Nguyen, Thanh-Tung; Huang, Joshua; Wu, Qingyao; Nguyen, Thuy; Li, Mark
Single-nucleotide polymorphisms (SNPs) selection and identification are the most important tasks in Genome-wide association data analysis. The problem is difficult because genome-wide association data is very high dimensional and a large portion of SNPs in the data is irrelevant to the disease. Advanced machine learning methods have been successfully used in Genome-wide association studies (GWAS) for identification of genetic variants that have relatively big effects in some common, complex diseases. Among them, the most successful one is Random Forests (RF). Despite of performing well in terms of prediction accuracy in some data sets with moderate size, RF still suffers from working in GWAS for selecting informative SNPs and building accurate prediction models. In this paper, we propose to use a new two-stage quality-based sampling method in random forests, named ts-RF, for SNP subspace selection for GWAS. The method first applies p-value assessment to find a cut-off point that separates informative and irrelevant SNPs in two groups. The informative SNPs group is further divided into two sub-groups: highly informative and weak informative SNPs. When sampling the SNP subspace for building trees for the forest, only those SNPs from the two sub-groups are taken into account. The feature subspaces always contain highly informative SNPs when used to split a node at a tree. This approach enables one to generate more accurate trees with a lower prediction error, meanwhile possibly avoiding overfitting. It allows one to detect interactions of multiple SNPs with the diseases, and to reduce the dimensionality and the amount of Genome-wide association data needed for learning the RF model. Extensive experiments on two genome-wide SNP data sets (Parkinson case-control data comprised of 408,803 SNPs and Alzheimer case-control data comprised of 380,157 SNPs) and 10 gene data sets have demonstrated that the proposed model significantly reduced prediction errors and outperformed
Biernacka, Joanna M.; Geske, Jennifer; Jenkins, Gregory D.; Colby, Colin; Rider, David N.; Karpyak, Victor M.; Choi, Doo-Sup; Fridley, Brooke L.
It is believed that multiple genetic variants with small individual effects contribute to the risk of alcohol dependence. Such polygenic effects are difficult to detect in genome-wide association studies that test for association of the phenotype with each single nucleotide polymorphism (SNP) individually. To overcome this challenge, gene set analysis (GSA) methods that jointly test for the effects of pre-defined groups of genes have been proposed. Rather than testing for association between the phenotype and individual SNPs, these analyses evaluate the global evidence of association with a set of related genes enabling the identification of cellular or molecular pathways or biological processes that play a role in development of the disease. It is hoped that by aggregating the evidence of association for all available SNPs in a group of related genes, these approaches will have enhanced power to detect genetic associations with complex traits. We performed GSA using data from a genome-wide study of 1165 alcohol dependent cases and 1379 controls from the Study of Addiction: Genetics and Environment (SAGE), for all 200 pathways listed in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Results demonstrated a potential role of the “Synthesis and Degradation of Ketone Bodies” pathway. Our results also support the potential involvement of the “Neuroactive Ligand Receptor Interaction” pathway, which has previously been implicated in addictive disorders. These findings demonstrate the utility of GSA in the study of complex disease, and suggest specific directions for further research into the genetic architecture of alcohol dependence. PMID:22717047
Bertram, Lars; Lange, Christoph; Mullin, Kristina; Parkinson, Michele; Hsiao, Monica; Hogan, Meghan F; Schjeide, Brit M M; Hooli, Basavaraj; Divito, Jason; Ionita, Iuliana; Jiang, Hongyu; Laird, Nan; Moscarillo, Thomas; Ohlsen, Kari L; Elliott, Kathryn; Wang, Xin; Hu-Lince, Diane; Ryder, Marie; Murphy, Amy; Wagner, Steven L; Blacker, Deborah; Becker, K David; Tanzi, Rudolph E
Alzheimer's disease (AD) is a genetically complex and heterogeneous disorder. To date four genes have been established to either cause early-onset autosomal-dominant AD (APP, PSEN1, and PSEN2(1-4)) or to increase susceptibility for late-onset AD (APOE5). However, the heritability of late-onset AD is as high as 80%, (6) and much of the phenotypic variance remains unexplained to date. We performed a genome-wide association (GWA) analysis using 484,522 single-nucleotide polymorphisms (SNPs) on a large (1,376 samples from 410 families) sample of AD families of self-reported European descent. We identified five SNPs showing either significant or marginally significant genome-wide association with a multivariate phenotype combining affection status and onset age. One of these signals (p = 5.7 x 10(-14)) was elicited by SNP rs4420638 and probably reflects APOE-epsilon4, which maps 11 kb proximal (r2 = 0.78). The other four signals were tested in three additional independent AD family samples composed of nearly 2700 individuals from almost 900 families. Two of these SNPs showed significant association in the replication samples (combined p values 0.007 and 0.00002). The SNP (rs11159647, on chromosome 14q31) with the strongest association signal also showed evidence of association with the same allele in GWA data generated in an independent sample of approximately 1,400 AD cases and controls (p = 0.04). Although the precise identity of the underlying locus(i) remains elusive, our study provides compelling evidence for the existence of at least one previously undescribed AD gene that, like APOE-epsilon4, primarily acts as a modifier of onset age.
J Brent Richards
Full Text Available The adipocyte-derived protein adiponectin is highly heritable and inversely associated with risk of type 2 diabetes mellitus (T2D and coronary heart disease (CHD. We meta-analyzed 3 genome-wide association studies for circulating adiponectin levels (n = 8,531 and sought validation of the lead single nucleotide polymorphisms (SNPs in 5 additional cohorts (n = 6,202. Five SNPs were genome-wide significant in their relationship with adiponectin (P< or =5x10(-8. We then tested whether these 5 SNPs were associated with risk of T2D and CHD using a Bonferroni-corrected threshold of P< or =0.011 to declare statistical significance for these disease associations. SNPs at the adiponectin-encoding ADIPOQ locus demonstrated the strongest associations with adiponectin levels (P-combined = 9.2x10(-19 for lead SNP, rs266717, n = 14,733. A novel variant in the ARL15 (ADP-ribosylation factor-like 15 gene was associated with lower circulating levels of adiponectin (rs4311394-G, P-combined = 2.9x10(-8, n = 14,733. This same risk allele at ARL15 was also associated with a higher risk of CHD (odds ratio [OR] = 1.12, P = 8.5x10(-6, n = 22,421 more nominally, an increased risk of T2D (OR = 1.11, P = 3.2x10(-3, n = 10,128, and several metabolic traits. Expression studies in humans indicated that ARL15 is well-expressed in skeletal muscle. These findings identify a novel protein, ARL15, which influences circulating adiponectin levels and may impact upon CHD risk.
Boueiz, Adel; Lutz, Sharon M; Cho, Michael H; Hersh, Craig P; Bowler, Russell P; Washko, George R; Halper-Stromberg, Eitan; Bakke, Per; Gulsvik, Amund; Laird, Nan M; Beaty, Terri H; Coxson, Harvey O; Crapo, James D; Silverman, Edwin K; Castaldi, Peter J; DeMeo, Dawn L
Emphysema has considerable variability in the severity and distribution of parenchymal destruction throughout the lungs. Upper lobe-predominant emphysema has emerged as an important predictor of response to lung volume reduction surgery. Yet, aside from alpha-1 antitrypsin deficiency, the genetic determinants of emphysema distribution remain largely unknown. To identify the genetic influences of emphysema distribution in non-alpha-1 antitrypsin-deficient smokers. A total of 11,532 subjects with complete genotype and computed tomography densitometry data in the COPDGene (Genetic Epidemiology of Chronic Obstructive Pulmonary Disease [COPD]; non-Hispanic white and African American), ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints), and GenKOLS (Genetics of Chronic Obstructive Lung Disease) studies were analyzed. Two computed tomography scan emphysema distribution measures (difference between upper-third and lower-third emphysema; ratio of upper-third to lower-third emphysema) were tested for genetic associations in all study subjects. Separate analyses in each study population were followed by a fixed effect metaanalysis. Single-nucleotide polymorphism-, gene-, and pathway-based approaches were used. In silico functional evaluation was also performed. We identified five loci associated with emphysema distribution at genome-wide significance. These loci included two previously reported associations with COPD susceptibility (4q31 near HHIP and 15q25 near CHRNA5) and three new associations near SOWAHB, TRAPPC9, and KIAA1462. Gene set analysis and in silico functional evaluation revealed pathways and cell types that may potentially contribute to the pathogenesis of emphysema distribution. This multicohort genome-wide association study identified new genomic loci associated with differential emphysematous destruction throughout the lungs. These findings may point to new biologic pathways on which to expand diagnostic and therapeutic
Wattacheril, Julia; Lavine, Joel E; Chalasani, Naga P; Guo, Xiuqing; Kwon, Soonil; Schwimmer, Jeffrey; Molleston, Jean P; Loomba, Rohit; Brunt, Elizabeth M; Chen, Yii-Der Ida; Goodarzi, Mark O; Taylor, Kent D; Yates, Katherine P; Tonascia, James; Rotter, Jerome I
To identify genetic loci associated with features of histologic severity of nonalcoholic fatty liver disease in a cohort of Hispanic boys. There were 234 eligible Hispanic boys age 2-17 years with clinical, laboratory, and histologic data enrolled in the Nonalcoholic Steatohepatitis Clinical Research Network included in the analysis of 624 297 single nucleotide polymorphisms (SNPs). After the elimination of 4 outliers and 22 boys with cryptic relatedness, association analyses were performed on 208 DNA samples with corresponding liver histology. Logistic regression analyses were carried out for qualitative traits and linear regression analyses were applied for quantitative traits. The median age and body mass index z-score were 12.0 years (IQR, 11.0-14.0) and 2.4 (IQR, 2.1-2.6), respectively. The nonalcoholic fatty liver disease activity score (scores 1-4 vs 5-8) was associated with SNP rs11166927 on chromosome 8 in the TRAPPC9 region (P = 8.7 -07 ). Fibrosis stage was associated with SNP rs6128907 on chromosome 20, near actin related protein 5 homolog (p = 9.9 -07 ). In comparing our results in Hispanic boys with those of previously reported SNPs in adult nonalcoholic steatohepatitis, 2 of 26 susceptibility loci were associated with nonalcoholic fatty liver disease activity score and 2 were associated with fibrosis stage. In this discovery genome-wide association study, we found significant novel gene effects on histologic traits associated with nonalcoholic fatty liver disease activity score and fibrosis that are distinct from those previously recognized by adult nonalcoholic fatty liver disease genome-wide association studies. Copyright © 2017 Elsevier Inc. All rights reserved.
Nelson, George W.; Lautenberger, James A.; Chinn, Leslie; McIntosh, Carl; Johnson, Randall C.; Sezgin, Efe; Kessing, Bailey; Malasky, Michael; Hendrickson, Sher L.; Pontius, Joan; Tang, Minzhong; An, Ping; Winkler, Cheryl A.; Limou, Sophie; Le Clerc, Sigrid; Delaneau, Olivier; Zagury, Jean-François; Schuitemaker, Hanneke; van Manen, Daniëlle; Bream, Jay H.; Gomperts, Edward D.; Buchbinder, Susan; Goedert, James J.; Kirk, Gregory D.; O'Brien, Stephen J.
Background. Host genetic variation influences human immunodeficiency virus (HIV) infection and progression to AIDS. Here we used clinically well-characterized subjects from 5 pretreatment HIV/AIDS cohorts for a genome-wide association study to identify gene associations with rate of AIDS progression. Methods. European American HIV seroconverters (n = 755) were interrogated for single-nucleotide polymorphisms (SNPs) (n = 700,022) associated with progression to AIDS 1987 (Cox proportional hazards regression analysis, co-dominant model). Results. Association with slower progression was observed for SNPs in the gene PARD3B. One of these, rs11884476, reached genome-wide significance (relative hazard = 0.3; P =3. 370 × 10−9) after statistical correction for 700,022 SNPs and contributes 4.52% of the overall variance in AIDS progression in this study. Nine of the top-ranked SNPs define a PARD3B haplotype that also displays significant association with progression to AIDS (hazard ratio, 0.3; P = 3.220 × 10−8). One of these SNPs, rs10185378, is a predicted exonic splicing enhancer; significant alteration in the expression profile of PARD3B splicing transcripts was observed in B cell lines with alternate rs10185378 genotypes. This SNP was typed in European cohorts of rapid progressors and was found to be protective for AIDS 1993 definition (odds ratio, 0.43, P = .025). Conclusions. These observations suggest a potential unsuspected pathway of host genetic influence on the dynamics of AIDS progression. PMID:21502085
Full Text Available The genetic basis of autoantibody production is largely unknown outside of associations located in the major histocompatibility complex (MHC human leukocyte antigen (HLA region. The aim of this study is the discovery of new genetic associations with autoantibody positivity using genome-wide association scan single nucleotide polymorphism (SNP data in type 1 diabetes (T1D patients with autoantibody measurements. We measured two anti-islet autoantibodies, glutamate decarboxylase (GADA, n = 2,506, insulinoma-associated antigen 2 (IA-2A, n = 2,498, antibodies to the autoimmune thyroid (Graves' disease (AITD autoantigen thyroid peroxidase (TPOA, n = 8,300, and antibodies against gastric parietal cells (PCA, n = 4,328 that are associated with autoimmune gastritis. Two loci passed a stringent genome-wide significance level (p<10(-10: 1q23/FCRL3 with IA-2A and 9q34/ABO with PCA. Eleven of 52 non-MHC T1D loci showed evidence of association with at least one autoantibody at a false discovery rate of 16%: 16p11/IL27-IA-2A, 2q24/IFIH1-IA-2A and PCA, 2q32/STAT4-TPOA, 10p15/IL2RA-GADA, 6q15/BACH2-TPOA, 21q22/UBASH3A-TPOA, 1p13/PTPN22-TPOA, 2q33/CTLA4-TPOA, 4q27/IL2/TPOA, 15q14/RASGRP1/TPOA, and 12q24/SH2B3-GADA and TPOA. Analysis of the TPOA-associated loci in 2,477 cases with Graves' disease identified two new AITD loci (BACH2 and UBASH3A.
Full Text Available The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs with minor allele frequency (MAF ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10−12. Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.
Full Text Available BACKGROUND: The rs12807809 single-nucleotide polymorphism in NRGN is a genetic risk variant with genome-wide significance for schizophrenia. The frequency of the T allele of rs12807809 is higher in individuals with schizophrenia than in those without the disorder. Reduced immunoreactivity of NRGN, which is expressed exclusively in the brain, has been observed in Brodmann areas (BA 9 and 32 of the prefrontal cortex in postmortem brains from patients with schizophrenia compared with those in controls. METHODS: Genotype effects of rs12807809 were investigated on gray matter (GM and white matter (WM volumes using magnetic resonance imaging (MRI with a voxel-based morphometry (VBM technique in a sample of 99 Japanese patients with schizophrenia and 263 healthy controls. RESULTS: Although significant genotype-diagnosis interaction either on GM or WM volume was not observed, there was a trend of genotype-diagnosis interaction on GM volume in the left anterior cingulate cortex (ACC. Thus, the effects of NRGN genotype on GM volume of patients with schizophrenia and healthy controls were separately investigated. In patients with schizophrenia, carriers of the risk T allele had a smaller GM volume in the left ACC (BA32 than did carriers of the non-risk C allele. Significant genotype effect on other regions of the GM or WM was not observed for either the patients or controls. CONCLUSIONS: Our findings suggest that the genome-wide associated genetic risk variant in the NRGN gene may be related to a small GM volume in the ACC in the left hemisphere in patients with schizophrenia.
Erk, Susanne; Meyer-Lindenberg, Andreas; Schnell, Knut; Opitz von Boberfeld, Carola; Esslinger, Christine; Kirsch, Peter; Grimm, Oliver; Arnold, Claudia; Haddad, Leila; Witt, Stephanie H; Cichon, Sven; Nöthen, Markus M; Rietschel, Marcella; Walter, Henrik
The neural abnormalities underlying genetic risk for bipolar disorder, a severe, common, and highly heritable psychiatric condition, are largely unknown. An opportunity to define these mechanisms is provided by the recent discovery, through genome-wide association, of a single-nucleotide polymorphism (rs1006737) strongly associated with bipolar disorder within the CACNA1C gene, encoding the alpha subunit of the L-type voltage-dependent calcium channel Ca(v)1.2. To determine whether the genetic risk associated with rs1006737 is mediated through hippocampal function. Functional magnetic resonance imaging study. University hospital. A total of 110 healthy volunteers of both sexes and of German descent in the Hardy-Weinberg equilibrium for rs1006737. Blood oxygen level-dependent signal during an episodic memory task and behavioral and psychopathological measures. Using an intermediate phenotype approach, we show that healthy carriers of the CACNA1C risk variant exhibit a pronounced reduction of bilateral hippocampal activation during episodic memory recall and diminished functional coupling between left and right hippocampal regions. Furthermore, risk allele carriers exhibit activation deficits of the subgenual anterior cingulate cortex, a region repeatedly associated with affective disorders and the mediation of adaptive stress-related responses. The relevance of these findings for affective disorders is supported by significantly higher psychopathology scores for depression, anxiety, obsessive-compulsive thoughts, interpersonal sensitivity, and neuroticism in risk allele carriers, correlating negatively with the observed regional brain activation. Our data demonstrate that rs1006737 or genetic variants in linkage disequilibrium with it are functional in the human brain and provide a neurogenetic risk mechanism for bipolar disorder backed by genome-wide evidence.
Pain, Oliver; Dudbridge, Frank; Cardno, Alastair G; Freeman, Daniel; Lu, Yi; Lundstrom, Sebastian; Lichtenstein, Paul; Ronald, Angelica
This study aimed to test for overlap in genetic influences between psychotic-like experience traits shown by adolescents in the community, and clinically-recognized psychiatric disorders in adulthood, specifically schizophrenia, bipolar disorder, and major depression. The full spectra of psychotic-like experience domains, both in terms of their severity and type (positive, cognitive, and negative), were assessed using self- and parent-ratings in three European community samples aged 15-19 years (Final N incl. siblings = 6,297-10,098). A mega-genome-wide association study (mega-GWAS) for each psychotic-like experience domain was performed. Single nucleotide polymorphism (SNP)-heritability of each psychotic-like experience domain was estimated using genomic-relatedness-based restricted maximum-likelihood (GREML) and linkage disequilibrium- (LD-) score regression. Genetic overlap between specific psychotic-like experience domains and schizophrenia, bipolar disorder, and major depression was assessed using polygenic risk score (PRS) and LD-score regression. GREML returned SNP-heritability estimates of 3-9% for psychotic-like experience trait domains, with higher estimates for less skewed traits (Anhedonia, Cognitive Disorganization) than for more skewed traits (Paranoia and Hallucinations, Parent-rated Negative Symptoms). Mega-GWAS analysis identified one genome-wide significant association for Anhedonia within IDO2 but which did not replicate in an independent sample. PRS analysis revealed that the schizophrenia PRS significantly predicted all adolescent psychotic-like experience trait domains (Paranoia and Hallucinations only in non-zero scorers). The major depression PRS significantly predicted Anhedonia and Parent-rated Negative Symptoms in adolescence. Psychotic-like experiences during adolescence in the community show additive genetic effects and partly share genetic influences with clinically-recognized psychiatric disorders, specifically schizophrenia and
Davies, G; Harris, S E; Reynolds, C A; Payton, A; Knight, H M; Liewald, D C; Lopez, L M; Luciano, M; Gow, A J; Corley, J; Henderson, R; Murray, C; Pattie, A; Fox, H C; Redmond, P; Lutz, M W; Chiba-Falek, O; Linnertz, C; Saith, S; Haggarty, P; McNeill, G; Ke, X; Ollier, W; Horan, M; Roses, A D; Ponting, C P; Porteous, D J; Tenesa, A; Pickles, A; Starr, J M; Whalley, L J; Pedersen, N L; Pendleton, N; Visscher, P M; Deary, I J
Cognitive decline is a feared aspect of growing old. It is a major contributor to lower quality of life and loss of independence in old age. We investigated the genetic contribution to individual differences in nonpathological cognitive ageing in five cohorts of older adults. We undertook a genome-wide association analysis using 549 692 single-nucleotide polymorphisms (SNPs) in 3511 unrelated adults in the Cognitive Ageing Genetics in England and Scotland (CAGES) project. These individuals have detailed longitudinal cognitive data from which phenotypes measuring each individual's cognitive changes were constructed. One SNP--rs2075650, located in TOMM40 (translocase of the outer mitochondrial membrane 40 homolog)--had a genome-wide significant association with cognitive ageing (P=2.5 × 10(-8)). This result was replicated in a meta-analysis of three independent Swedish cohorts (P=2.41 × 10(-6)). An Apolipoprotein E (APOE) haplotype (adjacent to TOMM40), previously associated with cognitive ageing, had a significant effect on cognitive ageing in the CAGES sample (P=2.18 × 10(-8); females, P=1.66 × 10(-11); males, P=0.01). Fine SNP mapping of the TOMM40/APOE region identified both APOE (rs429358; P=3.66 × 10(-11)) and TOMM40 (rs11556505; P=2.45 × 10(-8)) as loci that were associated with cognitive ageing. Imputation and conditional analyses in the discovery and replication cohorts strongly suggest that this effect is due to APOE (rs429358). Functional genomic analysis indicated that SNPs in the TOMM40/APOE region have a functional, regulatory non-protein-coding effect. The APOE region is significantly associated with nonpathological cognitive ageing. The identity and mechanism of one or multiple causal variants remain unclear.
Boraska, Vesna; Jerončić, Ana; Colonna, Vincenza; Southam, Lorraine; Nyholt, Dale R.; William Rayner, Nigel; Perry, John R.B.; Toniolo, Daniela; Albrecht, Eva; Ang, Wei; Bandinelli, Stefania; Barbalic, Maja; Barroso, Inês; Beckmann, Jacques S.; Biffar, Reiner; Boomsma, Dorret; Campbell, Harry; Corre, Tanguy; Erdmann, Jeanette; Esko, Tõnu; Fischer, Krista; Franceschini, Nora; Frayling, Timothy M.; Girotto, Giorgia; Gonzalez, Juan R.; Harris, Tamara B.; Heath, Andrew C.; Heid, Iris M.; Hoffmann, Wolfgang; Hofman, Albert; Horikoshi, Momoko; Hua Zhao, Jing; Jackson, Anne U.; Hottenga, Jouke-Jan; Jula, Antti; Kähönen, Mika; Khaw, Kay-Tee; Kiemeney, Lambertus A.; Klopp, Norman; Kutalik, Zoltán; Lagou, Vasiliki; Launer, Lenore J.; Lehtimäki, Terho; Lemire, Mathieu; Lokki, Marja-Liisa; Loley, Christina; Luan, Jian'an; Mangino, Massimo; Mateo Leach, Irene; Medland, Sarah E.; Mihailov, Evelin; Montgomery, Grant W.; Navis, Gerjan; Newnham, John; Nieminen, Markku S.; Palotie, Aarno; Panoutsopoulou, Kalliope; Peters, Annette; Pirastu, Nicola; Polašek, Ozren; Rehnström, Karola; Ripatti, Samuli; Ritchie, Graham R.S.; Rivadeneira, Fernando; Robino, Antonietta; Samani, Nilesh J.; Shin, So-Youn; Sinisalo, Juha; Smit, Johannes H.; Soranzo, Nicole; Stolk, Lisette; Swinkels, Dorine W.; Tanaka, Toshiko; Teumer, Alexander; Tönjes, Anke; Traglia, Michela; Tuomilehto, Jaakko; Valsesia, Armand; van Gilst, Wiek H.; van Meurs, Joyce B.J.; Smith, Albert Vernon; Viikari, Jorma; Vink, Jacqueline M.; Waeber, Gerard; Warrington, Nicole M.; Widen, Elisabeth; Willemsen, Gonneke; Wright, Alan F.; Zanke, Brent W.; Zgaga, Lina; Boehnke, Michael; d'Adamo, Adamo Pio; de Geus, Eco; Demerath, Ellen W.; den Heijer, Martin; Eriksson, Johan G.; Ferrucci, Luigi; Gieger, Christian; Gudnason, Vilmundur; Hayward, Caroline; Hengstenberg, Christian; Hudson, Thomas J.; Järvelin, Marjo-Riitta; Kogevinas, Manolis; Loos, Ruth J.F.; Martin, Nicholas G.; Metspalu, Andres; Pennell, Craig E.; Penninx, Brenda W.; Perola, Markus; Raitakari, Olli; Salomaa, Veikko; Schreiber, Stefan; Schunkert, Heribert; Spector, Tim D.; Stumvoll, Michael; Uitterlinden, André G.; Ulivi, Sheila; van der Harst, Pim; Vollenweider, Peter; Völzke, Henry; Wareham, Nicholas J.; Wichmann, H.-Erich; Wilson, James F.; Rudan, Igor; Xue, Yali; Zeggini, Eleftheria
The male-to-female sex ratio at birth is constant across world populations with an average of 1.06 (106 male to 100 female live births) for populations of European descent. The sex ratio is considered to be affected by numerous biological and environmental factors and to have a heritable component. The aim of this study was to investigate the presence of common allele modest effects at autosomal and chromosome X variants that could explain the observed sex ratio at birth. We conducted a large-scale genome-wide association scan (GWAS) meta-analysis across 51 studies, comprising overall 114 863 individuals (61 094 women and 53 769 men) of European ancestry and 2 623 828 common (minor allele frequency >0.05) single-nucleotide polymorphisms (SNPs). Allele frequencies were compared between men and women for directly-typed and imputed variants within each study. Forward-time simulations for unlinked, neutral, autosomal, common loci were performed under the demographic model for European populations with a fixed sex ratio and a random mating scheme to assess the probability of detecting significant allele frequency differences. We do not detect any genome-wide significant (P < 5 × 10−8) common SNP differences between men and women in this well-powered meta-analysis. The simulated data provided results entirely consistent with these findings. This large-scale investigation across ∼115 000 individuals shows no detectable contribution from common genetic variants to the observed skew in the sex ratio. The absence of sex-specific differences is useful in guiding genetic association study design, for example when using mixed controls for sex-biased traits. PMID:22843499
Gerry Norman P
Full Text Available Abstract Background Uric acid is the primary byproduct of purine metabolism. Hyperuricemia is associated with body mass index (BMI, sex, and multiple complex diseases including gout, hypertension (HTN, renal disease, and type 2 diabetes (T2D. Multiple genome-wide association studies (GWAS in individuals of European ancestry (EA have reported associations between serum uric acid levels (SUAL and specific genomic loci. The purposes of this study were: 1 to replicate major signals reported in EA populations; and 2 to use the weak LD pattern in African ancestry population to better localize (fine-map reported loci and 3 to explore the identification of novel findings cognizant of the moderate sample size. Methods African American (AA participants (n = 1,017 from the Howard University Family Study were included in this study. Genotyping was performed using the Affymetrix® Genome-wide Human SNP Array 6.0. Imputation was performed using MACH and the HapMap reference panels for CEU and YRI. A total of 2,400,542 single nucleotide polymorphisms (SNPs were assessed for association with serum uric acid under the additive genetic model with adjustment for age, sex, BMI, glomerular filtration rate, HTN, T2D, and the top two principal components identified in the assessment of admixture and population stratification. Results Four variants in the gene SLC2A9 achieved genome-wide significance for association with SUAL (p-values ranging from 8.88 × 10-9 to 1.38 × 10-9. Fine-mapping of the SLC2A9 signals identified a 263 kb interval of linkage disequilibrium in the HapMap CEU sample. This interval was reduced to 37 kb in our AA and the HapMap YRI samples. Conclusions The most strongly associated locus for SUAL in EA populations was also the most strongly associated locus in this AA sample. This finding provides evidence for the role of SLC2A9 in uric acid metabolism across human populations. Additionally, our findings demonstrate the utility of following-up EA
Mathew J Barber
Full Text Available Statins effectively lower total and plasma LDL-cholesterol, but the magnitude of decrease varies among individuals. To identify single nucleotide polymorphisms (SNPs contributing to this variation, we performed a combined analysis of genome-wide association (GWA results from three trials of statin efficacy.Bayesian and standard frequentist association analyses were performed on untreated and statin-mediated changes in LDL-cholesterol, total cholesterol, HDL-cholesterol, and triglyceride on a total of 3932 subjects using data from three studies: Cholesterol and Pharmacogenetics (40 mg/day simvastatin, 6 weeks, Pravastatin/Inflammation CRP Evaluation (40 mg/day pravastatin, 24 weeks, and Treating to New Targets (10 mg/day atorvastatin, 8 weeks. Genotype imputation was used to maximize genomic coverage and to combine information across studies. Phenotypes were normalized within each study to account for systematic differences among studies, and fixed-effects combined analysis of the combined sample were performed to detect consistent effects across studies. Two SNP associations were assessed as having posterior probability greater than 50%, indicating that they were more likely than not to be genuinely associated with statin-mediated lipid response. SNP rs8014194, located within the CLMN gene on chromosome 14, was strongly associated with statin-mediated change in total cholesterol with an 84% probability by Bayesian analysis, and a p-value exceeding conventional levels of genome-wide significance by frequentist analysis (P = 1.8 x 10(-8. This SNP was less significantly associated with change in LDL-cholesterol (posterior probability = 0.16, P = 4.0 x 10(-6. Bayesian analysis also assigned a 51% probability that rs4420638, located in APOC1 and near APOE, was associated with change in LDL-cholesterol.Using combined GWA analysis from three clinical trials involving nearly 4,000 individuals treated with simvastatin, pravastatin, or atorvastatin, we
Peters, Ulrike; Jiao, Shuo; Schumacher, Fredrick R; Hutter, Carolyn M; Aragaki, Aaron K; Baron, John A; Berndt, Sonja I; Bézieau, Stéphane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J; Campbell, Peter T; Carlson, Christopher S; Casey, Graham; Chan, Andrew T; Chang-Claude, Jenny; Chanock, Stephen J; Chen, Lin S; Coetzee, Gerhard A; Coetzee, Simon G; Conti, David V; Curtis, Keith R; Duggan, David; Edwards, Todd; Fuchs, Charles S; Gallinger, Steven; Giovannucci, Edward L; Gogarten, Stephanie M; Gruber, Stephen B; Haile, Robert W; Harrison, Tabitha A; Hayes, Richard B; Henderson, Brian E; Hoffmeister, Michael; Hopper, John L; Hudson, Thomas J; Hunter, David J; Jackson, Rebecca D; Jee, Sun Ha; Jenkins, Mark A; Jia, Wei-Hua; Kolonel, Laurence N; Kooperberg, Charles; Küry, Sébastien; Lacroix, Andrea Z; Laurie, Cathy C; Laurie, Cecelia A; Le Marchand, Loic; Lemire, Mathieu; Levine, David; Lindor, Noralane M; Liu, Yan; Ma, Jing; Makar, Karen W; Matsuo, Keitaro; Newcomb, Polly A; Potter, John D; Prentice, Ross L; Qu, Conghui; Rohan, Thomas; Rosse, Stephanie A; Schoen, Robert E; Seminara, Daniela; Shrubsole, Martha; Shu, Xiao-Ou; Slattery, Martha L; Taverna, Darin; Thibodeau, Stephen N; Ulrich, Cornelia M; White, Emily; Xiang, Yongbing; Zanke, Brent W; Zeng, Yi-Xin; Zhang, Ben; Zheng, Wei; Hsu, Li
Heritable factors contribute to the development of colorectal cancer. Identifying the genetic loci associated with colorectal tumor formation could elucidate the mechanisms of pathogenesis. We conducted a genome-wide association study that included 14 studies, 12,696 cases of colorectal tumors (11,870 cancer, 826 adenoma), and 15,113 controls of European descent. The 10 most statistically significant, previously unreported findings were followed up in 6 studies; these included 3056 colorectal tumor cases (2098 cancer, 958 adenoma) and 6658 controls of European and Asian descent. Based on the combined analysis, we identified a locus that reached the conventional genome-wide significance level at less than 5.0 × 10(-8): an intergenic region on chromosome 2q32.3, close to nucleic acid binding protein 1 (most significant single nucleotide polymorphism: rs11903757; odds ratio [OR], 1.15 per risk allele; P = 3.7 × 10(-8)). We also found evidence for 3 additional loci with P values less than 5.0 × 10(-7): a locus within the laminin gamma 1 gene on chromosome 1q25.3 (rs10911251; OR, 1.10 per risk allele; P = 9.5 × 10(-8)), a locus within the cyclin D2 gene on chromosome 12p13.32 (rs3217810 per risk allele; OR, 0.84; P = 5.9 × 10(-8)), and a locus in the T-box 3 gene on chromosome 12q24.21 (rs59336; OR, 0.91 per risk allele; P = 3.7 × 10(-7)). In a large genome-wide association study, we associated polymorphisms close to nucleic acid binding protein 1 (which encodes a DNA-binding protein involved in DNA repair) with colorectal tumor risk. We also provided evidence for an association between colorectal tumor risk and polymorphisms in laminin gamma 1 (this is the second gene in the laminin family to be associated with colorectal cancers), cyclin D2 (which encodes for cyclin D2), and T-box 3 (which encodes a T-box transcription factor and is a target of Wnt signaling to β-catenin). The roles of these genes and their products in cancer pathogenesis warrant further
Anna C Need
Full Text Available We report a genome-wide assessment of single nucleotide polymorphisms (SNPs and copy number variants (CNVs in schizophrenia. We investigated SNPs using 871 patients and 863 controls, following up the top hits in four independent cohorts comprising 1,460 patients and 12,995 controls, all of European origin. We found no genome-wide significant associations, nor could we provide support for any previously reported candidate gene or genome-wide associations. We went on to examine CNVs using a subset of 1,013 cases and 1,084 controls of European ancestry, and a further set of 60 cases and 64 controls of African ancestry. We found that eight cases and zero controls carried deletions greater than 2 Mb, of which two, at 8p22 and 16p13.11-p12.4, are newly reported here. A further evaluation of 1,378 controls identified no deletions greater than 2 Mb, suggesting a high prior probability of disease involvement when such deletions are observed in cases. We also provide further evidence for some smaller, previously reported, schizophrenia-associated CNVs, such as those in NRXN1 and APBA2. We could not provide strong support for the hypothesis that schizophrenia patients have a significantly greater "load" of large (>100 kb, rare CNVs, nor could we find common CNVs that associate with schizophrenia. Finally, we did not provide support for the suggestion that schizophrenia-associated CNVs may preferentially disrupt genes in neurodevelopmental pathways. Collectively, these analyses provide the first integrated study of SNPs and CNVs in schizophrenia and support the emerging view that rare deleterious variants may be more important in schizophrenia predisposition than common polymorphisms. While our analyses do not suggest that implicated CNVs impinge on particular key pathways, we do support the contribution of specific genomic regions in schizophrenia, presumably due to recurrent mutation. On balance, these data suggest that very few schizophrenia patients
Full Text Available Summary: A number of mitochondrial diseases arise from single-nucleotide variant (SNV accumulation in multiple mitochondria. Here, we present a method for identification of variants present at the single-mitochondrion level in individual mouse and human neuronal cells, allowing for extremely high-resolution study of mitochondrial mutation dynamics. We identified extensive heteroplasmy between individual mitochondrion, along with three high-confidence variants in mouse and one in human that were present in multiple mitochondria across cells. The pattern of variation revealed by single-mitochondrion data shows surprisingly pervasive levels of heteroplasmy in inbred mice. Distribution of SNV loci suggests inheritance of variants across generations, resulting in Poisson jackpot lines with large SNV load. Comparison of human and mouse variants suggests that theÂ two species might employ distinct modes of somatic segregation. Single-mitochondrion resolution revealed mitochondria mutational dynamics that we hypothesize to affect risk probabilities for mutations reaching disease thresholds. : Morris etÂ al. use independent sequencing of multiple individual mitochondria from mouse and human brain cells to show high pervasiveness of mutations. The mutations are heteroplasmic within single mitochondria and within and between cells. These findings suggest mechanisms by which mutations accumulate over time, resulting in mitochondrial dysfunction and disease. Keywords: single mitochondrion, single cell, human neuron, mouse neuron, single-nucleotide variation
Full Text Available Abstract Background Alfalfa, a perennial, outcrossing species, is a widely planted forage legume producing highly nutritious biomass. Currently, improvement of cultivated alfalfa mainly relies on recurrent phenotypic selection. Marker assisted breeding strategies can enhance alfalfa improvement efforts, particularly if many genome-wide markers are available. Transcriptome sequencing enables efficient high-throughput discovery of single nucleotide polymorphism (SNP markers for a complex polyploid species. Result The transcriptomes of 27 alfalfa genotypes, including elite breeding genotypes, parents of mapping populations, and unimproved wild genotypes, were sequenced using an Illumina Genome Analyzer IIx. De novo assembly of quality-filtered 72-bp reads generated 25,183 contigs with a total length of 26.8 Mbp and an average length of 1,065 bp, with an average read depth of 55.9-fold for each genotype. Overall, 21,954 (87.2% of the 25,183 contigs represented 14,878 unique protein accessions. Gene ontology (GO analysis suggested that a broad diversity of genes was represented in the resulting sequences. The realignment of individual reads to the contigs enabled the detection of 872,384 SNPs and 31,760 InDels. High resolution melting (HRM analysis was used to validate 91% of 192 putative SNPs identified by sequencing. Both allelic variants at about 95% of SNP sites identified among five wild, unimproved genotypes are still present in cultivated alfalfa, and all four US breeding programs also contain a high proportion of these SNPs. Thus, little evidence exists among this dataset for loss of significant DNA sequence diversity from either domestication or breeding of alfalfa. Structure analysis indicated that individuals from the subspecies falcata, the diploid subspecies caerulea, and the tetraploid subspecies sativa (cultivated tetraploid alfalfa were clearly separated. Conclusion We used transcriptome sequencing to discover large numbers of SNPs
Full Text Available Actinobacillus pleuropneumoniae is the pathogen of porcine contagious pleuropneumoniae, a highly contagious respiratory disease of swine. Although the genome of A. pleuropneumoniae was sequenced several years ago, limited information is available on the genome-wide transcriptional analysis to accurately annotate the gene structures and regulatory elements. High-throughput RNA sequencing (RNA-seq has been applied to study the transcriptional landscape of bacteria, which can efficiently and accurately identify gene expression regions and unknown transcriptional units, especially small non-coding RNAs (sRNAs, UTRs and regulatory regions. The aim of this study is to comprehensively analyze the transcriptome of A. pleuropneumoniae by RNA-seq in order to improve the existing genome annotation and promote our understanding of A. pleuropneumoniae gene structures and RNA-based regulation. In this study, we utilized RNA-seq to construct a single nucleotide resolution transcriptome map of A. pleuropneumoniae. More than 3.8 million high-quality reads (average length ~90 bp from a cDNA library were generated and aligned to the reference genome. We identified 32 open reading frames encoding novel proteins that were mis-annotated in the previous genome annotations. The start sites for 35 genes based on the current genome annotation were corrected. Furthermore, 51 sRNAs in the A. pleuropneumoniae genome were discovered, of which 40 sRNAs were never reported in previous studies. The transcriptome map also enabled visualization of 5'- and 3'-UTR regions, in which contained 11 sRNAs. In addition, 351 operons covering 1230 genes throughout the whole genome were identified. The RNA-Seq based transcriptome map validated annotated genes and corrected annotations of open reading frames in the genome, and led to the identification of many functional elements (e.g. regions encoding novel proteins, non-coding sRNAs and operon structures. The transcriptional units
Elbaz, Alexis; Nelson, Lorene M; Payami, Haydeh; Ioannidis, John P A; Fiske, Brian K; Annesi, Grazia; Belin, Andrea Carmine; Factor, Stewart A; Ferrarese, Carlo; Hadjigeorgiou, Georgios M; Higgins, Donald S; Kawakami, Hideshi; Krüger, Rejko; Marder, Karen S; Mayeux, Richard P; Mellick, George D; Nutt, John G; Ritz, Beate; Samii, Ali; Tanner, Caroline M; Van Broeckhoven, Christine; Van Den Eeden, Stephen K; Wirdefeldt, Karin; Zabetian, Cyrus P; Dehem, Marie; Montimurro, Jennifer S; Southwick, Audrey; Myers, Richard M; Trikalinos, Thomas A
Summary Background A genome-wide association study identified 13 single-nucleotide polymorphisms (SNPs) significantly associated with Parkinson’s disease. Small-scale replication studies were largely non-confirmatory, but a meta-analysis that included data from the original study could not exclude all SNP associations, leaving relevance of several markers uncertain. Methods Investigators from three Michael J Fox Foundation for Parkinson’s Research-funded genetics consortia—comprising 14 teams—contributed DNA samples from 5526 patients with Parkinson’s disease and 6682 controls, which were genotyped for the 13 SNPs. Most (88%) participants were of white, non-Hispanic descent. We assessed log-additive genetic effects using fixed and random effects models stratified by team and ethnic origin, and tested for heterogeneity across strata. A meta-analysis was undertaken that incorporated data from the original genome-wide study as well as subsequent replication studies. Findings In fixed and random-effects models no associations with any of the 13 SNPs were identified (odds ratios 0·89 to 1·09). Heterogeneity between studies and between ethnic groups was low for all SNPs. Subgroup analyses by age at study entry, ethnic origin, sex, and family history did not show any consistent associations. In our meta-analysis, no SNP showed significant association (summary odds ratios 0·95 to 1.08); there was little heterogeneity except for SNP rs7520966. Interpretation Our results do not lend support to the finding that the 13 SNPs reported in the original genome-wide association study are genetic susceptibility factors for Parkinson’s disease. PMID:17052658
Full Text Available Abstract Background The advent of high throughput sequencing technology has enabled the 1000 Genomes Project Pilot 3 to generate complete sequence data for more than 906 genes and 8,140 exons representing 697 subjects. The 1000 Genomes database provides a critical opportunity for further interpreting disease associations with single nucleotide polymorphisms (SNPs discovered from genetic association studies. Currently, direct sequencing of candidate genes or regions on a large number of subjects remains both cost- and time-prohibitive. Results To accelerate the translation from discovery to functional studies, we propose an in silico gene sequencing method (ISS, which predicts phased sequences of intragenic regions, using SNPs. The key underlying idea of our method is to infer diploid sequences (a pair of phased sequences/alleles at every functional locus utilizing the deep sequencing data from the 1000 Genomes Project and SNP data from the HapMap Project, and to build prediction models using flanking SNPs. Using this method, we have developed a database of prediction models for 611 known genes. Sequence prediction accuracy for these genes is 96.26% on average (ranges 79%-100%. This database of prediction models can be enhanced and scaled up to include new genes as the 1000 Genomes Project sequences additional genes on additional individuals. Applying our predictive model for the KCNJ11 gene to the Wellcome Trust Case Control Consortium (WTCCC Type 2 diabetes cohort, we demonstrate how the prediction of phased sequences inferred from GWAS SNP genotype data can be used to facilitate interpretation and identify a probable functional mechanism such as protein changes. Conclusions Prior to the general availability of routine sequencing of all subjects, the ISS method proposed here provides a time- and cost-effective approach to broadening the characterization of disease associated SNPs and regions, and facilitating the prioritization of candidate
Zhang Li; Wang Lvhua; Yang Ming; Ji Wei; Zhao Lujun; Yang Weizhi; Zhou Zongmei; Ou Guangfei; Lin Dongxin
Objective: To evaluate the relationship between single nucleotide polymorphism(SNP) of candidate genes and radiation-induced esophagitis (RIE) in patients with lung cancer. Methods: Between Jan. 2004 and Aug. 2006, 170 patients with pathologically diagnosed lung cancer were enrolled in this study. The total target dose was 45-70 Gy (median 60 Gy). One hundred and thirty-two patients were treated with three-dimensional conformal radiotherapy(3DCRT) and 38 with two-dimensional radiotherapy(2DRT). Forty-one patients received radiotherapy alone, 78 received sequential chemoradiotherapy and 51 received concurrent chemoradiotherapy. Thirty-seven SNPs in 20 DNA repair genes were analyzed by using PCR- based restricted fragment length polymorphism (RFLP). These genes were apoptosis and inflammatory cytokine genes including ATM, ERCC1, XRCC3, XRCCI, XPD, XPC, XPG, NBS1, STK15, ZNF350, ADPRT, TP53, FAS, FASL, CYP2D6*4, CASPASE8, COX2,TGF-β, CD14 and ACE. The endpoint was grade ≥2 R I E. Results: Forty of the 170 patients developed grade ≥2 R I E, including 36 in grade 2 and 4 in grade 3. Univariate analysis revealed that radiation technique and concurrent chemoradiotherapy were statistically significant relatives to the incidence of R I E (P=0.032, 0.049), and both of them had the trend associating with the esophagitis (P=0.072, 0.094). An increased incidence of esophagitis was observed associating with the TGF-β 1 -509T and XPD 751Lys/Lys genotypes (χ 2 =5.65, P=0.017; χ 2 =3.84, P=0.048) in multivariate analysis. Conclusions: Genetic polymorphisms in TGF-β 1 gene and XPD gene have a significant association with radiation-induced esophagitis. (authors)
Choi, Jong Wook; Moon, Shinje; Jang, Eun Jung; Lee, Chang Hwa; Park, Joon-Sung
Increased glycemic exposure, even below the diagnostic criteria for diabetes mellitus, is crucial in the pathogenesis of diabetic microvascular complications represented by microalbuminuria. Nonetheless, there is limited evidence regarding which single nucleotide polymorphisms (SNPs) are associated with prediabetes and whether genetic predisposition to prediabetes is related to microalbuminuria, especially in the general population. Our objective was to answer these questions. We conducted a genomewide association study (GWAS) separately on two population-based cohorts, Ansung and Ansan, in the Korean Genome and Epidemiology Study (KoGES). The initial GWAS was carried out on the Ansung cohort, followed by a replication study on the Ansan cohort. A total of 5682 native Korean participants without a significant medical illness were classified into either control group (n = 3153) or prediabetic group (n = 2529). In the GWAS, we identified two susceptibility loci associated with prediabetes, one at 17p15.3-p15.1 in the GCK gene and another at 7p15.1 in YKT6. When variations in GCK and YKT6 were used as a model of prediabetes, this genetically determined prediabetes increased microalbuminuria. Multiple logistic regression analyses revealed that fasting glucose concentration in plasma and SNP rs2908289 in GCK were associated with microalbuminuria, and adjustment for age, gender, smoking history, systolic blood pressure, waist circumference, and serum triglyceride levels did not attenuate this association. Our results suggest that prediabetes and the associated SNPs may predispose to microalbuminuria before the diagnosis of diabetes mellitus. Further studies are needed to explore the details of the physiological and molecular mechanisms underlying this genetic association.
Choi, Jong Wook; Moon, Shinje; Jang, Eun Jung; Lee, Chang Hwa; Park, Joon-Sung
Increased glycemic exposure, even below the diagnostic criteria for diabetes mellitus, is crucial in the pathogenesis of diabetic microvascular complications represented by microalbuminuria. Nonetheless, there is limited evidence regarding which single nucleotide polymorphisms (SNPs) are associated with prediabetes and whether genetic predisposition to prediabetes is related to microalbuminuria, especially in the general population. Our objective was to answer these questions. We conducted a genomewide association study (GWAS) separately on two population-based cohorts, Ansung and Ansan, in the Korean Genome and Epidemiology Study (KoGES). The initial GWAS was carried out on the Ansung cohort, followed by a replication study on the Ansan cohort. A total of 5682 native Korean participants without a significant medical illness were classified into either control group (n = 3153) or prediabetic group (n = 2529). In the GWAS, we identified two susceptibility loci associated with prediabetes, one at 17p15.3-p15.1 in the GCK gene and another at 7p15.1 in YKT6. When variations in GCK and YKT6 were used as a model of prediabetes, this genetically determined prediabetes increased microalbuminuria. Multiple logistic regression analyses revealed that fasting glucose concentration in plasma and SNP rs2908289 in GCK were associated with microalbuminuria, and adjustment for age, gender, smoking history, systolic blood pressure, waist circumference, and serum triglyceride levels did not attenuate this association. Our results suggest that prediabetes and the associated SNPs may predispose to microalbuminuria before the diagnosis of diabetes mellitus. Further studies are needed to explore the details of the physiological and molecular mechanisms underlying this genetic association. PMID:28158221
Thomas, Laurent F.; Sætrom, Pål
Alternative polyadenylation (APA) can for example occur when a protein-coding gene has several polyadenylation (polyA) signals in its last exon, resulting in messenger RNAs (mRNAs) with different 3′ untranslated region (UTR) lengths. Different 3′UTR lengths can give different microRNA (miRNA) regulation such that shortened transcripts have increased expression. The APA process is part of human cells' natural regulatory processes, but APA also seems to play an important role in many human diseases. Although altered APA in disease can have many causes, we reasoned that mutations in DNA elements that are important for the polyA process, such as the polyA signal and the downstream GU-rich region, can be one important mechanism. To test this hypothesis, we identified single nucleotide polymorphisms (SNPs) that can create or disrupt APA signals (APA-SNPs). By using a data-integrative approach, we show that APA-SNPs can affect 3′UTR length, miRNA regulation, and mRNA expression—both between homozygote individuals and within heterozygote individuals. Furthermore, we show that a significant fraction of the alleles that cause APA are strongly and positively linked with alleles found by genome-wide studies to be associated with disease. Our results confirm that APA-SNPs can give altered gene regulation and that APA alleles that give shortened transcripts and increased gene expression can be important hereditary causes for disease. PMID:22915998
Curran, Sarah; Bolton, Patrick; Rozsnyai, Kinga; Chiocchetti, Andreas; Klauck, Sabine M; Duketis, Eftichia; Poustka, Fritz; Schlitt, Sabine; Freitag, Christine M; Lee, Irene; Muglia, Pierandrea; Poot, Martin; Staal, Wouter; de Jonge, Maretha V; Ophoff, Roel A; Lewis, Cathryn; Skuse, David; Mandy, Will; Vassos, Evangelos; Fossdal, Ragnheidur; Magnusson, Páll; Hreidarsson, Stefan; Saemundsen, Evald; Stefansson, Hreinn; Stefansson, Kari; Collier, David
The Autism Genome Project (AGP) Consortium recently reported genome-wide significant association between autism and an intronic single nucleotide polymorphism marker, rs4141463, within the MACROD2 gene. In the present study we attempted to replicate this finding using an independent case-control design of 1,170 cases with autism spectrum disorder (ASD) (874 of which fulfilled narrow criteria for Autism (A)) from five centers within Europe (UK, Germany, the Netherlands, Italy, and Iceland), and 35,307 controls. The combined sample size gave us a non-centrality parameter (NCP) of 11.9, with 93% power to detect allelic association of rs4141463 at an alpha of 0.05 with odds ratio of 0.84 (the best odds ratio estimate of the AGP Consortium data), and for the narrow diagnosis of autism, an NCP of 8.9 and power of 85%. Our case-control data were analyzed for association, stratified by each center, and the summary statistics were combined using the meta-analysis program, GWAMA. This resulted in an odds ratio (OR) of 1.03 (95% CI 0.944-1.133), with a P-value of 0.5 for ASD and OR of 0.99 (95% CI 0.88-1.11) with P-value = 0.85 for the Autism (A) sub-group. Therefore, this study does not provide support for the reported association between rs4141463 and autism. Copyright © 2011 Wiley-Liss, Inc.
Full Text Available We demonstrate that Au-cluster-decorated single-walled carbon nanotubes (SWNTs may be used to discriminate single nucleotide polymorphism (SNP. Nanoscale Au clusters were formed on the side walls of carbon nanotubes in a transistor geometry using electrochemical deposition. The effect of Au cluster decoration appeared as hole doping when electrical transport characteristics were examined. Thiolated single-stranded probe peptide nucleic acid (PNA was successfully immobilized on Au clusters decorating single-walled carbon nanotube field-effect transistors (SWNT-FETs, resulting in a conductance decrease that could be explained by a decrease in Au work function upon adsorption of thiolated PNA. Although a target single-stranded DNA (ssDNA with a single mismatch did not cause any change in electrical conductance, a clear decrease in conductance was observed with matched ssDNA, thereby showing the possibility of SNP (single nucleotide polymorphism detection using Au-cluster-decorated SWNT-FETs. However, a power to discriminate SNP target is lost in high ionic environment. We can conclude that observed SNP discrimination in low ionic environment is due to the hampered binding of SNP target on nanoscale surfaces in low ionic conditions.
Full Text Available Abstract Background Specific genetic contributions for preeclampsia (PE are currently unknown. This genome-wide association study (GWAS aims to identify maternal single nucleotide polymorphisms (SNPs and copy-number variants (CNVs involved in the etiology of PE. Methods A genome-wide scan was performed on 177 PE cases (diagnosed according to National Heart, Lung and Blood Institute guidelines and 116 normotensive controls. White female study subjects from Iowa were genotyped on Affymetrix SNP 6.0 microarrays. CNV calls made using a combination of four detection algorithms (Birdseye, Canary, PennCNV, and QuantiSNP were merged using CNVision and screened with stringent prioritization criteria. Due to limited DNA quantities and the deleterious nature of copy-number deletions, it was decided a priori that only deletions would be selected for assay on the entire case-control dataset using quantitative real-time PCR. Results The top four SNP candidates had an allelic or genotypic p-value between 10-5 and 10-6, however, none surpassed the Bonferroni-corrected significance threshold. Three recurrent rare deletions meeting prioritization criteria detected in multiple cases were selected for targeted genotyping. A locus of particular interest was found showing an enrichment of case deletions in 19q13.31 (5/169 cases and 1/114 controls, which encompasses the PSG11 gene contiguous to a highly plastic genomic region. All algorithm calls for these regions were assay confirmed. Conclusions CNVs may confer risk for PE and represent interesting regions that warrant further investigation. Top SNP candidates identified from the GWAS, although not genome-wide significant, may be useful to inform future studies in PE genetics.
Elijah R Behr
Full Text Available Marked prolongation of the QT interval on the electrocardiogram associated with the polymorphic ventricular tachycardia Torsades de Pointes is a serious adverse event during treatment with antiarrhythmic drugs and other culprit medications, and is a common cause for drug relabeling and withdrawal. Although clinical risk factors have been identified, the syndrome remains unpredictable in an individual patient. Here we used genome-wide association analysis to search for common predisposing genetic variants. Cases of drug-induced Torsades de Pointes (diTdP, treatment tolerant controls, and general population controls were ascertained across multiple sites using common definitions, and genotyped on the Illumina 610k or 1M-Duo BeadChips. Principal Components Analysis was used to select 216 Northwestern European diTdP cases and 771 ancestry-matched controls, including treatment-tolerant and general population subjects. With these sample sizes, there is 80% power to detect a variant at genome-wide significance with minor allele frequency of 10% and conferring an odds ratio of ≥2.7. Tests of association were carried out for each single nucleotide polymorphism (SNP by logistic regression adjusting for gender and population structure. No SNP reached genome wide-significance; the variant with the lowest P value was rs2276314, a non-synonymous coding variant in C18orf21 (p = 3×10(-7, odds ratio = 2, 95% confidence intervals: 1.5-2.6. The haplotype formed by rs2276314 and a second SNP, rs767531, was significantly more frequent in controls than cases (p = 3×10(-9. Expanding the number of controls and a gene-based analysis did not yield significant associations. This study argues that common genomic variants do not contribute importantly to risk for drug-induced Torsades de Pointes across multiple drugs.
Seyerle, Amanda A; Lin, Henry J; Gogarten, Stephanie M; Stilp, Adrienne; Méndez Giráldez, Raul; Soliman, Elsayed; Baldassari, Antoine; Graff, Mariaelisa; Heckbert, Susan; Kerr, Kathleen F; Kooperberg, Charles; Rodriguez, Carlos; Guo, Xiuqing; Yao, Jie; Sotoodehnia, Nona; Taylor, Kent D; Whitsel, Eric A; Rotter, Jerome I; Laurie, Cathy C; Avery, Christy L
PR interval (PR) is a heritable electrocardiographic measure of atrial and atrioventricular nodal conduction. Changes in PR duration may be associated with atrial fibrillation, heart failure and all-cause mortality. Hispanic/Latino populations have high burdens of cardiovascular morbidity and mortality, are highly admixed and represent exceptional opportunities for novel locus identification. However, they remain chronically understudied. We present the first genome-wide association study (GWAS) of PR in 14 756 participants of Hispanic/Latino ancestry from three studies. Study-specific summary results of the association between 1000 Genomes Phase 1 imputed single-nucleotide polymorphisms (SNPs) and PR assumed an additive genetic model and were adjusted for global ancestry, study centre/region and clinical covariates. Results were combined using fixed-effects, inverse variance weighted meta-analysis. Sequential conditional analyses were used to identify independent signals. Replication of novel loci was performed in populations of Asian, African and European descent. ENCODE and RoadMap data were used to annotate results. We identified a novel genome-wide association (PPR at ID2 (rs6730558), which replicated in Asian and European populations (PPR loci to Hispanics/Latinos. Bioinformatics annotation provided evidence for regulatory function in cardiac tissue. Further, for six loci that generalised, the Hispanic/Latino index SNP was genome-wide significant and identical to (or in high linkage disequilibrium with) the previously identified GWAS lead SNP. Our results suggest that genetic determinants of PR are consistent across race/ethnicity, but extending studies to admixed populations can identify novel associations, underscoring the importance of conducting genetic studies in diverse populations. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise
Bigdeli, Tim B.; Ripke, Stephan; Bacanu, Silviu-Alin; Lee, Sang Hong; Wray, Naomi R.; Gejman, Pablo V.; Rietschel, Marcella; Cichon, Sven; St Clair, David; Corvin, Aiden; Kirov, George; McQuillin, Andrew; Gurling, Hugh; Rujescu, Dan; Andreassen, Ole A.; Werge, Thomas; Blackwood, Douglas H.R.; Pato, Carlos N.; Pato, Michele T.; Malhotra, Anil K.; O’Donovan, Michael C.; Kendler, Kenneth S.; Fanous, Ayman H.
Genome-wide association studies (GWAS) of schizophrenia have yielded more than 100 common susceptibility variants, and strongly support a substantial polygenic contribution of a large number of small allelic effects. It has been hypothesized that familial schizophrenia is largely a consequence of inherited rather than environmental factors. We investigated the extent to which familiality of schizophrenia is associated with enrichment for common risk variants detectable in a large GWAS. We analyzed single nucleotide polymorphism (SNP) data for cases reporting a family history of psychotic illness (N = 978), cases reporting no such family history (N = 4,503), and unscreened controls (N = 8,285) from the Psychiatric Genomics Consortium (PGC1) study of schizophrenia. We used a multinomial logistic regression approach with model-fitting to detect allelic effects specific to either family history subgroup. We also considered a polygenic model, in which we tested whether family history positive subjects carried more schizophrenia risk alleles than family history negative subjects, on average. Several individual SNPs attained suggestive but not genome-wide significant association with either family history subgroup. Comparison of genome-wide polygenic risk scores based on GWAS summary statistics indicated a significant enrichment for SNP effects among family history positive compared to family history negative cases (Nagelkerke’s R2 = 0.0021; P = 0.00331; P-value threshold history positive compared to family history negative cases (0.32 and 0.22, respectively; P = 0.031).We found suggestive evidence of allelic effects detectable in large GWAS of schizophrenia that might be specific to particular family history subgroups. However, consideration of a polygenic risk score indicated a significant enrichment among family history positive cases for common allelic effects. Familial illness might, therefore, represent a more heritable form of schizophrenia, as suggested by
Beaty, Terri H.; Ruczinski, Ingo; Murray, Jeffrey C.; Marazita, Mary L.; Munger, Ronald G.; Hetmanski, Jacqueline B.; Murray, Tanda; Redett, Richard J.; Fallin, M. Daniele; Liang, Kung Yee; Wu, Tao; Patel, Poorav J.; Jin, Sheng C.; Zhang, Tian Xiao; Schwender, Holger; Wu-Chou, Yah Huei; Chen, Philip K; Chong, Samuel S; Cheah, Felicia; Yeow, Vincent; Ye, Xiaoqian; Wang, Hong; Huang, Shangzhi; Jabs, Ethylin W.; Shi, Bing; Wilcox, Allen J.; Lie, Rolv T.; Jee, Sun Ha; Christensen, Kaare; Doheny, Kimberley F.; Pugh, Elizabeth W.; Ling, Hua; Scott, Alan F.
Non-syndromic cleft palate (CP) is a common birth defect with a complex and heterogeneous etiology involving both genetic and environmental risk factors. We conducted a genome wide association study (GWAS) using 550 case-parent trios, ascertained through a CP case collected in an international consortium. Family based association tests of single nucleotide polymorphisms (SNP) and three common maternal exposures (maternal smoking, alcohol consumption and multivitamin supplementation) were used in a combined 2 df test for gene (G) and gene-environment (G×E) interaction simultaneously, plus a separate 1 df test for G×E interaction alone. Conditional logistic regression models were used to estimate effects on risk to exposed and unexposed children. While no SNP achieved genome wide significance when considered alone, markers in several genes attained or approached genome wide significance when G×E interaction was included. Among these, MLLT3 and SMC2 on chromosome 9 showed multiple SNPs resulting in increased risk if the mother consumed alcohol during the peri-conceptual period (3 months prior to conception through the first trimester). TBK1 on chr. 12 and ZNF236 on chr. 18 showed multiple SNPs associated with higher risk of CP in the presence of maternal smoking. Additional evidence of reduced risk due to G×E interaction in the presence of multivitamin supplementation was observed for SNPs in BAALC on chr. 8. These results emphasize the need to consider G×E interaction when searching for genes influencing risk to complex and heterogeneous disorders, such as non-syndromic CP. PMID:21618603
Chasman, Daniel I; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; O'Seaghdha, Conall M; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D; Gierman, Hinco J; Feitosa, Mary F; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; de Andrade, Mariza; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S; van Duijn, Cornelia M; Borecki, Ingrid B; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M; Kao, W H Linda; Fox, Caroline S; Köttgen, Anna
In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10(-9)) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10(-4)-2.2 × 10(-7). Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general.
Watson, Corey T; Roussos, Panos; Garg, Paras; Ho, Daniel J; Azam, Nidha; Katsel, Pavel L; Haroutunian, Vahram; Sharp, Andrew J
Alzheimer's disease affects ~13% of people in the United States 65 years and older, making it the most common neurodegenerative disorder. Recent work has identified roles for environmental, genetic, and epigenetic factors in Alzheimer's disease risk. We performed a genome-wide screen of DNA methylation using the Illumina Infinium HumanMethylation450 platform on bulk tissue samples from the superior temporal gyrus of patients with Alzheimer's disease and non-demented controls. We paired a sliding window approach with multivariate linear regression to characterize Alzheimer's disease-associated differentially methylated regions (DMRs). We identified 479 DMRs exhibiting a strong bias for hypermethylated changes, a subset of which were independently associated with aging. DMR intervals overlapped 475 RefSeq genes enriched for gene ontology categories with relevant roles in neuron function and development, as well as cellular metabolism, and included genes reported in Alzheimer's disease genome-wide and epigenome-wide association studies. DMRs were enriched for brain-specific histone signatures and for binding motifs of transcription factors with roles in the brain and Alzheimer's disease pathology. Notably, hypermethylated DMRs preferentially overlapped poised promoter regions, marked by H3K27me3 and H3K4me3, previously shown to co-localize with aging-associated hypermethylation. Finally, the integration of DMR-associated single nucleotide polymorphisms with Alzheimer's disease genome-wide association study risk loci and brain expression quantitative trait loci highlights multiple potential DMRs of interest for further functional analysis. We have characterized changes in DNA methylation in the superior temporal gyrus of patients with Alzheimer's disease, highlighting novel loci that facilitate better characterization of pathways and mechanisms underlying Alzheimer's disease pathogenesis, and improve our understanding of epigenetic signatures that may contribute to the
Elisabeth M van Leeuwen
Full Text Available Genome-wide association studies (GWAS have revealed 74 single nucleotide polymorphisms (SNPs associated with high-density lipoprotein cholesterol (HDL blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS to identify SNP×SNP interactions associated with HDL levels. We performed a GWIS in the Rotterdam Study (RS cohort I (RS-I using the GLIDE tool which leverages the massively parallel computing power of Graphics Processing Units (GPUs to perform linear regression on all genome-wide pairs of SNPs. By performing a meta-analysis together with Rotterdam Study cohorts II and III (RS-II and RS-III, we were able to filter 181 interaction terms with a p-value<1 · 10-8 that replicated in the two independent cohorts. We were not able to replicate any of these interaction term in the AGES, ARIC, CHS, ERF, FHS and NFBC-66 cohorts (Ntotal = 30,011 when adjusting for multiple testing. Our GWIS resulted in the consistent finding of a possible interaction between rs774801 in ARMC8 (ENSG00000114098 and rs12442098 in SPATA8 (ENSG00000185594 being associated with HDL levels. However, p-values do not reach the preset Bonferroni correction of the p-values. Our study suggest that even for highly genetically determined traits such as HDL the sample sizes needed to detect SNP×SNP interactions are large and the 2-step filtering approaches do not yield a solution. Here we present our analysis plan and our reservations concerning GWIS.
Full Text Available Abstract Background Complementary single-nucleotide polymorphisms (SNPs may not be distributed equally between two DNA strands if the strands are functionally distinct, such as in transcribed genes. In introns, an excess of A↔G over the complementary C↔T substitutions had previously been found and attributed to transcription-coupled repair (TCR, demonstrating the valuable functional clues that can be obtained by studying such asymmetry. Here we studied asymmetry of human synonymous SNPs (sSNPs in the fourfold degenerate (FFD sites as compared to intronic SNPs (iSNPs. Results The identities of the ancestral bases and the direction of mutations were inferred from human-chimpanzee genomic alignment. After correction for background nucleotide composition, excess of A→G over the complementary T→C polymorphisms, which was observed previously and can be explained by TCR, was confirmed in FFD SNPs and iSNPs. However, when SNPs were separately examined according to whether they mapped to a CpG dinucleotide or not, an excess of C→T over G→A polymorphisms was found in non-CpG site FFD SNPs but was absent from iSNPs and CpG site FFD SNPs. Conclusion The genome-wide discrepancy of human FFD SNPs provides novel evidence for widespread selective pressure due to functional effects of sSNPs. The similar asymmetry pattern of FFD SNPs and iSNPs that map to a CpG can be explained by transcription-coupled mechanisms, including TCR and transcription-coupled mutation. Because of the hypermutability of CpG sites, more CpG site FFD SNPs are relatively younger and have confronted less selection effect than non-CpG FFD SNPs, which can explain the asymmetric discrepancy of CpG site FFD SNPs vs. non-CpG site FFD SNPs.
Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A
Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq TM (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.
Cho, Seoae; Kim, Haseong; Oh, Sohee; Kim, Kyunga; Park, Taesung
The current trend in genome-wide association studies is to identify regions where the true disease-causing genes may lie by evaluating thousands of single-nucleotide polymorphisms (SNPs) across the whole genome. However, many challenges exist in detecting disease-causing genes among the thousands of SNPs. Examples include multicollinearity and multiple testing issues, especially when a large number of correlated SNPs are simultaneously tested. Multicollinearity can often occur when predictor variables in a multiple regression model are highly correlated, and can cause imprecise estimation of association. In this study, we propose a simple stepwise procedure that identifies disease-causing SNPs simultaneously by employing elastic-net regularization, a variable selection method that allows one to address multicollinearity. At Step 1, the single-marker association analysis was conducted to screen SNPs. At Step 2, the multiple-marker association was scanned based on the elastic-net regularization. The proposed approach was applied to the rheumatoid arthritis (RA) case-control data set of Genetic Analysis Workshop 16. While the selected SNPs at the screening step are located mostly on chromosome 6, the elastic-net approach identified putative RA-related SNPs on other chromosomes in an increased proportion. For some of those putative RA-related SNPs, we identified the interactions with sex, a well known factor affecting RA susceptibility.
Assari, Raheleh; Aghighi, Yahya; Ziaee, Vahid; Sadr, Maryam; Rahmani, Farzaneh; Rezaei, Arezou; Sadr, Zeinab; Moradinejad, Mohammad Hassan; Raeeskarami, Seyed Reza; Rezaei, Nima
Kawasaki disease (KD) is a systemic vasculitis of children associated with cardiovascular sequelae. Proinflammatory cytokines play a major role in KD pathogenesis. However, their role is both influenced and modified by regulatory T-cells. IL-1 gene cluster, IL-6 and TNF-α polymorphisms have shown significant associations with some vasculitides. Herein we investigated their role in KD. Fifty-five patients with KD who were randomly selected from referrals to the main pediatric hospital were enrolled in this case-control study. Single nucleotide polymorphisms (SNPs) of the following genes were assessed in patients and 140 healthy subjects as control group: IL-1α at -889 (rs1800587), IL-1β at -511 (rs16944), IL-1β at +3962 (rs1143634), IL-1R at Pst-I 1970 (rs2234650), IL-1RN/A at Mspa-I 11100 (rs315952), TNF-α at -308 (rs1800629), TNF-α at -238, IL-6 at -174 (rs1800795) and IL-6 at +565. Twenty-one percent of the control group had A allele at TNF-α -238 while only 8% of KD patients had A allele at this position (P = 0.003, OR [95%CI] = 0.32 [0.14-0.71]). Consistently, TNF-α genotype GG at -238 had significant association with KD (OR [95% CI] = 4.31 [1.79-10.73]). Most controls carried the CG genotype at IL-6 -174 (n = 93 [66.9%]) while GG genotype was the most common genotype (n = 27 [49%]) among patients. Carriers of the GG haplotype at TNF-α (-308, -238) were significantly more prevalent among the KD group. No association was found between IL-1 gene cluster, allelic or haplotypic variants and KD. TNF-α GG genotype at -238 and GG haplotype at positions -308 and -238 were associated with KD in an Iranian population. © 2016 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
Weidinger, Stephan; Willis-Owen, Saffron A G; Kamatani, Yoichiro; Baurecht, Hansjörg; Morar, Nilesh; Liang, Liming; Edser, Pauline; Street, Teresa; Rodriguez, Elke; O'Regan, Grainne M; Beattie, Paula; Fölster-Holst, Regina; Franke, Andre; Novak, Natalija; Fahy, Caoimhe M; Winge, Mårten C G; Kabesch, Michael; Illig, Thomas; Heath, Simon; Söderhäll, Cilla; Melén, Erik; Pershagen, Göran; Kere, Juha; Bradley, Maria; Lieden, Agne; Nordenskjold, Magnus; Harper, John I; McLean, W H Irwin; Brown, Sara J; Cookson, William O C; Lathrop, G Mark; Irvine, Alan D; Moffatt, Miriam F
Atopic dermatitis (AD) is the most common dermatological disease of childhood. Many children with AD have asthma and AD shares regions of genetic linkage with psoriasis, another chronic inflammatory skin disease. We present here a genome-wide association study (GWAS) of childhood-onset AD in 1563 European cases with known asthma status and 4054 European controls. Using Illumina genotyping followed by imputation, we generated 268 034 consensus genotypes and in excess of 2 million single nucleotide polymorphisms (SNPs) for analysis. Association signals were assessed for replication in a second panel of 2286 European cases and 3160 European controls. Four loci achieved genome-wide significance for AD and replicated consistently across all cohorts. These included the epidermal differentiation complex (EDC) on chromosome 1, the genomic region proximal to LRRC32 on chromosome 11, the RAD50/IL13 locus on chromosome 5 and the major histocompatibility complex (MHC) on chromosome 6; reflecting action of classical HLA alleles. We observed variation in the contribution towards co-morbid asthma for these regions of association. We further explored the genetic relationship between AD, asthma and psoriasis by examining previously identified susceptibility SNPs for these diseases. We found considerable overlap between AD and psoriasis together with variable coincidence between allergic rhinitis (AR) and asthma. Our results indicate that the pathogenesis of AD incorporates immune and epidermal barrier defects with combinations of specific and overlapping effects at individual loci.
Hanson, Robert L; Muller, Yunhua L; Kobes, Sayuko; Guo, Tingwei; Bian, Li; Ossowski, Victoria; Wiedrich, Kim; Sutherland, Jeffrey; Wiedrich, Christopher; Mahkee, Darin; Huang, Ke; Abdussamad, Maryam; Traurig, Michael; Weil, E Jennifer; Nelson, Robert G; Bennett, Peter H; Knowler, William C; Bogardus, Clifton; Baier, Leslie J
Most genetic variants associated with type 2 diabetes mellitus (T2DM) have been identified through genome-wide association studies (GWASs) in Europeans. The current study reports a GWAS for young-onset T2DM in American Indians. Participants were selected from a longitudinal study conducted in Pima Indians and included 278 cases with diabetes with onset before 25 years of age, 295 nondiabetic controls ≥45 years of age, and 267 siblings of cases or controls. Individuals were genotyped on a ∼1M single nucleotide polymorphism (SNP) array, resulting in 453,654 SNPs with minor allele frequency >0.05. SNPs were analyzed for association in cases and controls, and a family-based association test was conducted. Tag SNPs (n = 311) were selected for 499 SNPs associated with diabetes (P associated with T2DM (odds ratio = 1.29 per copy of the T allele; P = 6.6 × 10(-8), which represents genome-wide significance accounting for the number of effectively independent SNPs analyzed). Transfection studies in murine pancreatic β-cells suggested that DNER regulates expression of notch signaling pathway genes. These studies implicate DNER as a susceptibility gene for T2DM in American Indians.
Full Text Available Broadly neutralizing antibodies may protect against HIV-1 acquisition. In natural infection, only 10-30% of patients have cross-reactive neutralizing humoral immunity which may relate to viral and or host factors. To explore the role of host genetic markers in the formation of cross-reactive neutralizing activity (CrNA in HIV-1 infected individuals, we performed a genome-wide association study (GWAS, in participants of the Amsterdam Cohort Studies with known CrNA in their sera. Single-nucleotide polymorphisms (SNPs with the strongest P-values are located in the major histocompatibility complex (MHC region, close to MICA (P = 7.68 × 10(-7, HLA-B (P = 6.96 × 10(-6 and in the coding region of HCP5 (P = 1.34 × 10(-5. However, none of the signals reached genome-wide significance. Our findings underline the potential involvement of genes close or within the MHC region with the development of CrNA.
Euler, Zelda; van Gils, Marit J.; Boeser-Nunnink, Brigitte D.; Schuitemaker, Hanneke; van Manen, Daniëlle
Broadly neutralizing antibodies may protect against HIV-1 acquisition. In natural infection, only 10–30% of patients have cross-reactive neutralizing humoral immunity which may relate to viral and or host factors. To explore the role of host genetic markers in the formation of cross-reactive neutralizing activity (CrNA) in HIV-1 infected individuals, we performed a genome-wide association study (GWAS), in participants of the Amsterdam Cohort Studies with known CrNA in their sera. Single-nucleotide polymorphisms (SNPs) with the strongest P-values are located in the major histocompatibility complex (MHC) region, close to MICA (P = 7.68×10−7), HLA-B (P = 6.96×10−6) and in the coding region of HCP5 (P = 1.34×10−5). However, none of the signals reached genome-wide significance. Our findings underline the potential involvement of genes close or within the MHC region with the development of CrNA. PMID:23372753
Kulbrock, Maike; Lehner, Stefanie; Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar
Equine recurrent uveitis (ERU) is a common eye disease affecting up to 3–15% of the horse population. A genome-wide association study (GWAS) using the Illumina equine SNP50 bead chip was performed to identify loci conferring risk to ERU. The sample included a total of 144 German warmblood horses. A GWAS showed a significant single nucleotide polymorphism (SNP) on horse chromosome (ECA) 20 at 49.3 Mb, with IL-17A and IL-17F being the closest genes. This locus explained a fraction of 23% of the phenotypic variance for ERU. A GWAS taking into account the severity of ERU, revealed a SNP on ECA18 nearby to the crystalline gene cluster CRYGA-CRYGF. For both genomic regions on ECA18 and 20, significantly associated haplotypes containing the genome-wide significant SNPs could be demonstrated. In conclusion, our results are indicative for a genetic component regulating the possible critical role of IL-17A and IL-17F in the pathogenesis of ERU. The associated SNP on ECA18 may be indicative for cataract formation in the course of ERU. PMID:23977091
Wei, Lijuan; Qu, Cunmin; Xu, Xinfu; Lu, Kun; Qian, Wei; Li, Jiana; Li, Maoteng; Liu, Liezhao
A stable yellow-seeded variety is the breeding goal for obtaining the ideal rapeseed (Brassica napus L.) plant, and the amount of acid detergent lignin (ADL) in the seeds and the hull content (HC) are often used as yellow-seeded rapeseed screening indices. In this study, a genome-wide association analysis of 520 accessions was performed using the Q + K model with a total of 31,839 single-nucleotide polymorphism (SNP) sites. As a result, three significant associations on the B. napus chromosomes A05, A09, and C05 were detected for seed ADL content. The peak SNPs were within 9.27, 14.22, and 20.86 kb of the key genes BnaA.PAL4, BnaA.CAD2/BnaA.CAD3, and BnaC.CCR1, respectively. Further analyses were performed on the major locus of A05, which was also detected in the seed HC examination. A comparison of our genome-wide association study (GWAS) results and previous linkage mappings revealed a common chromosomal region on A09, which indicates that GWAS can be used as a powerful complementary strategy for dissecting complex traits in B. napus. Genomic selection (GS) utilizing the significant SNP markers based on the GWAS results exhibited increased predictive ability, indicating that the predictive ability of a given model can be substantially improved by using GWAS and GS. PMID:26673885
Tsai, Chia-Ti; Hsieh, Chia-Shan; Chang, Sheng-Nan; Chuang, Eric Y.; Ueng, Kwo-Chang; Tsai, Chin-Feng; Lin, Tsung-Hsien; Wu, Cho-Kai; Lee, Jen-Kuang; Lin, Lian-Yu; Wang, Yi-Chih; Yu, Chih-Chieh; Lai, Ling-Ping; Tseng, Chuen-Den; Hwang, Juey-Jen; Chiang, Fu-Tien; Lin, Jiunn-Lee
Atrial fibrillation (AF) is the most common sustained cardiac arrhythmia. Previous genome-wide association studies had identified single-nucleotide polymorphisms in several genomic regions to be associated with AF. In human genome, copy number variations (CNVs) are known to contribute to disease susceptibility. Using a genome-wide multistage approach to identify AF susceptibility CNVs, we here show a common 4,470-bp diallelic CNV in the first intron of potassium interacting channel 1 gene (KCNIP1) is strongly associated with AF in Taiwanese populations (odds ratio=2.27 for insertion allele; P=6.23 × 10−24). KCNIP1 insertion is associated with higher KCNIP1 mRNA expression. KCNIP1-encoded protein potassium interacting channel 1 (KCHIP1) is physically associated with potassium Kv channels and modulates atrial transient outward current in cardiac myocytes. Overexpression of KCNIP1 results in inducible AF in zebrafish. In conclusions, a common CNV in KCNIP1 gene is a genetic predictor of AF risk possibly pointing to a functional pathway. PMID:26831368
Gref, Anna; Merid, Simon K; Gruzieva, Olena; Ballereau, Stéphane; Becker, Allan; Bellander, Tom; Bergström, Anna; Bossé, Yohan; Bottai, Matteo; Chan-Yeung, Moira; Fuertes, Elaine; Ierodiakonou, Despo; Jiang, Ruiwei; Joly, Stéphane; Jones, Meaghan; Kobor, Michael S; Korek, Michal; Kozyrskyj, Anita L; Kumar, Ashish; Lemonnier, Nathanaël; MacIntyre, Elaina; Ménard, Camille; Nickle, David; Obeidat, Ma'en; Pellet, Johann; Standl, Marie; Sääf, Annika; Söderhäll, Cilla; Tiesler, Carla M T; van den Berge, Maarten; Vonk, Judith M; Vora, Hita; Xu, Cheng-Jian; Antó, Josep M; Auffray, Charles; Brauer, Michael; Bousquet, Jean; Brunekreef, Bert; Gauderman, W James; Heinrich, Joachim; Kere, Juha; Koppelman, Gerard H; Postma, Dirkje; Carlsten, Christopher; Pershagen, Göran; Melén, Erik
The evidence supporting an association between traffic-related air pollution exposure and incident childhood asthma is inconsistent and may depend on genetic factors. To identify gene-environment interaction effects on childhood asthma using genome-wide single-nucleotide polymorphism (SNP) data and air pollution exposure. Identified loci were further analyzed at epigenetic and transcriptomic levels. We used land use regression models to estimate individual air pollution exposure (represented by outdoor NO 2 levels) at the birth address and performed a genome-wide interaction study for doctors' diagnoses of asthma up to 8 years in three European birth cohorts (n = 1,534) with look-up for interaction in two separate North American cohorts, CHS (Children's Health Study) and CAPPS/SAGE (Canadian Asthma Primary Prevention Study/Study of Asthma, Genetics and Environment) (n = 1,602 and 186 subjects, respectively). We assessed expression quantitative trait locus effects in human lung specimens and blood, as well as associations among air pollution exposure, methylation, and transcriptomic patterns. In the European cohorts, 186 SNPs had an interaction P asthma development and provided supportive evidence for interaction with air pollution for ADCY2, B4GALT5, and DLG2.
Matsuda, Fumio; Nakabayashi, Ryo; Yang, Zhigang; Okazaki, Yozo; Yonemaru, Jun-ichi; Ebana, Kaworu; Yano, Masahiro; Saito, Kazuki
Plants produce structurally diverse secondary (specialized) metabolites to increase their fitness for survival under adverse environments. Several bioactive compounds for new drugs have been identified through screening of plant extracts. In this study, genome-wide association studies (GWAS) were conducted to investigate the genetic architecture behind the natural variation of rice secondary metabolites. GWAS using the metabolome data of 175 rice accessions successfully identified 323 associations among 143 single nucleotide polymorphisms (SNPs) and 89 metabolites. The data analysis highlighted that levels of many metabolites are tightly associated with a small number of strong quantitative trait loci (QTLs). The tight association may be a mechanism generating strains with distinct metabolic composition through the crossing of two different strains. The results indicate that one plant species produces more diverse phytochemicals than previously expected, and plants still contain many useful compounds for human applications. PMID:25267402
Croteau-Chonka, Damien C; Marvelle, Amanda F; Lange, Ethan M; Lee, Nanette R; Adair, Linda S; Lange, Leslie A; Mohlke, Karen L
Increased values of multiple adiposity-related anthropometric traits are important risk factors for many common complex diseases. We performed a genome-wide association (GWA) study for four quantitative traits related to body size and adiposity (BMI, weight, waist circumference, and height) in a cohort of 1,792 adult Filipino women from the Cebu Longitudinal Health and Nutrition Survey (CLHNS). This is the first GWA study of anthropometric traits in Filipinos, a population experiencing a rapid transition into a more obesogenic environment. In addition to identifying suggestive evidence of additional single-nucleotide polymorphism (SNP) association signals (P Filipinos and provide further insight into the effects of BDNF, FTO, and MC4R on BMI.
Galvan, Antonella; Falvella, Felicia S; Frullanti, Elisa; Spinola, Monica; Incarbone, Matteo; Nosotti, Mario; Santambrogio, Luigi; Conti, Barbara; Pastorino, Ugo; Gonzalez-Neira, Anna; Dragani, Tommaso A
We analyzed a series of young (median age = 52 years) non-smoker lung cancer patients and their unaffected siblings as controls, using a genome-wide 620 901 single-nucleotide polymorphism (SNP) array analysis and a case-control DNA pooling approach. We identified 82 putatively associated SNPs that were retested by individual genotyping followed by use of the sib transmission disequilibrium test, pointing to 36 SNPs associated with lung cancer risk in the discordant sibs series. Analysis of these 36 SNPs in a polygenic model characterized by additive and interchangeable effects of rare alleles revealed a highly statistically significant dosage-dependent association between risk allele carrier status and proportion of cancer cases. Replication of the same 36 SNPs in a population-based series confirmed the association with lung cancer for three SNPs, suggesting that phenocopies and genetic heterogeneity can play a major role in the complex genetics of lung cancer risk in the general population.
Full Text Available Genome-wide association study (GWAS aims to discover genetic factors underlying phenotypic traits. The large number of genetic factors poses both computational and statistical challenges. Various computational approaches have been developed for large scale GWAS. In this chapter, we will discuss several widely used computational approaches in GWAS. The following topics will be covered: (1 An introduction to the background of GWAS. (2 The existing computational approaches that are widely used in GWAS. This will cover single-locus, epistasis detection, and machine learning methods that have been recently developed in biology, statistic, and computer science communities. This part will be the main focus of this chapter. (3 The limitations of current approaches and future directions.
Mulder, H A; Crump, R E; Calus, M P L; Veerkamp, R F
In recent years, it has been shown that not only is the phenotype under genetic control, but also the environmental variance. Very little, however, is known about the genetic architecture of environmental variance. The main objective of this study was to unravel the genetic architecture of the mean and environmental variance of somatic cell score (SCS) by identifying genome-wide associations for mean and environmental variance of SCS in dairy cows and by quantifying the accuracy of genome-wide breeding values. Somatic cell score was used because previous research has shown that the environmental variance of SCS is partly under genetic control and reduction of the variance of SCS by selection is desirable. In this study, we used 37,590 single nucleotide polymorphism (SNP) genotypes and 46,353 test-day records of 1,642 cows at experimental research farms in 4 countries in Europe. We used a genomic relationship matrix in a double hierarchical generalized linear model to estimate genome-wide breeding values and genetic parameters. The estimated mean and environmental variance per cow was used in a Bayesian multi-locus model to identify SNP associated with either the mean or the environmental variance of SCS. Based on the obtained accuracy of genome-wide breeding values, 985 and 541 independent chromosome segments affecting the mean and environmental variance of SCS, respectively, were identified. Using a genomic relationship matrix increased the accuracy of breeding values relative to using a pedigree relationship matrix. In total, 43 SNP were significantly associated with either the mean (22) or the environmental variance of SCS (21). The SNP with the highest Bayes factor was on chromosome 9 (Hapmap31053-BTA-111664) explaining approximately 3% of the genetic variance of the environmental variance of SCS. Other significant SNP explained less than 1% of the genetic variance. It can be concluded that fewer genomic regions affect the environmental variance of SCS than the
Tabas-Madrid, Daniel; Méndez-Vigo, Belén; Arteaga, Noelia; Marcer, Arnald; Pascual-Montano, Alberto; Weigel, Detlef; Xavier Picó, F; Alonso-Blanco, Carlos
Current global change is fueling an interest to understand the genetic and molecular mechanisms of plant adaptation to climate. In particular, altered flowering time is a common strategy for escape from unfavourable climate temperature. In order to determine the genomic bases underlying flowering time adaptation to this climatic factor, we have systematically analysed a collection of 174 highly diverse Arabidopsis thaliana accessions from the Iberian Peninsula. Analyses of 1.88 million single nucleotide polymorphisms provide evidence for a spatially heterogeneous contribution of demographic and adaptive processes to geographic patterns of genetic variation. Mountains appear to be allele dispersal barriers, whereas the relationship between flowering time and temperature depended on the precise temperature range. Environmental genome-wide associations supported an overall genome adaptation to temperature, with 9.4% of the genes showing significant associations. Furthermore, phenotypic genome-wide associations provided a catalogue of candidate genes underlying flowering time variation. Finally, comparison of environmental and phenotypic genome-wide associations identified known (Twin Sister of FT, FRIGIDA-like 1, and Casein Kinase II Beta chain 1) and new (Epithiospecifer Modifier 1 and Voltage-Dependent Anion Channel 5) genes as candidates for adaptation to climate temperature by altered flowering time. Thus, this regional collection provides an excellent resource to address the spatial complexity of climate adaptation in annual plants. © 2018 John Wiley & Sons Ltd.
Afsari, Sepideh; Korshoj, Lee E; Abel, Gary R; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant
Several nanoscale electronic methods have been proposed for high-throughput single-molecule nucleic acid sequence identification. While many studies display a large ensemble of measurements as "electronic fingerprints" with some promise for distinguishing the DNA and RNA nucleobases (adenine, guanine, cytosine, thymine, and uracil), important metrics such as accuracy and confidence of base calling fall well below the current genomic methods. Issues such as unreliable metal-molecule junction formation, variation of nucleotide conformations, insufficient differences between the molecular orbitals responsible for single-nucleotide conduction, and lack of rigorous base calling algorithms lead to overlapping nanoelectronic measurements and poor nucleotide discrimination, especially at low coverage on single molecules. Here, we demonstrate a technique for reproducible conductance measurements on conformation-constrained single nucleotides and an advanced algorithmic approach for distinguishing the nucleobases. Our quantum point contact single-nucleotide conductance sequencing (QPICS) method uses combed and electrostatically bound single DNA and RNA nucleotides on a self-assembled monolayer of cysteamine molecules. We demonstrate that by varying the applied bias and pH conditions, molecular conductance can be switched ON and OFF, leading to reversible nucleotide perturbation for electronic recognition (NPER). We utilize NPER as a method to achieve >99.7% accuracy for DNA and RNA base calling at low molecular coverage (∼12×) using unbiased single measurements on DNA/RNA nucleotides, which represents a significant advance compared to existing sequencing methods. These results demonstrate the potential for utilizing simple surface modifications and existing biochemical moieties in individual nucleobases for a reliable, direct, single-molecule, nanoelectronic DNA and RNA nucleotide identification method for sequencing.
Wang, Jia; Xian, Xiaohua; Xu, Xinfu; Qu, Cunmin; Lu, Kun; Li, Jiana; Liu, Liezhao
Seed coat color is an extremely important breeding characteristic of Brassica napus. To elucidate the factors affecting the genetic architecture of seed coat color, a genome-wide association study (GWAS) of seed coat color was conducted with a diversity panel comprising 520 B. napus cultivars and inbred lines. In total, 22 single-nucleotide polymorphisms (SNPs) distributed on 7 chromosomes were found to be associated with seed coat color. The most significant SNPs were found in 2014 near Bn-scaff_15763_1-p233999, only 43.42 kb away from BnaC06g17050D, which is orthologous to Arabidopsis thaliana TRANSPARENT TESTA 12 (TT12), an important gene involved in the transportation of proanthocyanidin precursors into the vacuole. Two of eight repeatedly detected SNPs can be identified and digested by restriction enzymes. Candidate gene mining revealed that the relevant regions of significant SNP loci on the A09 and C08 chromosomes are highly homologous. Moreover, a comparison of the GWAS results to those of previous quantitative trait locus (QTL) studies showed that 11 SNPs were located in the confidence intervals of the QTLs identified in previous studies based on linkage analyses or association mapping. Our results provide insights into the genetic basis of seed coat color in B. napus, and the beneficial allele, SNP information, and candidate genes should be useful for selecting yellow seeds in B. napus breeding.
Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R.; Correia, Catarina; Abrahams, Brett S.; Sykes, Nuala; Pagnamenta, Alistair T.; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J.; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F.; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R.; Casallo, Guillermo; Casey, Jillian; Chu, Su H.; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L.; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A.; Folstein, Susan E.; Fombonne, Eric; Freitag, Christine M.; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T.; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J.; Hakonarson, Hakon; Heron, Elizabeth A.; Hill, Matthew; Holt, Richard; Howe, Jennifer L.; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M.; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M.; Lamb, Janine A.; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L.; Lionel, Anath C.; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C.; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R.; McConachie, Helen; McDougle, Christopher J.; McGrath, Jane; McMahon, William M.; Melhem, Nadine M.; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J.; Mirza, Ghazala K.; Munson, Jeff; Nelson, Stanley F.; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R.; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L.; Bierut, Laura J.; Rice, John P.; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C.; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P.; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B.; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H.; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L.; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D.; Cantor, Rita M.; Cook, Edwin H.; Coon, Hilary; Cuccaro, Michael L.; Gallagher, Louise; Geschwind, Daniel H.; Gill, Michael; Haines, Jonathan L.; Miller, Judith; Monaco, Anthony P.; Nurnberger, John I.; Paterson, Andrew D.; Pericak-Vance, Margaret A.; Schellenberg, Gerard D.; Scherer, Stephen W.; Sutcliffe, James S.; Szatmari, Peter; Vicente, Astrid M.; Vieland, Veronica J.; Wijsman, Ellen M.; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim
Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10−8. When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10−8 threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C. PMID:20663923
Full Text Available Peanut (Arachis hypogaea consists of two subspecies, hypogaea and fastigiata, and has been cultivated worldwide for hundreds of years. Here, 158 peanut accessions were selected to dissect the molecular footprint of agronomic traits related to domestication using specific-locus amplified fragment sequencing (SLAF-seq method. Then, a total of 17,338 high-quality single nucleotide polymorphisms (SNPs in the whole peanut genome were revealed. Eleven agronomic traits in 158 peanut accessions were subsequently analyzed using genome-wide association studies (GWAS. Candidate genes responsible for corresponding traits were then analyzed in genomic regions surrounding the peak SNPs, and 1,429 genes were found within 200 kb windows centerd on GWAS-identified peak SNPs related to domestication. Highly differentiated genomic regions were observed between hypogaea and fastigiata accessions using FST values and sequence diversity (π ratios. Among the 1,429 genes, 662 were located on chromosome A3, suggesting the presence of major selective sweeps caused by artificial selection during long domestication. These findings provide a promising insight into the complicated genetic architecture of domestication-related traits in peanut, and reveal whole-genome SNP markers of beneficial candidate genes for marker-assisted selection (MAS in future breeding programs.
Beecham, Ashley; Dong, Chuanhui; Wright, Clinton B; Dueker, Nicole; Brickman, Adam M; Wang, Liyong; DeCarli, Charles; Blanton, Susan H; Rundek, Tatjana; Mayeux, Richard; Sacco, Ralph L
To investigate genetic variants influencing white matter hyperintensities (WMHs) in the understudied Hispanic population. Using 6.8 million single nucleotide polymorphisms (SNPs), we conducted a genome-wide association study (GWAS) to identify SNPs associated with WMH volume (WMHV) in 922 Hispanics who underwent brain MRI as a cross-section of 2 community-based cohorts in the Northern Manhattan Study and the Washington Heights-Inwood Columbia Aging Project. Multiple linear modeling with PLINK was performed to examine the additive genetic effects on ln(WMHV) after controlling for age, sex, total intracranial volume, and principal components of ancestry. Gene-based tests of association were performed using VEGAS. Replication was performed in independent samples of Europeans, African Americans, and Asians. From the SNP analysis, a total of 17 independent SNPs in 7 genes had suggestive evidence of association with WMHV in Hispanics ( p < 1 × 10 -5 ) and 5 genes from the gene-based analysis with p < 1 × 10 -3 . One SNP (rs9957475 in GATA6 ) and 1 gene ( UBE2C ) demonstrated evidence of association ( p < 0.05) in the African American sample. Four SNPs with p < 1 × 10 -5 were shown to affect binding of SPI1 using RegulomeDB. This GWAS of 2 community-based Hispanic cohorts revealed several novel WMH-associated genetic variants. Further replication is needed in independent Hispanic samples to validate these suggestive associations, and fine mapping is needed to pinpoint causal variants.
Easton, Douglas F.; Pooley, Karen A.; Dunning, Alison M.; Pharoah, Paul D. P.; Thompson, Deborah; Ballinger, Dennis G.; Struewing, Jeffery P.; Morrison, Jonathan; Field, Helen; Luben, Robert; Wareham, Nicholas; Ahmed, Shahana; Healey, Catherine S.; Bowman, Richard; Meyer, Kerstin B.; Haiman, Christopher A.; Kolonel, Laurence K.; Henderson, Brian E.; Marchand, Loic Le; Brennan, Paul; Sangrajrang, Suleeporn; Gaborieau, Valerie; Odefrey, Fabrice; Shen, Chen-Yang; Wu, Pei-Ei; Wang, Hui-Chun; Eccles, Diana; Evans, D. Gareth; Peto, Julian; Fletcher, Olivia; Johnson, Nichola; Seal, Sheila; Stratton, Michael R.; Rahman, Nazneen; Chenevix-Trench, Georgia; Bojesen, Stig E.; Nordestgaard, Børge G.; Axelsson, Christen K.; Garcia-Closas, Montserrat; Brinton, Louise; Chanock, Stephen; Lissowska, Jolanta; Peplonska, Beata; Nevanlinna, Heli; Fagerholm, Rainer; Eerola, Hannaleena; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Ahn, Sei-Hyun; Hunter, David J.; Hankinson, Susan E.; Cox, David G.; Hall, Per; Wedren, Sara; Liu, Jianjun; Low, Yen-Ling; Bogdanova, Natalia; Schürmann, Peter; Dörk, Thilo; Tollenaar, Rob A. E. M.; Jacobi, Catharina E.; Devilee, Peter; Klijn, Jan G. M.; Sigurdson, Alice J.; Doody, Michele M.; Alexander, Bruce H.; Zhang, Jinghui; Cox, Angela; Brock, Ian W.; MacPherson, Gordon; Reed, Malcolm W. R.; Couch, Fergus J.; Goode, Ellen L.; Olson, Janet E.; Meijers-Heijboer, Hanne; van den Ouweland, Ans; Uitterlinden, André; Rivadeneira, Fernando; Milne, Roger L.; Ribas, Gloria; Gonzalez-Neira, Anna; Benitez, Javier; Hopper, John L.; McCredie, Margaret; Southey, Melissa; Giles, Graham G.; Schroen, Chris; Justenhoven, Christina; Brauch, Hiltrud; Hamann, Ute; Ko, Yon-Dschun; Spurdle, Amanda B.; Beesley, Jonathan; Chen, Xiaoqing; Mannermaa, Arto; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana; Day, Nicholas E.; Cox, David R.; Ponder, Bruce A. J.; Luccarini, Craig; Conroy, Don; Shah, Mitul; Munday, Hannah; Jordan, Clare; Perkins, Barbara; West, Judy; Redman, Karen; Driver, Kristy; Aghmesheh, Morteza; Amor, David; Andrews, Lesley; Antill, Yoland; Armes, Jane; Armitage, Shane; Arnold, Leanne; Balleine, Rosemary; Begley, Glenn; Beilby, John; Bennett, Ian; Bennett, Barbara; Berry, Geoffrey; Blackburn, Anneke; Brennan, Meagan; Brown, Melissa; Buckley, Michael; Burke, Jo; Butow, Phyllis; Byron, Keith; Callen, David; Campbell, Ian; Chenevix-Trench, Georgia; Clarke, Christine; Colley, Alison; Cotton, Dick; Cui, Jisheng; Culling, Bronwyn; Cummings, Margaret; Dawson, Sarah-Jane; Dixon, Joanne; Dobrovic, Alexander; Dudding, Tracy; Edkins, Ted; Eisenbruch, Maurice; Farshid, Gelareh; Fawcett, Susan; Field, Michael; Firgaira, Frank; Fleming, Jean; Forbes, John; Friedlander, Michael; Gaff, Clara; Gardner, Mac; Gattas, Mike; George, Peter; Giles, Graham; Gill, Grantley; Goldblatt, Jack; Greening, Sian; Grist, Scott; Haan, Eric; Harris, Marion; Hart, Stewart; Hayward, Nick; Hopper, John; Humphrey, Evelyn; Jenkins, Mark; Jones, Alison; Kefford, Rick; Kirk, Judy; Kollias, James; Kovalenko, Sergey; Lakhani, Sunil; Leary, Jennifer; Lim, Jacqueline; Lindeman, Geoff; Lipton, Lara; Lobb, Liz; Maclurcan, Mariette; Mann, Graham; Marsh, Deborah; McCredie, Margaret; McKay, Michael; McLachlan, Sue Anne; Meiser, Bettina; Milne, Roger; Mitchell, Gillian; Newman, Beth; O'Loughlin, Imelda; Osborne, Richard; Peters, Lester; Phillips, Kelly; Price, Melanie; Reeve, Jeanne; Reeve, Tony; Richards, Robert; Rinehart, Gina; Robinson, Bridget; Rudzki, Barney; Salisbury, Elizabeth; Sambrook, Joe; Saunders, Christobel; Scott, Clare; Scott, Elizabeth; Scott, Rodney; Seshadri, Ram; Shelling, Andrew; Southey, Melissa; Spurdle, Amanda; Suthers, Graeme; Taylor, Donna; Tennant, Christopher; Thorne, Heather; Townshend, Sharron; Tucker, Kathy; Tyler, Janet; Venter, Deon; Visvader, Jane; Walpole, Ian; Ward, Robin; Waring, Paul; Warner, Bev; Warren, Graham; Watson, Elizabeth; Williams, Rachael; Wilson, Judy; Winship, Ingrid; Young, Mary Ann; Bowtell, David; Green, Adele; deFazio, Anna; Chenevix-Trench, Georgia; Gertig, Dorota; Webb, Penny
Breast cancer exhibits familial aggregation, consistent with variation in genetic susceptibility to the disease. Known susceptibility genes account for less than 25% of the familial risk of breast cancer, and the residual genetic variance is likely to be due to variants conferring more moderate risks. To identify further susceptibility alleles, we conducted a two-stage genome-wide association study in 4,398 breast cancer cases and 4,316 controls, followed by a third stage in which 30 single nucleotide polymorphisms (SNPs) were tested for confirmation in 21,860 cases and 22,578 controls from 22 studies. We used 227,876 SNPs that were estimated to correlate with 77% of known common SNPs in Europeans at r2>0.5. SNPs in five novel independent loci exhibited strong and consistent evidence of association with breast cancer (P<10−7). Four of these contain plausible causative genes (FGFR2, TNRC9, MAP3K1 and LSP1). At the second stage, 1,792 SNPs were significant at the P<0.05 level compared with an estimated 1,343 that would be expected by chance, indicating that many additional common susceptibility alleles may be identifiable by this approach. PMID:17529967
Full Text Available Gastritis is a major disease that has the potential to grow as gastric cancer. Gastric cancer is a very common cancer, and it is related to a very high mortality rate in Korea. This disease is known to have various reasons, including infection with Helicobacter pylori, dietary habits, tobacco, and alcohol. The incidence rate of gastritis has reported to differ between age, population, and gender. However, unlike other factors, there has been no analysis based on gender. So, we examined the high risk factors of gastritis in each gender in the Korean population by focusing on sex. We performed an analysis of 120 clinical characteristics and genome-wide association studies (GWAS using 349,184 single-nucleotide polymorphisms from the results of Anseong and Ansan cohort study in the Korea Association Resource (KARE project. As the result, we could not prove a strong relation with these factors and gastritis or gastric ulcer in the GWAS. However, we confirmed several already-known risk factors and also found some differences of clinical characteristics in each gender using logistic regression. As a result of the logistic regression, a relation with hyperlipidemia, coronary artery disease, myocardial infarction, hyperlipidemia therapy, hypotensive or antihypotensive drug, diastolic blood pressure, and gastritis was seen in males; the results of this study suggest that vascular disease has a potential association with gastritis in males.
Jeffrey A Gross
Full Text Available Suicide and suicide attempts are complex behaviors that result from the interaction of different factors, including genetic variants that increase the predisposition to suicidal behaviors. Copy number variations (CNVs are deletions or duplications of a segment of DNA usually larger than one kilobase. These structural genetic changes, although quite rare, have been associated with genetic liability to mental disorders, such as autism, schizophrenia, and bipolar disorder. No genome-wide level studies have been published investigating the potential role of CNVs in suicidal behaviors. Based on single-nucleotide polymorphism array data, we followed the Penn-CNV standards to detect CNVs in 1,608 subjects, comprising 475 suicide and suicide attempt cases and 1,133 controls. Although the initial algorithms determined the presence of CNVs on chromosomes 6 and 12 in seven and eight cases, respectively, compared with none of the controls, visual inspection of the raw data did not support this finding. Furthermore we were unable to validate these findings by CNV-specific real-time polymerase chain reaction. Additionally, rare CNV burden analysis did not find an association between the frequency or length of rare CNVs and suicidal behavior in our sample population. Although our findings suggest CNVs do not play an important role in the etiology of suicidal behaviors, they are not inconsistent with the strong evidence from the literature suggesting that other genetic variants account for a portion of the total phenotypic variability in suicidal behavior.
Hamzić, Edin; Buitenhuis, Bart; Hérault, Frédéric; Hawken, Rachel; Abrahamsen, Mitchel S; Servin, Bertrand; Elsen, Jean-Michel; Pinard-van der Laan, Marie-Hélène; Bed'Hom, Bertrand
Coccidiosis is the most common and costly disease in the poultry industry and is caused by protozoans of the Eimeria genus. The current control of coccidiosis, based on the use of anticoccidial drugs and vaccination, faces serious obstacles such as drug resistance and the high costs for the development of efficient vaccines, respectively. Therefore, the current control programs must be expanded with complementary approaches such as the use of genetics to improve the host response to Eimeria infections. Recently, we have performed a large-scale challenge study on Cobb500 broilers using E. maxima for which we investigated variability among animals in response to the challenge. As a follow-up to this challenge study, we performed a genome-wide association study (GWAS) to identify genomic regions underlying variability of the measured traits in the response to Eimeria maxima in broilers. Furthermore, we conducted a post-GWAS functional analysis to increase our biological understanding of the underlying response to Eimeria maxima challenge. In total, we identified 22 single nucleotide polymorphisms (SNPs) with q value Eimeria maxima in broilers. Furthermore, the post-GWAS functional analysis indicates that biological pathways and networks involved in tissue proliferation and repair along with the primary innate immune response may play the most important role during the early stage of Eimeria maxima infection in broilers.
vonHoldt, Bridgett M.; Pollinger, John P.; Lohmueller, Kirk E.; Han, Eunjung; Parker, Heidi G.; Quignon, Pascale; Degenhardt, Jeremiah D.; Boyko, Adam R.; Earl, Dent A.; Auton, Adam; Reynolds, Andy; Bryc, Kasia; Brisbin, Abra; Knowles, James C.; Mosher, Dana S.; Spady, Tyrone C.; Elkahloun, Abdel; Geffen, Eli; Pilot, Malgorzata; Jedrzejewski, Wlodzimierz; Greco, Claudia; Randi, Ettore; Bannasch, Danika; Wilton, Alan; Shearman, Jeremy; Musiani, Marco; Cargill, Michelle; Jones, Paul G.; Qian, Zuwei; Huang, Wei; Ding, Zhao-Li; Zhang, Ya-ping; Bustamante, Carlos D.; Ostrander, Elaine A.; Novembre, John; Wayne, Robert K.
Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication1,2. To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data3. Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity. PMID:20237475
Full Text Available Rice plants accumulate high concentrations of silicon. Silicon has been shown to be involved in plant growth, high yield, and mitigating biotic and abiotic stresses. However, it has been demonstrated that inorganic arsenic is taken up by rice through silicon transporters under anaerobic conditions, thus the ability to efficiently take up silicon may be considered either a positive or a negative trait in rice. Germanium is an analogue of silicon that produces brown lesions in shoots and leaves, and germanium toxicity has been used to identify mutants in silicon and arsenic transport. In this study, two different genetic mapping methods were performed to determine the loci involved in germanium sensitivity in rice. Genetic mapping in the biparental cross of Bala × Azucena (an F6 population and a genome wide association (GWA study with 350 accessions from the Rice Diversity Panel 1 were conducted using 15 μM of germanic acid. This identified a number of germanium sensitive loci: some co-localised with previously identified quantitative trait loci (QTL for tissue silicon or arsenic concentration, none co-localised with Lsi1 or Lsi6, while one single nucleotide polymorphism (SNP was detected within 200 kb of Lsi2 (these are genes known to transport silicon, whose identity was discovered using germanium toxicity. However, examining candidate genes that are within the genomic region of the loci detected above reveals genes homologous to both Lsi1 and Lsi2, as well as a number of other candidate genes, which are discussed.
Full Text Available In Brassica napus breeding, traits related to commercial success are of highest importance for plant breeders. However, such traits can only be assessed in an advanced developmental stage. % as well as require high experimental effort due to their quantitative inheritance and the importance of genotype*environment interaction. Molecular markers genetically linked to such traits have the potential to accelerate the breeding process of B. napus by marker-assisted selection. Therefore, the objectives of this study were to identify (i genome regions associated with the examined agronomic and seed quality traits, (ii the interrelationship of population structure and the detected associations, and (iii candidate genes for the revealed associations. The diversity set used in this study consisted of 405 Brassica napus inbred lines which were genotyped using a 6K single nucleotide polymorphism (SNP array and phenotyped for agronomic and seed quality traits in field trials. In a genome-wide association study, we detected a total of 112 associations between SNPs and the seed quality traits as well as 46 SNP-trait associations for the agronomic traits with a P-value 100 and a sequence identity of > 70 % to A. thaliana or B. rapa could be found for the agronomic SNP-trait associations and 187 hits of potential candidate genes for the seed quality SNP-trait associations.
Full Text Available BACKGROUND: We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS. This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP data. SNPpy and its dependencies are open source software. RESULTS: The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. CONCLUSIONS: By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is a practical and extensible solution for investigators who seek to deploy central management of their GWAS data.
Sharma, Swarkar; Gao, Xiaochong; Londono, Douglas; Devroy, Shonn E.; Mauldin, Kristen N.; Frankel, Jessica T.; Brandon, January M.; Zhang, Dongping; Li, Quan-Zhen; Dobbs, Matthew B.; Gurnett, Christina A.; Grant, Struan F.A.; Hakonarson, Hakon; Dormans, John P.; Herring, John A.; Gordon, Derek; Wise, Carol A.
Adolescent idiopathic scoliosis (AIS) is an unexplained and common spinal deformity seen in otherwise healthy children. Its pathophysiology is poorly understood despite intensive investigation. Although genetic underpinnings are clear, replicated susceptibility loci that could provide insight into etiology have not been forthcoming. To address these issues, we performed genome-wide association studies (GWAS) of ∼327 000 single nucleotide polymorphisms (SNPs) in 419 AIS families. We found strongest evidence of association with chromosome 3p26.3 SNPs in the proximity of the CHL1 gene (P protein related to Robo3. Mutations in the Robo3 protein cause horizontal gaze palsy with progressive scoliosis (HGPPS), a rare disease marked by severe scoliosis. Other top associations in our GWAS were with SNPs in the DSCAM gene encoding an axon guidance protein in the same structural class with Chl1 and Robo3. We additionally found AIS associations with loci in CNTNAP2, supporting a previous study linking this gene with AIS. Cntnap2 is also of functional interest, as it interacts directly with L1 and Robo class proteins and participates in axon pathfinding. Our results suggest the relevance of axon guidance pathways in AIS susceptibility, although these findings require further study, particularly given the apparent genetic heterogeneity in this disease. PMID:21216876
Nivard, Michel G; Middeldorp, Christel M; Lubke, Gitta; Hottenga, Jouke-Jan; Abdellaoui, Abdel; Boomsma, Dorret I; Dolan, Conor V
Heritability may be estimated using phenotypic data collected in relatives or in distantly related individuals using genome-wide single nucleotide polymorphism (SNP) data. We combined these approaches by re-parameterizing the model proposed by Zaitlen et al and extended this model to include moderation of (total and SNP-based) genetic and environmental variance components by a measured moderator. By means of data simulation, we demonstrated that the type 1 error rates of the proposed test are correct and parameter estimates are accurate. As an application, we considered the moderation by age or year of birth of variance components associated with body mass index (BMI), height, attention problems (AP), and symptoms of anxiety and depression. The genetic variance of BMI was found to increase with age, but the environmental variance displayed a greater increase with age, resulting in a proportional decrease of the heritability of BMI. Environmental variance of height increased with year of birth. The environmental variance of AP increased with age. These results illustrate the assessment of moderation of environmental and genetic effects, when estimating heritability from combined SNP and family data. The assessment of moderation of genetic and environmental variance will enhance our understanding of the genetic architecture of complex traits. PMID:27436263
Full Text Available Abstract Pre-harvest sprouting (PHS is a major abiotic factor affecting grain weight and quality, and is caused by an early break in seed dormancy. Association mapping (AM is used to detect correlations between phenotypes and genotypes based on linkage disequilibrium (LD in wheat breeding programs. We evaluated seed dormancy in 80 Chinese wheat founder parents in five environments and performed a genome-wide association study using 6,057 markers, including 93 simple sequence repeat (SSR, 1,472 diversity array technology (DArT, and 4,492 single nucleotide polymorphism (SNP markers. The general linear model (GLM and the mixed linear model (MLM were used in this study, and two significant markers (tPt-7980 and wPt-6457 were identified. Both markers were located on Chromosome 1B, with wPt-6457 having been identified in a previously reported chromosomal position. The significantly associated loci contain essential information for cloning genes related to resistance to PHS and can be used in wheat breeding programs.
The single-nucleotide polymorphism (SNP) rs10503253, located within the CUB and Sushi multiple domains-1 (CSMD1) gene on 8p23.2, was recently identified as genome-wide significant for schizophrenia (SZ), but is of unknown function. We investigated the neurocognitive effects of this CSMD1 variant in vivo in patients and healthy participants using behavioral and imaging measures of brain structure and function. We compared carriers and non-carriers of the risk \\'A\\' allele on measures of neuropsychological performance typically impaired in SZ (general cognitive ability, episodic and working memory and attentional control) in independent samples of Irish patients (n = 387) and controls (n = 171) and German patients (205) and controls (n = 533). Across these groups, the risk \\'A\\' allele at CSMD1 was associated with deleterious effects across a number of neurocognitive phenotypes. Specifically, the risk allele was associated with poorer performance on neuropsychological measures of general cognitive ability and memory function but not attentional control. These effects, while significant, were subtle, and varied between samples. Consistent with previous evidence suggesting that CSMD1 may be involved in brain mechanisms related to memory and learning, these data appear to reflect the deleterious effects of the identified \\'A\\' risk allele on neurocognitive function, possibly as part of the mechanism by which CSMD1 is associated with SZ risk.
Rose, Emma J
The single nucleotide polymorphism rs10503253 within the CUB and Sushi multiple domains-1 (CSMD1) gene on 8p23.2 has been identified as genome-wide significant for schizophrenia (SZ). This gene is of unknown function but has been implicated in multiple neurodevelopmental disorders that impact upon cognition, leading us to hypothesize that an effect on brain structure and function underlying cognitive processes may be part of the mechanism by which CMSD1 increases illness risk. To test this hypothesis, we investigated this CSMD1 variant in vivo in healthy participants in a magnetic resonance imaging (MRI) study comprised of both fMRI of spatial working memory (N = 50) and a voxel-based morphometry investigation of grey and white matter (WM) volume (N = 150). Analyses of these data indicated that the risk "A" allele was associated with comparatively reduced cortical activations in BA18, that is, middle occipital gyrus and cuneus; posterior brain regions that support maintenance processes during performance of a spatial working memory task. Conversely, there was an absence of significant structural differences in brain volume (i.e., grey or WM). In accordance with previous evidence, these data suggest that CSMD1 may mediate brain function related to cognitive processes (i.e., executive function); with the relatively deleterious effects of the identified "A" risk allele on brain activity possibly constituting part of the mechanism by which CSMD1 increases schizophrenia risk.
Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner\\'s curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C.
Hong, Chang Bum; Kim, Young Jin; Moon, Sanghoon; Shin, Young-Ah; Go, Min Jin; Kim, Dong-Joon; Lee, Jong-Young; Cho, Yoon Shin
Recent advances in high-throughput genotyping technologies have enabled us to conduct a genome-wide association study (GWAS) on a large cohort. However, analyzing millions of single nucleotide polymorphisms (SNPs) is still a difficult task for researchers conducting a GWAS. Several difficulties such as compatibilities and dependencies are often encountered by researchers using analytical tools, during the installation of software. This is a huge obstacle to any research institute without computing facilities and specialists. Therefore, a proper research environment is an urgent need for researchers working on GWAS. We developed BioSMACK to provide a research environment for GWAS that requires no configuration and is easy to use. BioSMACK is based on the Ubuntu Live CD that offers a complete Linux-based operating system environment without installation. Moreover, we provide users with a GWAS manual consisting of a series of guidelines for GWAS and useful examples. BioSMACK is freely available at http://ksnp.cdc. go.kr/biosmack.
Lee, Sungyoung; Kwon, Min-Seok; Park, Taesung
Most common complex traits, such as obesity, hypertension, diabetes, and cancers, are known to be associated with multiple genes, environmental factors, and their epistasis. Recently, the development of advanced genotyping technologies has allowed us to perform genome-wide association studies (GWASs). For detecting the effects of multiple genes on complex traits, many approaches have been proposed for GWASs. Multifactor dimensionality reduction (MDR) is one of the powerful and efficient methods for detecting high-order gene-gene (GxG) interactions. However, the biological interpretation of GxG interactions identified by MDR analysis is not easy. In order to aid the interpretation of MDR results, we propose a network graph analysis to elucidate the meaning of identified GxG interactions. The proposed network graph analysis consists of three steps. The first step is for performing GxG interaction analysis using MDR analysis. The second step is to draw the network graph using the MDR result. The third step is to provide biological evidence of the identified GxG interaction using external biological databases. The proposed method was applied to Korean Association Resource (KARE) data, containing 8838 individuals with 327,632 single-nucleotide polymorphisms, in order to perform GxG interaction analysis of body mass index (BMI). Our network graph analysis successfully showed that many identified GxG interactions have known biological evidence related to BMI. We expect that our network graph analysis will be helpful to interpret the biological meaning of GxG interactions.
Jarislav von Zitzewitz
Full Text Available Winterhardiness is a complex trait that involves low temperature tolerance (LTT, vernalization sensitivity, and photoperiod sensitivity. Quantitative trait loci (QTL for these traits were first identified using biparental mapping populations; candidate genes for all loci have since been identified and characterized. In this research we used a set of 148 accessions consisting of advanced breeding lines from the Oregon barley ( L. subsp breeding program and selected cultivars that were extensively phenotyped and genotyped with single nucleotide polymorphisms. Using these data for genome-wide association mapping we detected the same QTL and genes that have been systematically characterized using biparental populations over nearly two decades of intensive research. In this sample of germplasm, maximum LTT can be achieved with facultative growth habit, which can be predicted using a three-locus haplotype involving , , and . The and LTT QTL explained 25% of the phenotypic variation, offering the prospect that additional gains from selection can be achieved once favorable alleles are fixed at these loci.
Widmer, Christian; Lippert, Christoph; Weissbrod, Omer; Fusi, Nicolo; Kadie, Carl; Davidson, Robert; Listgarten, Jennifer; Heckerman, David
We examine improvements to the linear mixed model (LMM) that better correct for population structure and family relatedness in genome-wide association studies (GWAS). LMMs rely on the estimation of a genetic similarity matrix (GSM), which encodes the pairwise similarity between every two individuals in a cohort. These similarities are estimated from single nucleotide polymorphisms (SNPs) or other genetic variants. Traditionally, all available SNPs are used to estimate the GSM. In empirical studies across a wide range of synthetic and real data, we find that modifications to this approach improve GWAS performance as measured by type I error control and power. Specifically, when only population structure is present, a GSM constructed from SNPs that well predict the phenotype in combination with principal components as covariates controls type I error and yields more power than the traditional LMM. In any setting, with or without population structure or family relatedness, a GSM consisting of a mixture of two component GSMs, one constructed from all SNPs and another constructed from SNPs that well predict the phenotype again controls type I error and yields more power than the traditional LMM. Software implementing these improvements and the experimental comparisons are available at http://microsoft.com/science.
Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R; Correia, Catarina; Abrahams, Brett S; Sykes, Nuala; Pagnamenta, Alistair T; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R; Casallo, Guillermo; Casey, Jillian; Chu, Su H; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A; Folstein, Susan E; Fombonne, Eric; Freitag, Christine M; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J; Hakonarson, Hakon; Heron, Elizabeth A; Hill, Matthew; Holt, Richard; Howe, Jennifer L; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M; Lamb, Janine A; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L; Lionel, Anath C; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R; McConachie, Helen; McDougle, Christopher J; McGrath, Jane; McMahon, William M; Melhem, Nadine M; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J; Mirza, Ghazala K; Munson, Jeff; Nelson, Stanley F; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L; Bierut, Laura J; Rice, John P; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D; Cantor, Rita M; Cook, Edwin H; Coon, Hilary; Cuccaro, Michael L; Gallagher, Louise; Geschwind, Daniel H; Gill, Michael; Haines, Jonathan L; Miller, Judith; Monaco, Anthony P; Nurnberger, John I; Paterson, Andrew D; Pericak-Vance, Margaret A; Schellenberg, Gerard D; Scherer, Stephen W; Sutcliffe, James S; Szatmari, Peter; Vicente, Astrid M; Vieland, Veronica J; Wijsman, Ellen M; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim
Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C.
Full Text Available Various approaches can be applied to uncover the genetic basis of natural phenotypic variation, each with their specific strengths and limitations. Here, we use a replicated genome-wide association approach (Pool-GWAS to fine-scale map genomic regions contributing to natural variation in female abdominal pigmentation in Drosophila melanogaster, a trait that is highly variable in natural populations and highly heritable in the laboratory. We examined abdominal pigmentation phenotypes in approximately 8000 female European D. melanogaster, isolating 1000 individuals with extreme phenotypes. We then used whole-genome Illumina sequencing to identify single nucleotide polymorphisms (SNPs segregating in our sample, and tested these for associations with pigmentation by contrasting allele frequencies between replicate pools of light and dark individuals. We identify two small regions near the pigmentation genes tan and bric-à-brac 1, both corresponding to known cis-regulatory regions, which contain SNPs showing significant associations with pigmentation variation. While the Pool-GWAS approach suffers some limitations, its cost advantage facilitates replication and it can be applied to any non-model system with an available reference genome.
Bastide, Héloïse; Betancourt, Andrea; Nolte, Viola; Tobler, Raymond; Stöbe, Petra; Futschik, Andreas; Schlötterer, Christian
Various approaches can be applied to uncover the genetic basis of natural phenotypic variation, each with their specific strengths and limitations. Here, we use a replicated genome-wide association approach (Pool-GWAS) to fine-scale map genomic regions contributing to natural variation in female abdominal pigmentation in Drosophila melanogaster, a trait that is highly variable in natural populations and highly heritable in the laboratory. We examined abdominal pigmentation phenotypes in approximately 8000 female European D. melanogaster, isolating 1000 individuals with extreme phenotypes. We then used whole-genome Illumina sequencing to identify single nucleotide polymorphisms (SNPs) segregating in our sample, and tested these for associations with pigmentation by contrasting allele frequencies between replicate pools of light and dark individuals. We identify two small regions near the pigmentation genes tan and bric-à-brac 1, both corresponding to known cis-regulatory regions, which contain SNPs showing significant associations with pigmentation variation. While the Pool-GWAS approach suffers some limitations, its cost advantage facilitates replication and it can be applied to any non-model system with an available reference genome.
Schaid, Daniel J; Sinnwell, Jason P; Jenkins, Gregory D; McDonnell, Shannon K; Ingle, James N; Kubo, Michiaki; Goss, Paul E; Costantino, Joseph P; Wickerham, D Lawrence; Weinshilboum, Richard M
Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc "fixes." To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted P-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. © 2011 Wiley Periodicals, Inc.
Ahmad, Mahmoud Al; Panicker, Neena G.; Rizvi, Tahir A.; Mustafa, Farah
High speed sequential identification of the building blocks of DNA, (deoxyribonucleotides or nucleotides for short) without labeling or processing in long reads of DNA is the need of the hour. This can be accomplished through exploiting their unique electrical properties. In this study, the four different types of nucleotides that constitute a DNA molecule were suspended in a buffer followed by performing several types of electrical measurements. These electrical parameters were then used to quantify the suspended DNA nucleotides. Thus, we present a purely electrical counting scheme based on the semiconductor theory that allows one to determine the number of nucleotides in a solution by measuring their capacitance-voltage dependency. The nucleotide count was observed to be similar to the multiplication of the corresponding dopant concentration and debye volume after de-embedding the buffer contribution. The presented approach allows for a fast and label-free quantification of single and mixed nucleotides in a solution.
Sreekumar G Pillai
Full Text Available There is considerable variability in the susceptibility of smokers to develop chronic obstructive pulmonary disease (COPD. The only known genetic risk factor is severe deficiency of alpha(1-antitrypsin, which is present in 1-2% of individuals with COPD. We conducted a genome-wide association study (GWAS in a homogenous case-control cohort from Bergen, Norway (823 COPD cases and 810 smoking controls and evaluated the top 100 single nucleotide polymorphisms (SNPs in the family-based International COPD Genetics Network (ICGN; 1891 Caucasian individuals from 606 pedigrees study. The polymorphisms that showed replication were further evaluated in 389 subjects from the US National Emphysema Treatment Trial (NETT and 472 controls from the Normative Aging Study (NAS and then in a fourth cohort of 949 individuals from 127 extended pedigrees from the Boston Early-Onset COPD population. Logistic regression models with adjustments of covariates were used to analyze the case-control populations. Family-based association analyses were conducted for a diagnosis of COPD and lung function in the family populations. Two SNPs at the alpha-nicotinic acetylcholine receptor (CHRNA 3/5 locus were identified in the genome-wide association study. They showed unambiguous replication in the ICGN family-based analysis and in the NETT case-control analysis with combined p-values of 1.48 x 10(-10, (rs8034191 and 5.74 x 10(-10 (rs1051730. Furthermore, these SNPs were significantly associated with lung function in both the ICGN and Boston Early-Onset COPD populations. The C allele of the rs8034191 SNP was estimated to have a population attributable risk for COPD of 12.2%. The association of hedgehog interacting protein (HHIP locus on chromosome 4 was also consistently replicated, but did not reach genome-wide significance levels. Genome-wide significant association of the HHIP locus with lung function was identified in the Framingham Heart study (Wilk et al., companion article
Dehghan, Abbas; Köttgen, Anna; Yang, Qiong; Hwang, Shih-Jen; Kao, Wh Linda; Rivadeneira, Fernando; Boerwinkle, Eric; Levy, Daniel; Hofman, Albert; Astor, Brad C; Benjamin, Emelia J; van Duijn, Cornelia M; Witteman, Jacqueline C; Coresh, Josef; Fox, Caroline S
Hyperuricaemia, a highly heritable trait, is a key risk factor for gout. We aimed to identify novel genes associated with serum uric acid concentration and gout. Genome-wide association studies were done for serum uric acid in 7699 participants in the Framingham cohort and in 4148 participants in the Rotterdam cohort. Genome-wide significant single nucleotide polymorphisms (SNPs) were replicated in white (n=11 024) and black (n=3843) individuals who took part in the study of Atherosclerosis Risk in Communities (ARIC). The SNPs that reached genome-wide significant association with uric acid in either the Framingham cohort (pgout. The results obtained in white participants were combined using meta-analysis. Three loci in the Framingham cohort and two in the Rotterdam cohort showed genome-wide association with uric acid. Top SNPs in each locus were: missense rs16890979 in SLC2A9 (p=7.0 x 10(-168) and 2.9 x 10(-18) for white and black participants, respectively); missense rs2231142 in ABCG2 (p=2.5 x 10(-60) and 9.8 x 10(-4)), and rs1165205 in SLC17A3 (p=3.3 x 10(-26) and 0.33). All SNPs were direction-consistent with gout in white participants: rs16890979 (OR 0.59 per T allele, 95% CI 0.52-0.68, p=7.0 x 10(-14)), rs2231142 (1.74, 1.51-1.99, p=3.3 x 10(-15)), and rs1165205 (0.85, 0.77-0.94, p=0.002). In black participants of the ARIC study, rs2231142 was direction-consistent with gout (1.71, 1.06-2.77, p=0.028). An additive genetic risk score of high-risk alleles at the three loci showed graded associations with uric acid (272-351 mumol/L in the Framingham cohort, 269-386 mumol/L in the Rotterdam cohort, and 303-426 mumol/L in white participants of the ARIC study) and gout (frequency 2-13% in the Framingham cohort, 2-8% in the Rotterdam cohort, and 1-18% in white participants in the ARIC study). We identified three genetic loci associated with uric acid concentration and gout. A score based on genes with a putative role in renal urate handling showed a substantial risk
de Moor, Marleen H.M.; van den Berg, Stéphanie M.; Verweij, Karin J.H.; Krueger, Robert F.; Luciano, Michelle; Vasquez, Alejandro Arias; Matteson, Lindsay K.; Derringer, Jaime; Esko, Tõnu; Amin, Najaf; Gordon, Scott D.; Hansell, Narelle K.; Hart, Amy B.; Seppälä, Ilkka; Huffman, Jennifer E.; Konte, Bettina; Lahti, Jari; Lee, Minyoung; Miller, Mike; Nutile, Teresa; Tanaka, Toshiko; Teumer, Alexander; Viktorin, Alexander; Wedenoja, Juho; Abecasis, Goncalo R.; Adkins, Daniel E.; Agrawal, Arpana; Allik, Jüri; Appel, Katja; Bigdeli, Timothy B.; Busonero, Fabio; Campbell, Harry; Costa, Paul T.; Smith, George Davey; Davies, Gail; de Wit, Harriet; Ding, Jun; Engelhardt, Barbara E.; Eriksson, Johan G.; Fedko, Iryna O.; Ferrucci, Luigi; Franke, Barbara; Giegling, Ina; Grucza, Richard; Hartmann, Annette M.; Heath, Andrew C.; Heinonen, Kati; Henders, Anjali K.; Homuth, Georg; Hottenga, Jouke-Jan; Janzing, Joost; Jokela, Markus; Karlsson, Robert; Kemp, John P.; Kirkpatrick, Matthew G.; Latvala, Antti; Lehtimäki, Terho; Liewald, David C.; Madden, Pamela A.F.; Magri, Chiara; Magnusson, Patrik K.E.; Marten, Jonathan; Maschio, Andrea; Medland, Sarah E.; Mihailov, Evelin; Milaneschi, Yuri; Montgomery, Grant W.; Nauck, Matthias; Ouwens, Klaasjan G.; Palotie, Aarno; Pettersson, Erik; Polasek, Ozren; Qian, Yong; Pulkki-Råback, Laura; Raitakari, Olli T.; Realo, Anu; Rose, Richard J.; Ruggiero, Daniela; Schmidt, Carsten O.; Slutske, Wendy S.; Sorice, Rossella; Starr, John M.; Pourcain, Beate St; Sutin, Angelina R.; Timpson, Nicholas J.; Trochet, Holly; Vermeulen, Sita; Vuoksimaa, Eero; Widen, Elisabeth; Wouda, Jasper; Wright, Margaret J.; Zgaga, Lina; Scotland, Generation; Porteous, David; Minelli, Alessandra; Palmer, Abraham A.; Rujescu, Dan; Ciullo, Marina; Hayward, Caroline; Rudan, Igor; Metspalu, Andres; Kaprio, Jaakko; Deary, Ian J.; Räikkönen, Katri; Wilson, James F.; Keltikangas-Järvinen, Liisa; Bierut, Laura J.; Hettema, John M.; Grabe, Hans J.; van Duijn, Cornelia M.; Evans, David M.; Schlessinger, David; Pedersen, Nancy L.; Terracciano, Antonio; McGue, Matt; Penninx, Brenda W.J.H.; Martin, Nicholas G.; Boomsma, Dorret I.
Importance Neuroticism is a personality trait that is briefly defined by emotional instability. It is a robust genetic risk factor for Major Depressive Disorder (MDD) and other psychiatric disorders. Hence, neuroticism is an important phenotype for psychiatric genetics. The Genetics of Personality Consortium (GPC) has created a resource for genome-wide association analyses of personality traits in over 63,000 participants (including MDD cases). Objective To identify genetic variants associated with neuroticism by performing a meta-analysis of genome-wide association (GWA) results based on 1000Genomes imputation, to evaluate if common genetic variants as assessed by Single Nucleotide Polymorphisms (SNPs) explain variation in neuroticism by estimating SNP-based heritability, and to examine whether SNPs that predict neuroticism also predict MDD. Setting 30 cohorts with genome-wide genotype, personality and MDD data from the GPC. Participants The study included 63,661 participants from 29 discovery cohorts and 9,786 participants from a replication cohort. Participants came from Europe, the United States or Australia. Main outcome measure(s) Neuroticism scores harmonized across all cohorts by Item Response Theory (IRT) analysis, and clinically assessed MDD case-control status. Results A genome-wide significant SNP was found in the MAGI1 gene (rs35855737; P=9.26 × 10−9 in the discovery meta-analysis, and P=2.38 × 10−8 in the meta-analysis of all 30 cohorts). Common genetic variants explain 15% of the variance in neuroticism. Polygenic scores based on the meta-analysis of neuroticism in 27 of the discovery cohorts significantly predicted neuroticism in 2 independent cohorts. Importantly, polygenic scores also predicted MDD in these cohorts. Conclusions and relevance This study identifies a novel locus for neuroticism. The variant is located in a known gene that has been associated with bipolar disorder and schizophrenia in previous studies. In addition, the study
Perera, Minoli A; Cavallari, Larisa H; Limdi, Nita A; Gamazon, Eric R; Konkashbaev, Anuar; Daneshjou, Roxana; Pluzhnikov, Anna; Crawford, Dana C; Wang, Jelai; Liu, Nianjun; Tatonetti, Nicholas; Bourgeois, Stephane; Takahashi, Harumi; Bradford, Yukiko; Burkley, Benjamin M; Desnick, Robert J; Halperin, Jonathan L; Khalifa, Sherief I; Langaee, Taimour Y; Lubitz, Steven A; Nutescu, Edith A; Oetjens, Matthew; Shahin, Mohamed H; Patel, Shitalben R; Sagreiya, Hersh; Tector, Matthew; Weck, Karen E; Rieder, Mark J; Scott, Stuart A; Wu, Alan HB; Burmester, James K; Wadelius, Mia; Deloukas, Panos; Wagner, Michael J; Mushiroda, Taisei; Kubo, Michiaki; Roden, Dan M; Cox, Nancy J; Altman, Russ B; Klein, Teri E; Nakamura, Yusuke; Johnson, Julie A
Summary Background VKORC1 and CYP2C9 are important contributors to warfarin dose variability, but explain less variability for individuals of African descent than for those of European or Asian descent. We aimed to identify additional variants contributing to warfarin dose requirements in African Americans. Methods We did a genome-wide association study of discovery and replication cohorts. Samples from African-American adults (aged ≥18 years) who were taking a stable maintenance dose of warfarin were obtained at International Warfarin Pharmacogenetics Consortium (IWPC) sites and the University of Alabama at Birmingham (Birmingham, AL, USA). Patients enrolled at IWPC sites but who were not used for discovery made up the independent replication cohort. All participants were genotyped. We did a stepwise conditional analysis, conditioning first for VKORC1 −1639G→A, followed by the composite genotype of CYP2C9*2 and CYP2C9*3. We prespecified a genome-wide significance threshold of p<5×10−8 in the discovery cohort and p<0·0038 in the replication cohort. Findings The discovery cohort contained 533 participants and the replication cohort 432 participants. After the prespecified conditioning in the discovery cohort, we identified an association between a novel single nucleotide polymorphism in the CYP2C cluster on chromosome 10 (rs12777823) and warfarin dose requirement that reached genome-wide significance (p=1·51×10−8). This association was confirmed in the replication cohort (p=5·04×10−5); analysis of the two cohorts together produced a p value of 4·5×10−12. Individuals heterozygous for the rs12777823 A allele need a dose reduction of 6·92 mg/week and those homozygous 9·34 mg/week. Regression analysis showed that the inclusion of rs12777823 significantly improves warfarin dose variability explained by the IWPC dosing algorithm (21% relative improvement). Interpretation A novel CYP2C single nucleotide polymorphism exerts a clinically relevant
Full Text Available Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE for 30 individuals from two populations. The nucleotide diversity (π for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001 and the putatively neutral SNPs (FST = 0.0347, P < 0.001. However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001. Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40% significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus.
Power, Robert A; Tansey, Katherine E; Buttenschøn, Henriette Nørmølle; Cohen-Woods, Sarah; Bigdeli, Tim; Hall, Lynsey S; Kutalik, Zoltán; Lee, S Hong; Ripke, Stephan; Steinberg, Stacy; Teumer, Alexander; Viktorin, Alexander; Wray, Naomi R; Arolt, Volker; Baune, Bernard T; Boomsma, Dorret I; Børglum, Anders D; Byrne, Enda M; Castelao, Enrique; Craddock, Nick; Craig, Ian W; Dannlowski, Udo; Deary, Ian J; Degenhardt, Franziska; Forstner, Andreas J; Gordon, Scott D; Grabe, Hans J; Grove, Jakob; Hamilton, Steven P; Hayward, Caroline; Heath, Andrew C; Hocking, Lynne J; Homuth, Georg; Hottenga, Jouke J; Kloiber, Stefan; Krogh, Jesper; Landén, Mikael; Lang, Maren; Levinson, Douglas F; Lichtenstein, Paul; Lucae, Susanne; MacIntyre, Donald J; Madden, Pamela; Magnusson, Patrik K E; Martin, Nicholas G; McIntosh, Andrew M; Middeldorp, Christel M; Milaneschi, Yuri; Montgomery, Grant W; Mors, Ole; Müller-Myhsok, Bertram; Nyholt, Dale R; Oskarsson, Hogni; Owen, Michael J; Padmanabhan, Sandosh; Penninx, Brenda W J H; Pergadia, Michele L; Porteous, David J; Potash, James B; Preisig, Martin; Rivera, Margarita; Shi, Jianxin; Shyn, Stanley I; Sigurdsson, Engilbert; Smit, Johannes H; Smith, Blair H; Stefansson, Hreinn; Stefansson, Kari; Strohmaier, Jana; Sullivan, Patrick F; Thomson, Pippa; Thorgeirsson, Thorgeir E; Van der Auwera, Sandra; Weissman, Myrna M; Breen, Gerome; Lewis, Cathryn M
Major depressive disorder (MDD) is a disabling mood disorder, and despite a known heritable component, a large meta-analysis of genome-wide association studies revealed no replicable genetic risk variants. Given prior evidence of heterogeneity by age at onset in MDD, we tested whether genome-wide significant risk variants for MDD could be identified in cases subdivided by age at onset. Discovery case-control genome-wide association studies were performed where cases were stratified using increasing/decreasing age-at-onset cutoffs; significant single nucleotide polymorphisms were tested in nine independent replication samples, giving a total sample of 22,158 cases and 133,749 control subjects for subsetting. Polygenic score analysis was used to examine whether differences in shared genetic risk exists between earlier and adult-onset MDD with commonly comorbid disorders of schizophrenia, bipolar disorder, Alzheimer's disease, and coronary artery disease. We identified one replicated genome-wide significant locus associated with adult-onset (>27 years) MDD (rs7647854, odds ratio: 1.16, 95% confidence interval: 1.11-1.21, p = 5.2 × 10 -11 ). Using polygenic score analyses, we show that earlier-onset MDD is genetically more similar to schizophrenia and bipolar disorder than adult-onset MDD. We demonstrate that using additional phenotype data previously collected by genetic studies to tackle phenotypic heterogeneity in MDD can successfully lead to the discovery of genetic risk factor despite reduced sample size. Furthermore, our results suggest that the genetic susceptibility to MDD differs between adult- and earlier-onset MDD, with earlier-onset cases having a greater genetic overlap with schizophrenia and bipolar disorder. Copyright © 2016 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
García-Sanz, Ramón; Corchete, Luis Antonio; Alcoceba, Miguel; Chillon, María Carmen; Jiménez, Cristina; Prieto, Isabel; García-Álvarez, María; Puig, Noemi; Rapado, Immaculada; Barrio, Santiago; Oriol, Albert; Blanchard, María Jesús; de la Rubia, Javier; Martínez, Rafael; Lahuerta, Juan José; González Díaz, Marcos; Mateos, María Victoria; San Miguel, Jesús Fernando; Martínez-López, Joaquín; Sarasquete, María Eugenia
Bortezomib- and thalidomide-based therapies have significantly contributed to improved survival of multiple myeloma (MM) patients. However, treatment-induced peripheral neuropathy (TiPN) is a common adverse event associated with them. Risk factors for TiPN in MM patients include advanced age, prior neuropathy, and other drugs, but there are conflicting results about the role of genetics in predicting the risk of TiPN. Thus, we carried out a genome-wide association study based on more than 300 000 exome single nucleotide polymorphisms in 172 MM patients receiving therapy involving bortezomib and thalidomide. We compared patients developing and not developing TiPN under similar treatment conditions (GEM05MAS65, NCT00443235). The highest-ranking single nucleotide polymorphism was rs45443101, located in the PLCG2 gene, but no significant differences were found after multiple comparison correction (adjusted P = .1708). Prediction analyses, cytoband enrichment, and pathway analyses were also performed, but none yielded any significant findings. A copy number approach was also explored, but this gave no significant results either. In summary, our study did not find a consistent genetic component associated with TiPN under bortezomib and thalidomide therapies that could be used for prediction, which makes clinical judgment essential in the practical management of MM treatment. Copyright © 2016 John Wiley & Sons, Ltd.
Brütting, Christine; Emmer, Alexander; Kornhuber, Malte; Staege, Martin S
Although multiple sclerosis (MS) is one of the most common central nervous system diseases in young adults, little is known about its etiology. Several human endogenous retroviruses (ERVs) are considered to play a role in MS. We are interested in which ERVs can be identified in the vicinity of MS associated genetic marker to find potential initiators of MS. We analysed the chromosomal regions surrounding 58 single nucleotide polymorphisms (SNPs) that are associated with MS identified in one of the last major genome wide association studies. We scanned these regions for putative endogenous retrovirus sequences with large open reading frames (ORFs). We observed that more retrovirus-related putative ORFs exist in the relatively close vicinity of SNP marker indices in multiple sclerosis compared to control SNPs. We found very high homologies to HERV-K, HCML-ARV, XMRV, Galidia ERV, HERV-H/env62 and XMRV-like mouse endogenous retrovirus mERV-XL. The associated genes (CYP27B1, CD6, CD58, MPV17L2, IL12RB1, CXCR5, PTGER4, TAGAP, TYK2, ICAM3, CD86, GALC, GPR65 as well as the HLA DRB1*1501) are mainly involved in the immune system, but also in vitamin D regulation. The most frequently detected ERV sequences are related to the multiple sclerosis-associated retrovirus, the human immunodeficiency virus 1, HERV-K, and the Simian foamy virus. Our data shows that there is a relation between MS associated SNPs and the number of retroviral elements compared to control. Our data identifies new ERV sequences that have not been associated with MS, so far.
Park, Hae Jeong; Lee, Soojung; Ju, Eunji; Jones, Jayre A; Choi, Inyeong
Genome-wide association studies have identified the single nucleotide polymorphism (SNP) rs3278 in the human SLC4A7 gene as one of the marker loci for addiction vulnerability. This marker is located in an intron of the gene, and its genomic role has been unknown. In this study, we examined rs3278 and three adjacent SNPs prevalent in alcoholics for their effects on an alternative promoter that would lead to the production of the NH 2 -terminally truncated protein NBCn1ΔN450, missing the first 450 amino acids. Analysis of the transcription start site database and a promoter prediction algorithm identified a cluster of three promoters in intron 7 and two short CpG-rich sites in intron 6. The promoter closest to rs3278 showed strong transcription activity in luciferase reporter gene assays. Major-to-minor allele substitution at rs3278 resulted in increased transcription activity. Equivalent substitutions at adjacent rs3772723 (intron 7) and rs13077400 (exon 8) had negligible effect; however, the substitution at nonsynonymous rs3755652 (exon 8) increased the activity by more than twofold. The concomitant substitution at rs3278/rs3755652 produced an additive effect. The rs3755652 had more profound effects on the promoter than the upstream regulatory CpG sites. The amino acid change E326K caused by rs3755652 had negligible effect on transporter function. In HEK 293 cells, NBCn1ΔN450 was expressed in plasma membranes, but at significantly lower levels than the nontruncated NBCn1-E. The pH change mediated by NBCn1ΔN450 was also low. We conclude that rs3278 and rs3755652 stimulate an alternative transcription of the SLC4A7 gene, increasing the production of a defective transporter. Copyright © 2017 the American Physiological Society.
Hong, Joon Ki; Jeong, Yong Dae; Cho, Eun Seok; Choi, Tae Jeong; Kim, Yong Min; Cho, Kyu Ho; Lee, Jae Bong; Lim, Hyun Tae; Lee, Deuk Hwan
The genetic effects of an individual on the phenotypes of its social partners, such as its pen mates, are known as social genetic effects. This study aims to identify the candidate genes for social (pen-mates') average daily gain (ADG) in pigs by using the genome-wide association approach. Social ADG (sADG) was the average ADG of unrelated pen-mates (strangers). We used the phenotype data (16,802 records) after correcting for batch (week), sex, pen, number of strangers (1 to 7 pigs) in the pen, full-sib rate (0% to 80%) within pen, and age at the end of the test. A total of 1,041 pigs from Landrace breeds were genotyped using the Illumina PorcineSNP60 v2 BeadChip panel, which comprised 61,565 single nucleotide polymorphism (SNP) markers. After quality control, 909 individuals and 39,837 markers remained for sADG in genome-wide association study. We detected five new SNPs, all on chromosome 6, which have not been associated with social ADG or other growth traits to date. One SNP was inside the prostaglandin F2α receptor ( PTGFR ) gene, another SNP was located 22 kb upstream of gene interferon-induced protein 44 ( IFI44 ), and the last three SNPs were between 161 kb and 191 kb upstream of the EGF latrophilin and seven transmembrane domain-containing protein 1 ( ELTD1 ) gene. PTGFR, IFI44, and ELTD1 were never associated with social interaction and social genetic effects in any of the previous studies. The identification of several genomic regions, and candidate genes associated with social genetic effects reported here, could contribute to a better understanding of the genetic basis of interaction traits for ADG. In conclusion, we suggest that the PTGFR, IFI44, and ELTD1 may be used as a molecular marker for sADG, although their functional effect was not defined yet. Thus, it will be of interest to execute association studies in those genes.
Tielbeek, Jorim J; Johansson, Ada; Polderman, Tinca J C; Rautiainen, Marja-Riitta; Jansen, Philip; Taylor, Michelle; Tong, Xiaoran; Lu, Qing; Burt, Alexandra S; Tiemeier, Henning; Viding, Essi; Plomin, Robert; Martin, Nicholas G; Heath, Andrew C; Madden, Pamela A F; Montgomery, Grant; Beaver, Kevin M; Waldman, Irwin; Gelernter, Joel; Kranzler, Henry R; Farrer, Lindsay A; Perry, John R B; Munafò, Marcus; LoParo, Devon; Paunio, Tiina; Tiihonen, Jari; Mous, Sabine E; Pappa, Irene; de Leeuw, Christiaan; Watanabe, Kyoko; Hammerschlag, Anke R; Salvatore, Jessica E; Aliev, Fazil; Bigdeli, Tim B; Dick, Danielle; Faraone, Stephen V; Popma, Arne; Medland, Sarah E; Posthuma, Danielle
Antisocial behavior (ASB) places a large burden on perpetrators, survivors, and society. Twin studies indicate that half of the variation in this trait is genetic. Specific causal genetic variants have, however, not been identified. To estimate the single-nucleotide polymorphism-based heritability of ASB; to identify novel genetic risk variants, genes, or biological pathways; to test for pleiotropic associations with other psychiatric traits; and to reevaluate the candidate gene era data through the Broad Antisocial Behavior Consortium. Genome-wide association data from 5 large population-based cohorts and 3 target samples with genome-wide genotype and ASB data were used for meta-analysis from March 1, 2014, to May 1, 2016. All data sets used quantitative phenotypes, except for the Finnish Crime Study, which applied a case-control design (370 patients and 5850 control individuals). This study adopted relatively broad inclusion criteria to achieve a quantitative measure of ASB derived from multiple measures, maximizing the sample size over different age ranges. The discovery samples comprised 16 400 individuals, whereas the target samples consisted of 9381 individuals (all individuals were of European descent), including child and adult samples (mean age range, 6.7-56.1 years). Three promising loci with sex-discordant associations were found (8535 female individuals, chromosome 1: rs2764450, chromosome 11: rs11215217; 7772 male individuals, chromosome X, rs41456347). Polygenic risk score analyses showed prognostication of antisocial phenotypes in an independent Finnish Crime Study (2536 male individuals and 3684 female individuals) and shared genetic origin with conduct problems in a population-based sample (394 male individuals and 431 female individuals) but not with conduct disorder in a substance-dependent sample (950 male individuals and 1386 female individuals) (R2 = 0.0017 in the most optimal model, P = 0.03). Significant inverse genetic correlation
Ferrari, Raffaele; Hernandez, Dena G; Nalls, Michael A; Rohrer, Jonathan D; Ramasamy, Adaikalavan; Kwok, John B J; Dobson-Stone, Carol; Brooks, William S; Schofield, Peter R; Halliday, Glenda M; Hodges, John R; Piguet, Olivier; Bartley, Lauren; Thompson, Elizabeth; Haan, Eric; Hernández, Isabel; Ruiz, Agustín; Boada, Mercè; Borroni, Barbara; Padovani, Alessandro; Cruchaga, Carlos; Cairns, Nigel J; Benussi, Luisa; Binetti, Giuliano; Ghidoni, Roberta; Forloni, Gianluigi; Galimberti, Daniela; Fenoglio, Chiara; Serpente, Maria; Scarpini, Elio; Clarimón, Jordi; Lleó, Alberto; Blesa, Rafael; Waldö, Maria Landqvist; Nilsson, Karin; Nilsson, Christer; Mackenzie, Ian R A; Hsiung, Ging-Yuek R; Mann, David M A; Grafman, Jordan; Morris, Christopher M; Attems, Johannes; Griffiths, Timothy D; McKeith, Ian G; Thomas, Alan J; Pietrini, P; Huey, Edward D; Wassermann, Eric M; Baborie, Atik; Jaros, Evelyn; Tierney, Michael C; Pastor, Pau; Razquin, Cristina; Ortega-Cubero, Sara; Alonso, Elena; Perneczky, Robert; Diehl-Schmid, Janine; Alexopoulos, Panagiotis; Kurz, Alexander; Rainero, Innocenzo; Rubino, Elisa; Pinessi, Lorenzo; Rogaeva, Ekaterina; George-Hyslop, Peter St; Rossi, Giacomina; Tagliavini, Fabrizio; Giaccone, Giorgio; Rowe, James B; Schlachetzki, J C M; Uphill, James; Collinge, John; Mead, S; Danek, Adrian; Van Deerlin, Vivianna M; Grossman, Murray; Trojanowsk, John Q; van der Zee, Julie; Deschamps, William; Van Langenhove, Tim; Cruts, Marc; Van Broeckhoven, Christine; Cappa, Stefano F; Le Ber, Isabelle; Hannequin, Didier; Golfier, Véronique; Vercelletto, Martine; Brice, Alexis; Nacmias, Benedetta; Sorbi, Sandro; Bagnoli, Silvia; Piaceri, Irene; Nielsen, Jørgen E; Hjermind, Lena E; Riemenschneider, Matthias; Mayhaus, Manuel; Ibach, Bernd; Gasparoni, Gilles; Pichler, Sabrina; Gu, Wei; Rossor, Martin N; Fox, Nick C; Warren, Jason D; Spillantini, Maria Grazia; Morris, Huw R; Rizzu, Patrizia; Heutink, Peter; Snowden, Julie S; Rollinson, Sara; Richardson, Anna; Gerhard, Alexander; Bruni, Amalia C; Maletta, Raffaele; Frangipane, Francesca; Cupidi, Chiara; Bernardi, Livia; Anfossi, Maria; Gallo, Maura; Conidi, Maria Elena; Smirne, Nicoletta; Rademakers, Rosa; Baker, Matt; Dickson, Dennis W; Graff-Radford, Neill R; Petersen, Ronald C; Knopman, David; Josephs, Keith A; Boeve, Bradley F; Parisi, Joseph E; Seeley, William W; Miller, Bruce L; Karydas, Anna M; Rosen, Howard; van Swieten, John C; Dopper, Elise G P; Seelaar, Harro; Pijnenburg, Yolande AL; Scheltens, Philip; Logroscino, Giancarlo; Capozzo, Rosa; Novelli, Valeria; Puca, Annibale A; Franceschi, M; Postiglione, Alfredo; Milan, Graziella; Sorrentino, Paolo; Kristiansen, Mark; Chiang, Huei-Hsin; Graff, Caroline; Pasquier, Florence; Rollin, Adeline; Deramecourt, Vincent; Lebert, Florence; Kapogiannis, Dimitrios; Ferrucci, Luigi; Pickering-Brown, Stuart; Singleton, Andrew B; Hardy, John; Momeni, Parastoo
Summary Background Frontotemporal dementia (FTD) is a complex disorder characterised by a broad range of clinical manifestations, differential pathological signatures, and genetic variability. Mutations in three genes—MAPT, GRN, and C9orf72—have been associated with FTD. We sought to identify novel genetic risk loci associated with the disorder. Methods We did a two-stage genome-wide association study on clinical FTD, analysing samples from 3526 patients with FTD and 9402 healthy controls. All participants had European ancestry. In the discovery phase (samples from 2154 patients with FTD and 4308 controls), we did separate association analyses for each FTD subtype (behavioural variant FTD, semantic dementia, progressive non-fluent aphasia, and FTD overlapping with motor neuron disease [FTD-MND]), followed by a meta-analysis of the entire dataset. We carried forward replication of the novel suggestive loci in an independent sample series (samples from 1372 patients and 5094 controls) and then did joint phase and brain expression and methylation quantitative trait loci analyses for the associated (p<5 × 10−8) and suggestive single-nucleotide polymorphisms. Findings We identified novel associations exceeding the genome-wide significance threshold (p<5 × 10−8) that encompassed the HLA locus at 6p21.3 in the entire cohort. We also identified a potential novel locus at 11q14, encompassing RAB38/CTSC, for the behavioural FTD subtype. Analysis of expression and methylation quantitative trait loci data suggested that these loci might affect expression and methylation incis. Interpretation Our findings suggest that immune system processes (link to 6p21.3) and possibly lysosomal and autophagy pathways (link to 11q14) are potentially involved in FTD. Our findings need to be replicated to better define the association of the newly identified loci with disease and possibly to shed light on the pathomechanisms contributing to FTD. Funding The National Institute of
Christine E McLaren
Full Text Available The existence of multiple inherited disorders of iron metabolism in man, rodents and other vertebrates suggests genetic contributions to iron deficiency. To identify new genomic locations associated with iron deficiency, a genome-wide association study (GWAS was performed using DNA collected from white men aged≥25 y and women≥50 y in the Hemochromatosis and Iron Overload Screening (HEIRS Study with serum ferritin (SF≤12 µg/L (cases and iron replete controls (SF>100 µg/L in men, SF>50 µg/L in women. Regression analysis was used to examine the association between case-control status (336 cases, 343 controls and quantitative serum iron measures and 331,060 single nucleotide polymorphism (SNP genotypes, with replication analyses performed in a sample of 71 cases and 161 controls from a population of white male and female veterans screened at a US Veterans Affairs (VA medical center. Five SNPs identified in the GWAS met genome-wide statistical significance for association with at least one iron measure, rs2698530 on chr. 2p14; rs3811647 on chr. 3q22, a known SNP in the transferrin (TF gene region; rs1800562 on chr. 6p22, the C282Y mutation in the HFE gene; rs7787204 on chr. 7p21; and rs987710 on chr. 22q11 (GWAS observed P<1.51×10(-7 for all. An association between total iron binding capacity and SNP rs3811647 in the TF gene (GWAS observed P=7.0×10(-9, corrected P=0.012 was replicated within the VA samples (observed P=0.012. Associations with the C282Y mutation in the HFE gene also were replicated. The joint analysis of the HEIRS and VA samples revealed strong associations between rs2698530 on chr. 2p14 and iron status outcomes. These results confirm a previously-described TF polymorphism and implicate one potential new locus as a target for gene identification.
O'Brien, Katie M; Sandler, Dale P; Shi, Min; Harmon, Quaker E; Taylor, Jack A; Weinberg, Clarice R
Genetic factors likely influence individuals' concentrations of 25-hydroxyvitamin D [25(OH)D], a biomarker of vitamin D exposure previously linked to reduced risk of several chronic diseases. We conducted a genome-wide association study of serum 25(OH)D (assessed using liquid chromatography-tandem mass spectrometry) and 386,449 single nucleotide polymorphisms (SNPs). Our sample consisted of 1,829 participants randomly selected from the Sister Study, a cohort of women who had a sister with breast cancer but had never had breast cancer themselves. 19,741 SNPs were associated with 25(OH)D ( p < 0.05). We re-assessed these hits in an independent sample of 1,534 participants who later developed breast cancer. After pooling, 32 SNPs had genome-wide significant associations ( p < 5 × 10 -8 ). These were located in or near GC , the vitamin D binding protein, or CYP2R1 , a cytochrome P450 enzyme that hydroxylates vitamin D to form 25(OH)D. The top hit was rs4588, a missense GC polymorphism associated with a 3.5 ng/mL decrease in 25(OH)D per copy of the minor allele (95% confidence interval [CI]: -4.1, -3.0; p = 4.5 × 10 -38 ). The strongest SNP near CYP2R1 was rs12794714, a synonymous variant ( p = 3.8 × 10 -12 ; β = 1.8 ng/mL decrease in 25(OH)D per minor allele [CI: -2.2, -1.3]). Serum 25(OH)D concentrations from samples collected from some participants 3-10 years after baseline (811 cases, 780 non-cases) were also strongly associated with both loci. These findings augment our understanding of genetic influences on 25(OH)D and the possible role of vitamin D binding proteins and cytochrome P450 enzymes in determining measured levels. These results may help to identify individuals genetically predisposed to vitamin D insufficiency.
Danjou, Fabrice; Fozza, Claudio; Zoledziewska, Magdalena; Mulas, Antonella; Corda, Giovanna; Contini, Salvatore; Dore, Fausto; Galleu, Antonio; Di Tucci, Anna Angela; Caocci, Giovanni; Gaviano, Eleonora; Latte, Giancarlo; Gabbas, Attilio; Casula, Paolo; Delogu, Lucia Gemma; La Nasa, Giorgio; Angelucci, Emanuele; Cucca, Francesco; Longinotti, Maurizio
Because different findings suggest that an immune dysregulation plays a role in the pathogenesis of myelodysplastic syndrome (MDS), we analyzed a large cohort of patients from a homogeneous Sardinian population using ImmunoChip, a genotyping array exploring 147,954 single-nucleotide polymorphisms (SNPs) localized in genomic regions displaying some degree of association with immune-mediated diseases or pathways. The population studied included 133 cases and 3,894 controls, and a total of 153,978 autosomal markers and 971 non-autosomal markers were genotyped. After association analysis, only one variant passed the genome-wide significance threshold: rs71325459 (p = 1.16 × 10 -12 ), which is situated on chromosome 20. The variant is in high linkage disequilibrium with rs35640778, an untested missense variant situated in the RTEL1 gene, an interesting candidate that encodes for an ATP-dependent DNA helicase implicated in telomere-length regulation, DNA repair, and maintenance of genomic stability. The second most associated signal is composed of five variants that fall slightly below the genome-wide significance threshold but point out another interesting gene candidate. These SNPs, with p values between 2.53 × 10 -6 and 3.34 × 10 -6 , are situated in the methylene tetrahydrofolate reductase (MTHFR) gene. The most associated of these variants, rs1537514, presents an increased frequency of the derived C allele in cases, with 11.4% versus 4.4% in controls. MTHFR is the rate-limiting enzyme in the methyl cycle and genetic variations in this gene have been strongly associated with the risk of neoplastic diseases. The current understanding of the MDS biology, which is based on the hypothesis of the sequential development of multiple subclonal molecular lesions, fits very well with the demonstration of a possible role for RTEL1 and MTHFR gene polymorphisms, both of which are related to a variable risk of genomic instability. Copyright © 2016 ISEH - International
Fanous, Ayman H; Zhou, Baiyu; Aggen, Steven H
Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia.......Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia....
SNPs from the African American breast cancer scan to COGs , a European collaborative study which is has designed a SNP array with that will be genotyped...Award Number: W81XWH-08-1-0383 TITLE: A Genome-wide Breast Cancer Scan in African Americans PRINCIPAL INVESTIGATOR: Christopher A...SUBTITLE A Genome-wide Breast Cancer Scan in African Americans 5a. CONTRACT NUMBER 5b. GRANT NUMBER W81XWH-08-1-0383 5c. PROGRAM
Full Text Available Abstract Background Despite the recent success of genome-wide association studies in identifying novel loci contributing effects to complex human traits, such as type 2 diabetes and obesity, much of the genetic component of variation in these phenotypes remains unexplained. One way to improving power to detect further novel loci is through meta-analysis of studies from the same population, increasing the sample size over any individual study. Although statistical software analysis packages incorporate routines for meta-analysis, they are ill equipped to meet the challenges of the scale and complexity of data generated in genome-wide association studies. Results We have developed flexible, open-source software for the meta-analysis of genome-wide association studies. The software incorporates a variety of error trapping facilities, and provides a range of meta-analysis summary statistics. The software is distributed with scripts that allow simple formatting of files containing the results of each association study and generate graphical summaries of genome-wide meta-analysis results. Conclusions The GWAMA (Genome-Wide Association Meta-Analysis software has been developed to perform meta-analysis of summary statistics generated from genome-wide association studies of dichotomous phenotypes or quantitative traits. Software with source files, documentation and example data files are freely available online at http://www.well.ox.ac.uk/GWAMA.
Full Text Available Young-onset hypertension has a stronger genetic component than late-onset counterpart; thus, the identification of genes related to its susceptibility is a critical issue for the prevention and management of this disease. We carried out a two-stage association scan to map young-onset hypertension susceptibility genes. The first-stage analysis, a genome-wide association study, analyzed 175 matched case-control pairs; the second-stage analysis, a confirmatory association study, verified the results at the first stage based on a total of 1,008 patients and 1,008 controls. Single-locus association tests, multilocus association tests and pair-wise gene-gene interaction tests were performed to identify young-onset hypertension susceptibility genes. After considering stringent adjustments of multiple testing, gene annotation and single-nucleotide polymorphism (SNP quality, four SNPs from two SNP triplets with strong association signals (-log(10(p>7 and 13 SNPs from 8 interactive SNP pairs with strong interactive signals (-log(10(p>8 were carefully re-examined. The confirmatory study verified the association for a SNP quartet 219 kb and 495 kb downstream of LOC344371 (a hypothetical gene and RASGRP3 on chromosome 2p22.3, respectively. The latter has been implicated in the abnormal vascular responsiveness to endothelin-1 and angiotensin II in diabetic-hypertensive rats. Intrinsic synergy involving IMPG1 on chromosome 6q14.2-q15 was also verified. IMPG1 encodes interphotoreceptor matrix proteoglycan 1 which has cation binding capacity. The genes are novel hypertension targets identified in this first genome-wide hypertension association study of the Han Chinese population.
Marchal, Claire; Sasaki, Takayo; Vera, Daniel; Wilson, Korey; Sima, Jiao; Rivera-Mulia, Juan Carlos; Trevilla-García, Claudia; Nogues, Coralin; Nafie, Ebtesam; Gilbert, David M
This protocol is an extension to: Nat. Protoc. 6, 870-895 (2014); doi:10.1038/nprot.2011.328; published online 02 June 2011Cycling cells duplicate their DNA content during S phase, following a defined program called replication timing (RT). Early- and late-replicating regions differ in terms of mutation rates, transcriptional activity, chromatin marks and subnuclear position. Moreover, RT is regulated during development and is altered in diseases. Here, we describe E/L Repli-seq, an extension of our Repli-chip protocol. E/L Repli-seq is a rapid, robust and relatively inexpensive protocol for analyzing RT by next-generation sequencing (NGS), allowing genome-wide assessment of how cellular processes are linked to RT. Briefly, cells are pulse-labeled with BrdU, and early and late S-phase fractions are sorted by flow cytometry. Labeled nascent DNA is immunoprecipitated from both fractions and sequenced. Data processing leads to a single bedGraph file containing the ratio of nascent DNA from early versus late S-phase fractions. The results are comparable to those of Repli-chip, with the additional benefits of genome-wide sequence information and an increased dynamic range. We also provide computational pipelines for downstream analyses, for parsing phased genomes using single-nucleotide polymorphisms (SNPs) to analyze RT allelic asynchrony, and for direct comparison to Repli-chip data. This protocol can be performed in up to 3 d before sequencing, and requires basic cellular and molecular biology skills, as well as a basic understanding of Unix and R.
Stephanie N Lewis
Full Text Available Genome wide association studies (GWAS have proven useful as a method for identifying genetic variations associated with diseases. In this study, we analyzed GWAS data for 61 diseases and phenotypes to elucidate common associations based on single nucleotide polymorphisms (SNP. The study was an expansion on a previous study on identifying disease associations via data from a single GWAS on seven diseases.Adjustments to the originally reported study included expansion of the SNP dataset using Linkage Disequilibrium (LD and refinement of the four levels of analysis to encompass SNP, SNP block, gene, and pathway level comparisons. A pair-wise comparison between diseases and phenotypes was performed at each level and the Jaccard similarity index was used to measure the degree of association between two diseases/phenotypes. Disease relatedness networks (DRNs were used to visualize our results. We saw predominant relatedness between Multiple Sclerosis, type 1 diabetes, and rheumatoid arthritis for the first three levels of analysis. Expected relatedness was also seen between lipid- and blood-related traits.The predominant associations between Multiple Sclerosis, type 1 diabetes, and rheumatoid arthritis can be validated by clinical studies. The diseases have been proposed to share a systemic inflammation phenotype that can result in progression of additional diseases in patients with one of these three diseases. We also noticed unexpected relationships between metabolic and neurological diseases at the pathway comparison level. The less significant relationships found between diseases require a more detailed literature review to determine validity of the predictions. The results from this study serve as a first step towards a better understanding of seemingly unrelated diseases and phenotypes with similar symptoms or modes of treatment.
Michelle D Johnson
Full Text Available Epigenetic marks such as cytosine methylation are important determinants of cellular and whole-body phenotypes. However, the extent of, and reasons for inter-individual differences in cytosine methylation, and their association with phenotypic variation are poorly characterised. Here we present the first genome-wide study of cytosine methylation at single-nucleotide resolution in an animal model of human disease. We used whole-genome bisulfite sequencing in the spontaneously hypertensive rat (SHR, a model of cardiovascular disease, and the Brown Norway (BN control strain, to define the genetic architecture of cytosine methylation in the mammalian heart and to test for association between methylation and pathophysiological phenotypes. Analysis of 10.6 million CpG dinucleotides identified 77,088 CpGs that were differentially methylated between the strains. In F1 hybrids we found 38,152 CpGs showing allele-specific methylation and 145 regions with parent-of-origin effects on methylation. Cis-linkage explained almost 60% of inter-strain variation in methylation at a subset of loci tested for linkage in a panel of recombinant inbred (RI strains. Methylation analysis in isolated cardiomyocytes showed that in the majority of cases methylation differences in cardiomyocytes and non-cardiomyocytes were strain-dependent, confirming a strong genetic component for cytosine methylation. We observed preferential nucleotide usage associated with increased and decreased methylation that is remarkably conserved across species, suggesting a common mechanism for germline control of inter-individual variation in CpG methylation. In the RI strain panel, we found significant correlation of CpG methylation and levels of serum chromogranin B (CgB, a proposed biomarker of heart failure, which is evidence for a link between germline DNA sequence variation, CpG methylation differences and pathophysiological phenotypes in the SHR strain. Together, these results will
Nguyen, Thao T.B.; Arimatsu, Yuji; Hong, Sung-Jong; Brindley, Paul J.; Blair, David; Laha, Thewarach; Sripa, Banchob
Clonorchis sinensis is an important carcinogenic human liver fluke endemic in East and Southeast Asia. There are several conventional molecular markers have been used for identification and genetic diversity, however, no information about microsatellites of this liver fluke published so far. We here report microsatellite characterization and marker development for genetic diversity study in C. sinensis using genome-wide bioinformatics approach. Based on our search criteria, a total of 256,990 microsatellites (≥ 12 base pairs) were identified from genome database of C. sinensis with hexa-nucleotide motif being the most abundant (51%) followed by penta-nucleotide (18.3%) and tri-nucleotide (12.7%). The tetra-nucleotide, di-nucleotide and mononucleotide motifs accounted for 9.75 %, 7.63% and 0.14%, respectively. The total length of all microsatellites accounts for 0. 72 % of 547 Mb of the whole genome size and the frequency of microsatellites were found to be one microsatellite in every 2.13 kb of DNA. For the di-, tri, and tetra-nucleotide, the repeat numbers redundant are six (28%), four (45%) and three (76%), respectively. The ATC repeat is the most abundant microsatellites followed by AT, AAT and AC, respectively. Within 40 microsatellite loci developed, 24 microsatellite markers showed potential to differentiate between C. sinensis and O. viverrini. Seven out of 24 loci showed heterozygous with observed heterozygosity ranged from 0.467 to 1. Four-primer sets could amplify both C. sinensis and O. viverrini DNA with different sizes. This study provides basic information of C. sinensis microsatellites and the genome-wide markers developed may be a useful tool for genetic study of C. sinensis. PMID:25782682
Li, Zheng; Xia, Yi; Feng, Li-Na; Chen, Jie-Rong; Li, Hong-Min; Cui, Jing; Cai, Qing-Qing; Sim, Kar Seng; Nairismägi, Maarja-Liisa; Laurensia, Yurike; Meah, Wee Yang; Liu, Wen-Sheng; Guo, Yun-Miao; Chen, Li-Zhen; Feng, Qi-Sheng; Pang, Chi Pui; Chen, Li Jia; Chew, Soo Hong; Ebstein, Richard P; Foo, Jia Nee; Liu, Jianjun; Ha, Jeslin; Khoo, Lay Poh; Chin, Suk Teng; Zeng, Yi-Xin; Aung, Tin; Chowbay, Balram; Diong, Colin Phipps; Zhang, Fen; Liu, Yan-Hui; Tang, Tiffany; Tao, Miriam; Quek, Richard; Mohamad, Farid; Tan, Soo Yong; Teh, Bin Tean; Ng, Siok Bian; Chng, Wee Joo; Ong, Choon Kiat; Okada, Yukinori; Raychaudhuri, Soumya; Lim, Soon Thye; Tan, Wen; Peng, Rou-Jun; Khor, Chiea Chuen; Bei, Jin-Xin
Extranodal natural killer T-cell lymphoma (NKTCL), nasal type, is a rare and aggressive malignancy that occurs predominantly in Asian and Latin American populations. Although Epstein-Barr virus infection is a known risk factor, other risk factors and the pathogenesis of NKTCL are not well understood. We aimed to identify common genetic variants affecting individual risk of NKTCL. We did a genome-wide association study of 189 patients with extranodal NKTCL, nasal type (WHO classification criteria; cases) and 957 controls from Guangdong province, southern China. We validated our findings in four independent case-control series, including 75 cases from Guangdong province and 296 controls from Hong Kong, 65 cases and 983 controls from Guangdong province, 125 cases and 1110 controls from Beijing (northern China), and 60 cases and 2476 controls from Singapore. We used imputation and conditional logistic regression analyses to fine-map the associations. We also did a meta-analysis of the replication series and of the entire dataset. Associations exceeding the genome-wide significance threshold (p<5 × 10(-8)) were seen at 51 single-nucleotide polymorphisms (SNPs) mapping to the class II MHC region on chromosome 6, with rs9277378 (located in HLA-DPB1) having the strongest association with NKTCL susceptibility (p=4·21 × 10(-19), odds ratio [OR] 1·84 [95% CI 1·61-2·11] in meta-analysis of entire dataset). Imputation-based fine-mapping across the class II MHC region suggests that four aminoacid residues (Gly84-Gly85-Pro86-Met87) in near-complete linkage disequilibrium at the edge of the peptide-binding groove of HLA-DPB1 could account for most of the association between the rs9277378*A risk allele and NKTCL susceptibility (OR 2·38, p value for haplotype 2·32 × 10(-14)). This association is distinct from MHC associations with Epstein-Barr virus infection. To our knowledge, this is the first time that a genetic variant conferring an NKTCL risk is noted at
Ferrari, Raffaele; Hernandez, Dena G; Nalls, Michael A; Rohrer, Jonathan D; Ramasamy, Adaikalavan; Kwok, John B J; Dobson-Stone, Carol; Brooks, William S; Schofield, Peter R; Halliday, Glenda M; Hodges, John R; Piguet, Olivier; Bartley, Lauren; Thompson, Elizabeth; Haan, Eric; Hernández, Isabel; Ruiz, Agustín; Boada, Mercè; Borroni, Barbara; Padovani, Alessandro; Cruchaga, Carlos; Cairns, Nigel J; Benussi, Luisa; Binetti, Giuliano; Ghidoni, Roberta; Forloni, Gianluigi; Galimberti, Daniela; Fenoglio, Chiara; Serpente, Maria; Scarpini, Elio; Clarimón, Jordi; Lleó, Alberto; Blesa, Rafael; Waldö, Maria Landqvist; Nilsson, Karin; Nilsson, Christer; Mackenzie, Ian R A; Hsiung, Ging-Yuek R; Mann, David M A; Grafman, Jordan; Morris, Christopher M; Attems, Johannes; Griffiths, Timothy D; McKeith, Ian G; Thomas, Alan J; Pietrini, P; Huey, Edward D; Wassermann, Eric M; Baborie, Atik; Jaros, Evelyn; Tierney, Michael C; Pastor, Pau; Razquin, Cristina; Ortega-Cubero, Sara; Alonso, Elena; Perneczky, Robert; Diehl-Schmid, Janine; Alexopoulos, Panagiotis; Kurz, Alexander; Rainero, Innocenzo; Rubino, Elisa; Pinessi, Lorenzo; Rogaeva, Ekaterina; St George-Hyslop, Peter; Rossi, Giacomina; Tagliavini, Fabrizio; Giaccone, Giorgio; Rowe, James B; Schlachetzki, Johannes C M; Uphill, James; Collinge, John; Mead, Simon; Danek, Adrian; Van Deerlin, Vivianna M; Grossman, Murray; Trojanowski, John Q; van der Zee, Julie; Deschamps, William; Van Langenhove, Tim; Cruts, Marc; Van Broeckhoven, Christine; Cappa, Stefano F; Le Ber, Isabelle; Hannequin, Didier; Golfier, Véronique; Vercelletto, Martine; Brice, Alexis; Nacmias, Benedetta; Sorbi, Sandro; Bagnoli, Silvia; Piaceri, Irene; Nielsen, Jørgen E; Hjermind, Lena E; Riemenschneider, Matthias; Mayhaus, Manuel; Ibach, Bernd; Gasparoni, Gilles; Pichler, Sabrina; Gu, Wei; Rossor, Martin N; Fox, Nick C; Warren, Jason D; Spillantini, Maria Grazia; Morris, Huw R; Rizzu, Patrizia; Heutink, Peter; Snowden, Julie S; Rollinson, Sara; Richardson, Anna; Gerhard, Alexander; Bruni, Amalia C; Maletta, Raffaele; Frangipane, Francesca; Cupidi, Chiara; Bernardi, Livia; Anfossi, Maria; Gallo, Maura; Conidi, Maria Elena; Smirne, Nicoletta; Rademakers, Rosa; Baker, Matt; Dickson, Dennis W; Graff-Radford, Neill R; Petersen, Ronald C; Knopman, David; Josephs, Keith A; Boeve, Bradley F; Parisi, Joseph E; Seeley, William W; Miller, Bruce L; Karydas, Anna M; Rosen, Howard; van Swieten, John C; Dopper, Elise G P; Seelaar, Harro; Pijnenburg, Yolande A L; Scheltens, Philip; Logroscino, Giancarlo; Capozzo, Rosa; Novelli, Valeria; Puca, Annibale A; Franceschi, Massimo; Postiglione, Alfredo; Milan, Graziella; Sorrentino, Paolo; Kristiansen, Mark; Chiang, Huei-Hsin; Graff, Caroline; Pasquier, Florence; Rollin, Adeline; Deramecourt, Vincent; Lebert, Florence; Kapogiannis, Dimitrios; Ferrucci, Luigi; Pickering-Brown, Stuart; Singleton, Andrew B; Hardy, John; Momeni, Parastoo
Frontotemporal dementia (FTD) is a complex disorder characterised by a broad range of clinical manifestations, differential pathological signatures, and genetic variability. Mutations in three genes-MAPT, GRN, and C9orf72--have been associated with FTD. We sought to identify novel genetic risk loci associated with the disorder. We did a two-stage genome-wide association study on clinical FTD, analysing samples from 3526 patients with FTD and 9402 healthy controls. To reduce genetic heterogeneity, all participants were of European ancestry. In the discovery phase (samples from 2154 patients with FTD and 4308 controls), we did separate association analyses for each FTD subtype (behavioural variant FTD, semantic dementia, progressive non-fluent aphasia, and FTD overlapping with motor neuron disease [FTD-MND]), followed by a meta-analysis of the entire dataset. We carried forward replication of the novel suggestive loci in an independent sample series (samples from 1372 patients and 5094 controls) and then did joint phase and brain expression and methylation quantitative trait loci analyses for the associated (p<5 × 10(-8)) single-nucleotide polymorphisms. We identified novel associations exceeding the genome-wide significance threshold (p<5 × 10(-8)). Combined (joint) analyses of discovery and replication phases showed genome-wide significant association at 6p21.3, HLA locus (immune system), for rs9268877 (p=1·05 × 10(-8); odds ratio=1·204 [95% CI 1·11-1·30]), rs9268856 (p=5·51 × 10(-9); 0·809 [0·76-0·86]) and rs1980493 (p value=1·57 × 10(-8), 0·775 [0·69-0·86]) in the entire cohort. We also identified a potential novel locus at 11q14, encompassing RAB38/CTSC (the transcripts of which are related to lysosomal biology), for the behavioural FTD subtype for which joint analyses showed suggestive association for rs302668 (p=2·44 × 10(-7); 0·814 [0·71-0·92]). Analysis of expression and methylation quantitative trait loci data
Schwonbeck, Susanne; Krause-Griep, Andrea; Gajovic-Eichelmann, Nenad; Ehrentreich-Förster, Eva; Meinl, Walter; Glatt, Hansrüdi; Bier, Frank F
A method has been developed to determine SNPs on DNA chips by applying a flow-through bioscanner. As a practical application we demonstrated the fast and simple SNP analysis of 24 genotypes in an array of 96 spots with a single hybridisation and dissociation experiment. The main advantage of this methodical concept is the parallel and fast analysis without any need of enzymatic digestion. Additionally, the DNA chip format used is appropriate for parallel analysis up to 400 spots. The polymorphism in the gene of the human phenol sulfotransferase SULT1A1 was studied as a model SNP. Biotinylated PCR products containing the SNP (The SNP summary web site: ) (mutant) and those containing no mutation (wild-type) were brought onto the chips coated with NeutrAvidin using non-contact spotting. This was followed by an analysis which was carried out in a flow-through biochip scanner while constantly rinsing with buffer. After removing the non-biotinylated strand a fluorescent probe was hybridised, which is complementary to the wild-type sequence. If this probe binds to a mutant sequence, then one single base is not fully matching. Thereby, the mismatched hybrid (mutant) is less stable than the full-matched hybrid (wild-type). The final step after hybridisation on the chip involves rinsing with a buffer to start dissociation of the fluorescent probe from the immobilised DNA strand. The online measurement of the fluorescence intensity by the biochip scanner provides the possibility to follow the kinetics of the hybridisation and dissociation processes. According to the different stability of the full-match and the mismatch, either visual discrimination or kinetic analysis is possible to distinguish SNP-containing sequence from the wild-type sequence.
Morikawa, Takanori; Yokota, Kazumichi; Tanimoto, Sachie; Tsutsui, Makusu; Taniguchi, Masateru
Label-free detection of single-nucleotides was performed by fast tunneling current measurements in a polar solvent at 1 MHz sampling rate using SiO₂-protected Au nanoprobes. Short current spikes were observed, suggestive of trapping/detrapping of individual nucleotides between the nanoelectrodes. The fall and rise features of the electrical signatures indicated signal retardation by capacitance effects with a time constant of about 10 microseconds. The high temporal resolution revealed current fluctuations, reflecting the molecular conformation degrees of freedom in the electrode gap. The method presented in this work may enable direct characterizations of dynamic changes in single-molecule conformations in an electrode gap in liquid.
Rachel Maree Jones
Full Text Available Research has proposed that autistic-like traits in the general population lie on a continuum, with clinical Autism Spectrum Disorder (ASD representing the extreme end of this distribution. Inherent in this proposal is that biological mechanisms associated with clinical ASD may also underpin variation in autistic-like traits within the general population. A genome-wide association study using 2,462,046 single nucleotide polymorphisms (SNPs was undertaken for ASD in 965 individuals from the Western Australian Pregnancy Cohort (Raine Study. No SNP associations reached genome-wide significance (p < 5.0 x 10-8. However, investigations into nominal observed SNP associations (p < 1.0 x 10-5 add support to two positional candidate genes previously implicated in ASD aetiology, PRKCB1 and CBLN1.The rs198198 SNP (p = 9.587 x 10-6, is located within an intron of the protein kinase C, beta 1 (PRKCB1 gene on chromosome 16p11. The PRKCB1 gene has been previously reported in linkage and association studies for ASD, and its mRNA expression has been shown to be significantly down regulated in ASD cases compared with controls. The rs16946931 SNP (p = 1.78 x 10-6 is located in a region flanking the Cerebellin 1 (CBLN1 gene on chromosome 16q12.1. The CBLN1 gene is involved with synaptogenesis and is part of a gene family previously implicated in ASD. This GWA study is only the second to examine SNPs associated with autistic-like traits in the general population, and provides evidence to support roles for the PRKCB1 and CBLN1 genes in risk of clinical ASD.
Davies, G; Marioni, R E; Liewald, D C; Hill, W D; Hagenaars, S P; Harris, S E; Ritchie, S J; Luciano, M; Fawns-Ritchie, C; Lyall, D; Cullen, B; Cox, S R; Hayward, C; Porteous, D J; Evans, J; McIntosh, A M; Gallacher, J; Craddock, N; Pell, J P; Smith, D J; Gale, C R; Deary, I J
People's differences in cognitive functions are partly heritable and are associated with important life outcomes. Previous genome-wide association (GWA) studies of cognitive functions have found evidence for polygenic effects yet, to date, there are few replicated genetic associations. Here we use data from the UK Biobank sample to investigate the genetic contributions to variation in tests of three cognitive functions and in educational attainment. GWA analyses were performed for verbal–numerical reasoning (N=36 035), memory (N=112 067), reaction time (N=111 483) and for the attainment of a college or a university degree (N=111 114). We report genome-wide significant single-nucleotide polymorphism (SNP)-based associations in 20 genomic regions, and significant gene-based findings in 46 regions. These include findings in the ATXN2, CYP2DG, APBA1 and CADM2 genes. We report replication of these hits in published GWA studies of cognitive function, educational attainment and childhood intelligence. There is also replication, in UK Biobank, of SNP hits reported previously in GWA studies of educational attainment and cognitive function. GCTA-GREML analyses, using common SNPs (minor allele frequency>0.01), indicated significant SNP-based heritabilities of 31% (s.e.m.=1.8%) for verbal–numerical reasoning, 5% (s.e.m.=0.6%) for memory, 11% (s.e.m.=0.6%) for reaction time and 21% (s.e.m.=0.6%) for educational attainment. Polygenic score analyses indicate that up to 5% of the variance in cognitive test scores can be predicted in an independent cohort. The genomic regions identified include several novel loci, some of which have been associated with intracranial volume, neurodegeneration, Alzheimer's disease and schizophrenia. PMID:27046643
Lee, Myoungsook; Kwon, Dae Young; Kim, Myung-Sunny; Choi, Chong Ran; Park, Mi-Young; Kim, Ae-Jung
This is the first study to identify common genetic factors associated with the basal metabolic rate (BMR) and body mass index (BMI) in obese Korean women including overweight. This will be a basic study for future research of obese gene-BMR interaction. The experimental design was 2 by 2 with variables of BMR and BMI. A genome-wide association study (GWAS) of single nucleotide polymorphisms (SNPs) was conducted in the overweight and obesity (BMI > 23 kg/m(2)) compared to the normality, and in women with low BMR (BMR. A total of 140 SNPs reached formal genome-wide statistical significance in this study (P BMR (rs10786764; P = 8.0 × 10(-7), rs1040675; 2.3 × 10(-6)) and BMI (rs10786764; P = 2.5 × 10(-5), rs10786764; 6.57 × 10(-5)). The other genes related to BMI (HSD52, TMA16, MARCH1, NRG1, NRXN3, and STK4) yielded P BMR and BMI, including NRG3, OR8U8, BCL2L2-PABPN1, PABPN1, and SLC22A17 were identified in obese Korean women (P BMR- and BMI-related genes using GWAS. Although most of these newly established loci were not previously associated with obesity, they may provide new insights into body weight regulation. Our findings of five common genes associated with BMR and BMI in Koreans will serve as a reference for replication and validation of future studies on the metabolic rate.
Yu, Dongmei; Mathews, Carol A.; Scharf, Jeremiah M.; Neale, Benjamin M.; Davis, Lea K.; Gamazon, Eric R.; Derks, Eske M.; Evans, Patrick; Edlund, Christopher K.; Crane, Jacquelyn; Fagerness, Jesen A.; Osiecki, Lisa; Gallagher, Patience; Gerber, Gloria; Haddad, Stephen; Illmann, Cornelia; McGrath, Lauren M.; Mayerfeld, Catherine; Arepalli, Sampath; Barlassina, Cristina; Barr, Cathy L.; Bellodi, Laura; Benarroch, Fortu; Berrió, Gabriel Bedoya; Bienvenu, O. Joseph; Black, Donald; Bloch, Michael H.; Brentani, Helena; Bruun, Ruth D.; Budman, Cathy L.; Camarena, Beatriz; Campbell, Desmond D.; Cappi, Carolina; Cardona Silgado, Julio C.; Cavallini, Maria C.; Chavira, Denise A.; Chouinard, Sylvain; Cook, Edwin H.; Cookson, M. R.; Coric, Vladimir; Cullen, Bernadette; Cusi, Daniele; Delorme, Richard; Denys, Damiaan; Dion, Yves; Eapen, Valsama; Egberts, Karin; Falkai, Peter; Fernandez, Thomas; Fournier, Eduardo; Garrido, Helena; Geller, Daniel; Gilbert, Donald; Girard, Simon L.; Grabe, Hans J.; Grados, Marco A.; Greenberg, Benjamin D.; Gross-Tsur, Varda; Grünblatt, Edna; Hardy, John; Heiman, Gary A.; Hemmings, Sian M.J.; Herrera, Luis D.; Hezel, Dianne M.; Hoekstra, Pieter J.; Jankovic, Joseph; Kennedy, James L.; King, Robert A.; Konkashbaev, Anuar I.; Kremeyer, Barbara; Kurlan, Roger; Lanzagorta, Nuria; Leboyer, Marion; Leckman, James F.; Lennertz, Leonhard; Liu, Chunyu; Lochner, Christine; Lowe, Thomas L.; Lupoli, Sara; Macciardi, Fabio; Maier, Wolfgang; Manunta, Paolo; Marconi, Maurizio; McCracken, James T.; Mesa Restrepo, Sandra C.; Moessner, Rainald; Moorjani, Priya; Morgan, Jubel; Muller, Heike; Murphy, Dennis L.; Naarden, Allan L.; Ochoa, William Cornejo; Ophoff, Roel A.; Pakstis, Andrew J.; Pato, Michele T.; Pato, Carlos N.; Piacentini, John; Pittenger, Christopher; Pollak, Yehuda; Rauch, Scott L.; Renner, Tobias; Reus, Victor I.; Richter, Margaret A.; Riddle, Mark A.; Robertson, Mary M.; Romero, Roxana; Rosário, Maria C.; Rosenberg, David; Ruhrmann, Stephan; Sabatti, Chiara; Salvi, Erika; Sampaio, Aline S.; Samuels, Jack; Sandor, Paul; Service, Susan K.; Sheppard, Brooke; Singer, Harvey S.; Smit, Jan H.; Stein, Dan J.; Strengman, Eric; Tischfield, Jay A.; Turiel, Maurizio; Valencia Duarte, Ana V.; Vallada, Homero; Veenstra-VanderWeele, Jeremy; Walitza, Susanne; Walkup, John; Wang, Ying; Weale, Mike; Weiss, Robert; Wendland, Jens R.; Westenberg, Herman G.M.; Yao, Yin; Hounie, Ana G.; Miguel, Euripedes C.; Nicolini, Humberto; Wagner, Michael; Ruiz-Linares, Andres; Cath, Danielle C.; McMahon, William; Posthuma, Danielle; Oostra, Ben A.; Nestadt, Gerald; Rouleau, Guy A.; Purcell, Shaun; Jenike, Michael A.; Heutink, Peter; Hanna, Gregory L.; Conti, David V.; Arnold, Paul D.; Freimer, Nelson; Stewart, S. Evelyn; Knowles, James A.; Cox, Nancy J.; Pauls, David L.
Obsessive-compulsive disorder (OCD) and Tourette Syndrome (TS) are highly heritable neurodevelopmental disorders that are thought to share genetic risk factors. However, the identification of definitive susceptibility genes for these etiologically complex disorders remains elusive. Here, we report a combined genome-wide association study (GWAS) of TS and OCD in 2723 cases (1310 with OCD, 834 with TS, 579 with OCD plus TS/chronic tics (CT)), 5667 ancestry-matched controls, and 290 OCD parent-child trios. Although no individual single nucleotide polymorphisms (SNPs) achieved genome-wide significance, the GWAS signals were enriched for SNPs strongly associated with variations in brain gene expression levels, i.e. expression quantitative loci (eQTLs), suggesting the presence of true functional variants that contribute to risk of these disorders. Polygenic score analyses identified a significant polygenic component for OCD (p=2×10−4), predicting 3.2% of the phenotypic variance in an independent data set. In contrast, TS had a smaller, non-significant polygenic component, predicting only 0.6% of the phenotypic variance (p=0.06). No significant polygenic signal was detected across the two disorders, although the sample is likely underpowered to detect a modest shared signal. Furthermore, the OCD polygenic signal was significantly attenuated when cases with both OCD and TS/CT were included in the analysis (p=0.01). Previous work has shown that TS and OCD have some degree of shared genetic variation. However, the data from this study suggest that there are also distinct components to the genetic architectures of TS and OCD. Furthermore, OCD with co-occurring TS/CT may have different underlying genetic susceptibility compared to OCD alone. PMID:25158072
de Tayrac, Marie; Roth, Marie-Paule; Jouanolle, Anne-Marie; Coppin, Hélène; le Gac, Gérald; Piperno, Alberto; Férec, Claude; Pelucchi, Sara; Scotet, Virginie; Bardou-Jacquet, Edouard; Ropert, Martine; Bouvet, Régis; Génin, Emmanuelle; Mosser, Jean; Deugnier, Yves
Hereditary hemochromatosis (HH) is the most common form of genetic iron loading disease. It is mainly related to the homozygous C282Y/C282Y mutation in the HFE gene that is, however, a necessary but not a sufficient condition to develop clinical and even biochemical HH. This suggests that modifier genes are likely involved in the expressivity of the disease. Our aim was to identify such modifier genes. We performed a genome-wide association study (GWAS) using DNA collected from 474 unrelated C282Y homozygotes. Associations were examined for both quantitative iron burden indices and clinical outcomes with 534,213 single nucleotide polymorphisms (SNP) genotypes, with replication analyses in an independent sample of 748 C282Y homozygotes from four different European centres. One SNP met genome-wide statistical significance for association with transferrin concentration (rs3811647, GWAS p value of 7×10(-9) and replication p value of 5×10(-13)). This SNP, located within intron 11 of the TF gene, had a pleiotropic effect on serum iron (GWAS p value of 4.9×10(-6) and replication p value of 3.2×10(-6)). Both serum transferrin and iron levels were associated with serum ferritin levels, amount of iron removed and global clinical stage (pHFE-associated HH (HFE-HH) patients, identified the rs3811647 polymorphism in the TF gene as the only SNP significantly associated with iron metabolism through serum transferrin and iron levels. Because these two outcomes were clearly associated with the biochemical and clinical expression of the disease, an indirect link between the rs3811647 polymorphism and the phenotypic presentation of HFE-HH is likely. Copyright © 2014 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
Friedrich, Juliane; Brand, Bodo; Ponsuksili, Siriluck; Graunke, Katharina L; Langbein, Jan; Knaust, Jacqueline; Kühn, Christa; Schwerin, Manfred
Behaviour traits of cattle have been reported to affect important production traits, such as meat quality and milk performance as well as reproduction and health. Genetic predisposition is, together with environmental stimuli, undoubtedly involved in the development of behaviour phenotypes. Underlying molecular mechanisms affecting behaviour in general and behaviour and productions traits in particular still have to be studied in detail. Therefore, we performed a genome-wide association study in an F2 Charolais × German Holstein cross-breed population to identify genetic variants that affect behaviour-related traits assessed in an open-field and novel-object test and analysed their putative impact on milk performance. Of 37,201 tested single nucleotide polymorphism (SNPs), four showed a genome-wide and 37 a chromosome-wide significant association with behaviour traits assessed in both tests. Nine of the SNPs that were associated with behaviour traits likewise showed a nominal significant association with milk performance traits. On chromosomes 14 and 29, six SNPs were identified to be associated with exploratory behaviour and inactivity during the novel-object test as well as with milk yield traits. Least squares means for behaviour and milk performance traits for these SNPs revealed that genotypes associated with higher inactivity and less exploratory behaviour promote higher milk yields. Whether these results are due to molecular mechanisms simultaneously affecting behaviour and milk performance or due to a behaviour predisposition, which causes indirect effects on milk performance by influencing individual reactivity, needs further investigation. © 2015 Stichting International Foundation for Animal Genetics.
Aguiar, Derek; Halldórsson, Bjarni V.; Morrow, Eric M.; Istrail, Sorin
Motivation: The understanding of the genetic determinants of complex disease is undergoing a paradigm shift. Genetic heterogeneity of rare mutations with deleterious effects is more commonly being viewed as a major component of disease. Autism is an excellent example where research is active in identifying matches between the phenotypic and genomic heterogeneities. A considerable portion of autism appears to be correlated with copy number variation, which is not directly probed by single nucleotide polymorphism (SNP) array or sequencing technologies. Identifying the genetic heterogeneity of small deletions remains a major unresolved computational problem partly due to the inability of algorithms to detect them. Results: In this article, we present an algorithmic framework, which we term DELISHUS, that implements three exact algorithms for inferring regions of hemizygosity containing genomic deletions of all sizes and frequencies in SNP genotype data. We implement an efficient backtracking algorithm—that processes a 1 billion entry genome-wide association study SNP matrix in a few minutes—to compute all inherited deletions in a dataset. We further extend our model to give an efficient algorithm for detecting de novo deletions. Finally, given a set of called deletions, we also give a polynomial time algorithm for computing the critical regions of recurrent deletions. DELISHUS achieves significantly lower false-positive rates and higher power than previously published algorithms partly because it considers all individuals in the sample simultaneously. DELISHUS may be applied to SNP array or sequencing data to identify the deletion spectrum for family-based association studies. Availability: DELISHUS is available at http://www.brown.edu/Research/Istrail_Lab/. Contact: Eric_Morrow@brown.edu and Sorin_Istrail@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22689755
Wen, Zixiang; Boyse, John F; Song, Qijian; Cregan, Perry B; Wang, Dechun
Crop improvement always involves selection of specific alleles at genes controlling traits of agronomic importance, likely resulting in detectable signatures of selection within the genome of modern soybean (Glycine max L. Merr.). The identification of these signatures of selection is meaningful from the perspective of evolutionary biology and for uncovering the genetic architecture of agronomic traits. To this end, two populations of soybean, consisting of 342 landraces and 1062 improved lines, were genotyped with the SoySNP50K Illumina BeadChip containing 52,041 single nucleotide polymorphisms (SNPs), and systematically phenotyped for 9 agronomic traits. A cross-population composite likelihood ratio (XP-CLR) method was used to screen the signals of selective sweeps. A total of 125 candidate selection regions were identified, many of which harbored genes potentially involved in crop improvement. To further investigate whether these candidate regions were in fact enriched for genes affected by selection, genome-wide association studies (GWAS) were conducted on 7 selection traits targeted in soybean breeding (grain yield, plant height, lodging, maturity date, seed coat color, seed protein and oil content) and 2 non-selection traits (pubescence and flower color). Major genomic regions associated with selection traits overlapped with candidate selection regions, whereas no overlap of this kind occurred for the non-selection traits, suggesting that the selection sweeps identified are associated with traits of agronomic importance. Multiple novel loci and refined map locations of known loci related to these traits were also identified. These findings illustrate that comparative genomic analyses, especially when combined with GWAS, are a promising approach to dissect the genetic architecture of complex traits.
Choi, Bong Hwan; Wijayananda, Hasini I; Lee, Soo Hyun; Lee, Doo Ho; Kim, Jong Seok; Oh, Seok Il; Park, Eung Woo; Lee, Cheul Koo; Lee, Seung Hwan
There are various hypotheses on dog domestication based on archeological and genetic studies. Although many studies have been conducted on the origin of dogs, the existing literature about the ancestry, diversity, and population structure of Korean dogs is sparse. Therefore, this study is focused on the origin, diversity and population structure of Korean dogs. The study sample comprised four major categories, including non-dogs (coyotes and wolves), ancient, modern and Korean dogs. Selected samples were genotyped using an Illumina CanineHD array containing 173,662 single nucleotide polymorphisms. The genome-wide data were filtered using quality control parameters in PLINK 1.9. Only autosomal chromosomes were used for further analysis. The negative off-diagonal variance of the genetic relationship matrix analysis depicted, the variability of samples in each population. FIS (inbreeding rate within a population) values indicated, a low level of inbreeding within populations, and the patterns were in concordance with the results of Nei's genetic distance analysis. The lowest FST (inbreeding rate between populations) values among Korean and Chinese breeds, using a phylogenetic tree, multi-dimensional scaling, and a TreeMix likelihood tree showed Korean breeds are highly related to Chinese breeds. The Korean breeds possessed a unique and large diversity of admixtures compared with other breeds. The highest and lowest effective population sizes were observed in Korean Jindo Black (485) and Korean Donggyeong White (109), respectively. The historical effective population size of all Korean dogs showed declining trend from the past to present. It is important to take immediate action to protect the Korean dog population while conserving their diversity. Furthermore, this study suggests that Korean dogs have unique diversity and are one of the basal lineages of East Asian dogs, originating from China.
Parchman, Thomas L; Gompert, Zachariah; Mudge, Joann; Schilkey, Faye D; Benkman, Craig W; Buerkle, C Alex
Pine cones that remain closed and retain seeds until fire causes the cones to open (cone serotiny) represent a key adaptive trait in a variety of pine species. In lodgepole pine, there is substantial geographical variation in serotiny across the Rocky Mountain region. This variation in serotiny has evolved as a result of geographically divergent selection, with consequences that extend to forest communities and ecosystems. An understanding of the genetic architecture of this trait is of interest owing to the wide-reaching ecological consequences of serotiny and also because of the repeated evolution of the trait across the genus. Here, we present and utilize an inexpensive and time-effective method for generating population genomic data. The method uses restriction enzymes and PCR amplification to generate a library of fragments that can be sequenced with a high level of multiplexing. We obtained data for more than 95,000 single nucleotide polymorphisms across 98 serotinous and nonserotinous lodgepole pines from three populations. We used a Bayesian generalized linear model (GLM) to test for an association between genotypic variation at these loci and serotiny. The probability of serotiny varied by genotype at 11 loci, and the association between genotype and serotiny at these loci was consistent in each of the three populations of pines. Genetic variation across these 11 loci explained 50% of the phenotypic variation in serotiny. Our results provide a first genome-wide association map of serotiny in pines and demonstrate an inexpensive and efficient method for generating population genomic data. © 2012 Blackwell Publishing Ltd.
Nicholls, Andrew W.; Salek, Reza M.; Marques-Vidal, Pedro; Morya, Edgard; Sameshima, Koichi; Montoliu, Ivan; Da Silva, Laeticia; Collino, Sebastiano; Martin, François-Pierre; Rezzi, Serge; Steinbeck, Christoph; Waterworth, Dawn M.; Waeber, Gérard; Vollenweider, Peter; Beckmann, Jacques S.; Le Coutre, Johannes; Mooser, Vincent; Bergmann, Sven; Genick, Ulrich K.; Kutalik, Zoltán
Metabolic traits are molecular phenotypes that can drive clinical phenotypes and may predict disease progression. Here, we report results from a metabolome- and genome-wide association study on 1H-NMR urine metabolic profiles. The study was conducted within an untargeted approach, employing a novel method for compound identification. From our discovery cohort of 835 Caucasian individuals who participated in the CoLaus study, we identified 139 suggestively significant (P<5×10−8) and independent associations between single nucleotide polymorphisms (SNP) and metabolome features. Fifty-six of these associations replicated in the TasteSensomics cohort, comprising 601 individuals from São Paulo of vastly diverse ethnic background. They correspond to eleven gene-metabolite associations, six of which had been previously identified in the urine metabolome and three in the serum metabolome. Our key novel findings are the associations of two SNPs with NMR spectral signatures pointing to fucose (rs492602, P = 6.9×10−44) and lysine (rs8101881, P = 1.2×10−33), respectively. Fine-mapping of the first locus pinpointed the FUT2 gene, which encodes a fucosyltransferase enzyme and has previously been associated with Crohn's disease. This implicates fucose as a potential prognostic disease marker, for which there is already published evidence from a mouse model. The second SNP lies within the SLC7A9 gene, rare mutations of which have been linked to severe kidney damage. The replication of previous associations and our new discoveries demonstrate the potential of untargeted metabolomics GWAS to robustly identify molecular disease markers. PMID:24586186
Ali Saleh Hassan
Full Text Available In barley endosperm arabinoxylan (AX is the second most abundant cell wall polysaccharide and in wheat it is the most abundant polysaccharide in the starchy endosperm walls of the grain. AX is one of the main contributors to grain dietary fibre content providing several health benefits including cholesterol and glucose lowering effects, and antioxidant activities. Due to its complex structural features, AX might also affect the downstream applications of barley grain in malting and brewing. Using a high pressure liquid chromatography (HPLC method we quantified AX amounts in mature grain in 128 spring 2-row barley accessions. Amounts ranged from ~ 5.2 μg/g to ~ 9 μg/g. We used this data for a Genome Wide Association Study (GWAS that revealed three significant quantitative trait loci (QTL associated with grain AX levels which passed a false discovery threshold (FDR and are located on two of the seven barley chromosomes. Regions underlying the QTLs were scanned for genes likely to be involved in AX biosynthesis or turnover, and strong candidates, including glycosyltransferases from the GT43 and GT61 families and glycoside hydrolases from the GH10 family, were identified. Phylogenetic trees of selected gene families were built based on protein translations and were used to examine the relationship of the barley candidate genes to those in other species. Our data reaffirms the roles of existing genes thought to contribute to AX content, and identifies novel QTL (and candidate genes associated with them potentially influencing the AX content of barley grain. One potential outcome of this work is the deployment of highly associated single nucleotide polymorphisms markers in breeding programs to guide the modification of AX abundance in barley grain.
Luo, Chenglong; Qu, Hao; Wang, Jie; Wang, Yan; Ma, Jie; Li, Chunyu; Yang, Chunfen; Hu, Xiaoxiang; Li, Ning; Shu, Dingming
Hyperpigmentation of the visceral peritoneum (HVP) has recently garnered much attention in the poultry industry because of the possible risk to the health of affected animals and the damage it causes to the appearance of commercial chicken carcasses. However, the heritable characters of HVP remain unclear. The objective of this study was to investigate the genetic parameters of HVP by genome-wide association study (GWAS) in chickens. HVP was found to be influenced by genetic factors, with a heritability score of 0.33. HVP had positive genetic correlations with growth and carcass traits, such as leg muscle weight (rg = 0.34), but had negative genetic correlations with immune traits, such as the antibody response to Newcastle disease virus (rg = -0.42). The GWAS for HVP using 39,833 single nucleotide polymorphisms indicated the genetic factors associated with HVP displayed an additive effect rather than a dominance effect. In addition, we determined that three genomic regions, involving the 50.5-54.0 Mb region of chicken (Gallus gallus) chromosome 1 (GGA1), the 58.5-60.5 Mb region of GGA1, and the 10.5-12.0 Mb region of GGA20, were strongly associated (P HVP in chickens. Variants in these regions explained >50% of additive genetic variance for HVP. This study also confirmed that expression of BMP7, which codes for a bone morphogenetic protein and is located in one of the candidate regions, was significantly higher in the visceral peritoneum of Huiyang Beard chickens with HVP than in that of chickens without pigmentation (P HVP is a quantitative trait with moderate heritability. Genomic variants resulting in HVP were identified on GGA1 and GGA20, and expression of the BMP7 gene appears to be upregulated in HVP-affected chickens. Findings from this study should be used as a basis for further functional validation of candidate genes involved in HVP.
Allison L Weber
Full Text Available Aerobic organisms are susceptible to damage by reactive oxygen species. Oxidative stress resistance is a quantitative trait with population variation attributable to the interplay between genetic and environmental factors. Drosophila melanogaster provides an ideal system to study the genetics of variation for resistance to oxidative stress.We used 167 wild-derived inbred lines of the Drosophila Genetic Reference Panel for a genome-wide association study of acute oxidative stress resistance to two oxidizing agents, paraquat and menadione sodium bisulfite. We found significant genetic variation for both stressors. Single nucleotide polymorphisms (SNPs associated with variation in oxidative stress resistance were often sex-specific and agent-dependent, with a small subset common for both sexes or treatments. Associated SNPs had moderately large effects, with an inverse relationship between effect size and allele frequency. Linear models with up to 12 SNPs explained 67-79% and 56-66% of the phenotypic variance for resistance to paraquat and menadione sodium bisulfite, respectively. Many genes implicated were novel with no known role in oxidative stress resistance. Bioinformatics analyses revealed a cellular network comprising DNA metabolism and neuronal development, consistent with targets of oxidative stress-inducing agents. We confirmed associations of seven candidate genes associated with natural variation in oxidative stress resistance through mutational analysis.We identified novel candidate genes associated with variation in resistance to oxidative stress that have context-dependent effects. These results form the basis for future translational studies to identify oxidative stress susceptibility/resistance genes that are evolutionary conserved and might play a role in human disease.
Litonjua Augusto A
Full Text Available Abstract Background Personalized health-care promises tailored health-care solutions to individual patients based on their genetic background and/or environmental exposure history. To date, disease prediction has been based on a few environmental factors and/or single nucleotide polymorphisms (SNPs, while complex diseases are usually affected by many genetic and environmental factors with each factor contributing a small portion to the outcome. We hypothesized that the use of random forests classifiers to select SNPs would result in an improved predictive model of asthma exacerbations. We tested this hypothesis in a population of childhood asthmatics. Methods In this study, using emergency room visits or hospitalizations as the definition of a severe asthma exacerbation, we first identified a list of top Genome Wide Association Study (GWAS SNPs ranked by Random Forests (RF importance score for the CAMP (Childhood Asthma Management Program population of 127 exacerbation cases and 290 non-exacerbation controls. We predict severe asthma exacerbations using the top 10 to 320 SNPs together with age, sex, pre-bronchodilator FEV1 percentage predicted, and treatment group. Results Testing in an independent set of the CAMP population shows that severe asthma exacerbations can be predicted with an Area Under the Curve (AUC = 0.66 with 160-320 SNPs in comparison to an AUC score of 0.57 with 10 SNPs. Using the clinical traits alone yielded AUC score of 0.54, suggesting the phenotype is affected by genetic as well as environmental factors. Conclusions Our study shows that a random forests algorithm can effectively extract and use the information contained in a small number of samples. Random forests, and other machine learning tools, can be used with GWAS studies to integrate large numbers of predictors simultaneously.
Ye Seul Bae
Full Text Available Osteoporosis is a medical condition of global concern, with increasing incidence in both sexes. Bone mineral density (BMD, a highly heritable trait, has been proven a useful diagnostic factor in predicting fracture. Because medical information is lacking about male osteoporotic genetics, we conducted a genome-wide association study of BMD in Korean men. With 1,176 participants, we analyzed 4,414,664 single nucleotide polymorphisms (SNPs after genomic imputation, and identified five SNPs and three loci correlated with bone density and strength. Multivariate linear regression models were applied to adjust for age and body mass index interference. Rs17124500 (p = 6.42 × 10-7, rs34594869 (p = 6.53 × 10-7 and rs17124504 (p = 6.53 × 10-7 in 14q31.3 and rs140155614 (p = 8.64 × 10-7 in 15q25.1 were significantly associated with lumbar spine BMD (LS-BMD, while rs111822233 (p = 6.35 × 10-7 was linked with the femur total BMD (FT-BMD. Additionally, we analyzed the relationship between BMD and five genes previously identified in Korean men. Rs61382873 (p = 0.0009 in LRP5, rs9567003 (p = 0.0033 in TNFSF11 and rs9935828 (p = 0.0248 in FOXL1 were observed for LS-BMD. Furthermore, rs33997547 (p = 0.0057 in ZBTB and rs1664496 (p = 0.0012 in MEF2C were found to influence FT-BMD and rs61769193 (p = 0.0114 in ZBTB to influence femur neck BMD. We identified five SNPs and three genomic regions, associated with BMD. The significance of our results lies in the discovery of new loci, while also affirming a previously significant locus, as potential osteoporotic factors in the Korean male population.
Frankel, Adam; Armour, Nicola; Nancarrow, Derek; Krause, Lutz; Hayward, Nicholas; Lampe, Guy; Smithers, B Mark; Barbour, Andrew
The incidence of esophageal adenocarcinoma (EAC) has been increasing rapidly for the past 3 decades in Western (Caucasian) populations. Curative treatment is based around esophagectomy, which has a major impact on quality of life. For those suitable for treatment with curative intent, 5-year survival is ∼30%. More accurate prognostic tools are therefore needed, and copy number aberrations (CNAs) may offer the ability to act as prospective biomarkers in this regard. We performed a genome-wide examination of CNAs in 54 samples of EAC using single-nucleotide polymorphism (SNP) arrays. Our aims were to describe frequent regions of CNA, to define driver CNAs, and to identify CNAs that correlated with survival. Regions of frequent amplification included oncogenes such as EGFR, MYC, KLF12, and ERBB2, while frequently deleted regions included tumor suppressor genes such as CDKN2A/B, PTPRD, FHIT, and SMAD4. The genomic identification of significant targets in cancer (GISTIC) algorithm identified 24 regions of gain and 28 regions of loss that were likely to contain driver changes. We discovered 61 genes in five regions that, when stratified by CNA type (gain or loss), correlated with a statistically significant difference in survival. Pathway analysis of the genes residing in both the GISTIC and prognostic regions showed they were significantly enriched for cancer-related networks. Finally, we discovered that copy-neutral loss of heterozygosity is a frequent mechanism of CNA in genes currently targetable by chemotherapy, potentially leading to under-reporting of cases suitable for such treatment. Copyright © 2014 Wiley Periodicals, Inc.
Full Text Available Sorghum [ (L Moench], an important grain and forage crop, is receiving significant attention as a lignocellulosic feedstock because of its water-use efficiency and high biomass yield potential. Because of the advancement of genotyping and sequencing technologies, genome-wide association study (GWAS has become a routinely used method to investigate the genetic mechanisms underlying natural phenotypic variation. In this study, we performed a GWAS for nine grain and biomass-related plant architecture traits to determine their overall genetic architecture and the specific association of allelic variants in gibberellin (GA biosynthesis and signaling genes with these phenotypes. A total of 101 single-nucleotide polymorphism (SNP representative regions were associated with at least one of the nine traits, and two of the significant markers correspond to GA candidate genes, ( and (, affecting plant height and seed number, respectively. The resolution of a previously reported quantitative trait loci (QTL for leaf angle on chromosome 7 was increased to a 1.67 Mb region containing seven candidate genes with good prospects for further investigation. This study provides new knowledge of the association of GA genes with plant architecture traits and the genomic regions controlling variation in leaf angle, stem circumference, internode number, tiller number, seed number, panicle exsertion, and panicle length. The GA gene affecting seed number variation ( and the genomic region on chromosome 7 associated with variation in leaf angle are also important outcomes of this study and represent the foundation of future validation studies needed to apply this knowledge in breeding programs.
Siedlinski, Mateusz; Cho, Michael H.; Bakke, Per; Gulsvik, Amund; Lomas, David A.; Anderson, Wayne; Kong, Xiangyang; Rennard, Stephen I.; Beaty, Terri H.; Hokanson, John E.; Crapo, James D.; Silverman, Edwin K.
Background Cigarette smoking is a major risk factor for COPD and COPD severity. Previous genome-wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNPs) associated with the number of cigarettes smoked per day (CPD) and a Dopamine Beta-Hydroxylase (DBH) locus associated with smoking cessation in multiple populations. Objective To identify SNPs associated with lifetime average and current CPD, age at smoking initiation, and smoking cessation in COPD subjects. Methods GWAS were conducted in 4 independent cohorts encompassing 3,441 ever-smoking COPD subjects (GOLD stage II or higher). Untyped SNPs were imputed using HapMap (phase II) panel. Results from all cohorts were meta-analyzed. Results Several SNPs near the HLA region on chromosome 6p21 and in an intergenic region on chromosome 2q21 showed associations with age at smoking initiation, both with the lowest p=2×10−7. No SNPs were associated with lifetime average CPD, current CPD or smoking cessation with p<10−6. Nominally significant associations with candidate SNPs within alpha-nicotinic acetylcholine receptors 3/5 (CHRNA3/CHRNA5; e.g. p=0.00011 for SNP rs1051730) and Cytochrome P450 2A6 (CYP2A6; e.g. p=2.78×10−5 for a nonsynonymous SNP rs1801272) regions were observed for lifetime average CPD, however only CYP2A6 showed evidence of significant association with current CPD. A candidate SNP (rs3025343) in the DBH was significantly (p=0.015) associated with smoking cessation. Conclusion We identified two candidate regions associated with age at smoking initiation in COPD subjects. Associations of CHRNA3/CHRNA5 and CYP2A6 loci with CPD and DBH with smoking cessation are also likely of importance in the smoking behaviors of COPD patients. PMID:21685187
Full Text Available Most common complex traits, such as obesity, hypertension, diabetes, and cancers, are known to be associated with multiple genes, environmental factors, and their epistasis. Recently, the development of advanced genotyping technologies has allowed us to perform genome-wide association studies (GWASs. For detecting the effects of multiple genes on complex traits, many approaches have been proposed for GWASs. Multifactor dimensionality reduction (MDR is one of the powerful and efficient methods for detecting high-order gene-gene (GxG interactions. However, the biological interpretation of GxG interactions identified by MDR analysis is not easy. In order to aid the interpretation of MDR results, we propose a network graph analysis to elucidate the meaning of identified GxG interactions. The proposed network graph analysis consists of three steps. The first step is for performing GxG interaction analysis using MDR analysis. The second step is to draw the network graph using the MDR result. The third step is to provide biological evidence of the identified GxG interaction using external biological databases. The proposed method was applied to Korean Association Resource (KARE data, containing 8838 individuals with 327,632 single-nucleotide polymorphisms, in order to perform GxG interaction analysis of body mass index (BMI. Our network graph analysis successfully showed that many identified GxG interactions have known biological evidence related to BMI. We expect that our network graph analysis will be helpful to interpret the biological meaning of GxG interactions.
Jason P Wendler
Full Text Available Drug resistance remains a chief concern for malaria control. In order to determine the genetic markers of drug resistant parasites, we tested the genome-wide associations (GWA of sequence-based genotypes from 35 Kenyan P. falciparum parasites with the activities of 22 antimalarial drugs.Parasites isolated from children with acute febrile malaria were adapted to culture, and sensitivity was determined by in vitro growth in the presence of anti-malarial drugs. Parasites were genotyped using whole genome sequencing techniques. Associations between 6250 single nucleotide polymorphisms (SNPs and resistance to individual anti-malarial agents were determined, with false discovery rate adjustment for multiple hypothesis testing. We identified expected associations in the pfcrt region with chloroquine (CQ activity, and other novel loci associated with amodiaquine, quinazoline, and quinine activities. Signals for CQ and primaquine (PQ overlap in and around pfcrt, and interestingly the phenotypes are inversely related for these two drugs. We catalog the variation in dhfr, dhps, mdr1, nhe, and crt, including novel SNPs, and confirm the presence of a dhfr-164L quadruple mutant in coastal Kenya. Mutations implicated in sulfadoxine-pyrimethamine resistance are at or near fixation in this sample set.Sequence-based GWA studies are powerful tools for phenotypic association tests. Using this approach on falciparum parasites from coastal Kenya we identified known and previously unreported genes associated with phenotypic resistance to anti-malarial drugs, and observe in high-resolution haplotype visualizations a possible signature of an inverse selective relationship between CQ and PQ.
Full Text Available Abstract Perifosine belongs to the class of alkylphospholipid analogues, which act primarily at the cell membrane, thereby targeting signal transduction pathways. In phase I/II clinical trials, perifosine has induced tumour regression and caused disease stabilisation in a variety of tumour types. The genetic determinants responsible for its cytotoxicity have not been comprehensively studied, however. We performed a genome-wide analysis to identify genes whose expression levels or genotypic variation were correlated with the cytotoxicity of perifosine, using public databases on the US National Cancer Institute (NCI-60 human cancer cell lines. For demonstrating drug specificity, the NCI Standard Agent Database (including 171 drugs acting through a variety of mechanisms was used as a control. We identified agents with similar cytotoxicity profiles to that of perifosine in compounds used in the NCI drug screen. Furthermore, Gene Ontology and pathway analyses were carried out on genes more likely to be perifosine specific. The results suggested that genes correlated with perifosine cytotoxicity are connected by certain known pathways that lead to the mitogen-activated protein kinase signalling pathway and apoptosis. Biological processes such as 'response to stress', 'inflammatory response' and 'ubiquitin cycle' were enriched among these genes. Three single nucleotide polymorphisms (SNPs located in CACNA2DI and EXOC4 were found to be correlated with perifosine cytotoxicity. Our results provided a manageable list of genes whose expression levels or genotypic variation were strongly correlated with the cytotoxcity of perifosine. These genes could be targets for further studies using candidate-gene approaches. The results also provided insights into the pharmacodynamics of perifosine.
Full Text Available Male genital morphology of animals with internal fertilization and promiscuous mating systems have been one of the most diverse and rapidly evolving morphological traits. The male genital morphology in general is known to have low phenotypic and genetic variations, but the genetic basis of the male genital variation remains unclear. Drosophila melanogaster and its closely related species are morphologically very similar, but the shapes of the posterior lobe, a cuticular projection on the male genital arch are distinct from each other, representing a model system for studying the genetic basis of male genital morphology. In this study, we used highly inbred whole genome sequenced strains of D. melanogaster to perform genome wide association analysis on posterior lobe morphology. We quantified the outline shape of posterior lobes with Fourier coefficients obtained from elliptic Fourier analysis and performed principal component analysis, and posterior lobe size. The first and second principal components (PC1 and PC2 explained approximately 88% of the total variation of the posterior lobe shape. We then examined the association between the principal component scores and posterior lobe size and 1902142 single nucleotide polymorphisms (SNPs. As a result, we obtained 15, 14 and 15 SNPs for PC1, PC2 and posterior lobe size with P-values smaller than 10(-5. Based on the location of the SNPs, 13, 13 and six protein coding genes were identified as potential candidates for PC1, PC2 and posterior lobe size, respectively. In addition to the previous findings showing that the intraspecific posterior shape variation are regulated by multiple QTL with strong effects, the present study suggests that the intraspecific variation may be under polygenic regulation with a number of loci with small effects. Further studies are required for investigating whether these candidate genes are responsible for the intraspecific posterior lobe shape variation.
Sallam, Ahmad H; Tyagi, Priyanka; Brown-Guedira, Gina; Muehlbauer, Gary J; Hulse, Alex; Steffenson, Brian J
Stem rust was one of the most devastating diseases of barley in North America. Through the deployment of cultivars with the resistance gene Rpg1 , losses to stem rust have been minimal over the past 70 yr. However, there exist both domestic (QCCJB) and foreign (TTKSK aka isolate Ug99) pathotypes with virulence for this important gene. To identify new sources of stem rust resistance for barley, we evaluated the Wild Barley Diversity Collection (WBDC) (314 ecogeographically diverse accessions of Hordeum vulgare subsp. spontaneum ) for seedling resistance to four pathotypes (TTKSK, QCCJB, MCCFC, and HKHJC) of the wheat stem rust pathogen ( Puccinia graminis f. sp. tritici , Pgt ) and one isolate (92-MN-90) of the rye stem rust pathogen ( P. graminis f. sp. secalis , Pgs ). Based on a coefficient of infection, the frequency of resistance in the WBDC was low ranging from 0.6% with HKHJC to 19.4% with 92-MN-90. None of the accessions was resistant to all five cultures of P. graminis A genome-wide association study (GWAS) was conducted to map stem rust resistance loci using 50,842 single-nucleotide polymorphic markers generated by genotype-by-sequencing and ordered using the new barley reference genome assembly. After proper accounting for genetic relatedness and structure among accessions, 45 quantitative trait loci were identified for resistance to P. graminis across all seven barley chromosomes. Three novel loci associated with resistance to TTKSK, QCCJB, MCCFC, and 92-MN-90 were identified on chromosomes 5H and 7H, and two novel loci associated with resistance to HKHJC were identified on chromosomes 1H and 3H. These novel alleles will enhance the diversity of resistance available for cultivated barley. Copyright © 2017 Sallam et al.
Aoun, Meriem; Breiland, Matthew; Kathryn Turner, M; Loladze, Alexander; Chao, Shiaoman; Xu, Steven S; Ammar, Karim; Anderson, James A; Kolmer, James A; Acevedo, Maricelis
Leaf rust (caused by Erikss. ) is increasingly impacting durum wheat ( L. var. ) production with the recent appearance of races with virulence to widely grown cultivars in many durum producing areas worldwide. A highly virulent race on durum wheat was recently detected in Kansas. This race may spread to the northern Great Plains, where most of the US durum wheat is produced. The objective of this study was to identify sources of resistance to several races from the United States and Mexico at seedling stage in the greenhouse and at adult stage in field experiments. Genome-wide association study (GWAS) was used to identify single-nucleotide polymorphism (SNP) markers associated with leaf rust response in a worldwide durum wheat collection of 496 accessions. Thirteen accessions were resistant across all experiments. Association mapping revealed 88 significant SNPs associated with leaf rust response. Of these, 33 SNPs were located on chromosomes 2A and 2B, and 55 SNPs were distributed across all other chromosomes except for 1B and 7B. Twenty markers were associated with leaf rust response at seedling stage, while 68 markers were associated with leaf rust response at adult plant stage. The current study identified a total of 14 previously uncharacterized loci associated with leaf rust response in durum wheat. The discovery of these loci through association mapping (AM) is a significant step in identifying useful sources of resistance that can be used to broaden the relatively narrow leaf rust resistance spectrum in durum wheat germplasm. Copyright © 2016 Crop Science Society of America.
Kim, Lyoung Hyo; Park, Byung Lae; Cheong, Hyun Sub; Namgoong, Suhg; Kim, Ji On; Kim, Jeong-Hyun; Shin, Joong-Gon; Park, Chul Soo; Kim, Bong-Jo; Kim, Jae Won; Choi, Ihn-Geun; Hwang, Jaeuk; Shin, Hyoung Doo; Woo, Sung-Il
Schizophrenia is regarded as a multifactorial and polygenic brain disorder that is attributed to different combinations of genetic and environmental risk factors. Recently, several genome-wide association studies (GWASs) of schizophrenia have identified numerous risk factors, but the replication results remain controversial and ambiguous. To identify schizophrenia susceptibility loci in the Korean population, we performed a GWAS using the Illumina HumanOmni1-Quad V1.0 Microarray. We genotyped 1,140,419 single nucleotide polymorphisms (SNPs) in 350 Korea schizophrenia patients and 700 control subjects, and approximately 620,001 autosomal SNPs were passed our quality control. In the case-control analysis, the rs9607195 A>G on intergenic area 250 kb away from the ISX gene and the rs12738007 A>G on the intron of the MECR gene were the most strongly associated SNPs with the risk of schizophrenia (P = 6.2 × 10(-8) , OR = 0.50 and P = 3.7 × 10(-7) , OR = 2.39, respectively). In subsequent fine-mapping analysis, 6 SNPs of MECR were genotyped with 310 schizophrenia patients and 604 control subjects. The association of the MECR rs12738007, a top ranked-SNP in GWAS, was replicated (P = 1.5 × 10(-2) , OR = 1.53 in fine mapping analysis, P = 1.5 × 10(-6) , OR = 1.90 in combined analysis). The identification of putative schizophrenia susceptibility loci could provide new insights into genetic factors related with schizophrenia and clues for the development of diagnosis strategies. © 2015 Wiley Periodicals, Inc.
Bong Hwan Choi
Full Text Available There are various hypotheses on dog domestication based on archeological and genetic studies. Although many studies have been conducted on the origin of dogs, the existing literature about the ancestry, diversity, and population structure of Korean dogs is sparse. Therefore, this study is focused on the origin, diversity and population structure of Korean dogs. The study sample comprised four major categories, including non-dogs (coyotes and wolves, ancient, modern and Korean dogs. Selected samples were genotyped using an Illumina CanineHD array containing 173,662 single nucleotide polymorphisms. The genome-wide data were filtered using quality control parameters in PLINK 1.9. Only autosomal chromosomes were used for further analysis. The negative off-diagonal variance of the genetic relationship matrix analysis depicted, the variability of samples in each population. FIS (inbreeding rate within a population values indicated, a low level of inbreeding within populations, and the patterns were in concordance with the results of Nei's genetic distance analysis. The lowest FST (inbreeding rate between populations values among Korean and Chinese breeds, using a phylogenetic tree, multi-dimensional scaling, and a TreeMix likelihood tree showed Korean breeds are highly related to Chinese breeds. The Korean breeds possessed a unique and large diversity of admixtures compared with other breeds. The highest and lowest effective population sizes were observed in Korean Jindo Black (485 and Korean Donggyeong White (109, respectively. The historical effective population size of all Korean dogs showed declining trend from the past to present. It is important to take immediate action to protect the Korean dog population while conserving their diversity. Furthermore, this study suggests that Korean dogs have unique diversity and are one of the basal lineages of East Asian dogs, originating from China.
Full Text Available The tsetse fly Glossina fuscipes fuscipes (Gff is the insect vector of the two forms of Human African Trypanosomiasis (HAT that exist in Uganda. Understanding Gff population dynamics, and the underlying genetics of epidemiologically relevant phenotypes is key to reducing disease transmission. Using ddRAD sequence technology, complemented with whole-genome sequencing, we developed a panel of ∼73,000 single-nucleotide polymorphisms (SNPs distributed across the Gff genome that can be used for population genomics and to perform genome-wide-association studies. We used these markers to estimate genomic patterns of linkage disequilibrium (LD in Gff, and used the information, in combination with outlier-locus detection tests, to identify candidate regions of the genome under selection. LD in individual populations decays to half of its maximum value (r2max/2 between 1359 and 2429 bp. The overall LD estimated for the species reaches r2max/2 at 708 bp, an order of magnitude slower than in Drosophila. Using 53 infected (Trypanosoma spp. and uninfected flies from four genetically distinct Ugandan populations adapted to different environmental conditions, we were able to identify SNPs associated with the infection status of the fly and local environmental adaptation. The extent of LD in Gff likely facilitated the detection of loci under selection, despite the small sample size. Furthermore, it is probable that LD in the regions identified is much higher than the average genomic LD due to strong selection. Our results show that even modest sample sizes can reveal significant genetic associations in this species, which has implications for future studies given the difficulties of collecting field specimens with contrasting phenotypes for association analysis.
Shi, Yingyao; Gao, Lingling; Wu, Zhichao; Zhang, Xiaojing; Wang, Mingming; Zhang, Congshun; Zhang, Fan; Zhou, Yongli; Li, Zhikang
Improving the salt tolerance of direct-seeding rice at the seed germination stage is a major breeding goal in many Asian rice-growing countries, where seedlings must often establish in soils with a high salt content. Thus, it is important to understand the genetic mechanisms of salt tolerance in rice and to screen for germplasm with salt tolerance at the seed germination stage. Here, we investigated seven seed germination-related traits under control and salt-stress conditions and conducted a genome-wide association study based on the re-sequencing of 478 diverse rice accessions. The analysis used a mixed linear model and was based on 6,361,920 single nucleotide polymorphisms in 478 rice accessions grouped into whole, indica, and non-indica panels. Eleven loci containing 22 significant salt tolerance-associated single nucleotide polymorphisms were identified based on the stress-susceptibility indices (SSIs) of vigor index (VI) and mean germination time (MGT). From the SSI of VI, six major loci were identified, explaining 20.2% of the phenotypic variation. From the SSI of MGT, five major loci were detected, explaining 26.4% of the phenotypic variation. Of these, seven loci on chromosomes 1, 5, 6, 11, and 12 were close to six previously identified quantitative gene loci/genes related to tolerance to salinity or other abiotic stresses. The strongest association region for the SSI of MGT was identified in a ~ 13.3 kb interval (15450039-15,463,330) on chromosome 1, near salt-tolerance quantitative trait loci controlling the Na + : K + ratio, total Na + uptake, and total K + concentration. The strongest association region for the SSI of VI was detected in a ~ 164.2 kb interval (526662-690,854) on chromosome 2 harboring two nitrate transporter family genes (OsNRT2.1 and OsNRT2.2), which affect gene expression under salt stress. The haplotype analysis indicated that OsNRT2.2 was associated with subpopulation differentiation and its minor/rare tolerant haplotype was
Full Text Available Although great progress in genome-wide association studies (GWAS has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked. The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001
Seung Hwan Lee
Full Text Available This genome-wide association study (GWAS was conducted to identify major loci that are significantly associated with carcass weight, and their effects, in order to provide increased understanding of the genetic architecture of carcass weight in Hanwoo. This genome-wide association study identified one major chromosome region ranging from 23 Mb to 25 Mb on chromosome 14 as being associated with carcass weight in Hanwoo. Significant Bonferroni-corrected genome-wide associations (P<1.52×10(-6 were detected for 6 Single Nucleotide Polymorphic (SNP loci for carcass weight on chromosome 14. The most significant SNP was BTB-01280026 (P = 4.02×10(-11, located in the 25 Mb region on Bos taurus autosome 14 (BTA14. The other 5 significant SNPs were Hapmap27934-BTC-065223 (P = 4.04×10(-11 in 25.2 Mb, BTB-01143580 (P = 6.35×10(-11 in 24.3 Mb, Hapmap30932-BTC-011225 (P = 5.92×10(-10 in 24.8 Mb, Hapmap27112-BTC-063342 (P = 5.18×10(-9 in 25.4 Mb, and Hapmap24414-BTC-073009 (P = 7.38×10(-8 in 25.4 Mb, all on BTA 14. One SNP (BTB-01143580; P = 6.35×10(-11 lies independently from the other 5 SNPs. The 5 SNPs that lie together showed a large Linkage disequilibrium (LD block (block size of 553 kb with LD coefficients ranging from 0.53 to 0.89 within the block. The most significant SNPs accounted for 6.73% to 10.55% of additive genetic variance, which is quite a large proportion of the total additive genetic variance. The most significant SNP (BTB-01280026; P = 4.02×10(-11 had 16.96 kg of allele substitution effect, and the second most significant SNP (Hapmap27934-BTC-065223; P = 4.04×10(-11 had 18.06 kg of effect on carcass weight, which correspond to 44% and 47%, respectively, of the phenotypic standard deviation for carcass weight in Hanwoo cattle. Our results demonstrated that carcass weight was affected by a major Quantitative Trait Locus (QTL with a large effect and by many SNPs with small effects that are normally
Jee, Sun Ha; Sull, Jae Woong; Lee, Jong-Eun; Shin, Chol; Park, Jongkeun; Kimm, Heejin; Cho, Eun-Young; Shin, Eun-Soon; Yun, Ji Eun; Park, Ji Wan; Kim, Sang Yeun; Lee, Sun Ju; Jee, Eun Jung; Baik, Inkyung; Kao, Linda
Adiponectin is associated with obesity and insulin resistance. To date, there has been no genome-wide association study (GWAS) of adiponectin levels in Asians. Here we present a GWAS of a cohort of Korean volunteers. A total of 4,001 subjects were genotyped by using a genome-wide marker panel in a two-stage design (979 subjects initially and 3,022 in a second stage). Another 2,304 subjects were used for follow-up replication studies with selected markers. In the discovery phase, the top SNP a...
Full Text Available Once considered a single species, the whitefly, Bemisia tabaci, is a complex of numerous morphologically indistinguishable species. Within the last three decades, two of its members (MED and MEAM1 have become some of the world's most damaging agricultural pests invading countries across Europe, Africa, Asia and the Americas and affecting a vast range of agriculturally important food and fiber crops through both feeding-related damage and the transmission of numerous plant viruses. For some time now, researchers have relied on a single mitochondrial gene and/or a handful of nuclear markers to study this species complex. Here, we move beyond this by using 38,041 genome-wide Single Nucleotide Polymorphisms, and show that the two invasive members of the complex are closely related species with signatures of introgression with a third species (IO. Gene flow patterns were traced between contemporary invasive populations within MED and MEAM1 species and these were best explained by recent international trade. These findings have profound implications for delineating the B. tabaci species status and will impact quarantine measures and future management strategies of this global pest.
Background: Sirtuin-1 (SIRT-1), a protein has been found to protect the cells against oxidative stress due to its deacetylase activity. In this investigation, we aimed to study SIRT-1 gene rs2273773 C >T single nucleotide polymorphism and markers of serum protein oxidation (protein carbonyl and sulfhydryl groups) in ...
Yin, Jiaoyang; Vogel, Ulla; Gerdes, Lars Ulrik
The genetic susceptibility to basal cell carcinoma (BCC) among Danish psoriatic patients was investigated in association studies with 12 single nucleotide polymorphisms on chromosome 19q13.2-3. The results show a significant association between BCC and the A-allele of a polymorphism in ERCCI exon4...
Wiewel-Verschueren, Sophie; Mulder, Andre B.; Meijer, Karina; Mulder, Rene
In a previous study it was shown that lower factor XI (FXI) levels in women with heavy menstrual bleeding (HMB). Our aim was to determine the single-nucleotide variants (SNVs) in the F11 gene in women with HMB. In addition, an extensive literature search was performed to determine the clinical
Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr
Background RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver...
Jafari, Naghmeh; Broer, Linda; Hoppenbrouwers, Ilse A; van Duijn, Cornelia M; Hintzen, Rogier Q
Multiple sclerosis is a presumed autoimmune disease associated with genetic and environmental risk factors such as infectious mononucleosis. Recent research has shown infectious mononucleosis to be associated with a specific HLA class I polymorphism. Our aim was to test if the infectious mononucleosis-linked HLA class I single nucleotide polymorphism (rs6457110) is also associated with multiple sclerosis. Genotyping of the HLA-A single nucleotide polymorphism rs6457110 using TaqMan was performed in 591 multiple sclerosis cases and 600 controls. The association of multiple sclerosis with the HLA-A single nucleotide polymorphism was tested using logistic regression adjusted for age, sex and HLA-DRB1*1501. HLA-A minor allele (A) is associated with multiple sclerosis (OR = 0.68; p = 4.08 × 10( -5)). After stratification for HLA-DRB1*1501 risk allele (T) carrier we showed a significant OR of 0.70 (p = 0.003) for HLA-A. HLA class I single nucleotide polymorphism rs6457110 is associated with infectious mononucleosis and multiple sclerosis, independent of the major class II allele, supporting the hypothesis that shared genetics may contribute to the association between infectious mononucleosis and multiple sclerosis.
Canovas, Fernando; Mota, Catarina; Ferreira-Costa, Joana; Serrao, Ester; Coyer, Jim; Olsen, Jeanine; Pearson, Gareth
We characterized 35 single nucleotide polymorphism (SNP) markers for the brown alga Fucus vesiculosus. Based on existing Fucus Expressed Sequence Tag libraries for heat and desiccation-stressed tissue, SNPs were developed and confirmed by re-sequencing cDNA from a diverse panel of individuals. SNP
Udatha, D B R K Gupta; Rasmussen, Simon; Sicheritz-Pontén, Thomas
The non-synonymous SNPs, the so-called non-silent SNPs, which are single-nucleotide variations in the coding regions that give "birth" to amino acid mutations, are often involved in the modulation of protein function. Understanding the effect of individual amino acid mutations on a protein...
Xiao, Zhuo; Lie, Puchang; Fang, Zhiyuan; Yu, Luxin; Chen, Junhua; Liu, Jie; Ge, Chenchen; Zhou, Xuemeng; Zeng, Lingwen
A lateral flow biosensor for detection of single nucleotide polymorphism based on circular strand displacement reaction (CSDPR) has been developed. Taking advantage of high fidelity of T4 DNA ligase, signal amplification by CSDPR, and the optical properties of gold nanoparticles, this assay has reached a detection limit of 0.01 fM.
Catsburg, Arnold; van der Zwet, Wil C.; Morre, Servaas A.; Ouburg, Sander; Vandenbroucke-Grauls, Christina M. J. E.; Savelkoul, Paul H. M.
Reliable analysis of single nucleotide polymorphisms (SNPs) in DNA derived from samples containing low numbers of cells or from suboptimal sources can be difficult. A new procedure to characterize multiple SNPs in traces of DNA from plasma and old dried blood samples was developed. Six SNPs in the
Sjostedt, N.; Heuvel, J.J.M.W. van den; Koenderink, J.B.; Kidron, H.
PURPOSE: To study the function and expression of nine naturally occurring single-nucleotide polymorphisms (G406R, F431L, S441N, P480L, F489L, M515R, L525R, A528T and T542A) that are predicted to reside in the transmembrane regions of the ABC transporter ABCG2. METHODS: The transport activity of the
Mar 2, 2017 ... Abstract. Polycystic ovary syndrome (PCOS) is the most common and a complex female endocrine disorder, and is one of the leading cause of female infertility. Here, we aimed to investigate the association of single-nucleotide polymorphism of INS, INSR,. IRS1, IRS2, PPAR-G and CAPN10 gene in the ...
Ripke, S.; Sanders, A. R.; Kendler, K. S.; Levinson, D. F.; Sklar, P.; Holmans, P. A.; Lin, D. Y.; Duan, J.; Ophoff, R. A.; Andreassen, O. A.; Scolnick, E.; Cichon, S.; St Clair, D.; Corvin, A.; Gurling, H.; Werge, T.; Rujescu, D.; Blackwood, D. H.; Pato, C. N.; Malhotra, A. K.; Purcell, S.; Dudbridge, F.; Neale, B. M.; Rossin, L.; Visscher, P. M.; Posthuma, D.; Ruderfer, D. M.; Fanous, A.; Stefansson, H.; Steinberg, S.; Mowry, B. J.; Golimbet, V.; de Hert, M.; Jonsson, E. G.; Bitter, I.; Pietilainen, O. P.; Collier, D. A.; Tosato, S.; Agartz, I.; Albus, M.; Alexander, M.; Amdur, R. L.; Amin, F.; Bass, N.; Bergen, S. E.; Black, D. W.; Borglum, A. D.; Brown, M. A.; Bruggeman, R.; Buccola, N. G.; Byerley, W. F.; Cahn, W.; Cantor, R. M.; Carr, V. J.; Catts, S. V.; Choudhury, K.; Cloninger, C. R.; Cormican, P.; Craddock, N.; Danoy, P. A.; Datta, S.; de Haan, L.; Demontis, D.; Dikeos, D.; Djurovic, S.; Donnely, P.; Donohoe, G.; Duong, L.; Dwyer, S.; Fink-Jensen, A.; Freedman, R.; Freimer, N. B.; Friedl, M.; Georgieva, L.; Giegling, I.; Gill, M.; Glenthoj, B.; Godard, S.; Hamshere, M.; Hansen, M.; Hartmann, A. M.; Henskens, F. A.; Hougaard, D. M.; Hultman, C. M.; Ingason, A.; Jablensky, A. V.; Jakobsen, K. D.; Jay, M.; Jurgens, G.; Kahn, R. S.; Keller, M. C.; Kenis, G.; Kenny, E.; Kim, Y.; Kirov, G. K.; Konnerth, H.; Konte, B.; Krabbendam, L.; Krasucki, R.; Lasseter, V. K.; Laurent, C.; Lawrence, J.; Lencz, T.; Lerer, F. B.; Liang, K. Y.; Lichtenstein, P.; Lieberman, J. A.; Linszen, D. H.; Lonnqvist, J.; Loughland, C. M.; Maclean, A. W.; Maher, B. S.; Maier, W.; Mallet, J.; Malloy, P.; Mattheisen, M.; Mattingsdal, M.; McGhee, K. A.; McGrath, J. J.; McIntosh, A.; McLean, D. E.; McQuillin, A.; Melle, I.; Michie, P. T.; Milanova, V.; Morris, D. W.; Mors, O.; Mortensen, P. B.; Moskvina, V.; Muglia, P.; Myin-Germeys, I.; Nertney, D. A.; Nestadt, G.; Nielsen, J.; Nikolov, I.; Nordentoft, M.; Norton, N.; Nothen, M. M.; O'Dushlaine, C. T.; Olincy, A.; Olsen, L.; O'Neill, F. A.; Orntoft, T. F.; Owen, M. J.; Pantelis, C.; Papadimitriou, G.; Pato, M. T.; Peltonen, L.; Petursson, H.; Pickard, B.; Pimm, J.; Pulver, A. E.; Puri, V.; Quested, D.; Quinn, E. M.; Rasmussen, H. B.; Rethelyi, J. M.; Ribble, R.; Rietschel, M.; Riley, B. P.; Ruggeri, M.; Schall, U.; Schulze, T. G.; Schwab, S. G.; Scott, R. J.; Shi, J.; Sigurdsson, E.; Silvermann, J. M.; Spencer, C. C.; Stefansson, K.; Strange, A.; Strengman, E.; Stroup, T. S.; Suvisaari, J.; Terenius, L.; Thirumalai, S.; Thygesen, J. H.; Timm, S.; Toncheva, D.; van den Oord, E.; van Os, J.; van Winkel, R.; Veldink, J.; Walsh, D.; Wang, A. G.; Wiersma, D.; Wildenauer, D. B.; Williams, H. J.; Williams, N. M.; Wormley, B.; Zammit, S.; Sullivan, P. F.; O'Donovan, M. C.; Daly, M. J.; Gejman, P. V.
We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded
Beekman, Marian; Blanché, Hélène; Perola, Markus
Clear evidence exists for heritability of human longevity, and much interest is focused on identifying genes associated with longer lives. To identify such longevity alleles, we performed the largest genome-wide linkage scan thus far reported. Linkage analyses included 2118 nonagenarian Caucasian...
Scharf, J. M.; Yu, D.; Mathews, C. A.; Neale, B. M.; Stewart, S. E.; Fagerness, J. A.; Evans, P.; Gamazon, E.; Edlund, C. K.; Service, S. K.; Tikhomirov, A.; Osiecki, L.; Illmann, C.; Pluzhnikov, A.; Konkashbaev, A.; Davis, L. K.; Han, B.; Crane, J.; Moorjani, P.; Crenshaw, A. T.; Parkin, M. A.; Reus, V. I.; Lowe, T. L.; Rangel-Lugo, M.; Chouinard, S.; Dion, Y.; Girard, S.; Cath, D. C.; Smit, J. H.; King, R. A.; Fernandez, T. V.; Leckman, J. F.; Kidd, K. K.; Kidd, J. R.; Pakstis, A. J.; State, M. W.; Herrera, L. D.; Romero, R.; Fournier, E.; Sandor, P.; Barr, C. L.; Phan, N.; Gross-Tsur, V.; Benarroch, F.; Pollak, Y.; Budman, C. L.; Bruun, R. D.; Erenberg, G.; Naarden, A. L.; Hoekstra, P. J.
Tourette's syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association
We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10(-11)) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10(-9)), ANK3 (rs10994359, P = 2.5 × 10(-8)) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10(-9)).
Oskari Kilpeläinen, Tuomas; Ingelsson, Erik
Adiposity is strongly heritable and one of the leading risk factors for type 2 diabetes, cardiovascular disease, cancer, and premature death. In the past 8 years, genome-wide association studies (GWAS) have greatly increased our understanding of the genes and biological pathways that regulate...
Chaste, Pauline; Klei, Lambertus; Sanders, Stephan J; Hus, Vanessa; Murtha, Michael T; Lowe, Jennifer K; Willsey, A Jeremy; Moreno-De-Luca, Daniel; Yu, Timothy W; Fombonne, Eric; Geschwind, Daniel; Grice, Dorothy E; Ledbetter, David H; Mane, Shrikant M; Martin, Donna M; Morrow, Eric M; Walsh, Christopher A; Sutcliffe, James S; Lese Martin, Christa; Beaudet, Arthur L; Lord, Catherine; State, Matthew W; Cook, Edwin H; Devlin, Bernie
Phenotypic heterogeneity in autism has long been conjectured to be a major hindrance to the discovery of genetic risk factors, leading to numerous attempts to stratify children based on phenotype to increase power of discovery studies. This approach, however, is based on the hypothesis that phenotypic heterogeneity closely maps to genetic variation, which has not been tested. Our study examines the impact of subphenotyping of a well-characterized autism spectrum disorder (ASD) sample on genetic homogeneity and the ability to discover common genetic variants conferring liability to ASD. Genome-wide genotypic data of 2576 families from the Simons Simplex Collection were analyzed in the overall sample and phenotypic subgroups defined on the basis of diagnosis, IQ, and symptom profiles. We conducted a family-based association study, as well as estimating heritability and evaluating allele scores for each phenotypic subgroup. Association analyses revealed no genome-wide significant association signal. Subphenotyping did not increase power substantially. Moreover, allele scores built from the most associated single nucleotide polymorphisms, based on the odds ratio in the full sample, predicted case status in subsets of the sample equally well and heritability estimates were very similar for all subgroups. In genome-wide association analysis of the Simons Simplex Collection sample, reducing phenotypic heterogeneity had at most a modest impact on genetic homogeneity. Our results are based on a relatively small sample, one with greater homogeneity than the entire population; if they apply more broadly, they imply that analysis of subphenotypes is not a productive path forward for discovering genetic risk variants in ASD. Copyright © 2015 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
Davies, G; Armstrong, N; Bis, J C; Bressler, J; Chouraki, V; Giddaluru, S; Hofer, E; Ibrahim-Verbaas, C A; Kirin, M; Lahti, J; van der Lee, S J; Le Hellard, S; Liu, T; Marioni, R E; Oldmeadow, C; Postmus, I; Smith, A V; Smith, J A; Thalamuthu, A; Thomson, R; Vitart, V; Wang, J; Yu, L; Zgaga, L; Zhao, W; Boxall, R; Harris, S E; Hill, W D; Liewald, D C; Luciano, M; Adams, H; Ames, D; Amin, N; Amouyel, P; Assareh, A A; Au, R; Becker, J T; Beiser, A; Berr, C; Bertram, L; Boerwinkle, E; Buckley, B M; Campbell, H; Corley, J; De Jager, P L; Dufouil, C; Eriksson, J G; Espeseth, T; Faul, J D; Ford, I; Scotland, Generation; Gottesman, R F; Griswold, M E; Gudnason, V; Harris, T B; Heiss, G; Hofman, A; Holliday, E G; Huffman, J; Kardia, S L R; Kochan, N; Knopman, D S; Kwok, J B; Lambert, J-C; Lee, T; Li, G; Li, S-C; Loitfelder, M; Lopez, O L; Lundervold, A J; Lundqvist, A; Mather, K A; Mirza, S S; Nyberg, L; Oostra, B A; Palotie, A; Papenberg, G; Pattie, A; Petrovic, K; Polasek, O; Psaty, B M; Redmond, P; Reppermund, S; Rotter, J I; Schmidt, H; Schuur, M; Schofield, P W; Scott, R J; Steen, V M; Stott, D J; van Swieten, J C; Taylor, K D; Trollor, J; Trompet, S; Uitterlinden, A G; Weinstein, G; Widen, E; Windham, B G; Jukema, J W; Wright, A F; Wright, M J; Yang, Q; Amieva, H; Attia, J R; Bennett, D A; Brodaty, H; de Craen, A J M; Hayward, C; Ikram, M A; Lindenberger, U; Nilsson, L-G; Porteous, D J; Räikkönen, K; Reinvang, I; Rudan, I; Sachdev, P S; Schmidt, R; Schofield, P R; Srikanth, V; Starr, J M; Turner, S T; Weir, D R; Wilson, J F; van Duijn, C; Launer, L; Fitzpatrick, A L; Seshadri, S; Mosley, T H; Deary, I J
General cognitive function is substantially heritable across the human life course from adolescence to old age. We investigated the genetic contribution to variation in this important, health- and well-being-related trait in middle-aged and older adults. We conducted a meta-analysis of genome-wide association studies of 31 cohorts (N=53 949) in which the participants had undertaken multiple, diverse cognitive tests. A general cognitive function phenotype was tested for, and created in each cohort by principal component analysis. We report 13 genome-wide significant single-nucleotide polymorphism (SNP) associations in three genomic regions, 6q16.1, 14q12 and 19q13.32 (best SNP and closest gene, respectively: rs10457441, P=3.93 × 10−9, MIR2113; rs17522122, P=2.55 × 10−8, AKAP6; rs10119, P=5.67 × 10−9, APOE/TOMM40). We report one gene-based significant association with the HMGN1 gene located on chromosome 21 (P=1 × 10−6). These genes have previously been associated with neuropsychiatric phenotypes. Meta-analysis results are consistent with a polygenic model of inheritance. To estimate SNP-based heritability, the genome-wide complex trait analysis procedure was applied to two large cohorts, the Atherosclerosis Risk in Communities Study (N=6617) and the Health and Retirement Study (N=5976). The proportion of phenotypic variation accounted for by all genotyped common SNPs was 29% (s.e.=5%) and 28% (s.e.=7%), respectively. Using polygenic prediction analysis, ~1.2% of the variance in general cognitive function was predicted in the Generation Scotland cohort (N=5487; P=1.5 × 10−17). In hypothesis-driven tests, there was significant association between general cognitive function and four genes previously associated with Alzheimer's disease: TOMM40, APOE, ABCG1 and MEF2C. PMID:25644384
Full Text Available Thoroughbred, a relatively recent horse breed, is best known for its use in horse racing. Although myostatin (MSTN variants have been reported to be highly associated with horse racing performance, the trait is more likely to be polygenic in nature. The purpose of this study was to identify genetic variants strongly associated with racing performance by using estimated breeding value (EBV for race time as a phenotype. We conducted a two-stage genome-wide association study to search for genetic variants associated with the EBV. In the first stage of genome-wide association study, a relatively large number of markers (~54,000 single-nucleotide polymorphisms, SNPs were evaluated in a small number of samples (240 horses. In the second stage, a relatively small number of markers identified to have large effects (170 SNPs were evaluated in a much larger number of samples (1,156 horses. We also validated the SNPs related to MSTN known to have large effects on racing performance and found significant associations in the stage two analysis, but not in stage one. We identified 28 significant SNPs related to 17 genes. Among these, six genes have a function related to myogenesis and five genes are involved in muscle maintenance. To our knowledge, these genes are newly reported for the genetic association with racing performance of Thoroughbreds. It complements a recent horse genome-wide association studies of racing performance that identified other SNPs and genes as the most significant variants. These results will help to expand our knowledge of the polygenic nature of racing performance in Thoroughbreds.
Hsu, Yi-Hsiang; Liu, Youfang; Hannan, Marian T.; Maixner, William; Smith, Shad B.; Diatchenko, Luda; Golightly, Yvonne M.; Menz, Hylton B.; Kraus, Virginia B.; Doherty, Michael; Wilson, A.G.; Jordan, Joanne M.
Objective Hallux valgus (HV) affects ~36% of Caucasian adults. Although considered highly heritable, the underlying genetic determinants are unclear. We conducted the first genome-wide association study (GWAS) aimed to identify genetic variants associated with HV. Methods HV was assessed in 3 Caucasian cohorts (n=2,263, n=915, and n=1,231 participants, respectively). In each cohort, a GWAS was conducted using 2.5M imputed single nucleotide polymorphisms (SNPs). Mixed-effect regression with the additive genetic model adjusted for age, sex, weight and within-family correlations was used for both sex-specific and combined analyses. To combine GWAS results across cohorts, fixed-effect inverse-variance meta-analyses were used. Following meta-analyses, top-associated findings were also examined in an African American cohort (n=327). Results The proportion of HV variance explained by genome-wide genotyped SNPs was 50% in men and 48% in women. A higher proportion of genetic determinants of HV was sex-specific. The most significantly associated SNP in men was rs9675316 located on chr17q23-a24 near the AXIN2 gene (p=5.46×10−7); the most significantly associated SNP in women was rs7996797 located on chr13q14.1-q14.2 near the ESD gene (p=7.21×10−7). Genome-wide significant SNP-by-sex interaction was found for SNP rs1563374 located on chr11p15.1 near the MRGPRX3 gene (interaction p-value =4.1×10−9). The association signals diminished when combining men and women. Conclusion Findings suggest that the potential pathophysiological mechanisms of HV are complex and strongly underlined by sex-specific interactions. The identified genetic variants imply contribution of biological pathways observed in osteoarthritis as well as new pathways, influencing skeletal development and inflammation. PMID:26337638
Mariette, Stéphanie; Wong Jun Tai, Fabienne; Roch, Guillaume; Barre, Aurélien; Chague, Aurélie; Decroocq, Stéphane; Groppi, Alexis; Laizet, Yec'han; Lambert, Patrick; Tricon, David; Nikolski, Macha; Audergon, Jean-Marc; Abbott, Albert G; Decroocq, Véronique
In fruit tree species, many important traits have been characterized genetically by using single-family descent mapping in progenies segregating for the traits. However, most mapped loci have not been sufficiently resolved to the individual genes due to insufficient progeny sizes for high resolution mapping and the previous lack of whole-genome sequence resources of the study species. To address this problem for Plum Pox Virus (PPV) candidate resistance gene identification in Prunus species, we implemented a genome-wide association (GWA) approach in apricot. This study exploited the broad genetic diversity of the apricot (Prunus armeniaca) germplasm containing resistance to PPV, next-generation sequence-based genotyping, and the high-quality peach (Prunus persica) genome reference sequence for single nucleotide polymorphism (SNP) identification. The results of this GWA study validated previously reported PPV resistance quantitative trait loci (QTL) intervals, highlighted other potential resistance loci, and resolved each to a limited set of candidate genes for further study. This work substantiates the association genetics approach for resolution of QTL to candidate genes in apricot and suggests that this approach could simplify identification of other candidate genes for other marked trait intervals in this germplasm. © 2015 INRA, UMR 1332 BFP New Phytologist © 2015 New Phytologist Trust.
Zhao, Huiying; Nyholt, Dale R; Yang, Yuanhao; Wang, Jihua; Yang, Yuedong
Genome-wide association studies (GWAS) have successfully identified single variants associated with diseases. To increase the power of GWAS, gene-based and pathway-based tests are commonly employed to detect more risk factors. However, the gene- and pathway-based association tests may be biased towards genes or pathways containing a large number of single-nucleotide polymorphisms (SNPs) with small P-values caused by high linkage disequilibrium (LD) correlations. To address such bias, numerous pathway-based methods have been developed. Here we propose a novel method, DGAT-path, to divide all SNPs assigned to genes in each pathway into LD blocks, and to sum the chi-square statistics of LD blocks for assessing the significance of the pathway by permutation tests. The method was proven robust with the type I error rate >1.6 times lower than other methods. Meanwhile, the method displays a higher power and is not biased by the pathway size. The applications to the GWAS summary statistics for schizophrenia and breast cancer indicate that the detected top pathways contain more genes close to associated SNPs than other methods. As a result, the method identified 17 and 12 significant pathways containing 20 and 21 novel associated genes, respectively for two diseases. The method is available online by http://sparks-lab.org/server/DGAT-path .
Full Text Available In the yeast Saccharomyces cerevisiae and most other eukaryotes, mitotic recombination is important for the repair of double-stranded DNA breaks (DSBs. Mitotic recombination between homologous chromosomes can result in loss of heterozygosity (LOH. In this study, LOH events induced by ultraviolet (UV light are mapped throughout the genome to a resolution of about 1 kb using single-nucleotide polymorphism (SNP microarrays. UV doses that have little effect on the viability of diploid cells stimulate crossovers more than 1000-fold in wild-type cells. In addition, UV stimulates recombination in G1-synchronized cells about 10-fold more efficiently than in G2-synchronized cells. Importantly, at high doses of UV, most conversion events reflect the repair of two sister chromatids that are broken at approximately the same position whereas at low doses, most conversion events reflect the repair of a single broken chromatid. Genome-wide mapping of about 380 unselected crossovers, break-induced replication (BIR events, and gene conversions shows that UV-induced recombination events occur throughout the genome without pronounced hotspots, although the ribosomal RNA gene cluster has a significantly lower frequency of crossovers.
Mota, R R; Guimarães, S E F; Fortes, M R S; Hayes, B; Silva, F F; Verardo, L L; Kelly, M J; de Campos, C F; Guimarães, J D; Wenceslau, R R; Penitente-Filho, J M; Garcia, J F; Moore, S
We performed a genome-wide mapping for the age at first calving (AFC) with the goal of annotating candidate genes that regulate fertility in Nellore cattle. Phenotypic data from 762 cows and 777k SNP genotypes from 2,992 bulls and cows were used. Single nucleotide polymorphism (SNP) effects based on the single-step GBLUP methodology were blocked into adjacent windows of 1 Megabase (Mb) to explain the genetic variance. SNP windows explaining more than 0.40% of the AFC genetic variance were identified on chromosomes 2, 8, 9, 14, 16 and 17. From these windows, we identified 123 coding protein genes that were used to build gene networks. From the association study and derived gene networks, putative candidate genes (e.g., PAPPA, PREP, FER1L6, TPR, NMNAT1, ACAD10, PCMTD1, CRH, OPKR1, NPBWR1 and NCOA2) and transcription factors (TF) (STAT1, STAT3, RELA, E2F1 and EGR1) were strongly associated with female fertility (e.g., negative regulation of luteinizing hormone secretion, folliculogenesis and establishment of uterine receptivity). Evidence suggests that AFC inheritance is complex and controlled by multiple loci across the genome. As several windows explaining higher proportion of the genetic variance were identified on chromosome 14, further studies investigating the interaction across haplotypes to better understand the molecular architecture behind AFC in Nellore cattle should be undertaken. © 2017 Blackwell Verlag GmbH.
Black, W C; Gorrochotegui-Escalante, N; Duteau, N M
Most single nucleotide polymorphism (SNP) detection requires expensive equipment and reagents. The oligonucleotide ligation assay (OLA) is an inexpensive SNP assay that detects ligation between a biotinylated "allele-specific detector" and a 3' fluorescein-labeled "reporter" oligonucleotide. No ligation occurs unless the 3' detector nucleotide is complementary to the SNP nucleotide. The original OLA used chemical denaturation and neutralization. Heated OLA (HOLA) instead uses a thermal stable ligase and cycles of denaturing and hybridization for ligation and SNP detection. The cost per genotype is approximately US$1.25 with two-allele SNPs or approximately US$1.75 with three-allele SNPs. We illustrate the development of HOLA for SNP detection in the Early Trypsin and Abundant Trypsin loci in the mosquito Aedes aegypti (L.) and at the a-glycerophosphate dehydrogenase locus in the mosquito Anopheles gambiae s.s.
Full Text Available Abstract Background Animal identification is pivotal in governmental agricultural policy, enabling the management of subsidy payments, movement of livestock, test scheduling and control of disease. Advances in bovine genomics have made it possible to utilise inherent genetic variability to uniquely identify individual animals by DNA profiling, much as has been achieved with humans over the past 20 years. A DNA profiling test based on bi-allelic single nucleotide polymorphism (SNP markers would offer considerable advantages over current short tandem repeat (STR based industry standard tests, in that it would be easier to analyse and interpret. In this study, a panel of 51 genome-wide SNPs were genotyped across panels of semen DNA from 6 common breeds for the purposes of ascertaining allelic frequency. For SNPs on the same chromosome, the extent of linkage disequilbrium was determined from genotype data by Expectation Maximization (EM algorithm. Minimum probabilities of unique identification were determined for each breed panel. The usefulness of this SNP panel was ascertained by comparison to the current bovine STR Stockmarks II assay. A statistically representative random sampling of bovine animals from across Northern Ireland was assembled for the purposes of determining the population allele frequency for these STR loci and subsequently, the minimal probability of unique identification they conferred in sampled bovine animals from Northern Ireland. Results 6 SNPs exhibiting a minor allele frequency of less than 0.2 in more than 3 of the breed panels were excluded. 2 Further SNPs were found to reside in coding areas of the cattle genome and were excluded from the final panel. The remaining 43 SNPs exhibited genotype frequencies which were in Hardy Weinberg Equilibrium. SNPs on the same chromosome were observed to have no significant linkage disequilibrium/allelic association. Minimal probabilities of uniquely identifying individual animals from
Full Text Available Abstract Background Ancestry informative markers (AIMs are a type of genetic marker that is informative for tracing the ancestral ethnicity of individuals. Application of AIMs has gained substantial attention in population genetics, forensic sciences, and medical genetics. Single nucleotide polymorphisms (SNPs, the materials of AIMs, are useful for classifying individuals from distinct continental origins but cannot discriminate individuals with subtle genetic differences from closely related ancestral lineages. Proof-of-principle studies have shown that gene expression (GE also is a heritable human variation that exhibits differential intensity distributions among ethnic groups. GE supplies ethnic information supplemental to SNPs; this motivated us to integrate SNP and GE markers to construct AIM panels with a reduced number of required markers and provide high accuracy in ancestry inference. Few studies in the literature have considered GE in this aspect, and none have integrated SNP and GE markers to aid classification of samples from closely related ethnic populations. Results We integrated a forward variable selection procedure into flexible discriminant analysis to identify key SNP and/or GE markers with the highest cross-validation prediction accuracy. By analyzing genome-wide SNP and/or GE markers in 210 independent samples from four ethnic groups in the HapMap II Project, we found that average testing accuracies for a majority of classification analyses were quite high, except for SNP-only analyses that were performed to discern study samples containing individuals from two close Asian populations. The average testing accuracies ranged from 0.53 to 0.79 for SNP-only analyses and increased to around 0.90 when GE markers were integrated together with SNP markers for the classification of samples from closely related Asian populations. Compared to GE-only analyses, integrative analyses of SNP and GE markers showed comparable testing
Ren, Wen-Long; Wen, Yang-Jun; Dunwell, Jim M; Zhang, Yuan-Ming
Although nonparametric methods in genome-wide association studies (GWAS) are robust in quantitative trait nucleotide (QTN) detection, the absence of polygenic background control in single-marker association in genome-wide scans results in a high false positive rate. To overcome this issue, we proposed an integrated nonparametric method for multi-locus GWAS. First, a new model transformation was used to whiten the covariance matrix of polygenic matrix K and environmental noise. Using the transferred model, Kruskal-Wallis test along with least angle regression was then used to select all the markers that were potentially associated with the trait. Finally, all the selected markers were placed into multi-locus model, these effects were estimated by empirical Bayes, and all the nonzero effects were further identified by a likelihood ratio test for true QTN detection. This method, named pKWmEB, was validated by a series of Monte Carlo simulation studies. As a result, pKWmEB effectively controlled false positive rate, although a less stringent significance criterion was adopted. More importantly, pKWmEB retained the high power of Kruskal-Wallis test, and provided QTN effect estimates. To further validate pKWmEB, we re-analyzed four flowering time related traits in Arabidopsis thaliana, and detected some previously reported genes that were not identified by the other methods.
Volkov, Petr; Olsson, Anders H; Gillberg, Linn
Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL) analysis in human adipose tissue of 119 men, w...... and epigenetic variation in both cis and trans positions influencing gene expression in adipose tissue and in vivo (dys)metabolic traits associated with the development of obesity and diabetes.......Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL) analysis in human adipose tissue of 119 men......, where 592,794 single nucleotide polymorphisms (SNPs) were related to DNA methylation of 477,891 CpG sites, covering 99% of RefSeq genes. SNPs in significant mQTLs were further related to gene expression in adipose tissue and obesity related traits. We found 101,911 SNP-CpG pairs (mQTLs) in cis and 5...
Hu, Jiazhi; Meyers, Robin M; Dong, Junchao; Panchakshari, Rohit A; Alt, Frederick W; Frock, Richard L
Unbiased, high-throughput assays for detecting and quantifying DNA double-stranded breaks (DSBs) across the genome in mammalian cells will facilitate basic studies of the mechanisms that generate and repair endogenous DSBs. They will also enable more applied studies, such as those to evaluate the on- and off-target activities of engineered nucleases. Here we describe a linear amplification-mediated high-throughput genome-wide sequencing (LAM-HTGTS) method for the detection of genome-wide 'prey' DSBs via their translocation in cultured mammalian cells to a fixed 'bait' DSB. Bait-prey junctions are cloned directly from isolated genomic DNA using LAM-PCR and unidirectionally ligated to bridge adapters; subsequent PCR steps amplify the single-stranded DNA junction library in preparation for Illumina Miseq paired-end sequencing. A custom bioinformatics pipeline identifies prey sequences that contribute to junctions and maps them across the genome. LAM-HTGTS differs from related approaches because it detects a wide range of broken end structures with nucleotide-level resolution. Familiarity with nucleic acid methods and next-generation sequencing analysis is necessary for library generation and data interpretation. LAM-HTGTS assays are sensitive, reproducible, relatively inexpensive, scalable and straightforward to implement with a turnaround time of <1 week.
Cheng, Yu-Ching; Stanne, Tara M; Giese, Anne-Katrin; Ho, Weang Kee; Traylor, Matthew; Amouyel, Philippe; Holliday, Elizabeth G; Malik, Rainer; Xu, Huichun; Kittner, Steven J; Cole, John W; O'Connell, Jeffrey R; Danesh, John; Rasheed, Asif; Zhao, Wei; Engelter, Stefan; Grond-Ginsbach, Caspar; Kamatani, Yoichiro; Lathrop, Mark; Leys, Didier; Thijs, Vincent; Metso, Tiina M; Tatlisumak, Turgut; Pezzini, Alessandro; Parati, Eugenio A; Norrving, Bo; Bevan, Steve; Rothwell, Peter M; Sudlow, Cathie; Slowik, Agnieszka; Lindgren, Arne; Walters, Matthew R; Jannes, Jim; Shen, Jess; Crosslin, David; Doheny, Kimberly; Laurie, Cathy C; Kanse, Sandip M; Bis, Joshua C; Fornage, Myriam; Mosley, Thomas H; Hopewell, Jemma C; Strauch, Konstantin; Müller-Nurasyid, Martina; Gieger, Christian; Waldenberger, Melanie; Peters, Annette; Meisinger, Christine; Ikram, M Arfan; Longstreth, W T; Meschia, James F; Seshadri, Sudha; Sharma, Pankaj; Worrall, Bradford; Jern, Christina; Levi, Christopher; Dichgans, Martin; Boncoraglio, Giorgio B; Markus, Hugh S; Debette, Stephanie; Rolfs, Arndt; Saleheen, Danish; Mitchell, Braxton D
Although a genetic contribution to ischemic stroke is well recognized, only a handful of stroke loci have been identified by large-scale genetic association studies to date. Hypothesizing that genetic effects might be stronger for early- versus late-onset stroke, we conducted a 2-stage meta-analysis of genome-wide association studies, focusing on stroke cases with an age of onset genetic variants at loci with association Pstroke susceptibility locus at 10q25 reached genome-wide significance in the combined analysis of all samples from the discovery and follow-up stages (rs11196288; odds ratio =1.41; P=9.5×10(-9)). The associated locus is in an intergenic region between TCF7L2 and HABP2. In a further analysis in an independent sample, we found that 2 single nucleotide polymorphisms in high linkage disequilibrium with rs11196288 were significantly associated with total plasma factor VII-activating protease levels, a product of HABP2. HABP2, which encodes an extracellular serine protease involved in coagulation, fibrinolysis, and inflammatory pathways, may be a genetic susceptibility locus for early-onset stroke. © 2016 American Heart Association, Inc.
Full Text Available Most of the previously reported loci for total immunoglobulin E (IgE levels are related to Th2 cell-dependent pathways. We undertook a genome-wide association study (GWAS to identify genetic loci responsible for IgE regulation. A total of 479,940 single nucleotide polymorphisms (SNPs were tested for association with total serum IgE levels in 1180 Japanese adults. Fine-mapping with SNP imputation demonstrated 6 candidate regions: the PYHIN1/IFI16, MHC classes I and II, LEMD2, GRAMD1B, and chr13∶60576338 regions. Replication of these candidate loci in each region was assessed in 2 independent Japanese cohorts (n = 1110 and 1364, respectively. SNP rs3130941 in the HLA-C region was consistently associated with total IgE levels in 3 independent populations, and the meta-analysis yielded genome-wide significance (P = 1.07×10(-10. Using our GWAS results, we also assessed the reproducibility of previously reported gene associations with total IgE levels. Nine of 32 candidate genes identified by a literature search were associated with total IgE levels after correction for multiple testing. Our findings demonstrate that SNPs in the HLA-C region are strongly associated with total serum IgE levels in the Japanese population and that some of the previously reported genetic associations are replicated across ethnic groups.
Robert W. Bryson Jr.
Full Text Available Morphologically conserved taxa such as scorpions represent a challenge to delimit. We recently discovered populations of scorpions in the genus Kovarikia Soleglad, Fet & Graham, 2014 on two isolated mountain ranges in southern California. We generated genome-wide single nucleotide polymorphism data and used Bayes factors species delimitation to compare alternative species delimitation scenarios which variously placed scorpions from the two localities with geographically adjacent species or into separate lineages. We also estimated a time-calibrated phylogeny of Kovarikia and examined and compared the morphology of preserved specimens from across its distribution. Genetic results strongly support the distinction of two new lineages, which we describe and name here. Morphology among the species of Kovarikia was relatively conserved, despite deep genetic divergences, consistent with recent studies of stenotopic scorpions with limited vagility. Phylogeographic structure discovered in several previously described species also suggests additional cryptic species are probably present in the genus.
Schork, Andrew J; Thompson, Wesley K; Pham, Phillip
Recent results indicate that genome-wide association studies (GWAS) have the potential to explain much of the heritability of common complex phenotypes, but methods are lacking to reliably identify the remaining associated single nucleotide polymorphisms (SNPs). We applied stratified False...... Discovery Rate (sFDR) methods to leverage genic enrichment in GWAS summary statistics data to uncover new loci likely to replicate in independent samples. Specifically, we use linkage disequilibrium-weighted annotations for each SNP in combination with nominal p-values to estimate the True Discovery Rate...... in introns, and negative enrichment for intergenic SNPs. Stratified enrichment directly leads to increased TDR for a given p-value, mirrored by increased replication rates in independent samples. We show this in independent Crohn's disease GWAS, where we find a hundredfold variation in replication rate...
Bryson, Robert W.; Wood, Dustin A.; Graham, Matthew R.; Soleglad, Michael E.; McCormack, John E.
Morphologically conserved taxa such as scorpions represent a challenge to delimit. We recently discovered populations of scorpions in the genus Kovarikia Soleglad, Fet & Graham, 2014 on two isolated mountain ranges in southern California. We generated genome-wide single nucleotide polymorphism data and used Bayes factors species delimitation to compare alternative species delimitation scenarios which variously placed scorpions from the two localities with geographically adjacent species or into separate lineages. We also estimated a time-calibrated phylogeny of Kovarikia and examined and compared the morphology of preserved specimens from across its distribution. Genetic results strongly support the distinction of two new lineages, which we describe and name here. Morphology among the species of Kovarikia was relatively conserved, despite deep genetic divergences, consistent with recent studies of stenotopic scorpions with limited vagility. Phylogeographic structure discovered in several previously described species also suggests additional cryptic species are probably present in the genus.
Chen, Sherry Xi; Seelig, Georg
Even a single-nucleotide difference between the sequences of two otherwise identical biological nucleic acids can have dramatic functional consequences. Here, we use model-guided reaction pathway engineering to quantitatively improve the performance of selective hybridization probes in recognizing single nucleotide variants (SNVs). Specifically, we build a detection system that combines discrimination by competition with DNA strand displacement-based catalytic amplification. We show, both mathematically and experimentally, that the single nucleotide selectivity of such a system in binding to single-stranded DNA and RNA is quadratically better than discrimination due to competitive hybridization alone. As an additional benefit the integrated circuit inherits the property of amplification and provides at least 10-fold better sensitivity than standard hybridization probes. Moreover, we demonstrate how the detection mechanism can be tuned such that the detection reaction is agnostic to the position of the SNV within the target sequence. in contrast, prior strand displacement-based probes designed for kinetic discrimination are highly sensitive to position effects. We apply our system to reliably discriminate between different members of the let-7 microRNA family that differ in only a single base position. Our results demonstrate the power of systematic reaction network design to quantitatively improve biotechnology.
Nguyen H. Nguyen
Full Text Available The genetic resources available for the commercially important fish species Yellowtail kingfish (YTK (Seriola lalandi are relative sparse. To overcome this, we aimed (1 to develop a linkage map for this species, and (2 to identify markers/variants associated with economically important traits in kingfish (with an emphasis on body weight. Genetic and genomic analyses were conducted using 13,898 single nucleotide polymorphisms (SNPs generated from a new high-throughput genotyping by sequencing platform, Diversity Arrays Technology (DArTseqTM in a pedigreed population comprising 752 animals. The linkage analysis enabled to map about 4,000 markers to 24 linkage groups (LGs, with an average density of 3.4 SNPs per cM. The linkage map was integrated into a genome-wide association study (GWAS and identified six variants/SNPs associated with body weight (P < 5e-8 when a multi-locus mixed model was used. Two out of the six significant markers were mapped to LGs 17 and 23, and collectively they explained 5.8% of the total genetic variance. It is concluded that the newly developed linkage map and the significantly associated markers with body weight provide fundamental information to characterize genetic architecture of growth-related traits in this population of YTK S. lalandi.
Nguyen, Nguyen H; Rastas, Pasi M A; Premachandra, H K A; Knibb, Wayne
The genetic resources available for the commercially important fish species Yellowtail kingfish (YTK) ( Seriola lalandi) are relative sparse. To overcome this, we aimed (1) to develop a linkage map for this species, and (2) to identify markers/variants associated with economically important traits in kingfish (with an emphasis on body weight). Genetic and genomic analyses were conducted using 13,898 single nucleotide polymorphisms (SNPs) generated from a new high-throughput genotyping by sequencing platform, Diversity Arrays Technology (DArTseq TM ) in a pedigreed population comprising 752 animals. The linkage analysis enabled to map about 4,000 markers to 24 linkage groups (LGs), with an average density of 3.4 SNPs per cM. The linkage map was integrated into a genome-wide association study (GWAS) and identified six variants/SNPs associated with body weight ( P 5e -8 ) when a multi-locus mixed model was used. Two out of the six significant markers were mapped to LGs 17 and 23, and collectively they explained 5.8% of the total genetic variance. It is concluded that the newly developed linkage map and the significantly associated markers with body weight provide fundamental information to characterize genetic architecture of growth-related traits in this population of YTK S. lalandi .
Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette
Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...
Nagao, Yumiko; Nishida, Nao; Toyo-Oka, Licht; Kawaguchi, Atsushi; Amoroso, Antonio; Carrozzo, Marco; Sata, Michio; Mizokami, Masashi; Tokunaga, Katsushi; Tanaka, Yasuhito
There is a close relationship between hepatitis C virus (HCV) infection and lichen planus, a chronic inflammatory mucocutaneous disease. We performed a genome-wide association study (GWAS) to identify genetic variants associated with HCV-related lichen planus. We conducted a GWAS of 261 patients with HCV infection treated at a tertiary medical center in Japan from October 2007 through January 2013; a total of 71 had lichen planus and 190 had normal oral mucosa. We validated our findings in a GWAS of 38 patients with HCV-associated lichen planus and 7 HCV-infected patients with normal oral mucosa treated at a medical center in Italy. Single-nucleotide polymorphisms in NRP2 (rs884000) and IGFBP4 (rs538399) were associated with risk of HCV-associated lichen planus (P lichen planus. The odds ratios for the minor alleles of rs884000, rs538399, and rs9461799 were 3.25 (95% confidence interval, 1.95-5.41), 0.40 (95% confidence interval, 0.25-0.63), and 2.15 (95% confidence interval, 1.41-3.28), respectively. In a GWAS of Japanese patients with HCV infection, we replicated associations between previously reported polymorphisms in HLA class II genes and risk for lichen planus. We also identified single-nucleotide polymorphisms in NRP2 and IGFBP4 loci that increase and reduce risk of lichen planus, respectively. These genetic variants might be used to identify patients with HCV infection who are at risk for lichen planus. Copyright © 2017 AGA Institute. Published by Elsevier Inc. All rights reserved.
Ng, MYM; Levinson, DF; Faraone, SV; Suarez, BK; DeLisi, LE; Arinami, T; Riley, B; Paunio, T; Pulver, AE; Irmansyah; Holmans, PA; Escamilla, M; Wildenauer, DB; Williams, NM; Laurent, C; Mowry, BJ; Brzustowicz, LM; Maziade, M; Sklar, P; Garver, DL; Abecasis, GR; Lerer, B; Fallin, MD; Gurling, HMD; Gejman, PV; Lindholm, E; Moises, HW; Byerley, W; Wijsman, EM; Forabosco, P; Tsuang, MT; Hwu, H-G; Okazaki, Y; Kendler, KS; Wormley, B; Fanous, A; Walsh, D; O’Neill, FA; Peltonen, L; Nestadt, G; Lasseter, VK; Liang, KY; Papadimitriou, GM; Dikeos, DG; Schwab, SG; Owen, MJ; O’Donovan, MC; Norton, N; Hare, E; Raventos, H; Nicolini, H; Albus, M; Maier, W; Nimgaonkar, VL; Terenius, L; Mallet, J; Jay, M; Godard, S; Nertney, D; Alexander, M; Crowe, RR; Silverman, JM; Bassett, AS; Roy, M-A; Mérette, C; Pato, CN; Pato, MT; Roos, J Louw; Kohn, Y; Amann-Zalcenstein, D; Kalsi, G; McQuillin, A; Curtis, D; Brynjolfson, J; Sigmundsson, T; Petursson, H; Sanders, AR; Duan, J; Jazin, E; Myles-Worsley, M; Karayiorgou, M; Lewis, CM
A genome scan meta-analysis (GSMA) was carried out on 32 independent genome-wide linkage scan analyses that included 3255 pedigrees with 7413 genotyped cases affected with schizophrenia (SCZ) or related disorders. The primary GSMA divided the autosomes into 120 bins, rank-ordered the bins within each study according to the most positive linkage result in each bin, summed these ranks (weighted for study size) for each bin across studies and determined the empirical probability of a given summed rank (PSR) by simulation. Suggestive evidence for linkage was observed in two single bins, on chromosomes 5q (142-168 Mb) and 2q (103-134 Mb). Genome-wide evidence for linkage was detected on chromosome 2q (119-152 Mb) when bin boundaries were shifted to the middle of the previous bins. The primary analysis met empirical criteria for ‘aggregate’ genome-wide significance, indicating that some or all of 10 bins are likely to contain loci linked to SCZ, including regions of chromosomes 1, 2q, 3q, 4q, 5q, 8p and 10q. In a secondary analysis of 22 studies of European-ancestry samples, suggestive evidence for linkage was observed on chromosome 8p (16-33 Mb). Although the newer genome-wide association methodology has greater power to detect weak associations to single common DNA sequence variants, linkage analysis can detect diverse genetic effects that segregate in families, including multiple rare variants within one locus or several weakly associated loci in the same region. Therefore, the regions supported by this meta-analysis deserve close attention in future studies. PMID:19349958
Beth M Carpenter
Full Text Available Helicobacter pylori is a significant human pathogen that has adapted to survive the many stresses found within the gastric environment. Superoxide Dismutase (SodB is an important factor that helps H. pylori combat oxidative stress. sodB was previously shown to be repressed by the Ferric Uptake Regulator (Fur in the absence of iron (apo-Fur regulation . Herein, we show that apo regulation is not fully conserved among all strains of H. pylori. apo-Fur dependent changes in sodB expression are not observed under iron deplete conditions in H. pylori strains G27, HPAG1, or J99. However, Fur regulation of pfr and amiE occurs as expected. Comparative analysis of the Fur coding sequence between G27 and 26695 revealed a single amino acid difference, which was not responsible for the altered sodB regulation. Comparison of the sodB promoters from G27 and 26695 also revealed a single nucleotide difference within the predicted Fur binding site. Alteration of this nucleotide in G27 to that of 26695 restored apo-Fur dependent sodB regulation, indicating that a single base difference is at least partially responsible for the difference in sodB regulation observed among these H. pylori strains. Fur binding studies revealed that alteration of this single nucleotide in G27 increased the affinity of Fur for the sodB promoter. Additionally, the single base change in G27 enabled the sodB promoter to bind to apo-Fur with affinities similar to the 26695 sodB promoter. Taken together these data indicate that this nucleotide residue is important for direct apo-Fur binding to the sodB promoter.
Yuan, Xiguo; Yu, Guoqiang; Hou, Xuchu; Shih, Ie-Ming; Clarke, Robert; Zhang, Junying; Hoffman, Eric P; Wang, Roger R; Zhang, Zhen; Wang, Yue
Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC), a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1) exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2) performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3) iteratively detecting Significant Copy Number Aberrations (SCAs) and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS) on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma). When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC) or tumor suppressor genes (e.g., CDKN2A/B). Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes. Open-source and platform-independent SAIC software is
Full Text Available Abstract Background Somatic Copy Number Alterations (CNAs in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC, a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1 exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2 performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3 iteratively detecting Significant Copy Number Aberrations (SCAs and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. Results We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma. When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC or tumor suppressor genes (e.g., CDKN2A/B. Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Conclusions Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes
Oskari Kilpeläinen, Tuomas
Genome-wide association studies (GWASs) have revolutionized the search for genetic variants regulating resting heart rate. In the last 10 years, GWASs have led to the identification of at least 21 novel heart rate loci. These discoveries have provided valuable insights into the mechanisms...... and pathways that regulate heart rate and link heart rate to cardiovascular morbidity and mortality. GWASs capture majority of genetic variation in a population sample by utilizing high-throughput genotyping chips measuring genotypes for up to several millions of SNPs across the genome in thousands...... of individuals. This allows the identification of the strongest heart rate associated signals at genome-wide level. While GWASs provide robust statistical evidence of the association of a given genetic locus with heart rate, they are only the starting point for detailed follow-up studies to locate the causal...
Full Text Available Prashanth Suravajhala,1 Alfredo Benso2 1Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark; 2Department of Control and Computer Engineering, Politecnico di Torino, Torino, Italy Abstract: Next-generation sequencing technology has provided resources to easily explore and identify candidate single-nucleotide polymorphisms (SNPs and variants. However, there remains a challenge in identifying and inferring the causal SNPs from sequence data. A problem with different methods that predict the effect of mutations is that they produce false positives. In this hypothesis, we provide an overview of methods known for identifying causal variants and discuss the challenges, fallacies, and prospects in discerning candidate SNPs. We then propose a three-point classification strategy, which could be an additional annotation method in identifying causalities. Keywords: clinical mastitis, single-nucleotide polymorphisms, variants, associations, diseases, linkage disequilibrium, GWAS
Khodakov, Dmitriy A; Khodakova, Anastasia S; Huang, David M; Linacre, Adrian; Ellis, Amanda V
Single nucleotide polymorphisms (SNPs) are a prime source of genetic diversity. Discriminating between different SNPs provides an enormous leap towards the better understanding of the uniqueness of biological systems. Here we report on a new approach for SNP discrimination using toehold-mediated DNA strand displacement. The distinctiveness of the approach is based on the combination of both 3- and 4-way branch migration mechanisms, which allows for reliable discrimination of SNPs within double-stranded DNA generated from real-life human mitochondrial DNA samples. Aside from the potential diagnostic value, the current study represents an additional way to control the strand displacement reaction rate without altering other reaction parameters and provides new insights into the influence of single nucleotide substitutions on 3- and 4-way branch migration efficiency and kinetics.
Jee, Sun Ha; Sull, Jae Woong; Lee, Jong-Eun; Shin, Chol; Park, Jongkeun; Kimm, Heejin; Cho, Eun-Young; Shin, Eun-Soon; Yun, Ji Eun; Park, Ji Wan; Kim, Sang Yeun; Lee, Sun Ju; Jee, Eun Jung; Baik, Inkyung; Kao, Linda; Yoon, Sungjoo Kim; Jang, Yangsoo; Beaty, Terri H.
Adiponectin is associated with obesity and insulin resistance. To date, there has been no genome-wide association study (GWAS) of adiponectin levels in Asians. Here we present a GWAS of a cohort of Korean volunteers. A total of 4,001 subjects were genotyped by using a genome-wide marker panel in a two-stage design (979 subjects initially and 3,022 in a second stage). Another 2,304 subjects were used for follow-up replication studies with selected markers. In the discovery phase, the top SNP associated with mean log adiponectin was rs3865188 in CDH13 on chromosome 16 (p = 1.69 × 10−15 in the initial sample, p = 6.58 × 10−39 in the second genome-wide sample, and p = 2.12 × 10−32 in the replication sample). The meta-analysis p value for rs3865188 in all 6,305 individuals was 2.82 × 10−83. The association of rs3865188 with high-molecular-weight adiponectin (p = 7.36 × 10−58) was even stronger in the third sample. A reporter assay that evaluated the effects of a CDH13 promoter SNP in complete linkage disequilibrium with rs3865188 revealed that the major allele increased expression 2.2-fold. This study clearly shows that genetic variants in CDH13 influence adiponectin levels in Korean adults. PMID:20887962
Pritykin, Yuri; Ghersi, Dario; Singh, Mona
Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655
Mocellin, Simone; Tropea, Saveria; Benna, Clara; Rossi, Carlo Riccardo
Dysfunction of the circadian clock and single polymorphisms of some circadian genes have been linked to cancer susceptibility, although data are scarce and findings inconsistent. We aimed to investigate the association between circadian pathway genetic variation and risk of developing common cancers based on the findings of genome-wide association studies (GWASs). Single nucleotide polymorphisms (SNPs) of 17 circadian genes reported by three GWAS meta-analyses dedicated to breast (Discovery, Biology, and Risk of Inherited Variants in Breast Cancer (DRIVE) Consortium; cases, n = 15,748; controls, n = 18,084), prostate (Elucidating Loci Involved in Prostate Cancer Susceptibility (ELLIPSE) Consortium; cases, n = 14,160; controls, n = 12,724) and lung carcinoma (Transdisciplinary Research In Cancer of the Lung (TRICL) Consortium; cases, n = 12,160; controls, n = 16,838) in patients of European ancestry were utilized to perform pathway analysis by means of the adaptive rank truncated product (ARTP) method. Data were also available for the following subgroups: estrogen receptor negative breast cancer, aggressive prostate cancer, squamous lung carcinoma and lung adenocarcinoma. We found a highly significant statistical association between circadian pathway genetic variation and the risk of breast (pathway P value = 1.9 × 10 -6 ; top gene RORA, gene P value = 0.0003), prostate (pathway P value = 4.1 × 10 -6 ; top gene ARNTL, gene P value = 0.0002) and lung cancer (pathway P value = 6.9 × 10 -7 ; top gene RORA, gene P value = 2.0 × 10 -6 ), as well as all their subgroups. Out of 17 genes investigated, 15 were found to be significantly associated with the risk of cancer: four genes were shared by all three malignancies (ARNTL, CLOCK, RORA and RORB), two by breast and lung cancer (CRY1 and CRY2) and three by prostate and lung cancer (NPAS2, NR1D1 and PER3), whereas four genes were specific for lung cancer
Turner, Adam W; Martinuk, Amy; Silva, Anada; Lau, Paulina; Nikpay, Majid; Eriksson, Per; Folkersen, Lasse; Perisic, Ljubica; Hedin, Ulf; Soubeyrand, Sebastien; McPherson, Ruth
A recent genome-wide association study meta-analysis identified an intronic single nucleotide polymorphism in SMAD3, rs56062135C>T, the minor allele (T) which associates with protection from coronary artery disease. Relevant to atherosclerosis, SMAD3 is a key contributor to transforming growth factor-β pathway signaling. Here, we seek to identify ≥1 causal coronary artery disease-associated single nucleotide polymorphisms at the SMAD3 locus and characterize mechanisms whereby the risk allele(s) contribute to coronary artery disease risk. By genetic and epigenetic fine mapping, we identified a candidate causal single nucleotide polymorphism rs17293632C>T (D', 0.97; r(2), 0.94 with rs56062135) in intron 1 of SMAD3 with predicted functional effects. We show that the sequence encompassing rs17293632 acts as a strong enhancer in human arterial smooth muscle cells. The common allele (C) preserves an activator protein (AP)-1 site and enhancer function, whereas the protective (T) allele disrupts the AP-1 site and significantly reduces enhancer activity (Pto the (C) allele. We show that rs17293632 is an expression quantitative trait locus for SMAD3 in blood and atherosclerotic plaque with reduced expression of SMAD3 in carriers of the protective allele. Finally, siRNA knockdown of SMAD3 in human arterial smooth muscle cells increases cell viability, consistent with an antiproliferative role. The coronary artery disease-associated rs17293632C>T single nucleotide polymorphism represents a novel functional cis-acting element at the SMAD3 locus. The protective (T) allele of rs17293632 disrupts a consensus AP-1 binding site in a SMAD3 intron 1 enhancer, reduces enhancer activity and SMAD3 expression, altering human arterial smooth muscle cell proliferation. © 2016 American Heart Association, Inc.
Treff, Nathan R; Su, Jing; Kasabwala, Natasha; Tao, Xin; Miller, Kathleen A; Scott, Richard T
This study sought to validate a novel, minimally invasive system for embryo tracking by single nucleotide polymorphism microarray-based DNA fingerprinting of the first polar body. First polar body-based assignments of which embryos implanted and were delivered after multiple ET were 100% consistent with previously validated embryo DNA fingerprinting-based assignments. Copyright 2010 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Khodakov, Dmitriy A.; Khodakova, Anastasia S.; Huang, David M.; Linacre, Adrian; Ellis, Amanda V.
Single nucleotide polymorphisms (SNPs) are a prime source of genetic diversity. Discriminating between different SNPs provides an enormous leap towards the better understanding of the uniqueness of biological systems. Here we report on a new approach for SNP discrimination using toehold-mediated DNA strand displacement. The distinctiveness of the approach is based on the combination of both 3- and 4-way branch migration mechanisms, which allows for reliable discrimination of SNPs within doubl...
Full Text Available Manganese (Mn is an essential micro-nutrient for plants, but flooded rice fields can accumulate high levels of Mn2+ leading to Mn toxicity. Here, we present a genome-wide association study (GWAS to identify candidate loci conferring Mn toxicity tolerance in rice (Oryza sativa L.. A diversity panel of 288 genotypes was grown in hydroponic solutions in a greenhouse under optimal and toxic Mn concentrations. We applied a Mn toxicity treatment (5 ppm Mn2+, 3 weeks at twelve days after transplanting. Mn toxicity caused moderate damage in rice in terms of biomass loss and symptom formation despite extremely high shoot Mn concentrations ranging from 2.4 to 17.4 mg g-1. The tropical japonica subpopulation was more sensitive to Mn toxicity than other subpopulations. Leaf damage symptoms were significantly correlated with Mn uptake into shoots. Association mapping was conducted for seven traits using 416741 single nucleotide polymorphism (SNP markers using a mixed linear model, and detected six significant associations for the traits shoot manganese concentration and relative shoot length. Candidate regions contained genes coding for a heavy metal transporter, peroxidase precursor and Mn2+ ion binding proteins. The significant marker SNP-2.22465867 caused an amino acid change in a gene (LOC_Os02g37170 with unknown function. This study demonstrated significant natural variation in rice for Mn toxicity tolerance and the possibility of using GWAS to unravel genetic factors responsible for such complex traits.
Jinam, Timothy A; Phipps, Maude E; Saitou, Naruya
Southeast Asia houses various culturally and linguistically diverse ethnic groups. In Malaysia, where the Malay, Chinese, and Indian ethnic groups form the majority, there exist minority groups such as the "negritos" who are believed to be descendants of the earliest settlers of Southeast Asia. Here we report patterns of genetic substructure and admixture in two Malaysian negrito populations (Jehai and Kensiu), using ~50,000 genome-wide single-nucleotide polymorphism (SNP) data. We found traces of recent admixture in both the negrito populations, particularly in the Jehai, with the Malay through principal component analysis and STRUCTURE analysis software, which suggested that the admixture was as recent as one generation ago. We also identified significantly differentiated nonsynonymous SNPs and haplotype blocks related to intracellular transport, metabolic processes, and detection of stimulus. These results highlight the different levels of admixture experienced by the two Malaysian negritos. Delineating admixture and differentiated genomic regions should be of importance in designing and interpretation of molecular anthropology and disease association studies. Copyright © 2013 Wayne State University Press, Detroit, Michigan 48201-1309.
Liu, Zhaohua; Ji, Zhibin; Wang, Guizhi; Chao, Tianle; Hou, Lei; Wang, Jianmin
Throughout a long period of adaptation and selection, sheep have thrived in a diverse range of ecological environments. Mongolian sheep is the common ancestor of the Chinese short fat-tailed sheep. Migration to different ecoregions leads to changes in selection pressures and results in microevolution. Mongolian sheep and its subspecies differ in a number of important traits, especially reproductive traits. Genome-wide intraspecific variation is required to dissect the genetic basis of these traits. This research resequenced 3 short fat-tailed sheep breeds with a 43.2-fold coverage of the sheep genome. We report more than 17 million single nucleotide polymorphisms and 2.9 million indels and identify 143 genomic regions with reduced pooled heterozygosity or increased genetic distance to each other breed that represent likely targets for selection during the migration. These regions harbor genes related to developmental processes, cellular processes, multicellular organismal processes, biological regulation, metabolic processes, reproduction, localization, growth and various components of the stress responses. Furthermore, we examined the haplotype diversity of 3 genomic regions involved in reproduction and found significant differences in TSHR and PRL gene regions among 8 sheep breeds. Our results provide useful genomic information for identifying genes or causal mutations associated with important economic traits in sheep and for understanding the genetic basis of adaptation to different ecological environments.
Ai, XianTao; Liang, YaJun; Wang, JunDuo; Zheng, JuYun; Gong, ZhaoLong; Guo, JiangPing; Li, XueYuan; Qu, YanYing
Cotton (Gossypium spp.) is the most important natural textile fiber crop, and Gossypium hirsutum L. is responsible for 90% of the annual cotton crop in the world. Information on cotton genetic diversity and population structure is essential for new breeding lines. In this study, we analyzed population structure and genetic diversity of 288 elite Gossypium hirsutum cultivar accessions collected from around the world, and especially from China, using genome-wide single nucleotide polymorphisms (SNP) markers. The average polymorphsim information content (PIC) was 0.25, indicating a relatively low degree of genetic diversity. Population structure analysis revealed extensive admixture and identified three subgroups. Phylogenetic analysis supported the subgroups identified by STRUCTURE. The results from both population structure and phylogenetic analysis were, for the most part, in agreement with pedigree information. Analysis of molecular variance revealed a larger amount of variation was due to diversity within the groups. Establishment of genetic diversity and population structure from this study could be useful for genetic and genomic analysis and systematic utilization of the standing genetic variation in upland cotton.
Marcio P. Arruda
Full Text Available Fusarium head blight (FHB is one of the most important wheat ( L. diseases worldwide, and host resistance displays complex genetic control. A genome-wide association study (GWAS was performed on 273 winter wheat breeding lines from the midwestern and eastern regions of the United States to identify chromosomal regions associated with FHB resistance. Genotyping-by-sequencing (GBS was used to identify 19,992 single-nucleotide polymorphisms (SNPs covering all 21 wheat chromosomes. Marker–trait associations were performed with different statistical models, the most appropriate being a compressed mixed linear model (cMLM controlling for relatedness and population structure. Ten significant SNP–trait associations were detected on chromosomes 4A, 6A, 7A, 1D, 4D, and 7D, and multiple SNPs were associated with on chromosome 3B. Although combination of favorable alleles of these SNPs resulted in lower levels of severity (SEV, incidence (INC, and deoxynivalenol concentration (DON, lines carrying multiple beneficial alleles were in very low frequency for most traits. These SNPs can now be used for creating new breeding lines with different combinations of favorable alleles. This is one of the first GWAS using genomic resources from the International Wheat Genome Sequencing Consortium (IWGSC.
Full Text Available Bipolar disorder is a common and severe mental illness with unsolved pathophysiology. A genome-wide association study (GWAS has been used to find a number of risk genes, but it is difficult for a GWAS to find genes indirectly associated with a disease. To find core hub genes, we introduce a network analysis after the GWAS was conducted. Six thousand four hundred fifty eight single nucleotide polymorphisms (SNPs with p < 0.01 were sifted out from Wellcome Trust Case Control Consortium (WTCCC dataset and mapped to 2045 genes, which are then compared with the protein–protein network. One hundred twelve genes with a degree >17 were chosen as hub genes from which five significant modules and four core hub genes (FBXL13, WDFY2, bFGF, and MTHFD1L were found. These core hub genes have not been reported to be directly associated with BD but may function by interacting with genes directly related to BD. Our method engenders new thoughts on finding genes indirectly associated with, but important for, complex diseases.
Frackelton Edward C
Full Text Available Abstract Background Human height is considered highly heritable and correlated with certain disorders, such as type 2 diabetes and cancer. Despite environmental influences, genetic factors are known to play an important role in stature determination. A number of genetic determinants of adult height have already been established through genome wide association studies. Methods To examine 51 single nucleotide polymorphisms (SNPs corresponding to the 46 previously reported genomic loci for height in 8,184 European American children with height measurements. We leveraged genotyping data from our ongoing GWA study of height variation in children in order to query the 51 SNPs in this pediatric cohort. Results Sixteen of these SNPs yielded at least nominally significant association to height, representing fifteen different loci including EFEMP1-PNPT1, GPR126, C6orf173, SPAG17, Histone class 1, HLA class III and GDF5-UQCC. Other loci revealed no evidence for association, including HMGA1 and HMGA2. For the 16 associated variants, the genotype score explained 1.64% of the total variation for height z-score. Conclusion Among 46 loci that have been reported to associate with adult height to date, at least 15 also contribute to the determination of height in childhood.
Full Text Available Management of insects that cause economic damage to yields of soybean mainly rely on insecticide applications. Sources of resistance in soybean plant introductions (PIs to different insect pests have been reported, and some of these sources, like for the soybean aphid (SBA, have been used to develop resistant soybean cultivars. With the availability of SoySNP50K and the statistical power of genome-wide association studies, we integrated phenotypic data for beet armyworm, Mexican bean beetle (MBB, potato leafhopper (PLH, SBA, soybean looper (SBL, velvetbean caterpillar (VBC, and chewing damage caused by unspecified insects for a comprehensive understanding of insect resistance in the United States Department of Agriculture Soybean Germplasm Collection. We identified significant single nucleotide (SNP polymorphic markers for MBB, PLH, SBL, and VBC, and we highlighted several leucine-rich repeat-containing genes and myeloblastosis transcription factors within the high linkage disequilibrium region surrounding significant SNP markers. Specifically for soybean resistance to PLH, we found the PLH locus is close but distinct to a locus for soybean pubescence density on chromosome 12. The results provide genetic support that pubescence density may not directly link to PLH resistance. This study offers a novel insight of soybean resistance to four insect pests and reviews resistance mapping studies for major soybean insects.
Arnedo, Javier; Svrakic, Dragan M; Del Val, Coral; Romero-Zaliz, Rocío; Hernández-Cuervo, Helena; Fanous, Ayman H; Pato, Michele T; Pato, Carlos N; de Erausquin, Gabriel A; Cloninger, C Robert; Zwir, Igor
The authors sought to demonstrate that schizophrenia is a heterogeneous group of heritable disorders caused by different genotypic networks that cause distinct clinical syndromes. In a large genome-wide association study of cases with schizophrenia and controls, the authors first identified sets of interacting single-nucleotide polymorphisms (SNPs) that cluster within particular individuals (SNP sets) regardless of clinical status. Second, they examined the risk of schizophrenia for each SNP set and tested replicability in two independent samples. Third, they identified genotypic networks composed of SNP sets sharing SNPs or subjects. Fourth, they identified sets of distinct clinical features that cluster in particular cases (phenotypic sets or clinical syndromes) without regard for their genetic background. Fifth, they tested whether SNP sets were associated with distinct phenotypic sets in a replicable manner across the three studies. The authors identified 42 SNP sets associated with a 70% or greater risk of schizophrenia, and confirmed 34 (81%) or more with similar high risk of schizophrenia in two independent samples. Seventeen networks of SNP sets did not share any SNP or subject. These disjoint genotypic networks were associated with distinct gene products and clinical syndromes (i.e., the schizophrenias) varying in symptoms and severity. Associations between genotypic networks and clinical syndromes were complex, showing multifinality and equifinality. The interactive networks explained the risk of schizophrenia more than the average effects of all SNPs (24%). Schizophrenia is a group of heritable disorders caused by a moderate number of separate genotypic networks associated with several distinct clinical syndromes.
Ben J Hayes
Full Text Available Continued production of food in areas predicted to be most affected by climate change, such as dairy farming regions of Australia, will be a major challenge in coming decades. Along with rising temperatures and water shortages, scarcity of inputs such as high energy feeds is predicted. With the motivation of selecting cattle adapted to these changing environments, we conducted a genome wide association study to detect DNA markers (single nucleotide polymorphisms associated with the sensitivity of milk production to environmental conditions. To do this we combined historical milk production and weather records with dense marker genotypes on dairy sires with many daughters milking across a wide range of production environments in Australia. Markers associated with sensitivity of milk production to feeding level and sensitivity of milk production to temperature humidity index on chromosome nine and twenty nine respectively were validated in two independent populations, one a different breed of cattle. As the extent of linkage disequilibrium across cattle breeds is limited, the underlying causative mutations have been mapped to a small genomic interval containing two promising candidate genes. The validated marker panels we have reported here will aid selection for high milk production under anticipated climate change scenarios, for example selection of sires whose daughters will be most productive at low levels of feeding.
C. G. Dang
Full Text Available Significant SNPs associated with Warner-Bratzler (WB shear force and sensory traits were confirmed for Hanwoo beef (Korean cattle. A Bonferroni-corrected genome-wide significant association (p<1.3×10−6 was detected with only one single nucleotide polymorphism (SNP on chromosome 5 for WB shear force. A slightly higher number of SNPs was significantly (p<0.001 associated with WB shear force than with other sensory traits. Further, 50, 25, 29, and 34 SNPs were significantly associated with WB shear force, tenderness, juiciness, and flavor likeness, respectively. The SNPs between p = 0.001 and p = 0.0001 thresholds explained 3% to 9% of the phenotypic variance, while the most significant SNPs accounted for 7% to 12% of the phenotypic variance. In conclusion, because WB shear force and sensory evaluation were moderately affected by a few loci and minimally affected by other loci, further studies are required by using a large sample size and high marker density.
Amanda A Fox
Full Text Available BACKGROUND: Postoperative ventricular dysfunction (VnD occurs in 9-20% of coronary artery bypass graft (CABG surgical patients and is associated with increased postoperative morbidity and mortality. Understanding genetic causes of postoperative VnD should enhance patient risk stratification and improve treatment and prevention strategies. We aimed to determine if genetic variants associate with occurrence of in-hospital VnD after CABG surgery. METHODS: A genome-wide association study identified single nucleotide polymorphisms (SNPs associated with postoperative VnD in male subjects of European ancestry undergoing isolated primary CABG surgery with cardiopulmonary bypass. VnD was defined as the need for ≥2 inotropes or mechanical ventricular support after CABG surgery. Validated SNPs were assessed further in two replication CABG cohorts and meta-analysis was performed. RESULTS: Over 100 SNPs were associated with VnD (P2.1 of developing in-hospital VnD after CABG surgery. However, three genetic loci identified by meta-analysis were more modestly associated with development of postoperative VnD. Studies of larger cohorts to assess these loci as well as to define other genetic mechanisms and related biology that link genetic variants to postoperative ventricular dysfunction are warranted.
Zila, Charles T.; Samayoa, L. Fernando; Santiago, Rogelio; Butrón, Ana; Holland, James B.
Fusarium ear rot is a common disease of maize that affects food and feed quality globally. Resistance to the disease is highly quantitative, and maize breeders have difficulty incorporating polygenic resistance alleles from unadapted donor sources into elite breeding populations without having a negative impact on agronomic performance. Identification of specific allele variants contributing to improved resistance may be useful to breeders by allowing selection of resistance alleles in coupling phase linkage with favorable agronomic characteristics. We report the results of a genome-wide association study to detect allele variants associated with increased resistance to Fusarium ear rot in a maize core diversity panel of 267 inbred lines evaluated in two sets of environments. We performed association tests with 47,445 single-nucleotide polymorphisms (SNPs) while controlling for background genomic relationships with a mixed model and identified three marker loci significantly associated with disease resistance in at least one subset of environments. Each associated SNP locus had relatively small additive effects on disease resistance (±1.1% on a 0–100% scale), but nevertheless were associated with 3 to 12% of the genotypic variation within or across environment subsets. Two of three identified SNPs colocalized with genes that have been implicated with programmed cell death. An analysis of associated allele frequencies within the major maize subpopulations revealed enrichment for resistance alleles in the tropical/subtropical and popcorn subpopulations compared with other temperate breeding pools. PMID:24048647
Porter Christopher J
Full Text Available Abstract Background SNP microarrays are designed to genotype Single Nucleotide Polymorphisms (SNPs. These microarrays report hybridization of DNA fragments and therefore can be used for the purpose of detecting genomic fragments. Results Here, we demonstrate that a SNP microarray can be effectively used in this way to perform chromatin immunoprecipitation (ChIP on chip as an alternative to tiling microarrays. We illustrate this novel application by mapping whole genome histone H4 hyperacetylation in human myoblasts and myotubes. We detect clusters of hyperacetylated histone H4, often spanning across up to 300 kilobases of genomic sequence. Using complementary genome-wide analyses of gene expression by DNA microarray we demonstrate that these clusters of hyperacetylated histone H4 tend to be associated with expressed genes. Conclusion The use of a SNP array for a ChIP-on-chip application (ChIP on SNP-chip will be of great value to laboratories whose interest is the determination of general rules regarding the relationship of specific chromatin modifications to transcriptional status throughout the genome and to examine the asymmetric modification of chromatin at heterozygous loci.
Bradley J Foresman
Full Text Available Barley yellow dwarf viruses (BYDVs are responsible for the disease barley yellow dwarf (BYD and affect many cereals including oat (Avena sativa L.. Until recently, the molecular marker technology in oat has not allowed for many marker-trait association studies to determine the genetic mechanisms for tolerance. A genome-wide association study (GWAS was performed on 428 spring oat lines using a recently developed high-density oat single nucleotide polymorphism (SNP array as well as a SNP-based consensus map. Marker-trait associations were performed using a Q-K mixed model approach to control for population structure and relatedness. Six significant SNP-trait associations representing two QTL were found on chromosomes 3C (Mrg17 and 18D (Mrg04. This is the first report of BYDV tolerance QTL on chromosome 3C (Mrg17 and 18D (Mrg04. Haplotypes using the two QTL were evaluated and distinct classes for tolerance were identified based on the number of favorable alleles. A large number of lines carrying both favorable alleles were observed in the panel.
Full Text Available Various attempts have been made to predict the individual disease risk based on genotype data from genome-wide association studies (GWAS. However, most studies only investigated one or two classification algorithms and feature encoding schemes. In this study, we applied seven different classification algorithms on GWAS case-control data sets for seven different diseases to create models for disease risk prediction. Further, we used three different encoding schemes for the genotypes of single nucleotide polymorphisms (SNPs and investigated their influence on the predictive performance of these models. Our study suggests that an additive encoding of the SNP data should be the preferred encoding scheme, as it proved to yield the best predictive performances for all algorithms and data sets. Furthermore, our results showed that the differences between most state-of-the-art classification algorithms are not statistically significant. Consequently, we recommend to prefer algorithms with simple models like the linear support vector machine (SVM as they allow for better subsequent interpretation without significant loss of accuracy.
Behar, Doron M; Metspalu, Mait; Baran, Yael; Kopelman, Naama M; Yunusbayev, Bayazit; Gladstein, Ariella; Tzur, Shay; Sahakyan, Hovhannes; Bahmanimehr, Ardeshir; Yepiskoposyan, Levon; Tambets, Kristina; Khusnutdinova, Elza K; Kushniarevich, Alena; Balanovsky, Oleg; Balanovsky, Elena; Kovacevic, Lejla; Marjanovic, Damir; Mihailov, Evelin; Kouvatsi, Anastasia; Triantaphyllidis, Costas; King, Roy J; Semino, Ornella; Torroni, Antonio; Hammer, Michael F; Metspalu, Ene; Skorecki, Karl; Rosset, Saharon; Halperin, Eran; Villems, Richard; Rosenberg, Noah A
The origin and history of the Ashkenazi Jewish population have long been of great interest, and advances in high-throughput genetic analysis have recently provided a new approach for investigating these topics. We and others have argued on the basis of genome-wide data that the Ashkenazi Jewish population derives its ancestry from a combination of sources tracing to both Europe and the Middle East. It has been claimed, however, through a reanalysis of some of our data, that a large part of the ancestry of the Ashkenazi population originates with the Khazars, a Turkic-speaking group that lived to the north of the Caucasus region ~1,000 years ago. Because the Khazar population has left no obvious modern descendants that could enable a clear test for a contribution to Ashkenazi Jewish ancestry, the Khazar hypothesis has been difficult to examine using genetics. Furthermore, because only limited genetic data have been available from the Caucasus region, and because these data have been concentrated in populations that are genetically close to populations from the Middle East, the attribution of any signal of Ashkenazi-Caucasus genetic similarity to Khazar ancestry rather than shared ancestral Middle Eastern ancestry has been problematic. Here, through integration of genotypes from newly collected samples with data from several of our past studies, we have assembled the largest data set available to date for assessment of Ashkenazi Jewish genetic origins. This data set contains genome-wide single-nucleotide polymorphisms in 1,774 samples from 106 Jewish and non-Jewish populations that span the possible regions of potential Ashkenazi ancestry: Europe, the Middle East, and the region historically associated with the Khazar Khaganate. The data set includes 261 samples from 15 populations from the Caucasus region and the region directly to its north, samples that have not previously been included alongside Ashkenazi Jewish samples in genomic studies. Employing a variety of
Bjoerheim, Jens; Abrahamsen, Torveig Weum; Kristensen, Annette Torgunrud; Gaudernack, Gustav; Ekstroem, Per O.
Melting gel techniques have proven to be amenable and powerful tools in point mutation and single nucleotide polymorphism (SNP) analysis. With the introduction of commercially available capillary electrophoresis instruments, a partly automated platform for denaturant capillary electrophoresis with potential for routine screening of selected target sequences has been established. The aim of this article is to demonstrate the use of automated constant denaturant capillary electrophoresis (ACDCE) in single nucleotide polymorphism analysis of various target sequences. Optimal analysis conditions for different single nucleotide polymorphisms on ACDCE are evaluated with the Poland algorithm. Laboratory procedures include only PCR and electrophoresis. For direct genotyping of individual SNPs, the samples are analyzed with an internal standard and the alleles are identified by co-migration of sample and standard peaks. In conclusion, SNPs suitable for melting gel analysis based on theoretical thermodynamics were separated by ACDCE under appropriate conditions. With this instrumentation (ABI 310 Genetic Analyzer), 48 samples could be analyzed without any intervention. Several institutions have capillary instrumentation in-house, thus making this SNP analysis method accessible to large groups of researchers without any need for instrument modification
Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A
The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.
Adkins, Daniel E; Clark, Shaunna L; Copeland, William E; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A; Silberg, Judy; Brown, Tyson H; Fergusson, David M; Horwood, L John; Eaves, Lindon; van den Oord, Edwin J C G; Sullivan, Patrick F; Costello, E J
The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse, and dependence increasing across adolescence and peaking in early adulthood. Here, we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three longitudinal community samples (N=2,126, obs=12,166). Consumption-repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and six others met our 'suggestive' criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms, including neurotransmission, xenobiotic pharmacodynamics, and nuclear hormone receptors (NHR). These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies.
DeVilbiss, Andrew W; Sanalkumar, Rajendran; Johnson, Kirby D; Keles, Sunduz; Bresnick, Emery H
Hematopoiesis is an exquisitely regulated process in which stem cells in the developing embryo and the adult generate progenitor cells that give rise to all blood lineages. Master regulatory transcription factors control hematopoiesis by integrating signals from the microenvironment and dynamically establishing and maintaining genetic networks. One of the most rudimentary aspects of cell type-specific transcription factor function, how they occupy a highly restricted cohort of cis-elements in chromatin, remains poorly understood. Transformative technologic advances involving the coupling of next-generation DNA sequencing technology with the chromatin immunoprecipitation assay (ChIP-seq) have enabled genome-wide mapping of factor occupancy patterns. However, formidable problems remain; notably, ChIP-seq analysis yields hundreds to thousands of chromatin sites occupied by a given transcription factor, and only a fraction of the sites appear to be endowed with critical, non-redundant function. It has become en vogue to map transcription factor occupancy patterns genome-wide, while using powerful statistical tools to establish correlations to inform biology and mechanisms. With the advent of revolutionary genome editing technologies, one can now reach beyond correlations to conduct definitive hypothesis testing. This review focuses on key discoveries that have emerged during the path from single loci to genome-wide analyses, specifically in the context of hematopoietic transcriptional mechanisms. Copyright © 2014 ISEH - International Society for Experimental Hematology. Published by Elsevier Inc. All rights reserved.
Clive J Hoggart
Full Text Available Testing one SNP at a time does not fully realise the potential of genome-wide association studies to identify multiple causal variants, which is a plausible scenario for many complex diseases. We show that simultaneous analysis of the entire set of SNPs from a genome-wide study to identify the subset that best predicts disease outcome is now feasible, thanks to developments in stochastic search methods. We used a Bayesian-inspired penalised maximum likelihood approach in which every SNP can be considered for additive, dominant, and recessive contributions to disease risk. Posterior mode estimates were obtained for regression coefficients that were each assigned a prior with a sharp mode at zero. A non-zero coefficient estimate was interpreted as corresponding to a significant SNP. We investigated two prior distributions and show that the normal-exponential-gamma prior leads to improved SNP selection in comparison with single-SNP tests. We also derived an explicit approximation for type-I error that avoids the need to use permutation procedures. As well as genome-wide analyses, our method is well-suited to fine mapping with very dense SNP sets obtained from re-sequencing and/or imputation. It can accommodate quantitative as well as case-control phenotypes, covariate adjustment, and can be extended to search for interactions. Here, we demonstrate the power and empirical type-I error of our approach using simulated case-control data sets of up to 500 K SNPs, a real genome-wide data set of 300 K SNPs, and a sequence-based dataset, each of which can be analysed in a few hours on a desktop workstation.
Herold, Christine; Hooli, Basavaraj V.; Mullin, Kristina; Liu, Tian; Roehr, Johannes T; Mattheisen, Manuel; Parrado, Antonio R.; Bertram, Lars; Lange, Christoph; Tanzi, Rudolph E.
The genetic basis of Alzheimer's disease (AD) is complex and heterogeneous. Over 200 highly penetrant pathogenic variants in the genes APP, PSEN1 and PSEN2 cause a subset of early-onset familial Alzheimer's disease (EOFAD). On the other hand, susceptibility to late-onset forms of AD (LOAD) is indisputably associated to the ε4 allele in the gene APOE, and more recently to variants in more than two-dozen additional genes identified in the large-scale genome-wide association studies (GWAS) and meta-analyses reports. Taken together however, although the heritability in AD is estimated to be as high as 80%, a large proportion of the underlying genetic factors still remain to be elucidated. In this study we performed a systematic family-based genome-wide association and meta-analysis on close to 15 million imputed variants from three large collections of AD families (~3,500 subjects from 1,070 families). Using a multivariate phenotype combining affection status and onset age, meta-analysis of the association results revealed three single nucleotide polymorphisms (SNPs) that achieved genome-wide significance for association with AD risk: rs7609954 in the gene PTPRG (P-value = 3.98·10−08), rs1347297 in the gene OSBPL6 (P-value = 4.53·10−08), and rs1513625 near PDCL3 (P-value = 4.28·10−08). In addition, rs72953347 in OSBPL6 (P-value = 6.36·10−07) and two SNPs in the gene CDKAL1 showed marginally significant association with LOAD (rs10456232, P-value: 4.76·10−07; rs62400067, P-value: 3.54·10−07). In summary, family-based GWAS meta-analysis of imputed SNPs revealed novel genomic variants in (or near) PTPRG, OSBPL6, and PDCL3 that influence risk for AD with genome-wide significance. PMID:26830138
Brandon L Pierce
Full Text Available Arsenic contamination of drinking water is a major public health issue in many countries, increasing risk for a wide array of diseases, including cancer. There is inter-individual variation in arsenic metabolism efficiency and susceptibility to arsenic toxicity; however, the basis of this variation is not well understood. Here, we have performed the first genome-wide association study (GWAS of arsenic-related metabolism and toxicity phenotypes to improve our understanding of the mechanisms by which arsenic affects health. Using data on urinary arsenic metabolite concentrations and approximately 300,000 genome-wide single nucleotide polymorphisms (SNPs for 1,313 arsenic-exposed Bangladeshi individuals, we identified genome-wide significant association signals (P<5×10(-8 for percentages of both monomethylarsonic acid (MMA and dimethylarsinic acid (DMA near the AS3MT gene (arsenite methyltransferase; 10q24.32, with five genetic variants showing independent associations. In a follow-up analysis of 1,085 individuals with arsenic-induced premalignant skin lesions (the classical sign of arsenic toxicity and 1,794 controls, we show that one of these five variants (rs9527 is also associated with skin lesion risk (P = 0.0005. Using a subset of individuals with prospectively measured arsenic (n = 769, we show that rs9527 interacts with arsenic to influence incident skin lesion risk (P = 0.01. Expression quantitative trait locus (eQTL analyses of genome-wide expression data from 950 individual's lymphocyte RNA suggest that several of our lead SNPs represent cis-eQTLs for AS3MT (P = 10(-12 and neighboring gene C10orf32 (P = 10(-44, which are involved in C10orf32-AS3MT read-through transcription. This is the largest and most comprehensive genomic investigation of arsenic metabolism and toxicity to date, the only GWAS of any arsenic-related trait, and the first study to implicate 10q24.32 variants in both arsenic metabolism and arsenical
Orozco, Gisela; Goh, Chee L; Al Olama, Ali Amin; Benlloch-Garcia, Sara; Govindasami, Koveela; Guy, Michelle; Muir, Kenneth R; Giles, Graham G; Severi, Gianluca; Neal, David E; Hamdy, Freddie C; Donovan, Jenny L; Kote-Jarai, Zsofia; Easton, Douglas F; Eyre, Steve; Eeles, Rosalind A
WHAT'S KNOWN ON THE SUBJECT? AND WHAT DOES THE STUDY ADD?: The link between inflammation and cancer has long been reported and inflammation is thought to play a role in the pathogenesis of many cancers, including prostate cancer (PrCa). Over the last 5 years, genome-wide association studies (GWAS) have reported numerous susceptibility loci that predispose individuals to many different traits. The present study aims to ascertain if there are common genetic risk profiles that might predispose individuals to both PrCa and the autoimmune inflammatory condition, rheumatoid arthritis. These results could have potential public heath impact in terms of screening and chemoprevention. To investigate if potential common pathways exist for the pathogenesis of autoimmune disease and prostate cancer (PrCa). To ascertain if the single nucleotide polymorphisms (SNPs) reported by genome-wide association studies (GWAS) as being associated with susceptibility to PrCa are also associated with susceptibility to the autoimmune disease rheumatoid arthritis (RA). The original Wellcome Trust Case Control Consortium (WTCCC) UK RA GWAS study was expanded to include a total of 3221 cases and 5272 controls. In all, 37 germline autosomal SNPs at genome-wide significance associated with PrCa risk were identified from a UK/Australian PrCa GWAS. Allele frequencies were compared for these 37 SNPs between RA cases and controls using a chi-squared trend test and corrected for multiple testing (Bonferroni). In all, 33 SNPs were able to be analysed in the RA dataset. Proxies could not be located for the SNPs in 3q26, 5p15 and for two SNPs in 17q12. After applying a Bonferroni correction for the number of SNPs tested, the SNP mapping to CCHCR1 (rs130067) retained statistically significant evidence for association (P = 6 × 10(-4) ; odds ratio [OR] = 1.15, 95% CI: 1.06-1.24); this has also been associated with psoriasis. However, further analyses showed that the association of this allele was due to
Full Text Available DNA methylation plays a central role in regulating many aspects of growth and development in mammals through regulating gene expression. The development of next generation sequencing technologies have paved the way for genome-wide, high resolution analysis of DNA methylation landscapes using methodology known as reduced representation bisulfite sequencing (RRBS. While RRBS has proven to be effective in understanding DNA methylation landscapes in humans, mice, and rats, to date, few studies have utilised this powerful method for investigating DNA methylation in agricultural animals. Here we describe the utilisation of RRBS to investigate DNA methylation in sheep Longissimus dorsi muscles. RRBS analysis of ∼1% of the genome from Longissimus dorsi muscles provided data of suitably high precision and accuracy for DNA methylation analysis, at all levels of resolution from genome-wide to individual nucleotides. Combining RRBS data with mRNAseq data allowed the sheep Longissimus dorsi muscle methylome to be compared with methylomes from other species. While some species differences were identified, many similarities were observed between DNA methylation patterns in sheep and other more commonly studied species. The RRBS data presented here highlights the complexity of epigenetic regulation of genes. However, the similarities observed across species are promising, in that knowledge gained from epigenetic studies in human and mice may be applied, with caution, to agricultural species. The ability to accurately measure DNA methylation in agricultural animals will contribute an additional layer of information to the genetic analyses currently being used to maximise production gains in these species.
Full Text Available Hepatitis C virus (HCV establishes a chronic infection in 70-80% of infected individuals. Many researchers have examined the effect of human leukocyte antigen (HLA on viral persistence because of its critical role in the immune response against exposure to HCV, but almost all studies have proven to be inconclusive. To identify genetic risk factors for chronic HCV infection, we analyzed 458,207 single nucleotide polymorphisms (SNPs in 481 chronic HCV patients and 2,963 controls in a Japanese cohort. Next, we performed a replication study with an independent panel of 4,358 cases and 1,114 controls. We further confirmed the association in 1,379 cases and 25,817 controls. In the GWAS phase, we found 17 SNPs that showed suggestive association (P < 1 × 10⁻⁵. After the first replication study, we found one intronic SNP in the HLA-DQ locus associated with chronic HCV infection, and when we combined the two studies, the association reached the level of genome-wide significance. In the second replication study, we again confirmed the association (P(combined = 3.59 × 10⁻¹⁶, odds ratio [OR] = 0.79. Subsequent analysis revealed another SNP, rs1130380, with a stronger association (OR=0.72. This nucleotide substitution causes an amino acid substitution (R55P in the HLA-DQB1 protein specific to the DQB1*03 allele, which is common worldwide. In addition, we confirmed an association with the previously reported IFNL3-IFNL4 locus and propose that the effect of DQB1*03 on HCV persistence might be affected by the IFNL4 polymorphism. Our findings suggest that a common amino acid substitution in HLA-DQB1 affects susceptibility to chronic infection with HCV in the Japanese population and may not be independent of the IFNL4 genotype.
Evangelou, Evangelos; Fellay, Jacques; Colombo, Sara
Discussion on improving the power of genome-wide association studies to identify candidate variants and genes is generally centered on issues of maximizing sample size; less attention is given to the role of phenotype definition and ascertainment. The authors used genome-wide data from patients...... infected with human immunodeficiency virus type 1 (HIV-1) to assess whether differences in type of population (622 seroconverters vs. 636 seroprevalent subjects) or the number of measurements available for defining the phenotype resulted in differences in the effect sizes of associations between single...... available, particularly among seroconverters and for variants that achieved genome-wide significance. Differences in phenotype definition and ascertainment may affect the estimated magnitude of genetic effects and should be considered in optimizing power for discovering new associations....
Zeng, Lingwen; Xiao, Zhuo
A lateral flow biosensor (LFB) is introduced for the detection of single nucleotide polymorphisms (SNPs). The assay is composed of two steps: circular strand displacement reaction and lateral flow biosensor detection. In step 1, the nucleotide at SNP site is recognized by T4 DNA ligase and the signal is amplified by strand displacement DNA polymerase, which can be accomplished at a constant temperature. In step 2, the reaction product of step 1 is detected by a lateral flow biosensor, which is a rapid and cost effective tool for nuclei acid detection. Comparing with conventional methods, it requires no complicated machines. It is suitable for the use of point of care diagnostics. Therefore, this simple, cost effective, robust, and promising LFB detection method of SNP has great potential for the detection of genetic diseases, personalized medicine, cancer related mutations, and drug-resistant mutations of infectious agents.
Hand Melanie L
Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs provide essential tools for the advancement of research in plant genomics, and the development of SNP resources for many species has been accelerated by the capabilities of second-generation sequencing technologies. The current study aimed to develop and use a novel bioinformatic pipeline to generate a comprehensive collection of SNP markers within the agriculturally important pasture grass tall fescue; an outbreeding allopolyploid species displaying three distinct morphotypes: Continental, Mediterranean and rhizomatous. Results A bioinformatic pipeline was developed that successfully identified SNPs within genotypes from distinct tall fescue morphotypes, following the sequencing of 414 polymerase chain reaction (PCR – generated amplicons using 454 GS FLX technology. Equivalent amplicon sets were derived from representative genotypes of each morphotype, including six Continental, five Mediterranean and one rhizomatous. A total of 8,584 and 2,292 SNPs were identified with high confidence within the Continental and Mediterranean morphotypes respectively. The success of the bioinformatic approach was demonstrated through validation (at a rate of 70% of a subset of 141 SNPs using both SNaPshot™ and GoldenGate™ assay chemistries. Furthermore, the quantitative genotyping capability of the GoldenGate™ assay revealed that approximately 30% of the putative SNPs were accessible to co-dominant scoring, despite the hexaploid genome structure. The sub-genome-specific origin of each SNP validated from Continental tall fescue was predicted using a phylogenetic approach based on comparison with orthologous sequences from predicted progenitor species. Conclusions Using the appropriate bioinformatic approach, amplicon resequencing based on 454 GS FLX technology is an effective method for the identification of polymorphic SNPs within the genomes of Continental and Mediterranean tall fescue. The
Tönjes, Anke; Scholz, Markus; Krüger, Jacqueline; Krause, Kerstin; Schleinitz, Dorit; Kirsten, Holger; Gebhardt, Claudia; Marzi, Carola; Grallert, Harald; Ladenvall, Claes; Heyne, Henrike; Laurila, Esa; Kriebel, Jennifer; Meisinger, Christa; Rathmann, Wolfgang; Gieger, Christian; Groop, Leif; Prokopenko, Inga; Isomaa, Bo; Beutner, Frank; Kratzsch, Jürgen; Fischer-Rosinsky, Antje; Pfeiffer, Andreas; Krohn, Knut; Spranger, Joachim; Thiery, Joachim; Blüher, Matthias; Stumvoll, Michael; Kovacs, Peter
Progranulin is a secreted protein with important functions in processes including immune and inflammatory response, metabolism and embryonic development. The present study aimed at identification of genetic factors determining progranulin concentrations. We conducted a genome-wide association meta-analysis for serum progranulin in three independent cohorts from Europe: Sorbs (N = 848) and KORA (N = 1628) from Germany and PPP-Botnia (N = 335) from Finland (total N = 2811). Single nucleotide polymorphisms (SNPs) associated with progranulin levels were replicated in two additional German cohorts: LIFE-Heart Study (Leipzig; N = 967) and Metabolic Syndrome Berlin Potsdam (Berlin cohort; N = 833). We measured mRNA expression of genes in peripheral blood mononuclear cells (PBMC) by micro-arrays and performed mRNA expression quantitative trait and expression-progranulin association studies to functionally substantiate identified loci. Finally, we conducted siRNA silencing experiments in vitro to validate potential candidate genes within the associated loci. Heritability of circulating progranulin levels was estimated at 31.8% and 26.1% in the Sorbs and LIFE-Heart cohort, respectively. SNPs at three loci reached study-wide significance (rs660240 in CELSR2-PSRC1-MYBPHL-SORT1, rs4747197 in CDH23-PSAP and rs5848 in GRN) explaining 19.4%/15.0% of the variance and 61%/57% of total heritability in the Sorbs/LIFE-Heart Study. The strongest evidence for association was at rs660240 (P = 5.75 × 10-50), which was also associated with mRNA expression of PSRC1 in PBMC (P = 1.51 ×