WorldWideScience

Sample records for actual genome-wide single-nucleotide

  1. Application of genome-wide single nucleotide polymorphism typing: simple association and beyond.

    Directory of Open Access Journals (Sweden)

    J Raphael Gibbs

    2006-10-01

    Full Text Available The International HapMap Project and the arrival of technologies that type more than 100,000 SNPs in a single experiment have made genome-wide single nucleotide polymorphism (GW-SNP assay a realistic endeavor. This has sparked considerable debate regarding the promise of GW-SNP typing to identify genetic association in disease. As has already been shown, this approach has the potential to localize common genetic variation underlying disease risk. The data provided from this technology also lends itself to several other lines of investigation; autozygosity mapping in consanguineous families and outbred populations, direct detection of structural variation, admixture analysis, and other population genetic approaches. In this review we will discuss the potential uses and practical application of GW-SNP typing including those above and beyond simple association testing.

  2. A comparison in association and linkage genome-wide scans for alcoholism susceptibility genes using single-nucleotide polymorphisms.

    Science.gov (United States)

    Chiu, Yen-Feng; Liu, Su-Yun; Tsai, Ya-Yu

    2005-12-30

    We conducted genome-wide linkage scans using both microsatellite and single-nucleotide polymorphism (SNP) markers. Regions showing the strongest evidence of linkage to alcoholism susceptibility genes were identified. Haplotype analyses using a sliding-window approach for SNPs in these regions were performed. In addition, we performed a genome-wide association scan using SNP data. SNPs in these regions with evidence of association (P alcoholism (the most significant SNP had a p-value of 0.030) as those identified from association genomic screening (the most significant SNP had a p-value of 2.0 x 10(-8)).

  3. Genome-wide analysis of neuroblastomas using high-density single nucleotide polymorphism arrays.

    Directory of Open Access Journals (Sweden)

    Rani E George

    Full Text Available BACKGROUND: Neuroblastomas are characterized by chromosomal alterations with biological and clinical significance. We analyzed paired blood and primary tumor samples from 22 children with high-risk neuroblastoma for loss of heterozygosity (LOH and DNA copy number change using the Affymetrix 10K single nucleotide polymorphism (SNP array. FINDINGS: Multiple areas of LOH and copy number gain were seen. The most commonly observed area of LOH was on chromosome arm 11q (15/22 samples; 68%. Chromosome 11q LOH was highly associated with occurrence of chromosome 3p LOH: 9 of the 15 samples with 11q LOH had concomitant 3p LOH (P = 0.016. Chromosome 1p LOH was seen in one-third of cases. LOH events on chromosomes 11q and 1p were generally accompanied by copy number loss, indicating hemizygous deletion within these regions. The one exception was on chromosome 11p, where LOH in all four cases was accompanied by normal copy number or diploidy, implying uniparental disomy. Gain of copy number was most frequently observed on chromosome arm 17q (21/22 samples; 95% and was associated with allelic imbalance in six samples. Amplification of MYCN was also noted, and also amplification of a second gene, ALK, in a single case. CONCLUSIONS: This analysis demonstrates the power of SNP arrays for high-resolution determination of LOH and DNA copy number change in neuroblastoma, a tumor in which specific allelic changes drive clinical outcome and selection of therapy.

  4. Genome-Wide Association Study between Single Nucleotide Polymorphisms and Flight Speed in Nellore Cattle

    Science.gov (United States)

    Valente, Tiago Silva; Baldi, Fernando; Sant’Anna, Aline Cristina; Albuquerque, Lucia Galvão; Paranhos da Costa, Mateus José Rodrigues

    2016-01-01

    Introduction Cattle temperament is an important factor that affects the profitability of beef cattle enterprises, due to its relationship with productivity traits, animal welfare and labor safety. Temperament is a complex phenotype often assessed by measuring a series of behavioral traits, which result from the effects of multiple environmental and genetic factors, and their interactions. The aims of this study were to perform a genome-wide association study and detect genomic regions, potential candidate genes and their biological mechanisms underlying temperament, measured by flight speed (FS) test in Nellore cattle. Materials and Methods The genome-wide association study (GWAS) was performed using a single-step procedure (ssGBLUP) which combined simultaneously all 16,600 phenotypes from genotyped and non-genotyped animals, full pedigree information of 162,645 animals and 1,384 genotyped animals in one step. The animals were genotyped with High Density Bovine SNP BeadChip which contains 777,962 SNP markers. After quality control (QC) a total of 455,374 SNPs remained. Results Heritability estimated for FS was 0.21 ± 0.02. Consecutive SNPs explaining 1% or more of the total additive genetic variance were considered as windows associated with FS. Nine candidate regions located on eight different Bos taurus chromosomes (BTA) (1 at 73 Mb, 2 at 65 Mb, 5 at 22 Mb and 119 Mb, 9 at 98 Mb, 11 at 67 Mb, 15 at 16 Mb, 17 at 63 Kb, and 26 at 47 Mb) were identified. The candidate genes identified in these regions were NCKAP5 (BTA2), PARK2 (BTA9), ANTXR1 (BTA11), GUCY1A2 (BTA15), CPE (BTA17) and DOCK1 (BTA26). Among these genes PARK2, GUCY1A2, CPE and DOCK1 are related to dopaminergic system, memory formation, biosynthesis of peptide hormone and neurotransmitter and brain development, respectively. Conclusions Our findings allowed us to identify nine genomic regions (SNP windows) associated with beef cattle temperament, measured by FS test. Within these windows, six promising

  5. Genome-Wide Association Study between Single Nucleotide Polymorphisms and Flight Speed in Nellore Cattle.

    Directory of Open Access Journals (Sweden)

    Tiago Silva Valente

    Full Text Available Cattle temperament is an important factor that affects the profitability of beef cattle enterprises, due to its relationship with productivity traits, animal welfare and labor safety. Temperament is a complex phenotype often assessed by measuring a series of behavioral traits, which result from the effects of multiple environmental and genetic factors, and their interactions. The aims of this study were to perform a genome-wide association study and detect genomic regions, potential candidate genes and their biological mechanisms underlying temperament, measured by flight speed (FS test in Nellore cattle.The genome-wide association study (GWAS was performed using a single-step procedure (ssGBLUP which combined simultaneously all 16,600 phenotypes from genotyped and non-genotyped animals, full pedigree information of 162,645 animals and 1,384 genotyped animals in one step. The animals were genotyped with High Density Bovine SNP BeadChip which contains 777,962 SNP markers. After quality control (QC a total of 455,374 SNPs remained.Heritability estimated for FS was 0.21 ± 0.02. Consecutive SNPs explaining 1% or more of the total additive genetic variance were considered as windows associated with FS. Nine candidate regions located on eight different Bos taurus chromosomes (BTA (1 at 73 Mb, 2 at 65 Mb, 5 at 22 Mb and 119 Mb, 9 at 98 Mb, 11 at 67 Mb, 15 at 16 Mb, 17 at 63 Kb, and 26 at 47 Mb were identified. The candidate genes identified in these regions were NCKAP5 (BTA2, PARK2 (BTA9, ANTXR1 (BTA11, GUCY1A2 (BTA15, CPE (BTA17 and DOCK1 (BTA26. Among these genes PARK2, GUCY1A2, CPE and DOCK1 are related to dopaminergic system, memory formation, biosynthesis of peptide hormone and neurotransmitter and brain development, respectively.Our findings allowed us to identify nine genomic regions (SNP windows associated with beef cattle temperament, measured by FS test. Within these windows, six promising candidate genes and their biological functions were

  6. Genome-wide analysis of single nucleotide polymorphisms uncovers population structure in Northern Europe.

    Directory of Open Access Journals (Sweden)

    Elina Salmela

    Full Text Available BACKGROUND: Genome-wide data provide a powerful tool for inferring patterns of genetic variation and structure of human populations. PRINCIPAL FINDINGS: In this study, we analysed almost 250,000 SNPs from a total of 945 samples from Eastern and Western Finland, Sweden, Northern Germany and Great Britain complemented with HapMap data. Small but statistically significant differences were observed between the European populations (F(ST = 0.0040, p<10(-4, also between Eastern and Western Finland (F(ST = 0.0032, p<10(-3. The latter indicated the existence of a relatively strong autosomal substructure within the country, similar to that observed earlier with smaller numbers of markers. The Germans and British were less differentiated than the Swedes, Western Finns and especially the Eastern Finns who also showed other signs of genetic drift. This is likely caused by the later founding of the northern populations, together with subsequent founder and bottleneck effects, and a smaller population size. Furthermore, our data suggest a small eastern contribution among the Finns, consistent with the historical and linguistic background of the population. SIGNIFICANCE: Our results warn against a priori assumptions of homogeneity among Finns and other seemingly isolated populations. Thus, in association studies in such populations, additional caution for population structure may be necessary. Our results illustrate that population history is often important for patterns of genetic variation, and that the analysis of hundreds of thousands of SNPs provides high resolution also for population genetics.

  7. Genome-wide association mapping for wood characteristics in Populus identifies an array of candidate single nucleotide polymorphisms.

    Science.gov (United States)

    Porth, Ilga; Klapšte, Jaroslav; Skyba, Oleksandr; Hannemann, Jan; McKown, Athena D; Guy, Robert D; DiFazio, Stephen P; Muchero, Wellington; Ranjan, Priya; Tuskan, Gerald A; Friedmann, Michael C; Ehlting, Juergen; Cronk, Quentin C B; El-Kassaby, Yousry A; Douglas, Carl J; Mansfield, Shawn D

    2013-11-01

    Establishing links between phenotypes and molecular variants is of central importance to accelerate genetic improvement of economically important plant species. Our work represents the first genome-wide association study to the inherently complex and currently poorly understood genetic architecture of industrially relevant wood traits. Here, we employed an Illumina Infinium 34K single nucleotide polymorphism (SNP) genotyping array that generated 29,233 high-quality SNPs in c. 3500 broad-based candidate genes within a population of 334 unrelated Populus trichocarpa individuals to establish genome-wide associations. The analysis revealed 141 significant SNPs (α ≤ 0.05) associated with 16 wood chemistry/ultrastructure traits, individually explaining 3-7% of the phenotypic variance. A large set of associations (41% of all hits) occurred in candidate genes preselected for their suggested a priori involvement with secondary growth. For example, an allelic variant in the FRA8 ortholog explained 21% of the total genetic variance in fiber length, when the trait's heritability estimate was considered. The remaining associations identified SNPs in genes not previously implicated in wood or secondary wall formation. Our findings provide unique insights into wood trait architecture and support efforts for population improvement based on desirable allelic variants.

  8. Phylogeography and adaptation genetics of stickleback from the Haida Gwaii archipelago revealed using genome-wide single nucleotide polymorphism genotyping.

    Science.gov (United States)

    Deagle, Bruce E; Jones, Felicity C; Absher, Devin M; Kingsley, David M; Reimchen, Thomas E

    2013-04-01

    Threespine stickleback populations are model systems for studying adaptive evolution and the underlying genetics. In lakes on the Haida Gwaii archipelago (off western Canada), stickleback have undergone a remarkable local radiation and show phenotypic diversity matching that seen throughout the species distribution. To provide a historical context for this radiation, we surveyed genetic variation at >1000 single nucleotide polymorphism (SNP) loci in stickleback from over 100 populations. SNPs included markers evenly distributed throughout genome and candidate SNPs tagging adaptive genomic regions. Based on evenly distributed SNPs, the phylogeographic pattern differs substantially from the disjunct pattern previously observed between two highly divergent mtDNA lineages. The SNP tree instead shows extensive within watershed population clustering and different watersheds separated by short branches deep in the tree. These data are consistent with separate colonizations of most watersheds, despite underlying genetic connections between some independent drainages. This supports previous suppositions that morphological diversity observed between watersheds has been shaped independently, with populations exhibiting complete loss of lateral plates and giant size each occurring in several distinct clades. Throughout the archipelago, we see repeated selection of SNPs tagging candidate freshwater adaptive variants at several genomic regions differentiated between marine-freshwater populations on a global scale (e.g. EDA, Na/K ATPase). In estuarine sites, both marine and freshwater allelic variants were commonly detected. We also found typically marine alleles present in a few freshwater lakes, especially those with completely plated morphology. These results provide a general model for postglacial colonization of freshwater habitat by sticklebacks and illustrate the tremendous potential of genome-wide SNP data sets hold for resolving patterns and processes underlying recent

  9. Genome-wide divergence and linkage disequilibrium analyses for Capsicum baccatum revealed by genome-anchored single nucleotide polymorphisms

    Science.gov (United States)

    Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to show the distribution of these 2 important incompatible cultivated pepper species. Estimated mean nucleotide...

  10. Word Reading Fluency: Role of Genome-Wide Single-Nucleotide Polymorphisms in Developmental Stability and Correlations with Print Exposure

    Science.gov (United States)

    Harlaar, Nicole; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert

    2014-01-01

    The genetic effects on individual differences in reading development were examined using genome-wide complex trait analysis (GCTA) in a twin sample. In unrelated individuals (one twin per pair, n = 2,942), the GCTA-based heritability of reading fluency was ~20%-29% at ages 7 and 12. GCTA bivariate results showed that the phenotypic stability of…

  11. Estimating Additive and Non-Additive Genetic Variances and Predicting Genetic Merits Using Genome-Wide Dense Single Nucleotide Polymorphism Markers

    DEFF Research Database (Denmark)

    Su, Guosheng; Christensen, Ole Fredslund; Ostersen, Tage;

    2012-01-01

    Non-additive genetic variation is usually ignored when genome-wide markers are used to study the genetic architecture and genomic prediction of complex traits in human, wild life, model organisms or farm animals. However, non-additive genetic effects may have an important contribution to total...... genetic variation of complex traits. This study presented a genomic BLUP model including additive and non-additive genetic effects, in which additive and non-additive genetic relation matrices were constructed from information of genome-wide dense single nucleotide polymorphism (SNP) markers. In addition...... of genomic predictions for daily gain in pigs. In the analysis of daily gain, four linear models were used: 1) a simple additive genetic model (MA), 2) a model including both additive and additive by additive epistatic genetic effects (MAE), 3) a model including both additive and dominance genetic effects...

  12. Genome-wide analysis of single nucleotide polymorphisms in patients with atrophic age-related macular degeneration in oldest old Han Chinese.

    Science.gov (United States)

    Zhou, T Q; Guan, H J; Hu, J Y

    2015-12-21

    The aim of this study was to identify disease-associated loci in oldest old Han Chinese with atrophic age-related macular degeneration (AMD). This genome-wide association study (GWAS) only included oldest old (≥95 years old) subjects in Rugao County, China. Thirty atrophic AMD patients and 47 age-matched non-AMD controls were enrolled. The study subjects underwent a complete ophthalmic examination. Genomic DNA was extracted from peripheral blood samples. Single nucleotide polymorphisms (SNPs) were scanned by Genome-Wide Human Mapping SNP 6.0 Arrays and GeneChip Scanner 3000 7G. The results were read and analyzed by the Affymetrix Genotyping Console software. We filtered out the SNPs with a no-call rate ≥10%, MAF P old Han Chinese population. This finding may lead to new strategies for screening of atrophic AMD for Han Chinese.

  13. Genome-wide patterns of recombination, linkage disequilibrium and nucleotide diversity from pooled resequencing and single nucleotide polymorphism genotyping unlock the evolutionary history of Eucalyptus grandis.

    Science.gov (United States)

    Silva-Junior, Orzenil B; Grattapaglia, Dario

    2015-11-01

    We used high-density single nucleotide polymorphism (SNP) data and whole-genome pooled resequencing to examine the landscape of population recombination (ρ) and nucleotide diversity (ϴw ), assess the extent of linkage disequilibrium (r(2) ) and build the highest density linkage maps for Eucalyptus. At the genome-wide level, linkage disequilibrium (LD) decayed within c. 4-6 kb, slower than previously reported from candidate gene studies, but showing considerable variation from absence to complete LD up to 50 kb. A sharp decrease in the estimate of ρ was seen when going from short to genome-wide inter-SNP distances, highlighting the dependence of this parameter on the scale of observation adopted. Recombination was correlated with nucleotide diversity, gene density and distance from the centromere, with hotspots of recombination enriched for genes involved in chemical reactions and pathways of the normal metabolic processes. The high nucleotide diversity (ϴw = 0.022) of E. grandis revealed that mutation is more important than recombination in shaping its genomic diversity (ρ/ϴw = 0.645). Chromosome-wide ancestral recombination graphs allowed us to date the split of E. grandis (1.7-4.8 million yr ago) and identify a scenario for the recent demographic history of the species. Our results have considerable practical importance to Genome Wide Association Studies (GWAS), while indicating bright prospects for genomic prediction of complex phenotypes in eucalypt breeding.

  14. Genome-wide dynamic transcriptional profiling in clostridium beijerinckii NCIMB 8052 using single-nucleotide resolution RNA-Seq

    Directory of Open Access Journals (Sweden)

    Wang Yi

    2012-03-01

    Full Text Available Abstract Background Clostridium beijerinckii is a prominent solvent-producing microbe that has great potential for biofuel and chemical industries. Although transcriptional analysis is essential to understand gene functions and regulation and thus elucidate proper strategies for further strain improvement, limited information is available on the genome-wide transcriptional analysis for C. beijerinckii. Results The genome-wide transcriptional dynamics of C. beijerinckii NCIMB 8052 over a batch fermentation process was investigated using high-throughput RNA-Seq technology. The gene expression profiles indicated that the glycolysis genes were highly expressed throughout the fermentation, with comparatively more active expression during acidogenesis phase. The expression of acid formation genes was down-regulated at the onset of solvent formation, in accordance with the metabolic pathway shift from acidogenesis to solventogenesis. The acetone formation gene (adc, as a part of the sol operon, exhibited highly-coordinated expression with the other sol genes. Out of the > 20 genes encoding alcohol dehydrogenase in C. beijerinckii, Cbei_1722 and Cbei_2181 were highly up-regulated at the onset of solventogenesis, corresponding to their key roles in primary alcohol production. Most sporulation genes in C. beijerinckii 8052 demonstrated similar temporal expression patterns to those observed in B. subtilis and C. acetobutylicum, while sporulation sigma factor genes sigE and sigG exhibited accelerated and stronger expression in C. beijerinckii 8052, which is consistent with the more rapid forespore and endspore development in this strain. Global expression patterns for specific gene functional classes were examined using self-organizing map analysis. The genes associated with specific functional classes demonstrated global expression profiles corresponding to the cell physiological variation and metabolic pathway switch. Conclusions The results from this

  15. Detection of Hereditary 1,25-Hydroxyvitamin D-Resistant Rickets Caused by Uniparental Disomy of Chromosome 12 Using Genome-Wide Single Nucleotide Polymorphism Array

    Science.gov (United States)

    Tamura, Mayuko; Isojima, Tsuyoshi; Kawashima, Minae; Yoshida, Hideki; Yamamoto, Keiko; Kitaoka, Taichi; Namba, Noriyuki; Oka, Akira; Ozono, Keiichi; Tokunaga, Katsushi; Kitanaka, Sachiko

    2015-01-01

    Context Hereditary 1,25-dihydroxyvitamin D-resistant rickets (HVDRR) is an autosomal recessive disease caused by biallelic mutations in the vitamin D receptor (VDR) gene. No patients have been reported with uniparental disomy (UPD). Objective Using genome-wide single nucleotide polymorphism (SNP) array to confirm whether HVDRR was caused by UPD of chromosome 12. Materials and Methods A 2-year-old girl with alopecia and short stature and without any family history of consanguinity was diagnosed with HVDRR by typical laboratory data findings and clinical features of rickets. Sequence analysis of VDR was performed, and the origin of the homozygous mutation was investigated by target SNP sequencing, short tandem repeat analysis, and genome-wide SNP array. Results The patient had a homozygous p.Arg73Ter nonsense mutation. Her mother was heterozygous for the mutation, but her father was negative. We excluded gross deletion of the father’s allele or paternal discordance. Genome-wide SNP array of the family (the patient and her parents) showed complete maternal isodisomy of chromosome 12. She was successfully treated with high-dose oral calcium. Conclusions This is the first report of HVDRR caused by UPD, and the third case of complete UPD of chromosome 12, in the published literature. Genome-wide SNP array was useful for detecting isodisomy and the parental origin of the allele. Comprehensive examination of the homozygous state is essential for accurate genetic counseling of recurrence risk and appropriate monitoring for other chromosome 12 related disorders. Furthermore, oral calcium therapy was effective as an initial treatment for rickets in this instance. PMID:26153892

  16. Detection of Hereditary 1,25-Hydroxyvitamin D-Resistant Rickets Caused by Uniparental Disomy of Chromosome 12 Using Genome-Wide Single Nucleotide Polymorphism Array.

    Directory of Open Access Journals (Sweden)

    Mayuko Tamura

    Full Text Available Hereditary 1,25-dihydroxyvitamin D-resistant rickets (HVDRR is an autosomal recessive disease caused by biallelic mutations in the vitamin D receptor (VDR gene. No patients have been reported with uniparental disomy (UPD.Using genome-wide single nucleotide polymorphism (SNP array to confirm whether HVDRR was caused by UPD of chromosome 12.A 2-year-old girl with alopecia and short stature and without any family history of consanguinity was diagnosed with HVDRR by typical laboratory data findings and clinical features of rickets. Sequence analysis of VDR was performed, and the origin of the homozygous mutation was investigated by target SNP sequencing, short tandem repeat analysis, and genome-wide SNP array.The patient had a homozygous p.Arg73Ter nonsense mutation. Her mother was heterozygous for the mutation, but her father was negative. We excluded gross deletion of the father's allele or paternal discordance. Genome-wide SNP array of the family (the patient and her parents showed complete maternal isodisomy of chromosome 12. She was successfully treated with high-dose oral calcium.This is the first report of HVDRR caused by UPD, and the third case of complete UPD of chromosome 12, in the published literature. Genome-wide SNP array was useful for detecting isodisomy and the parental origin of the allele. Comprehensive examination of the homozygous state is essential for accurate genetic counseling of recurrence risk and appropriate monitoring for other chromosome 12 related disorders. Furthermore, oral calcium therapy was effective as an initial treatment for rickets in this instance.

  17. 鼠疫菌全基因组单核苷酸多态性研究进展%Genome-wide single nucleotide polymorphism of Yersinia pestis

    Institute of Scientific and Technical Information of China (English)

    王娜

    2011-01-01

    Single nucleotide polymorphisms (SNPs) mainly refer to the polymorphism of DNA sequence caused by a single nucleotide mutation, including the synonymous SNPs and non- synonymous SNPs. With the rapid development of sequencing technology, a large number of bacterial genome sequences are available. So, it's possible to identify potential SNPs sites by sequencing technology and bioinformatics methods. Also, SNPs, because of their own characteristics, have been widely used as a new molecular marker in bacterial genotyping, evolution and epidemiology research. In this paper, advances in the research on the genome-wide search of SNPs sites and analysis of the Yersinia pestis microevolution based on SNPs data are reviewed.%单核苷酸多态性(single nucleotide polymorphisms,SNPs)主要是指在基因组水平上由单个核苷酸的变异所引起的DNA序列多态性,包括同义SNPs(synonymous SNPs,sSNPs)和非同义SNPs(non-synonymous SNPs,nSNPs).随着测序技术的迅速发展,获得了大量细菌全基因组序列,使得通过测序技术及生物信息学方法寻找潜在的SNPs位点成为可能.并且,由于SNPs本身的特性,使其作为一种新的分子标记,在细菌分型与进化、流行病学调查研究中得到广泛应用.该文主要阐述基于全基因组寻找SNPs位点,并建立以SNPs数据为基础的鼠疫菌微进化研究分析的研究进展状况.

  18. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis

    Science.gov (United States)

    2011-09-01

    Almasy, L, Blangero, J. (2009) Human QTL linkage mapping. Genetica 136:333-340. Amos, CI. (2007) Successful design and conduct of genome-wide...quantitative trait loci. Genetica 136:237-243. Skol AD, Scott LJ, Abecasis GR, Boehnke M. (2006) Joint analysis is more efficient than replication

  19. A 2-Stage Genome-Wide Association Study to Identify Single Nucleotide Polymorphisms Associated With Development of Erectile Dysfunction Following Radiation Therapy for Prostate Cancer

    Energy Technology Data Exchange (ETDEWEB)

    Kerns, Sarah L. [Department of Radiation Oncology, Mount Sinai School of Medicine, New York, New York (United States); Departments of Pathology and Genetics, Albert Einstein College of Medicine, Bronx, New York (United States); Stock, Richard [Department of Radiation Oncology, Mount Sinai School of Medicine, New York, New York (United States); Stone, Nelson [Department of Radiation Oncology, Mount Sinai School of Medicine, New York, New York (United States); Department of Urology, Mount Sinai School of Medicine, New York, New York (United States); Buckstein, Michael [Department of Radiation Oncology, Mount Sinai School of Medicine, New York, New York (United States); Shao, Yongzhao [Division of Biostatistics, New York University School of Medicine, New York, New York (United States); Campbell, Christopher [Departments of Pathology and Genetics, Albert Einstein College of Medicine, Bronx, New York (United States); Rath, Lynda [Department of Radiation Oncology, Mount Sinai School of Medicine, New York, New York (United States); De Ruysscher, Dirk; Lammering, Guido [Department of Radiation Oncology, Maastricht University Medical Center, Maastricht (Netherlands); Hixson, Rosetta; Cesaretti, Jamie; Terk, Mitchell [Florida Radiation Oncology Group, Jacksonville, Florida (United States); Ostrer, Harry [Departments of Pathology and Genetics, Albert Einstein College of Medicine, Bronx, New York (United States); Rosenstein, Barry S., E-mail: barry.rosenstein@mssm.edu [Department of Radiation Oncology, Mount Sinai School of Medicine, New York, New York (United States); Department of Radiation Oncology, New York University School of Medicine, New York, New York (United States); Departments of Dermatology and Preventive Medicine, Mount Sinai School of Medicine, New York, New York (United States)

    2013-01-01

    Purpose: To identify single nucleotide polymorphisms (SNPs) associated with development of erectile dysfunction (ED) among prostate cancer patients treated with radiation therapy. Methods and Materials: A 2-stage genome-wide association study was performed. Patients were split randomly into a stage I discovery cohort (132 cases, 103 controls) and a stage II replication cohort (128 cases, 102 controls). The discovery cohort was genotyped using Affymetrix 6.0 genome-wide arrays. The 940 top ranking SNPs selected from the discovery cohort were genotyped in the replication cohort using Illumina iSelect custom SNP arrays. Results: Twelve SNPs identified in the discovery cohort and validated in the replication cohort were associated with development of ED following radiation therapy (Fisher combined P values 2.1 Multiplication-Sign 10{sup -5} to 6.2 Multiplication-Sign 10{sup -4}). Notably, these 12 SNPs lie in or near genes involved in erectile function or other normal cellular functions (adhesion and signaling) rather than DNA damage repair. In a multivariable model including nongenetic risk factors, the odds ratios for these SNPs ranged from 1.6 to 5.6 in the pooled cohort. There was a striking relationship between the cumulative number of SNP risk alleles an individual possessed and ED status (Sommers' D P value = 1.7 Multiplication-Sign 10{sup -29}). A 1-allele increase in cumulative SNP score increased the odds for developing ED by a factor of 2.2 (P value = 2.1 Multiplication-Sign 10{sup -19}). The cumulative SNP score model had a sensitivity of 84% and specificity of 75% for prediction of developing ED at the radiation therapy planning stage. Conclusions: This genome-wide association study identified a set of SNPs that are associated with development of ED following radiation therapy. These candidate genetic predictors warrant more definitive validation in an independent cohort.

  20. A genome-wide scan study identifies a single nucleotide substitution in ASIP associated with white versus non-white coat-colour variation in sheep (Ovis aries).

    Science.gov (United States)

    Li, M-H; Tiirikka, T; Kantanen, J

    2014-02-01

    In sheep, coat colour (and pattern) is one of the important traits of great biological, economic and social importance. However, the genetics of sheep coat colour has not yet been fully clarified. We conducted a genome-wide association study of sheep coat colours by genotyping 47 303 single-nucleotide polymorphisms (SNPs) in the Finnsheep population in Finland. We identified 35 SNPs associated with all the coat colours studied, which cover genomic regions encompassing three known pigmentation genes (TYRP1, ASIP and MITF) in sheep. Eighteen of these associations were confirmed in further tests between white versus non-white individuals, but none of the 35 associations were significant in the analysis of only non-white colours. Across the tests, the s66432.1 in ASIP showed significant association (P=4.2 × 10(-11) for all the colours; P=2.3 × 10(-11) for white versus non-white colours) with the variation in coat colours and strong linkage disequilibrium with other significant variants surrounding the ASIP gene. The signals detected around the ASIP gene were explained by differences in white versus non-white alleles. Further, a genome scan for selection for white coat pigmentation identified a strong and striking selection signal spanning ASIP. Our study identified the main candidate gene for the coat colour variation between white and non-white as ASIP, an autosomal gene that has been directly implicated in the pathway regulating melanogenesis. Together with ASIP, the two other newly identified genes (TYRP1 and MITF) in the Finnsheep, bordering associated SNPs, represent a new resource for enriching sheep coat-colour genetics and breeding.

  1. Genome-wide single nucleotide polymorphism-based assay for high-resolution epidemiological analysis of the methicillin-resistant Staphylococcus aureus hospital clone EMRSA-15.

    Science.gov (United States)

    Holmes, A; McAllister, G; McAdam, P R; Hsien Choi, S; Girvan, K; Robb, A; Edwards, G; Templeton, K; Fitzgerald, J R

    2014-02-01

    The EMRSA-15 clone is a major cause of nosocomial methicillin-resistant Staphylococcus aureus (MRSA) infections in the UK and elsewhere but existing typing methodologies have limited capacity to discriminate closely related strains, and are often poorly reproducible between laboratories. Here, we report the design, development and validation of a genome-wide single nucleotide polymorphism (SNP) typing method and compare it to established methods for typing of EMRSA-15. In order to identify discriminatory SNPs, the genomes of 17 EMRSA-15 strains, selected to represent the breadth of genotypic and phenotypic diversity of EMRSA-15 isolates in Scotland, were determined and phylogenetic reconstruction was carried out. In addition to 17 phylogenetically informative SNPs, five binary markers were included to form the basis of an EMRSA-15 genotyping assay. The SNP-based typing assay was as discriminatory as pulsed-field gel electrophoresis, and significantly more discriminatory than staphylococcal protein A (spa) typing for typing of a representative panel of diverse EMRSA-15 strains, isolates from two EMRSA-15 hospital outbreak investigations, and a panel of bacteraemia isolates obtained in healthcare facilities in the east of Scotland during a 12-month period. The assay is a rapid, and reproducible approach for epidemiological analysis of EMRSA-15 clinical isolates in Scotland. Unlike established methods the DNA sequence-based method is ideally suited for inter-laboratory comparison of identified genotypes, and its flexibility lends itself to supplementation with additional SNPs or markers for the identification of novel S. aureus strains in other regions of the world.

  2. A resource of genome-wide single-nucleotide polymorphisms generated by RAD tag sequencing in the critically endangered European eel

    DEFF Research Database (Denmark)

    Pujolar, J.M.; Jacobsen, M.W.; Frydenberg, J.;

    2013-01-01

    Reduced representation genome sequencing such as restriction-site-associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single-nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers for the Eu...... 425 loci and 376 918 associated SNPs provides a valuable tool for future population genetics and genomics studies and allows for targeting specific genes and particularly interesting regions of the eel genome......Reduced representation genome sequencing such as restriction-site-associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single-nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers...

  3. Deep sequencing revealed genome-wide single-nucleotide polymorphism and plasmid content of Erwinia amylovora strains isolated in Middle Atlas, Morocco.

    Science.gov (United States)

    Hannou, Najat; Mondy, Samuel; Planamente, Sara; Moumni, Mohieddine; Llop, Pablo; López, María; Manceau, Charles; Barny, Marie-Anne; Faure, Denis

    2013-10-01

    Erwinia amylovora causes economic losses that affect pear and apple production in Morocco. Here, we report comparative genomics of four Moroccan E. amylovora strains with the European strain CFBP1430 and North-American strain ATCC49946. Analysis of single nucleotide polymorphisms (SNPs) revealed genetic homogeneity of Moroccan's strains and their proximity to the European strain CFBP1430. Moreover, the collected sequences allowed the assembly of a 65 kpb plasmid, which is highly similar to the plasmid pEI70 harbored by several European E. amylovora isolates. This plasmid was found in 33% of the 40 E. amylovora strains collected from several host plants in 2009 and 2010 in Morocco.

  4. A genome-wide association study for milk production traits in Danish Jersey cattle using a 50K single nucleotide polymorphism chip.

    Science.gov (United States)

    Mai, M D; Sahana, G; Christiansen, F B; Guldbrandtsen, B

    2010-11-01

    Quantitative trait loci for milk production traits in Danish Jersey cattle were mapped by a genome-wide association analysis using a mixed model. The analysis incorporated 1,039 bulls and 33,090 SNP and resulted in 98 detected combinations of QTL and traits on 27 BTA. These QTL comprised 30 for milk index, 50 for fat index, and 18 for protein index. The evidence presents 33 genome-wide QTL on 14 BTA. Of these, 7 had effects on milk index, 21 on fat index, and 5 on protein index. Among the genome-wide QTL, 26 have been previously reported, 2 on BTA4 and BTA5 were new for milk index, and 5 on BTA4, BTA5, BTA13, BTA20, and BTA29 were new QTL for fat index. We found 7 pleiotropic or very closely linked QTL. Most of the QTL were associated with polymorphisms within narrow regions and several may represent the effects of polymorphisms of genes: DGAT1, casein, ARFGAP3, CYP11B1, and CDC-like kinase 4. By a chromosome-wide threshold, 65 additional QTL were detected. Many of them are likely to represent QTL. The results are interesting from a breeding perspective and contribute to the search for the genes causing the polymorphisms important for milk production traits.

  5. A genome-wide association study for milk production traits in Danish Jersey cattle using a 50K single nucleotide polymorphism chip

    DEFF Research Database (Denmark)

    Mai, Duy Minh; Sahana, Goutam; Christiansen, Freddy;

    2010-01-01

    for milk index, 50 for fat index, and 18 for protein index. The evidence presents 33 genome-wide QTL on 14 BTA. Of these, 7 had effects on milk index, 21 on fat index, and 5 on protein index. Among the genome-wide QTL, 26 have been previously reported, 2 on BTA4 and BTA5 were new for milk index, and 5......Quantitative trait loci for milk production traits in Danish Jersey cattle were mapped by a genome-wide association analysis using a mixed model. The analysis incorporated 1,039 bulls and 33,090 SNP and resulted in 98 detected combinations of QTL and traits on 27 BTA. These QTL comprised 30...... on BTA4, BTA5, BTA13, BTA20, and BTA29 were new QTL for fat index. We found 7 pleiotropic or very closely linked QTL. Most of the QTL were associated with polymorphisms within narrow regions and several may represent the effects of polymorphisms of genes: DGAT1, casein, ARFGAP3, CYP11B1, and CDC...

  6. Genome-wide association study identifies single-nucleotide polymorphism in KCNB1 associated with left ventricular mass in humans: The HyperGEN Study

    Directory of Open Access Journals (Sweden)

    Kraemer Rachel

    2009-05-01

    Full Text Available Abstract Background We conducted a genome-wide association study (GWAS and validation study for left ventricular (LV mass in the Family Blood Pressure Program – HyperGEN population. LV mass is a sensitive predictor of cardiovascular mortality and morbidity in all genders, races, and ages. Polymorphisms of candidate genes in diverse pathways have been associated with LV mass. However, subsequent studies have often failed to replicate these associations. Genome-wide association studies have unprecedented power to identify potential genes with modest effects on left LV mass. We describe here a GWAS for LV mass in Caucasians using the Affymetrix GeneChip Human Mapping 100 k Set. Cases (N = 101 and controls (N = 101 were selected from extreme tails of the LV mass index distribution from 906 individuals in the HyperGEN study. Eleven of 12 promising (Q Results Despite the relatively small sample, we identified 12 promising SNPs in the GWAS. Eleven SNPs were successfully genotyped in the validation study of 704 Caucasians and 1467 African Americans; 5 SNPs on chromosomes 5, 12, and 20 were significantly (P ≤ 0.05 associated with LV mass after correction for multiple testing. One SNP (rs756529 is intragenic within KCNB1, which is dephosphorylated by calcineurin, a previously reported candidate gene for LV hypertrophy within this population. Conclusion These findings suggest KCNB1 may be involved in the development of LV hypertrophy in humans.

  7. Genome-wide association study identifies single nucleotide polymorphism in DYRK1A associated with replication of HIV-1 in monocyte-derived macrophages.

    Directory of Open Access Journals (Sweden)

    Sebastiaan M Bol

    Full Text Available BACKGROUND: HIV-1 infected macrophages play an important role in rendering resting T cells permissive for infection, in spreading HIV-1 to T cells, and in the pathogenesis of AIDS dementia. During highly active anti-retroviral treatment (HAART, macrophages keep producing virus because tissue penetration of antiretrovirals is suboptimal and the efficacy of some is reduced. Thus, to cure HIV-1 infection with antiretrovirals we will also need to efficiently inhibit viral replication in macrophages. The majority of the current drugs block the action of viral enzymes, whereas there is an abundance of yet unidentified host factors that could be targeted. We here present results from a genome-wide association study identifying novel genetic polymorphisms that affect in vitro HIV-1 replication in macrophages. METHODOLOGY/PRINCIPAL FINDINGS: Monocyte-derived macrophages from 393 blood donors were infected with HIV-1 and viral replication was determined using Gag p24 antigen levels. Genomic DNA from individuals with macrophages that had relatively low (n = 96 or high (n = 96 p24 production was used for SNP genotyping with the Illumina 610 Quad beadchip. A total of 494,656 SNPs that passed quality control were tested for association with HIV-1 replication in macrophages, using linear regression. We found a strong association between in vitro HIV-1 replication in monocyte-derived macrophages and SNP rs12483205 in DYRK1A (p = 2.16 × 10(-5. While the association was not genome-wide significant (p<1 × 10(-7, we could replicate this association using monocyte-derived macrophages from an independent group of 31 individuals (p = 0.0034. Combined analysis of the initial and replication cohort increased the strength of the association (p = 4.84 × 10(-6. In addition, we found this SNP to be associated with HIV-1 disease progression in vivo in two independent cohort studies (p = 0.035 and p = 0.0048. CONCLUSIONS/SIGNIFICANCE: These findings suggest that the kinase

  8. Genome-Wide Single Nucleotide Polymorphism Discovery and the Construction of a High-Density Genetic Map for Melon (Cucumis melo L.) Using Genotyping-by-Sequencing.

    Science.gov (United States)

    Chang, Che-Wei; Wang, Yu-Hua; Tung, Chih-Wei

    2017-01-01

    Although genotyping-by-sequencing (GBS) enables the efficient and low-cost generation of large numbers of markers, the utility of resultant genotypes are limited, because they are enormously error-prone and contain high proportions of missing data. In this study, we generated single nucleotide polymorphism (SNP) markers for 109 recombinant inbred lines of melon (Cucumis melo L.) using the GBS approach and ordered them according to their physical position on the draft double haploid line DHL92 genome. Next, by investigating associations between these SNPs, we discovered that some segments on the physical map conflict with linkage relationships. Therefore, to filter out error-prone loci, 4,110 SNPs in which we have a high degree of confidence were selected as anchors to test independence with respect to unselected markers, and the resultant dataset was then analyzed using the Full-Sib Family Haplotype (FSFHap) algorithm in the software TASSEL 5.2. On the basis of this analysis, 22,933 loci that have an average rate of missing data of 0.281% were used to construct a genetic map, which spans 1,088.3 cM across 12 chromosomes and has a maximum spacing of 6.0 cM. Use of this high-quality linkage map enabled the identification of several quantitative trait loci (QTL) known to control traits in fruit and validated our approach. This study highlights the utility of GBS markers for the identification of trait-associated QTLs in melon and facilitates further investigation of genome structure.

  9. Genome-wide association study using high-density single nucleotide polymorphism arrays and whole-genome sequences for clinical mastitis traits in dairy cattle.

    Science.gov (United States)

    Sahana, G; Guldbrandtsen, B; Thomsen, B; Holm, L-E; Panitz, F; Brøndum, R F; Bendixen, C; Lund, M S

    2014-11-01

    Mastitis is a mammary disease that frequently affects dairy cattle. Despite considerable research on the development of effective prevention and treatment strategies, mastitis continues to be a significant issue in bovine veterinary medicine. To identify major genes that affect mastitis in dairy cattle, 6 chromosomal regions on Bos taurus autosome (BTA) 6, 13, 16, 19, and 20 were selected from a genome scan for 9 mastitis phenotypes using imputed high-density single nucleotide polymorphism arrays. Association analyses using sequence-level variants for the 6 targeted regions were carried out to map causal variants using whole-genome sequence data from 3 breeds. The quantitative trait loci (QTL) discovery population comprised 4,992 progeny-tested Holstein bulls, and QTL were confirmed in 4,442 Nordic Red and 1,126 Jersey cattle. The targeted regions were imputed to the sequence level. The highest association signal for clinical mastitis was observed on BTA 6 at 88.97 Mb in Holstein cattle and was confirmed in Nordic Red cattle. The peak association region on BTA 6 contained 2 genes: vitamin D-binding protein precursor (GC) and neuropeptide FF receptor 2 (NPFFR2), which, based on known biological functions, are good candidates for affecting mastitis. However, strong linkage disequilibrium in this region prevented conclusive determination of the causal gene. A different QTL on BTA 6 located at 88.32 Mb in Holstein cattle affected mastitis. In addition, QTL on BTA 13 and 19 were confirmed to segregate in Nordic Red cattle and QTL on BTA 16 and 20 were confirmed in Jersey cattle. Although several candidate genes were identified in these targeted regions, it was not possible to identify a gene or polymorphism as the causal factor for any of these regions.

  10. Genome-Wide Single Nucleotide Polymorphism Discovery and the Construction of a High-Density Genetic Map for Melon (Cucumis melo L.) Using Genotyping-by-Sequencing

    Science.gov (United States)

    Chang, Che-Wei; Wang, Yu-Hua; Tung, Chih-Wei

    2017-01-01

    Although genotyping-by-sequencing (GBS) enables the efficient and low-cost generation of large numbers of markers, the utility of resultant genotypes are limited, because they are enormously error-prone and contain high proportions of missing data. In this study, we generated single nucleotide polymorphism (SNP) markers for 109 recombinant inbred lines of melon (Cucumis melo L.) using the GBS approach and ordered them according to their physical position on the draft double haploid line DHL92 genome. Next, by investigating associations between these SNPs, we discovered that some segments on the physical map conflict with linkage relationships. Therefore, to filter out error-prone loci, 4,110 SNPs in which we have a high degree of confidence were selected as anchors to test independence with respect to unselected markers, and the resultant dataset was then analyzed using the Full-Sib Family Haplotype (FSFHap) algorithm in the software TASSEL 5.2. On the basis of this analysis, 22,933 loci that have an average rate of missing data of 0.281% were used to construct a genetic map, which spans 1,088.3 cM across 12 chromosomes and has a maximum spacing of 6.0 cM. Use of this high-quality linkage map enabled the identification of several quantitative trait loci (QTL) known to control traits in fruit and validated our approach. This study highlights the utility of GBS markers for the identification of trait-associated QTLs in melon and facilitates further investigation of genome structure. PMID:28220139

  11. Genome-wide analysis of human global and transcription-coupled excision repair of UV damage at single-nucleotide resolution.

    Science.gov (United States)

    Hu, Jinchuan; Adar, Sheera; Selby, Christopher P; Lieb, Jason D; Sancar, Aziz

    2015-05-01

    We developed a method for genome-wide mapping of DNA excision repair named XR-seq (excision repair sequencing). Human nucleotide excision repair generates two incisions surrounding the site of damage, creating an ∼30-mer. In XR-seq, this fragment is isolated and subjected to high-throughput sequencing. We used XR-seq to produce stranded, nucleotide-resolution maps of repair of two UV-induced DNA damages in human cells: cyclobutane pyrimidine dimers (CPDs) and (6-4) pyrimidine-pyrimidone photoproducts [(6-4)PPs]. In wild-type cells, CPD repair was highly associated with transcription, specifically with the template strand. Experiments in cells defective in either transcription-coupled excision repair or general excision repair isolated the contribution of each pathway to the overall repair pattern and showed that transcription-coupled repair of both photoproducts occurs exclusively on the template strand. XR-seq maps capture transcription-coupled repair at sites of divergent gene promoters and bidirectional enhancer RNA (eRNA) production at enhancers. XR-seq data also uncovered the repair characteristics and novel sequence preferences of CPDs and (6-4)PPs. XR-seq and the resulting repair maps will facilitate studies of the effects of genomic location, chromatin context, transcription, and replication on DNA repair in human cells.

  12. Genome-Wide Single-Nucleotide Polymorphisms in CMS and Restorer Lines Discovered by Genotyping Using Sequencing and Association with Marker-Combining Ability for 12 Yield-Related Traits in Oryza sativa L. subsp. Japonica

    Science.gov (United States)

    Zaid, Imdad U.; Tang, Weijie; Liu, Erbao; Khan, Sana U.; Wang, Hui; Mawuli, Edzesi W.; Hong, Delin

    2017-01-01

    Heterosis or hybrid vigor is closely related with general combing ability (GCA) of parents and special combining ability (SCA) of combinations. The evaluation of GCA and SCA facilitate selection of parents and combinations in heterosis breeding. In order to improve combining ability (CA) by molecular marker assist selection, it is necessary to identify marker loci associated with the CA. To identify the single nucleotide polymorphisms (SNP) loci associated with CA in the parental genomes of japonica rice, genome-wide discovered SNP loci were tested for association with the CA of 18 parents for 12 yield-related traits. In this study, 81 hybrids were created and evaluated to calculate the CA of 18 parents. The parents were sequenced by genotyping by sequencing (GBS) method for identification of genome-wide SNPs. The analysis of GBS indicated that the successful mapping of 9.86 × 106 short reads in the Nipponbare reference genome consists of 39,001 SNPs in parental genomes at 11,085 chromosomal positions. The discovered SNPs were non-randomly distributed within and among the 12 chromosomes of rice. Overall, 20.4% (8026) of the discovered SNPs were coding types, and 8.6% (3344) and 9.9% (3951) of the SNPs revealed synonymous and non-synonymous changes, which provide valuable knowledge about the underlying performance of the parents. Furthermore, the associations between SNPs and CA indicated that 362 SNP loci were significantly related to the CA of 12 parental traits. The identified SNP loci of CA in our study were distributed genome wide and caused a positive or negative effect on the CA of traits. For the yield-related traits, such as grain thickness, days to heading, panicle length, grain length and 1000-grain weight, a maximum number of positive SNP loci of CA were found in CMS A171 and in the restorers LC64 and LR27. On an individual basis, some of associated loci that resided on chromosomes 2, 5, 7, 9, and 11 recorded maximum positive values for the CA of traits

  13. Association study of nonsynonymous single nucleotide polymorphisms in schizophrenia

    DEFF Research Database (Denmark)

    Carrera, Noa; Arrojo, Manuel; Sanjuán, Julio

    2012-01-01

    Genome-wide association studies using several hundred thousand anonymous markers present limited statistical power. Alternatively, association studies restricted to common nonsynonymous single nucleotide polymorphisms (nsSNPs) have the advantage of strongly reducing the multiple testing problem, ...

  14. Single Nucleotide Polymorphism

    DEFF Research Database (Denmark)

    Børsting, Claus; Pereira, Vania; Andersen, Jeppe Dyrberg;

    2014-01-01

    Single nucleotide polymorphisms (SNPs) are the most frequent DNA sequence variations in the genome. They have been studied extensively in the last decade with various purposes in mind. In this chapter, we will discuss the advantages and disadvantages of using SNPs for human identification and bri...

  15. Genome-wide association study of multiplex schizophrenia pedigrees

    DEFF Research Database (Denmark)

    Levinson, Douglas F; Shi, Jianxin; Wang, Kai;

    2012-01-01

    The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs).......The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs)....

  16. Direct detection of single-nucleotide polymorphisms in bacterial DNA by SNPtrap

    DEFF Research Database (Denmark)

    Grønlund, Hugo Ahlm; Moen, Birgitte; Hoorfar, Jeffrey

    2011-01-01

    A major challenge with single-nucleotide polymorphism (SNP) fingerprinting of bacteria and higher organisms is the combination of genome-wide screenings with the potential of multiplexing and accurate SNP detection. Single-nucleotide extension by the minisequencing principle represents a technolo...

  17. Genome-Wide Association Analysis in Primary Sclerosing Cholangitis

    NARCIS (Netherlands)

    T.H. Karlsen; A. Franke; E. Melum; A.. Kaser; J.R. Hov; T. Balschun; B.A. Lie; A. Bergquist; C. Schramm; T.J. Weismüller; D. Gotthardt; C. Rust; E.E.R. Philipp; T. Fritz; L. Henckaerts; R. Weersma; P. Stokkers; C.Y. Ponsioen; C. Wijmenga; M. Sterneck; M. Nothnagel; J. Hampe; A. Teufel; H. Runz; P. Rosenstiel; A. Stiehl; S. Vermeire; U. Beuers; M. Manns; E. Schrumpf; K.M. Boberg; S. Schreiber

    2010-01-01

    BACKGROUND & AIMS: We aimed to characterize the genetic susceptibility to primary sclerosing cholangitis (PSC) by means of a genome-wide association analysis of single nucleotide polymorphism (SNP) markers. METHODS: A total of 443,816 SNPs on the Affymetrix SNP Array 5.0 (Affymetrix, Santa Clara, CA

  18. qpure: A tool to estimate tumor cellularity from genome-wide single-nucleotide polymorphism profiles.

    Directory of Open Access Journals (Sweden)

    Sarah Song

    Full Text Available Tumour cellularity, the relative proportion of tumour and normal cells in a sample, affects the sensitivity of mutation detection, copy number analysis, cancer gene expression and methylation profiling. Tumour cellularity is traditionally estimated by pathological review of sectioned specimens; however this method is both subjective and prone to error due to heterogeneity within lesions and cellularity differences between the sample viewed during pathological review and tissue used for research purposes. In this paper we describe a statistical model to estimate tumour cellularity from SNP array profiles of paired tumour and normal samples using shifts in SNP allele frequency at regions of loss of heterozygosity (LOH in the tumour. We also provide qpure, a software implementation of the method. Our experiments showed that there is a medium correlation 0.42 ([Formula: see text]-value=0.0001 between tumor cellularity estimated by qpure and pathology review. Interestingly there is a high correlation 0.87 ([Formula: see text]-value [Formula: see text] 2.2e-16 between cellularity estimates by qpure and deep Ion Torrent sequencing of known somatic KRAS mutations; and a weaker correlation 0.32 ([Formula: see text]-value=0.004 between IonTorrent sequencing and pathology review. This suggests that qpure may be a more accurate predictor of tumour cellularity than pathology review. qpure can be downloaded from https://sourceforge.net/projects/qpure/.

  19. qpure: A tool to estimate tumor cellularity from genome-wide single-nucleotide polymorphism profiles.

    Science.gov (United States)

    Song, Sarah; Nones, Katia; Miller, David; Harliwong, Ivon; Kassahn, Karin S; Pinese, Mark; Pajic, Marina; Gill, Anthony J; Johns, Amber L; Anderson, Matthew; Holmes, Oliver; Leonard, Conrad; Taylor, Darrin; Wood, Scott; Xu, Qinying; Newell, Felicity; Cowley, Mark J; Wu, Jianmin; Wilson, Peter; Fink, Lynn; Biankin, Andrew V; Waddell, Nic; Grimmond, Sean M; Pearson, John V

    2012-01-01

    Tumour cellularity, the relative proportion of tumour and normal cells in a sample, affects the sensitivity of mutation detection, copy number analysis, cancer gene expression and methylation profiling. Tumour cellularity is traditionally estimated by pathological review of sectioned specimens; however this method is both subjective and prone to error due to heterogeneity within lesions and cellularity differences between the sample viewed during pathological review and tissue used for research purposes. In this paper we describe a statistical model to estimate tumour cellularity from SNP array profiles of paired tumour and normal samples using shifts in SNP allele frequency at regions of loss of heterozygosity (LOH) in the tumour. We also provide qpure, a software implementation of the method. Our experiments showed that there is a medium correlation 0.42 ([Formula: see text]-value=0.0001) between tumor cellularity estimated by qpure and pathology review. Interestingly there is a high correlation 0.87 ([Formula: see text]-value [Formula: see text] 2.2e-16) between cellularity estimates by qpure and deep Ion Torrent sequencing of known somatic KRAS mutations; and a weaker correlation 0.32 ([Formula: see text]-value=0.004) between IonTorrent sequencing and pathology review. This suggests that qpure may be a more accurate predictor of tumour cellularity than pathology review. qpure can be downloaded from https://sourceforge.net/projects/qpure/.

  20. Profiling genome-wide DNA methylation.

    Science.gov (United States)

    Yong, Wai-Shin; Hsu, Fei-Man; Chen, Pao-Yang

    2016-01-01

    DNA methylation is an epigenetic modification that plays an important role in regulating gene expression and therefore a broad range of biological processes and diseases. DNA methylation is tissue-specific, dynamic, sequence-context-dependent and trans-generationally heritable, and these complex patterns of methylation highlight the significance of profiling DNA methylation to answer biological questions. In this review, we surveyed major methylation assays, along with comparisons and biological examples, to provide an overview of DNA methylation profiling techniques. The advances in microarray and sequencing technologies make genome-wide profiling possible at a single-nucleotide or even a single-cell resolution. These profiling approaches vary in many aspects, such as DNA input, resolution, genomic region coverage, and bioinformatics analysis, and selecting a feasible method requires knowledge of these methods. We first introduce the biological background of DNA methylation and its pattern in plants, animals and fungi. We present an overview of major experimental approaches to profiling genome-wide DNA methylation and hydroxymethylation and then extend to the single-cell methylome. To evaluate these methods, we outline their strengths and weaknesses and perform comparisons across the different platforms. Due to the increasing need to compute high-throughput epigenomic data, we interrogate the computational pipeline for bisulfite sequencing data and also discuss the concept of identifying differentially methylated regions (DMRs). This review summarizes the experimental and computational concepts for profiling genome-wide DNA methylation, followed by biological examples. Overall, this review provides researchers useful guidance for the selection of a profiling method suited to specific research questions.

  1. The challenges of genome-wide interaction studies: Lessons to learn from the analysis of HDL blood levels

    NARCIS (Netherlands)

    E.M. van Leeuwen (Elisa); F.A.S. Smouter (Françoise A.S.); T. Kam-Thong (Tony); N. Karbalai (Nazanin); G.D. Smith; T.B. Harris (Tamara); L.J. Launer (Lenore); C.M. Sitlani (Colleen); G. Li (Guo); J. Brody (Jennifer); J.C. Bis (Joshua); C.C. White (Charles); A. Jaiswal (Alok); B.A. Oostra (Ben); A. Hofman (Albert); F. Rivadeneira Ramirez (Fernando); A.G. Uitterlinden (André); E.A. Boerwinkle (Eric); C. Ballantyne (Christie); V. Gudnason (Vilmundur); B.M. Psaty (Bruce); L.A. Cupples (Adrienne); M.-R. Jarvelin (Marjo-Riitta); S. Ripatti (Samuli); A.J. Isaacs (Aaron); B. Müller-Myhsok (B.); L.C. Karssen (Lennart); C.M. van Duijn (Cock)

    2014-01-01

    textabstractGenome-wide association studies (GWAS) have revealed 74 single nucleotide polymorphisms (SNPs) associated with high-density lipoprotein cholesterol (HDL) blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS) to identify SNP6SNP interactions associa

  2. A genome-wide association study identifies an osteoarthritis susceptibility locus on chromosome 7q22

    NARCIS (Netherlands)

    J.M. Kerkhof (Hanneke); R.J. Lories (Rik); I. Meulenbelt (Ingrid); I. Jonsdottir (Ingileif); A.M. Valdes (Ana Maria); P.P. Arp (Pascal); T. Ingvarsson (Torvaldur); M. Jhamai (Mila); H. Jonsson (Helgi); L. Stolk (Lisette); G. Thorleifsson (Gudmar); G. Zhai (Guangju); F. Zhang (Feng); Y. Zhu (Yicheng); R. van der Breggen (Ruud); M. Doherty (Michael); D. Felson; A. Gonzalez (Antonio); B.V. Halldorsson (Bjarni); D.J. Hart (Deborah); V.B. Hauksson (Valdimar); A. Hofman (Albert); J.P.A. Ioannidis (John); M. Kloppenburg (Margreet); N.E. Lane (Nancy); J. Loughlin (John); F.P. Luyten (Frank); M.C. Nevitt (Michael); N. Parimi (Neeta); H.A.P. Pols (Huib); F. Rivadeneira Ramirez (Fernando); E. Slagboom (Eline); U. Styrkarsdottir (Unnur); A. Tsezou (Aspasia); T. van de Putte (Tom); J. Zmuda (Joseph); T.D. Spector (Timothy); J-A. Zwart (John-Anker); A.G. Uitterlinden (André); J.B.J. van Meurs (Joyce); A.J. Carr (Andrew Jonathan)

    2010-01-01

    markdownabstract__Objective__ To identify novel genes involved in osteoarthritis (OA), by means of a genome-wide association study. Methods. We tested 500,510 single-nucleotide polymorphisms (SNPs) in 1,341 Dutch Caucasian OA cases and 3,496 Dutch Caucasian controls. SNPs associated with at least 2

  3. Genome-wide association and functional studies identify a role for IGFBP3 in hip osteoarthritis

    NARCIS (Netherlands)

    D.S. Evans (Daniel); F. Cailotto (Frederic); N. Parimi (Neeta); A.M. Valdes (Ana Maria); M.C. Castaño Betancourt (Martha); Y. Liu (Youfang); R.C. Kaplan (Robert); M. Bidlingmaier (Martin); R.S. Vasan (Ramachandran Srini); A. Teumer (Alexander); G.J. Tranah (Gregory); M.C. Nevitt (Michael); S. Cummings; E.S. Orwoll (Eric); E. Barrett-Connor (Elizabeth); J.B. Renner (Jordan); J.M. Jordan (Joanne); M. Doherty (Michael); S. Doherty (Sally); A.G. Uitterlinden (André); J.B.J. van Meurs (Joyce); T.D. Spector (Timothy); R.J. Lories (Rik); N.E. Lane

    2014-01-01

    textabstractObjectives To identify genetic associations with hip osteoarthritis (HOA), we performed a meta-analysis of genome-wide association studies (GWAS) of HOA. Methods The GWAS meta-analysis included approximately 2.5 million imputed HapMap single nucleotide polymorphisms (SNPs). HOA cases and

  4. Genome-Wide Association Study of Intelligence: Additive Effects of Novel Brain Expressed Genes

    Science.gov (United States)

    Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.

    2012-01-01

    Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…

  5. Modelling the contribution of family history and variation in single nucleotide polymorphisms to risk of schizophrenia

    DEFF Research Database (Denmark)

    Agerbo, Esben; Mortensen, Preben Bo; Wiuf, Carsten

    2012-01-01

    Epidemiological studies indicate that having any family member with schizophrenia increases the risk of schizophrenia in the probands. However, genome-wide association studies (GWAS) have accounted for little of this variation. The aim of this study was to use a population-based sample to explore...... the influence of single-nucleotide polymorphisms (SNPs) on the excess schizophrenia risk in offspring of parents with a psychotic, bipolar affective or other psychiatric disorder....

  6. Genome-wide analysis correlates Ayurveda Prakriti.

    Science.gov (United States)

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K; Prasanna, B V; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S; Dedge, Amrish P; Bharadwaj, Ramachandra; Gangadharan, G G; Nair, Sreekumaran; Gopinath, Puthiya M; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-10-29

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as "Prakriti". To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10(-5)) were significantly different between Prakritis, without any confounding effect of stratification, after 10(6) permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India's traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine.

  7. Genome-wide patterns of nucleotide polymorphism in domesticated rice

    DEFF Research Database (Denmark)

    Caicedo, Ana L; Williamson, Scott H; Hernandez, Ryan D

    2007-01-01

    Domesticated Asian rice (Oryza sativa) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments......, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models...... explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection...

  8. Genome-Wide Association Study of Polymorphisms Predisposing to Bronchiolitis

    Science.gov (United States)

    Pasanen, Anu; Karjalainen, Minna K.; Bont, Louis; Piippo-Savolainen, Eija; Ruotsalainen, Marja; Goksör, Emma; Kumawat, Kuldeep; Hodemaekers, Hennie; Nuolivirta, Kirsi; Jartti, Tuomas; Wennergren, Göran; Hallman, Mikko; Rämet, Mika; Korppi, Matti

    2017-01-01

    Bronchiolitis is a major cause of hospitalization among infants. Severe bronchiolitis is associated with later asthma, suggesting a common genetic predisposition. Genetic background of bronchiolitis is not well characterized. To identify polymorphisms associated with bronchiolitis, we conducted a genome-wide association study (GWAS) in which 5,300,000 single nucleotide polymorphisms (SNPs) were tested for association in a Finnish–Swedish population of 217 children hospitalized for bronchiolitis and 778 controls. The most promising SNPs (n = 77) were genotyped in a Dutch replication population of 416 cases and 432 controls. Finally, we used a set of 202 Finnish bronchiolitis cases to further investigate candidate SNPs. We did not detect genome-wide significant associations, but several suggestive association signals (p bronchiolitis. These preliminary findings require further validation in a larger sample size. PMID:28139761

  9. Ultrahigh-density linkage map for cultivated cucumber (Cucumis sativus L.) using a single-nucleotide polymorphism genotyping array

    Science.gov (United States)

    With the low cost of single nucleotide polymorphism (SNP) discovery, use of SNP markers for SNP array development is becoming more affordable. The SNP array is a very useful tool for high throughput genotyping and has a number of applications such as genome-wide association studies (GWAS). Since the...

  10. Investigation of single nucleotide polymorphisms and biological pathways associated with response to TNFα inhibitors in patients with rheumatoid arthritis

    DEFF Research Database (Denmark)

    Krintel, Sophine B; Palermo, Giuseppe; Johansen, Julia S;

    2012-01-01

    Recently, two genome-wide association studies identified single nucleotide polymorphisms (SNPs) significantly associated with the treatment response to tumor necrosis factor α (TNFα) inhibitors in patients with rheumatoid arthritis (RA). We aimed to replicate these results and identify SNPs...... and the possible biological pathways associated with the treatment response to TNFα inhibitors....

  11. A comprehensive analysis of genome-wide association studies to identify prostate cancer susceptibility loci for the Romanian population.

    Science.gov (United States)

    Rădăvoi, George Daniel; Pricop, Cătălin; Jinga, Viorel; Mateş, Dana; Rădoi, Viorica Elena; Jinga, Mariana; Ursu, Radu Ioan; Bratu, Ovidiu Gabriel; Mischianu, Dan Liviu Dorel; Iordache, Paul

    2016-01-01

    The aim of this study is to examine a large dataset of single nucleotide polymorphism known to be associated with prostate cancer from previous genome-wide association studies and create a dataset of single nucleotide polymorphisms that can be used in replication studies for the Romanian population. This study will define a list of markers showing a significant association with this phenotype. We propose the results of this study as a starting point for any Romanian genome-wide association studies researching the genetic susceptibility for prostate cancer.

  12. The Challenges of Genome-Wide Interaction Studies: Lessons to Learn from the Analysis of HDL Blood Levels

    OpenAIRE

    van Leeuwen, Elisabeth M.; Smouter, Françoise A. S.; Tony Kam-Thong; Nazanin Karbalai; Smith, Albert V.; Harris, Tamara B.; Launer, Lenore J.; Sitlani, Colleen M.; Guo Li; Brody, Jennifer A; Bis, Joshua C.; White, Charles C.; Alok Jaiswal; Oostra, Ben A.; Albert Hofman

    2014-01-01

    Genome-wide association studies (GWAS) have revealed 74 single nucleotide polymorphisms (SNPs) associated with high-density lipoprotein cholesterol (HDL) blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS) to identify SNPxSNP interactions associated with HDL levels. We performed a GWIS in the Rotterdam Study (RS) cohort I (RS-I) using the GLIDE tool which leverages the massively parallel computing power of Graphics Processing Units (GPUs) to perform l...

  13. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Development of Advanced Classification Algorithm for Genome-Wide Single Nucleotide Polymorphism (SNP) Data Analysis

    Science.gov (United States)

    2011-04-01

    critical. 5. REFERENCES Almasy, L, Blangero, J. (2009) “Human QTL linkage mapping.” Genetica 136:333-340. Amos, CI. (2007) “Successful...quantitative trait loci.” Genetica 136:237-243. Ward, JH, Hook, ME. “A Hierarchical Grouping Procedure Applied to a Problem of Grouping Profiles

  14. European genome-wide association study identifies SLC14A1 as a new urinary bladder cancer susceptibility gene

    NARCIS (Netherlands)

    Rafnar, T.; Vermeulen, H.H.M.; Sulem, P.; Thorleifsson, G.; Aben, K.K.H.; Witjes, J.A.; Grotenhuis, A.J.; Verhaegh, G.W.C.T.; Hulsbergen- van de Kaa, C.A.; Besenbacher, S.; Gudbjartsson, D.; Stacey, S.N.; Gudmundsson, J.; Johannsdottir, H.; Bjarnason, H.; Zanon, C.; Helgadottir, H.; Jonasson, J.G.; Tryggvadottir, L.; Jonsson, E.; Geirsson, G.; Nikulasson, S.; Petursdottir, V.; Bishop, D.T.; Chung-Sak, S.; Choudhury, A.; Elliott, F.; Barrett, J.H.; Knowles, M.A.; Verdier, P. de; Ryk, C.; Lindblom, A.; Rudnai, P.; Gurzau, E.; Koppova, K.; Vineis, P.; Polidoro, S.; Guarrera, S.; Sacerdote, C.; Panadero, A.; Sanz-Velez, J.I.; Sanchez, M.; Valdivia, G.; Garcia-Prats, M.D.; Hengstler, J.G.; Selinski, S.; Gerullis, H.; Ovsiannikov, D.; Khezri, A.; Aminsharifi, A.; Malekzadeh, M.; Berg, L.H. van den; Ophoff, R.A.; Veldink, J.H.; Zeegers, M.P.; Kellen, E.; Fostinelli, J.; Andreoli, D.; Arici, C.; Porru, S.; Buntinx, F.; Ghaderi, A.; Golka, K.; Mayordomo, J.I.; Matullo, G.; Kumar, R.; Steineck, G.; Kiltie, A.E.; Kong, A.; Thorsteinsdottir, U.; Stefansson, K.; Kiemeney, L.A.L.M.

    2011-01-01

    Three genome-wide association studies in Europe and the USA have reported eight urinary bladder cancer (UBC) susceptibility loci. Using extended case and control series and 1000 Genomes imputations of 5 340 737 single-nucleotide polymorphisms (SNPs), we searched for additional loci in the European G

  15. Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering

    OpenAIRE

    Guo, Xuan; Meng, Yu; Yu, Ning; Pan, Yi

    2014-01-01

    Backgroud Taking the advan tage of high-throughput single nucleotide polymorphism (SNP) genotyping technology, large genome-wide association studies (GWASs) have been considered to hold promise for unravelling complex relationships between genotype and phenotype. At present, traditional single-locus-based methods are insufficient to detect interactions consisting of multiple-locus, which are broadly existing in complex traits. In addition, statistic tests for high order epistatic interactions...

  16. Genome wide association study identifies KCNMA1 contributing to human obesity

    DEFF Research Database (Denmark)

    Jiao, Hong; Arner, Peter; Hoffstedt, Johan;

    2011-01-01

    Recent genome-wide association (GWA) analyses have identified common single nucleotide polymorphisms (SNPs) that are associated with obesity. However, the reported genetic variation in obesity explains only a minor fraction of the total genetic variation expected to be present in the population....... Thus many genetic variants controlling obesity remain to be identified. The aim of this study was to use GWA followed by multiple stepwise validations to identify additional genes associated with obesity....

  17. Supervised Learning-Based tagSNP Selection for Genome-Wide Disease Classifications

    OpenAIRE

    Yang Mary Qu; Chen Zhongxue; Yang Jack; Liu Qingzhong; Sung Andrew H; Huang Xudong

    2008-01-01

    Abstract Background Comprehensive evaluation of common genetic variations through association of single nucleotide polymorphisms (SNPs) with complex human diseases on the genome-wide scale is an active area in human genome research. One of the fundamental questions in a SNP-disease association study is to find an optimal subset of SNPs with predicting power for disease status. To find that subset while reducing study burden in terms of time and costs, one can potentially reconcile information...

  18. Genome-wide association studies in pharmacogenomics of antidepressants.

    Science.gov (United States)

    Lin, Eugene; Lane, Hsien-Yuan

    2015-01-01

    Major depressive disorder (MDD) is one of the most common psychiatric disorders worldwide. Doctors must prescribe antidepressants based on educated guesses due to the fact that it is unmanageable to predict the effectiveness of any particular antidepressant in an individual patient. With the recent advent of scientific research, the genome-wide association study (GWAS) is extensively employed to analyze hundreds of thousands of single nucleotide polymorphisms by high-throughput genotyping technologies. In addition to the candidate-gene approach, the GWAS approach has recently been utilized to investigate the determinants of antidepressant response to therapy. In this study, we reviewed GWAS studies, their limitations and future directions with respect to the pharmacogenomics of antidepressants in MDD.

  19. Genome-wide association study of serum selenium concentrations

    DEFF Research Database (Denmark)

    Gong, Jian; Hsu, Li; Harrison, Tabitha

    2013-01-01

    Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated...... this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening and the Women’s Health Initiative (WHI). We...... tested association between 2,474,333 single nucleotide polymorphisms (SNPs) and serum selenium concentrations using linear regression models. In the first stage (PLCO) 41 SNPs clustered in 15 regions had p

  20. Fine-Scale Estimation of Location of Birth from Genome-Wide Single-Nucleotide Polymorphism Data

    OpenAIRE

    Hoggart, C. J.; O Reilly, P. F.; Kaakinen, M.; Zhang, W.; Chambers, J.C.; Kooner, J.S.; Coin, L. J. M.; Jarvelin, M.-R.

    2011-01-01

    Systematic nonrandom mating in populations results in genetic stratification and is predominantly caused by geographic separation, providing the opportunity to infer individuals’ birthplace from genetic data. Such inference has been demonstrated for individuals’ country of birth, but here we use data from the Northern Finland Birth Cohort 1966 (NFBC1966) to investigate the characteristics of genetic structure within a population and subsequently develop a method for inferring location to a fi...

  1. Genome-wide patterns of nucleotide polymorphism in domesticated rice.

    Directory of Open Access Journals (Sweden)

    Ana L Caicedo

    2007-09-01

    Full Text Available Domesticated Asian rice (Oryza sativa is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models to explain contemporary patterns of polymorphisms in rice, including a (i selectively neutral population bottleneck model, (ii bottleneck plus migration model, (iii multiple selective sweeps model, and (iv bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been the dominant demographic model for domesticated species, cannot explain the derived nucleotide polymorphism site frequency spectrum in rice. Instead, a bottleneck model that incorporates selective sweeps, or a more complex demographic model that includes subdivision and gene flow, are more plausible explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection results only in a local signature of variation.

  2. A genome-wide association study of aging.

    Science.gov (United States)

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora

    2011-11-01

    Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity.

  3. A genome-wide association study of female sexual dysfunction.

    Directory of Open Access Journals (Sweden)

    Andrea Burri

    Full Text Available BACKGROUND: Female sexual dysfunction (FSD is an important but controversial problem with serious negative impact on women's quality of life. Data from twin studies have shown a genetic contribution to the development and maintenance of FSD. METHODOLOGY/PRINCIPAL FINDINGS: We performed a genome-wide association study (GWAS on 2.5 million single-nucleotide polymorphisms (SNPs in 1,104 female twins (25-81 years of age in a population-based register and phenotypic data on lifelong sexual functioning. Although none reached conventional genome-wide level of significance (10 × -8, we found strongly suggestive associations with the phenotypic dimension of arousal (rs13202860, P = 1.2 × 10(-7; rs1876525, P = 1.2 × 10(-7; and rs13209281 P = 8.3 × 10(-7 on chromosome 6, around 500 kb upstream of the locus HTR1E (5-hydroxytryptamine receptor 1E locus, related to the serotonin brain pathways. We could not replicate previously reported candidate SNPs associated with FSD in the DRD4, 5HT2A and IL-1B loci. CONCLUSIONS/SIGNIFICANCE: We report the first GWAS of FSD symptoms in humans. This has pointed to several "risk alleles" and the implication of the serotonin and GABA pathways. Ultimately, understanding key mechanisms via this research may lead to new FSD treatments and inform clinical practice and developments in psychiatric nosology.

  4. Genome wide association studies for body conformation traits in the Chinese Holstein cattle population

    DEFF Research Database (Denmark)

    Wu, Xiaoping; Fang, Ming; Liu, Lin;

    2013-01-01

    Background: Genome-wide association study (GWAS) is a powerful tool for revealing the genetic basis of quantitative traits. However, studies using GWAS for conformation traits of cattle is comparatively less. This study aims to use GWAS to find the candidates genes for body conformation traits.......Results: The Illumina BovineSNP50 BeadChip was used to identify single nucleotide polymorphisms (SNPs) that are associated with body conformation traits. A least absolute shrinkage and selection operator (LASSO) was applied to detect multiple SNPs simultaneously for 29 body conformation traits with 1,314 Chinese...... Holstein cattle and 52,166 SNPs. Totally, 59 genome-wide significant SNPs associated with 26 conformation traits were detected by genome-wide association analysis; five SNPs were within previously reported QTL regions (Animal Quantitative Trait Loci (QTL) database) and 11 were very close to the reported...

  5. Genetic link between family socioeconomic status and children's educational achievement estimated from genome-wide SNPs.

    Science.gov (United States)

    Krapohl, E; Plomin, R

    2016-03-01

    One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES.

  6. Single-nucleotide polymorphisms and DNA methylation markers associated with central obesity and regulation of body weight.

    Science.gov (United States)

    Goni, Leticia; Milagro, Fermín I; Cuervo, Marta; Martínez, J Alfredo

    2014-11-01

    Visceral fat is strongly associated with the development of specific obesity-related metabolic alterations. Genetic and epigenetic mechanisms seem to be involved in the development of obesity and visceral adiposity. The aims of this review are to identify the single-nucleotide polymorphisms related to central obesity and to summarize the main findings on DNA methylation and obesity. A search of the MEDLINE database was conducted to identify genome-wide association studies, meta-analyses of genome-wide association studies, and gene-diet interaction studies related to central obesity, and, in addition, studies that analyzed DNA methylation in relation to body weight regulation. A total of 8 genome-wide association studies and 9 meta-analyses of genome-wide association studies reported numerous single-nucleotide polymorphisms to be associated with central obesity. Ten studies analyzed gene-diet interactions and central obesity, while 2 epigenome-wide association studies analyzed DNA methylation patterns and obesity. Nine studies investigated the relationship between DNA methylation and weight loss, excess body weight, or adiposity outcomes. Given the development of new sequencing and omics technologies, significantly more knowledge on genomics and epigenomics of obesity and body fat distribution will emerge in the near future.

  7. The expected performance of single nucleotide polymorphism loci in paternity testing.

    Science.gov (United States)

    Ayres, Karen L

    2005-11-25

    We discuss the utility of single nucleotide polymorphism loci for full trio and mother-unavailable paternity testing cases, in the presence of population substructure and relatedness of putative and actual fathers. We focus primarily on the expected number of loci required to gain specified probabilities of mismatches, and report the expected proportion of paternity indices greater than three threshold values for these loci.

  8. Empirical Bayes analysis of single nucleotide polymorphisms

    Directory of Open Access Journals (Sweden)

    Ickstadt Katja

    2008-03-01

    Full Text Available Abstract Background An important goal of whole-genome studies concerned with single nucleotide polymorphisms (SNPs is the identification of SNPs associated with a covariate of interest such as the case-control status or the type of cancer. Since these studies often comprise the genotypes of hundreds of thousands of SNPs, methods are required that can cope with the corresponding multiple testing problem. For the analysis of gene expression data, approaches such as the empirical Bayes analysis of microarrays have been developed particularly for the detection of genes associated with the response. However, the empirical Bayes analysis of microarrays has only been suggested for binary responses when considering expression values, i.e. continuous predictors. Results In this paper, we propose a modification of this empirical Bayes analysis that can be used to analyze high-dimensional categorical SNP data. This approach along with a generalized version of the original empirical Bayes method are available in the R package siggenes version 1.10.0 and later that can be downloaded from http://www.bioconductor.org. Conclusion As applications to two subsets of the HapMap data show, the empirical Bayes analysis of microarrays cannot only be used to analyze continuous gene expression data, but also be applied to categorical SNP data, where the response is not restricted to be binary. In association studies in which typically several ten to a few hundred SNPs are considered, our approach can furthermore be employed to test interactions of SNPs. Moreover, the posterior probabilities resulting from the empirical Bayes analysis of (prespecified interactions/genotypes can also be used to quantify the importance of these interactions.

  9. Genome Wide Association Analysis Reveals New Production Trait Genes in a Male Duroc Population.

    Directory of Open Access Journals (Sweden)

    Kejun Wang

    Full Text Available In this study, 796 male Duroc pigs were used to identify genomic regions controlling growth traits. Three production traits were studied: food conversion ratio, days to 100 KG, and average daily gain, using a panel of 39,436 single nucleotide polymorphisms. In total, we detected 11 genome-wide and 162 chromosome-wide single nucleotide polymorphism trait associations. The Gene ontology analysis identified 14 candidate genes close to significant single nucleotide polymorphisms, with growth-related functions: six for days to 100 KG (WT1, FBXO3, DOCK7, PPP3CA, AGPAT9, and NKX6-1, seven for food conversion ratio (MAP2, TBX15, IVL, ARL15, CPS1, VWC2L, and VAV3, and one for average daily gain (COL27A1. Gene ontology analysis indicated that most of the candidate genes are involved in muscle, fat, bone or nervous system development, nutrient absorption, and metabolism, which are all either directly or indirectly related to growth traits in pigs. Additionally, we found four haplotype blocks composed of suggestive single nucleotide polymorphisms located in the growth trait-related quantitative trait loci and further narrowed down the ranges, the largest of which decreased by ~60 Mb. Hence, our results could be used to improve pig production traits by increasing the frequency of favorable alleles via artificial selection.

  10. A genome-wide association study of anorexia nervosa.

    Science.gov (United States)

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, A A; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-10-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.

  11. Genome-wide association and genomic selection in animal breeding.

    Science.gov (United States)

    Hayes, Ben; Goddard, Mike

    2010-11-01

    Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.

  12. Functional Analysis of a Novel Genome-Wide Association Study Signal in SMAD3 That Confers Protection From Coronary Artery Disease

    DEFF Research Database (Denmark)

    Turner, Adam W.; Martinuk, Amy; Silva, Anada;

    2016-01-01

    OBJECTIVE—: A recent genome-wide association study meta-analysis identified an intronic single nucleotide polymorphism in SMAD3, rs56062135C>T, the minor allele (T) which associates with protection from coronary artery disease. Relevant to atherosclerosis, SMAD3 is a key contributor to transformi...

  13. Genetic integrity of the Dark European honey bee (Apis mellifera mellifera) from protected populations: a genome-wide assessment using SNPs and mtDNA sequence data

    DEFF Research Database (Denmark)

    Pinto, M Alice; Henriques, Dora; Chávez-Galarza, Julio

    2014-01-01

    to preserve the genetic integrity of A. m. mellifera, protected populations had a measurable component of their gene pool derived from commercial C-lineage honey bees. Here we used both sequence data from the tRNAleu-cox2 intergenic mtDNA region and a genome-wide scan, with over 1183 single nucleotide...

  14. Genome-wide scan for visceral leishmaniasis in mixed-breed dogs identifies candidate genes involved in T helper cells and macrophage signaling

    Science.gov (United States)

    We conducted a genome-wide scan for visceral leishmaniasis in mixed-breed dogs from a highly endemic area in Brazil using 149,648 single nucleotide polymorphism (SNP) markers genotyped in 20 cases and 28 controls. Using a mixed model approach, we found two candidate loci on canine autosomes 1 and 2....

  15. Common variants associated with breast cancer in genome-wide association studies are modifiers of breast cancer risk in BRCA1 and BRCA2 mutation carriers

    NARCIS (Netherlands)

    Wang, Xianshu; Pankratz, V. Shane; Fredericksen, Zachary; Tarrell, Robert; Karaus, Mary; McGuffog, Lesley; Pharaoh, Paul D. P.; Ponder, Bruce A. J.; Dunning, Alison M.; Peock, Susan; Cook, Margaret; Oliver, Clare; Frost, Debra; Sinilnikova, Olga M.; Stoppa-Lyonnet, Dominique; Mazoyer, Sylvie; Houdayer, Claude; Hogervorst, Frans B. L.; Hooning, Maartje J.; Ligtenberg, Marjolijn J.; Spurdle, Amanda; Chenevix-Trench, Georgia; Schmutzler, Rita K.; Wappenschmidt, Barbara; Engel, Christoph; Meindl, Alfons; Domchek, Susan M.; Nathanson, Katherine L.; Rebbeck, Timothy R.; Singer, Christian F.; Gschwantler-Kaulich, Daphne; Dressler, Catherina; Fink, Anneliese; Szabo, Csilla I.; Zikan, Michal; Foretova, Lenka; Claes, Kathleen; Thomas, Gilles; Hoover, Robert N.; Hunter, David J.; Chanock, Stephen J.; Easton, Douglas F.; Antoniou, Antonis C.; Couch, Fergus J.

    2010-01-01

    Recent studies have identified single nucleotide polymorphisms (SNPs) that significantly modify breast cancer risk in BRCA1 and BRCA2 mutation carriers. Since these risk modifiers were originally identified as genetic risk factors for breast cancer in genome-wide association studies (GWASs), additio

  16. Common variants associated with breast cancer in genome-wide association studies are modifiers of breast cancer risk in BRCA1 and BRCA2 mutation carriers.

    NARCIS (Netherlands)

    Wang, X.; Pankratz, V.S.; Fredericksen, Z.; Tarrell, R.; Karaus, M.; McGuffog, L.; Pharaoh, P.D.; Ponder, B.A.J.; Dunning, A.M.; Peock, S.; Cook, M.; Oliver, C.; Frost, D.; Sinilnikova, O.M.; Stoppa-Lyonnet, D.; Mazoyer, S.; Houdayer, C.; Hogervorst, F.B.L.; Hooning, M.J.; Ligtenberg, M.J.L.; Spurdle, A.; Chenevix-Trench, G.; Schmutzler, R.K.; Wappenschmidt, B.; Engel, C.; Meindl, A.; Domchek, S.M.; Nathanson, K.L.; Rebbeck, T.R.; Singer, C.F.; Gschwantler-Kaulich, D.; Dressler, C.; Fink, A.; Szabo, C.I.; Zikan, M.; Foretova, L.; Claes, K.; Thomas, G.; Hoover, R.N.; Hunter, D.J.; Chanock, S.J.; Easton, D.F.; Antoniou, A.C.; Couch, F.J.

    2010-01-01

    Recent studies have identified single nucleotide polymorphisms (SNPs) that significantly modify breast cancer risk in BRCA1 and BRCA2 mutation carriers. Since these risk modifiers were originally identified as genetic risk factors for breast cancer in genome-wide association studies (GWASs), additio

  17. Comparison of 6q25 Breast Cancer Hits from Asian and European Genome Wide Association Studies in the Breast Cancer Association Consortium (BCAC)

    NARCIS (Netherlands)

    Hein, Rebecca; Maranian, Melanie; Hopper, John L.; Kapuscinski, Miroslaw K.; Southey, Melissa C.; Park, Daniel J.; Schmidt, Marjanka K.; Broeks, Annegien; Hogervorst, Frans B. L.; Bueno-de-Mesquit, H. Bas; Muir, Kenneth R.; Lophatananon, Artitaya; Rattanamongkongul, Suthee; Puttawibul, Puttisak; Fasching, Peter A.; Hein, Alexander; Ekici, Arif B.; Beckmann, Matthias W.; Fletcher, Olivia; Johnson, Nichola; Silva, Isabel dos Santos; Peto, Julian; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Marmee, Frederick; Schneeweiss, Andreas; Sohn, Christof; Burwinkel, Barbara; Guenel, Pascal; Cordina-Duverger, Emilie; Menegaux, Florence; Truong, Therese; Bojesen, Stig E.; Nordestgaard, Borge G.; Flyger, Henrik; Milne, Roger L.; Arias Perez, Jose Ignacio; Pilar Zamora, M.; Benitez, Javier; Anton-Culver, Hoda; Ziogas, Argyrios; Bernstein, Leslie; Clarke, Christina A.; Brenner, Hermann; Mueller, Heiko; Arndt, Volker; Stegmaier, Christa; Rahman, Nazneen; Seal, Sheila; Turnbull, Clare; Renwick, Anthony; Meindl, Alfons; Schott, Sarah; Bartram, Claus R.; Schmutzler, Rita K.; Brauch, Hiltrud; Hamann, Ute; Ko, Yon-Dschun; Wang-Gohrke, Shan; Doerk, Thilo; Schuermann, Peter; Karstens, Johann H.; Hillemanns, Peter; Nevanlinna, Heli; Heikkinen, Tuomas; Aittomaki, Kristiina; Blomqvist, Carl; Bogdanova, Natalia V.; Zalutsky, Iosif V.; Antonenkova, Natalia N.; Bermisheva, Marina; Prokovieva, Darya; Farahtdinova, Albina; Khusnutdinova, Elza; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana; Chen, Xiaoqing; Beesley, Jonathan; Lambrechts, Diether; Zhao, Hui; Neven, Patrick; Wildiers, Hans; Nickels, Stefan; Flesch-Janys, Dieter; Radice, Paolo; Peterlongo, Paolo; Manoukian, Siranoush; Barile, Monica; Couch, Fergus J.; Olson, Janet E.; Wang, Xianshu; Fredericksen, Zachary; Giles, Graham G.; Baglietto, Laura; McLean, Catriona A.; Severi, Gianluca; Offit, Kenneth; Robson, Mark; Gaudet, Mia M.; Vijai, Joseph; Alnaes, Grethe Grenaker; Kristensen, Vessela; Borresen-Dale, Anne-Lise; John, Esther M.; Miron, Alexander; Winqvist, Robert; Pylkas, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Mulligan, Anna Marie; Figueroa, Jonine D.; Garcia-Closas, Montserrat; Lissowska, Jolanta; Sherman, Mark E.; Hooning, Maartje; Martens, John W. M.; Seynaeve, Caroline; Collee, Margriet; Hall, Per; Humpreys, Keith; Czene, Kamila; Liu, Jianjun; Cox, Angela; Brock, Ian W.; Cross, Simon S.; Reed, Malcolm W. R.; Ahmed, Shahana; Ghoussaini, Maya; Pharoah, Paul D. P.; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Jakubowska, Anna; Jaworska, Katarzyna; Durda, Katarzyna; Zlowocka, Elzbieta; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Shen, Chen-Yang; Yu, Jyh-Cherng; Hsu, Huan-Ming; Hou, Ming-Feng; Orr, Nick; Schoemaker, Minouk; Ashworth, Alan; Swerdlow, Anthony; Trentham-Dietz, Amy; Newcomb, Polly A.; Titus, Linda; Egan, Kathleen M.; Chenevix-Trench, Georgia; Antoniou, Antonis C.; Humphreys, Manjeet K.; Morrison, Jonathan; Chang-Claude, Jenny; Easton, Douglas F.; Dunning, Alison M.

    2012-01-01

    The 6q25.1 locus was first identified via a genome-wide association study (GWAS) in Chinese women and marked by single nucleotide polymorphism (SNP) rs2046210, approximately 180 Kb upstream of ESR1. There have been conflicting reports about the association of this locus with breast cancer in Europea

  18. A genome-wide association study identifies a novel susceptibility locus for renal cell carcinoma on 12p11.23

    NARCIS (Netherlands)

    Wu, Xifeng; Scelo, Ghislaine; Purdue, Mark P.; Rothman, Nathaniel; Johansson, Mattias; Ye, Yuanqing; Wang, Zhaoming; Zelenika, Diana; Moore, Lee E.; Wood, Christopher G.; Prokhortchouk, Egor; Gaborieau, Valerie; Jacobs, Kevin B.; Chow, Wong-Ho; Toro, Jorge R.; Zaridze, David; Lin, Jie; Lubinski, Jan; Trubicka, Joanna; Szeszenia-Dabrowska, Neonilia; Lissowska, Jolanta; Rudnai, Peter; Fabianova, Eleonora; Mates, Dana; Jinga, Viorel; Bencko, Vladimir; Slamova, Alena; Holcatova, Ivana; Navratilova, Marie; Janout, Vladimir; Boffetta, Paolo; Colt, Joanne S.; Davis, Faith G.; Schwartz, Kendra L.; Banks, Rosamonde E.; Selby, Peter J.; Harnden, Patricia; Berg, Christine D.; Hsing, Ann W.; Grubb, Robert L.; Boeing, Heiner; Vineis, Paolo; Clavel-Chapelon, Francoise; Palli, Domenico; Tumino, Rosario; Krogh, Vittorio; Panico, Salvatore; Duell, Eric J.; Ramon Quiros, Jose; Sanchez, Maria-Jose; Navarro, Carmen; Ardanaz, Eva; Dorronsoro, Miren; Khaw, Kay-Tee; Allen, Naomi E.; Bueno-de-Mesquita, H. Bas; Peeters, Petra H. M.; Trichopoulos, Dimitrios; Linseisen, Jakob; Ljungberg, Borje; Overvad, Kim; Tjonneland, Anne; Romieu, Isabelle; Riboli, Elio; Stevens, Victoria L.; Thun, Michael J.; Diver, W. Ryan; Gapstur, Susan M.; Pharoah, Paul D.; Easton, Douglas F.; Albanes, Demetrius; Virtamo, Jarmo; Vatten, Lars; Hveem, Kristian; Fletcher, Tony; Koppova, Kvetoslava; Cussenot, Olivier; Cancel-Tassin, Geraldine; Benhamou, Simone; Hildebrandt, Michelle A.; Pu, Xia; Foglio, Mario; Lechner, Doris; Hutchinson, Amy; Yeager, Meredith; Fraumeni, Joseph F.; Lathrop, Mark; Skryabin, Konstantin G.; McKay, James D.; Gu, Jian; Brennan, Paul; Chanock, Stephen J.

    2012-01-01

    Renal cell carcinoma (RCC) is the most lethal urologic cancer. Only two common susceptibility loci for RCC have been confirmed to date. To identify additional RCC common susceptibility loci, we conducted an independent genome- wide association study (GWAS). We analyzed 533 191 single nucleotide poly

  19. Family-based Genome-wide Association Study of Frontal Theta Oscillations Identifies Potassium Channel Gene KCNJ6

    OpenAIRE

    Kang, Sun J.; Rangaswamy, Madhavi; Manz, Niklas; Wang, Jen-Chyong; Wetherill, Leah; Hinrichs, Tony; Almasy, Laura; Brooks, Andy; Chorlian, David B.; Dick, Danielle; Hesselbrock, Victor; Kramer, John; Kuperman, Sam; Nurnberger, John,; Rice, John

    2012-01-01

    Event-related oscillations (EROs) represent highly heritable neuroelectric correlates of cognitive processes that manifest deficits in alcoholics and in offspring at high risk to develop alcoholism. Theta ERO to targets in the visual oddball task has been shown to be an endophenotype for alcoholism. A family-based genome-wide association study was performed for the frontal theta ERO phenotype using 634583 autosomal single nucleotide polymorphisms (SNPs) genotyped in 1560 family members from 1...

  20. A genome-wide tree- and forest-based association analysis of comorbidity of alcoholism and smoking

    OpenAIRE

    Ye, Yuanqing; Zhong, Xiaoyun; Zhang, Heping

    2005-01-01

    Genetic mechanisms underlying alcoholism are complex. Understanding the etiology of alcohol dependence and its comorbid conditions such as smoking is important because of the significant health concerns. In this report, we describe a method based on classification trees and deterministic forests for association studies to perform a genome-wide joint association analysis of alcoholism and smoking. This approach is used to analyze the single-nucleotide polymorphism data from the Collaborative S...

  1. Genome-Wide Association Studies of the Human Gut Microbiota.

    Directory of Open Access Journals (Sweden)

    Emily R Davenport

    Full Text Available The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both. These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%. For example, we identified an association between a taxon known to affect obesity (genus Akkermansia and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10-7. Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut.

  2. A genome-wide association study in multiple system atrophy

    Science.gov (United States)

    Sailer, Anna; Nalls, Michael A.; Schulte, Claudia; Federoff, Monica; Price, T. Ryan; Lees, Andrew; Ross, Owen A.; Dickson, Dennis W.; Mok, Kin; Mencacci, Niccolo E.; Schottlaender, Lucia; Chelban, Viorica; Ling, Helen; O'Sullivan, Sean S.; Wood, Nicholas W.; Traynor, Bryan J.; Ferrucci, Luigi; Federoff, Howard J.; Mhyre, Timothy R.; Morris, Huw R.; Deuschl, Günther; Quinn, Niall; Widner, Hakan; Albanese, Alberto; Infante, Jon; Bhatia, Kailash P.; Poewe, Werner; Oertel, Wolfgang; Höglinger, Günter U.; Wüllner, Ullrich; Goldwurm, Stefano; Pellecchia, Maria Teresa; Ferreira, Joaquim; Tolosa, Eduardo; Bloem, Bastiaan R.; Rascol, Olivier; Meissner, Wassilios G.; Hardy, John A.; Revesz, Tamas; Holton, Janice L.; Gasser, Thomas; Wenning, Gregor K.; Singleton, Andrew B.

    2016-01-01

    Objective: To identify genetic variants that play a role in the pathogenesis of multiple system atrophy (MSA), we undertook a genome-wide association study (GWAS). Methods: We performed a GWAS with >5 million genotyped and imputed single nucleotide polymorphisms (SNPs) in 918 patients with MSA of European ancestry and 3,864 controls. MSA cases were collected from North American and European centers, one third of which were neuropathologically confirmed. Results: We found no significant loci after stringent multiple testing correction. A number of regions emerged as potentially interesting for follow-up at p < 1 × 10−6, including SNPs in the genes FBXO47, ELOVL7, EDN1, and MAPT. Contrary to previous reports, we found no association of the genes SNCA and COQ2 with MSA. Conclusions: We present a GWAS in MSA. We have identified several potentially interesting gene loci, including the MAPT locus, whose significance will have to be evaluated in a larger sample set. Common genetic variation in SNCA and COQ2 does not seem to be associated with MSA. In the future, additional samples of well-characterized patients with MSA will need to be collected to perform a larger MSA GWAS, but this initial study forms the basis for these next steps. PMID:27629089

  3. Genome-wide association study of circulating retinol levels.

    Science.gov (United States)

    Mondul, Alison M; Yu, Kai; Wheeler, William; Zhang, Hong; Weinstein, Stephanie J; Major, Jacqueline M; Cornelis, Marilyn C; Männistö, Satu; Hazra, Aditi; Hsing, Ann W; Jacobs, Kevin B; Eliassen, Heather; Tanaka, Toshiko; Reding, Douglas J; Hendrickson, Sara; Ferrucci, Luigi; Virtamo, Jarmo; Hunter, David J; Chanock, Stephen J; Kraft, Peter; Albanes, Demetrius

    2011-12-01

    Retinol is one of the most biologically active forms of vitamin A and is hypothesized to influence a wide range of human diseases including asthma, cardiovascular disease, infectious diseases and cancer. We conducted a genome-wide association study of 5006 Caucasian individuals drawn from two cohorts of men: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention (ATBC) Study and the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. We identified two independent single-nucleotide polymorphisms associated with circulating retinol levels, which are located near the transthyretin (TTR) and retinol binding protein 4 (RBP4) genes which encode major carrier proteins of retinol: rs1667255 (P =2.30× 10(-17)) and rs10882272 (P =6.04× 10(-12)). We replicated the association with rs10882272 in RBP4 in independent samples from the Nurses' Health Study and the Invecchiare in Chianti Study (InCHIANTI) that included 3792 women and 504 men (P =9.49× 10(-5)), but found no association for retinol with rs1667255 in TTR among women, thus suggesting evidence for gender dimorphism (P-interaction=1.31× 10(-5)). Discovery of common genetic variants associated with serum retinol levels may provide further insight into the contribution of retinol and other vitamin A compounds to the development of cancer and other complex diseases.

  4. Psoriasis prediction from genome-wide SNP profiles

    Directory of Open Access Journals (Sweden)

    Fang Xiangzhong

    2011-01-01

    Full Text Available Abstract Background With the availability of large-scale genome-wide association study (GWAS data, choosing an optimal set of SNPs for disease susceptibility prediction is a challenging task. This study aimed to use single nucleotide polymorphisms (SNPs to predict psoriasis from searching GWAS data. Methods Totally we had 2,798 samples and 451,724 SNPs. Process for searching a set of SNPs to predict susceptibility for psoriasis consisted of two steps. The first one was to search top 1,000 SNPs with high accuracy for prediction of psoriasis from GWAS dataset. The second one was to search for an optimal SNP subset for predicting psoriasis. The sequential information bottleneck (sIB method was compared with classical linear discriminant analysis(LDA for classification performance. Results The best test harmonic mean of sensitivity and specificity for predicting psoriasis by sIB was 0.674(95% CI: 0.650-0.698, while only 0.520(95% CI: 0.472-0.524 was reported for predicting disease by LDA. Our results indicate that the new classifier sIB performs better than LDA in the study. Conclusions The fact that a small set of SNPs can predict disease status with average accuracy of 68% makes it possible to use SNP data for psoriasis prediction.

  5. Genome-Wide Association Study of Serum Selenium Concentrations

    Directory of Open Access Journals (Sweden)

    Ulrike Peters

    2013-05-01

    Full Text Available Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO Cancer Screening and the Women’s Health Initiative (WHI. We tested association between 2,474,333 single nucleotide polymorphisms (SNPs and serum selenium concentrations using linear regression models. In the first stage (PLCO 41 SNPs clustered in 15 regions had p < 1 × 10−5. None of these 41 SNPs reached the significant threshold (p = 0.05/15 regions = 0.003 in the second stage (WHI. Three SNPs had p < 0.05 in the second stage (rs1395479 and rs1506807 in 4q34.3/AGA-NEIL3; and rs891684 in 17q24.3/SLC39A11 and had p between 2.62 × 10−7 and 4.04 × 10−7 in the combined analysis (PLCO + WHI. Additional studies are needed to replicate these findings. Identification of genetic variation that impacts selenium concentrations may contribute to a better understanding of which genes regulate circulating selenium concentrations.

  6. Genome-wide association study and premature ovarian failure.

    Science.gov (United States)

    Christin-Maitre, S; Tachdjian, G

    2010-05-01

    Premature ovarian failure (POF) is defined as an amenorrhea for more than 4months, associated with elevated gonadotropins, usually higher than 20mIU/ml, occurring in a woman before the age of 40. Some candidate genes have been identified in the past 15years, such as FOXL2, FSHR, BMP15, GDF9, Xfra premutation. However, POF etiology remains unknown in more than 90% of cases. The first strategy to identify candidate gene, apart from studying genes involved in ovarian failure in animal models, relies on the study of X chromosome deletions and X;autosome translocations in patients. The second strategy is based on linkage analysis, the third one on Comparative Genomic Hybridization (CGH) array. The latest strategy relies on Genome-Wide Association Studies (GWAS). This technique consists in screening single nucleotide polymorphisms (SNPs) in patients and controls. So far, three studies have been performed and have identified different loci potentially linked to POF, such as PTHB1 and ADAMTS19. However, replications in independent cohorts need to be performed. GWAS studies on large cohorts of women with POF should find new candidate genes in the near future.

  7. Genome-wide association study of antisocial personality disorder

    Science.gov (United States)

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  8. Non-replication study of a genome-wide association study for hypertension and blood pressure in African Americans

    Directory of Open Access Journals (Sweden)

    Kidambi Srividya

    2012-04-01

    Full Text Available Abstract Background A recent genome wide association study in 1017 African Americans identified several single nucleotide polymorphisms that reached genome-wide significance for systolic blood pressure. We attempted to replicate these findings in an independent sample of 2474 unrelated African Americans in the Milwaukee metropolitan area; 53% were women and 47% were hypertensives. Methods We evaluated sixteen top associated SNPs from the above genome wide association study for hypertension as a binary trait or blood pressure as a continuous trait. In addition, we evaluated eight single nucleotide polymorphisms located in two genes (STK-39 and CDH-13 found to be associated with systolic and diastolic blood pressures by other genome wide association studies in European and Amish populations. TaqMan MGB-based chemistry with fluorescent probes was used for genotyping. We had an adequate sample size (80% power to detect an effect size of 1.2-2.0 for all the single nucleotide polymorphisms for hypertension as a binary trait, and 1% variance in blood pressure as a continuous trait. Quantitative trait analyses were performed both by excluding and also by including subjects on anti-hypertensive therapy (after adjustments were made for anti-hypertensive medications. Results For all 24 SNPs, no statistically significant differences were noted in the minor allele frequencies between cases and controls. One SNP (rs2146204 showed borderline association (p = 0.006 with hypertension status using recessive model and systolic blood pressure (p = 0.02, but was not significant after adjusting for multiple comparisons. In quantitative trait analyses, among normotensives only, rs12748299 was associated with SBP (p = 0.002. In addition, several nominally significant associations were noted with SBP and DBP among normotensives but none were statistically significant. Conclusions This study highlights the importance of replication to confirm the validity of genome wide

  9. Evidence for single nucleotide polymorphisms and their association with bipolar disorder

    Directory of Open Access Journals (Sweden)

    Szczepankiewicz A

    2013-10-01

    Full Text Available Aleksandra Szczepankiewicz1,21Laboratory of Molecular and Cell Biology, 2Department of Psychiatric Genetics, Poznan University of Medical Sciences, Poznan, PolandAbstract: Bipolar disorder (BD is a complex disorder with a number of susceptibility genes and environmental risk factors involved in its pathogenesis. In recent years, huge progress has been made in molecular techniques for genetic studies, which have enabled identification of numerous genomic regions and genetic variants implicated in BD across populations. Despite the abundance of genetic findings, the results have often been inconsistent and not replicated for many candidate genes/single nucleotide polymorphisms (SNPs. Therefore, the aim of the review presented here is to summarize the most important data reported so far in candidate gene and genome-wide association studies. Taking into account the abundance of association data, this review focuses on the most extensively studied genes and polymorphisms reported so far for BD to present the most promising genomic regions/SNPs involved in BD. The review of association data reveals evidence for several genes (SLC6A4/5-HTT [serotonin transporter gene], BDNF [brain-derived neurotrophic factor], DAOA [D-amino acid oxidase activator], DTNBP1 [dysbindin], NRG1 [neuregulin 1], DISC1 [disrupted in schizophrenia 1] to be crucial candidates in BD, whereas numerous genome-wide association studies conducted in BD indicate polymorphisms in two genes (CACNA1C [calcium channel, voltage-dependent, L type, alpha 1C subunit], ANK3 [ankyrin 3] replicated for association with BD in most of these studies. Nevertheless, further studies focusing on interactions between multiple candidate genes/SNPs, as well as systems biology and pathway analyses are necessary to integrate and improve the way we analyze the currently available association data.Keywords: candidate gene, genome-wide association study, SLC6A4, BDNF, DAOA, DTNBP1, NRG1, DISC1

  10. Compositions and methods for detecting single nucleotide polymorphisms

    Energy Technology Data Exchange (ETDEWEB)

    Yeh, Hsin-Chih; Werner, James; Martinez, Jennifer S.

    2016-11-22

    Described herein are nucleic acid based probes and methods for discriminating and detecting single nucleotide variants in nucleic acid molecules (e.g., DNA). The methods include use of a pair of probes can be used to detect and identify polymorphisms, for example single nucleotide polymorphism in DNA. The pair of probes emit a different fluorescent wavelength of light depending on the association and alignment of the probes when hybridized to a target nucleic acid molecule. Each pair of probes is capable of discriminating at least two different nucleic acid molecules that differ by at least a single nucleotide difference. The methods can probes can be used, for example, for detection of DNA polymorphisms that are indicative of a particular disease or condition.

  11. Genome-wide meta-analyses of smoking behaviors in African Americans.

    Science.gov (United States)

    David, S P; Hamidovic, A; Chen, G K; Bergen, A W; Wessel, J; Kasberger, J L; Brown, W M; Petruzella, S; Thacker, E L; Kim, Y; Nalls, M A; Tranah, G J; Sung, Y J; Ambrosone, C B; Arnett, D; Bandera, E V; Becker, D M; Becker, L; Berndt, S I; Bernstein, L; Blot, W J; Broeckel, U; Buxbaum, S G; Caporaso, N; Casey, G; Chanock, S J; Deming, S L; Diver, W R; Eaton, C B; Evans, D S; Evans, M K; Fornage, M; Franceschini, N; Harris, T B; Henderson, B E; Hernandez, D G; Hitsman, B; Hu, J J; Hunt, S C; Ingles, S A; John, E M; Kittles, R; Kolb, S; Kolonel, L N; Le Marchand, L; Liu, Y; Lohman, K K; McKnight, B; Millikan, R C; Murphy, A; Neslund-Dudas, C; Nyante, S; Press, M; Psaty, B M; Rao, D C; Redline, S; Rodriguez-Gil, J L; Rybicki, B A; Signorello, L B; Singleton, A B; Smoller, J; Snively, B; Spring, B; Stanford, J L; Strom, S S; Swan, G E; Taylor, K D; Thun, M J; Wilson, A F; Witte, J S; Yamamura, Y; Yanek, L R; Yu, K; Zheng, W; Ziegler, R G; Zonderman, A B; Jorgenson, E; Haiman, C A; Furberg, H

    2012-05-22

    The identification and exploration of genetic loci that influence smoking behaviors have been conducted primarily in populations of the European ancestry. Here we report results of the first genome-wide association study meta-analysis of smoking behavior in African Americans in the Study of Tobacco in Minority Populations Genetics Consortium (n = 32,389). We identified one non-coding single-nucleotide polymorphism (SNP; rs2036527[A]) on chromosome 15q25.1 associated with smoking quantity (cigarettes per day), which exceeded genome-wide significance (β = 0.040, s.e. = 0.007, P = 1.84 × 10(-8)). This variant is present in the 5'-distal enhancer region of the CHRNA5 gene and defines the primary index signal reported in studies of the European ancestry. No other SNP reached genome-wide significance for smoking initiation (SI, ever vs never smoking), age of SI, or smoking cessation (SC, former vs current smoking). Informative associations that approached genome-wide significance included three modestly correlated variants, at 15q25.1 within PSMA4, CHRNA5 and CHRNA3 for smoking quantity, which are associated with a second signal previously reported in studies in European ancestry populations, and a signal represented by three SNPs in the SPOCK2 gene on chr10q22.1. The association at 15q25.1 confirms this region as an important susceptibility locus for smoking quantity in men and women of African ancestry. Larger studies will be needed to validate the suggestive loci that did not reach genome-wide significance and further elucidate the contribution of genetic variation to disparities in cigarette consumption, SC and smoking-attributable disease between African Americans and European Americans.

  12. Genome-Wide Association Study of Schizophrenia in Japanese Population

    Science.gov (United States)

    Yamada, Kazuo; Iwayama, Yoshimi; Hattori, Eiji; Iwamoto, Kazuya; Toyota, Tomoko; Ohnishi, Tetsuo; Ohba, Hisako; Maekawa, Motoko; Kato, Tadafumi; Yoshikawa, Takeo

    2011-01-01

    Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS) is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs) in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions). The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila)-like 2] gene located on 9p21.3 (p = 0.00087). In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals) of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026). The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology. PMID:21674006

  13. Genome-wide association study of schizophrenia in Japanese population.

    Directory of Open Access Journals (Sweden)

    Kazuo Yamada

    Full Text Available Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions. The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila-like 2] gene located on 9p21.3 (p = 0.00087. In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026. The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology.

  14. Susceptibility to chronic mucus hypersecretion, a genome wide association study.

    Directory of Open Access Journals (Sweden)

    Akkelies E Dijkstra

    Full Text Available BACKGROUND: Chronic mucus hypersecretion (CMH is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA study of CMH in Caucasian populations. METHODS: GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years. Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP. RESULTS: A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10(-6, OR = 1.17, located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1 on chromosome 3. The risk allele (G was associated with higher mRNA expression of SATB1 (4.3×10(-9 in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture. CONCLUSIONS: Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH.

  15. Mosaic paternal genome-wide uniparental isodisomy with down syndrome.

    Science.gov (United States)

    Darcy, Diana; Atwal, Paldeep Singh; Angell, Cathy; Gadi, Inder; Wallerstein, Robert

    2015-10-01

    We report on a 6-month-old girl with two apparent cell lines; one with trisomy 21, and the other with paternal genome-wide uniparental isodisomy (GWUPiD), identified using single nucleotide polymorphism (SNP) based microarray and microsatellite analysis of polymorphic loci. The patient has Beckwith-Wiedemann syndrome (BWS) due to paternal uniparental disomy (UPD) at chromosome location 11p15 (UPD 11p15), which was confirmed through methylation analysis. Hyperinsulinemic hypoglycemia is present, which is associated with paternal UPD 11p15.5; and she likely has medullary nephrocalcinosis, which is associated with paternal UPD 20, although this was not biochemically confirmed. Angelman syndrome (AS) analysis was negative but this testing is not completely informative; she has no specific features of AS. Clinical features of this patient include: dysmorphic features consistent with trisomy 21, tetralogy of Fallot, hemihypertrophy, swirled skin hyperpigmentation, hepatoblastoma, and Wilms tumor. Her karyotype is 47,XX,+21[19]/46,XX[4], and microarray results suggest that the cell line with trisomy 21 is biparentally inherited and represents 40-50% of the genomic material in the tested specimen. The difference in the level of cytogenetically detected mosaicism versus the level of mosaicism observed via microarray analysis is likely caused by differences in the test methodologies. While a handful of cases of mosaic paternal GWUPiD have been reported, this patient is the only reported case that also involves trisomy 21. Other GWUPiD patients have presented with features associated with multiple imprinted regions, as does our patient.

  16. Susceptibility to Chronic Mucus Hypersecretion, a Genome Wide Association Study

    Science.gov (United States)

    Dijkstra, Akkelies E.; Smolonska, Joanna; van den Berge, Maarten; Wijmenga, Ciska; Zanen, Pieter; Luinge, Marjan A.; Platteel, Mathieu; Lammers, Jan-Willem; Dahlback, Magnus; Tosh, Kerrie; Hiemstra, Pieter S.; Sterk, Peter J.; Spira, Avi; Vestbo, Jorgen; Nordestgaard, Borge G.; Benn, Marianne; Nielsen, Sune F.; Dahl, Morten; Verschuren, W. Monique; Picavet, H. Susan J.; Smit, Henriette A.; Owsijewitsch, Michael; Kauczor, Hans U.; de Koning, Harry J.; Nizankowska-Mogilnicka, Eva; Mejza, Filip; Nastalek, Pawel; van Diemen, Cleo C.; Cho, Michael H.; Silverman, Edwin K.; Crapo, James D.; Beaty, Terri H.; Lomas, David A.; Bakke, Per; Gulsvik, Amund; Bossé, Yohan; Obeidat, M. A.; Loth, Daan W.; Lahousse, Lies; Rivadeneira, Fernando; Uitterlinden, Andre G.; Hofman, Andre; Stricker, Bruno H.; Brusselle, Guy G.; van Duijn, Cornelia M.; Brouwer, Uilke; Koppelman, Gerard H.; Vonk, Judith M.; Nawijn, Martijn C.; Groen, Harry J. M.; Timens, Wim; Boezen, H. Marike; Postma, Dirkje S.

    2014-01-01

    Background Chronic mucus hypersecretion (CMH) is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA) study of CMH in Caucasian populations. Methods GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years). Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP). Results A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10−6, OR = 1.17), located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1) on chromosome 3. The risk allele (G) was associated with higher mRNA expression of SATB1 (4.3×10−9) in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture. Conclusions Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH. PMID:24714607

  17. Genome-Wide Associations of Gene Expression Variation in Humans.

    Directory of Open Access Journals (Sweden)

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  18. Genome-wide associations of gene expression variation in humans.

    Directory of Open Access Journals (Sweden)

    Barbara E Stranger

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  19. International Genome-Wide Association Study Consortium Identifies Novel Loci Associated With Blood Pressure in Children and Adolescents

    Science.gov (United States)

    Parmar, Priyakumari Ganesh; Taal, H. Rob; Timpson, Nicholas J.; Howe, Laura D.; Verwoert, Germaine; Aalto, Ville; Uitterlinden, Andre G.; Briollais, Laurent; Evans, Dave M.; Wright, Margie J.; Newnham, John P.; Whitfield, John B.; Lyytikäinen, Leo-Pekka; Rivadeneira, Fernando; Boomsma, Dorrett I.; Viikari, Jorma; Gillman, Matthew W.; St Pourcain, Beate; Hottenga, Jouke-Jan; Montgomery, Grant W.; Hofman, Albert; Kähönen, Mika; Martin, Nicholas G.; Tobin, Martin D.; Raitakari, Ollie; Vioque, Jesus; Jaddoe, Vincent W.V.; Jarvelin, Marjo-Riita; Beilin, Lawrence J.; Heinrich, Joachim; van Duijn, Cornelia M.; Pennell, Craig E.; Lawlor, Debbie A.; Palmer, Lyle J.

    2017-01-01

    Background Our aim was to identify genetic variants associated with blood pressure (BP) in childhood and adolescence. Methods and Results Genome-wide association study data from participating European ancestry cohorts of the Early Genetics and Lifecourse Epidemiology (EAGLE) Consortium was meta-analyzed across 3 epochs; prepuberty (4–7 years), puberty (8–12 years), and postpuberty (13–20 years). Two novel loci were identified as having genome-wide associations with systolic BP across specific age epochs: rs1563894 (ITGA11, located in active H3K27Ac mark and transcription factor chromatin immunoprecipitation and 5′-C-phosphate-G-3′ methylation site) during prepuberty (P=2.86×10–8) and rs872256 during puberty (P=8.67×10–9). Several single-nucleotide polymorphism clusters were also associated with childhood BP at P<5×10–3. Using a P value threshold of <5×10–3, we found some overlap in variants across the different age epochs within our study and between several single-nucleotide polymorphisms in any of the 3 epochs and adult BP-related single-nucleotide polymorphisms. Conclusions Our results suggest that genetic determinants of BP act from childhood, develop over the lifecourse, and show some evidence of age-specific effects. PMID:26969751

  20. A Laboratory Exercise for Genotyping Two Human Single Nucleotide Polymorphisms

    Science.gov (United States)

    Fernando, James; Carlson, Bradley; LeBard, Timothy; McCarthy, Michael; Umali, Finianne; Ashton, Bryce; Rose, Ferrill F., Jr.

    2016-01-01

    The dramatic decrease in the cost of sequencing a human genome is leading to an era in which a wide range of students will benefit from having an understanding of human genetic variation. Since over 90% of sequence variation between humans is in the form of single nucleotide polymorphisms (SNPs), a laboratory exercise has been devised in order to…

  1. Single nucleotide polymorphisms associated with rat expressed sequences

    NARCIS (Netherlands)

    Guryev, Victor; Berezikov, Eugene; Malik, Rainer; Plasterk, Ronald H A; Cuppen, Edwin

    2004-01-01

    Single nucleotide polymorphisms (SNPs) are the most common source of genetic variation in populations and are thus most likely to account for the majority of phenotypic and behavioral differences between individuals or strains. Although the rat is extensively studied for the latter, data on naturall

  2. Single Nucleotide Polymorphisms Predict Symptom Severity of Autism Spectrum Disorder

    Science.gov (United States)

    Jiao, Yun; Chen, Rong; Ke, Xiaoyan; Cheng, Lu; Chu, Kangkang; Lu, Zuhong; Herskovits, Edward H.

    2012-01-01

    Autism is widely believed to be a heterogeneous disorder; diagnosis is currently based solely on clinical criteria, although genetic, as well as environmental, influences are thought to be prominent factors in the etiology of most forms of autism. Our goal is to determine whether a predictive model based on single-nucleotide polymorphisms (SNPs)…

  3. Mining for Single Nucleotide Polymorphisms in Pig genome sequence data

    NARCIS (Netherlands)

    Kerstens, H.H.D.; Kollers, S.; Kommandath, A.; Rosario, del M.; Dibbits, B.W.; Kinders, S.M.; Crooijmans, R.P.M.A.; Groenen, M.A.M.

    2009-01-01

    Background - Single nucleotide polymorphisms (SNPs) are ideal genetic markers due to their high abundance and the highly automated way in which SNPs are detected and SNP assays are performed. The number of SNPs identified in the pig thus far is still limited. Results - A total of 4.8 million whole g

  4. Genome-wide association study for wool production traits in a Chinese Merino sheep population.

    Directory of Open Access Journals (Sweden)

    Zhipeng Wang

    Full Text Available Genome-wide association studies (GWAS provide a powerful approach for identifying quantitative trait loci without prior knowledge of location or function. To identify loci associated with wool production traits, we performed a genome-wide association study on a total of 765 Chinese Merino sheep (JunKen type genotyped with 50 K single nucleotide polymorphisms (SNPs. In the present study, five wool production traits were examined: fiber diameter, fiber diameter coefficient of variation, fineness dispersion, staple length and crimp. We detected 28 genome-wide significant SNPs for fiber diameter, fiber diameter coefficient of variation, fineness dispersion, and crimp trait in the Chinese Merino sheep. About 43% of the significant SNP markers were located within known or predicted genes, including YWHAZ, KRTCAP3, TSPEAR, PIK3R4, KIF16B, PTPN3, GPRC5A, DDX47, TCF9, TPTE2, EPHA5 and NBEA genes. Our results not only confirm the results of previous reports, but also provide a suite of novel SNP markers and candidate genes associated with wool traits. Our findings will be useful for exploring the genetic control of wool traits in sheep.

  5. Genome-wide association study for wool production traits in a Chinese Merino sheep population.

    Science.gov (United States)

    Wang, Zhipeng; Zhang, Hui; Yang, Hua; Wang, Shouzhi; Rong, Enguang; Pei, Wenyu; Li, Hui; Wang, Ning

    2014-01-01

    Genome-wide association studies (GWAS) provide a powerful approach for identifying quantitative trait loci without prior knowledge of location or function. To identify loci associated with wool production traits, we performed a genome-wide association study on a total of 765 Chinese Merino sheep (JunKen type) genotyped with 50 K single nucleotide polymorphisms (SNPs). In the present study, five wool production traits were examined: fiber diameter, fiber diameter coefficient of variation, fineness dispersion, staple length and crimp. We detected 28 genome-wide significant SNPs for fiber diameter, fiber diameter coefficient of variation, fineness dispersion, and crimp trait in the Chinese Merino sheep. About 43% of the significant SNP markers were located within known or predicted genes, including YWHAZ, KRTCAP3, TSPEAR, PIK3R4, KIF16B, PTPN3, GPRC5A, DDX47, TCF9, TPTE2, EPHA5 and NBEA genes. Our results not only confirm the results of previous reports, but also provide a suite of novel SNP markers and candidate genes associated with wool traits. Our findings will be useful for exploring the genetic control of wool traits in sheep.

  6. Genome-wide association study of hepatitis C virus- and cryoglobulin-related vasculitis.

    Science.gov (United States)

    Zignego, A L; Wojcik, G L; Cacoub, P; Visentini, M; Casato, M; Mangia, A; Latanich, R; Charles, E D; Gragnani, L; Terrier, B; Piazzola, V; Dustin, L B; Khakoo, S I; Busch, M P; Lauer, G M; Kim, A Y; Alric, L; Thomas, D L; Duggal, P

    2014-10-01

    The host genetic basis of mixed cryoglobulin vasculitis is not well understood and has not been studied in large cohorts. A genome-wide association study was conducted among 356 hepatitis C virus (HCV) RNA-positive individuals with cryoglobulin-related vasculitis and 447 ethnically matched, HCV RNA-positive controls. All cases had both serum cryoglobulins and a vasculitis syndrome. A total of 899 641 markers from the Illumina HumanOmni1-Quad chip were analyzed using logistic regression adjusted for sex, as well as genetically determined ancestry. Replication of select single-nucleotide polymorphisms (SNPs) was conducted using 91 cases and 180 controls, adjusting for sex and country of origin. The most significant associations were identified on chromosome 6 near the NOTCH4 and MHC class II genes. A genome-wide significant association was detected on chromosome 6 at SNP rs9461776 (odds ratio=2.16, P=1.16E-07) between HLA-DRB1 and DQA1: this association was further replicated in additional independent samples (meta-analysis P=7.1 × 10(-9)). A genome-wide significant association with cryoglobulin-related vasculitis was identified with SNPs near NOTCH4 and MHC Class II genes. The two regions are correlated and it is difficult to disentangle which gene is responsible for the association with mixed cryoglobulinemia vasculitis in this extended major histocompatibility complex region.

  7. A Pilot Genome-Wide Association Study Identifies Potential Metabolic Pathways Involved in Tinnitus

    Science.gov (United States)

    Gilles, Annick; Van Camp, Guy; Van de Heyning, Paul; Fransen, Erik

    2017-01-01

    Tinnitus, the perception of an auditory phantom sound in the form of ringing, buzzing, roaring, or hissing in the absence of an external sound source, is perceived by ~15% of the population and 2.5% experiences a severely bothersome tinnitus. The contribution of genes on the development of tinnitus is still under debate. The current manuscript reports a pilot Genome Wide Association Study (GWAS) into tinnitus, in a small cohort of 167 independent tinnitus subjects, and 749 non-tinnitus controls, who were collected as part of a cross-sectional study. After genotyping, imputation, and quality checking, the association between the tinnitus phenotype and 4,000,000 single-nucleotide polymorphisms (SNPs) was tested followed by gene set enrichment analysis. None of the SNPs reached the threshold for genome-wide significance (p tinnitus phenotype. Despite the lack of genome-wide significant SNPs, which is, at least in part, due to the limited sample size of the current study, evidence was found for a genetic involvement in tinnitus. Gene set enrichment analysis showed several metabolic pathways to be significantly enriched with SNPs having a low p-value in the GWAS. These pathways are involved in oxidative stress, endoplasmatic reticulum (ER) stress, and serotonin reception mediated signaling. These results are a promising basis for further research into the genetic basis of tinnitus, including GWAS with larger sample sizes and considering tinnitus subtypes for which a greater genetic contribution is more likely. PMID:28303087

  8. Genome-wide association study identifies multiple loci associated with bladder cancer risk

    Science.gov (United States)

    Figueroa, Jonine D.; Ye, Yuanqing; Siddiq, Afshan; Garcia-Closas, Montserrat; Chatterjee, Nilanjan; Prokunina-Olsson, Ludmila; Cortessis, Victoria K.; Kooperberg, Charles; Cussenot, Olivier; Benhamou, Simone; Prescott, Jennifer; Porru, Stefano; Dinney, Colin P.; Malats, Núria; Baris, Dalsu; Purdue, Mark; Jacobs, Eric J.; Albanes, Demetrius; Wang, Zhaoming; Deng, Xiang; Chung, Charles C.; Tang, Wei; Bas Bueno-de-Mesquita, H.; Trichopoulos, Dimitrios; Ljungberg, Börje; Clavel-Chapelon, Françoise; Weiderpass, Elisabete; Krogh, Vittorio; Dorronsoro, Miren; Travis, Ruth; Tjønneland, Anne; Brenan, Paul; Chang-Claude, Jenny; Riboli, Elio; Conti, David; Gago-Dominguez, Manuela; Stern, Mariana C.; Pike, Malcolm C.; Van Den Berg, David; Yuan, Jian-Min; Hohensee, Chancellor; Rodabough, Rebecca; Cancel-Tassin, Geraldine; Roupret, Morgan; Comperat, Eva; Chen, Constance; De Vivo, Immaculata; Giovannucci, Edward; Hunter, David J.; Kraft, Peter; Lindstrom, Sara; Carta, Angela; Pavanello, Sofia; Arici, Cecilia; Mastrangelo, Giuseppe; Kamat, Ashish M.; Lerner, Seth P.; Barton Grossman, H.; Lin, Jie; Gu, Jian; Pu, Xia; Hutchinson, Amy; Burdette, Laurie; Wheeler, William; Kogevinas, Manolis; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Lloreta, Josep; Schwenn, Molly; Karagas, Margaret R.; Johnson, Alison; Schned, Alan; Armenti, Karla R.; Hosain, G.M.; Andriole, Gerald; Grubb, Robert; Black, Amanda; Ryan Diver, W.; Gapstur, Susan M.; Weinstein, Stephanie J.; Virtamo, Jarmo; Haiman, Chris A.; Landi, Maria T.; Caporaso, Neil; Fraumeni, Joseph F.; Vineis, Paolo; Wu, Xifeng; Silverman, Debra T.; Chanock, Stephen; Rothman, Nathaniel

    2014-01-01

    Candidate gene and genome-wide association studies (GWAS) have identified 11 independent susceptibility loci associated with bladder cancer risk. To discover additional risk variants, we conducted a new GWAS of 2422 bladder cancer cases and 5751 controls, followed by a meta-analysis with two independently published bladder cancer GWAS, resulting in a combined analysis of 6911 cases and 11 814 controls of European descent. TaqMan genotyping of 13 promising single nucleotide polymorphisms with P < 1 × 10−5 was pursued in a follow-up set of 801 cases and 1307 controls. Two new loci achieved genome-wide statistical significance: rs10936599 on 3q26.2 (P = 4.53 × 10−9) and rs907611 on 11p15.5 (P = 4.11 × 10−8). Two notable loci were also identified that approached genome-wide statistical significance: rs6104690 on 20p12.2 (P = 7.13 × 10−7) and rs4510656 on 6p22.3 (P = 6.98 × 10−7); these require further studies for confirmation. In conclusion, our study has identified new susceptibility alleles for bladder cancer risk that require fine-mapping and laboratory investigation, which could further understanding into the biological underpinnings of bladder carcinogenesis. PMID:24163127

  9. Comprehensive identification of single nucleotide polymorphisms associated with beta-lactam resistance within pneumococcal mosaic genes.

    Directory of Open Access Journals (Sweden)

    Claire Chewapreecha

    2014-08-01

    Full Text Available Traditional genetic association studies are very difficult in bacteria, as the generally limited recombination leads to large linked haplotype blocks, confounding the identification of causative variants. Beta-lactam antibiotic resistance in Streptococcus pneumoniae arises readily as the bacteria can quickly incorporate DNA fragments encompassing variants that make the transformed strains resistant. However, the causative mutations themselves are embedded within larger recombined blocks, and previous studies have only analysed a limited number of isolates, leading to the description of "mosaic genes" as being responsible for resistance. By comparing a large number of genomes of beta-lactam susceptible and non-susceptible strains, the high frequency of recombination should break up these haplotype blocks and allow the use of genetic association approaches to identify individual causative variants. Here, we performed a genome-wide association study to identify single nucleotide polymorphisms (SNPs and indels that could confer beta-lactam non-susceptibility using 3,085 Thai and 616 USA pneumococcal isolates as independent datasets for the variant discovery. The large sample sizes allowed us to narrow the source of beta-lactam non-susceptibility from long recombinant fragments down to much smaller loci comprised of discrete or linked SNPs. While some loci appear to be universal resistance determinants, contributing equally to non-susceptibility for at least two classes of beta-lactam antibiotics, some play a larger role in resistance to particular antibiotics. All of the identified loci have a highly non-uniform distribution in the populations. They are enriched not only in vaccine-targeted, but also non-vaccine-targeted lineages, which may raise clinical concerns. Identification of single nucleotide polymorphisms underlying resistance will be essential for future use of genome sequencing to predict antibiotic sensitivity in clinical microbiology.

  10. Computational Analysis of Single Nucleotide Polymorphisms Associated with Altered Drug Responsiveness in Type 2 Diabetes

    Science.gov (United States)

    Costa, Valerio; Federico, Antonio; Pollastro, Carla; Ziviello, Carmela; Cataldi, Simona; Formisano, Pietro; Ciccodicola, Alfredo

    2016-01-01

    Type 2 diabetes (T2D) is one of the most frequent mortality causes in western countries, with rapidly increasing prevalence. Anti-diabetic drugs are the first therapeutic approach, although many patients develop drug resistance. Most drug responsiveness variability can be explained by genetic causes. Inter-individual variability is principally due to single nucleotide polymorphisms, and differential drug responsiveness has been correlated to alteration in genes involved in drug metabolism (CYP2C9) or insulin signaling (IRS1, ABCC8, KCNJ11 and PPARG). However, most genome-wide association studies did not provide clues about the contribution of DNA variations to impaired drug responsiveness. Thus, characterizing T2D drug responsiveness variants is needed to guide clinicians toward tailored therapeutic approaches. Here, we extensively investigated polymorphisms associated with altered drug response in T2D, predicting their effects in silico. Combining different computational approaches, we focused on the expression pattern of genes correlated to drug resistance and inferred evolutionary conservation of polymorphic residues, computationally predicting the biochemical properties of polymorphic proteins. Using RNA-Sequencing followed by targeted validation, we identified and experimentally confirmed that two nucleotide variations in the CAPN10 gene—currently annotated as intronic—fall within two new transcripts in this locus. Additionally, we found that a Single Nucleotide Polymorphism (SNP), currently reported as intergenic, maps to the intron of a new transcript, harboring CAPN10 and GPR35 genes, which undergoes non-sense mediated decay. Finally, we analyzed variants that fall into non-coding regulatory regions of yet underestimated functional significance, predicting that some of them can potentially affect gene expression and/or post-transcriptional regulation of mRNAs affecting the splicing. PMID:27347941

  11. Computational Analysis of Single Nucleotide Polymorphisms Associated with Altered Drug Responsiveness in Type 2 Diabetes

    Directory of Open Access Journals (Sweden)

    Valerio Costa

    2016-06-01

    Full Text Available Type 2 diabetes (T2D is one of the most frequent mortality causes in western countries, with rapidly increasing prevalence. Anti-diabetic drugs are the first therapeutic approach, although many patients develop drug resistance. Most drug responsiveness variability can be explained by genetic causes. Inter-individual variability is principally due to single nucleotide polymorphisms, and differential drug responsiveness has been correlated to alteration in genes involved in drug metabolism (CYP2C9 or insulin signaling (IRS1, ABCC8, KCNJ11 and PPARG. However, most genome-wide association studies did not provide clues about the contribution of DNA variations to impaired drug responsiveness. Thus, characterizing T2D drug responsiveness variants is needed to guide clinicians toward tailored therapeutic approaches. Here, we extensively investigated polymorphisms associated with altered drug response in T2D, predicting their effects in silico. Combining different computational approaches, we focused on the expression pattern of genes correlated to drug resistance and inferred evolutionary conservation of polymorphic residues, computationally predicting the biochemical properties of polymorphic proteins. Using RNA-Sequencing followed by targeted validation, we identified and experimentally confirmed that two nucleotide variations in the CAPN10 gene—currently annotated as intronic—fall within two new transcripts in this locus. Additionally, we found that a Single Nucleotide Polymorphism (SNP, currently reported as intergenic, maps to the intron of a new transcript, harboring CAPN10 and GPR35 genes, which undergoes non-sense mediated decay. Finally, we analyzed variants that fall into non-coding regulatory regions of yet underestimated functional significance, predicting that some of them can potentially affect gene expression and/or post-transcriptional regulation of mRNAs affecting the splicing.

  12. Genome-wide identification of enhancer elements.

    Science.gov (United States)

    Tulin, Sarah; Barsi, Julius C; Bocconcelli, Carlo; Smith, Joel

    2016-01-01

    We present a prospective genome-wide regulatory element database for the sea urchin embryo and the modified chromosome capture-related methodology used to create it. The method we developed is termed GRIP-seq for genome-wide regulatory element immunoprecipitation and combines features of chromosome conformation capture, chromatin immunoprecipitation, and paired-end next-generation sequencing with molecular steps that enrich for active cis-regulatory elements associated with basal transcriptional machinery. The first GRIP-seq database, available to the community, comes from S. purpuratus 24 hpf embryos and takes advantage of the extremely well-characterized cis-regulatory elements in this system for validation. In addition, using the GRIP-seq database, we identify and experimentally validate a novel, intronic cis-regulatory element at the onecut locus. We find GRIP-seq signal sensitively identifies active cis-regulatory elements with a high signal-to-noise ratio for both distal and intronic elements. This promising GRIP-seq protocol has the potential to address a rate-limiting step in resolving comprehensive, predictive network models in all systems.

  13. Genome-wide significant associations in schizophrenia to ITIH3/4, CACNA1C and SDCCAG8, and extensive replication of associations reported by the Schizophrenia PGC

    DEFF Research Database (Denmark)

    Hamshere, M L; Walters, J T R; Smith, R;

    2013-01-01

    The Schizophrenia Psychiatric Genome-Wide Association Study Consortium (PGC) highlighted 81 single-nucleotide polymorphisms (SNPs) with moderate evidence for association to schizophrenia. After follow-up in independent samples, seven loci attained genome-wide significance (GWS), but multi-locus t...... interval (CI) 78-100%) of the original set of 78 SNPs represent true associations. We also provide strong evidence for overlap in genetic risk between schizophrenia and bipolar disorder.Molecular Psychiatry advance online publication, 22 May 2012; doi:10.1038/mp.2012.67....

  14. Characterization of single-nucleotide variation in Indian-origin rhesus macaques (Macaca mulatta

    Directory of Open Access Journals (Sweden)

    Wheeler David A

    2011-06-01

    Full Text Available Abstract Background Rhesus macaques are the most widely utilized nonhuman primate model in biomedical research. Previous efforts have validated fewer than 900 single nucleotide polymorphisms (SNPs in this species, which limits opportunities for genetic studies related to health and disease. Extensive information about SNPs and other genetic variation in rhesus macaques would facilitate valuable genetic analyses, as well as provide markers for genome-wide linkage analysis and the genetic management of captive breeding colonies. Results We used the available rhesus macaque draft genome sequence, new sequence data from unrelated individuals and existing published sequence data to create a genome-wide SNP resource for Indian-origin rhesus monkeys. The original reference animal and two additional Indian-origin individuals were resequenced to low coverage using SOLiD™ sequencing. We then used three strategies to validate SNPs: comparison of potential SNPs found in the same individual using two different sequencing chemistries, and comparison of potential SNPs in different individuals identified with either the same or different sequencing chemistries. Our approach validated approximately 3 million SNPs distributed across the genome. Preliminary analysis of SNP annotations suggests that a substantial number of these macaque SNPs may have functional effects. More than 700 non-synonymous SNPs were scored by Polyphen-2 as either possibly or probably damaging to protein function and these variants now constitute potential models for studying functional genetic variation relevant to human physiology and disease. Conclusions Resequencing of a small number of animals identified greater than 3 million SNPs. This provides a significant new information resource for rhesus macaques, an important research animal. The data also suggests that overall genetic variation is high in this species. We identified many potentially damaging non-synonymous coding SNPs

  15. Genome-wide patterns of Arabidopsis gene expression in nature.

    Directory of Open Access Journals (Sweden)

    Christina L Richards

    Full Text Available Organisms in the wild are subject to multiple, fluctuating environmental factors, and it is in complex natural environments that genetic regulatory networks actually function and evolve. We assessed genome-wide gene expression patterns in the wild in two natural accessions of the model plant Arabidopsis thaliana and examined the nature of transcriptional variation throughout its life cycle and gene expression correlations with natural environmental fluctuations. We grew plants in a natural field environment and measured genome-wide time-series gene expression from the plant shoot every three days, spanning the seedling to reproductive stages. We find that 15,352 genes were expressed in the A. thaliana shoot in the field, and accession and flowering status (vegetative versus flowering were strong components of transcriptional variation in this plant. We identified between ∼110 and 190 time-varying gene expression clusters in the field, many of which were significantly overrepresented by genes regulated by abiotic and biotic environmental stresses. The two main principal components of vegetative shoot gene expression (PC(veg correlate to temperature and precipitation occurrence in the field. The largest PC(veg axes included thermoregulatory genes while the second major PC(veg was associated with precipitation and contained drought-responsive genes. By exposing A. thaliana to natural environments in an open field, we provide a framework for further understanding the genetic networks that are deployed in natural environments, and we connect plant molecular genetics in the laboratory to plant organismal ecology in the wild.

  16. Genome-wide search for gene-gene interactions in colorectal cancer.

    Directory of Open Access Journals (Sweden)

    Shuo Jiao

    Full Text Available Genome-wide association studies (GWAS have successfully identified a number of single-nucleotide polymorphisms (SNPs associated with colorectal cancer (CRC risk. However, these susceptibility loci known today explain only a small fraction of the genetic risk. Gene-gene interaction (GxG is considered to be one source of the missing heritability. To address this, we performed a genome-wide search for pair-wise GxG associated with CRC risk using 8,380 cases and 10,558 controls in the discovery phase and 2,527 cases and 2,658 controls in the replication phase. We developed a simple, but powerful method for testing interaction, which we term the Average Risk Due to Interaction (ARDI. With this method, we conducted a genome-wide search to identify SNPs showing evidence for GxG with previously identified CRC susceptibility loci from 14 independent regions. We also conducted a genome-wide search for GxG using the marginal association screening and examining interaction among SNPs that pass the screening threshold (p<10(-4. For the known locus rs10795668 (10p14, we found an interacting SNP rs367615 (5q21 with replication p = 0.01 and combined p = 4.19×10(-8. Among the top marginal SNPs after LD pruning (n = 163, we identified an interaction between rs1571218 (20p12.3 and rs10879357 (12q21.1 (nominal combined p = 2.51×10(-6; Bonferroni adjusted p = 0.03. Our study represents the first comprehensive search for GxG in CRC, and our results may provide new insight into the genetic etiology of CRC.

  17. Genome-wide Analysis of Gene Regulation

    DEFF Research Database (Denmark)

    Chen, Yun

    cells are capable of regulating their gene expression, so that each cell can only express a particular set of genes yielding limited numbers of proteins with specialized functions. Therefore a rigid control of differential gene expression is necessary for cellular diversity. On the other hand, aberrant...... gene regulation will disrupt the cell’s fundamental processes, which in turn can cause disease. Hence, understanding gene regulation is essential for deciphering the code of life. Along with the development of high throughput sequencing (HTS) technology and the subsequent large-scale data analysis......, genome-wide assays have increased our understanding of gene regulation significantly. This thesis describes the integration and analysis of HTS data across different important aspects of gene regulation. Gene expression can be regulated at different stages when the genetic information is passed from gene...

  18. A "candidate-interactome" aggregate analysis of genome-wide association data in multiple sclerosis

    DEFF Research Database (Denmark)

    Mechelli, Rosella; Umeton, Renato; Policano, Claudia;

    2013-01-01

    , may be representative of gene-environment interactions of potential, uncertain or unlikely relevance for multiple sclerosis pathogenesis: Epstein-Barr virus, human immunodeficiency virus, hepatitis B virus, hepatitis C virus, cytomegalovirus, HHV8-Kaposi sarcoma, H1N1-influenza, JC virus, human innate...... immunity interactome for type I interferon, autoimmune regulator, vitamin D receptor, aryl hydrocarbon receptor and a panel of proteins targeted by 70 innate immune-modulating viral open reading frames from 30 viral species. Interactomes were either obtained from the literature or were manually curated....... The P values of all single nucleotide polymorphism mapping to a given interactome were obtained from the last genome-wide association study of the International Multiple Sclerosis Genetics Consortium & the Wellcome Trust Case Control Consortium, 2. The interaction between genotype and Epstein Barr virus...

  19. Genome Wide Association Study (GWAS) between Attention Deficit Hyperactivity Disorder (ADHD) and Obsessive Compulsive Disorder (OCD)

    Science.gov (United States)

    Ritter, McKenzie L.; Guo, Wei; Samuels, Jack F.; Wang, Ying; Nestadt, Paul S.; Krasnow, Janice; Greenberg, Benjamin D.; Fyer, Abby J.; McCracken, James T.; Geller, Daniel A.; Murphy, Dennis L.; Knowles, James A.; Grados, Marco A.; Riddle, Mark A.; Rasmussen, Steven A.; McLaughlin, Nicole C.; Nurmi, Erika L.; Askland, Kathleen D.; Cullen, Bernadette; Piacentini, John; Pauls, David L.; Bienvenu, Joseph; Stewart, Evelyn; Goes, Fernando S.; Maher, Brion; Pulver, Ann E.; Mattheisen, Manuel; Qian, Ji; Nestadt, Gerald; Shugart, Yin Yao

    2017-01-01

    Objective: The aim of this study was to identify any potential genetic overlap between attention deficit hyperactivity disorder (ADHD) and obsessive compulsive disorder (OCD). We hypothesized that since these disorders share a sub-phenotype, they may share common risk alleles. In this manuscript, we report the overlap found between these two disorders. Methods: A meta-analysis was conducted between ADHD and OCD, and polygenic risk scores (PRS) were calculated for both disorders. In addition, a protein-protein analysis was completed in order to examine the interactions between proteins; p-values for the protein-protein interaction analysis was calculated using permutation. Conclusion: None of the single nucleotide polymorphisms (SNPs) reached genome wide significance and there was little evidence of genetic overlap between ADHD and OCD. PMID:28386217

  20. Accounting for ancestry: population substructure and genome-wide association studies.

    Science.gov (United States)

    Tian, Chao; Gregersen, Peter K; Seldin, Michael F

    2008-10-15

    Accounting for the genetic substructure of human populations has become a major practical issue for studying complex genetic disorders. Allele frequency differences among ethnic groups and subgroups and admixture between different ethnic groups can result in frequent false-positive results or reduced power in genetic studies. Here, we review the problems and progress in defining population differences and the application of statistical methods to improve association studies. It is now possible to take into account the confounding effects of population stratification using thousands of unselected genome-wide single-nucleotide polymorphisms or, alternatively, selected panels of ancestry informative markers. These methods do not require any demographic information and therefore can be widely applied to genotypes available from multiple sources. We further suggest that it will be important to explore results in homogeneous population subsets as we seek to define the extent to which genomic variation influences complex phenotypes.

  1. Genome-wide association study for ovarian cancer susceptibility using pooled DNA

    DEFF Research Database (Denmark)

    Lu, Yi; Chen, Xiaoqing; Beesley, Jonathan;

    2012-01-01

    Recent Genome-Wide Association Studies (GWAS) have identified four low-penetrance ovarian cancer susceptibility loci. We hypothesized that further moderate- or low-penetrance variants exist among the subset of single-nucleotide polymorphisms (SNPs) not well tagged by the genotyping arrays used...... in a much larger stage 2 set of 4,651 cases and 6,966 controls from the Ovarian Cancer Association Consortium. Given that most of the top 20 SNPs from pooling were validated in the same samples by individual genotyping, the lack of replication is likely to be due to the relatively small sample size in our...... stage 1 GWAS rather than due to problems with the pooling approach. We conclude that there are unlikely to be any moderate or large effects on ovarian cancer risk untagged by less dense arrays. However, our study lacked power to make clear statements on the existence of hitherto untagged small...

  2. Multi-platform genome-wide analysis of melanoma progression to brain metastasis

    Directory of Open Access Journals (Sweden)

    Diego M. Marzese

    2014-12-01

    Full Text Available Melanoma has a high tendency to metastasize to brain tissue. The understanding about the molecular alterations of early-stage melanoma progression to brain metastasis (MBM is very limited. Identifying MBM-specific genomic and epigenomic alterations is a key initial step in understanding its aggressive nature and identifying specific novel druggable targets. Here, we describe a multi-platform dataset generated with different stages of melanoma progression to MBM. This data includes genome-wide DNA methylation (Illumina HM450K BeadChip, gene expression (Affymetrix HuEx 1.0 ST array, single nucleotide polymorphisms (SNPs and copy number variation (CNV; Affymetrix SNP 6.0 array analyses of melanocyte cells (MNCs, primary melanoma tumors (PRMs, lymph node metastases (LNMs and MBMs. The analysis of this data has been reported in our recently published study (Marzese et al., 2014.

  3. Associations between single nucleotide polymorphisms in iron-related genes and iron status in multiethnic populations.

    Directory of Open Access Journals (Sweden)

    Christine E McLaren

    Full Text Available The existence of multiple inherited disorders of iron metabolism suggests genetic contributions to iron deficiency. We previously performed a genome-wide association study of iron-related single nucleotide polymorphisms (SNPs using DNA from white men aged ≥ 25 y and women ≥ 50 y in the Hemochromatosis and Iron Overload Screening (HEIRS Study with serum ferritin (SF ≤ 12 µg/L (cases and controls (SF >100 µg/L in men, SF >50 µg/L in women. We report a follow-up study of white, African-American, Hispanic, and Asian HEIRS participants, analyzed for association between SNPs and eight iron-related outcomes. Three chromosomal regions showed association across multiple populations, including SNPs in the TF and TMPRSS6 genes, and on chromosome 18q21. A novel SNP rs1421312 in TMPRSS6 was associated with serum iron in whites (p = 3.7 × 10(-6 and replicated in African Americans (p = 0.0012.Twenty SNPs in the TF gene region were associated with total iron-binding capacity in whites (p<4.4 × 10(-5; six SNPs replicated in other ethnicities (p<0.01. SNP rs10904850 in the CUBN gene on 10p13 was associated with serum iron in African Americans (P = 1.0 × 10(-5. These results confirm known associations with iron measures and give unique evidence of their role in different ethnicities, suggesting origins in a common founder.

  4. Validation of Single Nucleotide Polymorphisms Associated with Carcass Traits in a Commercial Hanwoo Population

    Science.gov (United States)

    Sudrajad, Pita; Sharma, Aditi; Dang, Chang Gwon; Kim, Jong Joo; Kim, Kwan Suk; Lee, Jun Heon; Kim, Sidong; Lee, Seung Hwan

    2016-01-01

    Four carcass traits, namely carcass weight (CW), eye muscle area (EMA), back fat thickness (BF), and marbling score (MS), are the main price decision parameters used for purchasing Hanwoo beef. The development of DNA markers for these carcass traits for use in a beef management system could result in substantial profit for beef producers in Korea. The objective of this study was to validate the association of highly significant single nucleotide polymorphisms (SNPs) identified in a previous genome-wide association study (GWAS) with the four carcass traits in a commercial Hanwoo population. We genotyped 83 SNPs distributed across all 29 autosomes in 867 steers from a Korean Hanwoo feedlot. Six SNPs, namely ARS-BFGL-NGS-22774 (Chr4, Pos:4889229), ARS-BFGL-NGS-100046 (Chr6, Pos:61917424), ARS-BFGL-NGS-39006 (Chr27, Pos:38059196), ARS-BFGL-NGS-18790 (Chr10, Pos:26489109), ARS-BFGL-NGS-43879 (Chr9, Pos:39964297), and BTB-00775794 (Chr20, Pos:20476265), were found to be associated with CW, EMA, BF, and MS. The ARS-BFGL-NGS-22774, BTB-00775794, and ARS-BFGL-NGS-39006 markers accounted for 1.80%, 1.72%, and 1.35% (p<0.01), respectively, of the phenotypic variance in the commercial Hanwoo population. Many genes located in close proximity to the significant SNPs identified in this study were previously reported to have roles in carcass traits. The results of this study could be useful for marker-assisted selection programs. PMID:26954199

  5. From Single Nucleotide Polymorphisms to Constant Immunosuppression: Mesenchymal Stem Cell Therapy for Autoimmune Diseases

    Directory of Open Access Journals (Sweden)

    Raghavan Chinnadurai

    2013-01-01

    Full Text Available The regenerative abilities and the immunosuppressive properties of mesenchymal stromal cells (MSCs make them potentially the ideal cellular product of choice for treatment of autoimmune and other immune mediated disorders. Although the usefulness of MSCs for therapeutic applications is in early phases, their potential clinical use remains of great interest. Current clinical evidence of use of MSCs from both autologous and allogeneic sources to treat autoimmune disorders confers conflicting clinical benefit outcomes. These varied results may possibly be due to MSC use across wide range of autoimmune disorders with clinical heterogeneity or due to variability of the cellular product. In the light of recent genome wide association studies (GWAS, linking predisposition of autoimmune diseases to single nucleotide polymorphisms (SNPs in the susceptible genetic loci, the clinical relevance of MSCs possessing SNPs in the critical effector molecules of immunosuppression is largely undiscussed. It is of further interest in the allogeneic setting, where SNPs in the target pathway of MSC's intervention may also modulate clinical outcome. In the present review, we have discussed the known critical SNPs predisposing to disease susceptibility in various autoimmune diseases and their significance in the immunomodulatory properties of MSCs.

  6. Assessment of Genetic Diversity in Faba Bean Based on Single Nucleotide Polymorphism

    Directory of Open Access Journals (Sweden)

    Sukhjiwan Kaur

    2014-01-01

    Full Text Available Detection of genetic diversity is important for characterisation of crop plant collections in order to detect the presence of valuable trait variation for use in breeding programs. A collection of faba bean (Vicia faba L. genotypes was evaluated for intra- and inter-population diversity using a set of 768 genome-wide distributed single nucleotide polymorphism (SNP markers, of which 657 obtained successful amplification and detected polymorphisms. Gene diversity and polymorphism information content (PIC values varied between 0.022–0.500 and 0.023–1.00, with averages of 0.363 and 0.287, respectively. The genetic structure of the germplasm collection was analysed and a neighbour-joining (NJ dendrogram was constructed. The faba bean accessions grouped into two major groups, with several additional smaller sub-groups, predominantly on the basis of geographical origin. These results were further supported by principal co-ordinate analysis (PCoA, deriving two major groupings which were differentiated on the basis of site of origin and pedigree relationships. In general, high levels of heterozygosity were observed, presumably due to the partially allogamous nature of the species. The results will facilitate targeted crossing strategies in future faba bean breeding programs in order to achieve genetic gain.

  7. A single-nucleotide polymorphism of human neuropeptide s gene originated from Europe shows decreased bioactivity.

    Directory of Open Access Journals (Sweden)

    Cheng Deng

    Full Text Available Using accumulating SNP (Single-Nucleotide Polymorphism data, we performed a genome-wide search for polypeptide hormone ligands showing changes in the mature regions to elucidate genotype/phenotype diversity among various human populations. Neuropeptide S (NPS, a brain peptide hormone highly conserved in vertebrates, has diverse physiological effects on anxiety, fear, hyperactivity, food intake, and sleeping time through its cognate receptor-NPSR. Here, we report a SNP rs4751440 (L(6-NPS causing non-synonymous substitution on the 6(th position (V to L of the NPS mature peptide region. L(6-NPS has a higher allele frequency in Europeans than other populations and probably originated from European ancestors ~25,000 yrs ago based on haplotype analysis and Approximate Bayesian Computation. Functional analyses indicate that L(6-NPS exhibits a significant lower bioactivity than the wild type NPS, with ~20-fold higher EC50 values in the stimulation of NPSR. Additional evolutionary and mutagenesis studies further demonstrate the importance of the valine residue in the 6(th position for NPS functions. Given the known physiological roles of NPS receptor in inflammatory bowel diseases, asthma pathogenesis, macrophage immune responses, and brain functions, our study provides the basis to elucidate NPS evolution and signaling diversity among human populations.

  8. High-Resolution Genome-Wide Linkage Mapping Identifies Susceptibility Loci for BMI in the Chinese Population

    DEFF Research Database (Denmark)

    Zhang, Dong Feng; Pang, Zengchang; Li, Shuxia;

    2012-01-01

    The genetic loci affecting the commonly used BMI have been intensively investigated using linkage approaches in multiple populations. This study aims at performing the first genome-wide linkage scan on BMI in the Chinese population in mainland China with hypothesis that heterogeneity in genetic...... linkage could exist in different ethnic populations. BMI was measured from 126 dizygotic twins in Qingdao municipality who were genotyped using high-resolution Affymetrix Genome-Wide Human SNP arrays containing about 1 million single-nucleotide polymorphisms (SNPs). Nonparametric linkage analysis...... was performed with Merlin software package for linkage analysis using variance components approach for quantitative trait loci mapping. We identified a strong linkage peak at the end of chromosome 7 (7q36 at 186 cM) with a lod score of 4.06 which overlaps with that reported by a large multicenter study...

  9. Genome-wide assessment of the association of rare and common copy number variations to testicular germ cell cancer

    DEFF Research Database (Denmark)

    Edsgard, Stefan Daniel; Dalgaard, Marlene Danner; Weinhold, Nils;

    2013-01-01

    Testicular germ cell cancer (TGCC) is one of the most heritable forms of cancer. Previous genome-wide association studies have focused on single nucleotide polymorphisms, largely ignoring the influence of copy number variants (CNVs). Here we present a genome-wide study of CNV on a cohort of 212...... cases and 437 controls from Denmark, which was genotyped at ∼1.8 million markers, half of which were non-polymorphic copy number markers. No association of common variants were found, whereas analysis of rare variants (present in less than 1% of the samples) initially indicated a single gene...... of rare CNVs related to cell migration (false-discovery rate = 0.021, 1.8% of cases and 1.1% of controls). Dysregulation during migration of primordial germ cells has previously been suspected to be a part of TGCC development and this set of multiple rare variants may thereby have a minor contribution...

  10. A Genome-wide Pleiotropy Scan for Prostate Cancer Risk

    Science.gov (United States)

    Panagiotou, Orestis A; Travis, Ruth C; Campa, Daniele; Berndt, Sonja I.; Lindstrom, Sara; Kraft, Peter; Schumacher, Fredrick R.; Siddiq, Afshan; Papatheodorou, Stefania I.; Stanford, Janet L.; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie J.; Diver, W. Ryan; Gapstur, Susan M.; Stevens, Victoria L.; Boeing, Heiner; Bueno-de-Mesquita, H. Bas; Gurrea, Aurelio Barricarte; Kaaks, Rudolf; Khaw, Kay-Tee; Krogh, Vittorio; Overvad, Kim; Riboli, Elio; Trichopoulos, Dimitrios; Giovannucci, Edward; Stampfer, Meir; Haiman, Christopher; Henderson, Brian; Le Marchand, Loic; Gaziano, J. Michael; Hunter, DavidJ.; Koutros, Stella; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Wacholder, Sholom; Key, Timothy J.; Tsilidis, Konstantinos K

    2014-01-01

    Background No single-nucleotide polymorphisms (SNPs) specific for aggressive prostate cancer have been identified in genome-wide association studies (GWAS). Objective To test if SNPs associated with other traits may also affect the risk of aggressive prostate cancer. Design, setting, and participants SNPs implicated in any phenotype other than prostate cancer (p ≤ 10−7) were identified through the catalog of published GWAS and tested in 2891 aggressive prostate cancer cases and 4592 controls from the Breast and Prostate Cancer Cohort Consortium (BPC3). The 40 most significant SNPs were followed up in 4872 aggressive prostate cancer cases and 24 534 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. Outcome measurements and statistical analysis Odds ratios (ORs) and 95% confidence intervals (CIs) for aggressive prostate cancer were estimated. Results and limitations A total of 4666 SNPs were evaluated by the BPC3. Two signals were seen in regions already reported for prostate cancer risk. rs7014346 at 8q24.21 was marginally associated with aggressive prostate cancer in the BPC3 trial (p = 1.6 × 10-6), whereas after meta-analysis by PRACTICAL the summary OR was 1.21 (95%CI 1.16–1.27; p = 3.22 × 10−18). rs9900242 at 17q24.3 was also marginally associated with aggressive disease in the meta-analysis (OR 0.90, 95% CI 0.86–0.94; p = 2.5 × 10−6). Neither of these SNPs remained statistically significant when conditioning on correlated known prostate cancer SNPs. The meta-analysis by BPC3 and PRACTICAL identified a third promising signal, marked by rs16844874 at 2q34, independent of known prostate cancer loci (OR 1.12,95% CI 1.06–1.19; p = 4.67 × 10−5); it has been shown that SNPs correlated with this signal affect glycine concentrations. The main limitation is the heterogeneity in the definition of aggressive prostate cancer between BPC3 and PRACTICAL. Conclusions We did

  11. Risk of estrogen receptor-positive and -negative breast cancer and single-nucleotide polymorphism 2q35-rs13387042

    DEFF Research Database (Denmark)

    Milne, Roger L; Benítez, Javier; Nevanlinna, Heli

    2009-01-01

    BACKGROUND: A recent genome-wide association study identified single-nucleotide polymorphism (SNP) 2q35-rs13387042 as a marker of susceptibility to estrogen receptor (ER)-positive breast cancer. We attempted to confirm this association using the Breast Cancer Association Consortium. METHODS: 2q35......-rs13387042 SNP was genotyped for 31 510 women with invasive breast cancer, 1101 women with ductal carcinoma in situ, and 35 969 female control subjects from 25 studies. Odds ratios (ORs) were estimated by logistic regression, adjusted for study. Heterogeneity in odds ratios by each of age, ethnicity...... evidence of association between rs13387042 and breast cancer in white women of European origin (per-allele OR = 1.12, 95% confidence interval [CI] = 1.09 to 1.15; P(trend) = 1.0 x 10(-19)). The odds ratio was lower than that previously reported (P = .02) and did not vary by age or ethnicity (all P...

  12. A single nucleotide polymorphism of the TNRC9 gene associated with breast cancer risk in Chinese Han women.

    Science.gov (United States)

    Chen, F; Zhou, J; Xue, Y; Yang, S; Xiong, M; Li, Y; Liu, Q

    2014-01-01

    A single nucleotide polymorphism (SNP) in the TNRC9 gene was identified as a breast cancer susceptibility genetic variant in recent genome-wide association studies of women of European ancestry. We investigated whether TNRC9 polymorphisms are associated with risk of breast cancer in Chinese women of the Han nationality. We genotyped the SNPs rs3803662, rs1362548, rs1123428 in 870 women, including 388 breast cancer patients and 482 healthy controls, via the PCR-single strand conformation polymorphism procedure and by sequence detection. We found that the T allele and the TT genotype of the SNP rs38033662 is significantly associated with risk for breast cancer in Chinese Han women; however, no significant association was found for rs1362548 or rs1123428. We conclude that SNP rs3803662 is a putative risk factor for breast cancer in Chinese Han women.

  13. Single-Nucleotide Variations in Cardiac Arrhythmias: Prospects for Genomics and Proteomics Based Biomarker Discovery and Diagnostics

    Directory of Open Access Journals (Sweden)

    Ayman Abunimer

    2014-03-01

    Full Text Available Cardiovascular diseases are a large contributor to causes of early death in developed countries. Some of these conditions, such as sudden cardiac death and atrial fibrillation, stem from arrhythmias—a spectrum of conditions with abnormal electrical activity in the heart. Genome-wide association studies can identify single nucleotide variations (SNVs that may predispose individuals to developing acquired forms of arrhythmias. Through manual curation of published genome-wide association studies, we have collected a comprehensive list of 75 SNVs associated with cardiac arrhythmias. Ten of the SNVs result in amino acid changes and can be used in proteomic-based detection methods. In an effort to identify additional non-synonymous mutations that affect the proteome, we analyzed the post-translational modification S-nitrosylation, which is known to affect cardiac arrhythmias. We identified loss of seven known S-nitrosylation sites due to non-synonymous single nucleotide variations (nsSNVs. For predicted nitrosylation sites we found 1429 proteins where the sites are modified due to nsSNV. Analysis of the predicted S-nitrosylation dataset for over- or under-representation (compared to the complete human proteome of pathways and functional elements shows significant statistical over-representation of the blood coagulation pathway. Gene Ontology (GO analysis displays statistically over-represented terms related to muscle contraction, receptor activity, motor activity, cystoskeleton components, and microtubule activity. Through the genomic and proteomic context of SNVs and S-nitrosylation sites presented in this study, researchers can look for variation that can predispose individuals to cardiac arrhythmias. Such attempts to elucidate mechanisms of arrhythmia thereby add yet another useful parameter in predicting susceptibility for cardiac diseases.

  14. Pinched flow fractionation devices for detection of single nucleotide polymorphisms

    DEFF Research Database (Denmark)

    Larsen, Asger Vig; Poulsen, Lena; Birgens, Henrik

    2008-01-01

    We demonstrate a new and flexible micro fluidic based method for genotyping single nucleotide polymorphisms ( SNPs). The method relies on size separation of selectively hybridized polystyrene microspheres in a micro fluidic pinched flow fractionation (PFF) device. The micro fluidic PFF devices...... with 13 mu m deep channels were fabricated by thermal nanoimprint lithography ( NIL) in a thin film of cyclic-olefin copolymer (mr-I T85) on a silicon wafer substrate, and the channels were sealed by thermal polymer bonding. Streptavidin coated polystyrene microspheres with a mean diameter of 3.09 mu m...

  15. Single nucleotide polymorphism (SNP) detection on a magnetoresistive sensor

    DEFF Research Database (Denmark)

    Rizzi, Giovanni; Østerberg, Frederik Westergaard; Dufva, Martin

    2013-01-01

    We present a magnetoresistive sensor platform for hybridization assays and demonstrate its applicability on single nucleotide polymorphism (SNP) genotyping. The sensor relies on anisotropic magnetoresistance in a new geometry with a local negative reference and uses the magnetic field from...... the sensor bias current to magnetize magnetic beads in the vicinity of the sensor. The method allows for real-time measurements of the specific bead binding to the sensor surface during DNA hybridization and washing. Compared to other magnetic biosensing platforms, our approach eliminates the need...... for external electromagnets and thus allows for miniaturization of the sensor platform....

  16. Electroanalysis of single-nucleotide polymorphism by hairpin DNA architectures.

    Science.gov (United States)

    Abi, Alireza; Ferapontova, Elena E

    2013-04-01

    Genetic analysis of infectious and genetic diseases and cancer diagnostics require the development of efficient tools for fast and reliable analysis of single-nucleotide polymorphism (SNP) in targeted DNA and RNA sequences often responsible for signalling disease onset. Here, we highlight the main trends in the development of electrochemical genosensors for sensitive and selective detection of SNP that are based on hairpin DNA architectures exhibiting better SNP recognition properties compared with linear DNA probes. SNP detection by electrochemical hairpin DNA beacons is discussed, and comparative analysis of the existing SNP sensing strategies based on enzymatic and nanoparticle signal amplification schemes is presented.

  17. Impact of IL28B-Related Single Nucleotide Polymorphisms on Liver Histopathology in Chronic Hepatitis C Genotype 2 and 3

    DEFF Research Database (Denmark)

    Rembeck, Karolina; Alsiö, Asa; Christensen, Peer Brehm

    2012-01-01

    Recently, several genome-wide association studies have revealed that single nucleotide polymorphisms (SNPs) in proximity to IL28B predict spontaneous clearance of HCV infection as well as outcome following peginterferon and ribavirin therapy among HCV genotype 1 infected patients. The present study...... aimed to evaluate the impact of IL28B SNP variability on liver histology in the context of a phase III treatment trial (NORDynamIC) for treatment-naïve patients with chronic HCV genotype 2 or 3 infection, where pretreatment liver biopsies were mandatory....

  18. Genome-wide association study of insect bite hypersensitivity in two horse populations in the Netherlands

    Directory of Open Access Journals (Sweden)

    Schurink Anouk

    2012-10-01

    Full Text Available Abstract Background Insect bite hypersensitivity is a common allergic disease in horse populations worldwide. Insect bite hypersensitivity is affected by both environmental and genetic factors. However, little is known about genes contributing to the genetic variance associated with insect bite hypersensitivity. Therefore, the aim of our study was to identify and quantify genomic associations with insect bite hypersensitivity in Shetland pony mares and Icelandic horses in the Netherlands. Methods Data on 200 Shetland pony mares and 146 Icelandic horses were collected according to a matched case–control design. Cases and controls were matched on various factors (e.g. region, sire to minimize effects of population stratification. Breed-specific genome-wide association studies were performed using 70 k single nucleotide polymorphisms genotypes. Bayesian variable selection method Bayes-C with a threshold model implemented in GenSel software was applied. A 1 Mb non-overlapping window approach that accumulated contributions of adjacent single nucleotide polymorphisms was used to identify associated genomic regions. Results The percentage of variance explained by all single nucleotide polymorphisms was 13% in Shetland pony mares and 28% in Icelandic horses. The 20 non-overlapping windows explaining the largest percentages of genetic variance were found on nine chromosomes in Shetland pony mares and on 14 chromosomes in Icelandic horses. Overlap in identified associated genomic regions between breeds would suggest interesting candidate regions to follow-up on. Such regions common to both breeds (within 15 Mb were found on chromosomes 3, 7, 11, 20 and 23. Positional candidate genes within 2 Mb from the associated windows were identified on chromosome 20 in both breeds. Candidate genes are within the equine lymphocyte antigen class II region, which evokes an immune response by recognizing many foreign molecules. Conclusions The genome-wide association

  19. Genome-wide association study identifies candidate markers for bull fertility in Holstein dairy cattle.

    Science.gov (United States)

    Peñagaricano, F; Weigel, K A; Khatib, H

    2012-07-01

    The decline in the reproductive efficiency of dairy cattle has become a challenging problem worldwide. Female fertility is now taken into account in breeding goals while generally less attention is given to male fertility. The objective of this study was to perform a genome-wide association study in Holstein bulls to identify genetic variants significantly related to sire conception rate (SCR), a new phenotypic evaluation of bull fertility. The analysis included 1755 sires with SCR data and 38,650 single nucleotide polymorphisms (SNPs) spanning the entire bovine genome. Associations between SNPs and SCR were analyzed using a mixed linear model that included a random polygenic effect and SNP genotype either as a linear covariate or as a categorical variable. A multiple testing correction approach was used to account for the correlation between SNPs because of linkage disequilibrium. After genome-wide correction, eight SNPs showed significant association with SCR. Some of these SNPs are located close to or in the middle of genes with functions related to male fertility, such as the sperm acrosome reaction, chromatin remodeling during the spermatogenesis, and the meiotic process during male germ cell maturation. Some SNPs showed marked dominance effects, which provide more evidence for the relevance of non-additive effects in traits closely related to fitness such as fertility. The results could contribute to the identification of genes and pathways associated with male fertility in dairy cattle.

  20. Genome-wide scans using archived neonatal dried blood spot samples

    Directory of Open Access Journals (Sweden)

    Wiuf Carsten

    2009-07-01

    Full Text Available Abstract Background Identification of disease susceptible genes requires access to DNA from numerous well-characterised subjects. Archived residual dried blood spot samples from national newborn screening programs may provide DNA from entire populations and medical registries the corresponding clinical information. The amount of DNA available in these samples is however rarely sufficient for reliable genome-wide scans, and whole-genome amplification may thus be necessary. This study assess the quality of DNA obtained from different amplification protocols by evaluating fidelity and robustness of the genotyping of 610,000 single nucleotide polymorphisms, using the Illumina Infinium HD Human610-Quad BeadChip. Whole-genome amplified DNA from 24 neonatal dried blood spot samples stored between 15 to 25 years was tested, and high-quality genomic DNA from 8 of the same individuals was used as reference. Results Using 3.2 mm disks from dried blood spot samples the optimal DNA-extraction and amplification protocol resulted in call-rates between 99.15% – 99.73% (mean 99.56%, N = 16, and conflicts with reference DNA in only three per 10,000 genotype calls. Conclusion Whole-genome amplified DNA from archived neonatal dried blood spot samples can be used for reliable genome-wide scans and is a cost-efficient alternative to collecting new samples.

  1. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    Energy Technology Data Exchange (ETDEWEB)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMahon, Katherine D.; Malmstrom, Rex R.

    2014-06-18

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.

  2. Genome-wide association study of insect bite hypersensitivity in Dutch Shetland pony mares.

    Science.gov (United States)

    Schurink, A; Ducro, B J; Bastiaansen, J W M; Frankena, K; van Arendonk, J A M

    2013-02-01

    Insect bite hypersensitivity (IBH) is the most common allergic disease present in horses worldwide. It has been shown that IBH is under genetic control, but the knowledge of associated genes is limited. We conducted a genome-wide association study to identify and quantify genomic regions contributing to IBH in the Dutch Shetland pony population. A total of 97 cases and 91 controls were selected and matched on withers height, coat colour and pedigree to minimise the population stratification. A blood sample was collected from participating Shetland pony mares, their IBH phenotype was scored and the owner filled in a questionnaire. A total of 40 021 single-nucleotide polymorphisms (SNPs) were fitted in a univariable logistic model fitting an additive effect. Analysis revealed no effects of population stratification. Significant associations with IBH were detected for 24 SNPs on 12 chromosomes [-log(10) (P-value) > 2.5]. Odds ratios of allele substitution effects of the unfavourable allele were between 1.94 and 5.95. The most significant SNP was found on chromosome 27, with an odds ratio of 2.31 and with an allele frequency of the unfavourable allele of 0.72 in cases and 0.53 in controls. Genome-wide association studies on additional horse populations are desired to validate the identified associations, to identify the genes involved in IBH and to develop genomic tools to decrease IBH prevalence.

  3. Genome-wide association study identifies 74 loci associated with educational attainment

    Science.gov (United States)

    Okbay, Aysu; Beauchamp, Jonathan P.; Fontana, Mark A.; Lee, James J.; Pers, Tune H.; Rietveld, Cornelius A.; Turley, Patrick; Chen, Guo-Bo; Emilsson, Valur; Meddens, S. Fleur W.; Oskarsson, Sven; Pickrell, Joseph K.; Thom, Kevin; Timshel, Pascal; de Vlaming, Ronald; Abdellaoui, Abdel; Ahluwalia, Tarunveer S.; Bacelis, Jonas; Baumbach, Clemens; Bjornsdottir, Gyda; Brandsma, Johannes H.; Concas, Maria Pina; Derringer, Jaime; Furlotte, Nicholas A.; Galesloot, Tessel E.; Girotto, Giorgia; Gupta, Richa; Hall, Leanne M.; Harris, Sarah E.; Hofer, Edith; Horikoshi, Momoko; Huffman, Jennifer E.; Kaasik, Kadri; Kalafati, Ioanna P.; Karlsson, Robert; Kong, Augustine; Lahti, Jari; van der Lee, Sven J.; de Leeuw, Christiaan; Lind, Penelope A.; Lindgren, Karl-Oskar; Liu, Tian; Mangino, Massimo; Marten, Jonathan; Mihailov, Evelin; Miller, Michael B.; van der Most, Peter J.; Oldmeadow, Christopher; Payton, Antony; Pervjakova, Natalia; Peyrot, Wouter J.; Qian, Yong; Raitakari, Olli; Rueedi, Rico; Salvi, Erika; Schmidt, Börge; Schraut, Katharina E.; Shi, Jianxin; Smith, Albert V.; Poot, Raymond A.; Pourcain, Beate; Teumer, Alexander; Thorleifsson, Gudmar; Verweij, Niek; Vuckovic, Dragana; Wellmann, Juergen; Westra, Harm-Jan; Yang, Jingyun; Zhao, Wei; Zhu, Zhihong; Alizadeh, Behrooz Z.; Amin, Najaf; Bakshi, Andrew; Baumeister, Sebastian E.; Biino, Ginevra; Bønnelykke, Klaus; Boyle, Patricia A.; Campbell, Harry; Cappuccio, Francesco P.; Davies, Gail; De Neve, Jan-Emmanuel; Deloukas, Panos; Demuth, Ilja; Ding, Jun; Eibich, Peter; Eisele, Lewin; Eklund, Niina; Evans68, David M.; Faul, Jessica D.; Feitosa, Mary F.; Forstner, Andreas J.; Gandin, Ilaria; Gunnarsson, Bjarni; Halldórsson, Bjarni V.; Harris, Tamara B.; Heath, Andrew C.; Hocking, Lynne J.; Holliday, Elizabeth G.; Homuth, Georg; Horan, Michael A.; Hottenga, Jouke-Jan; de Jager, Philip L.; Joshi, Peter K.; Jugessur, Astanand; Kaakinen, Marika A.; Kähönen, Mika; Kanoni, Stavroula; Keltigangas-Järvinen, Liisa; Kiemeney, Lambertus A.L.M.; Kolcic, Ivana; Koskinen, Seppo; Kraja, Aldi T.; Kroh, Martin; Kutalik, Zoltan; Latvala, Antti; Launer, Lenore J.; Lebreton, Maël P.; Levinson, Douglas F.; Lichtenstein, Paul; Lichtner, Peter; Liewald, David C.M.; Loukola, Anu; Madden, Pamela A.; Mägi, Reedik; Mäki-Opas, Tomi; Marioni, Riccardo E.; Marques-Vidal, Pedro; Meddens, Gerardus A.; McMahon, George; Meisinger, Christa; Meitinger, Thomas; Milaneschi, Yusplitri; Milani, Lili; Montgomery, Grant W.; Myhre, Ronny; Nelson, Christopher P.; Nyholt, Dale R.; Ollier, William E.R.; Palotie, Aarno; Paternoster, Lavinia; Pedersen, Nancy L.; Petrovic, Katja E.; Porteous, David J.; Räikkönen, Katri; Ring, Susan M.; Robino, Antonietta; Rostapshova, Olga; Rudan, Igor; Rustichini, Aldo; Salomaa, Veikko; Sanders, Alan R.; Sarin, Antti-Pekka; Schmidt, Helena; Scott, Rodney J.; Smith, Blair H.; Smith, Jennifer A.; Staessen, Jan A.; Steinhagen-Thiessen, Elisabeth; Strauch, Konstantin; Terracciano, Antonio; Tobin, Martin D.; Ulivi, Sheila; Vaccargiu, Simona; Quaye, Lydia; van Rooij, Frank J.A.; Venturini, Cristina; Vinkhuyzen, Anna A.E.; Völker, Uwe; Völzke, Henry; Vonk, Judith M.; Vozzi, Diego; Waage, Johannes; Ware, Erin B.; Willemsen, Gonneke; Attia, John R.; Bennett, David A.; Berger, Klaus; Bertram, Lars; Bisgaard, Hans; Boomsma, Dorret I.; Borecki, Ingrid B.; Bultmann, Ute; Chabris, Christopher F.; Cucca, Francesco; Cusi, Daniele; Deary, Ian J.; Dedoussis, George V.; van Duijn, Cornelia M.; Eriksson, Johan G.; Franke, Barbara; Franke, Lude; Gasparini, Paolo; Gejman, Pablo V.; Gieger, Christian; Grabe, Hans-Jörgen; Gratten, Jacob; Groenen, Patrick J.F.; Gudnason, Vilmundur; van der Harst, Pim; Hayward, Caroline; Hinds, David A.; Hoffmann, Wolfgang; Hyppönen, Elina; Iacono, William G.; Jacobsson, Bo; Järvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Kaprio, Jaakko; Kardia, Sharon L.R.; Lehtimäki, Terho; Lehrer, Steven F.; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; Metspalu, Andres; Pendleton, Neil; Penninx, Brenda W.J.H.; Perola, Markus; Pirastu, Nicola; Pirastu, Mario; Polasek, Ozren; Posthuma, Danielle; Power, Christine; Province, Michael A.; Samani, Nilesh J.; Schlessinger, David; Schmidt, Reinhold; Sørensen, Thorkild I.A.; Spector, Tim D.; Stefansson, Kari; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tiemeier, Henning; Tung, Joyce Y.; Uitterlinden, André G.; Vitart, Veronique; Vollenweider, Peter; Weir, David R.; Wilson, James F.; Wright, Alan F.; Conley, Dalton C.; Krueger, Robert F.; Smith, George Davey; Hofman, Albert; Laibson, David I.; Medland, Sarah E.; Meyer, Michelle N.; Yang, Jian; Johannesson, Magnus; Visscher, Peter M.; Esko, Tõnu; Koellinger, Philipp D.; Cesarini, David; Benjamin, Daniel J.

    2016-01-01

    Summary Educational attainment (EA) is strongly influenced by social and other environmental factors, but genetic factors are also estimated to account for at least 20% of the variation across individuals1. We report the results of a genome-wide association study (GWAS) for EA that extends our earlier discovery sample1,2 of 101,069 individuals to 293,723 individuals, and a replication in an independent sample of 111,349 individuals from the UK Biobank. We now identify 74 genome-wide significant loci associated with number of years of schooling completed. Single-nucleotide polymorphisms (SNPs) associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioral phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because EA is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric disease. PMID:27225129

  4. Genome-wide association study of the backfat thickness trait in two pig populations

    Directory of Open Access Journals (Sweden)

    Dandan ZHU,Xiaolei LIU,Rothschild MAX,Zhiwu ZHANG,Shuhong ZHAO,Bin FAN

    2014-06-01

    Full Text Available Backfat thickness is a good predictor of carcass lean content, an economically important trait, and a main breeding target in pig improvement. In this study, the candidate genes and genomic regions associated with the tenth rib backfat thickness trait were identified in two independent pig populations, using a genome-wide association study of porcine 60K SNP genotype data applying the compressed mixed linear model (CMLM statistical method. For each population, 30 most significant single-nucleotide polymorphisms (SNPs were selected and SNP annotation implemented using Sus scrofa Build 10.2. In the first population, 25 significant SNPs were distributed on seven chromosomes, and SNPs on SSC1 and SSC7 showed great significance for fat deposition. The most significant SNP (ALGA0006623 was located on SSC1, upstream of the MC4R gene. In the second population, 27 significant SNPs were recognized by annotation, and 12 SNPs on SSC12 were related to fat deposition. Two haplotype blocks, M1GA0016251-MARC0075799 and ALGA0065251-MARC0014203-M1GA0016298-ALGA0065308, were detected in significant regions where the PIPNC1 and GH1 genes were identified as contributing to fat metabolism. The results indicated that genetic mechanism regulating backfat thickness is complex, and that genome-wide associations can be affected by populations with different genetic backgrounds.

  5. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    Energy Technology Data Exchange (ETDEWEB)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMcahon, Katherine D.; Mamlstrom, Rex R.

    2014-05-12

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.

  6. A genome-wide association study of total bilirubin and cholelithiasis risk in sickle cell anemia.

    Directory of Open Access Journals (Sweden)

    Jacqueline N Milton

    Full Text Available Serum bilirubin levels have been associated with polymorphisms in the UGT1A1 promoter in normal populations and in patients with hemolytic anemias, including sickle cell anemia. When hemolysis occurs circulating heme increases, leading to elevated bilirubin levels and an increased incidence of cholelithiasis. We performed the first genome-wide association study (GWAS of bilirubin levels and cholelithiasis risk in a discovery cohort of 1,117 sickle cell anemia patients. We found 15 single nucleotide polymorphisms (SNPs associated with total bilirubin levels at the genome-wide significance level (p value <5 × 10(-8. SNPs in UGT1A1, UGT1A3, UGT1A6, UGT1A8 and UGT1A10, different isoforms within the UGT1A locus, were identified (most significant rs887829, p = 9.08 × 10(-25. All of these associations were validated in 4 independent sets of sickle cell anemia patients. We tested the association of the 15 SNPs with cholelithiasis in the discovery cohort and found a significant association (most significant p value 1.15 × 10(-4. These results confirm that the UGT1A region is the major regulator of bilirubin metabolism in African Americans with sickle cell anemia, similar to what is observed in other ethnicities.

  7. Genome-wide association study of a quantitative disordered gambling trait.

    Science.gov (United States)

    Lind, Penelope A; Zhu, Gu; Montgomery, Grant W; Madden, Pamela A F; Heath, Andrew C; Martin, Nicholas G; Slutske, Wendy S

    2013-05-01

    Disordered gambling is a moderately heritable trait, but the underlying genetic basis is largely unknown. We performed a genome-wide association study (GWAS) for disordered gambling using a quantitative factor score in 1312 twins from 894 Australian families. Association was conducted for 2 381 914 single-nucleotide polymorphisms (SNPs) using the family-based association test in Merlin followed by gene and pathway enrichment analyses. Although no SNP reached genome-wide significance, six achieved P-values Secondary case-control analyses found two SNPs on chromosome 9 (rs1106076 and rs12305135 near VLDLR) and rs10812227 near FZD10 on chromosome 12 to be significantly associated with lifetime Diagnostic and Statistical Manual of Mental Disorders, fourth edition pathological gambling and South Oaks Gambling Screen classified probable pathological gambling status. Furthermore, several addiction-related pathways were enriched for SNPs associated with disordered gambling. Finally, gene-based analysis of 24 candidate genes for dopamine agonist-induced gambling in individuals with Parkinson's disease suggested an enrichment of SNPs associated with disordered gambling. We report the first GWAS of disordered gambling. While further replication is required, the identification of susceptibility loci and biological pathways will be important in characterizing the biological mechanisms that underpin disordered gambling.

  8. Multi-ethnic genome-wide association study identifies novel locus for type 2 diabetes susceptibility

    Science.gov (United States)

    Cook, James P; Morris, Andrew P

    2016-01-01

    Genome-wide association studies (GWAS) have traditionally been undertaken in homogeneous populations from the same ancestry group. However, with the increasing availability of GWAS in large-scale multi-ethnic cohorts, we have evaluated a framework for detecting association of genetic variants with complex traits, allowing for population structure, and developed a powerful test of heterogeneity in allelic effects between ancestry groups. We have applied the methodology to identify and characterise loci associated with susceptibility to type 2 diabetes (T2D) using GWAS data from the Resource for Genetic Epidemiology on Adult Health and Aging, a large multi-ethnic population-based cohort, created for investigating the genetic and environmental basis of age-related diseases. We identified a novel locus for T2D susceptibility at genome-wide significance (P<5 × 10−8) that maps to TOMM40-APOE, a region previously implicated in lipid metabolism and Alzheimer's disease. We have also confirmed previous reports that single-nucleotide polymorphisms at the TCF7L2 locus demonstrate the greatest extent of heterogeneity in allelic effects between ethnic groups, with the lowest risk observed in populations of East Asian ancestry. PMID:27189021

  9. CONAN: copy number variation analysis software for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Wichmann Heinz-Erich

    2010-06-01

    Full Text Available Abstract Background Genome-wide association studies (GWAS based on single nucleotide polymorphisms (SNPs revolutionized our perception of the genetic regulation of complex traits and diseases. Copy number variations (CNVs promise to shed additional light on the genetic basis of monogenic as well as complex diseases and phenotypes. Indeed, the number of detected associations between CNVs and certain phenotypes are constantly increasing. However, while several software packages support the determination of CNVs from SNP chip data, the downstream statistical inference of CNV-phenotype associations is still subject to complicated and inefficient in-house solutions, thus strongly limiting the performance of GWAS based on CNVs. Results CONAN is a freely available client-server software solution which provides an intuitive graphical user interface for categorizing, analyzing and associating CNVs with phenotypes. Moreover, CONAN assists the evaluation process by visualizing detected associations via Manhattan plots in order to enable a rapid identification of genome-wide significant CNV regions. Various file formats including the information on CNVs in population samples are supported as input data. Conclusions CONAN facilitates the performance of GWAS based on CNVs and the visual analysis of calculated results. CONAN provides a rapid, valid and straightforward software solution to identify genetic variation underlying the 'missing' heritability for complex traits that remains unexplained by recent GWAS. The freely available software can be downloaded at http://genepi-conan.i-med.ac.at.

  10. Genome-wide association study of drought-related resistance traits in Aegilops tauschii

    Science.gov (United States)

    Qin, Peng; Lin, Yu; Hu, Yaodong; Liu, Kun; Mao, Shuangshuang; Li, Zhanyi; Wang, Jirui; Liu, Yaxi; Wei, Yuming; Zheng, Youliang

    2016-01-01

    Abstract The D-genome progenitor of wheat (Triticum aestivum), Aegilops tauschii, possesses numerous genes for resistance to abiotic stresses, including drought. Therefore, information on the genetic architecture of A. tauschii can aid the development of drought-resistant wheat varieties. Here, we evaluated 13 traits in 373 A. tauschii accessions grown under normal and polyethylene glycol-simulated drought stress conditions and performed a genome-wide association study using 7,185 single nucleotide polymorphism (SNP) markers. We identified 208 and 28 SNPs associated with all traits using the general linear model and mixed linear model, respectively, while both models detected 25 significant SNPs with genome-wide distribution. Public database searches revealed several candidate/flanking genes related to drought resistance that were grouped into three categories according to the type of encoded protein (enzyme, storage protein, and drought-induced protein). This study provided essential information for SNPs and genes related to drought resistance in A. tauschii and wheat, and represents a foundation for breeding drought-resistant wheat cultivars using marker-assisted selection. PMID:27560650

  11. Detecting Single-Nucleotide Substitutions Induced by Genome Editing.

    Science.gov (United States)

    Miyaoka, Yuichiro; Chan, Amanda H; Conklin, Bruce R

    2016-08-01

    The detection of genome editing is critical in evaluating genome-editing tools or conditions, but it is not an easy task to detect genome-editing events-especially single-nucleotide substitutions-without a surrogate marker. Here we introduce a procedure that significantly contributes to the advancement of genome-editing technologies. It uses droplet digital polymerase chain reaction (ddPCR) and allele-specific hydrolysis probes to detect single-nucleotide substitutions generated by genome editing (via homology-directed repair, or HDR). HDR events that introduce substitutions using donor DNA are generally infrequent, even with genome-editing tools, and the outcome is only one base pair difference in 3 billion base pairs of the human genome. This task is particularly difficult in induced pluripotent stem (iPS) cells, in which editing events can be very rare. Therefore, the technological advances described here have implications for therapeutic genome editing and experimental approaches to disease modeling with iPS cells.

  12. Genome wide selection in Citrus breeding.

    Science.gov (United States)

    Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A

    2016-10-17

    Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq(TM) (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.

  13. Mosaic maternal uniparental disomy of chromosome 15 in Prader-Willi syndrome: utility of genome-wide SNP array.

    Science.gov (United States)

    Izumi, Kosuke; Santani, Avni B; Deardorff, Matthew A; Feret, Holly A; Tischler, Tanya; Thiel, Brian D; Mulchandani, Surabhi; Stolle, Catherine A; Spinner, Nancy B; Zackai, Elaine H; Conlin, Laura K

    2013-01-01

    Prader-Willi syndrome is caused by the loss of paternal gene expression on 15q11.2-q13.2, and one of the mechanisms resulting in Prader-Willi syndrome phenotype is maternal uniparental disomy of chromosome 15. Various mechanisms including trisomy rescue, monosomy rescue, and post fertilization errors can lead to uniparental disomy, and its mechanism can be inferred from the pattern of uniparental hetero and isodisomy. Detection of a mosaic cell line provides a unique opportunity to understand the mechanism of uniparental disomy; however, mosaic uniparental disomy is a rare finding in patients with Prader-Willi syndrome. We report on two infants with Prader-Willi syndrome caused by mosaic maternal uniparental disomy 15. Patient 1 has mosaic uniparental isodisomy of the entire chromosome 15, and Patient 2 has mosaic uniparental mixed iso/heterodisomy 15. Genome-wide single-nucleotide polymorphism array was able to demonstrate the presence of chromosomally normal cell line in the Patient 1 and trisomic cell line in Patient 2, and provide the evidence that post-fertilization error and trisomy rescue as a mechanism of uniparental disomy in each case, respectively. Given its ability of detecting small percent mosaicism as well as its capability of identifying the loss of heterozygosity of chromosomal regions, genome-wide single-nucleotide polymorphism array should be utilized as an adjunct to the standard methylation analysis in the evaluation of Prader-Willi syndrome.

  14. Copy number variants and common disorders: filling the gaps and exploring complexity in genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Xavier Estivill

    2007-10-01

    Full Text Available Genome-wide association scans (GWASs using single nucleotide polymorphisms (SNPs have been completed successfully for several common disorders and have detected over 30 new associations. Considering the large sample sizes and genome-wide SNP coverage of the scans, one might have expected many of the common variants underpinning the genetic component of various disorders to have been identified by now. However, these studies have not evaluated the contribution of other forms of genetic variation, such as structural variation, mainly in the form of copy number variants (CNVs. Known CNVs account for over 15% of the assembled human genome sequence. Since CNVs are not easily tagged by SNPs, might have a wide range of copy number variability, and often fall in genomic regions not well covered by whole-genome arrays or not genotyped by the HapMap project, current GWASs have largely missed the contribution of CNVs to complex disorders. In fact, some CNVs have already been reported to show association with several complex disorders using candidate gene/region approaches, underpinning the importance of regions not investigated in current GWASs. This reveals the need for new generation arrays (some already in the market and the use of tailored approaches to explore the full dimension of genome variability beyond the single nucleotide scale.

  15. Genome-wide significant association between alcohol dependence and a variant in the ADH gene cluster.

    Science.gov (United States)

    Frank, Josef; Cichon, Sven; Treutlein, Jens; Ridinger, Monika; Mattheisen, Manuel; Hoffmann, Per; Herms, Stefan; Wodarz, Norbert; Soyka, Michael; Zill, Peter; Maier, Wolfgang; Mössner, Rainald; Gaebel, Wolfgang; Dahmen, Norbert; Scherbaum, Norbert; Schmäl, Christine; Steffens, Michael; Lucae, Susanne; Ising, Marcus; Müller-Myhsok, Bertram; Nöthen, Markus M; Mann, Karl; Kiefer, Falk; Rietschel, Marcella

    2012-01-01

    Alcohol dependence (AD) is an important contributory factor to the global burden of disease. The etiology of AD involves both environmental and genetic factors, and the disorder has a heritability of around 50%. The aim of the present study was to identify susceptibility genes for AD by performing a genome-wide association study (GWAS). The sample comprised 1333 male in-patients with severe AD according to the Diagnostic and Statistical Manual of Mental Disorders, 4th edition, and 2168 controls. These included 487 patients and 1358 controls from a previous GWAS study by our group. All individuals were of German descent. Single-marker tests and a polygenic score-based analysis to assess the combined contribution of multiple markers with small effects were performed. The single nucleotide polymorphism (SNP) rs1789891, which is located between the ADH1B and ADH1C genes, achieved genome-wide significance [P = 1.27E-8, odds ratio (OR) = 1.46]. Other markers from this region were also associated with AD, and conditional analyses indicated that these made a partially independent contribution. The SNP rs1789891 is in complete linkage disequilibrium with the functional Arg272Gln variant (P = 1.24E-7, OR = 1.31) of the ADH1C gene, which has been reported to modify the rate of ethanol oxidation to acetaldehyde in vitro. A polygenic score-based approach produced a significant result (P = 9.66E-9). This is the first GWAS of AD to provide genome-wide significant support for the role of the ADH gene cluster and to suggest a polygenic component to the etiology of AD. The latter result may indicate that many more AD susceptibility genes still await identification.

  16. Utilizing twins as controls for non-twin case-materials in genome wide association studies.

    Directory of Open Access Journals (Sweden)

    Andrea Ganna

    Full Text Available Twin registries around the globe have collected DNA samples from large numbers of monozygotic and dizygotic twins. The twin sample collections are frequently used as controls in disease-specific studies together with non-twins. This approach is unbiased under the hypothesis that twins and singletons are comparable in terms of allele frequencies; i.e. there are no genetic variants associated with being a twin per se. To test this hypothesis we performed a genome-wide association study comparing the allele frequency of 572,352 single nucleotide polymorphisms (SNPs in 1,413 monozygotic (MZ and 5,451 dizygotic (DZ twins with 3,720 healthy singletons. Twins and singletons have been genotyped using the same platform. SNPs showing association with being a twin at P-value < 1 × 10(-5 were selected for replication analysis in 1,492 twins (463 MZ and 1,029 DZ and 1,880 singletons from Finland. No SNPs reached genome-wide significance (P-value < 5 × 10(-8 in the main analysis combining MZ and DZ twins. In a secondary analysis including only DZ twins two SNPs (rs2033541 close to ADAMTSL1 and rs4149283 close to ABCA1 were genome-wide significant after meta-analysis with the Finnish population. The estimated proportion of variance on the liability scale explained by all SNPs was 0.08 (P-value=0.003 when MZ and DZ were considered together and smaller for MZ (0.06, P-value=0.10 compared to DZ (0.09, P-value=0.003 when analyzed separately. In conclusion, twins and singletons can be used in genetic studies together with general population samples without introducing large bias. Further research is needed to explore genetic variances associated with DZ twinning.

  17. Meta-analysis of genome-wide association studies of anxiety disorders

    Science.gov (United States)

    Otowa, Takeshi; Hek, Karin; Lee, Minyoung; Byrne, Enda M.; Mirza, Saira S.; Nivard, Michel G.; Bigdeli, Timothy; Aggen, Steven H.; Adkins, Daniel; Wolen, Aaron; Fanous, Ayman; Keller, Matthew C.; Castelao, Enrique; Kutalik, Zoltan; Van der Auwera, Sandra; Homuth, Georg; Nauck, Matthias; Teumer, Alexander; Milaneschi, Yuri; Hottenga, Jouke-Jan; Direk, Nese; Hofman, Albert; Uitterlinden, Andre; Mulder, Cornelis L.; Henders, Anjali K.; Medland, Sarah E.; Gordon, Scott; Heath, Andrew C.; Madden, Pamela A.F.; Pergadia, Michelle; van der Most, Peter J.; Nolte, Ilja M.; van Oort, Floor V.A.; Hartman, Catharina A.; Oldehinkel, Albertine J.; Preisig, Martin; Grabe, Hans Jörgen; Middeldorp, Christel M.; Penninx, Brenda WJH; Boomsma, Dorret; Martin, Nicholas G.; Montgomery, Grant; Maher, Brion S.; van den Oord, Edwin J.; Wray, Naomi R.; Tiemeier, Henning; Hettema, John M.

    2015-01-01

    Anxiety disorders, namely generalized anxiety disorder, panic disorder, and phobias, are common, etiologically complex conditions with a partially genetic basis. Despite differing on diagnostic definitions based upon clinical presentation, anxiety disorders likely represent various expressions of an underlying common diathesis of abnormal regulation of basic threat-response systems. We conducted genome-wide association analyses in nine samples of European ancestry from seven large, independent studies. To identify genetic variants contributing to genetic susceptibility shared across interview-generated DSM-based anxiety disorders, we applied two phenotypic approaches: (1) comparisons between categorical anxiety disorder cases and super-normal controls, and (2) quantitative phenotypic factor scores derived from a multivariate analysis combining information across the clinical phenotypes. We used logistic and linear regression, respectively, to analyze the association between these phenotypes and genome-wide single nucleotide polymorphisms. Meta-analysis for each phenotype combined results across the nine samples for over 18 000 unrelated individuals. Each meta-analysis identified a different genome-wide significant region, with the following markers showing the strongest association: for case-control contrasts, rs1709393 located in an uncharacterized non-coding RNA locus on chromosomal band 3q12.3 (P=1.65×10−8); for factor scores, rs1067327 within CAMKMT encoding the calmodulin-lysine N-methyltransferase on chromosomal band 2p21 (P=2.86×10−9). Independent replication and further exploration of these findings are needed to more fully understand the role of these variants in risk and expression of anxiety disorders. PMID:26754954

  18. Effect of Genome-Wide Genotyping and Reference Panels on Rare Variants Imputation

    Institute of Scientific and Technical Information of China (English)

    Hou-Feng Zheng; Martin Ladouceur; Celia M.T. Greenwood; J.Brent Richards

    2012-01-01

    Common variants explain little of the variance of most common disease,prompting large-scale sequencing studies to understand the contribution of rare variants to these diseases.Imputation of rare variants from genome-wide genotypic arrays offers a cost-efficient strategy to achieve necessary sample sizes required for adequate statistical power.To estimate the performance of imputation of rare variants,we imputed 153 individuals,each of whom was genotyped on 3 different genotype arrays including 317k,610k and 1 million single nucleotide polymorphisms (SNPs),to two different reference panels:HapMap2 and 1000 Genomes pilot March 2010 release (1KGpilot) by using IMPUTE version 2.We found that more than 94% and 84% of all SNPs yield acceptable accuracy (info > 0.4) in HapMap2 and 1KGpilot-based imputation,respectively.For rare variants (minor allele frequency (MAF) ≤5%),the proportion of well-imputed SNPs increased as the MAF increased from 0.3% to 5% across all 3 genome-wide association study (GWAS) datasets.The proportion of well-imputed SNPs was 69%,60% and 49% for SNPs with a MAF from 0.3% to 5% for 1M,610k and 317k,respectively.None of the very rare variants (MAF ≤ 0.3%) were well imputed.We conclude that the imputation accuracy of rare variants increases with higher density of genome-wide genotyping arrays when the size of the reference panel is small.Variants with lower MAF are more difficult to impute.These findings have important implications in the design and replication of large-scale sequencing studies.

  19. Genome-wide association study in German patients with attention deficit/hyperactivity disorder.

    Science.gov (United States)

    Hinney, Anke; Scherag, André; Jarick, Ivonne; Albayrak, Özgür; Pütter, Carolin; Pechlivanis, Sonali; Dauvermann, Maria R; Beck, Sebastian; Weber, Heike; Scherag, Susann; Nguyen, Trang T; Volckmar, Anna-Lena; Knoll, Nadja; Faraone, Stephen V; Neale, Benjamin M; Franke, Barbara; Cichon, Sven; Hoffmann, Per; Nöthen, Markus M; Schreiber, Stefan; Jöckel, Karl-Heinz; Wichmann, H-Erich; Freitag, Christine; Lempp, Thomas; Meyer, Jobst; Gilsbach, Susanne; Herpertz-Dahlmann, Beate; Sinzig, Judith; Lehmkuhl, Gerd; Renner, Tobias J; Warnke, Andreas; Romanos, Marcel; Lesch, Klaus-Peter; Reif, Andreas; Schimmelmann, Benno G; Hebebrand, Johannes

    2011-12-01

    The heritability of attention deficit hyperactivity disorder (ADHD) is approximately 0.8. Despite several larger scale attempts, genome-wide association studies (GWAS) have not led to the identification of significant results. We performed a GWAS based on 495 German young patients with ADHD (according to DSM-IV criteria; Human660W-Quadv1; Illumina, San Diego, CA) and on 1,300 population-based adult controls (HumanHap550v3; Illumina). Some genes neighboring the single nucleotide polymorphisms (SNPs) with the lowest P-values (best P-value: 8.38 × 10(-7)) have potential relevance for ADHD (e.g., glutamate receptor, metabotropic 5 gene, GRM5). After quality control, the 30 independent SNPs with the lowest P-values (P-values ≤ 7.57 × 10(-5) ) were chosen for confirmation. Genotyping of these SNPs in up to 320 independent German families comprising at least one child with ADHD revealed directionally consistent effect-size point estimates for 19 (10 not consistent) of the SNPs. In silico analyses of the 30 SNPs in the largest meta-analysis so far (2,064 trios, 896 cases, and 2,455 controls) revealed directionally consistent effect-size point estimates for 16 SNPs (11 not consistent). None of the combined analyses revealed a genome-wide significant result. SNPs in previously described autosomal candidate genes did not show significantly lower P-values compared to SNPs within random sets of genes of the same size. We did not find genome-wide significant results in a GWAS of German children with ADHD compared to controls. The second best SNP is located in an intron of GRM5, a gene located within a recently described region with an infrequent copy number variation in patients with ADHD.

  20. Genome-Wide Association Study Identifies Single Nucleotide Polymorphism in DYRK1A Associated with Replication of HIV-1 in Monocyte-Derived Macrophages

    NARCIS (Netherlands)

    Bol, S.M.; Moerland, P.D.; Limou, S.; van Remmerden, Y.; Coulonges, C.; Manen, D.; Herbeck, J.T.; Fellay, J.; Sieberer, M.; Sietzema, J.G.; van 't Slot, R.; Martinson, J.; Zagury, J.F.; Schuitemaker, H.; van 't Wout, A.B.

    2011-01-01

    Background: HIV-1 infected macrophages play an important role in rendering resting T cells permissive for infection, in spreading HIV-1 to T cells, and in the pathogenesis of AIDS dementia. During highly active anti-retroviral treatment (HAART), macrophages keep producing virus because tissue penetr

  1. A Multipurpose, High-Throughput Single-Nucleotide Polymorphism Chip for the Dengue and Yellow Fever Mosquito, Aedes aegypti.

    Science.gov (United States)

    Evans, Benjamin R; Gloria-Soria, Andrea; Hou, Lin; McBride, Carolyn; Bonizzoni, Mariangela; Zhao, Hongyu; Powell, Jeffrey R

    2015-02-26

    The dengue and yellow fever mosquito, Aedes aegypti, contributes significantly to global disease burden. Genetic study of Aedes aegypti is essential to understanding its evolutionary history, competence as a disease vector, and the effects and efficacy of vector control methods. The prevalence of repeats and transposable elements in the Aedes aegypti genome complicates marker development and makes genome-wide genetic study challenging. To overcome these challenges, we developed a high-throughput genotyping chip, Axiom_aegypti1. This chip screens for 50,000 single-nucleotide polymorphisms present in Aedes aegypti populations from around the world. The array currently used genotypes 96 samples simultaneously. To ensure that these markers satisfy assumptions commonly made in many genetic analyses, we tested for Mendelian inheritance and linkage disequilibrium in laboratory crosses and a wild population, respectively. We have validated more than 25,000 of these markers to date, and expect this number to increase with more sampling. We also present evidence of the chip's efficacy in distinguishing populations throughout the world. The markers on this chip are ideal for applications ranging from population genetics to genome-wide association studies. This tool makes rapid, cost-effective, and comparable genotype data attainable to diverse sets of Aedes aegypti researchers, from those interested in potential range shifts due to climate change to those characterizing the genetic underpinnings of its competence to transmit disease.

  2. Genome wide linkage disequilibrium in Chinese asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm: implications for domestication history and genome wide association studies.

    Science.gov (United States)

    Xu, P; Wu, X; Wang, B; Luo, J; Liu, Y; Ehlers, J D; Close, T J; Roberts, P A; Lu, Z; Wang, S; Li, G

    2012-07-01

    Association mapping of important traits of crop plants relies on first understanding the extent and patterns of linkage disequilibrium (LD) in the particular germplasm being investigated. We characterize here the genetic diversity, population structure and genome wide LD patterns in a set of asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm from China. A diverse collection of 99 asparagus bean and normal cowpea accessions were genotyped with 1127 expressed sequence tag-derived single nucleotide polymorphism markers (SNPs). The proportion of polymorphic SNPs across the collection was relatively low (39%), with an average number of SNPs per locus of 1.33. Bayesian population structure analysis indicated two subdivisions within the collection sampled that generally represented the 'standard vegetable' type (subgroup SV) and the 'non-standard vegetable' type (subgroup NSV), respectively. Level of LD (r(2)) was higher and extent of LD persisted longer in subgroup SV than in subgroup NSV, whereas LD decayed rapidly (0-2 cM) in both subgroups. LD decay distance varied among chromosomes, with the longest (≈ 5 cM) five times longer than the shortest (≈ 1 cM). Partitioning of LD variance into within- and between-subgroup components coupled with comparative LD decay analysis suggested that linkage group 5, 7 and 10 may have undergone the most intensive epistatic selection toward traits favorable for vegetable use. This work provides a first population genetic insight into domestication history of asparagus bean and demonstrates the feasibility of mapping complex traits by genome wide association study in asparagus bean using a currently available cowpea SNPs marker platform.

  3. ENGINES: exploring single nucleotide variation in entire human genomes

    Directory of Open Access Journals (Sweden)

    Salas Antonio

    2011-04-01

    Full Text Available Abstract Background Next generation ultra-sequencing technologies are starting to produce extensive quantities of data from entire human genome or exome sequences, and therefore new software is needed to present and analyse this vast amount of information. The 1000 Genomes project has recently released raw data for 629 complete genomes representing several human populations through their Phase I interim analysis and, although there are certain public tools available that allow exploration of these genomes, to date there is no tool that permits comprehensive population analysis of the variation catalogued by such data. Description We have developed a genetic variant site explorer able to retrieve data for Single Nucleotide Variation (SNVs, population by population, from entire genomes without compromising future scalability and agility. ENGINES (ENtire Genome INterface for Exploring SNVs uses data from the 1000 Genomes Phase I to demonstrate its capacity to handle large amounts of genetic variation (>7.3 billion genotypes and 28 million SNVs, as well as deriving summary statistics of interest for medical and population genetics applications. The whole dataset is pre-processed and summarized into a data mart accessible through a web interface. The query system allows the combination and comparison of each available population sample, while searching by rs-number list, chromosome region, or genes of interest. Frequency and FST filters are available to further refine queries, while results can be visually compared with other large-scale Single Nucleotide Polymorphism (SNP repositories such as HapMap or Perlegen. Conclusions ENGINES is capable of accessing large-scale variation data repositories in a fast and comprehensive manner. It allows quick browsing of whole genome variation, while providing statistical information for each variant site such as allele frequency, heterozygosity or FST values for genetic differentiation. Access to the data mart

  4. Identification and analysis of Single Nucleotide Polymorphisms (SNPs in the mosquito Anopheles funestus, malaria vector

    Directory of Open Access Journals (Sweden)

    Hemingway Janet

    2007-01-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are the most common source of genetic variation in eukaryotic species and have become an important marker for genetic studies. The mosquito Anopheles funestus is one of the major malaria vectors in Africa and yet, prior to this study, no SNPs have been described for this species. Here we report a genome-wide set of SNP markers for use in genetic studies on this important human disease vector. Results DNA fragments from 50 genes were amplified and sequenced from 21 specimens of An. funestus. A third of specimens were field collected in Malawi, a third from a colony of Mozambican origin and a third form a colony of Angolan origin. A total of 494 SNPs including 303 within the coding regions of genes and 5 indels were identified. The physical positions of these SNPs in the genome are known. There were on average 7 SNPs per kilobase similar to that observed in An. gambiae and Drosophila melanogaster. Transitions outnumbered transversions, at a ratio of 2:1. The increased frequency of transition substitutions in coding regions is likely due to the structure of the genetic code and selective constraints. Synonymous sites within coding regions showed a higher polymorphism rate than non-coding introns or 3' and 5'flanking DNA with most of the substitutions in coding regions being observed at the 3rd codon position. A positive correlation in the level of polymorphism was observed between coding and non-coding regions within a gene. By genotyping a subset of 30 SNPs, we confirmed the validity of the SNPs identified during this study. Conclusion This set of SNP markers represents a useful tool for genetic studies in An. funestus, and will be useful in identifying candidate genes that affect diverse ranges of phenotypes that impact on vector control, such as resistance insecticide, mosquito behavior and vector competence.

  5. Single nucleotide polymorphisms for assessing genetic diversity in castor bean (Ricinus communis

    Directory of Open Access Journals (Sweden)

    Rabinowicz Pablo D

    2010-01-01

    Full Text Available Abstract Background Castor bean (Ricinus communis is an agricultural crop and garden ornamental that is widely cultivated and has been introduced worldwide. Understanding population structure and the distribution of castor bean cultivars has been challenging because of limited genetic variability. We analyzed the population genetics of R. communis in a worldwide collection of plants from germplasm and from naturalized populations in Florida, U.S. To assess genetic diversity we conducted survey sequencing of the genomes of seven diverse cultivars and compared the data to a reference genome assembly of a widespread cultivar (Hale. We determined the population genetic structure of 676 samples using single nucleotide polymorphisms (SNPs at 48 loci. Results Bayesian clustering indicated five main groups worldwide and a repeated pattern of mixed genotypes in most countries. High levels of population differentiation occurred between most populations but this structure was not geographically based. Most molecular variance occurred within populations (74% followed by 22% among populations, and 4% among continents. Samples from naturalized populations in Florida indicated significant population structuring consistent with local demes. There was significant population differentiation for 56 of 78 comparisons in Florida (pairwise population ϕPT values, p Conclusion Low levels of genetic diversity and mixing of genotypes have led to minimal geographic structuring of castor bean populations worldwide. Relatively few lineages occur and these are widely distributed. Our approach of determining population genetic structure using SNPs from genome-wide comparisons constitutes a framework for high-throughput analyses of genetic diversity in plants, particularly in species with limited genetic diversity.

  6. Common single nucleotide variants underlying drug addiction: more than a decade of research.

    Science.gov (United States)

    Bühler, Kora-Mareen; Giné, Elena; Echeverry-Alzate, Victor; Calleja-Conde, Javier; de Fonseca, Fernando Rodriguez; López-Moreno, Jose Antonio

    2015-09-01

    Drug-related phenotypes are common complex and highly heritable traits. In the last few years, candidate gene (CGAS) and genome-wide association studies (GWAS) have identified a huge number of single nucleotide polymorphisms (SNPs) associated with drug use, abuse or dependence, mainly related to alcohol or nicotine. Nevertheless, few of these associations have been replicated in independent studies. The aim of this study was to provide a review of the SNPs that have been most significantly associated with alcohol-, nicotine-, cannabis- and cocaine-related phenotypes in humans between the years of 2000 and 2012. To this end, we selected CGAS, GWAS, family-based association and case-only studies published in peer-reviewed international scientific journals (using the PubMed/MEDLINE and Addiction GWAS Resource databases) in which a significant association was reported. A total of 371 studies fit the search criteria. We then filtered SNPs with at least one replication study and performed meta-analysis of the significance of the associations. SNPs in the alcohol metabolizing genes, in the cholinergic gene cluster CHRNA5-CHRNA3-CHRNB4, and in the DRD2 and ANNK1 genes, are, to date, the most replicated and significant gene variants associated with alcohol- and nicotine-related phenotypes. In the case of cannabis and cocaine, a far fewer number of studies and replications have been reported, indicating either a need for further investigation or that the genetics of cannabis/cocaine addiction are more elusive. This review brings a global state-of-the-art vision of the behavioral genetics of addiction and collaborates on formulation of new hypothesis to guide future work.

  7. Genome-Wide Association Study Identifies Novel Loci Associated With Diisocyanate-Induced Occupational Asthma

    Science.gov (United States)

    Yucesoy, Berran; Kaufman, Kenneth M.; Lummus, Zana L.; Weirauch, Matthew T.; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M.; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B.; Bernstein, David I.

    2015-01-01

    Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10−14). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10−9 and rs2514805, p = 1.22 × 10−8, respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10−6). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies. PMID:25918132

  8. The challenges of genome-wide interaction studies: lessons to learn from the analysis of HDL blood levels.

    Science.gov (United States)

    van Leeuwen, Elisabeth M; Smouter, Françoise A S; Kam-Thong, Tony; Karbalai, Nazanin; Smith, Albert V; Harris, Tamara B; Launer, Lenore J; Sitlani, Colleen M; Li, Guo; Brody, Jennifer A; Bis, Joshua C; White, Charles C; Jaiswal, Alok; Oostra, Ben A; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, Andre G; Boerwinkle, Eric; Ballantyne, Christie M; Gudnason, Vilmundur; Psaty, Bruce M; Cupples, L Adrienne; Järvelin, Marjo-Riitta; Ripatti, Samuli; Isaacs, Aaron; Müller-Myhsok, Bertram; Karssen, Lennart C; van Duijn, Cornelia M

    2014-01-01

    Genome-wide association studies (GWAS) have revealed 74 single nucleotide polymorphisms (SNPs) associated with high-density lipoprotein cholesterol (HDL) blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS) to identify SNP×SNP interactions associated with HDL levels. We performed a GWIS in the Rotterdam Study (RS) cohort I (RS-I) using the GLIDE tool which leverages the massively parallel computing power of Graphics Processing Units (GPUs) to perform linear regression on all genome-wide pairs of SNPs. By performing a meta-analysis together with Rotterdam Study cohorts II and III (RS-II and RS-III), we were able to filter 181 interaction terms with a p-valueSPATA8 (ENSG00000185594) being associated with HDL levels. However, p-values do not reach the preset Bonferroni correction of the p-values. Our study suggest that even for highly genetically determined traits such as HDL the sample sizes needed to detect SNP×SNP interactions are large and the 2-step filtering approaches do not yield a solution. Here we present our analysis plan and our reservations concerning GWIS.

  9. Effects of environment, genetics and data analysis pitfalls in an esophageal cancer genome-wide association study.

    Directory of Open Access Journals (Sweden)

    Alexander Statnikov

    Full Text Available BACKGROUND: The development of new high-throughput genotyping technologies has allowed fast evaluation of single nucleotide polymorphisms (SNPs on a genome-wide scale. Several recent genome-wide association studies employing these technologies suggest that panels of SNPs can be a useful tool for predicting cancer susceptibility and discovery of potentially important new disease loci. METHODOLOGY/PRINCIPAL FINDINGS: In the present paper we undertake a careful examination of the relative significance of genetics, environmental factors, and biases of the data analysis protocol that was used in a previously published genome-wide association study. That prior study reported a nearly perfect discrimination of esophageal cancer patients and healthy controls on the basis of only genetic information. On the other hand, our results strongly suggest that SNPs in this dataset are not statistically linked to the phenotype, while several environmental factors and especially family history of esophageal cancer (a proxy to both environmental and genetic factors have only a modest association with the disease. CONCLUSIONS/SIGNIFICANCE: The main component of the previously claimed strong discriminatory signal is due to several data analysis pitfalls that in combination led to the strongly optimistic results. Such pitfalls are preventable and should be avoided in future studies since they create misleading conclusions and generate many false leads for subsequent research.

  10. Genome-wide mapping of copy number variation in humans: comparative analysis of high resolution array platforms.

    Directory of Open Access Journals (Sweden)

    Rajini R Haraksingh

    Full Text Available Accurate and efficient genome-wide detection of copy number variants (CNVs is essential for understanding human genomic variation, genome-wide CNV association type studies, cytogenetics research and diagnostics, and independent validation of CNVs identified from sequencing based technologies. Numerous, array-based platforms for CNV detection exist utilizing array Comparative Genome Hybridization (aCGH, Single Nucleotide Polymorphism (SNP genotyping or both. We have quantitatively assessed the abilities of twelve leading genome-wide CNV detection platforms to accurately detect Gold Standard sets of CNVs in the genome of HapMap CEU sample NA12878, and found significant differences in performance. The technologies analyzed were the NimbleGen 4.2 M, 2.1 M and 3×720 K Whole Genome and CNV focused arrays, the Agilent 1×1 M CGH and High Resolution and 2×400 K CNV and SNP+CGH arrays, the Illumina Human Omni1Quad array and the Affymetrix SNP 6.0 array. The Gold Standards used were a 1000 Genomes Project sequencing-based set of 3997 validated CNVs and an ultra high-resolution aCGH-based set of 756 validated CNVs. We found that sensitivity, total number, size range and breakpoint resolution of CNV calls were highest for CNV focused arrays. Our results are important for cost effective CNV detection and validation for both basic and clinical applications.

  11. High-speed droplet-allele-specific polymerase chain reaction for genotyping of single nucleotide polymorphisms.

    Science.gov (United States)

    Matsuda, Kazuyuki; Honda, Takayuki

    2015-01-01

    Single nucleotide alternations such as single nucleotide polymorphisms (SNPs) or single nucleotide mutations are useful genetic markers for molecular diagnosis, prognosis, drug response, and predisposition to diseases. Rapid identification of SNPs or mutations is clinically important, especially for determining drug responses and selection of molecular-targeted therapy. Here, we describe a rapid genotyping assay based on the allele-specific polymerase chain reaction (AS-PCR) by using our droplet-PCR machine (droplet-AS-PCR).

  12. Genome-wide survey of artificial mutations induced by ethyl methanesulfonate and gamma rays in tomato.

    Science.gov (United States)

    Shirasawa, Kenta; Hirakawa, Hideki; Nunome, Tsukasa; Tabata, Satoshi; Isobe, Sachiko

    2016-01-01

    Genome-wide mutations induced by ethyl methanesulfonate (EMS) and gamma irradiation in the tomato Micro-Tom genome were identified by a whole-genome shotgun sequencing analysis to estimate the spectrum and distribution of whole-genome DNA mutations and the frequency of deleterious mutations. A total of ~370 Gb of paired-end reads for four EMS-induced mutants and three gamma-ray-irradiated lines as well as a wild-type line were obtained by next-generation sequencing technology. Using bioinformatics analyses, we identified 5920 induced single nucleotide variations and insertion/deletion (indel) mutations. The predominant mutations in the EMS mutants were C/G to T/A transitions, while in the gamma-ray mutants, C/G to T/A transitions, A/T to T/A transversions, A/T to G/C transitions and deletion mutations were equally common. Biases in the base composition flanking mutations differed between the mutagenesis types. Regarding the effects of the mutations on gene function, >90% of the mutations were located in intergenic regions, and only 0.2% were deleterious. In addition, we detected 1,140,687 spontaneous single nucleotide polymorphisms and indel polymorphisms in wild-type Micro-Tom lines. We also found copy number variation, deletions and insertions of chromosomal segments in both the mutant and wild-type lines. The results provide helpful information not only for mutation research, but also for mutant screening methodology with reverse-genetic approaches.

  13. ADH single nucleotide polymorphism associations with alcohol metabolism in vivo

    Science.gov (United States)

    Birley, Andrew J.; James, Michael R.; Dickson, Peter A.; Montgomery, Grant W.; Heath, Andrew C.; Martin, Nicholas G.; Whitfield, John B.

    2009-01-01

    We have previously found that variation in alcohol metabolism in Europeans is linked to the chromosome 4q region containing the ADH gene family. We have now typed 103 single nucleotide polymorphisms (SNPs) across this region to test for allelic associations with variation in blood and breath alcohol concentrations after an alcohol challenge. In vivo alcohol metabolism was modelled with three parameters that identified the absorption and rise of alcohol concentration following ingestion, and the rate of elimination. Alleles of ADH7 SNPs were associated with the early stages of alcohol metabolism, with additional effects in the ADH1A, ADH1B and ADH4 regions. Rate of elimination was associated with SNPs in the intragenic region between ADH7 and ADH1C, and across ADH1C and ADH1B. SNPs affecting alcohol metabolism did not correspond to those reported to affect alcohol dependence or alcohol-related disease. The combined SNP associations with early- and late-stage metabolism only account for approximately 20% of the total genetic variance linked to the ADH region, and most of the variance for in vivo alcohol metabolism linked to this region is yet to be explained. PMID:19193628

  14. Bulk segregant analysis using single nucleotide polymorphism microarrays.

    Directory of Open Access Journals (Sweden)

    Anthony Becker

    Full Text Available Bulk segregant analysis (BSA using microarrays, and extreme array mapping (XAM have recently been used to rapidly identify genomic regions associated with phenotypes in multiple species. These experiments, however, require the identification of single feature polymorphisms (SFP between the cross parents for each new combination of genotypes, which raises the cost of experiments. The availability of the genomic polymorphism data in Arabidopsis thaliana, coupled with the efficient designs of Single Nucleotide Polymorphism (SNP genotyping arrays removes the requirement for SFP detection and lowers the per array cost, thereby lowering the overall cost per experiment. To demonstrate that these approaches would be functional on SNP arrays and determine confidence intervals, we analyzed hybridizations of natural accessions to the Arabidopsis ATSNPTILE array and simulated BSA or XAM given a variety of gene models, populations, and bulk selection parameters. Our results show a striking degree of correlation between the genotyping output of both methods, which suggests that the benefit of SFP genotyping in context of BSA can be had with the cheaper, more efficient SNP arrays. As a final proof of concept, we hybridized the DNA from bulks of an F2 mapping population of a Sulfur and Selenium ionomics mutant to both the Arabidopsis ATTILE1R and ATSNPTILE arrays, which produced almost identical results. We have produced R scripts that prompt the user for the required parameters and perform the BSA analysis using the ATSNPTILE1 array and have provided them as supplemental data files.

  15. Single Nucleotide Polymorphism Analysis of Protamine Genes in Infertile Men

    Directory of Open Access Journals (Sweden)

    Ahamad Salamian

    2008-01-01

    Full Text Available Background: Single nucleotide polymorphism (SNPs are considered as one of the underlyingcauses of male infertility. Proper sperm chromatin packaging which involves replacement ofhistones with protamines has profound effect on male fertility. Over 20 SNPs have been reportedfor the protamine 1 and 2.Materials and Methods: The aim of this study was to evaluate the frequency of two previouslyreported SNPs using polymerase chain reaction (PCR-restriction fragment length polymorphism(RFLP approach in 35, 96 and 177 normal, oligozoospermic and azoospermic individuals. TheseSNPs are: 1. A base pair substitution (G at position 197 instead of T in protamine type 1 Openreading frame (ORF including untranslated region, which causes an Arg residue change to Serresidue in a highly conserved region. 2. cytidine nucleotide change to thymidine in position of 248of protamine type 2 ORF which caused a nonsense point mutation.Results: The two mentioned SNPs were not present in the studied population, thus concluding thatthese SNPs can not serves as molecular markers for male infertility diagnosis.Conclusion: The results of our study reveal that in a selected Iranian population, the SNP G197Tand C248T are completely absent and are not associated with male infertility and therefore theseSNPs may not represent a molecular marker for genetic diagnosis of male infertility.

  16. Single nucleotide polymorphism-based validation of exonic splicing enhancers.

    Directory of Open Access Journals (Sweden)

    William G Fairbrother

    2004-09-01

    Full Text Available Because deleterious alleles arising from mutation are filtered by natural selection, mutations that create such alleles will be underrepresented in the set of common genetic variation existing in a population at any given time. Here, we describe an approach based on this idea called VERIFY (variant elimination reinforces functionality, which can be used to assess the extent of natural selection acting on an oligonucleotide motif or set of motifs predicted to have biological activity. As an application of this approach, we analyzed a set of 238 hexanucleotides previously predicted to have exonic splicing enhancer (ESE activity in human exons using the relative enhancer and silencer classification by unanimous enrichment (RESCUE-ESE method. Aligning the single nucleotide polymorphisms (SNPs from the public human SNP database to the chimpanzee genome allowed inference of the direction of the mutations that created present-day SNPs. Analyzing the set of SNPs that overlap RESCUE-ESE hexamers, we conclude that nearly one-fifth of the mutations that disrupt predicted ESEs have been eliminated by natural selection (odds ratio = 0.82 +/- 0.05. This selection is strongest for the predicted ESEs that are located near splice sites. Our results demonstrate a novel approach for quantifying the extent of natural selection acting on candidate functional motifs and also suggest certain features of mutations/SNPs, such as proximity to the splice site and disruption or alteration of predicted ESEs, that should be useful in identifying variants that might cause a biological phenotype.

  17. Single nucleotide polymorphisms and linkage disequilibrium in sunflower.

    Science.gov (United States)

    Kolkman, Judith M; Berry, Simon T; Leon, Alberto J; Slabaugh, Mary B; Tang, Shunxue; Gao, Wenxiang; Shintani, David K; Burke, John M; Knapp, Steven J

    2007-09-01

    Genetic diversity in modern sunflower (Helianthus annuus L.) cultivars (elite oilseed inbred lines) has been shaped by domestication and breeding bottlenecks and wild and exotic allele introgression(-)the former narrowing and the latter broadening genetic diversity. To assess single nucleotide polymorphism (SNP) frequencies, nucleotide diversity, and linkage disequilibrium (LD) in modern cultivars, alleles were resequenced from 81 genic loci distributed throughout the sunflower genome. DNA polymorphisms were abundant; 1078 SNPs (1/45.7 bp) and 178 insertions-deletions (INDELs) (1/277.0 bp) were identified in 49.4 kbp of DNA/genotype. SNPs were twofold more frequent in noncoding (1/32.1 bp) than coding (1/62.8 bp) sequences. Nucleotide diversity was only slightly lower in inbred lines ( = 0.0094) than wild populations ( = 0.0128). Mean haplotype diversity was 0.74. When extraploted across the genome ( approximately 3500 Mbp), sunflower was predicted to harbor at least 76.4 million common SNPs among modern cultivar alleles. LD decayed more slowly in inbred lines than wild populations (mean LD declined to 0.32 by 5.5 kbp in the former, the maximum physical distance surveyed), a difference attributed to domestication and breeding bottlenecks. SNP frequencies and LD decay are sufficient in modern sunflower cultivars for very high-density genetic mapping and high-resolution association mapping.

  18. Factor VII activating protease. Single nucleotide polymorphisms light the way.

    Science.gov (United States)

    Kanse, S M; Etscheid, M

    2011-08-01

    Factor VII activating protease (FSAP) is a circulating serine protease with high homology to fibrinolytic enzymes. A role in the regulation of coagulation and fibrinolysis is suspected based on in vitro studies demonstrating activation of FVII or pro-urokinase plasminogen activator (uPA). However, considering the paucity of any studies in animal models or any correlative studies in humans the role of FSAP in haemostasis remains unclear. In relation to vascular remodeling processes or inflammation it has been convincingly shown that FSAP interacts with growth factors as well as protease activated receptors (PAR). Against this sparse background there are a plethora of studies which have investigated the linkage of single nucleotide polymorphisms (SNP) in the FSAP gene (HABP2) to various diseases. The G534E SNP of FSAP is associated with a low proteolytic activity due to an amino acid exchange in the protease domain. This and other SNPs have been linked to carotid stenosis, stroke as well as thrombosis in the elderly and plaque calcification. These SNP analyses indicate an important role for FSAP in the regulation of the haemostasis system as well as fibroproliferative inflammatory processes.

  19. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    Science.gov (United States)

    ... historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A team of international researchers supported by the ... when they divide and grow uncontrollably, develop into melanoma. Also, MITF activity is known to be amplified ...

  20. Genome-wide association study of clinical dimensions of schizophrenia

    DEFF Research Database (Denmark)

    Fanous, Ayman H; Zhou, Baiyu; Aggen, Steven H;

    2012-01-01

    Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia....

  1. Cancer genetic association studies in the genome-wide age

    OpenAIRE

    Savage, Sharon A

    2008-01-01

    Genome-wide association studies of hundreds of thousands of SNPs have led to a deluge of studies of genetic variation in cancer and other common diseases. Large case–control and cohort studies have identified novel SNPs as markers of cancer risk. Genome-wide association study SNP data have also advanced understanding of population-specific genetic variation. While studies of risk profiles, combinations of SNPs that may increase cancer risk, are not yet clinically applicable, future, large-sca...

  2. Genome-wide polymorphisms show unexpected targets of natural selection

    OpenAIRE

    Pespeni, Melissa H.; Garfield, David A.; Manier, Mollie K; Palumbi, Stephen R.

    2011-01-01

    Natural selection can act on all the expressed genes of an individual, leaving signatures of genetic differentiation or diversity at many loci across the genome. New power to assay these genome-wide effects of selection comes from associating multi-locus patterns of polymorphism with gene expression and function. Here, we performed one of the first genome-wide surveys in a marine species, comparing purple sea urchins, Strongylocentrotus purpuratus, from two distant locations along the species...

  3. Genome-wide association study of retinopathy in individuals without diabetes.

    Directory of Open Access Journals (Sweden)

    Richard A Jensen

    Full Text Available BACKGROUND: Mild retinopathy (microaneurysms or dot-blot hemorrhages is observed in persons without diabetes or hypertension and may reflect microvascular disease in other organs. We conducted a genome-wide association study (GWAS of mild retinopathy in persons without diabetes. METHODS: A working group agreed on phenotype harmonization, covariate selection and analytic plans for within-cohort GWAS. An inverse-variance weighted fixed effects meta-analysis was performed with GWAS results from six cohorts of 19,411 Caucasians. The primary analysis included individuals without diabetes and secondary analyses were stratified by hypertension status. We also singled out the results from single nucleotide polymorphisms (SNPs previously shown to be associated with diabetes and hypertension, the two most common causes of retinopathy. RESULTS: No SNPs reached genome-wide significance in the primary analysis or the secondary analysis of participants with hypertension. SNP, rs12155400, in the histone deacetylase 9 gene (HDAC9 on chromosome 7, was associated with retinopathy in analysis of participants without hypertension, -1.3±0.23 (beta ± standard error, p = 6.6×10(-9. Evidence suggests this was a false positive finding. The minor allele frequency was low (∼2%, the quality of the imputation was moderate (r(2 ∼0.7, and no other common variants in the HDAC9 gene were associated with the outcome. SNPs found to be associated with diabetes and hypertension in other GWAS were not associated with retinopathy in persons without diabetes or in subgroups with or without hypertension. CONCLUSIONS: This GWAS of retinopathy in individuals without diabetes showed little evidence of genetic associations. Further studies are needed to identify genes associated with these signs in order to help unravel novel pathways and determinants of microvascular diseases.

  4. The association of genome-wide significant spirometric loci with chronic obstructive pulmonary disease susceptibility.

    Science.gov (United States)

    Castaldi, Peter J; Cho, Michael H; Litonjua, Augusto A; Bakke, Per; Gulsvik, Amund; Lomas, David A; Anderson, Wayne; Beaty, Terri H; Hokanson, John E; Crapo, James D; Laird, Nan; Silverman, Edwin K

    2011-12-01

    Two recent metaanalyses of genome-wide association studies conducted by the CHARGE and SpiroMeta consortia identified novel loci yielding evidence of association at or near genome-wide significance (GWS) with FEV(1) and FEV(1)/FVC. We hypothesized that a subset of these markers would also be associated with chronic obstructive pulmonary disease (COPD) susceptibility. Thirty-two single-nucleotide polymorphisms (SNPs) in or near 17 genes in 11 previously identified GWS spirometric genomic regions were tested for association with COPD status in four COPD case-control study samples (NETT/NAS, the Norway case-control study, ECLIPSE, and the first 1,000 subjects in COPDGene; total sample size, 3,456 cases and 1,906 controls). In addition to testing the 32 spirometric GWS SNPs, we tested a dense panel of imputed HapMap2 SNP markers from the 17 genes located near the 32 GWS SNPs and in a set of 21 well studied COPD candidate genes. Of the previously identified GWS spirometric genomic regions, three loci harbored SNPs associated with COPD susceptibility at a 5% false discovery rate: the 4q24 locus including FLJ20184/INTS12/GSTCD/NPNT, the 6p21 locus including AGER and PPT2, and the 5q33 locus including ADAM19. In conclusion, markers previously associated at or near GWS with spirometric measures were tested for association with COPD status in data from four COPD case-control studies, and three loci showed evidence of association with COPD susceptibility at a 5% false discovery rate.

  5. Exhaustive Genome-Wide Search for SNP-SNP Interactions Across 10 Human Diseases.

    Science.gov (United States)

    Murk, William; DeWan, Andrew T

    2016-01-01

    The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA) study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10(-12)). Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.

  6. Genome-wide estimates of coancestry, inbreeding and effective population size in the Spanish Holstein population.

    Directory of Open Access Journals (Sweden)

    Silvia Teresa Rodríguez-Ramilo

    Full Text Available Estimates of effective population size in the Holstein cattle breed have usually been low despite the large number of animals that constitute this breed. Effective population size is inversely related to the rates at which coancestry and inbreeding increase and these rates have been high as a consequence of intense and accurate selection. Traditionally, coancestry and inbreeding coefficients have been calculated from pedigree data. However, the development of genome-wide single nucleotide polymorphisms has increased the interest of calculating these coefficients from molecular data in order to improve their accuracy. In this study, genomic estimates of coancestry, inbreeding and effective population size were obtained in the Spanish Holstein population and then compared with pedigree-based estimates. A total of 11,135 animals genotyped with the Illumina BovineSNP50 BeadChip were available for the study. After applying filtering criteria, the final genomic dataset included 36,693 autosomal SNPs and 10,569 animals. Pedigree data from those genotyped animals included 31,203 animals. These individuals represented only the last five generations in order to homogenise the amount of pedigree information across animals. Genomic estimates of coancestry and inbreeding were obtained from identity by descent segments (coancestry or runs of homozygosity (inbreeding. The results indicate that the percentage of variance of pedigree-based coancestry estimates explained by genomic coancestry estimates was higher than that for inbreeding. Estimates of effective population size obtained from genome-wide and pedigree information were consistent and ranged from about 66 to 79. These low values emphasize the need of controlling the rate of increase of coancestry and inbreeding in Holstein selection programmes.

  7. Genome-wide association study of coronary and aortic calcification in lung cancer screening CT

    Science.gov (United States)

    de Vos, Bob D.; van Setten, Jessica; de Jong, Pim A.; Mali, Willem P.; Oudkerk, Matthijs; Viergever, Max A.; Išgum, Ivana

    2016-03-01

    Arterial calcification has been related to cardiovascular disease (CVD) and osteoporosis. However, little is known about the role of genetics and exact pathways leading to arterial calcification and its relation to bone density changes indicating osteoporosis. In this study, we conducted a genome-wide association study of arterial calcification burden, followed by a look-up of known single nucleotide polymorphisms (SNPs) for coronary artery disease (CAD) and myocardial infarction (MI), and bone mineral density (BMD) to test for a shared genetic basis between the traits. The study included a subcohort of the Dutch-Belgian lung cancer screening trial comprised of 2,561 participants. Participants underwent baseline CT screening in one of two hospitals participating in the trial. Low-dose chest CT images were acquired without contrast enhancement and without ECG-synchronization. In these images coronary and aortic calcifications were identified automatically. Subsequently, the detected calcifications were quantified using coronary artery calcium Agatston and volume scores. Genotype data was available for these participants. A genome-wide association study was conducted on 10,220,814 SNPs using a linear regression model. To reduce multiple testing burden, known CAD/MI and BMD SNPs were specifically tested (45 SNPs from the CARDIoGRAMplusC4D consortium and 60 SNPS from the GEFOS consortium). No novel significant SNPs were found. Significant enrichment for CAD/MI SNPs was observed in testing Agatston and coronary artery calcium volume scores. Moreover, a significant enrichment of BMD SNPs was shown in aortic calcium volume scores. This may indicate genetic relation of BMD SNPs and arterial calcification burden.

  8. A genome-wide association study reveals variants in ARL15 that influence adiponectin levels.

    Directory of Open Access Journals (Sweden)

    J Brent Richards

    2009-12-01

    Full Text Available The adipocyte-derived protein adiponectin is highly heritable and inversely associated with risk of type 2 diabetes mellitus (T2D and coronary heart disease (CHD. We meta-analyzed 3 genome-wide association studies for circulating adiponectin levels (n = 8,531 and sought validation of the lead single nucleotide polymorphisms (SNPs in 5 additional cohorts (n = 6,202. Five SNPs were genome-wide significant in their relationship with adiponectin (P< or =5x10(-8. We then tested whether these 5 SNPs were associated with risk of T2D and CHD using a Bonferroni-corrected threshold of P< or =0.011 to declare statistical significance for these disease associations. SNPs at the adiponectin-encoding ADIPOQ locus demonstrated the strongest associations with adiponectin levels (P-combined = 9.2x10(-19 for lead SNP, rs266717, n = 14,733. A novel variant in the ARL15 (ADP-ribosylation factor-like 15 gene was associated with lower circulating levels of adiponectin (rs4311394-G, P-combined = 2.9x10(-8, n = 14,733. This same risk allele at ARL15 was also associated with a higher risk of CHD (odds ratio [OR] = 1.12, P = 8.5x10(-6, n = 22,421 more nominally, an increased risk of T2D (OR = 1.11, P = 3.2x10(-3, n = 10,128, and several metabolic traits. Expression studies in humans indicated that ARL15 is well-expressed in skeletal muscle. These findings identify a novel protein, ARL15, which influences circulating adiponectin levels and may impact upon CHD risk.

  9. Genome-wide estimates of coancestry, inbreeding and effective population size in the Spanish Holstein population.

    Science.gov (United States)

    Rodríguez-Ramilo, Silvia Teresa; Fernández, Jesús; Toro, Miguel Angel; Hernández, Delfino; Villanueva, Beatriz

    2015-01-01

    Estimates of effective population size in the Holstein cattle breed have usually been low despite the large number of animals that constitute this breed. Effective population size is inversely related to the rates at which coancestry and inbreeding increase and these rates have been high as a consequence of intense and accurate selection. Traditionally, coancestry and inbreeding coefficients have been calculated from pedigree data. However, the development of genome-wide single nucleotide polymorphisms has increased the interest of calculating these coefficients from molecular data in order to improve their accuracy. In this study, genomic estimates of coancestry, inbreeding and effective population size were obtained in the Spanish Holstein population and then compared with pedigree-based estimates. A total of 11,135 animals genotyped with the Illumina BovineSNP50 BeadChip were available for the study. After applying filtering criteria, the final genomic dataset included 36,693 autosomal SNPs and 10,569 animals. Pedigree data from those genotyped animals included 31,203 animals. These individuals represented only the last five generations in order to homogenise the amount of pedigree information across animals. Genomic estimates of coancestry and inbreeding were obtained from identity by descent segments (coancestry) or runs of homozygosity (inbreeding). The results indicate that the percentage of variance of pedigree-based coancestry estimates explained by genomic coancestry estimates was higher than that for inbreeding. Estimates of effective population size obtained from genome-wide and pedigree information were consistent and ranged from about 66 to 79. These low values emphasize the need of controlling the rate of increase of coancestry and inbreeding in Holstein selection programmes.

  10. Exhaustive Genome-Wide Search for SNP-SNP Interactions Across 10 Human Diseases

    Directory of Open Access Journals (Sweden)

    William Murk

    2016-07-01

    Full Text Available The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs with minor allele frequency (MAF ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10−12. Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.

  11. Identification of a novel susceptibility locus for juvenile idiopathic arthritis by genome-wide association analysis

    Science.gov (United States)

    Hinks, Anne; Barton, Anne; Shephard, Neil; Eyre, Steve; Bowes, John; Cargill, Michele; Wang, Eric; Ke, Xiayi; Kennedy, Giulia C; John, Sally; Worthington, Jane; Thomson, Wendy

    2009-01-01

    Objective Juvenile idiopathic arthritis (JIA) is a chronic rheumatic disease of childhood. Two well-established genetic factors known to contribute to JIA susceptibility, HLA and PTPN22, account for less than half of the genetic susceptibility to disease; therefore, additional genetic factors have yet to be identified. The purpose of this study was to perform a systematic search of the genome to identify novel susceptibility loci for JIA. Methods A genome-wide association study using Affymetrix GeneChip 100K arrays was performed in a discovery cohort (279 cases and 184 controls). Single-nucleotide polymorphisms (SNPs) showing the most significant differences between cases and controls were then genotyped in a validation sample of cases (n = 321) and controls, combined with control data from the 1958 UK birth cohort (n = 2,024). In one region in which association was confirmed, fine-mapping was performed (654 cases and 1,847 controls). Results Of the 112 SNPs that were significantly associated with JIA in the discovery cohort, 6 SNPs were associated with JIA in the independent validation cohort. The most strongly associated SNP mapped to the HLA region, while the second strongest association was with a SNP within the VTCN1 gene. Fine-mapping of that gene was performed, and 10 SNPs were found to be associated with JIA. Conclusion This study is the first to successfully apply a SNP-based genome-wide association approach to the investigation of JIA. The replicated association with markers in the VTCN1 gene defined an additional susceptibility locus for JIA and implicates a novel pathway in the pathogenesis of this chronic disease of childhood. PMID:19116933

  12. Genome-wide association analysis of autoantibody positivity in type 1 diabetes cases.

    Directory of Open Access Journals (Sweden)

    Vincent Plagnol

    2011-08-01

    Full Text Available The genetic basis of autoantibody production is largely unknown outside of associations located in the major histocompatibility complex (MHC human leukocyte antigen (HLA region. The aim of this study is the discovery of new genetic associations with autoantibody positivity using genome-wide association scan single nucleotide polymorphism (SNP data in type 1 diabetes (T1D patients with autoantibody measurements. We measured two anti-islet autoantibodies, glutamate decarboxylase (GADA, n = 2,506, insulinoma-associated antigen 2 (IA-2A, n = 2,498, antibodies to the autoimmune thyroid (Graves' disease (AITD autoantigen thyroid peroxidase (TPOA, n = 8,300, and antibodies against gastric parietal cells (PCA, n = 4,328 that are associated with autoimmune gastritis. Two loci passed a stringent genome-wide significance level (p<10(-10: 1q23/FCRL3 with IA-2A and 9q34/ABO with PCA. Eleven of 52 non-MHC T1D loci showed evidence of association with at least one autoantibody at a false discovery rate of 16%: 16p11/IL27-IA-2A, 2q24/IFIH1-IA-2A and PCA, 2q32/STAT4-TPOA, 10p15/IL2RA-GADA, 6q15/BACH2-TPOA, 21q22/UBASH3A-TPOA, 1p13/PTPN22-TPOA, 2q33/CTLA4-TPOA, 4q27/IL2/TPOA, 15q14/RASGRP1/TPOA, and 12q24/SH2B3-GADA and TPOA. Analysis of the TPOA-associated loci in 2,477 cases with Graves' disease identified two new AITD loci (BACH2 and UBASH3A.

  13. Impact of the genome wide supported NRGN gene on anterior cingulate morphology in schizophrenia.

    Directory of Open Access Journals (Sweden)

    Kazutaka Ohi

    Full Text Available BACKGROUND: The rs12807809 single-nucleotide polymorphism in NRGN is a genetic risk variant with genome-wide significance for schizophrenia. The frequency of the T allele of rs12807809 is higher in individuals with schizophrenia than in those without the disorder. Reduced immunoreactivity of NRGN, which is expressed exclusively in the brain, has been observed in Brodmann areas (BA 9 and 32 of the prefrontal cortex in postmortem brains from patients with schizophrenia compared with those in controls. METHODS: Genotype effects of rs12807809 were investigated on gray matter (GM and white matter (WM volumes using magnetic resonance imaging (MRI with a voxel-based morphometry (VBM technique in a sample of 99 Japanese patients with schizophrenia and 263 healthy controls. RESULTS: Although significant genotype-diagnosis interaction either on GM or WM volume was not observed, there was a trend of genotype-diagnosis interaction on GM volume in the left anterior cingulate cortex (ACC. Thus, the effects of NRGN genotype on GM volume of patients with schizophrenia and healthy controls were separately investigated. In patients with schizophrenia, carriers of the risk T allele had a smaller GM volume in the left ACC (BA32 than did carriers of the non-risk C allele. Significant genotype effect on other regions of the GM or WM was not observed for either the patients or controls. CONCLUSIONS: Our findings suggest that the genome-wide associated genetic risk variant in the NRGN gene may be related to a small GM volume in the ACC in the left hemisphere in patients with schizophrenia.

  14. Genome-wide interaction study of smoking and bladder cancer risk

    Science.gov (United States)

    Figueroa, Jonine D.; Han, Summer S.; Garcia-Closas, Montserrat; Baris, Dalsu; Jacobs, Eric J.; Kogevinas, Manolis; Schwenn, Molly; Malats, Nuria; Johnson, Alison; Purdue, Mark P.; Caporaso, Neil; Landi, Maria Teresa; Prokunina-Olsson, Ludmila; Wang, Zhaoming; Hutchinson, Amy; Burdette, Laurie; Wheeler, William; Vineis, Paolo; Siddiq, Afshan; Cortessis, Victoria K.; Kooperberg, Charles; Cussenot, Olivier; Benhamou, Simone; Prescott, Jennifer; Porru, Stefano; Bueno-de-Mesquita, H.Bas; Trichopoulos, Dimitrios; Ljungberg, Börje; Clavel-Chapelon, Françoise; Weiderpass, Elisabete; Krogh, Vittorio; Dorronsoro, Miren; Travis, Ruth; Tjønneland, Anne; Brenan, Paul; Chang-Claude, Jenny; Riboli, Elio; Conti, David; Gago-Dominguez, Manuela; Stern, Mariana C.; Pike, Malcolm C.; Van Den Berg, David; Yuan, Jian-Min; Hohensee, Chancellor; Rodabough, Rebecca; Cancel-Tassin, Geraldine; Roupret, Morgan; Comperat, Eva; Chen, Constance; De Vivo, Immaculata; Giovannucci, Edward; Hunter, David J.; Kraft, Peter; Lindstrom, Sara; Carta, Angela; Pavanello, Sofia; Arici, Cecilia; Mastrangelo, Giuseppe; Karagas, Margaret R.; Schned, Alan; Armenti, Karla R.; Hosain, G.M.Monawar; Haiman, Chris A.; Fraumeni, Joseph F.; Chanock, Stephen J.; Chatterjee, Nilanjan; Rothman, Nathaniel; Silverman, Debra T.

    2014-01-01

    Bladder cancer is a complex disease with known environmental and genetic risk factors. We performed a genome-wide interaction study (GWAS) of smoking and bladder cancer risk based on primary scan data from 3002 cases and 4411 controls from the National Cancer Institute Bladder Cancer GWAS. Alternative methods were used to evaluate both additive and multiplicative interactions between individual single nucleotide polymorphisms (SNPs) and smoking exposure. SNPs with interaction P values < 5 × 10− 5 were evaluated further in an independent dataset of 2422 bladder cancer cases and 5751 controls. We identified 10 SNPs that showed association in a consistent manner with the initial dataset and in the combined dataset, providing evidence of interaction with tobacco use. Further, two of these novel SNPs showed strong evidence of association with bladder cancer in tobacco use subgroups that approached genome-wide significance. Specifically, rs1711973 (FOXF2) on 6p25.3 was a susceptibility SNP for never smokers [combined odds ratio (OR) = 1.34, 95% confidence interval (CI) = 1.20–1.50, P value = 5.18 × 10− 7]; and rs12216499 (RSPH3-TAGAP-EZR) on 6q25.3 was a susceptibility SNP for ever smokers (combined OR = 0.75, 95% CI = 0.67–0.84, P value = 6.35 × 10− 7). In our analysis of smoking and bladder cancer, the tests for multiplicative interaction seemed to more commonly identify susceptibility loci with associations in never smokers, whereas the additive interaction analysis identified more loci with associations among smokers—including the known smoking and NAT2 acetylation interaction. Our findings provide additional evidence of gene–environment interactions for tobacco and bladder cancer. PMID:24662972

  15. Discovering Genome-Wide Tag SNPs Based on the Mutual Information of the Variants

    Science.gov (United States)

    Elmas, Abdulkadir; Ou Yang, Tai-Hsien; Wang, Xiaodong

    2016-01-01

    Exploring linkage disequilibrium (LD) patterns among the single nucleotide polymorphism (SNP) sites can improve the accuracy and cost-effectiveness of genomic association studies, whereby representative (tag) SNPs are identified to sufficiently represent the genomic diversity in populations. There has been considerable amount of effort in developing efficient algorithms to select tag SNPs from the growing large-scale data sets. Methods using the classical pairwise-LD and multi-locus LD measures have been proposed that aim to reduce the computational complexity and to increase the accuracy, respectively. The present work solves the tag SNP selection problem by efficiently balancing the computational complexity and accuracy, and improves the coverage in genomic diversity in a cost-effective manner. The employed algorithm makes use of mutual information to explore the multi-locus association between SNPs and can handle different data types and conditions. Experiments with benchmark HapMap data sets show comparable or better performance against the state-of-the-art algorithms. In particular, as a novel application, the genome-wide SNP tagging is performed in the 1000 Genomes Project data sets, and produced a well-annotated database of tagging variants that capture the common genotype diversity in 2,504 samples from 26 human populations. Compared to conventional methods, the algorithm requires as input only the genotype (or haplotype) sequences, can scale up to genome-wide analyses, and produces accurate solutions with more information-rich output, providing an improved platform for researchers towards the subsequent association studies. PMID:27992465

  16. Sequencing genes in silico using single nucleotide polymorphisms

    Directory of Open Access Journals (Sweden)

    Zhang Xinyi

    2012-01-01

    Full Text Available Abstract Background The advent of high throughput sequencing technology has enabled the 1000 Genomes Project Pilot 3 to generate complete sequence data for more than 906 genes and 8,140 exons representing 697 subjects. The 1000 Genomes database provides a critical opportunity for further interpreting disease associations with single nucleotide polymorphisms (SNPs discovered from genetic association studies. Currently, direct sequencing of candidate genes or regions on a large number of subjects remains both cost- and time-prohibitive. Results To accelerate the translation from discovery to functional studies, we propose an in silico gene sequencing method (ISS, which predicts phased sequences of intragenic regions, using SNPs. The key underlying idea of our method is to infer diploid sequences (a pair of phased sequences/alleles at every functional locus utilizing the deep sequencing data from the 1000 Genomes Project and SNP data from the HapMap Project, and to build prediction models using flanking SNPs. Using this method, we have developed a database of prediction models for 611 known genes. Sequence prediction accuracy for these genes is 96.26% on average (ranges 79%-100%. This database of prediction models can be enhanced and scaled up to include new genes as the 1000 Genomes Project sequences additional genes on additional individuals. Applying our predictive model for the KCNJ11 gene to the Wellcome Trust Case Control Consortium (WTCCC Type 2 diabetes cohort, we demonstrate how the prediction of phased sequences inferred from GWAS SNP genotype data can be used to facilitate interpretation and identify a probable functional mechanism such as protein changes. Conclusions Prior to the general availability of routine sequencing of all subjects, the ISS method proposed here provides a time- and cost-effective approach to broadening the characterization of disease associated SNPs and regions, and facilitating the prioritization of candidate

  17. Network analysis of single nucleotide polymorphisms in asthma

    Science.gov (United States)

    Renkonen, Jutta; Joenväärä, Sakari; Parviainen, Ville; Mattila, Pirkko; Renkonen, Risto

    2010-01-01

    Background: Asthma is a chronic inflammatory disease of the airways with a complex genetic background. In this study, we carried out a meta-analysis of single nucleotide polymorphisms (SNPs) thought to be associated with asthma. Methods: The literature (PubMed) was searched for SNPs within genes relevant in asthma. The SNP-modified genes were converted to corresponding proteins, and their protein–protein interactions were searched from six different databases. This interaction network was analyzed using annotated vocabularies (ontologies), such as the Gene Ontology and Nature pathway interaction databases. Results: In total, 127 genes with SNPs related to asthma were found in the literature. The corresponding proteins were then entered into a large protein–protein interaction network with the help of various databases. Ninety-six SNP-related proteins had more than one interacting protein each, and a network containing 309 proteins and 644 connections was generated. This network was significantly enriched with a gene ontology entitled “protein binding” and several of its daughter categories, including receptor binding and cytokine binding, when compared with the background human proteome. In the detailed analysis, the chemokine network, including eight proteins and 13 toll-like receptors, were shown to interact with each other. Of great interest are the nonsynonymous SNPs which code for an alternative amino acid sequence of proteins and, of the toll-like receptor network, TLR1, TLR4, TLR5, TLR6, TLR10, IL4R, and IL13 are among these. Conclusions: Protein binding, toll-like receptors, and chemokines dominated in the asthma-related protein interaction network. Systems level analysis of allergy-related mutations can provide new insights into the pathogenetic mechanisms of disease. PMID:21437052

  18. Association of prediabetes-associated single nucleotide polymorphisms with microalbuminuria

    Science.gov (United States)

    Choi, Jong Wook; Moon, Shinje; Jang, Eun Jung; Lee, Chang Hwa; Park, Joon-Sung

    2017-01-01

    Increased glycemic exposure, even below the diagnostic criteria for diabetes mellitus, is crucial in the pathogenesis of diabetic microvascular complications represented by microalbuminuria. Nonetheless, there is limited evidence regarding which single nucleotide polymorphisms (SNPs) are associated with prediabetes and whether genetic predisposition to prediabetes is related to microalbuminuria, especially in the general population. Our objective was to answer these questions. We conducted a genomewide association study (GWAS) separately on two population-based cohorts, Ansung and Ansan, in the Korean Genome and Epidemiology Study (KoGES). The initial GWAS was carried out on the Ansung cohort, followed by a replication study on the Ansan cohort. A total of 5682 native Korean participants without a significant medical illness were classified into either control group (n = 3153) or prediabetic group (n = 2529). In the GWAS, we identified two susceptibility loci associated with prediabetes, one at 17p15.3-p15.1 in the GCK gene and another at 7p15.1 in YKT6. When variations in GCK and YKT6 were used as a model of prediabetes, this genetically determined prediabetes increased microalbuminuria. Multiple logistic regression analyses revealed that fasting glucose concentration in plasma and SNP rs2908289 in GCK were associated with microalbuminuria, and adjustment for age, gender, smoking history, systolic blood pressure, waist circumference, and serum triglyceride levels did not attenuate this association. Our results suggest that prediabetes and the associated SNPs may predispose to microalbuminuria before the diagnosis of diabetes mellitus. Further studies are needed to explore the details of the physiological and molecular mechanisms underlying this genetic association. PMID:28158221

  19. Evaluation of genome-wide power of genetic association studies based on empirical data from the HapMap project.

    Science.gov (United States)

    Nannya, Yasuhito; Taura, Kenjiro; Kurokawa, Mineo; Chiba, Shigeru; Ogawa, Seishi

    2007-10-15

    With recent advances in high-throughput single nucleotide polymorphism (SNP) typing technologies, genome-wide association studies have become a realistic approach to identify the causative genes that are responsible for common diseases of complex genetic traits. In this strategy, a trade-off between the increased genome coverage and a chance of finding SNPs incidentally showing a large statistics becomes serious due to extreme multiple-hypothesis testing. We investigated the extent to which this trade-off limits the genome-wide power with this approach by simulating a large number of case-control panels based on the empirical data from the HapMap Project. In our simulations, statistical costs of multiple hypothesis testing were evaluated by empirically calculating distributions of the maximum value of the chi(2) statistics for a series of marker sets having increasing numbers of SNPs, which were used to determine a genome-wide threshold in the following power simulations. With a practical study size, the cost of multiple testing largely offsets the potential benefits from increased genome coverage given modest genetic effects and/or low frequencies of causal alleles. In most realistic scenarios, increasing genome coverage becomes less influential on the power, while sample size is the predominant determinant of the feasibility of genome-wide association tests. Increasing genome coverage without corresponding increase in sample size will only consume resources without little gain in power. For common causal alleles with relatively large effect sizes [genotype relative risk > or =1.7], we can expect satisfactory power with currently available large-scale genotyping platforms using realistic sample size ( approximately 1000 per arm).

  20. A genome-wide association and gene-environment interaction study for serum triglycerides levels in a healthy Chinese male population.

    Science.gov (United States)

    Tan, Aihua; Sun, Jielin; Xia, Ning; Qin, Xue; Hu, Yanling; Zhang, Shijun; Tao, Sha; Gao, Yong; Yang, Xiaobo; Zhang, Haiying; Kim, Seong-Tae; Peng, Tao; Lin, Xiaoling; Li, Li; Mo, Linjian; Liang, Zhengjia; Shi, Deyi; Huang, Zhang; Huang, Xianghua; Liu, Ming; Ding, Qiang; Trent, Jeffrey M; Zheng, S Lilly; Mo, Zengnan; Xu, Jianfeng

    2012-04-01

    Triglyceride (TG) is a complex phenotype influenced by both genetic and environmental factors. Recent genome-wide association studies (GWAS) have identified genes or loci affecting lipid levels; however, such studies in Chinese populations are limited. A two-stage GWAS were conducted to identify genetic variants that were associated with TG in a Chinese population of 3495 men. Gene-environment interactions on serum TG levels were further investigated for the seven single nucleotide polymorphisms (SNPs) that were studied in both stages. Two previously reported SNPs (rs651821 in APOA5, rs328 in LPL) were replicated in the second stage, and the combined P-values were 9.19 × 10(-26) and 1.41 × 10(-9) for rs651821 and rs328, respectively. More importantly, a significant interaction between aldehyde dehydrogenase 2 (ALDH2) rs671 and alcohol consumption on serum TG levels were observed (P = 3.34 × 10(-5)). Rs671 was significantly associated with serum TG levels in drinkers (P = 1.90 × 10(-10)), while no association was observed in non-drinkers (P > 0.05). For drinkers, men carrying the AA/AG genotype have significantly lower serum TG levels, compared with men carrying the GG genotype. For men with the GG genotype, the serum TG levels increased with the quantity of alcohol intake (P = 1.28 × 10(-8) for trend test). We identified a novel, significant interaction effect between alcohol consumption and the ALDH2 rs671 polymorphism on TG levels, which suggests that the effect of alcohol intake on TG occurs in a two-faceted manner. Just one drink can increase TG level in susceptible individuals who carry the GG genotype, while individuals carrying AA/AG genotypes may actually benefit from moderate drinking.

  1. Novel Single-Nucleotide Polymorphism Markers Predictive of Pathologic Response to Preoperative Chemoradiation Therapy in Rectal Cancer Patients

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Jin C., E-mail: jckim@amc.seoul.kr [Department of Surgery, University of Ulsan College of Medicine, Seoul (Korea, Republic of); Institute of Innovative Cancer Research and Asan Institute for Life Sciences, Asan Medical Center, Seoul (Korea, Republic of); Ha, Ye J.; Roh, Seon A. [Department of Surgery, University of Ulsan College of Medicine, Seoul (Korea, Republic of); Institute of Innovative Cancer Research and Asan Institute for Life Sciences, Asan Medical Center, Seoul (Korea, Republic of); Cho, Dong H. [Institute of Innovative Cancer Research and Asan Institute for Life Sciences, Asan Medical Center, Seoul (Korea, Republic of); Graduate School of East-West Medical Science, Kyung Hee University, Gyeoggi-do (Korea, Republic of); Choi, Eun Y. [Department of Surgery, University of Ulsan College of Medicine, Seoul (Korea, Republic of); Institute of Innovative Cancer Research and Asan Institute for Life Sciences, Asan Medical Center, Seoul (Korea, Republic of); Kim, Tae W. [Institute of Innovative Cancer Research and Asan Institute for Life Sciences, Asan Medical Center, Seoul (Korea, Republic of); Department of Internal Medicine, University of Ulsan College of Medicine, Seoul (Korea, Republic of); Kim, Jong H. [Department of Radiation Oncology, University of Ulsan College of Medicine, Seoul (Korea, Republic of); Kang, Tae W. [Medical Genomics Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon (Korea, Republic of); Kim, Seon Y. [Institute of Innovative Cancer Research and Asan Institute for Life Sciences, Asan Medical Center, Seoul (Korea, Republic of); Medical Genomics Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon (Korea, Republic of); Kim, Yong S., E-mail: yongsung@kribb.re.kr [Institute of Innovative Cancer Research and Asan Institute for Life Sciences, Asan Medical Center, Seoul (Korea, Republic of); Medical Genomics Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon (Korea, Republic of)

    2013-06-01

    Purpose: Studies aimed at predicting individual responsiveness to preoperative chemoradiation therapy (CRT) are urgently needed, especially considering the risks associated with poorly responsive patients. Methods and Materials: A 3-step strategy for the determination of CRT sensitivity is proposed based on (1) the screening of a human genome-wide single-nucleotide polymorphism (SNP) array in correlation with histopathologic tumor regression grade (TRG); (2) clinical association analysis of 113 patients treated with preoperative CRT; and (3) a cell-based functional assay for biological validation. Results: Genome-wide screening identified 9 SNPs associated with preoperative CRT responses. Positive responses (TRG 1-3) were obtained more frequently in patients carrying the reference allele (C) of the SNP CORO2A rs1985859 than in those with the substitution allele (T) (P=.01). Downregulation of CORO2A was significantly associated with reduced early apoptosis by 27% (P=.048) and 39% (P=.023) in RKO and COLO320DM colorectal cancer cells, respectively, as determined by flow cytometry. Reduced radiosensitivity was confirmed by colony-forming assays in the 2 colorectal cancer cells (P=.034 and .015, respectively). The SNP FAM101A rs7955740 was not associated with radiosensitivity in the clinical association analysis. However, downregulation of FAM101A significantly reduced early apoptosis by 29% in RKO cells (P=.047), and it enhanced colony formation in RKO cells (P=.001) and COLO320DM cells (P=.002). Conclusion: CRT-sensitive SNP markers were identified using a novel 3-step process. The candidate marker CORO2A rs1985859 and the putative marker FAM101A rs7955740 may be of value for the prediction of radiosensitivity to preoperative CRT, although further validation is needed in large cohorts.

  2. Genome-wide association study identifies novel breast cancer susceptibility loci

    Science.gov (United States)

    Easton, Douglas F.; Pooley, Karen A.; Dunning, Alison M.; Pharoah, Paul D. P.; Thompson, Deborah; Ballinger, Dennis G.; Struewing, Jeffery P.; Morrison, Jonathan; Field, Helen; Luben, Robert; Wareham, Nicholas; Ahmed, Shahana; Healey, Catherine S.; Bowman, Richard; Meyer, Kerstin B.; Haiman, Christopher A.; Kolonel, Laurence K.; Henderson, Brian E.; Marchand, Loic Le; Brennan, Paul; Sangrajrang, Suleeporn; Gaborieau, Valerie; Odefrey, Fabrice; Shen, Chen-Yang; Wu, Pei-Ei; Wang, Hui-Chun; Eccles, Diana; Evans, D. Gareth; Peto, Julian; Fletcher, Olivia; Johnson, Nichola; Seal, Sheila; Stratton, Michael R.; Rahman, Nazneen; Chenevix-Trench, Georgia; Bojesen, Stig E.; Nordestgaard, Børge G.; Axelsson, Christen K.; Garcia-Closas, Montserrat; Brinton, Louise; Chanock, Stephen; Lissowska, Jolanta; Peplonska, Beata; Nevanlinna, Heli; Fagerholm, Rainer; Eerola, Hannaleena; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Ahn, Sei-Hyun; Hunter, David J.; Hankinson, Susan E.; Cox, David G.; Hall, Per; Wedren, Sara; Liu, Jianjun; Low, Yen-Ling; Bogdanova, Natalia; Schürmann, Peter; Dörk, Thilo; Tollenaar, Rob A. E. M.; Jacobi, Catharina E.; Devilee, Peter; Klijn, Jan G. M.; Sigurdson, Alice J.; Doody, Michele M.; Alexander, Bruce H.; Zhang, Jinghui; Cox, Angela; Brock, Ian W.; MacPherson, Gordon; Reed, Malcolm W. R.; Couch, Fergus J.; Goode, Ellen L.; Olson, Janet E.; Meijers-Heijboer, Hanne; van den Ouweland, Ans; Uitterlinden, André; Rivadeneira, Fernando; Milne, Roger L.; Ribas, Gloria; Gonzalez-Neira, Anna; Benitez, Javier; Hopper, John L.; McCredie, Margaret; Southey, Melissa; Giles, Graham G.; Schroen, Chris; Justenhoven, Christina; Brauch, Hiltrud; Hamann, Ute; Ko, Yon-Dschun; Spurdle, Amanda B.; Beesley, Jonathan; Chen, Xiaoqing; Mannermaa, Arto; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana; Day, Nicholas E.; Cox, David R.; Ponder, Bruce A. J.; Luccarini, Craig; Conroy, Don; Shah, Mitul; Munday, Hannah; Jordan, Clare; Perkins, Barbara; West, Judy; Redman, Karen; Driver, Kristy; Aghmesheh, Morteza; Amor, David; Andrews, Lesley; Antill, Yoland; Armes, Jane; Armitage, Shane; Arnold, Leanne; Balleine, Rosemary; Begley, Glenn; Beilby, John; Bennett, Ian; Bennett, Barbara; Berry, Geoffrey; Blackburn, Anneke; Brennan, Meagan; Brown, Melissa; Buckley, Michael; Burke, Jo; Butow, Phyllis; Byron, Keith; Callen, David; Campbell, Ian; Chenevix-Trench, Georgia; Clarke, Christine; Colley, Alison; Cotton, Dick; Cui, Jisheng; Culling, Bronwyn; Cummings, Margaret; Dawson, Sarah-Jane; Dixon, Joanne; Dobrovic, Alexander; Dudding, Tracy; Edkins, Ted; Eisenbruch, Maurice; Farshid, Gelareh; Fawcett, Susan; Field, Michael; Firgaira, Frank; Fleming, Jean; Forbes, John; Friedlander, Michael; Gaff, Clara; Gardner, Mac; Gattas, Mike; George, Peter; Giles, Graham; Gill, Grantley; Goldblatt, Jack; Greening, Sian; Grist, Scott; Haan, Eric; Harris, Marion; Hart, Stewart; Hayward, Nick; Hopper, John; Humphrey, Evelyn; Jenkins, Mark; Jones, Alison; Kefford, Rick; Kirk, Judy; Kollias, James; Kovalenko, Sergey; Lakhani, Sunil; Leary, Jennifer; Lim, Jacqueline; Lindeman, Geoff; Lipton, Lara; Lobb, Liz; Maclurcan, Mariette; Mann, Graham; Marsh, Deborah; McCredie, Margaret; McKay, Michael; McLachlan, Sue Anne; Meiser, Bettina; Milne, Roger; Mitchell, Gillian; Newman, Beth; O'Loughlin, Imelda; Osborne, Richard; Peters, Lester; Phillips, Kelly; Price, Melanie; Reeve, Jeanne; Reeve, Tony; Richards, Robert; Rinehart, Gina; Robinson, Bridget; Rudzki, Barney; Salisbury, Elizabeth; Sambrook, Joe; Saunders, Christobel; Scott, Clare; Scott, Elizabeth; Scott, Rodney; Seshadri, Ram; Shelling, Andrew; Southey, Melissa; Spurdle, Amanda; Suthers, Graeme; Taylor, Donna; Tennant, Christopher; Thorne, Heather; Townshend, Sharron; Tucker, Kathy; Tyler, Janet; Venter, Deon; Visvader, Jane; Walpole, Ian; Ward, Robin; Waring, Paul; Warner, Bev; Warren, Graham; Watson, Elizabeth; Williams, Rachael; Wilson, Judy; Winship, Ingrid; Young, Mary Ann; Bowtell, David; Green, Adele; deFazio, Anna; Chenevix-Trench, Georgia; Gertig, Dorota; Webb, Penny

    2009-01-01

    Breast cancer exhibits familial aggregation, consistent with variation in genetic susceptibility to the disease. Known susceptibility genes account for less than 25% of the familial risk of breast cancer, and the residual genetic variance is likely to be due to variants conferring more moderate risks. To identify further susceptibility alleles, we conducted a two-stage genome-wide association study in 4,398 breast cancer cases and 4,316 controls, followed by a third stage in which 30 single nucleotide polymorphisms (SNPs) were tested for confirmation in 21,860 cases and 22,578 controls from 22 studies. We used 227,876 SNPs that were estimated to correlate with 77% of known common SNPs in Europeans at r2>0.5. SNPs in five novel independent loci exhibited strong and consistent evidence of association with breast cancer (P<10−7). Four of these contain plausible causative genes (FGFR2, TNRC9, MAP3K1 and LSP1). At the second stage, 1,792 SNPs were significant at the P<0.05 level compared with an estimated 1,343 that would be expected by chance, indicating that many additional common susceptibility alleles may be identifiable by this approach. PMID:17529967

  3. The mouse QTL map helps interpret human genome-wide association studies for HDL cholesterol.

    Science.gov (United States)

    Leduc, Magalie S; Lyons, Malcolm; Darvishi, Katayoon; Walsh, Kenneth; Sheehan, Susan; Amend, Sarah; Cox, Allison; Orho-Melander, Marju; Kathiresan, Sekar; Paigen, Beverly; Korstanje, Ron

    2011-06-01

    Genome-wide association (GWA) studies represent a powerful strategy for identifying susceptibility genes for complex diseases in human populations but results must be confirmed and replicated. Because of the close homology between mouse and human genomes, the mouse can be used to add evidence to genes suggested by human studies. We used the mouse quantitative trait loci (QTL) map to interpret results from a GWA study for genes associated with plasma HDL cholesterol levels. We first positioned single nucleotide polymorphisms (SNPs) from a human GWA study on the genomic map for mouse HDL QTL. We then used mouse bioinformatics, sequencing, and expression studies to add evidence for one well-known HDL gene (Abca1) and three newly identified genes (Galnt2, Wwox, and Cdh13), thus supporting the results of the human study. For GWA peaks that occur in human haplotype blocks with multiple genes, we examined the homologous regions in the mouse to prioritize the genes using expression, sequencing, and bioinformatics from the mouse model, showing that some genes were unlikely candidates and adding evidence for candidate genes Mvk and Mmab in one haplotype block and Fads1 and Fads2 in the second haplotype block. Our study highlights the value of mouse genetics for evaluating genes found in human GWA studies.

  4. A genome-wide scan for common alleles affecting risk for autism.

    Science.gov (United States)

    Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R; Correia, Catarina; Abrahams, Brett S; Sykes, Nuala; Pagnamenta, Alistair T; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R; Casallo, Guillermo; Casey, Jillian; Chu, Su H; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A; Folstein, Susan E; Fombonne, Eric; Freitag, Christine M; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J; Hakonarson, Hakon; Heron, Elizabeth A; Hill, Matthew; Holt, Richard; Howe, Jennifer L; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M; Lamb, Janine A; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L; Lionel, Anath C; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R; McConachie, Helen; McDougle, Christopher J; McGrath, Jane; McMahon, William M; Melhem, Nadine M; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J; Mirza, Ghazala K; Munson, Jeff; Nelson, Stanley F; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L; Bierut, Laura J; Rice, John P; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D; Cantor, Rita M; Cook, Edwin H; Coon, Hilary; Cuccaro, Michael L; Gallagher, Louise; Geschwind, Daniel H; Gill, Michael; Haines, Jonathan L; Miller, Judith; Monaco, Anthony P; Nurnberger, John I; Paterson, Andrew D; Pericak-Vance, Margaret A; Schellenberg, Gerard D; Scherer, Stephen W; Sutcliffe, James S; Szatmari, Peter; Vicente, Astrid M; Vieland, Veronica J; Wijsman, Ellen M; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim

    2010-10-15

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C.

  5. Multi-locus Test and Correction for Confounding Effects in Genome-Wide Association Studies.

    Science.gov (United States)

    Chen, Donglai; Liu, Chuanhai; Xie, Jun

    2016-11-01

    Genome-wide association studies (GWAS) examine a large number of genetic variants, e. g., single nucleotide polymorphisms (SNP), and associate them with a disease of interest. Traditional statistical methods for GWASs can produce spurious associations, due to limited information from individual SNPs and confounding effects. This paper develops two statistical methods to enhance data analysis of GWASs. The first is a multiple-SNP association test, which is a weighted chi-square test derived for big contingency tables. The test assesses combinatorial effects of multiple SNPs and improves conventional methods of single SNP analysis. The second is a method that corrects for confounding effects, which may come from population stratification as well as other ambiguous (unknown) factors. The proposed method identifies a latent confounding factor, using a profile of whole genome SNPs, and eliminates confounding effects through matching or stratified statistical analysis. Simulations and a GWAS of rheumatoid arthritis demonstrate that the proposed methods dramatically remove the number of significant tests, or false positives, and outperforms other available methods.

  6. BioSMACK: a linux live CD for genome-wide association analyses.

    Science.gov (United States)

    Hong, Chang Bum; Kim, Young Jin; Moon, Sanghoon; Shin, Young-Ah; Go, Min Jin; Kim, Dong-Joon; Lee, Jong-Young; Cho, Yoon Shin

    2012-01-01

    Recent advances in high-throughput genotyping technologies have enabled us to conduct a genome-wide association study (GWAS) on a large cohort. However, analyzing millions of single nucleotide polymorphisms (SNPs) is still a difficult task for researchers conducting a GWAS. Several difficulties such as compatibilities and dependencies are often encountered by researchers using analytical tools, during the installation of software. This is a huge obstacle to any research institute without computing facilities and specialists. Therefore, a proper research environment is an urgent need for researchers working on GWAS. We developed BioSMACK to provide a research environment for GWAS that requires no configuration and is easy to use. BioSMACK is based on the Ubuntu Live CD that offers a complete Linux-based operating system environment without installation. Moreover, we provide users with a GWAS manual consisting of a series of guidelines for GWAS and useful examples. BioSMACK is freely available at http://ksnp.cdc. go.kr/biosmack.

  7. Incorporating group correlations in genome-wide association studies using smoothed group Lasso.

    Science.gov (United States)

    Liu, Jin; Huang, Jian; Ma, Shuangge; Wang, Kai

    2013-04-01

    In genome-wide association studies, penalization is an important approach for identifying genetic markers associated with disease. Motivated by the fact that there exists natural grouping structure in single nucleotide polymorphisms and, more importantly, such groups are correlated, we propose a new penalization method for group variable selection which can properly accommodate the correlation between adjacent groups. This method is based on a combination of the group Lasso penalty and a quadratic penalty on the difference of regression coefficients of adjacent groups. The new method is referred to as smoothed group Lasso (SGL). It encourages group sparsity and smoothes regression coefficients for adjacent groups. Canonical correlations are applied to the weights between groups in the quadratic difference penalty. We first derive a GCD algorithm for computing the solution path with linear regression model. The SGL method is further extended to logistic regression for binary response. With the assistance of the majorize-minimization algorithm, the SGL penalized logistic regression turns out to be an iteratively penalized least-square problem. We also suggest conducting principal component analysis to reduce the dimensionality within groups. Simulation studies are used to evaluate the finite sample performance. Comparison with group Lasso shows that SGL is more effective in selecting true positives. Two datasets are analyzed using the SGL method.

  8. A genome-wide copy number variant study of suicidal behavior.

    Directory of Open Access Journals (Sweden)

    Jeffrey A Gross

    Full Text Available Suicide and suicide attempts are complex behaviors that result from the interaction of different factors, including genetic variants that increase the predisposition to suicidal behaviors. Copy number variations (CNVs are deletions or duplications of a segment of DNA usually larger than one kilobase. These structural genetic changes, although quite rare, have been associated with genetic liability to mental disorders, such as autism, schizophrenia, and bipolar disorder. No genome-wide level studies have been published investigating the potential role of CNVs in suicidal behaviors. Based on single-nucleotide polymorphism array data, we followed the Penn-CNV standards to detect CNVs in 1,608 subjects, comprising 475 suicide and suicide attempt cases and 1,133 controls. Although the initial algorithms determined the presence of CNVs on chromosomes 6 and 12 in seven and eight cases, respectively, compared with none of the controls, visual inspection of the raw data did not support this finding. Furthermore we were unable to validate these findings by CNV-specific real-time polymerase chain reaction. Additionally, rare CNV burden analysis did not find an association between the frequency or length of rare CNVs and suicidal behavior in our sample population. Although our findings suggest CNVs do not play an important role in the etiology of suicidal behaviors, they are not inconsistent with the strong evidence from the literature suggesting that other genetic variants account for a portion of the total phenotypic variability in suicidal behavior.

  9. Agronomic and seed quality traits dissected by genome-wide association mapping in Brassica napus

    Directory of Open Access Journals (Sweden)

    Niklas eKörber

    2016-03-01

    Full Text Available In Brassica napus breeding, traits related to commercial success are of highest importance for plant breeders. However, such traits can only be assessed in an advanced developmental stage. % as well as require high experimental effort due to their quantitative inheritance and the importance of genotype*environment interaction. Molecular markers genetically linked to such traits have the potential to accelerate the breeding process of B. napus by marker-assisted selection. Therefore, the objectives of this study were to identify (i genome regions associated with the examined agronomic and seed quality traits, (ii the interrelationship of population structure and the detected associations, and (iii candidate genes for the revealed associations. The diversity set used in this study consisted of 405 Brassica napus inbred lines which were genotyped using a 6K single nucleotide polymorphism (SNP array and phenotyped for agronomic and seed quality traits in field trials. In a genome-wide association study, we detected a total of 112 associations between SNPs and the seed quality traits as well as 46 SNP-trait associations for the agronomic traits with a P-value 100 and a sequence identity of > 70 % to A. thaliana or B. rapa could be found for the agronomic SNP-trait associations and 187 hits of potential candidate genes for the seed quality SNP-trait associations.

  10. Elastic-net regularization approaches for genome-wide association studies of rheumatoid arthritis.

    Science.gov (United States)

    Cho, Seoae; Kim, Haseong; Oh, Sohee; Kim, Kyunga; Park, Taesung

    2009-12-15

    The current trend in genome-wide association studies is to identify regions where the true disease-causing genes may lie by evaluating thousands of single-nucleotide polymorphisms (SNPs) across the whole genome. However, many challenges exist in detecting disease-causing genes among the thousands of SNPs. Examples include multicollinearity and multiple testing issues, especially when a large number of correlated SNPs are simultaneously tested. Multicollinearity can often occur when predictor variables in a multiple regression model are highly correlated, and can cause imprecise estimation of association. In this study, we propose a simple stepwise procedure that identifies disease-causing SNPs simultaneously by employing elastic-net regularization, a variable selection method that allows one to address multicollinearity. At Step 1, the single-marker association analysis was conducted to screen SNPs. At Step 2, the multiple-marker association was scanned based on the elastic-net regularization. The proposed approach was applied to the rheumatoid arthritis (RA) case-control data set of Genetic Analysis Workshop 16. While the selected SNPs at the screening step are located mostly on chromosome 6, the elastic-net approach identified putative RA-related SNPs on other chromosomes in an increased proportion. For some of those putative RA-related SNPs, we identified the interactions with sex, a well known factor affecting RA susceptibility.

  11. Genome-wide association study identifies new prostate cancer susceptibility loci

    Science.gov (United States)

    Schumacher, Fredrick R.; Berndt, Sonja I.; Siddiq, Afshan; Jacobs, Kevin B.; Wang, Zhaoming; Lindstrom, Sara; Stevens, Victoria L.; Chen, Constance; Mondul, Alison M.; Travis, Ruth C.; Stram, Daniel O.; Eeles, Rosalind A.; Easton, Douglas F.; Giles, Graham; Hopper, John L.; Neal, David E.; Hamdy, Freddie C.; Donovan, Jenny L.; Muir, Kenneth; Al Olama, Ali Amin; Kote-Jarai, Zsofia; Guy, Michelle; Severi, Gianluca; Grönberg, Henrik; Isaacs, William B.; Karlsson, Robert; Wiklund, Fredrik; Xu, Jianfeng; Allen, Naomi E.; Andriole, Gerald L.; Barricarte, Aurelio; Boeing, Heiner; Bas Bueno-de-Mesquita, H.; Crawford, E. David; Diver, W. Ryan; Gonzalez, Carlos A.; Gaziano, J. Michael; Giovannucci, Edward L.; Johansson, Mattias; Le Marchand, Loic; Ma, Jing; Sieri, Sabina; Stattin, Pär; Stampfer, Meir J.; Tjonneland, Anne; Vineis, Paolo; Virtamo, Jarmo; Vogel, Ulla; Weinstein, Stephanie J.; Yeager, Meredith; Thun, Michael J.; Kolonel, Laurence N.; Henderson, Brian E.; Albanes, Demetrius; Hayes, Richard B.; Spencer Feigelson, Heather; Riboli, Elio; Hunter, David J.; Chanock, Stephen J.; Haiman, Christopher A.; Kraft, Peter

    2011-01-01

    Prostate cancer (PrCa) is the most common non-skin cancer diagnosed among males in developed countries and the second leading cause of cancer mortality, yet little is known regarding its etiology and factors that influence clinical outcome. Genome-wide association studies (GWAS) of PrCa have identified at least 30 distinct loci associated with small differences in risk. We conducted a GWAS in 2782 advanced PrCa cases (Gleason grade ≥ 8 or tumor stage C/D) and 4458 controls with 571 243 single nucleotide polymorphisms (SNPs). Based on in silico replication of 4679 SNPs (Stage 1, P < 0.02) in two published GWAS with 7358 PrCa cases and 6732 controls, we identified a new susceptibility locus associated with overall PrCa risk at 2q37.3 (rs2292884, P= 4.3 × 10−8). We also confirmed a locus suggested by an earlier GWAS at 12q13 (rs902774, P= 8.6 × 10−9). The estimated per-allele odds ratios for these loci (1.14 for rs2292884 and 1.17 for rs902774) did not differ between advanced and non-advanced PrCa (case-only test for heterogeneity P= 0.72 and P= 0.61, respectively). Further studies will be needed to assess whether these or other loci are differentially associated with PrCa subtypes. PMID:21743057

  12. A genome-wide scan for common alleles affecting risk for autism.

    LENUS (Irish Health Repository)

    Anney, Richard

    2010-10-15

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner\\'s curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C.

  13. A genome-wide scan for common alleles affecting risk for autism

    Science.gov (United States)

    Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R.; Correia, Catarina; Abrahams, Brett S.; Sykes, Nuala; Pagnamenta, Alistair T.; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J.; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F.; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R.; Casallo, Guillermo; Casey, Jillian; Chu, Su H.; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L.; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A.; Folstein, Susan E.; Fombonne, Eric; Freitag, Christine M.; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T.; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J.; Hakonarson, Hakon; Heron, Elizabeth A.; Hill, Matthew; Holt, Richard; Howe, Jennifer L.; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M.; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M.; Lamb, Janine A.; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L.; Lionel, Anath C.; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C.; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R.; McConachie, Helen; McDougle, Christopher J.; McGrath, Jane; McMahon, William M.; Melhem, Nadine M.; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J.; Mirza, Ghazala K.; Munson, Jeff; Nelson, Stanley F.; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R.; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L.; Bierut, Laura J.; Rice, John P.; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C.; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P.; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B.; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H.; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L.; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D.; Cantor, Rita M.; Cook, Edwin H.; Coon, Hilary; Cuccaro, Michael L.; Gallagher, Louise; Geschwind, Daniel H.; Gill, Michael; Haines, Jonathan L.; Miller, Judith; Monaco, Anthony P.; Nurnberger, John I.; Paterson, Andrew D.; Pericak-Vance, Margaret A.; Schellenberg, Gerard D.; Scherer, Stephen W.; Sutcliffe, James S.; Szatmari, Peter; Vicente, Astrid M.; Vieland, Veronica J.; Wijsman, Ellen M.; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim

    2010-01-01

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10−8. When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10−8 threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C. PMID:20663923

  14. Neuropsychological effects of the CSMD1 genome-wide associated schizophrenia risk variant rs10503253.

    LENUS (Irish Health Repository)

    Donohoe, G

    2013-03-01

    The single-nucleotide polymorphism (SNP) rs10503253, located within the CUB and Sushi multiple domains-1 (CSMD1) gene on 8p23.2, was recently identified as genome-wide significant for schizophrenia (SZ), but is of unknown function. We investigated the neurocognitive effects of this CSMD1 variant in vivo in patients and healthy participants using behavioral and imaging measures of brain structure and function. We compared carriers and non-carriers of the risk \\'A\\' allele on measures of neuropsychological performance typically impaired in SZ (general cognitive ability, episodic and working memory and attentional control) in independent samples of Irish patients (n = 387) and controls (n = 171) and German patients (205) and controls (n = 533). Across these groups, the risk \\'A\\' allele at CSMD1 was associated with deleterious effects across a number of neurocognitive phenotypes. Specifically, the risk allele was associated with poorer performance on neuropsychological measures of general cognitive ability and memory function but not attentional control. These effects, while significant, were subtle, and varied between samples. Consistent with previous evidence suggesting that CSMD1 may be involved in brain mechanisms related to memory and learning, these data appear to reflect the deleterious effects of the identified \\'A\\' risk allele on neurocognitive function, possibly as part of the mechanism by which CSMD1 is associated with SZ risk.

  15. Neural effects of the CSMD1 genome-wide associated schizophrenia risk variant rs10503253.

    LENUS (Irish Health Repository)

    Rose, Emma J

    2013-09-01

    The single nucleotide polymorphism rs10503253 within the CUB and Sushi multiple domains-1 (CSMD1) gene on 8p23.2 has been identified as genome-wide significant for schizophrenia (SZ). This gene is of unknown function but has been implicated in multiple neurodevelopmental disorders that impact upon cognition, leading us to hypothesize that an effect on brain structure and function underlying cognitive processes may be part of the mechanism by which CMSD1 increases illness risk. To test this hypothesis, we investigated this CSMD1 variant in vivo in healthy participants in a magnetic resonance imaging (MRI) study comprised of both fMRI of spatial working memory (N = 50) and a voxel-based morphometry investigation of grey and white matter (WM) volume (N = 150). Analyses of these data indicated that the risk "A" allele was associated with comparatively reduced cortical activations in BA18, that is, middle occipital gyrus and cuneus; posterior brain regions that support maintenance processes during performance of a spatial working memory task. Conversely, there was an absence of significant structural differences in brain volume (i.e., grey or WM). In accordance with previous evidence, these data suggest that CSMD1 may mediate brain function related to cognitive processes (i.e., executive function); with the relatively deleterious effects of the identified "A" risk allele on brain activity possibly constituting part of the mechanism by which CSMD1 increases schizophrenia risk.

  16. Biallelic and Genome Wide Association Mapping of Germanium Tolerant Loci in Rice (Oryza sativa L..

    Directory of Open Access Journals (Sweden)

    Partha Talukdar

    Full Text Available Rice plants accumulate high concentrations of silicon. Silicon has been shown to be involved in plant growth, high yield, and mitigating biotic and abiotic stresses. However, it has been demonstrated that inorganic arsenic is taken up by rice through silicon transporters under anaerobic conditions, thus the ability to efficiently take up silicon may be considered either a positive or a negative trait in rice. Germanium is an analogue of silicon that produces brown lesions in shoots and leaves, and germanium toxicity has been used to identify mutants in silicon and arsenic transport. In this study, two different genetic mapping methods were performed to determine the loci involved in germanium sensitivity in rice. Genetic mapping in the biparental cross of Bala × Azucena (an F6 population and a genome wide association (GWA study with 350 accessions from the Rice Diversity Panel 1 were conducted using 15 μM of germanic acid. This identified a number of germanium sensitive loci: some co-localised with previously identified quantitative trait loci (QTL for tissue silicon or arsenic concentration, none co-localised with Lsi1 or Lsi6, while one single nucleotide polymorphism (SNP was detected within 200 kb of Lsi2 (these are genes known to transport silicon, whose identity was discovered using germanium toxicity. However, examining candidate genes that are within the genomic region of the loci detected above reveals genes homologous to both Lsi1 and Lsi2, as well as a number of other candidate genes, which are discussed.

  17. Pseudo-Seq: Genome-Wide Detection of Pseudouridine Modifications in RNA.

    Science.gov (United States)

    Carlile, Thomas M; Rojas-Duran, Maria F; Gilbert, Wendy V

    2015-01-01

    RNA molecules contain a variety of chemically diverse, posttranscriptionally modified bases. The most abundant modified base found in cellular RNAs, pseudouridine (Ψ), has recently been mapped to hundreds of sites in mRNAs, many of which are dynamically regulated. Though the pseudouridine landscape has been determined in only a few cell types and growth conditions, the enzymes responsible for mRNA pseudouridylation are universally conserved, suggesting many novel pseudouridylated sites remain to be discovered. Here, we present Pseudo-seq, a technique that allows the identification of sites of pseudouridylation genome-wide with single-nucleotide resolution. In this chapter, we provide a detailed description of Pseudo-seq. We include protocols for RNA isolation from Saccharomyces cerevisiae, Pseudo-seq library preparation, and data analysis, including descriptions of processing and mapping of sequencing reads, computational identification of sites of pseudouridylation, and assignment of sites to specific pseudouridine synthases. The approach presented here is readily adaptable to any cell or tissue type from which high-quality mRNA can be isolated. Identification of novel pseudouridylation sites is an important first step in elucidating the regulation and functions of these modifications.

  18. A genome-wide, fine-scale map of natural pigmentation variation in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Héloïse Bastide

    2013-06-01

    Full Text Available Various approaches can be applied to uncover the genetic basis of natural phenotypic variation, each with their specific strengths and limitations. Here, we use a replicated genome-wide association approach (Pool-GWAS to fine-scale map genomic regions contributing to natural variation in female abdominal pigmentation in Drosophila melanogaster, a trait that is highly variable in natural populations and highly heritable in the laboratory. We examined abdominal pigmentation phenotypes in approximately 8000 female European D. melanogaster, isolating 1000 individuals with extreme phenotypes. We then used whole-genome Illumina sequencing to identify single nucleotide polymorphisms (SNPs segregating in our sample, and tested these for associations with pigmentation by contrasting allele frequencies between replicate pools of light and dark individuals. We identify two small regions near the pigmentation genes tan and bric-à-brac 1, both corresponding to known cis-regulatory regions, which contain SNPs showing significant associations with pigmentation variation. While the Pool-GWAS approach suffers some limitations, its cost advantage facilitates replication and it can be applied to any non-model system with an available reference genome.

  19. Lessons from Genome-Wide Association Studies in Reproductive Medicine: Menopause.

    Science.gov (United States)

    Ruth, Katherine S; Murray, Anna

    2016-07-01

    In recent years, common genetic variants have been identified by genome-wide association studies (GWASs) that have led to the detection of 44 genetic loci associated with approximately 6% of common variation in age at natural menopause. In the latest GWAS, doubling the sample size to approximately 70,000 women more than doubled the number of signals identified, from 17 to 56. In addition, low-frequency coding variants (highlighting the importance of this pathway in determining oocyte reserve. In addition, GWAS demonstrates that the hypothalamic-pituitary axis is involved in menopause timing as well as puberty timing, showing the first genetic link between timing of the start and end of reproductive life. Genetic variants have been used to explore the causal relationships between menopause timing and breast cancer. These studies demonstrate that for a 1 year increase in menopause age, there is a 6% increase in breast cancer risk, a value approximately double the estimate from epidemiological studies. Prolonged exposure to estrogen during reproductive life is the likely mechanism, rather than a direct effect of DDR variants on cancer risk. Further work is needed to determine the mechanism for the effect of each variant identified by GWAS and more variants will undoubtedly be discovered as sample sizes increase, denser single nucleotide polymorphism arrays and reference genomes are used, and populations from diverse ethnic groups are studied.

  20. A genome-wide association analysis for susceptibility of pigs to enterotoxigenic Escherichia coli F41.

    Science.gov (United States)

    Ji, H Y; Yang, B; Zhang, Z Y; Ouyang, J; Yang, M; Zhang, X F; Zhang, W C; Su, Y; Zhao, K W; Xiao, S J; Yan, X M; Ren, J; Huang, L S

    2016-10-01

    Enterotoxigenic Escherichia coli (ETEC) is a type of pathogenic bacteria that cause diarrhea in piglets through colonizing pig small intestine epithelial cells by their surface fimbriae. Different fimbriae type of ETEC including F4, F18, K99 and F41 have been isolated from diarrheal pigs. In this study, we performed a genome-wide association study to map the loci associated with the susceptibility of pigs to ETEC F41 using 39454 single nucleotide polymorphisms (SNPs) in 667 F2 pigs from a White Duroc×Erhualian F2 cross. The most significant SNP (ALGA0022658, P=5.59×10-13) located at 6.95 Mb on chromosome 4. ALGA0022658 was in high linkage disequilibrium (r 2>0.5) with surrounding SNPs that span a 1.21 Mb interval. Within this 1.21 Mb region, we investigated ZFAT as a positional candidate gene. We re-sequenced cDNA of ZFAT in four pigs with different susceptibility phenotypes, and identified seven coding variants. We genotyped these seven variants in 287 unrelated pigs from 15 diverse breeds that were measured with ETEC F41 susceptibility phenotype. Five variants showed nominal significant association (P<0.05) with ETEC F41 susceptibility phenotype in International commercial pigs. This study provided refined region associated with susceptibility of pigs to ETEC F41 than that reported previously. Further works are needed to uncover the underlying causal mutation(s).

  1. Validating Genome-Wide Association Candidates Controlling Quantitative Variation in Nodulation1[OPEN

    Science.gov (United States)

    Tiffin, Peter; Guhlin, Joseph; Atkins, Paul; Baltes, Nicholas J.; Denny, Roxanne

    2017-01-01

    Genome-wide association (GWA) studies offer the opportunity to identify genes that contribute to naturally occurring variation in quantitative traits. However, GWA relies exclusively on statistical association, so functional validation is necessary to make strong claims about gene function. We used a combination of gene-disruption platforms (Tnt1 retrotransposons, hairpin RNA-interference constructs, and CRISPR/Cas9 nucleases) together with randomized, well-replicated experiments to evaluate the function of genes that an earlier GWA study in Medicago truncatula had identified as candidates contributing to variation in the symbiosis between legumes and rhizobia. We evaluated ten candidate genes found in six clusters of strongly associated single nucleotide polymorphisms, selected on the basis of their strength of statistical association, proximity to annotated gene models, and root or nodule expression. We found statistically significant effects on nodule production for three candidate genes, each validated in two independent mutants. Annotated functions of these three genes suggest their contributions to quantitative variation in nodule production occur through processes not previously connected to nodulation, including phosphorous supply and salicylic acid-related defense response. These results demonstrate the utility of GWA combined with reverse mutagenesis technologies to discover and validate genes contributing to naturally occurring variation in quantitative traits. The results highlight the potential for GWA to complement forward genetics in identifying the genetic basis of ecologically and economically important traits. PMID:28057894

  2. Genome-wide association study of systemic sclerosis identifies CD247 as a novel susceptibility locus

    Science.gov (United States)

    Radstake, Timothy R.D.J.; Gorlova, Olga; Rueda, Blanca; Martin, Jose-Ezequiel; Alizadeh, Behrooz Z.; Palomino-Morales, Rogelio; Coenen, Marieke J.; Vonk, Madelon C.; Voskuyl, Alexandre E.; Scheurwegh, Annemie J.; Broen, Jasper C.; van Riel, Piet L.C.M.; van ‘t Slot, Ruben; Italiaander, Annet; Ophoff, Roel A.; Riemekasten, Gabriela; Hunzelmann, Nico; Simeon, Carmen P.; Ortego-Centeno, Norberto; González-Gay, Miguel A.; González-Escribano, María F.; Airo, Paolo; van Laar, Jaap; Herrick, Ariane; Worthington, Jane; Hesselstrand, Roger; Smith, Vanessa; de Keyser, Filip; Houssiau, Fredric; Chee, Meng May; Madhok, R; Shiels, Paul; Westhovens, Rene; Kreuter, Alexander; Kiener, Hans; de Baere, Elfride; Witte, Torsten; Padykov, Leonid; Klareskog, Lars; Beretta, Lorenzo; Scorza, Rafaella; Lie, Benedicte A.; Hoffman-Vold, Anna-Maria; Carreira, P; Varga, J.; Hinchcliff, M.; Gregersen, Peter; Lee, Annette T.; Ying, Jun; Han, Younghun; Weng, Shih-Feng; Amos, Christopher I.; Wigley, Fredrick M.; Hummers, Laura; Nelson, J. Lee; Agarwal, Sandeep K.; Assassi, Shervin; Gourh, Pravitt; Tan, Filemon K.; Koeleman, Bobby P.C.; Arnett, Frank C; Martin, Javier; Mayes, Maureen D.

    2010-01-01

    Systemic sclerosis (SSc) is an autoimmune disease characterized by fibrosis of the skin and internal organs that leads to profound disability and premature death. To identify novel SSc susceptibility loci we conducted the first genome wide association study (GWAS) in a population of Caucasian ancestry including a total of 2296 SSc patients and 5171 controls. Analysis of 279,621 autosomal single nucleotide polymorphisms (SNPs) followed by replication testing in an independent case-control set of European ancestry (2,753 SSc patients / 4,569 controls) identified a new susceptibility locus for systemic sclerosis at CD247 (1q22-23; rs2056626, P = 2.09 × 10−7 in the discovery samples, P = 3.39 × 10−9 in the combined analysis). Additionally, we confirm and firmly establish the role of MHC (2.31 × 10−18), IRF5 (P =1.86 × 10−13) and STAT4 (P =3.37 × 10−9) gene regions as SSc genetic risk factors. PMID:20383147

  3. A multi-stage genome-wide association study of bladder cancer identifies multiple susceptibility loci

    Science.gov (United States)

    Rothman, Nathaniel; Garcia-Closas, Montserrat; Chatterjee, Nilanjan; Malats, Nuria; Wu, Xifeng; Figueroa, Jonine; Real, Francisco X; Van Den Berg, David; Matullo, Giuseppe; Baris, Dalsu; Thun, Michael; Kiemeney, Lambertus A; Vineis, Paolo; De Vivo, Immaculata; Albanes, Demetrius; Purdue, Mark P; Rafnar, Thorunn; Hildebrandt, Michelle A T; Kiltie, Anne E; Cussenot, Olivier; Golka, Klaus; Kumar, Rajiv; Taylor, Jack A; Mayordomo, Jose I; Jacobs, Kevin B; Kogevinas, Manolis; Hutchinson, Amy; Wang, Zhaoming; Fu, Yi-Ping; Prokunina-Olsson, Ludmila; Burdette, Laurie; Yeager, Meredith; Wheeler, William; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Lloreta, Josep; Johnson, Alison; Schwenn, Molly; Karagas, Margaret R; Schned, Alan; Andriole, Gerald; Grubb, Robert; Black, Amanda; Jacobs, Eric J; Diver, W Ryan; Gapstur, Susan M; Weinstein, Stephanie J; Virtamo, Jarmo; Cortessis, Victoria K; Gago-Dominguez, Manuela; Pike, Malcolm C; Stern, Mariana C; Yuan, Jian-Min; Hunter, David; McGrath, Monica; Dinney, Colin P; Czerniak, Bogdan; Chen, Meng; Yang, Hushan; Vermeulen, Sita H; Aben, Katja K; Witjes, J Alfred; Makkinje, Remco R; Sulem, Patrick; Besenbacher, Soren; Stefansson, Kari; Riboli, Elio; Brennan, Paul; Panico, Salvatore; Navarro, Carmen; Allen, Naomi E; Bueno-de-Mesquita, H Bas; Trichopoulos, Dimitrios; Caporaso, Neil; Landi, Maria Teresa; Canzian, Federico; Ljungberg, Borje; Tjonneland, Anne; Clavel-Chapelon, Francoise; Bishop, David T; Teo, Mark T W; Knowles, Margaret A; Guarrera, Simonetta; Polidoro, Silvia; Ricceri, Fulvio; Sacerdote, Carlotta; Allione, Alessandra; Cancel-Tassin, Geraldine; Selinski, Silvia; Hengstler, Jan G; Dietrich, Holger; Fletcher, Tony; Rudnai, Peter; Gurzau, Eugen; Koppova, Kvetoslava; Bolick, Sophia C E; Godfrey, Ashley; Xu, Zongli; Sanz-Velez, José I; García-Prats, María D; Sanchez, Manuel; Valdivia, Gabriel; Porru, Stefano; Benhamou, Simone; Hoover, Robert N; Fraumeni, Joseph F; Silverman, Debra T; Chanock, Stephen J

    2010-01-01

    We conducted a multi-stage, genome-wide association study (GWAS) of bladder cancer with a primary scan of 589,299 single nucleotide polymorphisms (SNPs) in 3,532 cases and 5,120 controls of European descent (5 studies) followed by a replication strategy, which included 8,381 cases and 48,275 controls (16 studies). In a combined analysis, we identified three new regions associated with bladder cancer on chromosomes 22q13.1, 19q12 and 2q37.1; rs1014971, (P=8×10−12) maps to a non-genic region of chromosome 22q13.1; rs8102137 (P=2×10−11) on 19q12 maps to CCNE1; and rs11892031 (P=1×10−7) maps to the UGT1A cluster on 2q37.1. We confirmed four previous GWAS associations on chromosomes 3q28, 4p16.3, 8q24.21 and 8q24.3, validated previous candidate associations for the GSTM1 deletion (P=4×10−11) and a tag SNP for NAT2 acetylation status (P=4×10−11), as well as demonstrated smoking interactions with both regions. Our findings on common variants associated with bladder cancer risk should provide new insights into mechanisms of carcinogenesis. PMID:20972438

  4. Genome-Wide Pathway Analysis Identifies Genetic Pathways Associated with Psoriasis.

    Science.gov (United States)

    Aterido, Adrià; Julià, Antonio; Ferrándiz, Carlos; Puig, Lluís; Fonseca, Eduardo; Fernández-López, Emilia; Dauden, Esteban; Sánchez-Carazo, José Luís; López-Estebaranz, José Luís; Moreno-Ramírez, David; Vanaclocha, Francisco; Herrera, Enrique; de la Cueva, Pablo; Dand, Nick; Palau, Núria; Alonso, Arnald; López-Lasanta, María; Tortosa, Raül; García-Montero, Andrés; Codó, Laia; Gelpí, Josep Lluís; Bertranpetit, Jaume; Absher, Devin; Capon, Francesca; Myers, Richard M; Barker, Jonathan N; Marsal, Sara

    2016-03-01

    Psoriasis is a chronic inflammatory disease with a complex genetic architecture. To date, the psoriasis heritability is only partially explained. However, there is increasing evidence that the missing heritability in psoriasis could be explained by multiple genetic variants of low effect size from common genetic pathways. The objective of this study was to identify new genetic variation associated with psoriasis risk at the pathway level. We genotyped 598,258 single nucleotide polymorphisms in a discovery cohort of 2,281 case-control individuals from Spain. We performed a genome-wide pathway analysis using 1,053 reference biological pathways. A total of 14 genetic pathways (PFDR ≤ 2.55 × 10(-2)) were found to be significantly associated with psoriasis risk. Using an independent validation cohort of 7,353 individuals from the UK, a total of 6 genetic pathways were significantly replicated (PFDR ≤ 3.46 × 10(-2)). We found genetic pathways that had not been previously associated with psoriasis risk such as retinol metabolism (Pcombined = 1.84 × 10(-4)), the transport of inorganic ions and amino acids (Pcombined = 1.57 × 10(-7)), and post-translational protein modification (Pcombined = 1.57 × 10(-7)). In the latter pathway, MGAT5 showed a strong network centrality, and its association with psoriasis risk was further validated in an additional case-control cohort of 3,429 individuals (P psoriasis susceptibility.

  5. SNPpy--database management for SNP data from genome wide association studies.

    Directory of Open Access Journals (Sweden)

    Faheem Mitha

    Full Text Available BACKGROUND: We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS. This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP data. SNPpy and its dependencies are open source software. RESULTS: The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. CONCLUSIONS: By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is a practical and extensible solution for investigators who seek to deploy central management of their GWAS data.

  6. Genome-wide association study of corticobasal degeneration identifies risk variants shared with progressive supranuclear palsy

    Science.gov (United States)

    Kouri, Naomi; Ross, Owen A.; Dombroski, Beth; Younkin, Curtis S.; Serie, Daniel J.; Soto-Ortolaza, Alexandra; Baker, Matthew; Finch, Ni Cole A.; Yoon, Hyejin; Kim, Jungsu; Fujioka, Shinsuke; McLean, Catriona A.; Ghetti, Bernardino; Spina, Salvatore; Cantwell, Laura B.; Farlow, Martin R.; Grafman, Jordan; Huey, Edward D.; Ryung Han, Mi; Beecher, Sherry; Geller, Evan T.; Kretzschmar, Hans A.; Roeber, Sigrun; Gearing, Marla; Juncos, Jorge L.; Vonsattel, Jean Paul G.; Van Deerlin, Vivianna M.; Grossman, Murray; Hurtig, Howard I.; Gross, Rachel G.; Arnold, Steven E.; Trojanowski, John Q.; Lee, Virginia M.; Wenning, Gregor K.; White, Charles L.; Höglinger, Günter U.; Müller, Ulrich; Devlin, Bernie; Golbe, Lawrence I.; Crook, Julia; Parisi, Joseph E.; Boeve, Bradley F.; Josephs, Keith A.; Wszolek, Zbigniew K.; Uitti, Ryan J.; Graff-Radford, Neill R.; Litvan, Irene; Younkin, Steven G.; Wang, Li-San; Ertekin-Taner, Nilüfer; Rademakers, Rosa; Hakonarsen, Hakon; Schellenberg, Gerard D.; Dickson, Dennis W.

    2015-01-01

    Corticobasal degeneration (CBD) is a neurodegenerative disorder affecting movement and cognition, definitively diagnosed only at autopsy. Here, we conduct a genome-wide association study (GWAS) in CBD cases (n=152) and 3,311 controls, and 67 CBD cases and 439 controls in a replication stage. Associations with meta-analysis were 17q21 at MAPT (P=1.42 × 10−12), 8p12 at lnc-KIF13B-1, a long non-coding RNA (rs643472; P=3.41 × 10−8), and 2p22 at SOS1 (rs963731; P=1.76 × 10−7). Testing for association of CBD with top progressive supranuclear palsy (PSP) GWAS single-nucleotide polymorphisms (SNPs) identified associations at MOBP (3p22; rs1768208; P=2.07 × 10−7) and MAPT H1c (17q21; rs242557; P=7.91 × 10−6). We previously reported SNP/transcript level associations with rs8070723/MAPT, rs242557/MAPT, and rs1768208/MOBP and herein identified association with rs963731/SOS1. We identify new CBD susceptibility loci and show that CBD and PSP share a genetic risk factor other than MAPT at 3p22 MOBP (myelin-associated oligodendrocyte basic protein). PMID:26077951

  7. Genome-wide profiling of genetic variation in Agrobacterium-transformed rice plants*#

    Science.gov (United States)

    Li, Wen-xu; Wu, San-ling; Liu, Yan-hua; Jin, Gu-lei; Zhao, Hai-jun; Fan, Long-jiang; Shu, Qing-yao

    2016-01-01

    Agrobacterium-mediated transformation has been widely used in producing transgenic plants, and was recently used to generate “transgene-clean” targeted genomic modifications coupled with the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas9) system. Although tremendous variation in morphological and agronomic traits, such as plant height, seed fertility, and grain size, was observed in transgenic plants, the underlying mechanisms are not yet well understood, and the types and frequency of genetic variation in transformed plants have not been fully disclosed. To reveal the genome-wide variation in transformed plants, we sequenced the genomes of five independent T0 rice plants using next-generation sequencing (NGS) techniques. Bioinformatics analyses followed by experimental validation revealed the following: (1) in addition to transfer-DNA (T-DNA) insertions, three transformed plants carried heritable plasmid backbone DNA of variable sizes (855–5216 bp) and in different configurations with the T-DNA insertions (linked or apart); (2) each transgenic plant contained an estimated 338–1774 independent genetic variations (single nucleotide variations (SNVs) or small insertion/deletions); and (3) 2–6 new Tos17 insertions were detected in each transformed plant, but no other transposable elements or bacterial genomic DNA. PMID:27921404

  8. Exploring genome-wide - dietary heme iron intake interactions and the risk of type 2 diabetes

    Directory of Open Access Journals (Sweden)

    Louis Robert Pasquale

    2013-01-01

    Full Text Available Aims/hypothesis: Genome-wide association studies have identified over 50 new genetic loci for type 2 diabetes (T2D. Several studies conclude that higher dietary heme iron intake increases the risk of T2D. Therefore we assessed whether the relation between genetic loci and type 2 diabetes is modified by dietary heme iron intake. Methods: We used Affymetrix Genome-Wide Human 6.0 array data (681,770 single nucleotide polymorphisms (SNPs and dietary information collected in the Health Professionals Follow-up Study (n=725 cases; n=1,273 controls and the Nurses’ Health Study (n=1,081 cases; n=1,692 controls. We assessed whether genome-wide SNPs or iron metabolism SNPs interacted with dietary heme iron intake in relation to T2D, testing for associations in each cohort separately and then meta-analyzing to pool the results. Finally, we created 1,000 synthetic pathways matched to an iron metabolism pathway on number of genes, and number of SNPs in each gene. We compared the iron metabolic pathway SNPs with these synthetic SNP assemblies in their relation to T2D to assess if the pathway as a whole interacts with dietary heme iron intake.Results: Using a genomic approach, we found no significant gene-environment interactions with dietary heme iron intake in relation to T2D (top SNP in pooled analysis: intergenic rs10980508; p=1.03E-06 > Bonferroni corrected p=7.33E-08. Furthermore, no SNP in the iron metabolic pathway significantly interacted with dietary heme iron intake (top SNP in pooled analysis: rs1805313; p=1.14E-03 > Bonferroni corrected p=2.10E-04. Finally, neither the main genetic effects (pooled empirical p by SNP=0.41, nor gene – dietary heme-iron interactions (pooled empirical p value for the interactions=0.72 were significant for the iron metabolic pathway as a whole. Conclusions: We found no significant interactions between dietary heme iron intake and common SNPs in relation to T2D.

  9. Genome-wide association studies of the PR interval in African Americans.

    Directory of Open Access Journals (Sweden)

    J Gustav Smith

    Full Text Available The PR interval on the electrocardiogram reflects atrial and atrioventricular nodal conduction time. The PR interval is heritable, provides important information about arrhythmia risk, and has been suggested to differ among human races. Genome-wide association (GWA studies have identified common genetic determinants of the PR interval in individuals of European and Asian ancestry, but there is a general paucity of GWA studies in individuals of African ancestry. We performed GWA studies in African American individuals from four cohorts (n = 6,247 to identify genetic variants associated with PR interval duration. Genotyping was performed using the Affymetrix 6.0 microarray. Imputation was performed for 2.8 million single nucleotide polymorphisms (SNPs using combined YRI and CEU HapMap phase II panels. We observed a strong signal (rs3922844 within the gene encoding the cardiac sodium channel (SCN5A with genome-wide significant association (p<2.5 x 10⁻⁸ in two of the four cohorts and in the meta-analysis. The signal explained 2% of PR interval variability in African Americans (beta  = 5.1 msec per minor allele, 95% CI  = 4.1-6.1, p = 3 x 10⁻²³. This SNP was also associated with PR interval (beta = 2.4 msec per minor allele, 95% CI = 1.8-3.0, p = 3 x 10⁻¹⁶ in individuals of European ancestry (n = 14,042, but with a smaller effect size (p for heterogeneity <0.001 and variability explained (0.5%. Further meta-analysis of the four cohorts identified genome-wide significant associations with SNPs in SCN10A (rs6798015, MEIS1 (rs10865355, and TBX5 (rs7312625 that were highly correlated with SNPs identified in European and Asian GWA studies. African ancestry was associated with increased PR duration (13.3 msec, p = 0.009 in one but not the other three cohorts. Our findings demonstrate the relevance of common variants to African Americans at four loci previously associated with PR interval in European and

  10. Genome-wide association of lipid-lowering response to statins in combined study populations.

    Directory of Open Access Journals (Sweden)

    Mathew J Barber

    Full Text Available BACKGROUND: Statins effectively lower total and plasma LDL-cholesterol, but the magnitude of decrease varies among individuals. To identify single nucleotide polymorphisms (SNPs contributing to this variation, we performed a combined analysis of genome-wide association (GWA results from three trials of statin efficacy. METHODS AND PRINCIPAL FINDINGS: Bayesian and standard frequentist association analyses were performed on untreated and statin-mediated changes in LDL-cholesterol, total cholesterol, HDL-cholesterol, and triglyceride on a total of 3932 subjects using data from three studies: Cholesterol and Pharmacogenetics (40 mg/day simvastatin, 6 weeks, Pravastatin/Inflammation CRP Evaluation (40 mg/day pravastatin, 24 weeks, and Treating to New Targets (10 mg/day atorvastatin, 8 weeks. Genotype imputation was used to maximize genomic coverage and to combine information across studies. Phenotypes were normalized within each study to account for systematic differences among studies, and fixed-effects combined analysis of the combined sample were performed to detect consistent effects across studies. Two SNP associations were assessed as having posterior probability greater than 50%, indicating that they were more likely than not to be genuinely associated with statin-mediated lipid response. SNP rs8014194, located within the CLMN gene on chromosome 14, was strongly associated with statin-mediated change in total cholesterol with an 84% probability by Bayesian analysis, and a p-value exceeding conventional levels of genome-wide significance by frequentist analysis (P = 1.8 x 10(-8. This SNP was less significantly associated with change in LDL-cholesterol (posterior probability = 0.16, P = 4.0 x 10(-6. Bayesian analysis also assigned a 51% probability that rs4420638, located in APOC1 and near APOE, was associated with change in LDL-cholesterol. CONCLUSIONS AND SIGNIFICANCE: Using combined GWA analysis from three clinical trials involving nearly 4

  11. Genome-wide association study of tick resistance in South African Nguni cattle.

    Science.gov (United States)

    Mapholi, N O; Maiwashe, A; Matika, O; Riggio, V; Bishop, S C; MacNeil, M D; Banga, C; Taylor, J F; Dzama, K

    2016-04-01

    Ticks and tick-borne diseases are among the main causes of economic loss in the South African cattle industry through high morbidity and mortality rates. Concerns of the general public regarding chemical residues may tarnish their perceptions of food safety and environmental health when the husbandry of cattle includes frequent use of acaricides to manage ticks. The primary objective of this study was to identify single nucleotide polymorphism (SNP) markers associated with host resistance to ticks in South African Nguni cattle. Tick count data were collected monthly from 586 Nguni cattle reared in four herds under natural grazing conditions over a period of two years. The counts were recorded for six species of ticks attached in eight anatomical locations on the animals and were summed by species and anatomical location. This gave rise to 63 measured phenotypes or traits, with results for 12 of these traits being reported here. Tick count (x) data were transformed using log10(x+1) and the resulting values were examined for normality. DNA was extracted from hair and blood samples and was genotyped using the Illumina BovineSNP50 assay. After quality control (call rate >90%, minor allele frequency >0.02), 40,436 SNPs were retained for analysis. Genetic parameters were estimated and association analysis for tick resistance was carried out using two approaches: a genome-wide association (GWA) analysis using the GenABEL package and a regional heritability mapping (RHM) analysis. The Bonferroni genome-wide (PAmblyomma hebraeum (the vector for Heartwater disease) being the dominant species. Heritability estimates (h(2)) from the fitted animal and sire models ranged from 0.02±0.00 to 0.17±0.04 for the transformed tick count data. Several genomic regions harbouring quantitative trait loci (QTL) were identified for different tick count traits by both the GWA and RHM approaches. Three genome-wide significant regions on chromosomes 7, 10 and 19 were identified for total tick

  12. A genome-wide association study of serum uric acid in African Americans

    Directory of Open Access Journals (Sweden)

    Gerry Norman P

    2011-02-01

    Full Text Available Abstract Background Uric acid is the primary byproduct of purine metabolism. Hyperuricemia is associated with body mass index (BMI, sex, and multiple complex diseases including gout, hypertension (HTN, renal disease, and type 2 diabetes (T2D. Multiple genome-wide association studies (GWAS in individuals of European ancestry (EA have reported associations between serum uric acid levels (SUAL and specific genomic loci. The purposes of this study were: 1 to replicate major signals reported in EA populations; and 2 to use the weak LD pattern in African ancestry population to better localize (fine-map reported loci and 3 to explore the identification of novel findings cognizant of the moderate sample size. Methods African American (AA participants (n = 1,017 from the Howard University Family Study were included in this study. Genotyping was performed using the Affymetrix® Genome-wide Human SNP Array 6.0. Imputation was performed using MACH and the HapMap reference panels for CEU and YRI. A total of 2,400,542 single nucleotide polymorphisms (SNPs were assessed for association with serum uric acid under the additive genetic model with adjustment for age, sex, BMI, glomerular filtration rate, HTN, T2D, and the top two principal components identified in the assessment of admixture and population stratification. Results Four variants in the gene SLC2A9 achieved genome-wide significance for association with SUAL (p-values ranging from 8.88 × 10-9 to 1.38 × 10-9. Fine-mapping of the SLC2A9 signals identified a 263 kb interval of linkage disequilibrium in the HapMap CEU sample. This interval was reduced to 37 kb in our AA and the HapMap YRI samples. Conclusions The most strongly associated locus for SUAL in EA populations was also the most strongly associated locus in this AA sample. This finding provides evidence for the role of SLC2A9 in uric acid metabolism across human populations. Additionally, our findings demonstrate the utility of following-up EA

  13. Myosin individualized: single nucleotide polymorphisms in energy transduction

    Directory of Open Access Journals (Sweden)

    Wieben Eric D

    2010-03-01

    Full Text Available Abstract Background Myosin performs ATP free energy transduction into mechanical work in the motor domain of the myosin heavy chain (MHC. Energy transduction is the definitive systemic feature of the myosin motor performed by coordinating in a time ordered sequence: ATP hydrolysis at the active site, actin affinity modulation at the actin binding site, and the lever-arm rotation of the power stroke. These functions are carried out by several conserved sub-domains within the motor domain. Single nucleotide polymorphisms (SNPs affect the MHC sequence of many isoforms expressed in striated muscle, smooth muscle, and non-muscle tissue. The purpose of this work is to provide a rationale for using SNPs as a functional genomics tool to investigate structurefunction relationships in myosin. In particular, to discover SNP distribution over the conserved sub-domains and surmise what it implies about sub-domain stability and criticality in the energy transduction mechanism. Results An automated routine identifying human nonsynonymous SNP amino acid missense substitutions for any MHC gene mined the NCBI SNP data base. The routine tested 22 MHC genes coding muscle and non-muscle isoforms and identified 89 missense mutation positions in the motor domain with 10 already implicated in heart disease and another 8 lacking sequence homology with a skeletal MHC isoform for which a crystallographic model is available. The remaining 71 SNP substitutions were found to be distributed over MHC with 22 falling outside identified functional sub-domains and 49 in or very near to myosin sub-domains assigned specific crucial functions in energy transduction. The latter includes the active site, the actin binding site, the rigid lever-arm, and regions facilitating their communication. Most MHC isoforms contained SNPs somewhere in the motor domain. Conclusions Several functional-crucial sub-domains are infiltrated by a large number of SNP substitution sites suggesting these

  14. Single nucleotide polymorphism discovery in elite north american potato germplasm

    Directory of Open Access Journals (Sweden)

    De Jong Walter S

    2011-06-01

    Full Text Available Abstract Background Current breeding approaches in potato rely almost entirely on phenotypic evaluations; molecular markers, with the exception of a few linked to disease resistance traits, are not widely used. Large-scale sequence datasets generated primarily through Sanger Expressed Sequence Tag projects are available from a limited number of potato cultivars and access to next generation sequencing technologies permits rapid generation of sequence data for additional cultivars. When coupled with the advent of high throughput genotyping methods, an opportunity now exists for potato breeders to incorporate considerably more genotypic data into their decision-making. Results To identify a large number of Single Nucleotide Polymorphisms (SNPs in elite potato germplasm, we sequenced normalized cDNA prepared from three commercial potato cultivars: 'Atlantic', 'Premier Russet' and 'Snowden'. For each cultivar, we generated 2 Gb of sequence which was assembled into a representative transcriptome of ~28-29 Mb for each cultivar. Using the Maq SNP filter that filters read depth, density, and quality, 575,340 SNPs were identified within these three cultivars. In parallel, 2,358 SNPs were identified within existing Sanger sequences for three additional cultivars, 'Bintje', 'Kennebec', and 'Shepody'. Using a stringent set of filters in conjunction with the potato reference genome, we identified 69,011 high confidence SNPs from these six cultivars for use in genotyping with the Infinium platform. Ninety-six of these SNPs were used with a BeadXpress assay to assess allelic diversity in a germplasm panel of 248 lines; 82 of the SNPs proved sufficiently informative for subsequent analyses. Within diverse North American germplasm, the chip processing market class was most distinct, clearly separated from all other market classes. The round white and russet market classes both include fresh market and processing cultivars. Nevertheless, the russet and round

  15. Single nucleotide polymorphism (SNP-strings: an alternative method for assessing genetic associations.

    Directory of Open Access Journals (Sweden)

    Douglas S Goodin

    Full Text Available BACKGROUND: Genome-wide association studies (GWAS identify disease-associations for single-nucleotide-polymorphisms (SNPs from scattered genomic-locations. However, SNPs frequently reside on several different SNP-haplotypes, only some of which may be disease-associated. This circumstance lowers the observed odds-ratio for disease-association. METHODOLOGY/PRINCIPAL FINDINGS: Here we develop a method to identify the two SNP-haplotypes, which combine to produce each person's SNP-genotype over specified chromosomal segments. Two multiple sclerosis (MS-associated genetic regions were modeled; DRB1 (a Class II molecule of the major histocompatibility complex and MMEL1 (an endopeptidase that degrades both neuropeptides and β-amyloid. For each locus, we considered sets of eleven adjacent SNPs, surrounding the putative disease-associated gene and spanning ∼200 kb of DNA. The SNP-information was converted into an ordered-set of eleven-numbers (subject-vectors based on whether a person had zero, one, or two copies of particular SNP-variant at each sequential SNP-location. SNP-strings were defined as those ordered-combinations of eleven-numbers (0 or 1, representing a haplotype, two of which combined to form the observed subject-vector. Subject-vectors were resolved using probabilistic methods. In both regions, only a small number of SNP-strings were present. We compared our method to the SHAPEIT-2 phasing-algorithm. When the SNP-information spanning 200 kb was used, SHAPEIT-2 was inaccurate. When the SHAPEIT-2 window was increased to 2,000 kb, the concordance between the two methods, in both of these eleven-SNP regions, was over 99%, suggesting that, in these regions, both methods were quite accurate. Nevertheless, correspondence was not uniformly high over the entire DNA-span but, rather, was characterized by alternating peaks and valleys of concordance. Moreover, in the valleys of poor-correspondence, SHAPEIT-2 was also inconsistent with itself

  16. A novel statistic for genome-wide interaction analysis.

    Science.gov (United States)

    Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao

    2010-09-23

    Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDRanalysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  17. Genome-wide Association Analysis of Ten Chilling Tolerance Indices at the Germination and Seedling Stages in Maize

    Institute of Scientific and Technical Information of China (English)

    Juan Huang; Jianhua Zhang; Wenzhen Li; Wei Hu; Lichao Duan; Yang Feng; Fazhan Qiu

    2013-01-01

    Maize seedlings are very sensitive to chilling,especially during the transition phase from heterotrophic to autotrophic growth.Genetic dissection of the genetic basis of chilling tolerance would provide useful information for genetic improvement of maize inbreds.In this study,genome-wide association analysis was conducted to explore the genetic architecture of maize chilling tolerance at the seed germination and seedling stages with an association panel of 125 inbreds.Ten tolerance indices (ratios of the performance of 10 germination rates and seedling growth-related traits under chilling stress and control conditions)were investigated to assess the ability of chilling tolerance of the inbreds,and a total of 43 single nucleotide polymorphisms associated with chilling tolerance were detected,with none of them being related to chilling tolerance at both the germination and seedling stages simultaneously.Correlation analysis also revealed that the genetic basis of chilling tolerance at the seed germination stage is generally different from that at the seedling stage.In addition,a total of 40 candidate genes involving 31 of the 43 single nucleotide polymorphisms were predicted,and were grouped into five categories according to their functions.The possible roles of these candidate genes in chilling tolerance were also discussed.

  18. Genome-wide association study identifies variants in the ABO locus associated with susceptibility to pancreatic cancer

    Science.gov (United States)

    Amundadottir, Laufey; Kraft, Peter; Stolzenberg-Solomon, Rachael Z.; Fuchs, Charles S.; Petersen, Gloria M.; Arslan, Alan A.; Bueno-de-Mesquita, H. Bas; Gross, Myron; Helzlsouer, Kathy; Jacobs, Eric J.; LaCroix, Andrea; Zheng, Wei; Albanes, Demetrius; Bamlet, William; Berg, Christine D.; Berrino, Franco; Bingham, Sheila; Buring, Julie E.; Bracci, Paige M.; Canzian, Federico; Clavel-Chapelon, Françoise; Clipp, Sandra; Cotterchio, Michelle; de Andrade, Mariza; Duell, Eric J.; Fox, John W.; Gallinger, Steven; Gaziano, J. Michael; Giovannucci, Edward L.; Goggins, Michael; González, Carlos A.; Hallmans, Göran; Hankinson, Susan E.; Hassan, Manal; Holly, Elizabeth A.; Hunter, David J.; Hutchinson, Amy; Jackson, Rebecca; Jacobs, Kevin B.; Jenab, Mazda; Kaaks, Rudolf; Klein, Alison P.; Kooperberg, Charles; Kurtz, Robert C.; Li, Donghui; Lynch, Shannon M.; Mandelson, Margaret; McWilliams, Robert R.; Mendelsohn, Julie B.; Michaud, Dominique S.; Olson, Sara H.; Overvad, Kim; Patel, Alpa V.; Peeters, Petra H.M.; Rajkovic, Aleksandar; Riboli, Elio; Risch, Harvey A.; Shu, Xiao-Ou; Thomas, Gilles; Tobias, Geoffrey S.; Trichopoulos, Dimitrios; Van Den Eeden, Stephen K.; Virtamo, Jarmo; Wactawski-Wende, Jean; Wolpin, Brian M.; Yu, Herbert; Yu, Kai; Zeleniuch-Jacquotte, Anne; Chanock, Stephen J.; Hartge, Patricia; Hoover, Robert N.

    2010-01-01

    We conducted a two-stage genome-wide association study (GWAS) of pancreatic cancer, a cancer with one of the poorest survival rates worldwide. Initially, we genotyped 558,542 single nucleotide polymorphisms in 1,896 incident cases and 1,939 controls drawn from twelve prospective cohorts plus one hospital-based case-control study. In a combined analysis adjusted for study, sex, ancestry and five principal components that included an additional 2,457 cases and 2,654 controls from eight case-control studies, we identified an association between a locus on 9q34 and pancreatic cancer marked by the single nucleotide polymorphism, rs505922 (combined P=5.37 × 10-8; multiplicative per-allele odds ratio (OR) 1.20; 95% CI 1.12-1.28). This SNP maps to the first intron of the ABO blood group gene. Our results are consistent with earlier epidemiologic evidence suggesting that people with blood group O may have a lower risk of pancreatic cancer than those with groups A or B. PMID:19648918

  19. Power analysis for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Klein Robert J

    2007-08-01

    Full Text Available Abstract Background Genome-wide association studies are a promising new tool for deciphering the genetics of complex diseases. To choose the proper sample size and genotyping platform for such studies, power calculations that take into account genetic model, tag SNP selection, and the population of interest are required. Results The power of genome-wide association studies can be computed using a set of tag SNPs and a large number of genotyped SNPs in a representative population, such as available through the HapMap project. As expected, power increases with increasing sample size and effect size. Power also depends on the tag SNPs selected. In some cases, more power is obtained by genotyping more individuals at fewer SNPs than fewer individuals at more SNPs. Conclusion Genome-wide association studies should be designed thoughtfully, with the choice of genotyping platform and sample size being determined from careful power calculations.

  20. A novel statistic for genome-wide interaction analysis.

    Directory of Open Access Journals (Sweden)

    Xuesen Wu

    2010-09-01

    Full Text Available Although great progress in genome-wide association studies (GWAS has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked. The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  1. The common single-nucleotide polymorphism rs2681472 is associated with early-onset preeclampsia in Northern Han Chinese women.

    Science.gov (United States)

    Wan, Ji-Peng; Wang, Hong; Li, Chang-Zhong; Zhao, Han; You, Li; Shi, Dong-Hong; Sun, Xiu-Hua; Lv, Hong; Wang, Fei; Wen, Ze-Qing; Wang, Xie-Tong; Chen, Zi-Jiang

    2014-11-01

    Preeclampsia, characterized by hypertension and proteinuria, remains a leading cause of maternal morbidity and mortality. Recently, a genome-wide association study (GWAS) identified the single-nucleotide polymorphism, rs2681472, as a new hypertension susceptibility genetic variant. The purpose of this study was to evaluate the association between preeclampsia and rs268172 in a Northern Han Chinese population. We genotyped 1218 unrelated Northern Han Chinese women, including 515 patients with preeclampsia and 703 healthy controls. No significant differences were detected in the allele frequencies between patients and controls (P = .23). When patients were divided into early-onset and late-onset preeclampsia according to gestational age of disease onset, the allele frequencies significantly differed between controls and patients with early-onset preeclampsia (P = .02). Genotype frequencies also were significantly different between controls and patients early-onset preeclampsia when data were analyzed under additive (P = .03) and dominant (P = .009) models. We replicated this association in an independent Northern Han Chinese population and observed a significant difference in the allele frequencies between patients with early-onset preeclampsia and controls (P = .011). We report that rs2681472 is associated with early-onset preeclampsia in Northern Han Chinese women.

  2. Use of stochastic simulations to investigate the power and design of a whole genome association study using single nucleotide

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    This paper presents a quick, easy to implement and versatile way of using stochastic simulations to investigate the power and design of using single nucleotide polymorphism (SNP) arrays for genome-wide association studies in farm animals. It illustrates the methodology by discussing a small example where 6 experimental designs are considered to analyse the same resource consisting of 6006 animals with pedigree and phenotypic records: (1) genotyping the 30 most widely used sires in the population and all of their progeny (515 animals in total), (2) genotyping the 100 most widely used sires in the population and all of their progeny (1 102 animals in total), genotyping respectively (3) 515 and (4) 1 102 animals selected randomly or genotyping respectively (5) 515 and (6) 1 102 animals from the tails of the phenotypic distribution. Given the resource at hand, designs where the extreme animals are genotyped perform the best, followed by designs selecting animals at random. Designs where sires and their progeny are genotyped perform the worst, as even genotyping the 100 most widely used sires and their progeny is not as powerful of genotyping 515 extreme animals.

  3. Detailed analysis of association between common single nucleotide polymorphisms and subclinical atherosclerosis: The Multi-ethnic Study of Atherosclerosis.

    Science.gov (United States)

    Vargas, Jose D; Manichaikul, Ani; Wang, Xin-Qun; Rich, Stephen S; Rotter, Jerome I; Post, Wendy S; Polak, Joseph F; Budoff, Matthew J; Bluemke, David A

    2016-06-01

    Previously identified single nucleotide polymorphisms (SNPs) in genome wide association studies (GWAS) of cardiovascular disease (CVD) in participants of mostly European descent were tested for association with subclinical cardiovascular disease (sCVD), coronary artery calcium score (CAC) and carotid intima media thickness (CIMT) in the Multi-Ethnic Study of Atherosclerosis (MESA). The data in this data in brief article correspond to the article Common Genetic Variants and Subclinical Atherosclerosis: The Multi-Ethnic Study of Atherosclerosis [1]. This article includes the demographic information of the participants analyzed in the article as well as graphical displays and data tables of the association of the selected SNPs with CAC and of the meta-analysis across ethnicities of the association of CIMT-c (common carotid), CIMT-I (internal carotid), CAC-d (CAC as dichotomous variable with CAC>0) and CAC-c (CAC as continuous variable, the log of the raw CAC score plus one) and CVD. The data tables corresponding to the 9p21 fine mapping experiment as well as the power calculations referenced in the article are also included.

  4. SITDEM: A simulation tool for disease/endpoint models of association studies based on single nucleotide polymorphism genotypes

    Science.gov (United States)

    Oh, Jung Hun; Deasy, Joseph O.

    2016-01-01

    The association analysis between single nucleotide polymorphisms (SNPs) and disease or endpoint in genome-wide association studies (GWAS) has been considered as a powerful strategy for investigating genetic susceptibility and for identifying significant biomarkers. The statistical analysis approaches with simulated data have been widely used to review experimental designs and performance measurements. In recent years, a number of authors have proposed methods for the simulation of biological data in the genomic field. However, these methods use large-scale genomic data as a reference to simulate experiments, which may limit the use of the methods in the case where the data in specific studies are not available. Few methods use experimental results or observed parameters for simulation. The goal of this study is to develop a Web application called SITDEM to simulate disease/endpoint models in three different approaches based on only parameters observed in GWAS. In our simulation, a key task is to compute the probability of genotypes. Based on that, we randomly sample simulation data. Simulation results are shown as a function of p-value against odds ratio or relative risk of a SNP in dominant and recessive models. Our simulation results show the potential of SITDEM for simulating genotype data. SITDEM could be particularly useful for investigating the relationship among observed parameters for target SNPs and for estimating the number of variables (SNPs) required to result in significant p-values in multiple comparisons. The proposed simulation tool is freely available at http://www.snpmodel.com. PMID:24480173

  5. Genome-wide association studies and prediction of 17 traits related to phenology, biomass and cell wall composition in the energy grass Miscanthus sinensis.

    Science.gov (United States)

    Slavov, Gancho T; Nipper, Rick; Robson, Paul; Farrar, Kerrie; Allison, Gordon G; Bosch, Maurice; Clifton-Brown, John C; Donnison, Iain S; Jensen, Elaine

    2014-03-01

    • Increasing demands for food and energy require a step change in the effectiveness, speed and flexibility of crop breeding. Therefore, the aim of this study was to assess the potential of genome-wide association studies (GWASs) and genomic selection (i.e. phenotype prediction from a genome-wide set of markers) to guide fundamental plant science and to accelerate breeding in the energy grass Miscanthus. • We generated over 100,000 single-nucleotide variants (SNVs) by sequencing restriction site-associated DNA (RAD) tags in 138 Micanthus sinensis genotypes, and related SNVs to phenotypic data for 17 traits measured in a field trial. • Confounding by population structure and relatedness was severe in naïve GWAS analyses, but mixed-linear models robustly controlled for these effects and allowed us to detect multiple associations that reached genome-wide significance. Genome-wide prediction accuracies tended to be moderate to high (average of 0.57), but varied dramatically across traits. As expected, predictive abilities increased linearly with the size of the mapping population, but reached a plateau when the number of markers used for prediction exceeded 10,000-20,000, and tended to decline, but remain significant, when cross-validations were performed across subpopulations. • Our results suggest that the immediate implementation of genomic selection in Miscanthus breeding programs may be feasible.

  6. MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study

    Directory of Open Access Journals (Sweden)

    Xue Hong

    2009-01-01

    Full Text Available Abstract Background The interactions of multiple single nucleotide polymorphisms (SNPs are highly hypothesized to affect an individual's susceptibility to complex diseases. Although many works have been done to identify and quantify the importance of multi-SNP interactions, few of them could handle the genome wide data due to the combinatorial explosive search space and the difficulty to statistically evaluate the high-order interactions given limited samples. Results Three comparative experiments are designed to evaluate the performance of MegaSNPHunter. The first experiment uses synthetic data generated on the basis of epistasis models. The second one uses a genome wide study on Parkinson disease (data acquired by using Illumina HumanHap300 SNP chips. The third one chooses the rheumatoid arthritis study from Wellcome Trust Case Control Consortium (WTCCC using Affymetrix GeneChip 500K Mapping Array Set. MegaSNPHunter outperforms the best solution in this area and reports many potential interactions for the two real studies. Conclusion The experimental results on both synthetic data and two real data sets demonstrate that our proposed approach outperforms the best solution that is currently available in handling large-scale SNP data both in terms of speed and in terms of detection of potential interactions that were not identified before. To our knowledge, MegaSNPHunter is the first approach that is capable of identifying the disease-associated SNP interactions from WTCCC studies and is promising for practical disease prognosis.

  7. A genome-wide survey reveals a deletion polymorphism associated with resistance to gastrointestinal nematodes in Angus cattle.

    Science.gov (United States)

    Xu, Lingyang; Hou, Yali; Bickhart, Derek M; Song, Jiuzhou; Van Tassell, Curtis P; Sonstegard, Tad S; Liu, George E

    2014-06-01

    Gastrointestinal (GI) nematode infections are a worldwide threat to human health and animal production. In this study, we performed a genome-wide association study between copy number variations (CNVs) and resistance to GI nematodes in an Angus cattle population. Using a linear regression analysis, we identified one deletion CNV which reaches genome-wide significance after Bonferroni correction. With multiple mapped human olfactory receptor genes but no annotated bovine genes in the region, this significantly associated CNV displays high population frequencies (58.26 %) with a length of 104.8 kb on chr7. We further investigated the linkage disequilibrium (LD) relationships between this CNV and its nearby single nucleotide polymorphisms (SNPs) and genes. The underlining haplotype blocks contain immune-related genes such as ZNF496 and NLRP3. As this CNV co-segregates with linked SNPs and associated genes, we suspect that it could contribute to the detected variations in gene expression and thus differences in host parasite resistance.

  8. Bivariate genome-wide association study suggests that the DARC gene influences lean body mass and age at menarche.

    Science.gov (United States)

    Hai, Rong; Zhang, Lei; Pei, Yufang; Zhao, Lanjuan; Ran, Shu; Han, Yingying; Zhu, Xuezhen; Shen, Hui; Tian, Qing; Deng, Hongwen

    2012-06-01

    Lean body mass (LBM) and age at menarche (AAM) are two important complex traits for human health. The aim of this study was to identify pleiotropic genes for both traits using a powerful bivariate genome-wide association study (GWAS). Two studies, a discovery study and a replication study, were performed. In the discovery study, 909622 single nucleotide polymorphisms (SNPs) were genotyped in 801 unrelated female Han Chinese subjects using the Affymetrix human genome-wide SNP array 6.0 platform. Then, a bivariate GWAS was performed to identify the SNPs that may be important for LBM and AAM. In the replication study, significant findings from the discovery study were validated in 1692 unrelated Caucasian female subjects. One SNP rs3027009 that was bivariately associated with left arm lean mass and AAM in the discovery samples (P=7.26×10(-6)) and in the replication samples (P=0.005) was identified. The SNP is located at the upstream of DARC (Duffy antigen receptor for chemokines) gene, suggesting that DARC may play an important role in regulating the metabolisms of both LBM and AAM.

  9. A genome-wide association study of COPD identifies a susceptibility locus on chromosome 19q13.

    Science.gov (United States)

    Cho, Michael H; Castaldi, Peter J; Wan, Emily S; Siedlinski, Mateusz; Hersh, Craig P; Demeo, Dawn L; Himes, Blanca E; Sylvia, Jody S; Klanderman, Barbara J; Ziniti, John P; Lange, Christoph; Litonjua, Augusto A; Sparrow, David; Regan, Elizabeth A; Make, Barry J; Hokanson, John E; Murray, Tanda; Hetmanski, Jacqueline B; Pillai, Sreekumar G; Kong, Xiangyang; Anderson, Wayne H; Tal-Singer, Ruth; Lomas, David A; Coxson, Harvey O; Edwards, Lisa D; MacNee, William; Vestbo, Jørgen; Yates, Julie C; Agusti, Alvar; Calverley, Peter M A; Celli, Bartolome; Crim, Courtney; Rennard, Stephen; Wouters, Emiel; Bakke, Per; Gulsvik, Amund; Crapo, James D; Beaty, Terri H; Silverman, Edwin K

    2012-02-15

    The genetic risk factors for chronic obstructive pulmonary disease (COPD) are still largely unknown. To date, genome-wide association studies (GWASs) of limited size have identified several novel risk loci for COPD at CHRNA3/CHRNA5/IREB2, HHIP and FAM13A; additional loci may be identified through larger studies. We performed a GWAS using a total of 3499 cases and 1922 control subjects from four cohorts: the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE); the Normative Aging Study (NAS) and National Emphysema Treatment Trial (NETT); Bergen, Norway (GenKOLS); and the COPDGene study. Genotyping was performed on Illumina platforms with additional markers imputed using 1000 Genomes data; results were summarized using fixed-effect meta-analysis. We identified a new genome-wide significant locus on chromosome 19q13 (rs7937, OR = 0.74, P = 2.9 × 10(-9)). Genotyping this single nucleotide polymorphism (SNP) and another nearby SNP in linkage disequilibrium (rs2604894) in 2859 subjects from the family-based International COPD Genetics Network study (ICGN) demonstrated supportive evidence for association for COPD (P = 0.28 and 0.11 for rs7937 and rs2604894), pre-bronchodilator FEV(1) (P = 0.08 and 0.04) and severe (GOLD 3&4) COPD (P = 0.09 and 0.017). This region includes RAB4B, EGLN2, MIA and CYP2A6, and has previously been identified in association with cigarette smoking behavior.

  10. Family-based genome-wide association study of frontal θ oscillations identifies potassium channel gene KCNJ6.

    Science.gov (United States)

    Kang, S J; Rangaswamy, M; Manz, N; Wang, J-C; Wetherill, L; Hinrichs, T; Almasy, L; Brooks, A; Chorlian, D B; Dick, D; Hesselbrock, V; Kramer, J; Kuperman, S; Nurnberger, J; Rice, J; Schuckit, M; Tischfield, J; Bierut, L J; Edenberg, H J; Goate, A; Foroud, T; Porjesz, B

    2012-08-01

    Event-related oscillations (EROs) represent highly heritable neuroelectric correlates of cognitive processes that manifest deficits in alcoholics and in offspring at high risk to develop alcoholism. Theta ERO to targets in the visual oddball task has been shown to be an endophenotype for alcoholism. A family-based genome-wide association study was performed for the frontal theta ERO phenotype using 634 583 autosomal single nucleotide polymorphisms (SNPs) genotyped in 1560 family members from 117 families densely affected by alcohol use disorders, recruited in the Collaborative Study on the Genetics of Alcoholism. Genome-wide significant association was found with several SNPs on chromosome 21 in KCNJ6 (a potassium inward rectifier channel; KIR3.2/GIRK2), with the most significant SNP at P = 4.7 × 10(-10)). The same SNPs were also associated with EROs from central and parietal electrodes, but with less significance, suggesting that the association is frontally focused. One imputed synonymous SNP in exon four, highly correlated with our top three SNPs, was significantly associated with the frontal theta ERO phenotype. These results suggest KCNJ6 or its product GIRK2 account for some of the variations in frontal theta band oscillations. GIRK2 receptor activation contributes to slow inhibitory postsynaptic potentials that modulate neuronal excitability, and therefore influence neuronal networks.

  11. Genome-wide association study of prostate-specific antigen levels identifies novel loci independent of prostate cancer

    Science.gov (United States)

    Hoffmann, Thomas J.; Passarelli, Michael N.; Graff, Rebecca E.; Emami, Nima C.; Sakoda, Lori C.; Jorgenson, Eric; Habel, Laurel A.; Shan, Jun; Ranatunga, Dilrini K.; Quesenberry, Charles P.; Chao, Chun R.; Ghai, Nirupa R.; Aaronson, David; Presti, Joseph; Nordström, Tobias; Wang, Zhaoming; Berndt, Sonja I.; Chanock, Stephen J.; Mosley, Jonathan D.; Klein, Robert J.; Middha, Mridu; Lilja, Hans; Melander, Olle; Kvale, Mark N.; Kwok, Pui-Yan; Schaefer, Catherine; Risch, Neil; Van Den Eeden, Stephen K.; Witte, John S.

    2017-01-01

    Prostate-specific antigen (PSA) levels have been used for detection and surveillance of prostate cancer (PCa). However, factors other than PCa—such as genetics—can impact PSA. Here we present findings from a genome-wide association study (GWAS) of PSA in 28,503 Kaiser Permanente whites and 17,428 men from replication cohorts. We detect 40 genome-wide significant (P<5 × 10−8) single-nucleotide polymorphisms (SNPs): 19 novel, 15 previously identified for PSA (14 of which were also PCa-associated), and 6 previously identified for PCa only. Further analysis incorporating PCa cases suggests that at least half of the 40 SNPs are PSA-associated independent of PCa. The 40 SNPs explain 9.5% of PSA variation in non-Hispanic whites, and the remaining GWAS SNPs explain an additional 31.7%; this percentage is higher in younger men, supporting the genetic basis of PSA levels. These findings provide important information about genetic markers for PSA that may improve PCa screening, thereby reducing over-diagnosis and over-treatment. PMID:28139693

  12. TMPRSS9 and GRIN2B are associated with neuroticism: a genome-wide association study in a European sample.

    Science.gov (United States)

    Aragam, Nagesh; Wang, Ke-Sheng; Anderson, James L; Liu, Xuefeng

    2013-06-01

    Major depression disorder (MDD) is a complex and chronic disease that ranks fourth as cause of disability worldwide. About 14 million adults in the USA are believed to have MDD, and an estimated 75 % attempt suicide making MDD a major public health problem. Neuroticism has been recognized as an endophenotype of MDD; however, few genome-wide association (GWA) analyses of neuroticism as a quantitative trait have been reported to date. The aim of this study is to identify genome-wide genetic variants affecting neuroticism using a European sample. A linear regression model was used to analyze the association with neuroticism as a continuous trait in the Netherlands Study of Depression and Anxiety and Netherlands Twin Registry population-based sample of 2,748 individuals with Perlegen 600K single nucleotide polymorphisms (SNPs). In addition, the neuroticism-associated genes/loci of the top 20 SNPs (p personality disorder (ASPD) in an Australian twin family study. Through GWA analysis, 32 neuroticism-associated SNPs (p neuroticism. Our findings provide a basis for replication in other populations to elucidate the potential role of these genetic variants in neuroticism and MDD along with a possible relationship between ASPD and neuroticism.

  13. European genome-wide association study identifies SLC14A1 as a new urinary bladder cancer susceptibility gene.

    Science.gov (United States)

    Rafnar, Thorunn; Vermeulen, Sita H; Sulem, Patrick; Thorleifsson, Gudmar; Aben, Katja K; Witjes, J Alfred; Grotenhuis, Anne J; Verhaegh, Gerald W; Hulsbergen-van de Kaa, Christina A; Besenbacher, Soren; Gudbjartsson, Daniel; Stacey, Simon N; Gudmundsson, Julius; Johannsdottir, Hrefna; Bjarnason, Hjordis; Zanon, Carlo; Helgadottir, Hafdis; Jonasson, Jon Gunnlaugur; Tryggvadottir, Laufey; Jonsson, Eirikur; Geirsson, Gudmundur; Nikulasson, Sigfus; Petursdottir, Vigdis; Bishop, D Timothy; Chung-Sak, Sei; Choudhury, Ananya; Elliott, Faye; Barrett, Jennifer H; Knowles, Margaret A; de Verdier, Petra J; Ryk, Charlotta; Lindblom, Annika; Rudnai, Peter; Gurzau, Eugene; Koppova, Kvetoslava; Vineis, Paolo; Polidoro, Silvia; Guarrera, Simonetta; Sacerdote, Carlotta; Panadero, Angeles; Sanz-Velez, José I; Sanchez, Manuel; Valdivia, Gabriel; Garcia-Prats, Maria D; Hengstler, Jan G; Selinski, Silvia; Gerullis, Holger; Ovsiannikov, Daniel; Khezri, Abdolaziz; Aminsharifi, Alireza; Malekzadeh, Mahyar; van den Berg, Leonard H; Ophoff, Roel A; Veldink, Jan H; Zeegers, Maurice P; Kellen, Eliane; Fostinelli, Jacopo; Andreoli, Daniele; Arici, Cecilia; Porru, Stefano; Buntinx, Frank; Ghaderi, Abbas; Golka, Klaus; Mayordomo, José I; Matullo, Giuseppe; Kumar, Rajiv; Steineck, Gunnar; Kiltie, Anne E; Kong, Augustine; Thorsteinsdottir, Unnur; Stefansson, Kari; Kiemeney, Lambertus A

    2011-11-01

    Three genome-wide association studies in Europe and the USA have reported eight urinary bladder cancer (UBC) susceptibility loci. Using extended case and control series and 1000 Genomes imputations of 5 340 737 single-nucleotide polymorphisms (SNPs), we searched for additional loci in the European GWAS. The discovery sample set consisted of 1631 cases and 3822 controls from the Netherlands and 603 cases and 37 781 controls from Iceland. For follow-up, we used 3790 cases and 7507 controls from 13 sample sets of European and Iranian ancestry. Based on the discovery analysis, we followed up signals in the urea transporter (UT) gene SLC14A. The strongest signal at this locus was represented by a SNP in intron 3, rs17674580, that reached genome-wide significance in the overall analysis of the discovery and follow-up groups: odds ratio = 1.17, P = 7.6 × 10(-11). SLC14A1 codes for UTs that define the Kidd blood group and are crucial for the maintenance of a constant urea concentration gradient in the renal medulla and, through this, the kidney's ability to concentrate urine. It is speculated that rs17674580, or other sequence variants in LD with it, indirectly modifies UBC risk by affecting urine production. If confirmed, this would support the 'urogenous contact hypothesis' that urine production and voiding frequency modify the risk of UBC.

  14. Role of DISC1 interacting proteins in schizophrenia risk from genome-wide analysis of missense SNPs.

    Science.gov (United States)

    Costas, Javier; Suárez-Rama, Jose Javier; Carrera, Noa; Paz, Eduardo; Páramo, Mario; Agra, Santiago; Brenlla, Julio; Ramos-Ríos, Ramón; Arrojo, Manuel

    2013-11-01

    A balanced translocation affecting DISC1 cosegregates with several psychiatric disorders, including schizophrenia, in a Scottish family. DISC1 is a hub protein of a network of protein-protein interactions involved in multiple developmental pathways within the brain. Gene set-based analysis has been proposed as an alternative to individual analysis of single nucleotide polymorphisms (SNPs) to get information from genome-wide association studies. In this work, we tested for an overrepresentation of the DISC1 interacting proteins within the top results of our ranked list of genes based on our previous genome-wide association study of missense SNPs in schizophrenia. Our data set consisted of 5100 common missense SNPs genotyped in 476 schizophrenic patients and 447 control subjects from Galicia, NW Spain. We used a modification of the Gene Set Enrichment Analysis adapted for SNPs, as implemented in the GenGen software. The analysis detected an overrepresentation of the DISC1 interacting proteins (permuted P-value=0.0158), indicative of the role of this gene set in schizophrenia risk. We identified seven leading-edge genes, MACF1, UTRN, DST, DISC1, KIF3A, SYNE1, and AKAP9, responsible for the overrepresentation. These genes are involved in neuronal cytoskeleton organization and intracellular transport through the microtubule cytoskeleton, suggesting that these processes may be impaired in schizophrenia.

  15. A genome-wide association study identifies risk loci to equine recurrent uveitis in German warmblood horses.

    Science.gov (United States)

    Kulbrock, Maike; Lehner, Stefanie; Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

    2013-01-01

    Equine recurrent uveitis (ERU) is a common eye disease affecting up to 3-15% of the horse population. A genome-wide association study (GWAS) using the Illumina equine SNP50 bead chip was performed to identify loci conferring risk to ERU. The sample included a total of 144 German warmblood horses. A GWAS showed a significant single nucleotide polymorphism (SNP) on horse chromosome (ECA) 20 at 49.3 Mb, with IL-17A and IL-17F being the closest genes. This locus explained a fraction of 23% of the phenotypic variance for ERU. A GWAS taking into account the severity of ERU, revealed a SNP on ECA18 nearby to the crystalline gene cluster CRYGA-CRYGF. For both genomic regions on ECA18 and 20, significantly associated haplotypes containing the genome-wide significant SNPs could be demonstrated. In conclusion, our results are indicative for a genetic component regulating the possible critical role of IL-17A and IL-17F in the pathogenesis of ERU. The associated SNP on ECA18 may be indicative for cataract formation in the course of ERU.

  16. A genome-wide association study identifies risk loci to equine recurrent uveitis in German warmblood horses.

    Directory of Open Access Journals (Sweden)

    Maike Kulbrock

    Full Text Available Equine recurrent uveitis (ERU is a common eye disease affecting up to 3-15% of the horse population. A genome-wide association study (GWAS using the Illumina equine SNP50 bead chip was performed to identify loci conferring risk to ERU. The sample included a total of 144 German warmblood horses. A GWAS showed a significant single nucleotide polymorphism (SNP on horse chromosome (ECA 20 at 49.3 Mb, with IL-17A and IL-17F being the closest genes. This locus explained a fraction of 23% of the phenotypic variance for ERU. A GWAS taking into account the severity of ERU, revealed a SNP on ECA18 nearby to the crystalline gene cluster CRYGA-CRYGF. For both genomic regions on ECA18 and 20, significantly associated haplotypes containing the genome-wide significant SNPs could be demonstrated. In conclusion, our results are indicative for a genetic component regulating the possible critical role of IL-17A and IL-17F in the pathogenesis of ERU. The associated SNP on ECA18 may be indicative for cataract formation in the course of ERU.

  17. Genome wide association analysis of the 16th QTL- MAS Workshop dataset using the Random Forest machine learning approach

    Science.gov (United States)

    2014-01-01

    Background Genome wide association studies are now widely used in the livestock sector to estimate the association among single nucleotide polymorphisms (SNPs) distributed across the whole genome and one or more trait. As computational power increases, the use of machine learning techniques to analyze large genome wide datasets becomes possible. Methods The objective of this study was to identify SNPs associated with the three traits simulated in the 16th MAS-QTL workshop dataset using the Random Forest (RF) approach. The approach was applied to single and multiple trait estimated breeding values, and on yield deviations and to compare them with the results of the GRAMMAR-CG method. Results The two QTL mapping methods used, GRAMMAR-CG and RF, were successful in identifying the main QTLs for trait 1 on chromosomes 1 and 4, for trait 2 on chromosomes 1, 4 and 5 and for trait 3 on chromosomes 1, 2 and 3. Conclusions The results of the RF approach were confirmed by the GRAMMAR-CG method and validated by the effective QTL position, even if their approach to unravel cryptic genetic structure is different. Furthermore, both methods showed complementary findings. However, when the variance explained by the QTL is low, they both failed to detect significant associations. PMID:25519518

  18. Detection of genetic variants affecting cattle behaviour and their impact on milk production: a genome-wide association study.

    Science.gov (United States)

    Friedrich, Juliane; Brand, Bodo; Ponsuksili, Siriluck; Graunke, Katharina L; Langbein, Jan; Knaust, Jacqueline; Kühn, Christa; Schwerin, Manfred

    2016-02-01

    Behaviour traits of cattle have been reported to affect important production traits, such as meat quality and milk performance as well as reproduction and health. Genetic predisposition is, together with environmental stimuli, undoubtedly involved in the development of behaviour phenotypes. Underlying molecular mechanisms affecting behaviour in general and behaviour and productions traits in particular still have to be studied in detail. Therefore, we performed a genome-wide association study in an F2 Charolais × German Holstein cross-breed population to identify genetic variants that affect behaviour-related traits assessed in an open-field and novel-object test and analysed their putative impact on milk performance. Of 37,201 tested single nucleotide polymorphism (SNPs), four showed a genome-wide and 37 a chromosome-wide significant association with behaviour traits assessed in both tests. Nine of the SNPs that were associated with behaviour traits likewise showed a nominal significant association with milk performance traits. On chromosomes 14 and 29, six SNPs were identified to be associated with exploratory behaviour and inactivity during the novel-object test as well as with milk yield traits. Least squares means for behaviour and milk performance traits for these SNPs revealed that genotypes associated with higher inactivity and less exploratory behaviour promote higher milk yields. Whether these results are due to molecular mechanisms simultaneously affecting behaviour and milk performance or due to a behaviour predisposition, which causes indirect effects on milk performance by influencing individual reactivity, needs further investigation.

  19. Genome-wide association study identifies five new schizophrenia loci

    DEFF Research Database (Denmark)

    Ripke, Stephan; Sanders, Alan R; Kendler, Kenneth S;

    2011-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yiel...

  20. A genome-wide association study of anorexia nervosa

    NARCIS (Netherlands)

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, Annemarie; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countri

  1. Genome-wide association study identifies five new schizophrenia loci

    NARCIS (Netherlands)

    Ripke, Stephan; Sanders, Alan R.; Kendler, Kenneth S.; Levinson, Douglas F.; Sklar, Pamela; Holmans, Peter A.; Lin, Dan-Yu; Duan, Jubao; Ophoff, Roel A.; Andreassen, Ole A.; Scolnick, Edward; Cichon, Sven; Clair, David St.; Corvin, Aiden; Gurling, Hugh; Werge, Thomas; Rujescu, Dan; Blackwood, Douglas H. R.; Pato, Carlos N.; Malhotra, Anil K.; Purcell, Shaun; Dudbridge, Frank; Neale, Benjamin M.; Rossin, Lizzy; Visscher, Peter M.; Posthuma, Danielle; Ruderfer, Douglas M.; Fanous, Ayman; Stefansson, Hreinn; Steinberg, Stacy; Mowry, Bryan J.; Golimbet, Vera; De Hert, Marc; Jonsson, Erik G.; Bitter, Istvan; Pietilainen, Olli P. H.; Collier, David A.; Tosato, Sarah; Agartz, Ingrid; Albus, Margot; Alexander, Madeline; Amdur, Richard L.; Amin, Farooq; Bass, Nicholas; Bergen, Sarah E.; Black, Donald W.; Borglum, Anders D.; Brown, Matthew A.; Bruggeman, Richard; Buccola, Nancy G.; Byerley, William F.; Cahn, Wiepke; Cantor, Rita M.; Carr, Vaughan J.; Catts, Stanley V.; Choudhury, Khalid; Cloninger, C. Robert; Cormican, Paul; Craddock, Nicholas; Danoy, Patrick A.; Datta, Susmita; De Haan, Lieuwe; Demontis, Ditte; Dikeos, Dimitris; Djurovic, Srdjan; Donnelly, Peter; Donohoe, Gary; Duong, Linh; Dwyer, Sarah; Fink-Jensen, Anders; Freedman, Robert; Freimer, Nelson B.; Friedl, Marion; Georgieva, Lyudmila; Giegling, Ina; Gill, Michael; Glenthoj, Birte; Godard, Stephanie; Hamshere, Marian; Hansen, Mark; Hansen, Thomas; Hartmann, Annette M.; Henskens, Frans A.; Hougaard, David M.; Hultman, Christina M.; Ingason, Andres; Jablensky, Assen V.; Jakobsen, Klaus D.; Jay, Maurice; Juergens, Gesche; Kahn, Renes; Keller, Matthew C.; Kenis, Gunter; Kenny, Elaine; Kim, Yunjung; Kirov, George K.; Konnerth, Heike; Konte, Bettina; Krabbendam, Lydia; Krasucki, Robert; Lasseter, Virginia K.; Laurent, Claudine; Lawrence, Jacob; Lencz, Todd; Lerer, F. Bernard; Liang, Kung-Yee; Lichtenstein, Paul; Lieberman, Jeffrey A.; Linszen, Don H.; Lonnqvist, Jouko; Loughland, Carmel M.; Maclean, Alan W.; Maher, Brion S.; Maier, Wolfgang; Mallet, Jacques; Malloy, Pat; Mattheisen, Manuel; Mattingsdal, Morten; McGhee, Kevin A.; McGrath, John J.; McIntosh, Andrew; McLean, Duncan E.; McQuillin, Andrew; Melle, Ingrid; Michie, Patricia T.; Milanova, Vihra; Morris, Derek W.; Mors, Ole; Mortensen, Preben B.; Moskvina, Valentina; Muglia, Pierandrea; Myin-Germeys, Inez; Nertney, Deborah A.; Nestadt, Gerald; Nielsen, Jimmi; Nikolov, Ivan; Nordentoft, Merete; Norton, Nadine; Noethen, Markus M.; O'Dushlaine, Colm T.; Olincy, Ann; Olsen, Line; O'Neill, F. Anthony; Orntoft, Torben F.; Owen, Michael J.; Pantelis, Christos; Papadimitriou, George; Pato, Michele T.; Peltonen, Leena; Petursson, Hannes; Pickard, Ben; Pimm, Jonathan; Pulver, Ann E.; Puri, Vinay; Quested, Digby; Quinn, Emma M.; Rasmussen, Henrik B.; Rethelyi, Janos M.; Ribble, Robert; Rietschel, Marcella; Riley, Brien P.; Ruggeri, Mirella; Schall, Ulrich; Schulze, Thomas G.; Schwab, Sibylle G.; Scott, Rodney J.; Shi, Jianxin; Sigurdsson, Engilbert; Silverman, Jeremy M.; Spencer, Chris C. A.; Stefansson, Kari; Strange, Amy; Strengman, Eric; Stroup, T. Scott; Suvisaari, Jaana; Terenius, Lars; Thirumalai, Srinivasa; Thygesen, Johan H.; Timm, Sally; Toncheva, Draga; van den Oord, Edwin; van Os, Jim; van Winkel, Ruud; Veldink, Jan; Walsh, Dermot; Wang, August G.; Wiersma, Durk; Wildenauer, Dieter B.; Williams, Hywel J.; Williams, Nigel M.; Wormley, Brandon; Zammit, Stan; Sullivan, Patrick F.; O'Donovan, Michael C.; Daly, Mark J.; Gejman, Pablo V.

    2011-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded

  2. Genome-wide association study identifies five new schizophrenia loci.

    LENUS (Irish Health Repository)

    Ripke, Stephan

    2011-10-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10(-11)) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10(-9)), ANK3 (rs10994359, P = 2.5 × 10(-8)) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10(-9)).

  3. Genome-wide significant risk associations for mucinous ovarian carcinoma

    DEFF Research Database (Denmark)

    Kelemen, Linda E; Lawrenson, Kate; Tyrer, Jonathan;

    2015-01-01

    Genome-wide association studies have identified several risk associations for ovarian carcinomas but not for mucinous ovarian carcinomas (MOCs). Our analysis of 1,644 MOC cases and 21,693 controls with imputation identified 3 new risk associations: rs752590 at 2q13 (P = 3.3 × 10(-8)), rs711830 at...

  4. A single-nucleotide deletion in the POMP 5' UTR causes a transcriptional switch and altered epidermal proteasome distribution in KLICK genodermatosis.

    Science.gov (United States)

    Dahlqvist, Johanna; Klar, Joakim; Tiwari, Neha; Schuster, Jens; Törmä, Hans; Badhai, Jitendra; Pujol, Ramon; van Steensel, Maurice A M; Brinkhuizen, Tjinta; Brinkhuijzen, Tjinta; Gijezen, Lieke; Chaves, Antonio; Tadini, Gianluca; Vahlquist, Anders; Dahl, Niklas

    2010-04-09

    KLICK syndrome is a rare autosomal-recessive skin disorder characterized by palmoplantar keratoderma, linear hyperkeratotic papules, and ichthyosiform scaling. In order to establish the genetic cause of this disorder, we collected DNA samples from eight European probands. Using high-density genome-wide SNP analysis, we identified a 1.5 Mb homozygous candidate region on chromosome 13q. Sequence analysis of the ten annotated genes in the candidate region revealed homozygosity for a single-nucleotide deletion at position c.-95 in the proteasome maturation protein (POMP) gene, in all probands. The deletion is included in POMP transcript variants with long 5' untranslated regions (UTRs) and was associated with a marked increase of these transcript variants in keratinocytes from KLICK patients. POMP is a ubiquitously expressed protein and functions as a chaperone for proteasome maturation. Immunohistochemical analysis of skin biopsies from KLICK patients revealed an altered epidermal distribution of POMP, the proteasome subunit proteins alpha 7 and beta 5, and the ER stress marker CHOP. Our results suggest that KLICK syndrome is caused by a single-nucleotide deletion in the 5' UTR of POMP resulting in altered distribution of POMP in epidermis and a perturbed formation of the outermost layers of the skin. These findings imply that the proteasome has a prominent role in the terminal differentiation of human epidermis.

  5. Genome wide analysis of drug-induced torsades de pointes: lack of common variants with large effect sizes.

    Directory of Open Access Journals (Sweden)

    Elijah R Behr

    Full Text Available Marked prolongation of the QT interval on the electrocardiogram associated with the polymorphic ventricular tachycardia Torsades de Pointes is a serious adverse event during treatment with antiarrhythmic drugs and other culprit medications, and is a common cause for drug relabeling and withdrawal. Although clinical risk factors have been identified, the syndrome remains unpredictable in an individual patient. Here we used genome-wide association analysis to search for common predisposing genetic variants. Cases of drug-induced Torsades de Pointes (diTdP, treatment tolerant controls, and general population controls were ascertained across multiple sites using common definitions, and genotyped on the Illumina 610k or 1M-Duo BeadChips. Principal Components Analysis was used to select 216 Northwestern European diTdP cases and 771 ancestry-matched controls, including treatment-tolerant and general population subjects. With these sample sizes, there is 80% power to detect a variant at genome-wide significance with minor allele frequency of 10% and conferring an odds ratio of ≥2.7. Tests of association were carried out for each single nucleotide polymorphism (SNP by logistic regression adjusting for gender and population structure. No SNP reached genome wide-significance; the variant with the lowest P value was rs2276314, a non-synonymous coding variant in C18orf21 (p  =  3×10(-7, odds ratio = 2, 95% confidence intervals: 1.5-2.6. The haplotype formed by rs2276314 and a second SNP, rs767531, was significantly more frequent in controls than cases (p  =  3×10(-9. Expanding the number of controls and a gene-based analysis did not yield significant associations. This study argues that common genomic variants do not contribute importantly to risk for drug-induced Torsades de Pointes across multiple drugs.

  6. The challenges of genome-wide interaction studies: lessons to learn from the analysis of HDL blood levels.

    Directory of Open Access Journals (Sweden)

    Elisabeth M van Leeuwen

    Full Text Available Genome-wide association studies (GWAS have revealed 74 single nucleotide polymorphisms (SNPs associated with high-density lipoprotein cholesterol (HDL blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS to identify SNP×SNP interactions associated with HDL levels. We performed a GWIS in the Rotterdam Study (RS cohort I (RS-I using the GLIDE tool which leverages the massively parallel computing power of Graphics Processing Units (GPUs to perform linear regression on all genome-wide pairs of SNPs. By performing a meta-analysis together with Rotterdam Study cohorts II and III (RS-II and RS-III, we were able to filter 181 interaction terms with a p-value<1 · 10-8 that replicated in the two independent cohorts. We were not able to replicate any of these interaction term in the AGES, ARIC, CHS, ERF, FHS and NFBC-66 cohorts (Ntotal = 30,011 when adjusting for multiple testing. Our GWIS resulted in the consistent finding of a possible interaction between rs774801 in ARMC8 (ENSG00000114098 and rs12442098 in SPATA8 (ENSG00000185594 being associated with HDL levels. However, p-values do not reach the preset Bonferroni correction of the p-values. Our study suggest that even for highly genetically determined traits such as HDL the sample sizes needed to detect SNP×SNP interactions are large and the 2-step filtering approaches do not yield a solution. Here we present our analysis plan and our reservations concerning GWIS.

  7. Promising Loci and Genes for Yolk and Ovary Weight in Chickens Revealed by a Genome-Wide Association Study.

    Directory of Open Access Journals (Sweden)

    Congjiao Sun

    Full Text Available Because it serves as the cytoplasm of the oocyte and provides a large amount of reserves, the egg yolk has biological significance for developing embryos. The ovary and its hierarchy of follicles are the main reproductive organs responsible for yolk deposition in chickens. However, the genetic architecture underlying the yolk and ovarian follicle weights remains elusive. Here, we measured the yolk weight (YW at 11 age points from onset of egg laying to 72 weeks of age and measured the follicle weight (FW and ovary weight (OW at 73 weeks as part of a comprehensive genome-wide association study (GWAS in 1,534 F2 hens derived from reciprocal crosses between White Leghorn (WL and Dongxiang chickens (DX. For all ages, YWs exhibited moderate single nucleotide polymorphism (SNP-based heritability estimates (0.25-0.38, while the estimates for FW (0.16 and OW (0.20 were relatively low. Independent univariate genome-wide screens for each trait identified 12, 3, and 31 novel significant associations with YW, FW, and OW, respectively. A list of candidate genes such as ZAR1, STARD13, ACER1b, ACSBG2, and DHRS12 were identified for having a plausible function in yolk and follicle development. These genes are important to the initiation of embryogenesis, lipid transport, lipoprotein synthesis, lipid droplet promotion, and steroid hormone metabolism, respectively. Our study provides for the first time a genome-wide association (GWA analysis for follicle and ovary weight. Identification of the promising loci as well as potential candidate genes will greatly advance our understanding of the genetic basis underlying dynamic yolk weight and ovarian follicle development and has practical significance in breeding programs for the alteration of yolk weight at different age points.

  8. Genome-wide association study for circulating levels of PAI-1 provides novel insights into its regulation

    Science.gov (United States)

    Huang, Jie; Sabater-Lleal, Maria; Asselbergs, Folkert W.; Tregouet, David; Shin, So-Youn; Ding, Jingzhong; Baumert, Jens; Oudot-Mellakh, Tiphaine; Folkersen, Lasse; Johnson, Andrew D.; Smith, Nicholas L.; Williams, Scott M.; Ikram, Mohammad A.; Kleber, Marcus E.; Becker, Diane M.; Truong, Vinh; Mychaleckyj, Josyf C.; Tang, Weihong; Yang, Qiong; Sennblad, Bengt; Moore, Jason H.; Williams, Frances M. K.; Dehghan, Abbas; Silbernagel, Günther; Schrijvers, Elisabeth M. C.; Smith, Shelly; Karakas, Mahir; Tofler, Geoffrey H.; Silveira, Angela; Navis, Gerjan J.; Lohman, Kurt; Chen, Ming-Huei; Peters, Annette; Goel, Anuj; Hopewell, Jemma C.; Chambers, John C.; Saleheen, Danish; Lundmark, Per; Psaty, Bruce M.; Strawbridge, Rona J.; Boehm, Bernhard O.; Carter, Angela M.; Meisinger, Christa; Peden, John F.; Bis, Joshua C.; McKnight, Barbara; Öhrvik, John; Taylor, Kent; Franzosi, Maria Grazia; Seedorf, Udo; Collins, Rory; Franco-Cereceda, Anders; Syvänen, Ann-Christine; Goodall, Alison H.; Yanek, Lisa R.; Cushman, Mary; Müller-Nurasyid, Martina; Folsom, Aaron R.; Basu, Saonli; Matijevic, Nena; van Gilst, Wiek H.; Kooner, Jaspal S.; Hofman, Albert; Danesh, John; Clarke, Robert; Meigs, James B.; Kathiresan, Sekar; Reilly, Muredach P.; Klopp, Norman; Harris, Tamara B.; Winkelmann, Bernhard R.; Grant, Peter J.; Hillege, Hans L.; Watkins, Hugh; Spector, Timothy D.; Becker, Lewis C.; Tracy, Russell P.; März, Winfried; Uitterlinden, Andre G.; Eriksson, Per; Cambien, Francois; Morange, Pierre-Emmanuel; Koenig, Wolfgang; Soranzo, Nicole; van der Harst, Pim; Liu, Yongmei

    2012-01-01

    We conducted a genome-wide association study to identify novel associations between genetic variants and circulating plasminogen activator inhibitor-1 (PAI-1) concentration, and examined functional implications of variants and genes that were discovered. A discovery meta-analysis was performed in 19 599 subjects, followed by replication analysis of genome-wide significant (P < 5 × 10−8) single nucleotide polymorphisms (SNPs) in 10 796 independent samples. We further examined associations with type 2 diabetes and coronary artery disease, assessed the functional significance of the SNPs for gene expression in human tissues, and conducted RNA-silencing experiments for one novel association. We confirmed the association of the 4G/5G proxy SNP rs2227631 in the promoter region of SERPINE1 (7q22.1) and discovered genome-wide significant associations at 3 additional loci: chromosome 7q22.1 close to SERPINE1 (rs6976053, discovery P = 3.4 × 10−10); chromosome 11p15.2 within ARNTL (rs6486122, discovery P = 3.0 × 10−8); and chromosome 3p25.2 within PPARG (rs11128603, discovery P = 2.9 × 10−8). Replication was achieved for the 7q22.1 and 11p15.2 loci. There was nominal association with type 2 diabetes and coronary artery disease at ARNTL (P < .05). Functional studies identified MUC3 as a candidate gene for the second association signal on 7q22.1. In summary, SNPs in SERPINE1 and ARNTL and an SNP associated with the expression of MUC3 were robustly associated with circulating levels of PAI-1. PMID:22990020

  9. Unidimensional nonnegative scaling for genome-wide linkage disequilibrium maps.

    Science.gov (United States)

    Liao, Haiyong; Ng, Michael; Fung, Eric; Sham, Pak C

    2008-01-01

    The main aim of this paper is to propose and develop a unidimensional nonnegative scaling model to construct Linkage Disequilibrium (LD) maps. The proposed constrained scaling model can be efficiently solved by transforming it to an unconstrained model. The method is implemented in PC Clusters at Hong Kong Baptist University. The LD maps are constructed for four populations from Hapmap data sets with chromosomes of several ten thousand Single Nucleotide Polymorphisms (SNPs). The similarities and dissimilarities of the LD maps are studied and analysed. Computational results are also reported to show the effectiveness of the method using parallel computation.

  10. Evaluation of the lasso and the elastic net in genome-wide association studies.

    Science.gov (United States)

    Waldmann, Patrik; Mészáros, Gábor; Gredler, Birgit; Fuerst, Christian; Sölkner, Johann

    2013-01-01

    The number of publications performing genome-wide association studies (GWAS) has increased dramatically. Penalized regression approaches have been developed to overcome the challenges caused by the high dimensional data, but these methods are relatively new in the GWAS field. In this study we have compared the statistical performance of two methods (the least absolute shrinkage and selection operator-lasso and the elastic net) on two simulated data sets and one real data set from a 50 K genome-wide single nucleotide polymorphism (SNP) panel of 5570 Fleckvieh bulls. The first simulated data set displays moderate to high linkage disequilibrium between SNPs, whereas the second simulated data set from the QTLMAS 2010 workshop is biologically more complex. We used cross-validation to find the optimal value of regularization parameter λ with both minimum MSE and minimum MSE + 1SE of minimum MSE. The optimal λ values were used for variable selection. Based on the first simulated data, we found that the minMSE in general picked up too many SNPs. At minMSE + 1SE, the lasso didn't acquire any false positives, but selected too few correct SNPs. The elastic net provided the best compromise between few false positives and many correct selections when the penalty weight α was around 0.1. However, in our simulation setting, this α value didn't result in the lowest minMSE + 1SE. The number of selected SNPs from the QTLMAS 2010 data was after correction for population structure 82 and 161 for the lasso and the elastic net, respectively. In the Fleckvieh data set after population structure correction lasso and the elastic net identified from 1291 to 1966 important SNPs for milk fat content, with major peaks on chromosomes 5, 14, 15, and 20. Hence, we can conclude that it is important to analyze GWAS data with both the lasso and the elastic net and an alternative tuning criterion to minimum MSE is needed for variable selection.

  11. Evaluation of the lasso and the elastic net in genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Patrik eWaldmann

    2013-12-01

    Full Text Available The number of publications performing genome-wide association studies (GWAS has increased dramatically. Penalized regression approaches have been developed to overcome the challenges caused by the high dimensional data, but these methods are relatively new in the GWAS area. In this study we have compared the statistical performance of two methods (the least absolute shrinkage and selection operator - lasso and the elastic net on two simulated data sets and one real data set from a 50K genome-wide single nucleotide polymorphism (SNP panel of 5,570 Fleckvieh bulls. The first simulated data set displays moderate to high linkage disequilibrium between SNPs, whereas the second simulated data set from the QTLMAS 2010 workshop is biologically more complex. We used cross-validation to find the optimal value of regularization parameter λ with both minimum MSE and minimum MSE + 1 SE of minimum MSE. The optimal λ values were used for variable selection. Based on the first simulated data, we found that the minMSE in general picked up too many SNPs. At minMSE+1SE, the lasso didn’t acquire any false positives, but selected too few correct SNPs. The elastic net provided the best compromise between few false positives and many correct selections when the penalty weight α was around 0.1. However, in our simulation setting, this α value didn’t result in the lowest minMSE+1SE. The number of selected SNPs from the QTLMAS 2010 data was after correction for population structure 82 and 161 for the lasso and the elastic net, respectively. In the Fleckvieh data set after population structure correction lasso and the elastic net identified from 1,291 to 1,966 important SNPs for milk fat content, with major peaks on chromosomes 5, 14, 15 and 20. Hence, we can conclude that it is important to analyze GWAS data with both the lasso and the elastic net and an alternative tuning criterion to minimum MSE is needed for variable selection.

  12. A genome-wide association study of pulmonary function measures in the Framingham Heart Study.

    Directory of Open Access Journals (Sweden)

    Jemma B Wilk

    2009-03-01

    Full Text Available The ratio of forced expiratory volume in one second to forced vital capacity (FEV(1/FVC is a measure used to diagnose airflow obstruction and is highly heritable. We performed a genome-wide association study in 7,691 Framingham Heart Study participants to identify single-nucleotide polymorphisms (SNPs associated with the FEV(1/FVC ratio, analyzed as a percent of the predicted value. Identified SNPs were examined in an independent set of 835 Family Heart Study participants enriched for airflow obstruction. Four SNPs in tight linkage disequilibrium on chromosome 4q31 were associated with the percent predicted FEV(1/FVC ratio with p-values of genome-wide significance in the Framingham sample (best p-value = 3.6e-09. One of the four chromosome 4q31 SNPs (rs13147758; p-value 2.3e-08 in Framingham was genotyped in the Family Heart Study and produced evidence of association with the same phenotype, percent predicted FEV(1/FVC (p-value = 2.0e-04. The effect estimates for association in the Framingham and Family Heart studies were in the same direction, with the minor allele (G associated with higher FEV(1/FVC ratio levels. Results from the Family Heart Study demonstrated that the association extended to FEV(1 and dichotomous airflow obstruction phenotypes, particularly among smokers. The SNP rs13147758 was associated with the percent predicted FEV(1/FVC ratio in independent samples from the Framingham and Family Heart Studies producing a combined p-value of 8.3e-11, and this region of chromosome 4 around 145.68 megabases was associated with COPD in three additional populations reported in the accompanying manuscript. The associated SNPs do not lie within a gene transcript but are near the hedgehog-interacting protein (HHIP gene and several expressed sequence tags cloned from fetal lung. Though it is unclear what gene or regulatory effect explains the association, the region warrants further investigation.

  13. Frontotemporal dementia and its subtypes: a genome-wide association study

    Science.gov (United States)

    Ferrari, Raffaele; Hernandez, Dena G; Nalls, Michael A; Rohrer, Jonathan D; Ramasamy, Adaikalavan; Kwok, John B J; Dobson-Stone, Carol; Brooks, William S; Schofield, Peter R; Halliday, Glenda M; Hodges, John R; Piguet, Olivier; Bartley, Lauren; Thompson, Elizabeth; Haan, Eric; Hernández, Isabel; Ruiz, Agustín; Boada, Mercè; Borroni, Barbara; Padovani, Alessandro; Cruchaga, Carlos; Cairns, Nigel J; Benussi, Luisa; Binetti, Giuliano; Ghidoni, Roberta; Forloni, Gianluigi; Galimberti, Daniela; Fenoglio, Chiara; Serpente, Maria; Scarpini, Elio; Clarimón, Jordi; Lleó, Alberto; Blesa, Rafael; Waldö, Maria Landqvist; Nilsson, Karin; Nilsson, Christer; Mackenzie, Ian R A; Hsiung, Ging-Yuek R; Mann, David M A; Grafman, Jordan; Morris, Christopher M; Attems, Johannes; Griffiths, Timothy D; McKeith, Ian G; Thomas, Alan J; Pietrini, P; Huey, Edward D; Wassermann, Eric M; Baborie, Atik; Jaros, Evelyn; Tierney, Michael C; Pastor, Pau; Razquin, Cristina; Ortega-Cubero, Sara; Alonso, Elena; Perneczky, Robert; Diehl-Schmid, Janine; Alexopoulos, Panagiotis; Kurz, Alexander; Rainero, Innocenzo; Rubino, Elisa; Pinessi, Lorenzo; Rogaeva, Ekaterina; George-Hyslop, Peter St; Rossi, Giacomina; Tagliavini, Fabrizio; Giaccone, Giorgio; Rowe, James B; Schlachetzki, J C M; Uphill, James; Collinge, John; Mead, S; Danek, Adrian; Van Deerlin, Vivianna M; Grossman, Murray; Trojanowsk, John Q; van der Zee, Julie; Deschamps, William; Van Langenhove, Tim; Cruts, Marc; Van Broeckhoven, Christine; Cappa, Stefano F; Le Ber, Isabelle; Hannequin, Didier; Golfier, Véronique; Vercelletto, Martine; Brice, Alexis; Nacmias, Benedetta; Sorbi, Sandro; Bagnoli, Silvia; Piaceri, Irene; Nielsen, Jørgen E; Hjermind, Lena E; Riemenschneider, Matthias; Mayhaus, Manuel; Ibach, Bernd; Gasparoni, Gilles; Pichler, Sabrina; Gu, Wei; Rossor, Martin N; Fox, Nick C; Warren, Jason D; Spillantini, Maria Grazia; Morris, Huw R; Rizzu, Patrizia; Heutink, Peter; Snowden, Julie S; Rollinson, Sara; Richardson, Anna; Gerhard, Alexander; Bruni, Amalia C; Maletta, Raffaele; Frangipane, Francesca; Cupidi, Chiara; Bernardi, Livia; Anfossi, Maria; Gallo, Maura; Conidi, Maria Elena; Smirne, Nicoletta; Rademakers, Rosa; Baker, Matt; Dickson, Dennis W; Graff-Radford, Neill R; Petersen, Ronald C; Knopman, David; Josephs, Keith A; Boeve, Bradley F; Parisi, Joseph E; Seeley, William W; Miller, Bruce L; Karydas, Anna M; Rosen, Howard; van Swieten, John C; Dopper, Elise G P; Seelaar, Harro; Pijnenburg, Yolande AL; Scheltens, Philip; Logroscino, Giancarlo; Capozzo, Rosa; Novelli, Valeria; Puca, Annibale A; Franceschi, M; Postiglione, Alfredo; Milan, Graziella; Sorrentino, Paolo; Kristiansen, Mark; Chiang, Huei-Hsin; Graff, Caroline; Pasquier, Florence; Rollin, Adeline; Deramecourt, Vincent; Lebert, Florence; Kapogiannis, Dimitrios; Ferrucci, Luigi; Pickering-Brown, Stuart; Singleton, Andrew B; Hardy, John; Momeni, Parastoo

    2014-01-01

    Summary Background Frontotemporal dementia (FTD) is a complex disorder characterised by a broad range of clinical manifestations, differential pathological signatures, and genetic variability. Mutations in three genes—MAPT, GRN, and C9orf72—have been associated with FTD. We sought to identify novel genetic risk loci associated with the disorder. Methods We did a two-stage genome-wide association study on clinical FTD, analysing samples from 3526 patients with FTD and 9402 healthy controls. All participants had European ancestry. In the discovery phase (samples from 2154 patients with FTD and 4308 controls), we did separate association analyses for each FTD subtype (behavioural variant FTD, semantic dementia, progressive non-fluent aphasia, and FTD overlapping with motor neuron disease [FTD-MND]), followed by a meta-analysis of the entire dataset. We carried forward replication of the novel suggestive loci in an independent sample series (samples from 1372 patients and 5094 controls) and then did joint phase and brain expression and methylation quantitative trait loci analyses for the associated (p<5 × 10−8) and suggestive single-nucleotide polymorphisms. Findings We identified novel associations exceeding the genome-wide significance threshold (p<5 × 10−8) that encompassed the HLA locus at 6p21.3 in the entire cohort. We also identified a potential novel locus at 11q14, encompassing RAB38/CTSC, for the behavioural FTD subtype. Analysis of expression and methylation quantitative trait loci data suggested that these loci might affect expression and methylation incis. Interpretation Our findings suggest that immune system processes (link to 6p21.3) and possibly lysosomal and autophagy pathways (link to 11q14) are potentially involved in FTD. Our findings need to be replicated to better define the association of the newly identified loci with disease and possibly to shed light on the pathomechanisms contributing to FTD. Funding The National Institute of

  14. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci.

    Science.gov (United States)

    Smith, D J; Escott-Price, V; Davies, G; Bailey, M E S; Colodro-Conde, L; Ward, J; Vedernikov, A; Marioni, R; Cullen, B; Lyall, D; Hagenaars, S P; Liewald, D C M; Luciano, M; Gale, C R; Ritchie, S J; Hayward, C; Nicholl, B; Bulik-Sullivan, B; Adams, M; Couvy-Duchesne, B; Graham, N; Mackay, D; Evans, J; Smith, B H; Porteous, D J; Medland, S E; Martin, N G; Holmans, P; McIntosh, A M; Pell, J P; Deary, I J; O'Donovan, M C

    2016-06-01

    Neuroticism is a personality trait of fundamental importance for psychological well-being and public health. It is strongly associated with major depressive disorder (MDD) and several other psychiatric conditions. Although neuroticism is heritable, attempts to identify the alleles involved in previous studies have been limited by relatively small sample sizes. Here we report a combined meta-analysis of genome-wide association study (GWAS) of neuroticism that includes 91 370 participants from the UK Biobank cohort, 6659 participants from the Generation Scotland: Scottish Family Health Study (GS:SFHS) and 8687 participants from a QIMR (Queensland Institute of Medical Research) Berghofer Medical Research Institute (QIMR) cohort. All participants were assessed using the same neuroticism instrument, the Eysenck Personality Questionnaire-Revised (EPQ-R-S) Short Form's Neuroticism scale. We found a single-nucleotide polymorphism-based heritability estimate for neuroticism of ∼15% (s.e.=0.7%). Meta-analysis identified nine novel loci associated with neuroticism. The strongest evidence for association was at a locus on chromosome 8 (P=1.5 × 10(-15)) spanning 4 Mb and containing at least 36 genes. Other associated loci included interesting candidate genes on chromosome 1 (GRIK3 (glutamate receptor ionotropic kainate 3)), chromosome 4 (KLHL2 (Kelch-like protein 2)), chromosome 17 (CRHR1 (corticotropin-releasing hormone receptor 1) and MAPT (microtubule-associated protein Tau)) and on chromosome 18 (CELF4 (CUGBP elav-like family member 4)). We found no evidence for genetic differences in the common allelic architecture of neuroticism by sex. By comparing our findings with those of the Psychiatric Genetics Consortia, we identified a strong genetic correlation between neuroticism and MDD and a less strong but significant genetic correlation with schizophrenia, although not with bipolar disorder. Polygenic risk scores derived from the primary UK Biobank sample captured

  15. Genome-wide association study identifies genetic loci associated with iron deficiency.

    Directory of Open Access Journals (Sweden)

    Christine E McLaren

    Full Text Available The existence of multiple inherited disorders of iron metabolism in man, rodents and other vertebrates suggests genetic contributions to iron deficiency. To identify new genomic locations associated with iron deficiency, a genome-wide association study (GWAS was performed using DNA collected from white men aged≥25 y and women≥50 y in the Hemochromatosis and Iron Overload Screening (HEIRS Study with serum ferritin (SF≤12 µg/L (cases and iron replete controls (SF>100 µg/L in men, SF>50 µg/L in women. Regression analysis was used to examine the association between case-control status (336 cases, 343 controls and quantitative serum iron measures and 331,060 single nucleotide polymorphism (SNP genotypes, with replication analyses performed in a sample of 71 cases and 161 controls from a population of white male and female veterans screened at a US Veterans Affairs (VA medical center. Five SNPs identified in the GWAS met genome-wide statistical significance for association with at least one iron measure, rs2698530 on chr. 2p14; rs3811647 on chr. 3q22, a known SNP in the transferrin (TF gene region; rs1800562 on chr. 6p22, the C282Y mutation in the HFE gene; rs7787204 on chr. 7p21; and rs987710 on chr. 22q11 (GWAS observed P<1.51×10(-7 for all. An association between total iron binding capacity and SNP rs3811647 in the TF gene (GWAS observed P=7.0×10(-9, corrected P=0.012 was replicated within the VA samples (observed P=0.012. Associations with the C282Y mutation in the HFE gene also were replicated. The joint analysis of the HEIRS and VA samples revealed strong associations between rs2698530 on chr. 2p14 and iron status outcomes. These results confirm a previously-described TF polymorphism and implicate one potential new locus as a target for gene identification.

  16. Genome-wide association study for cheese yield and curd nutrient recovery in dairy cows.

    Science.gov (United States)

    Dadousis, C; Biffani, S; Cipolat-Gotet, C; Nicolazzi, E L; Rosa, G J M; Gianola, D; Rossoni, A; Santus, E; Bittante, G; Cecchinato, A

    2017-02-01

    Cheese production and consumption are increasing in many countries worldwide. As a result, interest has increased in strategies for genetic selection of individuals for technological traits of milk related to cheese yield (CY) in dairy cattle breeding. However, little is known about the genetic background of a cow's ability to produce cheese. Recently, a relatively large panel (1,264 cows) of different measures of individual cow CY and milk nutrient and energy recoveries in the cheese (REC) became available. Genetic analyses showed considerable variation for CY and for aptitude to retain high proportions of fat, protein, and water in the coagulum. For the dairy industry, these characteristics are of major economic importance. Nevertheless, use of this knowledge in dairy breeding is hampered by high costs, intense labor requirement, and lack of appropriate technology. However, in the era of genomics, new possibilities are available for animal breeding and genetic improvement. For example, identification of genomic regions involved in cow CY might provide potential for marker-assisted selection. The objective of this study was to perform genome-wide association studies on different CY and REC measures. Milk and DNA samples from 1,152 Italian Brown Swiss cows were used. Three CY traits expressing the weight (wt) of fresh curd (%CYCURD), curd solids (%CYSOLIDS), and curd moisture (%CYWATER) as a percentage of weight of milk processed, and 4 REC (RECFAT, RECPROTEIN, RECSOLIDS, and RECENERGY, calculated as the % ratio between the nutrient in curd and the corresponding nutrient in processed milk) were analyzed. Animals were genotyped with the Illumina BovineSNP50 Bead Chip v.2. Single marker regressions were fitted using the GenABEL R package (genome-wide association using mixed model and regression-genomic control). In total, 103 significant associations (88 single nucleotide polymorphisms) were identified in 10 chromosomes (2, 6, 9, 11, 12, 14, 18, 19, 27, 28). For

  17. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci

    Science.gov (United States)

    Smith, D J; Escott-Price, V; Davies, G; Bailey, M E S; Colodro-Conde, L; Ward, J; Vedernikov, A; Marioni, R; Cullen, B; Lyall, D; Hagenaars, S P; Liewald, D C M; Luciano, M; Gale, C R; Ritchie, S J; Hayward, C; Nicholl, B; Bulik-Sullivan, B; Adams, M; Couvy-Duchesne, B; Graham, N; Mackay, D; Evans, J; Smith, B H; Porteous, D J; Medland, S E; Martin, N G; Holmans, P; McIntosh, A M; Pell, J P; Deary, I J; O'Donovan, M C

    2016-01-01

    Neuroticism is a personality trait of fundamental importance for psychological well-being and public health. It is strongly associated with major depressive disorder (MDD) and several other psychiatric conditions. Although neuroticism is heritable, attempts to identify the alleles involved in previous studies have been limited by relatively small sample sizes. Here we report a combined meta-analysis of genome-wide association study (GWAS) of neuroticism that includes 91 370 participants from the UK Biobank cohort, 6659 participants from the Generation Scotland: Scottish Family Health Study (GS:SFHS) and 8687 participants from a QIMR (Queensland Institute of Medical Research) Berghofer Medical Research Institute (QIMR) cohort. All participants were assessed using the same neuroticism instrument, the Eysenck Personality Questionnaire-Revised (EPQ-R-S) Short Form's Neuroticism scale. We found a single-nucleotide polymorphism-based heritability estimate for neuroticism of ∼15% (s.e.=0.7%). Meta-analysis identified nine novel loci associated with neuroticism. The strongest evidence for association was at a locus on chromosome 8 (P=1.5 × 10−15) spanning 4 Mb and containing at least 36 genes. Other associated loci included interesting candidate genes on chromosome 1 (GRIK3 (glutamate receptor ionotropic kainate 3)), chromosome 4 (KLHL2 (Kelch-like protein 2)), chromosome 17 (CRHR1 (corticotropin-releasing hormone receptor 1) and MAPT (microtubule-associated protein Tau)) and on chromosome 18 (CELF4 (CUGBP elav-like family member 4)). We found no evidence for genetic differences in the common allelic architecture of neuroticism by sex. By comparing our findings with those of the Psychiatric Genetics Consortia, we identified a strong genetic correlation between neuroticism and MDD and a less strong but significant genetic correlation with schizophrenia, although not with bipolar disorder. Polygenic risk scores derived from the primary UK Biobank sample captured

  18. Genome wide copy number analysis of single cells

    Science.gov (United States)

    Baslan, Timour; Kendall, Jude; Rodgers, Linda; Cox, Hilary; Riggs, Mike; Stepansky, Asya; Troge, Jennifer; Ravi, Kandasamy; Esposito, Diane; Lakshmi, B.; Wigler, Michael; Navin, Nicholas; Hicks, James

    2016-01-01

    Summary Copy number variation (CNV) is increasingly recognized as an important contributor to phenotypic variation in health and disease. Most methods for determining CNV rely on admixtures of cells, where information regarding genetic heterogeneity is lost. Here, we present a protocol that allows for the genome wide copy number analysis of single nuclei isolated from mixed populations of cells. Single nucleus sequencing (SNS), combines flow sorting of single nuclei based on DNA content, whole genome amplification (WGA), followed by next generation sequencing to quantize genomic intervals in a genome wide manner. Multiplexing of single cells is discussed. Additionally, we outline informatic approaches that correct for biases inherent in the WGA procedure and allow for accurate determination of copy number profiles. All together, the protocol takes ~3 days from flow cytometry to sequence-ready DNA libraries. PMID:22555242

  19. Genome-wide patterns of selection in 230 ancient Eurasians

    Science.gov (United States)

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2016-01-01

    Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  20. Genome-wide association studies and resting heart rate

    DEFF Research Database (Denmark)

    Oskari Kilpeläinen, Tuomas

    2016-01-01

    Genome-wide association studies (GWASs) have revolutionized the search for genetic variants regulating resting heart rate. In the last 10 years, GWASs have led to the identification of at least 21 novel heart rate loci. These discoveries have provided valuable insights into the mechanisms...... and pathways that regulate heart rate and link heart rate to cardiovascular morbidity and mortality. GWASs capture majority of genetic variation in a population sample by utilizing high-throughput genotyping chips measuring genotypes for up to several millions of SNPs across the genome in thousands...... of individuals. This allows the identification of the strongest heart rate associated signals at genome-wide level. While GWASs provide robust statistical evidence of the association of a given genetic locus with heart rate, they are only the starting point for detailed follow-up studies to locate the causal...

  1. Genome-wide association studies in pediatric endocrinology.

    Science.gov (United States)

    Dauber, Andrew; Hirschhorn, Joel N

    2011-01-01

    Genome-wide association (GWA) studies are a powerful tool for understanding the genetic underpinnings of human disease. In this article, we briefly review the role and findings of GWA studies in type 1 diabetes, stature, pubertal timing, obesity, and vitamin D deficiency. We then discuss the present and future implications of these findings with regards to disease prediction, uncovering basic biology, and the development of novel therapeutic agents.

  2. Genome-wide association study of relative telomere length.

    Science.gov (United States)

    Prescott, Jennifer; Kraft, Peter; Chasman, Daniel I; Savage, Sharon A; Mirabello, Lisa; Berndt, Sonja I; Weissfeld, Joel L; Han, Jiali; Hayes, Richard B; Chanock, Stephen J; Hunter, David J; De Vivo, Immaculata

    2011-05-10

    Telomere function is essential to maintaining the physical integrity of linear chromosomes and healthy human aging. The probability of forming proper telomere structures depends on the length of the telomeric DNA tract. We attempted to identify common genetic variants associated with log relative telomere length using genome-wide genotyping data on 3,554 individuals from the Nurses' Health Study and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial that took part in the National Cancer Institute Cancer Genetic Markers of Susceptibility initiative for breast and prostate cancer. After genotyping 64 independent SNPs selected for replication in additional Nurses' Health Study and Women's Genome Health Study participants, we did not identify genome-wide significant loci; however, we replicated the inverse association of log relative telomere length with the minor allele variant [C] of rs16847897 at the TERC locus (per allele β = -0.03, P = 0.003) identified by a previous genome-wide association study. We did not find evidence for an association with variants at the OBFC1 locus or other loci reported to be associated with telomere length. With this sample size we had >80% power to detect β estimates as small as ±0.10 for SNPs with minor allele frequencies of ≥0.15 at genome-wide significance. However, power is greatly reduced for β estimates smaller than ±0.10, such as those for variants at the TERC locus. In general, common genetic variants associated with telomere length homeostasis have been difficult to detect. Potential biological and technical issues are discussed.

  3. Genome-wide association study of relative telomere length.

    Directory of Open Access Journals (Sweden)

    Jennifer Prescott

    Full Text Available Telomere function is essential to maintaining the physical integrity of linear chromosomes and healthy human aging. The probability of forming proper telomere structures depends on the length of the telomeric DNA tract. We attempted to identify common genetic variants associated with log relative telomere length using genome-wide genotyping data on 3,554 individuals from the Nurses' Health Study and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial that took part in the National Cancer Institute Cancer Genetic Markers of Susceptibility initiative for breast and prostate cancer. After genotyping 64 independent SNPs selected for replication in additional Nurses' Health Study and Women's Genome Health Study participants, we did not identify genome-wide significant loci; however, we replicated the inverse association of log relative telomere length with the minor allele variant [C] of rs16847897 at the TERC locus (per allele β = -0.03, P = 0.003 identified by a previous genome-wide association study. We did not find evidence for an association with variants at the OBFC1 locus or other loci reported to be associated with telomere length. With this sample size we had >80% power to detect β estimates as small as ±0.10 for SNPs with minor allele frequencies of ≥0.15 at genome-wide significance. However, power is greatly reduced for β estimates smaller than ±0.10, such as those for variants at the TERC locus. In general, common genetic variants associated with telomere length homeostasis have been difficult to detect. Potential biological and technical issues are discussed.

  4. Genome-Wide Prediction of C. elegans Genetic Interactions

    OpenAIRE

    Zhong, Weiwei; Sternberg, Paul W.

    2006-01-01

    To obtain a global view of functional interactions among genes in a metazoan genome, we computationally integrated interactome data, gene expression data, phenotype data, and functional annotation data from three model organisms—Saccharomyces cerevisiae, Caenorhabditis elegans, and Drosophila melanogaster—and predicted genome-wide genetic interactions in C. elegans. The resulting genetic interaction network (consisting of 18,183 interactions) provides a framework for system-level understandin...

  5. Integrative genome-wide approaches in embryonic stem cell research.

    Science.gov (United States)

    Zhang, Xinyue; Huang, Jing

    2010-10-01

    Embryonic stem (ES) cells are derived from blastocysts. They can differentiate into the three embryonic germ layers and essentially any type of somatic cells. They therefore hold great potential in tissue regeneration therapy. The ethical issues associated with the use of human embryonic stem cells are resolved by the technical break-through of generating induced pluripotent stem (iPS) cells from various types of somatic cells. However, how ES and iPS cells self-renew and maintain their pluripotency is still largely unknown in spite of the great progress that has been made in the last two decades. Integrative genome-wide approaches, such as the gene expression microarray, chromatin immunoprecipitation based microarray (ChIP-chip) and chromatin immunoprecipitation followed by massive parallel sequencing (ChIP-seq) offer unprecedented opportunities to elucidate the mechanism of the pluripotency, reprogramming and DNA damage response of ES and iPS cells. This frontier article summarizes the fundamental biological questions about ES and iPS cells and reviews the recent advances in ES and iPS cell research using genome-wide technologies. To this end, we offer our perspectives on the future of genome-wide studies on stem cells.

  6. No evidence for genome-wide interactions on plasma fibrinogen by smoking, alcohol consumption and body mass index: results from meta-analyses of 80,607 subjects.

    Directory of Open Access Journals (Sweden)

    Jens Baumert

    Full Text Available Plasma fibrinogen is an acute phase protein playing an important role in the blood coagulation cascade having strong associations with smoking, alcohol consumption and body mass index (BMI. Genome-wide association studies (GWAS have identified a variety of gene regions associated with elevated plasma fibrinogen concentrations. However, little is yet known about how associations between environmental factors and fibrinogen might be modified by genetic variation. Therefore, we conducted large-scale meta-analyses of genome-wide interaction studies to identify possible interactions of genetic variants and smoking status, alcohol consumption or BMI on fibrinogen concentration. The present study included 80,607 subjects of European ancestry from 22 studies. Genome-wide interaction analyses were performed separately in each study for about 2.6 million single nucleotide polymorphisms (SNPs across the 22 autosomal chromosomes. For each SNP and risk factor, we performed a linear regression under an additive genetic model including an interaction term between SNP and risk factor. Interaction estimates were meta-analysed using a fixed-effects model. No genome-wide significant interaction with smoking status, alcohol consumption or BMI was observed in the meta-analyses. The most suggestive interaction was found for smoking and rs10519203, located in the LOC123688 region on chromosome 15, with a p value of 6.2 × 10(-8. This large genome-wide interaction study including 80,607 participants found no strong evidence of interaction between genetic variants and smoking status, alcohol consumption or BMI on fibrinogen concentrations. Further studies are needed to yield deeper insight in the interplay between environmental factors and gene variants on the regulation of fibrinogen concentrations.

  7. A genome-wide association study of emotion dysregulation: Evidence for interleukin 2 receptor alpha.

    Science.gov (United States)

    Powers, Abigail; Almli, Lynn; Smith, Alicia; Lori, Adriana; Leveille, Jen; Ressler, Kerry J; Jovanovic, Tanja; Bradley, Bekh

    2016-12-01

    Emotion dysregulation has been implicated as a risk factor for many psychiatric conditions. Therefore, examining genetic risk associated with emotion dysregulation could help inform cross-disorder risk more generally. A genome-wide association study (GWAS) of emotion dysregulation using single nucleotide polymorphism (SNP) array technology was conducted in a highly traumatized, minority, urban sample (N = 2600, males = 774). Post-hoc analyses examined associations between SNPs identified in the GWAS and current depression, posttraumatic stress disorder (PTSD), and history of suicide attempt. Methylation quantitative trait loci were identified and gene set enrichment analyses were used to broadly determine biological processes involved with these SNPs. Among males, SNP rs6602398, located within the interleukin receptor 2A gene, IL2RA, was significantly associated with emotion dysregulation (p = 1.1 × 10(-8)). Logistic regression analyses revealed this SNP was significantly associated with depression (Exp(B) = 2.67, p < 0.001) and PTSD (Exp(B) = 2.07, p < 0.01). This SNP was associated with differential DNA methylation (p < 0.05) suggesting it may be functionally active. Finally, through gene set enrichment analyses, ten psychiatric disease pathways (adjusted p < 0.01) and the calcium signaling pathway (adjusted p = 0.008) were significantly associated with emotion dysregulation. We found initial evidence for an association between emotion dysregulation and genetic risk loci that have already been implicated in medical disorders that have high comorbidity with psychiatric disorders. Our results provide further evidence that emotion dysregulation can be understood as a potential psychiatric cross-disorder risk factor, and that sex differences across these phenotypes may be critical. Continued research into genetic and biological risk associated with emotion dysregulation is needed.

  8. Prediction of disease and phenotype associations from genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Stephanie N Lewis

    Full Text Available BACKGROUND: Genome wide association studies (GWAS have proven useful as a method for identifying genetic variations associated with diseases. In this study, we analyzed GWAS data for 61 diseases and phenotypes to elucidate common associations based on single nucleotide polymorphisms (SNP. The study was an expansion on a previous study on identifying disease associations via data from a single GWAS on seven diseases. METHODOLOGY/PRINCIPAL FINDINGS: Adjustments to the originally reported study included expansion of the SNP dataset using Linkage Disequilibrium (LD and refinement of the four levels of analysis to encompass SNP, SNP block, gene, and pathway level comparisons. A pair-wise comparison between diseases and phenotypes was performed at each level and the Jaccard similarity index was used to measure the degree of association between two diseases/phenotypes. Disease relatedness networks (DRNs were used to visualize our results. We saw predominant relatedness between Multiple Sclerosis, type 1 diabetes, and rheumatoid arthritis for the first three levels of analysis. Expected relatedness was also seen between lipid- and blood-related traits. CONCLUSIONS/SIGNIFICANCE: The predominant associations between Multiple Sclerosis, type 1 diabetes, and rheumatoid arthritis can be validated by clinical studies. The diseases have been proposed to share a systemic inflammation phenotype that can result in progression of additional diseases in patients with one of these three diseases. We also noticed unexpected relationships between metabolic and neurological diseases at the pathway comparison level. The less significant relationships found between diseases require a more detailed literature review to determine validity of the predictions. The results from this study serve as a first step towards a better understanding of seemingly unrelated diseases and phenotypes with similar symptoms or modes of treatment.

  9. Genome-wide association study of smoking behaviours in patients with COPD.

    Science.gov (United States)

    Siedlinski, Mateusz; Cho, Michael H; Bakke, Per; Gulsvik, Amund; Lomas, David A; Anderson, Wayne; Kong, Xiangyang; Rennard, Stephen I; Beaty, Terri H; Hokanson, John E; Crapo, James D; Silverman, Edwin K

    2011-10-01

    Background Cigarette smoking is a major risk factor for chronic obstructive pulmonary disease (COPD) and COPD severity. Previous genome-wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNPs) associated with the number of cigarettes smoked per day (CPD) and a dopamine beta-hydroxylase (DBH) locus associated with smoking cessation in multiple populations. Objective To identify SNPs associated with lifetime average and current CPD, age at smoking initiation, and smoking cessation in patients with COPD. Methods GWAS were conducted in four independent cohorts encompassing 3441 ever-smoking patients with COPD (Global Initiative for Obstructive Lung Disease stage II or higher). Untyped SNPs were imputed using the HapMap (phase II) panel. Results from all cohorts were meta-analysed. Results Several SNPs near the HLA region on chromosome 6p21 and in an intergenic region on chromosome 2q21 showed associations with age at smoking initiation, both with the lowest p=2×10(-7). No SNPs were associated with lifetime average CPD, current CPD or smoking cessation with p<10(-6). Nominally significant associations with candidate SNPs within cholinergic receptors, nicotinic, alpha 3/5 (CHRNA3/CHRNA5; eg, p=0.00011 for SNP rs1051730) and cytochrome P450, family 2, subfamily A, polypeptide 6 (CYP2A6; eg, p=2.78×10(-5) for a non-synonymous SNP rs1801272) regions were observed for lifetime average CPD, however only CYP2A6 showed evidence of significant association with current CPD. A candidate SNP (rs3025343) in DBH was significantly (p=0.015) associated with smoking cessation. Conclusion The authors identified two candidate regions associated with age at smoking initiation in patients with COPD. Associations of CHRNA3/CHRNA5 and CYP2A6 loci with CPD and DBH with smoking cessation are also likely of importance in the smoking behaviours of patients with COPD.

  10. A genome-wide linkage study of individuals with high scores on NEO personality traits.

    Science.gov (United States)

    Amin, N; Schuur, M; Gusareva, E S; Isaacs, A; Aulchenko, Y S; Kirichenko, A V; Zorkoltseva, I V; Axenovich, T I; Oostra, B A; Janssens, A C J W; van Duijn, C M

    2012-10-01

    The NEO-Five-Factor Inventory divides human personality traits into five dimensions: neuroticism, extraversion, openness, conscientiousness and agreeableness. In this study, we sought to identify regions harboring genes with large effects on the five NEO personality traits by performing genome-wide linkage analysis of individuals scoring in the extremes of these traits (>90th percentile). Affected-only linkage analysis was performed using an Illumina 6K linkage array in a family-based study, the Erasmus Rucphen Family study. We subsequently determined whether distinct, segregating haplotypes found with linkage analysis were associated with the trait of interest in the population. Finally, a dense single-nucleotide polymorphism genotyping array (Illumina 318K) was used to search for copy number variations (CNVs) in the associated regions. In the families with extreme phenotype scores, we found significant evidence of linkage for conscientiousness to 20p13 (rs1434789, log of odds (LOD)=5.86) and suggestive evidence of linkage (LOD >2.8) for neuroticism to 19q, 21q and 22q, extraversion to 1p, 1q, 9p and12q, openness to 12q and 19q, and agreeableness to 2p, 6q, 17q and 21q. Further analysis determined haplotypes in 21q22 for neuroticism (P-values = 0.009, 0.007), in 17q24 for agreeableness (marginal P-value = 0.018) and in 20p13 for conscientiousness (marginal P-values = 0.058, 0.038) segregating in families with large contributions to the LOD scores. No evidence for CNVs in any of the associated regions was found. Our findings imply that there may be genes with relatively large effects involved in personality traits, which may be identified with next-generation sequencing techniques.

  11. Genome-Wide Association Study on Male Genital Shape and Size in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Baku Takahara

    Full Text Available Male genital morphology of animals with internal fertilization and promiscuous mating systems have been one of the most diverse and rapidly evolving morphological traits. The male genital morphology in general is known to have low phenotypic and genetic variations, but the genetic basis of the male genital variation remains unclear. Drosophila melanogaster and its closely related species are morphologically very similar, but the shapes of the posterior lobe, a cuticular projection on the male genital arch are distinct from each other, representing a model system for studying the genetic basis of male genital morphology. In this study, we used highly inbred whole genome sequenced strains of D. melanogaster to perform genome wide association analysis on posterior lobe morphology. We quantified the outline shape of posterior lobes with Fourier coefficients obtained from elliptic Fourier analysis and performed principal component analysis, and posterior lobe size. The first and second principal components (PC1 and PC2 explained approximately 88% of the total variation of the posterior lobe shape. We then examined the association between the principal component scores and posterior lobe size and 1902142 single nucleotide polymorphisms (SNPs. As a result, we obtained 15, 14 and 15 SNPs for PC1, PC2 and posterior lobe size with P-values smaller than 10(-5. Based on the location of the SNPs, 13, 13 and six protein coding genes were identified as potential candidates for PC1, PC2 and posterior lobe size, respectively. In addition to the previous findings showing that the intraspecific posterior shape variation are regulated by multiple QTL with strong effects, the present study suggests that the intraspecific variation may be under polygenic regulation with a number of loci with small effects. Further studies are required for investigating whether these candidate genes are responsible for the intraspecific posterior lobe shape variation.

  12. Genome-wide association analysis of oxidative stress resistance in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Allison L Weber

    Full Text Available BACKGROUND: Aerobic organisms are susceptible to damage by reactive oxygen species. Oxidative stress resistance is a quantitative trait with population variation attributable to the interplay between genetic and environmental factors. Drosophila melanogaster provides an ideal system to study the genetics of variation for resistance to oxidative stress. METHODS AND FINDINGS: We used 167 wild-derived inbred lines of the Drosophila Genetic Reference Panel for a genome-wide association study of acute oxidative stress resistance to two oxidizing agents, paraquat and menadione sodium bisulfite. We found significant genetic variation for both stressors. Single nucleotide polymorphisms (SNPs associated with variation in oxidative stress resistance were often sex-specific and agent-dependent, with a small subset common for both sexes or treatments. Associated SNPs had moderately large effects, with an inverse relationship between effect size and allele frequency. Linear models with up to 12 SNPs explained 67-79% and 56-66% of the phenotypic variance for resistance to paraquat and menadione sodium bisulfite, respectively. Many genes implicated were novel with no known role in oxidative stress resistance. Bioinformatics analyses revealed a cellular network comprising DNA metabolism and neuronal development, consistent with targets of oxidative stress-inducing agents. We confirmed associations of seven candidate genes associated with natural variation in oxidative stress resistance through mutational analysis. CONCLUSIONS: We identified novel candidate genes associated with variation in resistance to oxidative stress that have context-dependent effects. These results form the basis for future translational studies to identify oxidative stress susceptibility/resistance genes that are evolutionary conserved and might play a role in human disease.

  13. Genome-wide Association Studies Identify Genetic Loci Associated With Albuminuria in Diabetes

    Science.gov (United States)

    Tin, Adrienne; Sorice, Rossella; Gorski, Mathias; Yeo, Nan Cher; Chu, Audrey Y.; Li, Man; Li, Yong; Mijatovic, Vladan; Ko, Yi-An; Taliun, Daniel; Luciani, Alessandro; Chen, Ming-Huei; Yang, Qiong; Foster, Meredith C.; Olden, Matthias; Hiraki, Linda T.; Tayo, Bamidele O.; Fuchsberger, Christian; Dieffenbach, Aida Karina; Shuldiner, Alan R.; Smith, Albert V.; Zappa, Allison M.; Lupo, Antonio; Kollerits, Barbara; Ponte, Belen; Stengel, Bénédicte; Krämer, Bernhard K.; Paulweber, Bernhard; Mitchell, Braxton D.; Hayward, Caroline; Helmer, Catherine; Meisinger, Christa; Gieger, Christian; Shaffer, Christian M.; Müller, Christian; Langenberg, Claudia; Ackermann, Daniel; Siscovick, David; Boerwinkle, Eric; Kronenberg, Florian; Ehret, Georg B.; Homuth, Georg; Waeber, Gerard; Navis, Gerjan; Gambaro, Giovanni; Malerba, Giovanni; Eiriksdottir, Gudny; Li, Guo; Wichmann, H. Erich; Grallert, Harald; Wallaschofski, Henri; Völzke, Henry; Brenner, Herrmann; Kramer, Holly; Leach, I. Mateo; Rudan, Igor; Hillege, Hans L.; Beckmann, Jacques S.; Lambert, Jean Charles; Luan, Jian'an; Zhao, Jing Hua; Chalmers, John; Coresh, Josef; Denny, Joshua C.; Butterbach, Katja; Launer, Lenore J.; Ferrucci, Luigi; Kedenko, Lyudmyla; Haun, Margot; Metzger, Marie; Woodward, Mark; Hoffman, Matthew J.; Nauck, Matthias; Waldenberger, Melanie; Pruijm, Menno; Bochud, Murielle; Rheinberger, Myriam; Verweij, Niek; Wareham, Nicholas J.; Endlich, Nicole; Soranzo, Nicole; Polasek, Ozren; van der Harst, Pim; Pramstaller, Peter Paul; Vollenweider, Peter; Wild, Philipp S.; Gansevoort, Ron T.; Rettig, Rainer; Biffar, Reiner; Carroll, Robert J.; Katz, Ronit; Loos, Ruth J.F.; Hwang, Shih-Jen; Coassin, Stefan; Bergmann, Sven; Rosas, Sylvia E.; Stracke, Sylvia; Harris, Tamara B.; Corre, Tanguy; Zeller, Tanja; Illig, Thomas; Aspelund, Thor; Tanaka, Toshiko; Lendeckel, Uwe; Völker, Uwe; Gudnason, Vilmundur; Chouraki, Vincent; Koenig, Wolfgang; Kutalik, Zoltan; O'Connell, Jeffrey R.; Parsa, Afshin; Heid, Iris M.; Paterson, Andrew D.; de Boer, Ian H.; Devuyst, Olivier; Lazar, Jozef; Endlich, Karlhans; Susztak, Katalin; Tremblay, Johanne; Hamet, Pavel; Jacob, Howard J.; Böger, Carsten A.

    2016-01-01

    Elevated concentrations of albumin in the urine, albuminuria, are a hallmark of diabetic kidney disease and are associated with an increased risk for end-stage renal disease and cardiovascular events. To gain insight into the pathophysiological mechanisms underlying albuminuria, we conducted meta-analyses of genome-wide association studies and independent replication in up to 5,825 individuals of European ancestry with diabetes and up to 46,061 without diabetes, followed by functional studies. Known associations of variants in CUBN, encoding cubilin, with the urinary albumin-to-creatinine ratio (UACR) were confirmed in the overall sample (P = 2.4 × 10−10). Gene-by-diabetes interactions were detected and confirmed for variants in HS6ST1 and near RAB38/CTSC. Single nucleotide polymorphisms at these loci demonstrated a genetic effect on UACR in individuals with but not without diabetes. The change in the average UACR per minor allele was 21% for HS6ST1 (P = 6.3 × 10–7) and 13% for RAB38/CTSC (P = 5.8 × 10−7). Experiments using streptozotocin-induced diabetic Rab38 knockout and control rats showed higher urinary albumin concentrations and reduced amounts of megalin and cubilin at the proximal tubule cell surface in Rab38 knockout versus control rats. Relative expression of RAB38 was higher in tubuli of patients with diabetic kidney disease compared with control subjects. The loci identified here confirm known pathways and highlight novel pathways influencing albuminuria. PMID:26631737

  14. Patterns of Genome-Wide Variation in Glossina fuscipes fuscipes Tsetse Flies from Uganda.

    Science.gov (United States)

    Gloria-Soria, Andrea; Dunn, W Augustine; Telleria, Erich L; Evans, Benjamin R; Okedi, Loyce; Echodu, Richard; Warren, Wesley C; Montague, Michael J; Aksoy, Serap; Caccone, Adalgisa

    2016-06-01

    The tsetse fly Glossina fuscipes fuscipes (Gff) is the insect vector of the two forms of Human African Trypanosomiasis (HAT) that exist in Uganda. Understanding Gff population dynamics, and the underlying genetics of epidemiologically relevant phenotypes is key to reducing disease transmission. Using ddRAD sequence technology, complemented with whole-genome sequencing, we developed a panel of ∼73,000 single-nucleotide polymorphisms (SNPs) distributed across the Gff genome that can be used for population genomics and to perform genome-wide-association studies. We used these markers to estimate genomic patterns of linkage disequilibrium (LD) in Gff, and used the information, in combination with outlier-locus detection tests, to identify candidate regions of the genome under selection. LD in individual populations decays to half of its maximum value (r(2) max/2) between 1359 and 2429 bp. The overall LD estimated for the species reaches r(2) max/2 at 708 bp, an order of magnitude slower than in Drosophila Using 53 infected (Trypanosoma spp.) and uninfected flies from four genetically distinct Ugandan populations adapted to different environmental conditions, we were able to identify SNPs associated with the infection status of the fly and local environmental adaptation. The extent of LD in Gff likely facilitated the detection of loci under selection, despite the small sample size. Furthermore, it is probable that LD in the regions identified is much higher than the average genomic LD due to strong selection. Our results show that even modest sample sizes can reveal significant genetic associations in this species, which has implications for future studies given the difficulties of collecting field specimens with contrasting phenotypes for association analysis.

  15. Genome-Wide Association of CKD Progression: The Chronic Renal Insufficiency Cohort Study.

    Science.gov (United States)

    Parsa, Afshin; Kanetsky, Peter A; Xiao, Rui; Gupta, Jayanta; Mitra, Nandita; Limou, Sophie; Xie, Dawei; Xu, Huichun; Anderson, Amanda Hyre; Ojo, Akinlolu; Kusek, John W; Lora, Claudia M; Hamm, L Lee; He, Jiang; Sandholm, Niina; Jeff, Janina; Raj, Dominic E; Böger, Carsten A; Bottinger, Erwin; Salimi, Shabnam; Parekh, Rulan S; Adler, Sharon G; Langefeld, Carl D; Bowden, Donald W; Groop, Per-Henrik; Forsblom, Carol; Freedman, Barry I; Lipkowitz, Michael; Fox, Caroline S; Winkler, Cheryl A; Feldman, Harold I

    2017-03-01

    The rate of decline of renal function varies significantly among individuals with CKD. To understand better the contribution of genetics to CKD progression, we performed a genome-wide association study among participants in the Chronic Renal Insufficiency Cohort Study. Our outcome of interest was CKD progression measured as change in eGFR over time among 1331 blacks and 1476 whites with CKD. We stratified all analyses by race and subsequently, diabetes status. Single-nucleotide polymorphisms (SNPs) that surpassed a significance threshold of P<1×10(-6) for association with eGFR slope were selected as candidates for follow-up and secondarily tested for association with proteinuria and time to ESRD. We identified 12 such SNPs among black patients and six such SNPs among white patients. We were able to conduct follow-up analyses of three candidate SNPs in similar (replication) cohorts and eight candidate SNPs in phenotype-related (validation) cohorts. Among blacks without diabetes, rs653747 in LINC00923 replicated in the African American Study of Kidney Disease and Hypertension cohort (discovery P=5.42×10(-7); replication P=0.039; combined P=7.42×10(-9)). This SNP also associated with ESRD (hazard ratio, 2.0 (95% confidence interval, 1.5 to 2.7); P=4.90×10(-6)). Similarly, rs931891 in LINC00923 associated with eGFR decline (P=1.44×10(-4)) in white patients without diabetes. In summary, SNPs in LINC00923, an RNA gene expressed in the kidney, significantly associated with CKD progression in individuals with nondiabetic CKD. However, the lack of equivalent cohorts hampered replication for most discovery loci. Further replication of our findings in comparable study populations is warranted.

  16. Genome-wide association study of treatment refractory schizophrenia in Han Chinese.

    Directory of Open Access Journals (Sweden)

    Ying-Jay Liou

    Full Text Available We report the first genome-wide association study of a joint analysis using 795 Han Chinese individuals with treatment-refractory schizophrenia (TRS and 806 controls. Three loci showed suggestive significant association with TRS were identified. These loci include: rs10218843 (P = 3.04 × 10(-7 and rs11265461 (P = 1.94 × 10(-7 are adjacent to signaling lymphocytic activation molecule family member 1 (SLAMF1; rs4699030 (P = 1.94 × 10(-6 and rs230529 (P = 1.74 × 10(-7 are located in the gene nuclear factor of kappa light polypeptide gene enhancer in B-cells 1 (NFKB1; and rs13049286 (P = 3.05 × 10(-5 and rs3827219 (P = 1.66 × 10(-5 fall in receptor-interacting serine/threonine-protein kinase 4 (RIPK4. One isolated single nucleotide polymorphism (SNP, rs739617 (P = 3.87 × 10(-5 was also identified to be associated with TRS. The -94delATTG allele (rs28362691 located in the promoter region of NFKB1 was identified by resequencing and was found to associate with TRS (P = 4.85 × 10(-6. The promoter assay demonstrated that the -94delATTG allele had a significant lower promoter activity than the -94insATTG allele in the SH-SY5Y cells. This study suggests that rs28362691 in NFKB1 might be involved in the development of TRS.

  17. Genome-wide association studies in nephrology: using known associations for data checks.

    Science.gov (United States)

    Wuttke, Matthias; Schaefer, Franz; Wong, Craig S; Köttgen, Anna

    2015-02-01

    Prior to conducting genome-wide association studies (GWAS) of renal traits and diseases, systematic checks to ensure data integrity and analytical work flow should be conducted. Using positive controls (ie, known associations between a single-nucleotide polymorphism [SNP] and a corresponding trait) allows for identifying errors that are not apparent solely from global evaluation of summary statistics. Strong genetic control associations of chronic kidney disease (CKD), as derived from GWAS, are lacking in the non-African ancestry CKD population; thus, in this perspective, we provide examples of and considerations for using positive controls among patients with CKD. Using data from individuals with CKD who participated in the CRIC (Chronic Renal Insufficiency Cohort) Study or PediGFR (Pediatric Investigation for Genetic Factors Linked to Renal Progression) Consortium, we evaluated 2 kinds of positive control traits: traits unrelated to kidney function (bilirubin level and body height) and those related to kidney function (cystatin C and urate levels). For the former, the proportion of variance in the control trait that is explained by the control SNP is the main determinant of the strength of the observable association, irrespective of adjustment for kidney function. For the latter, adjustment for kidney function can be effective in uncovering known associations among patients with CKD. For instance, in 1,092 participants in the PediGFR Consortium, the P value for the association of cystatin C concentrations and rs911119 in the CST3 gene decreased from 2.7×10(-3) to 2.4×10(-8) upon adjustment for serum creatinine-based estimated glomerular filtration rate. In this perspective, we give recommendations for the appropriate selection of control traits and SNPs that can be used for data checks prior to conducting GWAS among patients with CKD.

  18. Genome-wide identification of genetic determinants for the cytotoxicity of perifosine

    Directory of Open Access Journals (Sweden)

    Zhang Wei

    2008-09-01

    Full Text Available Abstract Perifosine belongs to the class of alkylphospholipid analogues, which act primarily at the cell membrane, thereby targeting signal transduction pathways. In phase I/II clinical trials, perifosine has induced tumour regression and caused disease stabilisation in a variety of tumour types. The genetic determinants responsible for its cytotoxicity have not been comprehensively studied, however. We performed a genome-wide analysis to identify genes whose expression levels or genotypic variation were correlated with the cytotoxicity of perifosine, using public databases on the US National Cancer Institute (NCI-60 human cancer cell lines. For demonstrating drug specificity, the NCI Standard Agent Database (including 171 drugs acting through a variety of mechanisms was used as a control. We identified agents with similar cytotoxicity profiles to that of perifosine in compounds used in the NCI drug screen. Furthermore, Gene Ontology and pathway analyses were carried out on genes more likely to be perifosine specific. The results suggested that genes correlated with perifosine cytotoxicity are connected by certain known pathways that lead to the mitogen-activated protein kinase signalling pathway and apoptosis. Biological processes such as 'response to stress', 'inflammatory response' and 'ubiquitin cycle' were enriched among these genes. Three single nucleotide polymorphisms (SNPs located in CACNA2DI and EXOC4 were found to be correlated with perifosine cytotoxicity. Our results provided a manageable list of genes whose expression levels or genotypic variation were strongly correlated with the cytotoxcity of perifosine. These genes could be targets for further studies using candidate-gene approaches. The results also provided insights into the pharmacodynamics of perifosine.

  19. Genome-wide study of gene variants associated with differential cardiovascular event reduction by pravastatin therapy.

    Directory of Open Access Journals (Sweden)

    Dov Shiffman

    Full Text Available Statin therapy reduces the risk of coronary heart disease (CHD, however, the person-to-person variability in response to statin therapy is not well understood. We have investigated the effect of genetic variation on the reduction of CHD events by pravastatin. First, we conducted a genome-wide association study of 682 CHD cases from the Cholesterol and Recurrent Events (CARE trial and 383 CHD cases from the West of Scotland Coronary Prevention Study (WOSCOPS, two randomized, placebo-controlled studies of pravastatin. In a combined case-only analysis, 79 single nucleotide polymorphisms (SNPs were associated with differential CHD event reduction by pravastatin according to genotype (P<0.0001, and these SNPs were analyzed in a second stage that included cases as well as non-cases from CARE and WOSCOPS and patients from the PROspective Study of Pravastatin in the Elderly at Risk/PHArmacogenomic study of Statins in the Elderly at risk for cardiovascular disease (PROSPER/PHASE, a randomized placebo controlled study of pravastatin in the elderly. We found that one of these SNPs (rs13279522 was associated with differential CHD event reduction by pravastatin therapy in all 3 studies: P = 0.002 in CARE, P = 0.01 in WOSCOPS, P = 0.002 in PROSPER/PHASE. In a combined analysis of CARE, WOSCOPS, and PROSPER/PHASE, the hazard ratio for CHD when comparing pravastatin with placebo decreased by a factor of 0.63 (95% CI: 0.52 to 0.75 for each extra copy of the minor allele (P = 4.8 × 10(-7. This SNP is located in DnaJ homolog subfamily C member 5B (DNAJC5B and merits investigation in additional randomized studies of pravastatin and other statins.

  20. Genome-wide association study of metabolic traits reveals novel gene-metabolite-disease links.

    Directory of Open Access Journals (Sweden)

    Rico Rueedi

    2014-02-01

    Full Text Available Metabolic traits are molecular phenotypes that can drive clinical phenotypes and may predict disease progression. Here, we report results from a metabolome- and genome-wide association study on (1H-NMR urine metabolic profiles. The study was conducted within an untargeted approach, employing a novel method for compound identification. From our discovery cohort of 835 Caucasian individuals who participated in the CoLaus study, we identified 139 suggestively significant (P<5×10(-8 and independent associations between single nucleotide polymorphisms (SNP and metabolome features. Fifty-six of these associations replicated in the TasteSensomics cohort, comprising 601 individuals from São Paulo of vastly diverse ethnic background. They correspond to eleven gene-metabolite associations, six of which had been previously identified in the urine metabolome and three in the serum metabolome. Our key novel findings are the associations of two SNPs with NMR spectral signatures pointing to fucose (rs492602, P = 6.9×10(-44 and lysine (rs8101881, P = 1.2×10(-33, respectively. Fine-mapping of the first locus pinpointed the FUT2 gene, which encodes a fucosyltransferase enzyme and has previously been associated with Crohn's disease. This implicates fucose as a potential prognostic disease marker, for which there is already published evidence from a mouse model. The second SNP lies within the SLC7A9 gene, rare mutations of which have been linked to severe kidney damage. The replication of previous associations and our new discoveries demonstrate the potential of untargeted metabolomics GWAS to robustly identify molecular disease markers.

  1. Patterns of Genome-Wide Variation in Glossina fuscipes fuscipes Tsetse Flies from Uganda

    Directory of Open Access Journals (Sweden)

    Andrea Gloria-Soria

    2016-06-01

    Full Text Available The tsetse fly Glossina fuscipes fuscipes (Gff is the insect vector of the two forms of Human African Trypanosomiasis (HAT that exist in Uganda. Understanding Gff population dynamics, and the underlying genetics of epidemiologically relevant phenotypes is key to reducing disease transmission. Using ddRAD sequence technology, complemented with whole-genome sequencing, we developed a panel of ∼73,000 single-nucleotide polymorphisms (SNPs distributed across the Gff genome that can be used for population genomics and to perform genome-wide-association studies. We used these markers to estimate genomic patterns of linkage disequilibrium (LD in Gff, and used the information, in combination with outlier-locus detection tests, to identify candidate regions of the genome under selection. LD in individual populations decays to half of its maximum value (r2max/2 between 1359 and 2429 bp. The overall LD estimated for the species reaches r2max/2 at 708 bp, an order of magnitude slower than in Drosophila. Using 53 infected (Trypanosoma spp. and uninfected flies from four genetically distinct Ugandan populations adapted to different environmental conditions, we were able to identify SNPs associated with the infection status of the fly and local environmental adaptation. The extent of LD in Gff likely facilitated the detection of loci under selection, despite the small sample size. Furthermore, it is probable that LD in the regions identified is much higher than the average genomic LD due to strong selection. Our results show that even modest sample sizes can reveal significant genetic associations in this species, which has implications for future studies given the difficulties of collecting field specimens with contrasting phenotypes for association analysis.

  2. Genome-wide SNP discovery in mungbean by Illumina HiSeq.

    Science.gov (United States)

    Van, Kyujung; Kang, Yang Jae; Han, Kwang-Soo; Lee, Yeong-Ho; Gwag, Jae-Gyun; Moon, Jung-Kyung; Lee, Suk-Ha

    2013-08-01

    Mungbean [Vigna radiata (L.) Wilczek], a self-pollinated diploid plant with 2n = 22 chromosomes, is an important legume crop with a high-quality amino acid profile. Sequence variation at the whole-genome level was examined by comparing two mungbean cultivars, Sunhwanokdu and Gyeonggijaerae 5, using Illumina HiSeq sequencing data. More than 40 billion bp from both mungbean cultivars were sequenced to a depth of 72×. After de novo assembly of Sunhwanokdu contigs by ABySS 1.3.2 (N50 = 9,958 bp), those longer than 10 kb were aligned with Gyeonggijaerae 5 reads using the Burrows-Wheeler Aligner. SAMTools was used for retrieving single nucleotide polymorphisms (SNPs) between Sunhwanokdu and Gyeonggijaerae 5, defining the lowest and highest depths as 5 and 100, respectively, and the sequence quality as 100. Of the 305,504 single-base changes identified, 40,503 SNPs were considered heterozygous in Gyeonggijaerae 5. Among the remaining 265,001 SNPs, 65.9 % (174,579 cases) were transitions and 34.1 % (90,422 cases) were transversions. For SNP validation, a total of 42 SNPs were chosen among Sunhwanokdu contigs longer than 10 kb and sharing at least 80 % sequence identity with common bean expressed sequence tags as determined with est2genome. Using seven mungbean cultivars from various origins in addition to Sunhwanokdu and Gyeonggijaerae 5, most of the SNPs identified by bioinformatics tools were confirmed by Sanger sequencing. These genome-wide SNP markers could enrich the current molecular resources and might be of value for the construction of a mungbean genetic map and the investigation of genetic diversity.

  3. A Genome-Wide Association Study of Vertical Cup-Disc Ratio in a Latino Population

    Science.gov (United States)

    Nannini, Drew R.; Torres, Mina; Chen, Yii-Der I.; Taylor, Kent D.; Rotter, Jerome I.; Varma, Rohit; Gao, Xiaoyi

    2017-01-01

    Purpose Vertical cup-disc ratio (VCDR) is used as a clinical assessment measure to identify and monitor glaucomatous damage to the optic nerve. Previous genetic studies conducted in European and Asian populations have identified many loci associated with VCDR. The genetic factors in other ethnic populations, such as Latino, influencing VCDR remain to be determined. Here, we describe the first genome-wide association study (GWAS) on VCDR in Latino individuals. Methods We conducted this GWAS on VCDR using 4537 Latino individuals who were genotyped by using either the Illumina OmniExpress BeadChip (∼730K markers) or the Illumina Hispanic/SOL BeadChip (∼2.5 million markers). Study subjects were 40 years of age and older. Linear regression, adjusting for age, sex, and principal components of genetic ancestry, was conducted to assess the associations between single nucleotide polymorphisms (SNPs) and VCDR. We imputed SNPs from the 1000 Genomes Project to integrate additional SNPs not directly genotyped. Results We replicated two previously reported SNPs that reached GWAS significance, rs1900005 and rs7916697, in the ATOH7-PBLD region, as well as identified two suggestive associations in the CDC7-TGFBR3 region on chromosome 1p22.1 and in the ZNF770-DPH6 region on chromosome 15q14. We discovered a novel SNP, rs56238729 (P = 1.22 × 10−13), in the ATOH7-PBLD region that is significantly associated with VCDR in Latino individuals. We replicated eight previously reported regions, including COL8A1, CDKN2B-CDKN2BAS, BMP2, and CHEK2 (P < 2.17 × 10−3). Conclusions Our results discovered a novel SNP that is significantly associated with VCDR in Latino individuals and confirmed previously reported loci, providing further insight into the genetic architecture of VCDR. PMID:28061514

  4. Hippocampal atrophy as a quantitative trait in a genome-wide association study identifying novel susceptibility genes for Alzheimer's disease.

    Directory of Open Access Journals (Sweden)

    Steven G Potkin

    Full Text Available BACKGROUND: With the exception of APOE epsilon4 allele, the common genetic risk factors for sporadic Alzheimer's Disease (AD are unknown. METHODS AND FINDINGS: We completed a genome-wide association study on 381 participants in the ADNI (Alzheimer's Disease Neuroimaging Initiative study. Samples were genotyped using the Illumina Human610-Quad BeadChip. 516,645 unique Single Nucleotide Polymorphisms (SNPs were included in the analysis following quality control measures. The genotype data and raw genetic data are freely available for download (LONI, http://www.loni.ucla.edu/ADNI/Data/. Two analyses were completed: a standard case-control analysis, and a novel approach using hippocampal atrophy measured on MRI as an objectively defined, quantitative phenotype. A General Linear Model was applied to identify SNPs for which there was an interaction between the genotype and diagnosis on the quantitative trait. The case-control analysis identified APOE and a new risk gene, TOMM40 (translocase of outer mitochondrial membrane 40, at a genome-wide significance level of < or =10(-6 (10(-11 for a haplotype. TOMM40 risk alleles were approximately twice as frequent in AD subjects as controls. The quantitative trait analysis identified 21 genes or chromosomal areas with at least one SNP with a p-value < or =10(-6, which can be considered potential "new" candidate loci to explore in the etiology of sporadic AD. These candidates included EFNA5, CAND1, MAGI2, ARSB, and PRUNE2, genes involved in the regulation of protein degradation, apoptosis, neuronal loss and neurodevelopment. Thus, we identified common genetic variants associated with the increased risk of developing AD in the ADNI cohort, and present publicly available genome-wide data. Supportive evidence based on case-control studies and biological plausibility by gene annotation is provided. Currently no available sample with both imaging and genetic data is available for replication. CONCLUSIONS: Using

  5. A genome-wide association study in chronic obstructive pulmonary disease (COPD: identification of two major susceptibility loci.

    Directory of Open Access Journals (Sweden)

    Sreekumar G Pillai

    2009-03-01

    Full Text Available There is considerable variability in the susceptibility of smokers to develop chronic obstructive pulmonary disease (COPD. The only known genetic risk factor is severe deficiency of alpha(1-antitrypsin, which is present in 1-2% of individuals with COPD. We conducted a genome-wide association study (GWAS in a homogenous case-control cohort from Bergen, Norway (823 COPD cases and 810 smoking controls and evaluated the top 100 single nucleotide polymorphisms (SNPs in the family-based International COPD Genetics Network (ICGN; 1891 Caucasian individuals from 606 pedigrees study. The polymorphisms that showed replication were further evaluated in 389 subjects from the US National Emphysema Treatment Trial (NETT and 472 controls from the Normative Aging Study (NAS and then in a fourth cohort of 949 individuals from 127 extended pedigrees from the Boston Early-Onset COPD population. Logistic regression models with adjustments of covariates were used to analyze the case-control populations. Family-based association analyses were conducted for a diagnosis of COPD and lung function in the family populations. Two SNPs at the alpha-nicotinic acetylcholine receptor (CHRNA 3/5 locus were identified in the genome-wide association study. They showed unambiguous replication in the ICGN family-based analysis and in the NETT case-control analysis with combined p-values of 1.48 x 10(-10, (rs8034191 and 5.74 x 10(-10 (rs1051730. Furthermore, these SNPs were significantly associated with lung function in both the ICGN and Boston Early-Onset COPD populations. The C allele of the rs8034191 SNP was estimated to have a population attributable risk for COPD of 12.2%. The association of hedgehog interacting protein (HHIP locus on chromosome 4 was also consistently replicated, but did not reach genome-wide significance levels. Genome-wide significant association of the HHIP locus with lung function was identified in the Framingham Heart study (Wilk et al., companion article

  6. Genome-wide association study for subclinical atherosclerosis in major arterial territories in the NHLBI's Framingham Heart Study

    Directory of Open Access Journals (Sweden)

    Hwang Shih-Jen

    2007-09-01

    Full Text Available Abstract Introduction Subclinical atherosclerosis (SCA measures in multiple arterial beds are heritable phenotypes that are associated with increased incidence of cardiovascular disease. We conducted a genome-wide association study (GWAS for SCA measurements in the community-based Framingham Heart Study. Methods Over 100,000 single nucleotide polymorphisms (SNPs were genotyped (Human 100K GeneChip, Affymetrix in 1345 subjects from 310 families. We calculated sex-specific age-adjusted and multivariable-adjusted residuals in subjects tested for quantitative SCA phenotypes, including ankle-brachial index, coronary artery calcification and abdominal aortic calcification using multi-detector computed tomography, and carotid intimal medial thickness (IMT using carotid ultrasonography. We evaluated associations of these phenotypes with 70,987 autosomal SNPs with minor allele frequency ≥ 0.10, call rate ≥ 80%, and Hardy-Weinberg p-value ≥ 0.001 in samples ranging from 673 to 984 subjects, using linear regression with generalized estimating equations (GEE methodology and family-based association testing (FBAT. Variance components LOD scores were also calculated. Results There was no association result meeting criteria for genome-wide significance, but our methods identified 11 SNPs with p -5 by GEE and five SNPs with p -5 by FBAT for multivariable-adjusted phenotypes. Among the associated variants were SNPs in or near genes that may be considered candidates for further study, such as rs1376877 (GEE p ABI2 for maximum internal carotid artery IMT and rs4814615 (FBAT p = 0.000003, located in PCSK2 for maximum common carotid artery IMT. Modest significant associations were noted with various SCA phenotypes for variants in previously reported atherosclerosis candidate genes, including NOS3 and ESR1. Associations were also noted of a region on chromosome 9p21 with CAC phenotypes that confirm associations with coronary heart disease and CAC in two

  7. Impact of Il28b-related single nucleotide polymorphisms on liver transient elastography in chronic hepatitis C infection.

    Directory of Open Access Journals (Sweden)

    Magdalena Ydreborg

    Full Text Available BACKGROUND AND AIMS: Recently, several genome-wide association studies have revealed that single nucleotide polymorphisms (SNPs in proximity to IL28B predict spontaneous clearance of hepatitis C virus (HCV infection as well as outcome following pegylated interferon and ribavirin therapy among genotype 1 infected patients. Additionally the presence of the otherwise favorable IL28B genetic variants in the context of HCV genotype 3 infection reportedly entail more pronounced liver fibrosis and steatosis. The present study aimed to evaluate the impact of IL28B SNP variability on liver stiffness as accessed by transient elastography. METHODS: Seven hundred and seventy-one Swedish HCV infected patients sequentially undergoing liver stiffness measurement by means of Fibroscan® in the context of a real-life trial had samples available for IL28B genotyping (rs12979860 and HCV genotyping. RESULTS: CC(rs12979860 was more common among HCV genotype 2 or 3 infected treatment-naïve patients than among those infected with genotype 1 (P<0.0001. Additionally CC(rs12979860 among HCV genotype 3 infected patients was associated with higher liver stiffness values (P = 0.004, and higher AST to platelet ratio index (APRI; p = 0.02 as compared to carriers of the T allele. Among HCV genotype 1 infected patients, CC(rs12979860 was significantly associated with higher viral load (P = 0.001, with a similar non-significant trend noted among HCV genotype 3 infected patients. CONCLUSION: This study confirms previous reports that the CC(rs12979860 SNP is associated with more pronounced liver pathology in patients chronically infected with HCV genotype 3 as compared to genotype 1, suggesting that IL28B genetic variants differently regulates the course of HCV infection across HCV genotypes.

  8. Identification and validation of single nucleotide polymorphic markers linked to Ug99 stem rust resistance in spring wheat

    Science.gov (United States)

    Chao, Shiaoman; Singh, Ravi P.; Sorrells, Mark E.

    2017-01-01

    Wheat stem rust (Puccinia graminis f. sp. tritici Eriks. and E. Henn.) is one of the most destructive diseases world-wide. Races belonging to Ug99 (or TTKSK) continue to cause crop losses in East Africa and threaten global wheat production. Developing and deploying wheat varieties with multiple race-specific genes or complex adult plant resistance is necessary to achieve durability. In the present study, we applied genome-wide association studies (GWAS) for identifying loci associated with the Ug99 stem rust resistance (SR) in a panel of wheat lines developed at the International Maize and Wheat Improvement Center (CIMMYT). Genotyping was carried out using the wheat 9K iSelect single nucleotide polymorphism (SNP) chip. Phenotyping was done in the field in Kenya by infection of Puccinia graminis f. sp. tritici race TTKST, the Sr24-virulent variant of Ug99. Marker-trait association identified 12 SNP markers significantly associated with resistance. Among them, 7 were mapped on five chromosomes. Markers located on chromosomes 4A and 4B overlapped with the location of the Ug99 resistance genes SrND643 and Sr37, respectively. Markers identified on 7DL were collocated with Sr25. Additional significant markers were located in the regions where no Sr gene has been reported. The chromosome location for five of the SNP markers was unknown. A BLASTN search of the NCBI database using the flanking sequences of the SNPs associated with Ug99 resistance revealed that several markers were linked to plant disease resistance analogues, while others were linked to regulatory factors or metabolic enzymes. A KASP (Kompetitive Allele Specific PCR) assay was used for validating six marker loci linked to genes with resistance to Ug99. Of those, four co-segregated with the Sr25-pathotypes while the rest identified unknown resistance genes. With further investigation, these markers can be used for marker-assisted selection in breeding for Ug99 stem rust resistance in wheat. PMID:28241006

  9. Optimization of Bartonella henselae multilocus sequence typing scheme using single-nucleotide polymorphism analysis of SOLiD sequence data

    Institute of Scientific and Technical Information of China (English)

    ZHAO Fan; Gemma Chaloner; Alistair Darby; SONG Xiu-ping; LI Dong-mei; Richard Birtles; LIU Qi-yong

    2012-01-01

    Background Multi-locus sequence typing (MLST) is widely used to explore the population structure of numerous bacterial pathogens.However,for genotypically-restricted pathogens,the sensitivity of MLST is limited by a paucity of variation within selected loci.For Bartonella henselae (B.henselae),although the MLST scheme currently used has been proven useful in defining the overall population structure of the species,its reliability for the accurate delineation of closely-related sequence types,between which allelic variation is usually limited to,at most,one or two nucleotide polymorphisms.Exploitation of high-throughput sequencing data allows a more informed selection of MLST loci and thus,potentially,a means of enhancing the sensitivity of the schemes they comprise.Methods We carried out SOLiD resequencing on 12 representative B.henselae isolates and explored these data using single nucleotide polymorphism (SNP) analysis.We determined the number and distribution of SNPs in the genes targeted by the established MLST scheme and modified the position of loci within these genes to capture as much genetic variation as possible.Results Using genome-wide SNP data,we found the distribution of SNPs within each open reading frame (ORF) of MLST loci,which were not represented by the established B.henselae MLST scheme.We then modified the position of loci in the MLST scheme to better reflect the polymorphism in the ORF as a whole.The use of amended loci in this scheme allowed previously indistinguishable ST1 strains to be differentiated.However,the diversity of B.henselae was still rare in China.Conclusions Our study demonstrates the use of SNP analysis to facilitate the selection of MLST loci to augment the currently-described scheme for B.henselae.And the diversity among B.henselae strains in China is markedly less than that observed in B.henselae populations elsewhere in the world.

  10. Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory

    Science.gov (United States)

    Li, Haiquan; Lee, Younghee; Chen, James L; Rebman, Ellen; Li, Jianrong

    2012-01-01

    Objective Thousands of complex-disease single-nucleotide polymorphisms (SNPs) have been discovered in genome-wide association studies (GWAS). However, these intragenic SNPs have not been collectively mined to unveil the genetic architecture between complex clinical traits. The authors hypothesize that biological annotations of host genes of trait-associated SNPs may reveal the biomolecular modularity across complex-disease traits and offer insights for drug repositioning. Methods Trait-to-polymorphism (SNPs) associations confirmed in GWAS were used. A novel method to quantify trait–trait similarity anchored in Gene Ontology annotations of human proteins and information theory was developed. The results were then validated with the shortest paths of physical protein interactions between biologically similar traits. Results A network was constructed consisting of 280 significant intertrait similarities among 177 disease traits, which covered 1438 well-validated disease-associated SNPs. Thirty-nine percent of intertrait connections were confirmed by curators, and the following additional studies demonstrated the validity of a proportion of the remainder. On a phenotypic trait level, higher Gene Ontology similarity between proteins correlated with smaller ‘shortest distance’ in protein interaction networks of complexly inherited diseases (Spearman p<2.2×10−16). Further, ‘cancer traits’ were similar to one another, as were ‘metabolic syndrome traits’ (Fisher's exact test p=0.001 and 3.5×10−7, respectively). Conclusion An imputed disease network by information-anchored functional similarity from GWAS trait-associated SNPs is reported. It is also demonstrated that small shortest paths of protein interactions correlate with complex-disease function. Taken together, these findings provide the framework for investigating drug targets with unbiased functional biomolecular networks rather than worn-out single-gene and subjective canonical pathway approaches

  11. Associations of Six Single Nucleotide Polymorphisms in Obesity-Related Genes With BMI and Risk of Obesity in Chinese Children

    Science.gov (United States)

    Wu, Lijun; Xi, Bo; Zhang, Meixian; Shen, Yue; Zhao, Xiaoyuan; Cheng, Hong; Hou, Dongqing; Sun, Dandan; Ott, Jurg; Wang, Xingyu; Mi, Jie

    2010-01-01

    OBJECTIVE Childhood obesity strongly predisposes to some adult diseases. Recently, genome-wide association (GWA) studies in Caucasians identified multiple single nucleotide polymorphisms (SNPs) associated with BMI and obesity. The associations of those SNPs with BMI and obesity among other ethnicities are not fully described, especially in children. Among those previously identified SNPs, we selected six (rs7138803, rs1805081, rs6499640, rs17782313, rs6265, and rs10938397, in or near obesity-related genes FAIM2, NPC1, FTO, MC4R, BDNF, and GNPDA2, respectively) because of the relatively high minor allele frequencies in Chinese individuals and tested the associations of the SNPs with BMI and obesity in Chinese children. RESEARCH DESIGN AND METHODS We investigated the associations of these SNPs with BMI and obesity in school-aged children. A total of 3,503 children participated in the study, including 1,229 obese, 655 overweight, and 1,619 normal-weight children (diagnosed by the Chinese age- and sex-specific BMI cutoffs). RESULTS After age and sex adjustment and correction for multiple testing, the SNPs rs17782313, rs6265, and rs10938397 were associated with BMI (P = 1.0 × 10−5, 0.038, and 0.00093, respectively) and also obesity (P = 5.0 × 10−6, 0.043, and 0.00085, respectively) in the Chinese children. The SNPs rs17782313 and rs10938397 were also significantly associated with waist circumference, waist-to-height ratio, and fat mass percentage. CONCLUSIONS Results of this study support obesity-related genes in adults as important genes for BMI variation in children and suggest that some SNPs identified by GWA studies in Caucasians also confer risk for obesity in Chinese children. PMID:20843981

  12. Integrating multiple genomic data to predict disease-causing nonsynonymous single nucleotide variants in exome sequencing studies.

    Directory of Open Access Journals (Sweden)

    Jiaxin Wu

    2014-03-01

    Full Text Available Exome sequencing has been widely used in detecting pathogenic nonsynonymous single nucleotide variants (SNVs for human inherited diseases. However, traditional statistical genetics methods are ineffective in analyzing exome sequencing data, due to such facts as the large number of sequenced variants, the presence of non-negligible fraction of pathogenic rare variants or de novo mutations, and the limited size of affected and normal populations. Indeed, prevalent applications of exome sequencing have been appealing for an effective computational method for identifying causative nonsynonymous SNVs from a large number of sequenced variants. Here, we propose a bioinformatics approach called SPRING (Snv PRioritization via the INtegration of Genomic data for identifying pathogenic nonsynonymous SNVs for a given query disease. Based on six functional effect scores calculated by existing methods (SIFT, PolyPhen2, LRT, MutationTaster, GERP and PhyloP and five association scores derived from a variety of genomic data sources (gene ontology, protein-protein interactions, protein sequences, protein domain annotations and gene pathway annotations, SPRING calculates the statistical significance that an SNV is causative for a query disease and hence provides a means of prioritizing candidate SNVs. With a series of comprehensive validation experiments, we demonstrate that SPRING is valid for diseases whose genetic bases are either partly known or completely unknown and effective for diseases with a variety of inheritance styles. In applications of our method to real exome sequencing data sets, we show the capability of SPRING in detecting causative de novo mutations for autism, epileptic encephalopathies and intellectual disability. We further provide an online service, the standalone software and genome-wide predictions of causative SNVs for 5,080 diseases at http://bioinfo.au.tsinghua.edu.cn/spring.

  13. Association between single nucleotide polymorphisms on chromosome 17q and the risk of prostate cancer in a Chinese population

    Institute of Scientific and Technical Information of China (English)

    Chang-Hu Zhou; Dong Wei; Ze Yang; Jian-Ye Wang; Su-Yan Cao; Xiao-Hong Shi; Yao-Guang Zhang; Ming Liu; Xin Wang; Jin Huang; Yi-Ge Yang

    2011-01-01

    In European populations,7 single nucleotide polymorphisms(SNPs) on chromosome 17q,3 SNPs on 17812,and 4 SNPs on 17824.3 were recently identified to be closely related to the risk of prostate cancer by a genome-wide association study.In Japanese populations,the correlation between 2 SNPs on 17q and the risk of prostate cancer and tumor aggressiveness was also confirmed by a large-scale experiment.However,whether 17q is associated with prostate cancer and its clinical manifestations in Chinese populations is still unknown.Therefore,we conducted a case-control study in a northern Chinese population and tested 2 SNPs,rs4430796 and rs1859962,on 17q in 124 prostate cancer patients and 111 controls using polymerase chain reaction-high resolution melting curve (PCR-HRM) combined with sequencing.We analyzed the association of the 2 SNPs with the risk of prostate cancer as well as patients'lifestyles,onset ages,Gleason scores,PSA levels,and pathologic stages.We found a significant difference in the G allele of SNP rs1859962(P=0.035,OR=1.51,95% CI=1.03-2.21) but not in the rs4430796 genotype frequency or allele frequency distribution between prostate cancer patients and the controls(P>0.05).Neither of the SNPs was significantly associated with the onset age,Gleason score,PSA level,pathologic stage,or other clinical indicators of patients with prostate cancer(P>0.05).Our results show that polymorphism of the G allele of SNP rs1859962 is associated with the risk of prostate cancer in a Chinese population.

  14. Additive and epistatic genome-wide association for growth and ultrasound scan measures of carcass-related traits in Brahman cattle.

    Science.gov (United States)

    Ali, A A; Khatkar, M S; Kadarmideen, H N; Thomson, P C

    2015-04-01

    Genome-wide association studies are routinely used to identify genomic regions associated with traits of interest. However, this ignores an important class of genomic associations, that of epistatic interactions. A genome-wide interaction analysis between single nucleotide polymorphisms (SNPs) using highly dense markers can detect epistatic interactions, but is a difficult task due to multiple testing and computational demand. However, It is important for revealing complex trait heredity. This study considers analytical methods that detect statistical interactions between pairs of loci. We investigated a three-stage modelling procedure: (i) a model without the SNP to estimate the variance components; (ii) a model with the SNP using variance component estimates from (i), thus avoiding iteration; and (iii) using the significant SNPs from (ii) for genome-wide epistasis analysis. We fitted these three-stage models to field data for growth and ultrasound measures for subcutaneous fat thickness in Brahman cattle. The study demonstrated the usefulness of modelling epistasis in the analysis of complex traits as it revealed extra sources of genetic variation and identified potential candidate genes affecting the concentration of insulin-like growth factor-1 and ultrasound scan measure of fat depth traits. Information about epistasis can add to our understanding of the complex genetic networks that form the fundamental basis of biological systems.

  15. Characterization of single nucleotide polymorphism markers for the green sea turtle (Chelonia mydas).

    Science.gov (United States)

    Roden, Suzanne E; Dutton, Peter H; Morin, Phillip A

    2009-05-01

    We present data on 29 new single nucleotide polymorphism assays for the green sea turtle, Chelonia mydas. DNA extracts from 39 green turtles were used for two methods of single nucleotide polymorphism discovery. The first approach employed an amplified fragment length polymorphism technique. The second technique screened a microsatellite library. Allele-specific amplification assays were developed for high-throughput single nucleotide polymorphism genotyping and tested on two Pacific C. mydas nesting populations. Observed heterozygosities ranged from 0 to 0.95 for a Hawaiian population and from 0 to 0.85 for a Galapagos population. Each of the populations had one locus out of Hardy-Weinberg equilibrium, SSCM2b and SSCM5 for Hawaii and Galapagos, respectively. No loci showed significant genotypic linkage disequilibrium across an expanded set of four Pacific nesting populations. However, two loci, SSCM4 and SSCM10b showed linkage disequilibrium across three populations indicating possible association.

  16. Genome-wide Association Study of Obsessive-Compulsive Disorder

    Science.gov (United States)

    Stewart, S Evelyn; Yu, Dongmei; Scharf, Jeremiah M; Neale, Benjamin M; Fagerness, Jesen A; Mathews, Carol A; Arnold, Paul D; Evans, Patrick D; Gamazon, Eric R; Osiecki, Lisa; McGrath, Lauren; Haddad, Stephen; Crane, Jacquelyn; Hezel, Dianne; Illman, Cornelia; Mayerfeld, Catherine; Konkashbaev, Anuar; Liu, Chunyu; Pluzhnikov, Anna; Tikhomirov, Anna; Edlund, Christopher K; Rauch, Scott L; Moessner, Rainald; Falkai, Peter; Maier, Wolfgang; Ruhrmann, Stephan; Grabe, Hans-Jörgen; Lennertz, Leonard; Wagner, Michael; Bellodi, Laura; Cavallini, Maria Cristina; Richter, Margaret A; Cook, Edwin H; Kennedy, James L; Rosenberg, David; Stein, Dan J; Hemmings, Sian MJ; Lochner, Christine; Azzam, Amin; Chavira, Denise A; Fournier, Eduardo; Garrido, Helena; Sheppard, Brooke; Umaña, Paul; Murphy, Dennis L; Wendland, Jens R; Veenstra-VanderWeele, Jeremy; Denys, Damiaan; Blom, Rianne; Deforce, Dieter; Van Nieuwerburgh, Filip; Westenberg, Herman GM; Walitza, Susanne; Egberts, Karin; Renner, Tobias; Miguel, Euripedes Constantino; Cappi, Carolina; Hounie, Ana G; Conceição do Rosário, Maria; Sampaio, Aline S; Vallada, Homero; Nicolini, Humberto; Lanzagorta, Nuria; Camarena, Beatriz; Delorme, Richard; Leboyer, Marion; Pato, Carlos N; Pato, Michele T; Voyiaziakis, Emanuel; Heutink, Peter; Cath, Danielle C; Posthuma, Danielle; Smit, Jan H; Samuels, Jack; Bienvenu, O Joseph; Cullen, Bernadette; Fyer, Abby J; Grados, Marco A; Greenberg, Benjamin D; McCracken, James T; Riddle, Mark A; Wang, Ying; Coric, Vladimir; Leckman, James F; Bloch, Michael; Pittenger, Christopher; Eapen, Valsamma; Black, Donald W; Ophoff, Roel A; Strengman, Eric; Cusi, Daniele; Turiel, Maurizio; Frau, Francesca; Macciardi, Fabio; Gibbs, J Raphael; Cookson, Mark R; Singleton, Andrew; Hardy, John; Crenshaw, Andrew T; Parkin, Melissa A; Mirel, Daniel B; Conti, David V; Purcell, Shaun; Nestadt, Gerald; Hanna, Gregory L; Jenike, Michael A; Knowles, James A; Cox, Nancy; Pauls, David L

    2014-01-01

    Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1,465 cases, 5,557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9,657 X-chromosome SNPs. Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two p-values were located within DLGAP1 (p=2.49×10-6 and p=3.44×10-6), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a p-value=3.84 × 10-8. However, when trios were meta-analyzed with the combined case-control samples, the p-value for this variant was 3.62×10-5, losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation-QTLs (p<0.001) and frontal lobe eQTLs (p=0.001) was observed within the top-ranked SNPs (p<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921

  17. Combined analysis of three genome-wide association studies on vWF and FVIII plasma levels

    Directory of Open Access Journals (Sweden)

    Cohen William

    2011-08-01

    Full Text Available Abstract Background Elevated levels of factor VIII (FVIII and von Willebrand Factor (vWF are well-established risk factors for cardiovascular diseases, in particular venous thrombosis. Although high, the heritability of these traits is poorly explained by the genetic factors known so far. The aim of this work was to identify novel single nucleotide polymorphisms (SNPs that could influence the variability of these traits. Methods Three independent genome-wide association studies for vWF plasma levels and FVIII activity were conducted and their results were combined into a meta-analysis totalling 1,624 subjects. Results No single nucleotide polymorphism (SNP reached the study-wide significance level of 1.12 × 10-7 that corresponds to the Bonferroni correction for the number of tested SNPs. Nevertheless, the recently discovered association of STXBP5, STX2, TC2N and CLEC4M genes with vWF levels and that of SCARA5 and STAB2 genes with FVIII levels were confirmed in this meta-analysis. Besides, among the fifteen novel SNPs showing promising association at p -5 with either vWF or FVIII levels in the meta-analysis, one located in ACCN1 gene also showed weak association (P = 0.0056 with venous thrombosis in a sample of 1,946 cases and 1,228 controls. Conclusions This study has generated new knowledge on genomic regions deserving further investigations in the search for genetic factors influencing vWF and FVIII plasma levels, some potentially implicated in VT, as well as providing some supporting evidence of previously identified genes.

  18. [Genome-wide association study for adolescent idiopathic scoliosis].

    Science.gov (United States)

    Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro

    2016-04-01

    Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS.

  19. Genome-wide association studies and contribution to cardiovascular physiology.

    Science.gov (United States)

    Munroe, Patricia B; Tinker, Andrew

    2015-09-01

    The study of family pedigrees with rare monogenic cardiovascular disorders has revealed new molecular players in physiological processes. Genome-wide association studies of complex traits with a heritable component may afford a similar and potentially intellectually richer opportunity. In this review we focus on the interpretation of genetic associations and the issue of causality in relation to known and potentially new physiology. We mainly discuss cardiometabolic traits as it reflects our personal interests, but the issues pertain broadly in many other disciplines. We also describe some of the resources that are now available that may expedite follow up of genetic association signals into observations on causal mechanisms and pathophysiology.

  20. Genome-wide prediction of C. elegans genetic interactions.

    Science.gov (United States)

    Zhong, Weiwei; Sternberg, Paul W

    2006-03-10

    To obtain a global view of functional interactions among genes in a metazoan genome, we computationally integrated interactome data, gene expression data, phenotype data, and functional annotation data from three model organisms-Saccharomyces cerevisiae, Caenorhabditis elegans, and Drosophila melanogaster-and predicted genome-wide genetic interactions in C. elegans. The resulting genetic interaction network (consisting of 18,183 interactions) provides a framework for system-level understanding of gene functions. We experimentally tested the predicted interactions for two human disease-related genes and identified 14 new modifiers.

  1. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    Science.gov (United States)

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  2. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    Directory of Open Access Journals (Sweden)

    Varun Warrier

    Full Text Available Asperger Syndrome (AS is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC, which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448 were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448 lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  3. Planning and executing a genome wide association study (GWAS).

    Science.gov (United States)

    Sale, Michèle M; Mychaleckyj, Josyf C; Chen, Wei-Min

    2009-01-01

    In recent years, genome-wide association approaches have proven a powerful and successful strategy to identify genetic contributors to complex traits, including a number of endocrine disorders. Their success has meant that genome wide association studies (GWAS) are fast becoming the default study design for discovery of new genetic variants that influence a clinical trait or phenotype. This chapter focuses on a number of key elements that require consideration for the successful conduct of a GWAS. Although many of the considerations are common to any genetic study, the greater cost, extreme multiple testing, and greater openness to data sharing require specific awareness and planning by investigators. In the section on designing a GWAS, we reflect on ethical considerations, study design, selection of phenotype/s, power considerations, sample tracking and storage issues, and genotyping product selection. During execution, important considerations include DNA quantity and preparation, genotyping methods, quality control checks of genotype data, in silico genotyping (imputation), tests of association, and replication of association signals. Although the field of human genetics is rapidly evolving, recent experiences can help guide an investigator in making practical and methodological choices that will eventually determine the overall quality of GWAS results. Given the investment to recruit patient populations or cohorts that are powered for a GWAS, and the still substantial costs associated with genotyping, it is helpful to be aware of these aspects to maximize the likelihood of success, especially where there is an opportunity for implementing them prospectively.

  4. Genome-wide mapping of DNA methylation in chicken.

    Directory of Open Access Journals (Sweden)

    Qinghe Li

    Full Text Available Cytosine DNA methylation is an important epigenetic modification termed as the fifth base that functions in diverse processes. Till now, the genome-wide DNA methylation maps of many organisms has been reported, such as human, Arabidopsis, rice and silkworm, but the methylation pattern of bird remains rarely studied. Here we show the genome-wide DNA methylation map of bird, using the chicken as a model organism and an immunocapturing approach followed by high-throughput sequencing. In both of the red jungle fowl and the avian broiler, DNA methylation was described separately for the liver and muscle tissue. Generally, chicken displays analogous methylation pattern with that of animals and plants. DNA methylation is enriched in the gene body regions and the repetitive sequences, and depleted in the transcription start site (TSS and the transcription termination site (TTS. Most of the CpG islands in the chicken genome are kept in unmethylated state. Promoter methylation is negatively correlated with the gene expression level, indicating its suppressive role in regulating gene transcription. This work contributes to our understanding of epigenetics in birds.

  5. Genome-wide mapping of DNA strand breaks.

    Directory of Open Access Journals (Sweden)

    Frédéric Leduc

    Full Text Available Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP, uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.

  6. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    Science.gov (United States)

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  7. Genome-wide interrogation identifies YAP1 variants associated with survival of small-cell lung cancer patients.

    Science.gov (United States)

    Wu, Chen; Xu, Binghe; Yuan, Peng; Miao, Xiaoping; Liu, Yu; Guan, Yin; Yu, Dianke; Xu, Jian; Zhang, Tongwen; Shen, Hongbing; Wu, Tangchun; Lin, Dongxin

    2010-12-01

    Although most patients with small-cell lung cancer respond to chemotherapy, the survival time is highly diverse. We conducted a genome-wide analysis to examine whether germline genetic variations are prognostic factors in small-cell lung cancer patients treated with the same chemotherapy regimen. Genome-wide scan of single nucleotide polymorphisms (SNP) was performed using blood DNA to identify genotypes associated with overall survival in 245 patients treated with platinum-based chemotherapy, and the results were replicated in another independent set of 305 patients. Associations were estimated by Cox models and function of the variants was examined by biochemical assays. We found that rs1820453 T>G SNP within the promoter region of YAP1 on chromosome 11q22 and rs716274 A>G SNP in the region of downstream of DYNC2H1 on chromosome 11q22.3 are associated with small-cell lung cancer survival. In pooled analysis of 2 independent cohorts, the adjusted hazard ratio for patients with the rs1820453 TG or GG genotype was 1.49 (95% CI, 1.19-1.85; P = 0.0004) and 1.65 (95% CI, 1.36-2.01; P = 4.76 × 10(-7)), respectively, compared with the TT genotype; and for patients with the rs716274 AG or GG genotype was 1.83 (95% CI, 1.47-2.29; P = 8.74 × 10(-8)) and 2.96 (95% CI, 1.90-4.62; P = 1.59 × 10(-6)), respectively, compared with the AA genotype. Functional analysis showed that the rs1820453 T>G change creates a transcriptional factor binding site and results in downregulation of YAP1 expression. These results suggest that YAP1 may play an important role in prognosis of small-cell lung cancer patients treated with platinum-based chemotherapy.

  8. Genome-wide association study of autistic-like traits in a general population study of young adults

    Directory of Open Access Journals (Sweden)

    Rachel Maree Jones

    2013-10-01

    Full Text Available Research has proposed that autistic-like traits in the general population lie on a continuum, with clinical Autism Spectrum Disorder (ASD representing the extreme end of this distribution. Inherent in this proposal is that biological mechanisms associated with clinical ASD may also underpin variation in autistic-like traits within the general population. A genome-wide association study using 2,462,046 single nucleotide polymorphisms (SNPs was undertaken for ASD in 965 individuals from the Western Australian Pregnancy Cohort (Raine Study. No SNP associations reached genome-wide significance (p < 5.0 x 10-8. However, investigations into nominal observed SNP associations (p < 1.0 x 10-5 add support to two positional candidate genes previously implicated in ASD aetiology, PRKCB1 and CBLN1.The rs198198 SNP (p = 9.587 x 10-6, is located within an intron of the protein kinase C, beta 1 (PRKCB1 gene on chromosome 16p11. The PRKCB1 gene has been previously reported in linkage and association studies for ASD, and its mRNA expression has been shown to be significantly down regulated in ASD cases compared with controls. The rs16946931 SNP (p = 1.78 x 10-6 is located in a region flanking the Cerebellin 1 (CBLN1 gene on chromosome 16q12.1. The CBLN1 gene is involved with synaptogenesis and is part of a gene family previously implicated in ASD. This GWA study is only the second to examine SNPs associated with autistic-like traits in the general population, and provides evidence to support roles for the PRKCB1 and CBLN1 genes in risk of clinical ASD.

  9. Genome-wide association study identifies novel loci associated with serum level of vitamin B12 in Chinese men.

    Science.gov (United States)

    Lin, Xiaoling; Lu, Daru; Gao, Yong; Tao, Sha; Yang, Xiaobo; Feng, Junjie; Tan, Aihua; Zhang, Haiying; Hu, Yanling; Qin, Xue; Kim, Seong-Tae; Peng, Tao; Li, Li; Mo, Linjian; Zhang, Shijun; Trent, Jeffrey M; Mo, Zengnan; Zheng, S Lilly; Xu, Jianfeng; Sun, Jielin

    2012-06-01

    Vitamin B12 (VitB12 or cobalamin) is an essential cofactor in several metabolic pathways. Clinically, VitB12 deficiency is associated with pernicious anemia, neurodegenerative disorder, cardiovascular disease and gastrointestinal disease. Although previous genome-wide association studies (GWAS) identified several genes, including FUT2, CUBN, TCN1 and MUT, that may influence VitB12 levels in European populations, common genetic determinants of VitB12 remain largely unknown, especially in Asian populations. Here we performed a GWAS in 1999 healthy Chinese men and replicated the top findings in an independent Chinese sample with 1496 subjects. We identified four novel genomic loci that were significantly associated with serum level of VitB12 at a genome-wide significance level of 5.00 × 10(-8). These four loci were MS4A3 (11q12.1; rs2298585; P= 2.64 × 10(-15)), CLYBL (13q32; rs41281112; P= 9.23 × 10(-10)), FUT6 (19p13.3; rs3760776; P= 3.68 × 10(-13)) and 5q32 region (rs10515552; P= 3.94 × 10(-8)). In addition, we also confirmed the association with the serum level of VitB12 for the previously reported FUT2 gene and identified one novel non-synonymous single-nucleotide polymorphism in FUT2 gene in this Chinese population (19q13.33; rs1047781; P= 3.62 × 10(-36)). The new loci identified offer new insights into the biochemical pathways involved in determining the serum level of VitB12 and provide opportunities to better delineate the role of VitB12 in health and disease.

  10. Genomic risk profiling of ischemic stroke: results of an international genome-wide association meta-analysis.

    Directory of Open Access Journals (Sweden)

    James F Meschia

    Full Text Available INTRODUCTION: Familial aggregation of ischemic stroke derives from shared genetic and environmental factors. We present a meta-analysis of genome-wide association scans (GWAS from 3 cohorts to identify the contribution of common variants to ischemic stroke risk. METHODS: This study involved 1464 ischemic stroke cases and 1932 controls. Cases were genotyped using the Illumina 610 or 660 genotyping arrays; controls, with Illumina HumanHap 550Kv1 or 550Kv3 genotyping arrays. Imputation was performed with the 1000 Genomes European ancestry haplotypes (August 2010 release as a reference. A total of 5,156,597 single-nucleotide polymorphisms (SNPs were incorporated into the fixed effects meta-analysis. All SNPs associated with ischemic stroke (P<1×10(-5 were incorporated into a multivariate risk profile model. RESULTS: No SNP reached genome-wide significance for ischemic stroke (P<5×10(-8. Secondary analysis identified a significant cumulative effect for age at onset of stroke (first versus fifth quintile of cumulative profiles based on SNPs associated with late onset, ß = 14.77 [10.85,18.68], P = 5.5×10(-12, as well as a strong effect showing increased risk across samples with a high propensity for stroke among samples with enriched counts of suggestive risk alleles (P<5×10(-6. Risk profile scores based only on genomic information offered little incremental prediction. DISCUSSION: There is little evidence of a common genetic variant contributing to moderate risk of ischemic stroke. Quintiles based on genetic loading of alleles associated with a younger age at onset of ischemic stroke revealed a significant difference in age at onset between those in the upper and lower quintiles. Using common variants from GWAS and imputation, genomic profiling remains inferior to family history of stroke for defining risk. Inclusion of genomic (rare variant information may be required to improve clinical risk profiling.

  11. A genome-wide association study to detect genetic variation for postpartum dysgalactia syndrome in five commercial pig breeding lines.

    Science.gov (United States)

    Preissler, Regine; Tetens, Jens; Reiners, Kerstin; Looft, Holger; Kemper, Nicole

    2013-08-01

    Postpartum dysgalactia syndrome (PDS) in sows is an important disease after parturition with a relevant economic impact, affecting the health and welfare of both sows and piglets. The genetic background of this disease has been discussed and its heritability estimated, but further genetic analyses are lacking in detail. The aim of the current study was to detect loci affecting the susceptibility to PDS through a genome-wide association approach. The study was designed as a family-based association study with matched sampling of affected sows and healthy half- or full-sib control sows on six farms. For the study, 597 sows (322 affected vs. 275 healthy control sows) were genotyped on 62 163 single nucleotide polymorphisms (SNPs) using the Illumina PorcineSNP60 BeadChip. After quality control, 585 sows (314 affected vs. 271 healthy control sows) and 49 740 SNPs remained for further analysis. Statistics were performed mainly with the r package genabel and included a principal component analysis. A statistically significant genome-wide associated SNP was identified on porcine chromosome (SSC) 17. Further promising results with moderate significance were detected on SSC 13 and on an unplaced scaffold with an older annotation on SSC 15. The PRICKLE2 and NRP2 genes were identified as candidate genes near associated SNPs. Several quantitative trait loci (QTL) have been previously described in these genomic regions, including QTL for mammary gland condition, as teat number and non-functional nipples QTL, as well as QTL for body temperature and gestation length.

  12. Novel Rheumatoid Arthritis Susceptibility Locus at 22q12 Identified in an Extended UK Genome-Wide Association Study

    Science.gov (United States)

    Orozco, Gisela; Viatte, Sebastien; Bowes, John; Martin, Paul; Wilson, Anthony G; Morgan, Ann W; Steer, Sophia; Wordsworth, Paul; Hocking, Lynne J; Barton, Anne; Worthington, Jane; Eyre, Stephen

    2014-01-01

    Objective The number of confirmed rheumatoid arthritis (RA) loci currently stands at 32, but many lines of evidence indicate that expansion of existing genome-wide association studies (GWAS) enhances the power to detect additional loci. This study was undertaken to extend our previous RA GWAS in a UK cohort, adding more independent RA cases and healthy controls, with the aim of detecting novel association signals for susceptibility to RA in a homogeneous UK cohort. Methods A total of 3,223 UK RA cases and 5,272 UK controls were available for association analyses, with the extension adding 1,361 cases and 2,334 controls to the original GWAS data set. The genotype data for all RA cases were imputed using the Impute program version 2. After stringent quality control thresholds were applied, 3,034 cases and 5,271 controls (1,831,729 single-nucleotide polymorphisms [SNPs]) were available for analysis. Association testing was performed using Plink software. Results The analyses indicated a suggestive association with susceptibility to RA (P < 0.0001) for 6 novel RA loci that have been previously found to be associated with other autoimmune diseases; these 6 SNPs were genotyped in independent samples. Two of the associated loci were validated, one of which was associated with RA at genome-wide levels of significance in the combined analysis, identifying a novel RA locus at 22q12 (P = 6.9 × 10−9). In addition, most of the previously known RA susceptibility loci were confirmed to be associated with RA, and for 16 of the loci, the strength of the association was increased. Conclusion This study identified a new RA locus mapping to 22q12. These results support the notion that increasing the power of GWAS enhances novel gene discovery. PMID:24449572

  13. Genome wide association identifies PPFIA1 as a candidate gene for acute lung injury risk following major trauma.

    Directory of Open Access Journals (Sweden)

    Jason D Christie

    Full Text Available Acute Lung Injury (ALI is a syndrome with high associated mortality characterized by severe hypoxemia and pulmonary infiltrates in patients with critical illness. We conducted the first investigation to use the genome wide association (GWA approach to identify putative risk variants for ALI. Genome wide genotyping was performed using the Illumina Human Quad 610 BeadChip. We performed a two-stage GWA study followed by a third stage of functional characterization. In the discovery phase (Phase 1, we compared 600 European American trauma-associated ALI cases with 2266 European American population-based controls. We carried forward the top 1% of single nucleotide polymorphisms (SNPs at p<0.01 to a replication phase (Phase 2 comprised of a nested case-control design sample of 212 trauma-associated ALI cases and 283 at-risk trauma non-ALI controls from ongoing cohort studies. SNPs that replicated at the 0.05 level in Phase 2 were subject to functional validation (Phase 3 using expression quantitative trait loci (eQTL analyses in stimulated B-lymphoblastoid cell lines (B-LCL in family trios. 159 SNPs from the discovery phase replicated in Phase 2, including loci with prior evidence for a role in ALI pathogenesis. Functional evaluation of these replicated SNPs revealed rs471931 on 11q13.3 to exert a cis-regulatory effect on mRNA expression in the PPFIA1 gene (p = 0.0021. PPFIA1 encodes liprin alpha, a protein involved in cell adhesion, integrin expression, and cell-matrix interactions. This study supports the feasibility of future multi-center GWA investigations of ALI risk, and identifies PPFIA1 as a potential functional candidate ALI risk gene for future research.

  14. Genome-wide association study of cognitive functions and educational attainment in UK Biobank (N=112 151)

    Science.gov (United States)

    Davies, G; Marioni, R E; Liewald, D C; Hill, W D; Hagenaars, S P; Harris, S E; Ritchie, S J; Luciano, M; Fawns-Ritchie, C; Lyall, D; Cullen, B; Cox, S R; Hayward, C; Porteous, D J; Evans, J; McIntosh, A M; Gallacher, J; Craddock, N; Pell, J P; Smith, D J; Gale, C R; Deary, I J

    2016-01-01

    People's differences in cognitive functions are partly heritable and are associated with important life outcomes. Previous genome-wide association (GWA) studies of cognitive functions have found evidence for polygenic effects yet, to date, there are few replicated genetic associations. Here we use data from the UK Biobank sample to investigate the genetic contributions to variation in tests of three cognitive functions and in educational attainment. GWA analyses were performed for verbal–numerical reasoning (N=36 035), memory (N=112 067), reaction time (N=111 483) and for the attainment of a college or a university degree (N=111 114). We report genome-wide significant single-nucleotide polymorphism (SNP)-based associations in 20 genomic regions, and significant gene-based findings in 46 regions. These include findings in the ATXN2, CYP2DG, APBA1 and CADM2 genes. We report replication of these hits in published GWA studies of cognitive function, educational attainment and childhood intelligence. There is also replication, in UK Biobank, of SNP hits reported previously in GWA studies of educational attainment and cognitive function. GCTA-GREML analyses, using common SNPs (minor allele frequency>0.01), indicated significant SNP-based heritabilities of 31% (s.e.m.=1.8%) for verbal–numerical reasoning, 5% (s.e.m.=0.6%) for memory, 11% (s.e.m.=0.6%) for reaction time and 21% (s.e.m.=0.6%) for educational attainment. Polygenic score analyses indicate that up to 5% of the variance in cognitive test scores can be predicted in an independent cohort. The genomic regions identified include several novel loci, some of which have been associated with intracranial volume, neurodegeneration, Alzheimer's disease and schizophrenia. PMID:27046643

  15. Gene-environment interaction effects on lung function- a genome-wide association study within the Framingham heart study

    Science.gov (United States)

    2013-01-01

    Background Previous studies in occupational exposure and lung function have focused only on the main effect of occupational exposure or genetics on lung function. Some disease-susceptible genes may be missed due to their low marginal effects, despite potential involvement in the disease process through interactions with the environment. Through comprehensive genome-wide gene-environment interaction studies, we can uncover these susceptibility genes. Our objective in this study was to explore gene by occupational exposure interaction effects on lung function using both the individual SNPs approach and the genetic network approach. Methods The study population comprised the Offspring Cohort and the Third Generation from the Framingham Heart Study. We used forced expiratory volume in one second (FEV1) and ratio of FEV1 to forced vital capacity (FVC) as outcomes. Occupational exposures were classified using a population-specific job exposure matrix. We performed genome-wide gene-environment interaction analysis, using the Affymetrix 550 K mapping array for genotyping. A linear regression-based generalized estimating equation was applied to account for within-family relatedness. Network analysis was conducted using results from single-nucleotide polymorphism (SNP)-level analyses and from gene expression study results. Results There were 4,785 participants in total. SNP-level analysis and network analysis identified SNP rs9931086 (Pinteraction =1.16 × 10-7) in gene SLC38A8, which may significantly modify the effects of occupational exposure on FEV1. Genes identified from the network analysis included CTLA-4, HDAC, and PPAR-alpha. Conclusions Our study implies that SNP rs9931086 in SLC38A8 and genes CTLA-4, HDAC, and PPAR-alpha, which are related to inflammatory processes, may modify the effect of occupational exposure on lung function. PMID:24289273

  16. Cross-Disorder Genome-Wide Analyses Suggest a Complex Genetic Relationship Between Tourette Syndrome and Obsessive-Compulsive Disorder

    Science.gov (United States)

    Yu, Dongmei; Mathews, Carol A.; Scharf, Jeremiah M.; Neale, Benjamin M.; Davis, Lea K.; Gamazon, Eric R.; Derks, Eske M.; Evans, Patrick; Edlund, Christopher K.; Crane, Jacquelyn; Fagerness, Jesen A.; Osiecki, Lisa; Gallagher, Patience; Gerber, Gloria; Haddad, Stephen; Illmann, Cornelia; McGrath, Lauren M.; Mayerfeld, Catherine; Arepalli, Sampath; Barlassina, Cristina; Barr, Cathy L.; Bellodi, Laura; Benarroch, Fortu; Berrió, Gabriel Bedoya; Bienvenu, O. Joseph; Black, Donald; Bloch, Michael H.; Brentani, Helena; Bruun, Ruth D.; Budman, Cathy L.; Camarena, Beatriz; Campbell, Desmond D.; Cappi, Carolina; Cardona Silgado, Julio C.; Cavallini, Maria C.; Chavira, Denise A.; Chouinard, Sylvain; Cook, Edwin H.; Cookson, M. R.; Coric, Vladimir; Cullen, Bernadette; Cusi, Daniele; Delorme, Richard; Denys, Damiaan; Dion, Yves; Eapen, Valsama; Egberts, Karin; Falkai, Peter; Fernandez, Thomas; Fournier, Eduardo; Garrido, Helena; Geller, Daniel; Gilbert, Donald; Girard, Simon L.; Grabe, Hans J.; Grados, Marco A.; Greenberg, Benjamin D.; Gross-Tsur, Varda; Grünblatt, Edna; Hardy, John; Heiman, Gary A.; Hemmings, Sian M.J.; Herrera, Luis D.; Hezel, Dianne M.; Hoekstra, Pieter J.; Jankovic, Joseph; Kennedy, James L.; King, Robert A.; Konkashbaev, Anuar I.; Kremeyer, Barbara; Kurlan, Roger; Lanzagorta, Nuria; Leboyer, Marion; Leckman, James F.; Lennertz, Leonhard; Liu, Chunyu; Lochner, Christine; Lowe, Thomas L.; Lupoli, Sara; Macciardi, Fabio; Maier, Wolfgang; Manunta, Paolo; Marconi, Maurizio; McCracken, James T.; Mesa Restrepo, Sandra C.; Moessner, Rainald; Moorjani, Priya; Morgan, Jubel; Muller, Heike; Murphy, Dennis L.; Naarden, Allan L.; Ochoa, William Cornejo; Ophoff, Roel A.; Pakstis, Andrew J.; Pato, Michele T.; Pato, Carlos N.; Piacentini, John; Pittenger, Christopher; Pollak, Yehuda; Rauch, Scott L.; Renner, Tobias; Reus, Victor I.; Richter, Margaret A.; Riddle, Mark A.; Robertson, Mary M.; Romero, Roxana; Rosário, Maria C.; Rosenberg, David; Ruhrmann, Stephan; Sabatti, Chiara; Salvi, Erika; Sampaio, Aline S.; Samuels, Jack; Sandor, Paul; Service, Susan K.; Sheppard, Brooke; Singer, Harvey S.; Smit, Jan H.; Stein, Dan J.; Strengman, Eric; Tischfield, Jay A.; Turiel, Maurizio; Valencia Duarte, Ana V.; Vallada, Homero; Veenstra-VanderWeele, Jeremy; Walitza, Susanne; Walkup, John; Wang, Ying; Weale, Mike; Weiss, Robert; Wendland, Jens R.; Westenberg, Herman G.M.; Yao, Yin; Hounie, Ana G.; Miguel, Euripedes C.; Nicolini, Humberto; Wagner, Michael; Ruiz-Linares, Andres; Cath, Danielle C.; McMahon, William; Posthuma, Danielle; Oostra, Ben A.; Nestadt, Gerald; Rouleau, Guy A.; Purcell, Shaun; Jenike, Michael A.; Heutink, Peter; Hanna, Gregory L.; Conti, David V.; Arnold, Paul D.; Freimer, Nelson; Stewart, S. Evelyn; Knowles, James A.; Cox, Nancy J.; Pauls, David L.

    2014-01-01

    Obsessive-compulsive disorder (OCD) and Tourette Syndrome (TS) are highly heritable neurodevelopmental disorders that are thought to share genetic risk factors. However, the identification of definitive susceptibility genes for these etiologically complex disorders remains elusive. Here, we report a combined genome-wide association study (GWAS) of TS and OCD in 2723 cases (1310 with OCD, 834 with TS, 579 with OCD plus TS/chronic tics (CT)), 5667 ancestry-matched controls, and 290 OCD parent-child trios. Although no individual single nucleotide polymorphisms (SNPs) achieved genome-wide significance, the GWAS signals were enriched for SNPs strongly associated with variations in brain gene expression levels, i.e. expression quantitative loci (eQTLs), suggesting the presence of true functional variants that contribute to risk of these disorders. Polygenic score analyses identified a significant polygenic component for OCD (p=2×10−4), predicting 3.2% of the phenotypic variance in an independent data set. In contrast, TS had a smaller, non-significant polygenic component, predicting only 0.6% of the phenotypic variance (p=0.06). No significant polygenic signal was detected across the two disorders, although the sample is likely underpowered to detect a modest shared signal. Furthermore, the OCD polygenic signal was significantly attenuated when cases with both OCD and TS/CT were included in the analysis (p=0.01). Previous work has shown that TS and OCD have some degree of shared genetic variation. However, the data from this study suggest that there are also distinct components to the genetic architectures of TS and OCD. Furthermore, OCD with co-occurring TS/CT may have different underlying genetic susceptibility compared to OCD alone. PMID:25158072

  17. LNA-enhanced detection of single nucleotide polymorphisms in the apolipoprotein E

    DEFF Research Database (Denmark)

    Jacobsen, Nana; Bentzen, Joan; Meldgaard, Michael;

    2002-01-01

    Genotyping of single nucleotide polymorphisms (SNPs) in large populations presents a great challenge, especially if the SNPs are embedded in GC-rich regions, such as the codon 112 SNP in the human apolipoprotein E (apoE). In the present study, we have used immobilized locked nucleic acid (LNA) ca...

  18. Development and characterization of 35 single nucleotide polymorphism markers for the brown alga Fucus vesiculosus

    NARCIS (Netherlands)

    Canovas, Fernando; Mota, Catarina; Ferreira-Costa, Joana; Serrao, Ester; Coyer, Jim; Olsen, Jeanine; Pearson, Gareth

    2011-01-01

    We characterized 35 single nucleotide polymorphism (SNP) markers for the brown alga Fucus vesiculosus. Based on existing Fucus Expressed Sequence Tag libraries for heat and desiccation-stressed tissue, SNPs were developed and confirmed by re-sequencing cDNA from a diverse panel of individuals. SNP l

  19. Twin Probes as a Novel Tool for the Detection of Single-Nucleotide Polymorphisms

    NARCIS (Netherlands)

    Ergen, Erhan; Weber, Markus; Jacob, Josemon; Herrmann, Andreas; Müllen, Klaus

    2006-01-01

    Single-nucleotide polymorphisms (SNPs) are the most common form of DNA sequence variation. There is a strong interest from both academy and industry to develop rapid, sensitive and cost effective methods for SNP detection. Here we report a novel structural concept for DNA detection based on fluoresc

  20. Subtyping of Salmonella enterica subspecies I using single nucleotide polymorphisms in adenylate cyclase (cyaA)

    Science.gov (United States)

    Methods to rapidly identify serotypes of Salmonella enterica subspecies I are of vital importance for protecting the safety of food. To supplement the serotyping method dkgB-linked intergenic sequence ribotyping (ISR), single nucleotide polymorphisms (SNPs) were characterized within adenylate cyclas...

  1. Increasing the number of single nucleotide polymorphisms used in genomic evaluation of dairy cattle

    Science.gov (United States)

    GeneSeek designed a new version of the GeneSeek Genomic Profiler HD BeadChip for Dairy Cattle, which had >77,000 single nucleotide polymorphisms (SNPs). A set of >140,000 SNPs was selected that included all SNPs on the existing GeneSeek chip, all SNPs used in U.S. national genomic evaluations, SNPs ...

  2. Combined array-comparative genomic hybridization and single-nucleotide polymorphism-loss of heterozygosity analysis reveals complex genetic alterations in cervical cancer

    Directory of Open Access Journals (Sweden)

    Kenter Gemma G

    2007-02-01

    Full Text Available Abstract Background Cervical carcinoma develops as a result of multiple genetic alterations. Different studies investigated genomic alterations in cervical cancer mainly by means of metaphase comparative genomic hybridization (mCGH and microsatellite marker analysis for the detection of loss of heterozygosity (LOH. Currently, high throughput methods such as array comparative genomic hybridization (array CGH, single nucleotide polymorphism array (SNP array and gene expression arrays are available to study genome-wide alterations. Integration of these 3 platforms allows detection of genomic alterations at high resolution and investigation of an association between copy number changes and expression. Results Genome-wide copy number and genotype analysis of 10 cervical cancer cell lines by array CGH and SNP array showed highly complex large-scale alterations. A comparison between array CGH and SNP array revealed that the overall concordance in detection of the same areas with copy number alterations (CNA was above 90%. The use of SNP arrays demonstrated that about 75% of LOH events would not have been found by methods which screen for copy number changes, such as array CGH, since these were LOH events without CNA. Regions frequently targeted by CNA, as determined by array CGH, such as amplification of 5p and 20q, and loss of 8p were confirmed by fluorescent in situ hybridization (FISH. Genome-wide, we did not find a correlation between copy-number and gene expression. At chromosome arm 5p however, 22% of the genes were significantly upregulated in cell lines with amplifications as compared to cell lines without amplifications, as measured by gene expression arrays. For 3 genes, SKP2, ANKH and TRIO, expression differences were confirmed by quantitative real-time PCR (qRT-PCR. Conclusion This study showed that copy number data retrieved from either array CGH or SNP array are comparable and that the integration of genome-wide LOH, copy number and gene

  3. Layers of epistasis: genome-wide regulatory networks and network approaches to genome-wide association studies

    Science.gov (United States)

    Cowper-Sal·lari, Richard; Cole, Michael D.; Karagas, Margaret R.; Lupien, Mathieu; Moore, Jason H.

    2010-01-01

    The conceptual foundation of the genome-wide association study (GWAS) has advanced unchecked since its conception. A revision might seem premature as the potential of GWAS has not been fully realized. Multiple technical and practical limitations need to be overcome before GWAS can be fairly criticized. But with the completion of hundreds of studies and a deeper understanding of the genetic architecture of disease, warnings are being raised. The results compiled to date indicate that risk-associated variants lie predominantly in non-coding regions of the genome. Additionally, alternative methodologies are uncovering large and heterogeneous sets of rare variants underlying disease. The fear is that, even in its fulfilment, the current GWAS paradigm might be incapable of dissecting all kinds of phenotypes. In the following text we review several initiatives that aim to overcome these limitations. The overarching theme of these studies is the inclusion of biological knowledge to both the analysis and interpretation of genotyping data. GWAS is uninformed of biology by design and although there is some virtue in its simplicity it is also its most conspicuous deficiency. We propose a framework in which to integrate these novel approaches, both empirical and theoretical, in the form of a genome-wide regulatory network (GWRN). By processing experimental data into networks, emerging data types based on chromatin-immunoprecipitation are made computationally tractable. This will give GWAS re-analysis efforts the most current and relevant substrates, and root them firmly on our knowledge of human disease. PMID:21197657

  4. A genome-wide association study of optic disc parameters.

    Directory of Open Access Journals (Sweden)

    Wishal D Ramdas

    2010-06-01

    Full Text Available The optic nerve head is involved in many ophthalmic disorders, including common diseases such as myopia and open-angle glaucoma. Two of the most important parameters are the size of the optic disc area and the vertical cup-disc ratio (VCDR. Both are highly heritable but genetically largely undetermined. We performed a meta-analysis of genome-wide association (GWA data to identify genetic variants associated with optic disc area and VCDR. The gene discovery included 7,360 unrelated individuals from the population-based Rotterdam Study I and Rotterdam Study II cohorts. These cohorts revealed two genome-wide significant loci for optic disc area, rs1192415 on chromosome 1p22 (p = 6.72x10(-19 within 117 kb of the CDC7 gene and rs1900004 on chromosome 10q21.3-q22.1 (p = 2.67x10(-33 within 10 kb of the ATOH7 gene. They revealed two genome-wide significant loci for VCDR, rs1063192 on chromosome 9p21 (p = 6.15x10(-11 in the CDKN2B gene and rs10483727 on chromosome 14q22.3-q23 (p = 2.93x10(-10 within 40 kbp of the SIX1 gene. Findings were replicated in two independent Dutch cohorts (Rotterdam Study III and Erasmus Rucphen Family study; N = 3,612, and the TwinsUK cohort (N = 843. Meta-analysis with the replication cohorts confirmed the four loci and revealed a third locus at 16q12.1 associated with optic disc area, and four other loci at 11q13, 13q13, 17q23 (borderline significant, and 22q12.1 for VCDR. ATOH7 was also associated with VCDR independent of optic disc area. Three of the loci were marginally associated with open-angle glaucoma. The protein pathways in which the loci of optic disc area are involved overlap with those identified for VCDR, suggesting a common genetic origin.

  5. Genome-wide transcriptional reprogramming under drought stress

    KAUST Repository

    Chen, Hao

    2012-01-01

    Soil water deficit is one of the major factors limiting plant productivity. Plants cope with this adverse environmental condition by coordinating the up- or downregulation of an array of stress responsive genes. Reprogramming the expression of these genes leads to rebalanced development and growth that are in concert with the reduced water availability and that ultimately confer enhanced stress tolerance. Currently, several techniques have been employed to monitor genome-wide transcriptional reprogramming under drought stress. The results from these high throughput studies indicate that drought stress-induced transcriptional reprogramming is dynamic, has temporal and spatial specificity, and is coupled with the circadian clock and phytohormone signaling pathways. © 2012 Springer-Verlag Berlin Heidelberg. All rights are reserved.

  6. Genome-wide measurement of RNA folding energies.

    Science.gov (United States)

    Wan, Yue; Qu, Kun; Ouyang, Zhengqing; Kertesz, Michael; Li, Jun; Tibshirani, Robert; Makino, Debora L; Nutter, Robert C; Segal, Eran; Chang, Howard Y

    2012-10-26

    RNA structural transitions are important in the function and regulation of RNAs. Here, we reveal a layer of transcriptome organization in the form of RNA folding energies. By probing yeast RNA structures at different temperatures, we obtained relative melting temperatures (Tm) for RNA structures in over 4000 transcripts. Specific signatures of RNA Tm demarcated the polarity of mRNA open reading frames and highlighted numerous candidate regulatory RNA motifs in 3' untranslated regions. RNA Tm distinguished noncoding versus coding RNAs and identified mRNAs with distinct cellular functions. We identified thousands of putative RNA thermometers, and their presence is predictive of the pattern of RNA decay in vivo during heat shock. The exosome complex recognizes unpaired bases during heat shock to degrade these RNAs, coupling intrinsic structural stabilities to gene regulation. Thus, genome-wide structural dynamics of RNA can parse functional elements of the transcriptome and reveal diverse biological insights.

  7. Genome-wide association studies in pediatric chronic kidney disease.

    Science.gov (United States)

    Gupta, Jayanta; Kanetsky, Peter A; Wuttke, Matthias; Köttgen, Anna; Schaefer, Franz; Wong, Craig S

    2016-08-01

    The genome-wide association study (GWAS) has become an established scientific method that provides an unbiased screen for genetic loci potentially associated with phenotypes of clinical interest, such as chronic kidney disease (CKD). Thus, GWAS provides opportunities to gain new perspectives regarding the genetic architecture of CKD progression by identifying new candidate genes and targets for intervention. As such, it has become an important arm of translational science providing a complementary line of investigation to identify novel therapeutics to treat CKD. In this review, we describe the method and the challenges of performing GWAS in the pediatric CKD population. We also provide an overview of successful GWAS for kidney disease, and we discuss the established pediatric CKD cohorts in North America and Europe that are poised to identify genetic risk variants associated with CKD progression.

  8. Genome-wide genetic changes during modern breeding of maize.

    Science.gov (United States)

    Jiao, Yinping; Zhao, Hainan; Ren, Longhui; Song, Weibin; Zeng, Biao; Guo, Jinjie; Wang, Baobao; Liu, Zhipeng; Chen, Jing; Li, Wei; Zhang, Mei; Xie, Shaojun; Lai, Jinsheng

    2012-06-03

    The success of modern maize breeding has been demonstrated by remarkable increases in productivity over the last four decades. However, the underlying genetic changes correlated with these gains remain largely unknown. We report here the sequencing of 278 temperate maize inbred lines from different stages of breeding history, including deep resequencing of 4 lines with known pedigree information. The results show that modern breeding has introduced highly dynamic genetic changes into the maize genome. Artificial selection has affected thousands of targets, including genes and non-genic regions, leading to a reduction in nucleotide diversity and an increase in the proportion of rare alleles. Genetic changes during breeding happen rapidly, with extensive variation (SNPs, indels and copy-number variants (CNVs)) occurring, even within identity-by-descent regions. Our genome-wide assessment of genetic changes during modern maize breeding provides new strategies as well as practical targets for future crop breeding and biotechnology.

  9. DNA Break Mapping Reveals Topoisomerase II Activity Genome-Wide

    Directory of Open Access Journals (Sweden)

    Laura Baranello

    2014-07-01

    Full Text Available Genomic DNA is under constant assault by endogenous and exogenous DNA damaging agents. DNA breakage can represent a major threat to genome integrity but can also be necessary for genome function. Here we present approaches to map DNA double-strand breaks (DSBs and single-strand breaks (SSBs at the genome-wide scale by two methods called DSB- and SSB-Seq, respectively. We tested these methods in human colon cancer cells and validated the results using the Topoisomerase II (Top2-poisoning agent etoposide (ETO. Our results show that the combination of ETO treatment with break-mapping techniques is a powerful method to elaborate the pattern of Top2 enzymatic activity across the genome.

  10. Chapter 10: Mining genome-wide genetic markers.

    Directory of Open Access Journals (Sweden)

    Xiang Zhang

    Full Text Available Genome-wide association study (GWAS aims to discover genetic factors underlying phenotypic traits. The large number of genetic factors poses both computational and statistical challenges. Various computational approaches have been developed for large scale GWAS. In this chapter, we will discuss several widely used computational approaches in GWAS. The following topics will be covered: (1 An introduction to the background of GWAS. (2 The existing computational approaches that are widely used in GWAS. This will cover single-locus, epistasis detection, and machine learning methods that have been recently developed in biology, statistic, and computer science communities. This part will be the main focus of this chapter. (3 The limitations of current approaches and future directions.

  11. High-resolution genome-wide mapping of histone modifications.

    Science.gov (United States)

    Roh, Tae-young; Ngau, Wing Chi; Cui, Kairong; Landsman, David; Zhao, Keji

    2004-08-01

    The expression patterns of eukaryotic genomes are controlled by their chromatin structure, consisting of nucleosome subunits in which DNA of approximately 146 bp is wrapped around a core of 8 histone molecules. Post-translational histone modifications play an essential role in modifying chromatin structure. Here we apply a combination of SAGE and chromatin immunoprecipitation (ChIP) protocols to determine the distribution of hyperacetylated histones H3 and H4 in the Saccharomyces cerevisiae genome. We call this approach genome-wide mapping technique (GMAT). Using GMAT, we find that the highest acetylation levels are detected in the 5' end of a gene's coding region, but not in the promoter. Furthermore, we show that the histone acetyltransferase, GCN5p, regulates H3 acetylation in the promoter and 5' end of the coding regions. These findings indicate that GMAT should find valuable applications in mapping target sites of chromatin-modifying enzymes.

  12. Genome-Wide Association Study of Coronary Artery Disease

    Directory of Open Access Journals (Sweden)

    Naomi Ogawa

    2010-01-01

    Full Text Available Coronary artery disease (CAD is a multifactorial disease with environmental and genetic determinants. The genetic determinants of CAD have previously been explored by the candidate gene approach. Recently, the data from the International HapMap Project and the development of dense genotyping chips have enabled us to perform genome-wide association studies (GWAS on a large number of subjects without bias towards any particular candidate genes. In 2007, three chip-based GWAS simultaneously revealed the significant association between common variants on chromosome 9p21 and CAD. This association was replicated among other ethnic groups and also in a meta-analysis. Further investigations have detected several other candidate loci associated with CAD. The chip-based GWAS approach has identified novel and unbiased genetic determinants of CAD and these insights provide the important direction to better understand the pathogenesis of CAD and to develop new and improved preventive measures and treatments for CAD.

  13. Genome-wide expression profiling of complex regional pain syndrome.

    Directory of Open Access Journals (Sweden)

    Eun-Heui Jin

    Full Text Available Complex regional pain syndrome (CRPS is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II and 5 controls (cut-off value: 1.5-fold change and p<0.05. Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1, matrix metalloproteinase 9 (MMP9, alanine aminopeptidase N (ANPEP, l-histidine decarboxylase (HDC, granulocyte colony-stimulating factor 3 receptor (G-CSF3R, and signal transducer and activator of transcription 3 (STAT3 genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR. We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10(-4. The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression.

  14. A genome-wide association study of anorexia nervosa

    Science.gov (United States)

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Landt, Margarita CT Slof-Op t; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O'Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2015-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  15. Genome-wide association studies in asthma: progress and pitfalls

    Directory of Open Access Journals (Sweden)

    March ME

    2015-01-01

    Full Text Available Michael E March,1 Patrick MA Sleiman,1,2 Hakon Hakonarson1,2 1Center for Applied Genomics, Children's Hospital of Philadelphia Research Institute, 2Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Abstract: Genetic studies of asthma have revealed that there is considerable heritability to the phenotype. An extensive history of candidate-gene studies has identified a long list of genes associated with immune function that are potentially involved in asthma pathogenesis. However, many of the results of candidate-gene studies have failed to be replicated, leaving in question the true impact of the implicated biological pathways on asthma. With the advent of genome-wide association studies, geneticists are able to examine the association of hundreds of thousands of genetic markers with a phenotype, allowing the hypothesis-free identification of variants associated with disease. Many such studies examining asthma or related phenotypes have been published, and several themes have begun to emerge regarding the biological pathways underpinning asthma. The results of many genome-wide association studies have currently not been replicated, and the large sample sizes required for this experimental strategy invoke difficulties with sample stratification and phenotypic heterogeneity. Recently, large collaborative groups of researchers have formed consortia focused on asthma, with the goals of sharing material and data and standardizing diagnosis and experimental methods. Additionally, research has begun to focus on genetic variants that affect the response to asthma medications and on the biology that generates the heterogeneity in the asthma phenotype. As this work progresses, it will move asthma patients closer to more specific, personalized medicine. Keywords: asthma, genetics, GWAS, pharmacogenetics, biomarkers

  16. Genome-Wide Association Study of Meiotic Recombination Phenotypes

    Science.gov (United States)

    Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G.; Sherman, Stephanie L.; Feingold, Eleanor

    2016-01-01

    Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9. By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2. This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events. PMID:27733454

  17. Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library

    Directory of Open Access Journals (Sweden)

    Salem Mohamed

    2009-11-01

    Full Text Available Abstract Background To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs have been used for single nucleotide polymorphism (SNP discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA broodstock population. Results The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme HaeIII; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends. Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183 of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In

  18. Meta-analysis of genome-wide association studies in African Americans provides insights into the genetic architecture of type 2 diabetes

    DEFF Research Database (Denmark)

    Ng, Maggie C Y; Shriner, Daniel; Chen, Brian H

    2014-01-01

    Type 2 diabetes (T2D) is more prevalent in African Americans than in Europeans. However, little is known about the genetic risk in African Americans despite the recent identification of more than 70 T2D loci primarily by genome-wide association studies (GWAS) in individuals of European ancestry....... In order to investigate the genetic architecture of T2D in African Americans, the MEta-analysis of type 2 DIabetes in African Americans (MEDIA) Consortium examined 17 GWAS on T2D comprising 8,284 cases and 15,543 controls in African Americans in stage 1 analysis. Single nucleotide polymorphisms (SNPs...... × 10(-8), odds ratio (OR)  = 1.09 to 1.36). Fine-mapping revealed that 88 of 158 previously identified T2D or glucose homeostasis loci demonstrated nominal to highly significant association (2.2 × 10(-23)

  19. An endophenotype approach to the genetics of alcohol dependence: a genome wide association study of fast beta EEG in families of African ancestry.

    Science.gov (United States)

    Meyers, J L; Zhang, J; Wang, J C; Su, J; Kuo, S I; Kapoor, M; Wetherill, L; Bertelsen, S; Lai, D; Salvatore, J E; Kamarajan, C; Chorlian, D; Agrawal, A; Almasy, L; Bauer, L; Bucholz, K K; Chan, G; Hesselbrock, V; Koganti, L; Kramer, J; Kuperman, S; Manz, N; Pandey, A; Seay, M; Scott, D; Taylor, R E; Dick, D M; Edenberg, H J; Goate, A; Foroud, T; Porjesz, B

    2017-01-10

    Fast beta (20-28 Hz) electroencephalogram (EEG) oscillatory activity may be a useful endophenotype for studying the genetics of disorders characterized by neural hyperexcitability, including substance use disorders (SUDs). However, the genetic underpinnings of fast beta EEG have not previously been studied in a population of African-American ancestry (AA). In a sample of 2382 AA individuals from 482 families drawn from the Collaborative Study on the Genetics of Alcoholism (COGA), we performed a genome-wide association study (GWAS) on resting-state fast beta EEG power. To further characterize our genetic findings, we examined the functional and clinical/behavioral significance of GWAS variants. Ten correlated single-nucleotide polymorphisms (SNPs) (r(2)>0.9) located in an intergenic region on chromosome 3q26 were associated with fast beta EEG power at PMolecular Psychiatry advance online publication, 10 January 2017; doi:10.1038/mp.2016.239.

  20. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    Science.gov (United States)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  1. ChIP on SNP-chip for genome-wide analysis of human histone H4 hyperacetylation

    Directory of Open Access Journals (Sweden)

    Porter Christopher J

    2007-09-01

    Full Text Available Abstract Background SNP microarrays are designed to genotype Single Nucleotide Polymorphisms (SNPs. These microarrays report hybridization of DNA fragments and therefore can be used for the purpose of detecting genomic fragments. Results Here, we demonstrate that a SNP microarray can be effectively used in this way to perform chromatin immunoprecipitation (ChIP on chip as an alternative to tiling microarrays. We illustrate this novel application by mapping whole genome histone H4 hyperacetylation in human myoblasts and myotubes. We detect clusters of hyperacetylated histone H4, often spanning across up to 300 kilobases of genomic sequence. Using complementary genome-wide analyses of gene expression by DNA microarray we demonstrate that these clusters of hyperacetylated histone H4 tend to be associated with expressed genes. Conclusion The use of a SNP array for a ChIP-on-chip application (ChIP on SNP-chip will be of great value to laboratories whose interest is the determination of general rules regarding the relationship of specific chromatin modifications to transcriptional status throughout the genome and to examine the asymmetric modification of chromatin at heterozygous loci.

  2. Influence of Feature Encoding and Choice of Classifier on Disease Risk Prediction in Genome-Wide Association Studies.

    Directory of Open Access Journals (Sweden)

    Florian Mittag

    Full Text Available Various attempts have been made to predict the individual disease risk based on genotype data from genome-wide association studies (GWAS. However, most studies only investigated one or two classification algorithms and feature encoding schemes. In this study, we applied seven different classification algorithms on GWAS case-control data sets for seven different diseases to create models for disease risk prediction. Further, we used three different encoding schemes for the genotypes of single nucleotide polymorphisms (SNPs and investigated their influence on the predictive performance of these models. Our study suggests that an additive encoding of the SNP data should be the preferred encoding scheme, as it proved to yield the best predictive performances for all algorithms and data sets. Furthermore, our results showed that the differences between most state-of-the-art classification algorithms are not statistically significant. Consequently, we recommend to prefer algorithms with simple models like the linear support vector machine (SVM as they allow for better subsequent interpretation without significant loss of accuracy.

  3. Accounting for selection and correlation in the analysis of two-stage genome-wide association studies.

    Science.gov (United States)

    Robertson, David S; Prevost, A Toby; Bowden, Jack

    2016-10-01

    The problem of selection bias has long been recognized in the analysis of two-stage trials, where promising candidates are selected in stage 1 for confirmatory analysis in stage 2. To efficiently correct for bias, uniformly minimum variance conditionally unbiased estimators (UMVCUEs) have been proposed for a wide variety of trial settings, but where the population parameter estimates are assumed to be independent. We relax this assumption and derive the UMVCUE in the multivariate normal setting with an arbitrary known covariance structure. One area of application is the estimation of odds ratios (ORs) when combining a genome-wide scan with a replication study. Our framework explicitly accounts for correlated single nucleotide polymorphisms, as might occur due to linkage disequilibrium. We illustrate our approach on the measurement of the association between 11 genetic variants and the risk of Crohn's disease, as reported in Parkes and others (2007. Sequence variants in the autophagy gene IRGM and multiple other replicating loci contribute to Crohn's disease susceptibility. Nat. Gen. 39: (7), 830-832.), and show that the estimated ORs can vary substantially if both selection and correlation are taken into account.

  4. Characterization of genome-wide SNPs for the water flea Daphnia pulicaria generated by genotyping-by-sequencing (GBS)

    Science.gov (United States)

    Muñoz, Joaquín; Chaturvedi, Anurag; De Meester, Luc; Weider, Lawrence J.

    2016-01-01

    The keystone aquatic herbivore Daphnia has been studied for more than 150 years in the context of evolution, ecology and ecotoxicology. Although it is rapidly becoming an emergent model for environmental and population genomics, there have been limited genome-wide level studies in natural populations. We report a unique resource of novel Single Nucleotide Polymorphic (SNP) markers for Daphnia pulicaria using the reduction in genomic complexity with the restriction enzymes approach, genotyping-by-sequencing. Using the genome of D. pulex as a reference, SNPs were scored for 53 clones from five natural populations that varied in lake trophic status. Our analyses resulted in 32,313 highly confident and bi-allelic SNP markers. 1,364 outlier SNPs were mapped on the annotated D. pulex genome, which identified 2,335 genes, including 565 within functional genes. Out of 885 EuKaryotic Orthologous Groups that we found from outlier SNPs, 294 were involved in three metabolic and four regulatory pathways. Bayesian-clustering analyses showed two distinct population clusters representing the possible combined effects of geography and lake trophic status. Our results provide an invaluable tool for future population genomics surveys in Daphnia targeting informative regions related to physiological processes that can be linked to the ecology of this emerging eco-responsive taxon. PMID:27346179

  5. A genome-wide analysis of the response to inhaled β2-agonists in chronic obstructive pulmonary disease.

    Science.gov (United States)

    Hardin, M; Cho, M H; McDonald, M-L; Wan, E; Lomas, D A; Coxson, H O; MacNee, W; Vestbo, J; Yates, J C; Agusti, A; Calverley, P M A; Celli, B; Crim, C; Rennard, S; Wouters, E; Bakke, P; Bhatt, S P; Kim, V; Ramsdell, J; Regan, E A; Make, B J; Hokanson, J E; Crapo, J D; Beaty, T H; Hersh, C P

    2016-08-01

    Short-acting β2-agonist bronchodilators are the most common medications used in treating chronic obstructive pulmonary disease (COPD). Genetic variants determining bronchodilator responsiveness (BDR) in COPD have not been identified. We performed a genome-wide association study (GWAS) of BDR in 5789 current or former smokers with COPD in one African-American and four white populations. BDR was defined as the quantitative spirometric response to inhaled β2-agonists. We combined results in a meta-analysis. In the meta-analysis, single-nucleotide polymorphisms (SNPs) in the genes KCNK1 (P=2.02 × 10(-7)) and KCNJ2 (P=1.79 × 10(-7)) were the top associations with BDR. Among African Americans, SNPs in CDH13 were significantly associated with BDR (P=5.1 × 10(-9)). A nominal association with CDH13 was identified in a gene-based analysis in all subjects. We identified suggestive association with BDR among COPD subjects for variants near two potassium channel genes (KCNK1 and KCNJ2). SNPs in CDH13 were significantly associated with BDR in African Americans.The Pharmacogenomics Journal advance online publication, 27 October 2015; doi:10.1038/tpj.2015.65.

  6. Characterization of genome-wide SNPs for the water flea Daphnia pulicaria generated by genotyping-by-sequencing (GBS).

    Science.gov (United States)

    Muñoz, Joaquín; Chaturvedi, Anurag; De Meester, Luc; Weider, Lawrence J

    2016-06-27

    The keystone aquatic herbivore Daphnia has been studied for more than 150 years in the context of evolution, ecology and ecotoxicology. Although it is rapidly becoming an emergent model for environmental and population genomics, there have been limited genome-wide level studies in natural populations. We report a unique resource of novel Single Nucleotide Polymorphic (SNP) markers for Daphnia pulicaria using the reduction in genomic complexity with the restriction enzymes approach, genotyping-by-sequencing. Using the genome of D. pulex as a reference, SNPs were scored for 53 clones from five natural populations that varied in lake trophic status. Our analyses resulted in 32,313 highly confident and bi-allelic SNP markers. 1,364 outlier SNPs were mapped on the annotated D. pulex genome, which identified 2,335 genes, including 565 within functional genes. Out of 885 EuKaryotic Orthologous Groups that we found from outlier SNPs, 294 were involved in three metabolic and four regulatory pathways. Bayesian-clustering analyses showed two distinct population clusters representing the possible combined effects of geography and lake trophic status. Our results provide an invaluable tool for future population genomics surveys in Daphnia targeting informative regions related to physiological processes that can be linked to the ecology of this emerging eco-responsive taxon.

  7. Genome-wide association study reveals putative regulators of bioenergy traits in Populus deltoides

    Energy Technology Data Exchange (ETDEWEB)

    Fahrenkrog, Annette M. [School of Forest Resources and Conservation, University of Florida, PO Box 110410 Gainesville FL 32611 USA; Plant Molecular and Cellular Biology Graduate Program, University of Florida, PO Box 110690 Gainesville FL 32610 USA; Neves, Leandro G. [School of Forest Resources and Conservation, University of Florida, PO Box 110410 Gainesville FL 32611 USA; Plant Molecular and Cellular Biology Graduate Program, University of Florida, PO Box 110690 Gainesville FL 32610 USA; Resende, Márcio F. R. [School of Forest Resources and Conservation, University of Florida, PO Box 110410 Gainesville FL 32611 USA; Genetics and Genomics Graduate Program, University of Florida, PO Box 103610 Gainesville FL 32610 USA; Vazquez, Ana I. [Department of Epidemiology and Biostatistics, Michigan State University, 909 Fee Road East Lansing MI 48824 USA; de los Campos, Gustavo [Department of Epidemiology and Biostatistics, Michigan State University, 909 Fee Road East Lansing MI 48824 USA; Statistics Department, Michigan State University, 619 Red Cedar Road MI 48824 USA; Dervinis, Christopher [School of Forest Resources and Conservation, University of Florida, PO Box 110410 Gainesville FL 32611 USA; Sykes, Robert [National Renewable Energy Laboratory, 15013 Denver West Parkway Golden CO 80401 USA; Davis, Mark [National Renewable Energy Laboratory, 15013 Denver West Parkway Golden CO 80401 USA; Davenport, Ruth [Biology Department, University of Florida, PO Box 118525 Gainesville FL 32611 USA; Barbazuk, William B. [Plant Molecular and Cellular Biology Graduate Program, University of Florida, PO Box 110690 Gainesville FL 32610 USA; Biology Department, University of Florida, PO Box 118525 Gainesville FL 32611 USA; University of Florida Genetics Institute, University of Florida, PO Box 103610 Gainesville FL 32611 USA; Kirst, Matias [School of Forest Resources and Conservation, University of Florida, PO Box 110410 Gainesville FL 32611 USA; Plant Molecular and Cellular Biology Graduate Program, University of Florida, PO Box 110690 Gainesville FL 32610 USA; University of Florida Genetics Institute, University of Florida, PO Box 103610 Gainesville FL 32611 USA

    2016-09-06

    Genome-wide association studies (GWAS) have been used extensively to dissect the genetic regulation of complex traits in plants. These studies have focused largely on the analysis of common genetic variants despite the abundance of rare polymorphisms in several species, and their potential role in trait variation. Here, we conducted the first GWAS in Populus deltoides, a genetically diverse keystone forest species in North America and an important short rotation woody crop for the bioenergy industry. We searched for associations between eight growth and wood composition traits, and common and low-frequency single-nucleotide polymorphisms detected by targeted resequencing of 18 153 genes in a population of 391 unrelated individuals. To increase power to detect associations with low-frequency variants, multiple-marker association tests were used in combination with single-marker association tests. Significant associations were discovered for all phenotypes and are indicative that low-frequency polymorphisms contribute to phenotypic variance of several bioenergy traits. Our results suggest that both common and low-frequency variants need to be considered for a comprehensive understanding of the genetic regulation of complex traits, particularly in species that carry large numbers of rare polymorphisms. These polymorphisms may be critical for the development of specialized plant feedstocks for bioenergy.

  8. A genome-wide association study identifies variants in KCNIP4 associated with ACE inhibitor-induced cough.

    Science.gov (United States)

    Mosley, J D; Shaffer, C M; Van Driest, S L; Weeke, P E; Wells, Q S; Karnes, J H; Velez Edwards, D R; Wei, W-Q; Teixeira, P L; Bastarache, L; Crawford, D C; Li, R; Manolio, T A; Bottinger, E P; McCarty, C A; Linneman, J G; Brilliant, M H; Pacheco, J A; Thompson, W; Chisholm, R L; Jarvik, G P; Crosslin, D R; Carrell, D S; Baldwin, E; Ralston, J; Larson, E B; Grafton, J; Scrol, A; Jouni, H; Kullo, I J; Tromp, G; Borthwick, K M; Kuivaniemi, H; Carey, D J; Ritchie, M D; Bradford, Y; Verma, S S; Chute, C G; Veluchamy, A; Siddiqui, M K; Palmer, C N A; Doney, A; MahmoudPour, S H; Maitland-van der Zee, A H; Morris, A D; Denny, J C; Roden, D M

    2016-06-01

    The most common side effect of angiotensin-converting enzyme inhibitor (ACEi) drugs is cough. We conducted a genome-wide association study (GWAS) of ACEi-induced cough among 7080 subjects of diverse ancestries in the Electronic Medical Records and Genomics (eMERGE) network. Cases were subjects diagnosed with ACEi-induced cough. Controls were subjects with at least 6 months of ACEi use and no cough. A GWAS (1595 cases and 5485 controls) identified associations on chromosome 4 in an intron of KCNIP4. The strongest association was at rs145489027 (minor allele frequency=0.33, odds ratio (OR)=1.3 (95% confidence interval (CI): 1.2-1.4), P=1.0 × 10(-8)). Replication for six single-nucleotide polymorphisms (SNPs) in KCNIP4 was tested in a second eMERGE population (n=926) and in the Genetics of Diabetes Audit and Research in Tayside, Scotland (GoDARTS) cohort (n=4309). Replication was observed at rs7675300 (OR=1.32 (1.01-1.70), P=0.04) in eMERGE and at rs16870989 and rs1495509 (OR=1.15 (1.01-1.30), P=0.03 for both) in GoDARTS. The combined association at rs1495509 was significant (OR=1.23 (1.15-1.32), P=1.9 × 10(-9)). These results indicate that SNPs in KCNIP4 may modulate ACEi-induced cough risk.

  9. Genome-wide association implicates numerous genes underlying ecological trait variation in natural populations of Populus trichocarpa.

    Science.gov (United States)

    McKown, Athena D; Klápště, Jaroslav; Guy, Robert D; Geraldes, Armando; Porth, Ilga; Hannemann, Jan; Friedmann, Michael; Muchero, Wellington; Tuskan, Gerald A; Ehlting, Jürgen; Cronk, Quentin C B; El-Kassaby, Yousry A; Mansfield, Shawn D; Douglas, Carl J

    2014-07-01

    In order to uncover the genetic basis of phenotypic trait variation, we used 448 unrelated wild accessions of black cottonwood (Populus trichocarpa) from much of its range in western North America. Extensive data from large-scale trait phenotyping (with spatial and temporal replications within a common garden) and genotyping (with a 34 K Populus single nucleotide polymorphism (SNP) array) of all accessions were used for gene discovery in a genome-wide association study (GWAS). We performed GWAS with 40 biomass, ecophysiology and phenology traits and 29,355 filtered SNPs representing 3518 genes. The association analyses were carried out using a Unified Mixed Model accounting for population structure effects among accessions. We uncovered 410 significant SNPs using a Bonferroni-corrected threshold (P<1.7×10(-6)). Markers were found across 19 chromosomes, explained 1-13% of trait variation, and implicated 275 unique genes in trait associations. Phenology had the largest number of associated genes (240 genes), followed by biomass (53 genes) and ecophysiology traits (25 genes). The GWAS results propose numerous loci for further investigation. Many traits had significant associations with multiple genes, underscoring their genetic complexity. Genes were also identified with multiple trait associations within and/or across trait categories. In some cases, traits were genetically correlated while in others they were not.

  10. The role of height-associated loci identified in genome wide association studies in the determination of pediatric stature

    Directory of Open Access Journals (Sweden)

    Frackelton Edward C

    2010-06-01

    Full Text Available Abstract Background Human height is considered highly heritable and correlated with certain disorders, such as type 2 diabetes and cancer. Despite environmental influences, genetic factors are known to play an important role in stature determination. A number of genetic determinants of adult height have already been established through genome wide association studies. Methods To examine 51 single nucleotide polymorphisms (SNPs corresponding to the 46 previously reported genomic loci for height in 8,184 European American children with height measurements. We leveraged genotyping data from our ongoing GWA study of height variation in children in order to query the 51 SNPs in this pediatric cohort. Results Sixteen of these SNPs yielded at least nominally significant association to height, representing fifteen different loci including EFEMP1-PNPT1, GPR126, C6orf173, SPAG17, Histone class 1, HLA class III and GDF5-UQCC. Other loci revealed no evidence for association, including HMGA1 and HMGA2. For the 16 associated variants, the genotype score explained 1.64% of the total variation for height z-score. Conclusion Among 46 loci that have been reported to associate with adult height to date, at least 15 also contribute to the determination of height in childhood.

  11. Evaluation of results from genome-wide studies of language and reading in a novel independent dataset.

    Science.gov (United States)

    Carrion-Castillo, A; van Bergen, E; Vino, A; van Zuijen, T; de Jong, P F; Francks, C; Fisher, S E

    2016-07-01

    Recent genome-wide association scans (GWAS) for reading and language abilities have pin-pointed promising new candidate loci. However, the potential contributions of these loci remain to be validated. In this study, we tested 17 of the most significantly associated single nucleotide polymorphisms (SNPs) from these GWAS studies (P < 10(-6) in the original studies) in a new independent population dataset from the Netherlands: known as Familial Influences on Literacy Abilities. This dataset comprised 483 children from 307 nuclear families and 505 adults (including parents of participating children), and provided adequate statistical power to detect the effects that were previously reported. The following measures of reading and language performance were collected: word reading fluency, nonword reading fluency, phonological awareness and rapid automatized naming. Two SNPs (rs12636438 and rs7187223) were associated with performance in multivariate and univariate testing, but these did not remain significant after correction for multiple testing. Another SNP (rs482700) was only nominally associated in the multivariate test. For the rest of the SNPs, we did not find supportive evidence of association. The findings may reflect differences between our study and the previous investigations with respect to the language of testing, the exact tests used and the recruitment criteria. Alternatively, most of the prior reported associations may have been false positives. A larger scale GWAS meta-analysis than those previously performed will likely be required to obtain robust insights into the genomic architecture underlying reading and language.

  12. Pathway analysis of expression-related SNPs on genome-wide association study of basal cell carcinoma

    Science.gov (United States)

    Li, Xin; Liang, Liming; De Vivo, Immaculata; Tang, Jean Y.; Han, Jiali

    2016-01-01

    Genome-wide association studies (GWASs) have primarily focused on the association between individual genetic markers and risk of disease. We applied a novel approach that integrates skin expression-related single-nucleotide polymorphisms (eSNPs) and pathway analysis for GWAS of basal cell carcinoma (BCC) to identify potential novel biological pathways. We evaluated the associations between 70,932 skin eSNPs and risk of BCC among 2,323 cases and 7,275 controls of European ancestry, and then assigned them to the pathways defined by KEGG, GO, and BioCarta databases. Three KEGG pathways (colorectal cancer, actin cytoskeleton, and BCC), two GO pathways (cellular component disassembly in apoptosis, and nucleus organization), and four BioCarta pathways (Ras signaling, T cell receptor signaling, natural killer cell-mediated cytotoxicity, and links between Pyk2 and Map Kinases) showed significant association with BCC risk with p-value<0.05 and FDR<0.2. These pathways also ranked at top in sensitivity analyses. Two positive controls in KEGG, the hedgehog pathway and the BCC pathway, showed significant association with BCC risk in both main and sensitivity analyses. Our results indicate that SNPs that are undetectable by conventional GWASs are significantly associated with BCC when tested as pathways. Biological studies of these gene groups suggest their potential roles in the etiology of BCC. PMID:27367190

  13. Phenotype definition is a main point in genome-wide association studies for bovine Mycobacterium avium ssp. paratuberculosis infection status.

    Science.gov (United States)

    Küpper, J; Brandt, H; Donat, K; Erhardt, G

    2014-10-01

    Paratuberculosis caused by Mycobacterium avium ssp. paratuberculosis (MAP) causes economic losses and is present in dairy herds worldwide. Different studies used different diagnostic tests to detect infection status and are the basis of genome-wide association (GWA) studies with inconsistent results. Therefore, the aim of this study was to identify and compare genomic regions associated with MAP susceptibility in the same cohort of cattle using different diagnostic tests. The GWA study was performed in German Holsteins within a case-control assay using 305 cows tested for MAP by fecal culture and additional with four different commercial ELISA-tests. Genotyping was performed with the Illumina Bovine SNP50 BeadChip. The results using fecal culture or ELISA test led to the identification of different genetic loci. Two single-nucleotide polymorphisms showed significant association with the ELISA-status. However, no significant association for MAP infection could be confirmed. Our results show that the definition of the MAP-phenotype has an important impact on the outcome of GWA studies for paratuberculosis.

  14. Genome-wide high-resolution mapping of UV-induced mitotic recombination events in Saccharomyces cerevisiae.

    Directory of Open Access Journals (Sweden)

    Yi Yin

    2013-10-01

    Full Text Available In the yeast Saccharomyces cerevisiae and most other eukaryotes, mitotic recombination is important for the repair of double-stranded DNA breaks (DSBs. Mitotic recombination between homologous chromosomes can result in loss of heterozygosity (LOH. In this study, LOH events induced by ultraviolet (UV light are mapped throughout the genome to a resolution of about 1 kb using single-nucleotide polymorphism (SNP microarrays. UV doses that have little effect on the viability of diploid cells stimulate crossovers more than 1000-fold in wild-type cells. In addition, UV stimulates recombination in G1-synchronized cells about 10-fold more efficiently than in G2-synchronized cells. Importantly, at high doses of UV, most conversion events reflect the repair of two sister chromatids that are broken at approximately the same position whereas at low doses, most conversion events reflect the repair of a single broken chromatid. Genome-wide mapping of about 380 unselected crossovers, break-induced replication (BIR events, and gene conversions shows that UV-induced recombination events occur throughout the genome without pronounced hotspots, although the ribosomal RNA gene cluster has a significantly lower frequency of crossovers.

  15. Genome-Wide Association and Transcriptome Analyses Reveal Candidate Genes Underlying Yield-determining Traits in Brassica napus

    Science.gov (United States)

    Lu, Kun; Peng, Liu; Zhang, Chao; Lu, Junhua; Yang, Bo; Xiao, Zhongchun; Liang, Ying; Xu, Xingfu; Qu, Cunmin; Zhang, Kai; Liu, Liezhao; Zhu, Qinlong; Fu, Minglian; Yuan, Xiaoyan; Li, Jiana

    2017-01-01

    Yield is one of the most important yet complex crop traits. To improve our understanding of the genetic basis of yield establishment, and to identify candidate genes responsible for yield improvement in Brassica napus, we performed genome-wide association studies (GWAS) for seven yield-determining traits [main inflorescence pod number (MIPN), branch pod number (BPN), pod number per plant (PNP), seed number per pod (SPP), thousand seed weight, main inflorescence yield (MIY), and branch yield], using data from 520 diverse B. napus accessions from two different yield environments. In total, we detected 128 significant single nucleotide polymorphisms (SNPs), 93 of which were revealed as novel by integrative analysis. A combination of GWAS and transcriptome sequencing on 21 haplotype blocks from samples pooled by four extremely high-yielding or low-yielding accessions revealed the differential expression of 14 crucial candiate genes (such as Bna.MYB83, Bna.SPL5, and Bna.ROP3) associated with multiple traits or containing multiple SNPs associated with the same trait. Functional annotation and expression pattern analyses further demonstrated that these 14 candiate genes might be important in developmental processes and biomass accumulation, thus affecting the yield establishment of B. napus. These results provide valuable information for understanding the genetic mechanisms underlying the establishment of high yield in B. napus, and lay the foundation for developing high-yielding B. napus varieties. PMID:28261256

  16. Goldsurfer2 (Gs2: A comprehensive tool for the analysis and visualization of genome wide association studies

    Directory of Open Access Journals (Sweden)

    Barnes Michael R

    2008-03-01

    Full Text Available Abstract Background Genome wide association (GWA studies are now being widely undertaken aiming to find the link between genetic variations and common diseases. Ideally, a well-powered GWA study will involve the measurement of hundreds of thousands of single nucleotide polymorphisms (SNPs in thousands of individuals. The sheer volume of data generated by these experiments creates very high analytical demands. There are a number of important steps during the analysis of such data, many of which may present severe bottlenecks. The data need to be imported and reviewed to perform initial quality control (QC before proceeding to association testing. Evaluation of results may involve further statistical analysis, such as permutation testing, or further QC of associated markers, for example, reviewing raw genotyping intensities. Finally significant associations need to be prioritised using functional and biological interpretation methods, browsing available biological annotation, pathway information and patterns of linkage disequilibrium (LD. Results We have developed an interactive and user-friendly graphical application to be used in all steps in GWA projects from initial data QC and analysis to biological evaluation and validation of results. The program is implemented in Java and can be used on all platforms. Conclusion Very large data sets (e.g. 500 k markers and 5000 samples can be quality assessed, rapidly analysed and integrated with genomic sequence information. Candidate SNPs can be selected and functionally evaluated.

  17. Computational methods using genome-wide association studies to predict radiotherapy complications and to identify correlative molecular processes

    Science.gov (United States)

    Oh, Jung Hun; Kerns, Sarah; Ostrer, Harry; Powell, Simon N.; Rosenstein, Barry; Deasy, Joseph O.

    2017-02-01

    The biological cause of clinically observed variability of normal tissue damage following radiotherapy is poorly understood. We hypothesized that machine/statistical learning methods using single nucleotide polymorphism (SNP)-based genome-wide association studies (GWAS) would identify groups of patients of differing complication risk, and furthermore could be used to identify key biological sources of variability. We developed a novel learning algorithm, called pre-conditioned random forest regression (PRFR), to construct polygenic risk models using hundreds of SNPs, thereby capturing genomic features that confer small differential risk. Predictive models were trained and validated on a cohort of 368 prostate cancer patients for two post-radiotherapy clinical endpoints: late rectal bleeding and erectile dysfunction. The proposed method results in better predictive performance compared with existing computational methods. Gene ontology enrichment analysis and protein-protein interaction network analysis are used to identify key biological processes and proteins that were plausible based on other published studies. In conclusion, we confirm that novel machine learning methods can produce large predictive models (hundreds of SNPs), yielding clinically useful risk stratification models, as well as identifying important underlying biological processes in the radiation damage and tissue repair process. The methods are generally applicable to GWAS data and are not specific to radiotherapy endpoints.

  18. Genome-wide assessment for genetic variants associated with ventricular dysfunction after primary coronary artery bypass graft surgery.

    Directory of Open Access Journals (Sweden)

    Amanda A Fox

    Full Text Available BACKGROUND: Postoperative ventricular dysfunction (VnD occurs in 9-20% of coronary artery bypass graft (CABG surgical patients and is associated with increased postoperative morbidity and mortality. Understanding genetic causes of postoperative VnD should enhance patient risk stratification and improve treatment and prevention strategies. We aimed to determine if genetic variants associate with occurrence of in-hospital VnD after CABG surgery. METHODS: A genome-wide association study identified single nucleotide polymorphisms (SNPs associated with postoperative VnD in male subjects of European ancestry undergoing isolated primary CABG surgery with cardiopulmonary bypass. VnD was defined as the need for ≥2 inotropes or mechanical ventricular support after CABG surgery. Validated SNPs were assessed further in two replication CABG cohorts and meta-analysis was performed. RESULTS: Over 100 SNPs were associated with VnD (P2.1 of developing in-hospital VnD after CABG surgery. However, three genetic loci identified by meta-analysis were more modestly associated with development of postoperative VnD. Studies of larger cohorts to assess these loci as well as to define other genetic mechanisms and related biology that link genetic variants to postoperative ventricular dysfunction are warranted.

  19. A genome-wide association study of direct gestation length in US Holstein and Italian Brown populations.

    Science.gov (United States)

    Maltecca, C; Gray, K A; Weigel, K A; Cassady, J P; Ashwell, M

    2011-12-01

    Direct gestation length influences economically important traits in dairy cattle that are related to birth and peri-natal survival of the calf. The objective of this study was to identify single nucleotide polymorphisms (SNPs) that are significantly associated with direct gestation length through a genome-wide association study. Data used in the analysis included 7,308,194 cow gestation lengths from daughters of 4743 United States Holstein sires in the Cooperative Dairy DNA Repository population and 580,157 gestation lengths from 749 sires in the Italian Brown population. Association analysis included 36,768 and 35,082 SNPs spanning all autosomes for Holstein and Brown Swiss, respectively. Multiple shrinkage Bayesian was employed. Estimates of heritability for both populations were moderate, with values of 0.32 (±0.03) and 0.29 (±0.02) for Holstein and Brown Swiss, respectively. A panel of SNPs was identified, which included SNPs that have significant effects on direct gestation length, of which the strongest candidate region is located on chromosome 18. Two regions not previously linked to direct calving ease and calf survival were identified on chromosome 7 and 28, corresponding to regions that contain genes related to embryonic development and foetal development. SNPs were also identified in regions that have been previously mapped for calving difficulty and longevity. This study identifies target regions for the investigation of direct foetal effects, which are a significant factor in determining the ease of calving.

  20. Genome-wide association study identifies loci and candidate genes for meat quality traits in Simmental beef cattle.

    Science.gov (United States)

    Xia, Jiangwei; Qi, Xin; Wu, Yang; Zhu, Bo; Xu, Lingyang; Zhang, Lupei; Gao, Xue; Chen, Yan; Li, Junya; Gao, Huijiang

    2016-06-01

    Improving meat quality is the best way to enhance profitability and strengthen competitiveness in beef industry. Identification of genetic variants that control beef quality traits can help breeders design optimal breeding programs to achieve this goal. We carried out a genome-wide association study for meat quality traits in 1141 Simmental cattle using the Illumina Bovine HD 770K SNP array to identify the candidate genes and genomic regions associated with meat quality traits for beef cattle, including fat color, meat color, marbling score, longissimus muscle area, and shear force. In our study, we identified twenty significant single-nucleotide polymorphisms (SNPs) (p meat quality traits. Notably, we observed several SNPs were in or near eleven genes which have been reported previously, including TMEM236, SORL1, TRDN, S100A10, AP2S1, KCTD16, LOC506594, DHX15, LAMA4, PREX1, and BRINP3. We identified a haplotype block on BTA13 containing five significant SNPs associated with fat color trait. We also found one of 19 SNPs was associated with multiple traits (shear force and longissimus muscle area) on BTA7. Our results offer valuable insights to further explore the potential mechanism of meat quality traits in Simmental beef cattle.

  1. Genome-wide association study of positive emotion identifies a genetic variant and a role for microRNAs

    Science.gov (United States)

    Wingo, Aliza P.; Almli, Lynn M.; Stevens, Jennifer S.; Jovanovic, Tanja; Wingo, Thomas S.; Tharp, Gregory; Li, Yujing; Lori, Adriana; Briscione, Maria; Jin, Peng; Binder, Elisabeth B.; Bradley, Bekh; Gibson, Greg; Ressler, Kerry J.

    2016-01-01

    Positive affect denotes a state of pleasurable engagement with the environment eliciting positive emotion such as contentment, enthusiasm, or happiness. Positive affect is associated with favorable psychological, physical, and economic outcomes in many longitudinal studies. With a heritability of ≤64%, positive affect is substantially influenced by genetic factors; however, our understanding of genetic pathways underlying individual differences in positive affect is still limited. Here, through a genome-wide association study (GWAS) of positive affect in African American participants, we identify a single nucleotide polymorphism (SNP), rs322931, significantly associated with positive affect at pmiR-181b in human brain and blood, greater nucleus accumbens reactivity to positive emotional stimuli, and enhanced fear inhibition. Prior studies have suggested that miR-181a is part of the reward neurocircuitry. Taken together, we identify a novel genetic variant for further elucidation of genetic underpinning of positive affect that mediates positive emotionality potentially via the nucleus accumbens and miR-181. PMID:27595594

  2. Genome-wide association study of autistic-like traits in a general population study of young adults.

    Science.gov (United States)

    Jones, Rachel Maree; Cadby, Gemma; Melton, Phillip E; Abraham, Lawrence J; Whitehouse, Andrew J; Moses, Eric K

    2013-01-01

    Lay abstract: It has been proposed that autistic-like traits in the general population lie on a continuum, with clinical Autism Spectrum Disorder (ASD), representing the extreme end of this distribution. The current study undertook a genome-wide association (GWA) scan of 965 young Western Australian adults to identify novel risk variants associated with autistic-like traits. No associations reached genome-wide significance; however, a review of nominally associated single nucleotide polymorphisms (SNPs) indicated two positional candidate loci that have been previously implicated in autistic-like trait etiology. Scientific abstract: Research has proposed that autistic-like traits in the general population lie on a continuum, with clinical ASD representing the extreme end of this distribution. Inherent in this proposal is that biological mechanisms associated with clinical ASD may also underpin variation in autistic-like traits within the general population. A GWA study using 2,462,046 SNPs was undertaken for ASD in 965 individuals from the Western Australian Pregnancy Cohort (Raine) Study. No SNP associations reached genome-wide significance (p CBLN1. The rs198198 SNP (p = 9.587 × 10(-6)), is located within an intron of the protein kinase C, beta 1 (PRKCB1) gene on chromosome 16p11. The PRKCB1 gene has been previously reported in linkage and association studies for ASD, and its mRNA expression has been shown to be significantly down regulated in ASD cases compared with controls. The rs16946931 SNP (p = 1.78 × 10(-6)) is located in a region flanking the Cerebellin 1 (CBLN1) gene on chromosome 16q12.1. The CBLN1 gene is involved with synaptogenesis and is part of a gene family previously implicated in ASD. This GWA study is only the second to examine SNPs associated with autistic-like traits in the general population, and provides evidence to support roles for the PRKCB1 and CBLN1 genes in risk of clinical ASD.

  3. Genome-wide association of coagulation properties, curd firmness modeling, protein percentage, and acidity in milk from Brown Swiss cows.

    Science.gov (United States)

    Dadousis, C; Biffani, S; Cipolat-Gotet, C; Nicolazzi, E L; Rossoni, A; Santus, E; Bittante, G; Cecchinato, A

    2016-05-01

    Cheese production is increasing in many countries, and a desire toward genetic selection for milk coagulation properties in dairy cattle breeding exists. However, measurements of individual cheesemaking properties are hampered by high costs and labor, whereas traditional single-point milk coagulation properties (MCP) are sometimes criticized. Nevertheless, new modeling of the entire curd firmness and syneresis process (CFt equation) offers new insight into the cheesemaking process. Moreover, identification of genomic regions regulating milk cheesemaking properties might enhance direct selection of individuals in breeding programs based on cheese ability rather than related milk components. Therefore, the objective of this study was to perform genome-wide association studies to identify genomic regions linked to traditional MCP and new CFt parameters, milk acidity (pH), and milk protein percentage. Milk and DNA samples from 1,043 Italian Brown Swiss cows were used. Milk pH and 3 MCP traits were grouped together to represent the MCP set. Four CFt equation parameters, 2 derived traits, and protein percentage were considered as the second group of traits (CFt set). Animals were genotyped with the Illumina SNP50 BeadChip v.2 (Illumina Inc., San Diego, CA). Multitrait animal models were used to estimate variance components. For genome-wide association studies, the genome-wide association using mixed model and regression-genomic control approach was used. In total, 106 significant marker traits associations and 66 single nucleotide polymorphisms were identified on 12 chromosomes (1, 6, 9, 11, 13, 15, 16, 19, 20, 23, 26, and 28). Sharp peaks were detected at 84 to 88 Mbp on Bos taurus autosome (BTA) 6, with a peak at 87.4 Mbp in the region harboring the casein genes. Evidence of quantitative trait loci at 82.6 and 88.4 Mbp on the same chromosome was found. All chromosomes but BTA6, BTA11, and BTA28 were associated with only one trait. Only BTA6 was in common between MCP

  4. Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Rekaya R

    2016-11-01

    Full Text Available Romdhane Rekaya,1–3 Shannon Smith,4 El Hamidi Hay,5 Nourhene Farhat,6 Samuel E Aggrey3,7 1Department of Animal and Dairy Science, College of Agricultural and Environmental Sciences, 2Department of Statistics, Franklin College of Arts and Sciences, 3Institute of Bioinformatics, The University of Georgia, Athens, GA, 4Zoetis, Kalamazoo, MI, 5United States Department of Agriculture, Agricultural Research Service, Beltsville, MD, 6Carolinas HealthCare System Blue Ridge, Morganton, NC, 7Department of Poultry Science, College of Agricultural and Environmental Sciences, University of Georgia, Athens, GA, USA Abstract: Errors in the binary status of some response traits are frequent in human, animal, and plant applications. These error rates tend to differ between cases and controls because diagnostic and screening tests have different sensitivity and specificity. This increases the inaccuracies of classifying individuals into correct groups, giving rise to both false-positive and false-negative cases. The analysis of these noisy binary responses due to misclassification will undoubtedly reduce the statistical power of genome-wide association studies (GWAS. A threshold model that accommodates varying diagnostic errors between cases and controls was investigated. A simulation study was carried out where several binary data sets (case–control were generated with varying effects for the most influential single nucleotide polymorphisms (SNPs and different diagnostic error rate for cases and controls. Each simulated data set consisted of 2000 individuals. Ignoring misclassification resulted in biased estimates of true influential SNP effects and inflated estimates for true noninfluential markers. A substantial reduction in bias and increase in accuracy ranging from 12% to 32% was observed when the misclassification procedure was invoked. In fact, the majority of influential SNPs that were not identified using the noisy data were captured using the

  5. Genome-wide association study to identify the genetic determinants of otitis media susceptibility in childhood.

    Directory of Open Access Journals (Sweden)

    Marie S Rye

    Full Text Available BACKGROUND: Otitis media (OM is a common childhood disease characterised by middle ear inflammation and effusion. Susceptibility to recurrent acute OM (rAOM; ≥ 3 episodes of AOM in 6 months and chronic OM with effusion (COME; MEE ≥ 3 months is 40-70% heritable. Few underlying genes have been identified to date, and no genome-wide association study (GWAS of OM has been reported. METHODS AND FINDINGS: Data for 2,524,817 single nucleotide polymorphisms (SNPs; 535,544 quality-controlled SNPs genotyped by Illumina 660W-Quad; 1,989,273 by imputation were analysed for association with OM in 416 cases and 1,075 controls from the Western Australian Pregnancy Cohort (Raine Study. Logistic regression analyses under an additive model undertaken in GenABEL/ProbABEL adjusting for population substructure using principal components identified SNPs at CAPN14 (rs6755194: OR = 1.90; 95%CI 1.47-2.45; P(adj-PCA = 8.3 × 10(-7 on chromosome 2p23.1 as the top hit, with independent effects (rs1862981: OR = 1.60; 95%CI 1.29-1.99; P(adj-PCA = 2.2 × 10(-5 observed at the adjacent GALNT14 gene. In a gene-based analysis in VEGAS, BPIFA3 (P(Gene = 2 × 10(-5 and BPIFA1 (P(Gene = 1.07 × 10(-4 in the BPIFA gene cluster on chromosome 20q11.21 were the top hits. In all, 32 genomic regions show evidence of association (P(adj-PCA<10(-5 in this GWAS, with pathway analysis showing a connection between top candidates and the TGFβ pathway. However, top and tag-SNP analysis for seven selected candidate genes in this pathway did not replicate in 645 families (793 affected individuals from the Western Australian Family Study of Otitis Media (WAFSOM. Lack of replication may be explained by sample size, difference in OM disease severity between primary and replication cohorts or due to type I error in the primary GWAS. CONCLUSIONS: This first discovery GWAS for an OM phenotype has identified CAPN14 and GALNT14 on chromosome 2p23.1 and the BPIFA gene cluster on chromosome 20q11.21 as

  6. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa.

    Directory of Open Access Journals (Sweden)

    Parameswari Paul

    Full Text Available Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa. Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309. Chromosomal mapping of the B. rapa Aux/IAA (BrIAA genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA-BrIAA and 36 cross species (BrIAA-AtIAA IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa.

  7. Genome-wide SNP identification in multiple morphotypes of allohexaploid tall fescue (Festuca arundinacea Schreb

    Directory of Open Access Journals (Sweden)

    Hand Melanie L

    2012-06-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs provide essential tools for the advancement of research in plant genomics, and the development of SNP resources for many species has been accelerated by the capabilities of second-generation sequencing technologies. The current study aimed to develop and use a novel bioinformatic pipeline to generate a comprehensive collection of SNP markers within the agriculturally important pasture grass tall fescue; an outbreeding allopolyploid species displaying three distinct morphotypes: Continental, Mediterranean and rhizomatous. Results A bioinformatic pipeline was developed that successfully identified SNPs within genotypes from distinct tall fescue morphotypes, following the sequencing of 414 polymerase chain reaction (PCR – generated amplicons using 454 GS FLX technology. Equivalent amplicon sets were derived from representative genotypes of each morphotype, including six Continental, five Mediterranean and one rhizomatous. A total of 8,584 and 2,292 SNPs were identified with high confidence within the Continental and Mediterranean morphotypes respectively. The success of the bioinformatic approach was demonstrated through validation (at a rate of 70% of a subset of 141 SNPs using both SNaPshot™ and GoldenGate™ assay chemistries. Furthermore, the quantitative genotyping capability of the GoldenGate™ assay revealed that approximately 30% of the putative SNPs were accessible to co-dominant scoring, despite the hexaploid genome structure. The sub-genome-specific origin of each SNP validated from Continental tall fescue was predicted using a phylogenetic approach based on comparison with orthologous sequences from predicted progenitor species. Conclusions Using the appropriate bioinformatic approach, amplicon resequencing based on 454 GS FLX technology is an effective method for the identification of polymorphic SNPs within the genomes of Continental and Mediterranean tall fescue. The

  8. Functional Characterization of Single-Nucleotide Polymorphisms in the Human Undifferentiated Embryonic-Cell Transcription Factor 1 Gene

    NARCIS (Netherlands)

    Thummer, Rajkumar P.; Drenth-Diephuis, Loes J.; Carney, Karen E.; Eggen, Bart J. L.

    2010-01-01

    Single-nucleotide polymorphisms (SNPs) are single-nucleotide sequence variations between individuals. Two missense SNPs are present in the human undifferentiated embryonic-cell transcription factor 1 (UTF1) gene and their consequences for UTF1 function are investigated in this study. Expression of t

  9. IL-18 single nucleotide polymorphisms in hematologic malignancies with HLA matched sibling donor allogeneic hematopoietic stem cell transplantation

    Institute of Scientific and Technical Information of China (English)

    蔡小矜

    2014-01-01

    Objective To explore the impact of interleukin-18(IL-18)single nucleotide polymorphisms on outcomes of hematologic malignancies with HLA-matched sibling donor hematopoietic stem cell transplantation(allo-HSCT).Methods Single-nucleotide polymorphisms in IL-18 promoter was detected by PCR-sequence-specific primer analysis(PCR-SSP)in 93 recipients and their HLA matched sibling donors.Hematopoietic reconstitution,

  10. A genome wide association study for backfat thickness in Italian Large White pigs highlights new regions affecting fat deposition including neuronal genes

    Directory of Open Access Journals (Sweden)

    Fontanesi Luca

    2012-11-01

    Full Text Available Abstract Background Carcass fatness is an important trait in most pig breeding programs. Following market requests, breeding plans for fresh pork consumption are usually designed to reduce carcass fat content and increase lean meat deposition. However, the Italian pig industry is mainly devoted to the production of Protected Designation of Origin dry cured hams: pigs are slaughtered at around 160 kg of live weight and the breeding goal aims at maintaining fat coverage, measured as backfat thickness to avoid excessive desiccation of the hams. This objective has shaped the genetic pool of Italian heavy pig breeds for a few decades. In this study we applied a selective genotyping approach within a population of ~ 12,000 performance tested Italian Large White pigs. Within this population, we selectively genotyped 304 pigs with extreme and divergent backfat thickness estimated breeding value by the Illumina PorcineSNP60 BeadChip and performed a genome wide association study to identify loci associated to this trait. Results We identified 4 single nucleotide polymorphisms with P≤5.0E-07 and additional 119 ones with 5.0E-07 Conclusions Further investigations are needed to evaluate the effects of the identified single nucleotide polymorphisms associated with backfat thickness on other traits as a pre-requisite for practical applications in breeding programs. Reported results could improve our understanding of the biology of fat metabolism and deposition that could also be relevant for other mammalian species including humans, confirming the role of neuronal genes on obesity.

  11. Semantically enabling a genome-wide association study database

    Directory of Open Access Journals (Sweden)

    Beck Tim

    2012-12-01

    Full Text Available Abstract Background The amount of data generated from genome-wide association studies (GWAS has grown rapidly, but considerations for GWAS phenotype data reuse and interchange have not kept pace. This impacts on the work of GWAS Central – a free and open access resource for the advanced querying and comparison of summary-level genetic association data. The benefits of employing ontologies for standardising and structuring data are widely accepted. The complex spectrum of observed human phenotypes (and traits, and the requirement for cross-species phenotype comparisons, calls for reflection on the most appropriate solution for the organisation of human phenotype data. The Semantic Web provides standards for the possibility of further integration of GWAS data and the ability to contribute to the web of Linked Data. Results A pragmatic consideration when applying phenotype ontologies to GWAS data is the ability to retrieve all data, at the most granular level possible, from querying a single ontology graph. We found the Medical Subject Headings (MeSH terminology suitable for describing all traits (diseases and medical signs and symptoms at various levels of granularity and the Human Phenotype Ontology (HPO most suitable for describing phenotypic abnormalities (medical signs and symptoms at the most granular level. Diseases within MeSH are mapped to HPO to infer the phenotypic abnormalities associated with diseases. Building on the rich semantic phenotype annotation layer, we are able to make cross-species phenotype comparisons and publish a core subset of GWAS data as RDF nanopublications. Conclusions We present a methodology for applying phenotype annotations to a comprehensive genome-wide association dataset and for ensuring compatibility with the Semantic Web. The annotations are used to assist with cross-species genotype and phenotype comparisons. However, further processing and deconstructions of terms may be required to facilitate automatic

  12. Machine Learning Techniques for Single Nucleotide Polymorphism—Disease Classification Models in Schizophrenia

    Directory of Open Access Journals (Sweden)

    Cristian R. Munteanu

    2010-07-01

    Full Text Available Single nucleotide polymorphisms (SNPs can be used as inputs in disease computational studies such as pattern searching and classification models. Schizophrenia is an example of a complex disease with an important social impact. The multiple causes of this disease create the need of new genetic or proteomic patterns that can diagnose patients using biological information. This work presents a computational study of disease machine learning classification models using only single nucleotide polymorphisms at the HTR2A and DRD3 genes from Galician (Northwest Spain schizophrenic patients. These classification models establish for the first time, to the best knowledge of the authors, a relationship between the sequence of the nucleic acid molecule and schizophrenia (Quantitative Genotype – Disease Relationships that can automatically recognize schizophrenia DNA sequences and correctly classify between 78.3–93.8% of schizophrenia subjects when using datasets which include simulated negative subjects and a linear artificial neural network.

  13. Genome-wide sequence variations among Mycobacterium avium subspecies paratuberculosis.

    Directory of Open Access Journals (Sweden)

    Chung-Yi eHsu

    2011-12-01

    Full Text Available Mycobacterium avium subspecies paratuberculosis (M. ap, the causative agent of Johne’s disease (JD, infects many farmed ruminants, wildlife animals and humans. To better understand the molecular pathogenesis of these infections, we analyzed the whole genome sequences of several M. ap and M. avium subspecies avium (M. avium strains isolated from various hosts and environments. Using Next-generation sequencing technology, all 6 M. ap isolates showed a high percentage of homology (98% to the reference genome sequence of M. ap K-10 isolated from cattle. However, 2 M. avium isolates (DT 78 and Env 77 showed significant sequence diversity from the reference strain M. avium 104. The genomes of M. avium isolates DT 78 and Env 77 exhibited only 87% and 40% homology, respectively, to the M. avium 104 reference genome. Within the M. ap isolates, genomic rearrangements (insertions/deletions, Indels were not detected, and only unique single nucleotide polymorphisms (SNPs were observed among the 6 M. ap strains. While most of the SNPs (~100 in M. ap genomes were non-synonymous, a total of ~ 6000 SNPs were detected among M. avium genomes, most of them were synonymous suggesting a differential selective pressure between M. ap and M. avium isolates. In addition, SNPs-based phylo-genomic analysis showed that isolates from goat and Oryx are closely related to the cattle (K-10 strain while the human isolate (M. ap 4B is closely related to the environmental strains, indicating environmental source to human infections. Overall, SNPs were the most common variations among M. ap isolates while SNPs in addition to Indels were prevalent among M. avium isolates. Genomic variations will be useful in designing host-specific markers for the analysis of mycobacterial evolution and for developing novel diagnostics directed against Johne’s disease in animals.

  14. Efficient single nucleotide polymorphism discovery in laboratory rat strains using wild rat-derived SNP candidates

    OpenAIRE

    Hedrich Hans J; Wedekind Dirk; Zeegers Dimphy; Guryev Victor; Smits Bart MG; Cuppen Edwin

    2005-01-01

    Abstract Background The laboratory rat (Rattus norvegicus) is an important model for studying many aspects of human health and disease. Detailed knowledge on genetic variation between strains is important from a biomedical, particularly pharmacogenetic point of view and useful for marker selection for genetic cloning and association studies. Results We show that Single Nucleotide Polymorphisms (SNPs) in commonly used rat strains are surprisingly well represented in wild rat isolates. Shotgun ...

  15. High-Resolution Mapping of Structural Mutations in Prostate Cancer with Single Nucleotide Polymorphism Arrays

    Science.gov (United States)

    2006-11-01

    recurrent phyllodes tumor and fibroa- denoma of breast using single nucleotide polymorphism arrays. Breast Cancer Res Treat 2006; 97:301–309. 21...neutral LOH). Interestingly, copy-neutral LOH, which is undetectable by conventional CGH methods, represents up to 80% of LOH events in some tumor ...the notion that LOH represents a key mechanism for tumor suppressor inactivation. Indeed, nearly all common tumor suppressor genes occur in regions

  16. Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

    OpenAIRE

    Chen, Chunxian; Gmitter Jr, Fred G

    2013-01-01

    Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for...

  17. Single Nucleotide Polymorphism Microarray Analysis in Cortisol-Secreting Adrenocortical Adenomas Identifies New Candidate Genes and Pathways

    Directory of Open Access Journals (Sweden)

    Cristina L. Ronchi

    2012-03-01

    Full Text Available The genetic mechanisms underlying adrenocortical tumor development are still largely unknown. We used high-resolution single nucleotide polymorphism microarrays (Affymetrix SNP 6.0 to detect copy number alterations (CNAs and copy-neutral losses of heterozygosity (cnLOH in 15 cortisol-secreting adrenocortical adenomas with matched blood samples. We focused on microalterations aiming to discover new candidate genes involved in early tumorigenesis and/or autonomous cortisol secretion. We identified 962 CNAs with a median of 18 CNAs per sample. Half of them involved noncoding regions, 89% were less than 100 kb, and 28% were found in at least two samples. The most frequently gained regions were 5p15.33, 6q16.1, 7p22.3-22.2, 8q24.3, 9q34.2-34.3, 11p15.5, 11q11, 12q12, 16q24.3, 20p11.1-20q21.11, and Xq28 (≥20% of cases, most of them being identified in the same three adenomas. These regions contained among others genes like NOTCH1, CYP11B2, HRAS, and IGF2. Recurrent losses were less common and smaller than gains, being mostly localized at 1p, 6q, and 11q. Pathway analysis revealed that Notch signaling was the most frequently altered. We identified 46 recurrent CNAs that each affected a single gene (31 gains and 15 losses, including genes involved in steroidogenesis (CYP11B1 or tumorigenesis (CTNNB1, EPHA7, SGK1, STIL, FHIT. Finally, 20 small cnLOH in four cases affecting 15 known genes were found. Our findings provide the first high-resolution genome-wide view of chromosomal changes in cortisol-secreting adenomas and identify novel candidate genes, such as HRAS, EPHA7, and SGK1. Furthermore, they implicate that the Notch1 signaling pathway might be involved in the molecular pathogenesis of adrenocortical tumors.

  18. Genetic analysis of the cardiac methylome at single nucleotide resolution in a model of human cardiovascular disease.

    Directory of Open Access Journals (Sweden)

    Michelle D Johnson

    2014-12-01

    Full Text Available Epigenetic marks such as cytosine methylation are important determinants of cellular and whole-body phenotypes. However, the extent of, and reasons for inter-individual differences in cytosine methylation, and their association with phenotypic variation are poorly characterised. Here we present the first genome-wide study of cytosine methylation at single-nucleotide resolution in an animal model of human disease. We used whole-genome bisulfite sequencing in the spontaneously hypertensive rat (SHR, a model of cardiovascular disease, and the Brown Norway (BN control strain, to define the genetic architecture of cytosine methylation in the mammalian heart and to test for association between methylation and pathophysiological phenotypes. Analysis of 10.6 million CpG dinucleotides identified 77,088 CpGs that were differentially methylated between the strains. In F1 hybrids we found 38,152 CpGs showing allele-specific methylation and 145 regions with parent-of-origin effects on methylation. Cis-linkage explained almost 60% of inter-strain variation in methylation at a subset of loci tested for linkage in a panel of recombinant inbred (RI strains. Methylation analysis in isolated cardiomyocytes showed that in the majority of cases methylation differences in cardiomyocytes and non-cardiomyocytes were strain-dependent, confirming a strong genetic component for cytosine methylation. We observed preferential nucleotide usage associated with increased and decreased methylation that is remarkably conserved across species, suggesting a common mechanism for germline control of inter-individual variation in CpG methylation. In the RI strain panel, we found significant correlation of CpG methylation and levels of serum chromogranin B (CgB, a proposed biomarker of heart failure, which is evidence for a link between germline DNA sequence variation, CpG methylation differences and pathophysiological phenotypes in the SHR strain. Together, these results will

  19. Integrative analysis of single nucleotide polymorphisms and gene expression efficiently distinguishes samples from closely related ethnic populations

    Directory of Open Access Journals (Sweden)

    Yang Hsin-Chou

    2012-07-01

    Full Text Available Abstract Background Ancestry informative markers (AIMs are a type of genetic marker that is informative for tracing the ancestral ethnicity of individuals. Application of AIMs has gained substantial attention in population genetics, forensic sciences, and medical genetics. Single nucleotide polymorphisms (SNPs, the materials of AIMs, are useful for classifying individuals from distinct continental origins but cannot discriminate individuals with subtle genetic differences from closely related ancestral lineages. Proof-of-principle studies have shown that gene expression (GE also is a heritable human variation that exhibits differential intensity distributions among ethnic groups. GE supplies ethnic information supplemental to SNPs; this motivated us to integrate SNP and GE markers to construct AIM panels with a reduced number of required markers and provide high accuracy in ancestry inference. Few studies in the literature have considered GE in this aspect, and none have integrated SNP and GE markers to aid classification of samples from closely related ethnic populations. Results We integrated a forward variable selection procedure into flexible discriminant analysis to identify key SNP and/or GE markers with the highest cross-validation prediction accuracy. By analyzing genome-wide SNP and/or GE markers in 210 independent samples from four ethnic groups in the HapMap II Project, we found that average testing accuracies for a majority of classification analyses were quite high, except for SNP-only analyses that were performed to discern study samples containing individuals from two close Asian populations. The average testing accuracies ranged from 0.53 to 0.79 for SNP-only analyses and increased to around 0.90 when GE markers were integrated together with SNP markers for the classification of samples from closely related Asian populations. Compared to GE-only analyses, integrative analyses of SNP and GE markers showed comparable testing

  20. Association of Nitric Oxide Synthase and Matrix Metalloprotease Single Nucleotide Polymorphisms with Preeclampsia and Its Complications.

    Directory of Open Access Journals (Sweden)

    Daniela P Leonardo

    Full Text Available Preeclampsia is one of the leading causes of maternal and neonatal morbidity and mortality in the world, but its appearance is still unpredictable and its pathophysiology has not been entirely elucidated. Genetic studies have associated single nucleotide polymorphisms in genes encoding nitric oxide synthase and matrix metalloproteases with preeclampsia, but the results are largely inconclusive across different populations.To investigate the association of single nucleotide polymorphisms (SNPs in NOS3 (G894T, T-786C, and a variable number of tandem repetitions VNTR in intron 4, MMP2 (C-1306T, and MMP9 (C-1562T genes with preeclampsia in patients from Southeastern Brazil.This prospective case-control study enrolled 77 women with preeclampsia and 266 control pregnant women. Clinical data were collected to assess risk factors and the presence of severe complications, such as eclampsia and HELLP (hemolysis, elevated liver enzymes, and low platelets syndrome.We found a significant association between the single nucleotide polymorphism NOS3 T-786C and preeclampsia, independently from age, height, weight, or the other SNPs studied, and no association was found with the other polymorphisms. Age and history of preeclampsia were also identified as risk factors. The presence of at least one polymorphic allele for NOS3 T-786C was also associated with the occurrence of eclampsia or HELLP syndrome among preeclamptic women.Our data support that the NOS3 T-786C SNP is associated with preeclampsia and the severity of its complications.

  1. Genome-wide transcriptome analysis of 150 cell samples†

    Science.gov (United States)

    Russom, Aman; Xiao, Wenzhong; Wilhelmy, Julie; Wang, Shenglong; Heath, Joe Don; Kurn, Nurith; Tompkins, Ronald G.; Davis, Ronald W.; Toner, Mehmet

    2013-01-01

    A major challenge in molecular biology is interrogating the human transcriptome on a genome wide scale when only a limited amount of biological sample is available for analysis. Current methodologies using microarray technologies for simultaneously monitoring mRNA transcription levels require nanogram amounts of total RNA. To overcome the sample size limitation of current technologies, we have developed a method to probe the global gene expression in biological samples as small as 150 cells, or the equivalent of approximately 300 pg total RNA. The new method employs microfluidic devices for the purification of total RNA from mammalian cells and ultra-sensitive whole transcriptome amplification techniques. We verified that the RNA integrity is preserved through the isolation process, accomplished highly reproducible whole transcriptome analysis, and established high correlation between repeated isolations of 150 cells and the same cell culture sample. We validated the technology by demonstrating that the combined microfluidic and amplification protocol is capable of identifying biological pathways perturbed by stimulation, which are consistent with the information recognized in bulk-isolated samples. PMID:20023796

  2. Genome-wide transcriptome analysis of 150 cell samples.

    Science.gov (United States)

    Irimia, Daniel; Mindrinos, Michael; Russom, Aman; Xiao, Wenzhong; Wilhelmy, Julie; Wang, Shenglong; Heath, Joe Don; Kurn, Nurith; Tompkins, Ronald G; Davis, Ronald W; Toner, Mehmet

    2009-01-01

    A major challenge in molecular biology is interrogating the human transcriptome on a genome wide scale when only a limited amount of biological sample is available for analysis. Current methodologies using microarray technologies for simultaneously monitoring mRNA transcription levels require nanogram amounts of total RNA. To overcome the sample size limitation of current technologies, we have developed a method to probe the global gene expression in biological samples as small as 150 cells, or the equivalent of approximately 300 pg total RNA. The new method employs microfluidic devices for the purification of total RNA from mammalian cells and ultra-sensitive whole transcriptome amplification techniques. We verified that the RNA integrity is preserved through the isolation process, accomplished highly reproducible whole transcriptome analysis, and established high correlation between repeated isolations of 150 cells and the same cell culture sample. We validated the technology by demonstrating that the combined microfluidic and amplification protocol is capable of identifying biological pathways perturbed by stimulation, which are consistent with the information recognized in bulk-isolated samples.

  3. Genome-wide studies of telomere biology in budding yeast

    Directory of Open Access Journals (Sweden)

    Yaniv Harari

    2014-03-01

    Full Text Available Telomeres are specialized DNA-protein structures at the ends of eukaryotic chromosomes. Telomeres are essential for chromosomal stability and integrity, as they prevent chromosome ends from being recognized as double strand breaks. In rapidly proliferating cells, telomeric DNA is synthesized by the enzyme telomerase, which copies a short template sequence within its own RNA moiety, thus helping to solve the “end-replication problem”, in which information is lost at the ends of chromosomes with each DNA replication cycle. The basic mechanisms of telomere length, structure and function maintenance are conserved among eukaryotes. Studies in the yeast Saccharomyces cerevisiae have been instrumental in deciphering the basic aspects of telomere biology. In the last decade, technical advances, such as the availability of mutant collections, have allowed carrying out systematic genome-wide screens for mutants affecting various aspects of telomere biology. In this review we summarize these efforts, and the insights that this Systems Biology approach has produced so far.

  4. Genome-Wide Analysis of Human Metapneumovirus Evolution

    Science.gov (United States)

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055

  5. Genome-Wide Analysis of DNA Methylation in Human Amnion

    Science.gov (United States)

    Kim, Jinsil; Pitlick, Mitchell M.; Christine, Paul J.; Schaefer, Amanda R.; Saleme, Cesar; Comas, Belén; Cosentino, Viviana; Gadow, Enrique; Murray, Jeffrey C.

    2013-01-01

    The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor) and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR) gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3) gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies. PMID:23533356

  6. A genome wide dosage suppressor network reveals genomic robustness

    Science.gov (United States)

    Patra, Biranchi; Kon, Yoshiko; Yadav, Gitanjali; Sevold, Anthony W.; Frumkin, Jesse P.; Vallabhajosyula, Ravishankar R.; Hintze, Arend; Østman, Bjørn; Schossau, Jory; Bhan, Ashish; Marzolf, Bruz; Tamashiro, Jenna K.; Kaur, Amardeep; Baliga, Nitin S.; Grayhack, Elizabeth J.; Adami, Christoph; Galas, David J.; Raval, Alpan; Phizicky, Eric M.; Ray, Animesh

    2017-01-01

    Genomic robustness is the extent to which an organism has evolved to withstand the effects of deleterious mutations. We explored the extent of genomic robustness in budding yeast by genome wide dosage suppressor analysis of 53 conditional lethal mutations in cell division cycle and RNA synthesis related genes, revealing 660 suppressor interactions of which 642 are novel. This collection has several distinctive features, including high co-occurrence of mutant-suppressor pairs within protein modules, highly correlated functions between the pairs and higher diversity of functions among the co-suppressors than previously observed. Dosage suppression of essential genes encoding RNA polymerase subunits and chromosome cohesion complex suggests a surprising degree of functional plasticity of macromolecular complexes, and the existence of numerous degenerate pathways for circumventing the effects of potentially lethal mutations. These results imply that organisms and cancer are likely able to exploit the genomic robustness properties, due the persistence of cryptic gene and pathway functions, to generate variation and adapt to selective pressures. PMID:27899637

  7. Genome-wide search for strabismus susceptibility loci.

    Directory of Open Access Journals (Sweden)

    Fujiwara H

    2003-06-01

    Full Text Available The purpose of this study was to search for chromosomal susceptibility loci for comitant strabismus. Genomic DNA was isolated from 10mL blood taken from each member of 30 nuclear families in which 2 or more siblings are affected by either esotropia or exotropia. A genome-wide search was performed with amplification by polymerase chain reaction of 400 markers in microsatellite regions with approximately 10 cM resolution. For each locus, non-parametric affected sib-pair analysis and non-parametric linkage analysis for multiple pedigrees (Genehunter software, http://linkage.rockefeller.edu/soft/ were used to calculate multipoint lod scores and non-parametric linkage (NPL scores, respectively. In sib-pair analysis, lod scores showed basically flat lines with several peaks of 0.25 on all chromosomes. In non-parametric linkage analysis for multiple pedigrees, NPL scores showed one peak as high as 1.34 on chromosomes 1, 2, 4, 7, 10, 15, and 16, while 2 such peaks were found on chromosomes 3, 9, 11, 12, 18, and 20. Non-parametric linkage analysis for multiple pedigrees of 30 families with comitant strabismus suggested a number of chromosomal susceptibility loci. Our ongoing study involving a larger number of families will refine the accuracy of statistical analysis to pinpoint susceptibility loci for comitant strabismus.

  8. Genome-wide association study of proneness to anger.

    Directory of Open Access Journals (Sweden)

    Eric Mick

    Full Text Available BACKGROUND: Community samples suggest that approximately 1 in 20 children and adults exhibit clinically significant anger, hostility, and aggression. Individuals with dysregulated emotional control have a greater lifetime burden of psychiatric morbidity, severe impairment in role functioning, and premature mortality due to cardiovascular disease. METHODS: With publically available data secured from dbGaP, we conducted a genome-wide association study of proneness to anger using the Spielberger State-Trait Anger Scale in the Atherosclerosis Risk in Communities (ARIC study (n = 8,747. RESULTS: Subjects were, on average, 54 (range 45-64 years old at baseline enrollment, 47% (n = 4,117 were male, and all were of European descent by self-report. The mean Angry Temperament and Angry Reaction scores were 5.8 ± 1.8 and 7.6 ± 2.2. We observed a nominally significant finding (p = 2.9E-08, λ = 1.027 - corrected pgc = 2.2E-07, λ = 1.0015 on chromosome 6q21 in the gene coding for the non-receptor protein-tyrosine kinase, Fyn. CONCLUSIONS: Fyn interacts with NDMA receptors and inositol-1,4,5-trisphosphate (IP3-gated channels to regulate calcium influx and intracellular release in the post-synaptic density. These results suggest that signaling pathways regulating intracellular calcium homeostasis, which are relevant to memory, learning, and neuronal survival, may in part underlie the expression of Angry Temperament.

  9. Insights into kidney diseases from genome-wide association studies.

    Science.gov (United States)

    Wuttke, Matthias; Köttgen, Anna

    2016-09-01

    Over the past decade, genome-wide association studies (GWAS) have considerably improved our understanding of the genetic basis of kidney function and disease. Population-based studies, used to investigate traits that define chronic kidney disease (CKD), have identified >50 genomic regions in which common genetic variants associate with estimated glomerular filtration rate or urinary albumin-to-creatinine ratio. Case-control studies, used to study specific CKD aetiologies, have yielded risk loci for specific kidney diseases such as IgA nephropathy and membranous nephropathy. In this Review, we summarize important findings from GWAS and clinical and experimental follow-up studies. We also compare risk allele frequency, effect sizes, and specificity in GWAS of CKD-defining traits and GWAS of specific CKD aetiologies and the implications for study design. Genomic regions identified in GWAS of CKD-defining traits can contain causal genes for monogenic kidney diseases. Population-based research on kidney function traits can therefore generate insights into more severe forms of kidney diseases. Experimental follow-up studies have begun to identify causal genes and variants, which are potential therapeutic targets, and suggest mechanisms underlying the high allele frequency of causal variants. GWAS are thus a useful approach to advance knowledge in nephrology.

  10. Genome-wide analyses of small noncoding RNAs in streptococci

    Directory of Open Access Journals (Sweden)

    Nadja ePatenge

    2015-05-01

    Full Text Available Streptococci represent a diverse group of Gram-positive bacteria, which colonize a wide range of hosts among animals and humans. Streptococcal species occur as commensal as well as pathogenic organisms. Many of the pathogenic species can cause severe, invasive infections in their hosts leading to a high morbidity and mortality. The consequence is a tremendous suffering on the part of men and livestock besides the significant financial burden in the agricultural and healthcare sectors. An environmentally stimulated and tightly controlled expression of virulence factor genes is of fundamental importance for streptococcal pathogenicity. Bacterial small noncoding RNAs (sRNAs modulate the expression of genes involved in stress response, sugar metabolism, surface composition, and other properties that are related to bacterial virulence. Even though the regulatory character is shared by this class of RNAs, variation on the molecular level results in a high diversity of functional mechanisms. The knowledge about the role of sRNAs in streptococci is still limited, but in recent years, genome-wide screens for sRNAs have been conducted in an increasing number of species. Bioinformatics prediction approaches have been employed as well as expression analyses by classical array techniques or next generation sequencing. This review will give an overview of whole genome screens for sRNAs in streptococci with a focus on describing the different methods and comparing their outcome considering sRNA conservation among species, functional similarities, and relevance for streptococcal infection.

  11. Identification of differential translation in genome wide studies.

    Science.gov (United States)

    Larsson, Ola; Sonenberg, Nahum; Nadon, Robert

    2010-12-14

    Regulation of gene expression through translational control is a fundamental mechanism implicated in many biological processes ranging from memory formation to innate immunity and whose dysregulation contributes to human diseases. Genome wide analyses of translational control strive to identify differential translation independent of cytosolic mRNA levels. For this reason, most studies measure genes' translation levels as log ratios (translation levels divided by corresponding cytosolic mRNA levels obtained in parallel). Counterintuitively, arising from a mathematical necessity, these log ratios tend to be highly correlated with the cytosolic mRNA levels. Accordingly, they do not effectively correct for cytosolic mRNA level and generate substantial numbers of biological false positives and false negatives. We show that analysis of partial variance, which produces estimates of translational activity that are independent of cytosolic mRNA levels, is a superior alternative. When combined with a variance shrinkage method for estimating error variance, analysis of partial variance has the additional benefit of having greater statistical power and identifying fewer genes as translationally regulated resulting merely from unrealistically low variance estimates rather than from large changes in translational activity. In contrast to log ratios, this formal analytical approach estimates translation effects in a statistically rigorous manner, eliminates the need for inefficient and error-prone heuristics, and produces results that agree with biological function. The method is applicable to datasets obtained from both the commonly used polysome microarray method and the sequencing-based ribosome profiling method.

  12. Genome-Wide Analysis of DNA Methylation in Human Amnion

    Directory of Open Access Journals (Sweden)

    Jinsil Kim

    2013-01-01

    Full Text Available The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3 gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies.

  13. Reducing dimensionality for prediction of genome-wide breeding values

    Directory of Open Access Journals (Sweden)

    Woolliams John A

    2009-03-01

    Full Text Available Abstract Partial least square regression (PLSR and principal component regression (PCR are methods designed for situations where the number of predictors is larger than the number of records. The aim was to compare the accuracy of genome-wide breeding values (EBV produced using PLSR and PCR with a Bayesian method, 'BayesB'. Marker densities of 1, 2, 4 and 8 Ne markers/Morgan were evaluated when the effective population size (Ne was 100. The correlation between true breeding value and estimated breeding value increased with density from 0.611 to 0.681 and 0.604 to 0.658 using PLSR and PCR respectively, with an overall advantage to PLSR of 0.016 (s.e = 0.008. Both methods gave a lower accuracy compared to the 'BayesB', for which accuracy increased from 0.690 to 0.860. PLSR and PCR appeared less responsive to increased marker density with the advantage of 'BayesB' increasing by 17% from a marker density of 1 to 8Ne/M. PCR and PLSR showed greater bias than 'BayesB' in predicting breeding values at all densities. Although, the PLSR and PCR were computationally faster and simpler, these advantages do not outweigh the reduction in accuracy, and there is a benefit in obtaining relevant prior information from the distribution of gene effects.

  14. Genome-wide identification of KANADI1 target genes.

    Directory of Open Access Journals (Sweden)

    Paz Merelo

    Full Text Available Plant organ development and polarity establishment is mediated by the action of several transcription factors. Among these, the KANADI (KAN subclade of the GARP protein family plays important roles in polarity-associated processes during embryo, shoot and root patterning. In this study, we have identified a set of potential direct target genes of KAN1 through a combination of chromatin immunoprecipitation/DNA sequencing (ChIP-Seq and genome-wide transcriptional profiling using tiling arrays. Target genes are over-represented for genes involved in the regulation of organ development as well as in the response to auxin. KAN1 affects directly the expression of several genes previously shown to be important in the establishment of polarity during lateral organ and vascular tissue development. We also show that KAN1 controls through its target genes auxin effects on organ development at different levels: transport and its regulation, and signaling. In addition, KAN1 regulates genes involved in the response to abscisic acid, jasmonic acid, brassinosteroids, ethylene, cytokinins and gibberellins. The role of KAN1 in organ polarity is antagonized by HD-ZIPIII transcription factors, including REVOLUTA (REV. A comparison of their target genes reveals that the REV/KAN1 module acts in organ patterning through opposite regulation of shared targets. Evidence of mutual repression between closely related family members is also shown.

  15. Genome-Wide Analysis of Human MicroRNA Stability

    Directory of Open Access Journals (Sweden)

    Yang Li

    2013-01-01

    Full Text Available Increasing studies have shown that microRNA (miRNA stability plays important roles in physiology. However, the global picture of miRNA stability remains largely unknown. Here, we had analyzed genome-wide miRNA stability across 10 diverse cell types using miRNA arrays. We found that miRNA stability shows high dynamics and diversity both within individual cells and across cell types. Strikingly, we observed a negative correlation between miRNA stability and miRNA expression level, which is different from current findings on other biological molecules such as proteins and mRNAs that show positive and not negative correlations between stability and expression level. This finding indicates that miRNA has a distinct action mode, which we called “rapid production, rapid turnover; slow production, slow turnover.” This mode further suggests that high expression miRNAs normally degrade fast and may endow the cell with special properties that facilitate cellular status-transition. Moreover, we revealed that the stability of miRNAs is affected by cohorts of factors that include miRNA targets, transcription factors, nucleotide content, evolution, associated disease, and environmental factors. Together, our results provided an extensive description of the global landscape, dynamics, and distinct mode of human miRNA stability, which provide help in investigating their functions in physiology and pathophysiology.

  16. Genome-wide linkage analysis for human longevity

    DEFF Research Database (Denmark)

    Beekman, Marian; Blanché, Hélène; Perola, Markus;

    2013-01-01

    Clear evidence exists for heritability of human longevity, and much interest is focused on identifying genes associated with longer lives. To identify such longevity alleles, we performed the largest genome-wide linkage scan thus far reported. Linkage analyses included 2118 nonagenarian Caucasian...... sibling pairs that have been enrolled in 15 study centers of 11 European countries as part of the Genetics of Healthy Aging (GEHA) project. In the joint linkage analyses, we observed four regions that show linkage with longevity; chromosome 14q11.2 (LOD = 3.47), chromosome 17q12-q22 (LOD = 2.......02 and P-value = 1.0 × 10(-5) , respectively. In the largest linkage scan thus far performed for human familial longevity, we confirm that the APOE locus is a longevity gene and that additional longevity loci may be identified at 14q11.2, 17q12-q22, and 19p13.3-p13.11. As the latter linkage results...

  17. A genome-wide meta-analysis of nodular sclerosing Hodgkin lymphoma identifies risk loci at 6p21.32.

    Science.gov (United States)

    Cozen, Wendy; Li, Dalin; Best, Timothy; Van Den Berg, David J; Gourraud, Pierre-Antoine; Cortessis, Victoria K; Skol, Andrew D; Mack, Thomas M; Glaser, Sally L; Weiss, Lawrence M; Nathwani, Bharat N; Bhatia, Smita; Schumacher, Fredrick R; Edlund, Christopher K; Hwang, Amie E; Slager, Susan L; Fredericksen, Zachary S; Strong, Louise C; Habermann, Thomas M; Link, Brian K; Cerhan, James R; Robison, Leslie L; Conti, David V; Onel, Kenan

    2012-01-12

    Nodular sclerosing Hodgkin lymphoma (NSHL) is a distinct, highly heritable Hodgkin lymphoma subtype. We undertook a genome-wide meta-analysis of 393 European-origin adolescent/young adult NSHL patients and 3315 controls using the Illumina Human610-Quad Beadchip and Affymetrix Genome-Wide Human SNP Array 6.0. We identified 3 single nucleotide polymorphisms (SNPs) on chromosome 6p21.32 that were significantly associated with NSHL risk: rs9268542 (P = 5.35 × 10(-10)), rs204999 (P = 1.44 × 10(-9)), and rs2858870 (P = 1.69 × 10(-8)). We also confirmed a previously reported association in the same region, rs6903608 (P = 3.52 × 10(-10)). rs204999 and rs2858870 were weakly correlated (r(2) = 0.257), and the remaining pairs of SNPs were not correlated (r(2) analysis, the DRB1*0701 allele was associated with a decreased risk of NSHL (OR = 0.5, 95% confidence interval = 0.4, 0.7). These data support the importance of the HLA class II region in NSHL etiology.

  18. A genome-wide screening and SNPs-to-genes approach to identify novel genetic risk factors associated with frontotemporal dementia.

    Science.gov (United States)

    Ferrari, Raffaele; Grassi, Mario; Salvi, Erika; Borroni, Barbara; Palluzzi, Fernando; Pepe, Daniele; D'Avila, Francesca; Padovani, Alessandro; Archetti, Silvana; Rainero, Innocenzo; Rubino, Elisa; Pinessi, Lorenzo; Benussi, Luisa; Binetti, Giuliano; Ghidoni, Roberta; Galimberti, Daniela; Scarpini, Elio; Serpente, Maria; Rossi, Giacomina; Giaccone, Giorgio; Tagliavini, Fabrizio; Nacmias, Benedetta; Piaceri, Irene; Bagnoli, Silvia; Bruni, Amalia C; Maletta, Raffaele G; Bernardi, Livia; Postiglione, Alfredo; Milan, Graziella; Franceschi, Massimo; Puca, Annibale A; Novelli, Valeria; Barlassina, Cristina; Glorioso, Nicola; Manunta, Paolo; Singleton, Andrew; Cusi, Daniele; Hardy, John; Momeni, Parastoo

    2015-10-01

    Frontotemporal dementia (FTD) is the second most prevalent form of early onset dementia after Alzheimer's disease (AD). We performed a case-control association study in an Italian FTD cohort (n = 530) followed by the novel single nucleotide polymorphisms (SNPs)-to-genes approach and functional annotation analysis. We identified 2 novel potential loci for FTD. Suggestive SNPs reached p-values ∼10(-7) and odds ratio > 2.5 (2p16.3) and 1.5 (17q25.3). Suggestive alleles at 17q25.3 identified a disease-associated haplotype causing decreased expression of -cis genes such as RFNG and AATK involved in neuronal genesis and differentiation and axon outgrowth, respectively. We replicated this locus through the SNPs-to-genes approach. Our functional annotation analysis indicated significant enrichment for functions of the brain (neuronal genesis, differentiation, and maturation), the synapse (neurotransmission and synapse plasticity), and elements of the immune system, the latter supporting our recent international FTD-genome-wide association study. This is the largest genome-wide study in Italian FTD to date. Although our results are not conclusive, we set the basis for future replication studies and identification of susceptible molecular mechanisms involved in FTD pathogenesis.

  19. Meta-Analysis of Genome-Wide Association Studies for Abdominal Aortic Aneurysm Identifies Four New Disease-Specific Risk Loci

    Science.gov (United States)

    Tromp, Gerard; Kuivaniemi, Helena; Gretarsdottir, Solveig; Baas, Annette F.; Giusti, Betti; Strauss, Ewa; van‘t Hof, Femke N.G.; Webb, Thomas R.; Erdman, Robert; Ritchie, Marylyn D.; Elmore, James R.; Verma, Anurag; Pendergrass, Sarah; Kullo, Iftikhar J.; Ye, Zi; Peissig, Peggy L.; Gottesman, Omri; Verma, Shefali S.; Malinowski, Jennifer; Rasmussen-Torvik, Laura J.; Borthwick, Kenneth M.; Smelser, Diane T.; Crosslin, David R.; de Andrade, Mariza; Ryer, Evan J.; McCarty, Catherine A.; Böttinger, Erwin P.; Pacheco, Jennifer A.; Crawford, Dana C.; Carrell, David S.; Gerhard, Glenn S.; Franklin, David P.; Carey, David J.; Phillips, Victoria L.; Williams, Michael J.A.; Wei, Wenhua; Blair, Ross; Hill, Andrew A.; Vasudevan, Thodor M.; Lewis, David R.; Thomson, Ian A.; Krysa, Jo; Hill, Geraldine B.; Roake, Justin; Merriman, Tony R.; Oszkinis, Grzegorz; Galora, Silvia; Saracini, Claudia; Abbate, Rosanna; Pulli, Raffaele; Pratesi, Carlo; Saratzis, Athanasios; Verissimo, Ana R.; Bumpstead, Suzannah; Badger, Stephen A.; Clough, Rachel E.; Cockerill, Gillian; Hafez, Hany; Scott, D. Julian A.; Futers, T. Simon; Romaine, Simon P.R.; Bridge, Katherine; Griffin, Kathryn J.; Bailey, Marc A.; Smith, Alberto; Thompson, Matthew M.; van Bockxmeer, Frank M.; Matthiasson, Stefan E.; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Blankensteijn, Jan D.; Teijink, Joep A.W.; Wijmenga, Cisca; de Graaf, Jacqueline; Kiemeney, Lambertus A.; Lindholt, Jes S.; Hughes, Anne; Bradley, Declan T.; Stirrups, Kathleen; Golledge, Jonathan; Norman, Paul E.; Powell, Janet T.; Humphries, Steve E.; Hamby, Stephen E.; Goodall, Alison H.; Nelson, Christopher P.; Sakalihasan, Natzi; Courtois, Audrey; Ferrell, Robert E.; Eriksson, Per; Folkersen, Lasse; Franco-Cereceda, Anders; Eicher, John D.; Johnson, Andrew D.; Betsholtz, Christer; Ruusalepp, Arno; Franzén, Oscar; Schadt, Eric E.; Björkegren, Johan L.M.; Lipovich, Leonard; Drolet, Anne M.; Verhoeven, Eric L.; Zeebregts, Clark J.; Geelkerken, Robert H.; van Sambeek, Marc R.; van Sterkenburg, Steven M.; de Vries, Jean-Paul; Stefansson, Kari; Thompson, John R.; de Bakker, Paul I.W.; Deloukas, Panos; Sayers, Robert D.; Harrison, Seamus C.; van Rij, Andre M.; Samani, Nilesh J.

    2017-01-01

    Rationale: Abdominal aortic aneurysm (AAA) is a complex disease with both genetic and environmental risk factors. Together, 6 previously identified risk loci only explain a small proportion of the heritability of AAA. Objective: To identify additional AAA risk loci using data from all available genome-wide association studies. Methods and Results: Through a meta-analysis of 6 genome-wide association study data sets and a validation study totaling 10 204 cases and 107 766 controls, we identified 4 new AAA risk loci: 1q32.3 (SMYD2), 13q12.11 (LINC00540), 20q13.12 (near PCIF1/MMP9/ZNF335), and 21q22.2 (ERG). In various database searches, we observed no new associations between the lead AAA single nucleotide polymorphisms and coronary artery disease, blood pressure, lipids, or diabetes mellitus. Network analyses identified ERG, IL6R, and LDLR as modifiers of MMP9, with a direct interaction between ERG and MMP9. Conclusions: The 4 new risk loci for AAA seem to be specific for AAA compared with other cardiovascular diseases and related traits suggesting that traditional cardiovascular risk factor management may only have limited value in preventing the progression of aneurysmal disease. PMID:27899403

  20. Genome-wide association mapping in a wild avian population identifies a link between genetic and phenotypic variation in a life-history trait.

    Science.gov (United States)

    Husby, Arild; Kawakami, Takeshi; Rönnegård, Lars; Smeds, Linnéa; Ellegren, Hans; Qvarnström, Anna

    2015-05-07

    Understanding the genetic basis of traits involved in adaptation is a major challenge in evolutionary biology but remains poorly understood. Here, we use genome-wide association mapping using a custom 50 k single nucleotide polymorphism (SNP) array in a natural population of collared flycatchers to examine the genetic basis of clutch size, an important life-history trait in many animal species. We found evidence for an association on chromosome 18 where one SNP significant at the genome-wide level explained 3.9% of the phenotypic variance. We also detected two suggestive quantitative trait loci (QTLs) on chromosomes 9 and 26. Fitness differences among genotypes were generally weak and not significant, although there was some indication of a sex-by-genotype interaction for lifetime reproductive success at the suggestive QTL on chromosome 26. This implies that sexual antagonism may play a role in maintaining genetic variation at this QTL. Our findings provide candidate regions for a classic avian life-history trait that will be useful for future studies examining the molecular and cellular function of, as well as evolutionary mechanisms operating at, these loci.

  1. Meta-analysis of genome-wide association studies identifies multiple lung cancer susceptibility loci in never-smoking Asian women.

    Science.gov (United States)

    Wang, Zhaoming; Seow, Wei Jie; Shiraishi, Kouya; Hsiung, Chao A; Matsuo, Keitaro; Liu, Jie; Chen, Kexin; Yamji, Taiki; Yang, Yang; Chang, I-Shou; Wu, Chen; Hong, Yun-Chul; Burdett, Laurie; Wyatt, Kathleen; Chung, Charles C; Li, Shengchao A; Yeager, Meredith; Hutchinson, Amy; Hu, Wei; Caporaso, Neil; Landi, Maria T; Chatterjee, Nilanjan; Song, Minsun; Fraumeni, Joseph F; Kohno, Takashi; Yokota, Jun; Kunitoh, Hideo; Ashikawa, Kyota; Momozawa, Yukihide; Daigo, Yataro; Mitsudomi, Tetsuya; Yatabe, Yasushi; Hida, Toyoaki; Hu, Zhibin; Dai, Juncheng; Ma, Hongxia; Jin, Guangfu; Song, Bao; Wang, Zhehai; Cheng, Sensen; Yin, Zhihua; Li, Xuelian; Ren, Yangwu; Guan, Peng; Chang, Jiang; Tan, Wen; Chen, Chien-Jen; Chang, Gee-Chen; Tsai, Ying-Huang; Su, Wu-Chou; Chen, Kuan-Yu; Huang, Ming-Shyan; Chen, Yuh-Min; Zheng, Hong; Li, Haixin; Cui, Ping; Guo, Huan; Xu, Ping; Liu, Li; Iwasaki, Motoki; Shimazu, Taichi; Tsugane, Shoichiro; Zhu, Junjie; Jiang, Gening; Fei, Ke; Park, Jae Yong; Kim, Yeul Hong; Sung, Jae Sook; Park, Kyong Hwa; Kim, Young Tae; Jung, Yoo Jin; Kang, Chang Hyun; Park, In Kyu; Kim, Hee Nam; Jeon, Hyo-Sung; Choi, Jin Eun; Choi, Yi Young; Kim, Jin Hee; Oh, In-Jae; Kim, Young-Chul; Sung, Sook Whan; Kim, Jun Suk; Yoon, Ho-Il; Kweon, Sun-Seog; Shin, Min-Ho; Seow, Adeline; Chen, Ying; Lim, Wei-Yen; Liu, Jianjun; Wong, Maria Pik; Lee, Victor Ho Fun; Bassig, Bryan A; Tucker, Margaret; Berndt, Sonja I; Chow, Wong-Ho; Ji, Bu-Tian; Wang, Junwen; Xu, Jun; Sihoe, Alan Dart Loon; Ho, James C M; Chan, John K C; Wang, Jiu-Cun; Lu, Daru; Zhao, Xueying; Zhao, Zhenhong; Wu, Junjie; Chen, Hongyan; Jin, Li; Wei, Fusheng; Wu, Guoping; An, She-Juan; Zhang, Xu-Chao; Su, Jian; Wu, Yi-Long; Gao, Yu-Tang; Xiang, Yong-Bing; He, Xingzhou; Li, Jihua; Zheng, Wei; Shu, Xiao-Ou; Cai, Qiuyin; Klein, Robert; Pao, William; Lawrence, Charles; Hosgood, H Dean; Hsiao, Chin-Fu; Chien, Li-Hsin; Chen, Ying-Hsiang; Chen, Chung-Hsing; Wang, Wen-Chang; Chen, Chih-Yi; Wang, Chih-Liang; Yu, Chong-Jen; Chen, Hui-Ling; Su, Yu-Chun; Tsai, Fang-Yu; Chen, Yi-Song; Li, Yao-Jen; Yang, Tsung-Ying; Lin, Chien-Chung; Yang, Pan-Chyr; Wu, Tangchun; Lin, Dongxin; Zhou, Baosen; Yu, Jinming; Shen, Hongbing; Kubo, Michiaki; Chanock, Stephen J; Rothman, Nathaniel; Lan, Qing

    2016-02-01

    Genome-wide association studies (GWAS) of lung cancer in Asian never-smoking women have previously identified six susceptibility loci associated with lung cancer risk. To further discover new susceptibility loci, we imputed data from four GWAS of Asian non-smoking female lung cancer (6877 cases and 6277 controls) using the 1000 Genomes Project (Phase 1 Release 3) data as the reference and genotyped additional samples (5878 cases and 7046 controls) for possible replication. In our meta-analysis, three new loci achieved genome-wide significance, marked by single nucleotide polymorphism (SNP) rs7741164 at 6p21.1 (per-allele odds ratio (OR) = 1.17; P = 5.8 × 10(-13)), rs72658409 at 9p21.3 (per-allele OR = 0.77; P = 1.41 × 10(-10)) and rs11610143 at 12q13.13 (per-allele OR = 0.89; P = 4.96 × 10(-9)). These findings identified new genetic susceptibility alleles for lung cancer in never-smoking women in Asia and merit follow-up to understand their biological underpinnings.

  2. A genome-wide screening and SNPs-to-genes approach to identify novel genetic risk factors associated with frontotemporal dementia

    Science.gov (United States)

    Ferrari, Raffaele; Grassi, Mario; Salvi, Erika; Borroni, Barbara; Palluzzi, Fernando; Pepe, Daniele; D'Avila, Francesca; Padovani, Alessandro; Archetti, Silvana; Rainero, Innocenzo; Rubino, Elisa; Pinessi, Lorenzo; Benussi, Luisa; Binetti, Giuliano; Ghidoni, Roberta; Galimberti, Daniela; Scarpini, Elio; Serpente, Maria; Rossi, Giacomina; Giaccone, Giorgio; Tagliavini, Fabrizio; Nacmias, Benedetta; Piaceri, Irene; Bagnoli, Silvia; Bruni, Amalia C.; Maletta, Raffaele G.; Bernardi, Livia; Postiglione, Alfredo; Milan, Graziella; Franceschi, Massimo; Puca, Annibale A.; Novelli, Valeria; Barlassina, Cristina; Glorioso, Nicola; Manunta, Paolo; Singleton, Andrew; Cusi, Daniele; Hardy, John; Momeni, Parastoo

    2015-01-01

    Frontotemporal dementia (FTD) is the second most prevalent form of early onset dementia after Alzheimer's disease (AD). We performed a case-control association study in an Italian FTD cohort (n = 530) followed by the novel single nucleotide polymorphisms (SNPs)-to-genes approach and functional annotation analysis. We identified 2 novel potential loci for FTD. Suggestive SNPs reached p-values ∼10−7 and odds ratio > 2.5 (2p16.3) and 1.5 (17q25.3). Suggestive alleles at 17q25.3 identified a disease-associated haplotype causing decreased expression of –cis genes such as RFNG and AATK involved in neuronal genesis and differentiation and axon outgrowth, respectively. We replicated this locus through the SNPs-to-genes approach. Our functional annotation analysis indicated significant enrichment for functions of the brain (neuronal genesis, differentiation, and maturation), the synapse (neurotransmission and synapse plasticity), and elements of the immune system, the latter supporting our recent international FTD–genome-wide association study. This is the largest genome-wide study in Italian FTD to date. Although our results are not conclusive, we set the basis for future replication studies and identification of susceptible molecular mechanisms involved in FTD pathogenesis. PMID:26154020

  3. Genome-wide association study for levels of total serum IgE identifies HLA-C in a Japanese population.

    Directory of Open Access Journals (Sweden)

    Yohei Yatagai

    Full Text Available Most of the previously reported loci for total immunoglobulin E (IgE levels are related to Th2 cell-dependent pathways. We undertook a genome-wide association study (GWAS to identify genetic loci responsible for IgE regulation. A total of 479,940 single nucleotide polymorphisms (SNPs were tested for association with total serum IgE levels in 1180 Japanese adults. Fine-mapping with SNP imputation demonstrated 6 candidate regions: the PYHIN1/IFI16, MHC classes I and II, LEMD2, GRAMD1B, and chr13∶60576338 regions. Replication of these candidate loci in each region was assessed in 2 independent Japanese cohorts (n = 1110 and 1364, respectively. SNP rs3130941 in the HLA-C region was consistently associated with total IgE levels in 3 independent populations, and the meta-analysis yielded genome-wide significance (P = 1.07×10(-10. Using our GWAS results, we also assessed the reproducibility of previously reported gene associations with total IgE levels. Nine of 32 candidate genes identified by a literature search were associated with total IgE levels after correction for multiple testing. Our findings demonstrate that SNPs in the HLA-C region are strongly associated with total serum IgE levels in the Japanese population and that some of the previously reported genetic associations are replicated across ethnic groups.

  4. Upregulation of c-MYC in cis through a Large Chromatin Loop Linked to a Cancer Risk-Associated Single-Nucleotide Polymorphism in Colorectal Cancer Cells▿

    Science.gov (United States)

    Wright, Jason B.; Brown, Seth J.; Cole, Michael D.

    2010-01-01

    Genome-wide association studies have mapped many single-nucleotide polymorphisms (SNPs) that are linked to cancer risk, but the mechanism by which most SNPs promote cancer remains undefined. The rs6983267 SNP at 8q24 has been associated with many cancers, yet the SNP falls 335 kb from the nearest gene, c-MYC. We show that the beta-catenin-TCF4 transcription factor complex binds preferentially to the cancer risk-associated rs6983267(G) allele in colon cancer cells. We also show that the rs6983267 SNP has enhancer-related histone marks and can form a 335-kb chromatin loop to interact with the c-MYC promoter. Finally, we show that the SNP has no effect on the efficiency of chromatin looping to the c-MYC promoter but that the cancer risk-associated SNP enhances the expression of the linked c-MYC allele. Thus, cancer risk is a direct consequence of elevated c-MYC expression from increased distal enhancer activity and not from reorganization/creation of the large chromatin loop. The findings of these studies support a mechanism for intergenic SNPs that can promote cancer through the regulation of distal genes by utilizing preexisting large chromatin loops. PMID:20065031

  5. Association of a single nucleotide polymorphism at 6q25.1,rs2046210, with endometrial cancer risk among Chinese women

    Institute of Scientific and Technical Information of China (English)

    Guoliang Li; Qiuyin Cai; Yong-Bing Xiang; Regina Courtney; Jia-Rong Cheng; Bo Huang; Ji-Rong Long; Hui Cai; Wei Zheng; Xiao-Ou Shu

    2011-01-01

    A recent genome-wide association study identified a new susceptibility locus for breast cancer, rs2046210, which is a single nucleotide polymorphism (SNP) located upstream of the estrogen receptor α (ESR1) gene on chromosome 6q25.1. Given that endometrial cancer shares many risk factors with breast cancer and both are related to estrogen exposure and that rs2046210 is in close proximity to the ESR1 gene, we evaluated the association of SNP rs2046210 with endometrial cancer risk among 953 cases and 947 controls in a population-based, case-control study conducted in Shanghai, China. Logistic regression models were used to derive odds ratios (ORs) and 95% confidence intervals (95% Cis) after adjusting for potential confounders. We found that the A allele of rs2046210, linked to an increased risk of breast cancer, was associated with increased but not statistically significant risk of endometrial cancer (OR = 1.16, 95% CI = 0.96-1.41 for the GA and AA genotypes compared with the GG genotype); the association was stronger among post-menopausal women (OR = 1.28, 95% CI = 1.00-1.65). The association tended to be stronger among women with higher or longer estrogen exposure than among women with relatively lower or shorter exposure to estrogen. Our study suggests that rs2046210 may play a role in the etiology of endometrial cancer. Additional studies are needed to confirm our findings.

  6. Identification of neural outgrowth genes using genome-wide RNAi.

    Directory of Open Access Journals (Sweden)

    Katharine J Sepp

    2008-07-01

    Full Text Available While genetic screens have identified many genes essential for neurite outgrowth, they have been limited in their ability to identify neural genes that also have earlier critical roles in the gastrula, or neural genes for which maternally contributed RNA compensates for gene mutations in the zygote. To address this, we developed methods to screen the Drosophila genome using RNA-interference (RNAi on primary neural cells and present the results of the first full-genome RNAi screen in neurons. We used live-cell imaging and quantitative image analysis to characterize the morphological phenotypes of fluorescently labelled primary neurons and glia in response to RNAi-mediated gene knockdown. From the full genome screen, we focused our analysis on 104 evolutionarily conserved genes that when downregulated by RNAi, have morphological defects such as reduced axon extension, excessive branching, loss of fasciculation, and blebbing. To assist in the phenotypic analysis of the large data sets, we generated image analysis algorithms that could assess the statistical significance of the mutant phenotypes. The algorithms were essential for the analysis of the thousands of images generated by the screening process and will become a valuable tool for future genome-wide screens in primary neurons. Our analysis revealed unexpected, essential roles in neurite outgrowth for genes representing a wide range of functional categories including signalling molecules, enzymes, channels, receptors, and cytoskeletal proteins. We also found that genes known to be involved in protein and vesicle trafficking showed similar RNAi phenotypes. We confirmed phenotypes of the protein trafficking genes Sec61alpha and Ran GTPase using Drosophila embryo and mouse embryonic cerebral cortical neurons, respectively. Collectively, our results showed that RNAi phenotypes in primary neural culture can parallel in vivo phenotypes, and the screening technique can be used to identify many new

  7. Genome-wide survey for biologically functional pseudogenes.

    Directory of Open Access Journals (Sweden)

    Orjan Svensson

    2006-05-01

    Full Text Available According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, indeed, in vivo vital and also evolutionarily preserved, encouraged us to conduct a genome-wide survey for other functional pseudogenes in human, mouse, and chimpanzee. We identify to our knowledge the first examples of conserved pseudogenes common to human and mouse, originating from one duplication predating the human-mouse species split and having evolved as pseudogenes since the species split. Functionality is one possible way to explain the apparently contradictory properties of such pseudogene pairs, i.e., high conservation and ancient origin. The hypothesis of functionality is tested by comparing expression evidence and synteny of the candidates with proper test sets. The tests suggest potential biological function. Our candidate set includes a small set of long-lived pseudogenes whose unknown potential function is retained since before the human-mouse species split, and also a larger group of primate-specific ones found from human-chimpanzee searches. Two processed sequences are notable, their conservation since the human-mouse split being as high as most protein-coding genes; one is derived from the protein Ataxin 7-like 3 (ATX7NL3, and one from the Spinocerebellar ataxia type 1 protein (ATX1. Our approach is comparative and can be applied to any pair of species. It is implemented by a semi-automated pipeline based on cross-species BLAST comparisons and maximum-likelihood phylogeny estimations. To separate pseudogenes from protein-coding genes, we use standard methods, utilizing in-frame disablements, as well as a probabilistic filter based on Ka/Ks ratios.

  8. Genephony: a knowledge management tool for genome-wide research

    Science.gov (United States)

    Nuzzo, Angelo; Riva, Alberto

    2009-01-01

    Background One of the consequences of the rapid and widespread adoption of high-throughput experimental technologies is an exponential increase of the amount of data produced by genome-wide experiments. Researchers increasingly need to handle very large volumes of heterogeneous data, including both the data generated by their own experiments and the data retrieved from publicly available repositories of genomic knowledge. Integration, exploration, manipulation and interpretation of data and information therefore need to become as automated as possible, since their scale and breadth are, in general, beyond the limits of what individual researchers and the basic data management tools in normal use can handle. This paper describes Genephony, a tool we are developing to address these challenges. Results We describe how Genephony can be used to manage large datesets of genomic information, integrating them with existing knowledge repositories. We illustrate its functionalities with an example of a complex annotation task, in which a set of SNPs coming from a genotyping experiment is annotated with genes known to be associated to a phenotype of interest. We show how, thanks to the modular architecture of Genephony and its user-friendly interface, this task can be performed in a few simple steps. Conclusion Genephony is an online tool for the manipulation of large datasets of genomic information. It can be used as a browser for genomic data, as a high-throughput annotation tool, and as a knowledge discovery tool. It is designed to be easy to use, flexible and extensible. Its knowledge management engine provides fine-grained control over individual data elements, as well as efficient operations on large datasets. PMID:19728881

  9. Genome-wide promoter methylome of small renal masses.

    Directory of Open Access Journals (Sweden)

    Ilsiya Ibragimova

    Full Text Available The majority of renal cell carcinoma (RCC is now incidentally detected and presents as small renal masses (SRMs defined as ≤ 4 cm in size. SRMs are heterogeneous comprising several histological types of RCC each with different biology and behavior, and benign tumors mainly oncocytoma. The varied prognosis of the different types of renal tumor has implications for management options. A key epigenetic alteration involved in the initiation and progression of cancer is aberrant methylation in the promoter region of a gene. The hypermethylation is associated with transcriptional repression and is an important mechanism of inactivation of tumor suppressor genes in neoplastic cells. We have determined the genome-wide promoter methylation profiles of 47 pT1a and 2 pT1b clear cell, papillary or chromophobe RCC, 25 benign renal oncocytoma ≤ 4 cm and 4 normal renal parenchyma specimens by Infinium HumanMethylation27 beadchip technology. We identify gene promoter hypermethylation signatures that distinguish clear cell and papillary from each other, from chromophobe and oncocytoma, and from normal renal cells. Pairwise comparisons revealed genes aberrantly hypermethylated in a tumor type but unmethylated in normal, and often unmethylated in the other renal tumor types. About 0.4% to 1.7% of genes comprised the promoter methylome in SRMs. The Infinium methylation score for representative genes was verified by gold standard technologies. The genes identified as differentially methylated implicate pathways involved in metabolism, tissue response to injury, epithelial to mesenchymal transition (EMT, signal transduction and G-protein coupled receptors (GPCRs, cancer, and stem cell regulation in the biology of RCC. Our findings contribute towards an improved understanding of the development of RCC, the different biology and behavior of histological types, and discovery of molecular subtypes. The differential methylation signatures may have utility in early

  10. Genome-wide examination of myoblast cell cycle withdrawal duringdifferentiation

    Energy Technology Data Exchange (ETDEWEB)

    Shen, Xun; Collier, John Michael; Hlaing, Myint; Zhang, Leanne; Delshad, Elizabeth H.; Bristow, James; Bernstein, Harold S.

    2002-12-02

    Skeletal and cardiac myocytes cease division within weeks of birth. Although skeletal muscle retains limited capacity for regeneration through recruitment of satellite cells, resident populations of adult myocardial stem cells have not been identified. Because cell cycle withdrawal accompanies myocyte differentiation, we hypothesized that C2C12 cells, a mouse myoblast cell line previously used to characterize myocyte differentiation, also would provide a model for studying cell cycle withdrawal during differentiation. C2C12 cells were differentiated in culture medium containing horse serum and harvested at various time points to characterize the expression profiles of known cell cycle and myogenic regulatory factors by immunoblot analysis. BrdU incorporation decreased dramatically in confluent cultures 48 hr after addition of horse serum, as cells started to form myotubes. This finding was preceded by up-regulation of MyoD, followed by myogenin, and activation of Bcl-2. Cyclin D1 was expressed in proliferating cultures and became undetectable in cultures containing 40 percent fused myotubes, as levels of p21(WAF1/Cip1) increased and alpha-actin became detectable. Because C2C12 myoblasts withdraw from the cell cycle during myocyte differentiation following a course that recapitulates this process in vivo, we performed a genome-wide screen to identify other gene products involved in this process. Using microarrays containing approximately 10,000 minimally redundant mouse sequences that map to the UniGene database of the National Center for Biotechnology Information, we compared gene expression profiles between proliferating, differentiating, and differentiated C2C12 cells and verified candidate genes demonstrating differential expression by RT-PCR. Cluster analysis of differentially expressed genes revealed groups of gene products involved in cell cycle withdrawal, muscle differentiation, and apoptosis. In addition, we identified several genes, including DDAH2 and Ly

  11. Genome-wide signatures of convergent evolution in echolocating mammals.

    Science.gov (United States)

    Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J

    2013-10-10

    Evolution is typically thought to proceed through divergence of genes, proteins and ultimately phenotypes. However, similar traits might also evolve convergently in unrelated taxa owing to similar selection pressures. Adaptive phenotypic convergence is widespread in nature, and recent results from several genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution, although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show that convergence is not a rare process restricted to several loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four newly sequenced bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the bottlenose dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Unexpectedly, we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognized.

  12. Probabilistic protein function prediction from heterogeneous genome-wide data.

    Directory of Open Access Journals (Sweden)

    Naoki Nariai

    Full Text Available Dramatic improvements in high throughput sequencing technologies have led to a staggering growth in the number of predicted genes. However, a large fraction of these newly discovered genes do not have a functional assignment. Fortunately, a variety of novel high-throughput genome-wide functional screening technologies provide important clues that shed light on gene function. The integration of heterogeneous data to predict protein function has been shown to improve the accuracy of automated gene annotation systems. In this paper, we propose and evaluate a probabilistic approach for protein function prediction that integrates protein-protein interaction (PPI data, gene expression data, protein motif information, mutant phenotype data, and protein localization data. First, functional linkage graphs are constructed from PPI data and gene expression data, in which an edge between nodes (proteins represents evidence for functional similarity. The assumption here is that graph neighbors are more likely to share protein function, compared to proteins that are not neighbors. The functional linkage graph model is then used in concert with protein domain, mutant phenotype and protein localization data to produce a functional prediction. Our method is applied to the functional prediction of Saccharomyces cerevisiae genes, using Gene Ontology (GO terms as the basis of our annotation. In a cross validation study we show that the integrated model increases recall by 18%, compared to using PPI data alone at the 50% precision. We also show that the integrated predictor is significantly better than each individual predictor. However, the observed improvement vs. PPI depends on both the new source of data and the functional category to be predicted. Surprisingly, in some contexts integration hurts overall prediction accuracy. Lastly, we provide a comprehensive assignment of putative GO terms to 463 proteins that currently have no assigned function.

  13. Improved statistics for genome-wide interaction analysis.

    Science.gov (United States)

    Ueki, Masao; Cordell, Heather J

    2012-01-01

    Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new "joint effects" statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al

  14. Genome-wide methylation analyses in glioblastoma multiforme.

    Directory of Open Access Journals (Sweden)

    Rose K Lai

    Full Text Available Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM. Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1, 5 methyl-deoxycytidine (5m-dC and 5 hydroxylmethyl-deoxycytidine (5hm-dC in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal.

  15. A genome-wide association study for venous thromboembolism: the extended cohorts for heart and aging research in genomic epidemiology (CHARGE) consortium.

    Science.gov (United States)

    Tang, Weihong; Teichert, Martina; Chasman, Daniel I; Heit, John A; Morange, Pierre-Emmanuel; Li, Guo; Pankratz, Nathan; Leebeek, Frank W; Paré, Guillaume; de Andrade, Mariza; Tzourio, Christophe; Psaty, Bruce M; Basu, Saonli; Ruiter, Rikje; Rose, Lynda; Armasu, Sebastian M; Lumley, Thomas; Heckbert, Susan R; Uitterlinden, André G; Lathrop, Mark; Rice, Kenneth M; Cushman, Mary; Hofman, Albert; Lambert, Jean-Charles; Glazer, Nicole L; Pankow, James S; Witteman, Jacqueline C; Amouyel, Philippe; Bis, Joshua C; Bovill, Edwin G; Kong, Xiaoxiao; Tracy, Russell P; Boerwinkle, Eric; Rotter, Jerome I; Trégouët, David-Alexandre; Loth, Daan W; Stricker, Bruno H Ch; Ridker, Paul M; Folsom, Aaron R; Smith, Nicholas L

    2013-07-01

    Venous thromboembolism (VTE) is a common, heritable disease resulting in high rates of hospitalization and mortality. Yet few associations between VTE and genetic variants, all in the coagulation pathway, have been established. To identify additional genetic determinants of VTE, we conducted a two-stage genome-wide association study (GWAS) among individuals of European ancestry in the extended cohorts for heart and aging research in genomic epidemiology (CHARGE) VTE consortium. The discovery GWAS comprised 1,618 incident VTE cases out of 44,499 participants from six community-based studies. Genotypes for genome-wide single-nucleotide polymorphisms (SNPs) were imputed to approximately 2.5 million SNPs in HapMap and association with VTE assessed using study-design appropriate regression methods. Meta-analysis of these results identified two known loci, in F5 and ABO. Top 1,047 tag SNPs (P ≤ 0.0016) from the discovery GWAS were tested for association in an additional 3,231 cases and 3,536 controls from three case-control studies. In the combined data from these two stages, additional genome-wide significant associations were observed on 4q35 at F11 (top SNP rs4253399, intronic to F11) and on 4q28 at FGG (rs6536024, 9.7 kb from FGG; P FGG locus were not completely explained by previously reported variants. Loci at or near SUSD1 and OTUD7A showed borderline yet novel associations (P FGG loci for VTE. Future studies are warranted to better characterize the associations with F11 and FGG and to replicate the new candidate associations.

  16. Polygenic transmission and complex neuro developmental network for attention deficit hyperactivity disorder: genome-wide association study of both common and rare variants.

    Science.gov (United States)

    Yang, Li; Neale, Benjamin M; Liu, Lu; Lee, S Hong; Wray, Naomi R; Ji, Ning; Li, Haimei; Qian, Qiujin; Wang, Dongliang; Li, Jun; Faraone, Stephen V; Wang, Yufeng; Doyle, Alysa E; Reif, Andreas; Rothenberger, Aribert; Franke, Barbara; Sonuga-Barke, Edmund J S; Steinhausen, Hans-Christoph; Buitelaar, Jan K; Kuntsi, Jonna; Biederman, Joseph; Lesch, Klaus-Peter; Kent, Lindsey; Asherson, Philip; Oades, Robert D; Loo, Sandra K; Nelson, Stan F; Faraone, Stephen V; Smalley, Susan L; Banaschewski, Tobias; Arias Vasquez, Alejandro; Todorov, Alexandre; Charach, Alice; Miranda, Ana; Warnke, Andreas; Thapar, Anita; Neale, Benjamin M; Cormand, Bru; Freitag, Christine; Mick, Eric; Mulas, Fernando; Middleton, Frank; HakonarsonHakonarson, Hakon; Palmason, Haukur; Schäfer, Helmut; Roeyers, Herbert; McGough, James J; Romanos, Jasmin; Crosbie, Jennifer; Meyer, Jobst; Ramos-Quiroga, Josep Antoni; Sergeant, Joseph; Elia, Josephine; Langely, Kate; Nisenbaum, Laura; Romanos, Marcel; Daly, Mark J; Ribasés, Marta; Gill, Michael; O'Donovan, Michael; Owen, Michael; Casas, Miguel; Bayés, Mònica; Lambregts-Rommelse, Nanda; Williams, Nigel; Holmans, Peter; Anney, Richard J L; Ebstein, Richard P; Schachar, Russell; Medland, Sarah E; Ripke, Stephan; Walitza, Susanne; Nguyen, Thuy Trang; Renner, Tobias J; Hu, Xiaolan

    2013-07-01

    Attention-deficit hyperactivity disorder (ADHD) is a complex polygenic disorder. This study aimed to discover common and rare DNA variants associated with ADHD in a large homogeneous Han Chinese ADHD case-control sample. The sample comprised 1,040 cases and 963 controls. All cases met DSM-IV ADHD diagnostic criteria. We used the Affymetrix6.0 array to assay both single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Genome-wide association analyses were performed using PLINK. SNP-heritability and SNP-genetic correlations with ADHD in Caucasians were estimated with genome-wide complex trait analysis (GCTA). Pathway analyses were performed using the Interval enRICHment Test (INRICH), the Disease Association Protein-Protein Link Evaluator (DAPPLE), and the Genomic Regions Enrichment of Annotations Tool (GREAT). We did not find genome-wide significance for single SNPs but did find an increased burden of large, rare CNVs in the ADHD sample (P = 0.038). SNP-heritability was estimated to be 0.42 (standard error, 0.13, P = 0.0017) and the SNP-genetic correlatio