Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J. L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schafer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph
Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus additional genome-wide association studies (GWAS) are needed. Method: We used case-control analyses of 896 cases…
Menke, Andreas; Domschke, Katharina; Czamara, Darina; Klengel, Torsten; Hennings, Johannes; Lucae, Susanne; Baune, Bernhard T; Arolt, Volker; Müller-Myhsok, Bertram; Holsboer, Florian; Binder, Elisabeth B
Emergence of suicidal ideation (TESI) during treatment with antidepressants in major depression led to a black box warning. We performed a genome-wide association study to identify genetic markers, which increase the risk for this serious side effect. TESI was evaluated in depressed in-patients (N=397) and defined by an emergence of suicidal thoughts during hospitalization without suicidal thoughts at admission using the suicide item (3) of the Hamilton Depression Rating Scale. Genotype distribution of 405.383 single-nucleotide polymorphisms (SNPs) in patients with TESI (N=32/8.1%) was compared to patients without increase in suicidal ideation (N=329/82.9%) and to a subgroup never reported suicidal ideation (N=79/19.9%). Top results were analyzed in an independent sample (N=501). None variant reached genome-wide significance, the best associated SNP was rs1630535 (p-value=1.3 × 10(-7)). The top 79 SNPs could be analyzed in an independent sample, and 14 variants showed nominal significant association with the same risk allele in the replication sample. A discriminant analysis classifying patients using these 79 SNPs revealed a 91% probability to classify TESI vs non-TESI cases correctly in the replication sample. Although our data need to be interpreted carefully owing to the small numbers in both cohorts, they suggest that a combination of genetic markers might indeed be used to identify patients at risk for TESI.
Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C
Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC...... a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable...
Full Text Available Twin registries around the globe have collected DNA samples from large numbers of monozygotic and dizygotic twins. The twin sample collections are frequently used as controls in disease-specific studies together with non-twins. This approach is unbiased under the hypothesis that twins and singletons are comparable in terms of allele frequencies; i.e. there are no genetic variants associated with being a twin per se. To test this hypothesis we performed a genome-wide association study comparing the allele frequency of 572,352 single nucleotide polymorphisms (SNPs in 1,413 monozygotic (MZ and 5,451 dizygotic (DZ twins with 3,720 healthy singletons. Twins and singletons have been genotyped using the same platform. SNPs showing association with being a twin at P-value < 1 × 10(-5 were selected for replication analysis in 1,492 twins (463 MZ and 1,029 DZ and 1,880 singletons from Finland. No SNPs reached genome-wide significance (P-value < 5 × 10(-8 in the main analysis combining MZ and DZ twins. In a secondary analysis including only DZ twins two SNPs (rs2033541 close to ADAMTSL1 and rs4149283 close to ABCA1 were genome-wide significant after meta-analysis with the Finnish population. The estimated proportion of variance on the liability scale explained by all SNPs was 0.08 (P-value=0.003 when MZ and DZ were considered together and smaller for MZ (0.06, P-value=0.10 compared to DZ (0.09, P-value=0.003 when analyzed separately. In conclusion, twins and singletons can be used in genetic studies together with general population samples without introducing large bias. Further research is needed to explore genetic variances associated with DZ twinning.
Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J.L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schäfer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Biederman, Joseph
Objective Although twin and family studies have shown attention deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus, additional genomewide association studies (GWAS) are needed. Method We used case-control analyses of 896 cases with DSM-IV ADHD genotyped using the Affymetrix 5.0 array and 2,455 repository controls screened for psychotic and bipolar symptoms genotyped using Affymetrix 6.0 arrays. A consensus SNP set was imputed using BEAGLE 3.0, resulting in an analysis dataset of 1,033,244 SNPs. The data were analyzed using a generalized linear model. Results No genome-wide significant associations were found. The most significant results implicated the following genes: PRKG1, FLNC, TCERG1L, PPM1H, NXPH1, PPM1H, CDH13, HK1 and HKDC1. Conclusions The current analyses are a useful addition to the present literature and will make a valuable contribution to future meta-analyses. The candidate gene findings are consistent with a prior meta-analysis in suggesting that the effects of ADHD risk variants must, individually, be very small and/or include multiple rare alleles. PMID:20732627
Malovini, Alberto; Rognoni, Carla; Puca, Annibale; Bellazzi, Riccardo
Experimental errors in the genotyping phases of a Genome-Wide Association Study (GWAS) can lead to false positive findings and to spurious associations. An appropriate quality control phase could minimize the effects of this kind of errors. Several filtering criteria can be used to perform quality control. Currently, no formal methods have been proposed for taking into account at the same time these criteria and the experimenter's preferences. In this paper we propose two strategies for setting appropriate genotyping rate thresholds for GWAS quality control. These two approaches are based on the Multi-Criteria Decision Making theory. We have applied our method on a real dataset composed by 734 individuals affected by Arterial Hypertension (AH) and 486 nonagenarians without history of AH. The proposed strategies appear to deal with GWAS quality control in a sound way, as they lead to rationalize and make explicit the experimenter's choices thus providing more reproducible results.
Zhang, Xiang-Yuan; He, Chao; Ye, Bing-Yu; Xie, De-Jian; Shi, Ming-Lei; Zhang, Yan; Shen, Wen-Long; Li, Ping; Zhao, Zhi-Hu
Highest-throughput chromosome conformation capture (Hi-C) is one of the key assays for genome- wide chromatin interaction studies. It is a time-consuming process that involves many steps and many different kinds of reagents, consumables, and equipments. At present, the reproducibility is unsatisfactory. By optimizing the key steps of the Hi-C experiment, such as crosslinking, pretreatment of digestion, inactivation of restriction enzyme, and in situ ligation etc., we established a robust Hi-C procedure and prepared two biological replicates of Hi-C libraries from the GM12878 cells. After preliminary quality control by Sanger sequencing, the two replicates were high-throughput sequenced. The bioinformatics analysis of the raw sequencing data revealed the mapping-ability and pair-mate rate of the raw data were around 90% and 72%, respectively. Additionally, after removal of self-circular ligations and dangling-end products, more than 96% of the valid pairs were reached. Genome-wide interactome profiling shows clear topological associated domains (TADs), which is consistent with previous reports. Further correlation analysis showed that the two biological replicates strongly correlate with each other in terms of both bin coverage and all bin pairs. All these results indicated that the optimized Hi-C procedure is robust and stable, which will be very helpful for the wide applications of the Hi-C assay.
Full Text Available We investigated the ability of several principal components analysis (PCA-based strategies to detect and control for population stratification using data from a multi-center study of epithelial ovarian cancer among women of European-American ethnicity. These include a correction based on an ancestry informative markers (AIMs panel designed to capture European ancestral variation and corrections utilizing un-thinned genome-wide SNP data; case-control samples were drawn from four geographically distinct North-American sites. The AIMs-only and genome-wide first principal components (PC1 both corresponded to the previously described North or Northwest-Southeast axis of European variation. We found that the genome-wide PCA captured this primary dimension of variation more precisely and identified additional axes of genome-wide variation of relevance to epithelial ovarian cancer. Associations evident between the genome-wide PCs and study site corroborate North American immigration history and suggest that undiscovered dimensions of variation lie within Northern Europe. The structure captured by the genome-wide PCA was also found within control individuals and did not reflect the case-control variation present in the data. The genome-wide PCA highlighted three regions of local LD, corresponding to the lactase (LCT gene on chromosome 2, the human leukocyte antigen system (HLA on chromosome 6 and to a common inversion polymorphism on chromosome 8. These features did not compromise the efficacy of PCs from this analysis for ancestry control. This study concludes that although AIMs panels are a cost-effective way of capturing population structure, genome-wide data should preferably be used when available.
Shefali S Verma
Full Text Available The electronic MEdical Records and GEnomics (eMERGE network brings together DNA biobanks linked to electronic health records (EHRs from multiple institutions. Approximately 52,000 DNA samples from distinct individuals have been genotyped using genome-wide SNP arrays across the nine sites of the network. The eMERGE Coordinating Center and the Genomics Workgroup developed a pipeline to impute and merge genomic data across the different SNP arrays to maximize sample size and power to detect associations with a variety of clinical endpoints. The 1000 Genomes cosmopolitan reference panel was used for imputation. Imputation results were evaluated using the following metrics: accuracy of imputation, allelic R2 (estimated correlation between the imputed and true genotypes, and the relationship between allelic R2 and minor allele frequency. Computation time and memory resources required by two different software packages (BEAGLE and IMPUTE2 were also evaluated. A number of challenges were encountered due to the complexity of using two different imputation software packages, multiple ancestral populations, and many different genotyping platforms. We present lessons learned and describe the pipeline implemented here to impute and merge genomic data sets. The eMERGE imputed dataset will serve as a valuable resource for discovery, leveraging the clinical data that can be mined from the EHR.
Zheng, Gang; Joo, Jungnam; Lin, Jing-Ping; Stylianou, Mario; Waclawiw, Myron A; Geller, Nancy L
In whole-genome association studies, at the first stage, all markers are tested for association and their test statistics or p-values are ranked. At the second stage, some most significant markers are further analyzed by more powerful statistical methods. This helps reduce the number of hypotheses to be corrected for in multiple testing. Ranks of true associations in genome-wide scans using a single test statistic have been studied. In a case-control design for association, the trend test has been proposed. However, three different trend tests, optimal for the recessive, additive, and dominant models, respectively, are available for each marker. Because the true genetic model is unknown, we rank markers based on multiple test statistics or test statistics robust to model mis-specification. We studied this problem with application to Problem 3 of Genetic Analysis Workshop 15. An independent simulation study was also conducted to further evaluate the proposed procedure.
Chen, Zhongxue; Huang, Hanwen; Ng, Hon Keung Tony
In genome-wide association studies (GWAS), multiple diseases with shared controls is one of the case-control study designs. If data obtained from these studies are appropriately analyzed, this design can have several advantages such as improving statistical power in detecting associations and reducing the time and cost in the data collection process. In this paper, we propose a study design for GWAS which involves multiple diseases but without controls. We also propose corresponding statistical data analysis strategy for GWAS with multiple diseases but no controls. Through a simulation study, we show that the statistical association test with the proposed study design is more powerful than the test with single disease sharing common controls, and it has comparable power to the overall test based on the whole dataset including the controls. We also apply the proposed method to a real GWAS dataset to illustrate the methodologies and the advantages of the proposed design. Some possible limitations of this study design and testing method and their solutions are also discussed. Our findings indicate that the proposed study design and statistical analysis strategy could be more efficient than the usual case-control GWAS as well as those with shared controls. Copyright © 2012 Elsevier B.V. All rights reserved.
DNA methylation is the most studied epigenetic modification, and altered DNA methylation patterns have been identified in cancer and more recently also in many other complex diseases. Furthermore, DNA methylation is influenced by a variety of environmental factors, and the analysis of DNA methylation patterns might allow deciphering previous exposure. Although a large number of techniques to study DNA methylation either genome-wide or at specific loci have been devised, they all are based on a limited number of principles for differentiating the methylation state, viz., methylation-specific/methylation-dependent restriction enzymes, antibodies or methyl-binding proteins, chemical-based enrichment, or bisulfite conversion. Second-generation sequencing has largely replaced microarrays as readout platform and is also becoming more popular for locus-specific DNA methylation analysis. In this chapter, the currently used methods for both genome-wide and locus-specific analysis of 5-methylcytosine and as its oxidative derivatives, such as 5-hydroxymethylcytosine, are reviewed in detail, and the advantages and limitations of each approach are discussed. Furthermore, emerging technologies avoiding PCR amplification and allowing a direct readout of DNA methylation are summarized, together with novel applications, such as the detection of DNA methylation in single cells or in circulating cell-free DNA.
Neale, B.M.; Medland, S.; Ripke, S.; Anney, R.J.; Asherson, P.; Buitelaar, J.K.; Franke, B.; Gill, M.; Kent, L.; Holmans, P.; Middleton, F.; Thapar, A.; Lesch, K.P.; Faraone, S.V.; Daly, M.; Nguyen, T.T.; Schafer, H.; Steinhausen, H.C.; Reif, A.; Renner, T.J.; Romanos, M.; Romanos, J.; Warnke, A.; Walitza, S.; Freitag, C.; Meyer, J.; Palmason, H.; Rothenberger, A.; Hawi, Z.; Sergeant, J.A.; Roeyers, H.; Mick, E.; Biederman, J.
OBJECTIVE: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus additional genomewide association studies (GWAS) are needed.
Lasky-Su, J.; Won, S.; Mick, E.; Anney, R.J.; Franke, B.; Neale, B.; Biederman, J.; Smalley, S.L.; Loo, S.K.; Todorov, A.; Faraone, S.V.; Weiss, S.T.; Lange, C.
Large numbers of control individuals with genome-wide genotype data are now available through various databases. These controls are regularly used in case-control genome-wide association studies (GWAS) to increase the statistical power. Controls are often "unselected" for the disease of interest and
Braxton D Mitchell
Full Text Available Genome-wide association studies (GWAS are widely applied to identify susceptibility loci for a variety of diseases using genotyping arrays that interrogate known polymorphisms throughout the genome. A particular strength of GWAS is that it is unbiased with respect to specific genomic elements (e.g., coding or regulatory regions of genes, and it has revealed important associations that would have never been suspected based on prior knowledge or assumptions. To date, the discovered SNPs associated with complex human traits tend to have small effect sizes, requiring very large sample sizes to achieve robust statistical power. To address these issues, a number of efficient strategies have emerged for conducting GWAS, including combining study results across multiple studies using meta-analysis, collecting cases through electronic health records, and using samples collected from other studies as controls that have already been genotyped and made publicly available (e.g., through deposition of de-identified data into dbGaP or EGA.In certain scenarios, it may be attractive to use already genotyped controls and divert resources to standardized collection, phenotyping, and genotyping of cases only. This strategy, however, requires that careful attention be paid to the choice of public controls and to the comparability of genetic data between cases and the public controls to ensure that any allele frequency differences observed between groups is attributable to locus-specific effects rather than to a systematic bias due to poor matching (population stratification or differential genotype calling (batch effects.The goal of this paper is to describe some of the potential pitfalls in using previously genotyped control data. We focus on considerations related to the choice of control groups, the use of different genotyping platforms, and approaches to deal with population stratification when cases and controls are genotyped across different platforms.
Ali, Fadhaa; Zhang, Jian
Multilocus haplotype analysis of candidate variants with genome wide association studies (GWAS) data may provide evidence of association with disease, even when the individual loci themselves do not. Unfortunately, when a large number of candidate variants are investigated, identifying risk haplotypes can be very difficult. To meet the challenge, a number of approaches have been put forward in recent years. However, most of them are not directly linked to the disease-penetrances of haplotypes and thus may not be efficient. To fill this gap, we propose a mixture model-based approach for detecting risk haplotypes. Under the mixture model, haplotypes are clustered directly according to their estimated disease penetrances. A theoretical justification of the above model is provided. Furthermore, we introduce a hypothesis test for haplotype inheritance patterns which underpin this model. The performance of the proposed approach is evaluated by simulations and real data analysis. The results show that the proposed approach outperforms an existing multiple testing method.
Bai, Xufeng; Zhao, Hu; Huang, Yong; Xie, Weibo; Han, Zhongmin; Zhang, Bo; Guo, Zilong; Yang, Lin; Dong, Haijiao; Xue, Weiya; Li, Guangwei; Hu, Gang; Hu, Yong; Xing, Yongzhong
Panicle architecture determines the number of spikelets per panicle (SPP) and is highly associated with grain yield in rice ( L.). Understanding the genetic basis of panicle architecture is important for improving the yield of rice grain. In this study, we dissected panicle architecture traits into eight components, which were phenotyped from a germplasm collection of 529 cultivars. Multiple regression analysis revealed that the number of secondary branch (NSB) was the major factor that contributed to SPP. Genome-wide association analysis was performed independently for the eight particle architecture traits observed in the and rice subpopulations compared with the whole rice population. In total, 30 loci were associated with these traits. Of these, 13 loci were closely linked to known panicle architecture genes, and 17 novel loci were repeatedly identified in different environments. An association signal cluster was identified for NSB and number of spikelets per secondary branch (NSSB) in the region of 31.6 to 31.7 Mb on chromosome 4. In addition to the common associations detected in both and subpopulations, many associated loci were unique to one subpopulation. For example, and were specifically associated with panicle length (PL) in and rice, respectively. Moreover, the -mediated flowering genes and were associated with the formation of panicle architecture in rice. These results suggest that different gene networks regulate panicle architecture in and rice. Copyright © 2016 Crop Science Society of America.
Fei, Gao; Luo, Yonglun; Li, Shengting
Animal breeding via Somatic Cell Nuclear Transfer (SCNT) has enormous potential in agriculture and biomedicine. However, concerns about whether SCNT animals are as healthy or epigenetically normal as conventionally bred ones are raised as the efficiency of cloning by SCNT is much lower than natural...... breeding or In-vitro fertilization (IVF). Thus, we have conducted a genome-wide gene expression and DNA methylation profiling between phenotypically normal cloned pigs and control pigs in two tissues (muscle and liver), using Affymetrix Porcine expression array as well as modified methylation......-specific digital karyotyping (MMSDK) and Solexa sequencing technology. Typical tissue-specific differences with respect to both gene expression and DNA methylation were observed in muscle and liver from cloned as well as control pigs. Gene expression profiles were highly similar between cloned pigs and controls...
Mhatre, Sharayu; Wang, Zhaoming; Nagrani, Rajini; Badwe, Rajendra; Chiplunkar, Shubhada; Mittal, Balraj; Yadav, Saurabh; Zhang, Haoyu; Chung, Charles C; Patil, Prachi; Chanock, Stephen; Dikshit, Rajesh; Chatterjee, Nilanjan; Rajaraman, Preetha
Gallbladder cancer is highly lethal, with notable differences in incidence by geography and ethnic background. The aim of this study was to identify common genetic susceptibility alleles for gallbladder cancer. In this case-control genome-wide association study (GWAS), we did a genome-wide scan of gallbladder cancer cases and hospital visitor controls, both of Indian descent, followed by imputation across the genome. Cases were patients aged 20-80 years with microscopically confirmed primary gallbladder cancer diagnosed or treated at Tata Memorial Hospital, Mumbai, India, and enrolled in the study between Sept 12, 2010, and June 8, 2015. We only included patients who had been diagnosed less than 1 year before the date of enrolment and excluded patients with any other malignancies. We recruited visitor controls aged 20-80 years with no history of cancer visiting all departments or units of Tata Memorial Hospital during the same time period and frequency matched them to cases on the basis of age, sex, and current region of residence. We estimated association using logistic regression, adjusting for age, sex, and five eigenvectors. We recruited samples for a replication cohort from patients visiting Tata Memorial Hospital between Aug 4, 2015, and May 17, 2016, and patients visiting the Sanjay Gandhi Postgraduate Institute of Medical Sciences, Lucknow, India, between July, 2010, and May, 2015. We used the same inclusion and exclusion criteria for the replication set. We examined three of the most significant single-nucleotide polymorphisms (SNPs) in the replication cohort and did a meta-analysis of the GWAS discovery and replication sets to get combined estimates of association. The discovery cohort comprised 1042 gallbladder cancer cases and 1709 controls and the replication cohort contained 428 gallbladder cancer cases and 420 controls. We observed genome-wide significant associations for several markers in the chromosomal region 7q21.12 harbouring both the ABCB1 and
Oluf Dimitri Røe
Full Text Available BACKGROUND: Malignant pleural mesothelioma is considered an almost incurable tumour with increasing incidence worldwide. It usually develops in the parietal pleura, from mesothelial lining or submesothelial cells, subsequently invading the visceral pleura. Chromosomal and genomic aberrations of mesothelioma are diverse and heterogenous. Genome-wide profiling of mesothelioma versus parietal and visceral normal pleural tissue could thus reveal novel genes and pathways explaining its aggressive phenotype. METHODOLOGY AND PRINCIPAL FINDINGS: Well-characterised tissue from five mesothelioma patients and normal parietal and visceral pleural samples from six non-cancer patients were profiled by Affymetrix oligoarray of 38 500 genes. The lists of differentially expressed genes tested for overrepresentation in KEGG PATHWAYS (Kyoto Encyclopedia of Genes and Genomes and GO (gene ontology terms revealed large differences of expression between visceral and parietal pleura, and both tissues differed from mesothelioma. Cell growth and intrinsic resistance in tumour versus parietal pleura was reflected in highly overexpressed cell cycle, mitosis, replication, DNA repair and anti-apoptosis genes. Several genes of the "salvage pathway" that recycle nucleobases were overexpressed, among them TYMS, encoding thymidylate synthase, the main target of the antifolate drug pemetrexed that is active in mesothelioma. Circadian rhythm genes were expressed in favour of tumour growth. The local invasive, non-metastatic phenotype of mesothelioma, could partly be due to overexpression of the known metastasis suppressors NME1 and NME2. Down-regulation of several tumour suppressor genes could contribute to mesothelioma progression. Genes involved in cell communication were down-regulated, indicating that mesothelioma may shield itself from the immune system. Similarly, in non-cancer parietal versus visceral pleura signal transduction, soluble transporter and adhesion genes were
Preuss, Michael; König, Inke R; Thompson, John R; Erdmann, Jeanette; Absher, Devin; Assimes, Themistocles L; Blankenberg, Stefan; Boerwinkle, Eric; Chen, Li; Cupples, L Adrienne; Hall, Alistair S; Halperin, Eran; Hengstenberg, Christian; Holm, Hilma; Laaksonen, Reijo; Li, Mingyao; März, Winfried; McPherson, Ruth; Musunuru, Kiran; Nelson, Christopher P; Burnett, Mary Susan; Epstein, Stephen E; O'Donnell, Christopher J; Quertermous, Thomas; Rader, Daniel J; Roberts, Robert; Schillert, Arne; Stefansson, Kari; Stewart, Alexandre F R; Thorleifsson, Gudmar; Voight, Benjamin F; Wells, George A; Ziegler, Andreas; Kathiresan, Sekar; Reilly, Muredach P; Samani, Nilesh J; Schunkert, Heribert
Recent genome-wide association studies (GWAS) of myocardial infarction (MI) and other forms of coronary artery disease (CAD) have led to the discovery of at least 13 genetic loci. In addition to the effect size, power to detect associations is largely driven by sample size. Therefore, to maximize the chance of finding novel susceptibility loci for CAD and MI, the Coronary ARtery DIsease Genome-wide Replication And Meta-analysis (CARDIoGRAM) consortium was formed. CARDIoGRAM combines data from all published and several unpublished GWAS in individuals with European ancestry; includes >22 000 cases with CAD, MI, or both and >60 000 controls; and unifies samples from the Atherosclerotic Disease VAscular functioN and genetiC Epidemiology study, CADomics, Cohorts for Heart and Aging Research in Genomic Epidemiology, deCODE, the German Myocardial Infarction Family Studies I, II, and III, Ludwigshafen Risk and Cardiovascular Heath Study/AtheroRemo, MedStar, Myocardial Infarction Genetics Consortium, Ottawa Heart Genomics Study, PennCath, and the Wellcome Trust Case Control Consortium. Genotyping was carried out on Affymetrix or Illumina platforms followed by imputation of genotypes in most studies. On average, 2.2 million single nucleotide polymorphisms were generated per study. The results from each study are combined using meta-analysis. As proof of principle, we meta-analyzed risk variants at 9p21 and found that rs1333049 confers a 29% increase in risk for MI per copy (P=2×10⁻²⁰). CARDIoGRAM is poised to contribute to our understanding of the role of common genetic variation on risk for CAD and MI.
Full Text Available We propose a two-stage penalized logistic regression approach to case-control genome-wide association studies. This approach consists of a screening stage and a selection stage. In the screening stage, main-effect and interaction-effect features are screened by using L1-penalized logistic like-lihoods. In the selection stage, the retained features are ranked by the logistic likelihood with the smoothly clipped absolute deviation (SCAD penalty (Fan and Li, 2001 and Jeffrey’s Prior penalty (Firth, 1993, a sequence of nested candidate models are formed, and the models are assessed by a family of extended Bayesian information criteria (J. Chen and Z. Chen, 2008. The proposed approach is applied to the analysis of the prostate cancer data of the Cancer Genetic Markers of Susceptibility (CGEMS project in the National Cancer Institute, USA. Simulation studies are carried out to compare the approach with the pair-wise multiple testing approach (Marchini et al. 2005 and the LASSO-patternsearch algorithm (Shi et al. 2007.
Dugué, Pierre-Antoine; Brinkman, Maree T; Milne, Roger L; Wong, Ee Ming; FitzGerald, Liesel M; Bassett, Julie K; Joo, Jihoon E; Jung, Chol-Hee; Makalic, Enes; Schmidt, Daniel F; Park, Daniel J; Chung, Jessica; Ta, Anthony D; Bolton, Damien M; Lonie, Andrew; Longano, Anthony; Hopper, John L; Severi, Gianluca; Saffery, Richard; English, Dallas R; Southey, Melissa C; Giles, Graham G
Global DNA methylation has been reported to be associated with urothelial cell carcinoma (UCC) by studies using blood samples collected at diagnosis. Using the Illumina HumanMethylation450 assay, we derived genome-wide measures of blood DNA methylation and assessed them for their prospective association with UCC risk. We used 439 case-control pairs from the Melbourne Collaborative Cohort Study matched on age, sex, country of birth, DNA sample type, and collection period. Conditional logistic regression was used to compute odds ratios (OR) of UCC risk per s.d. of each genome-wide measure of DNA methylation and 95% confidence intervals (CIs), adjusted for potential confounders. We also investigated associations by disease subtype, sex, smoking, and time since blood collection. The risk of superficial UCC was decreased for individuals with higher levels of our genome-wide DNA methylation measure (OR=0.71, 95% CI: 0.54-0.94; P=0.02). This association was particularly strong for current smokers at sample collection (OR=0.47, 95% CI: 0.27-0.83). Intermediate levels of our genome-wide measure were associated with decreased risk of invasive UCC. Some variation was observed between UCC subtypes and the location and regulatory function of the CpGs included in the genome-wide measures of methylation. Higher levels of our genome-wide DNA methylation measure were associated with decreased risk of superficial UCC and intermediate levels were associated with reduced risk of invasive disease. These findings require replication by other prospective studies.
Stephen N White
Full Text Available BACKGROUND: Like human immunodeficiency virus (HIV, ovine lentivirus (OvLV is macrophage-tropic and causes lifelong infection. OvLV infects one quarter of U.S. sheep and induces pneumonia and body condition wasting. There is no vaccine to prevent OvLV infection and no cost-effective treatment for infected animals. However, breed differences in prevalence and proviral concentration have indicated a genetic basis for susceptibility to OvLV. A recent study identified TMEM154 variants in OvLV susceptibility. The objective here was to identify additional loci associated with odds and/or control of OvLV infection. METHODOLOGY/PRINCIPAL FINDINGS: This genome-wide association study (GWAS included 964 sheep from Rambouillet, Polypay, and Columbia breeds with serological status and proviral concentration phenotypes. Analytic models accounted for breed and age, as well as genotype. This approach identified TMEM154 (nominal P=9.2×10(-7; empirical P=0.13, provided 12 additional genomic regions associated with odds of infection, and provided 13 regions associated with control of infection (all nominal P<1 × 10(-5. Rapid decline of linkage disequilibrium with distance suggested many regions included few genes each. Genes in regions associated with odds of infection included DPPA2/DPPA4 (empirical P=0.006, and SYTL3 (P=0.051. Genes in regions associated with control of infection included a zinc finger cluster (ZNF192, ZSCAN16, ZNF389, and ZNF165; P=0.001, C19orf42/TMEM38A (P=0.047, and DLGAP1 (P=0.092. CONCLUSIONS/SIGNIFICANCE: These associations provide targets for mutation discovery in sheep susceptibility to OvLV. Aside from TMEM154, these genes have not been associated previously with lentiviral infection in any species, to our knowledge. Further, data from other species suggest functional hypotheses for future testing of these genes in OvLV and other lentiviral infections. Specifically, SYTL3 binds and may regulate RAB27A, which is required for enveloped
Full Text Available This review aimed to arrange the process of a systematic review of genome-wide association studies in order to practice and apply a genome-wide meta-analysis (GWMA. The process has a series of five steps: searching and selection, extraction of related information, evaluation of validity, meta-analysis by type of genetic model, and evaluation of heterogeneity. In contrast to intervention meta-analyses, GWMA has to evaluate the Hardy–Weinberg equilibrium (HWE in the third step and conduct meta-analyses by five potential genetic models, including dominant, recessive, homozygote contrast, heterozygote contrast, and allelic contrast in the fourth step. The ‘genhwcci’ and ‘metan’ commands of STATA software evaluate the HWE and calculate a summary effect size, respectively. A meta-regression using the ‘metareg’ command of STATA should be conducted to evaluate related factors of heterogeneities.
Shim, Sungryul; Kim, Jiyoung; Jung, Wonguen; Shin, In-Soo; Bae, Jong-Myon
This review aimed to arrange the process of a systematic review of genome-wide association studies in order to practice and apply a genome-wide meta-analysis (GWMA). The process has a series of five steps: searching and selection, extraction of related information, evaluation of validity, meta-analysis by type of genetic model, and evaluation of heterogeneity. In contrast to intervention meta-analyses, GWMA has to evaluate the Hardy-Weinberg equilibrium (HWE) in the third step and conduct meta-analyses by five potential genetic models, including dominant, recessive, homozygote contrast, heterozygote contrast, and allelic contrast in the fourth step. The 'genhwcci' and 'metan' commands of STATA software evaluate the HWE and calculate a summary effect size, respectively. A meta-regression using the 'metareg' command of STATA should be conducted to evaluate related factors of heterogeneities.
Chengming Sun; Benqi Wang; Lei Yan; Kaining Hu; Sheng Liu; Yongming Zhou; Chunyun Guan; Zhenqian Zhang; Jiana Li; Jiefu Zhang; Song Chen; Jing Wen; Chaozhi Ma; Jinxing Tu; Jinxiong Shen
Plant height is a key morphological trait of rapeseed. In this study, we measured plant height of a rapeseed population across six environments. This population contains 476 inbred lines representing the major Chinese rapeseed genepool and 44 lines from other countries. The 60K Brassica Infinium® SNP array was utilized to genotype the association panel. A genome-wide association study (GWAS) was performed via three methods, including a robust, novel, nonparametric Anderson-Darling (A-D) test....
Sarah Vanessa Hatzig; Matthias eFrisch; Frank eBreuer; Nathalie eNesi; Sylvie eDucournau; Marie-Hélène eWagner; Gunhild eLeckband; Amine eAbbadi; Snowdon, Rod J.
Rapid and uniform seed germination is a crucial prerequisite for crop establishment and high yield levels in crop production. A disclosure of genetic factors contributing to adequate seed vigour would help to further increase yield potential and stability. Here we carried out a genome-wide association study in order to define genomic regions influencing seed germination and early seedling growth in oilseed rape (Brassica napus L.). A population of 248 genetically diverse winter-type B. napus ...
Scherer Stephen W
Full Text Available Abstract Background Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. Results We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. Conclusions The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.
Hinds, David A; Buil, Alfonso; Ziemek, Daniel; Martinez-Perez, Angel; Malik, Rainer; Folkersen, Lasse; Germain, Marine; Mälarstig, Anders; Brown, Andrew; Soria, Jose Manuel; Dichgans, Martin; Bing, Nan; Franco-Cereceda, Anders; Souto, Juan Carlos; Dermitzakis, Emmanouil T; Hamsten, Anders; Worrall, Bradford B; Tung, Joyce Y; Sabater-Lleal, Maria
Thrombotic diseases are among the leading causes of morbidity and mortality in the world. To add insights into the genetic regulation of thrombotic disease, we conducted a genome-wide association study (GWAS) of 6135 self-reported blood clots events and 252 827 controls of European ancestry belonging to the 23andMe cohort of research participants. Eight loci exceeded genome-wide significance. Among the genome-wide significant results, our study replicated previously known venous thromboembolism (VTE) loci near the F5, FGA-FGG, F11, F2, PROCR and ABO genes, and the more recently discovered locus near SLC44A2 In addition, our study reports for the first time a genome-wide significant association between rs114209171, located upstream of the F8 structural gene, and thrombosis risk. Analyses of expression profiles and expression quantitative trait loci across different tissues suggested SLC44A2, ILF3 and AP1M2 as the three most plausible candidate genes for the chromosome 19 locus, our only genome-wide significant thrombosis-related locus that does not harbor likely coagulation-related genes. In addition, we present data showing that this locus also acts as a novel risk factor for stroke and coronary artery disease (CAD). In conclusion, our study reveals novel common genetic risk factors for VTE, stroke and CAD and provides evidence that self-reported data on blood clots used in a GWAS yield results that are comparable with those obtained using clinically diagnosed VTE. This observation opens up the potential for larger meta-analyses, which will enable elucidation of the genetics of thrombotic diseases, and serves as an example for the genetic study of other diseases. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: firstname.lastname@example.org.
Fullerton, Stephanie M; Wolf, Wendy A; Brothers, Kyle B; Clayton, Ellen Wright; Crawford, Dana C; Denny, Joshua C; Greenland, Philip; Koenig, Barbara A; Leppig, Kathleen A; Lindor, Noralane M; McCarty, Catherine A; McGuire, Amy L; McPeek Hinz, Eugenia R; Mirel, Daniel B; Ramos, Erin M; Ritchie, Marylyn D; Smith, Maureen E; Waudby, Carol J; Burke, Wylie; Jarvik, Gail P
Return of individual genetic results to research participants, including participants in archives and biorepositories, is receiving increased attention. However, few groups have deliberated on specific results or weighed deliberations against relevant local contextual factors. The Electronic Medical Records and Genomics (eMERGE) Network, which includes five biorepositories conducting genome-wide association studies, convened a return of results oversight committee to identify potentially returnable results. Network-wide deliberations were then brought to local constituencies for final decision making. Defining results that should be considered for return required input from clinicians with relevant expertise and much deliberation. The return of results oversight committee identified two sex chromosomal anomalies, Klinefelter syndrome and Turner syndrome, as well as homozygosity for factor V Leiden, as findings that could warrant reporting. Views about returning findings of HFE gene mutations associated with hemochromatosis were mixed due to low penetrance. Review of electronic medical records suggested that most participants with detected abnormalities were unaware of these findings. Local considerations relevant to return varied and, to date, four sites have elected not to return findings (return was not possible at one site). The eMERGE experience reveals the complexity of return of results decision making and provides a potential deliberative model for adoption in other collaborative contexts.
Jiang, Libo; Zhang, Miaomiao; Sang, Mengmeng; Ye, Meixia; Wu, Rongling
Evo-devo is a theory proposed to study how phenotypes evolve by comparing the developmental processes of different organisms or the same organism experiencing changing environments. It has been recognized that nonallelic interactions at different genes or quantitative trait loci, known as epistasis, may play a pivotal role in the evolution of development, but it has proven difficult to quantify and elucidate this role into a coherent picture. We implement a high-dimensional genome-wide association study model into the evo-devo paradigm and pack it into the R-based Evo-Devo-EpiR, aimed at facilitating the genome-wide landscaping of epistasis for the diversification of phenotypic development. By analyzing a high-throughput assay of DNA markers and their pairs simultaneously, Evo-Devo-EpiR is equipped with a capacity to systematically characterize various epistatic interactions that impact on the pattern and timing of development and its evolution. Enabling a global search for all possible genetic interactions for developmental processes throughout the whole genome, Evo-Devo-EpiR provides a computational tool to illustrate a precise genotype-phenotype map at interface between epistasis, development and evolution. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: email@example.com.
Sun, Chengming; Wang, Benqi; Yan, Lei; Hu, Kaining; Liu, Sheng; Zhou, Yongming; Guan, Chunyun; Zhang, Zhenqian; Li, Jiana; Zhang, Jiefu; Chen, Song; Wen, Jing; Ma, Chaozhi; Tu, Jinxing; Shen, Jinxiong; Fu, Tingdong; Yi, Bin
Plant height is a key morphological trait of rapeseed. In this study, we measured plant height of a rapeseed population across six environments. This population contains 476 inbred lines representing the major Chinese rapeseed genepool and 44 lines from other countries. The 60K Brassica Infinium® SNP array was utilized to genotype the association panel. A genome-wide association study (GWAS) was performed via three methods, including a robust, novel, nonparametric Anderson-Darling (A-D) test. Consequently, 68 loci were identified as significantly associated with plant height (P 0.1), we found plausible candidates orthologous to the documented Arabidopsis genes involved in height regulation. One significant association found by GWAS colocalized with the established height locus BnRGA in rapeseed. Our results provide insights into the genetic basis of plant height in rapeseed and may facilitate marker-based breeding.
Full Text Available Plant height is a key morphological trait of rapeseed. In this study, we measured plant height of a rapeseed population across six environments. This population contains 476 inbred lines representing the major Chinese rapeseed genepool and 44 lines from other countries. The 60K Brassica Infinium® SNP array was utilized to genotype the association panel. A genome-wide association study (GWAS was performed via three methods, including a robust, novel, nonparametric Anderson-Darling (A-D test. Consequently, 68 loci were identified as significantly associated with plant height (P 0.1, we found plausible candidates orthologous to the documented Arabidopsis genes involved in height regulation. One significant association found by GWAS colocalized with the established height locus BnRGA in rapeseed. Our results provide insights into the genetic basis of plant height in rapeseed and may facilitate marker-based breeding.
Full Text Available Majority of rapeseed cultivars shatter seeds upon maturity especially under hot-dry and windy conditions, reducing yield and gross margin return to growers. Here, we identified quantitative trait loci (QTL for resistance to pod shatter in unstructured diverse panel of 143 rapeseed accessions, and two structured populations derived from bi-parental doubled haploid (DH and inter-mated (IF2 crosses derived from R1 (resistant to pod shattering and R2 (prone to pod shattering accessions. Genome-wide association analysis identified six significant QTL for resistance to pod shatter located on chromosomes A01, A06, A07, A09, C02 and C05. Two of the QTL, qSRI.A09 delimited with the SNP marker Bn-A09-p30171993 (A09 and qSRI.A06 delimited with the SNP marker Bn-A06-p115948 (A06 could be repeatedly detected across environments in diversity panel, DH and IF2 populations, suggesting that at least two loci on chromosomes A06 and A09 were the main contributors to pod shatter resistance in Chinese germplasm. Significant SNP markers identified in this study especially those appeared repeatedly across environments provide a cost-effective and an efficient method for introgression and pyramiding of favorable alleles for pod shatter resistance via marker-assisted selection in rapeseed improvement programs.
Hatzig, Sarah V; Frisch, Matthias; Breuer, Frank; Nesi, Nathalie; Ducournau, Sylvie; Wagner, Marie-Helene; Leckband, Gunhild; Abbadi, Amine; Snowdon, Rod J
Rapid and uniform seed germination is a crucial prerequisite for crop establishment and high yield levels in crop production. A disclosure of genetic factors contributing to adequate seed vigor would help to further increase yield potential and stability. Here we carried out a genome-wide association study in order to define genomic regions influencing seed germination and early seedling growth in oilseed rape (Brassica napus L.). A population of 248 genetically diverse winter-type B. napus accessions was genotyped with the Brassica 60k SNP Illumina genotyping array. Automated high-throughput in vitro phenotyping provided extensive data for multiple traits related to germination and early vigor, such as germination speed, absolute germination rate and radicle elongation. The data obtained indicate that seed germination and radicle growth are strongly environmentally dependent, but could nevertheless be substantially improved by genomic-based breeding. Conditions during seed production and storage were shown to have a profound effect on seed vigor, and a variable manifestation of seed dormancy appears to contribute to differences in germination performance in B. napus. Several promising positional and functional candidate genes could be identified within the genomic regions associated with germination speed, absolute germination rate, radicle growth and thousand seed weight. These include B. napus orthologs of the Arabidopsis thaliana genes SNOWY COTYLEDON 1 (SCO1), ARABIDOPSIS TWO-COMPONENT RESPONSE REGULATOR (ARR4), and ARGINYL-t-RNA PROTEIN TRANSFERASE 1 (ATE1), which have been shown previously to play a role in seed germination and seedling growth in A. thaliana.
Sarah Vanessa Hatzig
Full Text Available Rapid and uniform seed germination is a crucial prerequisite for crop establishment and high yield levels in crop production. A disclosure of genetic factors contributing to adequate seed vigour would help to further increase yield potential and stability. Here we carried out a genome-wide association study in order to define genomic regions influencing seed germination and early seedling growth in oilseed rape (Brassica napus L.. A population of 248 genetically diverse winter-type B. napus accessions was genotyped with the Brassica 60kSNP Illumina genotyping array. Automated high-throughput in vitro phenotyping provided extensive data for multiple traits related to germination and early vigour, such as germination speed, absolute germination rate and radicle elongation. The data obtained indicate that seed germination and radicle growth are strongly environmentally dependent, but could nevertheless be substantially improved by genomic-based breeding,. Conditions during seed production and storage were shown to have a profound effect on seed vigour, and a variable manifestation of seed dormancy appears to contribute to differences in germination performance in B. napus. Several promising positional and functional candidate genes could be identified within the genomic regions associated with germination speed, absolute germination rate, radicle growth and thousand seed weight. These include B. napus orthologues of the Arabidopsis thaliana genes SNOWY COTYLEDON 1 (SCO1, ARABIDOPSIS TWO-COMPONENT RESPONSE REGULATOR (ARR4 and ARGINYL-t-RNA PROTEIN TRANSFERASE 1 (ATE1, which have been shown previously to play a role in seed germination and seedling growth in A. thaliana.
Bauchet, Guillaume; Grenier, Stéphane; Samson, Nicolas; Bonnet, Julien; Grivet, Laurent; Causse, Mathilde
A panel of 300 tomato accessions including breeding materials was built and characterized with >11,000 SNP. A population structure in six subgroups was identified. Strong heterogeneity in linkage disequilibrium and recombination landscape among groups and chromosomes was shown. GWAS identified several associations for fruit weight, earliness and plant growth. Genome-wide association studies (GWAS) have become a method of choice in quantitative trait dissection. First limited to highly polymorphic and outcrossing species, it is now applied in horticultural crops, notably in tomato. Until now GWAS in tomato has been performed on panels of heirloom and wild accessions. Using modern breeding materials would be of direct interest for breeding purpose. To implement GWAS on a large panel of 300 tomato accessions including 168 breeding lines, this study assessed the genetic diversity and linkage disequilibrium decay and revealed the population structure and performed GWA experiment. Genetic diversity and population structure analyses were based on molecular markers (>11,000 SNP) covering the whole genome. Six genetic subgroups were revealed and associated to traits of agronomical interest, such as fruit weight and disease resistance. Estimates of linkage disequilibrium highlighted the heterogeneity of its decay among genetic subgroups. Haplotype definition allowed a fine characterization of the groups and their recombination landscape revealing the patterns of admixture along the genome. Selection footprints showed results in congruence with introgressions. Taken together, all these elements refined our knowledge of the genetic material included in this panel and allowed the identification of several associations for fruit weight, plant growth and earliness, deciphering the genetic architecture of these complex traits and identifying several new loci useful for tomato breeding.
Andersen, Vibeke; Ernst, Anja; Sventoraityte, Jurgita; Kupcinskas, Limas; Jacobsen, Bent A; Krarup, Henrik B; Vogel, Ulla; Jonaitis, Laimas; Denapiene, Goda; Kiudelis, Gediminas; Balschun, Tobias; Franke, Andre
Differences in the genetic architecture of inflammatory bowel disease between different European countries and ethnicities have previously been reported. In the present study, we wanted to assess the role of 11 newly identified UC risk variants, derived from a recent European UC genome wide association study (GWAS) (Franke et al., 2010), for 1) association with UC in the Nordic countries, 2) for population heterogeneity between the Nordic countries and the rest of Europe, and, 3) eventually, to drive some of the previous findings towards overall genome-wide significance. Eleven SNPs were replicated in a Danish sample consisting of 560 UC patients and 796 controls and nine missing SNPs of the German GWAS study were successfully genotyped in the Baltic sample comprising 441 UC cases and 1156 controls. The independent replication data was then jointly analysed with the original data and systematic comparisons of the findings between ethnicities were made. Pearson's χ2, Breslow-Day (BD) and Cochran-Mantel-Haenszel (CMH) tests were used for association analyses and heterogeneity testing. The rs5771069 (IL17REL) SNP was not associated with UC in the Danish panel. The rs5771069 (IL17REL) SNP was significantly associated with UC in the combined Baltic, Danish and Norwegian UC study sample driven by the Norwegian panel (OR = 0.89, 95% CI: 0.79-0.98, P = 0.02). No association was found between rs7809799 (SMURF1/KPNA7) and UC (OR = 1.20, 95% CI: 0.95-1.52, P = 0.10) or between UC and all other remaining SNPs. We had 94% chance of detecting an association for rs7809799 (SMURF1/KPNA7) in the combined replication sample, whereas the power were 55% or lower for the remaining SNPs.Statistically significant PBD was found for OR heterogeneity between the combined Baltic, Danish, and Norwegian panel versus the combined German, British, Belgian, and Greek panel (rs7520292 (P = 0.001), rs12518307 (P = 0.007), and rs2395609 (TCP11) (P = 0.01), respectively).No SNP reached genome-wide
Hinds, David A; Buil, Alfonso; Ziemek, Daniel
belonging to the 23andMe cohort of research participants. Eight loci exceeded genome-wide significance. Among the genome-wide significant results, our study replicated previously known venous thromboembolism (VTE) loci near the F5, FGA-FGG, F11, F2, PROCR and ABO genes, and the more recently discovered...
Vaithilingam, R D; Safii, S H; Baharuddin, N A; Ng, C C; Cheong, S C; Bartold, P M; Schaefer, A S; Loos, B G
Studies to elucidate the role of genetics as a risk factor for periodontal disease have gone through various phases. In the majority of cases, the initial 'hypothesis-dependent' candidate-gene polymorphism studies did not report valid genetic risk loci. Following a large-scale replication study, these initially positive results are believed to be caused by type 1 errors. However, susceptibility genes, such as CDKN2BAS (Cyclin Dependend KiNase 2B AntiSense RNA; alias ANRIL [ANtisense Rna In the Ink locus]), glycosyltransferase 6 domain containing 1 (GLT6D1) and cyclooxygenase 2 (COX2), have been reported as conclusive risk loci of periodontitis. The search for genetic risk factors accelerated with the advent of 'hypothesis-free' genome-wide association studies (GWAS). However, despite many different GWAS being performed for almost all human diseases, only three GWAS on periodontitis have been published - one reported genome-wide association of GLT6D1 with aggressive periodontitis (a severe phenotype of periodontitis), whereas the remaining two, which were performed on patients with chronic periodontitis, were not able to find significant associations. This review discusses the problems faced and the lessons learned from the search for genetic risk variants of periodontitis. Current and future strategies for identifying genetic variance in periodontitis, and the importance of planning a well-designed genetic study with large and sufficiently powered case-control samples of severe phenotypes, are also discussed. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Full Text Available Plant architecture is vital not only for crop yield, but also for field management, such as mechanical harvesting. The branch angle is one of the key factors determining plant architecture. With the aim of revealing the genetic control underlying branch angle in rapeseed (Brassica napus L., the positional variation of branch angles on individual plants was evaluated, and the branch angle increased with the elevation of branch position. Furthermore, three middle branches of individual plants were selected to measure the branch angle because they exhibited the most representative phenotypic values. An association panel with 472 diverse accessions was estimated for branch angle trait in six environments and genotyped with a 60K Brassica Infinium® SNP array. As a result of association mapping, 46 and 38 significantly-associated loci were detected using a mixed linear model (MLM and a multi-locus random-SNP-effect mixed linear model (MRMLM, which explained up to 62.2 and 66.2% of the cumulative phenotypic variation, respectively. Numerous highly-promising candidate genes were identified by annotating against Arabidopsis thaliana homologous, including some first found in rapeseed, such as TAC1, SGR1, SGR3, and SGR5. These findings reveal the genetic control underlying branch angle and provide insight into genetic improvements that are possible in the plant architecture of rapeseed.
Li, Hongge; Zhang, Liping; Hu, Jihong; Zhang, Fugui; Chen, Biyun; Xu, Kun; Gao, Guizhen; Li, Hao; Zhang, Tianyao; Li, Zaiyun; Wu, Xiaoming
Plant architecture is vital not only for crop yield, but also for field management, such as mechanical harvesting. The branch angle is one of the key factors determining plant architecture. With the aim of revealing the genetic control underlying branch angle in rapeseed (Brassica napus L.), the positional variation of branch angles on individual plants was evaluated, and the branch angle increased with the elevation of branch position. Furthermore, three middle branches of individual plants were selected to measure the branch angle because they exhibited the most representative phenotypic values. An association panel with 472 diverse accessions was estimated for branch angle trait in six environments and genotyped with a 60K Brassica Infinium® SNP array. As a result of association mapping, 46 and 38 significantly-associated loci were detected using a mixed linear model (MLM) and a multi-locus random-SNP-effect mixed linear model (MRMLM), which explained up to 62.2 and 66.2% of the cumulative phenotypic variation, respectively. Numerous highly-promising candidate genes were identified by annotating against Arabidopsis thaliana homologous, including some first found in rapeseed, such as TAC1, SGR1, SGR3, and SGR5. These findings reveal the genetic control underlying branch angle and provide insight into genetic improvements that are possible in the plant architecture of rapeseed.
Quaye, Lydia; Nicolaou, Nicos; Shane, Scott; Mangino, Massimo
To identify specific genetic variants influencing the phenotype of entrepreneurship, we conducted a genome-wide association study (GWAS) with 3,933 Caucasian females from the TwinsUK Adult Twin Registry. Following stringent genotype quality control, GWAF (genome-wide association analyses for family data) software was used to assess the association…
Jiang, Wei; Yu, Weichuan
In genome-wide association studies (GWASs) of common diseases/traits, we often analyze multiple GWASs with the same phenotype together to discover associated genetic variants with higher power. Since it is difficult to access data with detailed individual measurements, summary-statistics-based meta-analysis methods have become popular to jointly analyze datasets from multiple GWASs. In this paper, we propose a novel summary-statistics-based joint analysis method based on controlling the joint local false discovery rate (Jlfdr). We prove that our method is the most powerful summary-statistics-based joint analysis method when controlling the false discovery rate at a certain level. In particular, the Jlfdr-based method achieves higher power than commonly used meta-analysis methods when analyzing heterogeneous datasets from multiple GWASs. Simulation experiments demonstrate the superior power of our method over meta-analysis methods. Also, our method discovers more associations than meta-analysis methods from empirical datasets of four phenotypes. The R-package is available at: http://bioinformatics.ust.hk/Jlfdr.html . firstname.lastname@example.org. Supplementary data are available at Bioinformatics online.
Zheng, Ming; Peng, Cheng; Liu, Hongfang; Tang, Min; Yang, Hongli; Li, Xiaokang; Liu, Jinglin; Sun, Xingchao; Wang, Xinfa; Xu, Junfeng; Hua, Wei; Wang, Hanzhong
Plant architecture is crucial for rapeseed yield and is determined by plant height (PH), branch initiation height (BIH), branch number (BN) and leaf and inflorescence morphology. In this study, we measured three major factors (PH, BIH, and BN) in a panel of 333 rapeseed accessions across 4 years. A genome-wide association study (GWAS) was performed via Q + K model and the panel was genotyped using the 60 k Brassica Infinium SNP array. We identified seven loci for PH, four for BIH, and five for BN. Subsequently, by determining linkage disequilibrium (LD) decay associated with 38 significant SNPs, we gained 31, 15, and 17 candidate genes for these traits, respectively. We also showed that PH is significantly correlated with BIH, while no other correlation was revealed. Notably, a GA signaling gene (BnRGA) and a flowering gene (BnFT) located on chromosome A02 were identified as the most likely candidate genes associated with PH regulation. Furthermore, a meristem initiation gene (BnLOF2) and a NAC domain transcriptional factor (BnCUC3) that may be associated with BN were identified on the chromosome A07. This study reveals novel insight into the genetic control of plant architecture and may facilitate marker-based breeding for rapeseed.
Full Text Available Plant architecture is crucial for rapeseed yield and is determined by plant height (PH, branch initiation height (BIH, branch number (BN and leaf and inflorescence morphology. In this study, we measured three major factors (PH, BIH, and BN in a panel of 333 rapeseed accessions across 4 years. A genome-wide association study (GWAS was performed via Q + K model and the panel was genotyped using the 60 k Brassica Infinium SNP array. We identified seven loci for PH, four for BIH, and five for BN. Subsequently, by determining linkage disequilibrium (LD decay associated with 38 significant SNPs, we gained 31, 15, and 17 candidate genes for these traits, respectively. We also showed that PH is significantly correlated with BIH, while no other correlation was revealed. Notably, a GA signaling gene (BnRGA and a flowering gene (BnFT located on chromosome A02 were identified as the most likely candidate genes associated with PH regulation. Furthermore, a meristem initiation gene (BnLOF2 and a NAC domain transcriptional factor (BnCUC3 that may be associated with BN were identified on the chromosome A07. This study reveals novel insight into the genetic control of plant architecture and may facilitate marker-based breeding for rapeseed.
Shu, Xiaoli; Rasmussen, Søren K.
Genome-wide association studies (GWAS) for amylose, amylopectin and β-glucan concentration in a collection of 254 European spring barley varieties allowed to identify 20, 17, and 21 single nucleotide polymorphic (SNP) markers, respectively, associated with these important grain quality traits. Negative correlations between the content of amylose and β-glucan (R = −0.62, P < 0.01) and amylopectin and β-glucan (R = −0.487, P < 0.01) were found in this large collection of spring barley varieties. Besides HvCslF6, amo1 and AGPL2, sex6, and waxy were identified among the major genes responsible for β-glucan, amylose and amylopectin content, respectively. Several minor genes like HvGSL4, HvGSL3, and HvCesA6, PWD were also detected by GWAS for the first time. Furthermore, the gene encoding β-fructofuranosidase, located on the short arm of chromosome 7H at 1.49 cM, and SRF6, encoding “leucine-rich repeat receptor kinase protein” on chromosome 2 H, are proposed to be new candidate genes for amylopectin formation in barley endosperm. Several of the associated SNPs on chromosome 1, 5, 6, and 7H mapped to overlapping regions containing QTLs and genes controlling the three grain constituents. In particular chromosomes 5 and 7H carry many QTLs controlling barley grain quality. Amylose, amylopectin and β-glucan were interacted among each other through a metabolic network connected by UDP showing pleiotropic effects. Taken together, these results showed that cereal quality traits related each other and regulated through an interaction network, the identified major genes and genetic regions for amylose, amylopectin and β-glucan is a helpful for further research on carbohydrates and barley breeding. PMID:24860587
Shu, Xiaoli; Rasmussen, Søren Kjærsgaard
Genome-wide association studies (GWAS) for amylose, amylopectin and β-glucan concentration in a collection of 254 European spring barley varieties allowed to identify 20, 17 and 21 single nucleotide polymorphic (SNP) markers, respectively, associated with these important grain quality traits. Neg...
Liu, Li; Pertsemlidis, Alexander; Ding, Liang-Hao; Story, Michael D; Steinberg, Martin H; Sebastiani, Paola; Hoppe, Carolyn; Ballas, Samir K; Pace, Betty S
Sickle cell disease (SCD) is a group of inherited blood disorders that have in common a mutation in the sixth codon of the β-globin (HBB) gene on chromosome 11. However, people with the same genetic mutation display a wide range of clinical phenotypes. Fetal hemoglobin (HbF) expression is an important genetic modifier of SCD complications leading to milder symptoms and improved long-term survival. Therefore, we performed a genome-wide association study (GWAS) using a case-control experimental design in 244 African Americans with SCD to discover genetic factors associated with HbF expression. The case group consisted of subjects with HbF≥8.6% (133 samples) and control group subjects with HbF≤£3.1% (111 samples). Our GWAS results replicated SNPs previously identified in an erythroid-specific enhancer region located in the second intron of the BCL11A gene associated with HbF expression. In addition, we identified SNPs in the SPARC, GJC1, EFTUD2 and JAZF1 genes as novel candidates associated with HbF levels. To gain insights into mechanisms of globin gene regulation in the HBB locus, linkage disequilibrium (LD) and haplotype analyses were conducted. We observed strong LD in the low HbF group in contrast to a loss of LD and greater number of haplotypes in the high HbF group. A search of known HBB locus regulatory elements identified SNPs 5' of δ-globin located in an HbF silencing region. In particular, SNP rs4910736 created a binding site for a known transcription repressor GFi1 which is a candidate protein for further investigation. Another HbF-associated SNP, rs2855122 in the cAMP response element upstream of Gγ-globin, was analyzed for functional relevance. Studies performed with siRNA-mediated CREB binding protein (CBP) knockdown in primary erythroid cells demonstrated γ-globin activation and HbF induction, supporting a repressor role for CBP. This study identifies possible molecular determinants of HbF production. © 2016 by the Society for Experimental
Knudsen, Gitte Maegaard; Nielsen, Jesper Boye; Marvig, Rasmus Lykke
of 80 isolates of Listeria monocytogenes sampled from Danish food processing plants over a time-period of 20 years, and analyzed the sequences together with 10 public available reference genomes to advance our understanding of inter- and intra-plant genomic diversity of L. monocytogenes. Except......Whole genome sequencing is increasing used in epidemiology, e.g. for tracing outbreaks of food-borne diseases. This requires in-depth understanding of pathogen emergence, persistence, and genomic diversity along the food production chain including in food processing plants. We sequenced the genomes...
Giampetruzzi, Annalisa; Saponari, Maria; Loconsole, Giuliana; Boscia, Donato; Savino, Vito Nicola; Almeida, Rodrigo P P; Zicca, Stefania; Landa, Blanca B; Chacón-Diaz, Carlos; Saldarelli, Pasquale
Xylella fastidiosa is a plant-pathogenic bacterium recently introduced in Europe that is causing decline in olive trees in the South of Italy. Genetic studies have consistently shown that the bacterial genotype recovered from infected olive trees belongs to the sequence type ST53 within subspecies pauca. This genotype, ST53, has also been reported to occur in Costa Rica. The ancestry of ST53 was recently clarified, showing it contains alleles that are monophyletic with those of subsp. pauca in South America. To more robustly determine the phylogenetic placement of ST53 within X. fastidiosa, we performed a comparative analysis based on single nucleotide polymorphisms (SNPs) and the study of the pan-genome of the 27 currently public available whole genome sequences of X. fastidiosa. The resulting maximum-parsimony and maximum likelihood trees constructed using the SNPs and the pan-genome analysis are consistent with previously described X. fastidiosa taxonomy, distinguishing the subsp. fastidiosa, multiplex, pauca, sandyi, and morus. Within the subsp. pauca, the Italian and three Costa Rican isolates, all belonging to ST53, formed a compact phylotype in a clade divergent from the South American pauca isolates, also distinct from the recently described coffee isolate CFBP8072 imported into Europe from Ecuador. These findings were also supported by the gene characterization of a conjugative plasmid shared by all the four ST53 isolates. Furthermore, isolates of the ST53 clade possess an exclusive locus encoding a putative ATP-binding protein belonging to the family of histidine kinase-like ATPase gene, which is not present in isolates from the subspecies multiplex, sandyi, and pauca, but was detected in ST21 isolates of the subspecies fastidiosa from Costa Rica. The clustering and distinctiveness of the ST53 isolates supports the hypothesis of their common origin, and the limited genetic diversity among these isolates suggests this is an emerging clade within subsp
Schellenberg Gerard D
Full Text Available Abstract Background Alzheimer's disease (AD is common and highly heritable with many genes and gene variants associated with AD in one or more studies, including APOE ε2/ε3/ε4. However, the genetic backgrounds for normal cognition, mild cognitive impairment (MCI and AD in terms of changes in cerebrospinal fluid (CSF levels of Aβ1-42, T-tau, and P-tau181P, have not been clearly delineated. We carried out a genome-wide association study (GWAS in order to better define the genetic backgrounds to these three states in relation to CSF levels. Methods Subjects were participants in the Alzheimer's Disease Neuroimaging Initiative (ADNI. The GWAS dataset consisted of 818 participants (mainly Caucasian genotyped using the Illumina Human Genome 610 Quad BeadChips. This sample included 410 subjects (119 Normal, 115 MCI and 176 AD with measurements of CSF Aβ1-42, T-tau, and P-tau181P Levels. We used PLINK to find genetic associations with the three CSF biomarker levels. Association of each of the 498,205 SNPs was tested using additive, dominant, and general association models while considering APOE genotype and age. Finally, an effort was made to better identify relevant biochemical pathways for associated genes using the ALIGATOR software. Results We found that there were some associations with APOE genotype although CSF levels were about the same for each subject group; CSF Aβ1-42 levels decreased with APOE gene dose for each subject group. T-tau levels tended to be higher among AD cases than among normal subjects. From adjusted result using APOE genotype and age as covariates, no SNP was associated with CSF levels among AD subjects. CYP19A1 'aromatase' (rs2899472, NCAM2, and multiple SNPs located on chromosome 10 near the ARL5B gene demonstrated the strongest associations with Aβ1-42 in normal subjects. Two genes found to be near the top SNPs, CYP19A1 (rs2899472, p = 1.90 × 10-7 and NCAM2 (rs1022442, p = 2.75 × 10-7 have been reported as genetic
Kelemen, Linda E; Lawrenson, Kate; Tyrer, Jonathan
Genome-wide association studies have identified several risk associations for ovarian carcinomas but not for mucinous ovarian carcinomas (MOCs). Our analysis of 1,644 MOC cases and 21,693 controls with imputation identified 3 new risk associations: rs752590 at 2q13 (P = 3.3 × 10(-8)), rs711830 at...
We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10(-11)) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10(-9)), ANK3 (rs10994359, P = 2.5 × 10(-8)) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10(-9)).
Daetwyler, H.D.; Villanueva, B.; Bijma, P.; Woolliams, J.A.
Traditional selection methods, such as sib and best linear unbiased prediction (BLUP) selection, which increased genetic gain by increasing accuracy of evaluation have also led to an increased rate of inbreeding per generation (¿FG). This is not necessarily the case with genome-wide selection, which
C4BPB/C4BPA is a new susceptibility locus for venous thrombosis with unknown protein S-independent mechanism: results from genome-wide association and gene expression analyses followed by case-control studies.
Buil, Alfonso; Trégouët, David-Alexandre; Souto, Juan Carlos; Saut, Noémie; Germain, Marine; Rotival, Maxime; Tiret, Laurence; Cambien, Françcois; Lathrop, Mark; Zeller, Tanja; Alessi, Marie-Christine; Rodriguez de Cordoba, Santiago; Münzel, Thomas; Wild, Philipp; Fontcuberta, Jordi; Gagnon, France; Emmerich, Joseph; Almasy, Laura; Blankenberg, Stefan; Soria, José-Manuel; Morange, Pierre-Emmanuel
Through its binding with protein S (PS), a key element of the coagulation/fibrinolysis cascade, the C4b-binding protein (C4BP) has been hypothesized to be involved in the susceptibility to venous thrombosis (VT). To identify genetic factors that may influence the plasma levels of the 3 C4BP existing isoforms, alpha(7)beta(1), alpha(6)beta(1), and alpha(7)beta(0), we conducted a genome-wide association study by analyzing 283 437 single nucleotide polymorphisms (SNPs) in the Genetic Analysis of Idiopathic Thrombophilia (GAIT) study composed of 352 persons. Three SNPs at the C4BPB/C4BPA locus were found genome-wide significantly associated with alpha(7)beta(0) levels. One of these SNPs was further found to explain approximately 11% of the variability of mRNA C4BPA expression in the Gutenberg Heart Study composed of 1490 persons, with no effect on C4BPB mRNA expression. The allele associated with increased alpha(7)beta(0) plasma levels and increased C4BPA expression was further found associated with increased risk of VT (odds ratio [OR] = 1.24 [1.03-1.53]) in 2 independent case-control studies (MARseille THrombosis Association study [MARTHA] and FActeurs de RIsque et de récidives de la maladie thromboembolique VEineuse [FARIVE]) gathering 1706 cases and 1379 controls. This SNP was not associated with free PS or total PS. In conclusion, we observed strong evidence that the C4BPB/C4BPA locus is a new susceptibility locus for VT through a PS-independent mechanism that remains to be elucidated.
Pappa, I.; St Pourcain, B.; Benke, K.S.; Cavadino, A.; Hakulinen, C.; Nivard, M.G.; Nolte, I.M.; Tiesler, C.M.T.; Bakermans-Kranenburg, M.J.; Davies, G.E.; Evans, D.M.; Geoffroy, M.C.; Grallert, H.; Blokhuis, M.M.; Hudziak, J.J.; Kemp, J.P.; Keltikangas-Järvinen, L.; McMahon, G.; Mileva-Seitz, V.R.; Motazedi, E.; Power, C.; Raitakari, O.T.; Ring, S.M.; Rivadeneira, F.; Rodriguez, A.; Scheet, P.; Seppälä, I.; Snieder, H.; Standl, M.; Thiering, E.; Timpson, N.J.; Veenstra, R.; Velders, F.P.; Whitehouse, A.J.O.; Davey Smith, G.; Heinrich, J.; Hypponen, E.; Lehtimäki, T.; Middeldorp, C.M.; Oldehinkel, A.J.; Pennell, C.E.; Boomsma, D.I.; Tiemeier, H.
Individual differences in aggressive behavior emerge in early childhood and predict persisting behavioral problems and disorders. Studies of antisocial and severe aggression in adulthood indicate substantial underlying biology. However, little attention has been given to genome-wide approaches of
Waage, Johannes; Baurecht, Hansjörg; Hotze, Melanie; Strachan, David P; Curtin, John A; Bønnelykke, Klaus; Tian, Chao; Takahashi, Atsushi; Esparza-Gordillo, Jorge; Alves, Alexessander Couto; Thyssen, Jacob P; den Dekker, Herman T; Ferreira, Manuel A; Altmaier, Elisabeth; Sleiman, Patrick MA; Xiao, Feng Li; Gonzalez, Juan R; Marenholz, Ingo; Kalb, Birgit; Yanes, Maria Pino; Xu, Cheng-Jian; Carstensen, Lisbeth; Groen-Blokhuis, Maria M; Venturini, Cristina; Pennell, Craig E; Barton, Sheila J; Levin, Albert M; Curjuric, Ivan; Bustamante, Mariona; Kreiner-Møller, Eskil; Lockett, Gabrielle A; Bacelis, Jonas; Bunyavanich, Supinda; Myers, Rachel A; Matanovic, Anja; Kumar, Ashish; Tung, Joyce Y; Hirota, Tomomitsu; Kubo, Michiaki; McArdle, Wendy L; Henderson, A J; Kemp, John P; Zheng, Jie; Smith, George Davey; Rüschendorf, Franz; Bauerfeind, Anja; Lee-Kirsch, Min Ae; Arnold, Andreas; Homuth, Georg; Schmidt, Carsten O; Mangold, Elisabeth; Cichon, Sven; Keil, Thomas; Rodríguez, Elke; Peters, Annette; Franke, Andre; Lieb, Wolfgang; Novak, Natalija; Fölster-Holst, Regina; Horikoshi, Momoko; Pekkanen, Juha; Sebert, Sylvain; Husemoen, Lise L; Grarup, Niels; de Jongste, Johan C; Rivadeneira, Fernando; Hofman, Albert; Jaddoe, Vincent WV; Pasmans, Suzanne GMA; Elbert, Niels J; Uitterlinden, André G; Marks, Guy B; Thompson, Philip J; Matheson, Melanie C; Robertson, Colin F; Ried, Janina S; Li, Jin; Zuo, Xian Bo; Zheng, Xiao Dong; Yin, Xian Yong; Sun, Liang Dan; McAleer, Maeve A; O'Regan, Grainne M; Fahy, Caoimhe MR; Campbell, Linda E; Macek, Milan; Kurek, Michael; Hu, Donglei; Eng, Celeste; Postma, Dirkje S; Feenstra, Bjarke; Geller, Frank; Hottenga, Jouke Jan; Middeldorp, Christel M; Hysi, Pirro; Bataille, Veronique; Spector, Tim; Tiesler, Carla MT; Thiering, Elisabeth; Pahukasahasram, Badri; Yang, James J; Imboden, Medea; Huntsman, Scott; Vilor-Tejedor, Natàlia; Relton, Caroline L; Myhre, Ronny; Nystad, Wenche; Custovic, Adnan; Weiss, Scott T; Meyers, Deborah A; Söderhäll, Cilla; Melén, Erik; Ober, Carole; Raby, Benjamin A; Simpson, Angela; Jacobsson, Bo; Holloway, John W; Bisgaard, Hans; Sunyer, Jordi; Hensch, Nicole M Probst; Williams, L Keoki; Godfrey, Keith M; Wang, Carol A; Boomsma, Dorret I; Melbye, Mads; Koppelman, Gerard H; Jarvis, Deborah; McLean, WH Irwin; Irvine, Alan D; Zhang, Xue Jun; Hakonarson, Hakon; Gieger, Christian; Burchard, Esteban G; Martin, Nicholas G; Duijts, Liesbeth; Linneberg, Allan; Jarvelin, Marjo-Riitta; Noethen, Markus M; Lau, Susanne; Hübner, Norbert; Lee, Young-Ae; Tamari, Mayumi; Hinds, David A; Glass, Daniel; Brown, Sara J; Heinrich, Joachim; Evans, David M; Weidinger, Stephan
Genetic association studies have identified 21 loci associated with atopic dermatitis risk predominantly in populations of European ancestry. To identify further susceptibility loci for this common complex skin disease, we performed a meta-analysis of >15 million genetic variants in 21,399 cases and 95,464 controls from populations of European, African, Japanese and Latino ancestry, followed by replication in 32,059 cases and 228,628 controls from 18 studies. We identified 10 novel risk loci, bringing the total number of known atopic dermatitis risk loci to 31 (with novel secondary signals at 4 of these). Notably, the new loci include candidate genes with roles in regulation of innate host defenses and T-cell function, underscoring the important contribution of (auto-)immune mechanisms to atopic dermatitis pathogenesis. PMID:26482879
William S Bush
Full Text Available Genome-wide association studies (GWAS have evolved over the last ten years into a powerful tool for investigating the genetic architecture of human disease. In this work, we review the key concepts underlying GWAS, including the architecture of common diseases, the structure of common human genetic variation, technologies for capturing genetic information, study designs, and the statistical methods used for data analysis. We also look forward to the future beyond GWAS.
Rauch, Alexander; Mandrup, Susanne
number of technologies that can be used to obtain genome-wide insight into how genomes are reprogrammed during development and in response to specific external signals. By applying such technologies, we have begun to reveal the cross-talk between metabolism and the genome, i.e., how genomes...... are reprogrammed in response to metabolites, and how the regulation of metabolic networks is coordinated at the genomic level....
Yan, Fang; Yu, Yiyang; Gozzi, Kevin; Chen, Yun; Guo, Jian-Hua; Chai, Yunrong
Bacillus cereus is a soil-dwelling Gram-positive bacterium capable of forming structured multicellular communities, or biofilms. However, the regulatory pathways controlling biofilm formation are less well understood in B. cereus In this work, we developed a method to study B. cereus biofilms formed at the air-liquid interface. We applied two genome-wide approaches, random transposon insertion mutagenesis to identify genes that are potentially important for biofilm formation, and transcriptome analyses by RNA sequencing (RNA-seq) to characterize genes that are differentially expressed in B. cereus when cells were grown in a biofilm-inducing medium. For the first approach, we identified 23 genes whose disruption by transposon insertion led to altered biofilm phenotypes. Based on the predicted function, they included genes involved in processes such as nucleotide biosynthesis, iron salvage, and antibiotic production, as well as genes encoding an ATP-dependent protease and transcription regulators. Transcriptome analyses identified about 500 genes that were differentially expressed in cells grown under biofilm-inducing conditions. One particular set of those genes may contribute to major metabolic shifts, leading to elevated production of small volatile molecules. Selected volatile molecules were shown to stimulate robust biofilm formation in B. cereus Our studies represent a genome-wide investigation of B. cereus biofilm formation.IMPORTANCE In this work, we established a robust method for B. cereus biofilm studies and applied two genome-wide approaches, transposon insertion mutagenesis and transcriptome analyses by RNA-seq, to identify genes and pathways that are potentially important for biofilm formation in B. cereus We discovered dozens of genes and two major metabolic shifts that seem to be important for biofilm formation in B. cereus Our study represents a genome-wide investigation on B. cereus biofilm formation. Copyright © 2017 American Society for
Lang, M; Leménager, T; Streit, F; Fauth-Bühler, M; Frank, J; Juraeva, D; Witt, S H; Degenhardt, F; Hofmann, A; Heilmann-Heimbach, S; Kiefer, F; Brors, B; Grabe, H-J; John, U; Bischof, A; Bischof, G; Völker, U; Homuth, G; Beutel, M; Lind, P A; Medland, S E; Slutske, W S; Martin, N G; Völzke, H; Nöthen, M M; Meyer, C; Rumpf, H-J; Wurst, F M; Rietschel, M; Mann, K F
Pathological gambling is a behavioural addiction with negative economic, social, and psychological consequences. Identification of contributing genes and pathways may improve understanding of aetiology and facilitate therapy and prevention. Here, we report the first genome-wide association study of pathological gambling. Our aims were to identify pathways involved in pathological gambling, and examine whether there is a genetic overlap between pathological gambling and alcohol dependence. Four hundred and forty-five individuals with a diagnosis of pathological gambling according to the Diagnostic and Statistical Manual of Mental Disorders were recruited in Germany, and 986 controls were drawn from a German general population sample. A genome-wide association study of pathological gambling comprising single marker, gene-based, and pathway analyses, was performed. Polygenic risk scores were generated using data from a German genome-wide association study of alcohol dependence. No genome-wide significant association with pathological gambling was found for single markers or genes. Pathways for Huntington's disease (P-value=6.63×10(-3)); 5'-adenosine monophosphate-activated protein kinase signalling (P-value=9.57×10(-3)); and apoptosis (P-value=1.75×10(-2)) were significant. Polygenic risk score analysis of the alcohol dependence dataset yielded a one-sided nominal significant P-value in subjects with pathological gambling, irrespective of comorbid alcohol dependence status. The present results accord with previous quantitative formal genetic studies which showed genetic overlap between non-substance- and substance-related addictions. Furthermore, pathway analysis suggests shared pathology between Huntington's disease and pathological gambling. This finding is consistent with previous imaging studies. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Gupta, Vipin; Khadgawat, Rajesh; Sachdeva, Mohinder Pal
The successful advent of a genome-wide approach in association studies raises the hopes of human geneticists for solving a genetic maze of complex traits especially the disorders. This approach, which is replete with the application of cutting-edge technology and supported by big science projects (like Human Genome Project; and even more importantly the International HapMap Project) and various important databases (SNP database, CNV database, etc.), has had unprecedented success in rapidly uncovering many of the genetic determinants of complex disorders. The magnitude of this approach in the genetics of classical anthropological variables like height, skin color, eye color, and other genome diversity projects has certainly expanded the horizons of molecular anthropology. Therefore, in this article we have proposed a genome-wide association approach in molecular anthropological studies by providing lessons from the exemplary study of the Wellcome Trust Case Control Consortium. We have also highlighted the importance and uniqueness of Indian population groups in facilitating the design and finding optimum solutions for other genome-wide association-related challenges.
Ripke, Stephan; Sanders, Alan R; Kendler, Kenneth S
in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10(-9)), ANK3 (rs10994359, P = 2.5 × 10(-8)) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10(-9)).......We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis...... an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism...
Stewart, S Evelyn; Yu, Dongmei; Scharf, Jeremiah M; Neale, Benjamin M; Fagerness, Jesen A; Mathews, Carol A; Arnold, Paul D; Evans, Patrick D; Gamazon, Eric R; Osiecki, Lisa; McGrath, Lauren; Haddad, Stephen; Crane, Jacquelyn; Hezel, Dianne; Illman, Cornelia; Mayerfeld, Catherine; Konkashbaev, Anuar; Liu, Chunyu; Pluzhnikov, Anna; Tikhomirov, Anna; Edlund, Christopher K; Rauch, Scott L; Moessner, Rainald; Falkai, Peter; Maier, Wolfgang; Ruhrmann, Stephan; Grabe, Hans-Jörgen; Lennertz, Leonard; Wagner, Michael; Bellodi, Laura; Cavallini, Maria Cristina; Richter, Margaret A; Cook, Edwin H; Kennedy, James L; Rosenberg, David; Stein, Dan J; Hemmings, Sian MJ; Lochner, Christine; Azzam, Amin; Chavira, Denise A; Fournier, Eduardo; Garrido, Helena; Sheppard, Brooke; Umaña, Paul; Murphy, Dennis L; Wendland, Jens R; Veenstra-VanderWeele, Jeremy; Denys, Damiaan; Blom, Rianne; Deforce, Dieter; Van Nieuwerburgh, Filip; Westenberg, Herman GM; Walitza, Susanne; Egberts, Karin; Renner, Tobias; Miguel, Euripedes Constantino; Cappi, Carolina; Hounie, Ana G; Conceição do Rosário, Maria; Sampaio, Aline S; Vallada, Homero; Nicolini, Humberto; Lanzagorta, Nuria; Camarena, Beatriz; Delorme, Richard; Leboyer, Marion; Pato, Carlos N; Pato, Michele T; Voyiaziakis, Emanuel; Heutink, Peter; Cath, Danielle C; Posthuma, Danielle; Smit, Jan H; Samuels, Jack; Bienvenu, O Joseph; Cullen, Bernadette; Fyer, Abby J; Grados, Marco A; Greenberg, Benjamin D; McCracken, James T; Riddle, Mark A; Wang, Ying; Coric, Vladimir; Leckman, James F; Bloch, Michael; Pittenger, Christopher; Eapen, Valsamma; Black, Donald W; Ophoff, Roel A; Strengman, Eric; Cusi, Daniele; Turiel, Maurizio; Frau, Francesca; Macciardi, Fabio; Gibbs, J Raphael; Cookson, Mark R; Singleton, Andrew; Hardy, John; Crenshaw, Andrew T; Parkin, Melissa A; Mirel, Daniel B; Conti, David V; Purcell, Shaun; Nestadt, Gerald; Hanna, Gregory L; Jenike, Michael A; Knowles, James A; Cox, Nancy; Pauls, David L
Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1,465 cases, 5,557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9,657 X-chromosome SNPs. Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two p-values were located within DLGAP1 (p=2.49×10-6 and p=3.44×10-6), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a p-value=3.84 × 10-8. However, when trios were meta-analyzed with the combined case-control samples, the p-value for this variant was 3.62×10-5, losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation-QTLs (p<0.001) and frontal lobe eQTLs (p=0.001) was observed within the top-ranked SNPs (p<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921
Stadler, Zsofia K.; Thom, Peter; Robson, Mark E.; Weitzel, Jeffrey N.; Kauff, Noah D.; Hurley, Karen E.; Devlin, Vincent; Gold, Bert; Klein, Robert J.; Offit, Kenneth
Knowledge of the inherited risk for cancer is an important component of preventive oncology. In addition to well-established syndromes of cancer predisposition, much remains to be discovered about the genetic variation underlying susceptibility to common malignancies. Increased knowledge about the human genome and advances in genotyping technology have made possible genome-wide association studies (GWAS) of human diseases. These studies have identified many important regions of genetic variation associated with an increased risk for human traits and diseases including cancer. Understanding the principles, major findings, and limitations of GWAS is becoming increasingly important for oncologists as dissemination of genomic risk tests directly to consumers is already occurring through commercial companies. GWAS have contributed to our understanding of the genetic basis of cancer and will shed light on biologic pathways and possible new strategies for targeted prevention. To date, however, the clinical utility of GWAS-derived risk markers remains limited. PMID:20585100
Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J
The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53-3.14), P=1.9 × 10(-5)). Two polymorphisms at 6p21.2 LINC00951-LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37-1.85), P=1.6 × 10(-9)) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder.
Full Text Available Drug addiction is a costly and recurring healthcare problem, necessitating a need to understand risk factors and mechanisms of addiction, and to identify new biomarkers. To date, genome-wide association studies (GWAS for heroin addiction have been limited; moreover they have been restricted to examining samples of European and African-American origin due to difficulty of recruiting samples from other populations. This is the first study to test a Han Chinese population; we performed a GWAS on a homogeneous sample of 370 Han Chinese subjects diagnosed with heroin dependence using the DSM-IV criteria and 134 ethnically matched controls. Analysis using the diagnostic criteria of heroin dependence yielded suggestive evidence for association between variants in the genes CCDC42 (coiled coil domain 42; p = 2.8x10-7 and BRSK2 (BR serine/threonine 2; p = 4.110-6. In addition, we found evidence for risk variants within the ARHGEF10 (Rho guanine nucleotide exchange factor 10 gene on chromosome 8 and variants in a region on chromosome 20q13, which is gene-poor but has a concentration of mRNAs and predicted miRNAs. Gene-based association analysis identified genome-wide significant association between variants in CCDC42 and heroin addiction. Additionally, when we investigated shared risk variants between heroin addiction and risk of other addiction-related and psychiatric phenotypes using polygenic risk scores, we found a suggestive relationship with variants predicting tobacco addiction, and a significant relationship with variants predicting schizophrenia. Our genome wide association study of heroin dependence provides data in a novel sample, with functionally plausible results and evidence of genetic data of value to the field.
Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.
Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924
Dumitrescu, Logan; Ritchie, Marylyn D; Denny, Joshua C; El Rouby, Nihal M; McDonough, Caitrin W; Bradford, Yuki; Ramirez, Andrea H; Bielinski, Suzette J; Basford, Melissa A; Chai, High Seng; Peissig, Peggy; Carrell, David; Pathak, Jyotishman; Rasmussen, Luke V; Wang, Xiaoming; Pacheco, Jennifer A; Kho, Abel N; Hayes, M Geoffrey; Matsumoto, Martha; Smith, Maureen E; Li, Rongling; Cooper-DeHoff, Rhonda M; Kullo, Iftikhar J; Chute, Christopher G; Chisholm, Rex L; Jarvik, Gail P; Larson, Eric B; Carey, David; McCarty, Catherine A; Williams, Marc S; Roden, Dan M; Bottinger, Erwin; Johnson, Julie A; de Andrade, Mariza; Crawford, Dana C
Resistant hypertension is defined as high blood pressure that remains above treatment goals in spite of the concurrent use of three antihypertensive agents from different classes. Despite the important health consequences of resistant hypertension, few studies of resistant hypertension have been conducted. To perform a genome-wide association study for resistant hypertension, we defined and identified cases of resistant hypertension and hypertensives with treated, controlled hypertension among >47,500 adults residing in the US linked to electronic health records (EHRs) and genotyped as part of the electronic MEdical Records & GEnomics (eMERGE) Network. Electronic selection logic using billing codes, laboratory values, text queries, and medication records was used to identify resistant hypertension cases and controls at each site, and a total of 3,006 cases of resistant hypertension and 876 controlled hypertensives were identified among eMERGE Phase I and II sites. After imputation and quality control, a total of 2,530,150 SNPs were tested for an association among 2,830 multi-ethnic cases of resistant hypertension and 876 controlled hypertensives. No test of association was genome-wide significant in the full dataset or in the dataset limited to European American cases (n = 1,719) and controls (n = 708). The most significant finding was CLNK rs13144136 at p = 1.00x10-6 (odds ratio = 0.68; 95% CI = 0.58-0.80) in the full dataset with similar results in the European American only dataset. We also examined whether SNPs known to influence blood pressure or hypertension also influenced resistant hypertension. None was significant after correction for multiple testing. These data highlight both the difficulties and the potential utility of EHR-linked genomic data to study clinically-relevant traits such as resistant hypertension.
Beaupere, Carine; Chen, Rosalyn B; Pelosi, William; Labunskyy, Vyacheslav M
Translation of mRNA into proteins is a complex process involving several layers of regulation. It is often assumed that changes in mRNA transcription reflect changes in protein synthesis, but many exceptions have been observed. Recently, a technique called ribosome profiling (or Ribo-Seq) has emerged as a powerful method that allows identification, with high accuracy, which regions of mRNA are translated into proteins and quantification of translation at the genome-wide level. Here, we present a generalized protocol for genome-wide quantification of translation using Ribo-Seq in budding yeast. In addition, combining Ribo-Seq data with mRNA abundance measurements allows us to simultaneously quantify translation efficiency of thousands of mRNA transcripts in the same sample and compare changes in these parameters in response to experimental manipulations or in different physiological states. We describe a detailed protocol for generation of ribosome footprints using nuclease digestion, isolation of intact ribosome-footprint complexes via sucrose gradient fractionation, and preparation of DNA libraries for deep sequencing along with appropriate quality controls necessary to ensure accurate analysis of in vivo translation.
Full Text Available Asperger Syndrome (AS is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC, which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448 were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448 lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.
Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon
Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.
Weinsheimer, Shantel; Bendjilali, Nasrine; Nelson, Jeffrey; Guo, Diana E; Zaroff, Jonathan G; Sidney, Stephen; McCulloch, Charles E; Al-Shahi Salman, Rustam; Berg, Jonathan N; Koeleman, Bobby P C; Simon, Matthias; Bostroem, Azize; Fontanella, Marco; Sturiale, Carmelo L; Pola, Roberto; Puca, Alfredo; Lawton, Michael T; Young, William L; Pawlikowska, Ludmila; Klijn, Catharina J M; Kim, Helen
The pathogenesis of sporadic brain arteriovenous malformations (BAVMs) remains unknown, but studies suggest a genetic component. We estimated the heritability of sporadic BAVM and performed a genome-wide association study (GWAS) to investigate association of common single nucleotide polymorphisms (SNPs) with risk of sporadic BAVM in the international, multicentre Genetics of Arteriovenous Malformation (GEN-AVM) consortium. The Caucasian discovery cohort included 515 BAVM cases and 1191 controls genotyped using Affymetrix genome-wide SNP arrays. Genotype data were imputed to 1000 Genomes Project data, and well-imputed SNPs (>0.01 minor allele frequency) were analysed for association with BAVM. 57 top BAVM-associated SNPs (51 SNPs with pJAG1 and BNC2. Of the 6 candidate SNPs, 2 in ACVRL1 and MMP3 had a nominal p<0.05 in the replication cohort. We performed the first GWAS of sporadic BAVM in the largest BAVM cohort assembled to date. No GWAS SNPs were replicated, suggesting that common SNPs do not contribute strongly to BAVM susceptibility. However, heritability estimates suggest a modest but significant genetic contribution. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Blokland, Gabriëlla A M; Wallace, Angus K; Hansell, Narelle K; Thompson, Paul M; Hickie, Ian B; Montgomery, Grant W; Martin, Nicholas G; McMahon, Katie L; de Zubicaray, Greig I; Wright, Margaret J
In a population-based genome-wide association (GWA) study of n-back working memory task-related brain activation, we extracted the average percent BOLD signal change (2-back minus 0-back) from 46 regions-of-interest (ROIs) in functional MRI scans from 863 healthy twins and siblings. ROIs were obtained by creating spheres around group random effects analysis local maxima, and by thresholding a voxel-based heritability map of working memory brain activation at 50%. Quality control for test-retest reliability and heritability of ROI measures yielded 20 reliable (r>0.7) and heritable (h(2)>20%) ROIs. For GWA analysis, the cohort was divided into a discovery (n=679) and replication (n=97) sample. No variants survived the stringent multiple-testing-corrected genome-wide significance threshold (pmemory. Variants identified here may be relevant to (the susceptibility to) common disorders affecting brain function. Copyright © 2016 Elsevier B.V. All rights reserved.
Ballard, David; Abraham, Clara; Cho, Judy; Zhao, Hongyu
The use of biological annotation such as genes and pathways in the analysis of gene expression data has aided the identification of genes for follow-up studies and suggested functional information to uncharacterized genes. Several studies have applied similar methods to genome wide association studies and identified a number of disease related pathways. However, many questions remain on how to best approach this problem, such as whether there is a need to obtain a score to summarize association evidence at the gene level, and whether a pathway, dominated by just a few highly significant genes, is of interest. We evaluated the performance of two pathway-based methods (Random Set, and Binomial approximation to the hypergeometric test) based on their applications to three data sets of Crohn's disease. We consider both the disease status as a phenotype as well as the residuals after conditioning on IL23R, a known Crohn's related gene, as a phenotype. Our results show that Random Set method has the most power to identify disease related pathways. We confirm previously reported disease related pathways and provide evidence for IL-2 Receptor Beta Chain in T cell Activation and IL-9 signaling as Crohn's disease associated pathways. Our results highlight the need to apply powerful gene score methods prior to pathway enrichment tests, and that controlling for genes that attain genome wide significance enable further biological insight.
Full Text Available Abstract Background The use of biological annotation such as genes and pathways in the analysis of gene expression data has aided the identification of genes for follow-up studies and suggested functional information to uncharacterized genes. Several studies have applied similar methods to genome wide association studies and identified a number of disease related pathways. However, many questions remain on how to best approach this problem, such as whether there is a need to obtain a score to summarize association evidence at the gene level, and whether a pathway, dominated by just a few highly significant genes, is of interest. Methods We evaluated the performance of two pathway-based methods (Random Set, and Binomial approximation to the hypergeometric test based on their applications to three data sets of Crohn's disease. We consider both the disease status as a phenotype as well as the residuals after conditioning on IL23R, a known Crohn's related gene, as a phenotype. Results Our results show that Random Set method has the most power to identify disease related pathways. We confirm previously reported disease related pathways and provide evidence for IL-2 Receptor Beta Chain in T cell Activation and IL-9 signaling as Crohn's disease associated pathways. Conclusions Our results highlight the need to apply powerful gene score methods prior to pathway enrichment tests, and that controlling for genes that attain genome wide significance enable further biological insight.
Full Text Available Lipoprotein-associated phospholipase A(2 (Lp-PLA(2 is an emerging risk factor and therapeutic target for cardiovascular disease. The activity and mass of this enzyme are heritable traits, but major genetic determinants have not been explored in a systematic, genome-wide fashion. We carried out a genome-wide association study of Lp-PLA(2 activity and mass in 6,668 Caucasian subjects from the population-based Framingham Heart Study. Clinical data and genotypes from the Affymetrix 550K SNP array were obtained from the open-access Framingham SHARe project. Each polymorphism that passed quality control was tested for associations with Lp-PLA(2 activity and mass using linear mixed models implemented in the R statistical package, accounting for familial correlations, and controlling for age, sex, smoking, lipid-lowering-medication use, and cohort. For Lp-PLA(2 activity, polymorphisms at four independent loci reached genome-wide significance, including the APOE/APOC1 region on chromosome 19 (p = 6 x 10(-24; CELSR2/PSRC1 on chromosome 1 (p = 3 x 10(-15; SCARB1 on chromosome 12 (p = 1x10(-8 and ZNF259/BUD13 in the APOA5/APOA1 gene region on chromosome 11 (p = 4 x 10(-8. All of these remained significant after accounting for associations with LDL cholesterol, HDL cholesterol, or triglycerides. For Lp-PLA(2 mass, 12 SNPs achieved genome-wide significance, all clustering in a region on chromosome 6p12.3 near the PLA2G7 gene. Our analyses demonstrate that genetic polymorphisms may contribute to inter-individual variation in Lp-PLA(2 activity and mass.
Jost, Daniel; Everaers, Ralf
Correspondences between functional and thermodynamic melting properties in a genome are being increasingly employed for ab initio gene finding and for the interpretation of the evolution of genomes. Here we present the first systematic genome wide comparison between biologically coding domains and thermodynamically stable regions. In particular, we develop statistical methods to estimate the reliability of the resulting predictions. Not surprisingly, we find that the success of the approach depends on the difference in GC content between the coding and the non-coding parts of the genome and on the percentage of coding base-pairs in the sequence. These prerequisites vary strongly between species, where we observe no systematic differences between eukaryotes and prokaryotes. We find a number of organisms in which the strong correlation of coding domains and thermodynamically stable regions allows us to identify putative exons or genes to complement existing approaches. In contrast to previous investigations along these lines we have not employed the Poland-Scheraga (PS) model of DNA melting but use the earlier Zimm-Bragg (ZB) model. The Ising-like form of the ZB model can be viewed as an approximation to the PS model, with averaged loop entropies included into the cooperative factor [Formula: see text]. This results in a speed-up by a factor of 20-100 compared to the Fixman-Freire algorithm for the solution of the PS model. We show that for genomic sequences the resulting systematic errors are negligible compared to the parameterization uncertainty of the models. We argue that for limited computing resources, available CPU power is better invested in broadening the statistical base for genomic investigations than in marginal improvements of the description of the physical melting behavior.
Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A
Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq TM (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.
Nalls, Mike A.; Pankratz, Nathan; Lill, Christina M.; Do, Chuong B.; Hernandez, Dena G.; Saad, Mohamad; DeStefano, Anita L.; Kara, Eleanna; Bras, Jose; Sharma, Manu; Schulte, Claudia; Keller, Margaux F.; Arepalli, Sampath; Letson, Christopher; Edsall, Connor; Stefansson, Hreinn; Liu, Xinmin; Pliner, Hannah; Lee, Joseph H.; Cheng, Rong; Ikram, M. Arfan; Ioannidis, John P. A.; Hadjigeorgiou, Georgios M.; Bis, Joshua C.; Martinez, Maria; Perlmutter, Joel S.; Goate, Alison; Marder, Karen; Fiske, Brian; Sutherland, Margaret; Xiromerisiou, Georgia; Myers, Richard H.; Clark, Lorraine N.; Stefansson, Kari; Hardy, John A.; Heutink, Peter; Chen, Honglei; Wood, Nicholas W.; Houlden, Henry; Payami, Haydeh; Brice, Alexis; Scott, William K.; Gasser, Thomas; Bertram, Lars; Eriksson, Nicholas; Foroud, Tatiana; Singleton, Andrew B.; Plagnol, Vincent; Sheerin, Una-Marie; Simón-Sánchez, Javier; Lesage, Suzanne; Sveinbjörnsdóttir, Sigurlaug; Barker, Roger; Ben-Shlomo, Yoav; Berendse, Henk W.; Berg, Daniela; Bhatia, Kailash; de Bie, Rob M. A.; Biffi, Alessandro; Bloem, Bas; Bochdanovits, Zoltan; Bonin, Michael; Bras, Jose M.; Brockmann, Kathrin; Brooks, Janet; Burn, David J.; Charlesworth, Gavin; Chinnery, Patrick F.; Chong, Sean; Clarke, Carl E.; Cookson, Mark R.; Cooper, J. Mark; Corvol, Jean Christophe; Counsell, Carl; Damier, Philippe; Dartigues, Jean-François; Deloukas, Panos; Deuschl, Günther; Dexter, David T.; van Dijk, Karin D.; Dillman, Allissa; Durif, Frank; Dürr, Alexandra; Edkins, Sarah; Evans, Jonathan R.; Foltynie, Thomas; Dong, Jing; Gardner, Michelle; Gibbs, J. Raphael; Gray, Emma; Guerreiro, Rita; Harris, Clare; van Hilten, Jacobus J.; Hofman, Albert; Hollenbeck, Albert; Holton, Janice; Hu, Michele; Huang, Xuemei; Wurster, Isabel; Mätzler, Walter; Hudson, Gavin; Hunt, Sarah E.; Huttenlocher, Johanna; Illig, Thomas; Jónsson, Pálmi V.; Lambert, Jean-Charles; Langford, Cordelia; Lees, Andrew; Lichtner, Peter; Limousin, Patricia; Lopez, Grisel; Lorenz, Delia; McNeill, Alisdair; Moorby, Catriona; Moore, Matthew; Morris, Huw R.; Morrison, Karen E.; Mudanohwo, Ese; O'Sullivan, Sean S.; Pearson, Justin; Pétursson, Hjörvar; Pollak, Pierre; Post, Bart; Potter, Simon; Ravina, Bernard; Revesz, Tamas; Riess, Olaf; Rivadeneira, Fernando; Rizzu, Patrizia; Ryten, Mina; Sawcer, Stephen; Schapira, Anthony; Scheffer, Hans; Shaw, Karen; Shoulson, Ira; Sidransky, Ellen; Smith, Colin; Spencer, Chris C. A.; Stefánsson, Hreinn; Bettella, Francesco; Stockton, Joanna D.; Strange, Amy; Talbot, Kevin; Tanner, Carlie M.; Tashakkori-Ghanbaria, Avazeh; Tison, François; Trabzuni, Daniah; Traynor, Bryan J.; Uitterlinden, André G.; Velseboer, Daan; Vidailhet, Marie; Walker, Robert; van de Warrenburg, Bart; Wickremaratchi, Mirdhu; Williams, Nigel; Williams-Gray, Caroline H.; Winder-Rhodes, Sophie; Stefánsson, Kári; Hardy, John; Factor, S.; Higgins, D.; Evans, S.; Shill, H.; Stacy, M.; Danielson, J.; Marlor, L.; Williamson, K.; Jankovic, J.; Hunter, C.; Simon, D.; Ryan, P.; Scollins, L.; Saunders-Pullman, R.; Boyar, K.; Costan-Toth, C.; Ohmann, E.; Sudarsky, L.; Joubert, C.; Friedman, J.; Chou, K.; Fernandez, H.; Lannon, M.; Galvez-Jimenez, N.; Podichetty, A.; Thompson, K.; Lewitt, P.; Deangelis, M.; O'Brien, C.; Seeberger, L.; Dingmann, C.; Judd, D.; Marder, K.; Fraser, J.; Harris, J.; Bertoni, J.; Peterson, C.; Rezak, M.; Medalle, G.; Chouinard, S.; Panisset, M.; Hall, J.; Poiffaut, H.; Calabrese, V.; Roberge, P.; Wojcieszek, J.; Belden, J.; Jennings, D.; Marek, K.; Mendick, S.; Reich, S.; Dunlop, B.; Jog, M.; Horn, C.; Uitti, R.; Turk, M.; Ajax, T.; Mannetter, J.; Sethi, K.; Carpenter, J.; Dill, B.; Hatch, L.; Ligon, K.; Narayan, S.; Blindauer, K.; Abou-Samra, K.; Petit, J.; Elmer, L.; Aiken, E.; Davis, K.; Schell, C.; Wilson, S.; Velickovic, M.; Koller, W.; Phipps, S.; Feigin, A.; Gordon, M.; Hamann, J.; Licari, E.; Marotta-Kollarus, M.; Shannon, B.; Winnick, R.; Simuni, T.; Videnovic, A.; Kaczmarek, A.; Williams, K.; Wolff, M.; Rao, J.; Cook, M.; Fernandez, M.; Kostyk, S.; Hubble, J.; Campbell, A.; Reider, C.; Seward, A.; Camicioli, R.; Carter, J.; Nutt, J.; Andrews, P.; Morehouse, S.; Stone, C.; Mendis, T.; Grimes, D.; Alcorn-Costa, C.; Gray, P.; Haas, K.; Vendette, J.; Sutton, J.; Hutchinson, B.; Young, J.; Rajput, A.; Klassen, L.; Shirley, T.; Manyam, B.; Simpson, P.; Whetteckey, J.; Wulbrecht, B.; Truong, D.; Pathak, M.; Frei, K.; Luong, N.; Tra, T.; Tran, A.; Vo, J.; Lang, A.; Kleiner- Fisman, G.; Nieves, A.; Johnston, L.; So, J.; Podskalny, G.; Giffin, L.; Atchison, P.; Allen, C.; Martin, W.; Wieler, M.; Suchowersky, O.; Furtado, S.; Klimek, M.; Hermanowicz, N.; Niswonger, S.; Shults, C.; Fontaine, D.; Aminoff, M.; Christine, C.; Diminno, M.; Hevezi, J.; Dalvi, A.; Kang, U.; Richman, J.; Uy, S.; Sahay, A.; Gartner, M.; Schwieterman, D.; Hall, D.; Leehey, M.; Culver, S.; Derian, T.; Demarcaida, T.; Thurlow, S.; Rodnitzky, R.; Dobson, J.; Lyons, K.; Pahwa, R.; Gales, T.; Thomas, S.; Shulman, L.; Weiner, W.; Dustin, K.; Singer, C.; Zelaya, L.; Tuite, P.; Hagen, V.; Rolandelli, S.; Schacherer, R.; Kosowicz, J.; Gordon, P.; Werner, J.; Serrano, C.; Roque, S.; Kurlan, R.; Berry, D.; Gardiner, I.; Hauser, R.; Sanchez-Ramos, J.; Zesiewicz, T.; Delgado, H.; Price, K.; Rodriguez, P.; Wolfrath, S.; Pfeiffer, R.; Davis, L.; Pfeiffer, B.; Dewey, R.; Hayward, B.; Johnson, A.; Meacham, M.; Estes, B.; Walker, F.; Hunt, V.; O'Neill, C.; Racette, B.; Swisher, L.; Dijamco, Cheri; Conley, Emily Drabant; Dorfman, Elizabeth; Tung, Joyce Y.; Hinds, David A.; Mountain, Joanna L.; Wojcicki, Anne; Lew, M.; Klein, C.; Golbe, L.; Growdon, J.; Wooten, G. F.; Watts, R.; Guttman, M.; Goldwurm, S.; Saint-Hilaire, M. H.; Baker, K.; Litvan, I.; Nicholson, G.; Nance, M.; Drasby, E.; Isaacson, S.; Burn, D.; Pramstaller, P.; Al-hinti, J.; Moller, A.; Sherman, S.; Roxburgh, R.; Slevin, J.; Perlmutter, J.; Mark, M. H.; Huggins, N.; Pezzoli, G.; Massood, T.; Itin, I.; Corbett, A.; Chinnery, P.; Ostergaard, K.; Snow, B.; Cambi, F.; Kay, D.; Samii, A.; Agarwal, P.; Roberts, J. W.; Higgins, D. S.; Molho, Eric; Rosen, Ami; Montimurro, J.; Martinez, E.; Griffith, A.; Kusel, V.; Yearout, D.; Zabetian, C.; Clark, L. N.; Liu, X.; Lee, J. H.; Taub, R. Cheng; Louis, E. D.; Cote, L. J.; Waters, C.; Ford, B.; Fahn, S.; Vance, Jeffery M.; Beecham, Gary W.; Martin, Eden R.; Nuytemans, Karen; Pericak-Vance, Margaret A.; Haines, Jonathan L.; DeStefano, Anita; Seshadri, Sudha; Choi, Seung Hoan; Frank, Samuel; Psaty, Bruce M.; Rice, Kenneth; Longstreth, W. T.; Ton, Thanh G. N.; Jain, Samay; van Duijn, Cornelia M.; Verlinden, Vincent J.; Koudstaal, Peter J.; Singleton, Andrew; Cookson, Mark; Hernandez, Dena; Nalls, Michael; Zonderman, Alan; Ferrucci, Luigi; Johnson, Robert; Longo, Dan; O'Brien, Richard; Traynor, Bryan; Troncoso, Juan; van der Brug, Marcel; Zielke, Ronald; Weale, Michael; Ramasamy, Adaikalavan; Dardiotis, Efthimios; Tsimourtou, Vana; Spanaki, Cleanthe; Plaitakis, Andreas; Bozi, Maria; Stefanis, Leonidas; Vassilatis, Dimitris; Koutsis, Georgios; Panas, Marios; Lunnon, Katie; Lupton, Michelle; Powell, John; Parkkinen, Laura; Ansorge, Olaf
We conducted a meta-analysis of Parkinson's disease genome-wide association studies using a common set of 7,893,274 variants across 13,708 cases and 95,282 controls. Twenty-six loci were identified as having genome-wide significant association; these and 6 additional previously reported loci were
Full Text Available Michael E March,1 Patrick MA Sleiman,1,2 Hakon Hakonarson1,2 1Center for Applied Genomics, Children's Hospital of Philadelphia Research Institute, 2Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Abstract: Genetic studies of asthma have revealed that there is considerable heritability to the phenotype. An extensive history of candidate-gene studies has identified a long list of genes associated with immune function that are potentially involved in asthma pathogenesis. However, many of the results of candidate-gene studies have failed to be replicated, leaving in question the true impact of the implicated biological pathways on asthma. With the advent of genome-wide association studies, geneticists are able to examine the association of hundreds of thousands of genetic markers with a phenotype, allowing the hypothesis-free identification of variants associated with disease. Many such studies examining asthma or related phenotypes have been published, and several themes have begun to emerge regarding the biological pathways underpinning asthma. The results of many genome-wide association studies have currently not been replicated, and the large sample sizes required for this experimental strategy invoke difficulties with sample stratification and phenotypic heterogeneity. Recently, large collaborative groups of researchers have formed consortia focused on asthma, with the goals of sharing material and data and standardizing diagnosis and experimental methods. Additionally, research has begun to focus on genetic variants that affect the response to asthma medications and on the biology that generates the heterogeneity in the asthma phenotype. As this work progresses, it will move asthma patients closer to more specific, personalized medicine. Keywords: asthma, genetics, GWAS, pharmacogenetics, biomarkers
Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J
The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967
Full Text Available Complex regional pain syndrome (CRPS is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II and 5 controls (cut-off value: 1.5-fold change and p<0.05. Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1, matrix metalloproteinase 9 (MMP9, alanine aminopeptidase N (ANPEP, l-histidine decarboxylase (HDC, granulocyte colony-stimulating factor 3 receptor (G-CSF3R, and signal transducer and activator of transcription 3 (STAT3 genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR. We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10(-4. The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression.
Jin, Eun-Heui; Zhang, Enji; Ko, Youngkwon; Sim, Woo Seog; Moon, Dong Eon; Yoon, Keon Jung; Hong, Jang Hee; Lee, Won Hyung
Complex regional pain syndrome (CRPS) is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II) and 5 controls (cut-off value: 1.5-fold change and p<0.05). Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1), matrix metalloproteinase 9 (MMP9), alanine aminopeptidase N (ANPEP), l-histidine decarboxylase (HDC), granulocyte colony-stimulating factor 3 receptor (G-CSF3R), and signal transducer and activator of transcription 3 (STAT3) genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10−4). The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression. PMID:24244504
Roh, Tae-young; Ngau, Wing Chi; Cui, Kairong; Landsman, David; Zhao, Keji
The expression patterns of eukaryotic genomes are controlled by their chromatin structure, consisting of nucleosome subunits in which DNA of approximately 146 bp is wrapped around a core of 8 histone molecules. Post-translational histone modifications play an essential role in modifying chromatin structure. Here we apply a combination of SAGE and chromatin immunoprecipitation (ChIP) protocols to determine the distribution of hyperacetylated histones H3 and H4 in the Saccharomyces cerevisiae genome. We call this approach genome-wide mapping technique (GMAT). Using GMAT, we find that the highest acetylation levels are detected in the 5' end of a gene's coding region, but not in the promoter. Furthermore, we show that the histone acetyltransferase, GCN5p, regulates H3 acetylation in the promoter and 5' end of the coding regions. These findings indicate that GMAT should find valuable applications in mapping target sites of chromatin-modifying enzymes.
Rahmioglu, N.; Macgregor, S.; Drong, A.W.; Hedman, A.K.; Harris, H.R.; Randall, J.C.; Prokopenko, I.; Hottenga, J.J.; Boomsma, D.I.; Nyholt, DR; Morris, A.P.; Montgomery, G.W.; Missmer, S.A.; Lindgren, C.M.; Zondervan, K.T.
Endometriosis is a chronic inflammatory condition in women that results in pelvic pain and subfertility, and has been associated with decreased body mass index (BMI). Genetic variants contributing to the heritable component have started to emerge from genome-wide association studies (GWAS), although
Witoelar, Aree; Jansen, Iris E; Wang, Yunpeng; Desikan, Rahul S; Gibbs, J Raphael; Blauwendraat, Cornelis; Thompson, Wesley K; Hernandez, Dena G; Djurovic, Srdjan; Schork, Andrew J; Bettella, Francesco; Ellinghaus, David; Franke, Andre; Lie, Benedicte A; McEvoy, Linda K; Karlsen, Tom H; Lesage, Suzanne; Morris, Huw R; Brice, Alexis; Wood, Nicholas W; Heutink, Peter; Hardy, John; Singleton, Andrew B; Dale, Anders M; Gasser, Thomas; Andreassen, Ole A; Sharma, Manu
Recent genome-wide association studies (GWAS) and pathway analyses supported long-standing observations of an association between immune-mediated diseases and Parkinson disease (PD). The post-GWAS era provides an opportunity for cross-phenotype analyses between different complex phenotypes. To test the hypothesis that there are common genetic risk variants conveying risk of both PD and autoimmune diseases (ie, pleiotropy) and to identify new shared genetic variants and their pathways by applying a novel statistical framework in a genome-wide approach. Using the conjunction false discovery rate method, this study analyzed GWAS data from a selection of archetypal autoimmune diseases among 138 511 individuals of European ancestry and systemically investigated pleiotropy between PD and type 1 diabetes, Crohn disease, ulcerative colitis, rheumatoid arthritis, celiac disease, psoriasis, and multiple sclerosis. NeuroX data (6927 PD cases and 6108 controls) were used for replication. The study investigated the biological correlation between the top loci through protein-protein interaction and changes in the gene expression and methylation levels. The dates of the analysis were June 10, 2015, to March 4, 2017. The primary outcome was a list of novel loci and their pathways involved in PD and autoimmune diseases. Genome-wide conjunctional analysis identified 17 novel loci at false discovery rate less than 0.05 with overlap between PD and autoimmune diseases, including known PD loci adjacent to GAK, HLA-DRB5, LRRK2, and MAPT for rheumatoid arthritis, ulcerative colitis and Crohn disease. Replication confirmed the involvement of HLA, LRRK2, MAPT, TRIM10, and SETD1A in PD. Among the novel genes discovered, WNT3, KANSL1, CRHR1, BOLA2, and GUCY1A3 are within a protein-protein interaction network with known PD genes. A subset of novel loci was significantly associated with changes in methylation or expression levels of adjacent genes. The study findings provide novel mechanistic
Vu, Thanh Long; Chatzivasileiadis, Spyros; Chiang, Hsiao-Dong
demand and hopefully stabilize the system. This traditional emergency control results in interrupted service with severe economic damage to customers. Also, such control is usually less effective due to the lack of coordination among protective devices. In this paper, we propose a novel structural...... emergency control to render post-fault dynamics from the critical/emergency fault cleared state to the stable equilibrium point. This is a new control paradigm that does not rely on any continuous measurement or load shedding, as in the classical setup. Instead, the grid is made stable by discretely...... relocating the equilibrium point and its stability region, such that the system is consecutively attracted from the fault-cleared state back to the original equilibrium point. The proposed control is designed by solving linear and convex optimization problems, making it possibly scalable to large-scale power...
Full Text Available While genome-wide association studies are ongoing to identify sequence variation influencing susceptibility to major depressive disorder (MDD, epigenetic marks, such as DNA methylation, which can be influenced by environment, might also play a role. Here we present the first genome-wide DNA methylation (DNAm scan in MDD. We compared 39 postmortem frontal cortex MDD samples to 26 controls. DNA was hybridized to our Comprehensive High-throughput Arrays for Relative Methylation (CHARM platform, covering 3.5 million CpGs. CHARM identified 224 candidate regions with DNAm differences >10%. These regions are highly enriched for neuronal growth and development genes. Ten of 17 regions for which validation was attempted showed true DNAm differences; the greatest were in PRIMA1, with 12-15% increased DNAm in MDD (p = 0.0002-0.0003, and a concomitant decrease in gene expression. These results must be considered pilot data, however, as we could only test replication in a small number of additional brain samples (n = 16, which showed no significant difference in PRIMA1. Because PRIMA1 anchors acetylcholinesterase in neuronal membranes, decreased expression could result in decreased enzyme function and increased cholinergic transmission, consistent with a role in MDD. We observed decreased immunoreactivity for acetylcholinesterase in MDD brain with increased PRIMA1 DNAm, non-significant at p = 0.08.While we cannot draw firm conclusions about PRIMA1 DNAm in MDD, the involvement of neuronal development genes across the set showing differential methylation suggests a role for epigenetics in the illness. Further studies using limbic system brain regions might shed additional light on this role.
Feng, Yue; Lu, Qing; Zhai, Rongrong; Zhang, Mengchen; Xu, Qun; Yang, Yaolong; Wang, Shan; Yuan, Xiaoping; Yu, Hanyong; Wang, Yiping; Wei, Xinghua
Using genome-wide association mapping, 47 SNPs within 27 significant loci were identified for four grain shape traits, and 424 candidate genes were predicted from public database. Grain shape is a key determinant of grain yield and quality in rice (Oryza sativa L.). However, our knowledge of genes controlling rice grain shape remains limited. Genome-wide association mapping based on linkage disequilibrium (LD) has recently emerged as an effective approach for identifying genes or quantitative trait loci (QTL) underlying complex traits in plants. In this study, association mapping based on 5291 single nucleotide polymorphisms (SNPs) was conducted to identify significant loci associated with grain shape traits in a global collection of 469 diverse rice accessions. A total of 47 SNPs were located in 27 significant loci for four grain traits, and explained ~44.93-65.90 % of the phenotypic variation for each trait. In total, 424 candidate genes within a 200 kb extension region (±100 kb of each locus) of these loci were predicted. Of them, the cloned genes GS3 and qSW5 showed very strong effects on grain length and grain width in our study. Comparing with previously reported QTLs for grain shape traits, we found 11 novel loci, including 3, 3, 2 and 3 loci for grain length, grain width, grain length-width ratio and thousand grain weight, respectively. Validation of these new loci would be performed in the future studies. These results revealed that besides GS3 and qSW5, multiple novel loci and mechanisms were involved in determining rice grain shape. These findings provided valuable information for understanding of the genetic control of grain shape and molecular marker assistant selection (MAS) breeding in rice.
Weinsheimer, S.; Bendjilali, N.; Nelson, J.; Guo, D.E.; Zaroff, J.G.; Sidney, S.; McCulloch, C.E.; Salman, R. Al-Shahi; Berg, J.N.; Koeleman, B.P.C.; Simon, M.; Bostroem, A.; Fontanella, M.; Sturiale, C.L.; Pola, R.; Puca, A.; Lawton, M.T.; Young, W.L.; Pawlikowska, L.; Klijn, C.J.M.; Kim, H.
BACKGROUND: The pathogenesis of sporadic brain arteriovenous malformations (BAVMs) remains unknown, but studies suggest a genetic component. We estimated the heritability of sporadic BAVM and performed a genome-wide association study (GWAS) to investigate association of common single nucleotide
Jin, Ying; Birlea, Stanca A.; Fain, Pamela R.; Ferrara, Tracey M.; Ben, Songtao; Riccardi, Sheri L.; Cole, Joanne B.; Gowan, Katherine; Holland, Paulene J.; Bennett, Dorothy C.; Luiten, Rosalie M.; Wolkerstorfer, Albert; van der Veen, J. P. Wietze; Hartmann, Anke; Eichner, Saskia; Schuler, Gerold; van Geel, Nanja; Lambert, Jo; Kemp, E. Helen; Gawkrodger, David J.; Weetman, Anthony P.; Taïeb, Alain; Jouary, Thomas; Ezzedine, Khaled; Wallace, Margaret R.; McCormack, Wayne T.; Picardo, Mauro; Leone, Giovanni; Overbeck, Andreas; Silverberg, Nanette B.; Spritz, Richard A.
We previously reported a genome-wide association study (GWAS) identifying 14 susceptibility loci for generalized vitiligo. We report here a second GWAS (450 individuals with vitiligo (cases) and 3,182 controls), an independent replication study (1,440 cases and 1,316 controls) and a meta-analysis
Levinson, Douglas F; Shi, Jianxin; Wang, Kai
The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs).......The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs)....
Fanous, Ayman H; Zhou, Baiyu; Aggen, Steven H
Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia.......Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia....
Sung, Myong-Hee; Baek, Songjoon; Hager, Gordon L
High-throughput sequencing technologies have allowed many gene locus-level molecular biology assays to become genome-wide profiling methods. DNA-cleaving enzymes such as DNase I have been used to probe accessible chromatin. The accessible regions contain functional regulatory sites, including promoters, insulators and enhancers. Deep sequencing of DNase-seq libraries and computational analysis of the cut profiles have been used to infer protein occupancy in the genome at the nucleotide level, a method introduced as 'digital genomic footprinting'. The approach has been proposed as an attractive alternative to the analysis of transcription factors (TFs) by chromatin immunoprecipitation followed by sequencing (ChIP-seq), and in theory it should overcome antibody issues, poor resolution and batch effects. Recent reports point to limitations of the DNase-based genomic footprinting approach and call into question the scope of detectable protein occupancy, especially for TFs with short-lived chromatin binding. The genomics community is grappling with issues concerning the utility of genomic footprinting and is reassessing the proposed approaches in terms of robust deliverables. Here we summarize the consensus as well as different views emerging from recent reports, and we describe the remaining issues and hurdles for genomic footprinting.
Full Text Available Abstract Background Despite the recent success of genome-wide association studies in identifying novel loci contributing effects to complex human traits, such as type 2 diabetes and obesity, much of the genetic component of variation in these phenotypes remains unexplained. One way to improving power to detect further novel loci is through meta-analysis of studies from the same population, increasing the sample size over any individual study. Although statistical software analysis packages incorporate routines for meta-analysis, they are ill equipped to meet the challenges of the scale and complexity of data generated in genome-wide association studies. Results We have developed flexible, open-source software for the meta-analysis of genome-wide association studies. The software incorporates a variety of error trapping facilities, and provides a range of meta-analysis summary statistics. The software is distributed with scripts that allow simple formatting of files containing the results of each association study and generate graphical summaries of genome-wide meta-analysis results. Conclusions The GWAMA (Genome-Wide Association Meta-Analysis software has been developed to perform meta-analysis of summary statistics generated from genome-wide association studies of dichotomous phenotypes or quantitative traits. Software with source files, documentation and example data files are freely available online at http://www.well.ox.ac.uk/GWAMA.
Mogensen, Mette Sloth; Karlskov-Mortensen, Peter; Proschowsky, Helle Friis
with intervertebral disc calcification in Dachshund through a genome-wide association (GWA) study. Based on thorough radiographic examinations, 48 cases with >= 6 disc calcifications or surgically treated for disc herniation and 46 controls with 0-1 disc calcifications were identified. GWA using the Illumina Canine......HD BeadChip identified a locus on chromosome 12 from 36.8 to 38.6 Mb with 36 markers reaching genome-wide significance (P-genome = 0.00001-0.026). This study suggests that a major locus on chromosome 12 harbors genetic variations affecting the development of intervertebral disc calcification in Dachshund....
Hayden, Lystra P; Cho, Michael H; McDonald, Merry-Lynn N; Crapo, James D; Beaty, Terri H; Silverman, Edwin K; Hersh, Craig P
Previous studies have indicated that in adult smokers, a history of childhood pneumonia is associated with reduced lung function and chronic obstructive pulmonary disease. There have been few previous investigations using genome-wide association studies to investigate genetic predisposition to pneumonia. This study aims to identify the genetic variants associated with the development of pneumonia during childhood and over the course of the lifetime. Study subjects included current and former smokers with and without chronic obstructive pulmonary disease participating in the COPDGene Study. Pneumonia was defined by subject self-report, with childhood pneumonia categorized as having the first episode at pneumonia (843 cases, 9,091 control subjects) and lifetime pneumonia (3,766 cases, 5,659 control subjects) were performed separately in non-Hispanic whites and African Americans. Non-Hispanic white and African American populations were combined in the meta-analysis. Top genetic variants from childhood pneumonia were assessed in network analysis. No single-nucleotide polymorphisms reached genome-wide significance, although we identified potential regions of interest. In the childhood pneumonia analysis, this included variants in NGR1 (P = 6.3 × 10(-8)), PAK6 (P = 3.3 × 10(-7)), and near MATN1 (P = 2.8 × 10(-7)). In the lifetime pneumonia analysis, this included variants in LOC339862 (P = 8.7 × 10(-7)), RAPGEF2 (P = 8.4 × 10(-7)), PHACTR1 (P = 6.1 × 10(-7)), near PRR27 (P = 4.3 × 10(-7)), and near MCPH1 (P = 2.7 × 10(-7)). Network analysis of the genes associated with childhood pneumonia included top networks related to development, blood vessel morphogenesis, muscle contraction, WNT signaling, DNA damage, apoptosis, inflammation, and immune response (P ≤ 0.05). We have identified genes potentially associated with the risk of pneumonia. Further research will be required to confirm these
Nannya, Yasuhito; Taura, Kenjiro; Kurokawa, Mineo; Chiba, Shigeru; Ogawa, Seishi
.... We investigated the extent to which this trade-off limits the genome-wide power with this approach by simulating a large number of case-control panels based on the empirical data from the HapMap Project...
. Q16832. Homo sapiens. ROR1 HUMAN. Q01973. Homo sapiens. ROR2 HUMAN. Q01974. Homo sapiens. RYK HUMAN. P34925. Homo sapiens. INSR HUMAN. P06213. Homo sapiens. FGR1 DROME. Q07407. Drosophila melanogaster.
Receptor tyrosine kinases (RTKs) are key components of cell–cell signalling required for growth and development of multicellular organisms. It is therefore likely that the divergence of RTKs and associated components played a significant role in the evolution of multicellular organisms. We have carried out the present study ...
domain-based screening using Hidden Markov Models (HMMs) for RTKs in Genomescan predicted gene models of the Hydra magnipapillata genome .... its speed and minimal errors in tree topologies. A bootstrap test for all trees was ... according to the classification in RTKdb (Grassot et al. 2003). Another analysis along ...
Berardinelli, Ricardo Porto; Correa, Kleber Macedo; Moura Filho, Nelson Barboza de; Fernandez, Carlos Antonio [TRANSPETRO, Rio de Janeiro, RJ (Brazil); Matos, Jose Eduardo Nogueira de [PETROBRAS, Rio de Janeiro, RJ (Brazil)
The operation of liquefied natural gas (LNG) is pioneering within the PETROBRAS System. PETROBRAS Transporte - TRANSPETRO is going to operate two flexible LNG terminals, located in Ceara and Rio de Janeiro. In accordance with the Corporate Health, Safety and Environmental (HSE) Directive - Training, Education and Awareness, PETROBRAS Transporte S.A. - TRANSPETRO has prepared an action plan with the objective of ensuring the operational safety of the undertaking. Among other actions a training program for the emergency control of LNG will be inserted into the timetable. The above mentioned training program was held over a period of 20 hours, and was divided between theory and practice. In the theoretical part, the characteristics of the product, the history of accidents and the emergency response procedures were covered. In the practical part, 3000 gallons of LNG were utilized where the behavior of the product could be confirmed following a confined leak, thereby verifying the efficacy of the emergency control resources. The teaching process of the course was developed in the company through the preparation of specific procedures, emergency plans and the formation of internal instructors. (author)
Stahl, Eli A.; Raychaudhuri, Soumya; Remmers, Elaine F.; Xie, Gang; Eyre, Stephen; Thomson, Brian P.; Li, Yonghong; Kurreeman, Fina A. S.; Zhernakova, Alexandra; Hinks, Anne; Guiducci, Candace; Chen, Robert; Alfredsson, Lars; Amos, Christopher I.; Ardlie, Kristin G.; Barton, Anne; Bowes, John; Brouwer, Elisabeth; Burtt, Noel P.; Catanese, Joseph J.; Coblyn, Jonathan; Coenen, Marieke J. H.; Costenbader, Karen H.; Criswell, Lindsey A.; Crusius, J. Bart A.; Cui, Jing; de Bakker, Paul I. W.; De Jager, Philip L.; Ding, Bo; Emery, Paul; Flynn, Edward; Harrison, Pille; Hocking, Lynne J.; Huizinga, Tom W. J.; Kastner, Daniel L.; Ke, Xiayi; Lee, Annette T.; Liu, Xiangdong; Martin, Paul; Morgan, Ann W.; Padyukov, Leonid; Posthumus, Marcel D.; Radstake, Timothy R. D. J.; Reid, David M.; Seielstad, Mark; Seldin, Michael F.; Shadick, Nancy A.; Steer, Sophia; Tak, Paul P.; Thomson, Wendy; van der Helm-van Mil, Annette H. M.; van der Horst-Bruinsma, Irene E.; van der Schoot, C. Ellen; van Riel, Piet L. C. M.; Weinblatt, Michael E.; Wilson, Anthony G.; Wolbink, Gert Jan; Wordsworth, B. Paul; Wijmenga, Cisca; Karlson, Elizabeth W.; Toes, Rene E. M.; de Vries, Niek; Begovich, Ann B.; Worthington, Jane; Siminovitch, Katherine A.; Gregersen, Peter K.; Klareskog, Lars; Plenge, Robert M.
To identify new genetic risk factors for rheumatoid arthritis, we conducted a genome-wide association study meta-analysis of 5,539 autoantibody-positive individuals with rheumatoid arthritis (cases) and 20,169 controls of European descent, followed by replication in an independent set of 6,768
Hamzic, Edin; Bed'hom, Bertrand; Hérault, Frédéric
Use of genetic tools for improvement of host’s response is considered as a promising complementary approach for coccidiosis control. Therefore, we performed genome wide association study (GWAS) for response to Eimeria maxima challenge in broilers. The challenge was done on 2024 Cobb500 broilers. ...
Geller, Frank; Feenstra, Bjarke; Carstensen, Lisbeth
Hypospadias is a common congenital condition in boys in which the urethra opens on the underside of the penis. We performed a genome-wide association study on 1,006 surgery-confirmed hypospadias cases and 5,486 controls from Denmark. After replication genotyping of an additional 1,972 cases and 1...
Scott, Robert A; Scott, Laura J; Mägi, Reedik
To characterise type 2 diabetes (T2D) associated variation across the allele frequency spectrum, we conducted a meta-analysis of genome-wide association data from 26,676 T2D cases and 132,532 controls of European ancestry after imputation using the 1000 Genomes multi-ethnic reference panel. Promi...
Scott, Robert A; Scott, Laura J; Mägi, Reedik; Marullo, Letizia; Gaulton, Kyle J; Kaakinen, Marika; Pervjakova, Natalia; Pers, Tune H; Johnson, Andrew D; Eicher, John D; Jackson, Anne U; Ferreira, Teresa; Lee, Yeji; Ma, Clement; Steinthorsdottir, Valgerdur; Thorleifsson, Gudmar; Qi, Lu; Van Zuydam, Natalie R; Mahajan, Anubha; Chen, Han; Almgren, Peter; Voight, Ben F; Grallert, Harald; Müller-Nurasyid, Martina; Ried, Janina S; Rayner, William N; Robertson, Neil; Karssen, Lennart C; van Leeuwen, Elisabeth M; Willems, Sara M; Fuchsberger, Christian; Kwan, Phoenix; Teslovich, Tanya M; Chanda, Pritam; Li, Man; Lu, Yingchang; Dina, Christian; Thuillier, Dorothee; Yengo, Loic; Jiang, Longda; Sparso, Thomas; Kestler, Hans A; Chheda, Himanshu; Eisele, Lewin; Gustafsson, Stefan; Frånberg, Mattias; Strawbridge, Rona J; Benediktsson, Rafn; Hreidarsson, Astradur B; Kong, Augustine; Sigurðsson, Gunnar; Kerrison, Nicola D; Luan, Jian'an; Liang, Liming; Meitinger, Thomas; Roden, Michael; Thorand, Barbara; Esko, Tõnu; Mihailov, Evelin; Fox, Caroline; Liu, Ching-Ti; Rybin, Denis; Isomaa, Bo; Lyssenko, Valeriya; Tuomi, Tiinamaija; Couper, David J; Pankow, James S; Grarup, Niels; Have, Christian T; Jørgensen, Marit E; Jørgensen, Torben; Linneberg, Allan; Cornelis, Marilyn C; van Dam, Rob M; Hunter, David J; Kraft, Peter; Sun, Qi; Edkins, Sarah; Owen, Katharine R; Perry, John Rb; Wood, Andrew R; Zeggini, Eleftheria; Tajes-Fernandes, Juan; Abecasis, Goncalo R; Bonnycastle, Lori L; Chines, Peter S; Stringham, Heather M; Koistinen, Heikki A; Kinnunen, Leena; Sennblad, Bengt; Mühleisen, Thomas W; Nöthen, Markus M; Pechlivanis, Sonali; Baldassarre, Damiano; Gertow, Karl; Humphries, Steve E; Tremoli, Elena; Klopp, Norman; Meyer, Julia; Steinbach, Gerald; Wennauer, Roman; Eriksson, Johan G; Mӓnnistö, Satu; Peltonen, Leena; Tikkanen, Emmi; Charpentier, Guillaume; Eury, Elodie; Lobbens, Stéphane; Gigante, Bruna; Leander, Karin; McLeod, Olga; Bottinger, Erwin P; Gottesman, Omri; Ruderfer, Douglas; Blüher, Matthias; Kovacs, Peter; Tonjes, Anke; Maruthur, Nisa M; Scapoli, Chiara; Erbel, Raimund; Jöckel, Karl-Heinz; Moebus, Susanne; de Faire, Ulf; Hamsten, Anders; Stumvoll, Michael; Deloukas, Panagiotis; Donnelly, Peter J; Frayling, Timothy M; Hattersley, Andrew T; Ripatti, Samuli; Salomaa, Veikko; Pedersen, Nancy L; Boehm, Bernhard O; Bergman, Richard N; Collins, Francis S; Mohlke, Karen L; Tuomilehto, Jaakko; Hansen, Torben; Pedersen, Oluf; Barroso, Inês; Lannfelt, Lars; Ingelsson, Erik; Lind, Lars; Lindgren, Cecilia M; Cauchi, Stephane; Froguel, Philippe; Loos, Ruth Jf; Balkau, Beverley; Boeing, Heiner; Franks, Paul W; Gurrea, Aurelio Barricarte; Palli, Domenico; van der Schouw, Yvonne T|info:eu-repo/dai/nl/073449253; Altshuler, David; Groop, Leif C; Langenberg, Claudia; Wareham, Nicholas J; Sijbrands, Eric; van Duijn, Cornelia M; Florez, Jose C; Meigs, James B; Boerwinkle, Eric; Gieger, Christian; Strauch, Konstantin; Metspalu, Andres; Morris, Andrew D; Palmer, Colin Na; Hu, Frank B; Thorsteinsdottir, Unnur; Stefansson, Kari; Dupuis, Josée; Morris, Andrew P; Boehnke, Michael; McCarthy, Mark I; Prokopenko, Inga
To characterise type 2 diabetes (T2D) associated variation across the allele frequency spectrum, we conducted a meta-analysis of genome-wide association data from 26,676 T2D cases and 132,532 controls of European ancestry after imputation using the 1000 Genomes multi-ethnic reference panel.
Cerhan, James R.; Berndt, Sonja I.; Vijai, Joseph; Ghesquières, Hervé; McKay, James; Wang, Sophia S.; Wang, Zhaoming; Yeager, Meredith; Conde, Lucia; De Bakker, Paul I W; Nieters, Alexandra; Cox, David; Burdett, Laurie; Monnereau, Alain; Flowers, Christopher R.; De Roos, Anneclaire J.; Brooks-Wilson, Angela R.; Lan, Qing; Severi, Gianluca; Melbye, Mads; Gu, Jian; Jackson, Rebecca D.; Kane, Eleanor; Teras, Lauren R.; Purdue, Mark P.; Vajdic, Claire M.; Spinelli, John J.; Giles, Graham G.; Albanes, Demetrius; Kelly, Rachel S.; Zucca, Mariagrazia; Bertrand, Kimberly A.; Zeleniuch-Jacquotte, Anne; Lawrence, Charles; Hutchinson, Amy; Zhi, Degui; Habermann, Thomas M.; Link, Brian K.; Novak, Anne J.; Dogan, Ahmet; Asmann, Yan W.; Liebow, Mark; Thompson, Carrie A.; Ansell, Stephen M.; Witzig, Thomas E.; Weiner, George J.; Veron, Amelie S.; Zelenika, Diana; Tilly, Hervé; Haioun, Corinne; Molina, Thierry Jo; Hjalgrim, Henrik; Glimelius, Bengt; Adami, Hans Olov; Bracci, Paige M.; Riby, Jacques; Smith, Martyn T.; Holly, Elizabeth A.; Cozen, Wendy; Hartge, Patricia; Morton, Lindsay M.; Severson, Richard K.; Tinker, Lesley F.; North, Kari E.; Becker, Nikolaus; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; Staines, Anthony; Lightfoot, Tracy; Crouch, Simon; Smith, Alex; Roman, Eve; Diver, W. Ryan; Offit, Kenneth; Zelenetz, Andrew; Klein, Robert J.; Villano, Danylo J.; Zheng, Tongzhang; Zhang, Yawei; Holford, Theodore R.; Kricker, Anne; Turner, Jenny; Southey, Melissa C.; Clavel, Jacqueline; Virtamo, Jarmo; Weinstein, Stephanie; Riboli, Elio; Vineis, Paolo; Kaaks, Rudolph; Trichopoulos, Dimitrios; Vermeulen, Roel C H; Boeing, Heiner; Tjonneland, Anne; Angelucci, Emanuele; Di Lollo, Simonetta; Rais, Marco; Birmann, Brenda M.; Laden, Francine; Giovannucci, Edward; Kraft, Peter; Huang, Jinyan; Ma, Baoshan; Ye, Yuanqing; Chiu, Brian C H; Sampson, Joshua; Liang, Liming; Park, Ju Hyun; Chung, Charles C.; Weisenburger, Dennis D.; Chatterjee, Nilanjan; Fraumeni, Joseph F.; Slager, Susan L.; Wu, Xifeng; De Sanjose, Silvia; Smedby, Karin E.; Salles, Gilles; Skibola, Christine F.; Rothman, Nathaniel; Chanock, Stephen J.
Diffuse large B cell lymphoma (DLBCL) is the most common lymphoma subtype and is clinically aggressive. To identify genetic susceptibility loci for DLBCL, we conducted a meta-analysis of 3 new genome-wide association studies (GWAS) and 1 previous scan, totaling 3,857 cases and 7,666 controls of
Urrestarazu, Jorge; Muranty, Hélène; Denancé, Caroline; Leforestier, Diane; Ravon, Elisa; Guyader, Arnaud; Guisnel, Rémi; Feugey, Laurence; Aubourg, Sébastien; Celton, Jean Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick; Ordidge, Matthew; Paprstein, Frantisek; Sedlak, Jiri; Nybom, Hilde; Garkava-Gustavsson, Larisa; Troggio, Michela; Bianco, Luca; Velasco, Riccardo; Poncet, Charles; Théron, Anthony; Moriya, Shigeki; Bink, Marco C.A.M.; Laurens, François; Tartarini, Stefano; Durel, Charles Eric
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes
Thomas Danielsen, E.; E. Møller, Morten; Yamanaka, Naoki
Steroid hormones control important developmental processes and are linked to many diseases. To systematically identify genes and pathways required for steroid production, we performed a Drosophila genome-wide in vivo RNAi screen and identified 1,906 genes with potential roles in steroidogenesis...
Khor, Chiea Chuen; Davila, Sonia; Breunis, Willemijn B.; Lee, Yi-Ching; Shimizu, Chisato; Wright, Victoria J.; Yeung, Rae S. M.; Tan, Dennis E. K.; Sim, Kar Seng; Wang, Jie Jin; Wong, Tien Yin; Pang, Junxiong; Mitchell, Paul; Cimaz, Rolando; Dahdah, Nagib; Cheung, Yiu-Fai; Huang, Guo-Ying; Yang, Wanling; Park, In-Sook; Lee, Jong-Keuk; Wu, Jer-Yuarn; Levin, Michael; Burns, Jane C.; Burgner, David; Kuijpers, Taco W.; Hibberd, Martin L.; Lau, Yu-Lung; Zhang, Jing; Ma, Xiao-Jing; Liu, Fang; Wu, Lin; Yoo, Jeong-Jin; Hong, Soo-Jong; Kim, Kwi-Joo; Kim, Jae-Jung; Park, Young-Mi; Mi Hong, Young; Sohn, Sejung; Young Jang, Gi; Ha, Kee-Soo; Nam, Hyo-Kyoung; Byeon, Jung-Hye; Weon Yun, Sin; Ki Han, Myung; Lee, Kyung-Yil; Hwang, Ja-Young; Rhim, Jung-Woo; Seob Song, Min; Lee, Hyoung-Doo; Kim, Dong Soo; Lee, Jae-Moo; Chang, Jeng-Sheng; Tsai, Fuu-Jen; Liang, Chi-Di; Chen, Ming-Ren; Chi, Hsin; Chiu, Nan-Chang; Huang, Fu-Yuan; Chang, Luan-Yin; Huang, Li-Min; Kuo, Ho-Chang; Huang, Kao-Pin; Lee, Meng-Luen; Hwang, Betau; Huang, Yhu-Chering; Lee, Pi-Chang; Odam, Miranda; Christiansen, Frank T.; Witt, Campbell; Goldwater, Paul; Curtis, Nigel; Palasanthiran, Pamela; Ziegler, John; Nissen, Michael; Nourse, Clare; Kuipers, Irene M.; Ottenkamp, Jaap J.; Geissler, Judy; Biezeveld, Maarten; Tacke, Carline; Filippini, Luc; Brogan, Paul; Klein, Nigel; Shah, Vanita; Dillon, Michael; Booy, Robert; Shingadia, Delane; Bose, Anu; Mukasa, Thomas; Tulloh, Robert; Michie, Colin; Newburger, Jane W.; Baker, Annette L.; Rowley, Anne H.; Shulman, Stanford T.; Mason, Wilbert; Takahashi, Masato; Melish, Marian E.; Tremoulet, Adriana H.; Viswanathan, Ananth; Rochtchina, Elena; Attia, John; Scott, Rodney; Holliday, Elizabeth; Harrap, Stephen
Kawasaki disease is a systemic vasculitis of unknown etiology, with clinical observations suggesting a substantial genetic contribution to disease susceptibility. We conducted a genome-wide association study and replication analysis in 2,173 individuals with Kawasaki disease and 9,383 controls from
Jensen, Majken Karoline; Pers, Tune Hannes; Dworzynski, Piotr
in genes associated with risk of coronary heart disease (CHD). Methods and Results-Genome-wide association analyses of approximately approximate to 700 000 single-nucleotide polymorphisms in 899 incident CHD cases and 1823 age-and sex-matched controls within the Nurses' Health and the Health Professionals...
Siggia Eric D
Full Text Available Abstract Background To explain the vastly different phenotypes exhibited by the same organism under different conditions, it is essential that we understand how the organism's genes are coordinately regulated. While there are many excellent tools for predicting sequences encoding proteins or RNA genes, few algorithms exist to predict regulatory sequences on a genome wide scale with no prior information. Results To identify motifs involved in the control of transcription, an algorithm was developed that searches upstream of operons for improbably frequent dimers. The algorithm was applied to the B. subtilis genome, which is predicted to encode for approximately 200 DNA binding proteins. The dimers found to be over-represented could be clustered into 317 distinct groups, each thought to represent a class of motifs uniquely recognized by some transcription factor. For each cluster of dimers, a representative weight matrix was derived and scored over the regions upstream of the operons to predict the sites recognized by the cluster's factor, and a putative regulon of the operons immediately downstream of the sites was inferred. The distribution in number of operons per predicted regulon is comparable to that for well characterized transcription factors. The most highly over-represented dimers matched σA, the T-box, and σW sites. We have evidence to suggest that at least 52 of our clusters of dimers represent actual regulatory motifs, based on the groups' weight matrix matches to experimentally characterized sites, the functional similarity of the component operons of the groups' regulons, and the positional biases of the weight matrix matches. All predictions are assigned a significance value, and thresholds are set to avoid false positives. Where possible, we examine our false negatives, drawing examples from known regulatory motifs and regulons inferred from RNA expression data. Conclusions We have demonstrated that in the case of B. subtilis
Wang, Hansong; Burnett, Terrilea; Kono, Suminori; Haiman, Christopher A; Iwasaki, Motoki; Wilkens, Lynne R; Loo, Lenora W M; Van Den Berg, David; Kolonel, Laurence N; Henderson, Brian E; Keku, Temitope O; Sandler, Robert S; Signorello, Lisa B; Blot, William J; Newcomb, Polly A; Pande, Mala; Amos, Christopher I; West, Dee W; Bézieau, Stéphane; Berndt, Sonja I; Zanke, Brent W; Hsu, Li; Lindor, Noralane M; Haile, Robert W; Hopper, John L; Jenkins, Mark A; Gallinger, Steven; Casey, Graham; Stenzel, Stephanie L; Schumacher, Fredrick R; Peters, Ulrike; Gruber, Stephen B; Tsugane, Shoichiro; Stram, Daniel O; Le Marchand, Loïc
The genetic basis of sporadic colorectal cancer (CRC) is not well explained by known risk polymorphisms. Here we perform a meta-analysis of two genome-wide association studies in 2,627 cases and 3,797 controls of Japanese ancestry and 1,894 cases and 4,703 controls of African ancestry, to identify genetic variants that contribute to CRC susceptibility. We replicate genome-wide statistically significant associations (Pvalue of association mapping in diverse populations.
Full Text Available Streptococci represent a diverse group of Gram-positive bacteria, which colonize a wide range of hosts among animals and humans. Streptococcal species occur as commensal as well as pathogenic organisms. Many of the pathogenic species can cause severe, invasive infections in their hosts leading to a high morbidity and mortality. The consequence is a tremendous suffering on the part of men and livestock besides the significant financial burden in the agricultural and healthcare sectors. An environmentally stimulated and tightly controlled expression of virulence factor genes is of fundamental importance for streptococcal pathogenicity. Bacterial small noncoding RNAs (sRNAs modulate the expression of genes involved in stress response, sugar metabolism, surface composition, and other properties that are related to bacterial virulence. Even though the regulatory character is shared by this class of RNAs, variation on the molecular level results in a high diversity of functional mechanisms. The knowledge about the role of sRNAs in streptococci is still limited, but in recent years, genome-wide screens for sRNAs have been conducted in an increasing number of species. Bioinformatics prediction approaches have been employed as well as expression analyses by classical array techniques or next generation sequencing. This review will give an overview of whole genome screens for sRNAs in streptococci with a focus on describing the different methods and comparing their outcome considering sRNA conservation among species, functional similarities, and relevance for streptococcal infection.
Mick, Eric; McGough, James; Deutsch, Curtis K; Frazier, Jean A; Kennedy, David; Goldberg, Robert J
Community samples suggest that approximately 1 in 20 children and adults exhibit clinically significant anger, hostility, and aggression. Individuals with dysregulated emotional control have a greater lifetime burden of psychiatric morbidity, severe impairment in role functioning, and premature mortality due to cardiovascular disease. With publically available data secured from dbGaP, we conducted a genome-wide association study of proneness to anger using the Spielberger State-Trait Anger Scale in the Atherosclerosis Risk in Communities (ARIC) study (n = 8,747). Subjects were, on average, 54 (range 45-64) years old at baseline enrollment, 47% (n = 4,117) were male, and all were of European descent by self-report. The mean Angry Temperament and Angry Reaction scores were 5.8 ± 1.8 and 7.6 ± 2.2. We observed a nominally significant finding (p = 2.9E-08, λ = 1.027 - corrected pgc = 2.2E-07, λ = 1.0015) on chromosome 6q21 in the gene coding for the non-receptor protein-tyrosine kinase, Fyn. Fyn interacts with NDMA receptors and inositol-1,4,5-trisphosphate (IP3)-gated channels to regulate calcium influx and intracellular release in the post-synaptic density. These results suggest that signaling pathways regulating intracellular calcium homeostasis, which are relevant to memory, learning, and neuronal survival, may in part underlie the expression of Angry Temperament.
Full Text Available BACKGROUND: Community samples suggest that approximately 1 in 20 children and adults exhibit clinically significant anger, hostility, and aggression. Individuals with dysregulated emotional control have a greater lifetime burden of psychiatric morbidity, severe impairment in role functioning, and premature mortality due to cardiovascular disease. METHODS: With publically available data secured from dbGaP, we conducted a genome-wide association study of proneness to anger using the Spielberger State-Trait Anger Scale in the Atherosclerosis Risk in Communities (ARIC study (n = 8,747. RESULTS: Subjects were, on average, 54 (range 45-64 years old at baseline enrollment, 47% (n = 4,117 were male, and all were of European descent by self-report. The mean Angry Temperament and Angry Reaction scores were 5.8 ± 1.8 and 7.6 ± 2.2. We observed a nominally significant finding (p = 2.9E-08, λ = 1.027 - corrected pgc = 2.2E-07, λ = 1.0015 on chromosome 6q21 in the gene coding for the non-receptor protein-tyrosine kinase, Fyn. CONCLUSIONS: Fyn interacts with NDMA receptors and inositol-1,4,5-trisphosphate (IP3-gated channels to regulate calcium influx and intracellular release in the post-synaptic density. These results suggest that signaling pathways regulating intracellular calcium homeostasis, which are relevant to memory, learning, and neuronal survival, may in part underlie the expression of Angry Temperament.
Full Text Available Plant organ development and polarity establishment is mediated by the action of several transcription factors. Among these, the KANADI (KAN subclade of the GARP protein family plays important roles in polarity-associated processes during embryo, shoot and root patterning. In this study, we have identified a set of potential direct target genes of KAN1 through a combination of chromatin immunoprecipitation/DNA sequencing (ChIP-Seq and genome-wide transcriptional profiling using tiling arrays. Target genes are over-represented for genes involved in the regulation of organ development as well as in the response to auxin. KAN1 affects directly the expression of several genes previously shown to be important in the establishment of polarity during lateral organ and vascular tissue development. We also show that KAN1 controls through its target genes auxin effects on organ development at different levels: transport and its regulation, and signaling. In addition, KAN1 regulates genes involved in the response to abscisic acid, jasmonic acid, brassinosteroids, ethylene, cytokinins and gibberellins. The role of KAN1 in organ polarity is antagonized by HD-ZIPIII transcription factors, including REVOLUTA (REV. A comparison of their target genes reveals that the REV/KAN1 module acts in organ patterning through opposite regulation of shared targets. Evidence of mutual repression between closely related family members is also shown.
Beekman, Marian; Blanché, Hélène; Perola, Markus
Clear evidence exists for heritability of human longevity, and much interest is focused on identifying genes associated with longer lives. To identify such longevity alleles, we performed the largest genome-wide linkage scan thus far reported. Linkage analyses included 2118 nonagenarian Caucasian.......95), chromosome 19p13.3-p13.11 (LOD = 3.76), and chromosome 19q13.11-q13.32 (LOD = 3.57). To fine map these regions linked to longevity, we performed association analysis using GWAS data in a subgroup of 1228 unrelated nonagenarian and 1907 geographically matched controls. Using a fixed-effect meta-analysis.......02 and P-value = 1.0 × 10(-5) , respectively. In the largest linkage scan thus far performed for human familial longevity, we confirm that the APOE locus is a longevity gene and that additional longevity loci may be identified at 14q11.2, 17q12-q22, and 19p13.3-p13.11. As the latter linkage results...
Pappa, Irene; St Pourcain, Beate; Benke, Kelly; Cavadino, Alana; Hakulinen, Christian; Nivard, Michel G; Nolte, Ilja M; Tiesler, Carla M T; Bakermans-Kranenburg, Marian J; Davies, Gareth E; Evans, David M; Geoffroy, Marie-Claude; Grallert, Harald; Groen-Blokhuis, Maria M; Hudziak, James J; Kemp, John P; Keltikangas-Järvinen, Liisa; McMahon, George; Mileva-Seitz, Viara R; Motazedi, Ehsan; Power, Christine; Raitakari, Olli T; Ring, Susan M; Rivadeneira, Fernando; Rodriguez, Alina; Scheet, Paul A; Seppälä, Ilkka; Snieder, Harold; Standl, Marie; Thiering, Elisabeth; Timpson, Nicholas J; Veenstra, René; Velders, Fleur P; Whitehouse, Andrew J O; Smith, George Davey; Heinrich, Joachim; Hypponen, Elina; Lehtimäki, Terho; Middeldorp, Christel M; Oldehinkel, Albertine J; Pennell, Craig E; Boomsma, Dorret I; Tiemeier, Henning
Individual differences in aggressive behavior emerge in early childhood and predict persisting behavioral problems and disorders. Studies of antisocial and severe aggression in adulthood indicate substantial underlying biology. However, little attention has been given to genome-wide approaches of aggressive behavior in children. We analyzed data from nine population-based studies and assessed aggressive behavior using well-validated parent-reported questionnaires. This is the largest sample exploring children's aggressive behavior to date (N = 18,988), with measures in two developmental stages (N = 15,668 early childhood and N = 16,311 middle childhood/early adolescence). First, we estimated the additive genetic variance of children's aggressive behavior based on genome-wide SNP information, using genome-wide complex trait analysis (GCTA). Second, genetic associations within each study were assessed using a quasi-Poisson regression approach, capturing the highly right-skewed distribution of aggressive behavior. Third, we performed meta-analyses of genome-wide associations for both the total age-mixed sample and the two developmental stages. Finally, we performed a gene-based test using the summary statistics of the total sample. GCTA quantified variance tagged by common SNPs (10-54%). The meta-analysis of the total sample identified one region in chromosome 2 (2p12) at near genome-wide significance (top SNP rs11126630, P = 5.30 × 10(-8) ). The separate meta-analyses of the two developmental stages revealed suggestive evidence of association at the same locus. The gene-based analysis indicated association of variation within AVPR1A with aggressive behavior. We conclude that common variants at 2p12 show suggestive evidence for association with childhood aggression. Replication of these initial findings is needed, and further studies should clarify its biological meaning. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Klein Robert J
Full Text Available Abstract Background Genome-wide association studies are a promising new tool for deciphering the genetics of complex diseases. To choose the proper sample size and genotyping platform for such studies, power calculations that take into account genetic model, tag SNP selection, and the population of interest are required. Results The power of genome-wide association studies can be computed using a set of tag SNPs and a large number of genotyped SNPs in a representative population, such as available through the HapMap project. As expected, power increases with increasing sample size and effect size. Power also depends on the tag SNPs selected. In some cases, more power is obtained by genotyping more individuals at fewer SNPs than fewer individuals at more SNPs. Conclusion Genome-wide association studies should be designed thoughtfully, with the choice of genotyping platform and sample size being determined from careful power calculations.
Full Text Available Although great progress in genome-wide association studies (GWAS has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked. The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001
Rose K Lai
Full Text Available Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM. Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1, 5 methyl-deoxycytidine (5m-dC and 5 hydroxylmethyl-deoxycytidine (5hm-dC in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal.
Lai, Rose K; Chen, Yanwen; Guan, Xiaowei; Nousome, Darryl; Sharma, Charu; Canoll, Peter; Bruce, Jeffrey; Sloan, Andrew E; Cortes, Etty; Vonsattel, Jean-Paul; Su, Tao; Delgado-Cruzata, Lissette; Gurvich, Irina; Santella, Regina M; Ostrom, Quinn; Lee, Annette; Gregersen, Peter; Barnholtz-Sloan, Jill
Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM). Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina) were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1), 5 methyl-deoxycytidine (5m-dC) and 5 hydroxylmethyl-deoxycytidine (5hm-dC) in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes) that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal.
María José Aranzana
Full Text Available There is currently tremendous interest in the possibility of using genome-wide association mapping to identify genes responsible for natural variation, particularly for human disease susceptibility. The model plant Arabidopsis thaliana is in many ways an ideal candidate for such studies, because it is a highly selfing hermaphrodite. As a result, the species largely exists as a collection of naturally occurring inbred lines, or accessions, which can be genotyped once and phenotyped repeatedly. Furthermore, linkage disequilibrium in such a species will be much more extensive than in a comparable outcrossing species. We tested the feasibility of genome-wide association mapping in A. thaliana by searching for associations with flowering time and pathogen resistance in a sample of 95 accessions for which genome-wide polymorphism data were available. In spite of an extremely high rate of false positives due to population structure, we were able to identify known major genes for all phenotypes tested, thus demonstrating the potential of genome-wide association mapping in A. thaliana and other species with similar patterns of variation. The rate of false positives differed strongly between traits, with more clinal traits showing the highest rate. However, the false positive rates were always substantial regardless of the trait, highlighting the necessity of an appropriate genomic control in association studies.
Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.
Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962
Full Text Available Genome-wide association studies (GWAS simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645, thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density.
Shi, Zhen; Fujii, Kotaro; Kovary, Kyle M; Genuth, Naomi R; Röst, Hannes L; Teruel, Mary N; Barna, Maria
Emerging studies have linked the ribosome to more selective control of gene regulation. However, an outstanding question is whether ribosome heterogeneity at the level of core ribosomal proteins (RPs) exists and enables ribosomes to preferentially translate specific mRNAs genome-wide. Here, we measured the absolute abundance of RPs in translating ribosomes and profiled transcripts that are enriched or depleted from select subsets of ribosomes within embryonic stem cells. We find that heterogeneity in RP composition endows ribosomes with differential selectivity for translating subpools of transcripts, including those controlling metabolism, cell cycle, and development. As an example, mRNAs enriched in binding to RPL10A/uL1-containing ribosomes are shown to require RPL10A/uL1 for their efficient translation. Within several of these transcripts, this level of regulation is mediated, at least in part, by internal ribosome entry sites. Together, these results reveal a critical functional link between ribosome heterogeneity and the post-transcriptional circuitry of gene expression. Copyright © 2017 Elsevier Inc. All rights reserved.
van Es, Michael A.; Veldink, Jan H.; Saris, Christiaan G. J.; Blauw, Hylke M.; van Vught, Paul W. J.; Birve, Anna; Lemmens, Robin; Schelhaas, Helenius J.; Groen, Ewout J. N.; Huisman, Mark H. B.; van der Kooi, Anneke J.; de Visser, Marianne; Dahlberg, Caroline; Estrada, Karol; Rivadeneira, Fernando; Hofman, Albert; Zwarts, Machiel J.; van Doormaal, Perry T. C.; Rujescu, Dan; Strengman, Eric; Giegling, Ina; Muglia, Pierandrea; Tomik, Barbara; Slowik, Agnieszka; Uitterlinden, Andre G.; Hendrich, Corinna; Waibel, Stefan; Meyer, Thomas; Ludolph, Albert C.; Glass, Jonathan D.; Purcell, Shaun; Cichon, Sven; Nöthen, Markus M.; Wichmann, H.-Erich; Schreiber, Stefan; Vermeulen, Sita H. H. M.; Kiemeney, Lambertus A.; Wokke, John H. J.; Cronin, Simon; McLaughlin, Russell L.; Hardiman, Orla; Fumoto, Katsumi; Pasterkamp, R. Jeroen; Meininger, Vincent; Melki, Judith; Leigh, P. Nigel; Shaw, Christopher E.; Landers, John E.; Al-Chalabi, Ammar; Brown, Robert H.; Robberecht, Wim; Andersen, Peter M.; Ophoff, Roel A.; van den Berg, Leonard H.
We conducted a genome-wide association study among 2,323 individuals with sporadic amyotrophic lateral sclerosis (ALS) and 9,013 control subjects and evaluated all SNPs with P <1.0 x 10(-4) in a second, independent cohort of 2,532 affected individuals and 5,940 controls. Analysis of the genome-wide
Transcription regulation is important for nearly all cellular processes. To understand how transcription is regulated by different regulatory complexes, DNA microarray expression analysis is used to determine the genome-wide changes in mRNA levels upon deletion of individual factors that belong to
Karlsen, Tom H.; Franke, Andre; Melum, Espen; Kaser, Arthur; Hov, Johannes Roksund; Balschun, Tobias; Lie, Benedicte A.; Bergquist, Annika; Schramm, Christoph; Weismueller, Tobias J.; Gotthardt, Daniel; Rust, Christian; Philipp, Eva E. R.; Fritz, Teresa; Henckaerts, Liesbet; Weersma, Rinse K.; Stokkers, Pieter; Ponsioen, Cyriel Y.; Wijmenga, Cisca; Sterneck, Martina; Nothnagel, Michael; Hampe, Jochen; Teufel, Andreas; Runz, Heiko; Rosenstiel, Philip; Stiehl, Adolf; Vermeire, Severine; Beuers, Ulrich; Manns, Michael P.; Schrumpf, Erik; Boberg, Kirsten Muri; Schreiber, Stefan
BACKGROUND & AIMS: We aimed to characterize the genetic susceptibility to primary sclerosing cholangitis (PSC) by means of a genome-wide association analysis of single nucleotide polymorphism (SNP) markers. METHODS: A total of 443,816 SNPs on the Affymetrix SNP Array 5.0 (Affymetrix, Santa Clara,
Ripke, Stephan; Sanders, Alan R.; Kendler, Kenneth S.; Levinson, Douglas F.; Sklar, Pamela; Holmans, Peter A.; Lin, Dan-Yu; Duan, Jubao; Ophoff, Roel A.; Andreassen, Ole A.; Scolnick, Edward; Cichon, Sven; Clair, David St.; Corvin, Aiden; Gurling, Hugh; Werge, Thomas; Rujescu, Dan; Blackwood, Douglas H. R.; Pato, Carlos N.; Malhotra, Anil K.; Purcell, Shaun; Dudbridge, Frank; Neale, Benjamin M.; Rossin, Lizzy; Visscher, Peter M.; Posthuma, Danielle; Ruderfer, Douglas M.; Fanous, Ayman; Stefansson, Hreinn; Steinberg, Stacy; Mowry, Bryan J.; Golimbet, Vera; De Hert, Marc; Jonsson, Erik G.; Bitter, Istvan; Pietilainen, Olli P. H.; Collier, David A.; Tosato, Sarah; Agartz, Ingrid; Albus, Margot; Alexander, Madeline; Amdur, Richard L.; Amin, Farooq; Bass, Nicholas; Bergen, Sarah E.; Black, Donald W.; Borglum, Anders D.; Brown, Matthew A.; Bruggeman, Richard; Buccola, Nancy G.; Byerley, William F.; Cahn, Wiepke; Cantor, Rita M.; Carr, Vaughan J.; Catts, Stanley V.; Choudhury, Khalid; Cloninger, C. Robert; Cormican, Paul; Craddock, Nicholas; Danoy, Patrick A.; Datta, Susmita; De Haan, Lieuwe; Demontis, Ditte; Dikeos, Dimitris; Djurovic, Srdjan; Donnelly, Peter; Donohoe, Gary; Duong, Linh; Dwyer, Sarah; Fink-Jensen, Anders; Freedman, Robert; Freimer, Nelson B.; Friedl, Marion; Georgieva, Lyudmila; Giegling, Ina; Gill, Michael; Glenthoj, Birte; Godard, Stephanie; Hamshere, Marian; Hansen, Mark; Hansen, Thomas; Hartmann, Annette M.; Henskens, Frans A.; Hougaard, David M.; Hultman, Christina M.; Ingason, Andres; Jablensky, Assen V.; Jakobsen, Klaus D.; Jay, Maurice; Juergens, Gesche; Kahn, Renes; Keller, Matthew C.; Kenis, Gunter; Kenny, Elaine; Kim, Yunjung; Kirov, George K.; Konnerth, Heike; Konte, Bettina; Krabbendam, Lydia; Krasucki, Robert; Lasseter, Virginia K.; Laurent, Claudine; Lawrence, Jacob; Lencz, Todd; Lerer, F. Bernard; Liang, Kung-Yee; Lichtenstein, Paul; Lieberman, Jeffrey A.; Linszen, Don H.; Lonnqvist, Jouko; Loughland, Carmel M.; Maclean, Alan W.; Maher, Brion S.; Maier, Wolfgang; Mallet, Jacques; Malloy, Pat; Mattheisen, Manuel; Mattingsdal, Morten; McGhee, Kevin A.; McGrath, John J.; McIntosh, Andrew; McLean, Duncan E.; McQuillin, Andrew; Melle, Ingrid; Michie, Patricia T.; Milanova, Vihra; Morris, Derek W.; Mors, Ole; Mortensen, Preben B.; Moskvina, Valentina; Muglia, Pierandrea; Myin-Germeys, Inez; Nertney, Deborah A.; Nestadt, Gerald; Nielsen, Jimmi; Nikolov, Ivan; Nordentoft, Merete; Norton, Nadine; Noethen, Markus M.; O'Dushlaine, Colm T.; Olincy, Ann; Olsen, Line; O'Neill, F. Anthony; Orntoft, Torben F.; Owen, Michael J.; Pantelis, Christos; Papadimitriou, George; Pato, Michele T.; Peltonen, Leena; Petursson, Hannes; Pickard, Ben; Pimm, Jonathan; Pulver, Ann E.; Puri, Vinay; Quested, Digby; Quinn, Emma M.; Rasmussen, Henrik B.; Rethelyi, Janos M.; Ribble, Robert; Rietschel, Marcella; Riley, Brien P.; Ruggeri, Mirella; Schall, Ulrich; Schulze, Thomas G.; Schwab, Sibylle G.; Scott, Rodney J.; Shi, Jianxin; Sigurdsson, Engilbert; Silverman, Jeremy M.; Spencer, Chris C. A.; Stefansson, Kari; Strange, Amy; Strengman, Eric; Stroup, T. Scott; Suvisaari, Jaana; Terenius, Lars; Thirumalai, Srinivasa; Thygesen, Johan H.; Timm, Sally; Toncheva, Draga; van den Oord, Edwin; van Os, Jim; van Winkel, Ruud; Veldink, Jan; Walsh, Dermot; Wang, August G.; Wiersma, Durk; Wildenauer, Dieter B.; Williams, Hywel J.; Williams, Nigel M.; Wormley, Brandon; Zammit, Stan; Sullivan, Patrick F.; O'Donovan, Michael C.; Daly, Mark J.; Gejman, Pablo V.
We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded
Microarray based comparative genome-wide expression profiling of major subtypes of leukemia. ... similar patterns of result in terms of gene expression but it demonstrates statistically significant relationship only among CML and ALL which are of myeloid and lymphoid origin, respectively, in contrast to other combinations.
Smith, Paula; McGuffog, Lesley; Easton, Douglas F.; Mann, Graham J.; Pupo, Gulietta M.; Newman, Beth; Chenevix-Trench, Georgia; Szabo, Csilla; Southey, Melissa; Renard, Hélène; Odefrey, Fabrice; Lynch, Henry; Stoppa-Lyonnet, Dominique; Couch, Fergus; Hopper, John L.; Giles, Graham G.; McCredie, Margaret R. E.; Buys, Saundra; Andrulis, Irene; Senie, Ruby; Goldgar, David E.; Oldenburg, Rogier; Kroeze-Jansema, Karin; Kraan, Jaennelle; Meijers-Heijboer, Hanne; Klijn, Jan G. M.; van Asperen, Christi; van Leeuwen, Inge; Vasen, Hans F. A.; Cornelisse, Cees J.; Devilee, Peter; Baskcomb, Linda; Seal, Sheila; Barfoot, Rita; Mangion, Jon; Hall, Anita; Edkins, Sarah; Rapley, Elizabeth; Wooster, Richard; Chang-Claude, Jenny; Eccles, Diana; Evans, D. Gareth; Futreal, P. Andrew; Nathanson, Katherine L.; Weber, Barbara L.; Rahman, Nazneen; Stratton, Michael R.
Mutations in known breast cancer susceptibility genes account for a minority of the familial aggregation of the disease. To search for further breast cancer susceptibility genes, we performed a combined analysis of four genome-wide linkage screens, which included a total of 149 multiple case breast
Luyf, A. C. M.; de Gast, J.; van Kampen, A. H. C.
Motivation: To enhance the exploration of gene expression data in a metabolic context, one requires an application that allows the integration of this data and which represents this data in a (genome-wide) metabolic map. The layout of this metabolic map must be highly flexible to enable discoveries
S. Walter (Stefan); G. Atzmon (Gil); E.W. Demerath (Ellen); M. Garcia (Melissa); R.C. Kaplan (Robert); M. Kumari (Meena); K.L. Lunetta (Kathryn); Y. Milaneschi (Yuri); T. Tanaka (Toshiko); G.J. Tranah (Gregory); U. Völker (Uwe); L. Yu (Lei); A.M. Arnold (Alice); E.J. Benjamin (Emelia); R. Biffar (Reiner); A.S. Buchman (Aron); E.A. Boerwinkle (Eric); D. Couper (David); P.L. de Jager (Philip); D.A. Evans (Denis); T.B. Harris (Tamara); W. Hoffmann (Wolfgang); A. Hofman (Albert); D. Karasik (David); D.P. Kiel (Douglas); T. Kocher (Thomas); M. Kuningas (Maris); L.J. Launer (Lenore); K. Lohman (Kurt); P.L. Lutsey (Pamela); J.P. Mackenbach (Johan); K. Marciante (Kristin); B.M. Psaty (Bruce); E.M. Reiman (Eric); J.I. Rotter (Jerome); S. Seshadri (Sudha); M.D. Shardell (Michelle); A.V. Smith (Albert Vernon); P. Tikka-Kleemola (Päivi); J. Walston (Jeremy); M.C. Zillikens (Carola); S. Bandinelli (Stefania); S.E. Baumeister (Sebastian); D.A. Bennett (David); L. Ferrucci (Luigi); V. Gudnason (Vilmundur); M. Kivimaki (Mika); Y. Liu (Yongmei); J. Murabito (Joanne); A.B. Newman (Anne); H.W. Tiemeier (Henning); N. Franceschini (Nora)
textabstractHuman longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2)
Beurden, van S.J.; Peeters, B.P.H.; Rottier, P.J.M.; Davison, A.A.; Engelsma, M.Y.
Background Whereas temporal gene expression in mammalian herpesviruses has been studied extensively, little is known about gene expression in fish herpesviruses. Here we report a genome-wide transcription analysis of a fish herpesvirus, anguillid herpesvirus 1, in cell culture, studied during the
Genome-wide patterns of variation across individuals provide most powerful source of data for uncovering the history of migration, expansion, and adaptation of the human population. The arrival of new technologies that type more than millions of the single nucleotide polymorphisms (SNPs) in a single experiment has ...
Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y|info:eu-repo/dai/nl/369207084; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, Annemarie|info:eu-repo/dai/nl/191430129; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M
Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14
Lachmeijer, AMA; Arngrimsson, R; Bastiaans, EJ; Frigge, ML; Pals, G; Sigurdardottir, S; Stefansson, H; Palsson, B; Nicolae, D; Kong, A; Aarnoudse, JG; Gulcher, [No Value; Dekker, GA; ten Kate, LP; Stefansson, K
Preeclampsia, hallmarked by de novo hypertension and proteinuria in pregnancy, has a familial tendency. Recently, a large Icelandic genome-wide scan provided evidence for a maternal susceptibility locus for preeclampsia on chromosome 2p13 which was confirmed by a genome scan from Australia and New
Verkouteren, Joris A. C.; Hofman, Albert; Uitterlinden, André G.; Kraft, Peter; Turman, Constance; Han, Jiali; Cho, Eunyoung; Murabito, Joanne M.; Levy, Daniel; Qureshi, Abrar A.; Nijsten, Tamar
There is strong evidence for a role of environmental risk factors involved in susceptibility to develop multiple keratinocyte cancers (mKCs), but whether genes are also involved in mKCs susceptibility has not been thoroughly investigated. We investigated whether single nucleotide polymorphisms (SNPs) are associated with susceptibility for mKCs. A genome-wide association study (GWAS) of 1,666 cases with mKCs and 1,950 cases with single KC (sKCs; controls) from Harvard cohorts (the Nurses' Health Study [NHS], NHS II, and the Health Professionals Follow-Up Study) and the Framingham Heart Study was carried-out using over 8 million SNPs (stage-1). We sought to replicate the most significant statistical associations (p-value≤ 5.5x10-6) in an independent cohort of 574 mKCs and 872 sKCs from the Rotterdam Study. In the discovery stage, 40 SNPs with suggestive associations (p-value ≤5.5x10-6) were identified, with eight independent SNPs tagging all 40 SNPs. The most significant SNP was located at chromosome 9 (rs7468390; p-value = 3.92x10-7). In stage-2, none of these SNPs replicated and only two of them were associated with mKCs in the same direction in the combined meta-analysis. We tested the associations for 19 previously reported basal cell carcinoma-related SNPs (candidate gene association analysis), and found that rs1805007 (MC1R locus) was significantly associated with risk of mKCs (p-value = 2.80x10-4). Although the suggestive SNPs with susceptibility for mKCs were not replicated, we found that previously identified BCC variants may also be associated with mKC, which the most significant association (rs1805007) located at the MC1R gene. PMID:28081215
Luba M Pardo
Full Text Available There is strong evidence for a role of environmental risk factors involved in susceptibility to develop multiple keratinocyte cancers (mKCs, but whether genes are also involved in mKCs susceptibility has not been thoroughly investigated. We investigated whether single nucleotide polymorphisms (SNPs are associated with susceptibility for mKCs. A genome-wide association study (GWAS of 1,666 cases with mKCs and 1,950 cases with single KC (sKCs; controls from Harvard cohorts (the Nurses' Health Study [NHS], NHS II, and the Health Professionals Follow-Up Study and the Framingham Heart Study was carried-out using over 8 million SNPs (stage-1. We sought to replicate the most significant statistical associations (p-value≤ 5.5x10-6 in an independent cohort of 574 mKCs and 872 sKCs from the Rotterdam Study. In the discovery stage, 40 SNPs with suggestive associations (p-value ≤5.5x10-6 were identified, with eight independent SNPs tagging all 40 SNPs. The most significant SNP was located at chromosome 9 (rs7468390; p-value = 3.92x10-7. In stage-2, none of these SNPs replicated and only two of them were associated with mKCs in the same direction in the combined meta-analysis. We tested the associations for 19 previously reported basal cell carcinoma-related SNPs (candidate gene association analysis, and found that rs1805007 (MC1R locus was significantly associated with risk of mKCs (p-value = 2.80x10-4. Although the suggestive SNPs with susceptibility for mKCs were not replicated, we found that previously identified BCC variants may also be associated with mKC, which the most significant association (rs1805007 located at the MC1R gene.
Full Text Available Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions. The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila-like 2] gene located on 9p21.3 (p = 0.00087. In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026. The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology.
McQueen, Matthew B; Boardman, Jason D; Domingue, Benjamin W; Smolen, Andrew; Tabor, Joyce; Killeya-Jones, Ley; Halpern, Carolyn T; Whitsel, Eric A; Harris, Kathleen Mullan
Here we provide a detailed description of the genome-wide information available on the National Longitudinal Study of Adolescent to Adult Health (Add Health) sibling pair subsample (Harris et al. in Twin Res Hum Genet 16:391-398, 2013). A total of 2,020 samples were genotyped (including duplicates) arising from 1946 Add Health individuals from the sibling pairs subsample. After various steps for quality control (QC) and quality assurance (QA), we have high quality genome-wide data available on 1,888 individuals. In this report, we first highlight the QC and QA steps that were taken to prune the data of poorly performing samples and genetic markers. We further estimate the pairwise biological relationships using genome-wide data and compare those estimates to the assumed relationships in Add Health. Additionally, using genome-wide data from known regional reference populations from Europe, West Africa, North and South America, Japan and China, we estimate the relative genetic ancestry of the respondents. Finally, rather than conducting a traditional cross-sectional genome-wide association study (GWAS) of body mass index (BMI), we opted to utilize the extensive publicly available genome-wide information to conduct a weighted GWAS of longitudinal BMI while accounting for both family and ethnic variation.
Full Text Available Sciatica or the sciatic syndrome is a common and often disabling low back disorder in the working-age population. It has a relatively high heritability but poorly understood molecular mechanisms. The Finnish population is a genetic isolate where small founder population and bottleneck events have led to enrichment of certain rare and low frequency variants. We performed here the first genome-wide association (GWAS and meta-analysis of sciatica. The meta-analysis was conducted across two GWAS covering 291 Finnish sciatica cases and 3671 controls genotyped and imputed at 7.7 million autosomal variants. The most promising loci (p<1x10-6 were replicated in 776 Finnish sciatica patients and 18,489 controls. We identified five intragenic variants, with relatively low frequencies, at two novel loci associated with sciatica at genome-wide significance. These included chr9:14344410:I (rs71321981 at 9p22.3 (NFIB gene; p = 1.30x10-8, MAF = 0.08 and four variants at 15q21.2: rs145901849, rs80035109, rs190200374 and rs117458827 (MYO5A; p = 1.34x10-8, MAF = 0.06; p = 2.32x10-8, MAF = 0.07; p = 3.85x10-8, MAF = 0.06; p = 4.78x10-8, MAF = 0.07, respectively. The most significant association in the meta-analysis, a single base insertion rs71321981 within the regulatory region of the transcription factor NFIB, replicated in an independent Finnish population sample (p = 0.04. Despite identifying 15q21.2 as a promising locus, we were not able to replicate it. It was differentiated; the lead variants within 15q21.2 were more frequent in Finland (6-7% than in other European populations (1-2%. Imputation accuracies of the three significantly associated variants (chr9:14344410:I, rs190200374, and rs80035109 were validated by genotyping. In summary, our results suggest a novel locus, 9p22.3 (NFIB, which may be involved in susceptibility to sciatica. In addition, another locus, 15q21.2, emerged as a promising one, but failed to replicate.
Lemmelä, Susanna; Solovieva, Svetlana; Shiri, Rahman; Benner, Christian; Heliövaara, Markku; Kettunen, Johannes; Anttila, Verneri; Ripatti, Samuli; Perola, Markus; Seppälä, Ilkka; Juonala, Markus; Kähönen, Mika; Salomaa, Veikko; Viikari, Jorma; Raitakari, Olli T; Lehtimäki, Terho; Palotie, Aarno; Viikari-Juntura, Eira; Husgafvel-Pursiainen, Kirsti
Sciatica or the sciatic syndrome is a common and often disabling low back disorder in the working-age population. It has a relatively high heritability but poorly understood molecular mechanisms. The Finnish population is a genetic isolate where small founder population and bottleneck events have led to enrichment of certain rare and low frequency variants. We performed here the first genome-wide association (GWAS) and meta-analysis of sciatica. The meta-analysis was conducted across two GWAS covering 291 Finnish sciatica cases and 3671 controls genotyped and imputed at 7.7 million autosomal variants. The most promising loci (psciatica patients and 18,489 controls. We identified five intragenic variants, with relatively low frequencies, at two novel loci associated with sciatica at genome-wide significance. These included chr9:14344410:I (rs71321981) at 9p22.3 (NFIB gene; p = 1.30x10-8, MAF = 0.08) and four variants at 15q21.2: rs145901849, rs80035109, rs190200374 and rs117458827 (MYO5A; p = 1.34x10-8, MAF = 0.06; p = 2.32x10-8, MAF = 0.07; p = 3.85x10-8, MAF = 0.06; p = 4.78x10-8, MAF = 0.07, respectively). The most significant association in the meta-analysis, a single base insertion rs71321981 within the regulatory region of the transcription factor NFIB, replicated in an independent Finnish population sample (p = 0.04). Despite identifying 15q21.2 as a promising locus, we were not able to replicate it. It was differentiated; the lead variants within 15q21.2 were more frequent in Finland (6-7%) than in other European populations (1-2%). Imputation accuracies of the three significantly associated variants (chr9:14344410:I, rs190200374, and rs80035109) were validated by genotyping. In summary, our results suggest a novel locus, 9p22.3 (NFIB), which may be involved in susceptibility to sciatica. In addition, another locus, 15q21.2, emerged as a promising one, but failed to replicate.
Steinig, Eike J; Neuditschko, Markus; Khatkar, Mehar S; Raadsma, Herman W; Zenger, Kyall R
Network-based approaches are emerging as valuable tools for the analysis of complex genetic structure in wild and captive populations. netview p combines data quality control with the construction of population networks through mutual k-nearest neighbours thresholds applied to genome-wide SNPs. The program is cross-platform compatible, open-source and efficiently operates on data ranging from hundreds to hundreds of thousands of SNPs. The pipeline was used for the analysis of pedigree data from simulated (n = 750, SNPs = 1279) and captive silver-lipped pearl oysters (n = 415, SNPs = 1107), wild populations of the European hake from the Atlantic and Mediterranean (n = 834, SNPs = 380) and grey wolves from North America (n = 239, SNPs = 78 255). The population networks effectively visualize large- and fine-scale genetic structure within and between populations, including family-level structure and relationships. netview p comprises a network-based addition to other population analysis tools and provides user-friendly access to a complex network analysis pipeline through implementation in python. © 2015 John Wiley & Sons Ltd.
Full Text Available Hypoxia-inducible factors (HIFs are a family of evolutionary conserved alpha-beta heterodimeric transcription factors that induce a wide range of genes in response to low oxygen tension. Molecular mechanisms that mediate oxygen-dependent HIF regulation operate at the level of the alpha subunit, controlling protein stability, subcellular localization, and transcriptional coactivator recruitment. We have conducted an unbiased genome-wide RNA interference (RNAi screen in Drosophila cells aimed to the identification of genes required for HIF activity. After 3 rounds of selection, 30 genes emerged as critical HIF regulators in hypoxia, most of which had not been previously associated with HIF biology. The list of genes includes components of chromatin remodeling complexes, transcription elongation factors, and translational regulators. One remarkable hit was the argonaute 1 (ago1 gene, a central element of the microRNA (miRNA translational silencing machinery. Further studies confirmed the physiological role of the miRNA machinery in HIF-dependent transcription. This study reveals the occurrence of novel mechanisms of HIF regulation, which might contribute to developing novel strategies for therapeutic intervention of HIF-related pathologies, including heart attack, cancer, and stroke.
Franke, Andre; Wollstein, Andreas; Teuber, Markus; Wittig, Michael; Lu, Tim; Hoffmann, Katrin; Nürnberg, Peter; Krawczak, Michael; Schreiber, Stefan; Hampe, Jochen
Genome-wide association analysis appears to be a promising way to identify heritable susceptibility factors for complex human disorders. However, the feasibility of large-scale genotyping experiments is currently limited by an incomplete marker coverage of the genome, a restricted understanding of the functional role of given genomic regions, and the small sample sizes used. Thus, genome-wide association analysis will be a screening tool to facilitate subsequent gene discovery rather than a means to completely resolve individual genetic risk profiles. The validation of association findings will continue to rely upon the replication of "leads" in independent samples from either the same or different populations. Even under such pragmatic conditions, the timely analysis of the large data sets in question poses serious technical challenges. We have therefore developed public-domain software, GENOMIZER, that implements the workflow of an association experiment, including data management, single-point and haplotype analysis, "lead" definition, and data visualization. GENOMIZER (www.ikmb.uni-kiel.de/genomizer) comes with a complete user manual, and is open-source software licensed under the GNU Lesser General Public License. We suggest that the use of this software will facilitate the handling and interpretation of the currently emerging genome-wide association data. Copyright 2006 Wiley-Liss, Inc.
Ouzounis Christos A
Full Text Available Abstract Background Genome-wide expression patterns in physiological cardiac hypertrophy. Co-expression patterns in physiological cardiac hypertrophy Results In this study, the first large-scale analysis of publicly available genome-wide expression data of several in vivo murine models of physiological LVH was carried out using network analysis. On evaluating 3 million gene co-expression patterns across 141 relevant microarray experiments, it was found that physiological adaptation is an evolutionarily conserved processes involving preservation of the function of cytochrome c oxidase, induction of autophagy compatible with cell survival, and coordinated regulation of angiogenesis. Conclusion This analysis not only identifies known biological pathways involved in physiological LVH, but also offers novel insights into the molecular basis of this phenotype by identifying key networks of co-expressed genes, as well as their topological and functional properties, using relevant high-quality microarray experiments and network inference.
Caicedo, Ana L; Williamson, Scott H; Hernandez, Ryan D
Domesticated Asian rice (Oryza sativa) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments......, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models...... explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection...
Oskari Kilpeläinen, Tuomas
Genome-wide association studies (GWASs) have revolutionized the search for genetic variants regulating resting heart rate. In the last 10 years, GWASs have led to the identification of at least 21 novel heart rate loci. These discoveries have provided valuable insights into the mechanisms...... and pathways that regulate heart rate and link heart rate to cardiovascular morbidity and mortality. GWASs capture majority of genetic variation in a population sample by utilizing high-throughput genotyping chips measuring genotypes for up to several millions of SNPs across the genome in thousands...... of individuals. This allows the identification of the strongest heart rate associated signals at genome-wide level. While GWASs provide robust statistical evidence of the association of a given genetic locus with heart rate, they are only the starting point for detailed follow-up studies to locate the causal...
Abstract The use of genomic technologies in biogerontology has the potential to greatly enhance our understanding of human ageing. High-throughput screens for alleles correlated with survival in long-lived people have uncovered novel genes involved in age-associated disease. Genome-wide longevity studies in simple eukaryotes are identifying evolutionarily conserved pathways that determine longevity. It is hoped that validation of these 'public' aspects of ageing in mice, along with analyses o...
Weinsheimer, Shantel; Bendjilali, Nasrine; Nelson, Jeffrey; Guo, Diana E; Zaroff, Jonathan G.; Sidney, Stephen; McCulloch, Charles E.; Al-Shahi Salman, Rustam; Berg, Jonathan N; Bobby P. C. Koeleman; Simon, Matthias; Bostroem, Azize; Fontanella, Marco; Sturiale, Carmelo L; Pola, Roberto
BACKGROUND: The pathogenesis of sporadic brain arteriovenous malformations (BAVMs) remains unknown, but studies suggest a genetic component. We estimated the heritability of sporadic BAVM and performed a genome-wide association study (GWAS) to investigate association of common single nucleotide polymorphisms (SNPs) with risk of sporadic BAVM in the international, multicentre Genetics of Arteriovenous Malformation (GEN-AVM) consortium.METHODS: The Caucasian discovery cohort included 515 BAVM c...
Wendler, Jason P.; Okombo, John; Amato, Roberto; Miotto, Olivo; Kiara, Steven M.; Mwai, Leah; Pole, Lewa; O'Brien, John; Manske, Magnus; Alcock, Dan; Drury, Eleanor; Sanders, Mandy; Samuel O Oyola; Malangone, Cinzia; Jyothi, Dushyanth
Background Drug resistance remains a chief concern for malaria control. In order to determine the genetic markers of drug resistant parasites, we tested the genome-wide associations (GWA) of sequence-based genotypes from 35 Kenyan P. falciparum parasites with the activities of 22 antimalarial drugs. Methods and Principal Findings Parasites isolated from children with acute febrile malaria were adapted to culture, and sensitivity was determined by in vitro growth in the presence of anti-malari...
Shen, Hao-ran; Qiu, Li-hua; Zhang, Zhi-qing; Qin, Yuan-yuan; Cao, Cong; Di, Wen
Polycystic ovary syndrome (PCOS) is a complex, heterogeneous disorder of uncertain etiology. Recent studies suggested that insulin resistance (IR) plays an important role in the development of PCOS. In the current study, we aimed to investigate the molecular mechanism of IR in PCOS. We employed genome-wide methylated DNA immunoprecipitation (MeDIP) analysis to characterize genes that are differentially methylated in PCOS patients vs. healthy controls. Besides, we also identified the different...
Beecham, Gary W.; Hamilton, Kara; Adam C Naj; Martin, Eden R; Huentelman, Matt; Myers, Amanda J.; Corneveaux, Jason J.; Hardy, John; Vonsattel, Jean-Paul; Younkin, Steven G.; Bennett, David A.; Philip L De Jager; Larson, Eric B.; Crane, Paul K.; Kamboh, M Ilyas
Alzheimer's disease (AD) and related dementias are a major public health challenge and present a therapeutic imperative for which we need additional insight into molecular pathogenesis. We performed a genome-wide association study and analysis of known genetic risk loci for AD dementia using neuropathologic data from 4,914 brain autopsies. Neuropathologic data were used to define clinico-pathologic AD dementia or controls, assess core neuropathologic features of AD (neuritic plaques, NPs; neu...
Weiss, Lauren; Tsang, KM; Croen, LA; Torres, AR; Kharrazi, M.; Delorenze, GN; Windham, GC; Yoshida, CK; Zerbo, O
Effects of parental genotype or parent-offspring genetic interaction are well established in model organisms for a variety of traits. However, these transgenerational genetic models are rarely studied in humans. We have utilized an autism case-control study with 735 mother-child pairs to perform genome-wide screening for maternal genetic effects and maternal-offspring genetic interaction. We used simple models of single locus parent-child interaction and identified suggestive results (P
Olga A Soutourina
Full Text Available Clostridium difficile is an emergent pathogen, and the most common cause of nosocomial diarrhea. In an effort to understand the role of small noncoding RNAs (sRNAs in C. difficile physiology and pathogenesis, we used an in silico approach to identify 511 sRNA candidates in both intergenic and coding regions. In parallel, RNA-seq and differential 5'-end RNA-seq were used for global identification of C. difficile sRNAs and their transcriptional start sites at three different growth conditions (exponential growth phase, stationary phase, and starvation. This global experimental approach identified 251 putative regulatory sRNAs including 94 potential trans riboregulators located in intergenic regions, 91 cis-antisense RNAs, and 66 riboswitches. Expression of 35 sRNAs was confirmed by gene-specific experimental approaches. Some sRNAs, including an antisense RNA that may be involved in control of C. difficile autolytic activity, showed growth phase-dependent expression profiles. Expression of each of 16 predicted c-di-GMP-responsive riboswitches was observed, and experimental evidence for their regulatory role in coordinated control of motility and biofilm formation was obtained. Finally, we detected abundant sRNAs encoded by multiple C. difficile CRISPR loci. These RNAs may be important for C. difficile survival in bacteriophage-rich gut communities. Altogether, this first experimental genome-wide identification of C. difficile sRNAs provides a firm basis for future RNome characterization and identification of molecular mechanisms of sRNA-based regulation of gene expression in this emergent enteropathogen.
Metzakopian, Emmanouil; Strong, Alex; Iyer, Vivek; Hodgkins, Alex; Tzelepis, Konstantinos; Antunes, Liliana; Friedrich, Mathias J; Kang, Qiaohua; Davidson, Teresa; Lamberth, Jacob; Hoffmann, Christina; Davis, Gregory D; Vassiliou, George S; Skarnes, William C; Bradley, Allan
CRISPR-Cas9 technology has accelerated biological research becoming routine for many laboratories. It is rapidly replacing conventional gene editing techniques and has high utility for both genome-wide and gene-focussed applications. Here we present the first individually cloned CRISPR-Cas9 genome wide arrayed sgRNA libraries covering 17,166 human and 20,430 mouse genes at a complexity of 34,332 sgRNAs for human and 40,860 sgRNAs for the mouse genome. For flexibility in generating stable cell lines the sgRNAs have been cloned in a lentivirus backbone containing PiggyBac transposase recognition elements together with fluorescent and drug selection markers. Over 95% of tested sgRNA induced specific DNA cleavage as measured by CEL-1 assays. Furthermore, sgRNA targeting GPI anchor protein pathway genes induced loss of function mutations in human and mouse cell lines measured by FLAER labelling. These arrayed libraries offer the prospect for performing screens on individual genes, combinations as well as larger gene sets. They also facilitate rapid deconvolution of signals from genome-wide screens. This set of vectors provide an organized comprehensive gene editing toolbox of considerable scientific value.
Pritykin, Yuri; Ghersi, Dario; Singh, Mona
Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655
Ransohoff, Katherine J; Wu, Wenting; Cho, Hyunje G; Chahal, Harvind C; Lin, Yuan; Dai, Hong-Ji; Amos, Christopher I; Lee, Jeffrey E; Tang, Jean Y; Hinds, David A; Han, Jiali; Wei, Qingyi; Sarin, Kavita Y
Genome-wide association studies have identified 21 susceptibility loci associated with melanoma. These loci implicate genes affecting pigmentation, nevus count, telomere maintenance, and DNA repair in melanoma risk. Here, we report the results of a two-stage genome-wide association study of melanoma. The stage 1 discovery phase consisted of 4,842 self-reported melanoma cases and 286,565 controls of European ancestry from the 23andMe research cohort and the stage 2 replication phase consisted of 1,804 melanoma cases and 1,026 controls from the University of Texas M.D. Anderson Cancer Center. We performed a combined meta-analysis totaling 6,628 melanoma cases and 287,591 controls. Our study replicates 20 of 21 previously known melanoma-loci and confirms the association of the telomerase reverse transcriptase, TERT, with melanoma susceptibility at genome-wide significance. In addition, we uncover a novel polymorphism, rs187843643 (OR = 1.96; 95% CI = [1.54, 2.48]; P = 3.53 x 10-8), associated with melanoma. The SNP rs187842643 lies within a noncoding RNA 177kb downstream of BASP1 (brain associated protein-1). We find that BASP1 expression is suppressed in melanoma as compared with benign nevi, providing additional evidence for a putative role in melanoma pathogenesis.
Mechelli, Rosella; Umeton, Renato; Policano, Claudia
of genes whose products are known to physically interact with environmental factors that may be relevant for disease pathogenesis) analysis of genome-wide association data in multiple sclerosis. We looked for statistical enrichment of associations among interactomes that, at the current state of knowledge......, may be representative of gene-environment interactions of potential, uncertain or unlikely relevance for multiple sclerosis pathogenesis: Epstein-Barr virus, human immunodeficiency virus, hepatitis B virus, hepatitis C virus, cytomegalovirus, HHV8-Kaposi sarcoma, H1N1-influenza, JC virus, human innate....... The P values of all single nucleotide polymorphism mapping to a given interactome were obtained from the last genome-wide association study of the International Multiple Sclerosis Genetics Consortium & the Wellcome Trust Case Control Consortium, 2. The interaction between genotype and Epstein Barr virus...
Roh, Tae-Young; Zhao, Keji
One major postgenomic challenge is to characterize the epigenomes that control genome functions. The epigenomes are mainly defined by the specific association of nonhistone proteins with chromatin and the covalent modifications of chromatin, including DNA methylation and posttranslational histone modifications. The in vivo protein-binding and chromatin-modification patterns can be revealed by the chromatin immunoprecipitation assay (ChIP). By combining the ChIP assays and the serial analysis of gene expression (SAGE) protocols, we have developed an unbiased and high-resolution genome-wide mapping technique (GMAT) to determine the genome-wide protein-targeting and chromatin-modification patterns. GMAT has been successfully applied to mapping the target sites of the histone acetyltransferase, Gcn5p, in yeast and to the discovery of the histone acetylation islands as an epigenetic mark for functional regulatory elements in the human genome.
Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette
Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...
Sotos syndrome (SS) represents an important human model system for the study of epigenetic regulation; it is an overgrowth\\/intellectual disability syndrome caused by mutations in a histone methyltransferase, NSD1. As layered epigenetic modifications are often interdependent, we propose that pathogenic NSD1 mutations have a genome-wide impact on the most stable epigenetic mark, DNA methylation (DNAm). By interrogating DNAm in SS patients, we identify a genome-wide, highly significant NSD1(+\\/-)-specific signature that differentiates pathogenic NSD1 mutations from controls, benign NSD1 variants and the clinically overlapping Weaver syndrome. Validation studies of independent cohorts of SS and controls assigned 100% of these samples correctly. This highly specific and sensitive NSD1(+\\/-) signature encompasses genes that function in cellular morphogenesis and neuronal differentiation, reflecting cardinal features of the SS phenotype. The identification of SS-specific genome-wide DNAm alterations will facilitate both the elucidation of the molecular pathophysiology of SS and the development of improved diagnostic testing.
Full Text Available Genome-wide association studies (GWAS have successfully identified a number of single-nucleotide polymorphisms (SNPs associated with colorectal cancer (CRC risk. However, these susceptibility loci known today explain only a small fraction of the genetic risk. Gene-gene interaction (GxG is considered to be one source of the missing heritability. To address this, we performed a genome-wide search for pair-wise GxG associated with CRC risk using 8,380 cases and 10,558 controls in the discovery phase and 2,527 cases and 2,658 controls in the replication phase. We developed a simple, but powerful method for testing interaction, which we term the Average Risk Due to Interaction (ARDI. With this method, we conducted a genome-wide search to identify SNPs showing evidence for GxG with previously identified CRC susceptibility loci from 14 independent regions. We also conducted a genome-wide search for GxG using the marginal association screening and examining interaction among SNPs that pass the screening threshold (p<10(-4. For the known locus rs10795668 (10p14, we found an interacting SNP rs367615 (5q21 with replication p = 0.01 and combined p = 4.19×10(-8. Among the top marginal SNPs after LD pruning (n = 163, we identified an interaction between rs1571218 (20p12.3 and rs10879357 (12q21.1 (nominal combined p = 2.51×10(-6; Bonferroni adjusted p = 0.03. Our study represents the first comprehensive search for GxG in CRC, and our results may provide new insight into the genetic etiology of CRC.
Zou, F.; Carrasquillo, M. M.; Pankratz, V. S.; Belbin, O.; Morgan, K.; Allen, M.; Wilcox, S. L.; Ma, L.; Walker, L. P.; Kouri, N.; Burgess, J. D.; Younkin, L. H.; Younkin, Samuel G.; Younkin, C. S.; Bisceglio, G. D.; Crook, J. E.; Dickson, D. W.; Petersen, R. C.; Graff-Radford, N.; Younkin, Steven G.; Ertekin-Taner, N.
Background: Late-onset Alzheimer disease (LOAD) is a common disorder with a substantial genetic component. We postulate that many disease susceptibility variants act by altering gene expression levels. Methods: We measured messenger RNA (mRNA) expression levels of 12 LOAD candidate genes in the cerebella of 200 subjects with LOAD. Using the genotypes from our LOAD genome-wide association study for the cis-single nucleotide polymorphisms (SNPs) (n = 619) of these 12 LOAD candidate genes, we tested for associations with expression levels as endophenotypes. The strongest expression cis-SNP was tested for AD association in 7 independent case-control series (2,280 AD and 2,396 controls). Results: We identified 3 SNPs that associated significantly with IDE (insulin degrading enzyme) expression levels. A single copy of the minor allele for each significant SNP was associated with ∼twofold higher IDE expression levels. The most significant SNP, rs7910977, is 4.2 kb beyond the 3′ end of IDE. The association observed with this SNP was significant even at the genome-wide level (p = 2.7 × 10−8). Furthermore, the minor allele of rs7910977 associated significantly (p = 0.0046) with reduced LOAD risk (OR = 0.81 with a 95% CI of 0.70-0.94), as expected biologically from its association with elevated IDE expression. Conclusions: These results provide strong evidence that IDE is a late-onset Alzheimer disease (LOAD) gene with variants that modify risk of LOAD by influencing IDE expression. They also suggest that the use of expression levels as endophenotypes in genome-wide association studies may provide a powerful approach for the identification of disease susceptibility alleles. GLOSSARY AD = Alzheimer disease; CI = confidence interval; GWAS = genome-wide association study; LOAD = late-onset Alzheimer disease; mRNA = messenger RNA; OR = odds ratio; SNP = single nucleotide polymorphism. PMID:20142614
Wang, Hansong; Burnett, Terrilea; Kono, Suminori; Haiman, Christopher A.; Iwasaki, Motoki; Wilkens, Lynne R.; Lenora W M Loo; Berg, David Van Den; Laurence N Kolonel; Henderson, Brian E.; Keku, Temitope O; Sandler, Robert S.; Signorello, Lisa B.; Blot, William J.; Newcomb, Polly A.
The genetic basis of sporadic colorectal cancer (CRC) is not well explained by known risk polymorphisms. Here we perform a meta-analysis of two genome-wide association studies in 2,627 cases and 3,797 controls of Japanese ancestry and 1,894 cases and 4,703 controls of African ancestry, to identify genetic variants that contribute to CRC susceptibility. We replicate genome-wide statistically significant associations (P < 5×10−8) in 16,823 cases and 18,211 controls of European ancestry. This st...
Full Text Available Individuals with fast nicotine metabolism typically smoke more and thus have a greater risk for smoking-induced diseases. Further, the efficacy of smoking cessation pharmacotherapy is dependent on the rate of nicotine metabolism. Our objective was to use nicotine metabolite ratio (NMR, an established biomarker of nicotine metabolism rate, in a genome-wide association study (GWAS to identify novel genetic variants influencing nicotine metabolism. A heritability estimate of 0.81 (95% CI 0.70-0.88 was obtained for NMR using monozygotic and dizygotic twins of the FinnTwin cohort. We performed a GWAS in cotinine-verified current smokers of three Finnish cohorts (FinnTwin, Young Finns Study, FINRISK2007, followed by a meta-analysis of 1518 subjects, and annotated the genome-wide significant SNPs with methylation quantitative loci (meQTL analyses. We detected association on 19q13 with 719 SNPs exceeding genome-wide significance within a 4.2 Mb region. The strongest evidence for association emerged for CYP2A6 (min p = 5.77E-86, in intron 4, the main metabolic enzyme for nicotine. Other interesting genes with genome-wide significant signals included CYP2B6, CYP2A7, EGLN2, and NUMBL. Conditional analyses revealed three independent signals on 19q13, all located within or in the immediate vicinity of CYP2A6. A genetic risk score constructed using the independent signals showed association with smoking quantity (p = 0.0019 in two independent Finnish samples. Our meQTL results showed that methylation values of 16 CpG sites within the region are affected by genotypes of the genome-wide significant SNPs, and according to causal inference test, for some of the SNPs the effect on NMR is mediated through methylation. To our knowledge, this is the first GWAS on NMR. Our results enclose three independent novel signals on 19q13.2. The detected CYP2A6 variants explain a strikingly large fraction of variance (up to 31% in NMR in these study samples. Further, we provide
Salvi, Erika; Kutalik, Zoltán; Glorioso, Nicola; Benaglio, Paola; Frau, Francesca; Kuznetsova, Tatiana; Arima, Hisatomi; Hoggart, Clive; Tichet, Jean; Nikitin, Yury P.; Conti, Costanza; Seidlerova, Jitka; Tikhonoff, Valérie; Stolarz-Skrzypek, Katarzyna; Johnson, Toby; Devos, Nabila; Zagato, Laura; Guarrera, Simonetta; Zaninello, Roberta; Calabria, Andrea; Stancanelli, Benedetta; Troffa, Chiara; Thijs, Lutgarde; Rizzi, Federica; Simonova, Galina; Lupoli, Sara; Argiolas, Giuseppe; Braga, Daniele; D’Alessio, Maria C.; Ortu, Maria F.; Ricceri, Fulvio; Mercurio, Maurizio; Descombes, Patrick; Marconi, Maurizio; Chalmers, John; Harrap, Stephen; Filipovsky, Jan; Bochud, Murielle; Iacoviello, Licia; Ellis, Justine; Stanton, Alice V.; Laan, Maris; Padmanabhan, Sandosh; Dominiczak, Anna F.; Samani, Nilesh J.; Melander, Olle; Jeunemaitre, Xavier; Manunta, Paolo; Shabo, Amnon; Vineis, Paolo; Cappuccio, Francesco P.; Caulfield, Mark J.; Matullo, Giuseppe; Rivolta, Carlo; Munroe, Patricia B.; Barlassina, Cristina; Staessen, Jan A; Beckmann, Jacques S.; Cusi, Daniele
Essential hypertension is a multi-factorial disorder and is the main risk factor for renal and cardiovascular complications. The research on the genetics of hypertension has been frustrated by the small predictive value of the discovered genetic variants. The HYPERGENES Project investigated associations between genetic variants and essential hypertension pursuing a two-stage study by recruiting cases and controls from extensively characterized cohorts recruited over many years in different European regions. The discovery phase consisted of 1,865 cases and 1,750 controls genotyped with 1M Illumina array. Best hits were followed up in a validation panel of 1,385 cases and 1,246 controls that were genotyped with a custom array of 14,055 markers. We identified a new hypertension susceptibility locus (rs3918226) in the promoter region of the endothelial nitric oxide synthase (eNOS) gene (odds ratio 1.54; 95% CI 1.37-1.73; combined p=2.58·10−13). A meta-analysis, using other in-silico/de novo genotyping data for a total of 21714 subjects, resulted in an overall odds ratio of 1.34 (95% CI 1.25-1.44, p=1.032·10−14). The quantitative analysis on a population-based sample revealed an effect size of 1.91 (95% CI 0.16-3.66) for systolic and 1.40 (95% CI 0.25-2.55) for diastolic blood pressure. We identified in-silico a potential binding site for ETS transcription-factors directly next to rs3918226, suggesting a potential modulation of eNOS expression. Biological evidence links eNOS with hypertension, as it is a critical mediator of cardiovascular homeostasis and blood pressure control via vascular tone regulation. This finding supports the hypothesis that there may be a causal genetic variation at this locus. PMID:22184326
Tamada, Yoshinori; Imoto, Seiya; Araki, Hiromitsu; Nagasaki, Masao; Print, Cristin; Charnock-Jones, D Stephen; Miyano, Satoru
We present a novel algorithm to estimate genome-wide gene networks consisting of more than 20,000 genes from gene expression data using nonparametric Bayesian networks. Due to the difficulty of learning Bayesian network structures, existing algorithms cannot be applied to more than a few thousand genes. Our algorithm overcomes this limitation by repeatedly estimating subnetworks in parallel for genes selected by neighbor node sampling. Through numerical simulation, we confirmed that our algorithm outperformed a heuristic algorithm in a shorter time. We applied our algorithm to microarray data from human umbilical vein endothelial cells (HUVECs) treated with siRNAs, to construct a human genome-wide gene network, which we compared to a small gene network estimated for the genes extracted using a traditional bioinformatics method. The results showed that our genome-wide gene network contains many features of the small network, as well as others that could not be captured during the small network estimation. The results also revealed master-regulator genes that are not in the small network but that control many of the genes in the small network. These analyses were impossible to realize without our proposed algorithm.
Li, Xin; Cornelis, Marilyn C; Liang, Liming; Song, Fengju; De Vivo, Immaculata; Giovannucci, Edward; Tang, Jean Y; Han, Jiali
Animal models have suggested that oral or topical administration of caffeine could inhibit ultraviolet-induced carcinogenesis via the ataxia telangiectasia and rad3 (ATR)-related apoptosis. Previous epidemiological studies have demonstrated that increased caffeine consumption is associated with reduced risk of basal cell carcinoma (BCC). To identify common genetic markers that may modify this association, we tested gene-caffeine intake interaction on BCC risk in a genome-wide analysis. We included 3383 BCC cases and 8528 controls of European ancestry from the Nurses' Health Study and Health Professionals Follow-up Study. Single nucleotide polymorphism (SNP) rs142310826 near the NEIL3 gene showed a genome-wide significant interaction with caffeine consumption (P = 1.78 × 10-8 for interaction) on BCC risk. There was no gender difference for this interaction (P = 0.64 for heterogeneity). NEIL3, a gene belonging to the base excision DNA repair pathway, encodes a DNA glycosylase that recognizes and removes lesions produced by oxidative stress. In addition, we identified several loci with P value for interaction caffeine consumption-related SNPs reported by previous genome-wide association studies and risk of BCC, both individually and jointly, but found no significant association. In sum, we identified a DNA repair gene that could be involved in caffeine-mediated skin tumor inhibition. Further studies are warranted to confirm these findings. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: email@example.com.
Thompson, A G B; Uphill, J; Lowe, J; Porter, M-C; Lukic, A; Carswell, C; Rudge, P; MacKay, A; Collinge, J; Mead, S
Prion diseases are rare neurodegenerative conditions causing highly variable clinical syndromes, which often include prominent neuropsychiatric symptoms. We have recently carried out a clinical study of behavioural and psychiatric symptoms in a large prospective cohort of patients with prion disease in the United Kingdom, allowing us to operationalise specific behavioural/psychiatric phenotypes as traits in human prion disease. Here, we report exploratory genome-wide association analysis on 170 of these patients and 5200 UK controls, looking for single-nucleotide polymorphisms (SNPs) associated with three behavioural/psychiatric phenotypes in the context of prion disease. We also specifically examined a selection of candidate SNPs that have shown genome-wide association with psychiatric conditions in previously published studies, and the codon 129 polymorphism of the prion protein gene, which is known to modify various aspects of the phenotype of prion disease. No SNPs reached genome-wide significance, and there was no evidence of altered burden of known psychiatric risk alleles in relevant prion cases. SNPs showing suggestive evidence of association (Panalysis that may shed clearer light on the biological basis of these complex disease manifestations, and the diseases themselves.
Dogan, Meeshanthini V; Beach, Steven R H; Philibert, Robert A
Smoking is the leading cause of death in the United States. It exerts its effects by increasing susceptibility to a variety of complex disorders among those who smoke, and if pregnant, to their unborn children. In prior efforts to understand the epigenetic mechanisms through which this increased vulnerability is conveyed, a number of investigators have conducted genome wide methylation analyses. Unfortunately, secondary to methodological limitations, these studies were unable to examine methylation in gene regions with significant amounts of genetic variation. Using genome wide genetic and epigenetic data from the Framingham Heart Study, we re-examined the relationship of smoking status to genome wide methylation status. When only methylation status is considered, smoking was significantly associated with differential methylation in 310 genes that map to a variety of biological process and cellular differentiation pathways. However, when SNP effects on the magnitude of smoking associated methylation changes are also considered, cis and trans-interaction effects were noted at a total of 266 and 4353 genes with no marked enrichment for any biological pathways. Furthermore, the SNP variation participating in the significant interaction effects is enriched for loci previously associated with complex medical illnesses. The enlarged scope of the methylome shown to be affected by smoking may better explicate the mediational pathways linking smoking with a myriad of smoking related complex syndromes. Additionally, these results strongly suggest that combined epigenetic and genetic data analyses may be critical for a more complete understanding of the relationship between environmental variables, such as smoking, and pathophysiological outcomes. © 2017 Wiley Periodicals, Inc.
Full Text Available Genetic studies of human local adaptation have been facilitated greatly by recent advances in high-throughput genotyping and sequencing technologies. However, few studies have investigated local adaptation in Asian populations on a genome-wide scale and with a high geographic resolution. In this study, taking advantage of the dense population coverage in Southeast Asia, which is the part of the world least studied in term of natural selection, we depicted genome-wide landscapes of local adaptations in 63 Asian populations representing the majority of linguistic and ethnic groups in Asia. Using genome-wide data analysis, we discovered many genes showing signs of local adaptation or natural selection. Notable examples, such as FOXQ1, MAST2, and CDH4, were found to play a role in hair follicle development and human cancer, signal transduction, and tumor repression, respectively. These showed strong indications of natural selection in Philippine Negritos, a group of aboriginal hunter-gatherers living in the Philippines. MTTP, which has associations with metabolic syndrome, body mass index, and insulin regulation, showed a strong signature of selection in Southeast Asians, including Indonesians. Functional annotation analysis revealed that genes and genetic variants underlying natural selections were generally enriched in the functional category of alternative splicing. Specifically, many genes showing significant difference with respect to allele frequency between northern and southern Asian populations were found to be associated with human height and growth and various immune pathways. In summary, this study contributes to the overall understanding of human local adaptation in Asia and has identified both known and novel signatures of natural selection in the human genome.
Summary Background The epilepsies are a clinically heterogeneous group of neurological disorders. Despite strong evidence for heritability, genome-wide association studies have had little success in identification of risk loci associated with epilepsy, probably because of relatively small sample sizes and insufficient power. We aimed to identify risk loci through meta-analyses of genome-wide association studies for all epilepsy and the two largest clinical subtypes (genetic generalised epilepsy and focal epilepsy). Methods We combined genome-wide association data from 12 cohorts of individuals with epilepsy and controls from population-based datasets. Controls were ethnically matched with cases. We phenotyped individuals with epilepsy into categories of genetic generalised epilepsy, focal epilepsy, or unclassified epilepsy. After standardised filtering for quality control and imputation to account for different genotyping platforms across sites, investigators at each site conducted a linear mixed-model association analysis for each dataset. Combining summary statistics, we conducted fixed-effects meta-analyses of all epilepsy, focal epilepsy, and genetic generalised epilepsy. We set the genome-wide significance threshold at panalysis. Meta-analysis of the all-epilepsy cohort identified loci at 2q24.3 (p=8·71 × 10−10), implicating SCN1A, and at 4p15.1 (p=5·44 × 10−9), harbouring PCDH7, which encodes a protocadherin molecule not previously implicated in epilepsy. For the cohort of genetic generalised epilepsy, we noted a single signal at 2p16.1 (p=9·99 × 10−9), implicating VRK2 or FANCL. No single nucleotide polymorphism achieved genome-wide significance for focal epilepsy. Interpretation This meta-analysis describes a new locus not previously implicated in epilepsy and provides further evidence about the genetic architecture of these disorders, with the ultimate aim of assisting in disease classification and prognosis. The data suggest that
Matthew T. Oetjens
Full Text Available Population stratification or confounding by genetic ancestry is a potential cause of false associations in genetic association studies. Estimation of and adjustment for genetic ancestry has become common practice thanks in part to the availability of ancestry informative markers on genome-wide association study (GWAS arrays. While array data is now widespread, these data are not ubiquitous as several large epidemiologic and clinic-based studies lack genome-wide data. One such large epidemiologic-based study lacking genome-wide data accessible to investigators is the National Health and Nutrition Examination Surveys (NHANES, population-based cross-sectional surveys of Americans linked to demographic, health, and lifestyle data conducted by the Centers for Disease Control and Prevention. DNA samples (n=14,998 were extracted from biospecimens from consented NHANES participants between 1991-1994 (NHANES III, phase 2 and 1999-2002 and represent three major self-identified racial/ethnic groups: non-Hispanic whites (n=6,634, non-Hispanic blacks (n=3,458, and Mexican Americans (n=3,950. We as the Epidemiologic Architecture for Genes Linked to Environment (EAGLE study genotyped candidate gene and GWAS-identified index variants in NHANES as part of the larger Population Architecture using Genomics and Epidemiology (PAGE I study for collaborative genetic association studies. To enable basic quality control such as estimation of genetic ancestry to control for population stratification in NHANES san genome-wide data, we outline here strategies that use limited genetic data to identify the markers optimal for characterizing genetic ancestry. From among 411 and 295 autosomal SNPs available in NHANES III and NHANES 1999-2002, we demonstrate that markers with ancestry information can be identified to estimate global ancestry. Despite limited resolution, global genetic ancestry is highly correlated with self-identified race for the majority of participants
Li, Zhixiu; Brown, Matthew A
Ankylosing spondylitis (AS) is an immune-mediated arthritis which primarily affects the spine and sacroiliac joints. Significant progress has been made in discovery of genetic associations with AS by genome-wide association studies (GWAS) over past decade. These findings have uncovered novel pathways involved pathogenesis of the disease and have led to introduction of novel therapeutic treatments for AS. In this Review, we discuss the genetic variations associated with AS identified by GWAS, the major pathways revealed by these AS-associated variations and critical cell types involved in AS development. PMID:29333268
Dijkstra, A. E.; Smolonska, J.; van den Berge, M.
of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA) study of CMH in Caucasian populations. Methods: GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed...... of SATB1 (4.3610 29) in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture. Conclusions: Our findings, that SNP rs6577641...
Full Text Available Sudden unexpected death in epilepsy (SUDEP represents the most severe degree of the spectrum of epilepsy severity and is the commonest cause of epilepsy-related premature mortality. The precise pathophysiology and the genetic architecture of SUDEP remain elusive. Aiming to elucidate the genetic basis of SUDEP, we analysed rare, protein-changing variants from whole-exome sequences of 18 people who died of SUDEP, 87 living people with epilepsy and 1479 non-epilepsy disease controls. Association analysis revealed a significantly increased genome-wide polygenic burden per individual in the SUDEP cohort when compared to epilepsy (P = 5.7 × 10−3 and non-epilepsy disease controls (P = 1.2 × 10−3. The polygenic burden was driven both by the number of variants per individual, and over-representation of variants likely to be deleterious in the SUDEP cohort. As determined by this study, more than a thousand genes contribute to the observed polygenic burden within the framework of this study. Subsequent gene-based association analysis revealed five possible candidate genes significantly associated with SUDEP or epilepsy, but no one single gene emerges as common to the SUDEP cases. Our findings provide further evidence for a genetic susceptibility to SUDEP, and suggest an extensive polygenic contribution to SUDEP causation. Thus, an overall increased burden of deleterious variants in a highly polygenic background might be important in rendering a given individual more susceptible to SUDEP. Our findings suggest that exome sequencing in people with epilepsy might eventually contribute to generating SUDEP risk estimates, promoting stratified medicine in epilepsy, with the eventual aim of reducing an individual patient's risk of SUDEP.
Mira, Nuno P.; Teixeira, Miguel Cacho
Abstract Weak acids are widely used as food preservatives (e.g., acetic, propionic, benzoic, and sorbic acids), herbicides (e.g., 2,4-dichlorophenoxyacetic acid), and as antimalarial (e.g., artesunic and artemisinic acids), anticancer (e.g., artesunic acid), and immunosuppressive (e.g., mycophenolic acid) drugs, among other possible applications. The understanding of the mechanisms underlying the adaptive response and resistance to these weak acids is a prerequisite to develop more effective strategies to control spoilage yeasts, and the emergence of resistant weeds, drug resistant parasites or cancer cells. Furthermore, the identification of toxicity mechanisms and resistance determinants to weak acid-based pharmaceuticals increases current knowledge on their cytotoxic effects and may lead to the identification of new drug targets. This review integrates current knowledge on the mechanisms of toxicity and tolerance to weak acid stress obtained in the model eukaryote Saccharomyces cerevisiae using genome-wide approaches and more detailed gene-by-gene analysis. The major features of the yeast response to weak acids in general, and the more specific responses and resistance mechanisms towards a specific weak acid or a group of weak acids, depending on the chemical nature of the side chain R group (R-COOH), are highlighted. The involvement of several transcriptional regulatory networks in the genomic response to different weak acids is discussed, focusing on the regulatory pathways controlled by the transcription factors Msn2p/Msn4p, War1p, Haa1p, Rim101p, and Pdr1p/Pdr3p, which are known to orchestrate weak acid stress response in yeast. The extrapolation of the knowledge gathered in yeast to other eukaryotes is also attempted. PMID:20955006
Wong, Hector R
For nearly a decade, our research group has had the privilege of developing and mining a multicenter, microarray-based, genome-wide expression database of critically ill children (≤10 y of age) with septic shock. Using bioinformatic and systems biology approaches, the expression data generated through this discovery-oriented, exploratory approach have been leveraged for a variety of objectives, which are reviewed here. Fundamental observations include widespread repression of gene programs corresponding to the adaptive immune system and biologically significant differential patterns of gene expression across developmental age groups. The data have also identified gene expression-based subclasses of pediatric septic shock having clinically relevant phenotypic differences. The data have also been leveraged for the discovery of novel therapeutic targets, as well as for the discovery and development of novel stratification and diagnostic biomarkers. Almost a decade of genome-wide expression profiling in pediatric septic shock is now demonstrating tangible results. The studies have progressed from an initial discovery-oriented and exploratory phase to a new phase in which the data are being translated and applied to address several areas of clinical need.
M. Ilyas Kamboh
Full Text Available Background. The persistent presence of antiphospholipid antibodies (APA may lead to the development of primary or secondary antiphospholipid syndrome. Although the genetic basis of APA has been suggested, the identity of the underlying genes is largely unknown. In this study, we have performed a genome-wide association study (GWAS in an effort to identify susceptibility loci/genes for three main APA: anticardiolipin antibodies (ACL, lupus anticoagulant (LAC, and anti-β2 glycoprotein I antibodies (anti-β2GPI. Methods. DNA samples were genotyped using the Affymetrix 6.0 array containing 906,600 single-nucleotide polymorphisms (SNPs. Association of SNPs with the antibody status (positive/negative was tested using logistic regression under the additive model. Results. We have identified a number of suggestive novel loci with P
Ana L Caicedo
Full Text Available Domesticated Asian rice (Oryza sativa is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models to explain contemporary patterns of polymorphisms in rice, including a (i selectively neutral population bottleneck model, (ii bottleneck plus migration model, (iii multiple selective sweeps model, and (iv bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been the dominant demographic model for domesticated species, cannot explain the derived nucleotide polymorphism site frequency spectrum in rice. Instead, a bottleneck model that incorporates selective sweeps, or a more complex demographic model that includes subdivision and gene flow, are more plausible explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection results only in a local signature of variation.
Lamont R. Jones
Full Text Available Keloids are benign fibroproliferative tumors of the skin which commonly occur after injury mainly in darker skinned patients. Medical treatment is fraught with high recurrence rates mainly because of an incomplete understanding of the biological mechanisms that lead to keloids. The purpose of this project was to examine keloid pathogenesis from the epigenome perspective of DNA methylation. Genome-wide profiling used the Infinium HumanMethylation450 BeadChip to interrogate DNA from 6 fresh keloid and 6 normal skin samples from 12 anonymous donors. A 3-tiered approach was used to call out genes most differentially methylated between keloid and normal. When compared to normal, of the 685 differentially methylated CpGs at Tier 3, 510 were hypomethylated and 175 were hypermethylated with 190 CpGs in promoter and 495 in nonpromoter regions. The 190 promoter region CpGs corresponded to 152 genes: 96 (63% were hypomethylated and 56 (37% hypermethylated. This exploratory genome-wide scan of the keloid methylome highlights a predominance of hypomethylated genomic landscapes, favoring nonpromoter regions. DNA methylation, as an additional mechanism for gene regulation in keloid pathogenesis, holds potential for novel treatments that reverse deleterious epigenetic changes. As an alternative mechanism for regulating genes, epigenetics may explain why gene mutations alone do not provide definitive mechanisms for keloid formation.
Full Text Available Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP, uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.
Christina L Richards
Full Text Available Organisms in the wild are subject to multiple, fluctuating environmental factors, and it is in complex natural environments that genetic regulatory networks actually function and evolve. We assessed genome-wide gene expression patterns in the wild in two natural accessions of the model plant Arabidopsis thaliana and examined the nature of transcriptional variation throughout its life cycle and gene expression correlations with natural environmental fluctuations. We grew plants in a natural field environment and measured genome-wide time-series gene expression from the plant shoot every three days, spanning the seedling to reproductive stages. We find that 15,352 genes were expressed in the A. thaliana shoot in the field, and accession and flowering status (vegetative versus flowering were strong components of transcriptional variation in this plant. We identified between ∼110 and 190 time-varying gene expression clusters in the field, many of which were significantly overrepresented by genes regulated by abiotic and biotic environmental stresses. The two main principal components of vegetative shoot gene expression (PC(veg correlate to temperature and precipitation occurrence in the field. The largest PC(veg axes included thermoregulatory genes while the second major PC(veg was associated with precipitation and contained drought-responsive genes. By exposing A. thaliana to natural environments in an open field, we provide a framework for further understanding the genetic networks that are deployed in natural environments, and we connect plant molecular genetics in the laboratory to plant organismal ecology in the wild.
Xu, Jiawei; Bao, Xiao; Peng, Zhaofeng; Wang, Linlin; Du, Linqing; Niu, Wenbin; Sun, Yingpu
Polycystic ovary syndrome (PCOS) affects approximately 7% of the reproductive-age women. A growing body of evidence indicated that epigenetic mechanisms contributed to the development of PCOS. The role of DNA modification in human PCOS ovary granulosa cell is still unknown in PCOS progression. Global DNA methylation and hydroxymethylation were detected between PCOS' and controls' granulosa cell. Genome-wide DNA methylation was profiled to investigate the putative function of DNA methylaiton. Selected genes expressions were analyzed between PCOS' and controls' granulosa cell. Our results showed that the granulosa cell global DNA methylation of PCOS patients was significant higher than the controls'. The global DNA hydroxymethylation showed low level and no statistical difference between PCOS and control. 6936 differentially methylated CpG sites were identified between control and PCOS-obesity. 12245 differential methylated CpG sites were detected between control and PCOS-nonobesity group. 5202 methylated CpG sites were significantly differential between PCOS-obesity and PCOS-nonobesity group. Our results showed that DNA methylation not hydroxymethylation altered genome-wide in PCOS granulosa cell. The different methylation genes were enriched in development protein, transcription factor activity, alternative splicing, sequence-specific DNA binding and embryonic morphogenesis. YWHAQ, NCF2, DHRS9 and SCNA were up-regulation in PCOS-obesity patients with no significance different between control and PCOS-nonobesity patients, which may be activated by lower DNA methylaiton. Global and genome-wide DNA methylation alteration may contribute to different genes expression and PCOS clinical pathology.
Full Text Available Abstract Background DNA damage response (DDR plays pivotal roles in maintaining genome integrity and stability. An effective DDR requires the involvement of hundreds of genes that compose a complicated network. Because DDR is highly conserved in evolution, studies in lower eukaryotes can provide valuable information to elucidate the mechanism in higher organisms. Fission yeast (Schizosaccharomyces pombe has emerged as an excellent model for DDR research in recent years. To identify novel genes involved in DDR, we screened a genome-wide S. pombe haploid deletion library against six different DNA damage reagents. The library covered 90.5% of the nonessential genes of S. pombe. Results We have identified 52 genes that were actively involved in DDR. Among the 52 genes, 20 genes were linked to DDR for the first time. Flow cytometry analysis of the repair defective mutants revealed that most of them exhibited a defect in cell cycle progression, and some caused genome instability. Microarray analysis and genetic complementation assays were carried out to characterize 6 of the novel DDR genes in more detail. Data suggested that SPBC2A9.02 and SPAC27D7.08c were required for efficient DNA replication initiation because they interacted genetically with DNA replication initiation proteins Abp1 and Abp2. In addition, deletion of sgf73+, meu29+, sec65+ or pab1+ caused improper cytokinesis and DNA re-replication, which contributed to the diploidization in the mutants. Conclusions A genome-wide screen of genes involved in DDR emphasized the key role of cell cycle control in the DDR network. Characterization of novel genes identified in the screen helps to elucidate the mechanism of the DDR network and provides valuable clues for understanding genome stability in higher eukaryotes.
Hendricks, Elbert; Chevalier, Alain
In earlier work published by the author and co-authors, a dynamic model called a Mean Value Engine Model (MVEM) was developed. This model is physically based and is intended mainly for control applications. It is especially well suited to embedded model applications in engine controllers, susch...... as nonlinear observer based air/fuel ratio and advanced idle speed control. After a brief review of this model, it will be compared with other similar models which can be found in the literature. The attempt will be made to point out the differences between the new modified MVEM and those developed elsewhere...
Chen, Fang; Chen, Gary K; Stram, Daniel O; Millikan, Robert C; Ambrosone, Christine B; John, Esther M; Bernstein, Leslie; Zheng, Wei; Palmer, Julie R; Hu, Jennifer J; Rebbeck, Tim R; Ziegler, Regina G; Nyante, Sarah; Bandera, Elisa V; Ingles, Sue A; Press, Michael F; Ruiz-Narvaez, Edward A; Deming, Sandra L; Rodriguez-Gil, Jorge L; Demichele, Angela; Chanock, Stephen J; Blot, William; Signorello, Lisa; Cai, Qiuyin; Li, Guoliang; Long, Jirong; Huo, Dezheng; Zheng, Yonglan; Cox, Nancy J; Olopade, Olufunmilayo I; Ogundiran, Temidayo O; Adebamowo, Clement; Nathanson, Katherine L; Domchek, Susan M; Simon, Michael S; Hennis, Anselm; Nemesure, Barbara; Wu, Suh-Yuh; Leske, M Cristina; Ambs, Stefan; Hutter, Carolyn M; Young, Alicia; Kooperberg, Charles; Peters, Ulrike; Rhie, Suhn K; Wan, Peggy; Sheng, Xin; Pooler, Loreall C; Van Den Berg, David J; Le Marchand, Loic; Kolonel, Laurence N; Henderson, Brian E; Haiman, Christopher A
Genome-wide association studies (GWAS) in diverse populations are needed to reveal variants that are more common and/or limited to defined populations. We conducted a GWAS of breast cancer in women of African ancestry, with genotyping of >1,000,000 SNPs in 3,153 African American cases and 2,831 controls, and replication testing of the top 66 associations in an additional 3,607 breast cancer cases and 11,330 controls of African ancestry. Two of the 66 SNPs replicated (p women of African ancestry will demand testing of a substantially larger set of markers from stage 1 in a larger replication sample.
Hor, Hyun; Kutalik, Zoltán; Dauvilliers, Yves
Narcolepsy is a rare sleep disorder with the strongest human leukocyte antigen (HLA) association ever reported. Since the associated HLA-DRB1*1501-DQB1*0602 haplotype is common in the general population (15-25%), it has been suggested that it is almost necessary but not sufficient for developing...... narcolepsy. To further define the genetic basis of narcolepsy risk, we performed a genome-wide association study (GWAS) in 562 European individuals with narcolepsy (cases) and 702 ethnically matched controls, with independent replication in 370 cases and 495 controls, all heterozygous for DRB1*1501-DQB1...... ratio = 0.02; P narcolepsy susceptibility....
Reveille, John D; Sims, Anne-Marie; Danoy, Patrick; Evans, David M; Leo, Paul; Pointon, Jennifer J; Jin, Rui; Zhou, Xiaodong; Bradbury, Linda A; Appleton, Louise H; Davis, John C; Diekman, Laura; Doan, Tracey; Dowling, Alison; Duan, Ran; Duncan, Emma L; Farrar, Claire; Hadler, Johanna; Harvey, David; Karaderi, Tugce; Mogg, Rebecca; Pomeroy, Emma; Pryce, Karena; Taylor, Jacqueline; Savage, Laurie; Deloukas, Panos; Kumanduri, Vasudev; Peltonen, Leena; Ring, Sue M; Whittaker, Pamela; Glazov, Evgeny; Thomas, Gethin P; Maksymowych, Walter P; Inman, Robert D; Ward, Michael M; Stone, Millicent A; Weisman, Michael H; Wordsworth, B Paul; Brown, Matthew A
To identify susceptibility loci for ankylosing spondylitis, we undertook a genome-wide association study in 2,053 unrelated ankylosing spondylitis cases among people of European descent and 5,140 ethnically matched controls, with replication in an independent cohort of 898 ankylosing spondylitis cases and 1,518 controls. Cases were genotyped with Illumina HumHap370 genotyping chips. In addition to strong association with the major histocompatibility complex (MHC; P ankylosing spondylitis risk and identifies a major role for the interleukin (IL)-23 and IL-1 cytokine pathways in disease susceptibility. PMID:20062062
February through May is the critical time for wind erosion in Kansas, but wind erosion can happen any time when high winds occur on smooth, wide fields with low vegetation and poor soil structure. The most effective wind erosion control is to ensure a protective cover of residue or growing crop thro...
Chen, Peter E; Shapiro, B Jesse
Significant advances in sequencing technologies and genome-wide association studies (GWAS) have revealed substantial insight into the genetic architecture of human phenotypes. In recent years, the application of this approach in bacteria has begun to reveal the genetic basis of bacterial host preference, antibiotic resistance, and virulence. Here, we consider relevant differences between bacterial and human genome dynamics, apply GWAS to a global sample of Mycobacterium tuberculosis genomes to highlight the impacts of linkage disequilibrium, population stratification, and natural selection, and finally compare the traditional GWAS against phyC, a contrasting method of mapping genotype to phenotype based upon evolutionary convergence. We discuss strengths and weaknesses of both methods, and make suggestions for factors to be considered in future bacterial GWAS. Copyright © 2015 Elsevier Ltd. All rights reserved.
Mohanty, Sujata; Khanna, Radhika
Comparative analysis of multiple genomes of closely or distantly related Drosophila species undoubtedly creates excitement among evolutionary biologists in exploring the genomic changes with an ecology and evolutionary perspective. We present herewith the de novo assembled whole genome sequences of four Drosophila species, D. bipectinata, D. takahashii, D. biarmipes and D. nasuta of Indian origin using Next Generation Sequencing technology on an Illumina platform along with their detailed assembly statistics. The comparative genomics analysis, e.g. gene predictions and annotations, functional and orthogroup analysis of coding sequences and genome wide SNP distribution were performed. The whole genome of Zaprionus indianus of Indian origin published earlier by us and the genome sequences of previously sequenced 12 Drosophila species available in the NCBI database were included in the analysis. The present work is a part of our ongoing genomics project of Indian Drosophila species.
Full Text Available Coronary artery disease (CAD is a multifactorial disease with environmental and genetic determinants. The genetic determinants of CAD have previously been explored by the candidate gene approach. Recently, the data from the International HapMap Project and the development of dense genotyping chips have enabled us to perform genome-wide association studies (GWAS on a large number of subjects without bias towards any particular candidate genes. In 2007, three chip-based GWAS simultaneously revealed the significant association between common variants on chromosome 9p21 and CAD. This association was replicated among other ethnic groups and also in a meta-analysis. Further investigations have detected several other candidate loci associated with CAD. The chip-based GWAS approach has identified novel and unbiased genetic determinants of CAD and these insights provide the important direction to better understand the pathogenesis of CAD and to develop new and improved preventive measures and treatments for CAD.
Lada Artem G
Full Text Available Abstract Clusters of localized hypermutation in human breast cancer genomes, named “kataegis” (from the Greek for thunderstorm, are hypothesized to result from multiple cytosine deaminations catalyzed by AID/APOBEC proteins. However, a direct link between APOBECs and kataegis is still lacking. We have sequenced the genomes of yeast mutants induced in diploids by expression of the gene for PmCDA1, a hypermutagenic deaminase from sea lamprey. Analysis of the distribution of 5,138 induced mutations revealed localized clusters very similar to those found in tumors. Our data provide evidence that unleashed cytosine deaminase activity is an evolutionary conserved, prominent source of genome-wide kataegis events. Reviewers This article was reviewed by: Professor Sandor Pongor, Professor Shamil R. Sunyaev, and Dr Vladimir Kuznetsov.
Full Text Available Genome-wide association study (GWAS aims to discover genetic factors underlying phenotypic traits. The large number of genetic factors poses both computational and statistical challenges. Various computational approaches have been developed for large scale GWAS. In this chapter, we will discuss several widely used computational approaches in GWAS. The following topics will be covered: (1 An introduction to the background of GWAS. (2 The existing computational approaches that are widely used in GWAS. This will cover single-locus, epistasis detection, and machine learning methods that have been recently developed in biology, statistic, and computer science communities. This part will be the main focus of this chapter. (3 The limitations of current approaches and future directions.
Galesloot, Tessel E; Van Steen, Kristel; Kiemeney, Lambertus A L M
Joint association analysis of multiple traits in a genome-wide association study (GWAS), i.e. a multivariate GWAS, offers several advantages over analyzing each trait in a separate GWAS. In this study we directly compared a number of multivariate GWAS methods using simulated data. We focused on six...... methods that are implemented in the software packages PLINK, SNPTEST, MultiPhen, BIMBAM, PCHAT and TATES, and also compared them to standard univariate GWAS, analysis of the first principal component of the traits, and meta-analysis of univariate results. We simulated data (N = 1000) for three...... correlation. We compared the power of the methods using empirically fixed significance thresholds (α = 0.05). Our results showed that the multivariate methods implemented in PLINK, SNPTEST, MultiPhen and BIMBAM performed best for the majority of the tested scenarios, with a notable increase in power...
Henshall, John M
Validation of the results of genome-wide association studies or genomic selection studies is an essential component of the experimental program. Validation allows users to quantify the benefit of applying gene tests or genomic prediction, relative to the costs of implementing the program. Further, if implemented, an appropriate weight in a selection index can only be derived if estimates of the accuracy of genomic predictions are available. In this chapter the reasons for validation are explored, and a range of commonly encountered scenarios described. General principles are stated, and options for performing validation discussed. Designs for validation are heavily dependent on the availability of phenotyped animals, and also on the pedigree structures that characterize the breeding program. Consequently, there is no single plan that is always applicable, and a custom plan often must be developed.
Zhang, Hui; Wang, Zhipeng; Wang, Shouzhi; Li, Hui
Domestic animals are invaluable resources for study of the molecular architecture of complex traits. Although the mapping of quantitative trait loci (QTL) responsible for economically important traits in domestic animals has achieved remarkable results in recent decades, not all of the genetic variation in the complex traits has been captured because of the low density of markers used in QTL mapping studies. The genome wide association study (GWAS), which utilizes high-density single-nucleotide polymorphism (SNP), provides a new way to tackle this issue. Encouraging achievements in dissection of the genetic mechanisms of complex diseases in humans have resulted from the use of GWAS. At present, GWAS has been applied to the field of domestic animal breeding and genetics, and some advances have been made. Many genes or markers that affect economic traits of interest in domestic animals have been identified. In this review, advances in the use of GWAS in domestic animals are described.
Soil water deficit is one of the major factors limiting plant productivity. Plants cope with this adverse environmental condition by coordinating the up- or downregulation of an array of stress responsive genes. Reprogramming the expression of these genes leads to rebalanced development and growth that are in concert with the reduced water availability and that ultimately confer enhanced stress tolerance. Currently, several techniques have been employed to monitor genome-wide transcriptional reprogramming under drought stress. The results from these high throughput studies indicate that drought stress-induced transcriptional reprogramming is dynamic, has temporal and spatial specificity, and is coupled with the circadian clock and phytohormone signaling pathways. © 2012 Springer-Verlag Berlin Heidelberg. All rights are reserved.
Palma-Guerrero, Javier; Hall, Charles R; Kowbel, David; Welch, Juliet; Taylor, John W; Brem, Rachel B; Glass, N Louise
Understanding how genomes encode complex cellular and organismal behaviors has become the outstanding challenge of modern genetics. Unlike classical screening methods, analysis of genetic variation that occurs naturally in wild populations can enable rapid, genome-scale mapping of genotype to phenotype with a medium-throughput experimental design. Here we describe the results of the first genome-wide association study (GWAS) used to identify novel loci underlying trait variation in a microbial eukaryote, harnessing wild isolates of the filamentous fungus Neurospora crassa. We genotyped each of a population of wild Louisiana strains at 1 million genetic loci genome-wide, and we used these genotypes to map genetic determinants of microbial communication. In N. crassa, germinated asexual spores (germlings) sense the presence of other germlings, grow toward them in a coordinated fashion, and fuse. We evaluated germlings of each strain for their ability to chemically sense, chemotropically seek, and undergo cell fusion, and we subjected these trait measurements to GWAS. This analysis identified one gene, NCU04379 (cse-1, encoding a homolog of a neuronal calcium sensor), at which inheritance was strongly associated with the efficiency of germling communication. Deletion of cse-1 significantly impaired germling communication and fusion, and two genes encoding predicted interaction partners of CSE1 were also required for the communication trait. Additionally, mining our association results for signaling and secretion genes with a potential role in germling communication, we validated six more previously unknown molecular players, including a secreted protease and two other genes whose deletion conferred a novel phenotype of increased communication and multi-germling fusion. Our results establish protein secretion as a linchpin of germling communication in N. crassa and shed light on the regulation of communication molecules in this fungus. Our study demonstrates the power
Hayes, Ben; Goddard, Mike
Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.
Wang, Z.; McGlynn, K.A.; Rajpert- de Meyts, E.; Bishop, D.T.; Chung, C.C.; Dalgaard, M.D.; Greene, M.H.; Gupta, R; Grotmol, T.; Haugen, T.B.; Karlsson, R.; Litchfield, K.; Mitra, N.; Nielsen, K.; Pyle, L.C.; Schwartz, S.M.; Thorsson, V.; Vardhanabhuti, S.; Wiklund, F.; Turnbull, C.; Chanock, S.J.; Kanetsky, P.A.; Nathanson, K.L.; Kiemeney, B.; Skotheim, R.I.; Zheng, T.
The international Testicular Cancer Consortium (TECAC) combined five published genome-wide association studies of testicular germ cell tumor (TGCT; 3,558 cases and 13,970 controls) to identify new susceptibility loci. We conducted a fixed-effects meta-analysis, including, to our knowledge, the first
Hansel, Nadia N; Ruczinski, Ingo; Rafaels, Nicholas; Sin, Don D; Daley, Denise; Malinina, Alla; Huang, Lili; Sandford, Andrew; Murray, Tanda; Kim, Yoonhee; Vergara, Candelaria; Heckbert, Susan R; Psaty, Bruce M; Li, Guo; Elliott, W Mark; Aminuddin, Farzian; Dupuis, Josée; O'Connor, George T; Doheny, Kimberly; Scott, Alan F; Boezen, Hendrika; Postma, Dirkje S; Smolonska, Joanna; Zanen, Pieter; Mohamed Hoesein, Firdaus A; de Koning, Harry J; Crystal, Ronald G; Tanaka, Toshiko; Ferrucci, Luigi; Silverman, Edwin; Wan, Emily; Vestbo, Jorgen; Lomas, David A; Connett, John; Wise, Robert A; Neptune, Enid R; Mathias, Rasika A; Paré, Peter D; Beaty, Terri H; Barnes, Kathleen C
Accelerated lung function decline is a key COPD phenotype; however, its genetic control remains largely unknown. We performed a genome-wide association study using the Illumina Human660W-Quad v.1_A BeadChip. Generalized estimation equations were used to assess genetic contributions to lung function
Paternoster, Lavinia; Standl, Marie; Chen, Chih-Mei
population-based cohorts and then examined the ten most strongly associated new susceptibility loci in an additional 5,419 affected individuals and 19,833 controls from 14 studies. Three SNPs reached genome-wide significance in the discovery and replication cohorts combined, including rs479844 upstream...
Paternoster, Lavinia; Standl, Marie; Chen, Chih-Mei; Ramasamy, Adaikalavan; Bonnelykke, Klaus; Duijts, Liesbeth; Ferreira, Manuel A.; Alves, Alexessander Couto; Thyssen, Jacob P.; Albrecht, Eva; Baurecht, Hansjoerg; Feenstra, Bjarke; Sleiman, Patrick M. A.; Hysi, Pirro; Warrington, Nicole M.; Curjuric, Ivan; Myhre, Ronny; Curtin, John A.; Groen-Blokhuis, Maria M.; Kerkhof, Marjan; Saaf, Annika; Franke, Andre; Ellinghaus, David; Foelster-Holst, Regina; Dermitzakis, Emmanouil; Montgomery, Stephen B.; Prokisch, Holger; Heim, Katharina; Hartikainen, Anna-Liisa; Pouta, Anneli; Pekkanen, Juha; Blakemore, Alexandra I. F.; Buxton, Jessica L.; Kaakinen, Marika; Duffy, David L.; Madden, Pamela A.; Heath, Andrew C.; Montgomery, Grant W.; Thompson, Philip J.; Matheson, Melanie C.; Le Souef, Peter; St Pourcain, Beate; Smith, George Davey; Henderson, John; Kemp, John P.; Timpson, Nicholas J.; Deloukas, Panos; Ring, Susan M.; Wichmann, H-Erich; Mueller-Nurasyid, Martina; Novak, Natalija; Klopp, Norman; Rodriguez, Elke; McArdle, Wendy; Linneberg, Allan; Menne, Torkil; Nohr, Ellen A.; Hofman, Albert; Uitterlinden, Andre G.; van Duijin, Cornelia M.; Rivadeneira, Fernando; de Jongste, Johan C.; van der Valk, Ralf J. P.; Wjst, Matthias; Jogi, Rain; Geller, Frank; Boyd, Heather A.; Murray, Jeffrey C.; Kim, Cecilia; Mentch, Frank; March, Michael; Mangino, Massimo; Spector, Tim D.; Bataille, Veronique; Pennell, Craig E.; Holt, Patrick G.; Sly, Peter; Tiesler, Carla M. T.; Thiering, Elisabeth; Illig, Thomas; Imboden, Medea; Nystad, Wenche; Simpson, Angela; Hottenga, Jouke-Jan; Postma, Dirkje; Koppelman, Gerard H.; Smit, Henriette A.; Soderhall, Cilla; Chawes, Bo; Kreiner-Moller, Eskil; Bisgaard, Hans; Melen, Erik; Boomsma, Dorret I.; Custovic, Adnan; Jacobsson, Bo; Probst-Hensch, Nicole M.; Palmer, Lyle J.; Glass, Daniel; Hakonarson, Hakon; Melbye, Mads; Jarvis, Deborah L.; Jaddoe, Vincent W. V.; Gieger, Christian; Strachan, David P.; Martin, Nicholas G.; Jarvelin, Marjo-Riitta; Heinrich, Joachim; Evans, David M.; Weidinger, Stephan
Atopic dermatitis (AD) is a commonly occurring chronic skin disease with high heritability. Apart from filaggrin (FLG), the genes influencing atopic dermatitis are largely unknown. We conducted a genome-wide association meta-analysis of 5,606 affected individuals and 20,565 controls from 16
Mahajan, Anubha; Go, Min Jin; Zhang, Weihua; Below, Jennifer E.; Gaulton, Kyle J.; Ferreira, Teresa; Horikoshi, Momoko; Johnson, Andrew D.; Ng, Maggie C. Y.; Prokopenko, Inga; Saleheen, Danish; Wang, Xu; Zeggini, Eleftheria; Abecasis, Goncalo R.; Adair, Linda S.; Almgren, Peter; Atalay, Mustafa; Aung, Tin; Baldassarre, Damiano; Balkau, Beverley; Bao, Yuqian; Barnett, Anthony H.; Barroso, Ines; Basit, Abdul; Been, Latonya F.; Beilby, John; Bell, Graeme I.; Benediktsson, Rafn; Bergman, Richard N.; Boehm, Bernhard O.; Boerwinkle, Eric; Bonnycastle, Lori L.; Burtt, Noël; Cai, Qiuyin; Campbell, Harry; Carey, Jason; Cauchi, Stephane; Caulfield, Mark; Chan, Juliana C. N.; Chang, Li-Ching; Chang, Tien-Jyun; Chang, Yi-Cheng; Charpentier, Guillaume; Chen, Chien-Hsiun; Chen, Han; Chen, Yuan-Tsong; Chia, Kee-Seng; Chidambaram, Manickam; Chines, Peter S.; Cho, Nam H.; Cho, Young Min; Chuang, Lee-Ming; Collins, Francis S.; Cornelis, Marilyn C.; Couper, David J.; Crenshaw, Andrew T.; van Dam, Rob M.; Danesh, John; Das, Debashish; de Faire, Ulf; Dedoussis, George; Deloukas, Panos; Dimas, Antigone S.; Dina, Christian; Doney, Alex S. F.; Donnelly, Peter J.; Dorkhan, Mozhgan; van Duijn, Cornelia; Dupuis, Josée; Edkins, Sarah; Elliott, Paul; Emilsson, Valur; Erbel, Raimund; Eriksson, Johan G.; Escobedo, Jorge; Esko, Tonu; Eury, Elodie; Florez, Jose C.; Fontanillas, Pierre; Forouhi, Nita G.; Forsen, Tom; Fox, Caroline; Fraser, Ross M.; Frayling, Timothy M.; Froguel, Philippe; Frossard, Philippe; Gao, Yutang; Gertow, Karl; Gieger, Christian; Gigante, Bruna; Grallert, Harald; Grant, George B.; Groop, Leif C.; Groves, Christopher J.; Grundberg, Elin; Guiducci, Candace; Hamsten, Anders; Han, Bok-Ghee; Hara, Kazuo; Hassanali, Neelam; Hattersley, Andrew T.; Hayward, Caroline; Hedman, Asa K.; Herder, Christian; Hofman, Albert; Holmen, Oddgeir L.; Hovingh, Kees; Hreidarsson, Astradur B.; Hu, Cheng; Hu, Frank B.; Hui, Jennie; Humphries, Steve E.; Hunt, Sarah E.; Hunter, David J.; Hveem, Kristian; Hydrie, Zafar I.; Ikegami, Hiroshi; Illig, Thomas; Ingelsson, Erik; Islam, Muhammed; Isomaa, Bo; Jackson, Anne U.; Jafar, Tazeen; James, Alan; Jia, Weiping; Jöckel, Karl-Heinz; Jonsson, Anna; Jowett, Jeremy B. M.; Kadowaki, Takashi; Kang, Hyun Min; Kanoni, Stavroula; Kao, Wen Hong L.; Kathiresan, Sekar; Kato, Norihiro; Katulanda, Prasad; Keinanen-Kiukaanniemi, Sirkka M.; Kelly, Ann M.; Khan, Hassan; Khaw, Kay-Tee; Khor, Chiea-Chuen; Kim, Hyung-Lae; Kim, Sangsoo; Kim, Young Jin; Kinnunen, Leena; Klopp, Norman; Kong, Augustine; Korpi-Hyövälti, Eeva; Kowlessur, Sudhir; Kraft, Peter; Kravic, Jasmina; Kristensen, Malene M.; Krithika, S.; Kumar, Ashish; Kumate, Jesus; Kuusisto, Johanna; Kwak, Soo Heon; Laakso, Markku; Lagou, Vasiliki; Lakka, Timo A.; Langenberg, Claudia; Langford, Cordelia; Lawrence, Robert; Leander, Karin; Lee, Jen-Mai; Lee, Nanette R.; Li, Man; Li, Xinzhong; Li, Yun; Liang, Junbin; Liju, Samuel; Lim, Wei-Yen; Lind, Lars; Lindgren, Cecilia M.; Lindholm, Eero; Liu, Ching-Ti; Liu, Jian Jun; Lobbens, Stéphane; Long, Jirong; Loos, Ruth J. F.; Lu, Wei; Luan, Jian'an; Lyssenko, Valeriya; Ma, Ronald C. W.; Maeda, Shiro; Mägi, Reedik; Männistö, Satu; Matthews, David R.; Meigs, James B.; Melander, Olle; Metspalu, Andres; Meyer, Julia; Mirza, Ghazala; Mihailov, Evelin; Moebus, Susanne; Mohan, Viswanathan; Mohlke, Karen L.; Morris, Andrew D.; Mühleisen, Thomas W.; Müller-Nurasyid, Martina; Musk, Bill; Nakamura, Jiro; Nakashima, Eitaro; Navarro, Pau; Ng, Peng-Keat; Nica, Alexandra C.; Nilsson, Peter M.; Njølstad, Inger; Nöthen, Markus M.; Ohnaka, Keizo; Ong, Twee Hee; Owen, Katharine R.; Palmer, Colin N. A.; Pankow, James S.; Park, Kyong Soo; Parkin, Melissa; Pechlivanis, Sonali; Pedersen, Nancy L.; Peltonen, Leena; Perry, John R. B.; Peters, Annette; Pinidiyapathirage, Janani M.; Platou, Carl G. P.; Potter, Simon; Price, Jackie F.; Qi, Lu; Radha, Venkatesan; Rallidis, Loukianos; Rasheed, Asif; Rathmann, Wolfgang; Rauramaa, Rainer; Raychaudhuri, Soumya; Rayner, N. William; Rees, Simon D.; Rehnberg, Emil; Ripatti, Samuli; Robertson, Neil; Roden, Michael; Rossin, Elizabeth J.; Rudan, Igor; Rybin, Denis; Saaristo, Timo E.; Salomaa, Veikko; Saltevo, Juha; Samuel, Maria; Sanghera, Dharambir K.; Saramies, Jouko; Scott, James; Scott, Laura J.; Scott, Robert A.; Segrè, Ayellet V.; Sehmi, Joban; Sennblad, Bengt; Shah, Nabi; Shah, Sonia; Shera, A. Samad; Shu, Xiao Ou; Shuldiner, Alan R.; Sigurðsson, Gunnar; Sijbrands, Eric; Silveira, Angela; Sim, Xueling; Sivapalaratnam, Suthesh; Small, Kerrin S.; So, Wing Yee; Stančáková, Alena; Stefansson, Kari; Steinbach, Gerald; Steinthorsdottir, Valgerdur; Stirrups, Kathleen; Strawbridge, Rona J.; Stringham, Heather M.; Sun, Qi; Suo, Chen; Syvänen, Ann-Christine; Takayanagi, Ryoichi; Takeuchi, Fumihiko; Tay, Wan Ting; Teslovich, Tanya M.; Thorand, Barbara; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Tikkanen, Emmi; Trakalo, Joseph; Tremoli, Elena; Trip, Mieke D.; Tsai, Fuu Jen; Tuomi, Tiinamaija; Tuomilehto, Jaakko; Uitterlinden, Andre G.; Valladares-Salgado, Adan; Vedantam, Sailaja; Veglia, Fabrizio; Voight, Benjamin F.; Wang, Congrong; Wareham, Nicholas J.; Wennauer, Roman; Wickremasinghe, Ananda R.; Wilsgaard, Tom; Wilson, James F.; Wiltshire, Steven; Winckler, Wendy; Wong, Tien Yin; Wood, Andrew R.; Wu, Jer-Yuarn; Wu, Ying; Yamamoto, Ken; Yamauchi, Toshimasa; Yang, Mingyu; Yengo, Loic; Yokota, Mitsuhiro; Young, Robin; Zabaneh, Delilah; Zhang, Fan; Zhang, Rong; Zheng, Wei; Zimmet, Paul Z.; Altshuler, David; Bowden, Donald W.; Cho, Yoon Shin; Cox, Nancy J.; Cruz, Miguel; Hanis, Craig L.; Kooner, Jaspal; Lee, Jong-Young; Seielstad, Mark; teo, Yik Ying; Boehnke, Michael; Parra, Esteban J.; Chambers, John C.; Tai, E. Shyong; McCarthy, Mark I.; Morris, Andrew P.
To further understanding of the genetic basis of type 2 diabetes (T2D) susceptibility, we aggregated published meta-analyses of genome-wide association studies (GWAS), including 26,488 cases and 83,964 controls of European, east Asian, south Asian and Mexican and Mexican American ancestry. We
Paternoster, Lavinia; Standl, Marie; Chen, Chih-Mei
Atopic dermatitis (AD) is a commonly occurring chronic skin disease with high heritability. Apart from filaggrin (FLG), the genes influencing atopic dermatitis are largely unknown. We conducted a genome-wide association meta-analysis of 5,606 affected individuals and 20,565 controls from 16 popul...
Wang, Zhaoming; McGlynn, Katherine A.; Rajpert-De Meyts, Ewa
The international Testicular Cancer Consortium (TECAC) combined five published genome-wide association studies of testicular germ cell tumor (TGCT; 3,558 cases and 13,970 controls) to identify new susceptibility loci. We conducted a fixed-effects meta-analysis, including, to our knowledge, the fi...
Mahajan, Anubha; Go, Min Jin; Zhang, Weihua
To further understanding of the genetic basis of type 2 diabetes (T2D) susceptibility, we aggregated published meta-analyses of genome-wide association studies (GWAS), including 26,488 cases and 83,964 controls of European, east Asian, south Asian and Mexican and Mexican American ancestry. We obs...
Adult offspring from obese (OB) rat dams gain greater body weight and fat mass than controls when fed HFD. At PND21, we examined energy expenditure (EE) (indirect calorimetry), hepatic gene expression (microarrays), and changes in genome-wide and global DNA methylation (enrichment-coupled DNA seque...
Paternoster, L.; Standl, M.; Chen, C.M.; Ramasamy, A.; Bønnelykke, K.; Duijts, L.; Ferreira, M.A.; Couto Alves, A.; Thyssen, J.P.; Albrecht, E.; Baurecht, H.; Feenstra, B.; Sleiman, P.M.A.; Hysi, P.; Warrington, N.M.; Curjuric, I.; Myhre, R.; Curtin, J.A.; Groen-Blokhuis, M.M.; Kerkhof, M.; Sääf, A.; Franke, A.; Ellinghaus, D.; Fölster-Holst, R.; Dermitzakis, E.; Montgomery, S.B.; Prokisch, H.; Heim, K.; Hartikainen, A.-L.; Pouta, A.; Pekkanen, J.; Blakemore, A.I.F.; Buxton, J.L.; Kaakinen, M.; Duffy, DL; Madden, P.A.F.; Heath, A.C.; Montgomery, G.W.; Thompson, P.J.; Matheson, M.C.; Le Souëf, P.; St Pourcain, B.; Davey Smith, G.; Henderson, J.; Kemp, J.P.; Timpson, N.J.; Deloukas, P.; Ring, S.M.; Wichmann, H.-E.; Müller-Nurasyid, M.; Novak, N.; Klopp, N.; Rodríguez, E.; McArdle, W.; Linneberg, A.; Menné, T.; Nohr, E.A.; Hofman, A.; Uitterlinden, A.G.; van Duijn, C.M.; Rivadeneira, F.; de Jongste, J.C.; van der Valk, R.J.P.; Wjst, M.; Jogi, R.; Geller, F.; Boyd, H.A.; Murray, J.C.; Kim, C.; Mentch, F.; March, M.; Mangino, M.; Spector, T.D.; Bataille, V.; Pennell, C.E.; Holt, P.G.; Sly, P.; Tiesler, C.M.T.; Hottenga, J.J.; Boomsma, D.I.; Hakonarson, H.; Melbye, M.; Ljarvis, D.; Jaddoe, V.W.V.; Gieger, C.; Strachan, D.P.; Martin, N.G.; Jarvelin, M.-R.; Heinrich, J.; Evans, D.M.; Weidinger, S.
Atopic dermatitis (AD) is a commonly occurring chronic skin disease with high heritability. Apart from filaggrin (FLG), the genes influencing atopic dermatitis are largely unknown. We conducted a genome-wide association meta-analysis of 5,606 affected individuals and 20,565 controls from 16
Nielsen, Ronni; Mandrup, Susanne
The recent advances in high-throughput sequencing combined with various other technologies have allowed detailed and genome-wide insight into the transcriptional networks that control adipogenesis. Chromatin immunoprecipitation (ChIP) combined with high-throughput sequencing (ChIP-seq) is one of ...
Kim, Jin Hee; Park, Kyunghee; Yim, Seon-Hee; Choi, Jin Eun; Sung, Jae Sook; Park, Ju-Yeon; Choi, Yi Young; Jeon, Hyo-Sung; Park, Jae Yong; Yoon, Hyoung Kyu; Kim, Yeul Hong; Yoo, Byung Su; Kim, Young Tae; Hu, Hae-Jin; Chung, Yeun-Jun; Kim, Ho; Sung, Sook Whan; Hong, Yun-Chul
Lung cancer in never-smokers ranks as the seventh most common cause of cancer death worldwide, and the incidence of lung cancer in non-smoking Korean women appears to be steadily increasing. To identify the effect of genetic polymorphisms on lung cancer risk in non-smoking Korean women, we conducted a genome-wide association study of Korean female non-smokers with lung cancer. We analyzed 440,794 genotype data of 285 cases and 1,455 controls, and nineteen SNPs were associated with lung cancer development (P Korean non-smoking women.
Urrestarazu, Jorge; Muranty, Helene; Denance, Caroline; Leforestier, Diane; RAVON, ELISA; Guyader, Arnaud; Guisnel, Remi; Feugey, Laurence; Aubourg, Sebastien; Celton, Jean-Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom (R) Apple 480 K SNP array. We ran GWAS with a multi-locus mi...
Jorge Urrestarazu; Jorge Urrestarazu; Jorge Urrestarazu; Hélène Muranty; Caroline Denancé; Diane Leforestier; Elisa Ravon; Arnaud Guyader; Rémi Guisnel; Laurence Feugey; Sébastien Aubourg; Jean-Marc Celton; Nicolas Daccord; Luca Dondini; Roberto Gregori
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed ...
Urrestarazu, Jorge; Muranty, Hélène; Denancé, Caroline; Leforestier, Diane; Ravon, Elisa; Guyader, Arnaud; Guisnel, Rémi; Feugey, Laurence; Aubourg, Sébastien; Celton, Jean-Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed ...
Thomas Danielsen, E.; E. Møller, Morten; Yamanaka, Naoki
Steroid hormones control important developmental processes and are linked to many diseases. To systematically identify genes and pathways required for steroid production, we performed a Drosophila genome-wide in vivo RNAi screen and identified 1,906 genes with potential roles in steroidogenesis...... and developmental programs. In addition, we demonstrate the existence of an autophagosomal cholesterol mobilization mechanism and show that activation of this system rescues Niemann-Pick type C1 deficiency that causes a disorder characterized by cholesterol accumulation. These cholesterol-trafficking mechanisms...
Raphaëlle Teresa Matilde Maria Prinsen; Maria Giuseppina Strillacci; Fausta Schiavini; Attilio Rossoni; Birgit Gredler; Marlies Alexandra Dolezal; Anna Bieber; Alessandro Bagnato
Detecting Copy Number Variation (CNV) in cattle provides the opportunity to study their association with quantitative traits (Winchester et al., 2009; Zhang et al., 2009; Hou et al., 2011; Clop et al., 2012; de Almeida et al., 2016;). The aim of this study was to map CNVs in 1,410 Brown Swiss males and females using Illumina BovineHD Genotyping BeadChip data and to perform a genome-wide association analysis for production functional and health traits. After quality control, CNVs were called w...
Full Text Available Farnesoid X receptor (FXR, NR1H4 is a ligand-activated transcription factor, belonging to the nuclear receptor superfamily. FXR is highly expressed in the liver and is essential in regulating bile acid homeostasis. FXR deficiency is implicated in numerous liver diseases and mice with modulation of FXR have been used as animal models to study liver physiology and pathology. We have reported genome-wide binding of FXR in mice by chromatin immunoprecipitation - deep sequencing (ChIP-seq, with results indicating that FXR may be involved in regulating diverse pathways in liver. However, limited information exists for the functions of human FXR and the suitability of using murine models to study human FXR functions.In the current study, we performed ChIP-seq in primary human hepatocytes (PHHs treated with a synthetic FXR agonist, GW4064 or DMSO control. In parallel, RNA deep sequencing (RNA-seq and RNA microarray were performed for GW4064 or control treated PHHs and wild type mouse livers, respectively.ChIP-seq showed similar profiles of genome-wide FXR binding in humans and mice in terms of motif analysis and pathway prediction. However, RNA-seq and microarray showed more different transcriptome profiles between PHHs and mouse livers upon GW4064 treatment.In summary, we have established genome-wide human FXR binding and transcriptome profiles. These results will aid in determining the human FXR functions, as well as judging to what level the mouse models could be used to study human FXR functions.
Yu, Ying-Ying; Sun, Cui-Xiang; Liu, Yin-Kun; Li, Yan; Wang, Li; Zhang, Wei
To compare genome-wide DNA methylation profiles in ovary tissue from women with polycystic ovary syndrome (PCOS) and healthy controls. Case-control study matched for age and body mass index. University-affiliated hospital. Ten women with PCOS who underwent ovarian drilling to induce ovulation and 10 healthy women who were undergoing laparoscopic sterilization, hysterectomy for benign conditions, diagnostic laparoscopy for pelvic pain, or oophorectomy for nonovarian indications. None. Genome-wide DNA methylation patterns determined by immunoprecipitation and microarray (MeDIP-chip) analysis. The methylation levels were statistically significantly higher in CpG island shores (CGI shores), which lie outside of core promoter regions, and lower within gene bodies in women with PCOS relative to the controls. In addition, high CpG content promoters were the most frequently hypermethylated promoters in PCOS ovaries but were more often hypomethylated in controls. Second, 872 CGIs, specifically methylated in PCOS, represented 342 genes that could be associated with various molecular functions, including protein binding, hormone activity, and transcription regulator activity. Finally, methylation differences were validated in seven genes by methylation-specific polymerase chain reaction. These genes correlated to several functional families related to the pathogenesis of PCOS and may be potential biomarkers for this disease. Our results demonstrated that epigenetic modification differs between PCOS and normal ovaries, which may help to further understand the pathophysiology of this disease. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Jennifer K Lowe
Full Text Available It has been argued that the limited genetic diversity and reduced allelic heterogeneity observed in isolated founder populations facilitates discovery of loci contributing to both Mendelian and complex disease. A strong founder effect, severe isolation, and substantial inbreeding have dramatically reduced genetic diversity in natives from the island of Kosrae, Federated States of Micronesia, who exhibit a high prevalence of obesity and other metabolic disorders. We hypothesized that genetic drift and possibly natural selection on Kosrae might have increased the frequency of previously rare genetic variants with relatively large effects, making these alleles readily detectable in genome-wide association analysis. However, mapping in large, inbred cohorts introduces analytic challenges, as extensive relatedness between subjects violates the assumptions of independence upon which traditional association test statistics are based. We performed genome-wide association analysis for 15 quantitative traits in 2,906 members of the Kosrae population, using novel approaches to manage the extreme relatedness in the sample. As positive controls, we observe association to known loci for plasma cholesterol, triglycerides, and C-reactive protein and to a compelling candidate loci for thyroid stimulating hormone and fasting plasma glucose. We show that our study is well powered to detect common alleles explaining >/=5% phenotypic variance. However, no such large effects were observed with genome-wide significance, arguing that even in such a severely inbred population, common alleles typically have modest effects. Finally, we show that a majority of common variants discovered in Caucasians have indistinguishable effect sizes on Kosrae, despite the major differences in population genetics and environment.
Lim, J H; Kim, D J; Lee, D E; Han, J Y; Chung, J H; Ahn, H K; Lee, S W; Lim, D H; Lee, Y S; Park, S Y; Ryu, H M
Down syndrome (DS) is the most common aneuploidy, caused by an extra copy of all or part of chromosome 21 (chr21). Differential microRNA (miRNA) expression is involved in many human diseases including DS. However, the genome-wide changes in miRNA expression in DS fetal placentas have yet to be determined, and the function of these changes is also unclear. We profiled genome-wide miRNA expression in placenta samples from euploid or DS fetuses by using microarray technology and predicted the functions of differentially expressed miRNAs using bioinformatics tools. Thirty-four miRNAs were significantly differentially expressed in the DS placenta compared with the normal placenta (16 up-regulated and 18 down-regulated). However, expression of chr21-derived miRNAs did not change. Predicted target genes included 7434 genes targeted by up-regulated miRNAs and 6071 genes targeted by down-regulated miRNAs. Seventy-six of these target genes were located on chr21 (10 genes controlled by down-regulated miRNAs and 34 genes by up-regulated miRNAs, and 32 genes by both). Target genes on chr21 were significantly associated with DS and DS-related disorders, such as mental retardation, neurobehavioral manifestations, and congenital abnormalities. To our knowledge, this is the first genome-wide study to comprehensively survey placental miRNAs in DS fetuses. Our results provide new insight into miRNA expression in placentas of fetuses with DS. Additionally, our findings indicate that the differentially expressed miRNAs in the DS placenta may potentially affect various pathways related to DS pathogenesis. Copyright © 2015 Elsevier Ltd. All rights reserved.
Dalrymple Brian P
Full Text Available Abstract Background Gene regulation by transcription factors (TF is species, tissue and time specific. To better understand how the genetic code controls gene expression in bovine muscle we associated gene expression data from developing Longissimus thoracis et lumborum skeletal muscle with bovine promoter sequence information. Results We created a highly conserved genome-wide promoter landscape comprising 87,408 interactions relating 333 TFs with their 9,242 predicted target genes (TGs. We discovered that the complete set of predicted TGs share an average of 2.75 predicted TF binding sites (TFBSs and that the average co-expression between a TF and its predicted TGs is higher than the average co-expression between the same TF and all genes. Conversely, pairs of TFs sharing predicted TGs showed a co-expression correlation higher that pairs of TFs not sharing TGs. Finally, we exploited the co-occurrence of predicted TFBS in the context of muscle-derived functionally-coherent modules including cell cycle, mitochondria, immune system, fat metabolism, muscle/glycolysis, and ribosome. Our findings enabled us to reverse engineer a regulatory network of core processes, and correctly identified the involvement of E2F1, GATA2 and NFKB1 in the regulation of cell cycle, fat, and muscle/glycolysis, respectively. Conclusion The pivotal implication of our research is two-fold: (1 there exists a robust genome-wide expression signal between TFs and their predicted TGs in cattle muscle consistent with the extent of promoter sharing; and (2 this signal can be exploited to recover the cellular mechanisms underpinning transcription regulation of muscle structure and development in bovine. Our study represents the first genome-wide report linking tissue specific co-expression to co-regulation in a non-model vertebrate.
Saura, María; Fernández, Almudena; Rodríguez, M Carmen; Toro, Miguel A; Barragán, Carmen; Fernández, Ana I; Villanueva, Beatriz
Maintaining genetic variation and controlling the increase in inbreeding are crucial requirements in animal conservation programs. The most widely accepted strategy for achieving these objectives is to maximize the effective population size by minimizing the global coancestry obtained from a particular pedigree. However, for most natural or captive populations genealogical information is absent. In this situation, microsatellites have been traditionally the markers of choice to characterize genetic variation, and several estimators of genealogical coefficients have been developed using marker data, with unsatisfactory results. The development of high-throughput genotyping techniques states the necessity of reviewing the paradigm that genealogical coancestry is the best parameter for measuring genetic diversity. In this study, the Illumina PorcineSNP60 BeadChip was used to obtain genome-wide estimates of rates of coancestry and inbreeding and effective population size for an ancient strain of Iberian pigs that is now in serious danger of extinction and for which very accurate genealogical information is available (the Guadyerbas strain). Genome-wide estimates were compared with those obtained from microsatellite and from pedigree data. Estimates of coancestry and inbreeding computed from the SNP chip were strongly correlated with genealogical estimates and these correlations were substantially higher than those between microsatellite and genealogical coefficients. Also, molecular coancestry computed from SNP information was a better predictor of genealogical coancestry than coancestry computed from microsatellites. Rates of change in coancestry and inbreeding and effective population size estimated from molecular data were very similar to those estimated from genealogical data. However, estimates of effective population size obtained from changes in coancestry or inbreeding differed. Our results indicate that genome-wide information represents a useful alternative
Full Text Available Although family history is a risk factor for pancreatic adenocarcinoma, much of the genetic etiology of this disease remains unknown. While genome-wide association studies have identified some common single nucleotide polymorphisms (SNPs associated with pancreatic cancer risk, these SNPs do not explain all the heritability of this disease. We hypothesized that copy number variation (CNVs in the genome may play a role in genetic predisposition to pancreatic adenocarcinoma. Here, we report a genome-wide analysis of CNVs in a small hospital-based, European ancestry cohort of pancreatic cancer cases and controls. Germline CNV discovery was performed using the Illumina Human CNV370 platform in 223 pancreatic cancer cases (both sporadic and familial and 169 controls. Following stringent quality control, we asked if global CNV burden was a risk factor for pancreatic cancer. Finally, we performed in silico CNV genotyping and association testing to discover novel CNV risk loci. When we examined the global CNV burden, we found no strong evidence that CNV burden plays a role in pancreatic cancer risk either overall or specifically in individuals with a family history of the disease. Similarly, we saw no significant evidence that any particular CNV is associated with pancreatic cancer risk. Taken together, these data suggest that CNVs do not contribute substantially to the genetic etiology of pancreatic cancer, though the results are tempered by small sample size and large experimental variability inherent in array-based CNV studies
Li, Changgui; Li, Zhiqiang; Liu, Shiguo; Wang, Can; Han, Lin; Cui, Lingling; Zhou, Jingguo; Zou, Hejian; Liu, Zhen; Chen, Jianhua; Cheng, Xiaoyu; Zhou, Zhaowei; Ding, Chengcheng; Wang, Meng; Chen, Tong; Cui, Ying; He, Hongmei; Zhang, Keke; Yin, Congcong; Wang, Yunlong; Xing, Shichao; Li, Baojie; Ji, Jue; Jia, Zhaotong; Ma, Lidan; Niu, Jiapeng; Xin, Ying; Liu, Tian; Chu, Nan; Yu, Qing; Ren, Wei; Wang, Xuefeng; Zhang, Aiqing; Sun, Yuping; Wang, Haili; Lu, Jie; Li, Yuanyuan; Qing, Yufeng; Chen, Gang; Wang, Yangang; Zhou, Li; Niu, Haitao; Liang, Jun; Dong, Qian; Li, Xinde; Mi, Qing-Sheng; Shi, Yongyong
Gout is one of the most common types of inflammatory arthritis, caused by the deposition of monosodium urate crystals in and around the joints. Previous genome-wide association studies (GWASs) have identified many genetic loci associated with raised serum urate concentrations. However, hyperuricemia alone is not sufficient for the development of gout arthritis. Here we conduct a multistage GWAS in Han Chinese using 4,275 male gout patients and 6,272 normal male controls (1,255 cases and 1,848 controls were genome-wide genotyped), with an additional 1,644 hyperuricemic controls. We discover three new risk loci, 17q23.2 (rs11653176, P=1.36 × 10−13, BCAS3), 9p24.2 (rs12236871, P=1.48 × 10−10, RFX3) and 11p15.5 (rs179785, P=1.28 × 10−8, KCNQ1), which contain inflammatory candidate genes. Our results suggest that these loci are most likely related to the progression from hyperuricemia to inflammatory gout, which will provide new insights into the pathogenesis of gout arthritis. PMID:25967671
Full Text Available Abstract Background The amount of data generated from genome-wide association studies (GWAS has grown rapidly, but considerations for GWAS phenotype data reuse and interchange have not kept pace. This impacts on the work of GWAS Central – a free and open access resource for the advanced querying and comparison of summary-level genetic association data. The benefits of employing ontologies for standardising and structuring data are widely accepted. The complex spectrum of observed human phenotypes (and traits, and the requirement for cross-species phenotype comparisons, calls for reflection on the most appropriate solution for the organisation of human phenotype data. The Semantic Web provides standards for the possibility of further integration of GWAS data and the ability to contribute to the web of Linked Data. Results A pragmatic consideration when applying phenotype ontologies to GWAS data is the ability to retrieve all data, at the most granular level possible, from querying a single ontology graph. We found the Medical Subject Headings (MeSH terminology suitable for describing all traits (diseases and medical signs and symptoms at various levels of granularity and the Human Phenotype Ontology (HPO most suitable for describing phenotypic abnormalities (medical signs and symptoms at the most granular level. Diseases within MeSH are mapped to HPO to infer the phenotypic abnormalities associated with diseases. Building on the rich semantic phenotype annotation layer, we are able to make cross-species phenotype comparisons and publish a core subset of GWAS data as RDF nanopublications. Conclusions We present a methodology for applying phenotype annotations to a comprehensive genome-wide association dataset and for ensuring compatibility with the Semantic Web. The annotations are used to assist with cross-species genotype and phenotype comparisons. However, further processing and deconstructions of terms may be required to facilitate automatic
Stricker, Georg; Engelhardt, Alexander; Schulz, Daniel; Schmid, Matthias; Tresch, Achim; Gagneur, Julien
Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) is a widely used approach to study protein-DNA interactions. Often, the quantities of interest are the differential occupancies relative to controls, between genetic backgrounds, treatments, or combinations thereof. Current methods for differential occupancy of ChIP-Seq data rely however on binning or sliding window techniques, for which the choice of the window and bin sizes are subjective. Here, we present GenoGAM (Genome-wide Generalized Additive Model), which brings the well-established and flexible generalized additive models framework to genomic applications using a data parallelism strategy. We model ChIP-Seq read count frequencies as products of smooth functions along chromosomes. Smoothing parameters are objectively estimated from the data by cross-validation, eliminating ad hoc binning and windowing needed by current approaches. GenoGAM provides base-level and region-level significance testing for full factorial designs. Application to a ChIP-Seq dataset in yeast showed increased sensitivity over existing differential occupancy methods while controlling for type I error rate. By analyzing a set of DNA methylation data and illustrating an extension to a peak caller, we further demonstrate the potential of GenoGAM as a generic statistical modeling tool for genome-wide assays. Software is available from Bioconductor: https://www.bioconductor.org/packages/release/bioc/html/GenoGAM.html . firstname.lastname@example.org. Supplementary information is available at Bioinformatics online.
Todd, John A; Walker, Neil M; Cooper, Jason D; Smyth, Deborah J; Downes, Kate; Plagnol, Vincent; Bailey, Rebecca; Nejentsev, Sergey; Field, Sarah F; Payne, Felicity; Lowe, Christopher E; Szeszko, Jeffrey S; Hafler, Jason P; Zeitels, Lauren; Yang, Jennie H M; Vella, Adrian; Nutland, Sarah; Stevens, Helen E; Schuilenburg, Helen; Coleman, Gillian; Maisuria, Meeta; Meadows, William; Smink, Luc J; Healy, Barry; Burren, Oliver S; Lam, Alex A C; Ovington, Nigel R; Allen, James; Adlem, Ellen; Leung, Hin-Tak; Wallace, Chris; Howson, Joanna M M; Guja, Cristian; Ionescu-Tirgoviste, Constantin; Simmonds, Matthew J; Heward, Joanne M; Gough, Stephen CL; Dunger, David B; Wicker, Linda S; Clayton, David G
The Wellcome Trust Case Control Consortium (WTCCC) primary genome-wide association (GWA) scan1 on seven diseases, including the multifactorial, autoimmune disease, type 1 diabetes (T1D), shows significant association (P < 5 × 10−7 between T1D and six chromosome regions: 12q24, 12q13, 16p13, 18p11, 12p13 and 4q27. Here, we attempted to validate these and six other top findings in 4,000 individuals with T1D, 5,000 controls and 2,997 family trios that were independent of the WTCCC study. We confirmed unequivocally the associations of 12q24, 12q13, 16p13 and 18p11 (Pfollow-up ≤ 1.35 × 10−9; Poverall ≤ 1.15 × 10−14), leaving eight regions with small effects or false-positive associations with T1D. We also obtained evidence for chromosome 18q22 (Poverall = 1.38 × 10−8) from a genome-wide association study of nonsynonymous SNPs. Several regions, including 18q22 and 18p11, showed association with autoimmune thyroid disease. This study increases the number of T1D loci with compelling evidence from six to at least ten. PMID:17554260
Full Text Available Abstract Background Addison's disease (AD is caused by an autoimmune destruction of the adrenal cortex. The pathogenesis is multi-factorial, involving genetic components and hitherto unknown environmental factors. The aim of the present study was to investigate if gene dosage in the form of copy number variation (CNV could add to the repertoire of genetic susceptibility to autoimmune AD. Methods A genome-wide study using the Affymetrix GeneChip® Genome-Wide Human SNP Array 6.0 was conducted in 26 patients with AD. CNVs in selected genes were further investigated in a larger material of patients with autoimmune AD (n = 352 and healthy controls (n = 353 by duplex Taqman real-time polymerase chain reaction assays. Results We found that low copy number of UGT2B28 was significantly more frequent in AD patients compared to controls; conversely high copy number of ADAM3A was associated with AD. Conclusions We have identified two novel CNV associations to ADAM3A and UGT2B28 in AD. The mechanism by which this susceptibility is conferred is at present unclear, but may involve steroid inactivation (UGT2B28 and T cell maturation (ADAM3A. Characterization of these proteins may unravel novel information on the pathogenesis of autoimmunity.
Bønnelykke, Klaus; Matheson, Melanie C; Pers, Tune Hannes
and HLA-B. All the top SNPs were associated with allergic symptoms in an independent study. Risk-associated variants at these ten loci were estimated to account for at least 25% of allergic sensitization and allergic rhinitis. Understanding the molecular mechanisms underlying these associations may......Allergen-specific immunoglobulin E (present in allergic sensitization) has a central role in the pathogenesis of allergic disease. We performed the first large-scale genome-wide association study (GWAS) of allergic sensitization in 5,789 affected individuals and 10,056 controls and followed up...... the top SNP at each of 26 loci in 6,114 affected individuals and 9,920 controls. We increased the number of susceptibility loci with genome-wide significant association with allergic sensitization from three to ten, including SNPs in or near TLR6, C11orf30, STAT6, SLC25A46, HLA-DQB1, IL1RL1, LPP, MYC, IL2...
Green, Elaine K; Di Florio, Arianna; Forty, Liz; Gordon-Smith, Katherine; Grozeva, Detelina; Fraser, Christine; Richards, Alexander L; Moran, Jennifer L; Purcell, Shaun; Sklar, Pamela; Kirov, George; Owen, Michael J; O'Donovan, Michael C; Craddock, Nick; Jones, Lisa; Jones, Ian R
Studies have suggested that Research Diagnostic Criteria for Schizoaffective Disorder Bipolar type (RDC-SABP) might identify a more genetically homogenous subgroup of bipolar disorder. Aiming to identify loci associated with RDC-SABP, we have performed a replication study using independent RDC-SABP cases (n = 144) and controls (n = 6,559), focusing on the 10 loci that reached a p-value disorder sample. Combining the WTCCC and replication datasets by meta-analysis (combined RDC-SABP, n = 423, controls, n = 9,494), we observed genome-wide significant association at one SNP, rs2352974, located within the intron of the gene TRAIP on chromosome 3p21.31 (p-value, 4.37 × 10 -8 ). This locus did not reach genome-wide significance in bipolar disorder or schizophrenia large Psychiatric Genomic Consortium datasets, suggesting that it may represent a relatively specific genetic risk for the bipolar subtype of schizoaffective disorder. © 2017 Wiley Periodicals, Inc.
Johansen, Christopher T.; Wang, Jian; Lanktree, Matthew B.; Cao, Henian; McIntyre, Adam D.; Ban, Matthew R.; Martins, Rebecca A.; Kennedy, Brooke A.; Hassell, Reina G.; Visser, Maartje E.; Schwartz, Stephen M.; Voight, Benjamin F.; Elosua, Roberto; Salomaa, Veikko; O'Donnell, Christopher J.; Dallinga-Thie, Geesje M.; Anand, Sonia S.; Yusuf, Salim; Huff, Murray W.; Kathiresan, Sekar; Hegele, Robert A.
Genome-wide association studies (GWAS) have replicably identified multiple loci associated with population-based plasma lipid concentrations1-5. Common genetic variants at these loci together explain HTG) patients revealed that common variants in APOA5, GCKR, LPL and APOB genes were associated with the HTG phenotype at genome-wide significance. We subsequently resequenced protein coding regions of these genes and found a significant burden of 154 rare missense or nonsense variants in 438 HTG patients, in contrast to 53 variants in 327 controls (P=6.2X10-8); this corresponds to a carrier frequency of 28.1% of HTG patients and 15.3% of controls (P=2.6X10-5). Many rare variants were predicted in silico to have compromised function; additionally some had previously demonstrated dysfunctionality in vitro. Rare variants in these 4 genes explained 1.1% of total variation in HTG diagnoses. Our study demonstrates a marked mutation skew that likely contributes to disease pathophysiology in patients with HTG. PMID:20657596
De Vivo, Immaculata; Prescott, Jennifer; Setiawan, Veronica Wendy; Olson, Sara H; Wentzensen, Nicolas; Attia, John; Black, Amanda; Brinton, Louise; Chen, Chu; Chen, Constance; Cook, Linda S; Crous-Bou, Marta; Doherty, Jennifer; Dunning, Alison M; Easton, Douglas F; Friedenreich, Christine M; Garcia-Closas, Montserrat; Gaudet, Mia M; Haiman, Christopher; Hankinson, Susan E; Hartge, Patricia; Henderson, Brian E; Holliday, Elizabeth; Horn-Ross, Pamela L; Hunter, David J; Le Marchand, Loic; Liang, Xiaolin; Lissowska, Jolanta; Long, Jirong; Lu, Lingeng; Magliocco, Anthony M; McEvoy, Mark; O'Mara, Tracy A; Orlow, Irene; Painter, Jodie N; Pooler, Loreall; Rastogi, Radhai; Rebbeck, Timothy R; Risch, Harvey; Sacerdote, Carlotta; Schumacher, Fredrick; Scott, Rodney J; Sheng, Xin; Shu, Xiao-ou; Spurdle, Amanda B; Thompson, Deborah; Vanden Berg, David; Weiss, Noel S; Xia, Lucy; Xiang, Yong-Bing; Yang, Hannah P; Yu, Herbert; Zheng, Wei; Chanock, Stephen; Kraft, Peter
Endometrial cancer (EC), a neoplasm of the uterine epithelial lining, is the most common gynecological malignancy in developed countries and the fourth most common cancer among US women. Women with a family history of EC have an increased risk for the disease, suggesting that inherited genetic factors play a role. We conducted a two-stage genome-wide association study of Type I EC. Stage 1 included 5,472 women (2,695 cases and 2,777 controls) of European ancestry from seven studies. We selected independent single-nucleotide polymorphisms (SNPs) that displayed the most significant associations with EC in Stage 1 for replication among 17,948 women (4,382 cases and 13,566 controls) in a multiethnic population (African America, Asian, Latina, Hawaiian and European ancestry), from nine studies. Although no novel variants reached genome-wide significance, we replicated previously identified associations with genetic markers near the HNF1B locus. Our findings suggest that larger studies with specific tumor classification are necessary to identify novel genetic polymorphisms associated with EC susceptibility.
Bao, Feng; Deng, Yue; Zhao, Yanyu; Suo, Jinli; Dai, Qionghai
In genome-wide association studies (GWAS), the acquired sequential data may exhibit imbalance structure: abundant control vs. limited case samples. Such sample imbalance issue is particularly serious when investigating rare diseases or common diseases on rare populations. Conventional GWAS methods may suffer from severe statistic biases to the major group, leading to power losses in uncovering true suspicious loci. We introduce a boosting correction method termed as Bosco to deal with such imbalanced problem. Bosco is motivated by the boost learning theory in machine learning and is implemented in a coarse-to-fine learning framework: the coarse step assigns importance scores for all samples in the major group and the fine step calculates P -values by a weighted logistic regression. On simulated data sets, we demonstrate the proposed methods can dramatically improve the discovery power even on extremely imbalanced datasets, with well controlling the false positives. The Bosco is also applied to a genome-scale gastric cancer data set to conduct genome-wide analysis. Our method replicates existing reported findings (from the likelihood ratio test) with high statistical significance and shows the ability to identify new suspicious SNPs.
Stahl, Eli A.; Raychaudhuri, Soumya; Remmers, Elaine F.; Xie, Gang; Eyre, Stephen; Thomson, Brian P.; Li, Yonghong; Kurreeman, Fina A. S.; Zhernakova, Alexandra; Hinks, Anne; Guiducci, Candace; Chen, Robert; Alfredsson, Lars; Amos, Christopher I.; Ardlie, Kristin G.; Barton, Anne; Bowes, John; Brouwer, Elisabeth; Burtt, Noel P.; Catanese, Joseph J.; Coblyn, Jonathan; Coenen, Marieke JH; Costenbader, Karen H.; Criswell, Lindsey A.; Crusius, J. Bart A.; Cui, Jing; de Bakker, Paul I.W.; De Jager, Phillip L.; Ding, Bo; Emery, Paul; Flynn, Edward; Harrison, Pille; Hocking, Lynne J.; Huizinga, Tom W. J.; Kastner, Daniel L.; Ke, Xiayi; Lee, Annette T.; Liu, Xiangdong; Martin, Paul; Morgan, Ann W.; Padyukov, Leonid; Posthumus, Marcel D.; Radstake, Timothy RDJ; Reid, David M.; Seielstad, Mark; Seldin, Michael F.; Shadick, Nancy A.; Steer, Sophia; Tak, Paul P.; Thomson, Wendy; van der Helm-van Mil, Annette H. M.; van der Horst-Bruinsma, Irene E.; van der Schoot, C. Ellen; van Riel, Piet LCM; Weinblatt, Michael E.; Wilson, Anthony G.; Wolbink, Gert Jan; Wordsworth, Paul; Wijmenga, Cisca; Karlson, Elizabeth W.; Toes, Rene E. M.; de Vries, Niek; Begovich, Ann B.; Worthington, Jane; Siminovitch, Katherine A.; Gregersen, Peter K.; Klareskog, Lars; Plenge, Robert M.
To identify novel genetic risk factors for rheumatoid arthritis (RA), we conducted a genome-wide association study (GWAS) meta-analysis of 5,539 autoantibody positive RA cases and 20,169 controls of European descent, followed by replication in an independent set of 6,768 RA cases and 8,806 controls. Of 34 SNPs selected for replication, 7 novel RA risk alleles were identified at genome-wide significance (P<5×10−8) in analysis of all 41,282 samples. The associated SNPs are near genes of known immune function, including IL6ST, SPRED2, RBPJ, CCR6, IRF5, and PXK. We also refined the risk alleles at two established RA risk loci (IL2RA and CCL21) and confirmed the association at AFF3. These new associations bring the total number of confirmed RA risk loci to 31 among individuals of European ancestry. An additional 11 SNPs replicated at P<0.05, many of which are validated autoimmune risk alleles, suggesting that most represent bona fide RA risk alleles. PMID:20453842
Spain, S L; Pedroso, I; Kadeva, N; Miller, M B; Iacono, W G; McGue, M; Stergiakouli, E; Davey Smith, G; Putallaz, M; Lubinski, D; Meaburn, E L; Plomin, R; Simpson, M A
Although individual differences in intelligence (general cognitive ability) are highly heritable, molecular genetic analyses to date have had limited success in identifying specific loci responsible for its heritability. This study is the first to investigate exome variation in individuals of extremely high intelligence. Under the quantitative genetic model, sampling from the high extreme of the distribution should provide increased power to detect associations. We therefore performed a case-control association analysis with 1409 individuals drawn from the top 0.0003 (IQ >170) of the population distribution of intelligence and 3253 unselected population-based controls. Our analysis focused on putative functional exonic variants assayed on the Illumina HumanExome BeadChip. We did not observe any individual protein-altering variants that are reproducibly associated with extremely high intelligence and within the entire distribution of intelligence. Moreover, no significant associations were found for multiple rare alleles within individual genes. However, analyses using genome-wide similarity between unrelated individuals (genome-wide complex trait analysis) indicate that the genotyped functional protein-altering variation yields a heritability estimate of 17.4% (s.e. 1.7%) based on a liability model. In addition, investigation of nominally significant associations revealed fewer rare alleles associated with extremely high intelligence than would be expected under the null hypothesis. This observation is consistent with the hypothesis that rare functional alleles are more frequently detrimental than beneficial to intelligence.
Goelzer, Anne; Muntel, Jan; Chubukov, Victor; Jules, Matthieu; Prestel, Eric; Nölker, Rolf; Mariadassou, Mahendra; Aymerich, Stéphane; Hecker, Michael; Noirot, Philippe; Becher, Dörte; Fromion, Vincent
Predicting resource allocation between cell processes is the primary step towards decoding the evolutionary constraints governing bacterial growth under various conditions. Quantitative prediction at genome-scale remains a computational challenge as current methods are limited by the tractability of the problem or by simplifying hypotheses. Here, we show that the constraint-based modeling method Resource Balance Analysis (RBA), calibrated using genome-wide absolute protein quantification data, accurately predicts resource allocation in the model bacterium Bacillus subtilis for a wide range of growth conditions. The regulation of most cellular processes is consistent with the objective of growth rate maximization except for a few suboptimal processes which likely integrate more complex objectives such as coping with stressful conditions and survival. As a proof of principle by using simulations, we illustrated how calibrated RBA could aid rational design of strains for maximizing protein production, offering new opportunities to investigate design principles in prokaryotes and to exploit them for biotechnological applications. Copyright © 2015 International Metabolic Engineering Society. Published by Elsevier Inc. All rights reserved.
Emily R Davenport
Full Text Available The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both. These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%. For example, we identified an association between a taxon known to affect obesity (genus Akkermansia and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10-7. Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut.
Full Text Available Abstract Background With the availability of large-scale genome-wide association study (GWAS data, choosing an optimal set of SNPs for disease susceptibility prediction is a challenging task. This study aimed to use single nucleotide polymorphisms (SNPs to predict psoriasis from searching GWAS data. Methods Totally we had 2,798 samples and 451,724 SNPs. Process for searching a set of SNPs to predict susceptibility for psoriasis consisted of two steps. The first one was to search top 1,000 SNPs with high accuracy for prediction of psoriasis from GWAS dataset. The second one was to search for an optimal SNP subset for predicting psoriasis. The sequential information bottleneck (sIB method was compared with classical linear discriminant analysis(LDA for classification performance. Results The best test harmonic mean of sensitivity and specificity for predicting psoriasis by sIB was 0.674(95% CI: 0.650-0.698, while only 0.520(95% CI: 0.472-0.524 was reported for predicting disease by LDA. Our results indicate that the new classifier sIB performs better than LDA in the study. Conclusions The fact that a small set of SNPs can predict disease status with average accuracy of 68% makes it possible to use SNP data for psoriasis prediction.
Davenport, Emily R.; Cusanovich, Darren A.; Michelini, Katelyn; Barreiro, Luis B.; Ober, Carole; Gilad, Yoav
The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both). These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%). For example, we identified an association between a taxon known to affect obesity (genus Akkermansia) and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL) mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10−7). Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut. PMID:26528553
Quintales, Luis; Vázquez, Enrique; Antequera, Francisco
Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use. © The Author 2014. Published by Oxford University Press. For Permissions, please email: email@example.com.
Full Text Available Allergic diseases are complex diseases caused by a combination of genetic and environmental factors. To determine the genetic components of these diseases and to discover the genes and cellular pathways underlying them, a large number of genetic studies have been conducted. Progress in genetics enables us to conduct genome-wide association studies (GWASs, which is a comprehensive and unbiased approach to identify susceptibility loci for multifactorial diseases. Recent GWASs have convincingly detected a large number of loci associated with allergic diseases. Candidate genes in the susceptibility loci suggest roles for epithelial barrier functions, innate-adaptive immunity, IL-1 family signaling, regulatory T cells and the vitamin D pathway in the pathogenesis of allergic diseases. Interestingly, the IL1RL1, HLA, IL13 and C11orf30 regions are overlapping susceptibility loci among atopic dermatitis and asthma or allergic rhinitis. Although a more complete collection of associated genes and pathways is needed, biologic insights revealed by GWASs improve our understanding of the pathophysiology of human allergic diseases and contribute to the development of better treatment and preventive strategies.
Terracciano, Antonio; Tanaka, Toshiko; Sutin, Angelina R.; Sanna, Serena; Deiana, Barbara; Lai, Sandra; Uda, Manuela; Schlessinger, David; Abecasis, Gonçalo R.; Ferrucci, Luigi; Costa, Paul T.
Background Independent of temporal circumstances, some individuals have greater susceptibility to depressive affects, such as feelings of guilt, sadness, hopelessness, and loneliness. Identifying the genetic variants that contribute to these individual differences can point to biological pathways etiologically involved in psychiatric disorders. Methods Genome-wide association scans (GWA or GWAS) for the Depression scale of the Revised NEO Personality Inventory (NEO-PI-R) in community-based samples from a genetically homogeneous area of Sardinia, Italy (N = 3,972) and from the Baltimore Longitudinal Study of Aging in the US (N = 839). Results Meta-analytic results for genotyped or imputed single nucleotide polymorphisms (SNPs) indicate that the strongest association signals for trait depression were found in RORA (rs12912233; p= 6 × 10−7), a gene involved in circadian rhythm. A plausible biological association was also found with SNPs within GRM8 (rs17864092; p = 5 × 10−6), a metabotropic receptor for glutamate, a major excitatory neurotransmitter in the central nervous system. Conclusions These findings suggest shared genetic basis underlying the continuum from personality traits to psychopathology. PMID:20800221
Woolliams John A
Full Text Available Abstract Partial least square regression (PLSR and principal component regression (PCR are methods designed for situations where the number of predictors is larger than the number of records. The aim was to compare the accuracy of genome-wide breeding values (EBV produced using PLSR and PCR with a Bayesian method, 'BayesB'. Marker densities of 1, 2, 4 and 8 Ne markers/Morgan were evaluated when the effective population size (Ne was 100. The correlation between true breeding value and estimated breeding value increased with density from 0.611 to 0.681 and 0.604 to 0.658 using PLSR and PCR respectively, with an overall advantage to PLSR of 0.016 (s.e = 0.008. Both methods gave a lower accuracy compared to the 'BayesB', for which accuracy increased from 0.690 to 0.860. PLSR and PCR appeared less responsive to increased marker density with the advantage of 'BayesB' increasing by 17% from a marker density of 1 to 8Ne/M. PCR and PLSR showed greater bias than 'BayesB' in predicting breeding values at all densities. Although, the PLSR and PCR were computationally faster and simpler, these advantages do not outweigh the reduction in accuracy, and there is a benefit in obtaining relevant prior information from the distribution of gene effects.
Full Text Available Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO Cancer Screening and the Women’s Health Initiative (WHI. We tested association between 2,474,333 single nucleotide polymorphisms (SNPs and serum selenium concentrations using linear regression models. In the first stage (PLCO 41 SNPs clustered in 15 regions had p < 1 × 10−5. None of these 41 SNPs reached the significant threshold (p = 0.05/15 regions = 0.003 in the second stage (WHI. Three SNPs had p < 0.05 in the second stage (rs1395479 and rs1506807 in 4q34.3/AGA-NEIL3; and rs891684 in 17q24.3/SLC39A11 and had p between 2.62 × 10−7 and 4.04 × 10−7 in the combined analysis (PLCO + WHI. Additional studies are needed to replicate these findings. Identification of genetic variation that impacts selenium concentrations may contribute to a better understanding of which genes regulate circulating selenium concentrations.
Full Text Available Cytosine DNA methylation (CDM is a highly abundant, heritable but reversible chemical modification to the genome. Herein, a machine learning approach was applied to analyze the accumulation of epigenetic marks in methylomes of 152 ecotypes and 85 silencing mutants of Arabidopsis thaliana. In an information-thermodynamics framework, two measurements were used: (1 the amount of information gained/lost with the CDM changes I R and (2 the uncertainty of not observing a SNP L C R . We hypothesize that epigenetic marks are chromosomal footprints accounting for different ontogenetic and phylogenetic histories of individual populations. A machine learning approach is proposed to verify this hypothesis. Results support the hypothesis by the existence of discriminatory information (DI patterns of CDM able to discriminate between individuals and between individual subpopulations. The statistical analyses revealed a strong association between the topologies of the structured population of Arabidopsis ecotypes based on I R and on LCR, respectively. A statistical-physical relationship between I R and L C R was also found. Results to date imply that the genome-wide distribution of CDM changes is not only part of the biological signal created by the methylation regulatory machinery, but ensures the stability of the DNA molecule, preserving the integrity of the genetic message under continuous stress from thermal fluctuations in the cell environment.
Full Text Available In eukaryotes, 3′ untranslated regions (UTRs play important roles in regulating posttranscriptional gene expression. The 3′UTR is defined by regulated cleavage/polyadenylation of the pre-mRNA. The advent of next-generation sequencing technology has now enabled us to identify these events on a genome-wide scale. In this study, we used poly(A-position profiling by sequencing (3P-Seq to capture all poly(A sites across the genome of the freshwater planarian, Schmidtea mediterranea, an ideal model system for exploring the process of regeneration and stem cell function. We identified the 3′UTRs for ∼14,000 transcripts and thus improved the existing gene annotations. We found 97 transcripts, which are polyadenylated within an internal exon, resulting in the shrinking of the ORF and loss of a predicted protein domain. Around 40% of the transcripts in planaria were alternatively polyadenylated (ApA, resulting either in an altered 3′UTR or a change in coding sequence. We identified specific ApA transcript isoforms that were subjected to miRNA mediated gene regulation using degradome sequencing. In this study, we also confirmed a tissue-specific expression pattern for alternate polyadenylated transcripts. The insights from this study highlight the potential role of ApA in regulating the gene expression essential for planarian regeneration.
Full Text Available Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed.We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality and EDAR (associated with hair thickness were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9 were associated with pre-weaning gain in our previous genome-wide association study.Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.
Wang, Huihua; Zhang, Li; Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin
Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.
Full Text Available The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3 gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies.
Lakshmanan, Vairavan; Bansal, Dhiru; Kulkarni, Jahnavi; Poduval, Deepak; Krishna, Srikar; Sasidharan, Vidyanand; Anand, Praveen; Seshasayee, Aswin; Palakodeti, Dasaradhi
In eukaryotes, 3' untranslated regions (UTRs) play important roles in regulating posttranscriptional gene expression. The 3'UTR is defined by regulated cleavage/polyadenylation of the pre-mRNA. The advent of next-generation sequencing technology has now enabled us to identify these events on a genome-wide scale. In this study, we used poly(A)-position profiling by sequencing (3P-Seq) to capture all poly(A) sites across the genome of the freshwater planarian, Schmidtea mediterranea, an ideal model system for exploring the process of regeneration and stem cell function. We identified the 3'UTRs for ∼14,000 transcripts and thus improved the existing gene annotations. We found 97 transcripts, which are polyadenylated within an internal exon, resulting in the shrinking of the ORF and loss of a predicted protein domain. Around 40% of the transcripts in planaria were alternatively polyadenylated (ApA), resulting either in an altered 3'UTR or a change in coding sequence. We identified specific ApA transcript isoforms that were subjected to miRNA mediated gene regulation using degradome sequencing. In this study, we also confirmed a tissue-specific expression pattern for alternate polyadenylated transcripts. The insights from this study highlight the potential role of ApA in regulating the gene expression essential for planarian regeneration. Copyright © 2016 Lakshmanan et al.
Xavier, Alencar; Muir, William M; Rainey, Katy Martin
Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set. Copyright © 2016 Xavie et al.
Llinares-López, Felipe; Papaxanthos, Laetitia; Bodenham, Dean; Roqueiro, Damian; Borgwardt, Karsten
Genetic heterogeneity is the phenomenon that distinct genetic variants may give rise to the same phenotype. The recently introduced algorithm Fast Automatic Interval Search ( FAIS ) enables the genome-wide search of candidate regions for genetic heterogeneity in the form of any contiguous sequence of variants, and achieves high computational efficiency and statistical power. Although FAIS can test all possible genomic regions for association with a phenotype, a key limitation is its inability to correct for confounders such as gender or population structure, which may lead to numerous false-positive associations. We propose FastCMH , a method that overcomes this problem by properly accounting for categorical confounders, while still retaining statistical power and computational efficiency. Experiments comparing FastCMH with FAIS and multiple kinds of burden tests on simulated data, as well as on human and Arabidopsis samples, demonstrate that FastCMH can drastically reduce genomic inflation and discover associations that are missed by standard burden tests. An R package fastcmh is available on CRAN and the source code can be found at: https://www.bsse.ethz.ch/mlcb/research/bioinformatics-and-computational-biology/fastcmh.html. firstname.lastname@example.org. Supplementary data are available at Bioinformatics online.
Moorjani, Priya; Patterson, Nick; Loh, Po-Ru; Lipson, Mark; Kisfali, Péter; Melegh, Bela I; Bonin, Michael; Kádaši, Ludevít; Rieß, Olaf; Berger, Bonnie; Reich, David; Melegh, Béla
The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000-1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs). We estimate that the Roma harbor about 80% West Eurasian ancestry-derived from a combination of European and South Asian sources-and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe.
Ramachandran, Dhanya; Zeng, Zhen; Locke, Adam E; Mulle, Jennifer G; Bean, Lora J H; Rosser, Tracie C; Dooley, Kenneth J; Cua, Clifford L; Capone, George T; Reeves, Roger H; Maslen, Cheryl L; Cutler, David J; Feingold, Eleanor; Sherman, Stephanie L; Zwick, Michael E
The goal of this study was to identify the contribution of common genetic variants to Down syndrome-associated atrioventricular septal defect, a severe heart abnormality. Compared with the euploid population, infants with Down syndrome, or trisomy 21, have a 2000-fold increased risk of presenting with atrioventricular septal defects. The cause of this increased risk remains elusive. Here we present data from the largest heart study conducted to date on a trisomic background by using a carefully characterized collection of individuals from extreme ends of the phenotypic spectrum. We performed a genome-wide association study using logistic regression analysis on 452 individuals with Down syndrome, consisting of 210 cases with complete atrioventricular septal defects and 242 controls with structurally normal hearts. No individual variant achieved genome-wide significance. We identified four disomic regions (1p36.3, 5p15.31, 8q22.3, and 17q22) and two trisomic regions on chromosome 21 (around PDXK and KCNJ6 genes) that merit further investigation in large replication studies. Our data show that a few common genetic variants of large effect size (odds ratio >2.0) do not account for the elevated risk of Down syndrome-associated atrioventricular septal defects. Instead, multiple variants of low-to-moderate effect sizes may contribute to this elevated risk, highlighting the complex genetic architecture of atrioventricular septal defects even in the highly susceptible Down syndrome population. Copyright © 2015 Ramachandran et al.
Spycher, Ben D.; Henderson, John; Granell, Raquel; Evans, David M.; Smith, George Davey; Timpson, Nicholas J.; Sterne, Jonathan A. C.
Background Childhood wheezing and asthma vary greatly in clinical presentation and time course. The extent to which phenotypic variation reflects heterogeneity in disease pathways is unclear. Objective To assess the extent to which single nucleotide polymorphisms (SNPs) associated with childhood asthma in a genome-wide association study are predictive of asthma-related phenotypes. Methods In 8365 children from a population based birth cohort, the Avon Longitudinal Study of Parents and Children, allelic scores were derived based on between 10 and 215,443 SNPs ranked according to inverse of the p-value for their association with physician diagnosed asthma in an independent genome-wide association study (6176 cases and 7111 controls). We assessed the predictive value of allelic scores for asthma-related outcomes at age 7-9 years (physician’s diagnosis, longitudinal wheezing phenotypes, and measurements of pulmonary function, bronchial responsiveness and atopy). Results Scores based on the 46 highest-ranked SNPs were associated with the symptom-based phenotypes persistent (Pasthma (Patopy (Pasthma phenotypes. Conclusion The genetic origins of asthma are diverse and: some pathways are specific to wheezing syndromes while others are shared with atopy and bronchial hyper-responsiveness. Out study also provides evidence of aetiological differences among wheezing syndromes. PMID:22846752
Shimizu, S; Momozawa, Y; Takahashi, A; Nagasawa, T; Ashikawa, K; Terada, Y; Izumi, Y; Kobayashi, H; Tsuji, M; Kubo, M; Furuichi, Y
Periodontitis is a multifactorial disease in which bacterial, lifestyle, and genetic factors are involved. Although previous genetic association studies identified several susceptibility genes for periodontitis in European populations, there is little information for Asian populations. Here, we conducted a genome-wide association study and a replication study consisting of 2,760 Japanese periodontitis patients and 15,158 Japanese controls. Although single-nucleotide polymorphisms that surpassed a stringent genome-wide significance threshold (P periodontitis: KCNQ5 on chromosome 6q13 (rs9446777, P = 4.83 × 10(-6), odds ratio = 0.82) and GPR141-NME8 at chromosome 7p14.1 (rs2392510, P = 4.17 × 10(-6), odds ratio = 0.87). A stratified analysis indicated that the GPR141-NME8 locus had a strong genetic effect on the susceptibility to generalized periodontitis in Japanese individuals with a history of smoking. In conclusion, this study identified 2 suggestive loci for periodontitis in a Japanese population. This study should contribute to a further understanding of genetic factors for enhanced susceptibility to periodontitis. © International & American Associations for Dental Research 2015.
Mühleisen, Thomas W; Leber, Markus; Schulze, Thomas G; Strohmaier, Jana; Degenhardt, Franziska; Treutlein, Jens; Mattheisen, Manuel; Forstner, Andreas J; Schumacher, Johannes; Breuer, René; Meier, Sandra; Herms, Stefan; Hoffmann, Per; Lacour, André; Witt, Stephanie H; Reif, Andreas; Müller-Myhsok, Bertram; Lucae, Susanne; Maier, Wolfgang; Schwarz, Markus; Vedder, Helmut; Kammerer-Ciernioch, Jutta; Pfennig, Andrea; Bauer, Michael; Hautzinger, Martin; Moebus, Susanne; Priebe, Lutz; Czerski, Piotr M; Hauser, Joanna; Lissowska, Jolanta; Szeszenia-Dabrowska, Neonila; Brennan, Paul; McKay, James D; Wright, Adam; Mitchell, Philip B; Fullerton, Janice M; Schofield, Peter R; Montgomery, Grant W; Medland, Sarah E; Gordon, Scott D; Martin, Nicholas G; Krasnow, Valery; Chuchalin, Alexander; Babadjanova, Gulja; Pantelejeva, Galina; Abramova, Lilia I; Tiganov, Alexander S; Polonikov, Alexey; Khusnutdinova, Elza; Alda, Martin; Grof, Paul; Rouleau, Guy A; Turecki, Gustavo; Laprise, Catherine; Rivas, Fabio; Mayoral, Fermin; Kogevinas, Manolis; Grigoroiu-Serbanescu, Maria; Propping, Peter; Becker, Tim; Rietschel, Marcella; Nöthen, Markus M; Cichon, Sven
Bipolar disorder (BD) is a common and highly heritable mental illness and genome-wide association studies (GWAS) have robustly identified the first common genetic variants involved in disease aetiology. The data also provide strong evidence for the presence of multiple additional risk loci, each contributing a relatively small effect to BD susceptibility. Large samples are necessary to detect these risk loci. Here we present results from the largest BD GWAS to date by investigating 2.3 million single-nucleotide polymorphisms (SNPs) in a sample of 24,025 patients and controls. We detect 56 genome-wide significant SNPs in five chromosomal regions including previously reported risk loci ANK3, ODZ4 and TRANK1, as well as the risk locus ADCY2 (5p15.31) and a region between MIR2113 and POU3F2 (6q16.1). ADCY2 is a key enzyme in cAMP signalling and our finding provides new insights into the biological mechanisms involved in the development of BD.
Kathryn M Tsang
Full Text Available Effects of parental genotype or parent-offspring genetic interaction are well established in model organisms for a variety of traits. However, these transgenerational genetic models are rarely studied in humans. We have utilized an autism case-control study with 735 mother-child pairs to perform genome-wide screening for maternal genetic effects and maternal-offspring genetic interaction. We used simple models of single locus parent-child interaction and identified suggestive results (P<10(-4 that cannot be explained by main effects, but no genome-wide significant signals. Some of these maternal and maternal-child associations were in or adjacent to autism candidate genes including: PCDH9, FOXP1, GABRB3, NRXN1, RELN, MACROD2, FHIT, RORA, CNTN4, CNTNAP2, FAM135B, LAMA1, NFIA, NLGN4X, RAPGEF4, and SDK1. We attempted validation of potential autism association under maternal-specific models using maternal-paternal comparison in family-based GWAS datasets. Our results suggest that further study of parental genetic effects and parent-child interaction in autism is warranted.
Tsang, Kathryn M; Croen, Lisa A; Torres, Anthony R; Kharrazi, Martin; Delorenze, Gerald N; Windham, Gayle C; Yoshida, Cathleen K; Zerbo, Ousseny; Weiss, Lauren A
Effects of parental genotype or parent-offspring genetic interaction are well established in model organisms for a variety of traits. However, these transgenerational genetic models are rarely studied in humans. We have utilized an autism case-control study with 735 mother-child pairs to perform genome-wide screening for maternal genetic effects and maternal-offspring genetic interaction. We used simple models of single locus parent-child interaction and identified suggestive results (Peffects, but no genome-wide significant signals. Some of these maternal and maternal-child associations were in or adjacent to autism candidate genes including: PCDH9, FOXP1, GABRB3, NRXN1, RELN, MACROD2, FHIT, RORA, CNTN4, CNTNAP2, FAM135B, LAMA1, NFIA, NLGN4X, RAPGEF4, and SDK1. We attempted validation of potential autism association under maternal-specific models using maternal-paternal comparison in family-based GWAS datasets. Our results suggest that further study of parental genetic effects and parent-child interaction in autism is warranted.
Ishii, Atsushi; Yamaji, Keita; Uemoto, Yoshinobu; Sasago, Nanae; Kobayashi, Eiji; Kobayashi, Naohiko; Matsuhashi, Tamako; Maruyama, Shin; Matsumoto, Hirokazu; Sasazaki, Shinji; Mannen, Hideyuki
Fatty acid composition is one of the important traits in beef. The aim of this study was to identify candidate genomic regions for fatty acid composition by genome-wide association study with 50 K single nucleotide polymorphism (SNP) array in Japanese Black cattle. A total of 461 individuals and 40 657 SNPs were used in this study. We applied genome-wide rapid association using mixed model and regression (GRAMMAR) and genomic control approaches to estimate the associations between genotypes and fatty acid composition. In addition, two SNPs in fatty acid synthase (FASN) (T1952A) and stearoyl-CoA desaturase (SCD) (V293A) genes were also genotyped. Association analysis revealed that 30 significant SNPs for several fatty acids (C14:0, C14:1, C16:1 and C18:1) were located in the BTA19 FASN gene located within this region but the FASN mutation had no significant effect on any traits. We also detected one significant SNP for C18:1 on BTA23 and two SNPs for C16:0 on BTA25. The region around 17 Mb on BTA26 harbored two significant SNPs for C14:1 and SNP in SCD in this region showed the strongest association with C14:1. This study demonstrated novel candidate regions in BTA19, 23 and 25 for fatty acid composition. © 2013 Japanese Society of Animal Science.
Wang, Kai; Zhang, Haitao; Kugathasan, Subra; Annese, Vito; Bradfield, Jonathan P.; Russell, Richard K.; Sleiman, Patrick M.A.; Imielinski, Marcin; Glessner, Joseph; Hou, Cuiping; Wilson, David C.; Walters, Thomas; Kim, Cecilia; Frackelton, Edward C.; Lionetti, Paolo; Barabino, Arrigo; Van Limbergen, Johan; Guthery, Stephen; Denson, Lee; Piccoli, David; Li, Mingyao; Dubinsky, Marla; Silverberg, Mark; Griffiths, Anne; Grant, Struan F.A.; Satsangi, Jack; Baldassano, Robert; Hakonarson, Hakon
Previous genome-wide association (GWA) studies typically focus on single-locus analysis, which may not have the power to detect the majority of genuinely associated loci. Here, we applied pathway analysis using Affymetrix SNP genotype data from the Wellcome Trust Case Control Consortium (WTCCC) and uncovered significant association between Crohn Disease (CD) and the IL12/IL23 pathway, harboring 20 genes (p = 8 × 10−5). Interestingly, the pathway contains multiple genes (IL12B and JAK2) or homologs of genes (STAT3 and CCR6) that were recently identified as genuine susceptibility genes only through meta-analysis of several GWA studies. In addition, the pathway contains other susceptibility genes for CD, including IL18R1, JUN, IL12RB1, and TYK2, which do not reach genome-wide significance by single-marker association tests. The observed pathway-specific association signal was subsequently replicated in three additional GWA studies of European and African American ancestry generated on the Illumina HumanHap550 platform. Our study suggests that examination beyond individual SNP hits, by focusing on genetic networks and pathways, is important to unleashing the true power of GWA studies. PMID:19249008
Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue
Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884
Down Thomas A
Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.
Full Text Available Abstract Background A recent genome wide association study in 1017 African Americans identified several single nucleotide polymorphisms that reached genome-wide significance for systolic blood pressure. We attempted to replicate these findings in an independent sample of 2474 unrelated African Americans in the Milwaukee metropolitan area; 53% were women and 47% were hypertensives. Methods We evaluated sixteen top associated SNPs from the above genome wide association study for hypertension as a binary trait or blood pressure as a continuous trait. In addition, we evaluated eight single nucleotide polymorphisms located in two genes (STK-39 and CDH-13 found to be associated with systolic and diastolic blood pressures by other genome wide association studies in European and Amish populations. TaqMan MGB-based chemistry with fluorescent probes was used for genotyping. We had an adequate sample size (80% power to detect an effect size of 1.2-2.0 for all the single nucleotide polymorphisms for hypertension as a binary trait, and 1% variance in blood pressure as a continuous trait. Quantitative trait analyses were performed both by excluding and also by including subjects on anti-hypertensive therapy (after adjustments were made for anti-hypertensive medications. Results For all 24 SNPs, no statistically significant differences were noted in the minor allele frequencies between cases and controls. One SNP (rs2146204 showed borderline association (p = 0.006 with hypertension status using recessive model and systolic blood pressure (p = 0.02, but was not significant after adjusting for multiple comparisons. In quantitative trait analyses, among normotensives only, rs12748299 was associated with SBP (p = 0.002. In addition, several nominally significant associations were noted with SBP and DBP among normotensives but none were statistically significant. Conclusions This study highlights the importance of replication to confirm the validity of genome wide
Katharine J Sepp
Full Text Available While genetic screens have identified many genes essential for neurite outgrowth, they have been limited in their ability to identify neural genes that also have earlier critical roles in the gastrula, or neural genes for which maternally contributed RNA compensates for gene mutations in the zygote. To address this, we developed methods to screen the Drosophila genome using RNA-interference (RNAi on primary neural cells and present the results of the first full-genome RNAi screen in neurons. We used live-cell imaging and quantitative image analysis to characterize the morphological phenotypes of fluorescently labelled primary neurons and glia in response to RNAi-mediated gene knockdown. From the full genome screen, we focused our analysis on 104 evolutionarily conserved genes that when downregulated by RNAi, have morphological defects such as reduced axon extension, excessive branching, loss of fasciculation, and blebbing. To assist in the phenotypic analysis of the large data sets, we generated image analysis algorithms that could assess the statistical significance of the mutant phenotypes. The algorithms were essential for the analysis of the thousands of images generated by the screening process and will become a valuable tool for future genome-wide screens in primary neurons. Our analysis revealed unexpected, essential roles in neurite outgrowth for genes representing a wide range of functional categories including signalling molecules, enzymes, channels, receptors, and cytoskeletal proteins. We also found that genes known to be involved in protein and vesicle trafficking showed similar RNAi phenotypes. We confirmed phenotypes of the protein trafficking genes Sec61alpha and Ran GTPase using Drosophila embryo and mouse embryonic cerebral cortical neurons, respectively. Collectively, our results showed that RNAi phenotypes in primary neural culture can parallel in vivo phenotypes, and the screening technique can be used to identify many new
Barbara E Stranger
Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.
Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.
Christopoulou, Marilena; Wo, Sebastian Reyes-Chin; Kozik, Alex; McHale, Leah K; Truco, Maria-Jose; Wroblewski, Tadeusz; Michelmore, Richard W
Genome-wide motif searches identified 1134 genes in the lettuce reference genome of cv. Salinas that are potentially involved in pathogen recognition, of which 385 were predicted to encode nucleotide binding-leucine rich repeat receptor (NLR) proteins. Using a maximum-likelihood approach, we grouped the NLRs into 25 multigene families and 17 singletons. Forty-one percent of these NLR-encoding genes belong to three families, the largest being RGC16 with 62 genes in cv. Salinas. The majority of NLR-encoding genes are located in five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 and cosegregate with multiple disease resistance phenotypes. Most MRCs contain primarily members of a single NLR gene family but a few are more complex. MRC2 spans 73 Mb and contains 61 NLRs of six different gene families that cosegregate with nine disease resistance phenotypes. MRC3, which is 25 Mb, contains 22 RGC21 genes and colocates with Dm13. A library of 33 transgenic RNA interference tester stocks was generated for functional analysis of NLR-encoding genes that cosegregated with disease resistance phenotypes in each of the MRCs. Members of four NLR-encoding families, RGC1, RGC2, RGC21, and RGC12 were shown to be required for 16 disease resistance phenotypes in lettuce. The general composition of MRCs is conserved across different genotypes; however, the specific repertoire of NLR-encoding genes varied particularly of the rapidly evolving Type I genes. These tester stocks are valuable resources for future analyses of additional resistance phenotypes. Copyright © 2015 Christopoulou et al.
Background The Hepatitis B Virus (HBV) HBx regulatory protein is required for HBV replication and involved in HBV-related carcinogenesis. HBx interacts with chromatin modifying enzymes and transcription factors to modulate histone post-translational modifications and to regulate viral cccDNA transcription and cellular gene expression. Aiming to identify genes and non-coding RNAs (ncRNAs) directly targeted by HBx, we performed a chromatin immunoprecipitation sequencing (ChIP-Seq) to analyse HBV recruitment on host cell chromatin in cells replicating HBV. Results ChIP-Seq high throughput sequencing of HBx-bound fragments was used to obtain a high-resolution, unbiased, mapping of HBx binding sites across the genome in HBV replicating cells. Protein-coding genes and ncRNAs involved in cell metabolism, chromatin dynamics and cancer were enriched among HBx targets together with genes/ncRNAs known to modulate HBV replication. The direct transcriptional activation of genes/miRNAs that potentiate endocytosis (Ras-related in brain (RAB) GTPase family) and autophagy (autophagy related (ATG) genes, beclin-1, miR-33a) and the transcriptional repression of microRNAs (miR-138, miR-224, miR-576, miR-596) that directly target the HBV pgRNA and would inhibit HBV replication, contribute to HBx-mediated increase of HBV replication. Conclusions Our ChIP-Seq analysis of HBx genome wide chromatin recruitment defined the repertoire of genes and ncRNAs directly targeted by HBx and led to the identification of new mechanisms by which HBx positively regulates cccDNA transcription and HBV replication.
Darcy, Diana; Atwal, Paldeep Singh; Angell, Cathy; Gadi, Inder; Wallerstein, Robert
We report on a 6-month-old girl with two apparent cell lines; one with trisomy 21, and the other with paternal genome-wide uniparental isodisomy (GWUPiD), identified using single nucleotide polymorphism (SNP) based microarray and microsatellite analysis of polymorphic loci. The patient has Beckwith-Wiedemann syndrome (BWS) due to paternal uniparental disomy (UPD) at chromosome location 11p15 (UPD 11p15), which was confirmed through methylation analysis. Hyperinsulinemic hypoglycemia is present, which is associated with paternal UPD 11p15.5; and she likely has medullary nephrocalcinosis, which is associated with paternal UPD 20, although this was not biochemically confirmed. Angelman syndrome (AS) analysis was negative but this testing is not completely informative; she has no specific features of AS. Clinical features of this patient include: dysmorphic features consistent with trisomy 21, tetralogy of Fallot, hemihypertrophy, swirled skin hyperpigmentation, hepatoblastoma, and Wilms tumor. Her karyotype is 47,XX,+21/46,XX, and microarray results suggest that the cell line with trisomy 21 is biparentally inherited and represents 40-50% of the genomic material in the tested specimen. The difference in the level of cytogenetically detected mosaicism versus the level of mosaicism observed via microarray analysis is likely caused by differences in the test methodologies. While a handful of cases of mosaic paternal GWUPiD have been reported, this patient is the only reported case that also involves trisomy 21. Other GWUPiD patients have presented with features associated with multiple imprinted regions, as does our patient. © 2015 Wiley Periodicals, Inc.
Shen, Xun; Collier, John Michael; Hlaing, Myint; Zhang, Leanne; Delshad, Elizabeth H.; Bristow, James; Bernstein, Harold S.
Skeletal and cardiac myocytes cease division within weeks of birth. Although skeletal muscle retains limited capacity for regeneration through recruitment of satellite cells, resident populations of adult myocardial stem cells have not been identified. Because cell cycle withdrawal accompanies myocyte differentiation, we hypothesized that C2C12 cells, a mouse myoblast cell line previously used to characterize myocyte differentiation, also would provide a model for studying cell cycle withdrawal during differentiation. C2C12 cells were differentiated in culture medium containing horse serum and harvested at various time points to characterize the expression profiles of known cell cycle and myogenic regulatory factors by immunoblot analysis. BrdU incorporation decreased dramatically in confluent cultures 48 hr after addition of horse serum, as cells started to form myotubes. This finding was preceded by up-regulation of MyoD, followed by myogenin, and activation of Bcl-2. Cyclin D1 was expressed in proliferating cultures and became undetectable in cultures containing 40 percent fused myotubes, as levels of p21(WAF1/Cip1) increased and alpha-actin became detectable. Because C2C12 myoblasts withdraw from the cell cycle during myocyte differentiation following a course that recapitulates this process in vivo, we performed a genome-wide screen to identify other gene products involved in this process. Using microarrays containing approximately 10,000 minimally redundant mouse sequences that map to the UniGene database of the National Center for Biotechnology Information, we compared gene expression profiles between proliferating, differentiating, and differentiated C2C12 cells and verified candidate genes demonstrating differential expression by RT-PCR. Cluster analysis of differentially expressed genes revealed groups of gene products involved in cell cycle withdrawal, muscle differentiation, and apoptosis. In addition, we identified several genes, including DDAH2 and Ly
Bartels, Meike; Saviouk, Viatcheslav; de Moor, Marleen H M; Willemsen, Gonneke; van Beijsterveldt, Toos C E M; Hottenga, Jouke-Jan; de Geus, Eco J C; Boomsma, Dorret I
Causes of individual differences in happiness, as assessed with the Subjective Happiness Scale, are investigated in a large of sample twins and siblings from the Netherlands Twin Register. Over 12,000 twins and siblings, average age 24.7 years (range 12 to 88), took part in the study. A genetic model with an age by sex design was fitted to the data with structural equation modeling in Mx. The heritability of happiness was estimated at 22% for males and 41% in females. No effect of age was observed. To identify the genomic regions contributing to this heritability, a genome-wide linkage study for happiness was conducted in sibling pairs. A subsample of 1157 offspring from 441 families was genotyped with an average of 371 micro-satellite markers per individual. Phenotype and genotype data were analyzed in MERLIN with multipoint variance component linkage analysis and age and sex as covariates. A linkage signal (logarithm of odds score 2.73, empirical p value 0.095) was obtained at the end of the long arm of chromosome 19 for marker D19S254 at 110 cM. A second suggestive linkage peak was found at the short arm of chromosome 1 (LOD of 2.37) at 153 cM, marker D1S534 (empirical p value of .209). These two regions of interest are not overlapping with the regions found for contrasting phenotypes (such as depression, which is negatively associated with happiness). Further linkage and future association studies are warranted.
Ibragimova, Ilsiya; Slifker, Michael J.; Maradeo, Marie E.; Banumathy, Gowrishankar; Dulaimi, Essel; Uzzo, Robert G.; Cairns, Paul
The majority of renal cell carcinoma (RCC) is now incidentally detected and presents as small renal masses (SRMs) defined as ≤4 cm in size. SRMs are heterogeneous comprising several histological types of RCC each with different biology and behavior, and benign tumors mainly oncocytoma. The varied prognosis of the different types of renal tumor has implications for management options. A key epigenetic alteration involved in the initiation and progression of cancer is aberrant methylation in the promoter region of a gene. The hypermethylation is associated with transcriptional repression and is an important mechanism of inactivation of tumor suppressor genes in neoplastic cells. We have determined the genome-wide promoter methylation profiles of 47 pT1a and 2 pT1b clear cell, papillary or chromophobe RCC, 25 benign renal oncocytoma ≤4 cm and 4 normal renal parenchyma specimens by Infinium HumanMethylation27 beadchip technology. We identify gene promoter hypermethylation signatures that distinguish clear cell and papillary from each other, from chromophobe and oncocytoma, and from normal renal cells. Pairwise comparisons revealed genes aberrantly hypermethylated in a tumor type but unmethylated in normal, and often unmethylated in the other renal tumor types. About 0.4% to 1.7% of genes comprised the promoter methylome in SRMs. The Infinium methylation score for representative genes was verified by gold standard technologies. The genes identified as differentially methylated implicate pathways involved in metabolism, tissue response to injury, epithelial to mesenchymal transition (EMT), signal transduction and G-protein coupled receptors (GPCRs), cancer, and stem cell regulation in the biology of RCC. Our findings contribute towards an improved understanding of the development of RCC, the different biology and behavior of histological types, and discovery of molecular subtypes. The differential methylation signatures may have utility in early detection and
Full Text Available The majority of renal cell carcinoma (RCC is now incidentally detected and presents as small renal masses (SRMs defined as ≤ 4 cm in size. SRMs are heterogeneous comprising several histological types of RCC each with different biology and behavior, and benign tumors mainly oncocytoma. The varied prognosis of the different types of renal tumor has implications for management options. A key epigenetic alteration involved in the initiation and progression of cancer is aberrant methylation in the promoter region of a gene. The hypermethylation is associated with transcriptional repression and is an important mechanism of inactivation of tumor suppressor genes in neoplastic cells. We have determined the genome-wide promoter methylation profiles of 47 pT1a and 2 pT1b clear cell, papillary or chromophobe RCC, 25 benign renal oncocytoma ≤ 4 cm and 4 normal renal parenchyma specimens by Infinium HumanMethylation27 beadchip technology. We identify gene promoter hypermethylation signatures that distinguish clear cell and papillary from each other, from chromophobe and oncocytoma, and from normal renal cells. Pairwise comparisons revealed genes aberrantly hypermethylated in a tumor type but unmethylated in normal, and often unmethylated in the other renal tumor types. About 0.4% to 1.7% of genes comprised the promoter methylome in SRMs. The Infinium methylation score for representative genes was verified by gold standard technologies. The genes identified as differentially methylated implicate pathways involved in metabolism, tissue response to injury, epithelial to mesenchymal transition (EMT, signal transduction and G-protein coupled receptors (GPCRs, cancer, and stem cell regulation in the biology of RCC. Our findings contribute towards an improved understanding of the development of RCC, the different biology and behavior of histological types, and discovery of molecular subtypes. The differential methylation signatures may have utility in early
Wollstein, Andreas; Lao, Oscar; Becker, Christian; Brauer, Silke; Trent, Ronald J; Nürnberg, Peter; Stoneking, Mark; Kayser, Manfred
The human history of Oceania comprises two extremes: the initial colonizations of Near Oceania, one of the oldest out-of-Africa migrations, and of Remote Oceania, the most recent expansion into unoccupied territories. Genetic studies, mostly using uniparentally inherited DNA, have shed some light on human origins in Oceania, particularly indicating that Polynesians are of mixed East Asian and Near Oceanian ancestry. Here, we use ∼1 million single nucleotide polymorphisms (SNPs) to investigate the demographic history of Oceania in a more detailed manner. We developed a new approach to account for SNP ascertainment bias, used approximate Bayesian computation simulations to choose the best-fitting model of population history, and estimated demographic parameters. We find that the ancestors of Near Oceanians diverged from ancestral Eurasians ∼27 thousand years ago (kya), suggesting separate initial occupations of both territories. The genetic admixture in Polynesian history between East Asians (∼87%) and Near Oceanians (∼13%) occurred ∼3 kya, prior to the colonization of Polynesia. Fijians are of Polynesian (∼65%) and additional Near Oceanian (∼35%) ancestry not found in Polynesians, with this admixture occurring considerably after the initial settlement of Remote Oceania. Our data support a greater contribution of East Asian women than men in the admixture history of Remote Oceania and highlight population substructure in Polynesia and New Guinea. Despite the inherent ascertainment bias, genome-wide SNP data provide new insights into the genetic history of Oceana. Our approach to correct for ascertainment bias and obtain reliable inferences concerning demographic history should prove useful in other such studies. Copyright © 2010 Elsevier Ltd. All rights reserved.
Akkelies E Dijkstra
Full Text Available Chronic mucus hypersecretion (CMH is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA study of CMH in Caucasian populations.GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years. Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP.A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10(-6, OR = 1.17, located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1 on chromosome 3. The risk allele (G was associated with higher mRNA expression of SATB1 (4.3×10(-9 in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture.Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH.
Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J
Evolution is typically thought to proceed through divergence of genes, proteins and ultimately phenotypes. However, similar traits might also evolve convergently in unrelated taxa owing to similar selection pressures. Adaptive phenotypic convergence is widespread in nature, and recent results from several genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution, although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show that convergence is not a rare process restricted to several loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four newly sequenced bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the bottlenose dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Unexpectedly, we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognized.
Full Text Available Dramatic improvements in high throughput sequencing technologies have led to a staggering growth in the number of predicted genes. However, a large fraction of these newly discovered genes do not have a functional assignment. Fortunately, a variety of novel high-throughput genome-wide functional screening technologies provide important clues that shed light on gene function. The integration of heterogeneous data to predict protein function has been shown to improve the accuracy of automated gene annotation systems. In this paper, we propose and evaluate a probabilistic approach for protein function prediction that integrates protein-protein interaction (PPI data, gene expression data, protein motif information, mutant phenotype data, and protein localization data. First, functional linkage graphs are constructed from PPI data and gene expression data, in which an edge between nodes (proteins represents evidence for functional similarity. The assumption here is that graph neighbors are more likely to share protein function, compared to proteins that are not neighbors. The functional linkage graph model is then used in concert with protein domain, mutant phenotype and protein localization data to produce a functional prediction. Our method is applied to the functional prediction of Saccharomyces cerevisiae genes, using Gene Ontology (GO terms as the basis of our annotation. In a cross validation study we show that the integrated model increases recall by 18%, compared to using PPI data alone at the 50% precision. We also show that the integrated predictor is significantly better than each individual predictor. However, the observed improvement vs. PPI depends on both the new source of data and the functional category to be predicted. Surprisingly, in some contexts integration hurts overall prediction accuracy. Lastly, we provide a comprehensive assignment of putative GO terms to 463 proteins that currently have no assigned function.
Bierut, Laura Jean; Madden, Pamela A.F; Breslau, Naomi; Johnson, Eric O; Hatsukami, Dorothy; Pomerleau, Ovide F; Swan, Gary E; Rutter, Joni; Bertelsen, Sarah; Fox, Louis; Fugman, Douglas; Goate, Alison M; Hinrichs, Anthony L; Konvicka, Karel; Martin, Nicholas G; Montgomery, Grant W; Saccone, Nancy L; Saccone, Scott F; Wang, Jen C; Chase, Gary A; Rice, John P; Ballinger, Dennis G
.... To identify novel genes for which natural variation contributes to the development of nicotine dependence, we performed a comprehensive genome wide association study using nicotine dependent smokers...
al. (2007) “Efficient mapping of mendelian traits in dogs through genome-wide association.” Nat Genet 39:1321-1328. 12 Distribution A...collected data to genetically map superior intelligence in the military working dog. A behavioral testing regimen was developed by canine cognitive expert Dr...TERMS Military working dog genome-wide association study genetic marker intelligence 16
Eeles, Rosalind A.; Kote-Jarai, Zsofia; Olama, Ali Amin Al; Giles, Graham G.; Guy, Michelle; Severi, Gianluca; Muir, Kenneth; Hopper, John L.; Henderson, Brian E.; Haiman, Christopher A.; Schleutker, Johanna; Hamdy, Freddie C.; Neal, David E.; Donovan, Jenny L.; Stanford, Janet L.; Ostrander, Elaine A.; Ingles, Sue A.; John, Esther M.; Thibodeau, Stephen N.; Schaid, Daniel; Park, Jong Y.; Spurdle, Amanda; Clements, Judith; Dickinson, Joanne L.; Maier, Christiane; Vogel, Walther; Dörk, Thilo; Rebbeck, Timothy R.; Cooney, Kathleen A.; Cannon-Albright, Lisa; Chappuis, Pierre O.; Hutter, Pierre; Zeegers, Maurice; Kaneva, Radka; Zhang, Hong-Wei; Lu, Yong-Jie; Foulkes, William D.; English, Dallas R.; Leongamornlert, Daniel A.; Tymrakiewicz, Malgorzata; Morrison, Jonathan; Ardern-Jones, Audrey T.; Hall, Amanda L.; O’Brien, Lynne T.; Wilkinson, Rosemary A.; Saunders, Edward J.; Page, Elizabeth C.; Sawyer, Emma J.; Edwards, Stephen M.; Dearnaley, David P.; Horwich, Alan; Huddart, Robert A.; Khoo, Vincent S.; Parker, Christopher C.; Van As, Nicholas; Woodhouse, Christopher J.; Thompson, Alan; Christmas, Tim; Ogden, Chris; Cooper, Colin S.; Southey, Melissa C.; Lophatananon, Artitaya; Liu, Jo-Fen; Kolonel, Laurence N.; Le Marchand, Loic; Wahlfors, Tiina; Tammela, Teuvo L.; Auvinen, Anssi; Lewis, Sarah J.; Cox, Angela; FitzGerald, Liesel M.; Koopmeiners, Joseph S.; Karyadi, Danielle M.; Kwon, Erika M.; Stern, Mariana C.; Corral, Roman; Joshi, Amit D.; Shahabi, Ahva; McDonnell, Shannon K.; Sellers, Thomas A; Pow-Sang, Julio; Chambers, Suzanne; Aitken, Joanne; Gardiner, R.A. (Frank); Batra, Jyotsna; Kedda, Mary Anne; Lose, Felicity; Polanowski, Andrea; Patterson, Briony; Serth, Jürgen; Meyer, Andreas; Luedeke, Manuel; Stefflova, Klara; Ray, Anna M.; Lange, Ethan M.; Farnham, Jim; Khan, Humera; Slavov, Chavdar; Mitkova, Atanaska; Cao, Guangwen; Easton, Douglas F.
Prostate cancer (PrCa) is the most frequently diagnosed male cancer in developed countries. To identify common PrCa susceptibility alleles, we have previously conducted a genome-wide association study in which 541, 129 SNPs were genotyped in 1,854 PrCa cases with clinically detected disease and 1,894 controls. We have now evaluated promising associations in a second stage, in which we genotyped 43,671 SNPs in 3,650 PrCa cases and 3,940 controls, and a third stage, involving an additional 16,229 cases and 14,821 controls from 21 studies. In addition to previously identified loci, we identified a further seven new prostate cancer susceptibility loci on chromosomes 2, 4, 8, 11, and 22 (P=1.6×10−8 to P=2.7×10−33). PMID:19767753
Lu, Yi; Vitart, Veronique; Burdon, Kathryn P; Khor, Chiea Chuen; Bykhovskaya, Yelena; Mirshahi, Alireza; Hewitt, Alex W; Koehn, Demelza; Hysi, Pirro G; Ramdas, Wishal D; Zeller, Tanja; Vithana, Eranga N; Cornes, Belinda K; Tay, Wan-Ting; Tai, E Shyong; Cheng, Ching-Yu; Liu, Jianjun; Foo, Jia-Nee; Saw, Seang Mei; Thorleifsson, Gudmar; Stefansson, Kari; Dimasi, David P; Mills, Richard A; Mountain, Jenny; Ang, Wei; Hoehn, René; Verhoeven, Virginie J M; Grus, Franz; Wolfs, Roger; Castagne, Raphaële; Lackner, Karl J; Springelkamp, Henriët; Yang, Jian; Jonasson, Fridbert; Leung, Dexter Y L; Chen, Li J; Tham, Clement C Y; Rudan, Igor; Vatavuk, Zoran; Hayward, Caroline; Gibson, Jane; Cree, Angela J; MacLeod, Alex; Ennis, Sarah; Polasek, Ozren; Campbell, Harry; Wilson, James F; Viswanathan, Ananth C; Fleck, Brian; Li, Xiaohui; Siscovick, David; Taylor, Kent D; Rotter, Jerome I; Yazar, Seyhan; Ulmer, Megan; Li, Jun; Yaspan, Brian L; Ozel, Ayse B; Richards, Julia E; Moroi, Sayoko E; Haines, Jonathan L; Kang, Jae H; Pasquale, Louis R; Allingham, R Rand; Ashley-Koch, Allison; Mitchell, Paul; Wang, Jie Jin; Wright, Alan F; Pennell, Craig; Spector, Timothy D; Young, Terri L; Klaver, Caroline C W; Martin, Nicholas G; Montgomery, Grant W; Anderson, Michael G; Aung, Tin; Willoughby, Colin E; Wiggs, Janey L; Pang, Chi P; Thorsteinsdottir, Unnur; Lotery, Andrew J; Hammond, Christopher J; van Duijn, Cornelia M; Hauser, Michael A; Rabinowitz, Yaron S; Pfeiffer, Norbert; Mackey, David A; Craig, Jamie E; Macgregor, Stuart; Wong, Tien Y
Central corneal thickness (CCT) is associated with eye conditions including keratoconus and glaucoma. We performed a meta-analysis on >20,000 individuals in European and Asian populations that identified 16 new loci associated with CCT at genome-wide significance (P keratoconus in 2 cohorts with 874 cases and 6,085 controls (rs2721051 near FOXO1 had odds ratio (OR) = 1.62, 95% confidence interval (CI) = 1.4-1.88, P = 2.7 × 10(-10), and rs4894535 in FNDC3B had OR = 1.47, 95% CI = 1.29-1.68, P = 4.9 × 10(-9)). FNDC3B was also associated with primary open-angle glaucoma (P = 5.6 × 10(-4); tested in 3 cohorts with 2,979 cases and 7,399 controls). Further analyses implicate the collagen and extracellular matrix pathways in the regulation of CCT.
Full Text Available Hutchinson–Gilford progeria syndrome (HGPS patients do not develop cancer despite a significant accumulation of DNA damage in their cells. We have recently reported that HGPS cells are refractory to experimental oncogenic transformation and we identified the bromodomain-containing 4 protein (BRD4 as a mediator of the transformation resistance. ChIP-sequencing experiments revealed distinct genome-wide binding patterns for BRD4 in HGPS cells when compared to control wild type cells. Here we provide a detailed description of the ChIP-seq dataset (NCBI GEO accession number GSE61325, the specific and common BRD4 binding sites between HGPS and control cells, and the data analysis procedure associated with the publication by Fernandez et al., 2014 in Cell Reports 9, 248-260 .
Full Text Available The aim of this study was to identify the evidence of recent selection based on estimation of the integrated Haplotype Score (iHS, population differentiation index (FST and characterize affected regions near QTL associated with traits under strong selection in Pinzgau cattle. In total 21 Austrian and 19 Slovak purebreed bulls genotyped with Illumina bovineHD and bovineSNP50 BeadChip were used to identify genomic regions under selection. Only autosomal loci with call rate higher than 90%, minor allele frequency higher than 0.01 and Hardy-Weinberg equlibrium limit of 0.001 were included in the subsequent analyses of selection sweeps presence. The final dataset was consisted from 30538 SNPs with 81.86 kb average adjacent SNPs spacing. The iHS score were averaged into non-overlapping 500 kb segments across the genome. The FST values were also plotted against genome position based on sliding windows approach and averaged over 8 consecutive SNPs. Based on integrated Haplotype Score evaluation only 7 regions with iHS score higher than 1.7 was found. The average iHS score observed for each adjacent syntenic regions indicated slight effect of recent selection in analysed group of Pinzgau bulls. The level of genetic differentiation between Austrian and Slovak bulls estimated based on FST index was low. Only 24% of FST values calculated for each SNP was greather than 0.01. By using sliding windows approach was found that 5% of analysed windows had higher value than 0.01. Our results indicated use of similar selection scheme in breeding programs of Slovak and Austrian Pinzgau bulls. The evidence for genome-wide association between signatures of selection and regions affecting complex traits such as milk production was insignificant, because the loci in segments identified as affected by selection were very distant from each other. Identification of genomic regions that may be under pressure of selection for phenotypic traits to better understanding of the
Full Text Available Abstract Background Genome-wide computational analysis of alternative splicing (AS in several flowering plants has revealed that pre-mRNAs from about 30% of genes undergo AS. Chlamydomonas, a simple unicellular green alga, is part of the lineage that includes land plants. However, it diverged from land plants about one billion years ago. Hence, it serves as a good model system to study alternative splicing in early photosynthetic eukaryotes, to obtain insights into the evolution of this process in plants, and to compare splicing in simple unicellular photosynthetic and non-photosynthetic eukaryotes. We performed a global analysis of alternative splicing in Chlamydomonas reinhardtii using its recently completed genome sequence and all available ESTs and cDNAs. Results Our analysis of AS using BLAT and a modified version of the Sircah tool revealed AS of 498 transcriptional units with 611 events, representing about 3% of the total number of genes. As in land plants, intron retention is the most prevalent form of AS. Retained introns and skipped exons tend to be shorter than their counterparts in constitutively spliced genes. The splice site signals in all types of AS events are weaker than those in constitutively spliced genes. Furthermore, in alternatively spliced genes, the prevalent splice form has a stronger splice site signal than the non-prevalent form. Analysis of constitutively spliced introns revealed an over-abundance of motifs with simple repetitive elements in comparison to introns involved in intron retention. In almost all cases, AS results in a truncated ORF, leading to a coding sequence that is around 50% shorter than the prevalent splice form. Using RT-PCR we verified AS of two genes and show that they produce more isoforms than indicated by EST data. All cDNA/EST alignments and splice graphs are provided in a website at http://combi.cs.colostate.edu/as/chlamy. Conclusions The extent of AS in Chlamydomonas that we observed is much
Full Text Available Abstract Background Somatic Copy Number Alterations (CNAs in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC, a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1 exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2 performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3 iteratively detecting Significant Copy Number Aberrations (SCAs and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. Results We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma. When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC or tumor suppressor genes (e.g., CDKN2A/B. Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Conclusions Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes
Full Text Available Abstract Background SNP genotyping arrays have been developed to characterize single-nucleotide polymorphisms (SNPs and DNA copy number variations (CNVs. Nonparametric and model-based statistical algorithms have been developed to detect CNVs from SNP data using the marker intensities. However, these algorithms lack specificity to detect small CNVs owing to the high false positive rate when calling CNVs based on the intensity values. Therefore, the resulting association tests lack power even if the CNVs affecting disease risk are common. An alternative procedure called PennCNV uses information from both the marker intensities as well as the genotypes and therefore has increased sensitivity. Results By using the hidden Markov model (HMM implemented in PennCNV to derive the probabilities of different copy number states which we subsequently used in a logistic regression model, we developed a new genome-wide algorithm to detect CNV associations with diseases. We compared this new method with association test applied to the most probable copy number state for each individual that is provided by PennCNV after it performs an initial HMM analysis followed by application of the Viterbi algorithm, which removes information about copy number probabilities. In one of our simulation studies, we showed that for large CNVs (number of SNPs ≥ 10, the association tests based on PennCNV calls gave more significant results, but the new algorithm retained high power. For small CNVs (number of SNPs 10, the logistic algorithm provided smaller average p-values (e.g., p = 7.54e - 17 when relative risk RR = 3.0 in all the scenarios and could capture signals that PennCNV did not (e.g., p = 0.020 when RR = 3.0. From a second set of simulations, we showed that the new algorithm is more powerful in detecting disease associations with small CNVs (number of SNPs ranging from 3 to 5 under different penetrance models (e.g., when RR = 3.0, for relatively weak signals, power = 0
Nicholette D Palmer
Full Text Available African Americans are disproportionately affected by type 2 diabetes (T2DM yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD and 1029 population-based controls. The most significant SNPs (n = 550 independent loci were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071, were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05. Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10(-8. SNP rs7560163 (P = 7.0×10(-9, OR (95% CI = 0.75 (0.67-0.84 is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217 were associated with T2DM (P<0.05 and reached more nominal levels of significance (P<2.5×10(-5 in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations.
McKay, James D
Genome-wide association studies (GWAS) have been successful in identifying common genetic variation involved in susceptibility to etiologically complex disease. We conducted a GWAS to identify common genetic variation involved in susceptibility to upper aero-digestive tract (UADT) cancers. Genome-wide genotyping was carried out using the Illumina HumanHap300 beadchips in 2,091 UADT cancer cases and 3,513 controls from two large European multi-centre UADT cancer studies, as well as 4,821 generic controls. The 19 top-ranked variants were investigated further in an additional 6,514 UADT cancer cases and 7,892 controls of European descent from an additional 13 UADT cancer studies participating in the INHANCE consortium. Five common variants presented evidence for significant association in the combined analysis (p ≤ 5 × 10⁻⁷). Two novel variants were identified, a 4q21 variant (rs1494961, p = 1×10⁻⁸) located near DNA repair related genes HEL308 and FAM175A (or Abraxas) and a 12q24 variant (rs4767364, p =2 × 10⁻⁸) located in an extended linkage disequilibrium region that contains multiple genes including the aldehyde dehydrogenase 2 (ALDH2) gene. Three remaining variants are located in the ADH gene cluster and were identified previously in a candidate gene study involving some of these samples. The association between these three variants and UADT cancers was independently replicated in 5,092 UADT cancer cases and 6,794 controls non-overlapping samples presented here (rs1573496-ADH7, p = 5 × 10⁻⁸); rs1229984-ADH1B, p = 7 × 10⁻⁹; and rs698-ADH1C, p = 0.02). These results implicate two variants at 4q21 and 12q24 and further highlight three ADH variants in UADT cancer susceptibility.
James D McKay
Full Text Available Genome-wide association studies (GWAS have been successful in identifying common genetic variation involved in susceptibility to etiologically complex disease. We conducted a GWAS to identify common genetic variation involved in susceptibility to upper aero-digestive tract (UADT cancers. Genome-wide genotyping was carried out using the Illumina HumanHap300 beadchips in 2,091 UADT cancer cases and 3,513 controls from two large European multi-centre UADT cancer studies, as well as 4,821 generic controls. The 19 top-ranked variants were investigated further in an additional 6,514 UADT cancer cases and 7,892 controls of European descent from an additional 13 UADT cancer studies participating in the INHANCE consortium. Five common variants presented evidence for significant association in the combined analysis (p ≤ 5 × 10⁻⁷. Two novel variants were identified, a 4q21 variant (rs1494961, p = 1×10⁻⁸ located near DNA repair related genes HEL308 and FAM175A (or Abraxas and a 12q24 variant (rs4767364, p =2 × 10⁻⁸ located in an extended linkage disequilibrium region that contains multiple genes including the aldehyde dehydrogenase 2 (ALDH2 gene. Three remaining variants are located in the ADH gene cluster and were identified previously in a candidate gene study involving some of these samples. The association between these three variants and UADT cancers was independently replicated in 5,092 UADT cancer cases and 6,794 controls non-overlapping samples presented here (rs1573496-ADH7, p = 5 × 10⁻⁸; rs1229984-ADH1B, p = 7 × 10⁻⁹; and rs698-ADH1C, p = 0.02. These results implicate two variants at 4q21 and 12q24 and further highlight three ADH variants in UADT cancer susceptibility.
Castaldi, Peter J; Cho, Michael H; Litonjua, Augusto A; Bakke, Per; Gulsvik, Amund; Lomas, David A; Anderson, Wayne; Beaty, Terri H; Hokanson, John E; Crapo, James D; Laird, Nan; Silverman, Edwin K
Two recent metaanalyses of genome-wide association studies conducted by the CHARGE and SpiroMeta consortia identified novel loci yielding evidence of association at or near genome-wide significance (GWS) with FEV(1) and FEV(1)/FVC. We hypothesized that a subset of these markers would also be associated with chronic obstructive pulmonary disease (COPD) susceptibility. Thirty-two single-nucleotide polymorphisms (SNPs) in or near 17 genes in 11 previously identified GWS spirometric genomic regions were tested for association with COPD status in four COPD case-control study samples (NETT/NAS, the Norway case-control study, ECLIPSE, and the first 1,000 subjects in COPDGene; total sample size, 3,456 cases and 1,906 controls). In addition to testing the 32 spirometric GWS SNPs, we tested a dense panel of imputed HapMap2 SNP markers from the 17 genes located near the 32 GWS SNPs and in a set of 21 well studied COPD candidate genes. Of the previously identified GWS spirometric genomic regions, three loci harbored SNPs associated with COPD susceptibility at a 5% false discovery rate: the 4q24 locus including FLJ20184/INTS12/GSTCD/NPNT, the 6p21 locus including AGER and PPT2, and the 5q33 locus including ADAM19. In conclusion, markers previously associated at or near GWS with spirometric measures were tested for association with COPD status in data from four COPD case-control studies, and three loci showed evidence of association with COPD susceptibility at a 5% false discovery rate.
Eimear E Kenny
Full Text Available Crohn's disease (CD is a complex disorder resulting from the interaction of intestinal microbiota with the host immune system in genetically susceptible individuals. The largest meta-analysis of genome-wide association to date identified 71 CD-susceptibility loci in individuals of European ancestry. An important epidemiological feature of CD is that it is 2-4 times more prevalent among individuals of Ashkenazi Jewish (AJ descent compared to non-Jewish Europeans (NJ. To explore genetic variation associated with CD in AJs, we conducted a genome-wide association study (GWAS by combining raw genotype data across 10 AJ cohorts consisting of 907 cases and 2,345 controls in the discovery stage, followed up by a replication study in 971 cases and 2,124 controls. We confirmed genome-wide significant associations of 9 known CD loci in AJs and replicated 3 additional loci with strong signal (p<5×10⁻⁶. Novel signals detected among AJs were mapped to chromosomes 5q21.1 (rs7705924, combined p = 2×10⁻⁸; combined odds ratio OR = 1.48, 2p15 (rs6545946, p = 7×10⁻⁹; OR = 1.16, 8q21.11 (rs12677663, p = 2×10⁻⁸; OR = 1.15, 10q26.3 (rs10734105, p = 3×10⁻⁸; OR = 1.27, and 11q12.1 (rs11229030, p = 8×10⁻⁹; OR = 1.15, implicating biologically plausible candidate genes, including RPL7, CPAMD8, PRG2, and PRG3. In all, the 16 replicated and newly discovered loci, in addition to the three coding NOD2 variants, accounted for 11.2% of the total genetic variance for CD risk in the AJ population. This study demonstrates the complementary value of genetic studies in the Ashkenazim.
Neale, B.M.; Medland, S.E.; Ripke, S.; Asherson, P.; Franke, B.; Lesch, K.P.; Faraone, S.V.; Nguyen, T.T.; Schafer, H.; Holmans, P.; Daly, M.; Steinhausen, H.C.; Freitag, C.; Reif, A.; Renner, T.J.; Romanos, M.; Romanos, J.; Walitza, S.; Warnke, A.; Meyer, J.; Palmason, H.; Buitelaar, J.K.; Arias Vasquez, A.; Lambregts-Rommelse, N.N.J.; Gill, M.; Anney, R.J.; Langely, K.; O'Donovan, M.; Williams, N.; Owen, M.; Thapar, A.; Kent, L.; Sergeant, J.A.; Roeyers, H.; Mick, E.; Biederman, J.; Doyle, A.; Smalley, S.; Loo, S.; Hakonarson, H.; Elia, J.; Todorov, A.A.; Miranda, A.; Mulas, F.; Ebstein, R.P.; Rothenberger, A.; Banaschewski, T.; Oades, R.D.; Sonuga-Barke, E.J.S.; McGough, J.; Nisenbaum, L.; Middleton, F.; Hu, X.; Nelson, S.
OBJECTIVE: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded
Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.
Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…
Nieuwboer, H.A.; Pool, R.; Dolan, C.V.; Boomsma, D.I.; Nivard, M.G.
Here we present a method of genome-wide inferred study (GWIS) that provides an approximation of genome-wide association study (GWAS) summary statistics for a variable that is a function of phenotypes for which GWAS summary statistics, phenotypic means, and covariances are available. A GWIS can be
Gao, L.; Smit, Matthijs; Oord, J.J. van den; Goeman, J.J.; Verdegaal, E.M.; Burg, S.H. van der; Stas, M.; Beck, S.; Gruis, N.A.; Tensen, C.P.; Willemze, R.; Peeper, D.S.; Doorn, R. van
The involvement of epigenetic alterations in the pathogenesis of melanoma is increasingly recognized. Here, we performed genome-wide DNA methylation analysis of primary cutaneous melanoma and benign melanocytic nevus interrogating 14 495 genes using BeadChip technology. This genome-wide view of
Paternoster, L.; Evans, D. M.; Nohr, E. A.
Background: Thirty-two common variants associated with body mass index (BMI) have been identified in genome-wide association studies, explaining,similar to 1.45% of BMI variation in general population cohorts. We performed a genome-wide association study in a sample of young adults enriched...
Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan
Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…
Hoskins, Roger A.; Landolin, Jane M.; Brown, James B.; Sandler, Jeremy E.; Takahashi, Hazuki; Lassmann, Timo; Yu, Charles; Booth, Benjamin W.; Zhang, Dayu; Wan, Kenneth H.; Yang, Li; Boley, Nathan; Andrews, Justen; Kaufman, Thomas C.; Graveley, Brenton R.; Bickel, Peter J.; Carninci, Piero; Carlson, Joseph W.; Celniker, Susan E.
Core promoters are critical regions for gene regulation in higher eukaryotes. However, the boundaries of promoter regions, the relative rates of initiation at the transcription start sites (TSSs) distributed within them, and the functional significance of promoter architecture remain poorly understood. We produced a high-resolution map of promoters active in the Drosophila melanogaster embryo by integrating data from three independent and complementary methods: 21 million cap analysis of gene expression (CAGE) tags, 1.2 million RNA ligase mediated rapid amplification of cDNA ends (RLMRACE) reads, and 50,000 cap-trapped expressed sequence tags (ESTs). We defined 12,454 promoters of 8037 genes. Our analysis indicates that, due to non-promoter-associated RNA background signal, previous studies have likely overestimated the number of promoter-associated CAGE clusters by fivefold. We show that TSS distributions form a complex continuum of shapes, and that promoters active in the embryo and adult have highly similar shapes in 95% of cases. This suggests that these distributions are generally determined by static elements such as local DNA sequence and are not modulated by dynamic signals such as histone modifications. Transcription factor binding motifs are differentially enriched as a function of promoter shape, and peaked promoter shape is correlated with both temporal and spatial regulation of gene expression. Our results contribute to the emerging view that core promoters are functionally diverse and control patterning of gene expression in Drosophila and mammals.
Anttila, Verneri; Stefansson, Hreinn; Kallela, Mikko
1835740 on chromosome 8q22.1 to be associated with migraine (P = 5.38 × 10¿¿, odds ratio = 1.23, 95% CI 1.150-1.324) in a genome-wide association study of 2,731 migraine cases ascertained from three European headache clinics and 10,747 population-matched controls. The association was replicated in 3...
Sevillano Del Aguila, C.A.; Soares Lopes, M.; Harlizius, B; Hanenberg, E.H.A.T.; Knol, E F; Bastiaansen, J. W. M.
Background Cryptorchidism and scrotal/inguinal hernia are the most frequent congenital defects in pigs. Identification of genomic regions that control these congenital defects is of great interest to breeding programs, both from an animal welfare point of view as well as for economic reasons. The aim of this genome-wide association study (GWAS) was to identify single nucleotide polymorphisms (SNPs) that are strongly associated with these congenital defects. Genotypes were available for 2570 L...
Sevillano, Claudia A.; Marcos S. Lopes; Harlizius, Barbara; Hanenberg, Egiel HAT; Knol, Egbert F; Bastiaansen, John WM
Background Cryptorchidism and scrotal/inguinal hernia are the most frequent congenital defects in pigs. Identification of genomic regions that control these congenital defects is of great interest to breeding programs, both from an animal welfare point of view as well as for economic reasons. The aim of this genome-wide association study (GWAS) was to identify single nucleotide polymorphisms (SNPs) that are strongly associated with these congenital defects. Genotypes were available for 2570 L...
Earp, Madalene A; Linda E. Kelemen; Magliocco, Anthony M.; Swenerton, Kenneth D.; Chenevix-Trench, Georgia; Lu, Yi; Hein, Alexander; Ekici, Arif B.; Beckmann, Matthias W.; Fasching, Peter A; Lambrechts, Diether; Despierre, Evelyn; VERGOTE, Ignace; Lambrechts, Sandrina; Doherty, Jennifer A.
Epithelial ovarian cancer (EOC) is a heterogeneous cancer with both genetic and environmental risk factors. Variants influencing the risk of developing the less-common EOC subtypes have not been fully investigated. We performed a genome-wide association study (GWAS) of EOC according to subtype by pooling genomic DNA from 545 cases and 398 controls of European descent, and testing for allelic associations. We evaluated for replication 188 variants from the GWAS (56 variants for mucinous, 55 fo...
Earp, Madalene A; Linda E. Kelemen; Magliocco, Anthony M.; Swenerton, Kenneth D.; Chenevix–Trench, Georgia; Lu, Yi; Hein, Alexander; Ekici, Arif B.; Beckmann, Matthias W.; Fasching, Peter A; Lambrechts, Diether; Despierre, Evelyn; VERGOTE, Ignace; Lambrechts, Sandrina; Doherty, Jennifer A.
Epithelial ovarian cancer (EOC) is a heterogeneous cancer with both genetic and environmental risk factors. Variants influencing the risk of developing the less-common EOC subtypes have not been fully investigated. We performed a genome-wide association study (GWAS) of EOC according to subtype by pooling genomic DNA from 545 cases and 398 controls of European descent, and testing for allelic associations. We evaluated for replication 188 variants from the GWAS (56 variants for mucinous, 55 fo...
Sampson, JN; Wheeler, WA; Yeager, M.; Panagiotou, O.; Wang, Z; Berndt, SI; Lan, Q; Abnet, CC; Amundadottir, LT; Figueroa, JD; Landi, Mt; Mirabello, L.; Savage, SA; TAYLOR, PR; De Vivo, I
Background: Studies of related individuals have consistently demonstrated notable familial aggregation of cancer. We aim to estimate the heritability and genetic correlation attributable to the additive effects of common SNPs for cancer at 13 anatomical sites. Methods: Between 2007 and 2014, the US National Cancer Institute has generated data from genome-wide association studies (GWAS) for 49,492 cancer cases and 34,131 controls. We apply novel mixed model methodology (GCTA) to this GWAS data...
Wu, Qingyao; Ye, Yunming; Liu, Yang; Ng, Michael K
For high dimensional genome-wide association (GWA) case-control data of complex disease, there are usually a large portion of single-nucleotide polymorphisms (SNPs) that are irrelevant with the disease. A simple random sampling method in random forest using default mtry parameter to choose feature subspace, will select too many subspaces without informative SNPs. Exhaustive searching an optimal mtry is often required in order to include useful and relevant SNPs and get rid of vast of non-informative SNPs. However, it is too time-consuming and not favorable in GWA for high-dimensional data. The main aim of this paper is to propose a stratified sampling method for feature subspace selection to generate decision trees in a random forest for GWA high-dimensional data. Our idea is to design an equal-width discretization scheme for informativeness to divide SNPs into multiple groups. In feature subspace selection, we randomly select the same number of SNPs from each group and combine them to form a subspace to generate a decision tree. The advantage of this stratified sampling procedure can make sure each subspace contains enough useful SNPs, but can avoid a very high computational cost of exhaustive search of an optimal mtry, and maintain the randomness of a random forest. We employ two genome-wide SNP data sets (Parkinson case-control data comprised of 408 803 SNPs and Alzheimer case-control data comprised of 380 157 SNPs) to demonstrate that the proposed stratified sampling method is effective, and it can generate better random forest with higher accuracy and lower error bound than those by Breiman's random forest generation method. For Parkinson data, we also show some interesting genes identified by the method, which may be associated with neurological disorders for further biological investigations.
breeding programs. Seeding emergence rate (SER) and tillering are the most important agronomic traits affecting biomass and grain yield potential in wheat. Seedling emergence rate is an important target for judging the quality of seeds, and it is a reflection of seedling vigor and growth that indirectly affects yields. Several ...
Cunninghame Graham, Deborah S
Genome-wide association studies (GWAS) have been shown to be a powerful way of identifying novel susceptibility genes in systemic lupus erythematosus (SLE), as demonstrated by a series of publications in the past year. Lupus has been a late-comer to the GWAS community, being preceded by success stories for the GWAS approach in other autoimmune diseases, including type I diabetes, ankylosing spondylitis, rheumatoid arthritis, Crohn's disease and ulcerative colitis. The paper by Suarez-Gestal and colleagues seeks to exploit the wealth of data available from a total of four GWAS in SLE, three in European-American populations and one in a Swedish population. The authors describe replication of ten lupus susceptibility alleles in a Spanish SLE case-control study. PMID:19664177
Full Text Available Metastasis remains the primary cause of patient morbidity and mortality in solid tumors and is due to the action of a large number of tumor-autonomous and non-autonomous factors. Here we report the results of a genome-wide integrated strategy to identify novel metastasis susceptibility candidate genes and molecular pathways in breast cancer metastasis. This analysis implicates a number of transcriptional regulators and suggests cell-mediated immunity is an important determinant. Moreover, the analysis identified novel or FDA-approved drugs as potentially useful for anti-metastatic therapy. Further explorations implementing this strategy may therefore provide a variety of information for clinical applications in the control and treatment of advanced neoplastic disease.
Schumacher, Fredrick R.; Berndt, Sonja I.; Siddiq, Afshan
published GWAS with 7358 PrCa cases and 6732 controls, we identified a new susceptibility locus associated with overall PrCa risk at 2q37.3 (rs2292884, P= 4.3 x 10–8). We also confirmed a locus suggested by an earlier GWAS at 12q13 (rs902774, P= 8.6 x 10–9). The estimated per-allele odds ratios......Prostate cancer (PrCa) is the most common non-skin cancer diagnosed among males in developed countries and the second leading cause of cancer mortality, yet little is known regarding its etiology and factors that influence clinical outcome. Genome-wide association studies (GWAS) of PrCa have...
Aaron L. Statham
Full Text Available DNA methylation and nucleosome positioning are two key mechanisms that contribute to the epigenetic control of gene expression. During carcinogenesis, the expression of many genes is altered alongside extensive changes in the epigenome, with repressed genes often being associated with local DNA hypermethylation and gain of nucleosomes at their promoters. However the spectrum of alterations that occur at distal regulatory regions has not been extensively studied. To address this we used Nucleosome Occupancy and Methylation sequencing (NOMe-seq to compare the genome-wide DNA methylation and nucleosome occupancy profiles between normal and cancer cell line models of the breast and prostate. Here we describe the bioinformatic pipeline and methods that we developed for the processing and analysis of the NOMe-seq data published by (Taberlay et al., 2014  and deposited in the Gene Expression Omnibus with accession GSE57498.
Xiang, Zuoshuang; Qin, Tingting; Qin, Zhaohui S; He, Yongqun
The large amount of literature in the post-genomics era enables the study of gene interactions and networks using all available articles published for a specific organism. MeSH is a controlled vocabulary of medical and scientific terms that is used by biomedical scientists to manually index articles in the PubMed literature database. We hypothesized that genome-wide gene-MeSH term associations from the PubMed literature database could be used to predict implicit gene-to-gene relationships and networks. While the gene-MeSH associations have been used to detect gene-gene interactions in some studies, different methods have not been well compared, and such a strategy has not been evaluated for a genome-wide literature analysis. Genome-wide literature mining of gene-to-gene interactions allows ranking of the best gene interactions and investigation of comprehensive biological networks at a genome level. The genome-wide GenoMesh literature mining algorithm was developed by sequentially generating a gene-article matrix, a normalized gene-MeSH term matrix, and a gene-gene matrix. The gene-gene matrix relies on the calculation of pairwise gene dissimilarities based on gene-MeSH relationships. An optimized dissimilarity score was identified from six well-studied functions based on a receiver operating characteristic (ROC) analysis. Based on the studies with well-studied Escherichia coli and less-studied Brucella spp., GenoMesh was found to accurately identify gene functions using weighted MeSH terms, predict gene-gene interactions not reported in the literature, and cluster all the genes studied from an organism using the MeSH-based gene-gene matrix. A web-based GenoMesh literature mining program is also available at: http://genomesh.hegroup.org. GenoMesh also predicts gene interactions and networks among genes associated with specific MeSH terms or user-selected gene lists. The GenoMesh algorithm and web program provide the first genome-wide, MeSH-based literature mining
Liao, X J; Lia, L; Zhang, Z Y; Long, Y; Yang, B; Ruan, G R; Su, Y; Ai, H S; Zhang, W C; Deng, W Y; Xiao, S J; Ren, J; Ding, N S; Huang, L S
Umbilical hernia (UH) is a complex disorder caused by both genetic and environmental factors. UH brings animal welfare problems and severe economic loss to the pig industry. Until now, the genetic basis of UH is poorly understood. The high-density 60K porcine SNP array enables the rapid application of genome-wide association study (GWAS) to identify genetic loci for phenotypic traits at genome wide scale in pigs. The objective of this research was to identify susceptibility loci for swine umbilical hernia using the GWAS approach. We genotyped 478 piglets from 142 families representing three Western commercial breeds with the Illumina PorcineSNP60 BeadChip. Then significant SNPs were detected by GWAS using ROADTRIPS (Robust Association-Detection Test for Related Individuals with Population Substructure) software base on a Bonferroni corrected threshold (P = 1.67E-06) or suggestive threshold (P = 3.34E-05) and false discovery rate (FDR = 0.05). After quality control, 29,924 qualified SNPs and 472 piglets were used for GWAS. Two suggestive loci predisposing to pig UH were identified at 44.25MB on SSC2 (rs81358018, P = 3.34E-06, FDR = 0.049933) and at 45.90MB on SSC17 (rs81479278, P = 3.30E-06, FDR = 0.049933) in Duroc population, respectively. And no SNP was detected to be associated with pig UH at significant level in neither Landrace nor Large White population. Furthermore, we carried out a meta-analysis in the combined pure-breed population containing all the 472 piglets. rs81479278 (P = 1.16E-06, FDR = 0.022475) was identified to associate with pig UH at genome-wide significant level. SRC was characterized as plausible candidate gene for susceptibility to pig UH according to its genomic position and biological functions. To our knowledge, this study gives the first description of GWAS identifying susceptibility loci for umbilical hernia in pigs. Our findings provide deeper insights to the genetic architecture of umbilical hernia in pigs.
Full Text Available Retrovirus (RV is efficient for gene transfer and integration in dividing cells of diverse organisms. RV provides a powerful tool for insertional mutagenesis (IM to identify and functionally analyze genes essential for normal and pathological processes. Here we report RV-mediated gene transfer and genome-wide IM in fish stem cells from medaka and zebrafish. Three RVs were produced for fish cell transduction: rvLegfp and rvLcherry produce green fluorescent protein (GFP and mCherry fluorescent protein respectively under control of human cytomegalovirus immediate early promoter upon any chromosomal integration, whereas rvGTgfp contains a splicing acceptor and expresses GFP only upon gene trapping (GT via intronic in-frame integration and spliced to endogenous active genes. We show that rvLegfp and rvLcherry produce a transduction efficiency of 11~23% in medaka and zebrafish stem cell lines, which is as 30~67% efficient as the positive control in NIH/3T3. Upon co-infection with rvGTgfp and rvLcherry, GFP-positive cells were much fewer than Cherry-positive cells, consistent with rareness of productive gene trapping events versus random integration. Importantly, rvGTgfp infection in the medaka haploid embryonic stem (ES cell line HX1 generated GTgfp insertion on all 24 chromosomes of the haploid genome. Similar to the mammalian haploid cells, these insertion events were presented predominantly in intergenic regions and introns but rarely in exons. RV-transduced HX1 retained the ES cell properties such as stable growth, embryoid body formation and pluripotency gene expression. Therefore, RV is proficient for gene transfer and IM in fish stem cells. Our results open new avenue for genome-wide IM in medaka haploid ES cells in culture.
Karunas, A S; Iunusbaev, B B; Fedorova, Iu Iu; Gimalova, G F; Ramazanova, N N; Gur'eva, L L; Mukhtarova, L A; Zagidullin, Sh Z; Etkina, E I; Khusnutdinova, E K
Bronchial asthma is a chronic inflammatory respiratory disease that is caused by the complex interaction of environmental influences and genetic susceptibility. The first genome-wide association study of bronchial asthma discovered a significant association between SNPs within 17q12-21 genomic region and childhood bronchial asthma in individuals of European descent. Association with this genomic region was then replicated in a number of independent samples of European and Asian descent. Here we report results of the first genome-wide association study of bronchial asthma in the Volga-Ural region of Russia. The present study includes 358 unrelated patients with physician-diagnosed bronchial asthma and 369 disease-free control subjects of different ethnic origin (Russians, Tatars and Bashkirs). Genotyping of DNA samples was carried out using the Illumina Human610 quad array as a part of GABRIEL project (contract from the EC No LSHB-CT-2006-018996). After QC filtering procedures, a final set of 550915 SNPs genotyped in 330 cases and 348 controls was tested for association with bronchial asthma. Five markers on chromosome 17q12-21 showed statistically significant association with bronchial asthma (p < or = 4.79 x 10(-7)). SNP rs7216389 with the strongest evidence for association (p = 1.01 x 10(-7)) is located within the first intron of the GSDMB gene. Evidence for association was stronger with childhood-onset asthma (p = 1.97 x 10(-6) for SNP rs7216389) compared to late-onset asthma (p = 1.8 x 10(-4) for SNP rs7216389). Our replication study using three SNPs within GSDMB gene confirmed association with only childhood-onset asthma. In summary, these results suggest an important role for genetic variants within 17q12-q21 region in the development of bronchial asthma in the Volga-Ural region of Russia.
Full Text Available Abstract Background Dependences on addictive substances are substantially-heritable complex disorders whose molecular genetic bases have been partially elucidated by studies that have largely focused on research volunteers, including those recruited in Baltimore. Maryland. Subjects recruited from the Baltimore site of the Epidemiological Catchment Area (ECA study provide a potentially-useful comparison group for possible confounding features that might arise from selecting research volunteer samples of substance dependent and control individuals. We now report novel SNP (single nucleotide polymorphism genome wide association (GWA results for vulnerability to substance dependence in ECA participants, who were initially ascertained as members of a probability sample from Baltimore, and compare the results to those from ethnically-matched Baltimore research volunteers. Results We identify substantial overlap between the home address zip codes reported by members of these two samples. We find overlapping clusters of SNPs whose allele frequencies differ with nominal significance between substance dependent vs control individuals in both samples. These overlapping clusters of nominally-positive SNPs identify 172 genes in ways that are never found by chance in Monte Carlo simulation studies. Comparison with data from human expressed sequence tags suggests that these genes are expressed in brain, especially in hippocampus and amygdala, to extents that are greater than chance. Conclusion The convergent results from these probability sample and research volunteer sample datasets support prior genome wide association results. They fail to support the idea that large portions of the molecular genetic results for vulnerability to substance dependence derive from factors that are limited to research volunteers.
Liu, Qizhi; Wang, Yunzhi; Lin, Fan; Zhang, Lei; Li, Yan; Ge, Ruowen; Hong, Yunhan
Retrovirus (RV) is efficient for gene transfer and integration in dividing cells of diverse organisms. RV provides a powerful tool for insertional mutagenesis (IM) to identify and functionally analyze genes essential for normal and pathological processes. Here we report RV-mediated gene transfer and genome-wide IM in fish stem cells from medaka and zebrafish. Three RVs were produced for fish cell transduction: rvLegfp and rvLcherry produce green fluorescent protein (GFP) and mCherry fluorescent protein respectively under control of human cytomegalovirus immediate early promoter upon any chromosomal integration, whereas rvGTgfp contains a splicing acceptor and expresses GFP only upon gene trapping (GT) via intronic in-frame integration and spliced to endogenous active genes. We show that rvLegfp and rvLcherry produce a transduction efficiency of 11~23% in medaka and zebrafish stem cell lines, which is as 30~67% efficient as the positive control in NIH/3T3. Upon co-infection with rvGTgfp and rvLcherry, GFP-positive cells were much fewer than Cherry-positive cells, consistent with rareness of productive gene trapping events versus random integration. Importantly, rvGTgfp infection in the medaka haploid embryonic stem (ES) cell line HX1 generated GTgfp insertion on all 24 chromosomes of the haploid genome. Similar to the mammalian haploid cells, these insertion events were presented predominantly in intergenic regions and introns but rarely in exons. RV-transduced HX1 retained the ES cell properties such as stable growth, embryoid body formation and pluripotency gene expression. Therefore, RV is proficient for gene transfer and IM in fish stem cells. Our results open new avenue for genome-wide IM in medaka haploid ES cells in culture.
Full Text Available The emergence and re-emergence of plant pathogenic microorganisms are processes that imply perturbations in both host and pathogen ecological niches. Global change is largely assumed to drive the emergence of new etiological agents by altering the equilibrium of the ecological habitats which in turn places hosts more in contact with pathogen reservoirs. In this context, the number of epidemics is expected to increase dramatically in the next coming decades both in wild and crop plants. Under these considerations, the identification of the genetic variants underlying natural variation of resistance is a pre-requisite to estimate the adaptive potential of wild plant populations and to develop new breeding resistant cultivars. On the other hand, the prediction of pathogen's genetic determinants underlying disease emergence can help to identify plant resistance alleles. In the genomic era, whole genome sequencing combined with the development of statistical methods led to the emergence of Genome Wide Association (GWA mapping, a powerful tool for detecting genomic regions associated with natural variation of disease resistance in both wild and cultivated plants. However, GWA mapping has been less employed for the detection of genetic variants associated with pathogenicity in microbes. Here, we reviewed GWA studies performed either in plants or in pathogenic microorganisms (bacteria, fungi and oomycetes. In addition, we highlighted the benefits and caveats of the emerging joint GWA mapping approach that allows for the simultaneous identification of genes interacting between genomes of both partners. Finally, based on co-evolutionary processes in wild populations, we highlighted a phenotyping-free joint GWA mapping approach as a promising tool for describing the molecular landscape underlying plant - microbe interactions.
Erin N Smith
Full Text Available Although a highly heritable and disabling disease, bipolar disorder's (BD genetic variants have been challenging to identify. We present new genotype data for 1,190 cases and 401 controls and perform a genome-wide association study including additional samples for a total of 2,191 cases and 1,434 controls. We do not detect genome-wide significant associations for individual loci; however, across all SNPs, we show an association between the power to detect effects calculated from a previous genome-wide association study and evidence for replication (P = 1.5×10(-7. To demonstrate that this result is not likely to be a false positive, we analyze replication rates in a large meta-analysis of height and show that, in a large enough study, associations replicate as a function of power, approaching a linear relationship. Within BD, SNPs near exons exhibit a greater probability of replication, supporting an enrichment of reproducible associations near functional regions of genes. These results indicate that there is likely common genetic variation associated with BD near exons (±10 kb that could be identified in larger studies and, further, provide a framework for assessing the potential for replication when combining results from multiple studies.
Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis
Almasy, L, Blangero, J. (2009) Human QTL linkage mapping. Genetica 136:333-340. Amos, CI. (2007) Successful design and conduct of genome-wide...quantitative trait loci. Genetica 136:237-243. Skol AD, Scott LJ, Abecasis GR, Boehnke M. (2006) Joint analysis is more efficient than replication
Lane, Jérôme; McLaren, Paul J.; Dorrell, Lucy; Shianna, Kevin V.; Stemke, Amanda; Pelak, Kimberly; Moore, Stephen; Oldenburg, Johannes; Alvarez-Roman, Maria Teresa; Angelillo-Scherrer, Anne; Boehlen, Francoise; Bolton-Maggs, Paula H.B.; Brand, Brigit; Brown, Deborah; Chiang, Elaine; Cid-Haro, Ana Rosa; Clotet, Bonaventura; Collins, Peter; Colombo, Sara; Dalmau, Judith; Fogarty, Patrick; Giangrande, Paul; Gringeri, Alessandro; Iyer, Rathi; Katsarou, Olga; Kempton, Christine; Kuriakose, Philip; Lin, Judith; Makris, Mike; Manco-Johnson, Marilyn; Tsakiris, Dimitrios A.; Martinez-Picado, Javier; Mauser-Bunschoten, Evelien; Neff, Anne; Oka, Shinichi; Oyesiku, Lara; Parra, Rafael; Peter-Salonen, Kristiina; Powell, Jerry; Recht, Michael; Shapiro, Amy; Stine, Kimo; Talks, Katherine; Telenti, Amalio; Wilde, Jonathan; Yee, Thynn Thynn; Wolinsky, Steven M.; Martinson, Jeremy; Hussain, Shehnaz K.; Bream, Jay H.; Jacobson, Lisa P.; Carrington, Mary; Goedert, James J.; Haynes, Barton F.; McMichael, Andrew J.; Goldstein, David B.; Fellay, Jacques
Human genetic variation contributes to differences in susceptibility to HIV-1 infection. To search for novel host resistance factors, we performed a genome-wide association study (GWAS) in hemophilia patients highly exposed to potentially contaminated factor VIII infusions. Individuals with hemophilia A and a documented history of factor VIII infusions before the introduction of viral inactivation procedures (1979–1984) were recruited from 36 hemophilia treatment centers (HTCs), and their genome-wide genetic variants were compared with those from matched HIV-infected individuals. Homozygous carriers of known CCR5 resistance mutations were excluded. Single nucleotide polymorphisms (SNPs) and inferred copy number variants (CNVs) were tested using logistic regression. In addition, we performed a pathway enrichment analysis, a heritability analysis, and a search for epistatic interactions with CCR5 Δ32 heterozygosity. A total of 560 HIV-uninfected cases were recruited: 36 (6.4%) were homozygous for CCR5 Δ32 or m303. After quality control and SNP imputation, we tested 1 081 435 SNPs and 3686 CNVs for association with HIV-1 serostatus in 431 cases and 765 HIV-infected controls. No SNP or CNV reached genome-wide significance. The additional analyses did not reveal any strong genetic effect. Highly exposed, yet uninfected hemophiliacs form an ideal study group to investigate host resistance factors. Using a genome-wide approach, we did not detect any significant associations between SNPs and HIV-1 susceptibility, indicating that common genetic variants of major effect are unlikely to explain the observed resistance phenotype in this population. PMID:23372042
Full Text Available The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs with minor allele frequency (MAF ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10−12. Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.
Yildiz, Gokhan; Arslan-Ergul, Ayca; Bagislar, Sevgi; Konu, Ozlen; Yuzugullu, Haluk; Gursoy-Yuzugullu, Ozge; Ozturk, Nuri; Ozen, Cigdem; Ozdag, Hilal; Erdal, Esra; Karademir, Sedat; Sagol, Ozgul; Mizrak, Dilsa; Bozkaya, Hakan; Ilk, Hakki Gokhan; Ilk, Ozlem; Bilen, Biter; Cetin-Atalay, Rengul; Akar, Nejat; Ozturk, Mehmet
Senescence is a permanent proliferation arrest in response to cell stress such as DNA damage. It contributes strongly to tissue aging and serves as a major barrier against tumor development. Most tumor cells are believed to bypass the senescence barrier (become "immortal") by inactivating growth control genes such as TP53 and CDKN2A. They also reactivate telomerase reverse transcriptase. Senescence-to-immortality transition is accompanied by major phenotypic and biochemical changes mediated by genome-wide transcriptional modifications. This appears to happen during hepatocellular carcinoma (HCC) development in patients with liver cirrhosis, however, the accompanying transcriptional changes are virtually unknown. We investigated genome-wide transcriptional changes related to the senescence-to-immortality switch during hepatocellular carcinogenesis. Initially, we performed transcriptome analysis of senescent and immortal clones of Huh7 HCC cell line, and identified genes with significant differential expression to establish a senescence-related gene list. Through the analysis of senescence-related gene expression in different liver tissues we showed that cirrhosis and HCC display expression patterns compatible with senescent and immortal phenotypes, respectively; dysplasia being a transitional state. Gene set enrichment analysis revealed that cirrhosis/senescence-associated genes were preferentially expressed in non-tumor tissues, less malignant tumors, and differentiated or senescent cells. In contrast, HCC/immortality genes were up-regulated in tumor tissues, or more malignant tumors and progenitor cells. In HCC tumors and immortal cells genes involved in DNA repair, cell cycle, telomere extension and branched chain amino acid metabolism were up-regulated, whereas genes involved in cell signaling, as well as in drug, lipid, retinoid and glycolytic metabolism were down-regulated. Based on these distinctive gene expression features we developed a 15-gene
Silvia Teresa Rodríguez-Ramilo
Full Text Available Estimates of effective population size in the Holstein cattle breed have usually been low despite the large number of animals that constitute this breed. Effective population size is inversely related to the rates at which coancestry and inbreeding increase and these rates have been high as a consequence of intense and accurate selection. Traditionally, coancestry and inbreeding coefficients have been calculated from pedigree data. However, the development of genome-wide single nucleotide polymorphisms has increased the interest of calculating these coefficients from molecular data in order to improve their accuracy. In this study, genomic estimates of coancestry, inbreeding and effective population size were obtained in the Spanish Holstein population and then compared with pedigree-based estimates. A total of 11,135 animals genotyped with the Illumina BovineSNP50 BeadChip were available for the study. After applying filtering criteria, the final genomic dataset included 36,693 autosomal SNPs and 10,569 animals. Pedigree data from those genotyped animals included 31,203 animals. These individuals represented only the last five generations in order to homogenise the amount of pedigree information across animals. Genomic estimates of coancestry and inbreeding were obtained from identity by descent segments (coancestry or runs of homozygosity (inbreeding. The results indicate that the percentage of variance of pedigree-based coancestry estimates explained by genomic coancestry estimates was higher than that for inbreeding. Estimates of effective population size obtained from genome-wide and pedigree information were consistent and ranged from about 66 to 79. These low values emphasize the need of controlling the rate of increase of coancestry and inbreeding in Holstein selection programmes.
Gorski, Mathias; Tin, Adrienne; Garnaas, Maija; McMahon, Gearoid M.; Chu, Audrey Y.; Tayo, Bamidele O.; Pattaro, Cristian; Teumer, Alexander; Chasman, Daniel I.; Chalmers, John; Hamet, Pavel; Tremblay, Johanne; Woodward, Marc; Aspelund, Thor; Eiriksdottir, Gudny; Gudnason, Vilmundur; Harris, Tammara B.; Launer, Lenore J.; Smith, Albert V.; Mitchell, Braxton D.; O'Connell, Jeffrey R.; Shuldiner, Alan R.; Coresh, Josef; Li, Man; Freudenberger, Paul; Hofer, Edith; Schmidt, Helena; Schmidt, Reinhold; Holliday, Elizabeth G.; Mitchell, Paul; Wang, Jie Jin; de Boer, Ian H.; Li, Guo; Siscovick, David S.; Kutalik, Zoltan; Corre, Tanguy; Vollenweider, Peter; Waeber, Gérard; Gupta, Jayanta; Kanetsky, Peter A.; Hwang, Shih-Jen; Olden, Matthias; Yang, Qiong; de Andrade, Mariza; Atkinson, Elizabeth J.; Kardia, Sharon L.R.; Turner, Stephen T.; Stafford, Jeanette M.; Ding, Jingzhong; Liu, Yongmei; Barlassina, Cristina; Cusi, Daniele; Salvi, Erika; Staessen, Jan A; Ridker, Paul M; Grallert, Harald; Meisinger, Christa; Müller-Nurasyid, Martina; Krämer, Bernhard K.; Kramer, Holly; Rosas, Sylvia E.; Nolte, Ilja M.; Penninx, Brenda W.; Snieder, Harold; Del Greco, Fabiola; Franke, Andre; Nöthlings, Ute; Lieb, Wolfgang; Bakker, Stephan J.L.; Gansevoort, Ron T.; van der Harst, Pim; Dehghan, Abbas; Franco, Oscar H.; Hofman, Albert; Rivadeneira, Fernando; Sedaghat, Sanaz; Uitterlinden, André G.; Coassin, Stefan; Haun, Margot; Kollerits, Barbara; Kronenberg, Florian; Paulweber, Bernhard; Aumann, Nicole; Endlich, Karlhans; Pietzner, Mike; Völker, Uwe; Rettig, Rainer; Chouraki, Vincent; Helmer, Catherine; Lambert, Jean-Charles; Metzger, Marie; Stengel, Benedicte; Lehtimäki, Terho; Lyytikäinen, Leo-Pekka; Raitakari, Olli; Johnson, Andrew; Parsa, Afshin; Bochud, Murielle; Heid, Iris M.; Goessling, Wolfram; Köttgen, Anna; Kao, H. Linda; Fox, Caroline S.; Böger, Carsten A.
Genome wide association studies (GWAS) have identified multiple loci associated with cross-sectional eGFR, but a systematic genetic analysis of kidney function decline over time is missing. Here we conducted a GWAS meta-analysis among 63,558 participants of European descent, initially from 16 cohorts with serial kidney function measurements within the CKDGen Consortium, followed by independent replication among additional participants from 13 cohorts. In stage 1 GWAS meta-analysis, SNPs at MEOX2, GALNT11, IL1RAP, NPPA, HPCAL1 and CDH23 showed the strongest associations for at least one trait, in addition to the known UMOD locus which showed genome-wide significance with an annual change in eGFR. In stage 2 meta-analysis, the significant association at UMOD was replicated. Associations at GALNT11 with Rapid Decline (annual eGFRdecline of 3ml/min/1.73m2 or more), and CDH23 with eGFR change among those with CKD showed significant suggestive evidence of replication. Combined stage 1 and 2 meta-analyses showed significance for UMOD, GALNT11 and CDH23. Morpholino knockdowns of galnt11 and cdh23 in zebrafish embryos each had signs of severe edema 72 hours after gentamicin treatment compared to controls, but no gross morphological renal abnormalities before gentamicin administration. Thus, our results suggest a role in the deterioration of kidney function for the loci GALNT11 and CDH23, and show that the UMOD locus is significantly associated with kidney function decline. PMID:25493955
Rodríguez-Ramilo, Silvia Teresa; Fernández, Jesús; Toro, Miguel Angel; Hernández, Delfino; Villanueva, Beatriz
Estimates of effective population size in the Holstein cattle breed have usually been low despite the large number of animals that constitute this breed. Effective population size is inversely related to the rates at which coancestry and inbreeding increase and these rates have been high as a consequence of intense and accurate selection. Traditionally, coancestry and inbreeding coefficients have been calculated from pedigree data. However, the development of genome-wide single nucleotide polymorphisms has increased the interest of calculating these coefficients from molecular data in order to improve their accuracy. In this study, genomic estimates of coancestry, inbreeding and effective population size were obtained in the Spanish Holstein population and then compared with pedigree-based estimates. A total of 11,135 animals genotyped with the Illumina BovineSNP50 BeadChip were available for the study. After applying filtering criteria, the final genomic dataset included 36,693 autosomal SNPs and 10,569 animals. Pedigree data from those genotyped animals included 31,203 animals. These individuals represented only the last five generations in order to homogenise the amount of pedigree information across animals. Genomic estimates of coancestry and inbreeding were obtained from identity by descent segments (coancestry) or runs of homozygosity (inbreeding). The results indicate that the percentage of variance of pedigree-based coancestry estimates explained by genomic coancestry estimates was higher than that for inbreeding. Estimates of effective population size obtained from genome-wide and pedigree information were consistent and ranged from about 66 to 79. These low values emphasize the need of controlling the rate of increase of coancestry and inbreeding in Holstein selection programmes.
Bhattacharya, Soumyaroop; Go, Diana; Krenitsky, Daria L.; Huyck, Heidi L.; Solleti, Siva Kumar; Lunger, Valerie A.; Metlay, Leon; Srisuma, Sorachai; Wert, Susan E.; Pryhuber, Gloria S.
Rationale: Bronchopulmonary dysplasia (BPD) is a major complication of premature birth. Risk factors for BPD are complex and include prenatal infection and O2 toxicity. BPD pathology is equally complex and characterized by inflammation and dysmorphic airspaces and vasculature. Due to the limited availability of clinical samples, an understanding of the molecular pathogenesis of this disease and its causal mechanisms and associated biomarkers is limited. Objectives: Apply genome-wide expression profiling to define pathways affected in BPD lungs. Methods: Lung tissue was obtained at autopsy from 11 BPD cases and 17 age-matched control subjects without BPD. RNA isolated from these tissue samples was interrogated using microarrays. Standard gene selection and pathway analysis methods were applied to the data set. Abnormal expression patterns were validated by quantitative reverse transcriptase–polymerase chain reaction and immunohistochemistry. Measurements and Main Results: We identified 159 genes differentially expressed in BPD tissues. Pathway analysis indicated previously appreciated (e.g., DNA damage regulation of cell cycle) as well as novel (e.g., B-cell development) biological functions were affected. Three of the five most highly induced genes were mast cell (MC)-specific markers. We confirmed an increased accumulation of connective tissue MCTC (chymase expressing) mast cells in BPD tissues. Increased expression of MCTC markers was also demonstrated in an animal model of BPD-like pathology. Conclusions: We present a unique genome-wide expression data set from human BPD lung tissue. Our data provide information on gene expression patterns associated with BPD and facilitated the discovery that MCTC accumulation is a prominent feature of this disease. These observations have significant clinical and mechanistic implications. PMID:22723293
Full Text Available The use of genome-wide single nucleotide polymorphism (SNP data has recently proven useful in the study of human population structure. We have studied the internal genetic structure of the Swedish population using more than 350,000 SNPs from 1525 Swedes from all over the country genotyped on the Illumina HumanHap550 array. We have also compared them to 3212 worldwide reference samples, including Finns, northern Germans, British and Russians, based on the more than 29,000 SNPs that overlap between the Illumina and Affymetrix 250K Sty arrays. The Swedes--especially southern Swedes--were genetically close to the Germans and British, while their genetic distance to Finns was substantially longer. The overall structure within Sweden appeared clinal, and the substructure in the southern and middle parts was subtle. In contrast, the northern part of Sweden, Norrland, exhibited pronounced genetic differences both within the area and relative to the rest of the country. These distinctive genetic features of Norrland probably result mainly from isolation by distance and genetic drift caused by low population density. The internal structure within Sweden (F(ST = 0.0005 between provinces was stronger than that in many Central European populations, although smaller than what has been observed for instance in Finland; importantly, it is of the magnitude that may hamper association studies with a moderate number of markers if cases and controls are not properly matched geographically. Overall, our results underline the potential of genome-wide data in analyzing substructure in populations that might otherwise appear relatively homogeneous, such as the Swedes.
Full Text Available Dogs, with their breed-determined limited genetic background, are great models of human disease including cancer. Canine B-cell lymphoma and hemangiosarcoma are both malignancies of the hematologic system that are clinically and histologically similar to human B-cell non-Hodgkin lymphoma and angiosarcoma, respectively. Golden retrievers in the US show significantly elevated lifetime risk for both B-cell lymphoma (6% and hemangiosarcoma (20%. We conducted genome-wide association studies for hemangiosarcoma and B-cell lymphoma, identifying two shared predisposing loci. The two associated loci are located on chromosome 5, and together contribute ~20% of the risk of developing these cancers. Genome-wide p-values for the top SNP of each locus are 4.6×10-7 and 2.7×10-6, respectively. Whole genome resequencing of nine cases and controls followed by genotyping and detailed analysis identified three shared and one B-cell lymphoma specific risk haplotypes within the two loci, but no coding changes were associated with the risk haplotypes. Gene expression analysis of B-cell lymphoma tumors revealed that carrying the risk haplotypes at the first locus is associated with down-regulation of several nearby genes including the proximal gene TRPC6, a transient receptor Ca2+-channel involved in T-cell activation, among other functions. The shared risk haplotype in the second locus overlaps the vesicle transport and release gene STX8. Carrying the shared risk haplotype is associated with gene expression changes of 100 genes enriched for pathways involved in immune cell activation. Thus, the predisposing germ-line mutations in B-cell lymphoma and hemangiosarcoma appear to be regulatory, and affect pathways involved in T-cell mediated immune response in the tumor. This suggests that the interaction between the immune system and malignant cells plays a common role in the tumorigenesis of these relatively different cancers.
Tonomura, Noriko; Elvers, Ingegerd; Thomas, Rachael; Megquier, Kate; Turner-Maier, Jason; Howald, Cedric; Sarver, Aaron L.; Swofford, Ross; Frantz, Aric M.; Ito, Daisuke; Mauceli, Evan; Arendt, Maja; Noh, Hyun Ji; Koltookian, Michele; Biagi, Tara; Fryc, Sarah; Williams, Christina; Avery, Anne C.; Kim, Jong-Hyuk; Barber, Lisa; Burgess, Kristine; Lander, Eric S.; Karlsson, Elinor K.; Azuma, Chieko
Dogs, with their breed-determined limited genetic background, are great models of human disease including cancer. Canine B-cell lymphoma and hemangiosarcoma are both malignancies of the hematologic system that are clinically and histologically similar to human B-cell non-Hodgkin lymphoma and angiosarcoma, respectively. Golden retrievers in the US show significantly elevated lifetime risk for both B-cell lymphoma (6%) and hemangiosarcoma (20%). We conducted genome-wide association studies for hemangiosarcoma and B-cell lymphoma, identifying two shared predisposing loci. The two associated loci are located on chromosome 5, and together contribute ~20% of the risk of developing these cancers. Genome-wide p-values for the top SNP of each locus are 4.6×10-7 and 2.7×10-6, respectively. Whole genome resequencing of nine cases and controls followed by genotyping and detailed analysis identified three shared and one B-cell lymphoma specific risk haplotypes within the two loci, but no coding changes were associated with the risk haplotypes. Gene expression analysis of B-cell lymphoma tumors revealed that carrying the risk haplotypes at the first locus is associated with down-regulation of several nearby genes including the proximal gene TRPC6, a transient receptor Ca2+-channel involved in T-cell activation, among other functions. The shared risk haplotype in the second locus overlaps the vesicle transport and release gene STX8. Carrying the shared risk haplotype is associated with gene expression changes of 100 genes enriched for pathways involved in immune cell activation. Thus, the predisposing germ-line mutations in B-cell lymphoma and hemangiosarcoma appear to be regulatory, and affect pathways involved in T-cell mediated immune response in the tumor. This suggests that the interaction between the immune system and malignant cells plays a common role in the tumorigenesis of these relatively different cancers. PMID:25642983
Full Text Available BACKGROUND: Venous Thrombosis (VT is a common multifactorial disease associated with a major public health burden. Genetics factors are known to contribute to the susceptibility of the disease but how many genes are involved and their contribution to VT risk still remain obscure. We aimed to identify genetic variants associated with VT risk. METHODOLOGY/PRINCIPAL FINDINGS: We conducted a genome-wide association study (GWAS based on 551,141 SNPs genotyped in 1,542 cases and 1,110 controls. Twelve SNPs reached the genome-wide significance level of 2.0×10(-8 and encompassed four known VT-associated loci, ABO, F5, F11 and FGG. By means of haplotype analyses, we also provided novel arguments in favor of a role of HIVEP1, PROCR and STAB2, three loci recently hypothesized to participate in the susceptibility to VT. However, no novel VT-associated loci came out of our GWAS. Using a recently proposed statistical methodology, we also showed that common variants could explain about 35% of the genetic variance underlying VT susceptibility among which 3% could be attributable to the main identified VT loci. This analysis additionally suggested that the common variants left to be identified are not uniformly distributed across the genome and that chromosome 20, itself, could contribute to ∼7% of the total genetic variance. CONCLUSIONS/SIGNIFICANCE: This study might also provide a valuable source of information to expand our understanding of biological mechanisms regulating quantitative biomarkers for VT.
Nannya, Yasuhito; Taura, Kenjiro; Kurokawa, Mineo; Chiba, Shigeru; Ogawa, Seishi
With recent advances in high-throughput single nucleotide polymorphism (SNP) typing technologies, genome-wide association studies have become a realistic approach to identify the causative genes that are responsible for common diseases of complex genetic traits. In this strategy, a trade-off between the increased genome coverage and a chance of finding SNPs incidentally showing a large statistics becomes serious due to extreme multiple-hypothesis testing. We investigated the extent to which this trade-off limits the genome-wide power with this approach by simulating a large number of case-control panels based on the empirical data from the HapMap Project. In our simulations, statistical costs of multiple hypothesis testing were evaluated by empirically calculating distributions of the maximum value of the chi(2) statistics for a series of marker sets having increasing numbers of SNPs, which were used to determine a genome-wide threshold in the following power simulations. With a practical study size, the cost of multiple testing largely offsets the potential benefits from increased genome coverage given modest genetic effects and/or low frequencies of causal alleles. In most realistic scenarios, increasing genome coverage becomes less influential on the power, while sample size is the predominant determinant of the feasibility of genome-wide association tests. Increasing genome coverage without corresponding increase in sample size will only consume resources without little gain in power. For common causal alleles with relatively large effect sizes [genotype relative risk > or =1.7], we can expect satisfactory power with currently available large-scale genotyping platforms using realistic sample size ( approximately 1000 per arm).
Full Text Available Abstract Background RNA interference (RNAi has been seen as a revolution in functional genomics and system biology. Genome-wide RNAi research relies on the development of RNAi high-throughput screening (HTS assays. One of the most fundamental challenges in RNAi HTS is to glean biological significance from mounds of data, which relies on the development of effective analytic methods for selecting effective small interfering RNAs (siRNAs. Findings Based on a recently proposed parameter, strictly standardized mean difference (SSMD, I propose an analytic method for genome-wide screens of effective siRNAs through assessing and testing the size of siRNA effects. Central to this method is the capability of SSMD in quantifying siRNA effects. This method has relied on normal approximation, which works only in the primary screens but not in the confirmatory screens. In this paper, I explore the non-central t-distribution property of SSMD estimates and use this property to extend the SSMD-based method so that it works effectively in either primary or confirmatory screens as well as in any HTS screens with or without replicates. The SSMD-based method maintains a balanced control of false positives and false negatives. Conclusion The central interest in genome-wide RNAi research is the selection of effective siRNAs which relies on the development of analytic methods to measure the size of siRNA effects. The new analytic method for hit selection provided in this paper offers a good analytic tool for selecting effective siRNAs, better than current analytic methods, and thus may have broad utility in genome-wide RNAi research.
Pandey, Rajan; Mohmmed, Asif; Pierrot, Christine; Khalife, Jamal; Malhotra, Pawan; Gupta, Dinesh
Eukaryotic cellular machineries are intricately regulated by several molecular mechanisms involving transcriptional control, post-translational control and post-translational modifications of proteins (PTMs...
Hoeffding, Louise K; Rosengren, Anders; Thygesen, Johan H; Schmock, Henriette; Werge, Thomas; Hansen, Thomas
Epidemiological studies have documented higher than expected comorbidity (or, in some cases, inverse comorbidity) between schizophrenia and several autoimmune disorders. It remains unknown whether this comorbidity reflects shared genetic susceptibility loci. The present study aimed to investigate whether verified genome wide significant variants of autoimmune disorders confer risk of schizophrenia, which could suggest a common genetic basis. Seven hundred and fourteen genome wide significant risk variants of 25 autoimmune disorders were extracted from the NHGRI GWAS catalogue and examined for association to schizophrenia in the Psychiatric Genomics Consortium schizophrenia GWAS samples (36,989 cases and 113,075 controls). Two independent loci at 4q24 and 6p21.32-33 originally identified from GWAS of autoimmune diseases were found genome wide associated with schizophrenia (1.7 × 10(-8 )≥( )p ≥ 4.0 × 10(-21)). While these observations confirm the existence of shared genetic susceptibility loci between schizophrenia and autoimmune diseases, the findings did not show a significant enrichment. The findings do not support a genetic overlap in common SNPs between autoimmune diseases and schizophrenia that in part could explain the observed comorbidity from epidemiological studies.
Nguyen, Thanh-Tung; Huang, Joshua; Wu, Qingyao; Nguyen, Thuy; Li, Mark
Single-nucleotide polymorphisms (SNPs) selection and identification are the most important tasks in Genome-wide association data analysis. The problem is difficult because genome-wide association data is very high dimensional and a large portion of SNPs in the data is irrelevant to the disease. Advanced machine learning methods have been successfully used in Genome-wide association studies (GWAS) for identification of genetic variants that have relatively big effects in some common, complex diseases. Among them, the most successful one is Random Forests (RF). Despite of performing well in terms of prediction accuracy in some data sets with moderate size, RF still suffers from working in GWAS for selecting informative SNPs and building accurate prediction models. In this paper, we propose to use a new two-stage quality-based sampling method in random forests, named ts-RF, for SNP subspace selection for GWAS. The method first applies p-value assessment to find a cut-off point that separates informative and irrelevant SNPs in two groups. The informative SNPs group is further divided into two sub-groups: highly informative and weak informative SNPs. When sampling the SNP subspace for building trees for the forest, only those SNPs from the two sub-groups are taken into account. The feature subspaces always contain highly informative SNPs when used to split a node at a tree. This approach enables one to generate more accurate trees with a lower prediction error, meanwhile possibly avoiding overfitting. It allows one to detect interactions of multiple SNPs with the diseases, and to reduce the dimensionality and the amount of Genome-wide association data needed for learning the RF model. Extensive experiments on two genome-wide SNP data sets (Parkinson case-control data comprised of 408,803 SNPs and Alzheimer case-control data comprised of 380,157 SNPs) and 10 gene data sets have demonstrated that the proposed model significantly reduced prediction errors and outperformed
Sanders, Jennifer; Samuelson, David J
Human population-based genome-wide association (GWA) studies identify low penetrance breast cancer risk alleles; however, GWA studies alone do not definitively determine causative genes or mechanisms. Stringent genome- wide statistical significance level requirements, set to avoid false-positive associations, yield many false-negative associations. Laboratory rats (Rattus norvegicus) are useful to study many aspects of breast cancer, including genetic susceptibility. Several rat mammary cancer associated loci have been identified using genetic linkage and congenic strain based-approaches. Here, we sought to determine the amount of overlap between GWA study nominated human breast and rat mammary cancer susceptibility loci. We queried published GWA studies to identify two groups of SNPs, one that reached genome-wide significance and one comprised of SNPs failing a validation step and not reaching genome- wide significance. Human genome locations of these SNPs were compared to known rat mammary carcinoma susceptibility loci to determine if risk alleles existed in both species. Rat genome regions not known to associate with mammary cancer risk were randomly selected as control regions. Significantly more human breast cancer risk GWA study nominated SNPs mapped at orthologs of rat mammary cancer loci than to regions not known to contain rat mammary cancer loci. The rat genome was useful to predict associations that had met human genome-wide significance criteria and weaker associations that had not. Integration of human and rat comparative genomics may be useful to parse out false-negative associations in GWA studies of breast cancer risk.
Johnson, M; Brennecke, S; Iversen, A-C; East, C; Olsen, G; Kent, J; Dyer, T; Said, J; Roten, L; Abraham, L; Zwart, J-A; Winsvold, B; Håberg, A; Huentelman, M; Krokan, H; Gabrielsen, M; Austgulen, R; Blangero, J; Moses, E
We have successfully utilized a family-based study design to localize several positional candidate preeclampsia susceptibility genes to chromosomes 2q22(ACVR2A,LCT,LRP1B,RND3,GCA),5q (ERAP2) and 13q(TNFSF13B). We now report on our continued positional cloning efforts using an alternative genome-wide association (GWA) mapping strategy in large Caucasian case-control cohorts from Australia and Norway. To identify maternal genetic risk loci for preeclampsia. The unrelated Australian samples (545 cases,547 controls) were genotyped using Illumina BeadChip technology (700K loci) and have been analyzed using PLINK. All unrelated Norwegian samples were genotyped across several Illumina BeadChip substrates and consist of 847 cases (700K loci) and 638 controls. The Norwegian control samples originate from other HUNT studies pertaining to migraine (n=95,700K loci), lung cancer (n=89,370K loci) and normal brain pathology (n=454,2.5M loci). To analyze a concordant set of 2.5-3 million genotypes across all Norwegian samples we are currently using MaCH to impute those loci not directly genotyped. The Norwegian GWA data will be analyzed in SOLAR utilizing empirical kinship estimates to account for any distant relatedness. 1078 Australian samples (538 cases,540 controls) and 648, 175 SNPs passed our quality control metrics. Two SNP associations (rs7579169,p=3.6×10(-7); rs12711941,p=4.3×10(-7)) satisfied our genome-wide significant threshold (p1.56) and are in strong linkage disequilibrium with each other (r(2)>0.9) but not with any other genotyped SNP ±200kb. The analysis of the Norwegian GWAS is underway. The Australian GWAS has identified a novel preeclampsia risk locus on chromosome 2q. The INHBB gene closest to our SNP associations is a plausible positional candidate susceptibility gene. There is a substantive body of evidence implicating inhibins, activins and other members of the TGF-βsuperfamily to have a role in the development of preeclampsia. The biological connection
Ramachandran, Dhanya; Zeng, Zhen; Locke, Adam E; Mulle, Jennifer G; Bean, Lora J H; Rosser, Tracie C; Dooley, Kenneth J; Cua, Clifford L; Capone, George T; Reeves, Roger H; Maslen, Cheryl L; Cutler, David J; Feingold, Eleanor; Sherman, Stephanie L; Zwick, Michael E
.... We performed a genome-wide association study using logistic regression analysis on 452 individuals with Down syndrome, consisting of 210 cases with complete atrioventricular septal defects and 242...
Chen, Ying; Lee, Siu F.; Blanc, Eric; Reuter, Caroline; Wertheim, Bregje; Martinez-Diaz, Pedro; Hoffmann, Ary A.; Partridge, Linda
Clinal variation in quantitative traits is widespread, but its genetic basis awaits identification. Drosophila melanogaster shows adaptive, clinal variation in traits such as body size along latitudinal gradients on multiple continents. To investigate genome wide transcription differentiation
Martin, Simon H; Dasmahapatra, Kanchon K; Nadeau, Nicola J; Salazar, Camilo; Walters, James R; Simpson, Fraser; Blaxter, Mark; Manica, Andrea; Mallet, James; Jiggins, Chris D
Most speciation events probably occur gradually, without complete and immediate reproductive isolation, but the full extent of gene flow between diverging species has rarely been characterized on a genome-wide scale...
Kim, Deog Kyeom; Cho, Michael H; Hersh, Craig P
Rationale: A genome-wide association study (GWAS) for circulating chronic obstructive pulmonary disease (COPD) biomarkers could identify genetic determinants of biomarker levels and COPD susceptibility. Objectives: To identify genetic variants of circulating protein biomarkers and novel genetic...
Gretarsdottir, Solveig; Baas, Annette F.; Thorleifsson, Gudmar; Holm, Hilma; den Heijer, Martin; de Vries, Jean-Paul P. M.; Kranendonk, Steef E.; Zeebregts, Clark J. A. M.; van Sterkenburg, Steven M.; Geelkerken, Robert H.; van Rij, Andre M.; Williams, Michael J. A.; Boll, Albert P. M.; Kostic, Jelena P.; Jonasdottir, Adalbjorg; Jonasdottir, Aslaug; Walters, G. Bragi; Masson, Gisli; Sulem, Patrick; Saemundsdottir, Jona; Mouy, Magali; Magnusson, Kristinn P.; Tromp, Gerard; Elmore, James R.; Sakalihasan, Natzi; Limet, Raymond; Defraigne, Jean-Olivier; Ferrell, Robert E.; Ronkainen, Antti; Ruigrok, Ynte M.; Wijmenga, Cisca; Grobbee, Diederick E.; Shah, Svati H.; Granger, Christopher B.; Quyyumi, Arshed A.; Vaccarino, Viola; Patel, Riyaz S.; Zafari, A. Maziar; Levey, Allan I.; Austin, Harland; Girelli, Domenico; Pignatti, Pier Franco; Olivieri, Oliviero; Martinelli, Nicola; Malerba, Giovanni; Trabetti, Elisabetta; Becker, Lewis C.; Becker, Diane M.; Reilly, Muredach P.; Rader, Daniel J.; Mueller, Thomas; Dieplinger, Benjamin; Haltmayer, Meinhard; Urbonavicius, Sigitas; Lindblad, Bengt; Gottsater, Anders; Gaetani, Eleonora; Pola, Roberto; Wells, Philip; Rodger, Marc; Forgie, Melissa; Langlois, Nicole; Corral, Javier; Vicente, Vicente; Fontcuberta, Jordi; Espana, Francisco; Grarup, Niels; Jorgensen, Torben; Witte, Daniel R.; Hansen, Torben; Pedersen, Oluf; Aben, Katja K.; de Graaf, Jacqueline; Holewijn, Suzanne; Folkersen, Lasse; Franco-Cereceda, Anders; Eriksson, Per; Collier, David A.; Stefansson, Hreinn; Steinthorsdottir, Valgerdur; Rafnar, Thorunn; Valdimarsson, Einar M.; Magnadottir, Hulda B.; Sveinbjornsdottir, Sigurlaug; Olafsson, Isleifur; Magnusson, Magnus Karl; Palmason, Robert; Haraldsdottir, Vilhelmina; Andersen, Karl; Onundarson, Pall T.; Thorgeirsson, Gudmundur; Kiemeney, Lambertus A.; Powell, Janet T.; Carey, David J.; Kuivaniemi, Helena; Lindholt, Jes S.; Jones, Gregory T.; Kong, Augustine; Blankensteijn, Jan D.; Matthiasson, Stefan E.; Thorsteinsdottir, Unnur; Stefansson, Kari
We performed a genome-wide association study on 1,292 individuals with abdominal aortic aneurysms (AAAs) and 30,503 controls from Iceland and The Netherlands, with a follow-up of top markers in up to 3,267 individuals with AAAs and 7,451 controls. The A allele of rs7025486 on 9q33 was found to
Gretarsdottir, Solveig; Baas, Annette F; Thorleifsson, Gudmar
We performed a genome-wide association study on 1,292 individuals with abdominal aortic aneurysms (AAAs) and 30,503 controls from Iceland and The Netherlands, with a follow-up of top markers in up to 3,267 individuals with AAAs and 7,451 controls. The A allele of rs7025486 on 9q33 was found to as...
Purdue, Mark P.; Johansson, Mattias; Zelenika, Diana; Toro, Jorge R.; Scelo, Ghislaine; Moore, Lee E.; Prokhortchouk, Egor; Wu, Xifeng; Kiemeney, Lambertus A.; Gaborieau, Valerie; Jacobs, Kevin B.; Chow, Wong-Ho; Zaridze, David; Matveev, Vsevolod; Lubinski, Jan; Trubicka, Joanna; Szeszenia-Dabrowska, Neonila; Lissowska, Jolanta; Rudnai, Peter; Fabianova, Eleonora; Bucur, Alexandru; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Boffetta, Paolo; Colt, Joanne S.; Davis, Faith G.; Schwartz, Kendra L.; Banks, Rosamonde E.; Selby, Peter J.; Harnden, Patricia; Berg, Christine D.; Hsing, Ann W.; Grubb, Robert L.; Boeing, Heiner; Vineis, Paolo; Clavel-Chapelon, Francoise; Palli, Domenico; Tumino, Rosario; Krogh, Vittorio; Panico, Salvatore; Duell, Eric J.; Quiros, Jose Ramon; Sanchez, Maria-Jose; Navarro, Carmen; Ardanaz, Eva; Dorronsoro, Miren; Khaw, Kay-Tee; Allen, Naomi E.; Bueno-de-Mesquita, H. Bas; Peeters, Petra H. M.; Trichopoulos, Dimitrios; Linseisen, Jakob; Ljungberg, Borje; Overvad, Kim; Tjonneland, Anne; Romieu, Isabelle; Riboli, Elio; Mukeria, Anush; Shangina, Oxana; Stevens, Victoria L.; Thun, Michael J.; Diver, W. Ryan; Gapstur, Susan M.; Pharoah, Paul D.; Easton, Douglas F.; Albanes, Demetrius; Weinstein, Stephanie J.; Virtamo, Jarmo; Vatten, Lars; Hveem, Kristian; Njolstad, Inger; Tell, Grethe S.; Stoltenberg, Camilla; Kumar, Rajiv; Koppova, Kvetoslava; Cussenot, Olivier; Benhamou, Simone; Oosterwijk, Egbert; Vermeulen, Sita H.; Aben, Katja K. H.; van der Marel, Saskia L.; Ye, Yuanqing; Wood, Christopher G.; Pu, Xia; Mazur, Alexander M.; Boulygina, Eugenia S.; Chekanov, Nikolai N.; Foglio, Mario; Lechner, Doris; Gut, Ivo; Heath, Simon; Blanche, Helene; Hutchinson, Amy; Thomas, Gilles; Wang, Zhaoming; Yeager, Meredith; Fraumeni, Joseph F.; Skryabin, Konstantin G.; McKay, James D.; Rothman, Nathaniel; Chanock, Stephen J.; Lathrop, Mark; Brennan, Paul
We conducted a two-stage genome-wide association study of renal cell carcinoma (RCC) in 3,772 affected individuals (cases) and 8,505 controls of European background from 11 studies and followed up 6 SNPs in 3 replication studies of 2,198 cases and 4,918 controls. Two loci on the regions of 2p21 and
IRVINE, ALAN; CORVIN, AIDEN; MORRIS, DEREK
PUBLISHED Export Date: 3 March 2015 Atopic dermatitis and psoriasis are the two most common immune-mediated inflammatory disorders affecting the skin. Genome-wide studies demonstrate a high degree of genetic overlap, but these diseases have mutually exclusive clinical phenotypes and opposing im- mune mechanisms. Despite their prevalence, atopic dermatitis and psoriasis very rarely co-occur within one individual. By utilizing genome-wide association study and ImmunoChip data fro...
Vink, Jacqueline M; Smit, August B; de Geus, Eco J C
For the identification of genes associated with smoking initiation and current smoking, genome-wide association analyses were carried out in 3497 subjects. Significant genes that replicated in three independent samples (n = 405, 5810, and 1648) were visualized into a biologically meaningful network......) and cell-adhesion molecules (e.g., CDH23). We conclude that a network-based genome-wide association approach can identify genes influencing smoking behavior....
Villanueva, Pia; Newbury, Dianne F; Jara, Lilian; De Barbieri, Zulema; Mirza, Ghazala; Palomino, Hernán M; Fernández, María Angélica; Cazier, Jean-Baptiste; Monaco, Anthony P; Palomino, Hernán
Specific language impairment (SLI) is an unexpected deficit in the acquisition of language skills and affects between 5 and 8% of pre-school children. Despite its prevalence and high heritability, our understanding of the aetiology of this disorder is only emerging. In this paper, we apply genome-wide techniques to investigate an isolated Chilean population who exhibit an increased frequency of SLI. Loss of heterozygosity (LOH) mapping and parametric and non-parametric linkage analyses indicate that complex genetic factors are likely to underlie susceptibility to SLI in this population. Across all analyses performed, the most consistently implicated locus was on chromosome 7q. This locus achieved highly significant linkage under all three non-parametric models (max NPL=6.73, P=4.0 × 10−11). In addition, it yielded a HLOD of 1.24 in the recessive parametric linkage analyses and contained a segment that was homozygous in two affected individuals. Further, investigation of this region identified a two-SNP haplotype that occurs at an increased frequency in language-impaired individuals (P=0.008). We hypothesise that the linkage regions identified here, in particular that on chromosome 7, may contain variants that underlie the high prevalence of SLI observed in this isolated population and may be of relevance to other populations affected by language impairments. PMID:21248734
Villanueva, Pia; Newbury, Dianne F; Jara, Lilian; De Barbieri, Zulema; Mirza, Ghazala; Palomino, Hernán M; Fernández, María Angélica; Cazier, Jean-Baptiste; Monaco, Anthony P; Palomino, Hernán
Specific language impairment (SLI) is an unexpected deficit in the acquisition of language skills and affects between 5 and 8% of pre-school children. Despite its prevalence and high heritability, our understanding of the aetiology of this disorder is only emerging. In this paper, we apply genome-wide techniques to investigate an isolated Chilean population who exhibit an increased frequency of SLI. Loss of heterozygosity (LOH) mapping and parametric and non-parametric linkage analyses indicate that complex genetic factors are likely to underlie susceptibility to SLI in this population. Across all analyses performed, the most consistently implicated locus was on chromosome 7q. This locus achieved highly significant linkage under all three non-parametric models (max NPL = 6.73, P = 4.0 × 10(-11)). In addition, it yielded a HLOD of 1.24 in the recessive parametric linkage analyses and contained a segment that was homozygous in two affected individuals. Further, investigation of this region identified a two-SNP haplotype that occurs at an increased frequency in language-impaired individuals (P = 0.008). We hypothesise that the linkage regions identified here, in particular that on chromosome 7, may contain variants that underlie the high prevalence of SLI observed in this isolated population and may be of relevance to other populations affected by language impairments.
Grant, Struan F A; Hakonarson, Hakon
there is a revolution occurring in single nucleotide polymorphism (SNP) genotyping technology, with high-throughput methods now allowing large numbers of SNPs (10(5)-10(6)) to be genotyped in large cohort studies. This has enabled large-scale genome-wide association (GWA) studies in complex diseases, such as diabetes, asthma, and inflammatory bowel disease, to be undertaken for the first time. the GWA approach serves the critical need for a comprehensive and unbiased strategy to identify causal genes related to complex disease, and is rapidly replacing the more traditional candidate gene studies and microsatellite-based linkage mapping approaches that have dominated gene discovery attempts for common diseases. As a consequence of employing array-based technologies, over the last 3 years dramatic discoveries of key variants involved in multiple complex diseases and related traits have been reported in the top scientific literature and, most importantly, have been largely replicated by independent investigator groups. As a consequence, several novel genes have been identified, most notably in the metabolic, cardiovascular, autoimmune, and oncology disease areas, that are clearly rooted in the biology of these disorders. These discoveries have opened up new avenues for investigators to address novel molecular pathways that were not previously linked to or thought of in relation with these diseases. this review provides a synopsis of recent advances and what we may expect to still emerge from this field.
Cuypers, Bart; Van den Broeck, Frederik; Van Reet, Nick; Meehan, Conor J; Cauchard, Julien; Wilkes, Jonathan M; Claes, Filip; Goddeeris, Bruno; Birhanu, Hadush; Dujardin, Jean-Claude; Laukens, Kris; Büscher, Philippe; Deborggraeve, Stijn
Trypanosomes cause a variety of diseases in man and domestic animals in Africa, Latin America, and Asia. In the Trypanozoon subgenus, Trypanosoma brucei gambiense and Trypanosoma brucei rhodesiense cause human African trypanosomiasis, whereas Trypanosoma brucei brucei, Trypanosoma evansi, and Trypanosoma equiperdum are responsible for nagana, surra, and dourine in domestic animals, respectively. The genetic relationships between T. evansi and T. equiperdum and other Trypanozoon species remain unclear because the majority of phylogenetic analyses has been based on only a few genes. In this study, we have conducted a phylogenetic analysis based on genome-wide SNP analysis comprising 56 genomes from the Trypanozoon subgenus. Our data reveal that T. equiperdum has emerged at least once in Eastern Africa and T. evansi at two independent occasions in Western Africa. The genomes within the T. equiperdum and T. evansi monophyletic clusters show extremely little variation, probably due to the clonal spread linked to the independence from tsetse flies for their transmission. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Broeckx, Bart J G; Derrien, Thomas; Mottier, Stéphanie; Wucher, Valentin; Cadieu, Edouard; Hédan, Benoît; Le Béguec, Céline; Botherel, Nadine; Lindblad-Toh, Kerstin; Saunders, Jimmy H; Deforce, Dieter; André, Catherine; Peelman, Luc; Hitte, Christophe
Genome-wide association studies (GWAS) are widely used to identify loci associated with phenotypic traits in the domestic dog that has emerged as a model for Mendelian and complex traits. However, a disadvantage of GWAS is that it always requires subsequent fine-mapping or sequencing to pinpoint causal mutations. Here, we performed whole exome sequencing (WES) and canine high-density (cHD) SNP genotyping of 28 dogs from 3 breeds to compare the SNP and linkage disequilibrium characteristics together with the power and mapping precision of exome-guided GWAS (EG-GWAS) versus cHD-based GWAS. Using simulated phenotypes, we showed that EG-GWAS has a higher power than cHD to detect associations within target regions and less power outside target regions, with power being influenced further by sample size and SNP density. We analyzed two real phenotypes (hair length and furnishing), that are fixed in certain breeds to characterize mapping precision of the known causal mutations. EG-GWAS identified the associated exonic and 3'UTR variants within the FGF5 and RSPO2 genes, respectively, with only a few samples per breed. In conclusion, we demonstrated that EG-GWAS can identify loci associated with Mendelian phenotypes both within and across breeds.
The single-nucleotide polymorphism (SNP) rs10503253, located within the CUB and Sushi multiple domains-1 (CSMD1) gene on 8p23.2, was recently identified as genome-wide significant for schizophrenia (SZ), but is of unknown function. We investigated the neurocognitive effects of this CSMD1 variant in vivo in patients and healthy participants using behavioral and imaging measures of brain structure and function. We compared carriers and non-carriers of the risk \\'A\\' allele on measures of neuropsychological performance typically impaired in SZ (general cognitive ability, episodic and working memory and attentional control) in independent samples of Irish patients (n = 387) and controls (n = 171) and German patients (205) and controls (n = 533). Across these groups, the risk \\'A\\' allele at CSMD1 was associated with deleterious effects across a number of neurocognitive phenotypes. Specifically, the risk allele was associated with poorer performance on neuropsychological measures of general cognitive ability and memory function but not attentional control. These effects, while significant, were subtle, and varied between samples. Consistent with previous evidence suggesting that CSMD1 may be involved in brain mechanisms related to memory and learning, these data appear to reflect the deleterious effects of the identified \\'A\\' risk allele on neurocognitive function, possibly as part of the mechanism by which CSMD1 is associated with SZ risk.
Liu, Jinglan; Zhang, Zhe; Bando, Masashige; Itoh, Takehiko; Deardorff, Matthew A.; Li, Jennifer R.; Clark, Dinah; Kaur, Maninder; Tatsuro, Kondo; Kline, Antonie D.; Chang, Celia; Vega, Hugo; Jackson, Laird G.; Spinner, Nancy B.; Shirahige, Katsuhiko; Krantz, Ian D.
The cohesin complex has recently been shown to be a key regulator of eukaryotic gene expression, although the mechanisms by which it exerts its effects are poorly understood. We have undertaken a genome-wide analysis of DNA methylation in cohesin-deficient cell lines from probands with Cornelia de Lange syndrome (CdLS). Heterozygous mutations in NIPBL, SMC1A and SMC3 genes account for ∼65% of individuals with CdLS. SMC1A and SMC3 are subunits of the cohesin complex that controls sister chromatid cohesion, whereas NIPBL facilitates cohesin loading and unloading. We have examined the methylation status of 27 578 CpG dinucleotides in 72 CdLS and control samples. We have documented the DNA methylation pattern in human lymphoblastoid cell lines (LCLs) as well as identified specific differential DNA methylation in CdLS. Subgroups of CdLS probands and controls can be classified using selected CpG loci. The X chromosome was also found to have a unique DNA methylation pattern in CdLS. Cohesin preferentially binds to hypo-methylated DNA in control LCLs, whereas the differential DNA methylation alters cohesin binding in CdLS. Our results suggest that in addition to DNA methylation multiple mechanisms may be involved in transcriptional regulation in human cells and in the resultant gene misexpression in CdLS. PMID:20448023
Full Text Available Aging of biological systems is controlled by various processes which have a potential impact on gene expression. Here we report a genome-wide transcriptome analysis of the fungal aging model Podospora anserina. Total RNA of three individuals of defined age were pooled and analyzed by SuperSAGE (serial analysis of gene expression. A bioinformatics analysis identified different molecular pathways to be affected during aging. While the abundance of transcripts linked to ribosomes and to the proteasome quality control system were found to decrease during aging, those associated with autophagy increase, suggesting that autophagy may act as a compensatory quality control pathway. Transcript profiles associated with the energy metabolism including mitochondrial functions were identified to fluctuate during aging. Comparison of wild-type transcripts, which are continuously down-regulated during aging, with those down-regulated in the long-lived, copper-uptake mutant grisea, validated the relevance of age-related changes in cellular copper metabolism. Overall, we (i present a unique age-related data set of a longitudinal study of the experimental aging model P. anserina which represents a reference resource for future investigations in a variety of organisms, (ii suggest autophagy to be a key quality control pathway that becomes active once other pathways fail, and (iii present testable predictions for subsequent experimental investigations.
Full Text Available Abstract Background Lymphocytes act as a major component of the adaptive immune system, taking very crucial responsibility for immunity. Differences in proportions of T-cell subpopulations in peripheral blood among individuals under same conditions provide evidence of genetic control on these traits, but little is known about the genetic mechanism of them, especially in swine. Identification of the genetic control on these variants may help the genetic improvement of immune capacity through selection. Results To identify genomic regions responsible for these immune traits in swine, a genome-wide association study was conducted. A total of 675 pigs of three breeds were involved in the study. At 21 days of age, all individuals were vaccinated with modified live classical swine fever vaccine. Blood samples were collected when the piglets were 20 and 35 days of age, respectively. Seven traits, including the proportions of CD4+, CD8+, CD4+CD8+, CD4+CD8−, CD4−CD8+, CD4−CD8− and the ratio of CD4+ to CD8+ T cells were measured at the two ages. All the samples were genotyped for 62,163 single nucleotide polymorphisms (SNP using the Illumina porcineSNP60k BeadChip. 40833 SNPs were selected after quality control for association tests between SNPs and each immune trait considered based on a single-locus regression model. To tackle the issue of multiple testing in GWAS, 10,000 permutations were performed to determine the chromosome-wise and genome-wise significance levels of association tests. In total, 61 SNPs with chromosome-wise significance level and 3 SNPs with genome-wise significance level were identified. 27 significant SNPs were located within the immune-related QTL regions reported in previous studies. Furthermore, several significant SNPs fell into the regions harboring known immunity-related genes, 14 of them fell into the regions which harbor some known T cell-related genes. Conclusions Our study demonstrated that genome-wide association
Orr, Nick; Lemnrau, Alina; Cooke, Rosie
We conducted a genome-wide association study of male breast cancer comprising 823 cases and 2,795 controls of European ancestry, with validation in independent sample sets totaling 438 cases and 474 controls. A SNP in RAD51B at 14q24.1 was significantly associated with male breast cancer risk (P...... = 3.02 × 10(-13); odds ratio (OR) = 1.57). We also refine association at 16q12.1 to a SNP within TOX3 (P = 3.87 × 10(-15); OR = 1.50)....
Sklar, Pamela; Ripke, Stephan; Laura J. Scott; Andreassen, Ole A; Cichon, Sven; Craddock, Nick; Edenberg, Howard J; Nurnberger, John I.; Rietschel, Marcella; Blackwood, Douglas; Corvin, Aiden; Flickinger, Matthew; Guan, Weihua; Mattingsdal, Morten; McQuillin, Andrew
We conducted a combined genome-wide association (GWAS) analysis of 7,481 individuals affected with bipolar disorder and 9,250 control individuals within the Psychiatric Genomewide Association Study Consortium Bipolar Disorder group (PGC-BD). We performed a replication study in which we tested 34 independent SNPs in 4,493 independent bipolar disorder cases and 42,542 independent controls and found strong evidence for replication. In the replication sample, 18 of 34 SNPs had P value < 0.05, and...
Cichon, Sven; Mühleisen, Thomas W.; Degenhardt, Franziska A.; Mattheisen, Manuel; Miró, Xavier; Strohmaier, Jana; Steffens, Michael; Meesters, Christian; Herms, Stefan; Weingarten, Moritz; Priebe, Lutz; Haenisch, Britta; Alexander, Michael; Vollmer, Jennifer; Breuer, René
We conducted a genome-wide association study (GWAS) and a follow-up study of bipolar disorder (BD), a common neuropsychiatric disorder. In the GWAS, we investigated 499,494 autosomal and 12,484 X-chromosomal SNPs in 682 patients with BD and in 1300 controls. In the first follow-up step, we tested the most significant 48 SNPs in 1729 patients with BD and in 2313 controls. Eight SNPs showed nominally significant association with BD and were introduced to a meta-analysis of the GWAS and the firs...
Choudhry, Shweta; Deshpande, Archana; Qiao, Liang; Beckman, Kenneth; Sen, Saunak; Baskin, Laurence S.
Purpose Hypospadias is one of the most frequent genital malformations in the male newborn, and results from abnormal penile and urethral development. The etiology of hypospadias remains largely unknown despite intensive investigations. Fetal androgens have a crucial role in genital differentiation. Recent studies have suggested that molecular mechanisms that underlie the effects of androgens on the fetus may involve disruption of epigenetic programming of gene expression during development. We assessed whether epigenetic modification of DNA methylation is associated with hypospadias in a case-control study of 12 hypospadias and 8 control subjects. Materials and Methods Genome-wide DNA methylation profiling was performed on the study subjects using the Illumina Infinium® HumanMethylation450 Bead-Chip, which enables the direct investigation of methylation status of more than 485,000 individual CpG sites throughout the genome. The methylation level at each CpG site was compared between cases and controls using the t test and logistic regression. Results We identified 14 CpG sites that were associated with hypospadias with p hypospadias using a unique and novel epigenetic approach. Our findings suggest DNA methylation patterns are useful in identifying new genes such as SCARB1 and MYBPH that may be involved in the etiology of hypospadias. PMID:22906644
Foo, Jia Nee; Tan, Louis C; Irwan, Ishak D; Au, Wing-Lok; Low, Hui Qi; Prakash, Kumar-M; Ahmad-Annuar, Azlina; Bei, Jinxin; Chan, Anne Yy; Chen, Chiung Mei; Chen, Yi-Chun; Chung, Sun Ju; Deng, Hao; Lim, Shen-Yang; Mok, Vincent; Pang, Hao; Pei, Zhong; Peng, Rong; Shang, Hui-Fang; Song, Kyuyoung; Tan, Ai Huey; Wu, Yih-Ru; Aung, Tin; Cheng, Ching-Yu; Chew, Fook Tim; Chew, Soo-Hong; Chong, Siow-Ann; Ebstein, Richard P; Lee, Jimmy; Saw, Seang-Mei; Seow, Adeline; Subramaniam, Mythily; Tai, E-Shyong; Vithana, Eranga N; Wong, Tien-Yin; Heng, Khai Koon; Meah, Wee-Yang; Khor, Chiea Chuen; Liu, Hong; Zhang, Furen; Liu, Jianjun; Tan, Eng-King
Genome-wide association studies (GWAS) on Parkinson's disease (PD) have mostly been done in Europeans and Japanese. No study has been done in Han Chinese, which make up nearly a fifth of the world population. We conducted the first Han Chinese GWAS analysing a total of 22,729 subjects (5,125 PD cases and 17,604 controls) from Singapore, Hong Kong, Malaysia, Korea, mainland China and Taiwan. We performed imputation, merging and logistic regression analyses of 2,402,394 SNPs passing quality control filters in 779 PD cases, 13,227 controls, adjusted for the first three principal components. 90 SNPs with association P Asian PD. We also identified significant (P Asian-specific loci with large effects (OR > 1.45) on PD risk. Our results also demonstrate some differences in the genetic contribution to PD between Europeans and Asians. Further pan-ethnic meta-analysis with European GWAS cohorts may unravel new PD loci. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: email@example.com.
Hu, Nan; Wang, Chaoyu; Hu, Ying; Yang, Howard H; Giffen, Carol; Tang, Ze-Zhong; Han, Xiao-Yu; Goldstein, Alisa M; Emmert-Buck, Michael R; Buetow, Kenneth H; Taylor, Philip R; Lee, Maxwell P
Whole genome association studies of complex human diseases represent a new paradigm in the postgenomic era. In this study, we report application of the Affymetrix, Inc. (Santa Clara, CA) high-density single nucleotide polymorphism (SNP) array containing 11,555 SNPs in a pilot case-control study of esophageal squamous cell carcinoma (ESCC) that included the analysis of germ line samples from 50 ESCC patients and 50 matched controls. The average genotyping call rate for the 100 samples analyzed was 96%. Using the generalized linear model (GLM) with adjustment for potential confounders and multiple comparisons, we identified 37 SNPs associated with disease, assuming a recessive mode of transmission; similarly, 48 SNPs were identified assuming a dominant mode and 53 SNPs in a continuous mode. When the 37 SNPs identified from the GLM recessive mode were used in a principal components analysis, the first principal component correctly predicted 46 of 50 cases and 47 of 50 controls. Among all the SNPs selected from GLMs for the three modes of transmission, 39 could be mapped to 1 of 33 genes. Many of these genes are involved in various cancers, including GASC1, shown previously to be amplified in ESCCs, and EPHB1 and PIK3C3. In conclusion, we have shown the feasibility of the Affymetrix 10K SNP array in genome-wide association studies of common cancers and identified new candidate loci to study in ESCC.
van Werven, F.J.
Transcription, the synthesis of RNA from a DNA template, is a well-controlled process. TATA binding protein (TBP) recruitment to promoters is essential for transcription by all three RNA polymerases, and often is the rate-limiting step of transcription initiation. TBP is incorporated into different
Kim, Kyung Won; Myers, Rachel A; Lee, Ji Hyun; Igartua, Catherine; Lee, Kyung Eun; Kim, Yoon Hee; Kim, Eun-Jin; Yoon, Dankyu; Lee, Joo-Shil; Hirota, Tomomitsu; Tamari, Mayumi; Takahashi, Atsushi; Kubo, Michiaki; Choi, Je-Min; Kim, Kyu-Earn; Nicolae, Dan L; Ober, Carole; Sohn, Myung Hyun
Atopic dermatitis (AD) is a heterogeneous chronic inflammatory skin disease. Most AD during infancy resolves during childhood, but moderate-to-severe AD with allergic sensitization is more likely to persist into adulthood and more often occurs with other allergic diseases. We sought to find susceptibility loci by performing the first genome-wide association study (GWAS) of AD in Korean children with recalcitrant AD, which was defined as moderate-to-severe AD with allergic sensitization. Our study included 246 children with recalcitrant AD and 551 adult control subjects with a negative history of both allergic disease and allergic sensitization. DNA from these subjects was genotyped; sets of common single nucleotide polymorphisms (SNPs) were imputed and used in the GWAS after quality control checks. SNPs at a region on 13q21.31 were associated with recalcitrant AD at a genome-wide threshold of significance (P < 2.0 × 10(-8)). These associated SNPs are more than 1 Mb from the closest gene, protocadherin (PCDH)9. SNPs at 4 additional loci had P values of less than 1 × 10(-6), including SNPs at or near the neuroblastoma amplified sequence (NBAS; 2p24.3), thymus-expressed molecule involved in selection (THEMIS; 6q22.33), GATA3 (10p14), and S-phase cyclin A-associated protein in the ER (SCAPER; 15q24.3) genes. Further analysis of total serum IgE levels suggested 13q21.31 might be primarily an IgE locus, and analyses of published data demonstrated that SNPs at the 15q24.3 region are expression quantitative trait loci for 2 nearby genes, ISL2 and proline-serine-threonine phosphatase interacting protein 1 (PSTPIP1), in immune cells. Our GWAS of recalcitrant AD identified new susceptibility regions containing genes involved in epithelial cell function and immune dysregulation, 2 key features of AD, and potentially extend our understanding of their role in pathogenesis. Copyright © 2015 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All
Saxena, Richa; Plenge, Robert M; Bjonnes, Andrew C; Dashti, Hassan S; Okada, Yukinori; Gad El Haq, Wessam; Hammoudeh, Mohammed; Al Emadi, Samar; Masri, Basel K; Halabi, Hussein; Badsha, Humeira; Uthman, Imad W; Margolin, Lauren; Gupta, Namrata; Mahfoud, Ziyad R; Kapiri, Marianthi; Dargham, Soha R; Aranki, Grace; Kazkaz, Layla A; Arayssi, Thurayya
Genetic factors underlying susceptibility to rheumatoid arthritis (RA) in Arab populations are largely unknown. This genome-wide association study (GWAS) was undertaken to explore the generalizability of previously reported RA loci to Arab subjects and to discover new Arab-specific genetic loci. The Genetics of Rheumatoid Arthritis in Some Arab States Study was designed to examine the genetics and clinical features of RA patients from Jordan, the Kingdom of Saudi Arabia, Lebanon, Qatar, and the United Arab Emirates. In total, >7 million single-nucleotide polymorphisms (SNPs) were tested for association with RA overall and with seropositive or seronegative RA in 511 RA cases and 352 healthy controls. In addition, replication of 15 signals was attempted in 283 RA cases and 221 healthy controls. A genetic risk score of 68 known RA SNPs was also examined in this study population. Three loci (HLA region, intergenic 5q13, and 17p13 at SMTNL2/GGT6) reached genome-wide significance in the analyses of association with RA and with seropositive RA, and for all 3 loci, evidence of independent replication was demonstrated. Consistent with the findings in European and East Asian populations, the association of RA with HLA-DRB1 amino acid position 11 conferred the strongest effect (P = 4.8 × 10(-16) ), and a weighted genetic risk score of previously associated RA loci was found to be associated with RA (P = 3.41 × 10(-5) ) and with seropositive RA (P = 1.48 × 10(-6) ) in this population. In addition, 2 novel associations specific to Arab populations were found at the 5q13 and 17p13 loci. This first RA GWAS in Arab populations confirms that established HLA-region and known RA risk alleles contribute strongly to the risk and severity of disease in some Arab groups, suggesting that the genetic architecture of RA is similar across ethnic groups. Moreover, this study identified 2 novel RA risk loci in Arabs, offering further population-specific insights into the
Schurink, Anouk; Wolc, Anna; Ducro, Bart J; Frankena, Klaas; Garrick, Dorian J; Dekkers, Jack C M; van Arendonk, Johan A M
Insect bite hypersensitivity is a common allergic disease in horse populations worldwide. Insect bite hypersensitivity is affected by both environmental and genetic factors. However, little is known about genes contributing to the genetic variance associated with insect bite hypersensitivity. Therefore, the aim of our study was to identify and quantify genomic associations with insect bite hypersensitivity in Shetland pony mares and Icelandic horses in the Netherlands. Data on 200 Shetland pony mares and 146 Icelandic horses were collected according to a matched case-control design. Cases and controls were matched on various factors (e.g. region, sire) to minimize effects of population stratification. Breed-specific genome-wide association studies were performed using 70 k single nucleotide polymorphisms genotypes. Bayesian variable selection method Bayes-C with a threshold model implemented in GenSel software was applied. A 1 Mb non-overlapping window approach that accumulated contributions of adjacent single nucleotide polymorphisms was used to identify associated genomic regions. The percentage of variance explained by all single nucleotide polymorphisms was 13% in Shetland pony mares and 28% in Icelandic horses. The 20 non-overlapping windows explaining the largest percentages of genetic variance were found on nine chromosomes in Shetland pony mares and on 14 chromosomes in Icelandic horses. Overlap in identified associated genomic regions between breeds would suggest interesting candidate regions to follow-up on. Such regions common to both breeds (within 15 Mb) were found on chromosomes 3, 7, 11, 20 and 23. Positional candidate genes within 2 Mb from the associated windows were identified on chromosome 20 in both breeds. Candidate genes are within the equine lymphocyte antigen class II region, which evokes an immune response by recognizing many foreign molecules. The genome-wide association study identified several genomic regions associated with insect bite
Full Text Available Abstract Background Insect bite hypersensitivity is a common allergic disease in horse populations worldwide. Insect bite hypersensitivity is affected by both environmental and genetic factors. However, little is known about genes contributing to the genetic variance associated with insect bite hypersensitivity. Therefore, the aim of our study was to identify and quantify genomic associations with insect bite hypersensitivity in Shetland pony mares and Icelandic horses in the Netherlands. Methods Data on 200 Shetland pony mares and 146 Icelandic horses were collected according to a matched case–control design. Cases and controls were matched on various factors (e.g. region, sire to minimize effects of population stratification. Breed-specific genome-wide association studies were performed using 70 k single nucleotide polymorphisms genotypes. Bayesian variable selection method Bayes-C with a threshold model implemented in GenSel software was applied. A 1 Mb non-overlapping window approach that accumulated contributions of adjacent single nucleotide polymorphisms was used to identify associated genomic regions. Results The percentage of variance explained by all single nucleotide polymorphisms was 13% in Shetland pony mares and 28% in Icelandic horses. The 20 non-overlapping windows explaining the largest percentages of genetic variance were found on nine chromosomes in Shetland pony mares and on 14 chromosomes in Icelandic horses. Overlap in identified associated genomic regions between breeds would suggest interesting candidate regions to follow-up on. Such regions common to both breeds (within 15 Mb were found on chromosomes 3, 7, 11, 20 and 23. Positional candidate genes within 2 Mb from the associated windows were identified on chromosome 20 in both breeds. Candidate genes are within the equine lymphocyte antigen class II region, which evokes an immune response by recognizing many foreign molecules. Conclusions The genome-wide association
Stewart, SE; Yu, D; Scharf, JM; Neale, BM; Fagerness, JA; Mathews, CA; Arnold, PD; Evans, PD; Gamazon, ER; Osiecki, L; McGrath, L; Haddad, S; Crane, J; Hezel, D; Illman, C
Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1465 cases, 5557 ...
Rucker, J J H; Breen, G; Pinto, D; Pedroso, I; Lewis, C M; Cohen-Woods, S; Uher, R; Schosser, A; Rivera, M; Aitchison, K J; Craddock, N; Owen, M J; Jones, L; Jones, I; Korszun, A; Muglia, P; Barnes, M R; Preisig, M; Mors, O; Gill, M; Maier, W; Rice, J; Rietschel, M; Holsboer, F; Farmer, A E; Craig, I W; Scherer, S W; McGuffin, P
Large, rare copy number variants (CNVs) have been implicated in a variety of psychiatric disorders, but the role of CNVs in recurrent depression is unclear. We performed a genome-wide analysis of large, rare CNVs in 3106 cases of recurrent depression, 459 controls screened for lifetime-absence of psychiatric disorder and 5619 unscreened controls from phase 2 of the Wellcome Trust Case Control Consortium (WTCCC2). We compared the frequency of cases with CNVs against the frequency observed in each control group, analysing CNVs over the whole genome, genic, intergenic, intronic and exonic regions. We found that deletion CNVs were associated with recurrent depression, whereas duplications were not. The effect was significant when comparing cases with WTCCC2 controls (P=7.7 × 10(-6), odds ratio (OR) =1.25 (95% confidence interval (CI) 1.13-1.37)) and to screened controls (P=5.6 × 10(-4), OR=1.52 (95% CI 1.20-1.93). Further analysis showed that CNVs deleting protein coding regions were largely responsible for the association. Within an analysis of regions previously implicated in schizophrenia, we found an overall enrichment of CNVs in our cases when compared with screened controls (P=0.019). We observe an ordered increase of samples with deletion CNVs, with the lowest proportion seen in screened controls, the next highest in unscreened controls and the highest in cases. This may suggest that the absence of deletion CNVs, especially in genes, is associated with resilience to recurrent depression.
Easton, Douglas F.; Pooley, Karen A.; Dunning, Alison M.; Pharoah, Paul D. P.; Thompson, Deborah; Ballinger, Dennis G.; Struewing, Jeffery P.; Morrison, Jonathan; Field, Helen; Luben, Robert; Wareham, Nicholas; Ahmed, Shahana; Healey, Catherine S.; Bowman, Richard; Meyer, Kerstin B.; Haiman, Christopher A.; Kolonel, Laurence K.; Henderson, Brian E.; Marchand, Loic Le; Brennan, Paul; Sangrajrang, Suleeporn; Gaborieau, Valerie; Odefrey, Fabrice; Shen, Chen-Yang; Wu, Pei-Ei; Wang, Hui-Chun; Eccles, Diana; Evans, D. Gareth; Peto, Julian; Fletcher, Olivia; Johnson, Nichola; Seal, Sheila; Stratton, Michael R.; Rahman, Nazneen; Chenevix-Trench, Georgia; Bojesen, Stig E.; Nordestgaard, Børge G.; Axelsson, Christen K.; Garcia-Closas, Montserrat; Brinton, Louise; Chanock, Stephen; Lissowska, Jolanta; Peplonska, Beata; Nevanlinna, Heli; Fagerholm, Rainer; Eerola, Hannaleena; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Ahn, Sei-Hyun; Hunter, David J.; Hankinson, Susan E.; Cox, David G.; Hall, Per; Wedren, Sara; Liu, Jianjun; Low, Yen-Ling; Bogdanova, Natalia; Schürmann, Peter; Dörk, Thilo; Tollenaar, Rob A. E. M.; Jacobi, Catharina E.; Devilee, Peter; Klijn, Jan G. M.; Sigurdson, Alice J.; Doody, Michele M.; Alexander, Bruce H.; Zhang, Jinghui; Cox, Angela; Brock, Ian W.; MacPherson, Gordon; Reed, Malcolm W. R.; Couch, Fergus J.; Goode, Ellen L.; Olson, Janet E.; Meijers-Heijboer, Hanne; van den Ouweland, Ans; Uitterlinden, André; Rivadeneira, Fernando; Milne, Roger L.; Ribas, Gloria; Gonzalez-Neira, Anna; Benitez, Javier; Hopper, John L.; McCredie, Margaret; Southey, Melissa; Giles, Graham G.; Schroen, Chris; Justenhoven, Christina; Brauch, Hiltrud; Hamann, Ute; Ko, Yon-Dschun; Spurdle, Amanda B.; Beesley, Jonathan; Chen, Xiaoqing; Mannermaa, Arto; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana; Day, Nicholas E.; Cox, David R.; Ponder, Bruce A. J.; Luccarini, Craig; Conroy, Don; Shah, Mitul; Munday, Hannah; Jordan, Clare; Perkins, Barbara; West, Judy; Redman, Karen; Driver, Kristy; Aghmesheh, Morteza; Amor, David; Andrews, Lesley; Antill, Yoland; Armes, Jane; Armitage, Shane; Arnold, Leanne; Balleine, Rosemary; Begley, Glenn; Beilby, John; Bennett, Ian; Bennett, Barbara; Berry, Geoffrey; Blackburn, Anneke; Brennan, Meagan; Brown, Melissa; Buckley, Michael; Burke, Jo; Butow, Phyllis; Byron, Keith; Callen, David; Campbell, Ian; Chenevix-Trench, Georgia; Clarke, Christine; Colley, Alison; Cotton, Dick; Cui, Jisheng; Culling, Bronwyn; Cummings, Margaret; Dawson, Sarah-Jane; Dixon, Joanne; Dobrovic, Alexander; Dudding, Tracy; Edkins, Ted; Eisenbruch, Maurice; Farshid, Gelareh; Fawcett, Susan; Field, Michael; Firgaira, Frank; Fleming, Jean; Forbes, John; Friedlander, Michael; Gaff, Clara; Gardner, Mac; Gattas, Mike; George, Peter; Giles, Graham; Gill, Grantley; Goldblatt, Jack; Greening, Sian; Grist, Scott; Haan, Eric; Harris, Marion; Hart, Stewart; Hayward, Nick; Hopper, John; Humphrey, Evelyn; Jenkins, Mark; Jones, Alison; Kefford, Rick; Kirk, Judy; Kollias, James; Kovalenko, Sergey; Lakhani, Sunil; Leary, Jennifer; Lim, Jacqueline; Lindeman, Geoff; Lipton, Lara; Lobb, Liz; Maclurcan, Mariette; Mann, Graham; Marsh, Deborah; McCredie, Margaret; McKay, Michael; McLachlan, Sue Anne; Meiser, Bettina; Milne, Roger; Mitchell, Gillian; Newman, Beth; O'Loughlin, Imelda; Osborne, Richard; Peters, Lester; Phillips, Kelly; Price, Melanie; Reeve, Jeanne; Reeve, Tony; Richards, Robert; Rinehart, Gina; Robinson, Bridget; Rudzki, Barney; Salisbury, Elizabeth; Sambrook, Joe; Saunders, Christobel; Scott, Clare; Scott, Elizabeth; Scott, Rodney; Seshadri, Ram; Shelling, Andrew; Southey, Melissa; Spurdle, Amanda; Suthers, Graeme; Taylor, Donna; Tennant, Christopher; Thorne, Heather; Townshend, Sharron; Tucker, Kathy; Tyler, Janet; Venter, Deon; Visvader, Jane; Walpole, Ian; Ward, Robin; Waring, Paul; Warner, Bev; Warren, Graham; Watson, Elizabeth; Williams, Rachael; Wilson, Judy; Winship, Ingrid; Young, Mary Ann; Bowtell, David; Green, Adele; deFazio, Anna; Chenevix-Trench, Georgia; Gertig, Dorota; Webb, Penny
Breast cancer exhibits familial aggregation, consistent with variation in genetic susceptibility to the disease. Known susceptibility genes account for less than 25% of the familial risk of breast cancer, and the residual genetic variance is likely to be due to variants conferring more moderate risks. To identify further susceptibility alleles, we conducted a two-stage genome-wide association study in 4,398 breast cancer cases and 4,316 controls, followed by a third stage in which 30 single nucleotide polymorphisms (SNPs) were tested for confirmation in 21,860 cases and 22,578 controls from 22 studies. We used 227,876 SNPs that were estimated to correlate with 77% of known common SNPs in Europeans at r2>0.5. SNPs in five novel independent loci exhibited strong and consistent evidence of association with breast cancer (P<10−7). Four of these contain plausible causative genes (FGFR2, TNRC9, MAP3K1 and LSP1). At the second stage, 1,792 SNPs were significant at the P<0.05 level compared with an estimated 1,343 that would be expected by chance, indicating that many additional common susceptibility alleles may be identifiable by this approach. PMID:17529967
Eser, Umut; Falleur-Fettig, Melody; Johnson, Amy; Skotheim, Jan M
In budding yeast, commitment to cell division corresponds to activating the positive feedback loop of G1 cyclins controlled by the transcription factors SBF and MBF. This pair of transcription factors has over 200 targets, implying that cell-cycle commitment coincides with genome-wide changes in transcription. Here, we find that genes within this regulon have a well-defined distribution of transcriptional activation times. Combinatorial use of SBF and MBF results in a logical OR function for gene expression and partially explains activation timing. Activation of G1 cyclin expression precedes the activation of the bulk of the G1/S regulon, ensuring that commitment to cell division occurs before large-scale changes in transcription. Furthermore, we find similar positive feedback-first regulation in the yeasts S. bayanus and S. cerevisiae, as well as human cells. The widespread use of the feedback-first motif in eukaryotic cell-cycle control, implemented by nonorthologous proteins, suggests its frequent deployment at cellular transitions. Copyright © 2011 Elsevier Inc. All rights reserved.
Chen, Fang; Chen, Gary K.; Stram, Daniel O.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Palmer, Julie R.; Hu, Jennifer J.; Rebbeck, Tim R.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Ruiz-Narvaez, Edward A.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; DeMichele, Angela; Chanock, Stephen J.; Blot, William; Signorello, Lisa; Cai, Qiuyin; Li, Guoliang; Long, Jirong; Huo, Dezheng; Zheng, Yonglan; Cox, Nancy J.; Olopade, Olufunmilayo I.; Ogundiran, Temidayo O.; Adebamowo, Clement; Nathanson, Katherine L.; Domchek, Susan M.; Simon, Michael S.; Hennis, Anselm; Nemesure, Barbara; Wu, Suh-Yuh; Leske, M. Cristina; Ambs, Stefan; Hutter, Carolyn M.; Young, Alicia; Kooperberg, Charles; Peters, Ulrike; Rhie, Suhn K.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Christopher A.
Genome-wide association studies (GWAS) in diverse populations are needed to reveal variants that are more common and/or limited to defined populations. We conducted a GWAS of breast cancer in women of African ancestry, with genotyping of > 1,000,000 SNPs in 3,153 African American cases and 2,831 controls, and replication testing of the top 66 associations in an additional 3,607 breast cancer cases and 11,330 controls of African ancestry. Two of the 66 SNPs replicated (p < 0.05) in stage 2, which reached statistical significance levels of 10−6 and 10−5 in the stage 1 and 2 combined analysis (rs4322600 at chromosome 14q31: OR = 1.18, p = 4.3×10−6; rs10510333 at chromosome 3p26: OR = 1.15, p = 1.5×10−5). These suggestive risk loci have not been identified in previous GWAS in other populations and will need to be examined in additional samples. Identification of novel risk variants for breast cancer in women of African ancestry will demand testing of a substantially larger set of markers from stage 1 in a larger replication sample. PMID:22923054
Legge, S E; Hamshere, M L; Ripke, S; Pardinas, A F; Goldstein, J I; Rees, E; Richards, A L; Leonenko, G; Jorskog, L F; Goldstein, Jacqueline I; Jarskog, L Fredrik; Hilliard, Chris; Alfirevic, Ana; Duncan, Laramie; Fourches, Denis; Huang, Hailiang; Lek, Monkol; Neale, Benjamin M; Ripke, Stephan; Shianna, Kevin; Szatkiewicz, Jin P; Tropsha, Alexander; van den Oord, Edwin JCG; Cascorbi, Ingolf; Dettling, Michael; Gazit, Ephraim; Goff, Donald C; Holden, Arthur L; Kelly, Deanna L; Malhotra, Anil K; Nielsen, Jimmi; Pirmohamed, Munir; Rujescu, Dan; Werge, Thomas; Levy, Deborah L; Josiassen, Richard C; Kennedy, James L; Lieberman, Jeffrey A; Daly, Mark J; Sullivan, Patrick F; Chambert, K D; Collier, D A; Genovese, G; Giegling, I; Holmans, P; Jonasdottir, A; Kirov, G; McCarroll, S A; MacCabe, J H; Mantripragada, K; Moran, J L; Neale, B M; Stefansson, H; Rujescu, D; Daly, M J; Sullivan, P F; Owen, M J; O'Donovan, M C; Walters, J T R
The antipsychotic clozapine is uniquely effective in the management of schizophrenia; however, its use is limited by its potential to induce agranulocytosis. The causes of this, and of its precursor neutropenia, are largely unknown, although genetic factors have an important role. We sought risk alleles for clozapine-associated neutropenia in a sample of 66 cases and 5583 clozapine-treated controls, through a genome-wide association study (GWAS), imputed human leukocyte antigen (HLA) alleles, exome array and copy-number variation (CNV) analyses. We then combined associated variants in a meta-analysis with data from the Clozapine-Induced Agranulocytosis Consortium (up to 163 cases and 7970 controls). In the largest combined sample to date, we identified a novel association with rs149104283 (odds ratio (OR)=4.32, P=1.79 × 10−8), intronic to transcripts of SLCO1B3 and SLCO1B7, members of a family of hepatic transporter genes previously implicated in adverse drug reactions including simvastatin-induced myopathy and docetaxel-induced neutropenia. Exome array analysis identified gene-wide associations of uncommon non-synonymous variants within UBAP2 and STARD9. We additionally provide independent replication of a previously identified variant in HLA-DQB1 (OR=15.6, P=0.015, positive predictive value=35.1%). These results implicate biological pathways through which clozapine may act to cause this serious adverse effect. PMID:27400856
Jeffrey A Gross
Full Text Available Suicide and suicide attempts are complex behaviors that result from the interaction of different factors, including genetic variants that increase the predisposition to suicidal behaviors. Copy number variations (CNVs are deletions or duplications of a segment of DNA usually larger than one kilobase. These structural genetic changes, although quite rare, have been associated with genetic liability to mental disorders, such as autism, schizophrenia, and bipolar disorder. No genome-wide level studies have been published investigating the potential role of CNVs in suicidal behaviors. Based on single-nucleotide polymorphism array data, we followed the Penn-CNV standards to detect CNVs in 1,608 subjects, comprising 475 suicide and suicide attempt cases and 1,133 controls. Although the initial algorithms determined the presence of CNVs on chromosomes 6 and 12 in seven and eight cases, respectively, compared with none of the controls, visual inspection of the raw data did not support this finding. Furthermore we were unable to validate these findings by CNV-specific real-time polymerase chain reaction. Additionally, rare CNV burden analysis did not find an association between the frequency or length of rare CNVs and suicidal behavior in our sample population. Although our findings suggest CNVs do not play an important role in the etiology of suicidal behaviors, they are not inconsistent with the strong evidence from the literature suggesting that other genetic variants account for a portion of the total phenotypic variability in suicidal behavior.
Garcia-Closas, Montserrat; Couch, Fergus J; Lindstrom, Sara; Michailidou, Kyriaki; Schmidt, Marjanka K; Brook, Mark N; orr, Nick; Rhie, Suhn Kyong; Riboli, Elio; Feigelson, Heather s; Le Marchand, Loic; Buring, Julie E; Eccles, Diana; Miron, Penelope; Fasching, Peter A; Brauch, Hiltrud; Chang-Claude, Jenny; Carpenter, Jane; Godwin, Andrew K; Nevanlinna, Heli; Giles, Graham G; Cox, Angela; Hopper, John L; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Dicks, Ed; Howat, Will J; Schoof, Nils; Bojesen, Stig E; Lambrechts, Diether; Broeks, Annegien; Andrulis, Irene L; Guénel, Pascal; Burwinkel, Barbara; Sawyer, Elinor J; Hollestelle, Antoinette; Fletcher, Olivia; Winqvist, Robert; Brenner, Hermann; Mannermaa, Arto; Hamann, Ute; Meindl, Alfons; Lindblom, Annika; Zheng, Wei; Devillee, Peter; Goldberg, Mark S; Lubinski, Jan; Kristensen, Vessela; Swerdlow, Anthony; Anton-Culver, Hoda; Dörk, Thilo; Muir, Kenneth; Matsuo, Keitaro; Wu, Anna H; Radice, Paolo; Teo, Soo Hwang; Shu, Xiao-Ou; Blot, William; Kang, Daehee; Hartman, Mikael; Sangrajrang, Suleeporn; Shen, Chen-Yang; Southey, Melissa C; Park, Daniel J; Hammet, Fleur; Stone, Jennifer; Veer, Laura J Van’t; Rutgers, Emiel J; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Peto, Julian; Schrauder, Michael G; Ekici, Arif B; Beckmann, Matthias W; Silva, Isabel dos Santos; Johnson, Nichola; Warren, Helen; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Marme, Federick; Schneeweiss, Andreas; Sohn, Christof; Truong, Therese; Laurent-Puig, Pierre; Kerbrat, Pierre; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Milne, Roger L; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Lichtner, Peter; Lochmann, Magdalena; Justenhoven, Christina; Ko, Yon-Dschun; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Greco, Dario; Heikkinen, Tuomas; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Antonenkova, Natalia N; Margolin, Sara; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Balleine, Rosemary; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Neven, Patrick; Dieudonné, Anne-Sophie; Leunen, Karin; Rudolph, Anja; Nickels, Stefan; Flesch-Janys, Dieter; Peterlongo, Paolo; Peissel, Bernard; Bernard, Loris; Olson, Janet E; Wang, Xianshu; Stevens, Kristen; Severi, Gianluca; Baglietto, Laura; Mclean, Catriona; Coetzee, Gerhard A; Feng, Ye; Henderson, Brian E; Schumacher, Fredrick; Bogdanova, Natalia V; Labrèche, France; Dumont, Martine; Yip, Cheng Har; Taib, Nur Aishah Mohd; Cheng, Ching-Yu; Shrubsole, Martha; Long, Jirong; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Tollenaar, Robertus A E M; Seynaeve, Caroline M; Kriege, Mieke; Hooning, Maartje J; Van den Ouweland, Ans M W; Van Deurzen, Carolien H M; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Balasubramanian, Sabapathy P; Cross, Simon S; Reed, Malcolm W R; Signorello, Lisa; Cai, Qiuyin; Shah, Mitul; Miao, Hui; Chan, Ching Wan; Chia, Kee Seng; Jakubowska, Anna; Jaworska, Katarzyna; Durda, Katarzyna; Hsiung, Chia-Ni; Wu, Pei-Ei; Yu, Jyh-Cherng; Ashworth, Alan; Jones, Michael; Tessier, Daniel C; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Vincent, Daniel; Bacot, Francois; Ambrosone, Christine B; Bandera, Elisa V; John, Esther M; Chen, Gary K; Hu, Jennifer J; Rodriguez-gil, Jorge L; Bernstein, Leslie; Press, Michael F; Ziegler, Regina G; Millikan, Robert M; Deming-Halverson, Sandra L; Nyante, Sarah; Ingles, Sue A; Waisfisz, Quinten; Tsimiklis, Helen; Makalic, Enes; Schmidt, Daniel; Bui, Minh; Gibson, Lorna; Müller-Myhsok, Bertram; Schmutzler, Rita K; Hein, Rebecca; Dahmen, Norbert; Beckmann, Lars; Aaltonen, Kirsimari; Czene, Kamila; Irwanto, Astrid; Liu, Jianjun; Turnbull, Clare; Rahman, Nazneen; Meijers-Heijboer, Hanne; Uitterlinden, Andre G; Rivadeneira, Fernando; Olswold, Curtis; Slager, Susan; Pilarski, Robert; Ademuyiwa, Foluso; Konstantopoulou, Irene; Martin, Nicholas G; Montgomery, Grant W; Slamon, Dennis J; Rauh, Claudia; Lux, Michael P; Jud, Sebastian M; Bruning, Thomas; Weaver, Joellen; Sharma, Priyanka; Pathak, Harsh; Tapper, Will; Gerty, Sue; Durcan, Lorraine; Trichopoulos, Dimitrios; Tumino, Rosario; Peeters, Petra H; Kaaks, Rudolf; Campa, Daniele; Canzian, Federico; Weiderpass, Elisabete; Johansson, Mattias; Khaw, Kay-Tee; Travis, Ruth; Clavel-Chapelon, Françoise; Kolonel, Laurence N; Chen, Constance; Beck, Andy; Hankinson, Susan E; Berg, Christine D; Hoover, Robert N; Lissowska, Jolanta; Figueroa, Jonine D; Chasman, Daniel I; Gaudet, Mia M; Diver, W Ryan; Willett, Walter C; Hunter, David J; Simard, Jacques; Benitez, Javier; Dunning, Alison M; Sherman, Mark E; Chenevix-Trench, Georgia; Chanock, Stephen J; Hall, Per; Pharoah, Paul D P; Vachon, Celine; Easton, Douglas F; Haiman, Christopher A; Kraft, Peter
Estrogen receptor (ER)-negative tumors represent 20–30% of all breast cancers, with a higher proportion occurring in younger women and women of African ancestry1. The etiology2 and clinical behavior3 of ER-negative tumors are different from those of tumors expressing ER (ER positive), including differences in genetic predisposition4. To identify susceptibility loci specific to ER-negative disease, we combined in a meta-analysis 3 genome-wide association studies of 4,193 ER-negative breast cancer cases and 35,194 controls with a series of 40 follow-up studies (6,514 cases and 41,455 controls), genotyped using a custom Illumina array, iCOGS, developed by the Collaborative Oncological Gene-environment Study (COGS). SNPs at four loci, 1q32.1 (MDM4, P = 2.1 × 10−12 and LGR6, P = 1.4 × 10−8), 2p24.1 (P = 4.6 × 10−8) and 16q12.2 (FTO, P = 4.0 × 10−8), were associated with ER-negative but not ER-positive breast cancer (P > 0.05). These findings provide further evidence for distinct etiological pathways associated with invasive ER-positive and ER-negative breast cancers. PMID:23535733
Garcia-Closas, Montserrat; Couch, Fergus J; Lindstrom, Sara; Michailidou, Kyriaki; Schmidt, Marjanka K; Brook, Mark N; Orr, Nick; Rhie, Suhn Kyong; Riboli, Elio; Feigelson, Heather S; Le Marchand, Loic; Buring, Julie E; Eccles, Diana; Miron, Penelope; Fasching, Peter A; Brauch, Hiltrud; Chang-Claude, Jenny; Carpenter, Jane; Godwin, Andrew K; Nevanlinna, Heli; Giles, Graham G; Cox, Angela; Hopper, John L; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Dicks, Ed; Howat, Will J; Schoof, Nils; Bojesen, Stig E; Lambrechts, Diether; Broeks, Annegien; Andrulis, Irene L; Guénel, Pascal; Burwinkel, Barbara; Sawyer, Elinor J; Hollestelle, Antoinette; Fletcher, Olivia; Winqvist, Robert; Brenner, Hermann; Mannermaa, Arto; Hamann, Ute; Meindl, Alfons; Lindblom, Annika; Zheng, Wei; Devillee, Peter; Goldberg, Mark S; Lubinski, Jan; Kristensen, Vessela; Swerdlow, Anthony; Anton-Culver, Hoda; Dörk, Thilo; Muir, Kenneth; Matsuo, Keitaro; Wu, Anna H; Radice, Paolo; Teo, Soo Hwang; Shu, Xiao-Ou; Blot, William; Kang, Daehee; Hartman, Mikael; Sangrajrang, Suleeporn; Shen, Chen-Yang; Southey, Melissa C; Park, Daniel J; Hammet, Fleur; Stone, Jennifer; Veer, Laura J Van't; Rutgers, Emiel J; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Peto, Julian; Schrauder, Michael G; Ekici, Arif B; Beckmann, Matthias W; Dos Santos Silva, Isabel; Johnson, Nichola; Warren, Helen; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Marme, Federick; Schneeweiss, Andreas; Sohn, Christof; Truong, Therese; Laurent-Puig, Pierre; Kerbrat, Pierre; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Milne, Roger L; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Lichtner, Peter; Lochmann, Magdalena; Justenhoven, Christina; Ko, Yon-Dschun; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Greco, Dario; Heikkinen, Tuomas; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Antonenkova, Natalia N; Margolin, Sara; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Balleine, Rosemary; Tseng, Chiu-Chen; Berg, David Van Den; Stram, Daniel O; Neven, Patrick; Dieudonné, Anne-Sophie; Leunen, Karin; Rudolph, Anja; Nickels, Stefan; Flesch-Janys, Dieter; Peterlongo, Paolo; Peissel, Bernard; Bernard, Loris; Olson, Janet E; Wang, Xianshu; Stevens, Kristen; Severi, Gianluca; Baglietto, Laura; McLean, Catriona; Coetzee, Gerhard A; Feng, Ye; Henderson, Brian E; Schumacher, Fredrick; Bogdanova, Natalia V; Labrèche, France; Dumont, Martine; Yip, Cheng Har; Taib, Nur Aishah Mohd; Cheng, Ching-Yu; Shrubsole, Martha; Long, Jirong; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; Knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Tollenaar, Robertus A E M; Seynaeve, Caroline M; Kriege, Mieke; Hooning, Maartje J; van den Ouweland, Ans M W; van Deurzen, Carolien H M; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Balasubramanian, Sabapathy P; Cross, Simon S; Reed, Malcolm W R; Signorello, Lisa; Cai, Qiuyin; Shah, Mitul; Miao, Hui; Chan, Ching Wan; Chia, Kee Seng; Jakubowska, Anna; Jaworska, Katarzyna; Durda, Katarzyna; Hsiung, Chia-Ni; Wu, Pei-Ei; Yu, Jyh-Cherng; Ashworth, Alan; Jones, Michael; Tessier, Daniel C; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Vincent, Daniel; Bacot, Francois; Ambrosone, Christine B; Bandera, Elisa V; John, Esther M; Chen, Gary K; Hu, Jennifer J; Rodriguez-Gil, Jorge L; Bernstein, Leslie; Press, Michael F; Ziegler, Regina G; Millikan, Robert M; Deming-Halverson, Sandra L; Nyante, Sarah; Ingles, Sue A; Waisfisz, Quinten; Tsimiklis, Helen; Makalic, Enes; Schmidt, Daniel; Bui, Minh; Gibson, Lorna; Müller-Myhsok, Bertram; Schmutzler, Rita K; Hein, Rebecca; Dahmen, Norbert; Beckmann, Lars; Aaltonen, Kirsimari; Czene, Kamila; Irwanto, Astrid; Liu, Jianjun; Turnbull, Clare; Rahman, Nazneen; Meijers-Heijboer, Hanne; Uitterlinden, Andre G; Rivadeneira, Fernando; Olswold, Curtis; Slager, Susan; Pilarski, Robert; Ademuyiwa, Foluso; Konstantopoulou, Irene; Martin, Nicholas G; Montgomery, Grant W; Slamon, Dennis J; Rauh, Claudia; Lux, Michael P; Jud, Sebastian M; Bruning, Thomas; Weaver, Joellen; Sharma, Priyanka; Pathak, Harsh; Tapper, Will; Gerty, Sue; Durcan, Lorraine; Trichopoulos, Dimitrios; Tumino, Rosario; Peeters, Petra H; Kaaks, Rudolf; Campa, Daniele; Canzian, Federico; Weiderpass, Elisabete; Johansson, Mattias; Khaw, Kay-Tee; Travis, Ruth; Clavel-Chapelon, Françoise; Kolonel, Laurence N; Chen, Constance; Beck, Andy; Hankinson, Susan E; Berg, Christine D; Hoover, Robert N; Lissowska, Jolanta; Figueroa, Jonine D; Chasman, Daniel I; Gaudet, Mia M; Diver, W Ryan; Willett, Walter C; Hunter, David J; Simard, Jacques; Benitez, Javier; Dunning, Alison M; Sherman, Mark E; Chenevix-Trench, Georgia; Chanock, Stephen J; Hall, Per; Pharoah, Paul D P; Vachon, Celine; Easton, Douglas F; Haiman, Christopher A; Kraft, Peter
Estrogen receptor (ER)-negative tumors represent 20-30% of all breast cancers, with a higher proportion occurring in younger women and women of African ancestry. The etiology and clinical behavior of ER-negative tumors are different from those of tumors expressing ER (ER positive), including differences in genetic predisposition. To identify susceptibility loci specific to ER-negative disease, we combined in a meta-analysis 3 genome-wide association studies of 4,193 ER-negative breast cancer cases and 35,194 controls with a series of 40 follow-up studies (6,514 cases and 41,455 controls), genotyped using a custom Illumina array, iCOGS, developed by the Collaborative Oncological Gene-environment Study (COGS). SNPs at four loci, 1q32.1 (MDM4, P = 2.1 × 10(-12) and LGR6, P = 1.4 × 10(-8)), 2p24.1 (P = 4.6 × 10(-8)) and 16q12.2 (FTO, P = 4.0 × 10(-8)), were associated with ER-negative but not ER-positive breast cancer (P > 0.05). These findings provide further evidence for distinct etiological pathways associated with invasive ER-positive and ER-negative breast cancers.
Full Text Available MiRNAs impact on the control of cell fate by regulating gene expression at the post-transcriptional level. Here, using mammalian muscle differentiation as a model and a phenotypic loss-of-function screen, we explored the function of miRNAs at the genome-wide level. We found that the depletion of a high number of miRNAs (63 impacted on differentiation of human muscle precursors, underscoring the importance of this post-transcriptional mechanism of gene regulation. Interestingly, a comparison with miRNA expression profiles revealed that most of the hit miRNAs did not show any significant variations of expression during differentiation. These constitutively expressed miRNAs might be required for basic and/or essential cell function, or else might be regulated at the post-transcriptional level. MiRNA inhibition yielded a variety of phenotypes, reflecting the widespread miRNA involvement in differentiation. Using a functional screen (the STarS--Suppressor Target Screen--approach, i. e. concomitant knockdown of miRNAs and of candidate target proteins, we discovered miRNA protein targets that are previously uncharacterized controllers of muscle-cell terminal differentiation. Our results provide a strategy for functional annotation of the human miRnome.
Jorgenson, Eric; Makki, Nadja; Shen, Ling; Chen, David C.; Tian, Chao; Eckalbar, Walter L.; Hinds, David; Ahituv, Nadav; Avins, Andrew
Inguinal hernia repair is one of the most commonly performed operations in the world, yet little is known about the genetic mechanisms that predispose individuals to develop inguinal hernias. We perform a genome-wide association analysis of surgically confirmed inguinal hernias in 72,805 subjects (5,295 cases and 67,510 controls) and confirm top associations in an independent cohort of 92,444 subjects with self-reported hernia repair surgeries (9,701 cases and 82,743 controls). We identify four novel inguinal hernia susceptibility loci in the regions of EFEMP1, WT1, EBF2 and ADAMTS6. Moreover, we observe expression of all four genes in mouse connective tissue and network analyses show an important role for two of these genes (EFEMP1 and WT1) in connective tissue maintenance/homoeostasis. Our findings provide insight into the aetiology of hernia development and highlight genetic pathways for studies of hernia development and its treatment. PMID:26686553
Raman, Harsh; Raman, Rosy; Kilian, Andrzej; Detering, Frank; Carling, Jason; Coombes, Neil; Diffey, Simon; Kadkol, Gururaj; Edwards, David; McCully, Margaret; Ruperao, Pradeep; Parkin, Isobel A. P.; Batley, Jacqueline; Luckett, David J.; Wratten, Neil
Resistance to pod shattering (shatter resistance) is a target trait for global rapeseed (canola, Brassica napus L.), improvement programs to minimise grain loss in the mature standing crop, and during windrowing and mechanical harvest. We describe the genetic basis of natural variation for shatter resistance in B. napus and show that several quantitative trait loci (QTL) control this trait. To identify loci underlying shatter resistance, we used a novel genotyping-by-sequencing approach DArT-Seq. QTL analysis detected a total of 12 significant QTL on chromosomes A03, A07, A09, C03, C04, C06, and C08; which jointly account for approximately 57% of the genotypic variation in shatter resistance. Through Genome-Wide Association Studies, we show that a large number of loci, including those that are involved in shattering in Arabidopsis, account for variation in shatter resistance in diverse B. napus germplasm. Our results indicate that genetic diversity for shatter resistance genes in B. napus is limited; many of the genes that might control this trait were not included during the natural creation of this species, or were not retained during the domestication and selection process. We speculate that valuable diversity for this trait was lost during the natural creation of B. napus. To improve shatter resistance, breeders will need to target the introduction of useful alleles especially from genotypes of other related species of Brassica, such as those that we have identified. PMID:25006804
Polesskaya, Anna; Degerny, Cindy; Pinna, Guillaume; Maury, Yves; Kratassiouk, Gueorgui; Mouly, Vincent; Morozova, Nadya; Kropp, Jeremie; Frandsen, Niels; Harel-Bellan, Annick
MiRNAs impact on the control of cell fate by regulating gene expression at the post-transcriptional level. Here, using mammalian muscle differentiation as a model and a phenotypic loss-of-function screen, we explored the function of miRNAs at the genome-wide level. We found that the depletion of a high number of miRNAs (63) impacted on differentiation of human muscle precursors, underscoring the importance of this post-transcriptional mechanism of gene regulation. Interestingly, a comparison with miRNA expression profiles revealed that most of the hit miRNAs did not show any significant variations of expression during differentiation. These constitutively expressed miRNAs might be required for basic and/or essential cell function, or else might be regulated at the post-transcriptional level. MiRNA inhibition yielded a variety of phenotypes, reflecting the widespread miRNA involvement in differentiation. Using a functional screen (the STarS--Suppressor Target Screen--approach, i. e. concomitant knockdown of miRNAs and of candidate target proteins), we discovered miRNA protein targets that are previously uncharacterized controllers of muscle-cell terminal differentiation. Our results provide a strategy for functional annotation of the human miRnome.
Kinnersley, Ben; Labussière, Marianne; Holroyd, Amy; Di Stefano, Anna-Luisa; Broderick, Peter; Vijayakrishnan, Jayaram; Mokhtari, Karima; Delattre, Jean-Yves; Gousias, Konstantinos; Schramm, Johannes; Schoemaker, Minouk J.; Fleming, Sarah J.; Herms, Stefan; Heilmann, Stefanie; Schreiber, Stefan; Wichmann, Heinz-Erich; Nöthen, Markus M.; Swerdlow, Anthony; Lathrop, Mark; Simon, Matthias; Bondy, Melissa; Sanson, Marc; Houlston, Richard S.
Previous genome-wide association studies (GWASs) have shown that common genetic variation contributes to the heritable risk of glioma. To identify new glioma susceptibility loci, we conducted a meta-analysis of four GWAS (totalling 4,147 cases and 7,435 controls), with imputation using 1000 Genomes and UK10K Project data as reference. After genotyping an additional 1,490 cases and 1,723 controls we identify new risk loci for glioblastoma (GBM) at 12q23.33 (rs3851634, near POLR3B, P=3.02 × 10−9) and non-GBM at 10q25.2 (rs11196067, near VTI1A, P=4.32 × 10−8), 11q23.2 (rs648044, near ZBTB16, P=6.26 × 10−11), 12q21.2 (rs12230172, P=7.53 × 10−11) and 15q24.2 (rs1801591, near ETFA, P=5.71 × 10−9). Our findings provide further insights into the genetic basis of the different glioma subtypes. PMID:26424050
Köttgen, Anna; Albrecht, Eva; Teumer, Alexander; Vitart, Veronique; Krumsiek, Jan; Hundertmark, Claudia; Pistis, Giorgio; Ruggiero, Daniela; O’Seaghdha, Conall M; Haller, Toomas; Yang, Qiong; Tanaka, Toshiko; Johnson, Andrew D; Kutalik, Zoltán; Smith, Albert V; Shi, Julia; Struchalin, Maksim; Middelberg, Rita P S; Brown, Morris J; Gaffo, Angelo L; Pirastu, Nicola; Li, Guo; Hayward, Caroline; Zemunik, Tatijana; Huffman, Jennifer; Yengo, Loic; Zhao, Jing Hua; Demirkan, Ayse; Feitosa, Mary F; Liu, Xuan; Malerba, Giovanni; Lopez, Lorna M; van der Harst, Pim; Li, Xinzhong; Kleber, Marcus E; Hicks, Andrew A; Nolte, Ilja M; Johansson, Asa; Murgia, Federico; Wild, Sarah H; Bakker, Stephan J L; Peden, John F; Dehghan, Abbas; Steri, Maristella; Tenesa, Albert; Lagou, Vasiliki; Salo, Perttu; Mangino, Massimo; Rose, Lynda M; Lehtimäki, Terho; Woodward, Owen M; Okada, Yukinori; Tin, Adrienne; Müller, Christian; Oldmeadow, Christopher; Putku, Margus; Czamara, Darina; Kraft, Peter; Frogheri, Laura; Thun, Gian Andri; Grotevendt, Anne; Gislason, Gauti Kjartan; Harris, Tamara B; Launer, Lenore J; McArdle, Patrick; Shuldiner, Alan R; Boerwinkle, Eric; Coresh, Josef; Schmidt, Helena; Schallert, Michael; Martin, Nicholas G; Montgomery, Grant W; Kubo, Michiaki; Nakamura, Yusuke; Tanaka, Toshihiro; Munroe, Patricia B; Samani, Nilesh J; Jacobs, David R; Liu, Kiang; D’Adamo, Pio; Ulivi, Sheila; Rotter, Jerome I; Psaty, Bruce M; Vollenweider, Peter; Waeber, Gerard; Campbell, Susan; Devuyst, Olivier; Navarro, Pau; Kolcic, Ivana; Hastie, Nicholas; Balkau, Beverley; Froguel, Philippe; Esko, Tõnu; Salumets, Andres; Khaw, Kay Tee; Langenberg, Claudia; Wareham, Nicholas J; Isaacs, Aaron; Kraja, Aldi; Zhang, Qunyuan; Wild, Philipp S; Scott, Rodney J; Holliday, Elizabeth G; Org, Elin; Viigimaa, Margus; Bandinelli, Stefania; Metter, Jeffrey E; Lupo, Antonio; Trabetti, Elisabetta; Sorice, Rossella; Döring, Angela; Lattka, Eva; Strauch, Konstantin; Theis, Fabian; Waldenberger, Melanie; Wichmann, H-Erich; Davies, Gail; Gow, Alan J; Bruinenberg, Marcel; Study, LifeLines Cohort; Stolk, Ronald P; Kooner, Jaspal S; Zhang, Weihua; Winkelmann, Bernhard R; Boehm, Bernhard O; Lucae, Susanne; Penninx, Brenda W; Smit, Johannes H; Curhan, Gary; Mudgal, Poorva; Plenge, Robert M; Portas, Laura; Persico, Ivana; Kirin, Mirna; Wilson, James F; Leach, Irene Mateo; van Gilst, Wiek H; Goel, Anuj; Ongen, Halit; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, Andre G; Imboden, Medea; von Eckardstein, Arnold; Cucca, Francesco; Nagaraja, Ramaiah; Piras, Maria Grazia; Nauck, Matthias; Schurmann, Claudia; Budde, Kathrin; Ernst, Florian; Farrington, Susan M; Theodoratou, Evropi; Prokopenko, Inga; Stumvoll, Michael; Jula, Antti; Perola, Markus; Salomaa, Veikko; Shin, So-Youn; Spector, Tim D; Sala, Cinzia; Ridker, Paul M; Kähönen, Mika; Viikari, Jorma; Hengstenberg, Christian; Nelson, Christopher P; Consortium, CARDIoGRAM; Consortium, DIAGRAM; Consortium, ICBP; Consortium, MAGIC; Meschia, James F; Nalls, Michael A; Sharma, Pankaj; Singleton, Andrew B; Kamatani, Naoyuki; Zeller, Tanja; Burnier, Michel; Attia, John; Laan, Maris; Klopp, Norman; Hillege, Hans L; Kloiber, Stefan; Choi, Hyon; Pirastu, Mario; Tore, Silvia; Probst-Hensch, Nicole M; Völzke, Henry; Gudnason, Vilmundur; Parsa, Afshin; Schmidt, Reinhold; Whitfield, John B; Fornage, Myriam; Gasparini, Paolo; Siscovick, David S; Polašek, Ozren; Campbell, Harry; Rudan, Igor; Bouatia-Naji, Nabila; Metspalu, Andres; Loos, Ruth J F; van Duijn, Cornelia M; Borecki, Ingrid B; Ferrucci, Luigi; Gambaro, Giovanni; Deary, Ian J; Wolffenbuttel, Bruce H R; Chambers, John C; März, Winfried; Pramstaller, Peter P; Snieder, Harold; Gyllensten, Ulf; Wright, Alan F; Navis, Gerjan; Watkins, Hugh; Witteman, Jacqueline C M; Sanna, Serena; Schipf, Sabine; Dunlop, Malcolm G; Tönjes, Anke; Ripatti, Samuli; Soranzo, Nicole; Toniolo, Daniela; Chasman, Daniel I; Raitakari, Olli; Kao, W H Linda; Ciullo, Marina; Fox, Caroline S; Caulfield, Mark; Bochud, Murielle; Gieger, Christian
Elevated serum urate concentrations can cause gout, a prevalent and painful inflammatory arthritis. By combining data from >140,000 individuals of European ancestry within the Global Urate Genetics Consortium (GUGC), we identified and replicated 28 genome-wide significant loci in association with serum urate concentrations (18 new regions in or near TRIM46, INHBB, SFMBT1, TMEM171, VEGFA, BAZ1B, PRKAG2, STC1, HNF4G, A1CF, ATXN2, UBE2Q2, IGF1R, NFAT5, MAF, HLF, ACVR1B-ACVRL1 and B3GNT4). Associations for many of the loci were of similar magnitude in individuals of non-European ancestry. We further characterized these loci for associations with gout, transcript expression and the fractional excretion of urate. Network analyses implicate the inhibins-activins signaling pathways and glucose metabolism in systemic urate control. New candidate genes for serum urate concentration highlight the importance of metabolic control of urate production and excretion, which may have implications for the treatment and prevention of gout. PMID:23263486
Lu, Yi; Vitart, Veronique; Burdon, Kathryn P; Khor, Chiea Chuen; Bykhovskaya, Yelena; Mirshahi, Alireza; Hewitt, Alex W; Koehn, Demelza; Hysi, Pirro G; Ramdas, Wishal D; Zeller, Tanja; Vithana, Eranga N; Cornes, Belinda K; Tay, Wan-Ting; Tai, E Shyong; Cheng, Ching-Yu; Liu, Jianjun; Foo, Jia-Nee; Saw, Seang Mei; Thorleifsson, Gudmar; Stefansson, Kari; Dimasi, David P; Mills, Richard A; Mountain, Jenny; Ang, Wei; Hoehn, René; Verhoeven, Virginie J M; Grus, Franz; Wolfs, Roger; Castagne, Raphaële; Lackner, Karl J; Springelkamp, Henriët; Yang, Jian; Jonasson, Fridbert; Leung, Dexter Y L; Chen, Li J; Tham, Clement C Y; Rudan, Igor; Vatavuk, Zoran; Hayward, Caroline; Gibson, Jane; Cree, Angela J; MacLeod, Alex; Ennis, Sarah; Polasek, Ozren; Campbell, Harry; Wilson, James F; Viswanathan, Ananth C; Fleck, Brian; Li, Xiaohui; Siscovick, David; Taylor, Kent D; Rotter, Jerome I; Yazar, Seyhan; Ulmer, Megan; Li, Jun; Yaspan, Brian L; Ozel, Ayse B; Richards, Julia E; Moroi, Sayoko E; Haines, Jonathan L; Kang, Jae H; Pasquale, Louis R; Allingham, R Rand; Ashley-Koch, Allison; Mitchell, Paul; Wang, Jie Jin; Wright, Alan F; Pennell, Craig; Spector, Timothy D; Young, Terri L; Klaver, Caroline C W; Martin, Nicholas G; Montgomery, Grant W; Anderson, Michael G; Aung, Tin; Willoughby, Colin E; Wiggs, Janey L; Pang, Chi P; Thorsteinsdottir, Unnur; Lotery, Andrew J; Hammond, Christopher J; van Duijn, Cornelia M; Hauser, Michael A; Rabinowitz, Yaron S; Pfeiffer, Norbert; Mackey, David A; Craig, Jamie E; Macgregor, Stuart; Wong, Tien Y
Central corneal thickness (CCT) is associated with eye conditions including keratoconus and glaucoma. We performed a meta-analysis on >20,000 individuals in European and Asian populations that identified 16 new loci associated with CCT at genome-wide significance (P < 5 × 10−8). We further showed that 2 CCT-associated loci, FOXO1 and FNDC3B, conferred relatively large risks for keratoconus in 2 cohorts with 874 cases and 6,085 controls (rs2721051 near FOXO1 had odds ratio (OR) = 1.62, 95% confidence interval (CI) = 1.4–1.88, P = 2.7 × 10−10, and rs4894535 in FNDC3B had OR = 1.47, 95% CI = 1.29–1.68, P = 4.9 × 10−9). FNDC3B was also associated with primary open-angle glaucoma (P = 5.6 × 10−4; tested in 3 cohorts with 2,979 cases and 7,399 controls). Further analyses implicate the collagen and extracellular matrix pathways in the regulation of CCT. PMID:23291589
Full Text Available To investigate the underlying mechanisms of T2D pathogenesis, we looked for diabetes susceptibility genes that increase the risk of type 2 diabetes (T2D in a Han Chinese population. A two-stage genome-wide association (GWA study was conducted, in which 995 patients and 894 controls were genotyped using the Illumina HumanHap550-Duo BeadChip for the first genome scan stage. This was further replicated in 1,803 patients and 1,473 controls in stage 2. We found two loci not previously associated with diabetes susceptibility in and around the genes protein tyrosine phosphatase receptor type D (PTPRD (P = 8.54x10(-10; odds ratio [OR] = 1.57; 95% confidence interval [CI] = 1.36-1.82, and serine racemase (SRR (P = 3.06x10(-9; OR = 1.28; 95% CI = 1.18-1.39. We also confirmed that variants in KCNQ1 were associated with T2D risk, with the strongest signal at rs2237895 (P = 9.65x10(-10; OR = 1.29, 95% CI = 1.19-1.40. By identifying two novel genetic susceptibility loci in a Han Chinese population and confirming the involvement of KCNQ1, which was previously reported to be associated with T2D in Japanese and European descent populations, our results may lead to a better understanding of differences in the molecular pathogenesis of T2D among various populations.
Ghoussaini, Maya; Fletcher, Olivia; Michailidou, Kyriaki; Turnbull, Clare; Schmidt, Marjanka K; Dicks, Ed; Dennis, Joe; Wang, Qin; Humphreys, Manjeet K; Luccarini, Craig; Baynes, Caroline; Conroy, Don; Maranian, Melanie; Ahmed, Shahana; Driver, Kristy; Johnson, Nichola; Orr, Nicholas; Silva, Isabel dos Santos; Waisfisz, Quinten; Meijers-Heijboer, Hanne; Uitterlinden, Andre G.; Rivadeneira, Fernando; Hall, Per; Czene, Kamila; Irwanto, Astrid; Liu, Jianjun; Nevanlinna, Heli; Aittomäki, Kristiina; Blomqvist, Carl; Meindl, Alfons; Schmutzler, Rita K; Müller-Myhsok, Bertram; Lichtner, Peter; Chang-Claude, Jenny; Hein, Rebecca; Nickels, Stefan; Flesch-Janys, Dieter; Tsimiklis, Helen; Makalic, Enes; Schmidt, Daniel; Bui, Minh; Hopper, John L; Apicella, Carmel; Park, Daniel J; Southey, Melissa; Hunter, David J; Chanock, Stephen J; Broeks, Annegien; Verhoef, Senno; Hogervorst, Frans BL; Fasching, Peter A.; Lux, Michael P.; Beckmann, Matthias W.; Ekici, Arif B.; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Marme, Frederik; Schneeweiss, Andreas; Sohn, Christof; Burwinkel, Barbara; Guénel, Pascal; Truong, Thérèse; Cordina-Duverger, Emilie; Menegaux, Florence; Bojesen, Stig E; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Milne, Roger L.; Alonso, M. Rosario; González-Neira, Anna; Benítez, Javier; Anton-Culver, Hoda; Ziogas, Argyrios; Bernstein, Leslie; Dur, Christina Clarke; Brenner, Hermann; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Justenhoven, Christina; Brauch, Hiltrud; Brüning, Thomas; Wang-Gohrke, Shan; Eilber, Ursula; Dörk, Thilo; Schürmann, Peter; Bremer, Michael; Hillemanns, Peter; Bogdanova, Natalia V.; Antonenkova, Natalia N.; Rogov, Yuri I.; Karstens, Johann H.; Bermisheva, Marina; Prokofieva, Darya; Khusnutdinova, Elza; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Lambrechts, Diether; Yesilyurt, Betul T.; Floris, Giuseppe; Leunen, Karin; Manoukian, Siranoush; Bonanni, Bernardo; Fortuzzi, Stefano; Peterlongo, Paolo; Couch, Fergus J; Wang, Xianshu; Stevens, Kristen; Lee, Adam; Giles, Graham G.; Baglietto, Laura; Severi, Gianluca; McLean, Catriona; Alnæs, Grethe Grenaker; Kristensen, Vessela; Børrensen-Dale, Anne-Lise; John, Esther M.; Miron, Alexander; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; Andrulis, Irene L.; Glendon, Gord; Mulligan, Anna Marie; Devilee, Peter; van Asperen, Christie J.; Tollenaar, Rob A.E.M.; Seynaeve, Caroline; Figueroa, Jonine D; Garcia-Closas, Montserrat; Brinton, Louise; Lissowska, Jolanta; Hooning, Maartje J.; Hollestelle, Antoinette; Oldenburg, Rogier A.; van den Ouweland, Ans M.W.; Cox, Angela; Reed, Malcolm WR; Shah, Mitul; Jakubowska, Ania; Lubinski, Jan; Jaworska, Katarzyna; Durda, Katarzyna; Jones, Michael; Schoemaker, Minouk; Ashworth, Alan; Swerdlow, Anthony; Beesley, Jonathan; Chen, Xiaoqing; Muir, Kenneth R; Lophatananon, Artitaya; Rattanamongkongul, Suthee; Chaiwerawattana, Arkom; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Shen, Chen-Yang; Yu, Jyh-Cherng; Wu, Pei-Ei; Hsiung, Chia-Ni; Perkins, Annie; Swann, Ruth; Velentzis, Louiza; Eccles, Diana M; Tapper, Will J; Gerty, Susan M; Graham, Nikki J; Ponder, Bruce A. J.; Chenevix-Trench, Georgia; Pharoah, Paul D.P.; Lathrop, Mark; Dunning, Alison M.; Rahman, Nazneen; Peto, Julian; Easton, Douglas F
Breast cancer is the most common cancer among women. To date, 22 common breast cancer susceptibility loci have been identified accounting for ~ 8% of the heritability of the disease. We followed up 72 promising associations from two independent Genome Wide Association Studies (GWAS) in ~70,000 cases and ~68,000 controls from 41 case-control studies and nine breast cancer GWAS. We identified three new breast cancer risk loci on 12p11 (rs10771399; P=2.7 × 10−35), 12q24 (rs1292011; P=4.3×10−19) and 21q21 (rs2823093; P=1.1×10−12). SNP rs10771399 was associated with similar relative risks for both estrogen receptor (ER)-negative and ER-positive breast cancer, whereas the other two loci were associated only with ER-positive disease. Two of the loci lie in regions that contain strong plausible candidate genes: PTHLH (12p11) plays a crucial role in mammary gland development and the establishment of bone metastasis in breast cancer, while NRIP1 (21q21) encodes an ER co-factor and has a role in the regulation of breast cancer cell growth. PMID:22267197
We conducted a genome-wide scan for visceral leishmaniasis in mixed-breed dogs from a highly endemic area in Brazil using 149,648 single nucleotide polymorphism (SNP) markers genotyped in 20 cases and 28 controls. Using a mixed model approach, we found two candidate loci on canine autosomes 1 and 2....
Surakka, I.; Isaacs, A.; Karssen, L. C.
Recent genome-wide association (GWA) studies described 95 loci controlling serum lipid levels. These common variants explain similar to 25% of the heritability of the phenotypes. To date, no unbiased screen for gene-environment interactions for circulating lipids has been reported. We screened fo...
Steven G Potkin
Full Text Available BACKGROUND: With the exception of APOE epsilon4 allele, the common genetic risk factors for sporadic Alzheimer's Disease (AD are unknown. METHODS AND FINDINGS: We completed a genome-wide association study on 381 participants in the ADNI (Alzheimer's Disease Neuroimaging Initiative study. Samples were genotyped using the Illumina Human610-Quad BeadChip. 516,645 unique Single Nucleotide Polymorphisms (SNPs were included in the analysis following quality control measures. The genotype data and raw genetic data are freely available for download (LONI, http://www.loni.ucla.edu/ADNI/Data/. Two analyses were completed: a standard case-control analysis, and a novel approach using hippocampal atrophy measured on MRI as an objectively defined, quantitative phenotype. A General Linear Model was applied to identify SNPs for which there was an interaction between the genotype and diagnosis on the quantitative trait. The case-control analysis identified APOE and a new risk gene, TOMM40 (translocase of outer mitochondrial membrane 40, at a genome-wide significance level of < or =10(-6 (10(-11 for a haplotype. TOMM40 risk alleles were approximately twice as frequent in AD subjects as controls. The quantitative trait analysis identified 21 genes or chromosomal areas with at least one SNP with a p-value < or =10(-6, which can be considered potential "new" candidate loci to explore in the etiology of sporadic AD. These candidates included EFNA5, CAND1, MAGI2, ARSB, and PRUNE2, genes involved in the regulation of protein degradation, apoptosis, neuronal loss and neurodevelopment. Thus, we identified common genetic variants associated with the increased risk of developing AD in the ADNI cohort, and present publicly available genome-wide data. Supportive evidence based on case-control studies and biological plausibility by gene annotation is provided. Currently no available sample with both imaging and genetic data is available for replication. CONCLUSIONS: Using
Pillai, Sreekumar G.; Shianna, Kevin V.; Need, Anna C.; Feng, Sheng; Hersh, Craig P.; Bakke, Per; Gulsvik, Amund; Ruppert, Andreas; Lødrup Carlsen, Karin C.; Roses, Allen; Anderson, Wayne; Rennard, Stephen I.; Lomas, David A.; Silverman, Edwin K.; Goldstein, David B.
There is considerable variability in the susceptibility of smokers to develop chronic obstructive pulmonary disease (COPD). The only known genetic risk factor is severe deficiency of α1-antitrypsin, which is present in 1–2% of individuals with COPD. We conducted a genome-wide association study (GWAS) in a homogenous case-control cohort from Bergen, Norway (823 COPD cases and 810 smoking controls) and evaluated the top 100 single nucleotide polymorphisms (SNPs) in the family-based International COPD Genetics Network (ICGN; 1891 Caucasian individuals from 606 pedigrees) study. The polymorphisms that showed replication were further evaluated in 389 subjects from the US National Emphysema Treatment Trial (NETT) and 472 controls from the Normative Aging Study (NAS) and then in a fourth cohort of 949 individuals from 127 extended pedigrees from the Boston Early-Onset COPD population. Logistic regression models with adjustments of covariates were used to analyze the case-control populations. Family-based association analyses were conducted for a diagnosis of COPD and lung function in the family populations. Two SNPs at the α-nicotinic acetylcholine receptor (CHRNA 3/5) locus were identified in the genome-wide association study. They showed unambiguous replication in the ICGN family-based analysis and in the NETT case-control analysis with combined p-values of 1.48×10−10, (rs8034191) and 5.74×10−10 (rs1051730). Furthermore, these SNPs were significantly associated with lung function in both the ICGN and Boston Early-Onset COPD populations. The C allele of the rs8034191 SNP was estimated to have a population attributable risk for COPD of 12.2%. The association of hedgehog interacting protein (HHIP) locus on chromosome 4 was also consistently replicated, but did not reach genome-wide significance levels. Genome-wide significant association of the HHIP locus with lung function was identified in the Framingham Heart study (Wilk et al., companion article in this issue
Sreekumar G Pillai
Full Text Available There is considerable variability in the susceptibility of smokers to develop chronic obstructive pulmonary disease (COPD. The only known genetic risk factor is severe deficiency of alpha(1-antitrypsin, which is present in 1-2% of individuals with COPD. We conducted a genome-wide association study (GWAS in a homogenous case-control cohort from Bergen, Norway (823 COPD cases and 810 smoking controls and evaluated the top 100 single nucleotide polymorphisms (SNPs in the family-based International COPD Genetics Network (ICGN; 1891 Caucasian individuals from 606 pedigrees study. The polymorphisms that showed replication were further evaluated in 389 subjects from the US National Emphysema Treatment Trial (NETT and 472 controls from the Normative Aging Study (NAS and then in a fourth cohort of 949 individuals from 127 extended pedigrees from the Boston Early-Onset COPD population. Logistic regression models with adjustments of covariates were used to analyze the case-control populations. Family-based association analyses were conducted for a diagnosis of COPD and lung function in the family populations. Two SNPs at the alpha-nicotinic acetylcholine receptor (CHRNA 3/5 locus were identified in the genome-wide association study. They showed unambiguous replication in the ICGN family-based analysis and in the NETT case-control analysis with combined p-values of 1.48 x 10(-10, (rs8034191 and 5.74 x 10(-10 (rs1051730. Furthermore, these SNPs were significantly associated with lung function in both the ICGN and Boston Early-Onset COPD populations. The C allele of the rs8034191 SNP was estimated to have a population attributable risk for COPD of 12.2%. The association of hedgehog interacting protein (HHIP locus on chromosome 4 was also consistently replicated, but did not reach genome-wide significance levels. Genome-wide significant association of the HHIP locus with lung function was identified in the Framingham Heart study (Wilk et al., companion article
Chang, Cheng; Zhang, Kaida; Veluchamy, Abirami; Hébert, Harry L.; Looker, Helen C.; Colhoun, Helen M.; Palmer, Colin N. A.; Meng, Weihua
Purpose Diabetic cataract is one of the major eye complications of diabetes. It was reported that cataract occurs two to five times more frequently in patients with diabetes compared with those with no diabetes. The purpose of this study was to identify genetic contributors of diabetic cataract based on a genome-wide association approach using a well-defined Scottish diabetic cohort. Methods We adapted linked e-health records to define diabetic cataract. A diabetic cataract case in this study was defined as a type 2 diabetic patient who has ever been recorded in the linked e-health records to have cataracts in both eyes or who had previous cataract extraction surgeries in at least one eye. A control in this study was defined as a type 2 diabetic individual who has never been diagnosed as cataract in the linked e-health records and had no history of cataract surgeries. A standard genome-wide association approach was applied. Results Overall, we have 2341 diabetic cataract cases and 2878 controls in the genetics of diabetes audit and research in Tayside Scotland (GoDARTS) dataset. We found that the P value of rs2283290 in the CACNA1C gene was 8.81 × 10−10, which has reached genome-wide significance. We also identified that the blood calcium level was statistically different between diabetic cataract cases and controls. Conclusions We identified supporting evidence that CACNA1C gene is associated with diabetic cataract. The role of calcium in the cataractogenesis needs to be reevaluated in future studies. PMID:27124316
Full Text Available Abstract Background Specific genetic contributions for preeclampsia (PE are currently unknown. This genome-wide association study (GWAS aims to identify maternal single nucleotide polymorphisms (SNPs and copy-number variants (CNVs involved in the etiology of PE. Methods A genome-wide scan was performed on 177 PE cases (diagnosed according to National Heart, Lung and Blood Institute guidelines and 116 normotensive controls. White female study subjects from Iowa were genotyped on Affymetrix SNP 6.0 microarrays. CNV calls made using a combination of four detection algorithms (Birdseye, Canary, PennCNV, and QuantiSNP were merged using CNVision and screened with stringent prioritization criteria. Due to limited DNA quantities and the deleterious nature of copy-number deletions, it was decided a priori that only deletions would be selected for assay on the entire case-control dataset using quantitative real-time PCR. Results The top four SNP candidates had an allelic or genotypic p-value between 10-5 and 10-6, however, none surpassed the Bonferroni-corrected significance threshold. Three recurrent rare deletions meeting prioritization criteria detected in multiple cases were selected for targeted genotyping. A locus of particular interest was found showing an enrichment of case deletions in 19q13.31 (5/169 cases and 1/114 controls, which encompasses the PSG11 gene contiguous to a highly plastic genomic region. All algorithm calls for these regions were assay confirmed. Conclusions CNVs may confer risk for PE and represent interesting regions that warrant further investigation. Top SNP candidates identified from the GWAS, although not genome-wide significant, may be useful to inform future studies in PE genetics.
Wang, Yue; Goh, Wilson; Wong, Limsoon; Montana, Giovanni
Multivariate quantitative traits arise naturally in recent neuroimaging genetics studies, in which both structural and functional variability of the human brain is measured non-invasively through techniques such as magnetic resonance imaging (MRI). There is growing interest in detecting genetic variants associated with such multivariate traits, especially in genome-wide studies. Random forests (RFs) classifiers, which are ensembles of decision trees, are amongst the best performing machine learning algorithms and have been successfully employed for the prioritisation of genetic variants in case-control studies. RFs can also be applied to produce gene rankings in association studies with multivariate quantitative traits, and to estimate genetic similarities measures that are predictive of the trait. However, in studies involving hundreds of thousands of SNPs and high-dimensional traits, a very large ensemble of trees must be inferred from the data in order to obtain reliable rankings, which makes the application of these algorithms computationally prohibitive. We have developed a parallel version of the RF algorithm for regression and genetic similarity learning tasks in large-scale population genetic association studies involving multivariate traits, called PaRFR (Parallel Random Forest Regression). Our implementation takes advantage of the MapReduce programming model and is deployed on Hadoop, an open-source software framework that supports data-intensive distributed applications. Notable speed-ups are obtained by introducing a distance-based criterion for node splitting in the tree estimation process. PaRFR has been applied to a genome-wide association study on Alzheimer's disease (AD) in which the quantitative trait consists of a high-dimensional neuroimaging phenotype describing longitudinal changes in the human brain structure. PaRFR provides a ranking of SNPs associated to this trait, and produces pair-wise measures of genetic proximity that can be directly
Motivation Multivariate quantitative traits arise naturally in recent neuroimaging genetics studies, in which both structural and functional variability of the human brain is measured non-invasively through techniques such as magnetic resonance imaging (MRI). There is growing interest in detecting genetic variants associated with such multivariate traits, especially in genome-wide studies. Random forests (RFs) classifiers, which are ensembles of decision trees, are amongst the best performing machine learning algorithms and have been successfully employed for the prioritisation of genetic variants in case-control studies. RFs can also be applied to produce gene rankings in association studies with multivariate quantitative traits, and to estimate genetic similarities measures that are predictive of the trait. However, in studies involving hundreds of thousands of SNPs and high-dimensional traits, a very large ensemble of trees must be inferred from the data in order to obtain reliable rankings, which makes the application of these algorithms computationally prohibitive. Results We have developed a parallel version of the RF algorithm for regression and genetic similarity learning tasks in large-scale population genetic association studies involving multivariate traits, called PaRFR (Parallel Random Forest Regression). Our implementation takes advantage of the MapReduce programming model and is deployed on Hadoop, an open-source software framework that supports data-intensive distributed applications. Notable speed-ups are obtained by introducing a distance-based criterion for node splitting in the tree estimation process. PaRFR has been applied to a genome-wide association study on Alzheimer's disease (AD) in which the quantitative trait consists of a high-dimensional neuroimaging phenotype describing longitudinal changes in the human brain structure. PaRFR provides a ranking of SNPs associated to this trait, and produces pair-wise measures of genetic proximity
Dadousis, C; Biffani, S; Cipolat-Gotet, C; Nicolazzi, E L; Rosa, G J M; Gianola, D; Rossoni, A; Santus, E; Bittante, G; Cecchinato, A
Cheese production and consumption are increasing in many countries worldwide. As a result, interest has increased in strategies for genetic selection of individuals for technological traits of milk related to cheese yield (CY) in dairy cattle breeding. However, little is known about the genetic background of a cow's ability to produce cheese. Recently, a relatively large panel (1,264 cows) of different measures of individual cow CY and milk nutrient and energy recoveries in the cheese (REC) became available. Genetic analyses showed considerable variation for CY and for aptitude to retain high proportions of fat, protein, and water in the coagulum. For the dairy industry, these characteristics are of major economic importance. Nevertheless, use of this knowledge in dairy breeding is hampered by high costs, intense labor requirement, and lack of appropriate technology. However, in the era of genomics, new possibilities are available for animal breeding and genetic improvement. For example, identification of genomic regions involved in cow CY might provide potential for marker-assisted selection. The objective of this study was to perform genome-wide association studies on different CY and REC measures. Milk and DNA samples from 1,152 Italian Brown Swiss cows were used. Three CY traits expressing the weight (wt) of fresh curd (%CYCURD), curd solids (%CYSOLIDS), and curd moisture (%CYWATER) as a percentage of weight of milk processed, and 4 REC (RECFAT, RECPROTEIN, RECSOLIDS, and RECENERGY, calculated as the % ratio between the nutrient in curd and the corresponding nutrient in processed milk) were analyzed. Animals were genotyped with the Illumina BovineSNP50 Bead Chip v.2. Single marker regressions were fitted using the GenABEL R package (genome-wide association using mixed model and regression-genomic control). In total, 103 significant associations (88 single nucleotide polymorphisms) were identified in 10 chromosomes (2, 6, 9, 11, 12, 14, 18, 19, 27, 28). For
Full Text Available Aquaporins (AQPs are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea (Cicer arietinum L.. A complete overview of the chickpea AQP (CaAQP gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6 and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs, 13 tonoplast intrinsic proteins (TIPs, eight plasma membrane intrinsic proteins (PIPs and four small basic intrinsic proteins (SIPs based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis-acting regulatory elements revealed enrichment of cis-elements involved in circadian control, light response, defense and stress
Gary W Beecham
Full Text Available Alzheimer's disease (AD and related dementias are a major public health challenge and present a therapeutic imperative for which we need additional insight into molecular pathogenesis. We performed a genome-wide association study and analysis of known genetic risk loci for AD dementia using neuropathologic data from 4,914 brain autopsies. Neuropathologic data were used to define clinico-pathologic AD dementia or controls, assess core neuropathologic features of AD (neuritic plaques, NPs; neurofibrillary tangles, NFTs, and evaluate commonly co-morbid neuropathologic changes: cerebral amyloid angiopathy (CAA, Lewy body disease (LBD, hippocampal sclerosis of the elderly (HS, and vascular brain injury (VBI. Genome-wide significance was observed for clinico-pathologic AD dementia, NPs, NFTs, CAA, and LBD with a number of variants in and around the apolipoprotein E gene (APOE. GalNAc transferase 7 (GALNT7, ATP-Binding Cassette, Sub-Family G (WHITE, Member 1 (ABCG1, and an intergenic region on chromosome 9 were associated with NP score; and Potassium Large Conductance Calcium-Activated Channel, Subfamily M, Beta Member 2 (KCNMB2 was strongly associated with HS. Twelve of the 21 non-APOE genetic risk loci for clinically-defined AD dementia were confirmed in our clinico-pathologic sample: CR1, BIN1, CLU, MS4A6A, PICALM, ABCA7, CD33, PTK2B, SORL1, MEF2C, ZCWPW1, and CASS4 with 9 of these 12 loci showing larger odds ratio in the clinico-pathologic sample. Correlation of effect sizes for risk of AD dementia with effect size for NFTs or NPs showed positive correlation, while those for risk of VBI showed a moderate negative correlation. The other co-morbid neuropathologic features showed only nominal association with the known AD loci. Our results discovered new genetic associations with specific neuropathologic features and aligned known genetic risk for AD dementia with specific neuropathologic changes in the largest brain autopsy study of AD and related
Robin P Smith
Full Text Available Inter-individual variation in gene regulatory elements is hypothesized to play a causative role in adverse drug reactions and reduced drug activity. However, relatively little is known about the location and function of drug-dependent elements. To uncover drug-associated elements in a genome-wide manner, we performed RNA-seq and ChIP-seq using antibodies against the pregnane X receptor (PXR and three active regulatory marks (p300, H3K4me1, H3K27ac on primary human hepatocytes treated with rifampin or vehicle control. Rifampin and PXR were chosen since they are part of the CYP3A4 pathway, which is known to account for the metabolism of more than 50% of all prescribed drugs. We selected 227 proximal promoters for genes with rifampin-dependent expression or nearby PXR/p300 occupancy sites and assayed their ability to induce luciferase in rifampin-treated HepG2 cells, finding only 10 (4.4% that exhibited drug-dependent activity. As this result suggested a role for distal enhancer modules, we searched more broadly to identify 1,297 genomic regions bearing a conditional PXR occupancy as well as all three active regulatory marks. These regions are enriched near genes that function in the metabolism of xenobiotics, specifically members of the cytochrome P450 family. We performed enhancer assays in rifampin-treated HepG2 cells for 42 of these sequences as well as 7 sequences that overlap linkage-disequilibrium blocks defined by lead SNPs from pharmacogenomic GWAS studies, revealing 15/42 and 4/7 to be functional enhancers, respectively. A common African haplotype in one of these enhancers in the GSTA locus was found to exhibit potential rifampin hypersensitivity. Combined, our results further suggest that enhancers are the predominant targets of rifampin-induced PXR activation, provide a genome-wide catalog of PXR targets and serve as a model for the identification of drug-responsive regulatory elements.
Full Text Available Central obesity, measured by waist circumference (WC or waist-hip ratio (WHR, is a marker of body fat distribution. Although obesity disproportionately affects minority populations, few studies have conducted genome-wide association study (GWAS of fat distribution among those of predominantly African ancestry (AA. We performed GWAS of WC and WHR, adjusted and unadjusted for BMI, in up to 33,591 and 27,350 AA individuals, respectively. We identified loci associated with fat distribution in AA individuals using meta-analyses of GWA results for WC and WHR (stage 1. Overall, 25 SNPs with single genomic control (GC-corrected p-values<5.0 × 10(-6 were followed-up (stage 2 in AA with WC and with WHR. Additionally, we interrogated genomic regions of previously identified European ancestry (EA WHR loci among AA. In joint analysis of association results including both Stage 1 and 2 cohorts, 2 SNPs demonstrated association, rs2075064 at LHX2, p = 2.24×10(-8 for WC-adjusted-for-BMI, and rs6931262 at RREB1, p = 2.48×10(-8 for WHR-adjusted-for-BMI. However, neither signal was genome-wide significant after double GC-correction (LHX2: p = 6.5 × 10(-8; RREB1: p = 5.7 × 10(-8. Six of fourteen previously reported loci for waist in EA populations were significant (p<0.05 divided by the number of independent SNPs within the region in AA studied here (TBX15-WARS2, GRB14, ADAMTS9, LY86, RSPO3, ITPR2-SSPN. Further, we observed associations with metabolic traits: rs13389219 at GRB14 associated with HDL-cholesterol, triglycerides, and fasting insulin, and rs13060013 at ADAMTS9 with HDL-cholesterol and fasting insulin. Finally, we observed nominal evidence for sexual dimorphism, with stronger results in AA women at the GRB14 locus (p for interaction = 0.02. In conclusion, we identified two suggestive loci associated with fat distribution in AA populations in addition to confirming 6 loci previously identified in populations of EA. These findings reinforce the concept
Wuttke, Matthias; Schaefer, Franz; Wong, Craig S; Köttgen, Anna
Prior to conducting genome-wide association studies (GWAS) of renal traits and diseases, systematic checks to ensure data integrity and analytical work flow should be conducted. Using positive controls (ie, known associations between a single-nucleotide polymorphism [SNP] and a corresponding trait) allows for identifying errors that are not apparent solely from global evaluation of summary statistics. Strong genetic control associations of chronic kidney disease (CKD), as derived from GWAS, are lacking in the non-African ancestry CKD population; thus, in this perspective, we provide examples of and considerations for using positive controls among patients with CKD. Using data from individuals with CKD who participated in the CRIC (Chronic Renal Insufficiency Cohort) Study or PediGFR (Pediatric Investigation for Genetic Factors Linked to Renal Progression) Consortium, we evaluated 2 kinds of positive control traits: traits unrelated to kidney function (bilirubin level and body height) and those related to kidney function (cystatin C and urate levels). For the former, the proportion of variance in the control trait that is explained by the control SNP is the main determinant of the strength of the observable association, irrespective of adjustment for kidney function. For the latter, adjustment for kidney function can be effective in uncovering known associations among patients with CKD. For instance, in 1,092 participants in the PediGFR Consortium, the P value for the association of cystatin C concentrations and rs911119 in the CST3 gene decreased from 2.7×10(-3) to 2.4×10(-8) upon adjustment for serum creatinine-based estimated glomerular filtration rate. In this perspective, we give recommendations for the appropriate selection of control traits and SNPs that can be used for data checks prior to conducting GWAS among patients with CKD. Copyright © 2015 National Kidney Foundation, Inc. Published by Elsevier Inc. All rights reserved.
Cho, Michael H; Castaldi, Peter J; Wan, Emily S
The genetic risk factors for chronic obstructive pulmonary disease (COPD) are still largely unknown. To date, genome-wide association studies (GWASs) of limited size have identified several novel risk loci for COPD at CHRNA3/CHRNA5/IREB2, HHIP and FAM13A; additional loci may be identified through...... larger studies. We performed a GWAS using a total of 3499 cases and 1922 control subjects from four cohorts: the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE); the Normative Aging Study (NAS) and National Emphysema Treatment Trial (NETT); Bergen, Norway (Gen...
Angarica, Vladimir Espinosa; Del Sol, Antonio
Epigenetics play a central role in the regulation of many important cellular processes, and dysregulations at the epigenetic level could be the source of serious pathologies, such as neurological disorders affecting brain development, neurodegeneration, and intellectual disability. Despite significant technological advances for epigenetic profiling, there is still a need for a systematic understanding of how epigenetics shapes cellular circuitry, and disease pathogenesis. The development of accurate computational approaches for analyzing complex epigenetic profiles is essential for disentangling the mechanisms underlying cellular development, and the intricate interaction networks determining and sensing chromatin modifications and DNA methylation to control gene expression. In this chapter, we review the recent advances in the field of "computational epigenetics," including computational methods for processing different types of epigenetic data, prediction of chromatin states, and study of protein dynamics. We also discuss how "computational epigenetics" has complemented the fast growth in the generation of epigenetic data for uncovering the main differences and similarities at the epigenetic level between individuals and the mechanisms underlying disease onset and progression.
Sprung, Carl N; Yang, Yuqing; Forrester, Helen B; Li, Jason; Zaitseva, Marina; Cann, Leonie; Restall, Tina; Anderson, Robin L; Crosbie, Jeffrey C; Rogers, Peter A W
The majority of cancer patients achieve benefit from radiotherapy. A significant limitation of radiotherapy is its relatively low therapeutic index, defined as the maximum radiation dose that causes acceptable normal tissue damage to the minimum dose required to achieve tumor control. Recently, a new radiotherapy modality using synchrotron-generated X-ray microbeam radiotherapy has been demonstrated in animal models to ablate tumors with concurrent sparing of normal tissue. Very little work has been undertaken into the cellular and molecular mechanisms that differentiate microbeam radiotherapy from broad beam. The purpose of this study was to investigate and compare the whole genome transcriptional response of in vivo microbeam radiotherapy versus broad beam irradiated tumors. We hypothesized that gene expression changes after microbeam radiotherapy are different from those seen after broad beam. We found that in EMT6.5 tumors at 4-48 h postirradiation, microbeam radiotherapy differentially regulates a number of genes, including major histocompatibility complex (MHC) class II antigen gene family members, and other immunity-related genes including Ciita, Ifng, Cxcl1, Cxcl9, Indo and Ubd when compared to broad beam. Our findings demonstrate molecular differences in the tumor response to microbeam versus broad beam irradiation and these differences provide insight into the underlying mechanisms of microbeam radiotherapy and broad beam.
Lewsey, Mathew G; Hardcastle, Thomas J; Melnyk, Charles W; Molnar, Attila; Valli, Adrián; Urich, Mark A; Nery, Joseph R; Baulcombe, David C; Ecker, Joseph R
RNA silencing at the transcriptional and posttranscriptional levels regulates endogenous gene expression, controls invading transposable elements (TEs), and protects the cell against viruses. Key components of the mechanism are small RNAs (sRNAs) of 21-24 nt that guide the silencing machinery to their nucleic acid targets in a nucleotide sequence-specific manner. Transcriptional gene silencing is associated with 24-nt sRNAs and RNA-directed DNA methylation (RdDM) at cytosine residues in three DNA sequence contexts (CG, CHG, and CHH). We previously demonstrated that 24-nt sRNAs are mobile from shoot to root in Arabidopsis thaliana and confirmed that they mediate DNA methylation at three sites in recipient cells. In this study, we extend this finding by demonstrating that RdDM of thousands of loci in root tissues is dependent upon mobile sRNAs from the shoot and that mobile sRNA-dependent DNA methylation occurs predominantly in non-CG contexts. Mobile sRNA-dependent non-CG methylation is largely dependent on the DOMAINS REARRANGED METHYLTRANSFERASES 1/2 (DRM1/DRM2) RdDM pathway but is independent of the CHROMOMETHYLASE (CMT)2/3 DNA methyltransferases. Specific superfamilies of TEs, including those typically found in gene-rich euchromatic regions, lose DNA methylation in a mutant lacking 22- to 24-nt sRNAs (dicer-like 2, 3, 4 triple mutant). Transcriptome analyses identified a small number of genes whose expression in roots is associated with mobile sRNAs and connected to DNA methylation directly or indirectly. Finally, we demonstrate that sRNAs from shoots of one accession move across a graft union and target DNA methylation de novo at normally unmethylated sites in the genomes of root cells from a different accession.
Full Text Available Anxiety disorders (ADs are common mental disorders caused by a combination of genetic and environmental factors. Since ADs are highly comorbid with each other, partially due to shared genetic basis, studying AD phenotypes in a coordinated manner may be a powerful strategy for identifying potential genetic loci for ADs. To detect these loci, we performed genome-wide association studies (GWAS of ADs. In addition, as a complementary approach to single-locus analysis, we also conducted gene- and pathway-based analyses. GWAS data were derived from the control sample of the Molecular Genetics of Schizophrenia (MGS project (2,540 European American and 849 African American subjects genotyped on the Affymetrix GeneChip 6.0 array. We applied two phenotypic approaches: (1 categorical case-control comparisons (CC based upon psychiatric diagnoses, and (2 quantitative phenotypic factor scores (FS derived from a multivariate analysis combining information across the clinical phenotypes. Linear and logistic models were used to analyse the association with ADs using FS and CC traits, respectively. At the single locus level, no genome-wide significant association was found. A trans-population gene-based meta-analysis across both ethnic subsamples using FS identified three genes (MFAP3L on 4q32.3, NDUFAB1 and PALB2 on 16p12 with genome-wide significance (false discovery rate (FDR] <5%. At the pathway level, several terms such as transcription regulation, cytokine binding, and developmental process were significantly enriched in ADs (FDR <5%. Our approaches studying ADs as quantitative traits and utilizing the full GWAS data may be useful in identifying susceptibility genes and pathways for ADs.
Liu, Xiaohua; Kelsoe, John R; Greenwood, Tiffany A
Bipolar disorder is a heterogeneous mood disorder associated with several important clinical comorbidities, such as eating disorders. This clinical heterogeneity complicates the identification of genetic variants contributing to bipolar susceptibility. Here we investigate comorbidity of eating disorders as a subphenotype of bipolar disorder to identify genetic variation that is common and unique to both disorders. We performed a genome-wide association analysis contrasting 184 bipolar subjects with eating disorder comorbidity against both 1370 controls and 2006 subjects with bipolar disorder only from the Bipolar Genome Study (BiGS). The most significant genome-wide finding was observed bipolar with comorbid eating disorder vs. controls within SOX2-OT (p=8.9×10(-8) for rs4854912) with a secondary peak in the adjacent FXR1 gene (p=1.2×10(-6) for rs1805576) on chromosome 3q26.33. This region was also the most prominent finding in the case-only analysis (p=3.5×10(-7) and 4.3×10(-6), respectively). Several regions of interest containing genes involved in neurodevelopment and neuroprotection processes were also identified. While our primary finding did not quite reach genome-wide significance, likely due to the relatively limited sample size, these results can be viewed as a replication of a recent study of eating disorders in a large cohort. These findings replicate the prior association of SOX2-OT with eating disorders and broadly support the involvement of neurodevelopmental/neuroprotective mechanisms in the pathophysiology of both disorders. They further suggest that different clinical manifestations of bipolar disorder may reflect differential genetic contributions and argue for the utility of clinical subphenotypes in identifying additional molecular pathways leading to illness. Copyright © 2015 Elsevier B.V. All rights reserved.
Kote-Jarai, Zsofia; Olama, Ali Amin Al; Giles, Graham G
Prostate cancer (PrCa) is the most frequently diagnosed male cancer in developed countries. We conducted a multi-stage genome-wide association study for PrCa and previously reported the results of the first two stages, which identified 16 PrCa susceptibility loci. We report here the results...... of stage 3, in which we evaluated 1,536 SNPs in 4,574 individuals with prostate cancer (cases) and 4,164 controls. We followed up ten new association signals through genotyping in 51,311 samples in 30 studies from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations...... in the Genome (PRACTICAL) consortium. In addition to replicating previously reported loci, we identified seven new prostate cancer susceptibility loci on chromosomes 2p11, 3q23, 3q26, 5p12, 6p21, 12q13 and Xq12 (P = 4.0 × 10(-8) to P = 2.7 × 10(-24)). We also identified a SNP in TERT more strongly associated...
Full Text Available Pancreatic cancer shows very poor prognosis and is the fifth leading cause of cancer death in Japan. Previous studies indicated some genetic factors contributing to the development and progression of pancreatic cancer; however, there are limited reports for common genetic variants to be associated with this disease, especially in the Asian population. We have conducted a genome-wide association study (GWAS using 991 invasive pancreatic ductal adenocarcinoma cases and 5,209 controls, and identified three loci showing significant association (P-value<5x10(-7 with susceptibility to pancreatic cancer. The SNPs that showed significant association carried estimated odds ratios of 1.29, 1.32, and 3.73 with 95% confidence intervals of 1.17-1.43, 1.19-1.47, and 2.24-6.21; P-value of 3.30x10(-7, 3.30x10(-7, and 4.41x10(-7; located on chromosomes 6p25.3, 12p11.21 and 7q36.2, respectively. These associated SNPs are located within linkage disequilibrium blocks containing genes that have been implicated some roles in the oncogenesis of pancreatic cancer.
Wendler, Jason P; Okombo, John; Amato, Roberto; Miotto, Olivo; Kiara, Steven M; Mwai, Leah; Pole, Lewa; O'Brien, John; Manske, Magnus; Alcock, Dan; Drury, Eleanor; Sanders, Mandy; Oyola, Samuel O; Malangone, Cinzia; Jyothi, Dushyanth; Miles, Alistair; Rockett, Kirk A; MacInnis, Bronwyn L; Marsh, Kevin; Bejon, Philip; Nzila, Alexis; Kwiatkowski, Dominic P
Drug resistance remains a chief concern for malaria control. In order to determine the genetic markers of drug resistant parasites, we tested the genome-wide associations (GWA) of sequence-based genotypes from 35 Kenyan P. falciparum parasites with the activities of 22 antimalarial drugs. Parasites isolated from children with acute febrile malaria were adapted to culture, and sensitivity was determined by in vitro growth in the presence of anti-malarial drugs. Parasites were genotyped using whole genome sequencing techniques. Associations between 6250 single nucleotide polymorphisms (SNPs) and resistance to individual anti-malarial agents were determined, with false discovery rate adjustment for multiple hypothesis testing. We identified expected associations in the pfcrt region with chloroquine (CQ) activity, and other novel loci associated with amodiaquine, quinazoline, and quinine activities. Signals for CQ and primaquine (PQ) overlap in and around pfcrt, and interestingly the phenotypes are inversely related for these two drugs. We catalog the variation in dhfr, dhps, mdr1, nhe, and crt, including novel SNPs, and confirm the presence of a dhfr-164L quadruple mutant in coastal Kenya. Mutations implicated in sulfadoxine-pyrimethamine resistance are at or near fixation in this sample set. Sequence-based GWA studies are powerful tools for phenotypic association tests. Using this approach on falciparum parasites from coastal Kenya we identified known and previously unreported genes associated with phenotypic resistance to anti-malarial drugs, and observe in high-resolution haplotype visualizations a possible signature of an inverse selective relationship between CQ and PQ.
Michel, Audrey M; Baranov, Pavel V
Ribosome profiling or ribo-seq is a new technique that provides genome-wide information on protein synthesis (GWIPS) in vivo. It is based on the deep sequencing of ribosome protected mRNA fragments allowing the measurement of ribosome density along all RNA molecules present in the cell. At the same time, the high resolution of this technique allows detailed analysis of ribosome density on individual RNAs. Since its invention, the ribosome profiling technique has been utilized in a range of studies in both prokaryotic and eukaryotic organisms. Several studies have adapted and refined the original ribosome profiling protocol for studying specific aspects of translation. Ribosome profiling of initiating ribosomes has been used to map sites of translation initiation. These studies revealed the surprisingly complex organization of translation initiation sites in eukaryotes. Multiple initiation sites are responsible for the generation of N-terminally extended and truncated isoforms of known proteins as well as for the translation of numerous open reading frames (ORFs), upstream of protein coding ORFs. Ribosome profiling of elongating ribosomes has been used for measuring differential gene expression at the level of translation, the identification of novel protein coding genes and ribosome pausing. It has also provided data for developing quantitative models of translation. Although only a dozen or so ribosome profiling datasets have been published so far, they have already dramatically changed our understanding of translational control and have led to new hypotheses regarding the origin of protein coding genes. © 2013 John Wiley & Sons, Ltd. PMID:23696005
Spötter, Andreas; Gupta, Pooja; Mayer, Manfred; Reinsch, Norbert
Honey bees are exposed to many damaging pathogens and parasites. The most devastating is Varroa destructor, which mainly affects the brood. A promising approach for preventing its spread is to breed Varroa-resistant honey bees. One trait that has been shown to provide significant resistance against the Varroa mite is hygienic behavior, which is a behavioral response of honeybee workers to brood diseases in general. Here, we report the use of an Affymetrix 44K SNP array to analyze SNPs associated with detection and uncapping of Varroa-parasitized brood by individual worker bees (Apis mellifera). For this study, 22 000 individually labeled bees were video-monitored and a sample of 122 cases and 122 controls was collected and analyzed to determine the dependence/independence of SNP genotypes from hygienic and nonhygienic behavior on a genome-wide scale. After false-discovery rate correction of the P values, 6 SNP markers had highly significant associations with the trait investigated (α < 0.01). Inspection of the genomic regions around these SNPs led to the discovery of putative candidate genes. PMID:26774061
Cho, Seoae; Kim, Haseong; Oh, Sohee; Kim, Kyunga; Park, Taesung
The current trend in genome-wide association studies is to identify regions where the true disease-causing genes may lie by evaluating thousands of single-nucleotide polymorphisms (SNPs) across the whole genome. However, many challenges exist in detecting disease-causing genes among the thousands of SNPs. Examples include multicollinearity and multiple testing issues, especially when a large number of correlated SNPs are simultaneously tested. Multicollinearity can often occur when predictor variables in a multiple regression model are highly correlated, and can cause imprecise estimation of association. In this study, we propose a simple stepwise procedure that identifies disease-causing SNPs simultaneously by employing elastic-net regularization, a variable selection method that allows one to address multicollinearity. At Step 1, the single-marker association analysis was conducted to screen SNPs. At Step 2, the multiple-marker association was scanned based on the elastic-net regularization. The proposed approach was applied to the rheumatoid arthritis (RA) case-control data set of Genetic Analysis Workshop 16. While the selected SNPs at the screening step are located mostly on chromosome 6, the elastic-net approach identified putative RA-related SNPs on other chromosomes in an increased proportion. For some of those putative RA-related SNPs, we identified the interactions with sex, a well known factor affecting RA susceptibility.
Nimmakayala, Padma; Abburi, Venkata L; Saminathan, Thangasamy; Alaparthi, Suresh B; Almeida, Aldo; Davenport, Brittany; Nadimi, Marjan; Davidson, Joshua; Tonapi, Krittika; Yadav, Lav; Malkaram, Sridhar; Vajja, Gopinath; Hankins, Gerald; Harris, Robert; Park, Minkyu; Choi, Doil; Stommel, John; Reddy, Umesh K
Accumulated capsaicinoid content and increased fruit size are traits resulting from Capsicum annuum domestication. In this study, we used a diverse collection of C. annuum to generate 66,960 SNPs using genotyping by sequencing. The study identified 1189 haplotypes containing 3413 SNPs. Length of individual linkage disequilibrium (LD) blocks varied along chromosomes, with regions of high and low LD interspersed with an average LD of 139 kb. Principal component analysis (PCA), Bayesian model based population structure analysis and an Euclidean tree built based on identity by state (IBS) indices revealed that the clustering pattern of diverse accessions are in agreement with capsaicin content (CA) and fruit weight (FW) classifications indicating the importance of these traits in shaping modern pepper genome. PCA and IBS were used in a mixed linear model of capsaicin and dihydrocapsaicin content and fruit weight to reduce spurious associations because of confounding effects of subpopulations in genome-wide association study (GWAS). Our GWAS results showed SNPs in Ankyrin-like protein, IKI3 family protein, ABC transporter G family and pentatricopeptide repeat protein are the major markers for capsaicinoids and of 16 SNPs strongly associated with FW in both years of the study, 7 are located in known fruit weight controlling genes.
Kassam, Mohamed; Duvernois-Berthet, Evelyne; Cortijo, Sandra; Takashima, Kazuya; Saze, Hidetoshi; Toyoda, Atsushi; Fujiyama, Asao; Colot, Vincent; Kakutani, Tetsuji
Epigenetic variations of phenotypes, especially those associated with DNA methylation, are often inherited over multiple generations in plants. The active and inactive chromatin states are heritable and can be maintained or even be amplified by positive feedback in a transgenerational manner. However, mechanisms controlling the transgenerational DNA methylation dynamics are largely unknown. As an approach to understand the transgenerational dynamics, we examined long-term effect of impaired DNA methylation in Arabidopsis mutants of the chromatin remodeler gene DDM1 (Decrease in DNA Methylation 1) through whole genome DNA methylation sequencing. The ddm1 mutation induces a drastic decrease in DNA methylation of transposable elements (TEs) and repeats in the initial generation, while also inducing ectopic DNA methylation at hundreds of loci. Unexpectedly, this ectopic methylation can only be seen after repeated self-pollination. The ectopic cytosine methylation is found primarily in the non-CG context and starts from 3’ regions within transcription units and spreads upstream. Remarkably, when chromosomes with reduced DNA methylation were introduced from a ddm1 mutant into a DDM1 wild-type background, the ddm1-derived chromosomes also induced analogous de novo accumulation of DNA methylation in trans. These results lead us to propose a model to explain the transgenerational DNA methylation redistribution by genome-wide negative feedback. The global negative feedback, together with local positive feedback, would ensure robust and balanced differentiation of chromatin states within the genome. PMID:25902052
Full Text Available Down syndrome (DS is caused by triplication of Human chromosome 21 (Hsa21 and associated with an array of deleterious phenotypes, including mental retardation, heart defects and immunodeficiency. Genome-wide expression patterns of uncultured peripheral blood cells are useful to understanding of DS-associated immune dysfunction. We used a Human Exon microarray to characterize gene expression in uncultured peripheral blood cells derived from DS individuals and age-matched controls from two age groups: neonate (N and child (C. A total of 174 transcript clusters (gene-level with eight located on Hsa21 in N group and 383 transcript clusters including 56 on Hsa21 in C group were significantly dysregulated in DS individuals. Microarray data were validated by quantitative polymerase chain reaction. Functional analysis revealed that the dysregulated genes in DS were significantly enriched in two and six KEGG pathways in N and C group, respectively. These pathways included leukocyte trans-endothelial migration, B cell receptor signaling pathway and primary immunodeficiency, etc., which causally implicated dysfunctional immunity in DS. Our results provided a comprehensive picture of gene expression patterns in DS at the two developmental stages and pointed towards candidate genes and molecular pathways potentially associated with the immune dysfunction in DS.
Li, Chong; Jin, Lei; Bai, Yun; Chen, Qimin; Fu, Lijun; Yang, Minjun; Xiao, Huasheng; Zhao, Guoping; Wang, Shengyue
Down syndrome (DS) is caused by triplication of Human chromosome 21 (Hsa21) and associated with an array of deleterious phenotypes, including mental retardation, heart defects and immunodeficiency. Genome-wide expression patterns of uncultured peripheral blood cells are useful to understanding of DS-associated immune dysfunction. We used a Human Exon microarray to characterize gene expression in uncultured peripheral blood cells derived from DS individuals and age-matched controls from two age groups: neonate (N) and child (C). A total of 174 transcript clusters (gene-level) with eight located on Hsa21 in N group and 383 transcript clusters including 56 on Hsa21 in C group were significantly dysregulated in DS individuals. Microarray data were validated by quantitative polymerase chain reaction. Functional analysis revealed that the dysregulated genes in DS were significantly enriched in two and six KEGG pathways in N and C group, respectively. These pathways included leukocyte trans-endothelial migration, B cell receptor signaling pathway and primary immunodeficiency, etc., which causally implicated dysfunctional immunity in DS. Our results provided a comprehensive picture of gene expression patterns in DS at the two developmental stages and pointed towards candidate genes and molecular pathways potentially associated with the immune dysfunction in DS.
Marques, Catarina A; Dickens, Nicholas J; Paape, Daniel; Campbell, Samantha J; McCulloch, Richard
DNA replication initiates on defined genome sites, termed origins. Origin usage appears to follow common rules in the eukaryotic organisms examined to date: all chromosomes are replicated from multiple origins, which display variations in firing efficiency and are selected from a larger pool of potential origins. To ask if these features of DNA replication are true of all eukaryotes, we describe genome-wide origin mapping in the parasite Leishmania. Origin mapping in Leishmania suggests a striking divergence in origin usage relative to characterized eukaryotes, since each chromosome appears to be replicated from a single origin. By comparing two species of Leishmania, we find evidence that such origin singularity is maintained in the face of chromosome fusion or fission events during evolution. Mapping Leishmania origins suggests that all origins fire with equal efficiency, and that the genomic sites occupied by origins differ from related non-origins sites. Finally, we provide evidence that origin location in Leishmania displays striking conservation with Trypanosoma brucei, despite the latter parasite replicating its chromosomes from multiple, variable strength origins. The demonstration of chromosome replication for a single origin in Leishmania, a microbial eukaryote, has implications for the evolution of origin multiplicity and associated controls, and may explain the pervasive aneuploidy that characterizes Leishmania chromosome architecture.
Full Text Available Aromatase inhibitors (AI are extensively used in the treatment of estrogen receptor-positive breast cancers, however resistance to AI treatment is commonly observed. Apart from Estrogen receptor (ERα expression, no predictive biomarkers for response to AI treatment are clinically applied. Yet, since other therapeutic options exist in the clinic, such as tamoxifen, there is an urgent medical need for the development of treatment-selective biomarkers, enabling personalized endocrine treatment selection in breast cancer. In the described dataset, ERα chromatin binding and histone marks H3K4me3 and H3K27me3 were assessed in a genome-wide manner by Chromatin Immunoprecipitation (ChIP combined with massive parallel sequencing (ChIP-seq. These datasets were used to develop a classifier to stratify breast cancer patients on outcome after AI treatment in the metastatic setting. Here we describe in detail the data and quality control metrics, as well as the clinical information associated with the study, published by Jansen et al. . The data is publicly available through the GEO database with accession number GSE40867.
Muslihudeen Abdul-Razaq Abdul-Aziz
Full Text Available As our understanding of the human microbiome expands, impacts on health and disease continue to be revealed. Alterations in the microbiome can result in dysbiosis, which has now been linked to subsequent autoimmune and metabolic diseases, highlighting the need to identify factors that shape the microbiome. Research has identified that the composition and functions of the human microbiome can be influenced by diet, age, gender, and environment. More recently, studies have explored how human genetic variation may also influence the microbiome. Here, we review several recent analytical advances in this new research area, including those that use genome-wide association studies to examine host genome-microbiome interactions, while controlling for the influence of other factors. We find that current research is limited by small sample sizes, lack of cohort replication, and insufficient confirmatory mechanistic studies. In addition, we discuss the importance of understanding long-term interactions between the host genome and microbiome, as well as the potential impacts of disrupting this relationship, and explore new research avenues that may provide information about the co-evolutionary history of humans and their microorganisms.
Wei, Yu-Chung; Wen, Shu-Hui; Chen, Pei-Chun; Wang, Chih-Hao; Hsiao, Chuhsing K
Genome-wide association studies often face the undesirable result of either failing to detect any influential markers at all because of a stringent level for testing error corrections or encountering difficulty in quantifying the importance of markers by their P-values. Advocates of estimation procedures prefer to estimate the proportion of association rather than test significance to avoid overinterpretation. Here, we adopt a Bayesian hierarchical mixture model to estimate directly the proportion of influential markers, and then proceed to a selection procedure based on the Bayes factor (BF). This mixture model is able to accommodate different sources of dependence in the data through only a few parameters. Specifically, we focus on a standardized risk measure of unit variance so that fewer parameters are involved in inference. The expected value of this measure follows a mixture distribution with a mixing probability of association, and it is robust to minor allele frequencies. Furthermore, to select promising markers, we use the magnitude of the BF to represent the strength of evidence in support of the association between markers and disease. We demonstrate this procedure both with simulations and with SNP data from studies on rheumatoid arthritis, coronary artery disease, and Crohn's disease obtained from the Wellcome Trust Case–Control Consortium. This Bayesian procedure outperforms other existing methods in terms of accuracy, power, and computational efficiency. The R code that implements this method is available at http://homepage.ntu.edu.tw/~ckhsiao/Bmix/Bmix.htm. PMID:20407469
Alan L Hutchison
Full Text Available Robust methods for identifying patterns of expression in genome-wide data are important for generating hypotheses regarding gene function. To this end, several analytic methods have been developed for detecting periodic patterns. We improve one such method, JTK_CYCLE, by explicitly calculating the null distribution such that it accounts for multiple hypothesis testing and by including non-sinusoidal reference waveforms. We term this method empirical JTK_CYCLE with asymmetry search, and we compare its performance to JTK_CYCLE with Bonferroni and Benjamini-Hochberg multiple hypothesis testing correction, as well as to five other methods: cyclohedron test, address reduction, stable persistence, ANOVA, and F24. We find that ANOVA, F24, and JTK_CYCLE consistently outperform the other three methods when data are limited and noisy; empirical JTK_CYCLE with asymmetry search gives the greatest sensitivity while controlling for the false discovery rate. Our analysis also provides insight into experimental design and we find that, for a fixed number of samples, better sensitivity and specificity are achieved with higher numbers of replicates than with higher sampling density. Application of the methods to detecting circadian rhythms in a metadataset of microarrays that quantify time-dependent gene expression in whole heads of Drosophila melanogaster reveals annotations that are enriched among genes with highly asymmetric waveforms. These include a wide range of oxidation reduction and metabolic genes, as well as genes with transcripts that have multiple splice forms.
Full Text Available Ovarian cancer has a poor prognosis due to intrinsic or acquired resistance to some cytotoxic drugs, raising the interest in new DNA-binding agents such as mithramycin analogues as potential chemotherapeutic agents in gynecological cancer. Using a genome-wide approach, we have analyzed gene expression in A2780 human ovarian carcinoma cells treated with the novel mithramycin analogue DIG-MSK (demycarosyl-3D-β-D-digitoxosyl-mithramycin SK that binds to C+G-rich DNA sequences. Nanomolar concentrations of DIG-MSK abrogated the expression of genes involved in a variety of cell processes including transcription regulation and tumor development, which resulted in cell death. Some of those genes have been associated with cell proliferation and poor prognosis in ovarian cancer. Sp1 transcription factor regulated most of the genes that were down-regulated by the drug, as well as the up-regulation of other genes mainly involved in response to cell stress. The effect of DIG-MSK in the control of gene expression by other transcription factors was also explored. Some of them, such as CREB, E2F and EGR1, also recognize C/G-rich regions in gene promoters, which encompass potential DIG-MSK binding sites. DIG-MSK affected several biological processes and molecular functions related to transcription and its cellular regulation in A2780 cells, including transcription factor activity. This new compound might be a promising drug for the treatment of ovarian cancer.
Li, Yongsheng; Camarillo, Cynthia; Xu, Juan; Arana, Tania Bedard; Xiao, Yun; Zhao, Zheng; Chen, Hong; Ramirez, Mercedes; Zavala, Juan; Escamilla, Michael A.; Armas, Regina; Mendoza, Ricardo; Ontiveros, Alfonso; Nicolini, Humberto; Jerez Magaña, Alvaro Antonio; Rubin, Lewis P.; Li, Xia; Xu, Chun
Schizophrenia (SZ) and bipolar disorder (BP) are complex genetic disorders. Their appearance is also likely informed by as yet only partially described epigenetic contributions. Using a sequencing-based method for genome-wide analysis, we quantitatively compared the blood DNA methylation landscapes in SZ and BP subjects to control, both in an understudied population, Hispanics along the US-Mexico border. Remarkably, we identified thousands of differentially methylated regions for SZ and BP preferentially located in promoters 3′-UTRs and 5′-UTRs of genes. Distinct patterns of aberrant methylation of promoter sequences were located surrounding transcription start sites. In these instances, aberrant methylation occurred in CpG islands (CGIs) as well as in flanking regions as well as in CGI sparse promoters. Pathway analysis of genes displaying these distinct aberrant promoter methylation patterns showed enhancement of epigenetic changes in numerous genes previously related to psychiatric disorders and neurodevelopment. Integration of gene expression data further suggests that in SZ aberrant promoter methylation is significantly associated with altered gene transcription. In particular, we found significant associations between (1) promoter CGIs hypermethylation with gene repression and (2) CGI 3′-shore hypomethylation with increased gene expression. Finally, we constructed a specific methylation analysis platform that facilitates viewing and comparing aberrant genome methylation in human neuropsychiatric disorders. PMID:25734057
Qian, Meng; Shao, Yongzhao
Most existing association tests for genome-wide association studies (GWASs) fail to account for genetic heterogeneity. Zhou and Pan proposed a binomial-mixture-model-based association test to account for the possible genetic heterogeneity in case-control studies. The idea is elegant, however, the proposed test requires an expectation-maximization (EM)-type iterative algorithm to identify the penalised maximum likelihood estimates and a permutation method to assess p-values. The intensive computational burden induced by the EM-algorithm and the permutation becomes prohibitive for direct applications to GWASs. This paper develops a likelihood ratio test (LRT) for GWASs under genetic heterogeneity based on a more general alternative mixture model. In particular, a closed-form formula for the LRT statistic is derived to avoid the EM-type iterative numerical evaluation. Moreover, an explicit asymptotic null distribution is also obtained, which avoids using the permutation to obtain p-values. Thus, the proposed LRT is easy to implement for GWASs. Furthermore, numerical studies demonstrate that the LRT has power advantages over the commonly used Armitage trend test and other existing association tests under genetic heterogeneity. A breast cancer GWAS dataset is used to illustrate the newly proposed LRT. © 2013 Blackwell Publishing Ltd/University College London.
Bukszár, József; McClay, Joseph L; van den Oord, Edwin J C G
A limitation of current methods used to declare significance in genome-wide association studies (GWAS) is that they do not provide clear information about the probability that GWAS findings are true of false. This lack of information increases the chance of false discoveries and may result in real effects being missed. We propose a method to estimate the posterior probability that a marker has (no) effect given its test statistic value, also called the local false discovery rate (FDR), in the GWAS. A critical step involves the estimation the parameters of the distribution of the true alternative tests. For this, we derived and implemented the real maximum likelihood function, which turned out to provide us with significantly more accurate estimates than the widely used mixture model likelihood. Actual GWAS data are used to illustrate properties of the posterior probability estimates empirically. In addition to evaluating individual markers, a variety of applications are conceivable. For instance, posterior probability estimates can be used to control the FDR more precisely than Benjamini-Hochberg procedure. The codes are freely downloadable from the web site http://www.people.vcu.edu/~jbukszar.
Full Text Available A ligand-activated transcription factor aryl hydrocarbon receptor (AhR is recently revealed to play a key role in embryogenesis and tumorigenesis (Feng et al. , Safe et al.  and 2-(1′H-indole-3′-carbonyl-thiazole-4-carboxylic acid methyl ester (ITE (Song et al.  is an endogenous AhR ligand that possesses anti-tumor activity. In order to gain insights into how ITE acts via the AhR in embryogenesis and tumorigenesis, we analyzed the genome-wide transcriptional profiles of the following three groups of cells: the human glioblastoma U87 parental cells, U87 tumor sphere cells treated with vehicle (DMSO and U87 tumor sphere cells treated with ITE. Here, we provide the details of the sample gathering strategy and show the quality controls and the analyses associated with our gene array data deposited into the Gene Expression Omnibus (GEO under the accession code of GSE67986.
Full Text Available The analysis of genome-wide association studies (GWAS benefits from the investigation of biologically meaningful gene sets, such as gene-interaction networks (pathways. We propose an extension to a successful kernel-based pathway analysis approach by integrating kernel functions into a powerful algorithmic framework for variable selection, to enable investigation of multiple pathways simultaneously. We employ genetic similarity kernels from the logistic kernel machine test (LKMT as base-learners in a boosting algorithm. A model to explain case-control status is created iteratively by selecting pathways that improve its prediction ability. We evaluated our method in simulation studies adopting 50 pathways for different sample sizes and genetic effect strengths. Additionally, we included an exemplary application of kernel boosting to a rheumatoid arthritis and a lung cancer dataset. Simulations indicate that kernel boosting outperforms the LKMT in certain genetic scenarios. Applications to GWAS data on rheumatoid arthritis and lung cancer resulted in sparse models which were based on pathways interpretable in a clinical sense. Kernel boosting is highly flexible in terms of considered variables and overcomes the problem of multiple testing. Additionally, it enables the prediction of clinical outcomes. Thus, kernel boosting constitutes a new, powerful tool in the analysis of GWAS data and towards the understanding of biological processes involved in disease susceptibility.
Full Text Available Cyclins play important roles in cell division and cell expansion. They also interact with cyclin-dependent kinases to control cell cycle progression in plants. Our genome-wide analysis identified 52 expressed cyclin genes in tomato. Phylogenetic analysis of the deduced amino sequences of tomato and Arabidopsis cyclin genes divided them into 10 types, A-, B-, C-, D-, H-, L-, T-, U-, SDS- and J18. Pfam analysis indicated that most tomato cyclins contain a cyclin-N domain. C-, H- and J18 types only contain a cyclin-C domain, and U-type cyclins contain another potential cyclin domain. All of the cyclin genes are distributed throughout the tomato genome except for chromosome 8, and 30 of them were found to be segmentally duplicated; they are found on the duplicate segments of chromosome 1, 2, 3, 4, 5, 6, 10, 11 and 12, suggesting that tomato cyclin genes experienced a mass of segmental duplication. Quantitative real-time polymerase chain reaction analysis indicates that the expression patterns of tomato cyclin genes were significantly different in vegetative and reproductive stages. Transcription of most cyclin genes can be enhanced or repressed by exogenous application of gibberellin, which implies that gibberellin maybe a direct regulator of cyclin genes. The study presented here may be useful as a guide for further functional research on tomato cyclins.
Kristin D Kasschau
Full Text Available Eukaryotes contain a diversified set of small RNA-guided pathways that control genes, repeated sequences, and viruses at the transcriptional and posttranscriptional levels. Genome-wide profiles and analyses of small RNAs, particularly the large class of 24-nucleotide (nt short interfering RNAs (siRNAs, were done for wild-type Arabidopsis thaliana and silencing pathway mutants with defects in three RNA-dependent RNA polymerase (RDR and four Dicer-like (DCL genes. The profiling involved direct analysis using a multiplexed, parallel-sequencing strategy. Small RNA-generating loci, especially those producing predominantly 24-nt siRNAs, were found to be highly correlated with repetitive elements across the genome. These were found to be largely RDR2- and DCL3-dependent, although alternative DCL activities were detected on a widespread level in the absence of DCL3. In contrast, no evidence for RDR2-alternative activities was detected. Analysis of RDR2- and DCL3-dependent small RNA accumulation patterns in and around protein-coding genes revealed that upstream gene regulatory sequences systematically lack siRNA-generating activities. Further, expression profiling suggested that relatively few genes, proximal to abundant 24-nt siRNAs, are regulated directly by RDR2- and DCL3-dependent silencing. We conclude that the widespread accumulation patterns for RDR2- and DCL3-dependent siRNAs throughout the Arabidopsis genome largely reflect mechanisms to silence highly repeated sequences.
Ito, Tasuku; Tarutani, Yoshiaki; To, Taiko Kim; Kassam, Mohamed; Duvernois-Berthet, Evelyne; Cortijo, Sandra; Takashima, Kazuya; Saze, Hidetoshi; Toyoda, Atsushi; Fujiyama, Asao; Colot, Vincent; Kakutani, Tetsuji
Epigenetic variations of phenotypes, especially those associated with DNA methylation, are often inherited over multiple generations in plants. The active and inactive chromatin states are heritable and can be maintained or even be amplified by positive feedback in a transgenerational manner. However, mechanisms controlling the transgenerational DNA methylation dynamics are largely unknown. As an approach to understand the transgenerational dynamics, we examined long-term effect of impaired DNA methylation in Arabidopsis mutants of the chromatin remodeler gene DDM1 (Decrease in DNA Methylation 1) through whole genome DNA methylation sequencing. The ddm1 mutation induces a drastic decrease in DNA methylation of transposable elements (TEs) and repeats in the initial generation, while also inducing ectopic DNA methylation at hundreds of loci. Unexpectedly, this ectopic methylation can only be seen after repeated self-pollination. The ectopic cytosine methylation is found primarily in the non-CG context and starts from 3' regions within transcription units and spreads upstream. Remarkably, when chromosomes with reduced DNA methylation were introduced from a ddm1 mutant into a DDM1 wild-type background, the ddm1-derived chromosomes also induced analogous de novo accumulation of DNA methylation in trans. These results lead us to propose a model to explain the transgenerational DNA methylation redistribution by genome-wide negative feedback. The global negative feedback, together with local positive feedback, would ensure robust and balanced differentiation of chromatin states within the genome.
Yan, Huihuang; Tian, Shulan; Slager, Susan L; Sun, Zhifu; Ordog, Tamas
Epigenetic information encoded in covalent modifications of DNA and histone proteins regulates fundamental biological processes through the action of chromatin regulators, transcription factors, and noncoding RNA species. Epigenetic plasticity enables an organism to respond to developmental and environmental signals without genetic changes. However, aberrant epigenetic control plays a key role in pathogenesis of disease. Normal epigenetic states could be disrupted by detrimental mutations and expression alteration of chromatin regulators or by environmental factors. In this primer, we briefly review the epigenetic basis of human disease and discuss how recent discoveries in this field could be translated into clinical diagnosis, prevention, and treatment. We introduce platforms for mapping genome-wide chromatin accessibility, nucleosome occupancy, DNA-binding proteins, and DNA methylation, primarily focusing on the integration of DNA methylation and chromatin immunoprecipitation-sequencing technologies into disease association studies. We highlight practical considerations in applying high-throughput epigenetic assays and formulating analytical strategies. Finally, we summarize current challenges in sample acquisition, experimental procedures, data analysis, and interpretation and make recommendations on further refinement in these areas. Incorporating epigenomic testing into the clinical research arsenal will greatly facilitate our understanding of the epigenetic basis of disease and help identify novel therapeutic targets. © The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: firstname.lastname@example.org.
Full Text Available The primary goal of genome-wide association studies (GWAS is to discover variants that could lead, in isolation or in combination, to a particular trait or disease. Standard approaches to GWAS, however, are usually based on univariate hypothesis tests and therefore can account neither for correlations due to linkage disequilibrium nor for combinations of several markers. To discover and leverage such potential multivariate interactions, we propose in this work an extension of the Random Forest algorithm tailored for structured GWAS data. In terms of risk prediction, we show empirically on several GWAS datasets that the proposed T-Trees method significantly outperforms both the original Random Forest algorithm and standard linear models, thereby suggesting the actual existence of multivariate non-linear effects due to the combinations of several SNPs. We also demonstrate that variable importances as derived from our method can help identify relevant loci. Finally, we highlight the strong impact that quality control procedures may have, both in terms of predictive power and loci identification. Variable importance results and T-Trees source code are all available at www.montefiore.ulg.ac.be/~botta/ttrees/ and github.com/0asa/TTree-source respectively.
Full Text Available Water soluble carbohydrates (WSC in stems play an important role in buffering grain yield in wheat against biotic and abiotic stresses; however, knowledge of genes controlling WSC is very limited. We conducted a genome-wide association study (GWAS using a high-density 90K SNP array to better understand the genetic basis underlying WSC, and to explore marker-based breeding approaches. WSC was evaluated in an association panel comprising 166 Chinese bread wheat cultivars planted in four environments. Fifty two marker-trait associations (MTAs distributed across 23 loci were identified for phenotypic best linear unbiased estimates (BLUEs, and 11 MTAs were identified in two or more environments. Liner regression showed a clear dependence of WSC BLUE scores on numbers of favorable (increasing WSC content and unfavorable alleles (decreasing WSC, indicating that genotypes with higher numbers of favorable or lower numbers of unfavorable alleles had higher WSC content. In silico analysis of flanking sequences of trait-associated SNPs revealed eight candidate genes related to WSC content grouped into two categories based on the type of encoding proteins, namely, defense response proteins and proteins triggered by environmental stresses. The identified SNPs and candidate genes related to WSC provide opportunities for breeding higher WSC wheat cultivars.
Dong, Yan; Liu, Jindong; Zhang, Yan; Geng, Hongwei; Rasheed, Awais; Xiao, Yonggui; Cao, Shuanghe; Fu, Luping; Yan, Jun; Wen, Weie; Zhang, Yong; Jing, Ruilian; Xia, Xianchun; He, Zhonghu
Water soluble carbohydrates (WSC) in stems play an important role in buffering grain yield in wheat against biotic and abiotic stresses; however, knowledge of genes controlling WSC is very limited. We conducted a genome-wide association study (GWAS) using a high-density 90K SNP array to better understand the genetic basis underlying WSC, and to explore marker-based breeding approaches. WSC was evaluated in an association panel comprising 166 Chinese bread wheat cultivars planted in four environments. Fifty two marker-trait associations (MTAs) distributed across 23 loci were identified for phenotypic best linear unbiased estimates (BLUEs), and 11 MTAs were identified in two or more environments. Liner regression showed a clear dependence of WSC BLUE scores on numbers of favorable (increasing WSC content) and unfavorable alleles (decreasing WSC), indicating that genotypes with higher numbers of favorable or lower numbers of unfavorable alleles had higher WSC content. In silico analysis of flanking sequences of trait-associated SNPs revealed eight candidate genes related to WSC content grouped into two categories based on the type of encoding proteins, namely, defense response proteins and proteins triggered by environmental stresses. The identified SNPs and candidate genes related to WSC provide opportunities for breeding higher WSC wheat cultivars.
Khor, Chiea Chuen; Do, Tan; Jia, Hongyan; Nakano, Masakazu; George, Ronnie; Abu-Amero, Khaled; Duvesh, Roopam; Chen, Li Jia; Li, Zheng; Nongpiur, Monisha E; Perera, Shamira A; Qiao, Chunyan; Wong, Hon-Tym; Sakai, Hiroshi; Barbosa de Melo, Mônica; Lee, Mei-Chin; Chan, Anita S; Azhany, Yaakub; Dao, Thi Lam Huong; Ikeda, Yoko; Perez-Grossmann, Rodolfo A; Zarnowski, Tomasz; Day, Alexander C; Jonas, Jost B; Tam, Pancy O S; Tran, Tuan Anh; Ayub, Humaira; Akhtar, Farah; Micheal, Shazia; Chew, Paul T K; Aljasim, Leyla A; Dada, Tanuj; Luu, Tam Thi; Awadalla, Mona S; Kitnarong, Naris; Wanichwecharungruang, Boonsong; Aung, Yee Yee; Mohamed-Noor, Jelinar; Vijayan, Saravanan; Sarangapani, Sripriya; Husain, Rahat; Jap, Aliza; Baskaran, Mani; Goh, David; Su, Daniel H; Wang, Huaizhou; Yong, Vernon K; Yip, Leonard W; Trinh, Tuyet Bach; Makornwattana, Manchima; Nguyen, Thanh Thu; Leuenberger, Edgar U; Park, Ki-Ho; Wiyogo, Widya Artini; Kumar, Rajesh S; Tello, Celso; Kurimoto, Yasuo; Thapa, Suman S; Pathanapitoon, Kessara; Salmon, John F; Sohn, Yong Ho; Fea, Antonio; Ozaki, Mineo; Lai, Jimmy S M; Tantisevi, Visanee; Khaing, Chaw Chaw; Mizoguchi, Takanori; Nakano, Satoko; Kim, Chan-Yun; Tang, Guangxian; Fan, Sujie; Wu, Renyi; Meng, Hailin; Nguyen, Thi Thuy Giang; Tran, Tien Dat; Ueno, Morio; Martinez, Jose Maria; Ramli, Norlina; Aung, Yin Mon; Reyes, Rigo Daniel; Vernon, Stephen A; Fang, Seng Kheong; Xie, Zhicheng; Chen, Xiao Yin; Foo, Jia Nee; Sim, Kar Seng; Wong, Tina T; Quek, Desmond T; Venkatesh, Rengaraj; Kavitha, Srinivasan; Krishnadas, Subbiah R; Soumittra, Nagaswamy; Shantha, Balekudaru; Lim, Boon-Ang; Ogle, Jeanne; de Vasconcellos, José P C; Costa, Vital P; Abe, Ricardo Y; de Souza, Bruno B; Sng, Chelvin C; Aquino, Maria C; Kosior-Jarecka, Ewa; Fong, Guillermo Barreto; Tamanaja, Vania Castro; Fujita, Ricardo; Jiang, Yuzhen; Waseem, Naushin; Low, Sancy; Pham, Huan Nguyen; Al-Shahwan, Sami; Craven, E Randy; Khan, Muhammad Imran; Dada, Rrima; Mohanty, Kuldeep; Faiq, Muneeb A; Hewitt, Alex W; Burdon, Kathryn P; Gan, Eng Hui; Prutthipongsit, Anuwat; Patthanathamrongkasem, Thipnapa; Catacutan, Mary Ann T; Felarca, Irene R; Liao, Chona S; Rusmayani, Emma; Istiantoro, Vira Wardhana; Consolandi, Giulia; Pignata, Giulia; Lavia, Carlo; Rojanapongpun, Prin; Mangkornkanokpong, Lerprat; Chansangpetch, Sunee; Chan, Jonathan C H; Choy, Bonnie N K; Shum, Jennifer W H; Than, Hlaing May; Oo, Khin Thida; Han, Aye Thi; Yong, Victor H; Ng, Xiao-Yu; Goh, Shuang Ru; Chong, Yaan Fun; Hibberd, Martin L; Seielstad, Mark; Png, Eileen; Dunstan, Sarah J; Chau, Nguyen Van Vinh; Bei, Jinxin; Zeng, Yi Xin; Karkey, Abhilasha; Basnyat, Buddha; Pasutto, Francesca; Paoli, Daniela; Frezzotti, Paolo; Wang, Jie Jin; Mitchell, Paul; Fingert, John H; Allingham, R Rand; Hauser, Michael A; Lim, Soon Thye; Chew, Soo Hong; Ebstein, Richard P; Sakuntabhai, Anavaj; Park, Kyu Hyung; Ahn, Jeeyun; Boland, Greet; Snippe, Harm; Stead, Richard; Quino, Raquel; Zaw, Su Nyunt; Lukasik, Urszula; Shetty, Rohit; Zahari, Mimiwati; Bae, Hyoung Won; Oo, Nay Lin; Kubota, Toshiaki; Manassakorn, Anita; Ho, Wing Lau; Dallorto, Laura; Hwang, Young Hoon; Kiire, Christine A; Kuroda, Masako; Djamal, Zeiras Eka; Peregrino, Jovell Ian M; Ghosh, Arkasubhra; Jeoung, Jin Wook; Hoan, Tung S; Srisamran, Nuttamon; Sandragasu, Thayanithi; Set, Saw Htoo; Doan, Vi Huyen; Bhattacharya, Shomi S; Ho, Ching-Lin; Tan, Donald T; Sihota, Ramanjit; Loon, Seng-Chee; Mori, Kazuhiko; Kinoshita, Shigeru; Hollander, Anneke I den; Qamar, Raheel; Wang, Ya-Xing; Teo, Yik Y; Tai, E-Shyong; Hartleben-Matkin, Curt; Lozano-Giral, David; Saw, Seang Mei; Cheng, Ching-Yu; Zenteno, Juan C; Pang, Chi Pui; Bui, Huong T T; Hee, Owen; Craig, Jamie E; Edward, Deepak P; Yonahara, Michiko; Neto, Jamil Miguel; Guevara-Fujita, Maria L; Xu, Liang; Ritch, Robert; Liza-Sharmini, Ahmad Tajudin; Wong, Tien Y; Al-Obeidan, Saleh; Do, Nhu Hon; Sundaresan, Periasamy; Tham, Clement C; Foster, Paul J; Vijaya, Lingam; Tashiro, Kei; Vithana, Eranga N; Wang, Ningli; Aung, Tin
Primary angle closure glaucoma (PACG) is a major cause of blindness worldwide. We conducted a genome-wide association study (GWAS) followed by replication in a combined total of 10,503 PACG cases and 29,567 controls drawn from 24 countries across Asia, Australia, Europe, North America, and South America. We observed significant evidence of disease association at five new genetic loci upon meta-analysis of all patient collections. These loci are at EPDR1 rs3816415 (odds ratio (OR) = 1.24, P = 5.94 × 10(-15)), CHAT rs1258267 (OR = 1.22, P = 2.85 × 10(-16)), GLIS3 rs736893 (OR = 1.18, P = 1.43 × 10(-14)), FERMT2 rs7494379 (OR = 1.14, P = 3.43 × 10(-11)), and DPM2-FAM102A rs3739821 (OR = 1.15, P = 8.32 × 10(-12)). We also confirmed significant association at three previously described loci (P < 5 × 10(-8) for each sentinel SNP at PLEKHA7, COL11A1, and PCMTD1-ST18), providing new insights into the biology of PACG.
Full Text Available In this study, 796 male Duroc pigs were used to identify genomic regions controlling growth traits. Three production traits were studied: food conversion ratio, days to 100 KG, and average daily gain, using a panel of 39,436 single nucleotide polymorphisms. In total, we detected 11 genome-wide and 162 chromosome-wide single nucleotide polymorphism trait associations. The Gene ontology analysis identified 14 candidate genes close to significant single nucleotide polymorphisms, with growth-related functions: six for days to 100 KG (WT1, FBXO3, DOCK7, PPP3CA, AGPAT9, and NKX6-1, seven for food conversion ratio (MAP2, TBX15, IVL, ARL15, CPS1, VWC2L, and VAV3, and one for average daily gain (COL27A1. Gene ontology analysis indicated that most of the candidate genes are involved in muscle, fat, bone or nervous system development, nutrient absorption, and metabolism, which are all either directly or indirectly related to growth traits in pigs. Additionally, we found four haplotype blocks composed of suggestive single nucleotide polymorphisms located in the growth trait-related quantitative trait loci and further narrowed down the ranges, the largest of which decreased by ~60 Mb. Hence, our results could be used to improve pig production traits by increasing the frequency of favorable alleles via artificial selection.
Federico C F Calboli
Full Text Available Neuroticism is a moderately heritable personality trait considered to be a risk factor for developing major depression, anxiety disorders and dementia. We performed a genome-wide association study in 2,235 participants drawn from a population-based study of neuroticism, making this the largest association study for neuroticism to date. Neuroticism was measured by the Eysenck Personality Questionnaire. After Quality Control, we analysed 430,000 autosomal SNPs together with an additional 1.2 million SNPs imputed with high quality from the Hap Map CEU samples. We found a very small effect of population stratification, corrected using one principal component, and some cryptic kinship that required no correction. NKAIN2 showed suggestive evidence of association with neuroticism as a main effect (p < 10(-6 and GPC6 showed suggestive evidence for interaction with age (p approximately = 10(-7. We found support for one previously-reported association (PDE4D, but failed to replicate other recent reports. These results suggest common SNP variation does not strongly influence neuroticism. Our study was powered to detect almost all SNPs explaining at least 2% of heritability, and so our results effectively exclude the existence of loci having a major effect on neuroticism.
Long, Yi; Su, Ying; Ai, Huashui; Zhang, Zhiyan; Yang, Bin; Ruan, Guorong; Xiao, Shijun; Liao, Xinjun; Ren, Jun; Huang, Lusheng; Ding, Nengshui
Umbilical hernia (UH) is one of the most common congenital defects in pigs, leading to considerable economic loss and serious animal welfare problems. To test whether copy number variations (CNVs) contribute to pig UH, we performed a case-control genome-wide CNV association study on 905 pigs from the Duroc, Landrace and Yorkshire breeds using the Porcine SNP60 BeadChip and penncnv algorithm. We first constructed a genomic map comprising 6193 CNVs that pertain to 737 CNV regions. Then, we identified eight CNVs significantly associated with the risk for UH in the three pig breeds. Six of seven significantly associated CNVs were validated using quantitative real-time PCR. Notably, a rare CNV (CNV14:13030843-13059455) encompassing the NUGGC gene was strongly associated with UH (permutation-corrected P = 0.0015) in Duroc pigs. This CNV occurred exclusively in seven Duroc UH-affected individuals. SNPs surrounding the CNV did not show association signals, indicating that rare CNVs may play an important role in complex pig diseases such as UH. The NUGGC gene has been implicated in human omphalocele and inguinal hernia. Our finding supports that CNVs, including the NUGGC CNV, contribute to the pathogenesis of pig UH. © 2016 Stichting International Foundation for Animal Genetics.
Singhal, Sandeep K; Usmani, Nawaid; Michiels, Stefan; Metzger-Filho, Otto; Saini, Kamal S; Kovalchuk, Olga; Parliament, Matthew
Until recently, an elevated disease risk has been ascribed to a genetic predisposition, however, exciting progress over the past years has discovered alternate elements of inheritance that involve epigenetic regulation. Epigenetic changes are heritably stable alterations that include DNA methylation, histone modifications and RNA-mediated silencing. Aberrant DNA methylation is a common molecular basis for a number of important human diseases, including breast cancer. Changes in DNA methylation profoundly affect global gene expression patterns. What is emerging is a more dynamic and complex association between DNA methylation and gene expression than previously believed. Although many tools have already been developed for analyzing genome-wide gene expression data, tools for analyzing genome-wide DNA methylation have not yet reached the same level of refinement. Here we provide an in-depth analysis of DNA methylation in parallel with gene expression data characteristics and describe the particularities of low-level and high-level analyses of DNA methylation data. Low-level analysis refers to pre-processing of methylation data (i.e. normalization, transformation and filtering), whereas high-level analysis is focused on illustrating the application of the widely used class comparison, class prediction and class discovery methods to DNA methylation data. Furthermore, we investigate the influence of DNA methylation on gene expression by measuring the correlation between the degree of CpG methylation and the level of expression and to explore the pattern of methylation as a function of the promoter region.
Full Text Available Abstract Background Septic shock is a heterogeneous syndrome within which probably exist several biological subclasses. Discovery and identification of septic shock subclasses could provide the foundation for the design of more specifically targeted therapies. Herein we tested the hypothesis that pediatric septic shock subclasses can be discovered through genome-wide expression profiling. Methods Genome-wide expression profiling was conducted using whole blood-derived RNA from 98 children with septic shock, followed by a series of bioinformatic approaches targeted at subclass discovery and characterization. Results Three putative subclasses (subclasses A, B, and C were initially identified based on an empiric, discovery-oriented expression filter and unsupervised hierarchical clustering. Statistical comparison of the three putative subclasses (analysis of variance, Bonferonni correction, P Conclusion Genome-wide expression profiling can identify pediatric septic shock subclasses having clinically relevant phenotypes.