WorldWideScience

Sample records for genome-wide scoring approach

  1. Identification of networks of co-occurring, tumor-related DNA copy number changes using a genome-wide scoring approach.

    Directory of Open Access Journals (Sweden)

    Christiaan Klijn

    2010-01-01

    Full Text Available Tumorigenesis is a multi-step process in which normal cells transform into malignant tumors following the accumulation of genetic mutations that enable them to evade the growth control checkpoints that would normally suppress their growth or result in apoptosis. It is therefore important to identify those combinations of mutations that collaborate in cancer development and progression. DNA copy number alterations (CNAs are one of the ways in which cancer genes are deregulated in tumor cells. We hypothesized that synergistic interactions between cancer genes might be identified by looking for regions of co-occurring gain and/or loss. To this end we developed a scoring framework to separate truly co-occurring aberrations from passenger mutations and dominant single signals present in the data. The resulting regions of high co-occurrence can be investigated for between-region functional interactions. Analysis of high-resolution DNA copy number data from a panel of 95 hematological tumor cell lines correctly identified co-occurring recombinations at the T-cell receptor and immunoglobulin loci in T- and B-cell malignancies, respectively, showing that we can recover truly co-occurring genomic alterations. In addition, our analysis revealed networks of co-occurring genomic losses and gains that are enriched for cancer genes. These networks are also highly enriched for functional relationships between genes. We further examine sub-networks of these networks, core networks, which contain many known cancer genes. The core network for co-occurring DNA losses we find seems to be independent of the canonical cancer genes within the network. Our findings suggest that large-scale, low-intensity copy number alterations may be an important feature of cancer development or maintenance by affecting gene dosage of a large interconnected network of functionally related genes.

  2. Genome-wide approaches to understanding behaviour in Drosophila melanogaster.

    Science.gov (United States)

    Neville, Megan; Goodwin, Stephen F

    2012-09-01

    Understanding how an organism exhibits specific behaviours remains a major and important biological question. Studying behaviour in a simple model organism like the fruit fly Drosophila melanogaster has the advantages of advanced molecular genetics approaches along with well-defined anatomy and physiology. With advancements in functional genomic technologies, researchers are now attempting to uncover genes and pathways involved in complex behaviours on a genome-wide scale. A systems-level network approach, which will include genomic approaches, to study behaviour will be key to understanding the regulation and modulation of behaviours and the importance of context in regulating them.

  3. Integrative genome-wide approaches in embryonic stem cell research.

    Science.gov (United States)

    Zhang, Xinyue; Huang, Jing

    2010-10-01

    Embryonic stem (ES) cells are derived from blastocysts. They can differentiate into the three embryonic germ layers and essentially any type of somatic cells. They therefore hold great potential in tissue regeneration therapy. The ethical issues associated with the use of human embryonic stem cells are resolved by the technical break-through of generating induced pluripotent stem (iPS) cells from various types of somatic cells. However, how ES and iPS cells self-renew and maintain their pluripotency is still largely unknown in spite of the great progress that has been made in the last two decades. Integrative genome-wide approaches, such as the gene expression microarray, chromatin immunoprecipitation based microarray (ChIP-chip) and chromatin immunoprecipitation followed by massive parallel sequencing (ChIP-seq) offer unprecedented opportunities to elucidate the mechanism of the pluripotency, reprogramming and DNA damage response of ES and iPS cells. This frontier article summarizes the fundamental biological questions about ES and iPS cells and reviews the recent advances in ES and iPS cell research using genome-wide technologies. To this end, we offer our perspectives on the future of genome-wide studies on stem cells.

  4. Genome-Wide Approaches to Drosophila Heart Development

    Directory of Open Access Journals (Sweden)

    Manfred Frasch

    2016-05-01

    Full Text Available The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level.

  5. Layers of epistasis: genome-wide regulatory networks and network approaches to genome-wide association studies

    Science.gov (United States)

    Cowper-Sal·lari, Richard; Cole, Michael D.; Karagas, Margaret R.; Lupien, Mathieu; Moore, Jason H.

    2010-01-01

    The conceptual foundation of the genome-wide association study (GWAS) has advanced unchecked since its conception. A revision might seem premature as the potential of GWAS has not been fully realized. Multiple technical and practical limitations need to be overcome before GWAS can be fairly criticized. But with the completion of hundreds of studies and a deeper understanding of the genetic architecture of disease, warnings are being raised. The results compiled to date indicate that risk-associated variants lie predominantly in non-coding regions of the genome. Additionally, alternative methodologies are uncovering large and heterogeneous sets of rare variants underlying disease. The fear is that, even in its fulfilment, the current GWAS paradigm might be incapable of dissecting all kinds of phenotypes. In the following text we review several initiatives that aim to overcome these limitations. The overarching theme of these studies is the inclusion of biological knowledge to both the analysis and interpretation of genotyping data. GWAS is uninformed of biology by design and although there is some virtue in its simplicity it is also its most conspicuous deficiency. We propose a framework in which to integrate these novel approaches, both empirical and theoretical, in the form of a genome-wide regulatory network (GWRN). By processing experimental data into networks, emerging data types based on chromatin-immunoprecipitation are made computationally tractable. This will give GWAS re-analysis efforts the most current and relevant substrates, and root them firmly on our knowledge of human disease. PMID:21197657

  6. Statistical Approaches in Genome-Wide Association Studies

    OpenAIRE

    Yazdani, Akram

    2014-01-01

    Genome-wide association studies, GWAS, typically contain hundreds of thousands single nucleotide polymorphisms, SNPs, genotyped for few numbers of samples. The aim of these studies is to identify regions harboring SNPs or to predict the outcomes of interest. Since the number of predictors in the GWAS far exceeds the number of samples, it is impossible to analyze the data with classical statistical methods. In the current GWAS, the widely applied methods are based on single marker analysis th...

  7. Genome-wide approaches to understanding human ageing

    Directory of Open Access Journals (Sweden)

    Kaeberlein Matt

    2006-06-01

    Full Text Available Abstract The use of genomic technologies in biogerontology has the potential to greatly enhance our understanding of human ageing. High-throughput screens for alleles correlated with survival in long-lived people have uncovered novel genes involved in age-associated disease. Genome-wide longevity studies in simple eukaryotes are identifying evolutionarily conserved pathways that determine longevity. It is hoped that validation of these 'public' aspects of ageing in mice, along with analyses of variation in candidate human ageing genes, will provide targets for future interventions to slow the ageing process and retard the onset of age-associated pathologies.

  8. A genome-wide linkage study of individuals with high scores on NEO personality traits.

    Science.gov (United States)

    Amin, N; Schuur, M; Gusareva, E S; Isaacs, A; Aulchenko, Y S; Kirichenko, A V; Zorkoltseva, I V; Axenovich, T I; Oostra, B A; Janssens, A C J W; van Duijn, C M

    2012-10-01

    The NEO-Five-Factor Inventory divides human personality traits into five dimensions: neuroticism, extraversion, openness, conscientiousness and agreeableness. In this study, we sought to identify regions harboring genes with large effects on the five NEO personality traits by performing genome-wide linkage analysis of individuals scoring in the extremes of these traits (>90th percentile). Affected-only linkage analysis was performed using an Illumina 6K linkage array in a family-based study, the Erasmus Rucphen Family study. We subsequently determined whether distinct, segregating haplotypes found with linkage analysis were associated with the trait of interest in the population. Finally, a dense single-nucleotide polymorphism genotyping array (Illumina 318K) was used to search for copy number variations (CNVs) in the associated regions. In the families with extreme phenotype scores, we found significant evidence of linkage for conscientiousness to 20p13 (rs1434789, log of odds (LOD)=5.86) and suggestive evidence of linkage (LOD >2.8) for neuroticism to 19q, 21q and 22q, extraversion to 1p, 1q, 9p and12q, openness to 12q and 19q, and agreeableness to 2p, 6q, 17q and 21q. Further analysis determined haplotypes in 21q22 for neuroticism (P-values = 0.009, 0.007), in 17q24 for agreeableness (marginal P-value = 0.018) and in 20p13 for conscientiousness (marginal P-values = 0.058, 0.038) segregating in families with large contributions to the LOD scores. No evidence for CNVs in any of the associated regions was found. Our findings imply that there may be genes with relatively large effects involved in personality traits, which may be identified with next-generation sequencing techniques.

  9. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies

    DEFF Research Database (Denmark)

    Bulik-Sullivan, Brendan K.; Loh, Po-Ru; Finucane, Hilary K.

    2015-01-01

    Both polygenicity (many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from...... correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of the inflation in test statistics in many GWAS of large sample size....

  10. Data Mining Approaches for Genome-Wide Association of Mood Disorders

    Science.gov (United States)

    Pirooznia, Mehdi; Seifuddin, Fayaz; Judy, Jennifer; Mahon, Pamela B.; Potash, James B.; Zandi, Peter P.

    2012-01-01

    Mood disorders are highly heritable forms of major mental illness. A major breakthrough in elucidating the genetic architecture of mood disorders was anticipated with the advent of genome-wide association studies (GWAS). However, to date few susceptibility loci have been conclusively identified. The genetic etiology of mood disorders appears to be quite complex, and as a result, alternative approaches for analyzing GWAS data are needed. Recently, a polygenic scoring approach that captures the effects of alleles across multiple loci was successfully applied to the analysis of GWAS data in schizophrenia and bipolar disorder (BP). However, this method may be overly simplistic in its approach to the complexity of genetic effects. Data mining methods are available that may be applied to analyze the high dimensional data generated by GWAS of complex psychiatric disorders. We sought to compare the performance of five data mining methods, namely, Bayesian Networks (BN), Support Vector Machine (SVM), Random Forest (RF), Radial Basis Function network (RBF), and Logistic Regression (LR), against the polygenic scoring approach in the analysis of GWAS data on BP. The different classification methods were trained on GWAS datasets from the Bipolar Genome Study (2,191 cases with BP and 1,434 controls) and their ability to accurately classify case/control status was tested on a GWAS dataset from the Wellcome Trust Case Control Consortium. The performance of the classifiers in the test dataset was evaluated by comparing area under the receiver operating characteristic curves (AUC). BN performed the best of all the data mining classifiers, but none of these did significantly better than the polygenic score approach. We further examined a subset of SNPs in genes that are expressed in the brain, under the hypothesis that these might be most relevant to BP susceptibility, but all the classifiers performed worse with this reduced set of SNPs. The discriminative accuracy of all of these

  11. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast

    Science.gov (United States)

    Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-01-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095

  12. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast.

    Science.gov (United States)

    Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-03-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages.

  13. Integrating experimental and analytic approaches to improve data quality in genome-wide RNAi screens.

    Science.gov (United States)

    Zhang, Xiaohua Douglas; Espeseth, Amy S; Johnson, Eric N; Chin, Jayne; Gates, Adam; Mitnaul, Lyndon J; Marine, Shane D; Tian, Jenny; Stec, Eric M; Kunapuli, Priya; Holder, Dan J; Heyse, Joseph F; Strulovici, Berta; Ferrer, Marc

    2008-06-01

    RNA interference (RNAi) not only plays an important role in drug discovery but can also be developed directly into drugs. RNAi high-throughput screening (HTS) biotechnology allows us to conduct genome-wide RNAi research. A central challenge in genome-wide RNAi research is to integrate both experimental and computational approaches to obtain high quality RNAi HTS assays. Based on our daily practice in RNAi HTS experiments, we propose the implementation of 3 experimental and analytic processes to improve the quality of data from RNAi HTS biotechnology: (1) select effective biological controls; (2) adopt appropriate plate designs to display and/or adjust for systematic errors of measurement; and (3) use effective analytic metrics to assess data quality. The applications in 5 real RNAi HTS experiments demonstrate the effectiveness of integrating these processes to improve data quality. Due to the effectiveness in improving data quality in RNAi HTS experiments, the methods and guidelines contained in the 3 experimental and analytic processes are likely to have broad utility in genome-wide RNAi research.

  14. Multi-criteria decision making approaches for quality control of genome-wide association studies.

    Science.gov (United States)

    Malovini, Alberto; Rognoni, Carla; Puca, Annibale; Bellazzi, Riccardo

    2009-03-01

    Experimental errors in the genotyping phases of a Genome-Wide Association Study (GWAS) can lead to false positive findings and to spurious associations. An appropriate quality control phase could minimize the effects of this kind of errors. Several filtering criteria can be used to perform quality control. Currently, no formal methods have been proposed for taking into account at the same time these criteria and the experimenter's preferences. In this paper we propose two strategies for setting appropriate genotyping rate thresholds for GWAS quality control. These two approaches are based on the Multi-Criteria Decision Making theory. We have applied our method on a real dataset composed by 734 individuals affected by Arterial Hypertension (AH) and 486 nonagenarians without history of AH. The proposed strategies appear to deal with GWAS quality control in a sound way, as they lead to rationalize and make explicit the experimenter's choices thus providing more reproducible results.

  15. Dry and wet approaches for genome-wide functional annotation of conventional and unconventional transcriptional activators

    Directory of Open Access Journals (Sweden)

    Elisabetta Levati

    2016-01-01

    Full Text Available Transcription factors (TFs are master gene products that regulate gene expression in response to a variety of stimuli. They interact with DNA in a sequence-specific manner using a variety of DNA-binding domain (DBD modules. This allows to properly position their second domain, called “effector domain”, to directly or indirectly recruit positively or negatively acting co-regulators including chromatin modifiers, thus modulating preinitiation complex formation as well as transcription elongation. At variance with the DBDs, which are comprised of well-defined and easily recognizable DNA binding motifs, effector domains are usually much less conserved and thus considerably more difficult to predict. Also not so easy to identify are the DNA-binding sites of TFs, especially on a genome-wide basis and in the case of overlapping binding regions. Another emerging issue, with many potential regulatory implications, is that of so-called “moonlighting” transcription factors, i.e., proteins with an annotated function unrelated to transcription and lacking any recognizable DBD or effector domain, that play a role in gene regulation as their second job. Starting from bioinformatic and experimental high-throughput tools for an unbiased, genome-wide identification and functional characterization of TFs (especially transcriptional activators, we describe both established (and usually well affordable as well as newly developed platforms for DNA-binding site identification. Selected combinations of these search tools, some of which rely on next-generation sequencing approaches, allow delineating the entire repertoire of TFs and unconventional regulators encoded by the any sequenced genome.

  16. Elastic-net regularization approaches for genome-wide association studies of rheumatoid arthritis.

    Science.gov (United States)

    Cho, Seoae; Kim, Haseong; Oh, Sohee; Kim, Kyunga; Park, Taesung

    2009-12-15

    The current trend in genome-wide association studies is to identify regions where the true disease-causing genes may lie by evaluating thousands of single-nucleotide polymorphisms (SNPs) across the whole genome. However, many challenges exist in detecting disease-causing genes among the thousands of SNPs. Examples include multicollinearity and multiple testing issues, especially when a large number of correlated SNPs are simultaneously tested. Multicollinearity can often occur when predictor variables in a multiple regression model are highly correlated, and can cause imprecise estimation of association. In this study, we propose a simple stepwise procedure that identifies disease-causing SNPs simultaneously by employing elastic-net regularization, a variable selection method that allows one to address multicollinearity. At Step 1, the single-marker association analysis was conducted to screen SNPs. At Step 2, the multiple-marker association was scanned based on the elastic-net regularization. The proposed approach was applied to the rheumatoid arthritis (RA) case-control data set of Genetic Analysis Workshop 16. While the selected SNPs at the screening step are located mostly on chromosome 6, the elastic-net approach identified putative RA-related SNPs on other chromosomes in an increased proportion. For some of those putative RA-related SNPs, we identified the interactions with sex, a well known factor affecting RA susceptibility.

  17. A network-based approach to prioritize results from genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Nirmala Akula

    Full Text Available Genome-wide association studies (GWAS are a valuable approach to understanding the genetic basis of complex traits. One of the challenges of GWAS is the translation of genetic association results into biological hypotheses suitable for further investigation in the laboratory. To address this challenge, we introduce Network Interface Miner for Multigenic Interactions (NIMMI, a network-based method that combines GWAS data with human protein-protein interaction data (PPI. NIMMI builds biological networks weighted by connectivity, which is estimated by use of a modification of the Google PageRank algorithm. These weights are then combined with genetic association p-values derived from GWAS, producing what we call 'trait prioritized sub-networks.' As a proof of principle, NIMMI was tested on three GWAS datasets previously analyzed for height, a classical polygenic trait. Despite differences in sample size and ancestry, NIMMI captured 95% of the known height associated genes within the top 20% of ranked sub-networks, far better than what could be achieved by a single-locus approach. The top 2% of NIMMI height-prioritized sub-networks were significantly enriched for genes involved in transcription, signal transduction, transport, and gene expression, as well as nucleic acid, phosphate, protein, and zinc metabolism. All of these sub-networks were ranked near the top across all three height GWAS datasets we tested. We also tested NIMMI on a categorical phenotype, Crohn's disease. NIMMI prioritized sub-networks involved in B- and T-cell receptor, chemokine, interleukin, and other pathways consistent with the known autoimmune nature of Crohn's disease. NIMMI is a simple, user-friendly, open-source software tool that efficiently combines genetic association data with biological networks, translating GWAS findings into biological hypotheses.

  18. Distinguishing bacterial pathogens of potato using a genome-wide microarray approach.

    Science.gov (United States)

    Aittamaa, M; Somervuo, P; Pirhonen, M; Mattinen, L; Nissinen, R; Auvinen, P; Valkonen, J P T

    2008-09-01

    A set of 9676 probes was designed for the most harmful bacterial pathogens of potato and tested in a microarray format. Gene-specific probes could be designed for all genes of Pectobacterium atrosepticum, c. 50% of the genes of Streptomyces scabies and c. 30% of the genes of Clavibacter michiganensis ssp. sepedonicus utilizing the whole-genome sequence information available. For Streptomyces turgidiscabies, 226 probes were designed according to the sequences of a pathogenicity island containing important virulence genes. In addition, probes were designed for the virulence-associated nip (necrosis-inducing protein) genes of P. atrosepticum, P. carotovorum and Dickeya dadantii and for the intergenic spacer (IGS) sequences of the 16S-23S rRNA gene region. Ralstonia solanacearum was not included in the study, because it is a quarantine organism and is not presently found in Finland, but a few probes were also designed for this species. The probes contained on average 40 target-specific nucleotides and were synthesized on the array in situ, organized as eight sub-arrays with an identical set of probes which could be used for hybridization with different samples. All bacteria were readily distinguished using a single channel system for signal detection. Nearly all of the c. 1000 probes designed for C. michiganensis ssp. sepedonicus, c. 50% and 40% of the c. 4000 probes designed for the genes of S. scabies and P. atrosepticum, respectively, and over 100 probes for S. turgidiscabies showed significant signals only with the respective species. P. atrosepticum, P. carotovorum and Dickeya strains were all detected with 110 common probes. By contrast, the strains of these species were found to differ in their signal profiles. Probes targeting the IGS region and nip genes could be used to place strains of Dickeya to two groups, which correlated with differences in virulence. Taken together, the approach of using a custom-designed, genome-wide microarray provided a robust means

  19. Molecular characterisation of the poorly differentiated and undifferentiated thyroid carcinomas using genome-wide approaches

    OpenAIRE

    Pita, Jaime Miguel Gomes, 1985-

    2013-01-01

    Tese de doutoramento, Bioquímica (Genética Molecular), Universidade de Lisboa, Faculdade de Ciências, 2013 Poorly differentiated (PDTC) and anaplastic thyroid carcinomas (ATC) are highly malignant tumours composed by dedifferentiated cells, for which current therapeutic options have been ineffective. In the present project, the molecular signatures and genetic alterations associated with these tumours were elucidated, by using genome-wide expression analysis as first assessment. The role ...

  20. Genome-wide identification of Schistosoma japonicum microRNAs using a deep-sequencing approach.

    Directory of Open Access Journals (Sweden)

    Jian Huang

    Full Text Available BACKGROUND: Human schistosomiasis is one of the most prevalent and serious parasitic diseases worldwide. Schistosoma japonicum is one of important pathogens of this disease. MicroRNAs (miRNAs are a large group of non-coding RNAs that play important roles in regulating gene expression and protein translation in animals. Genome-wide identification of miRNAs in a given organism is a critical step to facilitating our understanding of genome organization, genome biology, evolution, and posttranscriptional regulation. METHODOLOGY/PRINCIPAL FINDINGS: We sequenced two small RNA libraries prepared from different stages of the life cycle of S. japonicum, immature schistosomula and mature pairing adults, through a deep DNA sequencing approach, which yielded approximately 12 million high-quality short sequence reads containing a total of approximately 2 million non-redundant tags. Based on a bioinformatics pipeline, we identified 176 new S. japonicum miRNAs, of which some exhibited a differential pattern of expression between the two stages. Although 21 S. japonicum miRNAs are orthologs of known miRNAs within the metazoans, some nucleotides at many positions of Schistosoma miRNAs, such as miR-8, let-7, miR-10, miR-31, miR-92, miR-124, and miR-125, are indeed significantly distinct from other bilaterian orthologs. In addition, both miR-71 and some miR-2 family members in tandem are found to be clustered in a reversal direction model on two genomic loci, and two pairs of novel S. japonicum miRNAs were derived from sense and antisense DNA strands at the same genomic loci. CONCLUSIONS/SIGNIFICANCE: The collection of S. japonicum miRNAs could be used as a new platform to study the genomic structure, gene regulation and networks, evolutionary processes, development, and host-parasite interactions. Some S. japonicum miRNAs and their clusters could represent the ancestral forms of the conserved orthologues and a model for the genesis of novel miRNAs.

  1. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance

    OpenAIRE

    Manning, Alisa K; Hivert, Marie-France; Scott, Robert A.; Grimsby, Jonna L; Bouatia-Naji, Nabila; Chen, Han; Rybin, Denis; Liu, Ching-Ti; Bielak, Lawrence F; Prokopenko, Inga; Amin, Najaf; Barnes, Daniel; Cadby, Gemma; Hottenga, Jouke-Jan; Ingelsson, Erik

    2012-01-01

    Recent genome-wide association studies have described many loci implicated in type 2 diabetes (T2D) pathophysiology and beta-cell dysfunction but have contributed little to the understanding of the genetic basis of insulin resistance. We hypothesized that genes implicated in insulin resistance pathways might be uncovered by accounting for differences in body mass index (BMI) and potential interactions between BMI and genetic variants. We applied a joint meta-analysis approach to test associat...

  2. The complex genetics of gait speed: genome-wide meta-analysis approach

    Science.gov (United States)

    Lunetta, Kathryn L.; Smith, Jennifer A.; Eicher, John D.; Vered, Rotem; Deelen, Joris; Arnold, Alice M.; Buchman, Aron S.; Tanaka, Toshiko; Faul, Jessica D.; Nethander, Maria; Fornage, Myriam; Adams, Hieab H.; Matteini, Amy M.; Callisaya, Michele L.; Smith, Albert V.; Yu, Lei; De Jager, Philip L.; Evans, Denis A.; Gudnason, Vilmundur; Hofman, Albert; Pattie, Alison; Corley, Janie; Launer, Lenore J.; Knopman, Davis S.; Parimi, Neeta; Turner, Stephen T.; Bandinelli, Stefania; Beekman, Marian; Gutman, Danielle; Sharvit, Lital; Mooijaart, Simon P.; Liewald, David C.; Houwing-Duistermaat, Jeanine J.; Ohlsson, Claes; Moed, Matthijs; Verlinden, Vincent J.; Mellström, Dan; van der Geest, Jos N.; Karlsson, Magnus; Hernandez, Dena; McWhirter, Rebekah; Liu, Yongmei; Thomson, Russell; Tranah, Gregory J.; Uitterlinden, Andre G.; Weir, David R.; Zhao, Wei; Starr, John M.; Johnson, Andrew D.; Ikram, M. Arfan; Bennett, David A.; Cummings, Steven R.; Deary, Ian J.; Harris, Tamara B.; Kardia, Sharon L. R.; Mosley, Thomas H.; Srikanth, Velandai K.; Windham, Beverly G.; Newman, Ann B.; Walston, Jeremy D.; Davies, Gail; Evans, Daniel S.; Slagboom, Eline P.; Ferrucci, Luigi; Kiel, Douglas P.; Murabito, Joanne M.; Atzmon, Gil

    2017-01-01

    Emerging evidence suggests that the basis for variation in late-life mobility is attributable, in part, to genetic factors, which may become increasingly important with age. Our objective was to systematically assess the contribution of genetic variation to gait speed in older individuals. We conducted a meta-analysis of gait speed GWASs in 31,478 older adults from 17 cohorts of the CHARGE consortium, and validated our results in 2,588 older adults from 4 independent studies. We followed our initial discoveries with network and eQTL analysis of candidate signals in tissues. The meta-analysis resulted in a list of 536 suggestive genome wide significant SNPs in or near 69 genes. Further interrogation with Pathway Analysis placed gait speed as a polygenic complex trait in five major networks. Subsequent eQTL analysis revealed several SNPs significantly associated with the expression of PRSS16, WDSUB1 and PTPRT, which in addition to the meta-analysis and pathway suggested that genetic effects on gait speed may occur through synaptic function and neuronal development pathways. No genome-wide significant signals for gait speed were identified from this moderately large sample of older adults, suggesting that more refined physical function phenotypes will be needed to identify the genetic basis of gait speed in aging. PMID:28077804

  3. On the analysis of genome-wide association studies in family-based designs: a universal, robust analysis approach and an application to four genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Sungho Won

    2009-11-01

    Full Text Available For genome-wide association studies in family-based designs, we propose a new, universally applicable approach. The new test statistic exploits all available information about the association, while, by virtue of its design, it maintains the same robustness against population admixture as traditional family-based approaches that are based exclusively on the within-family information. The approach is suitable for the analysis of almost any trait type, e.g. binary, continuous, time-to-onset, multivariate, etc., and combinations of those. We use simulation studies to verify all theoretically derived properties of the approach, estimate its power, and compare it with other standard approaches. We illustrate the practical implications of the new analysis method by an application to a lung-function phenotype, forced expiratory volume in one second (FEV1 in 4 genome-wide association studies.

  4. Genome-wide approaches to identify pharmacogenetic contributions to adverse drug reactions.

    Science.gov (United States)

    Nelson, M R; Bacanu, S-A; Mosteller, M; Li, L; Bowman, C E; Roses, A D; Lai, E H; Ehm, M G

    2009-02-01

    Adverse drug reactions (ADRs) have a major impact on patients, physicians, health care providers, regulatory agencies and pharmaceutical companies. Identifying the genetic contributions to ADR risk may lead to a better understanding of the underlying mechanisms, identification of patients at risk and a decrease in the number of events. Technological advances have made the routine monitoring and investigation of the genetic basis of ADRs during clinical trials possible. We demonstrate through simulation that genome-wide genotyping, coupled with the use of clinically matched or population controls, can yield sufficient statistical power to permit the identification of strong genetic predictors of ADR risk in a prospective manner with modest numbers of ADR cases. The results of a 500,000 single nucleotide polymorphism analysis of abacavir-associated hypersensitivity reaction suggest that the known HLA-B gene region could be identified with as few as 15 cases and 200 population controls in a sequential analysis.

  5. α-amanitin resistance in Drosophila melanogaster: A genome-wide association approach

    Science.gov (United States)

    Mitchell, Chelsea L.; Latuszek, Catrina E.; Vogel, Kara R.; Greenlund, Ian M.; Hobmeier, Rebecca E.; Ingram, Olivia K.; Dufek, Shannon R.; Pecore, Jared L.; Nip, Felicia R.; Johnson, Zachary J.; Ji, Xiaohui; Wei, Hairong; Gailing, Oliver

    2017-01-01

    We investigated the mechanisms of mushroom toxin resistance in the Drosophila Genetic Reference Panel (DGRP) fly lines, using genome-wide association studies (GWAS). While Drosophila melanogaster avoids mushrooms in nature, some lines are surprisingly resistant to α-amanitin—a toxin found solely in mushrooms. This resistance may represent a pre-adaptation, which might enable this species to invade the mushroom niche in the future. Although our previous microarray study had strongly suggested that pesticide-metabolizing detoxification genes confer α-amanitin resistance in a Taiwanese D. melanogaster line Ama-KTT, none of the traditional detoxification genes were among the top candidate genes resulting from the GWAS in the current study. Instead, we identified Megalin, Tequila, and widerborst as candidate genes underlying the α-amanitin resistance phenotype in the North American DGRP lines, all three of which are connected to the Target of Rapamycin (TOR) pathway. Both widerborst and Tequila are upstream regulators of TOR, and TOR is a key regulator of autophagy and Megalin-mediated endocytosis. We suggest that endocytosis and autophagy of α-amanitin, followed by lysosomal degradation of the toxin, is one of the mechanisms that confer α-amanitin resistance in the DGRP lines. PMID:28241077

  6. α-amanitin resistance in Drosophila melanogaster: A genome-wide association approach.

    Science.gov (United States)

    Mitchell, Chelsea L; Latuszek, Catrina E; Vogel, Kara R; Greenlund, Ian M; Hobmeier, Rebecca E; Ingram, Olivia K; Dufek, Shannon R; Pecore, Jared L; Nip, Felicia R; Johnson, Zachary J; Ji, Xiaohui; Wei, Hairong; Gailing, Oliver; Werner, Thomas

    2017-01-01

    We investigated the mechanisms of mushroom toxin resistance in the Drosophila Genetic Reference Panel (DGRP) fly lines, using genome-wide association studies (GWAS). While Drosophila melanogaster avoids mushrooms in nature, some lines are surprisingly resistant to α-amanitin-a toxin found solely in mushrooms. This resistance may represent a pre-adaptation, which might enable this species to invade the mushroom niche in the future. Although our previous microarray study had strongly suggested that pesticide-metabolizing detoxification genes confer α-amanitin resistance in a Taiwanese D. melanogaster line Ama-KTT, none of the traditional detoxification genes were among the top candidate genes resulting from the GWAS in the current study. Instead, we identified Megalin, Tequila, and widerborst as candidate genes underlying the α-amanitin resistance phenotype in the North American DGRP lines, all three of which are connected to the Target of Rapamycin (TOR) pathway. Both widerborst and Tequila are upstream regulators of TOR, and TOR is a key regulator of autophagy and Megalin-mediated endocytosis. We suggest that endocytosis and autophagy of α-amanitin, followed by lysosomal degradation of the toxin, is one of the mechanisms that confer α-amanitin resistance in the DGRP lines.

  7. Genome wide association analysis of the 16th QTL- MAS Workshop dataset using the Random Forest machine learning approach

    Science.gov (United States)

    2014-01-01

    Background Genome wide association studies are now widely used in the livestock sector to estimate the association among single nucleotide polymorphisms (SNPs) distributed across the whole genome and one or more trait. As computational power increases, the use of machine learning techniques to analyze large genome wide datasets becomes possible. Methods The objective of this study was to identify SNPs associated with the three traits simulated in the 16th MAS-QTL workshop dataset using the Random Forest (RF) approach. The approach was applied to single and multiple trait estimated breeding values, and on yield deviations and to compare them with the results of the GRAMMAR-CG method. Results The two QTL mapping methods used, GRAMMAR-CG and RF, were successful in identifying the main QTLs for trait 1 on chromosomes 1 and 4, for trait 2 on chromosomes 1, 4 and 5 and for trait 3 on chromosomes 1, 2 and 3. Conclusions The results of the RF approach were confirmed by the GRAMMAR-CG method and validated by the effective QTL position, even if their approach to unravel cryptic genetic structure is different. Furthermore, both methods showed complementary findings. However, when the variance explained by the QTL is low, they both failed to detect significant associations. PMID:25519518

  8. MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study

    Directory of Open Access Journals (Sweden)

    Xue Hong

    2009-01-01

    Full Text Available Abstract Background The interactions of multiple single nucleotide polymorphisms (SNPs are highly hypothesized to affect an individual's susceptibility to complex diseases. Although many works have been done to identify and quantify the importance of multi-SNP interactions, few of them could handle the genome wide data due to the combinatorial explosive search space and the difficulty to statistically evaluate the high-order interactions given limited samples. Results Three comparative experiments are designed to evaluate the performance of MegaSNPHunter. The first experiment uses synthetic data generated on the basis of epistasis models. The second one uses a genome wide study on Parkinson disease (data acquired by using Illumina HumanHap300 SNP chips. The third one chooses the rheumatoid arthritis study from Wellcome Trust Case Control Consortium (WTCCC using Affymetrix GeneChip 500K Mapping Array Set. MegaSNPHunter outperforms the best solution in this area and reports many potential interactions for the two real studies. Conclusion The experimental results on both synthetic data and two real data sets demonstrate that our proposed approach outperforms the best solution that is currently available in handling large-scale SNP data both in terms of speed and in terms of detection of potential interactions that were not identified before. To our knowledge, MegaSNPHunter is the first approach that is capable of identifying the disease-associated SNP interactions from WTCCC studies and is promising for practical disease prognosis.

  9. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance.

    Science.gov (United States)

    Manning, Alisa K; Hivert, Marie-France; Scott, Robert A; Grimsby, Jonna L; Bouatia-Naji, Nabila; Chen, Han; Rybin, Denis; Liu, Ching-Ti; Bielak, Lawrence F; Prokopenko, Inga; Amin, Najaf; Barnes, Daniel; Cadby, Gemma; Hottenga, Jouke-Jan; Ingelsson, Erik; Jackson, Anne U; Johnson, Toby; Kanoni, Stavroula; Ladenvall, Claes; Lagou, Vasiliki; Lahti, Jari; Lecoeur, Cecile; Liu, Yongmei; Martinez-Larrad, Maria Teresa; Montasser, May E; Navarro, Pau; Perry, John R B; Rasmussen-Torvik, Laura J; Salo, Perttu; Sattar, Naveed; Shungin, Dmitry; Strawbridge, Rona J; Tanaka, Toshiko; van Duijn, Cornelia M; An, Ping; de Andrade, Mariza; Andrews, Jeanette S; Aspelund, Thor; Atalay, Mustafa; Aulchenko, Yurii; Balkau, Beverley; Bandinelli, Stefania; Beckmann, Jacques S; Beilby, John P; Bellis, Claire; Bergman, Richard N; Blangero, John; Boban, Mladen; Boehnke, Michael; Boerwinkle, Eric; Bonnycastle, Lori L; Boomsma, Dorret I; Borecki, Ingrid B; Böttcher, Yvonne; Bouchard, Claude; Brunner, Eric; Budimir, Danijela; Campbell, Harry; Carlson, Olga; Chines, Peter S; Clarke, Robert; Collins, Francis S; Corbatón-Anchuelo, Arturo; Couper, David; de Faire, Ulf; Dedoussis, George V; Deloukas, Panos; Dimitriou, Maria; Egan, Josephine M; Eiriksdottir, Gudny; Erdos, Michael R; Eriksson, Johan G; Eury, Elodie; Ferrucci, Luigi; Ford, Ian; Forouhi, Nita G; Fox, Caroline S; Franzosi, Maria Grazia; Franks, Paul W; Frayling, Timothy M; Froguel, Philippe; Galan, Pilar; de Geus, Eco; Gigante, Bruna; Glazer, Nicole L; Goel, Anuj; Groop, Leif; Gudnason, Vilmundur; Hallmans, Göran; Hamsten, Anders; Hansson, Ola; Harris, Tamara B; Hayward, Caroline; Heath, Simon; Hercberg, Serge; Hicks, Andrew A; Hingorani, Aroon; Hofman, Albert; Hui, Jennie; Hung, Joseph; Jarvelin, Marjo-Riitta; Jhun, Min A; Johnson, Paul C D; Jukema, J Wouter; Jula, Antti; Kao, W H; Kaprio, Jaakko; Kardia, Sharon L R; Keinanen-Kiukaanniemi, Sirkka; Kivimaki, Mika; Kolcic, Ivana; Kovacs, Peter; Kumari, Meena; Kuusisto, Johanna; Kyvik, Kirsten Ohm; Laakso, Markku; Lakka, Timo; Lannfelt, Lars; Lathrop, G Mark; Launer, Lenore J; Leander, Karin; Li, Guo; Lind, Lars; Lindstrom, Jaana; Lobbens, Stéphane; Loos, Ruth J F; Luan, Jian'an; Lyssenko, Valeriya; Mägi, Reedik; Magnusson, Patrik K E; Marmot, Michael; Meneton, Pierre; Mohlke, Karen L; Mooser, Vincent; Morken, Mario A; Miljkovic, Iva; Narisu, Narisu; O'Connell, Jeff; Ong, Ken K; Oostra, Ben A; Palmer, Lyle J; Palotie, Aarno; Pankow, James S; Peden, John F; Pedersen, Nancy L; Pehlic, Marina; Peltonen, Leena; Penninx, Brenda; Pericic, Marijana; Perola, Markus; Perusse, Louis; Peyser, Patricia A; Polasek, Ozren; Pramstaller, Peter P; Province, Michael A; Räikkönen, Katri; Rauramaa, Rainer; Rehnberg, Emil; Rice, Ken; Rotter, Jerome I; Rudan, Igor; Ruokonen, Aimo; Saaristo, Timo; Sabater-Lleal, Maria; Salomaa, Veikko; Savage, David B; Saxena, Richa; Schwarz, Peter; Seedorf, Udo; Sennblad, Bengt; Serrano-Rios, Manuel; Shuldiner, Alan R; Sijbrands, Eric J G; Siscovick, David S; Smit, Johannes H; Small, Kerrin S; Smith, Nicholas L; Smith, Albert Vernon; Stančáková, Alena; Stirrups, Kathleen; Stumvoll, Michael; Sun, Yan V; Swift, Amy J; Tönjes, Anke; Tuomilehto, Jaakko; Trompet, Stella; Uitterlinden, Andre G; Uusitupa, Matti; Vikström, Max; Vitart, Veronique; Vohl, Marie-Claude; Voight, Benjamin F; Vollenweider, Peter; Waeber, Gerard; Waterworth, Dawn M; Watkins, Hugh; Wheeler, Eleanor; Widen, Elisabeth; Wild, Sarah H; Willems, Sara M; Willemsen, Gonneke; Wilson, James F; Witteman, Jacqueline C M; Wright, Alan F; Yaghootkar, Hanieh; Zelenika, Diana; Zemunik, Tatijana; Zgaga, Lina; Wareham, Nicholas J; McCarthy, Mark I; Barroso, Ines; Watanabe, Richard M; Florez, Jose C; Dupuis, Josée; Meigs, James B; Langenberg, Claudia

    2012-05-13

    Recent genome-wide association studies have described many loci implicated in type 2 diabetes (T2D) pathophysiology and β-cell dysfunction but have contributed little to the understanding of the genetic basis of insulin resistance. We hypothesized that genes implicated in insulin resistance pathways might be uncovered by accounting for differences in body mass index (BMI) and potential interactions between BMI and genetic variants. We applied a joint meta-analysis approach to test associations with fasting insulin and glucose on a genome-wide scale. We present six previously unknown loci associated with fasting insulin at P < 5 × 10(-8) in combined discovery and follow-up analyses of 52 studies comprising up to 96,496 non-diabetic individuals. Risk variants were associated with higher triglyceride and lower high-density lipoprotein (HDL) cholesterol levels, suggesting a role for these loci in insulin resistance pathways. The discovery of these loci will aid further characterization of the role of insulin resistance in T2D pathophysiology.

  10. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance

    Science.gov (United States)

    Manning, Alisa K.; Hivert, Marie-France; Scott, Robert A.; Grimsby, Jonna L.; Bouatia-Naji, Nabila; Chen, Han; Rybin, Denis; Liu, Ching-Ti; Bielak, Lawrence F.; Prokopenko, Inga; Amin, Najaf; Barnes, Daniel; Cadby, Gemma; Hottenga, Jouke-Jan; Ingelsson, Erik; Jackson, Anne U.; Johnson, Toby; Kanoni, Stavroula; Ladenvall, Claes; Lagou, Vasiliki; Lahti, Jari; Lecoeur, Cecile; Liu, Yongmei; Martinez-Larrad, Maria Teresa; Montasser, May E.; Navarro, Pau; Perry, John R. B.; Rasmussen-Torvik, Laura J.; Salo, Perttu; Sattar, Naveed; Shungin, Dmitry; Strawbridge, Rona J.; Tanaka, Toshiko; van Duijn, Cornelia M.; An, Ping; de Andrade, Mariza; Andrews, Jeanette S.; Aspelund, Thor; Atalay, Mustafa; Aulchenko, Yurii; Balkau, Beverley; Bandinelli, Stefania; Beckmann, Jacques S.; Beilby, John P.; Bellis, Claire; Bergman, Richard N.; Blangero, John; Boban, Mladen; Boehnke, Michael; Boerwinkle, Eric; Bonnycastle, Lori L.; Boomsma, Dorret I.; Borecki, Ingrid B.; Böttcher, Yvonne; Bouchard, Claude; Brunner, Eric; Budimir, Danijela; Campbell, Harry; Carlson, Olga; Chines, Peter S.; Clarke, Robert; Collins, Francis S.; Corbatón-Anchuelo, Arturo; Couper, David; de Faire, Ulf; Dedoussis, George V; Deloukas, Panos; Dimitriou, Maria; Egan, Josephine M; Eiriksdottir, Gudny; Erdos, Michael R.; Eriksson, Johan G.; Eury, Elodie; Ferrucci, Luigi; Ford, Ian; Forouhi, Nita G.; Fox, Caroline S; Franzosi, Maria Grazia; Franks, Paul W; Frayling, Timothy M; Froguel, Philippe; Galan, Pilar; de Geus, Eco; Gigante, Bruna; Glazer, Nicole L.; Goel, Anuj; Groop, Leif; Gudnason, Vilmundur; Hallmans, Göran; Hamsten, Anders; Hansson, Ola; Harris, Tamara B.; Hayward, Caroline; Heath, Simon; Hercberg, Serge; Hicks, Andrew A.; Hingorani, Aroon; Hofman, Albert; Hui, Jennie; Hung, Joseph; Jarvelin, Marjo Riitta; Jhun, Min A.; Johnson, Paul C.D.; Jukema, J Wouter; Jula, Antti; Kao, W.H.; Kaprio, Jaakko; Kardia, Sharon L. R.; Keinanen-Kiukaanniemi, Sirkka; Kivimaki, Mika; Kolcic, Ivana; Kovacs, Peter; Kumari, Meena; Kuusisto, Johanna; Kyvik, Kirsten Ohm; Laakso, Markku; Lakka, Timo; Lannfelt, Lars; Lathrop, G Mark; Launer, Lenore J.; Leander, Karin; Li, Guo; Lind, Lars; Lindstrom, Jaana; Lobbens, Stéphane; Loos, Ruth J. F.; Luan, Jian’an; Lyssenko, Valeriya; Mägi, Reedik; Magnusson, Patrik K. E.; Marmot, Michael; Meneton, Pierre; Mohlke, Karen L.; Mooser, Vincent; Morken, Mario A.; Miljkovic, Iva; Narisu, Narisu; O’Connell, Jeff; Ong, Ken K.; Oostra, Ben A.; Palmer, Lyle J.; Palotie, Aarno; Pankow, James S.; Peden, John F.; Pedersen, Nancy L.; Pehlic, Marina; Peltonen, Leena; Penninx, Brenda; Pericic, Marijana; Perola, Markus; Perusse, Louis; Peyser, Patricia A; Polasek, Ozren; Pramstaller, Peter P.; Province, Michael A.; Räikkönen, Katri; Rauramaa, Rainer; Rehnberg, Emil; Rice, Ken; Rotter, Jerome I.; Rudan, Igor; Ruokonen, Aimo; Saaristo, Timo; Sabater-Lleal, Maria; Salomaa, Veikko; Savage, David B.; Saxena, Richa; Schwarz, Peter; Seedorf, Udo; Sennblad, Bengt; Serrano-Rios, Manuel; Shuldiner, Alan R.; Sijbrands, Eric J.G.; Siscovick, David S.; Smit, Johannes H.; Small, Kerrin S.; Smith, Nicholas L.; Smith, Albert Vernon; Stančáková, Alena; Stirrups, Kathleen; Stumvoll, Michael; Sun, Yan V.; Swift, Amy J.; Tönjes, Anke; Tuomilehto, Jaakko; Trompet, Stella; Uitterlinden, Andre G.; Uusitupa, Matti; Vikström, Max; Vitart, Veronique; Vohl, Marie-Claude; Voight, Benjamin F.; Vollenweider, Peter; Waeber, Gerard; Waterworth, Dawn M; Watkins, Hugh; Wheeler, Eleanor; Widen, Elisabeth; Wild, Sarah H.; Willems, Sara M.; Willemsen, Gonneke; Wilson, James F.; Witteman, Jacqueline C.M.; Wright, Alan F.; Yaghootkar, Hanieh; Zelenika, Diana; Zemunik, Tatijana; Zgaga, Lina; Wareham, Nicholas J.; McCarthy, Mark I.; Barroso, Ines; Watanabe, Richard M.; Florez, Jose C.; Dupuis, Josée; Meigs, James B.; Langenberg, Claudia

    2013-01-01

    Recent genome-wide association studies have described many loci implicated in type 2 diabetes (T2D) pathophysiology and beta-cell dysfunction, but contributed little to our understanding of the genetic basis of insulin resistance. We hypothesized that genes implicated in insulin resistance pathways may be uncovered by accounting for differences in body mass index (BMI) and potential interaction between BMI and genetic variants. We applied a novel joint meta-analytical approach to test associations with fasting insulin (FI) and glucose (FG) on a genome-wide scale. We present six previously unknown FI loci at Pdiscovery and follow-up analyses of 52 studies comprising up to 96,496non-diabetic individuals. Risk variants were associated with higher triglyceride and lower HDL cholesterol levels, suggestive of a role for these FI loci in insulin resistance pathways. The localization of these additional loci will aid further characterization of the role of insulin resistance in T2D pathophysiology. PMID:22581228

  11. Contribution of genome-wide association studies to scientific research: a pragmatic approach to evaluate their impact.

    Directory of Open Access Journals (Sweden)

    Vito A G Ricigliano

    Full Text Available The factual value of genome-wide association studies (GWAS for the understanding of multifactorial diseases is a matter of intense debate. Practical consequences for the development of more effective therapies do not seem to be around the corner. Here we propose a pragmatic and objective evaluation of how much new biology is arising from these studies, with particular attention to the information that can help prioritize therapeutic targets. We chose multiple sclerosis (MS as a paradigm disease and assumed that, in pre-GWAS candidate-gene studies, the knowledge behind the choice of each gene reflected the understanding of the disease prior to the advent of GWAS. Importantly, this knowledge was based mainly on non-genetic, phenotypic grounds. We performed single-gene and pathway-oriented comparisons of old and new knowledge in MS by confronting an unbiased list of candidate genes in pre-GWAS association studies with those genes exceeding the genome-wide significance threshold in GWAS published from 2007 on. At the single gene level, the majority (94 out of 125 of GWAS-discovered variants had never been contemplated as plausible candidates in pre-GWAS association studies. The 31 genes that were present in both pre- and post-GWAS lists may be of particular interest in that they represent disease-associated variants whose pathogenetic relevance is supported at the phenotypic level (i.e. the phenotypic information that steered their selection as candidate genes in pre-GWAS association studies. As such they represent attractive therapeutic targets. Interestingly, our analysis shows that some of these variants are targets of pharmacologically active compounds, including drugs that are already registered for human use. Compared with the above single-gene analysis, at the pathway level GWAS results appear more coherent with previous knowledge, reinforcing some of the current views on MS pathogenesis and related therapeutic research. This study presents a

  12. IGESS: a statistical approach to integrating individual-level genotype data and summary statistics in genome-wide association studies.

    Science.gov (United States)

    Dai, Mingwei; Ming, Jingsi; Cai, Mingxuan; Liu, Jin; Yang, Can; Wan, Xiang; Xu, Zongben

    2017-09-15

    Results from genome-wide association studies (GWAS) suggest that a complex phenotype is often affected by many variants with small effects, known as 'polygenicity'. Tens of thousands of samples are often required to ensure statistical power of identifying these variants with small effects. However, it is often the case that a research group can only get approval for the access to individual-level genotype data with a limited sample size (e.g. a few hundreds or thousands). Meanwhile, summary statistics generated using single-variant-based analysis are becoming publicly available. The sample sizes associated with the summary statistics datasets are usually quite large. How to make the most efficient use of existing abundant data resources largely remains an open question. In this study, we propose a statistical approach, IGESS, to increasing statistical power of identifying risk variants and improving accuracy of risk prediction by i ntegrating individual level ge notype data and s ummary s tatistics. An efficient algorithm based on variational inference is developed to handle the genome-wide analysis. Through comprehensive simulation studies, we demonstrated the advantages of IGESS over the methods which take either individual-level data or summary statistics data as input. We applied IGESS to perform integrative analysis of Crohns Disease from WTCCC and summary statistics from other studies. IGESS was able to significantly increase the statistical power of identifying risk variants and improve the risk prediction accuracy from 63.2% ( ±0.4% ) to 69.4% ( ±0.1% ) using about 240 000 variants. The IGESS software is available at https://github.com/daviddaigithub/IGESS . zbxu@xjtu.edu.cn or xwan@comp.hkbu.edu.hk or eeyang@hkbu.edu.hk. Supplementary data are available at Bioinformatics online.

  13. A Two-Stage Penalized Logistic Regression Approach to Case-Control Genome-Wide Association Studies

    Directory of Open Access Journals (Sweden)

    Jingyuan Zhao

    2012-01-01

    Full Text Available We propose a two-stage penalized logistic regression approach to case-control genome-wide association studies. This approach consists of a screening stage and a selection stage. In the screening stage, main-effect and interaction-effect features are screened by using L1-penalized logistic like-lihoods. In the selection stage, the retained features are ranked by the logistic likelihood with the smoothly clipped absolute deviation (SCAD penalty (Fan and Li, 2001 and Jeffrey’s Prior penalty (Firth, 1993, a sequence of nested candidate models are formed, and the models are assessed by a family of extended Bayesian information criteria (J. Chen and Z. Chen, 2008. The proposed approach is applied to the analysis of the prostate cancer data of the Cancer Genetic Markers of Susceptibility (CGEMS project in the National Cancer Institute, USA. Simulation studies are carried out to compare the approach with the pair-wise multiple testing approach (Marchini et al. 2005 and the LASSO-patternsearch algorithm (Shi et al. 2007.

  14. Novel approach for deriving genome wide SNP analysis data from archived blood spots

    Directory of Open Access Journals (Sweden)

    Fowler Katie E

    2012-09-01

    Full Text Available Abstract Background The ability to transport and store DNA at room temperature in low volumes has the advantage of optimising cost, time and storage space. Blood spots on adapted filter papers are popular for this, with FTA (Flinders Technology Associates Whatman™TM technology being one of the most recent. Plant material, plasmids, viral particles, bacteria and animal blood have been stored and transported successfully using this technology, however the method of porcine DNA extraction from FTA Whatman™TM cards is a relatively new approach, allowing nucleic acids to be ready for downstream applications such as PCR, whole genome amplification, sequencing and subsequent application to single nucleotide polymorphism microarrays has hitherto been under-explored. Findings DNA was extracted from FTA Whatman™TM cards (following adaptations of the manufacturer’s instructions, whole genome amplified and subsequently analysed to validate the integrity of the DNA for downstream SNP analysis. DNA was successfully extracted from 288/288 samples and amplified by WGA. Allele dropout post WGA, was observed in less than 2% of samples and there was no clear evidence of amplification bias nor contamination. Acceptable call rates on porcine SNP chips were also achieved using DNA extracted and amplified in this way. Conclusions DNA extracted from FTA Whatman cards is of a high enough quality and quantity following whole genomic amplification to perform meaningful SNP chip studies.

  15. Evaluation of a Two-Stage Approach in Trans-Ethnic Meta-Analysis in Genome-Wide Association Studies.

    Science.gov (United States)

    Hong, Jaeyoung; Lunetta, Kathryn L; Cupples, L Adrienne; Dupuis, Josée; Liu, Ching-Ti

    2016-05-01

    Meta-analysis of genome-wide association studies (GWAS) has achieved great success in detecting loci underlying human diseases. Incorporating GWAS results from diverse ethnic populations for meta-analysis, however, remains challenging because of the possible heterogeneity across studies. Conventional fixed-effects (FE) or random-effects (RE) methods may not be most suitable to aggregate multiethnic GWAS results because of violation of the homogeneous effect assumption across studies (FE) or low power to detect signals (RE). Three recently proposed methods, modified RE (RE-HE) model, binary-effects (BE) model and a Bayesian approach (Meta-analysis of Transethnic Association [MANTRA]), show increased power over FE and RE methods while incorporating heterogeneity of effects when meta-analyzing trans-ethnic GWAS results. We propose a two-stage approach to account for heterogeneity in trans-ethnic meta-analysis in which we clustered studies with cohort-specific ancestry information prior to meta-analysis. We compare this to a no-prior-clustering (crude) approach, evaluating type I error and power of these two strategies, in an extensive simulation study to investigate whether the two-stage approach offers any improvements over the crude approach. We find that the two-stage approach and the crude approach for all five methods (FE, RE, RE-HE, BE, MANTRA) provide well-controlled type I error. However, the two-stage approach shows increased power for BE and RE-HE, and similar power for MANTRA and FE compared to their corresponding crude approach, especially when there is heterogeneity across the multiethnic GWAS results. These results suggest that prior clustering in the two-stage approach can be an effective and efficient intermediate step in meta-analysis to account for the multiethnic heterogeneity.

  16. The genetic association between personality and major depression or bipolar disorder. A polygenic score analysis using genome-wide association data.

    Science.gov (United States)

    Middeldorp, C M; de Moor, M H M; McGrath, L M; Gordon, S D; Blackwood, D H; Costa, P T; Terracciano, A; Krueger, R F; de Geus, E J C; Nyholt, D R; Tanaka, T; Esko, T; Madden, P A F; Derringer, J; Amin, N; Willemsen, G; Hottenga, J-J; Distel, M A; Uda, M; Sanna, S; Spinhoven, P; Hartman, C A; Ripke, S; Sullivan, P F; Realo, A; Allik, J; Heath, A C; Pergadia, M L; Agrawal, A; Lin, P; Grucza, R A; Widen, E; Cousminer, D L; Eriksson, J G; Palotie, A; Barnett, J H; Lee, P H; Luciano, M; Tenesa, A; Davies, G; Lopez, L M; Hansell, N K; Medland, S E; Ferrucci, L; Schlessinger, D; Montgomery, G W; Wright, M J; Aulchenko, Y S; Janssens, A C J W; Oostra, B A; Metspalu, A; Abecasis, G R; Deary, I J; Räikkönen, K; Bierut, L J; Martin, N G; Wray, N R; van Duijn, C M; Smoller, J W; Penninx, B W J H; Boomsma, D I

    2011-10-18

    The relationship between major depressive disorder (MDD) and bipolar disorder (BD) remains controversial. Previous research has reported differences and similarities in risk factors for MDD and BD, such as predisposing personality traits. For example, high neuroticism is related to both disorders, whereas openness to experience is specific for BD. This study examined the genetic association between personality and MDD and BD by applying polygenic scores for neuroticism, extraversion, openness to experience, agreeableness and conscientiousness to both disorders. Polygenic scores reflect the weighted sum of multiple single-nucleotide polymorphism alleles associated with the trait for an individual and were based on a meta-analysis of genome-wide association studies for personality traits including 13,835 subjects. Polygenic scores were tested for MDD in the combined Genetic Association Information Network (GAIN-MDD) and MDD2000+ samples (N=8921) and for BD in the combined Systematic Treatment Enhancement Program for Bipolar Disorder and Wellcome Trust Case-Control Consortium samples (N=6329) using logistic regression analyses. At the phenotypic level, personality dimensions were associated with MDD and BD. Polygenic neuroticism scores were significantly positively associated with MDD, whereas polygenic extraversion scores were significantly positively associated with BD. The explained variance of MDD and BD, ∼0.1%, was highly comparable to the variance explained by the polygenic personality scores in the corresponding personality traits themselves (between 0.1 and 0.4%). This indicates that the proportions of variance explained in mood disorders are at the upper limit of what could have been expected. This study suggests shared genetic risk factors for neuroticism and MDD on the one hand and for extraversion and BD on the other.

  17. Identifying Loci for the Overlap between Attention-Deficit/Hyperactivity Disorder and Autism Spectrum Disorder Using a Genome-Wide QTL Linkage Approach

    Science.gov (United States)

    Nijmeijer, Judith S.; Arias-Vasquez, Alejandro; Rommelse, Nanda N. J.; Altink, Marieke E.; Anney, Richard J. L.; Asherson, Philip; Banaschewski, Tobias; Buschgens, Cathelijne J. M.; Fliers, Ellen A.; Gill, Michael; Minderaa, Ruud B.; Poustka, Luise; Sergeant, Joseph A.; Buitelaar, Jan K.; Franke, Barbara; Ebstein, Richard P.; Miranda, Ana; Mulas, Fernando; Oades, Robert D.; Roeyers, Herbert; Rothenberger, Aribert; Sonuga-Barke, Edmund J. S.; Steinhausen, Hans-Christoph; Faraone, Stephen V.; Hartman, Catharina A.; Hoekstra, Pieter J.

    2010-01-01

    Objective: The genetic basis for autism spectrum disorder (ASD) symptoms in children with attention-deficit/hyperactivity disorder (ADHD) was addressed using a genome-wide linkage approach. Method: Participants of the International Multi-Center ADHD Genetics study comprising 1,143 probands with ADHD and 1,453 siblings were analyzed. The total and…

  18. Identifying Loci for the Overlap Between Attention-Deficit/Hyperactivity Disorder and Autism Spectrum Disorder Using a Genome-wide QTL Linkage Approach

    NARCIS (Netherlands)

    Nijmeijer, Judith S.; Arias-Vasquez, Alejandro; Rommelse, Nanda N. J.; Altink, Marieke E.; Anney, Richard J. L.; Asherson, Philip; Banaschewski, Tobias; Buschgens, Cathelijne J. M.; Fliers, Ellen A.; Gill, Michael; Minderaa, Ruud B.; Poustka, Luise; Sergeant, Joseph A.; Buitelaar, Jan K.; Franke, Barbara; Ebstein, Richard P.; Miranda, Ana; Mulas, Fernando; Oades, Robert D.; Roeyers, Herbert; Rothenberger, Aribert; Sonuga-Barke, Edmund J. S.; Steinhausen, Hans-Christoph; Faraone, Stephen. V.; Hartman, Catharina A.; Hoekstra, Pieter J.

    Objective: The genetic basis for autism spectrum disorder (ASD) symptoms in children with attention-deficit/hyperactivity disorder (ADHD) was addressed using a genome-wide linkage approach. Method: Participants of the International Multi-Center ADHD Genetics study comprising 1,143 probands with ADHD

  19. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance

    NARCIS (Netherlands)

    Manning, Alisa K.; Hivert, Marie-France; Scott, Robert A.; Grimsby, Jonna L.; Bouatia-Naji, Nabila; Chen, Han; Rybin, Denis; Liu, Ching-Ti; Bielak, Lawrence F.; Prokopenko, Inga; Amin, Najaf; Barnes, Daniel; Cadby, Gemma; Hottenga, Jouke-Jan; Ingelsson, Erik; Jackson, Anne U.; Johnson, Toby; Kanoni, Stavroula; Ladenvall, Claes; Lagou, Vasiliki; Lahti, Jari; Lecoeur, Cecile; Liu, Yongmei; Martinez-Larrad, Maria Teresa; Montasser, May E.; Navarro, Pau; Perry, John R. B.; Rasmussen-Torvik, Laura J.; Salo, Perttu; Sattar, Naveed; Shungin, Dmitry; Strawbridge, Rona J.; Tanaka, Toshiko; van Duijn, Cornelia M.; An, Ping; de Andrade, Mariza; Andrews, Jeanette S.; Aspelund, Thor; Atalay, Mustafa; Aulchenko, Yurii; Balkau, Beverley; Bandinelli, Stefania; Beckmann, Jacques S.; Beilby, John P.; Bellis, Claire; Bergman, Richard N.; Blangero, John; Boban, Mladen; Boehnke, Michael; Boerwinkle, Eric; Bonnycastle, Lori L.; Boomsma, Dorret I.; Borecki, Ingrid B.; Boettcher, Yvonne; Bouchard, Claude; Brunner, Eric; Budimir, Danijela; Campbell, Harry; Carlson, Olga; Chines, Peter S.; Clarke, Robert; Collins, Francis S.; Corbaton-Anchuelo, Arturo; Couper, David; de Faire, Ulf; Dedoussis, George V.; Deloukas, Panos; Dimitriou, Maria; Egan, Josephine M.; Eiriksdottir, Gudny; Erdos, Michael R.; Eriksson, Johan G.; Eury, Elodie; Ferrucci, Luigi; Ford, Ian; Forouhi, Nita G.; Fox, Caroline S.; Franzosi, Maria Grazia; Franks, Paul W.; Frayling, Timothy M.; Froguel, Philippe; Galan, Pilar; de Geus, Eco; Gigante, Bruna; Glazer, Nicole L.; Goel, Anuj; Groop, Leif; Gudnason, Vilmundur; Hallmans, Goeran; Hamsten, Anders; Hansson, Ola; Harris, Tamara B.; Hayward, Caroline; Heath, Simon; Hercberg, Serge; Hicks, Andrew A.; Hingorani, Aroon; Hofman, Albert; Hui, Jennie; Hung, Joseph; Jarvelin, Marjo-Riitta; Jhun, Min A.; Johnson, Paul C. D.; Jukema, J. Wouter; Jula, Antti; Kao, W. H.; Kaprio, Jaakko; Kardia, Sharon L. R.; Keinanen-Kiukaanniemi, Sirkka; Kivimaki, Mika; Kolcic, Ivana; Kovacs, Peter; Kumari, Meena; Kuusisto, Johanna; Kyvik, Kirsten Ohm; Laakso, Markku; Lakka, Timo; Lannfelt, Lars; Lathrop, G. Mark; Launer, Lenore J.; Leander, Karin; Li, Guo; Lind, Lars; Lindstrom, Jaana; Lobbens, Stephane; Loos, Ruth J. F.; Luan, Jian'an; Lyssenko, Valeriya; Magi, Reedik; Magnusson, Patrik K. E.; Marmot, Michael; Meneton, Pierre; Mohlke, Karen L.; Mooser, Vincent; Morken, Mario A.; Miljkovic, Iva; Narisu, Narisu; O'Connell, Jeff; Ong, Ken K.; Oostra, Ben A.; Palmer, Lyle J.; Palotie, Aarno; Pankow, James S.; Peden, John F.; Pedersen, Nancy L.; Pehlic, Marina; Peltonen, Leena; Penninx, Brenda; Pericic, Marijana; Perola, Markus; Perusse, Louis; Peyser, Patricia A.; Polasek, Ozren; Pramstaller, Peter P.; Province, Michael A.; Raikkonen, Katri; Rauramaa, Rainer; Rehnberg, Emil; Rice, Ken; Rotter, Jerome I.; Rudan, Igor; Ruokonen, Aimo; Saaristo, Timo; Sabater-Lleal, Maria; Salomaa, Veikko; Savage, David B.; Saxena, Richa; Schwarz, Peter; Seedorf, Udo; Sennblad, Bengt; Serrano-Rios, Manuel; Shuldiner, Alan R.; Sijbrands, Eric J. G.; Siscovick, David S.; Smit, Johannes H.; Small, Kerrin S.; Smith, Nicholas L.; Smith, Albert Vernon; Stancakova, Alena; Stirrups, Kathleen; Stumvoll, Michael; Sun, Yan V.; Swift, Amy J.; Toenjes, Anke; Tuomilehto, Jaakko; Trompet, Stella; Uitterlinden, Andre G.; Uusitupa, Matti; Vikstrom, Max; Vitart, Veronique; Vohl, Marie-Claude; Voight, Benjamin F.; Vollenweider, Peter; Waeber, Gerard; Waterworth, Dawn M.; Watkins, Hugh; Wheeler, Eleanor; Widen, Elisabeth; Wild, Sarah H.; Willems, Sara M.; Willemsen, Gonneke; Wilson, James F.; Witteman, Jacqueline C. M.; Wright, Alan F.; Yaghootkar, Hanieh; Zelenika, Diana; Zemunik, Tatijana; Zgaga, Lina; Wareham, Nicholas J.; McCarthy, Mark I.; Barroso, Ines; Watanabe, Richard M.; Florez, Jose C.; Dupuis, Josee; Meigs, James B.; Langenberg, Claudia

    2012-01-01

    Recent genome-wide association studies have described many loci implicated in type 2 diabetes (T2D) pathophysiology and beta-cell dysfunction but have contributed little to the understanding of the genetic basis of insulin resistance. We hypothesized that genes implicated in insulin resistance pathw

  20. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance

    DEFF Research Database (Denmark)

    Manning, Alisa K; Hivert, Marie-France; Scott, Robert A

    2012-01-01

    Recent genome-wide association studies have described many loci implicated in type 2 diabetes (T2D) pathophysiology and β-cell dysfunction but have contributed little to the understanding of the genetic basis of insulin resistance. We hypothesized that genes implicated in insulin resistance...... in insulin resistance pathways. The discovery of these loci will aid further characterization of the role of insulin resistance in T2D pathophysiology....

  1. Genome-Wide Locations of Potential Epimutations Associated with Environmentally Induced Epigenetic Transgenerational Inheritance of Disease Using a Sequential Machine Learning Prediction Approach.

    Science.gov (United States)

    Haque, M Muksitul; Holder, Lawrence B; Skinner, Michael K

    2015-01-01

    Environmentally induced epigenetic transgenerational inheritance of disease and phenotypic variation involves germline transmitted epimutations. The primary epimutations identified involve altered differential DNA methylation regions (DMRs). Different environmental toxicants have been shown to promote exposure (i.e., toxicant) specific signatures of germline epimutations. Analysis of genomic features associated with these epimutations identified low-density CpG regions (learning computational approach to predict all potential epimutations in the genome. A number of previously identified sperm epimutations were used as training sets. A novel machine learning approach using a sequential combination of Active Learning and Imbalance Class Learner analysis was developed. The transgenerational sperm epimutation analysis identified approximately 50K individual sites with a 1 kb mean size and 3,233 regions that had a minimum of three adjacent sites with a mean size of 3.5 kb. A select number of the most relevant genomic features were identified with the low density CpG deserts being a critical genomic feature of the features selected. A similar independent analysis with transgenerational somatic cell epimutation training sets identified a smaller number of 1,503 regions of genome-wide predicted sites and differences in genomic feature contributions. The predicted genome-wide germline (sperm) epimutations were found to be distinct from the predicted somatic cell epimutations. Validation of the genome-wide germline predicted sites used two recently identified transgenerational sperm epimutation signature sets from the pesticides dichlorodiphenyltrichloroethane (DDT) and methoxychlor (MXC) exposure lineage F3 generation. Analysis of this positive validation data set showed a 100% prediction accuracy for all the DDT-MXC sperm epimutations. Observations further elucidate the genomic features associated with transgenerational germline epimutations and identify a genome-wide set

  2. Systems genetics and genome-wide association approaches for analysis of feed intake, feed efficiency, and performance in beef cattle

    DEFF Research Database (Denmark)

    Santana, M. H. A.; Freua, M. C.; Do, D. N.

    2016-01-01

    and 943,577 single nucleotide polymorphisms (SNPs), a genome-wide association analysis was performed for dry matter intake, average daily weight gain, feed conversion ratio, and residual feed intake with a Bayesian Lasso procedure. Genes within 50-kb SNPs, most relevant for explaining the genomic variance......Feed intake, feed efficiency, and weight gain are important economic traits of beef cattle in feed lots. In the present study, we investigated the physiological processes underlying such traits from the point of view of systems genetics. Firstly, using data from 1334 Nellore (Bos indicus) cattle...... phenotypes and showed that almost all genomic variances were in the SNPs located in the intergenic and intronic regions. We further identified five main metabolic pathways related to ion transport, body composition, and feed intake control, which influenced the four phenotypes simultaneously. The systems...

  3. Genome-wide search for miRNA-target interactions in Arabidopsis thaliana with an integrated approach

    Directory of Open Access Journals (Sweden)

    Ding Jiandong

    2012-06-01

    Full Text Available Abstract Background MiRNA are about 22nt long small noncoding RNAs that post transcriptionally regulate gene expression in animals, plants and protozoa. Confident identification of MiRNA-Target Interactions (MTI is vital to understand their function. Currently, several integrated computational programs and databases are available for animal miRNAs, the mechanisms of which are significantly different from plant miRNAs. Methods Here we present an integrated MTI prediction and analysis toolkit (imiRTP for Arabidopsis thaliana. It features two important functions: (i combination of several effective plant miRNA target prediction methods provides a sufficiently large MTI candidate set, and (ii different filters allow for an efficient selection of potential targets. The modularity of imiRTP enables the prediction of high quality targets on genome-wide scale. Moreover, predicted MTIs can be presented in various ways, which allows for browsing through the putative target sites as well as conducting simple and advanced analyses. Results Results show that imiRTP could always find high quality candidates compared with single method by choosing appropriate filter and parameter. And we also reveal that a portion of plant miRNA could bind target genes out of coding region. Based on our results, imiRTP could facilitate the further study of Arabidopsis miRNAs in real use. All materials of imiRTP are freely available under a GNU license at (http://admis.fudan.edu.cn/projects/imiRTP.htm.

  4. A Genetic Predictive Model for Canine Hip Dysplasia: Integration of Genome Wide Association Study (GWAS) and Candidate Gene Approaches

    Science.gov (United States)

    Bartolomé, Nerea; Segarra, Sergi; Artieda, Marta; Francino, Olga; Sánchez, Elisenda; Szczypiorska, Magdalena; Casellas, Joaquim; Tejedor, Diego; Cerdeira, Joaquín; Martínez, Antonio; Velasco, Alfonso; Sánchez, Armand

    2015-01-01

    Canine hip dysplasia is one of the most prevalent developmental orthopedic diseases in dogs worldwide. Unfortunately, the success of eradication programs against this disease based on radiographic diagnosis is low. Adding the use of diagnostic genetic tools to the current phenotype-based approach might be beneficial. The aim of this study was to develop a genetic prognostic test for early diagnosis of hip dysplasia in Labrador Retrievers. To develop our DNA test, 775 Labrador Retrievers were recruited. For each dog, a blood sample and a ventrodorsal hip radiograph were taken. Dogs were divided into two groups according to their FCI hip score: control (A/B) and case (D/E). C dogs were not included in the sample. Genetic characterization combining a GWAS and a candidate gene strategy using SNPs allowed a case-control population association study. A mathematical model which included 7 SNPs was developed using logistic regression. The model showed a good accuracy (Area under the ROC curve = 0.85) and was validated in an independent population of 114 dogs. This prognostic genetic test represents a useful tool for choosing the most appropriate therapeutic approach once genetic predisposition to hip dysplasia is known. Therefore, it allows a more individualized management of the disease. It is also applicable during genetic selection processes, since breeders can benefit from the information given by this test as soon as a blood sample can be collected, and act accordingly. In the authors’ opinion, a shift towards genomic screening might importantly contribute to reducing canine hip dysplasia in the future. In conclusion, based on genetic and radiographic information from Labrador Retrievers with hip dysplasia, we developed an accurate predictive genetic test for early diagnosis of hip dysplasia in Labrador Retrievers. However, further research is warranted in order to evaluate the validity of this genetic test in other dog breeds. PMID:25874693

  5. A genetic predictive model for canine hip dysplasia: integration of Genome Wide Association Study (GWAS and candidate gene approaches.

    Directory of Open Access Journals (Sweden)

    Nerea Bartolomé

    Full Text Available Canine hip dysplasia is one of the most prevalent developmental orthopedic diseases in dogs worldwide. Unfortunately, the success of eradication programs against this disease based on radiographic diagnosis is low. Adding the use of diagnostic genetic tools to the current phenotype-based approach might be beneficial. The aim of this study was to develop a genetic prognostic test for early diagnosis of hip dysplasia in Labrador Retrievers. To develop our DNA test, 775 Labrador Retrievers were recruited. For each dog, a blood sample and a ventrodorsal hip radiograph were taken. Dogs were divided into two groups according to their FCI hip score: control (A/B and case (D/E. C dogs were not included in the sample. Genetic characterization combining a GWAS and a candidate gene strategy using SNPs allowed a case-control population association study. A mathematical model which included 7 SNPs was developed using logistic regression. The model showed a good accuracy (Area under the ROC curve = 0.85 and was validated in an independent population of 114 dogs. This prognostic genetic test represents a useful tool for choosing the most appropriate therapeutic approach once genetic predisposition to hip dysplasia is known. Therefore, it allows a more individualized management of the disease. It is also applicable during genetic selection processes, since breeders can benefit from the information given by this test as soon as a blood sample can be collected, and act accordingly. In the authors' opinion, a shift towards genomic screening might importantly contribute to reducing canine hip dysplasia in the future. In conclusion, based on genetic and radiographic information from Labrador Retrievers with hip dysplasia, we developed an accurate predictive genetic test for early diagnosis of hip dysplasia in Labrador Retrievers. However, further research is warranted in order to evaluate the validity of this genetic test in other dog breeds.

  6. A genetic predictive model for canine hip dysplasia: integration of Genome Wide Association Study (GWAS) and candidate gene approaches.

    Science.gov (United States)

    Bartolomé, Nerea; Segarra, Sergi; Artieda, Marta; Francino, Olga; Sánchez, Elisenda; Szczypiorska, Magdalena; Casellas, Joaquim; Tejedor, Diego; Cerdeira, Joaquín; Martínez, Antonio; Velasco, Alfonso; Sánchez, Armand

    2015-01-01

    Canine hip dysplasia is one of the most prevalent developmental orthopedic diseases in dogs worldwide. Unfortunately, the success of eradication programs against this disease based on radiographic diagnosis is low. Adding the use of diagnostic genetic tools to the current phenotype-based approach might be beneficial. The aim of this study was to develop a genetic prognostic test for early diagnosis of hip dysplasia in Labrador Retrievers. To develop our DNA test, 775 Labrador Retrievers were recruited. For each dog, a blood sample and a ventrodorsal hip radiograph were taken. Dogs were divided into two groups according to their FCI hip score: control (A/B) and case (D/E). C dogs were not included in the sample. Genetic characterization combining a GWAS and a candidate gene strategy using SNPs allowed a case-control population association study. A mathematical model which included 7 SNPs was developed using logistic regression. The model showed a good accuracy (Area under the ROC curve = 0.85) and was validated in an independent population of 114 dogs. This prognostic genetic test represents a useful tool for choosing the most appropriate therapeutic approach once genetic predisposition to hip dysplasia is known. Therefore, it allows a more individualized management of the disease. It is also applicable during genetic selection processes, since breeders can benefit from the information given by this test as soon as a blood sample can be collected, and act accordingly. In the authors' opinion, a shift towards genomic screening might importantly contribute to reducing canine hip dysplasia in the future. In conclusion, based on genetic and radiographic information from Labrador Retrievers with hip dysplasia, we developed an accurate predictive genetic test for early diagnosis of hip dysplasia in Labrador Retrievers. However, further research is warranted in order to evaluate the validity of this genetic test in other dog breeds.

  7. Genome-Wide Locations of Potential Epimutations Associated with Environmentally Induced Epigenetic Transgenerational Inheritance of Disease Using a Sequential Machine Learning Prediction Approach.

    Directory of Open Access Journals (Sweden)

    M Muksitul Haque

    Full Text Available Environmentally induced epigenetic transgenerational inheritance of disease and phenotypic variation involves germline transmitted epimutations. The primary epimutations identified involve altered differential DNA methylation regions (DMRs. Different environmental toxicants have been shown to promote exposure (i.e., toxicant specific signatures of germline epimutations. Analysis of genomic features associated with these epimutations identified low-density CpG regions (<3 CpG / 100bp termed CpG deserts and a number of unique DNA sequence motifs. The rat genome was annotated for these and additional relevant features. The objective of the current study was to use a machine learning computational approach to predict all potential epimutations in the genome. A number of previously identified sperm epimutations were used as training sets. A novel machine learning approach using a sequential combination of Active Learning and Imbalance Class Learner analysis was developed. The transgenerational sperm epimutation analysis identified approximately 50K individual sites with a 1 kb mean size and 3,233 regions that had a minimum of three adjacent sites with a mean size of 3.5 kb. A select number of the most relevant genomic features were identified with the low density CpG deserts being a critical genomic feature of the features selected. A similar independent analysis with transgenerational somatic cell epimutation training sets identified a smaller number of 1,503 regions of genome-wide predicted sites and differences in genomic feature contributions. The predicted genome-wide germline (sperm epimutations were found to be distinct from the predicted somatic cell epimutations. Validation of the genome-wide germline predicted sites used two recently identified transgenerational sperm epimutation signature sets from the pesticides dichlorodiphenyltrichloroethane (DDT and methoxychlor (MXC exposure lineage F3 generation. Analysis of this positive validation

  8. A genome-wide screening and SNPs-to-genes approach to identify novel genetic risk factors associated with frontotemporal dementia

    Science.gov (United States)

    Ferrari, Raffaele; Grassi, Mario; Salvi, Erika; Borroni, Barbara; Palluzzi, Fernando; Pepe, Daniele; D'Avila, Francesca; Padovani, Alessandro; Archetti, Silvana; Rainero, Innocenzo; Rubino, Elisa; Pinessi, Lorenzo; Benussi, Luisa; Binetti, Giuliano; Ghidoni, Roberta; Galimberti, Daniela; Scarpini, Elio; Serpente, Maria; Rossi, Giacomina; Giaccone, Giorgio; Tagliavini, Fabrizio; Nacmias, Benedetta; Piaceri, Irene; Bagnoli, Silvia; Bruni, Amalia C.; Maletta, Raffaele G.; Bernardi, Livia; Postiglione, Alfredo; Milan, Graziella; Franceschi, Massimo; Puca, Annibale A.; Novelli, Valeria; Barlassina, Cristina; Glorioso, Nicola; Manunta, Paolo; Singleton, Andrew; Cusi, Daniele; Hardy, John; Momeni, Parastoo

    2015-01-01

    Frontotemporal dementia (FTD) is the second most prevalent form of early onset dementia after Alzheimer's disease (AD). We performed a case-control association study in an Italian FTD cohort (n = 530) followed by the novel single nucleotide polymorphisms (SNPs)-to-genes approach and functional annotation analysis. We identified 2 novel potential loci for FTD. Suggestive SNPs reached p-values ∼10−7 and odds ratio > 2.5 (2p16.3) and 1.5 (17q25.3). Suggestive alleles at 17q25.3 identified a disease-associated haplotype causing decreased expression of –cis genes such as RFNG and AATK involved in neuronal genesis and differentiation and axon outgrowth, respectively. We replicated this locus through the SNPs-to-genes approach. Our functional annotation analysis indicated significant enrichment for functions of the brain (neuronal genesis, differentiation, and maturation), the synapse (neurotransmission and synapse plasticity), and elements of the immune system, the latter supporting our recent international FTD–genome-wide association study. This is the largest genome-wide study in Italian FTD to date. Although our results are not conclusive, we set the basis for future replication studies and identification of susceptible molecular mechanisms involved in FTD pathogenesis. PMID:26154020

  9. A genome-wide screening and SNPs-to-genes approach to identify novel genetic risk factors associated with frontotemporal dementia.

    Science.gov (United States)

    Ferrari, Raffaele; Grassi, Mario; Salvi, Erika; Borroni, Barbara; Palluzzi, Fernando; Pepe, Daniele; D'Avila, Francesca; Padovani, Alessandro; Archetti, Silvana; Rainero, Innocenzo; Rubino, Elisa; Pinessi, Lorenzo; Benussi, Luisa; Binetti, Giuliano; Ghidoni, Roberta; Galimberti, Daniela; Scarpini, Elio; Serpente, Maria; Rossi, Giacomina; Giaccone, Giorgio; Tagliavini, Fabrizio; Nacmias, Benedetta; Piaceri, Irene; Bagnoli, Silvia; Bruni, Amalia C; Maletta, Raffaele G; Bernardi, Livia; Postiglione, Alfredo; Milan, Graziella; Franceschi, Massimo; Puca, Annibale A; Novelli, Valeria; Barlassina, Cristina; Glorioso, Nicola; Manunta, Paolo; Singleton, Andrew; Cusi, Daniele; Hardy, John; Momeni, Parastoo

    2015-10-01

    Frontotemporal dementia (FTD) is the second most prevalent form of early onset dementia after Alzheimer's disease (AD). We performed a case-control association study in an Italian FTD cohort (n = 530) followed by the novel single nucleotide polymorphisms (SNPs)-to-genes approach and functional annotation analysis. We identified 2 novel potential loci for FTD. Suggestive SNPs reached p-values ∼10(-7) and odds ratio > 2.5 (2p16.3) and 1.5 (17q25.3). Suggestive alleles at 17q25.3 identified a disease-associated haplotype causing decreased expression of -cis genes such as RFNG and AATK involved in neuronal genesis and differentiation and axon outgrowth, respectively. We replicated this locus through the SNPs-to-genes approach. Our functional annotation analysis indicated significant enrichment for functions of the brain (neuronal genesis, differentiation, and maturation), the synapse (neurotransmission and synapse plasticity), and elements of the immune system, the latter supporting our recent international FTD-genome-wide association study. This is the largest genome-wide study in Italian FTD to date. Although our results are not conclusive, we set the basis for future replication studies and identification of susceptible molecular mechanisms involved in FTD pathogenesis.

  10. A combined genome-wide linkage and association approach to find susceptibility loci for platelet function phenotypes in European American and African American families with coronary artery disease

    Directory of Open Access Journals (Sweden)

    Wilson Alexander F

    2010-06-01

    Full Text Available Abstract Background The inability of aspirin (ASA to adequately suppress platelet aggregation is associated with future risk of coronary artery disease (CAD. Heritability studies of agonist-induced platelet function phenotypes suggest that genetic variation may be responsible for ASA responsiveness. In this study, we leverage independent information from genome-wide linkage and association data to determine loci controlling platelet phenotypes before and after treatment with ASA. Methods Clinical data on 37 agonist-induced platelet function phenotypes were evaluated before and after a 2-week trial of ASA (81 mg/day in 1231 European American and 846 African American healthy subjects with a family history of premature CAD. Principal component analysis was performed to minimize the number of independent factors underlying the covariance of these various phenotypes. Multi-point sib-pair based linkage analysis was performed using a microsatellite marker set, and single-SNP association tests were performed using markers from the Illumina 1 M genotyping chip from deCODE Genetics, Inc. All analyses were performed separately within each ethnic group. Results Several genomic regions appear to be linked to ASA response factors: a 10 cM region in African Americans on chromosome 5q11.2 had several STRs with suggestive (p-value -4 and significant (p-value -5 linkage to post aspirin platelet response to ADP, and ten additional factors had suggestive evidence for linkage (p-value -4 to thirteen genomic regions. All but one of these factors were aspirin response variables. While the strength of genome-wide SNP association signals for factors showing evidence for linkage is limited, especially at the strict thresholds of genome-wide criteria (N = 9 SNPs for 11 factors, more signals were considered significant when the association signal was weighted by evidence for linkage (N = 30 SNPs. Conclusions Our study supports the hypothesis that platelet phenotypes in

  11. Identification of a common variant affecting human episodic memory performance using a pooled genome-wide association approach: a case study of disease gene identification.

    Science.gov (United States)

    Pawlowski, Traci L; Huentelman, Matthew J

    2011-01-01

    Genome-wide association studies (GWAS) are an important tool for discovering novel genes associated with disease or traits. Careful design of case-control groups greatly facilitates the efficacy of these studies. Here we describe a pooled GWAS study undertaken to find novel genes associated with human episodic memory performance. A genomic locus for the WW and C2 domain-containing 1 protein, KIBRA (also known as WWC1), was found to be associated with memory performance in three cognitively normal cohorts from Switzerland and the USA. This result was further supported by correlation of KIBRA genotype and differences in hippocampal activation as measured by functional magnetic resonance imaging (fMRI). These findings provide an excellent example of the application of GWAS using a pooled genomic DNA approach to successfully identify a locus with strong effects on human memory.

  12. On genome-wide association studies for family-based designs: an integrative analysis approach combining ascertained family samples with unselected controls.

    NARCIS (Netherlands)

    Lasky-Su, J.; Won, S.; Mick, E.; Anney, R.J.; Franke, B.; Neale, B.; Biederman, J.; Smalley, S.L.; Loo, S.K.; Todorov, A.; Faraone, S.V.; Weiss, S.T.; Lange, C.

    2010-01-01

    Large numbers of control individuals with genome-wide genotype data are now available through various databases. These controls are regularly used in case-control genome-wide association studies (GWAS) to increase the statistical power. Controls are often "unselected" for the disease of interest and

  13. Inferring Population Size History from Large Samples of Genome-Wide Molecular Data - An Approximate Bayesian Computation Approach.

    Directory of Open Access Journals (Sweden)

    Simon Boitard

    2016-03-01

    Full Text Available Inferring the ancestral dynamics of effective population size is a long-standing question in population genetics, which can now be tackled much more accurately thanks to the massive genomic data available in many species. Several promising methods that take advantage of whole-genome sequences have been recently developed in this context. However, they can only be applied to rather small samples, which limits their ability to estimate recent population size history. Besides, they can be very sensitive to sequencing or phasing errors. Here we introduce a new approximate Bayesian computation approach named PopSizeABC that allows estimating the evolution of the effective population size through time, using a large sample of complete genomes. This sample is summarized using the folded allele frequency spectrum and the average zygotic linkage disequilibrium at different bins of physical distance, two classes of statistics that are widely used in population genetics and can be easily computed from unphased and unpolarized SNP data. Our approach provides accurate estimations of past population sizes, from the very first generations before present back to the expected time to the most recent common ancestor of the sample, as shown by simulations under a wide range of demographic scenarios. When applied to samples of 15 or 25 complete genomes in four cattle breeds (Angus, Fleckvieh, Holstein and Jersey, PopSizeABC revealed a series of population declines, related to historical events such as domestication or modern breed creation. We further highlight that our approach is robust to sequencing errors, provided summary statistics are computed from SNPs with common alleles.

  14. An integrated transcriptomics-guided genome-wide promoter analysis and next-generation proteomics approach to mine factor(s) regulating cellular differentiation

    Science.gov (United States)

    Mandal, Kamal; Bader, Samuel L.; Kumar, Pankaj; Malakar, Dipankar; Campbell, David S.; Pradhan, Bhola Shankar; Sarkar, Rajesh K.; Wadhwa, Neerja; Sensharma, Souvik; Jain, Vaibhav; Moritz, Robert L.

    2017-01-01

    Abstract Differential next-generation-omics approaches aid in the visualization of biological processes and pave the way for divulging important events and/or interactions leading to a functional output at cellular or systems level. To this end, we undertook an integrated Nextgen transcriptomics and proteomics approach to divulge differential gene expression of infant and pubertal rat Sertoli cells (Sc).Unlike, pubertal Sc, infant Sc are immature and fail to support spermatogenesis. We found exclusive association of 14 and 19 transcription factor binding sites to infantile and pubertal states of Sc, respectively, using differential transcriptomics-guided genome-wide computational analysis of relevant promoters employing 220 Positional Weight Matrices from the TRANSFAC database. Proteomic SWATH-MS analysis provided extensive quantification of nuclear and cytoplasmic protein fractions revealing 1,670 proteins differentially located between the nucleus and cytoplasm of infant Sc and 890 proteins differentially located within those of pubertal Sc. Based on our multi-omics approach, the transcription factor YY1 was identified as one of the lead candidates regulating differentiation of Sc.YY1 was found to have abundant binding sites on promoters of genes upregulated during puberty. To determine its significance, we generated transgenic rats with Sc specific knockdown of YY1 that led to compromised spermatogenesis. PMID:28065881

  15. A Novel Genome-Wide Association Study Approach Using Genotyping by Exome Sequencing Leads to the Identification of a Primary Open Angle Glaucoma Associated Inversion Disrupting ADAMTS17.

    Directory of Open Access Journals (Sweden)

    Oliver P Forman

    Full Text Available Closed breeding populations in the dog in conjunction with advances in gene mapping and sequencing techniques facilitate mapping of autosomal recessive diseases and identification of novel disease-causing variants, often using unorthodox experimental designs. In our investigation we demonstrate successful mapping of the locus for primary open angle glaucoma in the Petit Basset Griffon Vendéen dog breed with 12 cases and 12 controls, using a novel genotyping by exome sequencing approach. The resulting genome-wide association signal was followed up by genome sequencing of an individual case, leading to the identification of an inversion with a breakpoint disrupting the ADAMTS17 gene. Genotyping of additional controls and expression analysis provide strong evidence that the inversion is disease causing. Evidence of cryptic splicing resulting in novel exon transcription as a consequence of the inversion in ADAMTS17 is identified through RNAseq experiments. This investigation demonstrates how a novel genotyping by exome sequencing approach can be used to map an autosomal recessive disorder in the dog, with the use of genome sequencing to facilitate identification of a disease-associated variant.

  16. Handling Permutation in Sequence Comparison: Genome-Wide Enhancer Prediction in Vertebrates by a Novel Non-Linear Alignment Scoring Principle.

    Directory of Open Access Journals (Sweden)

    Dirk Dolle

    Full Text Available Enhancers have been described to evolve by permutation without changing function. This has posed the problem of how to predict enhancer elements that are hidden from alignment-based approaches due to the loss of co-linearity. Alignment-free algorithms have been proposed as one possible solution. However, this approach is hampered by several problems inherent to its underlying working principle. Here we present a new approach, which combines the power of alignment and alignment-free techniques into one algorithm. It allows the prediction of enhancers based on the query and target sequence only, no matter whether the regulatory logic is co-linear or reshuffled. To test our novel approach, we employ it for the prediction of enhancers across the evolutionary distance of ~450Myr between human and medaka. We demonstrate its efficacy by subsequent in vivo validation resulting in 82% (9/11 of the predicted medaka regions showing reporter activity. These include five candidates with partially co-linear and four with reshuffled motif patterns. Orthology in flanking genes and conservation of the detected co-linear motifs indicates that those candidates are likely functionally equivalent enhancers. In sum, our results demonstrate that the proposed principle successfully predicts mutated as well as permuted enhancer regions at an encouragingly high rate.

  17. Unraveling the rat blood genome-wide transcriptome after oral administration of lavender oil by a two-color dye-swap DNA microarray approach

    Directory of Open Access Journals (Sweden)

    Motohide Hori

    2016-06-01

    Full Text Available Lavender oil (LO is a commonly used essential oil in aromatherapy as non-traditional medicine. With an aim to demonstrate LO effects on the body, we have recently established an animal model investigating the influence of orally administered LO in rat tissues, genome-wide. In this brief, we investigate the effect of LO ingestion in the blood of rat. Rats were administered LO at usual therapeutic dose (5 mg/kg in humans, and following collection of the venous blood from the heart and extraction of total RNA, the differentially expressed genes were screened using a 4 × 44-K whole-genome rat chip (Agilent microarray platform; Agilent Technologies, Palo Alto, CA, USA in conjunction with a two-color dye-swap approach. A total of 834 differentially expressed genes in the blood were identified: 362 up-regulated and 472 down-regulated. These genes were functionally categorized using bioinformatics tools. The gene expression inventory of rat blood transcriptome under LO, a first report, has been deposited into the Gene Expression Omnibus (GEO: GSE67499. The data will be a valuable resource in examining the effects of natural products, and which could also serve as a human model for further functional analysis and investigation.

  18. Genome-wide association study of clinical dimensions of schizophrenia

    DEFF Research Database (Denmark)

    Fanous, Ayman H; Zhou, Baiyu; Aggen, Steven H;

    2012-01-01

    Multiple sources of evidence suggest that genetic factors influence variation in clinical features of schizophrenia. The authors present the first genome-wide association study (GWAS) of dimensional symptom scores among individuals with schizophrenia....

  19. Genetic dissection of drought and heat tolerance in chickpea through genome-wide and candidate gene-based association mapping approaches.

    Directory of Open Access Journals (Sweden)

    Mahendar Thudi

    Full Text Available To understand the genetic basis of tolerance to drought and heat stresses in chickpea, a comprehensive association mapping approach has been undertaken. Phenotypic data were generated on the reference set (300 accessions, including 211 mini-core collection accessions for drought tolerance related root traits, heat tolerance, yield and yield component traits from 1-7 seasons and 1-3 locations in India (Patancheru, Kanpur, Bangalore and three locations in Africa (Nairobi, Egerton in Kenya and Debre Zeit in Ethiopia. Diversity Array Technology (DArT markers equally distributed across chickpea genome were used to determine population structure and three sub-populations were identified using admixture model in STRUCTURE. The pairwise linkage disequilibrium (LD estimated using the squared-allele frequency correlations (r2; when r2<0.20 was found to decay rapidly with the genetic distance of 5 cM. For establishing marker-trait associations (MTAs, both genome-wide and candidate gene-sequencing based association mapping approaches were conducted using 1,872 markers (1,072 DArTs, 651 single nucleotide polymorphisms [SNPs], 113 gene-based SNPs and 36 simple sequence repeats [SSRs] and phenotyping data mentioned above employing mixed linear model (MLM analysis with optimum compression with P3D method and kinship matrix. As a result, 312 significant MTAs were identified and a maximum number of MTAs (70 was identified for 100-seed weight. A total of 18 SNPs from 5 genes (ERECTA, 11 SNPs; ASR, 4 SNPs; DREB, 1 SNP; CAP2 promoter, 1 SNP and AMDH, 1SNP were significantly associated with different traits. This study provides significant MTAs for drought and heat tolerance in chickpea that can be used, after validation, in molecular breeding for developing superior varieties with enhanced drought and heat tolerance.

  20. A novel MMP12 locus is associated with large artery atherosclerotic stroke using a genome-wide age-at-onset informed approach.

    Directory of Open Access Journals (Sweden)

    Matthew Traylor

    2014-07-01

    Full Text Available Genome-wide association studies (GWAS have begun to identify the common genetic component to ischaemic stroke (IS. However, IS has considerable phenotypic heterogeneity. Where clinical covariates explain a large fraction of disease risk, covariate informed designs can increase power to detect associations. As prevalence rates in IS are markedly affected by age, and younger onset cases may have higher genetic predisposition, we investigated whether an age-at-onset informed approach could detect novel associations with IS and its subtypes; cardioembolic (CE, large artery atherosclerosis (LAA and small vessel disease (SVD in 6,778 cases of European ancestry and 12,095 ancestry-matched controls. Regression analysis to identify SNP associations was performed on posterior liabilities after conditioning on age-at-onset and affection status. We sought further evidence of an association with LAA in 1,881 cases and 50,817 controls, and examined mRNA expression levels of the nearby genes in atherosclerotic carotid artery plaques. Secondly, we performed permutation analyses to evaluate the extent to which age-at-onset informed analysis improves significance for novel loci. We identified a novel association with an MMP12 locus in LAA (rs660599; p = 2.5×10⁻⁷, with independent replication in a second population (p = 0.0048, OR(95% CI = 1.18(1.05-1.32; meta-analysis p = 2.6×10⁻⁸. The nearby gene, MMP12, was significantly overexpressed in carotid plaques compared to atherosclerosis-free control arteries (p = 1.2×10⁻¹⁵; fold change = 335.6. Permutation analyses demonstrated improved significance for associations when accounting for age-at-onset in all four stroke phenotypes (p<0.001. Our results show that a covariate-informed design, by adjusting for age-at-onset of stroke, can detect variants not identified by conventional GWAS.

  1. Influence of Genetic Variants in EGF and Other Genes on Hematological Traits in Korean Populations by a Genome-Wide Approach

    Directory of Open Access Journals (Sweden)

    Yun Kyoung Kim

    2015-01-01

    Full Text Available Hematological traits are important health indicators and are used as diagnostic clinical parameters for human disorders. Recently, genome-wide association studies (GWAS identified many genetic loci associated with hematological traits in diverse ethnic groups. However, additional GWAS are necessary to elucidate the breadth of genetic variation and the underlying genetic architecture represented by hematological metrics. To identify additional genetic loci influencing hematological traits (such as hematocrit, hemoglobin concentration, white blood cell count, red blood cell count, and platelet count, we conducted GWAS and meta-analyses on data from 12,509 Korean individuals grouped into population-based cohorts. Of interest is EGF, a factor plays a role in the proliferation and differentiation of hematopoietic progenitor cells. We identified a novel EGF variant, which associated with platelet count in our study (Pcombined=2.44×10-15. Our study also replicated 16 genetic associations related to five hematological traits with genome-wide significance (P<5×10-8 that were previously established in other ethnic groups. Of these, variants influencing platelet count are distributed across several genes and have pleiotropic effects in coronary artery disease and dyslipidemia. Our findings may aid in elucidating molecular mechanisms underlying not only hematopoiesis but also inflammatory and cardiovascular diseases.

  2. Profiling genome-wide DNA methylation.

    Science.gov (United States)

    Yong, Wai-Shin; Hsu, Fei-Man; Chen, Pao-Yang

    2016-01-01

    DNA methylation is an epigenetic modification that plays an important role in regulating gene expression and therefore a broad range of biological processes and diseases. DNA methylation is tissue-specific, dynamic, sequence-context-dependent and trans-generationally heritable, and these complex patterns of methylation highlight the significance of profiling DNA methylation to answer biological questions. In this review, we surveyed major methylation assays, along with comparisons and biological examples, to provide an overview of DNA methylation profiling techniques. The advances in microarray and sequencing technologies make genome-wide profiling possible at a single-nucleotide or even a single-cell resolution. These profiling approaches vary in many aspects, such as DNA input, resolution, genomic region coverage, and bioinformatics analysis, and selecting a feasible method requires knowledge of these methods. We first introduce the biological background of DNA methylation and its pattern in plants, animals and fungi. We present an overview of major experimental approaches to profiling genome-wide DNA methylation and hydroxymethylation and then extend to the single-cell methylome. To evaluate these methods, we outline their strengths and weaknesses and perform comparisons across the different platforms. Due to the increasing need to compute high-throughput epigenomic data, we interrogate the computational pipeline for bisulfite sequencing data and also discuss the concept of identifying differentially methylated regions (DMRs). This review summarizes the experimental and computational concepts for profiling genome-wide DNA methylation, followed by biological examples. Overall, this review provides researchers useful guidance for the selection of a profiling method suited to specific research questions.

  3. An endophenotype approach to the genetics of alcohol dependence: a genome wide association study of fast beta EEG in families of African ancestry.

    Science.gov (United States)

    Meyers, J L; Zhang, J; Wang, J C; Su, J; Kuo, S I; Kapoor, M; Wetherill, L; Bertelsen, S; Lai, D; Salvatore, J E; Kamarajan, C; Chorlian, D; Agrawal, A; Almasy, L; Bauer, L; Bucholz, K K; Chan, G; Hesselbrock, V; Koganti, L; Kramer, J; Kuperman, S; Manz, N; Pandey, A; Seay, M; Scott, D; Taylor, R E; Dick, D M; Edenberg, H J; Goate, A; Foroud, T; Porjesz, B

    2017-01-10

    Fast beta (20-28 Hz) electroencephalogram (EEG) oscillatory activity may be a useful endophenotype for studying the genetics of disorders characterized by neural hyperexcitability, including substance use disorders (SUDs). However, the genetic underpinnings of fast beta EEG have not previously been studied in a population of African-American ancestry (AA). In a sample of 2382 AA individuals from 482 families drawn from the Collaborative Study on the Genetics of Alcoholism (COGA), we performed a genome-wide association study (GWAS) on resting-state fast beta EEG power. To further characterize our genetic findings, we examined the functional and clinical/behavioral significance of GWAS variants. Ten correlated single-nucleotide polymorphisms (SNPs) (r(2)>0.9) located in an intergenic region on chromosome 3q26 were associated with fast beta EEG power at PMolecular Psychiatry advance online publication, 10 January 2017; doi:10.1038/mp.2016.239.

  4. Detecting correlation between allele frequencies and environmental variables as a signature of selection. A fast computational approach for genome-wide studies

    DEFF Research Database (Denmark)

    Guillot, Gilles; Vitalis, Renaud; Rouzic, Arnaud le;

    2014-01-01

    Genomic regions (or loci) displaying outstanding correlation with some environmental variables are likely to be under selection and this is the rationale of recent methods of identifying selected loci and retrieving functional information about them. To be efficient, such methods need to be able...... to disentangle the potential effect of environmental variables from the confounding effect of population history. For the routine analysis of genome-wide datasets, one also needs fast inference and model selection algorithms. We propose a method based on an explicit spatial model which is an instance of spatial...... generalized linear mixed model (SGLMM). For inference, we make use of the INLA–SPDE theoretical and computational framework developed by Rue et al. (2009) and Lindgren et al. (2011). The method we propose allows one to quantify the correlation between genotypes and environmental variables. It works...

  5. Analysis of pigmented villonodular synovitis with genome-wide complementary DNA microarray and tissue array technology reveals insight into potential novel therapeutic approaches.

    Science.gov (United States)

    Finis, Katharina; Sültmann, Holger; Ruschhaupt, Markus; Buness, Andreas; Helmchen, Birgit; Kuner, Ruprecht; Gross, Marie-Luise; Fink, Bernd; Schirmacher, Peter; Poustka, Annemarie; Berger, Irina

    2006-03-01

    To characterize the gene expression profile and determine potential diagnostic markers and therapeutic targets in pigmented villonodular synovitis (PVNS). Gene expression patterns in 11 patients with PVNS, 18 patients with rheumatoid arthritis (RA), and 19 patients with osteoarthritis (OA) were investigated using genome-wide complementary DNA microarrays. Validation of differentially expressed genes was performed by real-time quantitative polymerase chain reaction and immunohistochemical analysis on tissue arrays (80 patients with PVNS, 51 patients with RA, and 20 patients with OA). The gene expression profile in PVNS was clearly distinct from those in RA and OA. One hundred forty-one up-regulated genes and 47 down-regulated genes were found in PVNS compared with RA, and 153 up-regulated genes and 89 down-regulated genes were found in PVNS compared with OA (fold change > or = 1.5; Q PVNS were involved in apoptosis regulation, matrix degradation, and inflammation (ALOX5AP, ATP6V1B2, CD53, CHI3L1, CTSL, CXCR4, HSPA8, HSPCA, LAPTM5, MMP9, MOAP1, and SPP1). The gene expression signature in PVNS is similar to that of activated macrophages and is consistent with the local destructive course of the disease. The gene and protein expression patterns suggest that the ongoing proliferation in PVNS is sustained by apoptosis resistance. This result suggests the possibility of a potential novel therapeutic intervention against PVNS.

  6. Comparison on the molecular response profiles between nano zinc oxide (ZnO) particles and free zinc ion using a genome-wide toxicogenomics approach.

    Science.gov (United States)

    Su, Guanyong; Zhang, Xiaowei; Giesy, John P; Musarrat, Javed; Saquib, Quaiser; Alkhedhairy, Abdulaziz A; Yu, Hongxia

    2015-11-01

    Increasing production and applications of nano zinc oxide particles (nano-ZnO) enhances the probability of its exposure in occupational and environmental settings, but toxicity studies are still limited. Taking the free Zn ion (Zn(2+)) as a control, cytotoxicity of a commercially available nano-ZnO was assessed with a 6-h exposure in Escherichia coli (E. coli). The fitted dose-cytotoxicity curve for ZnCl2 was significantly sharper than that from nano-ZnO. Then, a genome-wide gene expression profile following exposure to nano-ZnO was conducted by use of a live cell reporter assay system with library of 1820 modified green fluorescent protein (GFP)-expressing promoter reporter vectors constructed from E. coli K12 strains, which resulted in 387 significantly altered genes in bacterial (p 2) was different from the profile following exposure to 6 mg/L of free zinc ion. The result indicates that these two Zn forms might cause toxicity to bacterial in different modes of action. Our results underscore the importance of understanding the adverse effects elicited by nano-ZnO after entering aquatic environment.

  7. The classification of mRNA expression levels by the phosphorylation state of RNAPII CTD based on a combined genome-wide approach

    Directory of Open Access Journals (Sweden)

    Tachibana Taro

    2011-10-01

    Full Text Available Abstract Background Cellular function is regulated by the balance of stringently regulated amounts of mRNA. Previous reports revealed that RNA polymerase II (RNAPII, which transcribes mRNA, can be classified into the pausing state and the active transcription state according to the phosphorylation state of RPB1, the catalytic subunit of RNAPII. However, genome-wide association between mRNA expression level and the phosphorylation state of RNAPII is unclear. While the functional importance of pausing genes is clear, such as in mouse Embryonic Stem cells for differentiation, understanding this association is critical for distinguishing pausing genes from active transcribing genes in expression profiling data, such as microarrays and RNAseq. Therefore, we examined the correlation between the phosphorylation of RNAPII and mRNA expression levels using a combined analysis by ChIPseq and RNAseq. Results We first performed a precise quantitative measurement of mRNA by performing an optimized calculation in RNAseq. We then visualized the recruitment of various phosphorylated RNAPIIs, such as Ser2P and Ser5P. A combined analysis using optimized RNAseq and ChIPseq for phosphorylated RNAPII revealed that mRNA levels correlate with the various phosphorylation states of RNAPII. Conclusions We demonstrated that the amount of mRNA is precisely reflected by the phased phosphorylation of Ser2 and Ser5. In particular, even the most "pausing" genes, for which only Ser5 is phosphorylated, were detectable at a certain level of mRNA. Our analysis indicated that the complexity of quantitative regulation of mRNA levels could be classified into three categories according to the phosphorylation state of RNAPII.

  8. The classification of mRNA expression levels by the phosphorylation state of RNAPII CTD based on a combined genome-wide approach

    Science.gov (United States)

    2011-01-01

    Background Cellular function is regulated by the balance of stringently regulated amounts of mRNA. Previous reports revealed that RNA polymerase II (RNAPII), which transcribes mRNA, can be classified into the pausing state and the active transcription state according to the phosphorylation state of RPB1, the catalytic subunit of RNAPII. However, genome-wide association between mRNA expression level and the phosphorylation state of RNAPII is unclear. While the functional importance of pausing genes is clear, such as in mouse Embryonic Stem cells for differentiation, understanding this association is critical for distinguishing pausing genes from active transcribing genes in expression profiling data, such as microarrays and RNAseq. Therefore, we examined the correlation between the phosphorylation of RNAPII and mRNA expression levels using a combined analysis by ChIPseq and RNAseq. Results We first performed a precise quantitative measurement of mRNA by performing an optimized calculation in RNAseq. We then visualized the recruitment of various phosphorylated RNAPIIs, such as Ser2P and Ser5P. A combined analysis using optimized RNAseq and ChIPseq for phosphorylated RNAPII revealed that mRNA levels correlate with the various phosphorylation states of RNAPII. Conclusions We demonstrated that the amount of mRNA is precisely reflected by the phased phosphorylation of Ser2 and Ser5. In particular, even the most "pausing" genes, for which only Ser5 is phosphorylated, were detectable at a certain level of mRNA. Our analysis indicated that the complexity of quantitative regulation of mRNA levels could be classified into three categories according to the phosphorylation state of RNAPII. PMID:22011111

  9. A genome-wide association study of hypertension and blood pressure in African Americans.

    Directory of Open Access Journals (Sweden)

    Adebowale Adeyemo

    2009-07-01

    Full Text Available The evidence for the existence of genetic susceptibility variants for the common form of hypertension ("essential hypertension" remains weak and inconsistent. We sought genetic variants underlying blood pressure (BP by conducting a genome-wide association study (GWAS among African Americans, a population group in the United States that is disproportionately affected by hypertension and associated complications, including stroke and kidney diseases. Using a dense panel of over 800,000 SNPs in a discovery sample of 1,017 African Americans from the Washington, D.C., metropolitan region, we identified multiple SNPs reaching genome-wide significance for systolic BP in or near the genes: PMS1, SLC24A4, YWHA7, IPO7, and CACANA1H. Two of these genes, SLC24A4 (a sodium/potassium/calcium exchanger and CACNA1H (a voltage-dependent calcium channel, are potential candidate genes for BP regulation and the latter is a drug target for a class of calcium channel blockers. No variant reached genome wide significance for association with diastolic BP (top scoring SNP rs1867226, p = 5.8 x 10(-7 or with hypertension as a binary trait (top scoring SNP rs9791170, p = 5.1 x 10(-7. We replicated some of the significant SNPs in a sample of West Africans. Pathway analysis revealed that genes harboring top-scoring variants cluster in pathways and networks of biologic relevance to hypertension and BP regulation. This is the first GWAS for hypertension and BP in an African American population. The findings suggests that, in addition to or in lieu of relying solely on replicated variants of moderate-to-large effect reaching genome-wide significance, pathway and network approaches may be useful in identifying and prioritizing candidate genes/loci for further experiments.

  10. Complete genome-wide screening and subtractive genomic approach revealed new virulence factors, potential drug targets against bio-war pathogen Brucella melitensis 16M

    Directory of Open Access Journals (Sweden)

    Pradeepkiran JA

    2015-03-01

    Full Text Available Jangampalli Adi Pradeepkiran,1* Sri Bhashyam Sainath,2,3* Konidala Kranthi Kumar,1 Matcha Bhaskar1 1Division of Animal Biotechnology, Department of Zoology, Sri Venkateswara University, Tirupati, India; 2CIMAR/CIIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, Porto, Portugal, 3Department of Biotechnology, Vikrama Simhapuri University, Nellore, Andhra Pradesh, India *These authors contributed equally to this work Abstract: Brucella melitensis 16M is a Gram-negative coccobacillus that infects both animals and humans. It causes a disease known as brucellosis, which is characterized by acute febrile illness in humans and causes abortions in livestock. To prevent and control brucellosis, identification of putative drug targets is crucial. The present study aimed to identify drug targets in B. melitensis 16M by using a subtractive genomic approach. We used available database repositories (Database of Essential Genes, Kyoto Encyclopedia of Genes and Genomes Automatic Annotation Server, and Kyoto Encyclopedia of Genes and Genomes to identify putative genes that are nonhomologous to humans and essential for pathogen B. melitensis 16M. The results revealed that among 3 Mb genome size of pathogen, 53 putative characterized and 13 uncharacterized hypothetical genes were identified; further, from Basic Local Alignment Search Tool protein analysis, one hypothetical protein showed a close resemblance (50% to Silicibacter pomeroyi DUF1285 family protein (2RE3. A further homology model of the target was constructed using MODELLER 9.12 and optimized through variable target function method by molecular dynamics optimization with simulating annealing. The stereochemical quality of the restrained model was evaluated by PROCHECK, VERIFY-3D, ERRAT, and WHATIF servers. Furthermore, structure-based virtual screening was carried out against the predicted active site of the respective protein using the

  11. Complete genome-wide screening and subtractive genomic approach revealed new virulence factors, potential drug targets against bio-war pathogen Brucella melitensis 16M.

    Science.gov (United States)

    Pradeepkiran, Jangampalli Adi; Sainath, Sri Bhashyam; Kumar, Konidala Kranthi; Bhaskar, Matcha

    2015-01-01

    Brucella melitensis 16M is a Gram-negative coccobacillus that infects both animals and humans. It causes a disease known as brucellosis, which is characterized by acute febrile illness in humans and causes abortions in livestock. To prevent and control brucellosis, identification of putative drug targets is crucial. The present study aimed to identify drug targets in B. melitensis 16M by using a subtractive genomic approach. We used available database repositories (Database of Essential Genes, Kyoto Encyclopedia of Genes and Genomes Automatic Annotation Server, and Kyoto Encyclopedia of Genes and Genomes) to identify putative genes that are nonhomologous to humans and essential for pathogen B. melitensis 16M. The results revealed that among 3 Mb genome size of pathogen, 53 putative characterized and 13 uncharacterized hypothetical genes were identified; further, from Basic Local Alignment Search Tool protein analysis, one hypothetical protein showed a close resemblance (50%) to Silicibacter pomeroyi DUF1285 family protein (2RE3). A further homology model of the target was constructed using MODELLER 9.12 and optimized through variable target function method by molecular dynamics optimization with simulating annealing. The stereochemical quality of the restrained model was evaluated by PROCHECK, VERIFY-3D, ERRAT, and WHATIF servers. Furthermore, structure-based virtual screening was carried out against the predicted active site of the respective protein using the glycerol structural analogs from the PubChem database. We identified five best inhibitors with strong affinities, stable interactions, and also with reliable drug-like properties. Hence, these leads might be used as the most effective inhibitors of modeled protein. The outcome of the present work of virtual screening of putative gene targets might facilitate design of potential drugs for better treatment against brucellosis.

  12. Complete genome-wide screening and subtractive genomic approach revealed new virulence factors, potential drug targets against bio-war pathogen Brucella melitensis 16M

    Science.gov (United States)

    Pradeepkiran, Jangampalli Adi; Sainath, Sri Bhashyam; Kumar, Konidala Kranthi; Bhaskar, Matcha

    2015-01-01

    Brucella melitensis 16M is a Gram-negative coccobacillus that infects both animals and humans. It causes a disease known as brucellosis, which is characterized by acute febrile illness in humans and causes abortions in livestock. To prevent and control brucellosis, identification of putative drug targets is crucial. The present study aimed to identify drug targets in B. melitensis 16M by using a subtractive genomic approach. We used available database repositories (Database of Essential Genes, Kyoto Encyclopedia of Genes and Genomes Automatic Annotation Server, and Kyoto Encyclopedia of Genes and Genomes) to identify putative genes that are nonhomologous to humans and essential for pathogen B. melitensis 16M. The results revealed that among 3 Mb genome size of pathogen, 53 putative characterized and 13 uncharacterized hypothetical genes were identified; further, from Basic Local Alignment Search Tool protein analysis, one hypothetical protein showed a close resemblance (50%) to Silicibacter pomeroyi DUF1285 family protein (2RE3). A further homology model of the target was constructed using MODELLER 9.12 and optimized through variable target function method by molecular dynamics optimization with simulating annealing. The stereochemical quality of the restrained model was evaluated by PROCHECK, VERIFY-3D, ERRAT, and WHATIF servers. Furthermore, structure-based virtual screening was carried out against the predicted active site of the respective protein using the glycerol structural analogs from the PubChem database. We identified five best inhibitors with strong affinities, stable interactions, and also with reliable drug-like properties. Hence, these leads might be used as the most effective inhibitors of modeled protein. The outcome of the present work of virtual screening of putative gene targets might facilitate design of potential drugs for better treatment against brucellosis. PMID:25834405

  13. Mismatch repair genes Mlh1 and Mlh3 modify CAG instability in Huntington's disease mice: genome-wide and candidate approaches.

    Directory of Open Access Journals (Sweden)

    Ricardo Mouro Pinto

    2013-10-01

    Full Text Available The Huntington's disease gene (HTT CAG repeat mutation undergoes somatic expansion that correlates with pathogenesis. Modifiers of somatic expansion may therefore provide routes for therapies targeting the underlying mutation, an approach that is likely applicable to other trinucleotide repeat diseases. Huntington's disease Hdh(Q111 mice exhibit higher levels of somatic HTT CAG expansion on a C57BL/6 genetic background (B6.Hdh(Q111 than on a 129 background (129.Hdh(Q111 . Linkage mapping in (B6x129.Hdh(Q111 F2 intercross animals identified a single quantitative trait locus underlying the strain-specific difference in expansion in the striatum, implicating mismatch repair (MMR gene Mlh1 as the most likely candidate modifier. Crossing B6.Hdh(Q111 mice onto an Mlh1 null background demonstrated that Mlh1 is essential for somatic CAG expansions and that it is an enhancer of nuclear huntingtin accumulation in striatal neurons. Hdh(Q111 somatic expansion was also abolished in mice deficient in the Mlh3 gene, implicating MutLγ (MLH1-MLH3 complex as a key driver of somatic expansion. Strikingly, Mlh1 and Mlh3 genes encoding MMR effector proteins were as critical to somatic expansion as Msh2 and Msh3 genes encoding DNA mismatch recognition complex MutSβ (MSH2-MSH3. The Mlh1 locus is highly polymorphic between B6 and 129 strains. While we were unable to detect any difference in base-base mismatch or short slipped-repeat repair activity between B6 and 129 MLH1 variants, repair efficiency was MLH1 dose-dependent. MLH1 mRNA and protein levels were significantly decreased in 129 mice compared to B6 mice, consistent with a dose-sensitive MLH1-dependent DNA repair mechanism underlying the somatic expansion difference between these strains. Together, these data identify Mlh1 and Mlh3 as novel critical genetic modifiers of HTT CAG instability, point to Mlh1 genetic variation as the likely source of the instability difference in B6 and 129 strains and suggest

  14. Mismatch repair genes Mlh1 and Mlh3 modify CAG instability in Huntington's disease mice: genome-wide and candidate approaches.

    Science.gov (United States)

    Pinto, Ricardo Mouro; Dragileva, Ella; Kirby, Andrew; Lloret, Alejandro; Lopez, Edith; St Claire, Jason; Panigrahi, Gagan B; Hou, Caixia; Holloway, Kim; Gillis, Tammy; Guide, Jolene R; Cohen, Paula E; Li, Guo-Min; Pearson, Christopher E; Daly, Mark J; Wheeler, Vanessa C

    2013-10-01

    The Huntington's disease gene (HTT) CAG repeat mutation undergoes somatic expansion that correlates with pathogenesis. Modifiers of somatic expansion may therefore provide routes for therapies targeting the underlying mutation, an approach that is likely applicable to other trinucleotide repeat diseases. Huntington's disease Hdh(Q111) mice exhibit higher levels of somatic HTT CAG expansion on a C57BL/6 genetic background (B6.Hdh(Q111) ) than on a 129 background (129.Hdh(Q111) ). Linkage mapping in (B6x129).Hdh(Q111) F2 intercross animals identified a single quantitative trait locus underlying the strain-specific difference in expansion in the striatum, implicating mismatch repair (MMR) gene Mlh1 as the most likely candidate modifier. Crossing B6.Hdh(Q111) mice onto an Mlh1 null background demonstrated that Mlh1 is essential for somatic CAG expansions and that it is an enhancer of nuclear huntingtin accumulation in striatal neurons. Hdh(Q111) somatic expansion was also abolished in mice deficient in the Mlh3 gene, implicating MutLγ (MLH1-MLH3) complex as a key driver of somatic expansion. Strikingly, Mlh1 and Mlh3 genes encoding MMR effector proteins were as critical to somatic expansion as Msh2 and Msh3 genes encoding DNA mismatch recognition complex MutSβ (MSH2-MSH3). The Mlh1 locus is highly polymorphic between B6 and 129 strains. While we were unable to detect any difference in base-base mismatch or short slipped-repeat repair activity between B6 and 129 MLH1 variants, repair efficiency was MLH1 dose-dependent. MLH1 mRNA and protein levels were significantly decreased in 129 mice compared to B6 mice, consistent with a dose-sensitive MLH1-dependent DNA repair mechanism underlying the somatic expansion difference between these strains. Together, these data identify Mlh1 and Mlh3 as novel critical genetic modifiers of HTT CAG instability, point to Mlh1 genetic variation as the likely source of the instability difference in B6 and 129 strains and suggest that MLH1

  15. Inbreeding in genome-wide selection

    NARCIS (Netherlands)

    Daetwyler, H.D.; Villanueva, B.; Bijma, P.; Woolliams, J.A.

    2007-01-01

    Traditional selection methods, such as sib and best linear unbiased prediction (BLUP) selection, which increased genetic gain by increasing accuracy of evaluation have also led to an increased rate of inbreeding per generation (¿FG). This is not necessarily the case with genome-wide selection, which

  16. Dating the age of admixture via wavelet transform analysis of genome-wide data

    NARCIS (Netherlands)

    I. Pugach (Irina); R. Matveyev (Rostislav); A. Wollstein (Andreas); M.H. Kayser (Manfred); M. Stoneking (Mark)

    2011-01-01

    textabstractWe describe a PCA-based genome scan approach to analyze genome-wide admixture structure, and introduce wavelet transform analysis as a method for estimating the time of admixture. We test the wavelet transform method with simulations and apply it to genome-wide SNP data from eight admixe

  17. A Genome-Wide Association Search for Type 2 Diabetes Genes in African Americans

    NARCIS (Netherlands)

    Palmer, Nicholette D.; McDonough, Caitrin W.; Hicks, Pamela J.; Roh, Bong H.; Wing, Maria R.; An, S. Sandy; Hester, Jessica M.; Cooke, Jessica N.; Bostrom, Meredith A.; Rudock, Megan E.; Talbert, Matthew E.; Lewis, Joshua P.; Ferrara, Assiamira; Lu, Lingyi; Ziegler, Julie T.; Sale, Michele M.; Divers, Jasmin; Shriner, Daniel; Adeyemo, Adebowale; Rotimi, Charles N.; Ng, Maggie C. Y.; Langefeld, Carl D.; Freedman, Barry I.; Bowden, Donald W.

    2012-01-01

    African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide A

  18. Genome wide copy number analysis of single cells

    Science.gov (United States)

    Baslan, Timour; Kendall, Jude; Rodgers, Linda; Cox, Hilary; Riggs, Mike; Stepansky, Asya; Troge, Jennifer; Ravi, Kandasamy; Esposito, Diane; Lakshmi, B.; Wigler, Michael; Navin, Nicholas; Hicks, James

    2016-01-01

    Summary Copy number variation (CNV) is increasingly recognized as an important contributor to phenotypic variation in health and disease. Most methods for determining CNV rely on admixtures of cells, where information regarding genetic heterogeneity is lost. Here, we present a protocol that allows for the genome wide copy number analysis of single nuclei isolated from mixed populations of cells. Single nucleus sequencing (SNS), combines flow sorting of single nuclei based on DNA content, whole genome amplification (WGA), followed by next generation sequencing to quantize genomic intervals in a genome wide manner. Multiplexing of single cells is discussed. Additionally, we outline informatic approaches that correct for biases inherent in the WGA procedure and allow for accurate determination of copy number profiles. All together, the protocol takes ~3 days from flow cytometry to sequence-ready DNA libraries. PMID:22555242

  19. Genome-wide significant association between alcohol dependence and a variant in the ADH gene cluster.

    Science.gov (United States)

    Frank, Josef; Cichon, Sven; Treutlein, Jens; Ridinger, Monika; Mattheisen, Manuel; Hoffmann, Per; Herms, Stefan; Wodarz, Norbert; Soyka, Michael; Zill, Peter; Maier, Wolfgang; Mössner, Rainald; Gaebel, Wolfgang; Dahmen, Norbert; Scherbaum, Norbert; Schmäl, Christine; Steffens, Michael; Lucae, Susanne; Ising, Marcus; Müller-Myhsok, Bertram; Nöthen, Markus M; Mann, Karl; Kiefer, Falk; Rietschel, Marcella

    2012-01-01

    Alcohol dependence (AD) is an important contributory factor to the global burden of disease. The etiology of AD involves both environmental and genetic factors, and the disorder has a heritability of around 50%. The aim of the present study was to identify susceptibility genes for AD by performing a genome-wide association study (GWAS). The sample comprised 1333 male in-patients with severe AD according to the Diagnostic and Statistical Manual of Mental Disorders, 4th edition, and 2168 controls. These included 487 patients and 1358 controls from a previous GWAS study by our group. All individuals were of German descent. Single-marker tests and a polygenic score-based analysis to assess the combined contribution of multiple markers with small effects were performed. The single nucleotide polymorphism (SNP) rs1789891, which is located between the ADH1B and ADH1C genes, achieved genome-wide significance [P = 1.27E-8, odds ratio (OR) = 1.46]. Other markers from this region were also associated with AD, and conditional analyses indicated that these made a partially independent contribution. The SNP rs1789891 is in complete linkage disequilibrium with the functional Arg272Gln variant (P = 1.24E-7, OR = 1.31) of the ADH1C gene, which has been reported to modify the rate of ethanol oxidation to acetaldehyde in vitro. A polygenic score-based approach produced a significant result (P = 9.66E-9). This is the first GWAS of AD to provide genome-wide significant support for the role of the ADH gene cluster and to suggest a polygenic component to the etiology of AD. The latter result may indicate that many more AD susceptibility genes still await identification.

  20. Meta-analysis of genome-wide association studies of anxiety disorders

    Science.gov (United States)

    Otowa, Takeshi; Hek, Karin; Lee, Minyoung; Byrne, Enda M.; Mirza, Saira S.; Nivard, Michel G.; Bigdeli, Timothy; Aggen, Steven H.; Adkins, Daniel; Wolen, Aaron; Fanous, Ayman; Keller, Matthew C.; Castelao, Enrique; Kutalik, Zoltan; Van der Auwera, Sandra; Homuth, Georg; Nauck, Matthias; Teumer, Alexander; Milaneschi, Yuri; Hottenga, Jouke-Jan; Direk, Nese; Hofman, Albert; Uitterlinden, Andre; Mulder, Cornelis L.; Henders, Anjali K.; Medland, Sarah E.; Gordon, Scott; Heath, Andrew C.; Madden, Pamela A.F.; Pergadia, Michelle; van der Most, Peter J.; Nolte, Ilja M.; van Oort, Floor V.A.; Hartman, Catharina A.; Oldehinkel, Albertine J.; Preisig, Martin; Grabe, Hans Jörgen; Middeldorp, Christel M.; Penninx, Brenda WJH; Boomsma, Dorret; Martin, Nicholas G.; Montgomery, Grant; Maher, Brion S.; van den Oord, Edwin J.; Wray, Naomi R.; Tiemeier, Henning; Hettema, John M.

    2015-01-01

    Anxiety disorders, namely generalized anxiety disorder, panic disorder, and phobias, are common, etiologically complex conditions with a partially genetic basis. Despite differing on diagnostic definitions based upon clinical presentation, anxiety disorders likely represent various expressions of an underlying common diathesis of abnormal regulation of basic threat-response systems. We conducted genome-wide association analyses in nine samples of European ancestry from seven large, independent studies. To identify genetic variants contributing to genetic susceptibility shared across interview-generated DSM-based anxiety disorders, we applied two phenotypic approaches: (1) comparisons between categorical anxiety disorder cases and super-normal controls, and (2) quantitative phenotypic factor scores derived from a multivariate analysis combining information across the clinical phenotypes. We used logistic and linear regression, respectively, to analyze the association between these phenotypes and genome-wide single nucleotide polymorphisms. Meta-analysis for each phenotype combined results across the nine samples for over 18 000 unrelated individuals. Each meta-analysis identified a different genome-wide significant region, with the following markers showing the strongest association: for case-control contrasts, rs1709393 located in an uncharacterized non-coding RNA locus on chromosomal band 3q12.3 (P=1.65×10−8); for factor scores, rs1067327 within CAMKMT encoding the calmodulin-lysine N-methyltransferase on chromosomal band 2p21 (P=2.86×10−9). Independent replication and further exploration of these findings are needed to more fully understand the role of these variants in risk and expression of anxiety disorders. PMID:26754954

  1. Genome-Wide Detection and Analysis of Multifunctional Genes

    Science.gov (United States)

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  2. Genome-wide identification of enhancer elements.

    Science.gov (United States)

    Tulin, Sarah; Barsi, Julius C; Bocconcelli, Carlo; Smith, Joel

    2016-01-01

    We present a prospective genome-wide regulatory element database for the sea urchin embryo and the modified chromosome capture-related methodology used to create it. The method we developed is termed GRIP-seq for genome-wide regulatory element immunoprecipitation and combines features of chromosome conformation capture, chromatin immunoprecipitation, and paired-end next-generation sequencing with molecular steps that enrich for active cis-regulatory elements associated with basal transcriptional machinery. The first GRIP-seq database, available to the community, comes from S. purpuratus 24 hpf embryos and takes advantage of the extremely well-characterized cis-regulatory elements in this system for validation. In addition, using the GRIP-seq database, we identify and experimentally validate a novel, intronic cis-regulatory element at the onecut locus. We find GRIP-seq signal sensitively identifies active cis-regulatory elements with a high signal-to-noise ratio for both distal and intronic elements. This promising GRIP-seq protocol has the potential to address a rate-limiting step in resolving comprehensive, predictive network models in all systems.

  3. A novel algorithm for simultaneous SNP selection in high-dimensional genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Zuber Verena

    2012-10-01

    Full Text Available Abstract Background Identification of causal SNPs in most genome wide association studies relies on approaches that consider each SNP individually. However, there is a strong correlation structure among SNPs that needs to be taken into account. Hence, increasingly modern computationally expensive regression methods are employed for SNP selection that consider all markers simultaneously and thus incorporate dependencies among SNPs. Results We develop a novel multivariate algorithm for large scale SNP selection using CAR score regression, a promising new approach for prioritizing biomarkers. Specifically, we propose a computationally efficient procedure for shrinkage estimation of CAR scores from high-dimensional data. Subsequently, we conduct a comprehensive comparison study including five advanced regression approaches (boosting, lasso, NEG, MCP, and CAR score and a univariate approach (marginal correlation to determine the effectiveness in finding true causal SNPs. Conclusions Simultaneous SNP selection is a challenging task. We demonstrate that our CAR score-based algorithm consistently outperforms all competing approaches, both uni- and multivariate, in terms of correctly recovered causal SNPs and SNP ranking. An R package implementing the approach as well as R code to reproduce the complete study presented here is available from http://strimmerlab.org/software/care/.

  4. A Bayesian Approach to Learning Scoring Systems.

    Science.gov (United States)

    Ertekin, Şeyda; Rudin, Cynthia

    2015-12-01

    We present a Bayesian method for building scoring systems, which are linear models with coefficients that have very few significant digits. Usually the construction of scoring systems involve manual effort-humans invent the full scoring system without using data, or they choose how logistic regression coefficients should be scaled and rounded to produce a scoring system. These kinds of heuristics lead to suboptimal solutions. Our approach is different in that humans need only specify the prior over what the coefficients should look like, and the scoring system is learned from data. For this approach, we provide a Metropolis-Hastings sampler that tends to pull the coefficient values toward their "natural scale." Empirically, the proposed method achieves a high degree of interpretability of the models while maintaining competitive generalization performances.

  5. Unraveling the Rat Intestine, Spleen and Liver Genome-Wide Transcriptome after the Oral Administration of Lavender Oil by a Two-Color Dye-Swap DNA Microarray Approach.

    Directory of Open Access Journals (Sweden)

    Hiroko Kubo

    Full Text Available The use of lavender oil (LO--a commonly, used oil in aromatherapy, with well-defined volatile components linalool and linalyl acetate--in non-traditional medicine is increasing globally. To understand and demonstrate the potential positive effects of LO on the body, we have established an animal model in this current study, investigating the orally administered LO effects genome wide in the rat small intestine, spleen, and liver. The rats were administered LO at 5 mg/kg (usual therapeutic dose in humans followed by the screening of differentially expressed genes in the tissues, using a 4×44-K whole-genome rat chip (Agilent microarray platform; Agilent Technologies, Palo Alto, CA, USA in conjunction with a dye-swap approach, a novelty of this study. Fourteen days after LO treatment and compared with a control group (sham, a total of 156 and 154 up (≧ 1.5-fold- and down (≦ 0.75-fold-regulated genes, 174 and 66 up- (≧ 1.5-fold- and down (≦ 0.75-fold-regulated genes, and 222 and 322 up- (≧ 1.5-fold- and down (≦ 0.75-fold-regulated genes showed differential expression at the mRNA level in the small intestine, spleen and liver, respectively. The reverse transcription-polymerase chain reaction (RT-PCR validation of highly up- and down-regulated genes confirmed the regulation of the Papd4, Lrp1b, Alb, Cyr61, Cyp2c, and Cxcl1 genes by LO as examples in these tissues. Using bioinformatics, including Ingenuity Pathway Analysis (IPA, differentially expressed genes were functionally categorized by their Gene Ontology (GO and biological function and network analysis, revealing their diverse functions and potential roles in LO-mediated effects in rat. Further IPA analysis in particular unraveled the presence of novel genes, such as Papd4, Or8k5, Gprc5b, Taar5, Trpc6, Pld2 and Onecut3 (up-regulated top molecules and Tnf, Slc45a4, Slc25a23 and Samt4 (down-regulated top molecules, to be influenced by LO treatment in the small intestine, spleen and

  6. Unraveling the Rat Intestine, Spleen and Liver Genome-Wide Transcriptome after the Oral Administration of Lavender Oil by a Two-Color Dye-Swap DNA Microarray Approach.

    Science.gov (United States)

    Kubo, Hiroko; Shibato, Junko; Saito, Tomomi; Ogawa, Tetsuo; Rakwal, Randeep; Shioda, Seiji

    2015-01-01

    The use of lavender oil (LO)--a commonly, used oil in aromatherapy, with well-defined volatile components linalool and linalyl acetate--in non-traditional medicine is increasing globally. To understand and demonstrate the potential positive effects of LO on the body, we have established an animal model in this current study, investigating the orally administered LO effects genome wide in the rat small intestine, spleen, and liver. The rats were administered LO at 5 mg/kg (usual therapeutic dose in humans) followed by the screening of differentially expressed genes in the tissues, using a 4×44-K whole-genome rat chip (Agilent microarray platform; Agilent Technologies, Palo Alto, CA, USA) in conjunction with a dye-swap approach, a novelty of this study. Fourteen days after LO treatment and compared with a control group (sham), a total of 156 and 154 up (≧ 1.5-fold)- and down (≦ 0.75-fold)-regulated genes, 174 and 66 up- (≧ 1.5-fold)- and down (≦ 0.75-fold)-regulated genes, and 222 and 322 up- (≧ 1.5-fold)- and down (≦ 0.75-fold)-regulated genes showed differential expression at the mRNA level in the small intestine, spleen and liver, respectively. The reverse transcription-polymerase chain reaction (RT-PCR) validation of highly up- and down-regulated genes confirmed the regulation of the Papd4, Lrp1b, Alb, Cyr61, Cyp2c, and Cxcl1 genes by LO as examples in these tissues. Using bioinformatics, including Ingenuity Pathway Analysis (IPA), differentially expressed genes were functionally categorized by their Gene Ontology (GO) and biological function and network analysis, revealing their diverse functions and potential roles in LO-mediated effects in rat. Further IPA analysis in particular unraveled the presence of novel genes, such as Papd4, Or8k5, Gprc5b, Taar5, Trpc6, Pld2 and Onecut3 (up-regulated top molecules) and Tnf, Slc45a4, Slc25a23 and Samt4 (down-regulated top molecules), to be influenced by LO treatment in the small intestine, spleen and liver

  7. Genome-wide Analysis of Gene Regulation

    DEFF Research Database (Denmark)

    Chen, Yun

    cells are capable of regulating their gene expression, so that each cell can only express a particular set of genes yielding limited numbers of proteins with specialized functions. Therefore a rigid control of differential gene expression is necessary for cellular diversity. On the other hand, aberrant...... gene regulation will disrupt the cell’s fundamental processes, which in turn can cause disease. Hence, understanding gene regulation is essential for deciphering the code of life. Along with the development of high throughput sequencing (HTS) technology and the subsequent large-scale data analysis......, genome-wide assays have increased our understanding of gene regulation significantly. This thesis describes the integration and analysis of HTS data across different important aspects of gene regulation. Gene expression can be regulated at different stages when the genetic information is passed from gene...

  8. Genetic link between family socioeconomic status and children's educational achievement estimated from genome-wide SNPs.

    Science.gov (United States)

    Krapohl, E; Plomin, R

    2016-03-01

    One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES.

  9. Mapping the sensory perception of apple using descriptive sensory evaluation in a genome wide association study.

    Science.gov (United States)

    Amyotte, Beatrice; Bowen, Amy J; Banks, Travis; Rajcan, Istvan; Somers, Daryl J

    2017-01-01

    Breeding apples is a long-term endeavour and it is imperative that new cultivars are selected to have outstanding consumer appeal. This study has taken the approach of merging sensory science with genome wide association analyses in order to map the human perception of apple flavour and texture onto the apple genome. The goal was to identify genomic associations that could be used in breeding apples for improved fruit quality. A collection of 85 apple cultivars was examined over two years through descriptive sensory evaluation by a trained sensory panel. The trained sensory panel scored randomized sliced samples of each apple cultivar for seventeen taste, flavour and texture attributes using controlled sensory evaluation practices. In addition, the apple collection was subjected to genotyping by sequencing for marker discovery. A genome wide association analysis suggested significant genomic associations for several sensory traits including juiciness, crispness, mealiness and fresh green apple flavour. The findings include previously unreported genomic regions that could be used in apple breeding and suggest that similar sensory association mapping methods could be applied in other plants.

  10. Genome-wide association study of smoking initiation and current smoking

    DEFF Research Database (Denmark)

    Vink, Jacqueline M; Smit, August B; de Geus, Eco J C;

    2009-01-01

    For the identification of genes associated with smoking initiation and current smoking, genome-wide association analyses were carried out in 3497 subjects. Significant genes that replicated in three independent samples (n = 405, 5810, and 1648) were visualized into a biologically meaningful network......) and cell-adhesion molecules (e.g., CDH23). We conclude that a network-based genome-wide association approach can identify genes influencing smoking behavior....

  11. Chapter 10: Mining genome-wide genetic markers.

    Directory of Open Access Journals (Sweden)

    Xiang Zhang

    Full Text Available Genome-wide association study (GWAS aims to discover genetic factors underlying phenotypic traits. The large number of genetic factors poses both computational and statistical challenges. Various computational approaches have been developed for large scale GWAS. In this chapter, we will discuss several widely used computational approaches in GWAS. The following topics will be covered: (1 An introduction to the background of GWAS. (2 The existing computational approaches that are widely used in GWAS. This will cover single-locus, epistasis detection, and machine learning methods that have been recently developed in biology, statistic, and computer science communities. This part will be the main focus of this chapter. (3 The limitations of current approaches and future directions.

  12. Genome-wide analysis correlates Ayurveda Prakriti.

    Science.gov (United States)

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K; Prasanna, B V; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S; Dedge, Amrish P; Bharadwaj, Ramachandra; Gangadharan, G G; Nair, Sreekumaran; Gopinath, Puthiya M; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-10-29

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as "Prakriti". To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10(-5)) were significantly different between Prakritis, without any confounding effect of stratification, after 10(6) permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India's traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine.

  13. Planning and executing a genome wide association study (GWAS).

    Science.gov (United States)

    Sale, Michèle M; Mychaleckyj, Josyf C; Chen, Wei-Min

    2009-01-01

    In recent years, genome-wide association approaches have proven a powerful and successful strategy to identify genetic contributors to complex traits, including a number of endocrine disorders. Their success has meant that genome wide association studies (GWAS) are fast becoming the default study design for discovery of new genetic variants that influence a clinical trait or phenotype. This chapter focuses on a number of key elements that require consideration for the successful conduct of a GWAS. Although many of the considerations are common to any genetic study, the greater cost, extreme multiple testing, and greater openness to data sharing require specific awareness and planning by investigators. In the section on designing a GWAS, we reflect on ethical considerations, study design, selection of phenotype/s, power considerations, sample tracking and storage issues, and genotyping product selection. During execution, important considerations include DNA quantity and preparation, genotyping methods, quality control checks of genotype data, in silico genotyping (imputation), tests of association, and replication of association signals. Although the field of human genetics is rapidly evolving, recent experiences can help guide an investigator in making practical and methodological choices that will eventually determine the overall quality of GWAS results. Given the investment to recruit patient populations or cohorts that are powered for a GWAS, and the still substantial costs associated with genotyping, it is helpful to be aware of these aspects to maximize the likelihood of success, especially where there is an opportunity for implementing them prospectively.

  14. Genome-wide mapping of DNA methylation in chicken.

    Directory of Open Access Journals (Sweden)

    Qinghe Li

    Full Text Available Cytosine DNA methylation is an important epigenetic modification termed as the fifth base that functions in diverse processes. Till now, the genome-wide DNA methylation maps of many organisms has been reported, such as human, Arabidopsis, rice and silkworm, but the methylation pattern of bird remains rarely studied. Here we show the genome-wide DNA methylation map of bird, using the chicken as a model organism and an immunocapturing approach followed by high-throughput sequencing. In both of the red jungle fowl and the avian broiler, DNA methylation was described separately for the liver and muscle tissue. Generally, chicken displays analogous methylation pattern with that of animals and plants. DNA methylation is enriched in the gene body regions and the repetitive sequences, and depleted in the transcription start site (TSS and the transcription termination site (TTS. Most of the CpG islands in the chicken genome are kept in unmethylated state. Promoter methylation is negatively correlated with the gene expression level, indicating its suppressive role in regulating gene transcription. This work contributes to our understanding of epigenetics in birds.

  15. Genome-Wide Scan for Methylation Profiles in Keloids

    Directory of Open Access Journals (Sweden)

    Lamont R. Jones

    2015-01-01

    Full Text Available Keloids are benign fibroproliferative tumors of the skin which commonly occur after injury mainly in darker skinned patients. Medical treatment is fraught with high recurrence rates mainly because of an incomplete understanding of the biological mechanisms that lead to keloids. The purpose of this project was to examine keloid pathogenesis from the epigenome perspective of DNA methylation. Genome-wide profiling used the Infinium HumanMethylation450 BeadChip to interrogate DNA from 6 fresh keloid and 6 normal skin samples from 12 anonymous donors. A 3-tiered approach was used to call out genes most differentially methylated between keloid and normal. When compared to normal, of the 685 differentially methylated CpGs at Tier 3, 510 were hypomethylated and 175 were hypermethylated with 190 CpGs in promoter and 495 in nonpromoter regions. The 190 promoter region CpGs corresponded to 152 genes: 96 (63% were hypomethylated and 56 (37% hypermethylated. This exploratory genome-wide scan of the keloid methylome highlights a predominance of hypomethylated genomic landscapes, favoring nonpromoter regions. DNA methylation, as an additional mechanism for gene regulation in keloid pathogenesis, holds potential for novel treatments that reverse deleterious epigenetic changes. As an alternative mechanism for regulating genes, epigenetics may explain why gene mutations alone do not provide definitive mechanisms for keloid formation.

  16. Genome-wide search for strabismus susceptibility loci.

    Directory of Open Access Journals (Sweden)

    Fujiwara H

    2003-06-01

    Full Text Available The purpose of this study was to search for chromosomal susceptibility loci for comitant strabismus. Genomic DNA was isolated from 10mL blood taken from each member of 30 nuclear families in which 2 or more siblings are affected by either esotropia or exotropia. A genome-wide search was performed with amplification by polymerase chain reaction of 400 markers in microsatellite regions with approximately 10 cM resolution. For each locus, non-parametric affected sib-pair analysis and non-parametric linkage analysis for multiple pedigrees (Genehunter software, http://linkage.rockefeller.edu/soft/ were used to calculate multipoint lod scores and non-parametric linkage (NPL scores, respectively. In sib-pair analysis, lod scores showed basically flat lines with several peaks of 0.25 on all chromosomes. In non-parametric linkage analysis for multiple pedigrees, NPL scores showed one peak as high as 1.34 on chromosomes 1, 2, 4, 7, 10, 15, and 16, while 2 such peaks were found on chromosomes 3, 9, 11, 12, 18, and 20. Non-parametric linkage analysis for multiple pedigrees of 30 families with comitant strabismus suggested a number of chromosomal susceptibility loci. Our ongoing study involving a larger number of families will refine the accuracy of statistical analysis to pinpoint susceptibility loci for comitant strabismus.

  17. Accounting for Linkage Disequilibrium in genome scans for selection without individual genotypes: the local score approach.

    Science.gov (United States)

    Fariello, María Inés; Boitard, Simon; Mercier, Sabine; Robelin, David; Faraut, Thomas; Arnould, Cécile; Recoquillay, Julien; Bouchez, Olivier; Salin, Gérald; Dehais, Patrice; Gourichon, David; Leroux, Sophie; Pitel, Frédérique; Leterrier, Christine; SanCristobal, Magali

    2017-04-10

    Detecting genomic footprints of selection is an important step in the understanding of evolution. Accounting for linkage disequilibrium in genome scans increases detection power, but haplotype-based methods require individual genotypes and are not applicable on pool-sequenced samples. We propose to take advantage of the local score approach to account for linkage disequilibrium in genome scans for selection, cumulating (possibly small) signals from single markers over a genomic segment, to clearly pinpoint a selection signal. Using computer simulations, we demonstrate that this approach detects selection with higher power than several state-of-the-art single marker, windowing or haplotype-based approaches. We illustrate this on two benchmark data sets including individual genotypes, for which we obtain similar results with the local score and one haplotype-based approach. Finally, we apply the local score approach to Pool-Seq data obtained from a divergent selection experiment on behavior in quail, and obtain precise and biologically coherent selection signals: while competing methods fail to highlight any clear selection signature, our method detects several regions involving genes known to act on social responsiveness or autistic traits. Although we focus here on the detection of positive selection from multiple population data, the local score approach is general and can be applied to other genome scans for selection or other genome-wide analyses such as GWAS. This article is protected by copyright. All rights reserved.

  18. Genome-wide association analysis of imputed rare variants: application to seven common complex diseases.

    Science.gov (United States)

    Mägi, Reedik; Asimit, Jennifer L; Day-Williams, Aaron G; Zeggini, Eleftheria; Morris, Andrew P

    2012-12-01

    Genome-wide association studies have been successful in identifying loci contributing effects to a range of complex human traits. The majority of reproducible associations within these loci are with common variants, each of modest effect, which together explain only a small proportion of heritability. It has been suggested that much of the unexplained genetic component of complex traits can thus be attributed to rare variation. However, genome-wide association study genotyping chips have been designed primarily to capture common variation, and thus are underpowered to detect the effects of rare variants. Nevertheless, we demonstrate here, by simulation, that imputation from an existing scaffold of genome-wide genotype data up to high-density reference panels has the potential to identify rare variant associations with complex traits, without the need for costly re-sequencing experiments. By application of this approach to genome-wide association studies of seven common complex diseases, imputed up to publicly available reference panels, we identify genome-wide significant evidence of rare variant association in PRDM10 with coronary artery disease and multiple genes in the major histocompatibility complex (MHC) with type 1 diabetes. The results of our analyses highlight that genome-wide association studies have the potential to offer an exciting opportunity for gene discovery through association with rare variants, conceivably leading to substantial advancements in our understanding of the genetic architecture underlying complex human traits.

  19. Genome wide selection in Citrus breeding.

    Science.gov (United States)

    Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A

    2016-10-17

    Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq(TM) (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.

  20. Genome-wide association studies in pharmacogenomics of antidepressants.

    Science.gov (United States)

    Lin, Eugene; Lane, Hsien-Yuan

    2015-01-01

    Major depressive disorder (MDD) is one of the most common psychiatric disorders worldwide. Doctors must prescribe antidepressants based on educated guesses due to the fact that it is unmanageable to predict the effectiveness of any particular antidepressant in an individual patient. With the recent advent of scientific research, the genome-wide association study (GWAS) is extensively employed to analyze hundreds of thousands of single nucleotide polymorphisms by high-throughput genotyping technologies. In addition to the candidate-gene approach, the GWAS approach has recently been utilized to investigate the determinants of antidepressant response to therapy. In this study, we reviewed GWAS studies, their limitations and future directions with respect to the pharmacogenomics of antidepressants in MDD.

  1. Genome-Wide Association Study of Coronary Artery Disease

    Directory of Open Access Journals (Sweden)

    Naomi Ogawa

    2010-01-01

    Full Text Available Coronary artery disease (CAD is a multifactorial disease with environmental and genetic determinants. The genetic determinants of CAD have previously been explored by the candidate gene approach. Recently, the data from the International HapMap Project and the development of dense genotyping chips have enabled us to perform genome-wide association studies (GWAS on a large number of subjects without bias towards any particular candidate genes. In 2007, three chip-based GWAS simultaneously revealed the significant association between common variants on chromosome 9p21 and CAD. This association was replicated among other ethnic groups and also in a meta-analysis. Further investigations have detected several other candidate loci associated with CAD. The chip-based GWAS approach has identified novel and unbiased genetic determinants of CAD and these insights provide the important direction to better understand the pathogenesis of CAD and to develop new and improved preventive measures and treatments for CAD.

  2. Genome-wide Association Study of Postburn Scarring Identifies a Novel Protective Variant.

    Science.gov (United States)

    Sood, Ravi F; Hocking, Anne M; Muffley, Lara A; Ga, Maricar; Honari, Shari; Reiner, Alexander P; Gibran, Nicole S

    2015-10-01

    To identify genetic variants associated with the severity of postburn hypertrophic scarring (HTS) using a genome-wide approach. Risk of severe postburn HTS is known to depend on race, but the genetic determinants of HTS are unknown. We conducted a genome-wide association study (GWAS) in a prospective cohort of adults admitted with deep-partial-thickness burns from 2007 through 2014. Scar severity was assessed over time using the Vancouver Scar Scale (VSS), and DNA was genotyped with a >500,000-marker array. We performed association testing of single-nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) >0.01 using linear regression of VSS height score on genotype adjusted for patient and injury characteristics as well as population genetic structure. Array-wide significance was based on Bonferroni correction for multiple testing. Of 538 patients (median age 40 years, median burn size 6.0% of body surface area), 71% were men and 76% were White. The mean VSS height score was 1.2 (range: 0-3). Of 289,639 SNPs tested, a variant in the CUB and Sushi multiple domains 1 (CSMD1) gene (rs11136645; MAF = 0.49), was significantly associated with decreased scar height (regression coefficient = -0.23, P = 7.9 × 10). In the first published GWAS of HTS, we report that a common intronic variant in the CSMD1 gene is associated with reduced severity of postburn HTS. If this association is confirmed in an independent cohort, investigating the potential role of CSMD1 in wound healing may elucidate HTS pathophysiology.

  3. Genome-Wide Association Study of Intelligence: Additive Effects of Novel Brain Expressed Genes

    Science.gov (United States)

    Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.

    2012-01-01

    Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…

  4. Genome-Wide Association Study of Intelligence: Additive Effects of Novel Brain Expressed Genes

    Science.gov (United States)

    Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.

    2012-01-01

    Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…

  5. Genome-wide association analysis identifies three new susceptibility loci for childhood body mass index

    DEFF Research Database (Denmark)

    Felix, Janine F; Bradfield, Jonathan P; Monnereau, Claire;

    2016-01-01

    A large number of genetic loci are associated with adult body mass index. However, the genetics of childhood body mass index are largely unknown. We performed a meta-analysis of genome-wide association studies of childhood body mass index, using sex- and age-adjusted standard deviation scores. We...

  6. Genome-wide association analysis identifies three new susceptibility loci for childhood body mass index

    NARCIS (Netherlands)

    Felix, Janine F; Bradfield, Jonathan P; Monnereau, Claire; van der Valk, Ralf J P; Stergiakouli, Evie; Chesi, Alessandra; Gaillard, Romy; Feenstra, Bjarke; Thiering, Elisabeth; Kreiner-Møller, Eskil; Mahajan, Anubha; Pitkänen, Niina; Joro, Raimo; Cavadino, Alana; Huikari, Ville; Franks, Steve; Groen-Blokhuis, Maria M; Cousminer, Diana L; Marsh, Julie A; Lehtimäki, Terho; Curtin, John A; Vioque, Jesus; Ahluwalia, Tarunveer S; Myhre, Ronny; Price, Thomas S; Vilor-Tejedor, Natalia; Yengo, Loïc; Grarup, Niels; Ntalla, Ioanna; Ang, Wei; Atalay, Mustafa; Bisgaard, Hans; Blakemore, Alexandra I; Bonnefond, Amelie; Carstensen, Lisbeth; Eriksson, Johan; Flexeder, Claudia; Franke, Lude; Geller, Frank; Geserick, Mandy; Hartikainen, Anna-Liisa; Haworth, Claire M A; Hirschhorn, Joel N; Hofman, Albert; Holm, Jens-Christian; Horikoshi, Momoko; Hottenga, Jouke Jan; Huang, Jinyan; Kadarmideen, Haja N; Kähönen, Mika; Kiess, Wieland; Lakka, Hanna-Maaria; Lakka, Timo A; Lewin, Alexandra M; Liang, Liming; Lyytikäinen, Leo-Pekka; Ma, Baoshan; Magnus, Per; McCormack, Shana E; McMahon, George; Mentch, Frank D; Middeldorp, Christel M; Murray, Clare S; Pahkala, Katja; Pers, Tune H; Pfäffle, Roland; Postma, Dirkje S; Power, Christine; Simpson, Angela; Sengpiel, Verena; Tiesler, Carla M T; Torrent, Maties; Uitterlinden, André G; van Meurs, Joyce B; Vinding, Rebecca; Waage, Johannes; Wardle, Jane; Zeggini, Eleftheria; Zemel, Babette S; Dedoussis, George V; Pedersen, Oluf; Froguel, Philippe; Sunyer, Jordi; Plomin, Robert; Jacobsson, Bo; Hansen, Torben; Gonzalez, Juan R; Custovic, Adnan; Raitakari, Olli T; Pennell, Craig E; Widén, Elisabeth; Boomsma, Dorret I; Koppelman, Gerard H; Sebert, Sylvain; Järvelin, Marjo-Riitta; Hyppönen, Elina; McCarthy, Mark I; Lindi, Virpi; Harri, Niinikoski; Körner, Antje; Bønnelykke, Klaus; Heinrich, Joachim; Melbye, Mads; Rivadeneira, Fernando; Hakonarson, Hakon; Ring, Susan M; Smith, George Davey; Sørensen, Thorkild I A; Timpson, Nicholas J; Grant, Struan F A; Jaddoe, Vincent W V

    2015-01-01

    A large number of genetic loci are associated with adult body mass index. However, the genetics of childhood body mass index are largely unknown. We performed a meta-analysis of genome-wide association studies of childhood body mass index, using sex- and age-adjusted standard deviation scores. We in

  7. Genome-wide association analysis identifies three new susceptibility loci for childhood body mass index

    NARCIS (Netherlands)

    Felix, Janine F.; Bradfield, Jonathan P.; Monnereau, Claire; van der Valk, Ralf J. P.; Stergiakouli, Evie; Chesi, Alessandra; Gaillard, Romy; Feenstra, Bjarke; Thiering, Elisabeth; Kreiner-Moller, Eskil; Mahajan, Anubha; Pitkanen, Niina; Joro, Raimo; Cavadino, Alana; Huikari, Ville; Franks, Steve; Groen-Blokhuis, Maria M.; Cousminer, Diana L.; Marsh, Julie A.; Lehtimaki, Terho; Curtin, John A.; Vioque, Jesus; Ahluwalia, Tarunveer S.; Myhre, Ronny; Price, Thomas S.; Vilor-Tejedor, Natalia; Yengo, Loic; Grarup, Niels; Ntalla, Ioanna; Ang, Wei; Atalay, Mustafa; Bisgaard, Hans; Blakemore, Alexandra I.; Bonnefond, Amelie; Carstensen, Lisbeth; Eriksson, Johan; Flexeder, Claudia; Franke, Lude; Geller, Frank; Geserick, Mandy; Hartikainen, Anna-Liisa; Haworth, Claire M. A.; Hirschhorn, Joel N.; Hofman, Albert; Holm, Jens-Christian; Horikoshi, Momoko; Hottenga, Jouke Jan; Huang, Jinyan; Kadarmideen, Haja N.; Kahonen, Mika; Kiess, Wieland; Lakka, Hanna-Maaria; Lakka, Timo A.; Lewin, Alexandra M.; Liang, Liming; Lyytikainen, Leo-Pekka; Ma, Baoshan; Magnus, Per; McCormack, Shana E.; McMahon, George; Mentch, Frank D.; Middeldorp, Christel M.; Murray, Clare S.; Pahkala, Katja; Pers, Tune H.; Pfaefle, Roland; Postma, Dirkje S.; Power, Christine; Simpson, Angela; Sengpiel, Verena; Tiesler, Carla M. T.; Torrent, Maties; Uitterlinden, Andre G.; van Meurs, Joyce B.; Vinding, Rebecca; Waage, Johannes; Wardle, Jane; Zeggini, Eleftheria; Zemel, Babette S.; Dedoussis, George V.; Pedersen, Oluf; Froguel, Philippe; Sunyer, Jordi; Plomin, Robert; Jacobsson, Bo; Hansen, Torben; Gonzalez, Juan R.; Custovic, Adnan; Raitakari, Olli T.; Pennell, Craig E.; Widen, Elisabeth; Boomsma, Dorret I.; Koppelman, Gerard H.; Sebert, Sylvain; Jarvelin, Marjo-Riitta; Hypponen, Elina; McCarthy, Mark I.; Lindi, Virpi; Harri, Niinikoski; Koerner, Antje; Bonnelykke, Klaus; Heinrich, Joachim; Melbye, Mads; Rivadeneira, Fernando; Hakonarson, Hakon; Ring, Susan M.; Smith, George Davey; Sorensen, Thorkild I. A.; Timpson, Nicholas J.; Grant, Struan F. A.; Jaddoe, Vincent W. V.

    2016-01-01

    A large number of genetic loci are associated with adult body mass index. However, the genetics of childhood body mass index are largely unknown. We performed a meta-analysis of genome-wide association studies of childhood body mass index, using sex-and age-adjusted standard deviation scores. We

  8. High-resolution genome-wide mapping of histone modifications.

    Science.gov (United States)

    Roh, Tae-young; Ngau, Wing Chi; Cui, Kairong; Landsman, David; Zhao, Keji

    2004-08-01

    The expression patterns of eukaryotic genomes are controlled by their chromatin structure, consisting of nucleosome subunits in which DNA of approximately 146 bp is wrapped around a core of 8 histone molecules. Post-translational histone modifications play an essential role in modifying chromatin structure. Here we apply a combination of SAGE and chromatin immunoprecipitation (ChIP) protocols to determine the distribution of hyperacetylated histones H3 and H4 in the Saccharomyces cerevisiae genome. We call this approach genome-wide mapping technique (GMAT). Using GMAT, we find that the highest acetylation levels are detected in the 5' end of a gene's coding region, but not in the promoter. Furthermore, we show that the histone acetyltransferase, GCN5p, regulates H3 acetylation in the promoter and 5' end of the coding regions. These findings indicate that GMAT should find valuable applications in mapping target sites of chromatin-modifying enzymes.

  9. DNA Break Mapping Reveals Topoisomerase II Activity Genome-Wide

    Directory of Open Access Journals (Sweden)

    Laura Baranello

    2014-07-01

    Full Text Available Genomic DNA is under constant assault by endogenous and exogenous DNA damaging agents. DNA breakage can represent a major threat to genome integrity but can also be necessary for genome function. Here we present approaches to map DNA double-strand breaks (DSBs and single-strand breaks (SSBs at the genome-wide scale by two methods called DSB- and SSB-Seq, respectively. We tested these methods in human colon cancer cells and validated the results using the Topoisomerase II (Top2-poisoning agent etoposide (ETO. Our results show that the combination of ETO treatment with break-mapping techniques is a powerful method to elaborate the pattern of Top2 enzymatic activity across the genome.

  10. Genome-wide Pleiotropy Between Parkinson Disease and Autoimmune Diseases.

    Science.gov (United States)

    Witoelar, Aree; Jansen, Iris E; Wang, Yunpeng; Desikan, Rahul S; Gibbs, J Raphael; Blauwendraat, Cornelis; Thompson, Wesley K; Hernandez, Dena G; Djurovic, Srdjan; Schork, Andrew J; Bettella, Francesco; Ellinghaus, David; Franke, Andre; Lie, Benedicte A; McEvoy, Linda K; Karlsen, Tom H; Lesage, Suzanne; Morris, Huw R; Brice, Alexis; Wood, Nicholas W; Heutink, Peter; Hardy, John; Singleton, Andrew B; Dale, Anders M; Gasser, Thomas; Andreassen, Ole A; Sharma, Manu

    2017-07-01

    Recent genome-wide association studies (GWAS) and pathway analyses supported long-standing observations of an association between immune-mediated diseases and Parkinson disease (PD). The post-GWAS era provides an opportunity for cross-phenotype analyses between different complex phenotypes. To test the hypothesis that there are common genetic risk variants conveying risk of both PD and autoimmune diseases (ie, pleiotropy) and to identify new shared genetic variants and their pathways by applying a novel statistical framework in a genome-wide approach. Using the conjunction false discovery rate method, this study analyzed GWAS data from a selection of archetypal autoimmune diseases among 138 511 individuals of European ancestry and systemically investigated pleiotropy between PD and type 1 diabetes, Crohn disease, ulcerative colitis, rheumatoid arthritis, celiac disease, psoriasis, and multiple sclerosis. NeuroX data (6927 PD cases and 6108 controls) were used for replication. The study investigated the biological correlation between the top loci through protein-protein interaction and changes in the gene expression and methylation levels. The dates of the analysis were June 10, 2015, to March 4, 2017. The primary outcome was a list of novel loci and their pathways involved in PD and autoimmune diseases. Genome-wide conjunctional analysis identified 17 novel loci at false discovery rate less than 0.05 with overlap between PD and autoimmune diseases, including known PD loci adjacent to GAK, HLA-DRB5, LRRK2, and MAPT for rheumatoid arthritis, ulcerative colitis and Crohn disease. Replication confirmed the involvement of HLA, LRRK2, MAPT, TRIM10, and SETD1A in PD. Among the novel genes discovered, WNT3, KANSL1, CRHR1, BOLA2, and GUCY1A3 are within a protein-protein interaction network with known PD genes. A subset of novel loci was significantly associated with changes in methylation or expression levels of adjacent genes. The study findings provide novel mechanistic

  11. SmileFinder: a resampling-based approach to evaluate signatures of selection from genome-wide sets of matching allele frequency data in two or more diploid populations.

    Science.gov (United States)

    Guiblet, Wilfried M; Zhao, Kai; O'Brien, Stephen J; Massey, Steven E; Roca, Alfred L; Oleksyk, Taras K

    2015-01-01

    Adaptive alleles may rise in frequency as a consequence of positive selection, creating a pattern of decreased variation in the neighboring loci, known as a selective sweep. When the region containing this pattern is compared to another population with no history of selection, a rise in variance of allele frequencies between populations is observed. One challenge presented by large genome-wide datasets is the ability to differentiate between patterns that are remnants of natural selection from those expected to arise at random and/or as a consequence of selectively neutral demographic forces acting in the population. SmileFinder is a simple program that looks for diversity and divergence patterns consistent with selection sweeps by evaluating allele frequencies in windows, including neighboring loci from two or more populations of a diploid species against the genome-wide neutral expectation. The program calculates the mean of heterozygosity and FST in a set of sliding windows of incrementally increasing sizes, and then builds a resampled distribution (the baseline) of random multi-locus sets matched to the sizes of sliding windows, using an unrestricted sampling. Percentiles of the values in the sliding windows are derived from the superimposed resampled distribution. The resampling can easily be scaled from 1 K to 100 M; the higher the number, the more precise the percentiles ascribed to the extreme observed values. The output from SmileFinder can be used to plot percentile values to look for population diversity and divergence patterns that may suggest past actions of positive selection along chromosome maps, and to compare lists of suspected candidate genes under random gene sets to test for the overrepresentation of these patterns among gene categories. Both applications of the algorithm have already been used in published studies. Here we present a publicly available, open source program that will serve as a useful tool for preliminary scans of selection

  12. Genome-wide association study reveals greater polygenic loading for schizophrenia in cases with a family history of illness.

    Science.gov (United States)

    Bigdeli, Tim B; Ripke, Stephan; Bacanu, Silviu-Alin; Lee, Sang Hong; Wray, Naomi R; Gejman, Pablo V; Rietschel, Marcella; Cichon, Sven; St Clair, David; Corvin, Aiden; Kirov, George; McQuillin, Andrew; Gurling, Hugh; Rujescu, Dan; Andreassen, Ole A; Werge, Thomas; Blackwood, Douglas H R; Pato, Carlos N; Pato, Michele T; Malhotra, Anil K; O'Donovan, Michael C; Kendler, Kenneth S; Fanous, Ayman H

    2016-03-01

    Genome-wide association studies (GWAS) of schizophrenia have yielded more than 100 common susceptibility variants, and strongly support a substantial polygenic contribution of a large number of small allelic effects. It has been hypothesized that familial schizophrenia is largely a consequence of inherited rather than environmental factors. We investigated the extent to which familiality of schizophrenia is associated with enrichment for common risk variants detectable in a large GWAS. We analyzed single nucleotide polymorphism (SNP) data for cases reporting a family history of psychotic illness (N = 978), cases reporting no such family history (N = 4,503), and unscreened controls (N = 8,285) from the Psychiatric Genomics Consortium (PGC1) study of schizophrenia. We used a multinomial logistic regression approach with model-fitting to detect allelic effects specific to either family history subgroup. We also considered a polygenic model, in which we tested whether family history positive subjects carried more schizophrenia risk alleles than family history negative subjects, on average. Several individual SNPs attained suggestive but not genome-wide significant association with either family history subgroup. Comparison of genome-wide polygenic risk scores based on GWAS summary statistics indicated a significant enrichment for SNP effects among family history positive compared to family history negative cases (Nagelkerke's R(2 ) = 0.0021; P = 0.00331; P-value threshold history positive compared to family history negative cases (0.32 and 0.22, respectively; P = 0.031). We found suggestive evidence of allelic effects detectable in large GWAS of schizophrenia that might be specific to particular family history subgroups. However, consideration of a polygenic risk score indicated a significant enrichment among family history positive cases for common allelic effects. Familial illness might, therefore, represent a more heritable form of

  13. Genome-wide association studies of human adiposity: Zooming in on synapses

    DEFF Research Database (Denmark)

    Sandholt, Camilla H.; Grarup, Niels; Pedersen, Oluf;

    2015-01-01

    The decade anniversary for genome-wide assocn. studies (GWAS) is approaching, and this exptl. approach has commenced a deeper understanding of the genetics underlying complex diseases. In obesity genetics the GIANT (Genetic Investigation of ANthropometric Traits) consortium has played a crucial...

  14. A genome-wide screen and linkage mapping for a large pedigree with episodic ataxia.

    Science.gov (United States)

    Cader, M Z; Steckley, J L; Dyment, D A; McLachlan, R S; Ebers, G C

    2005-07-12

    Episodic ataxias are ion channel disorders characterized by attacks of incoordination. The authors performed a genome-wide screen in a large pedigree segregating a novel episodic ataxia and found significant linkage on 1q42 with a multipoint lod score of 3.65. Haplotype analysis and fine mapping yielded a peak 2-point lod score of 4.14 and indicated a 4-cM region on 1q42 that is likely to harbor an episodic ataxia gene.

  15. A resource of genome-wide single-nucleotide polymorphisms generated by RAD tag sequencing in the critically endangered European eel

    DEFF Research Database (Denmark)

    Pujolar, J.M.; Jacobsen, M.W.; Frydenberg, J.

    2013-01-01

    Reduced representation genome sequencing such as restriction-site-associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single-nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers for the Eu......Reduced representation genome sequencing such as restriction-site-associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single-nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers...... for the European eel using the RAD sequencing approach that was simultaneously identified and scored in a genome-wide scan of 30 individuals. Whereas genomic resources are increasingly becoming available for this species, including the recent release of a draft genome, no genome-wide set of SNP markers...

  16. Genome-wide association study of proneness to anger.

    Directory of Open Access Journals (Sweden)

    Eric Mick

    Full Text Available BACKGROUND: Community samples suggest that approximately 1 in 20 children and adults exhibit clinically significant anger, hostility, and aggression. Individuals with dysregulated emotional control have a greater lifetime burden of psychiatric morbidity, severe impairment in role functioning, and premature mortality due to cardiovascular disease. METHODS: With publically available data secured from dbGaP, we conducted a genome-wide association study of proneness to anger using the Spielberger State-Trait Anger Scale in the Atherosclerosis Risk in Communities (ARIC study (n = 8,747. RESULTS: Subjects were, on average, 54 (range 45-64 years old at baseline enrollment, 47% (n = 4,117 were male, and all were of European descent by self-report. The mean Angry Temperament and Angry Reaction scores were 5.8 ± 1.8 and 7.6 ± 2.2. We observed a nominally significant finding (p = 2.9E-08, λ = 1.027 - corrected pgc = 2.2E-07, λ = 1.0015 on chromosome 6q21 in the gene coding for the non-receptor protein-tyrosine kinase, Fyn. CONCLUSIONS: Fyn interacts with NDMA receptors and inositol-1,4,5-trisphosphate (IP3-gated channels to regulate calcium influx and intracellular release in the post-synaptic density. These results suggest that signaling pathways regulating intracellular calcium homeostasis, which are relevant to memory, learning, and neuronal survival, may in part underlie the expression of Angry Temperament.

  17. Cancer genetic association studies in the genome-wide age

    OpenAIRE

    Savage, Sharon A

    2008-01-01

    Genome-wide association studies of hundreds of thousands of SNPs have led to a deluge of studies of genetic variation in cancer and other common diseases. Large case–control and cohort studies have identified novel SNPs as markers of cancer risk. Genome-wide association study SNP data have also advanced understanding of population-specific genetic variation. While studies of risk profiles, combinations of SNPs that may increase cancer risk, are not yet clinically applicable, future, large-sca...

  18. Genome-wide polymorphisms show unexpected targets of natural selection

    OpenAIRE

    Pespeni, Melissa H.; Garfield, David A.; Manier, Mollie K; Palumbi, Stephen R.

    2011-01-01

    Natural selection can act on all the expressed genes of an individual, leaving signatures of genetic differentiation or diversity at many loci across the genome. New power to assay these genome-wide effects of selection comes from associating multi-locus patterns of polymorphism with gene expression and function. Here, we performed one of the first genome-wide surveys in a marine species, comparing purple sea urchins, Strongylocentrotus purpuratus, from two distant locations along the species...

  19. Genome-wide association study of multiplex schizophrenia pedigrees

    DEFF Research Database (Denmark)

    Levinson, Douglas F; Shi, Jianxin; Wang, Kai

    2012-01-01

    The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs).......The authors used a genome-wide association study (GWAS) of multiply affected families to investigate the association of schizophrenia to common single-nucleotide polymorphisms (SNPs) and rare copy number variants (CNVs)....

  20. Genome-Wide Association Study of Receptive Language Ability of 12-Year-Olds

    Science.gov (United States)

    Harlaar, Nicole; Meaburn, Emma L.; Hayiou-Thomas, Marianna E.; Davis, Oliver S. P.; Docherty, Sophia; Hanscombe, Ken B.; Haworth, Claire M. A.; Price, Thomas S.; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert

    2014-01-01

    Purpose: Researchers have previously shown that individual differences in measures of receptive language ability at age 12 are highly heritable. In the current study, the authors attempted to identify some of the genes responsible for the heritability of receptive language ability using a "genome-wide association" approach. Method: The…

  1. Genome-Wide Association Study for Response to Eimeria maxima Challenge in Broilers

    DEFF Research Database (Denmark)

    Hamzic, Edin; Bed'hom, Bertrand; Hérault, Frédéric

    Use of genetic tools for improvement of host’s response is considered as a promising complementary approach for coccidiosis control. Therefore, we performed genome wide association study (GWAS) for response to Eimeria maxima challenge in broilers. The challenge was done on 2024 Cobb500 broilers. ...

  2. Genome-Wide Association Study of Receptive Language Ability of 12-Year-Olds

    Science.gov (United States)

    Harlaar, Nicole; Meaburn, Emma L.; Hayiou-Thomas, Marianna E.; Davis, Oliver S. P.; Docherty, Sophia; Hanscombe, Ken B.; Haworth, Claire M. A.; Price, Thomas S.; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert

    2014-01-01

    Purpose: Researchers have previously shown that individual differences in measures of receptive language ability at age 12 are highly heritable. In the current study, the authors attempted to identify some of the genes responsible for the heritability of receptive language ability using a "genome-wide association" approach. Method: The…

  3. A genome-wide association search for type 2 diabetes genes in African Americans

    DEFF Research Database (Denmark)

    Palmer, Nicholette D; McDonough, Caitrin W; Hicks, Pamela J

    2012-01-01

    African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wid...

  4. Implementing meta-analysis from genome-wide association studies for pork quality traits

    Science.gov (United States)

    Pork quality plays an important role in the meat processing industry, thus different methodologies have been implemented to elucidate the genetic architecture of traits affecting meat quality. One of the most common and widely used approaches is to perform genome-wide association (GWA) studies. Howe...

  5. Genome-wide gene expression changes in an industrial clavulanic acid overproduction strain of Streptomyces clavuligerus

    NARCIS (Netherlands)

    Medema, M.H.; Alam, M.T.; Heijne, W.H.M.; Berg, M.A. van den; Müller, U.; Trefzer, A.; Bovenberg, R.A.L.; Breitling, R.; Takano, E.

    2011-01-01

    To increase production of the important pharmaceutical compound clavulanic acid, a beta-lactamase inhibitor, both random mutagenesis approaches and rational engineering of Streptomyces clavuligerus strains have been extensively applied. Here, for the first time, we compared genome-wide gene expressi

  6. Genome-wide association studies in economics and entrepreneurship research: Promises and limitations

    NARCIS (Netherlands)

    Ph.D. Koellinger (Philipp); M.J.H.M. van der Loos (Matthijs); P.J.F. Groenen (Patrick); A.R. Thurik (Roy); F. Rivadeneira Ramirez (Fernando); F.J.A. van Rooij (Frank); A.G. Uitterlinden (André); A. Hofman (Albert)

    2010-01-01

    textabstractThe recently developed genome-wide association study (GWAS) design enables the identification of genes specifically associated with economic outcomes such as occupational and other choices. This is a promising new approach for economics research which we aim to apply to the choice for en

  7. Genome-wide association studies in economics and entrepreneurship research: promises and limitations

    NARCIS (Netherlands)

    Ph.D. Koellinger (Philipp); M.J.H.M. van der Loos (Matthijs); P.J.F. Groenen (Patrick); A.R. Thurik (Roy); F. Rivadeneira Ramirez (Fernando); F.J.A. van Rooij (Frank)

    2010-01-01

    textabstractThe recently developed genome-wide association study (GWAS) design enables the identification of genes specifically associated with economic outcomes such as occupational and other choices. This is a promising new approach for economics research which we aim to apply to the choice for en

  8. Genome-wide gene expression changes in an industrial clavulanic acid overproduction strain of Streptomyces clavuligerus.

    NARCIS (Netherlands)

    Medema, M.H.; Alam, M.T.; Heijne, W.H.; Berg, M.A.M.C. van den; Muller, U.; Trefzer, A.; Bovenberg, R.A.; Breitling, R.; Takano, E.

    2011-01-01

    To increase production of the important pharmaceutical compound clavulanic acid, a beta-lactamase inhibitor, both random mutagenesis approaches and rational engineering of Streptomyces clavuligerus strains have been extensively applied. Here, for the first time, we compared genome-wide gene expressi

  9. Natural selection on functional modules, a genome-wide analysis.

    Science.gov (United States)

    Serra, François; Arbiza, Leonardo; Dopazo, Joaquín; Dopazo, Hernán

    2011-03-01

    Classically, the functional consequences of natural selection over genomes have been analyzed as the compound effects of individual genes. The current paradigm for large-scale analysis of adaptation is based on the observed significant deviations of rates of individual genes from neutral evolutionary expectation. This approach, which assumed independence among genes, has not been able to identify biological functions significantly enriched in positively selected genes in individual species. Alternatively, pooling related species has enhanced the search for signatures of selection. However, grouping signatures does not allow testing for adaptive differences between species. Here we introduce the Gene-Set Selection Analysis (GSSA), a new genome-wide approach to test for evidences of natural selection on functional modules. GSSA is able to detect lineage specific evolutionary rate changes in a notable number of functional modules. For example, in nine mammal and Drosophilae genomes GSSA identifies hundreds of functional modules with significant associations to high and low rates of evolution. Many of the detected functional modules with high evolutionary rates have been previously identified as biological functions under positive selection. Notably, GSSA identifies conserved functional modules with many positively selected genes, which questions whether they are exclusively selected for fitting genomes to environmental changes. Our results agree with previous studies suggesting that adaptation requires positive selection, but not every mutation under positive selection contributes to the adaptive dynamical process of the evolution of species.

  10. Natural selection on functional modules, a genome-wide analysis.

    Directory of Open Access Journals (Sweden)

    François Serra

    2011-03-01

    Full Text Available Classically, the functional consequences of natural selection over genomes have been analyzed as the compound effects of individual genes. The current paradigm for large-scale analysis of adaptation is based on the observed significant deviations of rates of individual genes from neutral evolutionary expectation. This approach, which assumed independence among genes, has not been able to identify biological functions significantly enriched in positively selected genes in individual species. Alternatively, pooling related species has enhanced the search for signatures of selection. However, grouping signatures does not allow testing for adaptive differences between species. Here we introduce the Gene-Set Selection Analysis (GSSA, a new genome-wide approach to test for evidences of natural selection on functional modules. GSSA is able to detect lineage specific evolutionary rate changes in a notable number of functional modules. For example, in nine mammal and Drosophilae genomes GSSA identifies hundreds of functional modules with significant associations to high and low rates of evolution. Many of the detected functional modules with high evolutionary rates have been previously identified as biological functions under positive selection. Notably, GSSA identifies conserved functional modules with many positively selected genes, which questions whether they are exclusively selected for fitting genomes to environmental changes. Our results agree with previous studies suggesting that adaptation requires positive selection, but not every mutation under positive selection contributes to the adaptive dynamical process of the evolution of species.

  11. Semiparametric score sevel susion: Gaussian sopula approach

    NARCIS (Netherlands)

    Susyanyo, N.; Klaassen, C.A.J.; Veldhuis, R.N.J.; Spreeuwers, L.J.

    2015-01-01

    Score level fusion is an appealing method for combining multi-algorithms, multi- representations, and multi-modality biometrics due to its simplicity. Often, scores are assumed to be independent, but even for dependent scores, accord- ing to the Neyman-Pearson lemma, the likelihood ratio is the opti

  12. Genome-wide analyses of small noncoding RNAs in streptococci

    Directory of Open Access Journals (Sweden)

    Nadja ePatenge

    2015-05-01

    Full Text Available Streptococci represent a diverse group of Gram-positive bacteria, which colonize a wide range of hosts among animals and humans. Streptococcal species occur as commensal as well as pathogenic organisms. Many of the pathogenic species can cause severe, invasive infections in their hosts leading to a high morbidity and mortality. The consequence is a tremendous suffering on the part of men and livestock besides the significant financial burden in the agricultural and healthcare sectors. An environmentally stimulated and tightly controlled expression of virulence factor genes is of fundamental importance for streptococcal pathogenicity. Bacterial small noncoding RNAs (sRNAs modulate the expression of genes involved in stress response, sugar metabolism, surface composition, and other properties that are related to bacterial virulence. Even though the regulatory character is shared by this class of RNAs, variation on the molecular level results in a high diversity of functional mechanisms. The knowledge about the role of sRNAs in streptococci is still limited, but in recent years, genome-wide screens for sRNAs have been conducted in an increasing number of species. Bioinformatics prediction approaches have been employed as well as expression analyses by classical array techniques or next generation sequencing. This review will give an overview of whole genome screens for sRNAs in streptococci with a focus on describing the different methods and comparing their outcome considering sRNA conservation among species, functional similarities, and relevance for streptococcal infection.

  13. Insights into kidney diseases from genome-wide association studies.

    Science.gov (United States)

    Wuttke, Matthias; Köttgen, Anna

    2016-09-01

    Over the past decade, genome-wide association studies (GWAS) have considerably improved our understanding of the genetic basis of kidney function and disease. Population-based studies, used to investigate traits that define chronic kidney disease (CKD), have identified >50 genomic regions in which common genetic variants associate with estimated glomerular filtration rate or urinary albumin-to-creatinine ratio. Case-control studies, used to study specific CKD aetiologies, have yielded risk loci for specific kidney diseases such as IgA nephropathy and membranous nephropathy. In this Review, we summarize important findings from GWAS and clinical and experimental follow-up studies. We also compare risk allele frequency, effect sizes, and specificity in GWAS of CKD-defining traits and GWAS of specific CKD aetiologies and the implications for study design. Genomic regions identified in GWAS of CKD-defining traits can contain causal genes for monogenic kidney diseases. Population-based research on kidney function traits can therefore generate insights into more severe forms of kidney diseases. Experimental follow-up studies have begun to identify causal genes and variants, which are potential therapeutic targets, and suggest mechanisms underlying the high allele frequency of causal variants. GWAS are thus a useful approach to advance knowledge in nephrology.

  14. Identification of differential translation in genome wide studies.

    Science.gov (United States)

    Larsson, Ola; Sonenberg, Nahum; Nadon, Robert

    2010-12-14

    Regulation of gene expression through translational control is a fundamental mechanism implicated in many biological processes ranging from memory formation to innate immunity and whose dysregulation contributes to human diseases. Genome wide analyses of translational control strive to identify differential translation independent of cytosolic mRNA levels. For this reason, most studies measure genes' translation levels as log ratios (translation levels divided by corresponding cytosolic mRNA levels obtained in parallel). Counterintuitively, arising from a mathematical necessity, these log ratios tend to be highly correlated with the cytosolic mRNA levels. Accordingly, they do not effectively correct for cytosolic mRNA level and generate substantial numbers of biological false positives and false negatives. We show that analysis of partial variance, which produces estimates of translational activity that are independent of cytosolic mRNA levels, is a superior alternative. When combined with a variance shrinkage method for estimating error variance, analysis of partial variance has the additional benefit of having greater statistical power and identifying fewer genes as translationally regulated resulting merely from unrealistically low variance estimates rather than from large changes in translational activity. In contrast to log ratios, this formal analytical approach estimates translation effects in a statistically rigorous manner, eliminates the need for inefficient and error-prone heuristics, and produces results that agree with biological function. The method is applicable to datasets obtained from both the commonly used polysome microarray method and the sequencing-based ribosome profiling method.

  15. Genome-Wide Association Studies of the Human Gut Microbiota.

    Directory of Open Access Journals (Sweden)

    Emily R Davenport

    Full Text Available The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both. These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%. For example, we identified an association between a taxon known to affect obesity (genus Akkermansia and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10-7. Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut.

  16. Genome-wide studies of telomere biology in budding yeast

    Directory of Open Access Journals (Sweden)

    Yaniv Harari

    2014-03-01

    Full Text Available Telomeres are specialized DNA-protein structures at the ends of eukaryotic chromosomes. Telomeres are essential for chromosomal stability and integrity, as they prevent chromosome ends from being recognized as double strand breaks. In rapidly proliferating cells, telomeric DNA is synthesized by the enzyme telomerase, which copies a short template sequence within its own RNA moiety, thus helping to solve the “end-replication problem”, in which information is lost at the ends of chromosomes with each DNA replication cycle. The basic mechanisms of telomere length, structure and function maintenance are conserved among eukaryotes. Studies in the yeast Saccharomyces cerevisiae have been instrumental in deciphering the basic aspects of telomere biology. In the last decade, technical advances, such as the availability of mutant collections, have allowed carrying out systematic genome-wide screens for mutants affecting various aspects of telomere biology. In this review we summarize these efforts, and the insights that this Systems Biology approach has produced so far.

  17. Comparative analysis of methods for genome-wide nucleosome cartography.

    Science.gov (United States)

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use. © The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  18. Novel genetic matching methods for handling population stratification in genome-wide association studies.

    Science.gov (United States)

    Lacour, André; Schüller, Vitalia; Drichel, Dmitriy; Herold, Christine; Jessen, Frank; Leber, Markus; Maier, Wolfgang; Noethen, Markus M; Ramirez, Alfredo; Vaitsiakhovich, Tatsiana; Becker, Tim

    2015-03-14

    A usually confronted problem in association studies is the occurrence of population stratification. In this work, we propose a novel framework to consider population matchings in the contexts of genome-wide and sequencing association studies. We employ pairwise and groupwise optimal case-control matchings and present an agglomerative hierarchical clustering, both based on a genetic similarity score matrix. In order to ensure that the resulting matches obtained from the matching algorithm capture correctly the population structure, we propose and discuss two stratum validation methods. We also invent a decisive extension to the Cochran-Armitage Trend test to explicitly take into account the particular population structure. We assess our framework by simulations of genotype data under the null hypothesis, to affirm that it correctly controls for the type-1 error rate. By a power study we evaluate that structured association testing using our framework displays reasonable power. We compare our result with those obtained from a logistic regression model with principal component covariates. Using the principal components approaches we also find a possible false-positive association to Alzheimer's disease, which is neither supported by our new methods, nor by the results of a most recent large meta analysis or by a mixed model approach. Matching methods provide an alternative handling of confounding due to population stratification for statistical tests for which covariates are hard to model. As a benchmark, we show that our matching framework performs equally well to state of the art models on common variants.

  19. Genome-wide prediction of transcriptional regulatory elements of human promoters using gene expression and promoter analysis data

    Directory of Open Access Journals (Sweden)

    Kim Seon-Young

    2006-07-01

    Full Text Available Abstract Background A complete understanding of the regulatory mechanisms of gene expression is the next important issue of genomics. Many bioinformaticians have developed methods and algorithms for predicting transcriptional regulatory mechanisms from sequence, gene expression, and binding data. However, most of these studies involved the use of yeast which has much simpler regulatory networks than human and has many genome wide binding data and gene expression data under diverse conditions. Studies of genome wide transcriptional networks of human genomes currently lag behind those of yeast. Results We report herein a new method that combines gene expression data analysis with promoter analysis to infer transcriptional regulatory elements of human genes. The Z scores from the application of gene set analysis with gene sets of transcription factor binding sites (TFBSs were successfully used to represent the activity of TFBSs in a given microarray data set. A significant correlation between the Z scores of gene sets of TFBSs and individual genes across multiple conditions permitted successful identification of many known human transcriptional regulatory elements of genes as well as the prediction of numerous putative TFBSs of many genes which will constitute a good starting point for further experiments. Using Z scores of gene sets of TFBSs produced better predictions than the use of mRNA levels of a transcription factor itself, suggesting that the Z scores of gene sets of TFBSs better represent diverse mechanisms for changing the activity of transcription factors in the cell. In addition, cis-regulatory modules, combinations of co-acting TFBSs, were readily identified by our analysis. Conclusion By a strategic combination of gene set level analysis of gene expression data sets and promoter analysis, we were able to identify and predict many transcriptional regulatory elements of human genes. We conclude that this approach will aid in decoding

  20. A Genome-Wide Scan for Breast Cancer Risk Haplotypes among African American Women

    Science.gov (United States)

    Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.

    2013-01-01

    Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962

  1. Genome-wide association study of chemotherapeutic agent-induced severe neutropenia/leucopenia for patients in Biobank Japan.

    Science.gov (United States)

    Low, Siew-Kee; Chung, Suyoun; Takahashi, Atsushi; Zembutsu, Hitoshi; Mushiroda, Taisei; Kubo, Michiaki; Nakamura, Yusuke

    2013-08-01

    Chemotherapeutic agents are notoriously known to have a narrow therapeutic range that often results in life-threatening toxicity. Hence, it is clinically important to identify the patients who are at high risk for severe toxicity to certain chemotherapy through a pharmacogenomics approach. In this study, we carried out multiple genome-wide association studies (GWAS) of 13 122 cancer patients who received different chemotherapy regimens, including cyclophosphamide- and platinum-based (cisplatin and carboplatin), anthracycline-based (doxorubicin and epirubicin), and antimetabolite-based (5-fluorouracil and gemcitabine) treatment, antimicrotubule agents (paclitaxel and docetaxel), and topoisomerase inhibitors (camptothecin and etoposide), as well as combination therapy with paclitaxel and carboplatin, to identify genetic variants that are associated with the risk of severe neutropenia/leucopenia in the Japanese population. In addition, we used a weighted genetic risk scoring system to evaluate the cumulative effects of the suggestive genetic variants identified from GWAS in order to predict the risk levels of individuals who carry multiple risk alleles. Although we failed to identify genetic variants that surpassed the genome-wide significance level (P < 5.0 × 10(-8) ) through GWAS, probably due to insufficient statistical power and complex clinical features, we were able to shortlist some of the suggestive associated loci. The current study is at the relatively preliminary stage, but does highlight the complexity and problematic issues associated with retrospective pharmacogenomics studies. However, we hope that verification of these genetic variants through local and international collaborations could improve the clinical outcome for cancer patients.

  2. A novel statistic for genome-wide interaction analysis.

    Science.gov (United States)

    Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao

    2010-09-23

    Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDRanalysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  3. Genome-wide promoter methylome of small renal masses.

    Directory of Open Access Journals (Sweden)

    Ilsiya Ibragimova

    Full Text Available The majority of renal cell carcinoma (RCC is now incidentally detected and presents as small renal masses (SRMs defined as ≤ 4 cm in size. SRMs are heterogeneous comprising several histological types of RCC each with different biology and behavior, and benign tumors mainly oncocytoma. The varied prognosis of the different types of renal tumor has implications for management options. A key epigenetic alteration involved in the initiation and progression of cancer is aberrant methylation in the promoter region of a gene. The hypermethylation is associated with transcriptional repression and is an important mechanism of inactivation of tumor suppressor genes in neoplastic cells. We have determined the genome-wide promoter methylation profiles of 47 pT1a and 2 pT1b clear cell, papillary or chromophobe RCC, 25 benign renal oncocytoma ≤ 4 cm and 4 normal renal parenchyma specimens by Infinium HumanMethylation27 beadchip technology. We identify gene promoter hypermethylation signatures that distinguish clear cell and papillary from each other, from chromophobe and oncocytoma, and from normal renal cells. Pairwise comparisons revealed genes aberrantly hypermethylated in a tumor type but unmethylated in normal, and often unmethylated in the other renal tumor types. About 0.4% to 1.7% of genes comprised the promoter methylome in SRMs. The Infinium methylation score for representative genes was verified by gold standard technologies. The genes identified as differentially methylated implicate pathways involved in metabolism, tissue response to injury, epithelial to mesenchymal transition (EMT, signal transduction and G-protein coupled receptors (GPCRs, cancer, and stem cell regulation in the biology of RCC. Our findings contribute towards an improved understanding of the development of RCC, the different biology and behavior of histological types, and discovery of molecular subtypes. The differential methylation signatures may have utility in early

  4. Genome-wide association study for subclinical atherosclerosis in major arterial territories in the NHLBI's Framingham Heart Study

    Directory of Open Access Journals (Sweden)

    Hwang Shih-Jen

    2007-09-01

    Full Text Available Abstract Introduction Subclinical atherosclerosis (SCA measures in multiple arterial beds are heritable phenotypes that are associated with increased incidence of cardiovascular disease. We conducted a genome-wide association study (GWAS for SCA measurements in the community-based Framingham Heart Study. Methods Over 100,000 single nucleotide polymorphisms (SNPs were genotyped (Human 100K GeneChip, Affymetrix in 1345 subjects from 310 families. We calculated sex-specific age-adjusted and multivariable-adjusted residuals in subjects tested for quantitative SCA phenotypes, including ankle-brachial index, coronary artery calcification and abdominal aortic calcification using multi-detector computed tomography, and carotid intimal medial thickness (IMT using carotid ultrasonography. We evaluated associations of these phenotypes with 70,987 autosomal SNPs with minor allele frequency ≥ 0.10, call rate ≥ 80%, and Hardy-Weinberg p-value ≥ 0.001 in samples ranging from 673 to 984 subjects, using linear regression with generalized estimating equations (GEE methodology and family-based association testing (FBAT. Variance components LOD scores were also calculated. Results There was no association result meeting criteria for genome-wide significance, but our methods identified 11 SNPs with p -5 by GEE and five SNPs with p -5 by FBAT for multivariable-adjusted phenotypes. Among the associated variants were SNPs in or near genes that may be considered candidates for further study, such as rs1376877 (GEE p ABI2 for maximum internal carotid artery IMT and rs4814615 (FBAT p = 0.000003, located in PCSK2 for maximum common carotid artery IMT. Modest significant associations were noted with various SCA phenotypes for variants in previously reported atherosclerosis candidate genes, including NOS3 and ESR1. Associations were also noted of a region on chromosome 9p21 with CAC phenotypes that confirm associations with coronary heart disease and CAC in two

  5. Power analysis for genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Klein Robert J

    2007-08-01

    Full Text Available Abstract Background Genome-wide association studies are a promising new tool for deciphering the genetics of complex diseases. To choose the proper sample size and genotyping platform for such studies, power calculations that take into account genetic model, tag SNP selection, and the population of interest are required. Results The power of genome-wide association studies can be computed using a set of tag SNPs and a large number of genotyped SNPs in a representative population, such as available through the HapMap project. As expected, power increases with increasing sample size and effect size. Power also depends on the tag SNPs selected. In some cases, more power is obtained by genotyping more individuals at fewer SNPs than fewer individuals at more SNPs. Conclusion Genome-wide association studies should be designed thoughtfully, with the choice of genotyping platform and sample size being determined from careful power calculations.

  6. A novel statistic for genome-wide interaction analysis.

    Directory of Open Access Journals (Sweden)

    Xuesen Wu

    2010-09-01

    Full Text Available Although great progress in genome-wide association studies (GWAS has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked. The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  7. Robustness of genome-wide scanning using archived dried blood spot samples as a DNA source

    Directory of Open Access Journals (Sweden)

    Børglum Anders D

    2011-07-01

    Full Text Available Abstract Background The search to identify disease-susceptible genes requires access to biological material from numerous well-characterized subjects. Archived residual dried blood spot (DBS samples, also known as Guthrie cards, from national newborn screening programs may provide a DNA source for entire populations. Combined with clinical information from medical registries, DBS samples could provide a rich source for productive research. However, the amounts of DNA which can be extracted from these precious samples are minute and may be prohibitive for numerous genotypings. Previously, we demonstrated that DBS DNA can be whole-genome amplified and used for reliable genetic analysis on different platforms, including genome-wide scanning arrays. However, it remains unclear whether this approach is workable on a large sample scale. We examined the robustness of using DBS samples for whole-genome amplification following genome-wide scanning, using arrays from Illumina and Affymetrix. Results This study is based on 4,641 DBS samples from the Danish Newborn Screening Biobank, extracted for three separate genome-wide association studies. The amount of amplified DNA was significantly (P Conclusion Our study indicates that archived DBS samples from the Danish Newborn Screening Biobank represent a reliable resource of DNA for whole-genome amplification and subsequent genome-wide association studies. With call-rates equivalent to high quality DNA samples, our results point to new opportunities for using the neonatal biobanks available worldwide in the hunt for genetic components of disease.

  8. Genome-wide methylation analyses in glioblastoma multiforme.

    Directory of Open Access Journals (Sweden)

    Rose K Lai

    Full Text Available Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM. Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1, 5 methyl-deoxycytidine (5m-dC and 5 hydroxylmethyl-deoxycytidine (5hm-dC in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal.

  9. Improved statistics for genome-wide interaction analysis.

    Science.gov (United States)

    Ueki, Masao; Cordell, Heather J

    2012-01-01

    Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new "joint effects" statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al

  10. Genome-Wide Association of Stem Water Soluble Carbohydrates in Bread Wheat

    Science.gov (United States)

    Dong, Yan; Liu, Jindong; Zhang, Yan; Geng, Hongwei; Rasheed, Awais; Xiao, Yonggui; Cao, Shuanghe; Fu, Luping; Yan, Jun; Wen, Weie; Zhang, Yong; Jing, Ruilian; Xia, Xianchun; He, Zhonghu

    2016-01-01

    Water soluble carbohydrates (WSC) in stems play an important role in buffering grain yield in wheat against biotic and abiotic stresses; however, knowledge of genes controlling WSC is very limited. We conducted a genome-wide association study (GWAS) using a high-density 90K SNP array to better understand the genetic basis underlying WSC, and to explore marker-based breeding approaches. WSC was evaluated in an association panel comprising 166 Chinese bread wheat cultivars planted in four environments. Fifty two marker-trait associations (MTAs) distributed across 23 loci were identified for phenotypic best linear unbiased estimates (BLUEs), and 11 MTAs were identified in two or more environments. Liner regression showed a clear dependence of WSC BLUE scores on numbers of favorable (increasing WSC content) and unfavorable alleles (decreasing WSC), indicating that genotypes with higher numbers of favorable or lower numbers of unfavorable alleles had higher WSC content. In silico analysis of flanking sequences of trait-associated SNPs revealed eight candidate genes related to WSC content grouped into two categories based on the type of encoding proteins, namely, defense response proteins and proteins triggered by environmental stresses. The identified SNPs and candidate genes related to WSC provide opportunities for breeding higher WSC wheat cultivars. PMID:27802269

  11. High-Resolution Genome-Wide Linkage Mapping Identifies Susceptibility Loci for BMI in the Chinese Population

    DEFF Research Database (Denmark)

    Zhang, Dong Feng; Pang, Zengchang; Li, Shuxia

    2012-01-01

    The genetic loci affecting the commonly used BMI have been intensively investigated using linkage approaches in multiple populations. This study aims at performing the first genome-wide linkage scan on BMI in the Chinese population in mainland China with hypothesis that heterogeneity in genetic...... in western countries. Multiple loci showing suggestive linkage were found on chromosome 1 (lod score 2.38 at 242 cM), chromosome 8 (2.48 at 95 cM), and chromosome 14 (2.2 at 89.4 cM). The strong linkage identified in the Chinese subjects that is consistent with that found in populations of European origin...... could suggest the existence of evolutionarily preserved genetic mechanisms for BMI whereas the multiple suggestive loci could represent genetic effect from gene-environment interaction as a result of population-specific environmental adaptation....

  12. Polygenic analysis of genome-wide SNP data identifies common variants on allergic rhinitis

    DEFF Research Database (Denmark)

    Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette

    Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...

  13. Genome-wide distribution of genetic diversity and linkage disequilibrium in elite sugar beet germplasm

    Directory of Open Access Journals (Sweden)

    Weißleder Knuth

    2011-10-01

    Full Text Available Abstract Background Characterization of population structure and genetic diversity of germplasm is essential for the efficient organization and utilization of breeding material. The objectives of this study were to (i explore the patterns of population structure in the pollen parent heterotic pool using different methods, (ii investigate the genome-wide distribution of genetic diversity, and (iii assess the extent and genome-wide distribution of linkage disequilibrium (LD in elite sugar beet germplasm. Results A total of 264 and 238 inbred lines from the yield type and sugar type inbreds of the pollen parent heterotic gene pools, respectively, which had been genotyped with 328 SNP markers, were used in this study. Two distinct subgroups were detected based on different statistical methods within the elite sugar beet germplasm set, which was in accordance with its breeding history. MCLUST based on principal components, principal coordinates, or lapvectors had high correspondence with the germplasm type information as well as the assignment by STRUCTURE, which indicated that these methods might be alternatives to STRUCTURE for population structure analysis. Gene diversity and modified Roger's distance between the examined germplasm types varied considerably across the genome, which might be due to artificial selection. This observation indicates that population genetic approaches could be used to identify candidate genes for the traits under selection. Due to the fact that r2 >0.8 is required to detect marker-phenotype association explaining less than 1% of the phenotypic variance, our observation of a low proportion of SNP loci pairs showing such levels of LD suggests that the number of markers has to be dramatically increased for powerful genome-wide association mapping. Conclusions We provided a genome-wide distribution map of genetic diversity and linkage disequilibrium for the elite sugar beet germplasm, which is useful for the application of

  14. Genome-wide meta-analyses of smoking behaviors in African Americans.

    Science.gov (United States)

    David, S P; Hamidovic, A; Chen, G K; Bergen, A W; Wessel, J; Kasberger, J L; Brown, W M; Petruzella, S; Thacker, E L; Kim, Y; Nalls, M A; Tranah, G J; Sung, Y J; Ambrosone, C B; Arnett, D; Bandera, E V; Becker, D M; Becker, L; Berndt, S I; Bernstein, L; Blot, W J; Broeckel, U; Buxbaum, S G; Caporaso, N; Casey, G; Chanock, S J; Deming, S L; Diver, W R; Eaton, C B; Evans, D S; Evans, M K; Fornage, M; Franceschini, N; Harris, T B; Henderson, B E; Hernandez, D G; Hitsman, B; Hu, J J; Hunt, S C; Ingles, S A; John, E M; Kittles, R; Kolb, S; Kolonel, L N; Le Marchand, L; Liu, Y; Lohman, K K; McKnight, B; Millikan, R C; Murphy, A; Neslund-Dudas, C; Nyante, S; Press, M; Psaty, B M; Rao, D C; Redline, S; Rodriguez-Gil, J L; Rybicki, B A; Signorello, L B; Singleton, A B; Smoller, J; Snively, B; Spring, B; Stanford, J L; Strom, S S; Swan, G E; Taylor, K D; Thun, M J; Wilson, A F; Witte, J S; Yamamura, Y; Yanek, L R; Yu, K; Zheng, W; Ziegler, R G; Zonderman, A B; Jorgenson, E; Haiman, C A; Furberg, H

    2012-05-22

    The identification and exploration of genetic loci that influence smoking behaviors have been conducted primarily in populations of the European ancestry. Here we report results of the first genome-wide association study meta-analysis of smoking behavior in African Americans in the Study of Tobacco in Minority Populations Genetics Consortium (n = 32,389). We identified one non-coding single-nucleotide polymorphism (SNP; rs2036527[A]) on chromosome 15q25.1 associated with smoking quantity (cigarettes per day), which exceeded genome-wide significance (β = 0.040, s.e. = 0.007, P = 1.84 × 10(-8)). This variant is present in the 5'-distal enhancer region of the CHRNA5 gene and defines the primary index signal reported in studies of the European ancestry. No other SNP reached genome-wide significance for smoking initiation (SI, ever vs never smoking), age of SI, or smoking cessation (SC, former vs current smoking). Informative associations that approached genome-wide significance included three modestly correlated variants, at 15q25.1 within PSMA4, CHRNA5 and CHRNA3 for smoking quantity, which are associated with a second signal previously reported in studies in European ancestry populations, and a signal represented by three SNPs in the SPOCK2 gene on chr10q22.1. The association at 15q25.1 confirms this region as an important susceptibility locus for smoking quantity in men and women of African ancestry. Larger studies will be needed to validate the suggestive loci that did not reach genome-wide significance and further elucidate the contribution of genetic variation to disparities in cigarette consumption, SC and smoking-attributable disease between African Americans and European Americans.

  15. Genome-Wide Architecture of Disease Resistance Genes in Lettuce.

    Science.gov (United States)

    Christopoulou, Marilena; Wo, Sebastian Reyes-Chin; Kozik, Alex; McHale, Leah K; Truco, Maria-Jose; Wroblewski, Tadeusz; Michelmore, Richard W

    2015-10-08

    Genome-wide motif searches identified 1134 genes in the lettuce reference genome of cv. Salinas that are potentially involved in pathogen recognition, of which 385 were predicted to encode nucleotide binding-leucine rich repeat receptor (NLR) proteins. Using a maximum-likelihood approach, we grouped the NLRs into 25 multigene families and 17 singletons. Forty-one percent of these NLR-encoding genes belong to three families, the largest being RGC16 with 62 genes in cv. Salinas. The majority of NLR-encoding genes are located in five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 and cosegregate with multiple disease resistance phenotypes. Most MRCs contain primarily members of a single NLR gene family but a few are more complex. MRC2 spans 73 Mb and contains 61 NLRs of six different gene families that cosegregate with nine disease resistance phenotypes. MRC3, which is 25 Mb, contains 22 RGC21 genes and colocates with Dm13. A library of 33 transgenic RNA interference tester stocks was generated for functional analysis of NLR-encoding genes that cosegregated with disease resistance phenotypes in each of the MRCs. Members of four NLR-encoding families, RGC1, RGC2, RGC21, and RGC12 were shown to be required for 16 disease resistance phenotypes in lettuce. The general composition of MRCs is conserved across different genotypes; however, the specific repertoire of NLR-encoding genes varied particularly of the rapidly evolving Type I genes. These tester stocks are valuable resources for future analyses of additional resistance phenotypes. Copyright © 2015 Christopoulou et al.

  16. Genome-wide inference of regulatory networks in Streptomyces coelicolor

    Directory of Open Access Journals (Sweden)

    Takano Eriko

    2010-10-01

    Full Text Available Abstract Background The onset of antibiotics production in Streptomyces species is co-ordinated with differentiation events. An understanding of the genetic circuits that regulate these coupled biological phenomena is essential to discover and engineer the pharmacologically important natural products made by these species. The availability of genomic tools and access to a large warehouse of transcriptome data for the model organism, Streptomyces coelicolor, provides incentive to decipher the intricacies of the regulatory cascades and develop biologically meaningful hypotheses. Results In this study, more than 500 samples of genome-wide temporal transcriptome data, comprising wild-type and more than 25 regulatory gene mutants of Streptomyces coelicolor probed across multiple stress and medium conditions, were investigated. Information based on transcript and functional similarity was used to update a previously-predicted whole-genome operon map and further applied to predict transcriptional networks constituting modules enriched in diverse functions such as secondary metabolism, and sigma factor. The predicted network displays a scale-free architecture with a small-world property observed in many biological networks. The networks were further investigated to identify functionally-relevant modules that exhibit functional coherence and a consensus motif in the promoter elements indicative of DNA-binding elements. Conclusions Despite the enormous experimental as well as computational challenges, a systems approach for integrating diverse genome-scale datasets to elucidate complex regulatory networks is beginning to emerge. We present an integrated analysis of transcriptome data and genomic features to refine a whole-genome operon map and to construct regulatory networks at the cistron level in Streptomyces coelicolor. The functionally-relevant modules identified in this study pose as potential targets for further studies and verification.

  17. Genome-Wide Association Study of Schizophrenia in Japanese Population

    Science.gov (United States)

    Yamada, Kazuo; Iwayama, Yoshimi; Hattori, Eiji; Iwamoto, Kazuya; Toyota, Tomoko; Ohnishi, Tetsuo; Ohba, Hisako; Maekawa, Motoko; Kato, Tadafumi; Yoshikawa, Takeo

    2011-01-01

    Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS) is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs) in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions). The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila)-like 2] gene located on 9p21.3 (p = 0.00087). In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals) of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026). The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology. PMID:21674006

  18. Genome-wide survey for biologically functional pseudogenes.

    Directory of Open Access Journals (Sweden)

    Orjan Svensson

    2006-05-01

    Full Text Available According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, indeed, in vivo vital and also evolutionarily preserved, encouraged us to conduct a genome-wide survey for other functional pseudogenes in human, mouse, and chimpanzee. We identify to our knowledge the first examples of conserved pseudogenes common to human and mouse, originating from one duplication predating the human-mouse species split and having evolved as pseudogenes since the species split. Functionality is one possible way to explain the apparently contradictory properties of such pseudogene pairs, i.e., high conservation and ancient origin. The hypothesis of functionality is tested by comparing expression evidence and synteny of the candidates with proper test sets. The tests suggest potential biological function. Our candidate set includes a small set of long-lived pseudogenes whose unknown potential function is retained since before the human-mouse species split, and also a larger group of primate-specific ones found from human-chimpanzee searches. Two processed sequences are notable, their conservation since the human-mouse split being as high as most protein-coding genes; one is derived from the protein Ataxin 7-like 3 (ATX7NL3, and one from the Spinocerebellar ataxia type 1 protein (ATX1. Our approach is comparative and can be applied to any pair of species. It is implemented by a semi-automated pipeline based on cross-species BLAST comparisons and maximum-likelihood phylogeny estimations. To separate pseudogenes from protein-coding genes, we use standard methods, utilizing in-frame disablements, as well as a probabilistic filter based on Ka/Ks ratios.

  19. Genome-wide association study of schizophrenia in Japanese population.

    Directory of Open Access Journals (Sweden)

    Kazuo Yamada

    Full Text Available Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions. The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila-like 2] gene located on 9p21.3 (p = 0.00087. In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026. The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology.

  20. Probabilistic protein function prediction from heterogeneous genome-wide data.

    Directory of Open Access Journals (Sweden)

    Naoki Nariai

    Full Text Available Dramatic improvements in high throughput sequencing technologies have led to a staggering growth in the number of predicted genes. However, a large fraction of these newly discovered genes do not have a functional assignment. Fortunately, a variety of novel high-throughput genome-wide functional screening technologies provide important clues that shed light on gene function. The integration of heterogeneous data to predict protein function has been shown to improve the accuracy of automated gene annotation systems. In this paper, we propose and evaluate a probabilistic approach for protein function prediction that integrates protein-protein interaction (PPI data, gene expression data, protein motif information, mutant phenotype data, and protein localization data. First, functional linkage graphs are constructed from PPI data and gene expression data, in which an edge between nodes (proteins represents evidence for functional similarity. The assumption here is that graph neighbors are more likely to share protein function, compared to proteins that are not neighbors. The functional linkage graph model is then used in concert with protein domain, mutant phenotype and protein localization data to produce a functional prediction. Our method is applied to the functional prediction of Saccharomyces cerevisiae genes, using Gene Ontology (GO terms as the basis of our annotation. In a cross validation study we show that the integrated model increases recall by 18%, compared to using PPI data alone at the 50% precision. We also show that the integrated predictor is significantly better than each individual predictor. However, the observed improvement vs. PPI depends on both the new source of data and the functional category to be predicted. Surprisingly, in some contexts integration hurts overall prediction accuracy. Lastly, we provide a comprehensive assignment of putative GO terms to 463 proteins that currently have no assigned function.

  1. Genome-wide gene expression analysis of anguillid herpesvirus 1

    NARCIS (Netherlands)

    Beurden, van S.J.; Peeters, B.P.H.; Rottier, P.J.M.; Davison, A.A.; Engelsma, M.Y.

    2013-01-01

    Background Whereas temporal gene expression in mammalian herpesviruses has been studied extensively, little is known about gene expression in fish herpesviruses. Here we report a genome-wide transcription analysis of a fish herpesvirus, anguillid herpesvirus 1, in cell culture, studied during the

  2. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    Science.gov (United States)

    ... Q R S T U V W X Y Z We want to hear from you You are here: News & Events 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2004 2003 2002 2001 2000 1999 Spotlight on Research 2012 July 2012 (historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A team of ...

  3. A genome-wide scan for preeclampsia in the Netherlands

    NARCIS (Netherlands)

    Lachmeijer, AMA; Arngrimsson, R; Bastiaans, EJ; Frigge, ML; Pals, G; Sigurdardottir, S; Stefansson, H; Palsson, B; Nicolae, D; Kong, A; Aarnoudse, JG; Gulcher, [No Value; Dekker, GA; ten Kate, LP; Stefansson, K

    2001-01-01

    Preeclampsia, hallmarked by de novo hypertension and proteinuria in pregnancy, has a familial tendency. Recently, a large Icelandic genome-wide scan provided evidence for a maternal susceptibility locus for preeclampsia on chromosome 2p13 which was confirmed by a genome scan from Australia and New

  4. Genome-wide RNA Tomography in the Zebrafish Embryo

    NARCIS (Netherlands)

    Junker, Jan Philipp; Noël, Emily S; Guryev, Victor; Peterson, Kevin A; Shah, Gopi; Huisken, Jan; McMahon, Andrew P; Berezikov, Eugene; Bakkers, Jeroen; van Oudenaarden, Alexander

    2014-01-01

    Advancing our understanding of embryonic development is heavily dependent on identification of novel pathways or regulators. Although genome-wide techniques such as RNA sequencing are ideally suited for discovering novel candidate genes, they are unable to yield spatially resolved information in

  5. Genome-wide RNA Tomography in the zebrafish embryo

    NARCIS (Netherlands)

    Junker, Jan Philipp; Noël, Emily S; Guryev, Victor; Peterson, Kevin A; Shah, Gopi; Huisken, Jan; McMahon, Andrew P; Berezikov, Eugene; Bakkers, Jeroen; van Oudenaarden, Alexander

    2014-01-01

    Advancing our understanding of embryonic development is heavily dependent on identification of novel pathways or regulators. Although genome-wide techniques such as RNA sequencing are ideally suited for discovering novel candidate genes, they are unable to yield spatially resolved information in

  6. Genome-wide association study identifies five new schizophrenia loci

    NARCIS (Netherlands)

    Ripke, Stephan; Sanders, Alan R.; Kendler, Kenneth S.; Levinson, Douglas F.; Sklar, Pamela; Holmans, Peter A.; Lin, Dan-Yu; Duan, Jubao; Ophoff, Roel A.; Andreassen, Ole A.; Scolnick, Edward; Cichon, Sven; Clair, David St.; Corvin, Aiden; Gurling, Hugh; Werge, Thomas; Rujescu, Dan; Blackwood, Douglas H. R.; Pato, Carlos N.; Malhotra, Anil K.; Purcell, Shaun; Dudbridge, Frank; Neale, Benjamin M.; Rossin, Lizzy; Visscher, Peter M.; Posthuma, Danielle; Ruderfer, Douglas M.; Fanous, Ayman; Stefansson, Hreinn; Steinberg, Stacy; Mowry, Bryan J.; Golimbet, Vera; De Hert, Marc; Jonsson, Erik G.; Bitter, Istvan; Pietilainen, Olli P. H.; Collier, David A.; Tosato, Sarah; Agartz, Ingrid; Albus, Margot; Alexander, Madeline; Amdur, Richard L.; Amin, Farooq; Bass, Nicholas; Bergen, Sarah E.; Black, Donald W.; Borglum, Anders D.; Brown, Matthew A.; Bruggeman, Richard; Buccola, Nancy G.; Byerley, William F.; Cahn, Wiepke; Cantor, Rita M.; Carr, Vaughan J.; Catts, Stanley V.; Choudhury, Khalid; Cloninger, C. Robert; Cormican, Paul; Craddock, Nicholas; Danoy, Patrick A.; Datta, Susmita; De Haan, Lieuwe; Demontis, Ditte; Dikeos, Dimitris; Djurovic, Srdjan; Donnelly, Peter; Donohoe, Gary; Duong, Linh; Dwyer, Sarah; Fink-Jensen, Anders; Freedman, Robert; Freimer, Nelson B.; Friedl, Marion; Georgieva, Lyudmila; Giegling, Ina; Gill, Michael; Glenthoj, Birte; Godard, Stephanie; Hamshere, Marian; Hansen, Mark; Hansen, Thomas; Hartmann, Annette M.; Henskens, Frans A.; Hougaard, David M.; Hultman, Christina M.; Ingason, Andres; Jablensky, Assen V.; Jakobsen, Klaus D.; Jay, Maurice; Juergens, Gesche; Kahn, Renes; Keller, Matthew C.; Kenis, Gunter; Kenny, Elaine; Kim, Yunjung; Kirov, George K.; Konnerth, Heike; Konte, Bettina; Krabbendam, Lydia; Krasucki, Robert; Lasseter, Virginia K.; Laurent, Claudine; Lawrence, Jacob; Lencz, Todd; Lerer, F. Bernard; Liang, Kung-Yee; Lichtenstein, Paul; Lieberman, Jeffrey A.; Linszen, Don H.; Lonnqvist, Jouko; Loughland, Carmel M.; Maclean, Alan W.; Maher, Brion S.; Maier, Wolfgang; Mallet, Jacques; Malloy, Pat; Mattheisen, Manuel; Mattingsdal, Morten; McGhee, Kevin A.; McGrath, John J.; McIntosh, Andrew; McLean, Duncan E.; McQuillin, Andrew; Melle, Ingrid; Michie, Patricia T.; Milanova, Vihra; Morris, Derek W.; Mors, Ole; Mortensen, Preben B.; Moskvina, Valentina; Muglia, Pierandrea; Myin-Germeys, Inez; Nertney, Deborah A.; Nestadt, Gerald; Nielsen, Jimmi; Nikolov, Ivan; Nordentoft, Merete; Norton, Nadine; Noethen, Markus M.; O'Dushlaine, Colm T.; Olincy, Ann; Olsen, Line; O'Neill, F. Anthony; Orntoft, Torben F.; Owen, Michael J.; Pantelis, Christos; Papadimitriou, George; Pato, Michele T.; Peltonen, Leena; Petursson, Hannes; Pickard, Ben; Pimm, Jonathan; Pulver, Ann E.; Puri, Vinay; Quested, Digby; Quinn, Emma M.; Rasmussen, Henrik B.; Rethelyi, Janos M.; Ribble, Robert; Rietschel, Marcella; Riley, Brien P.; Ruggeri, Mirella; Schall, Ulrich; Schulze, Thomas G.; Schwab, Sibylle G.; Scott, Rodney J.; Shi, Jianxin; Sigurdsson, Engilbert; Silverman, Jeremy M.; Spencer, Chris C. A.; Stefansson, Kari; Strange, Amy; Strengman, Eric; Stroup, T. Scott; Suvisaari, Jaana; Terenius, Lars; Thirumalai, Srinivasa; Thygesen, Johan H.; Timm, Sally; Toncheva, Draga; van den Oord, Edwin; van Os, Jim; van Winkel, Ruud; Veldink, Jan; Walsh, Dermot; Wang, August G.; Wiersma, Durk; Wildenauer, Dieter B.; Williams, Hywel J.; Williams, Nigel M.; Wormley, Brandon; Zammit, Stan; Sullivan, Patrick F.; O'Donovan, Michael C.; Daly, Mark J.; Gejman, Pablo V.

    2011-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded

  7. A genome-wide association study of anorexia nervosa

    NARCIS (Netherlands)

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, Annemarie; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countri

  8. Genome-Wide Association Analysis in Primary Sclerosing Cholangitis

    NARCIS (Netherlands)

    T.H. Karlsen; A. Franke; E. Melum; A.. Kaser; J.R. Hov; T. Balschun; B.A. Lie; A. Bergquist; C. Schramm; T.J. Weismüller; D. Gotthardt; C. Rust; E.E.R. Philipp; T. Fritz; L. Henckaerts; R. Weersma; P. Stokkers; C.Y. Ponsioen; C. Wijmenga; M. Sterneck; M. Nothnagel; J. Hampe; A. Teufel; H. Runz; P. Rosenstiel; A. Stiehl; S. Vermeire; U. Beuers; M. Manns; E. Schrumpf; K.M. Boberg; S. Schreiber

    2010-01-01

    BACKGROUND & AIMS: We aimed to characterize the genetic susceptibility to primary sclerosing cholangitis (PSC) by means of a genome-wide association analysis of single nucleotide polymorphism (SNP) markers. METHODS: A total of 443,816 SNPs on the Affymetrix SNP Array 5.0 (Affymetrix, Santa Clara, CA

  9. Genome-wide association study of Tourette's syndrome

    NARCIS (Netherlands)

    Scharf, J. M.; Yu, D.; Mathews, C. A.; Neale, B. M.; Stewart, S. E.; Fagerness, J. A.; Evans, P.; Gamazon, E.; Edlund, C. K.; Service, S. K.; Tikhomirov, A.; Osiecki, L.; Illmann, C.; Pluzhnikov, A.; Konkashbaev, A.; Davis, L. K.; Han, B.; Crane, J.; Moorjani, P.; Crenshaw, A. T.; Parkin, M. A.; Reus, V. I.; Lowe, T. L.; Rangel-Lugo, M.; Chouinard, S.; Dion, Y.; Girard, S.; Cath, D. C.; Smit, J. H.; King, R. A.; Fernandez, T. V.; Leckman, J. F.; Kidd, K. K.; Kidd, J. R.; Pakstis, A. J.; State, M. W.; Herrera, L. D.; Romero, R.; Fournier, E.; Sandor, P.; Barr, C. L.; Phan, N.; Gross-Tsur, V.; Benarroch, F.; Pollak, Y.; Budman, C. L.; Bruun, R. D.; Erenberg, G.; Naarden, A. L.; Lee, P. C.; Weiss, N.; Kremeyer, B.; Berrio, G. B.; Campbell, D. D.; Cardona Silgado, J. C.; Ochoa, W. C.; Mesa Restrepo, S. C.; Muller, H.; Valencia Duarte, A. V.; Lyon, G. J.; Leppert, M.; Morgan, J.; Weiss, R.; Grados, M. A.; Anderson, K.; Davarya, S.; Singer, H.; Walkup, J.; Jankovic, J.; Tischfield, J. A.; Heiman, G. A.; Gilbert, D. L.; Hoekstra, P. J.; Robertson, M. M.; Kurlan, R.; Liu, C.; Gibbs, J. R.; Singleton, A.; Hardy, J.; Strengman, E.; Ophoff, R. A.; Wagner, M.; Moessner, R.; Mirel, D. B.; Posthuma, D.; Sabatti, C.; Eskin, E.; Conti, D. V.; Knowles, J. A.; Ruiz-Linares, A.; Rouleau, G. A.; Purcell, S.; Heutink, P.; Oostra, B. A.; McMahon, W. M.; Freimer, N. B.; Cox, N. J.; Pauls, D. L.

    2013-01-01

    Tourette's syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association

  10. Genome-wide association study identifies five new schizophrenia loci

    NARCIS (Netherlands)

    Ripke, Stephan; Sanders, Alan R.; Kendler, Kenneth S.; Levinson, Douglas F.; Sklar, Pamela; Holmans, Peter A.; Lin, Dan-Yu; Duan, Jubao; Ophoff, Roel A.; Andreassen, Ole A.; Scolnick, Edward; Cichon, Sven; Clair, David St.; Corvin, Aiden; Gurling, Hugh; Werge, Thomas; Rujescu, Dan; Blackwood, Douglas H. R.; Pato, Carlos N.; Malhotra, Anil K.; Purcell, Shaun; Dudbridge, Frank; Neale, Benjamin M.; Rossin, Lizzy; Visscher, Peter M.; Posthuma, Danielle; Ruderfer, Douglas M.; Fanous, Ayman; Stefansson, Hreinn; Steinberg, Stacy; Mowry, Bryan J.; Golimbet, Vera; De Hert, Marc; Jonsson, Erik G.; Bitter, Istvan; Pietilainen, Olli P. H.; Collier, David A.; Tosato, Sarah; Agartz, Ingrid; Albus, Margot; Alexander, Madeline; Amdur, Richard L.; Amin, Farooq; Bass, Nicholas; Bergen, Sarah E.; Black, Donald W.; Borglum, Anders D.; Brown, Matthew A.; Bruggeman, Richard; Buccola, Nancy G.; Byerley, William F.; Cahn, Wiepke; Cantor, Rita M.; Carr, Vaughan J.; Catts, Stanley V.; Choudhury, Khalid; Cloninger, C. Robert; Cormican, Paul; Craddock, Nicholas; Danoy, Patrick A.; Datta, Susmita; De Haan, Lieuwe; Demontis, Ditte; Dikeos, Dimitris; Djurovic, Srdjan; Donnelly, Peter; Donohoe, Gary; Duong, Linh; Dwyer, Sarah; Fink-Jensen, Anders; Freedman, Robert; Freimer, Nelson B.; Friedl, Marion; Georgieva, Lyudmila; Giegling, Ina; Gill, Michael; Glenthoj, Birte; Godard, Stephanie; Hamshere, Marian; Hansen, Mark; Hansen, Thomas; Hartmann, Annette M.; Henskens, Frans A.; Hougaard, David M.; Hultman, Christina M.; Ingason, Andres; Jablensky, Assen V.; Jakobsen, Klaus D.; Jay, Maurice; Juergens, Gesche; Kahn, Renes; Keller, Matthew C.; Kenis, Gunter; Kenny, Elaine; Kim, Yunjung; Kirov, George K.; Konnerth, Heike; Konte, Bettina; Krabbendam, Lydia; Krasucki, Robert; Lasseter, Virginia K.; Laurent, Claudine; Lawrence, Jacob; Lencz, Todd; Lerer, F. Bernard; Liang, Kung-Yee; Lichtenstein, Paul; Lieberman, Jeffrey A.; Linszen, Don H.; Lonnqvist, Jouko; Loughland, Carmel M.; Maclean, Alan W.; Maher, Brion S.; Maier, Wolfgang; Mallet, Jacques; Malloy, Pat; Mattheisen, Manuel; Mattingsdal, Morten; McGhee, Kevin A.; McGrath, John J.; McIntosh, Andrew; McLean, Duncan E.; McQuillin, Andrew; Melle, Ingrid; Michie, Patricia T.; Milanova, Vihra; Morris, Derek W.; Mors, Ole; Mortensen, Preben B.; Moskvina, Valentina; Muglia, Pierandrea; Myin-Germeys, Inez; Nertney, Deborah A.; Nestadt, Gerald; Nielsen, Jimmi; Nikolov, Ivan; Nordentoft, Merete; Norton, Nadine; Noethen, Markus M.; O'Dushlaine, Colm T.; Olincy, Ann; Olsen, Line; O'Neill, F. Anthony; Orntoft, Torben F.; Owen, Michael J.; Pantelis, Christos; Papadimitriou, George; Pato, Michele T.; Peltonen, Leena; Petursson, Hannes; Pickard, Ben; Pimm, Jonathan; Pulver, Ann E.; Puri, Vinay; Quested, Digby; Quinn, Emma M.; Rasmussen, Henrik B.; Rethelyi, Janos M.; Ribble, Robert; Rietschel, Marcella; Riley, Brien P.; Ruggeri, Mirella; Schall, Ulrich; Schulze, Thomas G.; Schwab, Sibylle G.; Scott, Rodney J.; Shi, Jianxin; Sigurdsson, Engilbert; Silverman, Jeremy M.; Spencer, Chris C. A.; Stefansson, Kari; Strange, Amy; Strengman, Eric; Stroup, T. Scott; Suvisaari, Jaana; Terenius, Lars; Thirumalai, Srinivasa; Thygesen, Johan H.; Timm, Sally; Toncheva, Draga; van den Oord, Edwin; van Os, Jim; van Winkel, Ruud; Veldink, Jan; Walsh, Dermot; Wang, August G.; Wiersma, Durk; Wildenauer, Dieter B.; Williams, Hywel J.; Williams, Nigel M.; Wormley, Brandon; Zammit, Stan; Sullivan, Patrick F.; O'Donovan, Michael C.; Daly, Mark J.; Gejman, Pablo V.

    2011-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded

  11. Genome-wide association study identifies five new schizophrenia loci.

    LENUS (Irish Health Repository)

    Ripke, Stephan

    2011-10-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10(-11)) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10(-9)), ANK3 (rs10994359, P = 2.5 × 10(-8)) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10(-9)).

  12. Genome-wide association study identifies five new schizophrenia loci

    DEFF Research Database (Denmark)

    Ripke, Stephan; Sanders, Alan R; Kendler, Kenneth S

    2011-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yiel...

  13. Genome-wide significant risk associations for mucinous ovarian carcinoma

    DEFF Research Database (Denmark)

    Kelemen, Linda E; Lawrenson, Kate; Tyrer, Jonathan;

    2015-01-01

    Genome-wide association studies have identified several risk associations for ovarian carcinomas but not for mucinous ovarian carcinomas (MOCs). Our analysis of 1,644 MOC cases and 21,693 controls with imputation identified 3 new risk associations: rs752590 at 2q13 (P = 3.3 × 10(-8)), rs711830 at...

  14. Best diagnostic approach for the genetic evaluation of fetuses after intrauterine death in first, second or third trimester: QF-PCR, karyotyping and/or genome wide SNP array analysis

    NARCIS (Netherlands)

    Kooper, A.J.A.; Faas, B.H.W.; Feenstra, I.; Leeuw, N. de; Smeets, D.F.C.M.

    2014-01-01

    BACKGROUND: The aim of this study was to evaluate the best diagnostic approach for the genetic analysis of samples from first, second and third trimester intrauterine fetal deaths (IUFDs). We examined a total of 417 IUFD samples from fetuses with and without congenital anomalies. On 414 samples, kar

  15. Best diagnostic approach for the genetic evaluation of fetuses after intrauterine death in first, second or third trimester: QF-PCR, karyotyping and/or genome wide SNP array analysis

    NARCIS (Netherlands)

    Kooper, A.J.A.; Faas, B.H.W.; Feenstra, I.; Leeuw, N. de; Smeets, D.F.C.M.

    2014-01-01

    BACKGROUND: The aim of this study was to evaluate the best diagnostic approach for the genetic analysis of samples from first, second and third trimester intrauterine fetal deaths (IUFDs). We examined a total of 417 IUFD samples from fetuses with and without congenital anomalies. On 414 samples,

  16. Mosaicism for genome-wide paternal uniparental disomy with features of multiple imprinting disorders: diagnostic and management issues.

    Science.gov (United States)

    Inbar-Feigenberg, Michal; Choufani, Sanaa; Cytrynbaum, Cheryl; Chen, Yi-An; Steele, Leslie; Shuman, Cheryl; Ray, Peter N; Weksberg, Rosanna

    2013-01-01

    Mosaicism for genome-wide paternal uniparental disomy (UPD) has been reported in only seven live born individuals to date. Clinical presentation includes manifestations of multiple paternal UPD syndromes with high variability, likely due to the variable levels of mosaicism in different somatic tissues. We report an eighth case in a female patient with mosaicism for genome-wide paternal UPD which highlights the complex clinical presentation. Our patient had features of Beckwith-Wiedemann syndrome (BWS), Angelman syndrome, and congenital hyperinsulinism. The clinical findings included prematurity, organomegaly, hemihyperplasia, developmental delay, benign tumors, and cystic lesions. The diagnosis in our patient was established utilizing microarray-based genome-wide DNA methylation analysis performed on leukocyte DNA. Targeted multiplex ligation-dependent probe amplification (MLPA) analysis of chromosome regions 11p15 and 15q13 confirmed mosaicism for paternal UPD at these genomic regions. This case represents the first report of microarray-based genome-wide DNA methylation analysis in the diagnosis of genome-wide paternal UPD. The application of microarray-based genome-wide DNA methylation analysis on selected individuals with complex clinical presentations could be a valuable diagnostic tool to improve the detection rate of mosaic genome-wide paternal UPD. This approach, which screens many loci simultaneously, is more cost-effective and less labor-intensive than performing multiple targeted DNA methylation-based assays. Identification of individuals with mosaicism for genome-wide paternal UPD is an important goal as it confers a low recurrence risk for the family and identifies individuals who require surveillance due to increased tumor risk.

  17. Performance of polygenic scores for predicting phobic anxiety.

    Directory of Open Access Journals (Sweden)

    Stefan Walter

    Full Text Available CONTEXT: Anxiety disorders are common, with a lifetime prevalence of 20% in the U.S., and are responsible for substantial burdens of disability, missed work days and health care utilization. To date, no causal genetic variants have been identified for anxiety, anxiety disorders, or related traits. OBJECTIVE: To investigate whether a phobic anxiety symptom score was associated with 3 alternative polygenic risk scores, derived from external genome-wide association studies of anxiety, an internally estimated agnostic polygenic score, or previously identified candidate genes. DESIGN: Longitudinal follow-up study. Using linear and logistic regression we investigated whether phobic anxiety was associated with polygenic risk scores derived from internal, leave-one out genome-wide association studies, from 31 candidate genes, and from out-of-sample genome-wide association weights previously shown to predict depression and anxiety in another cohort. SETTING AND PARTICIPANTS: Study participants (n = 11,127 were individuals from the Nurses' Health Study and Health Professionals Follow-up Study. MAIN OUTCOME MEASURE: Anxiety symptoms were assessed via the 8-item phobic anxiety scale of the Crown Crisp Index at two time points, from which a continuous phenotype score was derived. RESULTS: We found no genome-wide significant associations with phobic anxiety. Phobic anxiety was also not associated with a polygenic risk score derived from the genome-wide association study beta weights using liberal p-value thresholds; with a previously published genome-wide polygenic score; or with a candidate gene risk score based on 31 genes previously hypothesized to predict anxiety. CONCLUSION: There is a substantial gap between twin-study heritability estimates of anxiety disorders ranging between 20-40% and heritability explained by genome-wide association results. New approaches such as improved genome imputations, application of gene expression and biological

  18. Genome-wide identification of significant aberrations in cancer genome

    Directory of Open Access Journals (Sweden)

    Yuan Xiguo

    2012-07-01

    Full Text Available Abstract Background Somatic Copy Number Alterations (CNAs in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC, a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1 exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2 performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3 iteratively detecting Significant Copy Number Aberrations (SCAs and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. Results We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma. When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC or tumor suppressor genes (e.g., CDKN2A/B. Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Conclusions Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes

  19. Privacy-Preserving Data Sharing for Genome-Wide Association Studies

    CERN Document Server

    Uhler, Caroline; Fienberg, Stephen E

    2012-01-01

    Traditional statistical methods for confidentiality protection of statistical databases do not scale well to deal with GWAS (genome-wide association studies) databases especially in terms of guarantees regarding protection from linkage to external information. The more recent concept of differential privacy, introduced by the cryptographic community, is an approach which provides a rigorous definition of privacy with meaningful privacy guarantees in the presence of arbitrary external information, although the guarantees come at a serious price in terms of data utility. Building on such notions, we propose new methods to release aggregate GWAS data without compromising an individual's privacy. We present methods for releasing differentially private minor allele frequencies, chi-square statistics and p-values. We compare these approaches on simulated data and on a GWAS study of canine hair length involving 685 dogs. We also propose a privacy-preserving method for finding genome-wide associations based on a diff...

  20. Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma

    OpenAIRE

    Chambers, John C.; Zhang, Weihua; Sehmi, Joban; Li, Xinzhong; Wass, Mark N; Harst, Pim; Holm, Hilma; Sanna, Serena; Kavousi, Maryam; Baumeister, Sebastian E.; Coin, Lachlan J.; Deng, Guohong; Gieger, Christian; Heard-Costa, Nancy L.; Hottenga, Jouke-Jan

    2011-01-01

    Concentrations of liver enzymes in plasma are widely used as indicators of liver disease. We carried out a genome-wide association study in 61,089 individuals, identifying 42 loci associated with concentrations of liver enzymes in plasma, of which 32 are new associations (P = 10(-8) to P = 10(-190)). We used functional genomic approaches including metabonomic profiling and gene expression analyses to identify probable candidate genes at these regions. We identified 69 candidate genes, includi...

  1. Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma

    OpenAIRE

    Chambers, John C.; Zhang, Weihua; Sehmi, Joban; Li, Xinzhong; Wass, Mark N; Harst, Pim; Holm, Hilma; Sanna, Serena; Kavousi, Maryam; Baumeister, Sebastian E.; Coin, Lachlan J.; Deng, Guohong; Gieger, Christian; Heard-Costa, Nancy L.; Hottenga, Jouke-Jan

    2011-01-01

    Concentrations of liver enzymes in plasma are widely used as indicators of liver disease. We carried out a genome-wide association study in 61,089 individuals, identifying 42 loci associated with concentrations of liver enzymes in plasma, of which 32 are new associations (P = 10−8 to P = 10−190). We used functional genomic approaches including metabonomic profiling and gene expression analyses to identify probable candidate genes at these regions. We identified 69 candidate genes, including g...

  2. Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma.

    OpenAIRE

    Chambers, John C.; Zhang, Weihua; Sehmi, Joban; Li, Xinzhong; Wass, Mark N; Harst, Pim; Holm, Hilma; Sanna, Serena; Kavousi, Maryam; Baumeister, Sebastian E.; Coin, Lachlan J.; Abecasis, Goncalo R.; Ahmadi, Kourosh R; Boomsma, Dorret I; Caulfield, Mark

    2011-01-01

    Concentrations of liver enzymes in plasma are widely used as indicators of liver disease. We carried out a genome-wide association study in 61,089 individuals, identifying 42 loci associated with concentrations of liver enzymes in plasma, of which 32 are new associations (P = 10(-8) to P = 10(-190)). We used functional genomic approaches including metabonomic profiling and gene expression analyses to identify probable candidate genes at these regions. We identified 69 candidate genes, includi...

  3. Genome-Wide Scan for Adaptive Divergence and Association with Population-Specific Covariates

    OpenAIRE

    Gautier, Mathieu

    2015-01-01

    In population genomics studies, accounting for the neutral covariance structure across population allele frequencies is critical to improve the robustness of genome-wide scan approaches. Elaborating on the BayEnv model, this study investigates several modeling extensions (i) to improve the estimation accuracy of the population covariance matrix and all the related measures, (ii) to identify significantly overly differentiated SNPs based on a calibration procedure of the XtX statistics, and (i...

  4. Mapping the sensory perception of apple using descriptive sensory evaluation in a genome wide association study

    OpenAIRE

    Amyotte, Beatrice; Bowen, Amy J.; Banks, Travis; Rajcan, Istvan; Somers, Daryl J.

    2017-01-01

    Breeding apples is a long-term endeavour and it is imperative that new cultivars are selected to have outstanding consumer appeal. This study has taken the approach of merging sensory science with genome wide association analyses in order to map the human perception of apple flavour and texture onto the apple genome. The goal was to identify genomic associations that could be used in breeding apples for improved fruit quality. A collection of 85 apple cultivars was examined over two years thr...

  5. Epigenomic model of cardiac enhancers with application to genome wide association studies.

    Science.gov (United States)

    Sahu, Avinash Das; Aniba, Radhouane; Chang, Yen-Pei Christy; Hannenhalli, Sridhar

    2013-01-01

    Mammalian gene regulation is often mediated by distal enhancer elements, in particular, for tissue specific and developmental genes. Computational identification of enhancers is difficult because they do not exhibit clear location preference relative to their target gene and also because they lack clearly distinguishing genomic features. This represents a major challenge in deciphering transcriptional regulation. Recent ChIP-seq based genome-wide investigation of epigenomic modifications have revealed that enhancers are often enriched for certain epigenomic marks. Here we utilize the epigenomic data in human heart tissue along with validated human heart enhancers to develop a Support Vector Machine (SVM) model of cardiac enhancers. Cross-validation classification accuracy of our model was 84% and 92% on positive and negative sets respectively with ROC AUC = 0.92. More importantly, while P300 binding has been used as gold standard for enhancers, our model can distinguish P300-bound validated enhancers from other P300-bound regions that failed to exhibit enhancer activity in transgenic mouse. While GWAS studies reveal polymorphic regions associated with certain phenotypes, they do not immediately provide causality. Next, we hypothesized that genomic regions containing a GWAS SNP associated with a cardiac phenotype might contain another SNP in a cardiac enhancer, which presumably mediates the phenotype. Starting with a comprehensive set of SNPs associated with cardiac phenotypes in GWAS studies, we scored other SNPs in LD with the GWAS SNP according to its probability of being an enhancer and choose one with best score in the LD as enhancer. We found that our predicted enhancers are enriched for known cardiac transcriptional regulator motifs and are likely to regulate the nearby gene. Importantly, these tendencies are more favorable for the predicted enhancers compared with an approach that uses P300 binding as a marker of enhancer activity.

  6. Genome-wide association analyses using electronic health records identify new loci influencing blood pressure variation.

    Science.gov (United States)

    Hoffmann, Thomas J; Ehret, Georg B; Nandakumar, Priyanka; Ranatunga, Dilrini; Schaefer, Catherine; Kwok, Pui-Yan; Iribarren, Carlos; Chakravarti, Aravinda; Risch, Neil

    2017-01-01

    Longitudinal electronic health records on 99,785 Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort individuals provided 1,342,814 systolic and diastolic blood pressure measurements for a genome-wide association study on long-term average systolic, diastolic, and pulse pressure. We identified 39 new loci among 75 genome-wide significant loci (P ≤ 5 × 10(-8)), with most replicating in the combined International Consortium for Blood Pressure (ICBP; n = 69,396) and UK Biobank (UKB; n = 152,081) studies. Combining GERA with ICBP yielded 36 additional new loci, with most replicating in UKB. Combining all three studies (n = 321,262) yielded 241 additional genome-wide significant loci, although no replication sample was available for these. All associated loci explained 2.9%, 2.5%, and 3.1% of variation in systolic, diastolic, and pulse pressure, respectively, in GERA non-Hispanic whites. Using multiple blood pressure measurements in GERA doubled the variance explained. A normalized risk score was associated with time to onset of hypertension (hazards ratio = 1.18, P = 8.2 × 10(-45)). Expression quantitative trait locus analysis of blood pressure loci showed enrichment in aorta and tibial artery.

  7. Using Ontology Fingerprints to Evaluate Genome-wide Association Results

    OpenAIRE

    Lam Tsoi; Michael Boehnke; Richard Klein; Jim Zheng

    2009-01-01

    We describe an approach to characterize genes or phenotypes via ontology fingerprints which are composed of Gene Ontology (GO) terms overrepresented among those PubMed abstracts linked to the genes or phenotypes. We then quantify the biological relevance between genes and phenotypes by comparing their ontology fingerprints to calculate a similarity score. We validated this approach by correctly identifying genes belong to their biological pathways with high accuracy, and applied this approach...

  8. Genome-wide pathway association studies of multiple correlated quantitative phenotypes using principle component analyses.

    Directory of Open Access Journals (Sweden)

    Feng Zhang

    Full Text Available Genome-wide pathway association studies provide novel insight into the biological mechanism underlying complex diseases. Current pathway association studies primarily focus on single important disease phenotype, which is sometimes insufficient to characterize the clinical manifestations of complex diseases. We present a multi-phenotypes pathway association study(MPPAS approach using principle component analysis(PCA. In our approach, PCA is first applied to multiple correlated quantitative phenotypes for extracting a set of orthogonal phenotypic components. The extracted phenotypic components are then used for pathway association analysis instead of original quantitative phenotypes. Four statistics were proposed for PCA-based MPPAS in this study. Simulations using the real data from the HapMap project were conducted to evaluate the power and type I error rates of PCA-based MPPAS under various scenarios considering sample sizes, additive and interactive genetic effects. A real genome-wide association study data set of bone mineral density (BMD at hip and spine were also analyzed by PCA-based MPPAS. Simulation studies illustrated the performance of PCA-based MPPAS for identifying the causal pathways underlying complex diseases. Genome-wide MPPAS of BMD detected associations between BMD and KENNY_CTNNB1_TARGETS_UP as well as LONGEVITYPATHWAY pathways in this study. We aim to provide a applicable MPPAS approach, which may help to gain deep understanding the potential biological mechanism of association results for complex diseases.

  9. Genome-wide association studies and resting heart rate

    DEFF Research Database (Denmark)

    Oskari Kilpeläinen, Tuomas

    2016-01-01

    Genome-wide association studies (GWASs) have revolutionized the search for genetic variants regulating resting heart rate. In the last 10 years, GWASs have led to the identification of at least 21 novel heart rate loci. These discoveries have provided valuable insights into the mechanisms...... and pathways that regulate heart rate and link heart rate to cardiovascular morbidity and mortality. GWASs capture majority of genetic variation in a population sample by utilizing high-throughput genotyping chips measuring genotypes for up to several millions of SNPs across the genome in thousands...... of individuals. This allows the identification of the strongest heart rate associated signals at genome-wide level. While GWASs provide robust statistical evidence of the association of a given genetic locus with heart rate, they are only the starting point for detailed follow-up studies to locate the causal...

  10. Genome-wide patterns of selection in 230 ancient Eurasians

    Science.gov (United States)

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2016-01-01

    Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  11. Genome-wide patterns of nucleotide polymorphism in domesticated rice

    DEFF Research Database (Denmark)

    Caicedo, Ana L; Williamson, Scott H; Hernandez, Ryan D

    2007-01-01

    Domesticated Asian rice (Oryza sativa) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments......, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models...... explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection...

  12. Genome-Wide Association Study of Polymorphisms Predisposing to Bronchiolitis

    Science.gov (United States)

    Pasanen, Anu; Karjalainen, Minna K.; Bont, Louis; Piippo-Savolainen, Eija; Ruotsalainen, Marja; Goksör, Emma; Kumawat, Kuldeep; Hodemaekers, Hennie; Nuolivirta, Kirsi; Jartti, Tuomas; Wennergren, Göran; Hallman, Mikko; Rämet, Mika; Korppi, Matti

    2017-01-01

    Bronchiolitis is a major cause of hospitalization among infants. Severe bronchiolitis is associated with later asthma, suggesting a common genetic predisposition. Genetic background of bronchiolitis is not well characterized. To identify polymorphisms associated with bronchiolitis, we conducted a genome-wide association study (GWAS) in which 5,300,000 single nucleotide polymorphisms (SNPs) were tested for association in a Finnish–Swedish population of 217 children hospitalized for bronchiolitis and 778 controls. The most promising SNPs (n = 77) were genotyped in a Dutch replication population of 416 cases and 432 controls. Finally, we used a set of 202 Finnish bronchiolitis cases to further investigate candidate SNPs. We did not detect genome-wide significant associations, but several suggestive association signals (p bronchiolitis. These preliminary findings require further validation in a larger sample size. PMID:28139761

  13. Genome-Wide Prediction of C. elegans Genetic Interactions

    OpenAIRE

    Zhong, Weiwei; Sternberg, Paul W.

    2006-01-01

    To obtain a global view of functional interactions among genes in a metazoan genome, we computationally integrated interactome data, gene expression data, phenotype data, and functional annotation data from three model organisms—Saccharomyces cerevisiae, Caenorhabditis elegans, and Drosophila melanogaster—and predicted genome-wide genetic interactions in C. elegans. The resulting genetic interaction network (consisting of 18,183 interactions) provides a framework for system-level understandin...

  14. Genome-wide association study of relative telomere length.

    Science.gov (United States)

    Prescott, Jennifer; Kraft, Peter; Chasman, Daniel I; Savage, Sharon A; Mirabello, Lisa; Berndt, Sonja I; Weissfeld, Joel L; Han, Jiali; Hayes, Richard B; Chanock, Stephen J; Hunter, David J; De Vivo, Immaculata

    2011-05-10

    Telomere function is essential to maintaining the physical integrity of linear chromosomes and healthy human aging. The probability of forming proper telomere structures depends on the length of the telomeric DNA tract. We attempted to identify common genetic variants associated with log relative telomere length using genome-wide genotyping data on 3,554 individuals from the Nurses' Health Study and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial that took part in the National Cancer Institute Cancer Genetic Markers of Susceptibility initiative for breast and prostate cancer. After genotyping 64 independent SNPs selected for replication in additional Nurses' Health Study and Women's Genome Health Study participants, we did not identify genome-wide significant loci; however, we replicated the inverse association of log relative telomere length with the minor allele variant [C] of rs16847897 at the TERC locus (per allele β = -0.03, P = 0.003) identified by a previous genome-wide association study. We did not find evidence for an association with variants at the OBFC1 locus or other loci reported to be associated with telomere length. With this sample size we had >80% power to detect β estimates as small as ±0.10 for SNPs with minor allele frequencies of ≥0.15 at genome-wide significance. However, power is greatly reduced for β estimates smaller than ±0.10, such as those for variants at the TERC locus. In general, common genetic variants associated with telomere length homeostasis have been difficult to detect. Potential biological and technical issues are discussed.

  15. Genome-wide association study of relative telomere length.

    Directory of Open Access Journals (Sweden)

    Jennifer Prescott

    Full Text Available Telomere function is essential to maintaining the physical integrity of linear chromosomes and healthy human aging. The probability of forming proper telomere structures depends on the length of the telomeric DNA tract. We attempted to identify common genetic variants associated with log relative telomere length using genome-wide genotyping data on 3,554 individuals from the Nurses' Health Study and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial that took part in the National Cancer Institute Cancer Genetic Markers of Susceptibility initiative for breast and prostate cancer. After genotyping 64 independent SNPs selected for replication in additional Nurses' Health Study and Women's Genome Health Study participants, we did not identify genome-wide significant loci; however, we replicated the inverse association of log relative telomere length with the minor allele variant [C] of rs16847897 at the TERC locus (per allele β = -0.03, P = 0.003 identified by a previous genome-wide association study. We did not find evidence for an association with variants at the OBFC1 locus or other loci reported to be associated with telomere length. With this sample size we had >80% power to detect β estimates as small as ±0.10 for SNPs with minor allele frequencies of ≥0.15 at genome-wide significance. However, power is greatly reduced for β estimates smaller than ±0.10, such as those for variants at the TERC locus. In general, common genetic variants associated with telomere length homeostasis have been difficult to detect. Potential biological and technical issues are discussed.

  16. Genome-wide association studies in pediatric endocrinology.

    Science.gov (United States)

    Dauber, Andrew; Hirschhorn, Joel N

    2011-01-01

    Genome-wide association (GWA) studies are a powerful tool for understanding the genetic underpinnings of human disease. In this article, we briefly review the role and findings of GWA studies in type 1 diabetes, stature, pubertal timing, obesity, and vitamin D deficiency. We then discuss the present and future implications of these findings with regards to disease prediction, uncovering basic biology, and the development of novel therapeutic agents.

  17. Identification of Transcribed Enhancers by Genome-Wide Chromatin Immunoprecipitation Sequencing.

    Science.gov (United States)

    Blinka, Steven; Reimer, Michael H; Pulakanti, Kirthi; Pinello, Luca; Yuan, Guo-Cheng; Rao, Sridhar

    2017-01-01

    Recent work has shown that RNA polymerase II-mediated transcription at distal cis-regulatory elements serves as a mark of highly active enhancers. Production of noncoding RNAs at enhancers, termed eRNAs, correlates with higher expression of genes that the enhancer interacts with; hence, eRNAs provide a new tool to model gene activity in normal and disease tissues. Moreover, this unique class of noncoding RNA has diverse roles in transcriptional regulation. Transcribed enhancers can be identified by a common signature of epigenetic marks by overlaying a series of genome-wide chromatin immunoprecipitation and RNA sequencing datasets. A computational approach to filter non-enhancer elements and other classes of noncoding RNAs is essential to not cloud downstream analysis. Here we present a protocol that combines wet and dry bench methods to accurately identify transcribed enhancers genome-wide as well as an experimental procedure to validate these datasets.

  18. The HapMap and genome-wide association studies in diagnosis and therapy.

    Science.gov (United States)

    Manolio, Teri A; Collins, Francis S

    2009-01-01

    The International HapMap Project produced a genome-wide database of human genetic variation for use in genetic association studies of common diseases. The initial output of these studies has been overwhelming, with over 150 risk loci identified in studies of more than 60 common diseases and traits. These associations have suggested previously unsuspected etiologic pathways for common diseases that will be of use in identifying new therapeutic targets and developing targeted interventions based on genetically defined risk. Here we examine the development and application of the HapMap to genome-wide association (GWA) studies; present and future technologies for GWA research; current major efforts in GWA studies; successes and limitations of the GWA approach in identifying polymorphisms related to complex diseases; data release and privacy polices; use of these findings by clinicians, the public, and academic physicians; and sources of ongoing authoritative information on this rapidly evolving field.

  19. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.

    Directory of Open Access Journals (Sweden)

    Wei Li

    Full Text Available Copy-number variations (CNV, loss of heterozygosity (LOH, and uniparental disomy (UPD are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS, is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs. In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information.

  20. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.

    Science.gov (United States)

    Wang, Yu; Li, Wei; Xia, Yingying; Wang, Chongzhi; Tang, Y Tom; Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2014-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information.

  1. Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies.

    Directory of Open Access Journals (Sweden)

    Clive J Hoggart

    2008-07-01

    Full Text Available Testing one SNP at a time does not fully realise the potential of genome-wide association studies to identify multiple causal variants, which is a plausible scenario for many complex diseases. We show that simultaneous analysis of the entire set of SNPs from a genome-wide study to identify the subset that best predicts disease outcome is now feasible, thanks to developments in stochastic search methods. We used a Bayesian-inspired penalised maximum likelihood approach in which every SNP can be considered for additive, dominant, and recessive contributions to disease risk. Posterior mode estimates were obtained for regression coefficients that were each assigned a prior with a sharp mode at zero. A non-zero coefficient estimate was interpreted as corresponding to a significant SNP. We investigated two prior distributions and show that the normal-exponential-gamma prior leads to improved SNP selection in comparison with single-SNP tests. We also derived an explicit approximation for type-I error that avoids the need to use permutation procedures. As well as genome-wide analyses, our method is well-suited to fine mapping with very dense SNP sets obtained from re-sequencing and/or imputation. It can accommodate quantitative as well as case-control phenotypes, covariate adjustment, and can be extended to search for interactions. Here, we demonstrate the power and empirical type-I error of our approach using simulated case-control data sets of up to 500 K SNPs, a real genome-wide data set of 300 K SNPs, and a sequence-based dataset, each of which can be analysed in a few hours on a desktop workstation.

  2. Current approaches of genome-wide association studies

    Institute of Scientific and Technical Information of China (English)

    Jianfeng Xu

    2008-01-01

    @@ With rapid advances in high-throughput genotyping technology and the great increase in information available on SNPs throughout the genuine, genuine-wide association(GWA) studies have now become feasible.

  3. Genome-wide nucleosome map and cytosine methylation levels of an ancient human genome

    DEFF Research Database (Denmark)

    Pedersen, Jakob Skou; Valen, Eivind; Velazquez, Amhed Missael Vargas;

    2014-01-01

    data generated from hair shafts of a 4000-yr-old Paleo-Eskimo belonging to the Saqqaq culture, we generate the first ancient nucleosome map coupled with a genome-wide survey of cytosine methylation levels. The validity of both nucleosome map and methylation levels were confirmed by the recovery...... of the expected signals at promoter regions, exon/intron boundaries, and CTCF sites. The top-scoring nucleosome calls revealed distinct DNA positioning biases, attesting to nucleotide-level accuracy. The ancient methylation levels exhibited high conservation over time, clustering closely with modern hair tissues...

  4. Genome-wide scan in Portuguese Island families implicates multiple loci in bipolar disorder: fine mapping adds support on chromosomes 6 and 11.

    Science.gov (United States)

    Pato, Carlos N; Pato, M T; Kirby, A; Petryshen, T L; Medeiros, H; Carvalho, C; Macedo, A; Dourado, A; Coelho, I; Valente, J; Soares, M J; Ferreira, C P; Lei, M; Verner, A; Hudson, T J; Morley, C P; Kennedy, J L; Azevedo, M H; Daly, M J; Sklar, P

    2004-05-15

    As part of an extensive study in the Portuguese Island population of families with multiple patients suffering from bipolar disorder and schizophrenia, we performed an initial genome-wide scan of 16 extended families with bipolar disorder that identified three regions on chromosomes 2, 11, and 19 with genome-wide suggestive linkage and several other regions, including chromosome 6q, also approached suggestive levels of significance. Dick et al. [2003: Am J Hum Genet 73:107-114] recently reported in a study of 250 families with bipolar disorder a maxLOD score of 3.61 near marker D6S1021 on chromosome 6q. This study replicates this finding having detected a peak NPL = 2.02 (P = 0.025) with the same marker D6S1021(104.7 Mb). Higher-density mapping provided additional support for loci on chromosome 6 including marker D6S1021 with an NPL = 2.59 (P = 0.0068) and peaking at marker D6S1639 (125 Mb) with an NPL = 3.06 (P = 0.0019). A similar pattern was detected with higher-density mapping of chromosome 11 with an NPL = 3.15 (P = 0.0014) at marker D11S1883 (63.1 Mb). Simulations at the density of our fine mapping data indicate that less than 1 scan out of 10 would find two such scores genome-wide in the same scan by chance. Our findings provide additional support for a susceptibility locus for bipolar disorder on 6q, as well as, suggesting the importance of denser scans. Published 2004 Wiley-Liss, Inc.

  5. A genome-wide association study using international breeding-evaluation data identifies major loci affecting production traits and stature in the Brown Swiss cattle breed

    Directory of Open Access Journals (Sweden)

    Guo Jiazhong

    2012-10-01

    Full Text Available Abstract Background The genome-wide association study (GWAS is a useful approach to identify genes affecting economically important traits in dairy cattle. Here, we report the results from a GWAS based on high-density SNP genotype data and estimated breeding values for nine production, fertility, body conformation, udder health and workability traits in the Brown Swiss cattle population that is part of the international genomic evaluation program. Result GWASs were performed using 50 k SNP chip data and deregressed estimated breeding values (DEBVs for nine traits from between 2061 and 5043 bulls that were part of the international genomic evaluation program coordinated by Interbull Center. The nine traits were milk yield (MY, fat yield (FY, protein yield (PY, lactating cow’s ability to recycle after calving (CRC, angularity (ANG, body depth (BDE, stature (STA, milk somatic cell score (SCS and milk speed (MSP. Analyses were performed using a linear mixed model correcting for population confounding. A total of 74 SNPs were detected to be genome-wide significantly associated with one or several of the nine analyzed traits. The strongest signal was identified on chromosome 25 for milk production traits, stature and body depth. Other signals were on chromosome 11 for angularity, chromosome 24 for somatic cell score, and chromosome 6 for milking speed. Some signals overlapped with earlier reported QTL for similar traits in other cattle populations and were located close to interesting candidate genes worthy of further investigations. Conclusions Our study shows that international genetic evaluation data is a useful resource for identifying genetic factors influencing complex traits in livestock. Several genome wide significant association signals could be identified in the Brown Swiss population, including a major signal on BTA25. Our findings report several associations and plausible candidate genes that deserve further exploration in other

  6. Genome-wide association study identifies multiple loci associated with bladder cancer risk

    Science.gov (United States)

    Figueroa, Jonine D.; Ye, Yuanqing; Siddiq, Afshan; Garcia-Closas, Montserrat; Chatterjee, Nilanjan; Prokunina-Olsson, Ludmila; Cortessis, Victoria K.; Kooperberg, Charles; Cussenot, Olivier; Benhamou, Simone; Prescott, Jennifer; Porru, Stefano; Dinney, Colin P.; Malats, Núria; Baris, Dalsu; Purdue, Mark; Jacobs, Eric J.; Albanes, Demetrius; Wang, Zhaoming; Deng, Xiang; Chung, Charles C.; Tang, Wei; Bas Bueno-de-Mesquita, H.; Trichopoulos, Dimitrios; Ljungberg, Börje; Clavel-Chapelon, Françoise; Weiderpass, Elisabete; Krogh, Vittorio; Dorronsoro, Miren; Travis, Ruth; Tjønneland, Anne; Brenan, Paul; Chang-Claude, Jenny; Riboli, Elio; Conti, David; Gago-Dominguez, Manuela; Stern, Mariana C.; Pike, Malcolm C.; Van Den Berg, David; Yuan, Jian-Min; Hohensee, Chancellor; Rodabough, Rebecca; Cancel-Tassin, Geraldine; Roupret, Morgan; Comperat, Eva; Chen, Constance; De Vivo, Immaculata; Giovannucci, Edward; Hunter, David J.; Kraft, Peter; Lindstrom, Sara; Carta, Angela; Pavanello, Sofia; Arici, Cecilia; Mastrangelo, Giuseppe; Kamat, Ashish M.; Lerner, Seth P.; Barton Grossman, H.; Lin, Jie; Gu, Jian; Pu, Xia; Hutchinson, Amy; Burdette, Laurie; Wheeler, William; Kogevinas, Manolis; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Lloreta, Josep; Schwenn, Molly; Karagas, Margaret R.; Johnson, Alison; Schned, Alan; Armenti, Karla R.; Hosain, G.M.; Andriole, Gerald; Grubb, Robert; Black, Amanda; Ryan Diver, W.; Gapstur, Susan M.; Weinstein, Stephanie J.; Virtamo, Jarmo; Haiman, Chris A.; Landi, Maria T.; Caporaso, Neil; Fraumeni, Joseph F.; Vineis, Paolo; Wu, Xifeng; Silverman, Debra T.; Chanock, Stephen; Rothman, Nathaniel

    2014-01-01

    Candidate gene and genome-wide association studies (GWAS) have identified 11 independent susceptibility loci associated with bladder cancer risk. To discover additional risk variants, we conducted a new GWAS of 2422 bladder cancer cases and 5751 controls, followed by a meta-analysis with two independently published bladder cancer GWAS, resulting in a combined analysis of 6911 cases and 11 814 controls of European descent. TaqMan genotyping of 13 promising single nucleotide polymorphisms with P < 1 × 10−5 was pursued in a follow-up set of 801 cases and 1307 controls. Two new loci achieved genome-wide statistical significance: rs10936599 on 3q26.2 (P = 4.53 × 10−9) and rs907611 on 11p15.5 (P = 4.11 × 10−8). Two notable loci were also identified that approached genome-wide statistical significance: rs6104690 on 20p12.2 (P = 7.13 × 10−7) and rs4510656 on 6p22.3 (P = 6.98 × 10−7); these require further studies for confirmation. In conclusion, our study has identified new susceptibility alleles for bladder cancer risk that require fine-mapping and laboratory investigation, which could further understanding into the biological underpinnings of bladder carcinogenesis. PMID:24163127

  7. Genome-wide association study for wool production traits in a Chinese Merino sheep population.

    Directory of Open Access Journals (Sweden)

    Zhipeng Wang

    Full Text Available Genome-wide association studies (GWAS provide a powerful approach for identifying quantitative trait loci without prior knowledge of location or function. To identify loci associated with wool production traits, we performed a genome-wide association study on a total of 765 Chinese Merino sheep (JunKen type genotyped with 50 K single nucleotide polymorphisms (SNPs. In the present study, five wool production traits were examined: fiber diameter, fiber diameter coefficient of variation, fineness dispersion, staple length and crimp. We detected 28 genome-wide significant SNPs for fiber diameter, fiber diameter coefficient of variation, fineness dispersion, and crimp trait in the Chinese Merino sheep. About 43% of the significant SNP markers were located within known or predicted genes, including YWHAZ, KRTCAP3, TSPEAR, PIK3R4, KIF16B, PTPN3, GPRC5A, DDX47, TCF9, TPTE2, EPHA5 and NBEA genes. Our results not only confirm the results of previous reports, but also provide a suite of novel SNP markers and candidate genes associated with wool traits. Our findings will be useful for exploring the genetic control of wool traits in sheep.

  8. Genome-wide association study for wool production traits in a Chinese Merino sheep population.

    Science.gov (United States)

    Wang, Zhipeng; Zhang, Hui; Yang, Hua; Wang, Shouzhi; Rong, Enguang; Pei, Wenyu; Li, Hui; Wang, Ning

    2014-01-01

    Genome-wide association studies (GWAS) provide a powerful approach for identifying quantitative trait loci without prior knowledge of location or function. To identify loci associated with wool production traits, we performed a genome-wide association study on a total of 765 Chinese Merino sheep (JunKen type) genotyped with 50 K single nucleotide polymorphisms (SNPs). In the present study, five wool production traits were examined: fiber diameter, fiber diameter coefficient of variation, fineness dispersion, staple length and crimp. We detected 28 genome-wide significant SNPs for fiber diameter, fiber diameter coefficient of variation, fineness dispersion, and crimp trait in the Chinese Merino sheep. About 43% of the significant SNP markers were located within known or predicted genes, including YWHAZ, KRTCAP3, TSPEAR, PIK3R4, KIF16B, PTPN3, GPRC5A, DDX47, TCF9, TPTE2, EPHA5 and NBEA genes. Our results not only confirm the results of previous reports, but also provide a suite of novel SNP markers and candidate genes associated with wool traits. Our findings will be useful for exploring the genetic control of wool traits in sheep.

  9. Gene expression levels as endophenotypes in genome-wide association studies of Alzheimer disease

    Science.gov (United States)

    Zou, F.; Carrasquillo, M. M.; Pankratz, V. S.; Belbin, O.; Morgan, K.; Allen, M.; Wilcox, S. L.; Ma, L.; Walker, L. P.; Kouri, N.; Burgess, J. D.; Younkin, L. H.; Younkin, Samuel G.; Younkin, C. S.; Bisceglio, G. D.; Crook, J. E.; Dickson, D. W.; Petersen, R. C.; Graff-Radford, N.; Younkin, Steven G.; Ertekin-Taner, N.

    2010-01-01

    Background: Late-onset Alzheimer disease (LOAD) is a common disorder with a substantial genetic component. We postulate that many disease susceptibility variants act by altering gene expression levels. Methods: We measured messenger RNA (mRNA) expression levels of 12 LOAD candidate genes in the cerebella of 200 subjects with LOAD. Using the genotypes from our LOAD genome-wide association study for the cis-single nucleotide polymorphisms (SNPs) (n = 619) of these 12 LOAD candidate genes, we tested for associations with expression levels as endophenotypes. The strongest expression cis-SNP was tested for AD association in 7 independent case-control series (2,280 AD and 2,396 controls). Results: We identified 3 SNPs that associated significantly with IDE (insulin degrading enzyme) expression levels. A single copy of the minor allele for each significant SNP was associated with ∼twofold higher IDE expression levels. The most significant SNP, rs7910977, is 4.2 kb beyond the 3′ end of IDE. The association observed with this SNP was significant even at the genome-wide level (p = 2.7 × 10−8). Furthermore, the minor allele of rs7910977 associated significantly (p = 0.0046) with reduced LOAD risk (OR = 0.81 with a 95% CI of 0.70-0.94), as expected biologically from its association with elevated IDE expression. Conclusions: These results provide strong evidence that IDE is a late-onset Alzheimer disease (LOAD) gene with variants that modify risk of LOAD by influencing IDE expression. They also suggest that the use of expression levels as endophenotypes in genome-wide association studies may provide a powerful approach for the identification of disease susceptibility alleles. GLOSSARY AD = Alzheimer disease; CI = confidence interval; GWAS = genome-wide association study; LOAD = late-onset Alzheimer disease; mRNA = messenger RNA; OR = odds ratio; SNP = single nucleotide polymorphism. PMID:20142614

  10. Implementing meta-analysis from genome-wide association studies for pork quality traits.

    Science.gov (United States)

    Bernal Rubio, Y L; Gualdrón Duarte, J L; Bates, R O; Ernst, C W; Nonneman, D; Rohrer, G A; King, D A; Shackelford, S D; Wheeler, T L; Cantet, R J C; Steibel, J P

    2015-12-01

    Pork quality plays an important role in the meat processing industry. Thus, different methodologies have been implemented to elucidate the genetic architecture of traits affecting meat quality. One of the most common and widely used approaches is to perform genome-wide association (GWA) studies. However, a limitation of many GWA in animal breeding is the limited power due to small sample sizes in animal populations. One alternative is to implement a meta-analysis of GWA (MA-GWA) combining results from independent association studies. The objective of this study was to identify significant genomic regions associated with meat quality traits by performing MA-GWA for 8 different traits in 3 independent pig populations. Results from MA-GWA were used to search for genes possibly associated with the set of evaluated traits. Data from 3 pig data sets (U.S. Meat Animal Research Center, commercial, and Michigan State University Pig Resource Population) were used. A MA was implemented by combining -scores derived for each SNP in every population and then weighting them using the inverse of estimated variance of SNP effects. A search for annotated genes retrieved genes previously reported as candidates for shear force (calpain-1 catalytic subunit [] and calpastatin []), as well as for ultimate pH, purge loss, and cook loss (protein kinase, AMP-activated, γ 3 noncatalytic subunit []). In addition, novel candidate genes were identified for intramuscular fat and cook loss (acyl-CoA synthetase family member 3 mitochondrial []) and for the objective measure of muscle redness, CIE a* (glycogen synthase 1, muscle [] and ferritin, light polypeptide []). Thus, implementation of MA-GWA allowed integration of results for economically relevant traits and identified novel genes to be tested as candidates for meat quality traits in pig populations.

  11. Genome-wide Association Study of Obsessive-Compulsive Disorder

    Science.gov (United States)

    Stewart, S Evelyn; Yu, Dongmei; Scharf, Jeremiah M; Neale, Benjamin M; Fagerness, Jesen A; Mathews, Carol A; Arnold, Paul D; Evans, Patrick D; Gamazon, Eric R; Osiecki, Lisa; McGrath, Lauren; Haddad, Stephen; Crane, Jacquelyn; Hezel, Dianne; Illman, Cornelia; Mayerfeld, Catherine; Konkashbaev, Anuar; Liu, Chunyu; Pluzhnikov, Anna; Tikhomirov, Anna; Edlund, Christopher K; Rauch, Scott L; Moessner, Rainald; Falkai, Peter; Maier, Wolfgang; Ruhrmann, Stephan; Grabe, Hans-Jörgen; Lennertz, Leonard; Wagner, Michael; Bellodi, Laura; Cavallini, Maria Cristina; Richter, Margaret A; Cook, Edwin H; Kennedy, James L; Rosenberg, David; Stein, Dan J; Hemmings, Sian MJ; Lochner, Christine; Azzam, Amin; Chavira, Denise A; Fournier, Eduardo; Garrido, Helena; Sheppard, Brooke; Umaña, Paul; Murphy, Dennis L; Wendland, Jens R; Veenstra-VanderWeele, Jeremy; Denys, Damiaan; Blom, Rianne; Deforce, Dieter; Van Nieuwerburgh, Filip; Westenberg, Herman GM; Walitza, Susanne; Egberts, Karin; Renner, Tobias; Miguel, Euripedes Constantino; Cappi, Carolina; Hounie, Ana G; Conceição do Rosário, Maria; Sampaio, Aline S; Vallada, Homero; Nicolini, Humberto; Lanzagorta, Nuria; Camarena, Beatriz; Delorme, Richard; Leboyer, Marion; Pato, Carlos N; Pato, Michele T; Voyiaziakis, Emanuel; Heutink, Peter; Cath, Danielle C; Posthuma, Danielle; Smit, Jan H; Samuels, Jack; Bienvenu, O Joseph; Cullen, Bernadette; Fyer, Abby J; Grados, Marco A; Greenberg, Benjamin D; McCracken, James T; Riddle, Mark A; Wang, Ying; Coric, Vladimir; Leckman, James F; Bloch, Michael; Pittenger, Christopher; Eapen, Valsamma; Black, Donald W; Ophoff, Roel A; Strengman, Eric; Cusi, Daniele; Turiel, Maurizio; Frau, Francesca; Macciardi, Fabio; Gibbs, J Raphael; Cookson, Mark R; Singleton, Andrew; Hardy, John; Crenshaw, Andrew T; Parkin, Melissa A; Mirel, Daniel B; Conti, David V; Purcell, Shaun; Nestadt, Gerald; Hanna, Gregory L; Jenike, Michael A; Knowles, James A; Cox, Nancy; Pauls, David L

    2014-01-01

    Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1,465 cases, 5,557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9,657 X-chromosome SNPs. Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two p-values were located within DLGAP1 (p=2.49×10-6 and p=3.44×10-6), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a p-value=3.84 × 10-8. However, when trios were meta-analyzed with the combined case-control samples, the p-value for this variant was 3.62×10-5, losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation-QTLs (p<0.001) and frontal lobe eQTLs (p=0.001) was observed within the top-ranked SNPs (p<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921

  12. Genome-wide landscapes of human local adaptation in Asia.

    Directory of Open Access Journals (Sweden)

    Wei Qian

    Full Text Available Genetic studies of human local adaptation have been facilitated greatly by recent advances in high-throughput genotyping and sequencing technologies. However, few studies have investigated local adaptation in Asian populations on a genome-wide scale and with a high geographic resolution. In this study, taking advantage of the dense population coverage in Southeast Asia, which is the part of the world least studied in term of natural selection, we depicted genome-wide landscapes of local adaptations in 63 Asian populations representing the majority of linguistic and ethnic groups in Asia. Using genome-wide data analysis, we discovered many genes showing signs of local adaptation or natural selection. Notable examples, such as FOXQ1, MAST2, and CDH4, were found to play a role in hair follicle development and human cancer, signal transduction, and tumor repression, respectively. These showed strong indications of natural selection in Philippine Negritos, a group of aboriginal hunter-gatherers living in the Philippines. MTTP, which has associations with metabolic syndrome, body mass index, and insulin regulation, showed a strong signature of selection in Southeast Asians, including Indonesians. Functional annotation analysis revealed that genes and genetic variants underlying natural selections were generally enriched in the functional category of alternative splicing. Specifically, many genes showing significant difference with respect to allele frequency between northern and southern Asian populations were found to be associated with human height and growth and various immune pathways. In summary, this study contributes to the overall understanding of human local adaptation in Asia and has identified both known and novel signatures of natural selection in the human genome.

  13. A signal-to-noise approach to score normalization

    NARCIS (Netherlands)

    Arampatzis, A.; Kamps, J.; Cheung, D.; Song, I.-Y.; Chu, W.; Hu, X.; Lin, J.; Li, J.; Peng, Z.

    2009-01-01

    Score normalization is indispensable in distributed retrieval and fusion or meta-search where merging of result-lists is required. Distributional approaches to score normalization with reference to relevance, such as binary mixture models like the normal-exponential, suffer from lack of universality

  14. Genome-wide prediction of C. elegans genetic interactions.

    Science.gov (United States)

    Zhong, Weiwei; Sternberg, Paul W

    2006-03-10

    To obtain a global view of functional interactions among genes in a metazoan genome, we computationally integrated interactome data, gene expression data, phenotype data, and functional annotation data from three model organisms-Saccharomyces cerevisiae, Caenorhabditis elegans, and Drosophila melanogaster-and predicted genome-wide genetic interactions in C. elegans. The resulting genetic interaction network (consisting of 18,183 interactions) provides a framework for system-level understanding of gene functions. We experimentally tested the predicted interactions for two human disease-related genes and identified 14 new modifiers.

  15. Genome-wide association studies and contribution to cardiovascular physiology.

    Science.gov (United States)

    Munroe, Patricia B; Tinker, Andrew

    2015-09-01

    The study of family pedigrees with rare monogenic cardiovascular disorders has revealed new molecular players in physiological processes. Genome-wide association studies of complex traits with a heritable component may afford a similar and potentially intellectually richer opportunity. In this review we focus on the interpretation of genetic associations and the issue of causality in relation to known and potentially new physiology. We mainly discuss cardiometabolic traits as it reflects our personal interests, but the issues pertain broadly in many other disciplines. We also describe some of the resources that are now available that may expedite follow up of genetic association signals into observations on causal mechanisms and pathophysiology.

  16. [Genome-wide association study for adolescent idiopathic scoliosis].

    Science.gov (United States)

    Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro

    2016-04-01

    Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS.

  17. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    Science.gov (United States)

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  18. A comparison in association and linkage genome-wide scans for alcoholism susceptibility genes using single-nucleotide polymorphisms.

    Science.gov (United States)

    Chiu, Yen-Feng; Liu, Su-Yun; Tsai, Ya-Yu

    2005-12-30

    We conducted genome-wide linkage scans using both microsatellite and single-nucleotide polymorphism (SNP) markers. Regions showing the strongest evidence of linkage to alcoholism susceptibility genes were identified. Haplotype analyses using a sliding-window approach for SNPs in these regions were performed. In addition, we performed a genome-wide association scan using SNP data. SNPs in these regions with evidence of association (P alcoholism (the most significant SNP had a p-value of 0.030) as those identified from association genomic screening (the most significant SNP had a p-value of 2.0 x 10(-8)).

  19. A genome-wide association study of female sexual dysfunction.

    Directory of Open Access Journals (Sweden)

    Andrea Burri

    Full Text Available BACKGROUND: Female sexual dysfunction (FSD is an important but controversial problem with serious negative impact on women's quality of life. Data from twin studies have shown a genetic contribution to the development and maintenance of FSD. METHODOLOGY/PRINCIPAL FINDINGS: We performed a genome-wide association study (GWAS on 2.5 million single-nucleotide polymorphisms (SNPs in 1,104 female twins (25-81 years of age in a population-based register and phenotypic data on lifelong sexual functioning. Although none reached conventional genome-wide level of significance (10 × -8, we found strongly suggestive associations with the phenotypic dimension of arousal (rs13202860, P = 1.2 × 10(-7; rs1876525, P = 1.2 × 10(-7; and rs13209281 P = 8.3 × 10(-7 on chromosome 6, around 500 kb upstream of the locus HTR1E (5-hydroxytryptamine receptor 1E locus, related to the serotonin brain pathways. We could not replicate previously reported candidate SNPs associated with FSD in the DRD4, 5HT2A and IL-1B loci. CONCLUSIONS/SIGNIFICANCE: We report the first GWAS of FSD symptoms in humans. This has pointed to several "risk alleles" and the implication of the serotonin and GABA pathways. Ultimately, understanding key mechanisms via this research may lead to new FSD treatments and inform clinical practice and developments in psychiatric nosology.

  20. A genome-wide association study of aging.

    Science.gov (United States)

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora

    2011-11-01

    Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity.

  1. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    Directory of Open Access Journals (Sweden)

    Varun Warrier

    Full Text Available Asperger Syndrome (AS is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC, which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448 were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448 lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  2. Genome-wide association studies of obesity and metabolic syndrome.

    Science.gov (United States)

    Fall, Tove; Ingelsson, Erik

    2014-01-25

    Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  3. Genome-wide mapping of DNA strand breaks.

    Directory of Open Access Journals (Sweden)

    Frédéric Leduc

    Full Text Available Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP, uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.

  4. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    Science.gov (United States)

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  5. Genome-wide patterns of Arabidopsis gene expression in nature.

    Directory of Open Access Journals (Sweden)

    Christina L Richards

    Full Text Available Organisms in the wild are subject to multiple, fluctuating environmental factors, and it is in complex natural environments that genetic regulatory networks actually function and evolve. We assessed genome-wide gene expression patterns in the wild in two natural accessions of the model plant Arabidopsis thaliana and examined the nature of transcriptional variation throughout its life cycle and gene expression correlations with natural environmental fluctuations. We grew plants in a natural field environment and measured genome-wide time-series gene expression from the plant shoot every three days, spanning the seedling to reproductive stages. We find that 15,352 genes were expressed in the A. thaliana shoot in the field, and accession and flowering status (vegetative versus flowering were strong components of transcriptional variation in this plant. We identified between ∼110 and 190 time-varying gene expression clusters in the field, many of which were significantly overrepresented by genes regulated by abiotic and biotic environmental stresses. The two main principal components of vegetative shoot gene expression (PC(veg correlate to temperature and precipitation occurrence in the field. The largest PC(veg axes included thermoregulatory genes while the second major PC(veg was associated with precipitation and contained drought-responsive genes. By exposing A. thaliana to natural environments in an open field, we provide a framework for further understanding the genetic networks that are deployed in natural environments, and we connect plant molecular genetics in the laboratory to plant organismal ecology in the wild.

  6. Genome-wide patterns of nucleotide polymorphism in domesticated rice.

    Directory of Open Access Journals (Sweden)

    Ana L Caicedo

    2007-09-01

    Full Text Available Domesticated Asian rice (Oryza sativa is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models to explain contemporary patterns of polymorphisms in rice, including a (i selectively neutral population bottleneck model, (ii bottleneck plus migration model, (iii multiple selective sweeps model, and (iv bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been the dominant demographic model for domesticated species, cannot explain the derived nucleotide polymorphism site frequency spectrum in rice. Instead, a bottleneck model that incorporates selective sweeps, or a more complex demographic model that includes subdivision and gene flow, are more plausible explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection results only in a local signature of variation.

  7. Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.

    Directory of Open Access Journals (Sweden)

    Jing Chen

    Full Text Available Identifying transcription factors (TF involved in producing a genome-wide transcriptional profile is an essential step in building mechanistic model that can explain observed gene expression data. We developed a statistical framework for constructing genome-wide signatures of TF activity, and for using such signatures in the analysis of gene expression data produced by complex transcriptional regulatory programs. Our framework integrates ChIP-seq data and appropriately matched gene expression profiles to identify True REGulatory (TREG TF-gene interactions. It provides genome-wide quantification of the likelihood of regulatory TF-gene interaction that can be used to either identify regulated genes, or as genome-wide signature of TF activity. To effectively use ChIP-seq data, we introduce a novel statistical model that integrates information from all binding "peaks" within 2 Mb window around a gene's transcription start site (TSS, and provides gene-level binding scores and probabilities of regulatory interaction. In the second step we integrate these binding scores and regulatory probabilities with gene expression data to assess the likelihood of True REGulatory (TREG TF-gene interactions. We demonstrate the advantages of TREG framework in identifying genes regulated by two TFs with widely different distribution of functional binding events (ERα and E2f1. We also show that TREG signatures of TF activity vastly improve our ability to detect involvement of ERα in producing complex diseases-related transcriptional profiles. Through a large study of disease-related transcriptional signatures and transcriptional signatures of drug activity, we demonstrate that increase in statistical power associated with the use of TREG signatures makes the crucial difference in identifying key targets for treatment, and drugs to use for treatment. All methods are implemented in an open-source R package treg. The package also contains all data used in the analysis

  8. Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture

    NARCIS (Netherlands)

    S.I. Berndt (Sonja); S. Gustafsson (Stefan); R. Mägi (Reedik); A. Ganna (Andrea); E. Wheeler (Eleanor); M.F. Feitosa (Mary Furlan); A.E. Justice (Anne); K.L. Monda (Keri); D.C. Croteau-Chonka (Damien); F.R. Day (Felix); T. Esko (Tõnu); M. Fall (Magnus); T. Ferreira (Teresa); D. Gentilini (Davide); A.U. Jackson (Anne); J. Luan; J.C. Randall (Joshua); S. Vedantam (Sailaja); C.J. Willer (Cristen); T.W. Winkler (Thomas); A.R. Wood (Andrew); T. Workalemahu (Tsegaselassie); Y.-J. Hu (Yi-Juan); S.H. Lee (Sang Hong); L. Liang (Liming); D.Y. Lin (Dan); J. Min (Josine); B.M. Neale (Benjamin); G. Thorleifsson (Gudmar); J. Yang (Jian); E. Albrecht (Eva); N. Amin (Najaf); J.L. Bragg-Gresham (Jennifer L.); G. Cadby (Gemma); M. den Heijer (Martin); N. Eklund (Niina); K. Fischer (Krista); A. Goel (Anuj); J.J. Hottenga (Jouke Jan); J.E. Huffman (Jennifer); I. Jarick (Ivonne); A. Johansson (Åsa); T. Johnson (Toby); S. Kanoni (Stavroula); M.E. Kleber (Marcus); I.R. König (Inke); K. Kristiansson (Kati); Z. Kutalik (Zoltán); C. Lamina (Claudia); C. Lecoeur (Cécile); G. Li (Guo); M. Mangino (Massimo); W.L. McArdle (Wendy); M.C. Medina-Gomez (Carolina); M. Müller-Nurasyid (Martina); J.S. Ngwa; I.M. Nolte (Ilja); L. Paternoster (Lavinia); S. Pechlivanis (Sonali); M. Perola (Markus); M.J. Peters (Marjolein); M. Preuss (Michael); L.M. Rose (Lynda); J. Shi (Jianxin); D. Shungin (Dmitry); G.D. Smith; R.J. Strawbridge (Rona); I. Surakka (Ida); A. Teumer (Alexander); M.D. Trip (Mieke); J.P. Tyrer (Jonathan); J.V. van Vliet-Ostaptchouk (Jana); L. Vandenput (Liesbeth); L. Waite (Lindsay); J.H. Zhao (Jing); D. Absher (Devin); F.W. Asselbergs (Folkert); M. Atalay (Mustafa); A.P. Attwood (Antony); A.J. Balmforth (Anthony); D.C.G. Basart (Dick); J.P. Beilby (John); L.L. Bonnycastle (Lori); P. Brambilla (Paolo); M. Bruinenberg (M.); H. Campbell (Harry); D.I. Chasman (Daniel); P.S. Chines (Peter); F.S. Collins (Francis); J. Connell (John); W. O Cookson (William); U. de Faire (Ulf); F. de Vegt (Femmie); M. Dei (Mariano); M. Dimitriou (Maria); T. Edkins (Ted); K. Estrada Gil (Karol); D.M. Evans (David); M. Farrall (Martin); F. Ferrario (Franco); J. Ferrières (Jean); L. Franke (Lude); F. Frau (Francesca); P.V. Gejman (Pablo); H. Grallert (Harald); H. Grönberg (Henrik); V. Gudnason (Vilmundur); A. Hall (Anne); A.S. Hall (Alistair); A.L. Hartikainen; C. Hayward (Caroline); N.L. Heard-Costa (Nancy); A.C. Heath (Andrew); J. Hebebrand (Johannes); G. Homuth (Georg); F.B. Hu (Frank); S.E. Hunt (Sarah); E. Hyppönen (Elina); C. Iribarren (Carlos); K.B. Jacobs (Kevin); J.-O. Jansson (John-Olov); A. Jula (Antti); M. Kähönen (Mika); S. Kathiresan (Sekar); F. Kee (F.); K-T. Khaw (Kay-Tee); M. Kivimaki (Mika); W. Koenig (Wolfgang); A. Kraja (Aldi); M. Kumari (Meena); K. Kuulasmaa (Kari); J. Kuusisto (Johanna); J. Laitinen (Jaana); T.A. Lakka (Timo); C. Langenberg (Claudia); L.J. Launer (Lenore); L. Lind (Lars); J. Lindstrom (Jaana); J. Liu (Jianjun); A. Liuzzi (Antonio); M.L. Lokki; M. Lorentzon (Mattias); P.A. Madden (Pamela); P.K. Magnusson (Patrik); P. Manunta (Paolo); D. Marek (Diana); W. März (Winfried); I.M. Leach (Irene Mateo); B. McKnight (Barbara); S.E. Medland (Sarah Elizabeth); E. Mihailov (Evelin); L. Milani (Lili); G.W. Montgomery (Grant); V. Mooser (Vincent); T.W. Mühleisen (Thomas); P. Munroe (Patricia); A.W. Musk (Arthur); N. Narisu (Narisu); G. Navis (Gerjan); G. Nicholson (Ggeorge); C. Nohr (Christian); K. Ong (Ken); B.A. Oostra (Ben); C.N.A. Palmer (Colin); A. Palotie (Aarno); J. Peden (John); N. Pedersen; A. Peters (Annette); O. Polasek (Ozren); A. Pouta (Anneli); P.P. Pramstaller (Peter Paul); I. Prokopenko (Inga); C. Pütter (Carolin); A. Radhakrishnan (Aparna); O. Raitakari (Olli); A. Rendon (Augusto); F. Rivadeneira Ramirez (Fernando); I. Rudan (Igor); T. Saaristo (Timo); J.G. Sambrook (Jennifer); A.R. Sanders (Alan); S. Sanna (Serena); J. Saramies (Jouko); S. Schipf (Sabine); S. Schreiber (Stefan); H. Schunkert (Heribert); S.-Y. Shin; S. Signorini (Stefano); J. Sinisalo (Juha); B. Skrobek (Boris); N. Soranzo (Nicole); A. Stancáková (Alena); K. Stark (Klaus); J. Stephens (Jonathan); K. Stirrups (Kathy); R.P. Stolk (Ronald); M. Stumvoll (Michael); A.J. Swift (Amy); E.V. Theodoraki (Eirini); B. Thorand (Barbara); D.-A. Tregouet (David-Alexandre); E. Tremoli (Elena); M.M. van der Klauw (Melanie); J.B.J. van Meurs (Joyce); S.H.H.M. Vermeulen (Sita); J. Viikari (Jorma); J. Virtamo (Jarmo); V. Vitart (Veronique); G. Waeber (Gérard); Z. Wang (Zhaoming); E. Widen (Elisabeth); S.H. Wild (Sarah); G.A.H.M. Willemsen (Gonneke); B. Winkelmann; J.C.M. Witteman (Jacqueline); B.H.R. Wolffenbuttel (Bruce); A. Wong (Andrew); A.F. Wright (Alan); M.C. Zillikens (Carola); P. Amouyel (Philippe); B.O. Boehm (Bernhard); E.A. Boerwinkle (Eric); D.I. Boomsma (Dorret); M. Caulfield (Mark); S.J. Chanock (Stephen); L.A. Cupples (Adrienne); D. Cusi (Daniele); G.V. Dedoussis (George); J. Erdmann (Jeanette); J.G. Eriksson (Johan); P.W. Franks (Paul); P. Froguel (Philippe); C. Gieger (Christian); U. Gyllensten (Ulf); A. Hamsten (Anders); T.B. Harris (Tamara); C. Hengstenberg (Christian); A.A. Hicks (Andrew); A. Hingorani (Aroon); A. Hinney (Anke); A. Hofman (Albert); G.K. Hovingh (Kees); K. Hveem (Kristian); T. Illig (Thomas); M.-R. Jarvelin (Marjo-Riitta); K.-H. Jöckel (Karl-Heinz); S. Keinanen-Kiukaanniemi (Sirkka); L.A.L.M. Kiemeney (Bart); D. Kuh (Diana); M. Laakso (Markku); T. Lehtimäki (Terho); D.F. Levinson (Douglas); N.G. Martin (Nicholas); A. Metspalu (Andres); A.D. Morris (Andrew); M.S. Nieminen (Markku); I. Njølstad (Inger); C. Ohlsson (Claes); A.J. Oldehinkel (Albertine); W.H. Ouwehand (Willem); C. Palmer (Cameron); B.W.J.H. Penninx (Brenda); C. Power (Christopher); M.A. Province (Mike); B.M. Psaty (Bruce); L. Qi (Lu); R. Rauramaa (Rainer); P.M. Ridker (Paul); S. Ripatti (Samuli); V. Salomaa (Veikko); N.J. Samani (Nilesh); H. Snieder (Harold); H.G. Sorensen; T.D. Spector (Timothy); J-A. Zwart (John-Anker); A. Tönjes (Anke); J. Tuomilehto (Jaakko); A.G. Uitterlinden (André); M. Uusitupa (Matti); P. van der Harst (Pim); P. Vollenweider (Peter); H. Wallaschofski (Henri); N.J. Wareham (Nick); H. Watkins (Hugh); H.E. Wichmann (Heinz Erich); J.F. Wilson (James F); G.R. Abecasis (Gonçalo); T.L. Assimes (Themistocles); I. Barroso (Inês); M. Boehnke (Michael); I.B. Borecki (Ingrid); P. Deloukas (Panagiotis); C. Fox (Craig); T.M. Frayling (Timothy); L. Groop (Leif); T. Haritunian (Talin); I.M. Heid (Iris); D. Hunter (David); R.C. Kaplan (Robert); F. Karpe (Fredrik); M.F. Moffatt (Miriam); K.L. Mohlke (Karen); J.R. O´Connell; Y. Pawitan (Yudi); E.E. Schadt (Eric); D. Schlessinger (David); V. Steinthorsdottir (Valgerdur); D.P. Strachan (David); U. Thorsteinsdottir (Unnur); C.M. van Duijn (Cock); P.M. Visscher (Peter); A.M. Di Blasio (Anna Maria); J.N. Hirschhorn (Joel); C.M. Lindgren (Cecilia); A.D. Morris (Andrew); D. Meyre (David); A. Scherag (Andre); M.I. McCarthy (Mark); E.K. Speliotes (Elizabeth); K.E. North (Kari); R.J.F. Loos (Ruth); E. Ingelsson (Erik)

    2013-01-01

    textabstractApproaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of b

  9. Loci associated with ischaemic stroke and its subtypes (SiGN) : A genome-wide association study

    NARCIS (Netherlands)

    Pulit, SL; Algra, A; de Bakker, Paul I W

    2016-01-01

    Background: The discovery of disease-associated loci through genome-wide association studies (GWAS) is the leading genetic approach to the identification of novel biological pathways underlying diseases in humans. Until recently, GWAS in ischaemic stroke have been limited by small sample sizes and

  10. Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture

    DEFF Research Database (Denmark)

    Berndt, Sonja I; Gustafsson, Stefan; Mägi, Reedik

    2013-01-01

    Approaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of body mass ...

  11. Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture

    NARCIS (Netherlands)

    Berndt, Sonja I; Gustafsson, Stefan; Mägi, Reedik; Ganna, Andrea; Wheeler, Eleanor; Feitosa, Mary F; Justice, Anne E; Monda, Keri L; Croteau-Chonka, Damien C; Day, Felix R; Esko, Tõnu; Fall, Tove; Ferreira, Teresa; Gentilini, Davide; Jackson, Anne U; Luan, Jian'an; Randall, Joshua C; Vedantam, Sailaja; Willer, Cristen J; Winkler, Thomas W; Wood, Andrew R; Workalemahu, Tsegaselassie; Hu, Yi-Juan; Lee, Sang Hong; Liang, Liming; Lin, Dan-Yu; Min, Josine L; Neale, Benjamin M; Thorleifsson, Gudmar; Yang, Jian; Albrecht, Eva; Amin, Najaf; Bragg-Gresham, Jennifer L; Cadby, Gemma; den Heijer, Martin; Eklund, Niina; Fischer, Krista; Goel, Anuj; Hottenga, Jouke-Jan; Huffman, Jennifer E; Jarick, Ivonne; Johansson, Åsa; Johnson, Toby; Kanoni, Stavroula; Kleber, Marcus E; König, Inke R; Kristiansson, Kati; Kutalik, Zoltán; Lamina, Claudia; Lecoeur, Cecile; Li, Guo; Mangino, Massimo; McArdle, Wendy L; Medina-Gomez, Carolina; Müller-Nurasyid, Martina; Ngwa, Julius S; Nolte, Ilja M; Paternoster, Lavinia; Pechlivanis, Sonali; Perola, Markus; Peters, Marjolein J; Preuss, Michael; Rose, Lynda M; Shi, Jianxin; Shungin, Dmitry; Smith, Albert Vernon; Strawbridge, Rona J; Surakka, Ida; Teumer, Alexander; Trip, Mieke D; Tyrer, Jonathan; Van Vliet-Ostaptchouk, Jana V; Vandenput, Liesbeth; Waite, Lindsay L; Zhao, Jing Hua; Absher, Devin; Asselbergs, Folkert W; Atalay, Mustafa; Attwood, Antony P; Balmforth, Anthony J; Basart, Hanneke; Beilby, John; Bonnycastle, Lori L; Brambilla, Paolo; Bruinenberg, Marcel; Campbell, Harry; Chasman, Daniel I; Chines, Peter S; Collins, Francis S; Connell, John M; Cookson, William O; de Faire, Ulf; de Vegt, Femmie; Dei, Mariano; Dimitriou, Maria; Edkins, Sarah; Estrada, Karol; Evans, David M; Farrall, Martin; Ferrario, Marco M; Ferrières, Jean; Franke, Lude; Frau, Francesca; Gejman, Pablo V; Grallert, Harald; Grönberg, Henrik; Gudnason, Vilmundur; Hall, Alistair S; Hall, Per; Hartikainen, Anna-Liisa; Hayward, Caroline; Heard-Costa, Nancy L; Heath, Andrew C; Hebebrand, Johannes; Homuth, Georg; Hu, Frank B; Hunt, Sarah E; Hyppönen, Elina; Iribarren, Carlos; Jacobs, Kevin B; Jansson, John-Olov; Jula, Antti; Kähönen, Mika; Kathiresan, Sekar; Kee, Frank; Khaw, Kay-Tee; Kivimäki, Mika; Koenig, Wolfgang; Kraja, Aldi T; Kumari, Meena; Kuulasmaa, Kari; Kuusisto, Johanna; Laitinen, Jaana H; Lakka, Timo A; Langenberg, Claudia; Launer, Lenore J; Lind, Lars; Lindström, Jaana; Liu, Jianjun; Liuzzi, Antonio; Lokki, Marja-Liisa; Lorentzon, Mattias; Madden, Pamela A; Magnusson, Patrik K; Manunta, Paolo; Marek, Diana; März, Winfried; Mateo Leach, Irene; McKnight, Barbara; Medland, Sarah E; Mihailov, Evelin; Milani, Lili; Montgomery, Grant W; Mooser, Vincent; Mühleisen, Thomas W; Munroe, Patricia B; Musk, Arthur W; Narisu, Narisu; Navis, Gerjan; Nicholson, George; Nohr, Ellen A; Ong, Ken K; Oostra, Ben A; Palmer, Colin N A; Palotie, Aarno; Peden, John F; Pedersen, Nancy; Peters, Annette; Polasek, Ozren; Pouta, Anneli; Pramstaller, Peter P; Prokopenko, Inga; Pütter, Carolin; Radhakrishnan, Aparna; Raitakari, Olli; Rendon, Augusto; Rivadeneira, Fernando; Rudan, Igor; Saaristo, Timo E; Sambrook, Jennifer G; Sanders, Alan R; Sanna, Serena; Saramies, Jouko; Schipf, Sabine; Schreiber, Stefan; Schunkert, Heribert; Shin, So-Youn; Signorini, Stefano; Sinisalo, Juha; Skrobek, Boris; Soranzo, Nicole; Stančáková, Alena; Stark, Klaus; Stephens, Jonathan C; Stirrups, Kathleen; Stolk, Ronald P; Stumvoll, Michael; Swift, Amy J; Theodoraki, Eirini V; Thorand, Barbara; Tregouet, David-Alexandre; Tremoli, Elena; Van der Klauw, Melanie M; van Meurs, Joyce B J; Vermeulen, Sita H; Viikari, Jorma; Virtamo, Jarmo; Vitart, Veronique; Waeber, Gérard; Wang, Zhaoming; Widén, Elisabeth; Wild, Sarah H; Willemsen, Gonneke; Winkelmann, Bernhard R; Witteman, Jacqueline C M; Wolffenbuttel, Bruce H R; Wong, Andrew; Wright, Alan F; Zillikens, M Carola; Amouyel, Philippe; Boehm, Bernhard O; Boerwinkle, Eric; Boomsma, Dorret I; Caulfield, Mark J; Chanock, Stephen J; Cupples, L Adrienne; Cusi, Daniele; Dedoussis, George V; Erdmann, Jeanette; Eriksson, Johan G; Franks, Paul W; Froguel, Philippe; Gieger, Christian; Gyllensten, Ulf; Hamsten, Anders; Harris, Tamara B; Hengstenberg, Christian; Hicks, Andrew A; Hingorani, Aroon; Hinney, Anke; Hofman, Albert; Hovingh, Kees G; Hveem, Kristian; Illig, Thomas; Jarvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Keinanen-Kiukaanniemi, Sirkka M; Kiemeney, Lambertus A; Kuh, Diana; Laakso, Markku; Lehtimäki, Terho; Levinson, Douglas F; Martin, Nicholas G; Metspalu, Andres; Morris, Andrew D; Nieminen, Markku S; Njølstad, Inger; Ohlsson, Claes; Oldehinkel, Albertine J; Ouwehand, Willem H; Palmer, Lyle J; Penninx, Brenda; Power, Chris; Province, Michael A; Psaty, Bruce M; Qi, Lu; Rauramaa, Rainer; Ridker, Paul M; Ripatti, Samuli; Salomaa, Veikko; Samani, Nilesh J; Snieder, Harold; Sørensen, Thorkild I A; Spector, Timothy D; Stefansson, Kari; Tönjes, Anke; Tuomilehto, Jaakko; Uitterlinden, André G; Uusitupa, Matti; van der Harst, Pim; Vollenweider, Peter; Wallaschofski, Henri; Wareham, Nicholas J; Watkins, Hugh; Wichmann, H-Erich; Wilson, James F; Abecasis, Goncalo R; Assimes, Themistocles L; Barroso, Inês; Boehnke, Michael; Borecki, Ingrid B; Deloukas, Panos; Fox, Caroline S; Frayling, Timothy; Groop, Leif C; Haritunian, Talin; Heid, Iris M; Hunter, David; Kaplan, Robert C; Karpe, Fredrik; Moffatt, Miriam F; Mohlke, Karen L; O'Connell, Jeffrey R; Pawitan, Yudi; Schadt, Eric E; Schlessinger, David; Steinthorsdottir, Valgerdur; Strachan, David P; Thorsteinsdottir, Unnur; van Duijn, Cornelia M; Visscher, Peter M; Di Blasio, Anna Maria; Hirschhorn, Joel N; Lindgren, Cecilia M; Morris, Andrew P; Meyre, David; Scherag, André; McCarthy, Mark I; Speliotes, Elizabeth K; North, Kari E; Loos, Ruth J F; Ingelsson, Erik

    2013-01-01

    Approaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of body mass ind

  12. Cross-Cancer Genome-Wide Analysis of Lung, Ovary, Breast, Prostate, and Colorectal Cancer Reveals Novel Pleiotropic Associations

    NARCIS (Netherlands)

    Fehringer, Gordon; Kraft, Peter; Pharoah, Paul D.; Eeles, Rosalind A.; Chatterjee, Nilanjan; Schumacher, Fredrick R.; Schildkraut, Joellen M.; Lindstrom, Sara; Brennan, Paul; Bickeboller, Heike; Houlston, Richard S.; Landi, Maria Teresa; Caporaso, Neil; Risch, Angela; Al Olama, Ali Amin; Berndt, Sonja I.; Giovannucci, Edward L.; Gronberg, Henrik; Kote-Jarai, Zsofia; Ma, Jing; Muir, Kenneth; Stampfer, Meir J.; Stevens, Victoria L.; Wiklund, Fredrik; Willett, Walter C.; Goode, Ellen L.; Permuth, Jennifer B.; Risch, Harvey A.; Reid, Brett M.; Bezieau, Stephane; Brenner, Hermann; Chan, Andrew T.; Chang-Claude, Jenny; Hudson, Thomas J.; Kocarnik, Jonathan K.; Newcomb, Polly A.; Schoen, Robert E.; Slattery, Martha L.; White, Emily; Adank, Muriel A.; Ahsan, Habibul; Aittomaki, Kristiina; Baglietto, Laura; Blomquist, Carl; Canzian, Federico; Czene, Kamila; dos-Santos-Silva, Isabel; Eliassen, A. Heather; Figueroa, Jonine D.; Flesch-Janys, Dieter; Fletcher, Olivia; Garcia-Closas, Montserrat; Gaudet, Mia M.; Johnson, Nichola; Hall, Per; Hazra, Aditi; Hein, Rebecca; Hofman, Albert; Hopper, John L.; Irwanto, Astrid; Johansson, Mattias; Kaaks, Rudolf; Kibriya, Muhammad G.; Lichtner, Peter; Liu, Jianjun; Lund, Eiliv; Makalic, Enes; Meindl, Alfons; Muller-Myhsok, Bertram; Muranen, Taru A.; Nevanlinna, Heli; Peeters, Petra H.; Peto, Julian; Prentice, Ross L.; Rahman, Nazneen; Sanchez, Maria Jose; Schmidt, Daniel F.; Schmutzler, Rita K.; Southey, Melissa C.; Tamimi, Rulla; Travis, Ruth C.; Turnbull, Clare; Uitterlinden, Andre G.; Wang, Zhaoming; Whittemore, Alice S.; Yang, Xiaohong R.; Zheng, Wei; Buchanan, Daniel D.; Casey, Graham; Conti, David V.; Edlund, Christopher K.; Gallinger, Steven; Haile, Robert W.; Jenkins, Mark; Le Marchand, Loic; Li, Li; Lindor, Noralene M.; Schmit, Stephanie L.; Thibodeau, Stephen N.; Woods, Michael O.; Rafnar, Thorunn; Gudmundsson, Julius; Stacey, Simon N.; Stefansson, Kari; Sulem, Patrick; Chen, Y. Ann; Tyrer, Jonathan P.; Christiani, David C.; Wei, Yongyue; Shen, Hongbing; Hu, Zhibin; Shu, Xiao-Ou; Shiraishi, Kouya; Takahashi, Atsushi; Bosse, Yohan; Obeidat, Ma'en; Nickle, David; Timens, Wim; Freedman, Matthew L.; Li, Qiyuan; Seminara, Daniela; Chanock, Stephen J.; Gong, Jian; Peters, Ulrike; Gruber, Stephen B.; Amos, Christopher I.; Sellers, Thomas A.; Easton, Douglas F.; Hunter, David J.; Haiman, Christopher A.; Henderson, Brian E.; Hung, Rayjean J.

    2016-01-01

    Identifying genetic variants with pleiotropic associations can uncover common pathways influencing multiple cancers. We took a two-stage approach to conduct genome-wide association studies for lung, ovary, breast, prostate, and colorectal cancer from the GAME-ON/GECCO Network (61,851 cases, 61,820 c

  13. Cross-cancer genome-wide analysis of lung, ovary, breast, prostate, and colorectal cancer reveals novel pleiotropic associations

    NARCIS (Netherlands)

    Fehringer, G. (Gordon); P. Kraft (Peter); P.D.P. Pharoah (Paul); R. Eeles (Rosalind); Chatterjee, N. (Nilanjan); F.R. Schumacher (Fredrick R); J.M. Schildkraut (Joellen); S. Lindstrom (Stephen); P. Brennan (Paul); H. Bickeböller (Heike); R. Houlston (Richard); M.T. Landi (Maria Teresa); N.E. Caporaso (Neil); Risch, A. (Angela); A.A. Al Olama (Ali Amin); S.I. Berndt (Sonja); Giovannucci, E.L. (Edward L.); H. Grönberg (Henrik); Z. Kote-Jarai; Ma, J. (Jing); K.R. Muir (K.); M.J. Stampfer (Meir J.); Stevens, V.L. (Victoria L.); F. Wiklund (Fredrik); W.C. Willett (Walter C.); E.L. Goode (Ellen); Permuth, J.B. (Jennifer B.); H. Risch (Harvey); Reid, B.M. (Brett M.); Bezieau, S. (Stephane); H. Brenner (Hermann); Chan, A.T. (Andrew T.); J. Chang-Claude (Jenny); T.J. Hudson (Thomas); Kocarnik, J.K. (Jonathan K.); P. Newcomb (Polly); Schoen, R.E. (Robert E.); Slattery, M.L. (Martha L.); White, E. (Emily); M.A. Adank (Muriel); H. Ahsan (Habibul); K. Aittomäki (Kristiina); Baglietto, L. (Laura); Blomquist, C. (Carl); F. Canzian (Federico); K. Czene (Kamila); I. dos Santos Silva (Isabel); Eliassen, A.H. (A. Heather); J.D. Figueroa (Jonine); D. Flesch-Janys (Dieter); O. Fletcher (Olivia); M. García-Closas (Montserrat); M.M. Gaudet (Mia); Johnson, N. (Nichola); P. Hall (Per); A. Hazra (Aditi); R. Hein (Rebecca); Hofman, A. (Albert); J.L. Hopper (John); A. Irwanto (Astrid); M. Johansson (Mattias); R. Kaaks (Rudolf); M.G. Kibriya (Muhammad); P. Lichtner (Peter); J. Liu (Jianjun); E. Lund (Eiliv); Makalic, E. (Enes); A. Meindl (Alfons); B. Müller-Myhsok (B.); Muranen, T.A. (Taru A.); H. Nevanlinna (Heli); P.H.M. Peeters; J. Peto (Julian); R. Prentice (Ross); N. Rahman (Nazneen); M.-J. Sanchez (Maria-Jose); D.F. Schmidt (Daniel); R.K. Schmutzler (Rita); M.C. Southey (Melissa); Tamimi, R. (Rulla); S.P.L. Travis (Simon); C. Turnbull (Clare); Uitterlinden, A.G. (Andre G.); Z. Wang (Zhaoming); A.S. Whittemore (Alice); X.R. Yang (Xiaohong); W. Zheng (Wei); D. Buchanan (Daniel); G. Casey (Graham); G. Conti (Giario); C.K. Edlund (Christopher); S. Gallinger (Steve); R. Haile (Robert); M. Jenkins (Mark); Marchand, L. (Loïcle); Li, L. (Li); N.M. Lindor (Noralane); Schmit, S.L. (Stephanie L.); S.N. Thibodeau (Stephen); M.O. Woods (Michael); T. Rafnar (Thorunn); J. Gudmundsson (Julius); S.N. Stacey (Simon); Stefansson, K. (Kari); P. Sulem (Patrick); Chen, Y.A. (Y. Ann); J.P. Tyrer (Jonathan); Christiani, D.C. (David C.); Wei, Y. (Yongyue); H. Shen (Hongbing); Z. Hu (Zhibin); X.-O. Shu (Xiao-Ou); Shiraishi, K. (Kouya); A. Takahashi (Atsushi); Y. Bossé (Yohan); M. Obeidat; D.C. Nickle (David C.); W. Timens (Wim); M. Freedman (Matthew); Li, Q. (Qiyuan); D. Seminara (Daniela); S.J. Chanock (Stephen); Gong, J. (Jian); U. Peters (Ulrike); S.B. Gruber (Stephen); Amos, C.I. (Christopher I.); T.A. Sellers (Thomas A.); D.F. Easton (Douglas F.); D. Hunter (David); C.A. Haiman (Christopher A.); B.E. Henderson (Brian); R.J. Hung (Rayjean)

    2016-01-01

    textabstractIdentifying genetic variants with pleiotropic associations can uncover common pathways influencing multiple cancers. We took a two-stage approach to conduct genome-wide association studies for lung, ovary, breast, prostate, and colorectal cancer from the GAME-ON/GECCO Network (61,851 cas

  14. Cross-cancer genome-wide analysis of lung, ovary, breast, prostate, and colorectal cancer reveals novel pleiotropic associations

    NARCIS (Netherlands)

    Fehringer, Gordon; Kraft, Peter; Pharoah, Paul D.; Eeles, Rosalind A.; Chatterjee, Nilanjan; Schumacher, Fredrick R.; Schildkraut, Joellen M.; Lindström, Sara; Brennan, Paul; Bickeböller, Heike; Houlston, Richard S.; Landi, Maria Teresa; Caporaso, Neil; Risch, Angela; Al Olama, Ali Amin; Berndt, Sonja I.; Giovannucci, Edward L.; Grönberg, Henrik; Kote-Jarai, Zsofia; Ma, Jing; Muir, Kenneth; Stampfer, Meir J.; Stevens, Victoria L.; Wiklund, Fredrik; Willett, Walter C.; Goode, Ellen L.; Permuth, Jennifer B.; Risch, Harvey A.; Reid, Brett M.; Bezieau, Stephane; Brenner, Hermann; Chan, Andrew T.; Chang-Claude, Jenny; Hudson, Thomas J.; Kocarnik, Jonathan K.; Newcomb, Polly A.; Schoen, Robert E.; Slattery, Martha L.; White, Emily; Adank, Muriel A.; Ahsan, Habibul; Aittomäki, Kristiina; Baglietto, Laura; Blomquist, Carl; Canzian, Federico; Czene, Kamila; Dos-Santos-silva, Isabel; Eliassen, A. Heather; Figueroa, Jonine D.; Flesch-Janys, Dieter; Fletcher, Olivia; Garcia-Closas, Montserrat; Gaudet, Mia M.; Johnson, Nichola; Hall, Per; Hazra, Aditi; Hein, Rebecca; Hofman, Albert; Hopper, John L.; Irwanto, Astrid; Johansson, Mattias; Kaaks, Rudolf; Kibriya, Muhammad G.; Lichtner, Peter; Liu, Jianjun; Lund, Eiliv; Makalic, Enes; Meindl, Alfons; Müller-Myhsok, Bertram; Muranen, Taru A.; Nevanlinna, Heli; Peeters, Petra H.; Peto, Julian; Prentice, Ross L.; Rahman, Nazneen; Sanchez, Maria Jose; Schmidt, Daniel F.; Schmutzler, Rita K.; Southey, Melissa C.; Tamimi, Rulla; Travis, Ruth C.; Turnbull, Clare; Uitterlinden, Andre G.; Wang, Zhaoming; Whittemore, Alice S.; Yang, Xiaohong R.; Zheng, Wei; Buchanan, Daniel D.; Casey, Graham; Conti, David V.; Edlund, Christopher K.; Gallinger, Steven; Haile, Robert W.; Jenkins, Mark; Marchand, Loïcle; Li, Li; Lindor, Noralene M.; Schmit, Stephanie L.; Thibodeau, Stephen N.; Woods, Michael O.; Rafnar, Thorunn; Gudmundsson, Julius; Stacey, Simon N.; Stefansson, Kari; Sulem, Patrick; Chen, Y. Ann; Tyrer, Jonathan P.; Christiani, David C.; Wei, Yongyue; Shen, Hongbing; Hu, Zhibin; Shu, Xiao Ou; Shiraishi, Kouya; Takahashi, Atsushi; Bossé, Yohan; Obeidat, Ma'en; Nickle, David; Timens, Wim; Freedman, Matthew L.; Li, Qiyuan; Seminara, Daniela; Chanock, Stephen J.; Gong, Jian; Peters, Ulrike; Gruber, Stephen B.; Amos, Christopher I.; Sellers, Thomas A.; Easton, Douglas F.; Hunter, David J.; Haiman, Christopher A.; Henderson, Brian E.; Hung, Rayjean J.

    2016-01-01

    Identifying genetic variants with pleiotropic associations can uncover common pathways influencing multiple cancers. We took a two-stage approach to conduct genome-wide association studies for lung, ovary, breast, prostate, and colorectal cancer from the GAME-ON/GECCO Network (61,851 cases, 61,820 c

  15. Cross-cancer genome-wide analysis of lung, ovary, breast, prostate and colorectal cancer reveals novel pleiotropic associations

    NARCIS (Netherlands)

    Fehringer, Gordon; Kraft, Peter; Pharoah, Paul D P; Eeles, Rosalind A; Chatterjee, Nilanjan; Schumacher, Fredrick R; Schildkraut, Joellen M; Lindstrom, Sara; Brennan, Paul; Bickeböller, Heike; Houlston, Richard S; Landi, Maria Teresa; Caporaso, Neil E; Risch, Angela; Amin Al Olama, Ali; Berndt, Sonja I; Giovannucci, Edward; Gronberg, Henrik; Kote-Jarai, Zsofia; Ma, Jing; Muir, Kenneth; Stampfer, Meir J; Stevens, Victoria L; Wiklund, Fredrik; Willett, Walter C; Goode, Ellen L; Permuth, Jennifer B; Risch, Harvey A; Reid, Brett M; Bezieau, Stéphane; Brenner, Hermann; Chan, Andrew T; Chang-Claude, Jenny; Hudson, Thomas J; Kocarnik, Jonathan; Newcomb, Polly A; Schoen, Robert E; Slattery, Martha L; White, Emily; Adank, Muriel A; Ahsan, Habibul; Aittomaki, Kristiina; Baglietto, Laura; Blomquist, Carl; Canzian, Federico; Czene, Kamila; Dos Santos Silva, Isabel; Eliassen, A Heather; Figueroa, Jonine D; Flesch-Janys, Dieter; Fletcher, Olivia; Garcia-Closas, Montserrat; Gaudet, Mia M; Johnson, Nichola; Hall, Per; Hazra, Aditi; Hein, Rebecca; Hofman, Albert; Hopper, John L; Irwanto, Astrid; Johansson, Mattias; Kaaks, Rudolf; Kibriya, Muhammad G; Lichtner, Peter; Liu, Jian Jun; Lund, Eiliv; Makalic, Enes; Meindl, Alfons; Müller-Myhsok, Bertram; Muranen, Taru A; Nevanlinna, Heli; Peeters, Petra H; Peto, Julian; Prentice, Ross L; Rahman, Nazneen; Sanchez, Maria-Jose; Schmidt, Daniel F; Schmutzler, Rita K; Southey, Melissa C; Tamimi, Rulla M; Travis, Ruth C; Turnbull, Clare; Uitterlinden, Andre G; Wang, Zhaoming; Whittemore, Alice S; Yang, Xiaohong R; Zheng, Wei; Rafnar, Thorunn; Gudmundsson, Julius; Stacey, Simon N; Stefansson, Kari; Sulem, Patrick; Chen, Y Ann; Tyrer, Jonathan P; Christiani, David C; Wei, Yongyue; Shen, Hongbing; Hu, Zhibin; Shu, Xiao-Ou; Shiraishi, Kouya; Takahashi, Atsushi; Bossé, Yohan; Obeidat, Ma'en; Nickle, David; Timens, Wim; Freedman, Matthew L; Li, Qiyuan; Seminara, Daniela; Chanock, Stephen J; Gong, Jian; Peters, Ulrike; Gruber, Stephen B; Amos, Christopher I; Sellers, Thomas A; Easton, Douglas F; Hunter, David J; Haiman, Christopher A; Henderson, Brian E; Hung, Rayjean J

    2016-01-01

    Identifying genetic variants with pleiotropic associations can uncover common pathways influencing multiple cancers. We took a two-staged approach to conduct genome-wide association studies for lung, ovary, breast, prostate and colorectal cancer from the GAME-ON/GECCO Network (61,851 cases, 61,820 c

  16. Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture

    NARCIS (Netherlands)

    Berndt, Sonja I; Gustafsson, Stefan; Mägi, Reedik; Ganna, Andrea; Wheeler, Eleanor; Feitosa, Mary F; Justice, Anne E; Monda, Keri L; Croteau-Chonka, Damien C; Day, Felix R; Esko, Tõnu; Fall, Tove; Ferreira, Teresa; Gentilini, Davide; Jackson, Anne U; Luan, Jian'an; Randall, Joshua C; Vedantam, Sailaja; Willer, Cristen J; Winkler, Thomas W; Wood, Andrew R; Workalemahu, Tsegaselassie; Hu, Yi-Juan; Lee, Sang Hong; Liang, Liming; Lin, Dan-Yu; Min, Josine L; Neale, Benjamin M; Thorleifsson, Gudmar; Yang, Jian; Albrecht, Eva; Amin, Najaf; Bragg-Gresham, Jennifer L; Cadby, Gemma; den Heijer, Martin; Eklund, Niina; Fischer, Krista; Goel, Anuj; Hottenga, Jouke-Jan; Huffman, Jennifer E; Jarick, Ivonne; Johansson, Åsa; Johnson, Toby; Kanoni, Stavroula; Kleber, Marcus E; König, Inke R; Kristiansson, Kati; Kutalik, Zoltán; Lamina, Claudia; Lecoeur, Cecile; Li, Guo; Mangino, Massimo; McArdle, Wendy L; Medina-Gomez, Carolina; Müller-Nurasyid, Martina; Ngwa, Julius S; Nolte, Ilja M; Paternoster, Lavinia; Pechlivanis, Sonali; Perola, Markus; Peters, Marjolein J; Preuss, Michael; Rose, Lynda M; Shi, Jianxin; Shungin, Dmitry; Smith, Albert Vernon; Strawbridge, Rona J; Surakka, Ida; Teumer, Alexander; Trip, Mieke D; Tyrer, Jonathan; Van Vliet-Ostaptchouk, Jana V; Vandenput, Liesbeth; Waite, Lindsay L; Zhao, Jing Hua; Absher, Devin; Asselbergs, Folkert W; Atalay, Mustafa; Attwood, Antony P; Balmforth, Anthony J; Basart, Hanneke; Beilby, John; Bonnycastle, Lori L; Brambilla, Paolo; Bruinenberg, Marcel; Campbell, Harry; Chasman, Daniel I; Chines, Peter S; Collins, Francis S; Connell, John M; Cookson, William O; de Faire, Ulf; de Vegt, Femmie; Dei, Mariano; Dimitriou, Maria; Edkins, Sarah; Estrada, Karol; Evans, David M; Farrall, Martin; Ferrario, Marco M; Ferrières, Jean; Franke, Lude; Frau, Francesca; Gejman, Pablo V; Grallert, Harald; Grönberg, Henrik; Gudnason, Vilmundur; Hall, Alistair S; Hall, Per; Hartikainen, Anna-Liisa; Hayward, Caroline; Heard-Costa, Nancy L; Heath, Andrew C; Hebebrand, Johannes; Homuth, Georg; Hu, Frank B; Hunt, Sarah E; Hyppönen, Elina; Iribarren, Carlos; Jacobs, Kevin B; Jansson, John-Olov; Jula, Antti; Kähönen, Mika; Kathiresan, Sekar; Kee, Frank; Khaw, Kay-Tee; Kivimäki, Mika; Koenig, Wolfgang; Kraja, Aldi T; Kumari, Meena; Kuulasmaa, Kari; Kuusisto, Johanna; Laitinen, Jaana H; Lakka, Timo A; Langenberg, Claudia; Launer, Lenore J; Lind, Lars; Lindström, Jaana; Liu, Jianjun; Liuzzi, Antonio; Lokki, Marja-Liisa; Lorentzon, Mattias; Madden, Pamela A; Magnusson, Patrik K; Manunta, Paolo; Marek, Diana; März, Winfried; Mateo Leach, Irene; McKnight, Barbara; Medland, Sarah E; Mihailov, Evelin; Milani, Lili; Montgomery, Grant W; Mooser, Vincent; Mühleisen, Thomas W; Munroe, Patricia B; Musk, Arthur W; Narisu, Narisu; Navis, Gerjan; Nicholson, George; Nohr, Ellen A; Ong, Ken K; Oostra, Ben A; Palmer, Colin N A; Palotie, Aarno; Peden, John F; Pedersen, Nancy; Peters, Annette; Polasek, Ozren; Pouta, Anneli; Pramstaller, Peter P; Prokopenko, Inga; Pütter, Carolin; Radhakrishnan, Aparna; Raitakari, Olli; Rendon, Augusto; Rivadeneira, Fernando; Rudan, Igor; Saaristo, Timo E; Sambrook, Jennifer G; Sanders, Alan R; Sanna, Serena; Saramies, Jouko; Schipf, Sabine; Schreiber, Stefan; Schunkert, Heribert; Shin, So-Youn; Signorini, Stefano; Sinisalo, Juha; Skrobek, Boris; Soranzo, Nicole; Stančáková, Alena; Stark, Klaus; Stephens, Jonathan C; Stirrups, Kathleen; Stolk, Ronald P; Stumvoll, Michael; Swift, Amy J; Theodoraki, Eirini V; Thorand, Barbara; Tregouet, David-Alexandre; Tremoli, Elena; Van der Klauw, Melanie M; van Meurs, Joyce B J; Vermeulen, Sita H; Viikari, Jorma; Virtamo, Jarmo; Vitart, Veronique; Waeber, Gérard; Wang, Zhaoming; Widén, Elisabeth; Wild, Sarah H; Willemsen, Gonneke; Winkelmann, Bernhard R; Witteman, Jacqueline C M; Wolffenbuttel, Bruce H R; Wong, Andrew; Wright, Alan F; Zillikens, M Carola; Amouyel, Philippe; Boehm, Bernhard O; Boerwinkle, Eric; Boomsma, Dorret I; Caulfield, Mark J; Chanock, Stephen J; Cupples, L Adrienne; Cusi, Daniele; Dedoussis, George V; Erdmann, Jeanette; Eriksson, Johan G; Franks, Paul W; Froguel, Philippe; Gieger, Christian; Gyllensten, Ulf; Hamsten, Anders; Harris, Tamara B; Hengstenberg, Christian; Hicks, Andrew A; Hingorani, Aroon; Hinney, Anke; Hofman, Albert; Hovingh, Kees G; Hveem, Kristian; Illig, Thomas; Jarvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Keinanen-Kiukaanniemi, Sirkka M; Kiemeney, Lambertus A; Kuh, Diana; Laakso, Markku; Lehtimäki, Terho; Levinson, Douglas F; Martin, Nicholas G; Metspalu, Andres; Morris, Andrew D; Nieminen, Markku S; Njølstad, Inger; Ohlsson, Claes; Oldehinkel, Albertine J; Ouwehand, Willem H; Palmer, Lyle J; Penninx, Brenda; Power, Chris; Province, Michael A; Psaty, Bruce M; Qi, Lu; Rauramaa, Rainer; Ridker, Paul M; Ripatti, Samuli; Salomaa, Veikko; Samani, Nilesh J; Snieder, Harold; Sørensen, Thorkild I A; Spector, Timothy D; Stefansson, Kari; Tönjes, Anke; Tuomilehto, Jaakko; Uitterlinden, André G; Uusitupa, Matti; van der Harst, Pim; Vollenweider, Peter; Wallaschofski, Henri; Wareham, Nicholas J; Watkins, Hugh; Wichmann, H-Erich; Wilson, James F; Abecasis, Goncalo R; Assimes, Themistocles L; Barroso, Inês; Boehnke, Michael; Borecki, Ingrid B; Deloukas, Panos; Fox, Caroline S; Frayling, Timothy; Groop, Leif C; Haritunian, Talin; Heid, Iris M; Hunter, David; Kaplan, Robert C; Karpe, Fredrik; Moffatt, Miriam F; Mohlke, Karen L; O'Connell, Jeffrey R; Pawitan, Yudi; Schadt, Eric E; Schlessinger, David; Steinthorsdottir, Valgerdur; Strachan, David P; Thorsteinsdottir, Unnur; van Duijn, Cornelia M; Visscher, Peter M; Di Blasio, Anna Maria; Hirschhorn, Joel N; Lindgren, Cecilia M; Morris, Andrew P; Meyre, David; Scherag, André; McCarthy, Mark I; Speliotes, Elizabeth K; North, Kari E; Loos, Ruth J F; Ingelsson, Erik

    Approaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of body mass

  17. Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture

    NARCIS (Netherlands)

    S.I. Berndt (Sonja); S. Gustafsson (Stefan); R. Mägi (Reedik); A. Ganna (Andrea); E. Wheeler (Eleanor); M.F. Feitosa (Mary Furlan); A.E. Justice (Anne); K.L. Monda (Keri); D.C. Croteau-Chonka (Damien); F.R. Day (Felix); T. Esko (Tõnu); M. Fall (Magnus); T. Ferreira (Teresa); D. Gentilini (Davide); A.U. Jackson (Anne); J. Luan; J.C. Randall (Joshua); S. Vedantam (Sailaja); C.J. Willer (Cristen); T.W. Winkler (Thomas); A.R. Wood (Andrew); T. Workalemahu (Tsegaselassie); Y.-J. Hu (Yi-Juan); S.H. Lee (Sang Hong); L. Liang (Liming); D.Y. Lin (Dan); J. Min (Josine); B.M. Neale (Benjamin); G. Thorleifsson (Gudmar); J. Yang (Jian); E. Albrecht (Eva); N. Amin (Najaf); J.L. Bragg-Gresham (Jennifer L.); G. Cadby (Gemma); M. den Heijer (Martin); N. Eklund (Niina); K. Fischer (Krista); A. Goel (Anuj); J.J. Hottenga (Jouke Jan); J.E. Huffman (Jennifer); I. Jarick (Ivonne); A. Johansson (Åsa); T. Johnson (Toby); S. Kanoni (Stavroula); M.E. Kleber (Marcus); I.R. König (Inke); K. Kristiansson (Kati); Z. Kutalik (Zoltán); C. Lamina (Claudia); C. Lecoeur (Cécile); G. Li (Guo); M. Mangino (Massimo); W.L. McArdle (Wendy); M.C. Medina-Gomez (Carolina); M. Müller-Nurasyid (Martina); J.S. Ngwa; I.M. Nolte (Ilja); L. Paternoster (Lavinia); S. Pechlivanis (Sonali); M. Perola (Markus); M.J. Peters (Marjolein); M. Preuss (Michael); L.M. Rose (Lynda); J. Shi (Jianxin); D. Shungin (Dmitry); G.D. Smith; R.J. Strawbridge (Rona); I. Surakka (Ida); A. Teumer (Alexander); M.D. Trip (Mieke); J.P. Tyrer (Jonathan); J.V. van Vliet-Ostaptchouk (Jana); L. Vandenput (Liesbeth); L. Waite (Lindsay); J.H. Zhao (Jing); D. Absher (Devin); F.W. Asselbergs (Folkert); M. Atalay (Mustafa); A.P. Attwood (Antony); A.J. Balmforth (Anthony); D.C.G. Basart (Dick); J.P. Beilby (John); L.L. Bonnycastle (Lori); P. Brambilla (Paolo); M. Bruinenberg (M.); H. Campbell (Harry); D.I. Chasman (Daniel); P.S. Chines (Peter); F.S. Collins (Francis); J. Connell (John); W. O Cookson (William); U. de Faire (Ulf); F. de Vegt (Femmie); M. Dei (Mariano); M. Dimitriou (Maria); T. Edkins (Ted); K. Estrada Gil (Karol); D.M. Evans (David); M. Farrall (Martin); F. Ferrario (Franco); J. Ferrières (Jean); L. Franke (Lude); F. Frau (Francesca); P.V. Gejman (Pablo); H. Grallert (Harald); H. Grönberg (Henrik); V. Gudnason (Vilmundur); A. Hall (Anne); A.S. Hall (Alistair); A.L. Hartikainen; C. Hayward (Caroline); N.L. Heard-Costa (Nancy); A.C. Heath (Andrew); J. Hebebrand (Johannes); G. Homuth (Georg); F.B. Hu (Frank); S.E. Hunt (Sarah); E. Hyppönen (Elina); C. Iribarren (Carlos); K.B. Jacobs (Kevin); J.-O. Jansson (John-Olov); A. Jula (Antti); M. Kähönen (Mika); S. Kathiresan (Sekar); F. Kee (F.); K-T. Khaw (Kay-Tee); M. Kivimaki (Mika); W. Koenig (Wolfgang); A. Kraja (Aldi); M. Kumari (Meena); K. Kuulasmaa (Kari); J. Kuusisto (Johanna); J. Laitinen (Jaana); T.A. Lakka (Timo); C. Langenberg (Claudia); L.J. Launer (Lenore); L. Lind (Lars); J. Lindstrom (Jaana); J. Liu (Jianjun); A. Liuzzi (Antonio); M.L. Lokki; M. Lorentzon (Mattias); P.A. Madden (Pamela); P.K. Magnusson (Patrik); P. Manunta (Paolo); D. Marek (Diana); W. März (Winfried); I.M. Leach (Irene Mateo); B. McKnight (Barbara); S.E. Medland (Sarah Elizabeth); E. Mihailov (Evelin); L. Milani (Lili); G.W. Montgomery (Grant); V. Mooser (Vincent); T.W. Mühleisen (Thomas); P. Munroe (Patricia); A.W. Musk (Arthur); N. Narisu (Narisu); G. Navis (Gerjan); G. Nicholson (Ggeorge); C. Nohr (Christian); K. Ong (Ken); B.A. Oostra (Ben); C.N.A. Palmer (Colin); A. Palotie (Aarno); J. Peden (John); N. Pedersen; A. Peters (Annette); O. Polasek (Ozren); A. Pouta (Anneli); P.P. Pramstaller (Peter Paul); I. Prokopenko (Inga); C. Pütter (Carolin); A. Radhakrishnan (Aparna); O. Raitakari (Olli); A. Rendon (Augusto); F. Rivadeneira Ramirez (Fernando); I. Rudan (Igor); T. Saaristo (Timo); J.G. Sambrook (Jennifer); A.R. Sanders (Alan); S. Sanna (Serena); J. Saramies (Jouko); S. Schipf (Sabine); S. Schreiber (Stefan); H. Schunkert (Heribert); S.-Y. Shin; S. Signorini (Stefano); J. Sinisalo (Juha); B. Skrobek (Boris); N. Soranzo (Nicole); A. Stancáková (Alena); K. Stark (Klaus); J. Stephens (Jonathan); K. Stirrups (Kathy); R.P. Stolk (Ronald); M. Stumvoll (Michael); A.J. Swift (Amy); E.V. Theodoraki (Eirini); B. Thorand (Barbara); D.-A. Tregouet (David-Alexandre); E. Tremoli (Elena); M.M. van der Klauw (Melanie); J.B.J. van Meurs (Joyce); S.H.H.M. Vermeulen (Sita); J. Viikari (Jorma); J. Virtamo (Jarmo); V. Vitart (Veronique); G. Waeber (Gérard); Z. Wang (Zhaoming); E. Widen (Elisabeth); S.H. Wild (Sarah); G.A.H.M. Willemsen (Gonneke); B. Winkelmann; J.C.M. Witteman (Jacqueline); B.H.R. Wolffenbuttel (Bruce); A. Wong (Andrew); A.F. Wright (Alan); M.C. Zillikens (Carola); P. Amouyel (Philippe); B.O. Boehm (Bernhard); E.A. Boerwinkle (Eric); D.I. Boomsma (Dorret); M. Caulfield (Mark); S.J. Chanock (Stephen); L.A. Cupples (Adrienne); D. Cusi (Daniele); G.V. Dedoussis (George); J. Erdmann (Jeanette); J.G. Eriksson (Johan); P.W. Franks (Paul); P. Froguel (Philippe); C. Gieger (Christian); U. Gyllensten (Ulf); A. Hamsten (Anders); T.B. Harris (Tamara); C. Hengstenberg (Christian); A.A. Hicks (Andrew); A. Hingorani (Aroon); A. Hinney (Anke); A. Hofman (Albert); G.K. Hovingh (Kees); K. Hveem (Kristian); T. Illig (Thomas); M.-R. Jarvelin (Marjo-Riitta); K.-H. Jöckel (Karl-Heinz); S. Keinanen-Kiukaanniemi (Sirkka); L.A.L.M. Kiemeney (Bart); D. Kuh (Diana); M. Laakso (Markku); T. Lehtimäki (Terho); D.F. Levinson (Douglas); N.G. Martin (Nicholas); A. Metspalu (Andres); A.D. Morris (Andrew); M.S. Nieminen (Markku); I. Njølstad (Inger); C. Ohlsson (Claes); A.J. Oldehinkel (Albertine); W.H. Ouwehand (Willem); C. Palmer (Cameron); B.W.J.H. Penninx (Brenda); C. Power (Christopher); M.A. Province (Mike); B.M. Psaty (Bruce); L. Qi (Lu); R. Rauramaa (Rainer); P.M. Ridker (Paul); S. Ripatti (Samuli); V. Salomaa (Veikko); N.J. Samani (Nilesh); H. Snieder (Harold); H.G. Sorensen; T.D. Spector (Timothy); J-A. Zwart (John-Anker); A. Tönjes (Anke); J. Tuomilehto (Jaakko); A.G. Uitterlinden (André); M. Uusitupa (Matti); P. van der Harst (Pim); P. Vollenweider (Peter); H. Wallaschofski (Henri); N.J. Wareham (Nick); H. Watkins (Hugh); H.E. Wichmann (Heinz Erich); J.F. Wilson (James F); G.R. Abecasis (Gonçalo); T.L. Assimes (Themistocles); I. Barroso (Inês); M. Boehnke (Michael); I.B. Borecki (Ingrid); P. Deloukas (Panagiotis); C. Fox (Craig); T.M. Frayling (Timothy); L. Groop (Leif); T. Haritunian (Talin); I.M. Heid (Iris); D. Hunter (David); R.C. Kaplan (Robert); F. Karpe (Fredrik); M.F. Moffatt (Miriam); K.L. Mohlke (Karen); J.R. O´Connell; Y. Pawitan (Yudi); E.E. Schadt (Eric); D. Schlessinger (David); V. Steinthorsdottir (Valgerdur); D.P. Strachan (David); U. Thorsteinsdottir (Unnur); C.M. van Duijn (Cock); P.M. Visscher (Peter); A.M. Di Blasio (Anna Maria); J.N. Hirschhorn (Joel); C.M. Lindgren (Cecilia); A.D. Morris (Andrew); D. Meyre (David); A. Scherag (Andre); M.I. McCarthy (Mark); E.K. Speliotes (Elizabeth); K.E. North (Kari); R.J.F. Loos (Ruth); E. Ingelsson (Erik)

    2013-01-01

    textabstractApproaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of

  18. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity

    Science.gov (United States)

    Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P.; Nir, Talia M.; Toga, Arthur W.; Jack, Clifford R.; Saykin, Andrew J.; Green, Robert C.; Weiner, Michael W.; Medland, Sarah E.; Montgomery, Grant W.; Hansell, Narelle K.; McMahon, Katie L.; de Zubicaray, Greig I.; Martin, Nicholas G.; Wright, Margaret J.; Thompson, Paul M.; Weiner, Michael; Aisen, Paul; Weiner, Michael; Aisen, Paul; Petersen, Ronald; Jack, Clifford R.; Jagust, William; Trojanowski, John Q.; Toga, Arthur W.; Beckett, Laurel; Green, Robert C.; Saykin, Andrew J.; Morris, John; Liu, Enchi; Green, Robert C.; Montine, Tom; Petersen, Ronald; Aisen, Paul; Gamst, Anthony; Thomas, Ronald G.; Donohue, Michael; Walter, Sarah; Gessert, Devon; Sather, Tamie; Beckett, Laurel; Harvey, Danielle; Gamst, Anthony; Donohue, Michael; Kornak, John; Jack, Clifford R.; Dale, Anders; Bernstein, Matthew; Felmlee, Joel; Fox, Nick; Thompson, Paul; Schuff, Norbert; Alexander, Gene; DeCarli, Charles; Jagust, William; Bandy, Dan; Koeppe, Robert A.; Foster, Norm; Reiman, Eric M.; Chen, Kewei; Mathis, Chet; Morris, John; Cairns, Nigel J.; Taylor-Reinwald, Lisa; Trojanowki, J.Q.; Shaw, Les; Lee, Virginia M.Y.; Korecka, Magdalena; Toga, Arthur W.; Crawford, Karen; Neu, Scott; Saykin, Andrew J.; Foroud, Tatiana M.; Potkin, Steven; Shen, Li; Khachaturian, Zaven; Frank, Richard; Snyder, Peter J.; Molchan, Susan; Kaye, Jeffrey; Quinn, Joseph; Lind, Betty; Dolen, Sara; Schneider, Lon S.; Pawluczyk, Sonia; Spann, Bryan M.; Brewer, James; Vanderswag, Helen; Heidebrink, Judith L.; Lord, Joanne L.; Petersen, Ronald; Johnson, Kris; Doody, Rachelle S.; Villanueva-Meyer, Javier; Chowdhury, Munir; Stern, Yaakov; Honig, Lawrence S.; Bell, Karen L.; Morris, John C.; Ances, Beau; Carroll, Maria; Leon, Sue; Mintun, Mark A.; Schneider, Stacy; Marson, Daniel; Griffith, Randall; Clark, David; Grossman, Hillel; Mitsis, Effie; Romirowsky, Aliza; deToledo-Morrell, Leyla; Shah, Raj C.; Duara, Ranjan; Varon, Daniel; Roberts, Peggy; Albert, Marilyn; Onyike, Chiadi; Kielb, Stephanie; Rusinek, Henry; de Leon, Mony J.; Glodzik, Lidia; De Santi, Susan; Doraiswamy, P. Murali; Petrella, Jeffrey R.; Coleman, R. Edward; Arnold, Steven E.; Karlawish, Jason H.; Wolk, David; Smith, Charles D.; Jicha, Greg; Hardy, Peter; Lopez, Oscar L.; Oakley, MaryAnn; Simpson, Donna M.; Porsteinsson, Anton P.; Goldstein, Bonnie S.; Martin, Kim; Makino, Kelly M.; Ismail, M. Saleem; Brand, Connie; Mulnard, Ruth A.; Thai, Gaby; Mc-Adams-Ortiz, Catherine; Womack, Kyle; Mathews, Dana; Quiceno, Mary; Diaz-Arrastia, Ramon; King, Richard; Weiner, Myron; Martin-Cook, Kristen; DeVous, Michael; Levey, Allan I.; Lah, James J.; Cellar, Janet S.; Burns, Jeffrey M.; Anderson, Heather S.; Swerdlow, Russell H.; Apostolova, Liana; Lu, Po H.; Bartzokis, George; Silverman, Daniel H.S.; Graff-Radford, Neill R.; Parfitt, Francine; Johnson, Heather; Farlow, Martin R.; Hake, Ann Marie; Matthews, Brandy R.; Herring, Scott; van Dyck, Christopher H.; Carson, Richard E.; MacAvoy, Martha G.; Chertkow, Howard; Bergman, Howard; Hosein, Chris; Black, Sandra; Stefanovic, Bojana; Caldwell, Curtis; Hsiung, Ging-Yuek Robin; Feldman, Howard; Mudge, Benita; Assaly, Michele; Kertesz, Andrew; Rogers, John; Trost, Dick; Bernick, Charles; Munic, Donna; Kerwin, Diana; Mesulam, Marek-Marsel; Lipowski, Kristina; Wu, Chuang-Kuo; Johnson, Nancy; Sadowsky, Carl; Martinez, Walter; Villena, Teresa; Turner, Raymond Scott; Johnson, Kathleen; Reynolds, Brigid; Sperling, Reisa A.; Johnson, Keith A.; Marshall, Gad; Frey, Meghan; Yesavage, Jerome; Taylor, Joy L.; Lane, Barton; Rosen, Allyson; Tinklenberg, Jared; Sabbagh, Marwan; Belden, Christine; Jacobson, Sandra; Kowall, Neil; Killiany, Ronald; Budson, Andrew E.; Norbash, Alexander; Johnson, Patricia Lynn; Obisesan, Thomas O.; Wolday, Saba; Bwayo, Salome K.; Lerner, Alan; Hudson, Leon; Ogrocki, Paula; Fletcher, Evan; Carmichael, Owen; Olichney, John; DeCarli, Charles; Kittur, Smita; Borrie, Michael; Lee, T.-Y.; Bartha, Rob; Johnson, Sterling; Asthana, Sanjay; Carlsson, Cynthia M.; Potkin, Steven G.; Preda, Adrian; Nguyen, Dana; Tariot, Pierre; Fleisher, Adam; Reeder, Stephanie; Bates, Vernice; Capote, Horacio; Rainka, Michelle; Scharre, Douglas W.; Kataki, Maria; Zimmerman, Earl A.; Celmins, Dzintra; Brown, Alice D.; Pearlson, Godfrey D.; Blank, Karen; Anderson, Karen; Saykin, Andrew J.; Santulli, Robert B.; Schwartz, Eben S.; Sink, Kaycee M.; Williamson, Jeff D.; Garg, Pradeep; Watkins, Franklin; Ott, Brian R.; Querfurth, Henry; Tremont, Geoffrey; Salloway, Stephen; Malloy, Paul; Correia, Stephen; Rosen, Howard J.; Miller, Bruce L.; Mintzer, Jacobo; Longmire, Crystal Flynn; Spicer, Kenneth; Finger, Elizabeth; Rachinsky, Irina; Rogers, John; Kertesz, Andrew; Drost, Dick

    2013-01-01

    Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer’s disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain’s connectivity pattern, allowing us to discover genetic variants that affect the human brain’s wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer’s disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases. PMID:23471985

  19. Genome-wide Association Study of Personality Traits in the Long Life Family Study

    Directory of Open Access Journals (Sweden)

    Harold T Bae

    2013-05-01

    Full Text Available Personality traits have been shown to be associated with longevity and healthy aging. In order to discover novel genetic modifiers associated with personality traits as related with longevity, we performed a genome-wide association study (GWAS on personality factors assessed by NEO-FFI in individuals enrolled in the Long Life Family Study (LLFS, a study of 583 families (N up to 4595 with clustering for longevity in the United States and Denmark. Three SNPs, in almost perfect LD, associated with agreeableness reached genome-wide significance (p<10-8 and replicated in an additional sample of 1279 LLFS subjects, although one (rs9650241 failed to replicate and the other two were not available in two independent replication cohorts, the Baltimore Longitudinal Study of Aging and the New England Centenarian Study. Based on 10,000,000 permutations, the empirical p-value of 2X10-7 was observed for the genome-wide significant SNPs. Seventeen SNPs that reached marginal statistical significance in the two previous GWASs (p-value < 10-4 and 10-5, were also marginally significantly associated in this study (p-value < 0.05, although none of the associations passed the Bonferroni correction. In addition, we tested age-by-SNP interactions and found some significant associations. Since scores of personality traits in LLFS subjects change in the oldest ages, and genetic factors outweigh environmental factors to achieve extreme ages, these age-by-SNP interactions could be a proxy for complex gene-gene interactions affecting personality traits and longevity.

  20. A Latent Class Approach to Estimating Test-Score Reliability

    Science.gov (United States)

    van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

    2011-01-01

    This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

  1. Propensity Score Analysis: An Alternative Statistical Approach for HRD Researchers

    Science.gov (United States)

    Keiffer, Greggory L.; Lane, Forrest C.

    2016-01-01

    Purpose: This paper aims to introduce matching in propensity score analysis (PSA) as an alternative statistical approach for researchers looking to make causal inferences using intact groups. Design/methodology/approach: An illustrative example demonstrated the varying results of analysis of variance, analysis of covariance and PSA on a heuristic…

  2. A genome-wide association study of optic disc parameters.

    Directory of Open Access Journals (Sweden)

    Wishal D Ramdas

    2010-06-01

    Full Text Available The optic nerve head is involved in many ophthalmic disorders, including common diseases such as myopia and open-angle glaucoma. Two of the most important parameters are the size of the optic disc area and the vertical cup-disc ratio (VCDR. Both are highly heritable but genetically largely undetermined. We performed a meta-analysis of genome-wide association (GWA data to identify genetic variants associated with optic disc area and VCDR. The gene discovery included 7,360 unrelated individuals from the population-based Rotterdam Study I and Rotterdam Study II cohorts. These cohorts revealed two genome-wide significant loci for optic disc area, rs1192415 on chromosome 1p22 (p = 6.72x10(-19 within 117 kb of the CDC7 gene and rs1900004 on chromosome 10q21.3-q22.1 (p = 2.67x10(-33 within 10 kb of the ATOH7 gene. They revealed two genome-wide significant loci for VCDR, rs1063192 on chromosome 9p21 (p = 6.15x10(-11 in the CDKN2B gene and rs10483727 on chromosome 14q22.3-q23 (p = 2.93x10(-10 within 40 kbp of the SIX1 gene. Findings were replicated in two independent Dutch cohorts (Rotterdam Study III and Erasmus Rucphen Family study; N = 3,612, and the TwinsUK cohort (N = 843. Meta-analysis with the replication cohorts confirmed the four loci and revealed a third locus at 16q12.1 associated with optic disc area, and four other loci at 11q13, 13q13, 17q23 (borderline significant, and 22q12.1 for VCDR. ATOH7 was also associated with VCDR independent of optic disc area. Three of the loci were marginally associated with open-angle glaucoma. The protein pathways in which the loci of optic disc area are involved overlap with those identified for VCDR, suggesting a common genetic origin.

  3. Genome-wide association study of antisocial personality disorder

    Science.gov (United States)

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  4. Genome Wide Examination of Allelic Loss in Lobular and Ductal Breast Cancer

    Science.gov (United States)

    2004-07-01

    assay with data from array comparative genomic hybridization (CGH) on the same tumors. We find almost complete concordance with LOH as defined by the...Facility for assis- HuSNP assay on PEP material may be an acceptable tance in the hybridization and analysis of the HuSNP arrays, approach to genome-wide...Levine D, it is still a low-density map, with an average of one SNP Rabinovitch P, Reid B: 17p (p53) allelic losses, 4N (G2/ tetraploid ) site per 8.5 Mb in

  5. Principal components analysis corrects for stratification in genome-wide association studies.

    Science.gov (United States)

    Price, Alkes L; Patterson, Nick J; Plenge, Robert M; Weinblatt, Michael E; Shadick, Nancy A; Reich, David

    2006-08-01

    Population stratification--allele frequency differences between cases and controls due to systematic ancestry differences-can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker's variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers.

  6. Genome-wide SNP identification in multiple morphotypes of allohexaploid tall fescue (Festuca arundinacea Schreb

    Directory of Open Access Journals (Sweden)

    Hand Melanie L

    2012-06-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs provide essential tools for the advancement of research in plant genomics, and the development of SNP resources for many species has been accelerated by the capabilities of second-generation sequencing technologies. The current study aimed to develop and use a novel bioinformatic pipeline to generate a comprehensive collection of SNP markers within the agriculturally important pasture grass tall fescue; an outbreeding allopolyploid species displaying three distinct morphotypes: Continental, Mediterranean and rhizomatous. Results A bioinformatic pipeline was developed that successfully identified SNPs within genotypes from distinct tall fescue morphotypes, following the sequencing of 414 polymerase chain reaction (PCR – generated amplicons using 454 GS FLX technology. Equivalent amplicon sets were derived from representative genotypes of each morphotype, including six Continental, five Mediterranean and one rhizomatous. A total of 8,584 and 2,292 SNPs were identified with high confidence within the Continental and Mediterranean morphotypes respectively. The success of the bioinformatic approach was demonstrated through validation (at a rate of 70% of a subset of 141 SNPs using both SNaPshot™ and GoldenGate™ assay chemistries. Furthermore, the quantitative genotyping capability of the GoldenGate™ assay revealed that approximately 30% of the putative SNPs were accessible to co-dominant scoring, despite the hexaploid genome structure. The sub-genome-specific origin of each SNP validated from Continental tall fescue was predicted using a phylogenetic approach based on comparison with orthologous sequences from predicted progenitor species. Conclusions Using the appropriate bioinformatic approach, amplicon resequencing based on 454 GS FLX technology is an effective method for the identification of polymorphic SNPs within the genomes of Continental and Mediterranean tall fescue. The

  7. Genome-wide association studies in pediatric chronic kidney disease.

    Science.gov (United States)

    Gupta, Jayanta; Kanetsky, Peter A; Wuttke, Matthias; Köttgen, Anna; Schaefer, Franz; Wong, Craig S

    2016-08-01

    The genome-wide association study (GWAS) has become an established scientific method that provides an unbiased screen for genetic loci potentially associated with phenotypes of clinical interest, such as chronic kidney disease (CKD). Thus, GWAS provides opportunities to gain new perspectives regarding the genetic architecture of CKD progression by identifying new candidate genes and targets for intervention. As such, it has become an important arm of translational science providing a complementary line of investigation to identify novel therapeutics to treat CKD. In this review, we describe the method and the challenges of performing GWAS in the pediatric CKD population. We also provide an overview of successful GWAS for kidney disease, and we discuss the established pediatric CKD cohorts in North America and Europe that are poised to identify genetic risk variants associated with CKD progression.

  8. Genome-wide genetic changes during modern breeding of maize.

    Science.gov (United States)

    Jiao, Yinping; Zhao, Hainan; Ren, Longhui; Song, Weibin; Zeng, Biao; Guo, Jinjie; Wang, Baobao; Liu, Zhipeng; Chen, Jing; Li, Wei; Zhang, Mei; Xie, Shaojun; Lai, Jinsheng

    2012-06-03

    The success of modern maize breeding has been demonstrated by remarkable increases in productivity over the last four decades. However, the underlying genetic changes correlated with these gains remain largely unknown. We report here the sequencing of 278 temperate maize inbred lines from different stages of breeding history, including deep resequencing of 4 lines with known pedigree information. The results show that modern breeding has introduced highly dynamic genetic changes into the maize genome. Artificial selection has affected thousands of targets, including genes and non-genic regions, leading to a reduction in nucleotide diversity and an increase in the proportion of rare alleles. Genetic changes during breeding happen rapidly, with extensive variation (SNPs, indels and copy-number variants (CNVs)) occurring, even within identity-by-descent regions. Our genome-wide assessment of genetic changes during modern maize breeding provides new strategies as well as practical targets for future crop breeding and biotechnology.

  9. AID/APOBEC cytosine deaminase induces genome-wide kataegis

    Directory of Open Access Journals (Sweden)

    Lada Artem G

    2012-12-01

    Full Text Available Abstract Clusters of localized hypermutation in human breast cancer genomes, named “kataegis” (from the Greek for thunderstorm, are hypothesized to result from multiple cytosine deaminations catalyzed by AID/APOBEC proteins. However, a direct link between APOBECs and kataegis is still lacking. We have sequenced the genomes of yeast mutants induced in diploids by expression of the gene for PmCDA1, a hypermutagenic deaminase from sea lamprey. Analysis of the distribution of 5,138 induced mutations revealed localized clusters very similar to those found in tumors. Our data provide evidence that unleashed cytosine deaminase activity is an evolutionary conserved, prominent source of genome-wide kataegis events. Reviewers This article was reviewed by: Professor Sandor Pongor, Professor Shamil R. Sunyaev, and Dr Vladimir Kuznetsov.

  10. Genome-wide association study of serum selenium concentrations

    DEFF Research Database (Denmark)

    Gong, Jian; Hsu, Li; Harrison, Tabitha

    2013-01-01

    Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated...... this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening and the Women’s Health Initiative (WHI). We...... tested association between 2,474,333 single nucleotide polymorphisms (SNPs) and serum selenium concentrations using linear regression models. In the first stage (PLCO) 41 SNPs clustered in 15 regions had p

  11. Genome-wide measurement of RNA folding energies.

    Science.gov (United States)

    Wan, Yue; Qu, Kun; Ouyang, Zhengqing; Kertesz, Michael; Li, Jun; Tibshirani, Robert; Makino, Debora L; Nutter, Robert C; Segal, Eran; Chang, Howard Y

    2012-10-26

    RNA structural transitions are important in the function and regulation of RNAs. Here, we reveal a layer of transcriptome organization in the form of RNA folding energies. By probing yeast RNA structures at different temperatures, we obtained relative melting temperatures (Tm) for RNA structures in over 4000 transcripts. Specific signatures of RNA Tm demarcated the polarity of mRNA open reading frames and highlighted numerous candidate regulatory RNA motifs in 3' untranslated regions. RNA Tm distinguished noncoding versus coding RNAs and identified mRNAs with distinct cellular functions. We identified thousands of putative RNA thermometers, and their presence is predictive of the pattern of RNA decay in vivo during heat shock. The exosome complex recognizes unpaired bases during heat shock to degrade these RNAs, coupling intrinsic structural stabilities to gene regulation. Thus, genome-wide structural dynamics of RNA can parse functional elements of the transcriptome and reveal diverse biological insights.

  12. Genome-wide transcriptional reprogramming under drought stress

    KAUST Repository

    Chen, Hao

    2012-01-01

    Soil water deficit is one of the major factors limiting plant productivity. Plants cope with this adverse environmental condition by coordinating the up- or downregulation of an array of stress responsive genes. Reprogramming the expression of these genes leads to rebalanced development and growth that are in concert with the reduced water availability and that ultimately confer enhanced stress tolerance. Currently, several techniques have been employed to monitor genome-wide transcriptional reprogramming under drought stress. The results from these high throughput studies indicate that drought stress-induced transcriptional reprogramming is dynamic, has temporal and spatial specificity, and is coupled with the circadian clock and phytohormone signaling pathways. © 2012 Springer-Verlag Berlin Heidelberg. All rights are reserved.

  13. A comparison of multivariate genome-wide association methods

    DEFF Research Database (Denmark)

    Galesloot, Tessel E; Van Steen, Kristel; Kiemeney, Lambertus A L M

    2014-01-01

    Joint association analysis of multiple traits in a genome-wide association study (GWAS), i.e. a multivariate GWAS, offers several advantages over analyzing each trait in a separate GWAS. In this study we directly compared a number of multivariate GWAS methods using simulated data. We focused on six...... methods that are implemented in the software packages PLINK, SNPTEST, MultiPhen, BIMBAM, PCHAT and TATES, and also compared them to standard univariate GWAS, analysis of the first principal component of the traits, and meta-analysis of univariate results. We simulated data (N = 1000) for three...... correlation. We compared the power of the methods using empirically fixed significance thresholds (α = 0.05). Our results showed that the multivariate methods implemented in PLINK, SNPTEST, MultiPhen and BIMBAM performed best for the majority of the tested scenarios, with a notable increase in power...

  14. Genome-Wide Association Study of Meiotic Recombination Phenotypes

    Science.gov (United States)

    Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G.; Sherman, Stephanie L.; Feingold, Eleanor

    2016-01-01

    Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9. By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2. This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events. PMID:27733454

  15. Genome-wide association and genomic selection in animal breeding.

    Science.gov (United States)

    Hayes, Ben; Goddard, Mike

    2010-11-01

    Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.

  16. Genome-wide significant loci for addiction and anxiety

    Science.gov (United States)

    Hodgson, K.; Almasy, L.; Knowles, E.E.M.; Kent, J.W.; Curran, J.E.; Dyer, T.D.; Göring, H.H.H.; Olvera, R.L.; Fox, P.T.; Pearlson, G.D.; Krystal, J.H.; Duggirala, R.; Blangero, J.; Glahn, D.C.

    2017-01-01

    Background Psychiatric comorbidity is common among individuals with addictive disorders, with patients frequently suffering from anxiety disorders. While the genetic architecture of comorbid addictive and anxiety disorders remains unclear, elucidating the genes involved could provide important insights into the underlying etiology. Methods Here we examine a sample of 1284 Mexican-Americans from randomly selected extended pedigrees. Variance decomposition methods were used to examine the role of genetics in addiction phenotypes (lifetime history of alcohol dependence, drug dependence or chronic smoking) and various forms of clinically relevant anxiety. Genome-wide univariate and bivariate linkage scans were conducted to localize the chromosomal regions influencing these traits. Results Addiction phenotypes and anxiety were shown to be heritable and univariate genome-wide linkage scans revealed significant quantitative trait loci for drug dependence (14q13.2–q21.2, LOD = 3.322) and a broad anxiety phenotype (12q24.32–q24.33, LOD = 2.918). Significant positive genetic correlations were observed between anxiety and each of the addiction subtypes (ρg = 0.550–0.655) and further investigation with bivariate linkage analyses identified significant pleiotropic signals for alcohol dependence-anxiety (9q33.1–q33.2, LOD = 3.054) and drug dependence-anxiety (18p11.23–p11.22, LOD = 3.425). Conclusions This study confirms the shared genetic underpinnings of addiction and anxiety and identifies genomic loci involved in the etiology of these comorbid disorders. The linkage signal for anxiety on 12q24 spans the location of TMEM132D, an emerging gene of interest from previous GWAS of anxiety traits, whilst the bivariate linkage signal identified for anxiety-alcohol on 9q33 peak coincides with a region where rare CNVs have been associated with psychiatric disorders. Other signals identified implicate novel regions of the genome in addiction genetics. PMID:27318301

  17. A genome-wide methylation study on obesity

    Science.gov (United States)

    Xu, Xiaojing; Su, Shaoyong; Barnes, Vernon A.; De Miguel, Carmen; Pollock, Jennifer; Ownby, Dennis; Shi, Huidong; Zhu, Haidong; Snieder, Harold; Wang, Xiaoling

    2013-01-01

    Besides differential methylation, DNA methylation variation has recently been proposed and demonstrated to be a potential contributing factor to cancer risk. Here we aim to examine whether differential variability in methylation is also an important feature of obesity, a typical non-malignant common complex disease. We analyzed genome-wide methylation profiles of over 470,000 CpGs in peripheral blood samples from 48 obese and 48 lean African-American youth aged 14–20 y old. A substantial number of differentially variable CpG sites (DVCs), using statistics based on variances, as well as a substantial number of differentially methylated CpG sites (DMCs), using statistics based on means, were identified. Similar to the findings in cancers, DVCs generally exhibited an outlier structure and were more variable in cases than in controls. By randomly splitting the current sample into a discovery and validation set, we observed that both the DVCs and DMCs identified from the first set could independently predict obesity status in the second set. Furthermore, both the genes harboring DMCs and the genes harboring DVCs showed significant enrichment of genes identified by genome-wide association studies on obesity and related diseases, such as hypertension, dyslipidemia, type 2 diabetes and certain types of cancers, supporting their roles in the etiology and pathogenesis of obesity. We generalized the recent finding on methylation variability in cancer research to obesity and demonstrated that differential variability is also an important feature of obesity-related methylation changes. Future studies on the epigenetics of obesity will benefit from both statistics based on means and statistics based on variances. PMID:23644594

  18. Genome-wide association studies in asthma: progress and pitfalls

    Directory of Open Access Journals (Sweden)

    March ME

    2015-01-01

    Full Text Available Michael E March,1 Patrick MA Sleiman,1,2 Hakon Hakonarson1,2 1Center for Applied Genomics, Children's Hospital of Philadelphia Research Institute, 2Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Abstract: Genetic studies of asthma have revealed that there is considerable heritability to the phenotype. An extensive history of candidate-gene studies has identified a long list of genes associated with immune function that are potentially involved in asthma pathogenesis. However, many of the results of candidate-gene studies have failed to be replicated, leaving in question the true impact of the implicated biological pathways on asthma. With the advent of genome-wide association studies, geneticists are able to examine the association of hundreds of thousands of genetic markers with a phenotype, allowing the hypothesis-free identification of variants associated with disease. Many such studies examining asthma or related phenotypes have been published, and several themes have begun to emerge regarding the biological pathways underpinning asthma. The results of many genome-wide association studies have currently not been replicated, and the large sample sizes required for this experimental strategy invoke difficulties with sample stratification and phenotypic heterogeneity. Recently, large collaborative groups of researchers have formed consortia focused on asthma, with the goals of sharing material and data and standardizing diagnosis and experimental methods. Additionally, research has begun to focus on genetic variants that affect the response to asthma medications and on the biology that generates the heterogeneity in the asthma phenotype. As this work progresses, it will move asthma patients closer to more specific, personalized medicine. Keywords: asthma, genetics, GWAS, pharmacogenetics, biomarkers

  19. A genome-wide association study of anorexia nervosa

    Science.gov (United States)

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Landt, Margarita CT Slof-Op t; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O'Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2015-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  20. A genome-wide association study of anorexia nervosa.

    Science.gov (United States)

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, A A; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-10-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.

  1. Genome-wide expression profiling of complex regional pain syndrome.

    Directory of Open Access Journals (Sweden)

    Eun-Heui Jin

    Full Text Available Complex regional pain syndrome (CRPS is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II and 5 controls (cut-off value: 1.5-fold change and p<0.05. Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1, matrix metalloproteinase 9 (MMP9, alanine aminopeptidase N (ANPEP, l-histidine decarboxylase (HDC, granulocyte colony-stimulating factor 3 receptor (G-CSF3R, and signal transducer and activator of transcription 3 (STAT3 genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR. We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10(-4. The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression.

  2. Genome-wide uniparental disomy screen in human discarded morphologically abnormal embryos.

    Science.gov (United States)

    Xu, Jiawei; Zhang, Meixiang; Niu, Wenbin; Yao, Guidong; Sun, Bo; Bao, Xiao; Wang, Linlin; Du, Linqing; Sun, Yingpu

    2015-07-21

    Uniparental disomy (UPD) has been shown to be rare in human normal blastocysts, but its frequency in discarded morphologically abnormal embryos and its relevance to embryonic self-correction of aneuploid remains unknown. The aim of this study was to detect UPD in discarded morphologically abnormal embryos. Both discarded morphologically abnormal embryos, including zero-pronuclear zygotes (0PN), one-pronuclear zygotes (1PN), three-pronuclear zygotes (3PN) and 2PN embryos scored as low development potential were cultured into blastocysts then underwent trophectoderm biopsy. Genome-wide UPD screening of the trophectoderm of 241 discarded morphologically abnormal embryo sourced blastocysts showed that UPD occurred in nine embryos. Five embryos exhibited UPDs with euploid chromosomes, and four displayed UPDs with chromosomal aneuploid. The percentage of UPDs among the morphologically abnormal sourced blastocysts was 3.73%, which is significant higher than the percentage observed in normal blastocysts. The frequency of UPD in 3PN-sourced blastocysts was 7.69%, which is significantly higher than that in normal blastocysts. This study provides the first systematic genome-wide profile of UPD in discarded morphologically abnormal embryos. Our results indicated that UPD may be a common phenomenon in discarded morphologically abnormal embryos and may be relevant to human embryonic self-correction.

  3. Genome-wide association and linkage analyses localize a progressive retinal atrophy locus in Persian cats.

    Science.gov (United States)

    Alhaddad, Hasan; Gandolfi, Barbara; Grahn, Robert A; Rah, Hyung-Chul; Peterson, Carlyn B; Maggs, David J; Good, Kathryn L; Pedersen, Niels C; Lyons, Leslie A

    2014-08-01

    Hereditary eye diseases of animals serve as excellent models of human ocular disorders and assist in the development of gene and drug therapies for inherited forms of blindness. Several primary hereditary eye conditions affecting various ocular tissues and having different rates of progression have been documented in domestic cats. Gene therapy for canine retinopathies has been successful, thus the cat could be a gene therapy candidate for other forms of retinal degenerations. The current study investigates a hereditary, autosomal recessive, retinal degeneration specific to Persian cats. A multi-generational pedigree segregating for this progressive retinal atrophy was genotyped using a 63 K SNP array and analyzed via genome-wide linkage and association methods. A multi-point parametric linkage analysis localized the blindness phenotype to a ~1.75 Mb region with significant LOD scores (Z ≈ 14, θ = 0.00) on cat chromosome E1. Genome-wide TDT, sib-TDT, and case-control analyses also consistently supported significant association within the same region on chromosome E1, which is homologous to human chromosome 17. Using haplotype analysis, a ~1.3 Mb region was identified as highly associated for progressive retinal atrophy in Persian cats. Several candidate genes within the region are reasonable candidates as a potential causative gene and should be considered for molecular analyses.

  4. Genome-wide association study of a quantitative disordered gambling trait.

    Science.gov (United States)

    Lind, Penelope A; Zhu, Gu; Montgomery, Grant W; Madden, Pamela A F; Heath, Andrew C; Martin, Nicholas G; Slutske, Wendy S

    2013-05-01

    Disordered gambling is a moderately heritable trait, but the underlying genetic basis is largely unknown. We performed a genome-wide association study (GWAS) for disordered gambling using a quantitative factor score in 1312 twins from 894 Australian families. Association was conducted for 2 381 914 single-nucleotide polymorphisms (SNPs) using the family-based association test in Merlin followed by gene and pathway enrichment analyses. Although no SNP reached genome-wide significance, six achieved P-values Secondary case-control analyses found two SNPs on chromosome 9 (rs1106076 and rs12305135 near VLDLR) and rs10812227 near FZD10 on chromosome 12 to be significantly associated with lifetime Diagnostic and Statistical Manual of Mental Disorders, fourth edition pathological gambling and South Oaks Gambling Screen classified probable pathological gambling status. Furthermore, several addiction-related pathways were enriched for SNPs associated with disordered gambling. Finally, gene-based analysis of 24 candidate genes for dopamine agonist-induced gambling in individuals with Parkinson's disease suggested an enrichment of SNPs associated with disordered gambling. We report the first GWAS of disordered gambling. While further replication is required, the identification of susceptibility loci and biological pathways will be important in characterizing the biological mechanisms that underpin disordered gambling.

  5. Genome-wide association study identifies four loci associated with eruption of permanent teeth.

    Directory of Open Access Journals (Sweden)

    Frank Geller

    2011-09-01

    Full Text Available The sequence and timing of permanent tooth eruption is thought to be highly heritable and can have important implications for the risk of malocclusion, crowding, and periodontal disease. We conducted a genome-wide association study of number of permanent teeth erupted between age 6 and 14 years, analyzed as age-adjusted standard deviation score averaged over multiple time points, based on childhood records for 5,104 women from the Danish National Birth Cohort. Four loci showed association at P<5×10(-8 and were replicated in four independent study groups from the United States and Denmark with a total of 3,762 individuals; all combined P-values were below 10(-11. Two loci agreed with previous findings in primary tooth eruption and were also known to influence height and breast cancer, respectively. The two other loci pointed to genomic regions without any previous significant genome-wide association study results. The intronic SNP rs7924176 in ADK could be linked to gene expression in monocytes. The combined effect of the four genetic variants was most pronounced between age 10 and 12 years, where children with 6 to 8 delayed tooth eruption alleles had on average 3.5 (95% confidence interval: 2.9-4.1 fewer permanent teeth than children with 0 or 1 of these alleles.

  6. Genome-wide association study of insect bite hypersensitivity in Dutch Shetland pony mares.

    Science.gov (United States)

    Schurink, A; Ducro, B J; Bastiaansen, J W M; Frankena, K; van Arendonk, J A M

    2013-02-01

    Insect bite hypersensitivity (IBH) is the most common allergic disease present in horses worldwide. It has been shown that IBH is under genetic control, but the knowledge of associated genes is limited. We conducted a genome-wide association study to identify and quantify genomic regions contributing to IBH in the Dutch Shetland pony population. A total of 97 cases and 91 controls were selected and matched on withers height, coat colour and pedigree to minimise the population stratification. A blood sample was collected from participating Shetland pony mares, their IBH phenotype was scored and the owner filled in a questionnaire. A total of 40 021 single-nucleotide polymorphisms (SNPs) were fitted in a univariable logistic model fitting an additive effect. Analysis revealed no effects of population stratification. Significant associations with IBH were detected for 24 SNPs on 12 chromosomes [-log(10) (P-value) > 2.5]. Odds ratios of allele substitution effects of the unfavourable allele were between 1.94 and 5.95. The most significant SNP was found on chromosome 27, with an odds ratio of 2.31 and with an allele frequency of the unfavourable allele of 0.72 in cases and 0.53 in controls. Genome-wide association studies on additional horse populations are desired to validate the identified associations, to identify the genes involved in IBH and to develop genomic tools to decrease IBH prevalence.

  7. A genome-wide association study of Cloninger’s Temperament scales: Implications for the evolutionary genetics of personality

    OpenAIRE

    Verweij, Karin J.H.; Zietsch, Brendan P.; Medland, Sarah E.; Gordon, Scott D.; Benyamin, Beben; Nyholt, Dale R.; McEvoy, Brian P.; Sullivan, Patrick F.; Heath, Andrew C.; Pamela A F Madden; Henders, Anjali K.; Montgomery, Grant W.; Martin, Nicholas G.; Wray, Naomi R

    2010-01-01

    Variation in personality traits is 30% to 60% attributed to genetic influences. Attempts to unravel these genetic influences at the molecular level have, so far, been inconclusive. We performed the first genome-wide association study of Cloninger’s temperament scales in a sample of 5117 individuals, in order to identify common genetic variants underlying variation in personality. Participants’ scores on Harm Avoidance, Novelty Seeking, Reward Dependence, and Persistence were tested for associ...

  8. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    Science.gov (United States)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  9. Why breeding values estimated using familial data should not be used for genome-wide association studies.

    Science.gov (United States)

    Ekine, Chinyere C; Rowe, Suzanne J; Bishop, Stephen C; de Koning, Dirk-Jan

    2014-02-19

    In animal breeding, the genetic potential of an animal is summarized as its estimated breeding value, which is derived from its own performance as well as the performance of related individuals. Here, we illustrate why estimated breeding values are not suitable as a phenotype for genome-wide association studies. We simulated human-type and pig-type pedigrees with a range of quantitative trait loci (QTL) effects (0.5-3% of phenotypic variance) and heritabilities (0.3-0.8). We analyzed 1000 replicates of each scenario with four models: (a) a full mixed model including a polygenic effect, (b) a regression analysis using the residual of a mixed model as a trait score (so called GRAMMAR approach), (c) a regression analysis using the estimated breeding value as a trait score, and (d) a regression analysis that uses the raw phenotype as a trait score. We show that using breeding values as a trait score gives very high false-positive rates (up 14% in human pedigrees and >60% in pig pedigrees). Simulations based on a real pedigree show that additional generations of pedigree increase the type I error. Including the family relationship as a random effect provides the greatest power to detect QTL while controlling for type I error at the desired level and providing the most accurate estimates of the QTL effect. Both the use of residuals and the use of breeding values result in deflated estimates of the QTL effect. We derive the contributions of QTL effects to the breeding value and residual and show how this affects the estimates.

  10. Genome-wide search for susceptibility genes to type 2 diabetes in West Africans: potential role of C-peptide.

    Science.gov (United States)

    Chen, Guanjie; Adeyemo, Adebowale; Zhou, Jie; Chen, Yuanxiu; Huang, Hanxia; Doumatey, Ayo; Lashley, Kerrie; Agyenim-Boateng, Kofi; Eghan, Benjamin A; Acheampong, Joseph; Fasanmade, Olufemi; Johnson, Thomas; Okafor, Godfrey; Oli, Johnnie; Amoah, Albert; Rotimi, Charles

    2007-12-01

    C-peptide is a substance that the pancreas releases into the circulation in equimolar amounts to insulin and has demonstrated important physiological effects which relate to the vascular field, in particular the microcirculation. For this analysis, we included 321 full and 36 half sibling pairs affected with type 2 diabetes (T2D) from West Africa. A genome-wide panel of 390 tri-nucleotide and tetra-nucleotide repeats with an average distance of 8.9 cM was performed on a total of 691 persons. Variance components based on multipoint linkage approach as implemented in SOLAR were performed for log C-peptide. Significant linkage evidences were observed on 10q23 at D10S2327 with a LOD score of 4.04 (nominal p-value=0.000008, empirical p-value=0.0004); and on 4p15 at D4S2632 with a LOD score of 3.48 (nominal p-value=0.000031, empirical p-value=0.0013). Other suggestive evidence of linkage were observed on 15q14 at D15S659 with a LOD score 2.41 (nominal p-value=0.000435, empirical p-value=0.0068), and on 18p11 near D18S976 with a LOD score 2.18 (nominal p-value=0.000771 and empirical p-value=0.0094). Interestingly, five positional candidate genes for diabetes and related complications are located in our linkage region (the pituitary adenylate cyclase activating polypeptide (PACAP in 18p11); the peroxisome proliferator-activated receptor gamma coactivator 1 (PPARGC1 in 4p15); PTEN, PPP1R5, and IDE located in 10q23. In conclusion, we identified four major genetic loci (10q23, 4p15, 15q14, and 18p11) influencing C-peptide concentration in West Africans with T2D.

  11. Genome-wide Association Study for Radiographic Vertebral Fractures: A Potential Role for the 16q24 BMD Locus versus Lessons Learned from Challenging Phenotype Definition.

    Science.gov (United States)

    Oei, Ling; Estrada, Karol; Duncan, Emma L; Christiansen, Claus; Liu, Ching-Ti; Langdahl, Bente L; Obermayer-Pietsch, Barbara; Riancho, José A; Prince, Richard L; van Schoor, Natasja M; McCloskey, Eugene; Hsu, Yi-Hsiang; Evangelou, Evangelos; Ntzani, Evangelia; Evans, David M; Alonso, Nerea; Husted, Lise B; Valero, Carmen; Hernandez, Jose L; Lewis, Joshua R; Kaptoge, Stephen K; Zhu, Kun; Cupples, L Adrienne; Medina-Gómez, Carolina; Vandenput, Liesbeth; Kim, Ghi Su; Lee, Seung Hun; Castaño-Betancourt, Martha C; Oei, Edwin H G; Martinez, Josefina; Daroszewska, Anna; van der Klift, Marjolein; Mellström, Dan; Herrera, Lizbeth; Karlsson, Magnus K; Hofman, Albert; Ljunggren, Osten; Pols, Huibert A P; Stolk, Lisette; van Meurs, Joyce B J; Ioannidis, John P A; Zillikens, M Carola; Lips, Paul; Karasik, David; Uitterlinden, André G; Styrkarsdottir, Unnur; Brown, Matthew A; Koh, Jung-Min; Richards, J Brent; Reeve, Jonathan; Ohlsson, Claes; Ralston, Stuart H; Kiel, Douglas P; Rivadeneira, Fernando

    2013-10-25

    Vertebral fracture risk is a heritable complex trait. The aim of this study was to identify genetic susceptibility factors for osteoporotic vertebral fractures applying a genome-wide association study (GWAS) approach. The GWAS discovery was based on the Rotterdam Study, a population-based study of elderly Dutch individuals aged >55years; and comprising 329 cases and 2,666 controls with radiographic scoring (McCloskey-Kanis) and genetic data. Replication of one top-associated SNP was pursued by de-novo genotyping of 15 independent studies across Europe, the United States, Australia and one Asian study. Radiographic vertebral fracture assessment was performed using McCloskey-Kanis or Genant semi-quantitative definitions. SNPs were analyzed in relation to vertebral fracture using logistic regression models corrected for age and sex. Fixed effects inverse variance and Han-Eskin alternative random effects meta-analyses were applied. Genome-wide significance was set at P 1.25) may still be consistent with an effect size definitions are needed to confirm or reject the involvement of this locus on the risk for vertebral fractures. © 2013.

  12. Genome-wide association mapping of quantitative traits in a breeding population of sugarcane.

    Science.gov (United States)

    Racedo, Josefina; Gutiérrez, Lucía; Perera, María Francisca; Ostengo, Santiago; Pardo, Esteban Mariano; Cuenya, María Inés; Welin, Bjorn; Castagnaro, Atilio Pedro

    2016-06-24

    Molecular markers associated with relevant agronomic traits could significantly reduce the time and cost involved in developing new sugarcane varieties. Previous sugarcane genome-wide association analyses (GWAS) have found few molecular markers associated with relevant traits at plant-cane stage. The aim of this study was to establish an appropriate GWAS to find molecular markers associated with yield related traits consistent across harvesting seasons in a breeding population. Sugarcane clones were genotyped with DArT (Diversity Array Technology) and TRAP (Target Region Amplified Polymorphism) markers, and evaluated for cane yield (CY) and sugar content (SC) at two locations during three successive crop cycles. GWAS mapping was applied within a novel mixed-model framework accounting for population structure with Principal Component Analysis scores as random component. A total of 43 markers significantly associated with CY in plant-cane, 42 in first ratoon, and 41 in second ratoon were detected. Out of these markers, 20 were associated with CY in 2 years. Additionally, 38 significant associations for SC were detected in plant-cane, 34 in first ratoon, and 47 in second ratoon. For SC, one marker-trait association was found significant for the 3 years of the study, while twelve markers presented association for 2 years. In the multi-QTL model several markers with large allelic substitution effect were found. Sequences of four DArT markers showed high similitude and e-value with coding sequences of Sorghum bicolor, confirming the high gene microlinearity between sorghum and sugarcane. In contrast with other sugarcane GWAS studies reported earlier, the novel methodology to analyze multi-QTLs through successive crop cycles used in the present study allowed us to find several markers associated with relevant traits. Combining existing phenotypic trial data and genotypic DArT and TRAP marker characterizations within a GWAS approach including population structure as

  13. Principles guiding embryo selection following genome-wide haplotyping of preimplantation embryos.

    Science.gov (United States)

    Dimitriadou, Eftychia; Melotte, Cindy; Debrock, Sophie; Esteki, Masoud Zamani; Dierickx, Kris; Voet, Thierry; Devriendt, Koen; de Ravel, Thomy; Legius, Eric; Peeraer, Karen; Meuleman, Christel; Vermeesch, Joris Robert

    2017-03-01

    How to select and prioritize embryos during PGD following genome-wide haplotyping? In addition to genetic disease-specific information, the embryo selected for transfer is based on ranking criteria including the existence of mitotic and/or meiotic aneuploidies, but not carriership of mutations causing recessive disorders. Embryo selection for monogenic diseases has been mainly performed using targeted disease-specific assays. Recently, these targeted approaches are being complemented by generic genome-wide genetic analysis methods such as karyomapping or haplarithmisis, which are based on genomic haplotype reconstruction of cell(s) biopsied from embryos. This provides not only information about the inheritance of Mendelian disease alleles but also about numerical and structural chromosome anomalies and haplotypes genome-wide. Reflections on how to use this information in the diagnostic laboratory are lacking. We present the results of the first 101 PGD cycles (373 embryos) using haplarithmisis, performed in the Centre for Human Genetics, UZ Leuven. The questions raised were addressed by a multidisciplinary team of clinical geneticist, fertility specialists and ethicists. Sixty-three couples enrolled in the genome-wide haplotyping-based PGD program. Families presented with either inherited genetic variants causing known disorders and/or chromosomal rearrangements that could lead to unbalanced translocations in the offspring. Embryos were selected based on the absence or presence of the disease allele, a trisomy or other chromosomal abnormality leading to known developmental disorders. In addition, morphologically normal Day 5 embryos were prioritized for transfer based on the presence of other chromosomal imbalances and/or carrier information. Some of the choices made and principles put forward are specific for cleavage-stage-based genetic testing. The proposed guidelines are subject to continuous update based on the accumulating knowledge from the implementation of

  14. A Genome-Wide RNAi Screen for Factors Involved in Neuronal Specification in Caenorhabditis elegans.

    Directory of Open Access Journals (Sweden)

    Richard J Poole

    2011-06-01

    Full Text Available One of the central goals of developmental neurobiology is to describe and understand the multi-tiered molecular events that control the progression of a fertilized egg to a terminally differentiated neuron. In the nematode Caenorhabditis elegans, the progression from egg to terminally differentiated neuron has been visually traced by lineage analysis. For example, the two gustatory neurons ASEL and ASER, a bilaterally symmetric neuron pair that is functionally lateralized, are generated from a fertilized egg through an invariant sequence of 11 cellular cleavages that occur stereotypically along specific cleavage planes. Molecular events that occur along this developmental pathway are only superficially understood. We take here an unbiased, genome-wide approach to identify genes that may act at any stage to ensure the correct differentiation of ASEL. Screening a genome-wide RNAi library that knocks-down 18,179 genes (94% of the genome, we identified 245 genes that affect the development of the ASEL neuron, such that the neuron is either not generated, its fate is converted to that of another cell, or cells from other lineage branches now adopt ASEL fate. We analyze in detail two factors that we identify from this screen: (1 the proneural gene hlh-14, which we find to be bilaterally expressed in the ASEL/R lineages despite their asymmetric lineage origins and which we find is required to generate neurons from several lineage branches including the ASE neurons, and (2 the COMPASS histone methyltransferase complex, which we find to be a critical embryonic inducer of ASEL/R asymmetry, acting upstream of the previously identified miRNA lsy-6. Our study represents the first comprehensive, genome-wide analysis of a single neuronal cell fate decision. The results of this analysis provide a starting point for future studies that will eventually lead to a more complete understanding of how individual neuronal cell types are generated from a single

  15. Genome-wide association studies in an isolated founder population from the Pacific Island of Kosrae.

    Directory of Open Access Journals (Sweden)

    Jennifer K Lowe

    2009-02-01

    Full Text Available It has been argued that the limited genetic diversity and reduced allelic heterogeneity observed in isolated founder populations facilitates discovery of loci contributing to both Mendelian and complex disease. A strong founder effect, severe isolation, and substantial inbreeding have dramatically reduced genetic diversity in natives from the island of Kosrae, Federated States of Micronesia, who exhibit a high prevalence of obesity and other metabolic disorders. We hypothesized that genetic drift and possibly natural selection on Kosrae might have increased the frequency of previously rare genetic variants with relatively large effects, making these alleles readily detectable in genome-wide association analysis. However, mapping in large, inbred cohorts introduces analytic challenges, as extensive relatedness between subjects violates the assumptions of independence upon which traditional association test statistics are based. We performed genome-wide association analysis for 15 quantitative traits in 2,906 members of the Kosrae population, using novel approaches to manage the extreme relatedness in the sample. As positive controls, we observe association to known loci for plasma cholesterol, triglycerides, and C-reactive protein and to a compelling candidate loci for thyroid stimulating hormone and fasting plasma glucose. We show that our study is well powered to detect common alleles explaining >/=5% phenotypic variance. However, no such large effects were observed with genome-wide significance, arguing that even in such a severely inbred population, common alleles typically have modest effects. Finally, we show that a majority of common variants discovered in Caucasians have indistinguishable effect sizes on Kosrae, despite the major differences in population genetics and environment.

  16. Utilizing twins as controls for non-twin case-materials in genome wide association studies.

    Directory of Open Access Journals (Sweden)

    Andrea Ganna

    Full Text Available Twin registries around the globe have collected DNA samples from large numbers of monozygotic and dizygotic twins. The twin sample collections are frequently used as controls in disease-specific studies together with non-twins. This approach is unbiased under the hypothesis that twins and singletons are comparable in terms of allele frequencies; i.e. there are no genetic variants associated with being a twin per se. To test this hypothesis we performed a genome-wide association study comparing the allele frequency of 572,352 single nucleotide polymorphisms (SNPs in 1,413 monozygotic (MZ and 5,451 dizygotic (DZ twins with 3,720 healthy singletons. Twins and singletons have been genotyped using the same platform. SNPs showing association with being a twin at P-value < 1 × 10(-5 were selected for replication analysis in 1,492 twins (463 MZ and 1,029 DZ and 1,880 singletons from Finland. No SNPs reached genome-wide significance (P-value < 5 × 10(-8 in the main analysis combining MZ and DZ twins. In a secondary analysis including only DZ twins two SNPs (rs2033541 close to ADAMTSL1 and rs4149283 close to ABCA1 were genome-wide significant after meta-analysis with the Finnish population. The estimated proportion of variance on the liability scale explained by all SNPs was 0.08 (P-value=0.003 when MZ and DZ were considered together and smaller for MZ (0.06, P-value=0.10 compared to DZ (0.09, P-value=0.003 when analyzed separately. In conclusion, twins and singletons can be used in genetic studies together with general population samples without introducing large bias. Further research is needed to explore genetic variances associated with DZ twinning.

  17. Genome-wide association study of coronary and aortic calcification in lung cancer screening CT

    Science.gov (United States)

    de Vos, Bob D.; van Setten, Jessica; de Jong, Pim A.; Mali, Willem P.; Oudkerk, Matthijs; Viergever, Max A.; Išgum, Ivana

    2016-03-01

    Arterial calcification has been related to cardiovascular disease (CVD) and osteoporosis. However, little is known about the role of genetics and exact pathways leading to arterial calcification and its relation to bone density changes indicating osteoporosis. In this study, we conducted a genome-wide association study of arterial calcification burden, followed by a look-up of known single nucleotide polymorphisms (SNPs) for coronary artery disease (CAD) and myocardial infarction (MI), and bone mineral density (BMD) to test for a shared genetic basis between the traits. The study included a subcohort of the Dutch-Belgian lung cancer screening trial comprised of 2,561 participants. Participants underwent baseline CT screening in one of two hospitals participating in the trial. Low-dose chest CT images were acquired without contrast enhancement and without ECG-synchronization. In these images coronary and aortic calcifications were identified automatically. Subsequently, the detected calcifications were quantified using coronary artery calcium Agatston and volume scores. Genotype data was available for these participants. A genome-wide association study was conducted on 10,220,814 SNPs using a linear regression model. To reduce multiple testing burden, known CAD/MI and BMD SNPs were specifically tested (45 SNPs from the CARDIoGRAMplusC4D consortium and 60 SNPS from the GEFOS consortium). No novel significant SNPs were found. Significant enrichment for CAD/MI SNPs was observed in testing Agatston and coronary artery calcium volume scores. Moreover, a significant enrichment of BMD SNPs was shown in aortic calcium volume scores. This may indicate genetic relation of BMD SNPs and arterial calcification burden.

  18. Developments in obesity genetics in the era of genome-wide association studies.

    Science.gov (United States)

    Day, Felix R; Loos, Ruth J F

    2011-01-01

    Obesity is an important factor contributing to the global burden of morbidity and mortality. By identifying obesity susceptibility genes, scientists aim to elucidate some of its aetiology. Early studies used candidate gene and genome-wide linkage approaches to search for such genes with limited success. However, the advent of genome-wide association studies (GWAS) has dramatically increased the pace of gene discovery. So far, GWAS have identified at least 50 loci robustly associated with body mass index (BMI), waist-to-hip ratio, body fat percentage and extreme obesity. Some of these have been shown to replicate in non-white populations and in children and adolescents. Furthermore, for some loci interaction studies have shown that the BMI-increasing effect is attenuated in physically active individuals. Despite many successful discoveries, the effect sizes of the established loci are small, and combined they explain only a fraction of the inter-individual variation in BMI. The low predictive value means that their value in mainstream health care is limited. However, as most of these newly established loci were not previously linked to obesity, they may provide new insights into body weight regulation. Continued efforts in gene discovery, using a range of approaches, will be needed to increase our understanding of obesity. Copyright © 2011 S. Karger AG, Basel.

  19. GenoGAM: genome-wide generalized additive models for ChIP-Seq analysis.

    Science.gov (United States)

    Stricker, Georg; Engelhardt, Alexander; Schulz, Daniel; Schmid, Matthias; Tresch, Achim; Gagneur, Julien

    2017-08-01

    Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) is a widely used approach to study protein-DNA interactions. Often, the quantities of interest are the differential occupancies relative to controls, between genetic backgrounds, treatments, or combinations thereof. Current methods for differential occupancy of ChIP-Seq data rely however on binning or sliding window techniques, for which the choice of the window and bin sizes are subjective. Here, we present GenoGAM (Genome-wide Generalized Additive Model), which brings the well-established and flexible generalized additive models framework to genomic applications using a data parallelism strategy. We model ChIP-Seq read count frequencies as products of smooth functions along chromosomes. Smoothing parameters are objectively estimated from the data by cross-validation, eliminating ad hoc binning and windowing needed by current approaches. GenoGAM provides base-level and region-level significance testing for full factorial designs. Application to a ChIP-Seq dataset in yeast showed increased sensitivity over existing differential occupancy methods while controlling for type I error rate. By analyzing a set of DNA methylation data and illustrating an extension to a peak caller, we further demonstrate the potential of GenoGAM as a generic statistical modeling tool for genome-wide assays. Software is available from Bioconductor: https://www.bioconductor.org/packages/release/bioc/html/GenoGAM.html . gagneur@in.tum.de. Supplementary information is available at Bioinformatics online.

  20. A genome-wide SNP scan accelerates trait-regulatory genomic loci identification in chickpea

    Science.gov (United States)

    Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

    2015-01-01

    We identified 44844 high-quality SNPs by sequencing 92 diverse chickpea accessions belonging to a seed and pod trait-specific association panel using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays. A GWAS (genome-wide association study) in an association panel of 211, including the 92 sequenced accessions, identified 22 major genomic loci showing significant association (explaining 23–47% phenotypic variation) with pod and seed number/plant and 100-seed weight. Eighteen trait-regulatory major genomic loci underlying 13 robust QTLs were validated and mapped on an intra-specific genetic linkage map by QTL mapping. A combinatorial approach of GWAS, QTL mapping and gene haplotype-specific LD mapping and transcript profiling uncovered one superior haplotype and favourable natural allelic variants in the upstream regulatory region of a CesA-type cellulose synthase (Ca_Kabuli_CesA3) gene regulating high pod and seed number/plant (explaining 47% phenotypic variation) in chickpea. The up-regulation of this superior gene haplotype correlated with increased transcript expression of Ca_Kabuli_CesA3 gene in the pollen and pod of high pod/seed number accession, resulting in higher cellulose accumulation for normal pollen and pollen tube growth. A rapid combinatorial genome-wide SNP genotyping-based approach has potential to dissect complex quantitative agronomic traits and delineate trait-regulatory genomic loci (candidate genes) for genetic enhancement in crop plants, including chickpea. PMID:26058368

  1. Genome-wide detection of genes targeted by non-Ig somatic hypermutation in lymphoma.

    Directory of Open Access Journals (Sweden)

    Yanwen Jiang

    Full Text Available The processes of somatic hypermutation (SHM and class switch recombination introduced by activation-induced cytosine deaminase (AICDA at the Immunoglobulin (Ig loci are key steps for creating a pool of diversified antibodies in germinal center B cells (GCBs. Unfortunately, AICDA can also accidentally introduce mutations at bystander loci, particularly within the 5' regulatory regions of proto-oncogenes relevant to diffuse large B cell lymphomas (DLBCL. Since current methods for genomewide sequencing such as Exon Capture and RNAseq only target mutations in coding regions, to date non-Ig promoter SHMs have been studied only in a handful genes. We designed a novel approach integrating bioinformatics tools with next generation sequencing technology to identify regulatory loci targeted by SHM genome-wide. We observed increased numbers of SHM associated sequence variant hotspots in lymphoma cells as compared to primary normal germinal center B cells. Many of these SHM hotspots map to genes that have not been reported before as mutated, including BACH2, BTG2, CXCR4, CIITA, EBF1, PIM2, and TCL1A, etc., all of which have potential roles in B cell survival, differentiation, and malignant transformation. In addition, using BCL6 and BACH2 as examples, we demonstrated that SHM sites identified in these 5' regulatory regions greatly altered their transcription activities in a reporter assay. Our approach provides a first cost-efficient, genome-wide method to identify regulatory mutations and non-Ig SHM hotspots.

  2. Genome-Wide Association Study Identifies Novel Loci Associated With Diisocyanate-Induced Occupational Asthma

    Science.gov (United States)

    Yucesoy, Berran; Kaufman, Kenneth M.; Lummus, Zana L.; Weirauch, Matthew T.; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M.; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B.; Bernstein, David I.

    2015-01-01

    Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10−14). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10−9 and rs2514805, p = 1.22 × 10−8, respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10−6). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies. PMID:25918132

  3. Comparing State SAT Scores Using a Mixture Modeling Approach

    Science.gov (United States)

    Kim, YoungKoung Rachel

    2009-01-01

    Presented at the national conference for AERA (American Educational Research Association) in April 2009. The large variability of SAT taker population across states makes state-by-state comparisons of the SAT scores challenging. Using a mixture modeling approach, therefore, the current study presents a method of identifying subpopulations in terms…

  4. Semantically enabling a genome-wide association study database

    Directory of Open Access Journals (Sweden)

    Beck Tim

    2012-12-01

    Full Text Available Abstract Background The amount of data generated from genome-wide association studies (GWAS has grown rapidly, but considerations for GWAS phenotype data reuse and interchange have not kept pace. This impacts on the work of GWAS Central – a free and open access resource for the advanced querying and comparison of summary-level genetic association data. The benefits of employing ontologies for standardising and structuring data are widely accepted. The complex spectrum of observed human phenotypes (and traits, and the requirement for cross-species phenotype comparisons, calls for reflection on the most appropriate solution for the organisation of human phenotype data. The Semantic Web provides standards for the possibility of further integration of GWAS data and the ability to contribute to the web of Linked Data. Results A pragmatic consideration when applying phenotype ontologies to GWAS data is the ability to retrieve all data, at the most granular level possible, from querying a single ontology graph. We found the Medical Subject Headings (MeSH terminology suitable for describing all traits (diseases and medical signs and symptoms at various levels of granularity and the Human Phenotype Ontology (HPO most suitable for describing phenotypic abnormalities (medical signs and symptoms at the most granular level. Diseases within MeSH are mapped to HPO to infer the phenotypic abnormalities associated with diseases. Building on the rich semantic phenotype annotation layer, we are able to make cross-species phenotype comparisons and publish a core subset of GWAS data as RDF nanopublications. Conclusions We present a methodology for applying phenotype annotations to a comprehensive genome-wide association dataset and for ensuring compatibility with the Semantic Web. The annotations are used to assist with cross-species genotype and phenotype comparisons. However, further processing and deconstructions of terms may be required to facilitate automatic

  5. A Probabilistic Genome-Wide Gene Reading Frame Sequence Model

    DEFF Research Database (Denmark)

    Have, Christian Theil; Mørk, Søren

    We introduce a new type of probabilistic sequence model, that model the sequential composition of reading frames of genes in a genome. Our approach extends gene finders with a model of the sequential composition of genes at the genome-level -- effectively producing a sequential genome annotation...... and are evaluated by the effect on prediction performance. Since bacterial gene finding to a large extent is a solved problem it forms an ideal proving ground for evaluating the explicit modeling of larger scale gene sequence composition of genomes. We conclude that the sequential composition of gene reading frames...... as output. The model can be used to obtain the most probable genome annotation based on a combination of i: a gene finder score of each gene candidate and ii: the sequence of the reading frames of gene candidates through a genome. The model --- as well as a higher order variant --- is developed and tested...

  6. Genome Wide Association Study (GWAS) between Attention Deficit Hyperactivity Disorder (ADHD) and Obsessive Compulsive Disorder (OCD)

    Science.gov (United States)

    Ritter, McKenzie L.; Guo, Wei; Samuels, Jack F.; Wang, Ying; Nestadt, Paul S.; Krasnow, Janice; Greenberg, Benjamin D.; Fyer, Abby J.; McCracken, James T.; Geller, Daniel A.; Murphy, Dennis L.; Knowles, James A.; Grados, Marco A.; Riddle, Mark A.; Rasmussen, Steven A.; McLaughlin, Nicole C.; Nurmi, Erika L.; Askland, Kathleen D.; Cullen, Bernadette; Piacentini, John; Pauls, David L.; Bienvenu, Joseph; Stewart, Evelyn; Goes, Fernando S.; Maher, Brion; Pulver, Ann E.; Mattheisen, Manuel; Qian, Ji; Nestadt, Gerald; Shugart, Yin Yao

    2017-01-01

    Objective: The aim of this study was to identify any potential genetic overlap between attention deficit hyperactivity disorder (ADHD) and obsessive compulsive disorder (OCD). We hypothesized that since these disorders share a sub-phenotype, they may share common risk alleles. In this manuscript, we report the overlap found between these two disorders. Methods: A meta-analysis was conducted between ADHD and OCD, and polygenic risk scores (PRS) were calculated for both disorders. In addition, a protein-protein analysis was completed in order to examine the interactions between proteins; p-values for the protein-protein interaction analysis was calculated using permutation. Conclusion: None of the single nucleotide polymorphisms (SNPs) reached genome wide significance and there was little evidence of genetic overlap between ADHD and OCD. PMID:28386217

  7. Characterization of genome-wide SNPs for the water flea Daphnia pulicaria generated by genotyping-by-sequencing (GBS).

    Science.gov (United States)

    Muñoz, Joaquín; Chaturvedi, Anurag; De Meester, Luc; Weider, Lawrence J

    2016-06-27

    The keystone aquatic herbivore Daphnia has been studied for more than 150 years in the context of evolution, ecology and ecotoxicology. Although it is rapidly becoming an emergent model for environmental and population genomics, there have been limited genome-wide level studies in natural populations. We report a unique resource of novel Single Nucleotide Polymorphic (SNP) markers for Daphnia pulicaria using the reduction in genomic complexity with the restriction enzymes approach, genotyping-by-sequencing. Using the genome of D. pulex as a reference, SNPs were scored for 53 clones from five natural populations that varied in lake trophic status. Our analyses resulted in 32,313 highly confident and bi-allelic SNP markers. 1,364 outlier SNPs were mapped on the annotated D. pulex genome, which identified 2,335 genes, including 565 within functional genes. Out of 885 EuKaryotic Orthologous Groups that we found from outlier SNPs, 294 were involved in three metabolic and four regulatory pathways. Bayesian-clustering analyses showed two distinct population clusters representing the possible combined effects of geography and lake trophic status. Our results provide an invaluable tool for future population genomics surveys in Daphnia targeting informative regions related to physiological processes that can be linked to the ecology of this emerging eco-responsive taxon.

  8. Characterization of genome-wide SNPs for the water flea Daphnia pulicaria generated by genotyping-by-sequencing (GBS)

    Science.gov (United States)

    Muñoz, Joaquín; Chaturvedi, Anurag; De Meester, Luc; Weider, Lawrence J.

    2016-01-01

    The keystone aquatic herbivore Daphnia has been studied for more than 150 years in the context of evolution, ecology and ecotoxicology. Although it is rapidly becoming an emergent model for environmental and population genomics, there have been limited genome-wide level studies in natural populations. We report a unique resource of novel Single Nucleotide Polymorphic (SNP) markers for Daphnia pulicaria using the reduction in genomic complexity with the restriction enzymes approach, genotyping-by-sequencing. Using the genome of D. pulex as a reference, SNPs were scored for 53 clones from five natural populations that varied in lake trophic status. Our analyses resulted in 32,313 highly confident and bi-allelic SNP markers. 1,364 outlier SNPs were mapped on the annotated D. pulex genome, which identified 2,335 genes, including 565 within functional genes. Out of 885 EuKaryotic Orthologous Groups that we found from outlier SNPs, 294 were involved in three metabolic and four regulatory pathways. Bayesian-clustering analyses showed two distinct population clusters representing the possible combined effects of geography and lake trophic status. Our results provide an invaluable tool for future population genomics surveys in Daphnia targeting informative regions related to physiological processes that can be linked to the ecology of this emerging eco-responsive taxon. PMID:27346179

  9. Genome-wide significant risk associations for mucinous ovarian carcinoma

    Science.gov (United States)

    Kelemen, Linda E.; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; M. Lee, Janet; Seo, Ji-Heui; Phelan, Catherine M.; Beesley, Jonathan; Chen, Xiaoqin; Spindler, Tassja J.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chen, Y. Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Engelholm, Svend Aage; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kjaer, Susanne K.; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moes-Sosnowska, Joanna; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Wlodzimierz, Sawicki; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Freedman, Matthew L.; Chenevix-Trench, Georgia; Pharoah, Paul D.; Gayther, Simon A.; Berchuck, Andrew

    2015-01-01

    Genome-wide association studies have identified several risk associations for ovarian carcinomas (OC) but not for mucinous ovarian carcinomas (MOC). Genotypes from OC cases and controls were imputed into the 1000 Genomes Project reference panel. Analysis of 1,644 MOC cases and 21,693 controls identified three novel risk associations: rs752590 at 2q13 (P = 3.3 × 10−8), rs711830 at 2q31.1 (P = 7.5 × 10−12) and rs688187 at 19q13.2 (P = 6.8 × 10−13). Expression Quantitative Trait Locus (eQTL) analysis in ovarian and colorectal tumors (which are histologically similar to MOC) identified significant eQTL associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10−4, FDR = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors, and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease. PMID:26075790

  10. Genome-Wide Analysis of Human MicroRNA Stability

    Directory of Open Access Journals (Sweden)

    Yang Li

    2013-01-01

    Full Text Available Increasing studies have shown that microRNA (miRNA stability plays important roles in physiology. However, the global picture of miRNA stability remains largely unknown. Here, we had analyzed genome-wide miRNA stability across 10 diverse cell types using miRNA arrays. We found that miRNA stability shows high dynamics and diversity both within individual cells and across cell types. Strikingly, we observed a negative correlation between miRNA stability and miRNA expression level, which is different from current findings on other biological molecules such as proteins and mRNAs that show positive and not negative correlations between stability and expression level. This finding indicates that miRNA has a distinct action mode, which we called “rapid production, rapid turnover; slow production, slow turnover.” This mode further suggests that high expression miRNAs normally degrade fast and may endow the cell with special properties that facilitate cellular status-transition. Moreover, we revealed that the stability of miRNAs is affected by cohorts of factors that include miRNA targets, transcription factors, nucleotide content, evolution, associated disease, and environmental factors. Together, our results provided an extensive description of the global landscape, dynamics, and distinct mode of human miRNA stability, which provide help in investigating their functions in physiology and pathophysiology.

  11. Genome-wide association study and premature ovarian failure.

    Science.gov (United States)

    Christin-Maitre, S; Tachdjian, G

    2010-05-01

    Premature ovarian failure (POF) is defined as an amenorrhea for more than 4months, associated with elevated gonadotropins, usually higher than 20mIU/ml, occurring in a woman before the age of 40. Some candidate genes have been identified in the past 15years, such as FOXL2, FSHR, BMP15, GDF9, Xfra premutation. However, POF etiology remains unknown in more than 90% of cases. The first strategy to identify candidate gene, apart from studying genes involved in ovarian failure in animal models, relies on the study of X chromosome deletions and X;autosome translocations in patients. The second strategy is based on linkage analysis, the third one on Comparative Genomic Hybridization (CGH) array. The latest strategy relies on Genome-Wide Association Studies (GWAS). This technique consists in screening single nucleotide polymorphisms (SNPs) in patients and controls. So far, three studies have been performed and have identified different loci potentially linked to POF, such as PTHB1 and ADAMTS19. However, replications in independent cohorts need to be performed. GWAS studies on large cohorts of women with POF should find new candidate genes in the near future.

  12. Reconstructing Roma history from genome-wide data.

    Directory of Open Access Journals (Sweden)

    Priya Moorjani

    Full Text Available The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000-1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs. We estimate that the Roma harbor about 80% West Eurasian ancestry-derived from a combination of European and South Asian sources-and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe.

  13. Genome-Wide Association Study of Serum Selenium Concentrations

    Directory of Open Access Journals (Sweden)

    Ulrike Peters

    2013-05-01

    Full Text Available Selenium is an essential trace element and circulating selenium concentrations have been associated with a wide range of diseases. Candidate gene studies suggest that circulating selenium concentrations may be impacted by genetic variation; however, no study has comprehensively investigated this hypothesis. Therefore, we conducted a two-stage genome-wide association study to identify genetic variants associated with serum selenium concentrations in 1203 European descents from two cohorts: the Prostate, Lung, Colorectal, and Ovarian (PLCO Cancer Screening and the Women’s Health Initiative (WHI. We tested association between 2,474,333 single nucleotide polymorphisms (SNPs and serum selenium concentrations using linear regression models. In the first stage (PLCO 41 SNPs clustered in 15 regions had p < 1 × 10−5. None of these 41 SNPs reached the significant threshold (p = 0.05/15 regions = 0.003 in the second stage (WHI. Three SNPs had p < 0.05 in the second stage (rs1395479 and rs1506807 in 4q34.3/AGA-NEIL3; and rs891684 in 17q24.3/SLC39A11 and had p between 2.62 × 10−7 and 4.04 × 10−7 in the combined analysis (PLCO + WHI. Additional studies are needed to replicate these findings. Identification of genetic variation that impacts selenium concentrations may contribute to a better understanding of which genes regulate circulating selenium concentrations.

  14. Genome-wide identification of KANADI1 target genes.

    Directory of Open Access Journals (Sweden)

    Paz Merelo

    Full Text Available Plant organ development and polarity establishment is mediated by the action of several transcription factors. Among these, the KANADI (KAN subclade of the GARP protein family plays important roles in polarity-associated processes during embryo, shoot and root patterning. In this study, we have identified a set of potential direct target genes of KAN1 through a combination of chromatin immunoprecipitation/DNA sequencing (ChIP-Seq and genome-wide transcriptional profiling using tiling arrays. Target genes are over-represented for genes involved in the regulation of organ development as well as in the response to auxin. KAN1 affects directly the expression of several genes previously shown to be important in the establishment of polarity during lateral organ and vascular tissue development. We also show that KAN1 controls through its target genes auxin effects on organ development at different levels: transport and its regulation, and signaling. In addition, KAN1 regulates genes involved in the response to abscisic acid, jasmonic acid, brassinosteroids, ethylene, cytokinins and gibberellins. The role of KAN1 in organ polarity is antagonized by HD-ZIPIII transcription factors, including REVOLUTA (REV. A comparison of their target genes reveals that the REV/KAN1 module acts in organ patterning through opposite regulation of shared targets. Evidence of mutual repression between closely related family members is also shown.

  15. Genome-Wide Analysis of DNA Methylation in Human Amnion

    Directory of Open Access Journals (Sweden)

    Jinsil Kim

    2013-01-01

    Full Text Available The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3 gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies.

  16. Psoriasis prediction from genome-wide SNP profiles

    Directory of Open Access Journals (Sweden)

    Fang Xiangzhong

    2011-01-01

    Full Text Available Abstract Background With the availability of large-scale genome-wide association study (GWAS data, choosing an optimal set of SNPs for disease susceptibility prediction is a challenging task. This study aimed to use single nucleotide polymorphisms (SNPs to predict psoriasis from searching GWAS data. Methods Totally we had 2,798 samples and 451,724 SNPs. Process for searching a set of SNPs to predict susceptibility for psoriasis consisted of two steps. The first one was to search top 1,000 SNPs with high accuracy for prediction of psoriasis from GWAS dataset. The second one was to search for an optimal SNP subset for predicting psoriasis. The sequential information bottleneck (sIB method was compared with classical linear discriminant analysis(LDA for classification performance. Results The best test harmonic mean of sensitivity and specificity for predicting psoriasis by sIB was 0.674(95% CI: 0.650-0.698, while only 0.520(95% CI: 0.472-0.524 was reported for predicting disease by LDA. Our results indicate that the new classifier sIB performs better than LDA in the study. Conclusions The fact that a small set of SNPs can predict disease status with average accuracy of 68% makes it possible to use SNP data for psoriasis prediction.

  17. Reducing dimensionality for prediction of genome-wide breeding values

    Directory of Open Access Journals (Sweden)

    Woolliams John A

    2009-03-01

    Full Text Available Abstract Partial least square regression (PLSR and principal component regression (PCR are methods designed for situations where the number of predictors is larger than the number of records. The aim was to compare the accuracy of genome-wide breeding values (EBV produced using PLSR and PCR with a Bayesian method, 'BayesB'. Marker densities of 1, 2, 4 and 8 Ne markers/Morgan were evaluated when the effective population size (Ne was 100. The correlation between true breeding value and estimated breeding value increased with density from 0.611 to 0.681 and 0.604 to 0.658 using PLSR and PCR respectively, with an overall advantage to PLSR of 0.016 (s.e = 0.008. Both methods gave a lower accuracy compared to the 'BayesB', for which accuracy increased from 0.690 to 0.860. PLSR and PCR appeared less responsive to increased marker density with the advantage of 'BayesB' increasing by 17% from a marker density of 1 to 8Ne/M. PCR and PLSR showed greater bias than 'BayesB' in predicting breeding values at all densities. Although, the PLSR and PCR were computationally faster and simpler, these advantages do not outweigh the reduction in accuracy, and there is a benefit in obtaining relevant prior information from the distribution of gene effects.

  18. A genome wide dosage suppressor network reveals genomic robustness

    Science.gov (United States)

    Patra, Biranchi; Kon, Yoshiko; Yadav, Gitanjali; Sevold, Anthony W.; Frumkin, Jesse P.; Vallabhajosyula, Ravishankar R.; Hintze, Arend; Østman, Bjørn; Schossau, Jory; Bhan, Ashish; Marzolf, Bruz; Tamashiro, Jenna K.; Kaur, Amardeep; Baliga, Nitin S.; Grayhack, Elizabeth J.; Adami, Christoph; Galas, David J.; Raval, Alpan; Phizicky, Eric M.; Ray, Animesh

    2017-01-01

    Genomic robustness is the extent to which an organism has evolved to withstand the effects of deleterious mutations. We explored the extent of genomic robustness in budding yeast by genome wide dosage suppressor analysis of 53 conditional lethal mutations in cell division cycle and RNA synthesis related genes, revealing 660 suppressor interactions of which 642 are novel. This collection has several distinctive features, including high co-occurrence of mutant-suppressor pairs within protein modules, highly correlated functions between the pairs and higher diversity of functions among the co-suppressors than previously observed. Dosage suppression of essential genes encoding RNA polymerase subunits and chromosome cohesion complex suggests a surprising degree of functional plasticity of macromolecular complexes, and the existence of numerous degenerate pathways for circumventing the effects of potentially lethal mutations. These results imply that organisms and cancer are likely able to exploit the genomic robustness properties, due the persistence of cryptic gene and pathway functions, to generate variation and adapt to selective pressures. PMID:27899637

  19. Genome-wide transcriptome analysis of 150 cell samples†

    Science.gov (United States)

    Russom, Aman; Xiao, Wenzhong; Wilhelmy, Julie; Wang, Shenglong; Heath, Joe Don; Kurn, Nurith; Tompkins, Ronald G.; Davis, Ronald W.; Toner, Mehmet

    2013-01-01

    A major challenge in molecular biology is interrogating the human transcriptome on a genome wide scale when only a limited amount of biological sample is available for analysis. Current methodologies using microarray technologies for simultaneously monitoring mRNA transcription levels require nanogram amounts of total RNA. To overcome the sample size limitation of current technologies, we have developed a method to probe the global gene expression in biological samples as small as 150 cells, or the equivalent of approximately 300 pg total RNA. The new method employs microfluidic devices for the purification of total RNA from mammalian cells and ultra-sensitive whole transcriptome amplification techniques. We verified that the RNA integrity is preserved through the isolation process, accomplished highly reproducible whole transcriptome analysis, and established high correlation between repeated isolations of 150 cells and the same cell culture sample. We validated the technology by demonstrating that the combined microfluidic and amplification protocol is capable of identifying biological pathways perturbed by stimulation, which are consistent with the information recognized in bulk-isolated samples. PMID:20023796

  20. Genome-wide transcriptome analysis of 150 cell samples.

    Science.gov (United States)

    Irimia, Daniel; Mindrinos, Michael; Russom, Aman; Xiao, Wenzhong; Wilhelmy, Julie; Wang, Shenglong; Heath, Joe Don; Kurn, Nurith; Tompkins, Ronald G; Davis, Ronald W; Toner, Mehmet

    2009-01-01

    A major challenge in molecular biology is interrogating the human transcriptome on a genome wide scale when only a limited amount of biological sample is available for analysis. Current methodologies using microarray technologies for simultaneously monitoring mRNA transcription levels require nanogram amounts of total RNA. To overcome the sample size limitation of current technologies, we have developed a method to probe the global gene expression in biological samples as small as 150 cells, or the equivalent of approximately 300 pg total RNA. The new method employs microfluidic devices for the purification of total RNA from mammalian cells and ultra-sensitive whole transcriptome amplification techniques. We verified that the RNA integrity is preserved through the isolation process, accomplished highly reproducible whole transcriptome analysis, and established high correlation between repeated isolations of 150 cells and the same cell culture sample. We validated the technology by demonstrating that the combined microfluidic and amplification protocol is capable of identifying biological pathways perturbed by stimulation, which are consistent with the information recognized in bulk-isolated samples.

  1. Reconstructing Roma History from Genome-Wide Data

    Science.gov (United States)

    Moorjani, Priya; Patterson, Nick; Loh, Po-Ru; Lipson, Mark; Kisfali, Péter; Melegh, Bela I.; Bonin, Michael; Kádaši, Ľudevít; Rieß, Olaf; Berger, Bonnie; Reich, David; Melegh, Béla

    2013-01-01

    The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000–1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs). We estimate that the Roma harbor about 80% West Eurasian ancestry–derived from a combination of European and South Asian sources–and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe. PMID:23516520

  2. Genome-wide association study of circulating retinol levels.

    Science.gov (United States)

    Mondul, Alison M; Yu, Kai; Wheeler, William; Zhang, Hong; Weinstein, Stephanie J; Major, Jacqueline M; Cornelis, Marilyn C; Männistö, Satu; Hazra, Aditi; Hsing, Ann W; Jacobs, Kevin B; Eliassen, Heather; Tanaka, Toshiko; Reding, Douglas J; Hendrickson, Sara; Ferrucci, Luigi; Virtamo, Jarmo; Hunter, David J; Chanock, Stephen J; Kraft, Peter; Albanes, Demetrius

    2011-12-01

    Retinol is one of the most biologically active forms of vitamin A and is hypothesized to influence a wide range of human diseases including asthma, cardiovascular disease, infectious diseases and cancer. We conducted a genome-wide association study of 5006 Caucasian individuals drawn from two cohorts of men: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention (ATBC) Study and the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. We identified two independent single-nucleotide polymorphisms associated with circulating retinol levels, which are located near the transthyretin (TTR) and retinol binding protein 4 (RBP4) genes which encode major carrier proteins of retinol: rs1667255 (P =2.30× 10(-17)) and rs10882272 (P =6.04× 10(-12)). We replicated the association with rs10882272 in RBP4 in independent samples from the Nurses' Health Study and the Invecchiare in Chianti Study (InCHIANTI) that included 3792 women and 504 men (P =9.49× 10(-5)), but found no association for retinol with rs1667255 in TTR among women, thus suggesting evidence for gender dimorphism (P-interaction=1.31× 10(-5)). Discovery of common genetic variants associated with serum retinol levels may provide further insight into the contribution of retinol and other vitamin A compounds to the development of cancer and other complex diseases.

  3. A genome-wide association study in multiple system atrophy

    Science.gov (United States)

    Sailer, Anna; Nalls, Michael A.; Schulte, Claudia; Federoff, Monica; Price, T. Ryan; Lees, Andrew; Ross, Owen A.; Dickson, Dennis W.; Mok, Kin; Mencacci, Niccolo E.; Schottlaender, Lucia; Chelban, Viorica; Ling, Helen; O'Sullivan, Sean S.; Wood, Nicholas W.; Traynor, Bryan J.; Ferrucci, Luigi; Federoff, Howard J.; Mhyre, Timothy R.; Morris, Huw R.; Deuschl, Günther; Quinn, Niall; Widner, Hakan; Albanese, Alberto; Infante, Jon; Bhatia, Kailash P.; Poewe, Werner; Oertel, Wolfgang; Höglinger, Günter U.; Wüllner, Ullrich; Goldwurm, Stefano; Pellecchia, Maria Teresa; Ferreira, Joaquim; Tolosa, Eduardo; Bloem, Bastiaan R.; Rascol, Olivier; Meissner, Wassilios G.; Hardy, John A.; Revesz, Tamas; Holton, Janice L.; Gasser, Thomas; Wenning, Gregor K.; Singleton, Andrew B.

    2016-01-01

    Objective: To identify genetic variants that play a role in the pathogenesis of multiple system atrophy (MSA), we undertook a genome-wide association study (GWAS). Methods: We performed a GWAS with >5 million genotyped and imputed single nucleotide polymorphisms (SNPs) in 918 patients with MSA of European ancestry and 3,864 controls. MSA cases were collected from North American and European centers, one third of which were neuropathologically confirmed. Results: We found no significant loci after stringent multiple testing correction. A number of regions emerged as potentially interesting for follow-up at p < 1 × 10−6, including SNPs in the genes FBXO47, ELOVL7, EDN1, and MAPT. Contrary to previous reports, we found no association of the genes SNCA and COQ2 with MSA. Conclusions: We present a GWAS in MSA. We have identified several potentially interesting gene loci, including the MAPT locus, whose significance will have to be evaluated in a larger sample set. Common genetic variation in SNCA and COQ2 does not seem to be associated with MSA. In the future, additional samples of well-characterized patients with MSA will need to be collected to perform a larger MSA GWAS, but this initial study forms the basis for these next steps. PMID:27629089

  4. Genome-Wide Analysis of DNA Methylation in Human Amnion

    Science.gov (United States)

    Kim, Jinsil; Pitlick, Mitchell M.; Christine, Paul J.; Schaefer, Amanda R.; Saleme, Cesar; Comas, Belén; Cosentino, Viviana; Gadow, Enrique; Murray, Jeffrey C.

    2013-01-01

    The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor) and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR) gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3) gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies. PMID:23533356

  5. MGAS: a powerful tool for multivariate gene-based genome-wide association analysis.

    Science.gov (United States)

    Van der Sluis, Sophie; Dolan, Conor V; Li, Jiang; Song, Youqiang; Sham, Pak; Posthuma, Danielle; Li, Miao-Xin

    2015-04-01

    Standard genome-wide association studies, testing the association between one phenotype and a large number of single nucleotide polymorphisms (SNPs), are limited in two ways: (i) traits are often multivariate, and analysis of composite scores entails loss in statistical power and (ii) gene-based analyses may be preferred, e.g. to decrease the multiple testing problem. Here we present a new method, multivariate gene-based association test by extended Simes procedure (MGAS), that allows gene-based testing of multivariate phenotypes in unrelated individuals. Through extensive simulation, we show that under most trait-generating genotype-phenotype models MGAS has superior statistical power to detect associated genes compared with gene-based analyses of univariate phenotypic composite scores (i.e. GATES, multiple regression), and multivariate analysis of variance (MANOVA). Re-analysis of metabolic data revealed 32 False Discovery Rate controlled genome-wide significant genes, and 12 regions harboring multiple genes; of these 44 regions, 30 were not reported in the original analysis. MGAS allows researchers to conduct their multivariate gene-based analyses efficiently, and without the loss of power that is often associated with an incorrectly specified genotype-phenotype models. MGAS is freely available in KGG v3.0 (http://statgenpro.psychiatry.hku.hk/limx/kgg/download.php). Access to the metabolic dataset can be requested at dbGaP (https://dbgap.ncbi.nlm.nih.gov/). The R-simulation code is available from http://ctglab.nl/people/sophie_van_der_sluis. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.

  6. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    Energy Technology Data Exchange (ETDEWEB)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMahon, Katherine D.; Malmstrom, Rex R.

    2014-06-18

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.

  7. Genome-wide association study identifies candidate markers for bull fertility in Holstein dairy cattle.

    Science.gov (United States)

    Peñagaricano, F; Weigel, K A; Khatib, H

    2012-07-01

    The decline in the reproductive efficiency of dairy cattle has become a challenging problem worldwide. Female fertility is now taken into account in breeding goals while generally less attention is given to male fertility. The objective of this study was to perform a genome-wide association study in Holstein bulls to identify genetic variants significantly related to sire conception rate (SCR), a new phenotypic evaluation of bull fertility. The analysis included 1755 sires with SCR data and 38,650 single nucleotide polymorphisms (SNPs) spanning the entire bovine genome. Associations between SNPs and SCR were analyzed using a mixed linear model that included a random polygenic effect and SNP genotype either as a linear covariate or as a categorical variable. A multiple testing correction approach was used to account for the correlation between SNPs because of linkage disequilibrium. After genome-wide correction, eight SNPs showed significant association with SCR. Some of these SNPs are located close to or in the middle of genes with functions related to male fertility, such as the sperm acrosome reaction, chromatin remodeling during the spermatogenesis, and the meiotic process during male germ cell maturation. Some SNPs showed marked dominance effects, which provide more evidence for the relevance of non-additive effects in traits closely related to fitness such as fertility. The results could contribute to the identification of genes and pathways associated with male fertility in dairy cattle.

  8. Genome-wide identification of expression quantitative trait loci (eQTLs in human heart.

    Directory of Open Access Journals (Sweden)

    Tamara T Koopmann

    Full Text Available In recent years genome-wide association studies (GWAS have uncovered numerous chromosomal loci associated with various electrocardiographic traits and cardiac arrhythmia predisposition. A considerable fraction of these loci lie within inter-genic regions. The underlying trait-associated variants likely reside in regulatory regions and exert their effect by modulating gene expression. Hence, the key to unraveling the molecular mechanisms underlying these cardiac traits is to interrogate variants for association with differential transcript abundance by expression quantitative trait locus (eQTL analysis. In this study we conducted an eQTL analysis of human heart. For a total of 129 left ventricular samples that were collected from non-diseased human donor hearts, genome-wide transcript abundance and genotyping was determined using microarrays. Each of the 18,402 transcripts and 897,683 SNP genotypes that remained after pre-processing and stringent quality control were tested for eQTL effects. We identified 771 eQTLs, regulating 429 unique transcripts. Overlaying these eQTLs with cardiac GWAS loci identified novel candidates for studies aimed at elucidating the functional and transcriptional impact of these loci. Thus, this work provides for the first time a comprehensive eQTL map of human heart: a powerful and unique resource that enables systems genetics approaches for the study of cardiac traits.

  9. A powerful test of independent assortment that determines genome-wide significance quickly and accurately.

    Science.gov (United States)

    Stewart, W C L; Hager, V R

    2016-08-01

    In the analysis of DNA sequences on related individuals, most methods strive to incorporate as much information as possible, with little or no attention paid to the issue of statistical significance. For example, a modern workstation can easily handle the computations needed to perform a large-scale genome-wide inheritance-by-descent (IBD) scan, but accurate assessment of the significance of that scan is often hindered by inaccurate approximations and computationally intensive simulation. To address these issues, we developed gLOD-a test of co-segregation that, for large samples, models chromosome-specific IBD statistics as a collection of stationary Gaussian processes. With this simple model, the parametric bootstrap yields an accurate and rapid assessment of significance-the genome-wide corrected P-value. Furthermore, we show that (i) under the null hypothesis, the limiting distribution of the gLOD is the standard Gumbel distribution; (ii) our parametric bootstrap simulator is approximately 40 000 times faster than gene-dropping methods, and it is more powerful than methods that approximate the adjusted P-value; and, (iii) the gLOD has the same statistical power as the widely used maximum Kong and Cox LOD. Thus, our approach gives researchers the ability to determine quickly and accurately the significance of most large-scale IBD scans, which may contain multiple traits, thousands of families and tens of thousands of DNA sequences.

  10. Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes

    Directory of Open Access Journals (Sweden)

    Sungkyoung Choi

    2016-12-01

    Full Text Available The success of genome-wide association studies (GWASs has enabled us to improve risk assessment and provide novel genetic variants for diagnosis, prevention, and treatment. However, most variants discovered by GWASs have been reported to have very small effect sizes on complex human diseases, which has been a big hurdle in building risk prediction models. Recently, many statistical approaches based on penalized regression have been developed to solve the “large p and small n” problem. In this report, we evaluated the performance of several statistical methods for predicting a binary trait: stepwise logistic regression (SLR, least absolute shrinkage and selection operator (LASSO, and Elastic-Net (EN. We first built a prediction model by combining variable selection and prediction methods for type 2 diabetes using Affymetrix Genome-Wide Human SNP Array 5.0 from the Korean Association Resource project. We assessed the risk prediction performance using area under the receiver operating characteristic curve (AUC for the internal and external validation datasets. In the internal validation, SLR-LASSO and SLR-EN tended to yield more accurate predictions than other combinations. During the external validation, the SLR-SLR and SLR-EN combinations achieved the highest AUC of 0.726. We propose these combinations as a potentially powerful risk prediction model for type 2 diabetes.

  11. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    Energy Technology Data Exchange (ETDEWEB)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMcahon, Katherine D.; Mamlstrom, Rex R.

    2014-05-12

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.

  12. Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes

    Science.gov (United States)

    Choi, Sungkyoung; Bae, Sunghwan

    2016-01-01

    The success of genome-wide association studies (GWASs) has enabled us to improve risk assessment and provide novel genetic variants for diagnosis, prevention, and treatment. However, most variants discovered by GWASs have been reported to have very small effect sizes on complex human diseases, which has been a big hurdle in building risk prediction models. Recently, many statistical approaches based on penalized regression have been developed to solve the “large p and small n” problem. In this report, we evaluated the performance of several statistical methods for predicting a binary trait: stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN). We first built a prediction model by combining variable selection and prediction methods for type 2 diabetes using Affymetrix Genome-Wide Human SNP Array 5.0 from the Korean Association Resource project. We assessed the risk prediction performance using area under the receiver operating characteristic curve (AUC) for the internal and external validation datasets. In the internal validation, SLR-LASSO and SLR-EN tended to yield more accurate predictions than other combinations. During the external validation, the SLR-SLR and SLR-EN combinations achieved the highest AUC of 0.726. We propose these combinations as a potentially powerful risk prediction model for type 2 diabetes.

  13. Comparison of genome-wide selection strategies to identify furfural tolerance genes in Escherichia coli.

    Science.gov (United States)

    Glebes, Tirzah Y; Sandoval, Nicholas R; Gillis, Jacob H; Gill, Ryan T

    2015-01-01

    Engineering both feedstock and product tolerance is important for transitioning towards next-generation biofuels derived from renewable sources. Tolerance to chemical inhibitors typically results in complex phenotypes, for which multiple genetic changes must often be made to confer tolerance. Here, we performed a genome-wide search for furfural-tolerant alleles using the TRackable Multiplex Recombineering (TRMR) method (Warner et al. (2010), Nature Biotechnology), which uses chromosomally integrated mutations directed towards increased or decreased expression of virtually every gene in Escherichia coli. We employed various growth selection strategies to assess the role of selection design towards growth enrichments. We also compared genes with increased fitness from our TRMR selection to those from a previously reported genome-wide identification study of furfural tolerance genes using a plasmid-based genomic library approach (Glebes et al. (2014) PLOS ONE). In several cases, growth improvements were observed for the chromosomally integrated promoter/RBS mutations but not for the plasmid-based overexpression constructs. Through this assessment, four novel tolerance genes, ahpC, yhjH, rna, and dicA, were identified and confirmed for their effect on improving growth in the presence of furfural.

  14. Infection and inflammation in schizophrenia and bipolar disorder: a genome wide study for interactions with genetic variation.

    Directory of Open Access Journals (Sweden)

    Dimitrios Avramopoulos

    Full Text Available Inflammation and maternal or fetal infections have been suggested as risk factors for schizophrenia (SZ and bipolar disorder (BP. It is likely that such environmental effects are contingent on genetic background. Here, in a genome-wide approach, we test the hypothesis that such exposures increase the risk for SZ and BP and that the increase is dependent on genetic variants. We use genome-wide genotype data, plasma IgG antibody measurements against Toxoplasma gondii, Herpes simplex virus type 1, Cytomegalovirus, Human Herpes Virus 6 and the food antigen gliadin as well as measurements of C-reactive protein (CRP, a peripheral marker of inflammation. The subjects are SZ cases, BP cases, parents of cases and screened controls. We look for higher levels of our immunity/infection variables and interactions between them and common genetic variation genome-wide. We find many of the antibody measurements higher in both disorders. While individual tests do not withstand correction for multiple comparisons, the number of nominally significant tests and the comparisons showing the expected direction are in significant excess (permutation p=0.019 and 0.004 respectively. We also find CRP levels highly elevated in SZ, BP and the mothers of BP cases, in agreement with existing literature, but possibly confounded by our inability to correct for smoking or body mass index. In our genome-wide interaction analysis no signal reached genome-wide significance, yet many plausible candidate genes emerged. In a hypothesis driven test, we found multiple interactions among SZ-associated SNPs in the HLA region on chromosome 6 and replicated an interaction between CMV infection and genotypes near the CTNNA3 gene reported by a recent GWAS. Our results support that inflammatory processes and infection may modify the risk for psychosis and suggest that the genotype at SZ-associated HLA loci modifies the effect of these variables on the risk to develop SZ.

  15. Infection and inflammation in schizophrenia and bipolar disorder: a genome wide study for interactions with genetic variation.

    Science.gov (United States)

    Avramopoulos, Dimitrios; Pearce, Brad D; McGrath, John; Wolyniec, Paula; Wang, Ruihua; Eckart, Nicole; Hatzimanolis, Alexandros; Goes, Fernando S; Nestadt, Gerald; Mulle, Jennifer; Coneely, Karen; Hopkins, Myfanwy; Ruczinski, Ingo; Yolken, Robert; Pulver, Ann E

    2015-01-01

    Inflammation and maternal or fetal infections have been suggested as risk factors for schizophrenia (SZ) and bipolar disorder (BP). It is likely that such environmental effects are contingent on genetic background. Here, in a genome-wide approach, we test the hypothesis that such exposures increase the risk for SZ and BP and that the increase is dependent on genetic variants. We use genome-wide genotype data, plasma IgG antibody measurements against Toxoplasma gondii, Herpes simplex virus type 1, Cytomegalovirus, Human Herpes Virus 6 and the food antigen gliadin as well as measurements of C-reactive protein (CRP), a peripheral marker of inflammation. The subjects are SZ cases, BP cases, parents of cases and screened controls. We look for higher levels of our immunity/infection variables and interactions between them and common genetic variation genome-wide. We find many of the antibody measurements higher in both disorders. While individual tests do not withstand correction for multiple comparisons, the number of nominally significant tests and the comparisons showing the expected direction are in significant excess (permutation p=0.019 and 0.004 respectively). We also find CRP levels highly elevated in SZ, BP and the mothers of BP cases, in agreement with existing literature, but possibly confounded by our inability to correct for smoking or body mass index. In our genome-wide interaction analysis no signal reached genome-wide significance, yet many plausible candidate genes emerged. In a hypothesis driven test, we found multiple interactions among SZ-associated SNPs in the HLA region on chromosome 6 and replicated an interaction between CMV infection and genotypes near the CTNNA3 gene reported by a recent GWAS. Our results support that inflammatory processes and infection may modify the risk for psychosis and suggest that the genotype at SZ-associated HLA loci modifies the effect of these variables on the risk to develop SZ.

  16. Genome-wide DNA methylation profiling of cell-free serum DNA in esophageal adenocarcinoma and Barrett esophagus.

    Science.gov (United States)

    Zhai, Rihong; Zhao, Yang; Su, Li; Cassidy, Lauren; Liu, Geoffrey; Christiani, David C

    2012-01-01

    Aberrant DNA methylation (DNAm) is a feature of most types of cancers. Genome-wide DNAm profiling has been performed successfully on tumor tissue DNA samples. However, the invasive procedure limits the utility of tumor tissue for epidemiological studies. While recent data indicate that cell-free circulating DNAm (cfDNAm) profiles reflect DNAm status in corresponding tumor tissues, no studies have examined the association of cfDNAm with cancer or precursors on a genome-wide scale. The objective of this pilot study was to evaluate the putative significance of genome-wide cfDNAm profiles in esophageal adenocarcinoma (EA) and Barrett esophagus (BE, EA precursor). We performed genome-wide DNAm profiling in EA tissue DNA (n = 8) and matched serum DNA (n = 8), in serum DNA of BE (n = 10), and in healthy controls (n = 10) using the Infinium HumanMethylation27 BeadChip that covers 27,578 CpG loci in 14,495 genes. We found that cfDNAm profiles were highly correlated to DNAm profiles in matched tumor tissue DNA (r = 0.92) in patients with EA. We selected the most differentially methylated loci to perform hierarchical clustering analysis. We found that 911 loci can discriminate perfectly between EA and control samples, 554 loci can separate EA from BE samples, and 46 loci can distinguish BE from control samples. These results suggest that genome-wide cfDNAm profiles are highly consistent with DNAm profiles detected in corresponding tumor tissues. Differential cfDNAm profiling may be a useful approach for the noninvasive screening of EA and EA premalignant lesions.

  17. Genome-wide DNA Methylation Profiling of Cell-Free Serum DNA in Esophageal Adenocarcinoma and Barrett Esophagus

    Directory of Open Access Journals (Sweden)

    Rihong Zhai

    2012-01-01

    Full Text Available Aberrant DNA methylation (DNAm is a feature of most types of cancers. Genome-wide DNAm profiling has been performed successfully on tumor tissue DNA samples. However, the invasive procedure limits the utility of tumor tissue for epidemiological studies. While recent data indicate that cell-free circulating DNAm (cfDNAm profiles reflect DNAm status in corresponding tumor tissues, no studies have examined the association of cfDNAm with cancer or precursors on a genome-wide scale. The objective of this pilot study was to evaluate the putative significance of genome-wide cfDNAm profiles in esophageal adenocarcinoma (EA and Barrett esophagus (BE, EA precursor. We performed genome-wide DNAm profiling in EA tissue DNA (n = 8 and matched serum DNA (n = 8, in serum DNA of BE (n = 10, and in healthy controls (n = 10 using the Infinium HumanMethylation27 BeadChip that covers 27,578 CpG loci in 14,495 genes. We found that cfDNAm profiles were highly correlated to DNAm profiles in matched tumor tissue DNA (r = 0.92 in patients with EA. We selected the most differentially methylated loci to perform hierarchical clustering analysis. We found that 911 loci can discriminate perfectly between EA and control samples, 554 loci can separate EA from BE samples, and 46 loci can distinguish BE from control samples. These results suggest that genome-wide cfDNAm profiles are highly consistent with DNAm profiles detected in corresponding tumor tissues. Differential cfDNAm profiling may be a useful approach for the noninvasive screening of EA and EA premalignant lesions.

  18. Genome-wide association study of tick resistance in South African Nguni cattle.

    Science.gov (United States)

    Mapholi, N O; Maiwashe, A; Matika, O; Riggio, V; Bishop, S C; MacNeil, M D; Banga, C; Taylor, J F; Dzama, K

    2016-04-01

    Ticks and tick-borne diseases are among the main causes of economic loss in the South African cattle industry through high morbidity and mortality rates. Concerns of the general public regarding chemical residues may tarnish their perceptions of food safety and environmental health when the husbandry of cattle includes frequent use of acaricides to manage ticks. The primary objective of this study was to identify single nucleotide polymorphism (SNP) markers associated with host resistance to ticks in South African Nguni cattle. Tick count data were collected monthly from 586 Nguni cattle reared in four herds under natural grazing conditions over a period of two years. The counts were recorded for six species of ticks attached in eight anatomical locations on the animals and were summed by species and anatomical location. This gave rise to 63 measured phenotypes or traits, with results for 12 of these traits being reported here. Tick count (x) data were transformed using log10(x+1) and the resulting values were examined for normality. DNA was extracted from hair and blood samples and was genotyped using the Illumina BovineSNP50 assay. After quality control (call rate >90%, minor allele frequency >0.02), 40,436 SNPs were retained for analysis. Genetic parameters were estimated and association analysis for tick resistance was carried out using two approaches: a genome-wide association (GWA) analysis using the GenABEL package and a regional heritability mapping (RHM) analysis. The Bonferroni genome-wide (PAmblyomma hebraeum (the vector for Heartwater disease) being the dominant species. Heritability estimates (h(2)) from the fitted animal and sire models ranged from 0.02±0.00 to 0.17±0.04 for the transformed tick count data. Several genomic regions harbouring quantitative trait loci (QTL) were identified for different tick count traits by both the GWA and RHM approaches. Three genome-wide significant regions on chromosomes 7, 10 and 19 were identified for total tick

  19. Mosaic paternal genome-wide uniparental isodisomy with down syndrome.

    Science.gov (United States)

    Darcy, Diana; Atwal, Paldeep Singh; Angell, Cathy; Gadi, Inder; Wallerstein, Robert

    2015-10-01

    We report on a 6-month-old girl with two apparent cell lines; one with trisomy 21, and the other with paternal genome-wide uniparental isodisomy (GWUPiD), identified using single nucleotide polymorphism (SNP) based microarray and microsatellite analysis of polymorphic loci. The patient has Beckwith-Wiedemann syndrome (BWS) due to paternal uniparental disomy (UPD) at chromosome location 11p15 (UPD 11p15), which was confirmed through methylation analysis. Hyperinsulinemic hypoglycemia is present, which is associated with paternal UPD 11p15.5; and she likely has medullary nephrocalcinosis, which is associated with paternal UPD 20, although this was not biochemically confirmed. Angelman syndrome (AS) analysis was negative but this testing is not completely informative; she has no specific features of AS. Clinical features of this patient include: dysmorphic features consistent with trisomy 21, tetralogy of Fallot, hemihypertrophy, swirled skin hyperpigmentation, hepatoblastoma, and Wilms tumor. Her karyotype is 47,XX,+21[19]/46,XX[4], and microarray results suggest that the cell line with trisomy 21 is biparentally inherited and represents 40-50% of the genomic material in the tested specimen. The difference in the level of cytogenetically detected mosaicism versus the level of mosaicism observed via microarray analysis is likely caused by differences in the test methodologies. While a handful of cases of mosaic paternal GWUPiD have been reported, this patient is the only reported case that also involves trisomy 21. Other GWUPiD patients have presented with features associated with multiple imprinted regions, as does our patient. © 2015 Wiley Periodicals, Inc.

  20. Susceptibility to Chronic Mucus Hypersecretion, a Genome Wide Association Study

    Science.gov (United States)

    Dijkstra, Akkelies E.; Smolonska, Joanna; van den Berge, Maarten; Wijmenga, Ciska; Zanen, Pieter; Luinge, Marjan A.; Platteel, Mathieu; Lammers, Jan-Willem; Dahlback, Magnus; Tosh, Kerrie; Hiemstra, Pieter S.; Sterk, Peter J.; Spira, Avi; Vestbo, Jorgen; Nordestgaard, Borge G.; Benn, Marianne; Nielsen, Sune F.; Dahl, Morten; Verschuren, W. Monique; Picavet, H. Susan J.; Smit, Henriette A.; Owsijewitsch, Michael; Kauczor, Hans U.; de Koning, Harry J.; Nizankowska-Mogilnicka, Eva; Mejza, Filip; Nastalek, Pawel; van Diemen, Cleo C.; Cho, Michael H.; Silverman, Edwin K.; Crapo, James D.; Beaty, Terri H.; Lomas, David A.; Bakke, Per; Gulsvik, Amund; Bossé, Yohan; Obeidat, M. A.; Loth, Daan W.; Lahousse, Lies; Rivadeneira, Fernando; Uitterlinden, Andre G.; Hofman, Andre; Stricker, Bruno H.; Brusselle, Guy G.; van Duijn, Cornelia M.; Brouwer, Uilke; Koppelman, Gerard H.; Vonk, Judith M.; Nawijn, Martijn C.; Groen, Harry J. M.; Timens, Wim; Boezen, H. Marike; Postma, Dirkje S.

    2014-01-01

    Background Chronic mucus hypersecretion (CMH) is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA) study of CMH in Caucasian populations. Methods GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years). Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP). Results A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10−6, OR = 1.17), located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1) on chromosome 3. The risk allele (G) was associated with higher mRNA expression of SATB1 (4.3×10−9) in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture. Conclusions Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH. PMID:24714607

  1. Genome-wide examination of myoblast cell cycle withdrawal duringdifferentiation

    Energy Technology Data Exchange (ETDEWEB)

    Shen, Xun; Collier, John Michael; Hlaing, Myint; Zhang, Leanne; Delshad, Elizabeth H.; Bristow, James; Bernstein, Harold S.

    2002-12-02

    Skeletal and cardiac myocytes cease division within weeks of birth. Although skeletal muscle retains limited capacity for regeneration through recruitment of satellite cells, resident populations of adult myocardial stem cells have not been identified. Because cell cycle withdrawal accompanies myocyte differentiation, we hypothesized that C2C12 cells, a mouse myoblast cell line previously used to characterize myocyte differentiation, also would provide a model for studying cell cycle withdrawal during differentiation. C2C12 cells were differentiated in culture medium containing horse serum and harvested at various time points to characterize the expression profiles of known cell cycle and myogenic regulatory factors by immunoblot analysis. BrdU incorporation decreased dramatically in confluent cultures 48 hr after addition of horse serum, as cells started to form myotubes. This finding was preceded by up-regulation of MyoD, followed by myogenin, and activation of Bcl-2. Cyclin D1 was expressed in proliferating cultures and became undetectable in cultures containing 40 percent fused myotubes, as levels of p21(WAF1/Cip1) increased and alpha-actin became detectable. Because C2C12 myoblasts withdraw from the cell cycle during myocyte differentiation following a course that recapitulates this process in vivo, we performed a genome-wide screen to identify other gene products involved in this process. Using microarrays containing approximately 10,000 minimally redundant mouse sequences that map to the UniGene database of the National Center for Biotechnology Information, we compared gene expression profiles between proliferating, differentiating, and differentiated C2C12 cells and verified candidate genes demonstrating differential expression by RT-PCR. Cluster analysis of differentially expressed genes revealed groups of gene products involved in cell cycle withdrawal, muscle differentiation, and apoptosis. In addition, we identified several genes, including DDAH2 and Ly

  2. Susceptibility to chronic mucus hypersecretion, a genome wide association study.

    Directory of Open Access Journals (Sweden)

    Akkelies E Dijkstra

    Full Text Available BACKGROUND: Chronic mucus hypersecretion (CMH is associated with an increased frequency of respiratory infections, excess lung function decline, and increased hospitalisation and mortality rates in the general population. It is associated with smoking, but it is unknown why only a minority of smokers develops CMH. A plausible explanation for this phenomenon is a predisposing genetic constitution. Therefore, we performed a genome wide association (GWA study of CMH in Caucasian populations. METHODS: GWA analysis was performed in the NELSON-study using the Illumina 610 array, followed by replication and meta-analysis in 11 additional cohorts. In total 2,704 subjects with, and 7,624 subjects without CMH were included, all current or former heavy smokers (≥20 pack-years. Additional studies were performed to test the functional relevance of the most significant single nucleotide polymorphism (SNP. RESULTS: A strong association with CMH, consistent across all cohorts, was observed with rs6577641 (p = 4.25×10(-6, OR = 1.17, located in intron 9 of the special AT-rich sequence-binding protein 1 locus (SATB1 on chromosome 3. The risk allele (G was associated with higher mRNA expression of SATB1 (4.3×10(-9 in lung tissue. Presence of CMH was associated with increased SATB1 mRNA expression in bronchial biopsies from COPD patients. SATB1 expression was induced during differentiation of primary human bronchial epithelial cells in culture. CONCLUSIONS: Our findings, that SNP rs6577641 is associated with CMH in multiple cohorts and is a cis-eQTL for SATB1, together with our additional observation that SATB1 expression increases during epithelial differentiation provide suggestive evidence that SATB1 is a gene that affects CMH.

  3. Genephony: a knowledge management tool for genome-wide research

    Science.gov (United States)

    Nuzzo, Angelo; Riva, Alberto

    2009-01-01

    Background One of the consequences of the rapid and widespread adoption of high-throughput experimental technologies is an exponential increase of the amount of data produced by genome-wide experiments. Researchers increasingly need to handle very large volumes of heterogeneous data, including both the data generated by their own experiments and the data retrieved from publicly available repositories of genomic knowledge. Integration, exploration, manipulation and interpretation of data and information therefore need to become as automated as possible, since their scale and breadth are, in general, beyond the limits of what individual researchers and the basic data management tools in normal use can handle. This paper describes Genephony, a tool we are developing to address these challenges. Results We describe how Genephony can be used to manage large datesets of genomic information, integrating them with existing knowledge repositories. We illustrate its functionalities with an example of a complex annotation task, in which a set of SNPs coming from a genotyping experiment is annotated with genes known to be associated to a phenotype of interest. We show how, thanks to the modular architecture of Genephony and its user-friendly interface, this task can be performed in a few simple steps. Conclusion Genephony is an online tool for the manipulation of large datasets of genomic information. It can be used as a browser for genomic data, as a high-throughput annotation tool, and as a knowledge discovery tool. It is designed to be easy to use, flexible and extensible. Its knowledge management engine provides fine-grained control over individual data elements, as well as efficient operations on large datasets. PMID:19728881

  4. Genome-wide identification of direct HBx genomic targets

    KAUST Repository

    Guerrieri, Francesca

    2017-02-17

    Background The Hepatitis B Virus (HBV) HBx regulatory protein is required for HBV replication and involved in HBV-related carcinogenesis. HBx interacts with chromatin modifying enzymes and transcription factors to modulate histone post-translational modifications and to regulate viral cccDNA transcription and cellular gene expression. Aiming to identify genes and non-coding RNAs (ncRNAs) directly targeted by HBx, we performed a chromatin immunoprecipitation sequencing (ChIP-Seq) to analyse HBV recruitment on host cell chromatin in cells replicating HBV. Results ChIP-Seq high throughput sequencing of HBx-bound fragments was used to obtain a high-resolution, unbiased, mapping of HBx binding sites across the genome in HBV replicating cells. Protein-coding genes and ncRNAs involved in cell metabolism, chromatin dynamics and cancer were enriched among HBx targets together with genes/ncRNAs known to modulate HBV replication. The direct transcriptional activation of genes/miRNAs that potentiate endocytosis (Ras-related in brain (RAB) GTPase family) and autophagy (autophagy related (ATG) genes, beclin-1, miR-33a) and the transcriptional repression of microRNAs (miR-138, miR-224, miR-576, miR-596) that directly target the HBV pgRNA and would inhibit HBV replication, contribute to HBx-mediated increase of HBV replication. Conclusions Our ChIP-Seq analysis of HBx genome wide chromatin recruitment defined the repertoire of genes and ncRNAs directly targeted by HBx and led to the identification of new mechanisms by which HBx positively regulates cccDNA transcription and HBV replication.

  5. Genome-Wide Associations of Gene Expression Variation in Humans.

    Directory of Open Access Journals (Sweden)

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  6. Genome-wide associations of gene expression variation in humans.

    Directory of Open Access Journals (Sweden)

    Barbara E Stranger

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  7. Genome-wide signatures of convergent evolution in echolocating mammals.

    Science.gov (United States)

    Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J

    2013-10-10

    Evolution is typically thought to proceed through divergence of genes, proteins and ultimately phenotypes. However, similar traits might also evolve convergently in unrelated taxa owing to similar selection pressures. Adaptive phenotypic convergence is widespread in nature, and recent results from several genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution, although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show that convergence is not a rare process restricted to several loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four newly sequenced bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the bottlenose dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Unexpectedly, we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognized.

  8. Identification of neural outgrowth genes using genome-wide RNAi.

    Directory of Open Access Journals (Sweden)

    Katharine J Sepp

    2008-07-01

    Full Text Available While genetic screens have identified many genes essential for neurite outgrowth, they have been limited in their ability to identify neural genes that also have earlier critical roles in the gastrula, or neural genes for which maternally contributed RNA compensates for gene mutations in the zygote. To address this, we developed methods to screen the Drosophila genome using RNA-interference (RNAi on primary neural cells and present the results of the first full-genome RNAi screen in neurons. We used live-cell imaging and quantitative image analysis to characterize the morphological phenotypes of fluorescently labelled primary neurons and glia in response to RNAi-mediated gene knockdown. From the full genome screen, we focused our analysis on 104 evolutionarily conserved genes that when downregulated by RNAi, have morphological defects such as reduced axon extension, excessive branching, loss of fasciculation, and blebbing. To assist in the phenotypic analysis of the large data sets, we generated image analysis algorithms that could assess the statistical significance of the mutant phenotypes. The algorithms were essential for the analysis of the thousands of images generated by the screening process and will become a valuable tool for future genome-wide screens in primary neurons. Our analysis revealed unexpected, essential roles in neurite outgrowth for genes representing a wide range of functional categories including signalling molecules, enzymes, channels, receptors, and cytoskeletal proteins. We also found that genes known to be involved in protein and vesicle trafficking showed similar RNAi phenotypes. We confirmed phenotypes of the protein trafficking genes Sec61alpha and Ran GTPase using Drosophila embryo and mouse embryonic cerebral cortical neurons, respectively. Collectively, our results showed that RNAi phenotypes in primary neural culture can parallel in vivo phenotypes, and the screening technique can be used to identify many new

  9. Genome-wide identification of novel small RNAs in Pseudomonas aeruginosa

    DEFF Research Database (Denmark)

    Gómez Lozano, María

    and RNA sequencing (RNAseq) technologies. The latter approach, in particular, has revolutionized sRNA discovery by enabling interrogation of the transcriptome at unprecedented depths. The size and complexity of the P. aeruginosa genome suggests that it encodes many hitherto undetected sRNAs. In this study...... of regulatory sRNAs in P. aeruginosa and the approach described here may be applied to identify sRNAs in any bacterium under different growth and stress conditions. In addition the role of sRNA OsiS was investigated. OsiS was identified in our genome-wide search of sRNAs in P. aeruginosa. OsiS is highly...... to know the exact nature of the interaction between these two sRNAs. Notably, OsiS is, to the best of our knowledge, the first sRNA whose main function seems to be regulating the cellular levels of another sRNA....

  10. Application of genome-wide single nucleotide polymorphism typing: simple association and beyond.

    Directory of Open Access Journals (Sweden)

    J Raphael Gibbs

    2006-10-01

    Full Text Available The International HapMap Project and the arrival of technologies that type more than 100,000 SNPs in a single experiment have made genome-wide single nucleotide polymorphism (GW-SNP assay a realistic endeavor. This has sparked considerable debate regarding the promise of GW-SNP typing to identify genetic association in disease. As has already been shown, this approach has the potential to localize common genetic variation underlying disease risk. The data provided from this technology also lends itself to several other lines of investigation; autozygosity mapping in consanguineous families and outbred populations, direct detection of structural variation, admixture analysis, and other population genetic approaches. In this review we will discuss the potential uses and practical application of GW-SNP typing including those above and beyond simple association testing.

  11. Preliminary Clinical Evaluation: The What, Where, How Approach to Scoring

    OpenAIRE

    Wright, C.; Akimoto, T.

    2016-01-01

    The SCoR is driving for preliminary clinical evaluation (PCE) however; currently there is no method of quantification to assess quality. FRCR has an approach to quantify comments in the rapid reporting examination (CR2B).The aim of this project was to develop a robust scoring system that enables comprehensive image evaluation regardless of profession. An image test bank was administered using RadBench with equal prevalence of normal /abnormal. A random sample of attempts was selected to pilot...

  12. Genome-wide association study of generalized anxiety symptoms in the Hispanic Community Health Study/Study of Latinos.

    Science.gov (United States)

    Dunn, Erin C; Sofer, Tamar; Gallo, Linda C; Gogarten, Stephanie M; Kerr, Kathleen F; Chen, Chia-Yen; Stein, Murray B; Ursano, Robert J; Guo, Xiuqing; Jia, Yucheng; Qi, Qibin; Rotter, Jerome I; Argos, Maria; Cai, Jianwen; Penedo, Frank J; Perreira, Krista; Wassertheil-Smoller, Sylvia; Smoller, Jordan W

    2017-03-01

    Although generalized anxiety disorder (GAD) is heritable and aggregates in families, no genomic loci associated with GAD have been reported. We aimed to discover potential loci by conducting a genome-wide analysis of GAD symptoms in a large, population-based sample of Hispanic/Latino adults. Data came from 12,282 participants (aged 18-74) in the Hispanic Community Health Study/Study of Latinos. Using a shortened Spielberger Trait Anxiety measure, we analyzed the following: (i) a GAD symptoms score restricted to the three items tapping diagnostic features of GAD as defined by DSM-V; and (ii) a total trait anxiety score based on summing responses to all ten items. We first calculated the heritability due to common variants (h(2)SNP ) and then conducted a genome-wide association study (GWAS) of GAD symptoms. Replication was attempted in three independent Hispanic cohorts (Multi-Ethnic Study of Atherosclerosis, Women's Health Initiative, Army STARRS). The GAD symptoms score showed evidence of modest heritability (7.2%; P = 0.03), while the total trait anxiety score did not (4.97%; P = 0.20). One genotyped SNP (rs78602344) intronic to thrombospondin 2 (THBS2) was nominally associated (P = 5.28 × 10(-8) ) in the primary analysis adjusting for psychiatric medication use and significantly associated with the GAD symptoms score in the analysis excluding medication users (P = 4.18 × 10(-8) ). However, meta-analysis of the replication samples did not support this association. Although we identified a genome-wide significant locus in this sample, we were unable to replicate this finding. Evidence for heritability was also only detected for GAD symptoms, and not the trait anxiety measure, suggesting differential genetic influences within the domain of trait anxiety. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  13. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Development of Advanced Classification Algorithm for Genome-Wide Single Nucleotide Polymorphism (SNP) Data Analysis

    Science.gov (United States)

    2011-04-01

    al. (2007) “Efficient mapping of mendelian traits in dogs through genome-wide association.” Nat Genet 39:1321-1328. 12 Distribution A...collected data to genetically map superior intelligence in the military working dog. A behavioral testing regimen was developed by canine cognitive expert Dr...TERMS Military working dog genome-wide association study genetic marker intelligence 16

  14. A genome-wide MeSH-based literature mining system predicts implicit gene-to-gene relationships and networks.

    Science.gov (United States)

    Xiang, Zuoshuang; Qin, Tingting; Qin, Zhaohui S; He, Yongqun

    2013-10-16

    The large amount of literature in the post-genomics era enables the study of gene interactions and networks using all available articles published for a specific organism. MeSH is a controlled vocabulary of medical and scientific terms that is used by biomedical scientists to manually index articles in the PubMed literature database. We hypothesized that genome-wide gene-MeSH term associations from the PubMed literature database could be used to predict implicit gene-to-gene relationships and networks. While the gene-MeSH associations have been used to detect gene-gene interactions in some studies, different methods have not been well compared, and such a strategy has not been evaluated for a genome-wide literature analysis. Genome-wide literature mining of gene-to-gene interactions allows ranking of the best gene interactions and investigation of comprehensive biological networks at a genome level. The genome-wide GenoMesh literature mining algorithm was developed by sequentially generating a gene-article matrix, a normalized gene-MeSH term matrix, and a gene-gene matrix. The gene-gene matrix relies on the calculation of pairwise gene dissimilarities based on gene-MeSH relationships. An optimized dissimilarity score was identified from six well-studied functions based on a receiver operating characteristic (ROC) analysis. Based on the studies with well-studied Escherichia coli and less-studied Brucella spp., GenoMesh was found to accurately identify gene functions using weighted MeSH terms, predict gene-gene interactions not reported in the literature, and cluster all the genes studied from an organism using the MeSH-based gene-gene matrix. A web-based GenoMesh literature mining program is also available at: http://genomesh.hegroup.org. GenoMesh also predicts gene interactions and networks among genes associated with specific MeSH terms or user-selected gene lists. The GenoMesh algorithm and web program provide the first genome-wide, MeSH-based literature mining

  15. Genome-wide identification of the regulatory targets of a transcription factor using biochemical characterization and computational genomic analysis

    Directory of Open Access Journals (Sweden)

    Jolly Emmitt R

    2005-11-01

    Full Text Available Abstract Background A major challenge in computational genomics is the development of methodologies that allow accurate genome-wide prediction of the regulatory targets of a transcription factor. We present a method for target identification that combines experimental characterization of binding requirements with computational genomic analysis. Results Our method identified potential target genes of the transcription factor Ndt80, a key transcriptional regulator involved in yeast sporulation, using the combined information of binding affinity, positional distribution, and conservation of the binding sites across multiple species. We have also developed a mathematical approach to compute the false positive rate and the total number of targets in the genome based on the multiple selection criteria. Conclusion We have shown that combining biochemical characterization and computational genomic analysis leads to accurate identification of the genome-wide targets of a transcription factor. The method can be extended to other transcription factors and can complement other genomic approaches to transcriptional regulation.

  16. A genome-wide tree- and forest-based association analysis of comorbidity of alcoholism and smoking

    OpenAIRE

    Ye, Yuanqing; Zhong, Xiaoyun; Zhang, Heping

    2005-01-01

    Genetic mechanisms underlying alcoholism are complex. Understanding the etiology of alcohol dependence and its comorbid conditions such as smoking is important because of the significant health concerns. In this report, we describe a method based on classification trees and deterministic forests for association studies to perform a genome-wide joint association analysis of alcoholism and smoking. This approach is used to analyze the single-nucleotide polymorphism data from the Collaborative S...

  17. Comparison of three summary statistics for ranking genes in genome-wide association studies.

    Science.gov (United States)

    Freytag, Saskia; Bickeböller, Heike

    2014-05-20

    Problems associated with insufficient power have haunted the analysis of genome-wide association studies and are likely to be the main challenge for the analysis of next-generation sequencing data. Ranking genes according to their strength of association with the investigated phenotype is one solution. To obtain rankings for genes, researchers can draw from a wide range of statistics summarizing the relationships between variants mapped to a gene and the phenotype. Hence, it is of interest to explore the performance of these statistics in the context of rankings. To this end, we conducted a simulation study (limited to genes of equal sizes) of three different summary statistics examining the ability to rank genes in a meaningful order. The weighted sum of squared marginal score test (Pan, 2009), RareCover algorithm (Bahtia et al., 2010) and the elastic net regularization (Zou and Hastie, 2005) were chosen, because they can handle common as well as rare variants. The test based on the score statistic outperformed both other methods in almost all investigated scenarios. It was the only measure to consistently detect genes with interacting causal variants. However, the RareCover algorithm proved better at identifying genes including causal variants with small effect sizes and low minor allele frequency than the weighted sum of squared marginal score test. The performance of the elastic net regularization was unimpressive for all but the simplest scenarios. Copyright © 2013 John Wiley & Sons, Ltd.

  18. Genome-wide analysis of alternative splicing in Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Thomas Julie

    2010-02-01

    Full Text Available Abstract Background Genome-wide computational analysis of alternative splicing (AS in several flowering plants has revealed that pre-mRNAs from about 30% of genes undergo AS. Chlamydomonas, a simple unicellular green alga, is part of the lineage that includes land plants. However, it diverged from land plants about one billion years ago. Hence, it serves as a good model system to study alternative splicing in early photosynthetic eukaryotes, to obtain insights into the evolution of this process in plants, and to compare splicing in simple unicellular photosynthetic and non-photosynthetic eukaryotes. We performed a global analysis of alternative splicing in Chlamydomonas reinhardtii using its recently completed genome sequence and all available ESTs and cDNAs. Results Our analysis of AS using BLAT and a modified version of the Sircah tool revealed AS of 498 transcriptional units with 611 events, representing about 3% of the total number of genes. As in land plants, intron retention is the most prevalent form of AS. Retained introns and skipped exons tend to be shorter than their counterparts in constitutively spliced genes. The splice site signals in all types of AS events are weaker than those in constitutively spliced genes. Furthermore, in alternatively spliced genes, the prevalent splice form has a stronger splice site signal than the non-prevalent form. Analysis of constitutively spliced introns revealed an over-abundance of motifs with simple repetitive elements in comparison to introns involved in intron retention. In almost all cases, AS results in a truncated ORF, leading to a coding sequence that is around 50% shorter than the prevalent splice form. Using RT-PCR we verified AS of two genes and show that they produce more isoforms than indicated by EST data. All cDNA/EST alignments and splice graphs are provided in a website at http://combi.cs.colostate.edu/as/chlamy. Conclusions The extent of AS in Chlamydomonas that we observed is much

  19. A Genome-wide Pleiotropy Scan for Prostate Cancer Risk

    Science.gov (United States)

    Panagiotou, Orestis A; Travis, Ruth C; Campa, Daniele; Berndt, Sonja I.; Lindstrom, Sara; Kraft, Peter; Schumacher, Fredrick R.; Siddiq, Afshan; Papatheodorou, Stefania I.; Stanford, Janet L.; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie J.; Diver, W. Ryan; Gapstur, Susan M.; Stevens, Victoria L.; Boeing, Heiner; Bueno-de-Mesquita, H. Bas; Gurrea, Aurelio Barricarte; Kaaks, Rudolf; Khaw, Kay-Tee; Krogh, Vittorio; Overvad, Kim; Riboli, Elio; Trichopoulos, Dimitrios; Giovannucci, Edward; Stampfer, Meir; Haiman, Christopher; Henderson, Brian; Le Marchand, Loic; Gaziano, J. Michael; Hunter, DavidJ.; Koutros, Stella; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Wacholder, Sholom; Key, Timothy J.; Tsilidis, Konstantinos K

    2014-01-01

    Background No single-nucleotide polymorphisms (SNPs) specific for aggressive prostate cancer have been identified in genome-wide association studies (GWAS). Objective To test if SNPs associated with other traits may also affect the risk of aggressive prostate cancer. Design, setting, and participants SNPs implicated in any phenotype other than prostate cancer (p ≤ 10−7) were identified through the catalog of published GWAS and tested in 2891 aggressive prostate cancer cases and 4592 controls from the Breast and Prostate Cancer Cohort Consortium (BPC3). The 40 most significant SNPs were followed up in 4872 aggressive prostate cancer cases and 24 534 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. Outcome measurements and statistical analysis Odds ratios (ORs) and 95% confidence intervals (CIs) for aggressive prostate cancer were estimated. Results and limitations A total of 4666 SNPs were evaluated by the BPC3. Two signals were seen in regions already reported for prostate cancer risk. rs7014346 at 8q24.21 was marginally associated with aggressive prostate cancer in the BPC3 trial (p = 1.6 × 10-6), whereas after meta-analysis by PRACTICAL the summary OR was 1.21 (95%CI 1.16–1.27; p = 3.22 × 10−18). rs9900242 at 17q24.3 was also marginally associated with aggressive disease in the meta-analysis (OR 0.90, 95% CI 0.86–0.94; p = 2.5 × 10−6). Neither of these SNPs remained statistically significant when conditioning on correlated known prostate cancer SNPs. The meta-analysis by BPC3 and PRACTICAL identified a third promising signal, marked by rs16844874 at 2q34, independent of known prostate cancer loci (OR 1.12,95% CI 1.06–1.19; p = 4.67 × 10−5); it has been shown that SNPs correlated with this signal affect glycine concentrations. The main limitation is the heterogeneity in the definition of aggressive prostate cancer between BPC3 and PRACTICAL. Conclusions We did

  20. Genome-wide association study of colorectal cancer in Hispanics

    Science.gov (United States)

    Schmit, Stephanie L.; Schumacher, Fredrick R.; Edlund, Christopher K.; Conti, David V.; Ihenacho, Ugonna; Wan, Peggy; Van Den Berg, David; Casey, Graham; Fortini, Barbara K.; Lenz, Heinz-Josef; Tusié-Luna, Teresa; Aguilar-Salinas, Carlos A.; Moreno-Macías, Hortensia; Huerta-Chagoya, Alicia; Ordóñez-Sánchez, María Luisa; Rodríguez-Guillén, Rosario; Cruz-Bautista, Ivette; Rodríguez-Torres, Maribel; Muñóz-Hernández, Linda Liliana; Arellano-Campos, Olimpia; Gómez, Donají; Alvirde, Ulices; González-Villalpando, Clicerio; González-Villalpando, María Elena; Le Marchand, Loic; Haiman, Christopher A.; Figueiredo, Jane C.

    2016-01-01

    Genome-wide association studies (GWAS) have identified 58 susceptibility alleles across 37 regions associated with the risk of colorectal cancer (CRC) with P < 5×10−8. Most studies have been conducted in non-Hispanic whites and East Asians; however, the generalizability of these findings and the potential for ethnic-specific risk variation in Hispanic and Latino (HL) individuals have been largely understudied. We describe the first GWAS of common genetic variation contributing to CRC risk in HL (1611 CRC cases and 4330 controls). We also examine known susceptibility alleles and implement imputation-based fine-mapping to identify potential ethnicity-specific association signals in known risk regions. We discovered 17 variants across 4 independent regions that merit further investigation due to suggestive CRC associations (P < 1×10−6) at 1p34.3 (rs7528276; Odds Ratio (OR) = 1.86 [95% confidence interval (CI): 1.47–2.36); P = 2.5×10−7], 2q23.3 (rs1367374; OR = 1.37 (95% CI: 1.21–1.55); P = 4.0×10−7), 14q24.2 (rs143046984; OR = 1.65 (95% CI: 1.36–2.01); P = 4.1×10−7) and 16q12.2 [rs142319636; OR = 1.69 (95% CI: 1.37–2.08); P=7.8×10−7]. Among the 57 previously published CRC susceptibility alleles with minor allele frequency ≥1%, 76.5% of SNPs had a consistent direction of effect and 19 (33.3%) were nominally statistically significant (P < 0.05). Further, rs185423955 and rs60892987 were identified as novel secondary susceptibility variants at 3q26.2 (P = 5.3×10–5) and 11q12.2 (P = 6.8×10−5), respectively. Our findings demonstrate the importance of fine mapping in HL. These results are informative for variant prioritization in functional studies and future risk prediction modeling in minority populations. PMID:27207650

  1. Opportunistic splitting for scheduling using a score-based approach

    KAUST Repository

    Rashid, Faraan

    2012-06-01

    We consider the problem of scheduling a user in a multi-user wireless environment in a distributed manner. The opportunistic splitting algorithm is applied to find the best group of users without reporting the channel state information to the centralized scheduler. The users find the best among themselves while requiring just a ternary feedback from the common receiver at the end of each mini-slot. The original splitting algorithm is modified to handle users with asymmetric channel conditions. We use a score-based approach with the splitting algorithm to introduce time and throughput fairness while exploiting the multi-user diversity of the network. Analytical and simulation results are given to show that the modified score-based splitting algorithm works well as a fair scheduling scheme with good spectral efficiency and reduced feedback. © 2012 IEEE.

  2. Genome-Wide Association Mapping for Cell Wall Composition and Properties in Temperate Grasses

    DEFF Research Database (Denmark)

    Bellucci, Andrea

    -glucans. Plant cell wall biosynthesis is regulated by a large number of genes and regulatory factors but very few of these are known and characterized. This PhD project aimed to the identification of putative candidate genes involved in plant cell wall composition and properties using a genome wide (GWAS......) approach. The species investigate were wheat, barley and B. distachyon, considered a model plant for temperate cereals. Agronomical traits as yield and plant height were also included in the analysis along with cell wall composition and saccharification properties. Several marker-trait associations were......Plant cell wall confers flexibility, support for the vital processes of the plant and resistance to abiotic stresses and pathogen. It is constituted by a complex matrix of cellulose, hemicellulose, pectins and polyphenolic compounds as lignin. These main components interact with each other...

  3. Protein Interaction-Based Genome-Wide Analysis of Incident Coronary Heart Disease

    DEFF Research Database (Denmark)

    Jensen, Majken Karoline; Pers, Tune Hannes; Dworzynski, Piotr

    2011-01-01

    Background-Network-based approaches may leverage genome-wide association (GWA) analysis by testing for the aggregate association across several pathway members. We aimed to examine if networks of genes that represent experimentally determined protein-protein interactions (PPIs) are enriched...... are involved in abnormal cardiovascular system physiological features based on knockout mice (4-fold enrichment; Fisher exact test, P = 0.006). Ingenuity pathway analysis revealed that canonical pathways, especially related to blood pressure regulation, were significantly enriched in the genes from the top...... complex. Conclusions-The integration of a GWA study with PPI data successfully identifies a set of candidate susceptibility genes for incident CHD that would have been missed in single-marker GWA analysis. (Circ Cardiovasc Genet. 2011; 4:549-556.)...

  4. SOAPsplice: genome-wide ab initio detection of splice junctions from RNA-Seq data

    Directory of Open Access Journals (Sweden)

    Songbo eHuang

    2011-07-01

    Full Text Available RNA-Seq, a method using next generation sequencing technologies to sequence the transcriptome, facilitates genome-wide analysis of splice junction sites. In this paper, we introduce SOAPsplice, a robust tool to detect splice junctions using RNA-Seq data without using any information of known splice junctions. SOAPsplice uses a novel two-step approach consisting of first identifying as many reasonable splice junction candidates as possible, and then, filtering the false positives with two effective filtering strategies. In both simulated and real datasets, SOAPsplice is able to detect many reliable splice junctions with low false positive rate. The improvement gained by SOAPsplice, when compared to other existing tools, becomes more obvious when the depth of sequencing is low. SOAPsplice is freely available at http://soap.genomics.org.cn/soapsplice.html.

  5. Moving towards system genetics through multiple trait analysis in genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Daniel eShriner

    2012-01-01

    Full Text Available Association studies are a staple of genotype-phenotype mapping studies, whether they are based on single markers, haplotypes, candidate genes, genome-wide genotypes, or whole genome sequences. Although genetic epidemiological studies typically contain data collected on multiple traits which themselves are often correlated, most analyses have been performed on single traits. Here, I review several methods that have been developed to perform multiple trait analysis. These methods range from traditional multivariate models for systems of equations to recently developed graphical approaches based on network theory. The application of network theory to genetics is termed systems genetics and has the potential to address long-standing questions in genetics about complex processes such as coordinate regulation, homeostasis, and pleiotropy.

  6. Advances in plant cell type-specific genome-wide studies of gene expression

    Institute of Scientific and Technical Information of China (English)

    Ying WANG; Yuling JIAO

    2011-01-01

    Cell is the functional unit of life.To study the complex interactions of systems of biological molecules,it is crucial to dissect these molecules at the cell level.In recent years,major progresses have been made by plant biologists to profile gene expression in specific cell types at the genome-wide level.Approaches based on the isolation of cells,polysomes or nuclei have been developed and successfully used for studying the cell types from distinct organs of several plant species.These cell-level data sets revealed previously unrecognized cellular properties,such as cell-specific gene expression modules and hormone response centers,and should serve as essential resources for functional genomic analyses.Newly developed technologies are more affordable to many laboratories and should help to provide new insights at the cellular resolution in the near future.

  7. Genome-wide association study for ovarian cancer susceptibility using pooled DNA

    DEFF Research Database (Denmark)

    Lu, Yi; Chen, Xiaoqing; Beesley, Jonathan;

    2012-01-01

    Recent Genome-Wide Association Studies (GWAS) have identified four low-penetrance ovarian cancer susceptibility loci. We hypothesized that further moderate- or low-penetrance variants exist among the subset of single-nucleotide polymorphisms (SNPs) not well tagged by the genotyping arrays used...... in a much larger stage 2 set of 4,651 cases and 6,966 controls from the Ovarian Cancer Association Consortium. Given that most of the top 20 SNPs from pooling were validated in the same samples by individual genotyping, the lack of replication is likely to be due to the relatively small sample size in our...... stage 1 GWAS rather than due to problems with the pooling approach. We conclude that there are unlikely to be any moderate or large effects on ovarian cancer risk untagged by less dense arrays. However, our study lacked power to make clear statements on the existence of hitherto untagged small...

  8. Genome-wide gene expression changes in an industrial clavulanic acid overproduction strain of Streptomyces clavuligerus.

    Science.gov (United States)

    Medema, Marnix H; Alam, Mohammad T; Heijne, Wilbert H M; van den Berg, Marco A; Müller, Ulrike; Trefzer, Axel; Bovenberg, Roel A L; Breitling, Rainer; Takano, Eriko

    2011-03-01

    To increase production of the important pharmaceutical compound clavulanic acid, a β-lactamase inhibitor, both random mutagenesis approaches and rational engineering of Streptomyces clavuligerus strains have been extensively applied. Here, for the first time, we compared genome-wide gene expression of an industrial S. clavuligerus strain, obtained through iterative mutagenesis, with that of the wild-type strain. Intriguingly, we found that the majority of the changes contributed not to a complex rewiring of primary metabolism but consisted of a simple upregulation of various antibiotic biosynthesis gene clusters. A few additional transcriptional changes in primary metabolism at key points seem to divert metabolic fluxes to the biosynthetic precursors for clavulanic acid. In general, the observed changes largely coincide with genes that have been targeted by rational engineering in recent years, yet the presence of a number of previously unexplored genes clearly demonstrates that functional genomic analysis can provide new leads for strain improvement in biotechnology.

  9. Genome-wide de Novo Prediction of Proximal and Distal Tissue-Specific Enhancers

    Energy Technology Data Exchange (ETDEWEB)

    Loots, G G; Ovcharenko, I V

    2005-11-03

    Determining how transcriptional regulatory networks are encoded in the human genome is essential for understanding how cellular processes are directed. Here, we present a novel approach for systematically predicting tissue specific regulatory elements (REs) that blends genome-wide expression profiling, vertebrate genome comparisons, and pattern analysis of transcription factor binding sites. This analysis yields 4,670 candidate REs in the human genome with distinct tissue specificities, the majority of which reside far away from transcription start sites. We identify key transcription factors (TFs) for 34 distinct tissues and demonstrate that tissue-specific gene expression relies on multiple regulatory pathways employing similar, but different cohorts of interacting TFs. The methods and results we describe provide a global view of tissue specific gene regulation in humans, and propose a strategy for deciphering the transcriptional regulatory code in eukaryotes.

  10. Genome-wide analyses for dissecting gene regulatory networks in the shoot apical meristem.

    Science.gov (United States)

    Bustamante, Mariana; Matus, José Tomás; Riechmann, José Luis

    2016-03-01

    Shoot apical meristem activity is controlled by complex regulatory networks in which components such as transcription factors, miRNAs, small peptides, hormones, enzymes and epigenetic marks all participate. Many key genes that determine the inherent characteristics of the shoot apical meristem have been identified through genetic approaches. Recent advances in genome-wide studies generating extensive transcriptomic and DNA-binding datasets have increased our understanding of the interactions within the regulatory networks that control the activity of the meristem, identifying new regulators and uncovering connections between previously unlinked network components. In this review, we focus on recent studies that illustrate the contribution of whole genome analyses to understand meristem function. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  11. RNA 3D modules in genome-wide predictions of RNA 2D structure

    DEFF Research Database (Denmark)

    Theis, Corinna; Zirbel, Craig L; Zu Siederdissen, Christian Höner

    2015-01-01

    Recent experimental and computational progress has revealed a large potential for RNA structure in the genome. This has been driven by computational strategies that exploit multiple genomes of related organisms to identify common sequences and secondary structures. However, these computational...... approaches have two main challenges: they are computationally expensive and they have a relatively high false discovery rate (FDR). Simultaneously, RNA 3D structure analysis has revealed modules composed of non-canonical base pairs which occur in non-homologous positions, apparently by independent evolution....... These modules can, for example, occur inside structural elements which in RNA 2D predictions appear as internal loops. Hence one question is if the use of such RNA 3D information can improve the prediction accuracy of RNA secondary structure at a genome-wide level. Here, we use RNAz in combination with 3D...

  12. Genome-wide Escherichia coli stress response and improved tolerance towards industrially relevant chemicals

    DEFF Research Database (Denmark)

    Rau, Martin Holm; Calero Valdayo, Patricia; Lennen, Rebecca

    2016-01-01

    of transcription changes within and between chemical groups, with functions such as energy metabolism, stress response, membrane modification, transporters and iron metabolism being affected. Regulon enrichment analysis identified key regulators likely mediating the transcriptional response, including CRP, Rpo...... approach was employed to understand the chemical stress response of Escherichia coli, including a genome-wide screen for mutants with increased fitness during chemical stress. Twelve chemicals with significant production potential were selected, consisting of organic solvent-like chemicals (butanol...... stress identified 294 enriched and 336 depleted mutants and experimental validation revealed up to 60 % increase in mutant growth rates. Mutants enriched in several conditions contained, among others, insertions in genes of the Mar-Sox-Rob regulon as well as transcription and translation related gene...

  13. A genome-wide analysis of small regulatory RNAs in the human pathogen group A Streptococcus.

    Directory of Open Access Journals (Sweden)

    Nataly Perez

    Full Text Available The coordinated regulation of gene expression is essential for pathogens to infect and cause disease. A recently appreciated mechanism of regulation is that afforded by small regulatory RNA (sRNA molecules. Here, we set out to assess the prevalence of sRNAs in the human bacterial pathogen group A Streptococcus (GAS. Genome-wide identification of candidate GAS sRNAs was performed through a tiling Affymetrix microarray approach and identified 40 candidate sRNAs within the M1T1 GAS strain MGAS2221. Together with a previous bioinformatic approach this brings the number of novel candidate sRNAs in GAS to 75, a number that approximates the number of GAS transcription factors. Transcripts were confirmed by Northern blot analysis for 16 of 32 candidate sRNAs tested, and the abundance of several of these sRNAs were shown to be temporally regulated. Six sRNAs were selected for further study and the promoter, transcriptional start site, and Rho-independent terminator identified for each. Significant variation was observed between the six sRNAs with respect to their stability during growth, and with respect to their inter- and/or intra-serotype-specific levels of abundance. To start to assess the contribution of sRNAs to gene regulation in M1T1 GAS we deleted the previously described sRNA PEL from four clinical isolates. Data from genome-wide expression microarray, quantitative RT-PCR, and Western blot analyses are consistent with PEL having no regulatory function in M1T1 GAS. The finding that candidate sRNA molecules are prevalent throughout the GAS genome provides significant impetus to the study of this fundamental gene-regulatory mechanism in an important human pathogen.

  14. Genome-wide signatures of 'rearrangement hotspots' within segmental duplications in humans.

    Directory of Open Access Journals (Sweden)

    Mohammed Uddin

    Full Text Available The primary objective of this study was to create a genome-wide high resolution map (i.e., >100 bp of 'rearrangement hotspots' which can facilitate the identification of regions capable of mediating de novo deletions or duplications in humans. A hierarchical method was employed to fragment segmental duplications (SDs into multiple smaller SD units. Combining an end space free pairwise alignment algorithm with a 'seed and extend' approach, we have exhaustively searched 409 million alignments to detect complex structural rearrangements within the reference-guided assembly of the NA18507 human genome (18× coverage, including the previously identified novel 4.8 Mb sequence from de novo assembly within this genome. We have identified 1,963 rearrangement hotspots within SDs which encompass 166 genes and display an enrichment of duplicated gene nucleotide variants (DNVs. These regions are correlated with increased non-allelic homologous recombination (NAHR event frequency which presumably represents the origin of copy number variations (CNVs and pathogenic duplications/deletions. Analysis revealed that 20% of the detected hotspots are clustered within the proximal and distal SD breakpoints flanked by the pathogenic deletions/duplications that have been mapped for 24 NAHR-mediated genomic disorders. FISH Validation of selected complex regions revealed 94% concordance with in silico localization of the highly homologous derivatives. Other results from this study indicate that intra-chromosomal recombination is enhanced in genic compared with agenic duplicated regions, and that gene desert regions comprising SDs may represent reservoirs for creation of novel genes. The generation of genome-wide signatures of 'rearrangement hotspots', which likely serve as templates for NAHR, may provide a powerful approach towards understanding the underlying mutational mechanism(s for development of constitutional and acquired diseases.

  15. Genome-wide interaction-based association analysis identified multiple new susceptibility Loci for common diseases.

    Directory of Open Access Journals (Sweden)

    Yang Liu

    2011-03-01

    Full Text Available Genome-wide interaction-based association (GWIBA analysis has the potential to identify novel susceptibility loci. These interaction effects could be missed with the prevailing approaches in genome-wide association studies (GWAS. However, no convincing loci have been discovered exclusively from GWIBA methods, and the intensive computation involved is a major barrier for application. Here, we developed a fast, multi-thread/parallel program named "pair-wise interaction-based association mapping" (PIAM for exhaustive two-locus searches. With this program, we performed a complete GWIBA analysis on seven diseases with stringent control for false positives, and we validated the results for three of these diseases. We identified one pair-wise interaction between a previously identified locus, C1orf106, and one new locus, TEC, that was specific for Crohn's disease, with a Bonferroni corrected P < 0.05 (P = 0.039. This interaction was replicated with a pair of proxy linked loci (P = 0.013 on an independent dataset. Five other interactions had corrected P < 0.5. We identified the allelic effect of a locus close to SLC7A13 for coronary artery disease. This was replicated with a linked locus on an independent dataset (P = 1.09 × 10⁻⁷. Through a local validation analysis that evaluated association signals, rather than locus-based associations, we found that several other regions showed association/interaction signals with nominal P < 0.05. In conclusion, this study demonstrated that the GWIBA approach was successful for identifying novel loci, and the results provide new insights into the genetic architecture of common diseases. In addition, our PIAM program was capable of handling very large GWAS datasets that are likely to be produced in the future.

  16. Case-Control Genome-Wide Association Study of Attention-Deficit/Hyperactivity Disorder

    Science.gov (United States)

    Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J. L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schafer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus additional genome-wide association studies (GWAS) are needed. Method: We used case-control analyses of 896 cases…

  17. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    Science.gov (United States)

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2010-01-01

    Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…

  18. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    Science.gov (United States)

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  19. Genome-wide screening and identification of antigens for rickettsial vaccine development

    Science.gov (United States)

    The capacity to identify immunogens for vaccine development by genome-wide screening has been markedly enhanced by the availability of complete microbial genome sequences coupled to rapid proteomic and bioinformatic analysis. Critical to this genome-wide screening is in vivo testing in the context o...

  20. Genome-Wide Association Study and Linkage Analysis of the Healthy Aging Index

    DEFF Research Database (Denmark)

    Minster, Ryan L; Sanders, Jason L; Singh, Jatinder;

    2015-01-01

    BACKGROUND: The Healthy Aging Index (HAI) is a tool for measuring the extent of health and disease across multiple systems. METHODS: We conducted a genome-wide association study and a genome-wide linkage analysis to map quantitative trait loci associated with the HAI and a modified HAI weighted...

  1. Meta-analysis of genome-wide association studies of attention-deficit/hyperactivity disorder.

    NARCIS (Netherlands)

    Neale, B.M.; Medland, S.E.; Ripke, S.; Asherson, P.; Franke, B.; Lesch, K.P.; Faraone, S.V.; Nguyen, T.T.; Schafer, H.; Holmans, P.; Daly, M.; Steinhausen, H.C.; Freitag, C.; Reif, A.; Renner, T.J.; Romanos, M.; Romanos, J.; Walitza, S.; Warnke, A.; Meyer, J.; Palmason, H.; Buitelaar, J.K.; Vasquez, A.A.; Lambregts-Rommelse, N.N.J.; Gill, M.; Anney, R.J.; Langely, K.; O'Donovan, M.; Williams, N.; Owen, M.; Thapar, A.; Kent, L.; Sergeant, J.A.; Roeyers, H.; Mick, E.; Biederman, J.; Doyle, A.; Smalley, S.; Loo, S.; Hakonarson, H.; Elia, J.; Todorov, A.; Miranda, A.; Mulas, F.; Ebstein, R.P.; Rothenberger, A.; Banaschewski, T.; Oades, R.D.; Sonuga-Barke, E.; McGough, J.; Nisenbaum, L.; Middleton, F.; Hu, X.; Nelson, S.

    2010-01-01

    OBJECTIVE: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded signifi

  2. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    Science.gov (United States)

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  3. SUSCEPTIBILITY LOCI FOR UMBILICAL HERNIA IN SWINE DETECTED BY GENOME-WIDE ASSOCIATION.

    Science.gov (United States)

    Liao, X J; Lia, L; Zhang, Z Y; Long, Y; Yang, B; Ruan, G R; Su, Y; Ai, H S; Zhang, W C; Deng, W Y; Xiao, S J; Ren, J; Ding, N S; Huang, L S

    2015-10-01

    Umbilical hernia (UH) is a complex disorder caused by both genetic and environmental factors. UH brings animal welfare problems and severe economic loss to the pig industry. Until now, the genetic basis of UH is poorly understood. The high-density 60K porcine SNP array enables the rapid application of genome-wide association study (GWAS) to identify genetic loci for phenotypic traits at genome wide scale in pigs. The objective of this research was to identify susceptibility loci for swine umbilical hernia using the GWAS approach. We genotyped 478 piglets from 142 families representing three Western commercial breeds with the Illumina PorcineSNP60 BeadChip. Then significant SNPs were detected by GWAS using ROADTRIPS (Robust Association-Detection Test for Related Individuals with Population Substructure) software base on a Bonferroni corrected threshold (P = 1.67E-06) or suggestive threshold (P = 3.34E-05) and false discovery rate (FDR = 0.05). After quality control, 29,924 qualified SNPs and 472 piglets were used for GWAS. Two suggestive loci predisposing to pig UH were identified at 44.25MB on SSC2 (rs81358018, P = 3.34E-06, FDR = 0.049933) and at 45.90MB on SSC17 (rs81479278, P = 3.30E-06, FDR = 0.049933) in Duroc population, respectively. And no SNP was detected to be associated with pig UH at significant level in neither Landrace nor Large White population. Furthermore, we carried out a meta-analysis in the combined pure-breed population containing all the 472 piglets. rs81479278 (P = 1.16E-06, FDR = 0.022475) was identified to associate with pig UH at genome-wide significant level. SRC was characterized as plausible candidate gene for susceptibility to pig UH according to its genomic position and biological functions. To our knowledge, this study gives the first description of GWAS identifying susceptibility loci for umbilical hernia in pigs. Our findings provide deeper insights to the genetic architecture of umbilical hernia in pigs.

  4. Exploring genome-wide - dietary heme iron intake interactions and the risk of type 2 diabetes

    Directory of Open Access Journals (Sweden)

    Louis Robert Pasquale

    2013-01-01

    Full Text Available Aims/hypothesis: Genome-wide association studies have identified over 50 new genetic loci for type 2 diabetes (T2D. Several studies conclude that higher dietary heme iron intake increases the risk of T2D. Therefore we assessed whether the relation between genetic loci and type 2 diabetes is modified by dietary heme iron intake. Methods: We used Affymetrix Genome-Wide Human 6.0 array data (681,770 single nucleotide polymorphisms (SNPs and dietary information collected in the Health Professionals Follow-up Study (n=725 cases; n=1,273 controls and the Nurses’ Health Study (n=1,081 cases; n=1,692 controls. We assessed whether genome-wide SNPs or iron metabolism SNPs interacted with dietary heme iron intake in relation to T2D, testing for associations in each cohort separately and then meta-analyzing to pool the results. Finally, we created 1,000 synthetic pathways matched to an iron metabolism pathway on number of genes, and number of SNPs in each gene. We compared the iron metabolic pathway SNPs with these synthetic SNP assemblies in their relation to T2D to assess if the pathway as a whole interacts with dietary heme iron intake.Results: Using a genomic approach, we found no significant gene-environment interactions with dietary heme iron intake in relation to T2D (top SNP in pooled analysis: intergenic rs10980508; p=1.03E-06 > Bonferroni corrected p=7.33E-08. Furthermore, no SNP in the iron metabolic pathway significantly interacted with dietary heme iron intake (top SNP in pooled analysis: rs1805313; p=1.14E-03 > Bonferroni corrected p=2.10E-04. Finally, neither the main genetic effects (pooled empirical p by SNP=0.41, nor gene – dietary heme-iron interactions (pooled empirical p value for the interactions=0.72 were significant for the iron metabolic pathway as a whole. Conclusions: We found no significant interactions between dietary heme iron intake and common SNPs in relation to T2D.

  5. A genome-wide association search for type 2 diabetes genes in African Americans.

    Directory of Open Access Journals (Sweden)

    Nicholette D Palmer

    Full Text Available African Americans are disproportionately affected by type 2 diabetes (T2DM yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD and 1029 population-based controls. The most significant SNPs (n = 550 independent loci were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071, were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05. Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10(-8. SNP rs7560163 (P = 7.0×10(-9, OR (95% CI = 0.75 (0.67-0.84 is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217 were associated with T2DM (P<0.05 and reached more nominal levels of significance (P<2.5×10(-5 in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations.

  6. VIGoR: Variational Bayesian Inference for Genome-Wide Regression

    Directory of Open Access Journals (Sweden)

    Akio Onogi

    2016-04-01

    Full Text Available Genome-wide regression using a number of genome-wide markers as predictors is now widely used for genome-wide association mapping and genomic prediction. We developed novel software for genome-wide regression which we named VIGoR (variational Bayesian inference for genome-wide regression. Variational Bayesian inference is computationally much faster than widely used Markov chain Monte Carlo algorithms. VIGoR implements seven regression methods, and is provided as a command line program package for Linux/Mac, and as a cross-platform R package. In addition to model fitting, cross-validation and hyperparameter tuning using cross-validation can be automatically performed by modifying a single argument. VIGoR is available at https://github.com/Onogi/VIGoR. The R package is also available at https://cran.r-project.org/web/packages/VIGoR/index.html.

  7. More heritability probably captured by psoriasis genome-wide association study in Han Chinese.

    Science.gov (United States)

    Jiang, Long; Liu, Lu; Cheng, Yuyan; Lin, Yan; Shen, Changbing; Zhu, Caihong; Yang, Sen; Yin, Xianyong; Zhang, Xuejun

    2015-11-15

    Missing heritability is a common problem in genome-wide association studies in complex diseases/traits. To quantify the unbiased heritability estimate, we applied the phenotype correlation-genotype correlation regression in psoriasis genome-wide association data in Han Chinese which comprises 1139 cases and 1132 controls. We estimated that 45.7% heritability of psoriasis in Han Chinese were captured by common variants (s.e.=12.5%), which reinforced that the majority of psoriasis heritability can be covered by common variants in genome-wide association data (68.2%). The results provided evidence that the heritability covered by psoriasis genome-wide genotyping data was probably underestimated in previous restricted maximum likelihood method. Our study highlights the broad role of common variants in the etiology of psoriasis and sheds light on the possibility to identify more common variants of small effect by increasing the sample size in psoriasis genome-wide association studies.

  8. Evaluation of genome-wide power of genetic association studies based on empirical data from the HapMap project.

    Science.gov (United States)

    Nannya, Yasuhito; Taura, Kenjiro; Kurokawa, Mineo; Chiba, Shigeru; Ogawa, Seishi

    2007-10-15

    With recent advances in high-throughput single nucleotide polymorphism (SNP) typing technologies, genome-wide association studies have become a realistic approach to identify the causative genes that are responsible for common diseases of complex genetic traits. In this strategy, a trade-off between the increased genome coverage and a chance of finding SNPs incidentally showing a large statistics becomes serious due to extreme multiple-hypothesis testing. We investigated the extent to which this trade-off limits the genome-wide power with this approach by simulating a large number of case-control panels based on the empirical data from the HapMap Project. In our simulations, statistical costs of multiple hypothesis testing were evaluated by empirically calculating distributions of the maximum value of the chi(2) statistics for a series of marker sets having increasing numbers of SNPs, which were used to determine a genome-wide threshold in the following power simulations. With a practical study size, the cost of multiple testing largely offsets the potential benefits from increased genome coverage given modest genetic effects and/or low frequencies of causal alleles. In most realistic scenarios, increasing genome coverage becomes less influential on the power, while sample size is the predominant determinant of the feasibility of genome-wide association tests. Increasing genome coverage without corresponding increase in sample size will only consume resources without little gain in power. For common causal alleles with relatively large effect sizes [genotype relative risk > or =1.7], we can expect satisfactory power with currently available large-scale genotyping platforms using realistic sample size ( approximately 1000 per arm).

  9. A Preliminary Genome-Wide Association Study of Pain-Related Fear: Implications for Orofacial Pain

    Directory of Open Access Journals (Sweden)

    Cameron L. Randall

    2017-01-01

    Full Text Available Background. Acute and chronic orofacial pain can significantly impact overall health and functioning. Associations between fear of pain and the experience of orofacial pain are well-documented, and environmental, behavioral, and cognitive components of fear of pain have been elucidated. Little is known, however, regarding the specific genes contributing to fear of pain. Methods. A genome-wide association study (GWAS; N=990 was performed to identify plausible genes that may predispose individuals to various levels of fear of pain. The total score and three subscales (fear of minor, severe, and medical/dental pain of the Fear of Pain Questionnaire-9 (FPQ-9 were modeled in a variance components modeling framework to test for genetic association with 8.5 M genetic variants across the genome, while adjusting for sex, age, education, and income. Results. Three genetic loci were significantly associated with fear of minor pain (8q24.13, 8p21.2, and 6q26; p<5×10-8 for all near the genes TMEM65, NEFM, NEFL, AGPAT4, and PARK2. Other suggestive loci were found for the fear of pain total score and each of the FPQ-9 subscales. Conclusions. Multiple genes were identified as possible candidates contributing to fear of pain. The findings may have implications for understanding and treating chronic orofacial pain.

  10. Stepwise Distributed Open Innovation Contests for Software Development: Acceleration of Genome-Wide Association Analysis.

    Science.gov (United States)

    Hill, Andrew; Loh, Po-Ru; Bharadwaj, Ragu B; Pons, Pascal; Shang, Jingbo; Guinan, Eva; Lakhani, Karim; Kilty, Iain; Jelinsky, Scott A

    2017-05-01

    The association of differing genotypes with disease-related phenotypic traits offers great potential to both help identify new therapeutic targets and support stratification of patients who would gain the greatest benefit from specific drug classes. Development of low-cost genotyping and sequencing has made collecting large-scale genotyping data routine in population and therapeutic intervention studies. In addition, a range of new technologies is being used to capture numerous new and complex phenotypic descriptors. As a result, genotype and phenotype datasets have grown exponentially. Genome-wide association studies associate genotypes and phenotypes using methods such as logistic regression. As existing tools for association analysis limit the efficiency by which value can be extracted from increasing volumes of data, there is a pressing need for new software tools that can accelerate association analyses on large genotype-phenotype datasets. Using open innovation (OI) and contest-based crowdsourcing, the logistic regression analysis in a leading, community-standard genetics software package (PLINK 1.07) was substantially accelerated. OI allowed us to do this in computational, numeric, and algorithmic approaches was identified that accelerated the logistic regression in PLINK 1.07 by 18- to 45-fold. Combining contest-derived logistic regression code with coarse-grained parallelization, multithreading, and associated changes to data initialization code further developed through distributed innovation, we achieved an end-to-end speedup of 591-fold for a data set size of 6678 subjects by 645 863 variants, compared to PLINK 1.07's logistic regression. This represents a reduction in run time from 4.8 hours to 29 seconds. Accelerated logistic regression code developed in this project has been incorporated into the PLINK2 project. Using iterative competition-based OI, we have developed a new, faster implementation of logistic regression for genome-wide association

  11. Genome-wide functional analysis of human cell-cycle regulators

    Science.gov (United States)

    Mukherji, Mridul; Bell, Russell; Supekova, Lubica; Wang, Yan; Orth, Anthony P.; Batalov, Serge; Miraglia, Loren; Huesken, Dieter; Lange, Joerg; Martin, Christopher; Sahasrabudhe, Sudhir; Reinhardt, Mischa; Natt, Francois; Hall, Jonathan; Mickanin, Craig; Labow, Mark; Chanda, Sumit K.; Cho, Charles Y.; Schultz, Peter G.

    2006-01-01

    Human cells have evolved complex signaling networks to coordinate the cell cycle. A detailed understanding of the global regulation of this fundamental process requires comprehensive identification of the genes and pathways involved in the various stages of cell-cycle progression. To this end, we report a genome-wide analysis of the human cell cycle, cell size, and proliferation by targeting >95% of the protein-coding genes in the human genome using small interfering RNAs (siRNAs). Analysis of >2 million images, acquired by quantitative fluorescence microscopy, showed that depletion of 1,152 genes strongly affected cell-cycle progression. These genes clustered into eight distinct phenotypic categories based on phase of arrest, nuclear area, and nuclear morphology. Phase-specific networks were built by interrogating knowledge-based and physical interaction databases with identified genes. Genome-wide analysis of cell-cycle regulators revealed a number of kinase, phosphatase, and proteolytic proteins and also suggests that processes thought to regulate G1-S phase progression like receptor-mediated signaling, nutrient status, and translation also play important roles in the regulation of G2/M phase transition. Moreover, 15 genes that are integral to TNF/NF-κB signaling were found to regulate G2/M, a previously unanticipated role for this pathway. These analyses provide systems-level insight into both known and novel genes as well as pathways that regulate cell-cycle progression, a number of which may provide new therapeutic approaches for the treatment of cancer. PMID:17001007

  12. Identification of a novel susceptibility locus for juvenile idiopathic arthritis by genome-wide association analysis

    Science.gov (United States)

    Hinks, Anne; Barton, Anne; Shephard, Neil; Eyre, Steve; Bowes, John; Cargill, Michele; Wang, Eric; Ke, Xiayi; Kennedy, Giulia C; John, Sally; Worthington, Jane; Thomson, Wendy

    2009-01-01

    Objective Juvenile idiopathic arthritis (JIA) is a chronic rheumatic disease of childhood. Two well-established genetic factors known to contribute to JIA susceptibility, HLA and PTPN22, account for less than half of the genetic susceptibility to disease; therefore, additional genetic factors have yet to be identified. The purpose of this study was to perform a systematic search of the genome to identify novel susceptibility loci for JIA. Methods A genome-wide association study using Affymetrix GeneChip 100K arrays was performed in a discovery cohort (279 cases and 184 controls). Single-nucleotide polymorphisms (SNPs) showing the most significant differences between cases and controls were then genotyped in a validation sample of cases (n = 321) and controls, combined with control data from the 1958 UK birth cohort (n = 2,024). In one region in which association was confirmed, fine-mapping was performed (654 cases and 1,847 controls). Results Of the 112 SNPs that were significantly associated with JIA in the discovery cohort, 6 SNPs were associated with JIA in the independent validation cohort. The most strongly associated SNP mapped to the HLA region, while the second strongest association was with a SNP within the VTCN1 gene. Fine-mapping of that gene was performed, and 10 SNPs were found to be associated with JIA. Conclusion This study is the first to successfully apply a SNP-based genome-wide association approach to the investigation of JIA. The replicated association with markers in the VTCN1 gene defined an additional susceptibility locus for JIA and implicates a novel pathway in the pathogenesis of this chronic disease of childhood. PMID:19116933

  13. Genome-wide interaction study of smoking and bladder cancer risk

    Science.gov (United States)

    Figueroa, Jonine D.; Han, Summer S.; Garcia-Closas, Montserrat; Baris, Dalsu; Jacobs, Eric J.; Kogevinas, Manolis; Schwenn, Molly; Malats, Nuria; Johnson, Alison; Purdue, Mark P.; Caporaso, Neil; Landi, Maria Teresa; Prokunina-Olsson, Ludmila; Wang, Zhaoming; Hutchinson, Amy; Burdette, Laurie; Wheeler, William; Vineis, Paolo; Siddiq, Afshan; Cortessis, Victoria K.; Kooperberg, Charles; Cussenot, Olivier; Benhamou, Simone; Prescott, Jennifer; Porru, Stefano; Bueno-de-Mesquita, H.Bas; Trichopoulos, Dimitrios; Ljungberg, Börje; Clavel-Chapelon, Françoise; Weiderpass, Elisabete; Krogh, Vittorio; Dorronsoro, Miren; Travis, Ruth; Tjønneland, Anne; Brenan, Paul; Chang-Claude, Jenny; Riboli, Elio; Conti, David; Gago-Dominguez, Manuela; Stern, Mariana C.; Pike, Malcolm C.; Van Den Berg, David; Yuan, Jian-Min; Hohensee, Chancellor; Rodabough, Rebecca; Cancel-Tassin, Geraldine; Roupret, Morgan; Comperat, Eva; Chen, Constance; De Vivo, Immaculata; Giovannucci, Edward; Hunter, David J.; Kraft, Peter; Lindstrom, Sara; Carta, Angela; Pavanello, Sofia; Arici, Cecilia; Mastrangelo, Giuseppe; Karagas, Margaret R.; Schned, Alan; Armenti, Karla R.; Hosain, G.M.Monawar; Haiman, Chris A.; Fraumeni, Joseph F.; Chanock, Stephen J.; Chatterjee, Nilanjan; Rothman, Nathaniel; Silverman, Debra T.

    2014-01-01

    Bladder cancer is a complex disease with known environmental and genetic risk factors. We performed a genome-wide interaction study (GWAS) of smoking and bladder cancer risk based on primary scan data from 3002 cases and 4411 controls from the National Cancer Institute Bladder Cancer GWAS. Alternative methods were used to evaluate both additive and multiplicative interactions between individual single nucleotide polymorphisms (SNPs) and smoking exposure. SNPs with interaction P values < 5 × 10− 5 were evaluated further in an independent dataset of 2422 bladder cancer cases and 5751 controls. We identified 10 SNPs that showed association in a consistent manner with the initial dataset and in the combined dataset, providing evidence of interaction with tobacco use. Further, two of these novel SNPs showed strong evidence of association with bladder cancer in tobacco use subgroups that approached genome-wide significance. Specifically, rs1711973 (FOXF2) on 6p25.3 was a susceptibility SNP for never smokers [combined odds ratio (OR) = 1.34, 95% confidence interval (CI) = 1.20–1.50, P value = 5.18 × 10− 7]; and rs12216499 (RSPH3-TAGAP-EZR) on 6q25.3 was a susceptibility SNP for ever smokers (combined OR = 0.75, 95% CI = 0.67–0.84, P value = 6.35 × 10− 7). In our analysis of smoking and bladder cancer, the tests for multiplicative interaction seemed to more commonly identify susceptibility loci with associations in never smokers, whereas the additive interaction analysis identified more loci with associations among smokers—including the known smoking and NAT2 acetylation interaction. Our findings provide additional evidence of gene–environment interactions for tobacco and bladder cancer. PMID:24662972

  14. Genome wide association mapping for grain shape traits in indica rice.

    Science.gov (United States)

    Feng, Yue; Lu, Qing; Zhai, Rongrong; Zhang, Mengchen; Xu, Qun; Yang, Yaolong; Wang, Shan; Yuan, Xiaoping; Yu, Hanyong; Wang, Yiping; Wei, Xinghua

    2016-10-01

    Using genome-wide association mapping, 47 SNPs within 27 significant loci were identified for four grain shape traits, and 424 candidate genes were predicted from public database. Grain shape is a key determinant of grain yield and quality in rice (Oryza sativa L.). However, our knowledge of genes controlling rice grain shape remains limited. Genome-wide association mapping based on linkage disequilibrium (LD) has recently emerged as an effective approach for identifying genes or quantitative trait loci (QTL) underlying complex traits in plants. In this study, association mapping based on 5291 single nucleotide polymorphisms (SNPs) was conducted to identify significant loci associated with grain shape traits in a global collection of 469 diverse rice accessions. A total of 47 SNPs were located in 27 significant loci for four grain traits, and explained ~44.93-65.90 % of the phenotypic variation for each trait. In total, 424 candidate genes within a 200 kb extension region (±100 kb of each locus) of these loci were predicted. Of them, the cloned genes GS3 and qSW5 showed very strong effects on grain length and grain width in our study. Comparing with previously reported QTLs for grain shape traits, we found 11 novel loci, including 3, 3, 2 and 3 loci for grain length, grain width, grain length-width ratio and thousand grain weight, respectively. Validation of these new loci would be performed in the future studies. These results revealed that besides GS3 and qSW5, multiple novel loci and mechanisms were involved in determining rice grain shape. These findings provided valuable information for understanding of the genetic control of grain shape and molecular marker assistant selection (MAS) breeding in rice.

  15. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis

    Science.gov (United States)

    2011-09-01

    Almasy, L, Blangero, J. (2009) Human QTL linkage mapping. Genetica 136:333-340. Amos, CI. (2007) Successful design and conduct of genome-wide...quantitative trait loci. Genetica 136:237-243. Skol AD, Scott LJ, Abecasis GR, Boehnke M. (2006) Joint analysis is more efficient than replication

  16. Genome-wide association of bipolar disorder suggests an enrichment of replicable associations in regions near genes.

    Directory of Open Access Journals (Sweden)

    Erin N Smith

    2011-06-01

    Full Text Available Although a highly heritable and disabling disease, bipolar disorder's (BD genetic variants have been challenging to identify. We present new genotype data for 1,190 cases and 401 controls and perform a genome-wide association study including additional samples for a total of 2,191 cases and 1,434 controls. We do not detect genome-wide significant associations for individual loci; however, across all SNPs, we show an association between the power to detect effects calculated from a previous genome-wide association study and evidence for replication (P = 1.5×10(-7. To demonstrate that this result is not likely to be a false positive, we analyze replication rates in a large meta-analysis of height and show that, in a large enough study, associations replicate as a function of power, approaching a linear relationship. Within BD, SNPs near exons exhibit a greater probability of replication, supporting an enrichment of reproducible associations near functional regions of genes. These results indicate that there is likely common genetic variation associated with BD near exons (±10 kb that could be identified in larger studies and, further, provide a framework for assessing the potential for replication when combining results from multiple studies.

  17. A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data

    Directory of Open Access Journals (Sweden)

    Scherer Stephen W

    2011-05-01

    Full Text Available Abstract Background Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. Results We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. Conclusions The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.

  18. The challenges of genome-wide interaction studies: lessons to learn from the analysis of HDL blood levels.

    Science.gov (United States)

    van Leeuwen, Elisabeth M; Smouter, Françoise A S; Kam-Thong, Tony; Karbalai, Nazanin; Smith, Albert V; Harris, Tamara B; Launer, Lenore J; Sitlani, Colleen M; Li, Guo; Brody, Jennifer A; Bis, Joshua C; White, Charles C; Jaiswal, Alok; Oostra, Ben A; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, Andre G; Boerwinkle, Eric; Ballantyne, Christie M; Gudnason, Vilmundur; Psaty, Bruce M; Cupples, L Adrienne; Järvelin, Marjo-Riitta; Ripatti, Samuli; Isaacs, Aaron; Müller-Myhsok, Bertram; Karssen, Lennart C; van Duijn, Cornelia M

    2014-01-01

    Genome-wide association studies (GWAS) have revealed 74 single nucleotide polymorphisms (SNPs) associated with high-density lipoprotein cholesterol (HDL) blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS) to identify SNP×SNP interactions associated with HDL levels. We performed a GWIS in the Rotterdam Study (RS) cohort I (RS-I) using the GLIDE tool which leverages the massively parallel computing power of Graphics Processing Units (GPUs) to perform linear regression on all genome-wide pairs of SNPs. By performing a meta-analysis together with Rotterdam Study cohorts II and III (RS-II and RS-III), we were able to filter 181 interaction terms with a p-valueSPATA8 (ENSG00000185594) being associated with HDL levels. However, p-values do not reach the preset Bonferroni correction of the p-values. Our study suggest that even for highly genetically determined traits such as HDL the sample sizes needed to detect SNP×SNP interactions are large and the 2-step filtering approaches do not yield a solution. Here we present our analysis plan and our reservations concerning GWIS.

  19. Genome-wide association of bipolar disorder suggests an enrichment of replicable associations in regions near genes.

    Directory of Open Access Journals (Sweden)

    Erin N Smith

    2011-06-01

    Full Text Available Although a highly heritable and disabling disease, bipolar disorder's (BD genetic variants have been challenging to identify. We present new genotype data for 1,190 cases and 401 controls and perform a genome-wide association study including additional samples for a total of 2,191 cases and 1,434 controls. We do not detect genome-wide significant associations for individual loci; however, across all SNPs, we show an association between the power to detect effects calculated from a previous genome-wide association study and evidence for replication (P = 1.5×10(-7. To demonstrate that this result is not likely to be a false positive, we analyze replication rates in a large meta-analysis of height and show that, in a large enough study, associations replicate as a function of power, approaching a linear relationship. Within BD, SNPs near exons exhibit a greater probability of replication, supporting an enrichment of reproducible associations near functional regions of genes. These results indicate that there is likely common genetic variation associated with BD near exons (±10 kb that could be identified in larger studies and, further, provide a framework for assessing the potential for replication when combining results from multiple studies.

  20. Univariate/multivariate genome-wide association scans using data from families and unrelated samples.

    Directory of Open Access Journals (Sweden)

    Lei Zhang

    Full Text Available As genome-wide association studies (GWAS are becoming more popular, two approaches, among others, could be considered in order to improve statistical power for identifying genes contributing subtle to moderate effects to human diseases. The first approach is to increase sample size, which could be achieved by combining both unrelated and familial subjects together. The second approach is to jointly analyze multiple correlated traits. In this study, by extending generalized estimating equations (GEEs, we propose a simple approach for performing univariate or multivariate association tests for the combined data of unrelated subjects and nuclear families. In particular, we correct for population stratification by integrating principal component analysis and transmission disequilibrium test strategies. The proposed method allows for multiple siblings as well as missing parental information. Simulation studies show that the proposed test has improved power compared to two popular methods, EIGENSTRAT and FBAT, by analyzing the combined data, while correcting for population stratification. In addition, joint analysis of bivariate traits has improved power over univariate analysis when pleiotropic effects are present. Application to the Genetic Analysis Workshop 16 (GAW16 data sets attests to the feasibility and applicability of the proposed method.

  1. Genome-wide family-based linkage analysis of exome chip variants and cardiometabolic risk.

    Science.gov (United States)

    Hellwege, Jacklyn N; Palmer, Nicholette D; Raffield, Laura M; Ng, Maggie C Y; Hawkins, Gregory A; Long, Jirong; Lorenzo, Carlos; Norris, Jill M; Ida Chen, Y-D; Speliotes, Elizabeth K; Rotter, Jerome I; Langefeld, Carl D; Wagenknecht, Lynne E; Bowden, Donald W

    2014-05-01

    Linkage analysis of complex traits has had limited success in identifying trait-influencing loci. Recently, coding variants have been implicated as the basis for some biomedical associations. We tested whether coding variants are the basis for linkage peaks of complex traits in 42 African-American (n = 596) and 90 Hispanic (n = 1,414) families in the Insulin Resistance Atherosclerosis Family Study (IRASFS) using Illumina HumanExome Beadchips. A total of 92,157 variants in African Americans (34%) and 81,559 (31%) in Hispanics were polymorphic and tested using two-point linkage and association analyses with 37 cardiometabolic phenotypes. In African Americans 77 LOD scores greater than 3 were observed. The highest LOD score was 4.91 with the APOE SNP rs7412 (MAF = 0.13) with plasma apolipoprotein B (ApoB). This SNP was associated with ApoB (P-value = 4 × 10(-19)) and accounted for 16.2% of the variance in African Americans. In Hispanic families, 104 LOD scores were greater than 3. The strongest evidence of linkage (LOD = 4.29) was with rs5882 (MAF = 0.46) in CETP with HDL. CETP variants were strongly associated with HDL (0.00049 evidence of strong linkage in this genome wide survey of primarily coding variants was uncommon. Loci with strong evidence of linkage was characterized by large contributions to the variance, and, in these cases, are common variants. Less compelling evidence of linkage and association was observed with additional loci that may require larger family sets to confirm. © 2014 WILEY PERIODICALS, INC.

  2. Genome Wide Association Study to predict severe asthma exacerbations in children using random forests classifiers

    Directory of Open Access Journals (Sweden)

    Litonjua Augusto A

    2011-06-01

    Full Text Available Abstract Background Personalized health-care promises tailored health-care solutions to individual patients based on their genetic background and/or environmental exposure history. To date, disease prediction has been based on a few environmental factors and/or single nucleotide polymorphisms (SNPs, while complex diseases are usually affected by many genetic and environmental factors with each factor contributing a small portion to the outcome. We hypothesized that the use of random forests classifiers to select SNPs would result in an improved predictive model of asthma exacerbations. We tested this hypothesis in a population of childhood asthmatics. Methods In this study, using emergency room visits or hospitalizations as the definition of a severe asthma exacerbation, we first identified a list of top Genome Wide Association Study (GWAS SNPs ranked by Random Forests (RF importance score for the CAMP (Childhood Asthma Management Program population of 127 exacerbation cases and 290 non-exacerbation controls. We predict severe asthma exacerbations using the top 10 to 320 SNPs together with age, sex, pre-bronchodilator FEV1 percentage predicted, and treatment group. Results Testing in an independent set of the CAMP population shows that severe asthma exacerbations can be predicted with an Area Under the Curve (AUC = 0.66 with 160-320 SNPs in comparison to an AUC score of 0.57 with 10 SNPs. Using the clinical traits alone yielded AUC score of 0.54, suggesting the phenotype is affected by genetic as well as environmental factors. Conclusions Our study shows that a random forests algorithm can effectively extract and use the information contained in a small number of samples. Random forests, and other machine learning tools, can be used with GWAS studies to integrate large numbers of predictors simultaneously.

  3. Local Genealogies in a Linear Mixed Model for Genome-wide Association Mapping in Complex Pedigreed Populations

    DEFF Research Database (Denmark)

    Sahana, Goutam; Mailund, Thomas; Lund, Mogens Sandø

    2011-01-01

    be extended to incorporate other effects in a straightforward and rigorous fashion. Here, we present a complementary approach, called ‘GENMIX (genealogy based mixed model)’ which combines advantages from two powerful GWAS methods: genealogy-based haplotype grouping and MMA. Subjects and Methods: We validated......Introduction: The state-of-the-art for dealing with multiple levels of relationship among the samples in genome-wide association studies (GWAS) is unified mixed model analysis (MMA). This approach is very flexible, can be applied to both family-based and population-based samples, and can...

  4. Genome-wide linkage scan for loci associated with epilepsy in Belgian shepherd dogs

    Directory of Open Access Journals (Sweden)

    Regan Kelly R

    2010-05-01

    Full Text Available Abstract Background Idiopathic epilepsy in the Belgian shepherd dog is known to have a substantial genetic component. The objective of this study was to identify genomic regions associated with the expression of generalized seizures in the Belgian Tervuren and Sheepdog. Results DNA from 366 dogs, of which 74 were classified as epileptic, representing two extended families were subjected to a genome-wide linkage scan using 410 microsatellite markers yielding informative coverage averaging 5.95 ± 0.21 Mb. Though previous studies based on pedigree analyses proposed a major gene of influence, the present study demonstrated the trait to be highly polygenic. Studies of complex disorders in humans indicate that a liberal composite evaluation of genetic linkage is needed to identify underlying quantitative trait loci (QTLs. Four chromosomes yielded tentative linkage based upon LOD scores in excess of 1.0. Possible QTLs within these regions were supported also by analyses of multipoint linkage, allele frequency, TDT, and transmission of haplotype blocks. Conclusions Taken together the data tentatively indicate six QTLs, three on CFA 2, and one on each of CFA 6, 12, and 37, that support fine mapping for mutations associated with epilepsy in the Belgian shepherd. The study also underscores the complexity of genomic linkage studies for polygenic disorders.

  5. PRIMUS: rapid reconstruction of pedigrees from genome-wide estimates of identity by descent.

    Science.gov (United States)

    Staples, Jeffrey; Qiao, Dandi; Cho, Michael H; Silverman, Edwin K; Nickerson, Deborah A; Below, Jennifer E

    2014-11-06

    Understanding and correctly utilizing relatedness among samples is essential for genetic analysis; however, managing sample records and pedigrees can often be error prone and incomplete. Data sets ascertained by random sampling often harbor cryptic relatedness that can be leveraged in genetic analyses for maximizing power. We have developed a method that uses genome-wide estimates of pairwise identity by descent to identify families and quickly reconstruct and score all possible pedigrees that fit the genetic data by using up to third-degree relatives, and we have included it in the software package PRIMUS (Pedigree Reconstruction and Identification of the Maximally Unrelated Set). Here, we validate its performance on simulated, clinical, and HapMap pedigrees. Among these samples, we demonstrate that PRIMUS can verify reported pedigree structures and identify cryptic relationships. Finally, we show that PRIMUS reconstructed pedigrees, all of which were previously unknown, for 203 families from a cohort collected in Starr County, TX (1,890 samples).

  6. Genome-Wide Association Analyses Point to Candidate Genes for Electric Shock Avoidance in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Mirjam Appel

    Full Text Available Electric shock is a common stimulus for nociception-research and the most widely used reinforcement in aversive associative learning experiments. Yet, nothing is known about the mechanisms it recruits at the periphery. To help fill this gap, we undertook a genome-wide association analysis using 38 inbred Drosophila melanogaster strains, which avoided shock to varying extents. We identified 514 genes whose expression levels and/ or sequences co-varied with shock avoidance scores. We independently scrutinized 14 of these genes using mutants, validating the effect of 7 of them on shock avoidance. This emphasizes the value of our candidate gene list as a guide for follow-up research. In addition, by integrating our association results with external protein-protein interaction data we obtained a shock avoidance-associated network of 38 genes. Both this network and the original candidate list contained a substantial number of genes that affect mechanosensory bristles, which are hair-like organs distributed across the fly's body. These results may point to a potential role for mechanosensory bristles in shock sensation. Thus, we not only provide a first list of candidate genes for shock avoidance, but also point to an interesting new hypothesis on nociceptive mechanisms.

  7. Computer vision and machine learning for robust phenotyping in genome-wide studies

    Science.gov (United States)

    Zhang, Jiaoping; Naik, Hsiang Sing; Assefa, Teshale; Sarkar, Soumik; Reddy, R. V. Chowda; Singh, Arti; Ganapathysubramanian, Baskar; Singh, Asheesh K.

    2017-01-01

    Traditional evaluation of crop biotic and abiotic stresses are time-consuming and labor-intensive limiting the ability to dissect the genetic basis of quantitative traits. A machine learning (ML)-enabled image-phenotyping pipeline for the genetic studies of abiotic stress iron deficiency chlorosis (IDC) of soybean is reported. IDC classification and severity for an association panel of 461 diverse plant-introduction accessions was evaluated using an end-to-end phenotyping workflow. The workflow consisted of a multi-stage procedure including: (1) optimized protocols for consistent image capture across plant canopies, (2) canopy identification and registration from cluttered backgrounds, (3) extraction of domain expert informed features from the processed images to accurately represent IDC expression, and (4) supervised ML-based classifiers that linked the automatically extracted features with expert-rating equivalent IDC scores. ML-generated phenotypic data were subsequently utilized for the genome-wide association study and genomic prediction. The results illustrate the reliability and advantage of ML-enabled image-phenotyping pipeline by identifying previously reported locus and a novel locus harboring a gene homolog involved in iron acquisition. This study demonstrates a promising path for integrating the phenotyping pipeline into genomic prediction, and provides a systematic framework enabling robust and quicker phenotyping through ground-based systems. PMID:28272456

  8. Genome-Wide Association Analyses Point to Candidate Genes for Electric Shock Avoidance in Drosophila melanogaster.

    Science.gov (United States)

    Appel, Mirjam; Scholz, Claus-Jürgen; Müller, Tobias; Dittrich, Marcus; König, Christian; Bockstaller, Marie; Oguz, Tuba; Khalili, Afshin; Antwi-Adjei, Emmanuel; Schauer, Tamas; Margulies, Carla; Tanimoto, Hiromu; Yarali, Ayse

    2015-01-01

    Electric shock is a common stimulus for nociception-research and the most widely used reinforcement in aversive associative learning experiments. Yet, nothing is known about the mechanisms it recruits at the periphery. To help fill this gap, we undertook a genome-wide association analysis using 38 inbred Drosophila melanogaster strains, which avoided shock to varying extents. We identified 514 genes whose expression levels and/ or sequences co-varied with shock avoidance scores. We independently scrutinized 14 of these genes using mutants, validating the effect of 7 of them on shock avoidance. This emphasizes the value of our candidate gene list as a guide for follow-up research. In addition, by integrating our association results with external protein-protein interaction data we obtained a shock avoidance-associated network of 38 genes. Both this network and the original candidate list contained a substantial number of genes that affect mechanosensory bristles, which are hair-like organs distributed across the fly's body. These results may point to a potential role for mechanosensory bristles in shock sensation. Thus, we not only provide a first list of candidate genes for shock avoidance, but also point to an interesting new hypothesis on nociceptive mechanisms.

  9. Computer vision and machine learning for robust phenotyping in genome-wide studies.

    Science.gov (United States)

    Zhang, Jiaoping; Naik, Hsiang Sing; Assefa, Teshale; Sarkar, Soumik; Reddy, R V Chowda; Singh, Arti; Ganapathysubramanian, Baskar; Singh, Asheesh K

    2017-03-08

    Traditional evaluation of crop biotic and abiotic stresses are time-consuming and labor-intensive limiting the ability to dissect the genetic basis of quantitative traits. A machine learning (ML)-enabled image-phenotyping pipeline for the genetic studies of abiotic stress iron deficiency chlorosis (IDC) of soybean is reported. IDC classification and severity for an association panel of 461 diverse plant-introduction accessions was evaluated using an end-to-end phenotyping workflow. The workflow consisted of a multi-stage procedure including: (1) optimized protocols for consistent image capture across plant canopies, (2) canopy identification and registration from cluttered backgrounds, (3) extraction of domain expert informed features from the processed images to accurately represent IDC expression, and (4) supervised ML-based classifiers that linked the automatically extracted features with expert-rating equivalent IDC scores. ML-generated phenotypic data were subsequently utilized for the genome-wide association study and genomic prediction. The results illustrate the reliability and advantage of ML-enabled image-phenotyping pipeline by identifying previously reported locus and a novel locus harboring a gene homolog involved in iron acquisition. This study demonstrates a promising path for integrating the phenotyping pipeline into genomic prediction, and provides a systematic framework enabling robust and quicker phenotyping through ground-based systems.

  10. Replication of a Genome-Wide Association Study of Birth Weight in Preterm Neonates

    Science.gov (United States)

    Ryckman, Kelli K; Feenstra, Bjarke; Shaffer, John R.; Bream, Elise NA; Geller, Frank; Feingold, Eleanor; Weeks, Daniel E; Gadow, Enrique; Cosentino, Viviana; Saleme, Cesar; Simhan, Hyagriv N; Merrill, David; Fong, Chin-To; Busch, Tamara; Berends, Susan K; Comas, Belen; Camelo, Jorge L; Boyd, Heather; Laurie, Cathy; Crosslin, David; Zhang, Qi; Doheny, Kim F; Pugh, Elizabeth; Melbye, Mads; Marazita, Mary L; Dagle, John M; Murray, Jeffrey C

    2011-01-01

    Objective To examine associations in a preterm population between rs9883204 in ADCY5 and rs900400 near LEKR1 and CCNL1 with birth weight. Both markers were associated with birth weight in a term population in a recent genome-wide association (GWA) study by Freathy et al. Study design A meta-analysis of mother and infant samples was performed for associations of rs900400 and rs9883204 with birth weight in 393 families from the U.S., 265 families from Argentina and 735 mother-infant pairs from Denmark. Z scores adjusted for infant sex and gestational age were generated for each population separately and regressed on allele counts. Association evidence was combined across sites by inverse-variance weighted meta-analysis. Results Each additional C allele of rs900400 (LEKR1/CCNL1) in infants was marginally associated with a 0.069 standard deviation (SD) lower birth weight (95% CI = −0.159 – 0.022, P = 0.068). This result was slightly more pronounced after adjusting for smoking (P = 0.036). There were no significant associations identified with rs9883204 or in maternal samples. Conclusions These results indicate the potential importance of this marker on birth weight irrespective of gestational age. PMID:21885063

  11. SiRNA sequence model: redesign algorithm based on available genome-wide libraries.

    Science.gov (United States)

    Kozak, Karol

    2013-12-01

    The evolution of RNA interference (RNAi) and the development of technologies exploiting its biology have enabled scientists to rapidly examine the consequences of depleting a particular gene product in cells. Design tools have been developed based on experimental data to increase the knockdown efficiency of siRNAs. Not all siRNAs that are developed to a given target mRNA are equally effective. Currently available design algorithms take an accession, identify conserved regions among their transcript space, find accessible regions within the mRNA, design all possible siRNAs for these regions, filter them based on multi-scores thresholds, and then perform off-target filtration. These different criteria are used by commercial suppliers to produce siRNA genome-wide libraries for different organisms. In this article, we analyze existing siRNA design algorithms and evaluate weight of design parameters for libraries produced in the last decade. We proved that not all essential parameters are currently applied by siRNA vendors. Based on our evaluation results, we were able to suggest an siRNA sequence pattern. The findings in our study can be useful for commercial vendors improving the design of RNAi constructs, by addressing both the issue of potency and the issue of specificity.

  12. Genome-wide meta-analysis of systolic blood pressure in children with sickle cell disease.

    Directory of Open Access Journals (Sweden)

    Pallav Bhatnagar

    Full Text Available In pediatric sickle cell disease (SCD patients, it has been reported that higher systolic blood pressure (SBP is associated with increased risk of a silent cerebral infarction (SCI. SCI is a major cause of neurologic morbidity in children with SCD, and blood pressure is a potential modulator of clinical manifestations of SCD; however, the risk factors underlying these complications are not well characterized. The aim of this study was to identify genetic variants that influence SBP in an African American population in the setting of SCD, and explore the use of SBP as an endo-phenotype for SCI. We conducted a genome-wide meta-analysis for SBP using two SCD cohorts, as well as a candidate screen based on published SBP loci. A total of 1,617 patients were analyzed, and while no SNP reached genome-wide significance (P-value<5.0 x 10(-8, a number of suggestive candidate loci were identified. The most significant SNP, rs7952106 (P-value=8.57 x 10(-7, was in the DRD2 locus on chromosome 11. In a gene-based association analysis, MIR4301 (micro-RNA4301, which resides in an intron of DRD2, was the most significant gene (P-value=5.2 x 10(-5. Examining 27 of the previously reported SBP associated SNPs, 4 SNPs were nominally significant. A genetic risk score was constructed to assess the aggregated genetic effect of the published SBP variants, demonstrating a significant association (P=0.05. In addition, we also assessed whether these variants are associated with SCI, validating the use of SBP as an endo-phenotype for SCI. Three SNPs were nominally associated, and only rs2357790 (5' CACNB2 was significant for both SBP and SCI. None of these SNPs retained significance after Bonferroni correction. Taken together, our results suggest the importance of DRD2 genetic variation in the modulation of SBP, and extend the aggregated importance of previously reported SNPs in the modulation of SBP in an African American cohort, more specifically in children with SCD.

  13. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci

    Science.gov (United States)

    Smith, D J; Escott-Price, V; Davies, G; Bailey, M E S; Colodro-Conde, L; Ward, J; Vedernikov, A; Marioni, R; Cullen, B; Lyall, D; Hagenaars, S P; Liewald, D C M; Luciano, M; Gale, C R; Ritchie, S J; Hayward, C; Nicholl, B; Bulik-Sullivan, B; Adams, M; Couvy-Duchesne, B; Graham, N; Mackay, D; Evans, J; Smith, B H; Porteous, D J; Medland, S E; Martin, N G; Holmans, P; McIntosh, A M; Pell, J P; Deary, I J; O'Donovan, M C

    2016-01-01

    Neuroticism is a personality trait of fundamental importance for psychological well-being and public health. It is strongly associated with major depressive disorder (MDD) and several other psychiatric conditions. Although neuroticism is heritable, attempts to identify the alleles involved in previous studies have been limited by relatively small sample sizes. Here we report a combined meta-analysis of genome-wide association study (GWAS) of neuroticism that includes 91 370 participants from the UK Biobank cohort, 6659 participants from the Generation Scotland: Scottish Family Health Study (GS:SFHS) and 8687 participants from a QIMR (Queensland Institute of Medical Research) Berghofer Medical Research Institute (QIMR) cohort. All participants were assessed using the same neuroticism instrument, the Eysenck Personality Questionnaire-Revised (EPQ-R-S) Short Form's Neuroticism scale. We found a single-nucleotide polymorphism-based heritability estimate for neuroticism of ∼15% (s.e.=0.7%). Meta-analysis identified nine novel loci associated with neuroticism. The strongest evidence for association was at a locus on chromosome 8 (P=1.5 × 10−15) spanning 4 Mb and containing at least 36 genes. Other associated loci included interesting candidate genes on chromosome 1 (GRIK3 (glutamate receptor ionotropic kainate 3)), chromosome 4 (KLHL2 (Kelch-like protein 2)), chromosome 17 (CRHR1 (corticotropin-releasing hormone receptor 1) and MAPT (microtubule-associated protein Tau)) and on chromosome 18 (CELF4 (CUGBP elav-like family member 4)). We found no evidence for genetic differences in the common allelic architecture of neuroticism by sex. By comparing our findings with those of the Psychiatric Genetics Consortia, we identified a strong genetic correlation between neuroticism and MDD and a less strong but significant genetic correlation with schizophrenia, although not with bipolar disorder. Polygenic risk scores derived from the primary UK Biobank sample captured

  14. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci.

    Science.gov (United States)

    Smith, D J; Escott-Price, V; Davies, G; Bailey, M E S; Colodro-Conde, L; Ward, J; Vedernikov, A; Marioni, R; Cullen, B; Lyall, D; Hagenaars, S P; Liewald, D C M; Luciano, M; Gale, C R; Ritchie, S J; Hayward, C; Nicholl, B; Bulik-Sullivan, B; Adams, M; Couvy-Duchesne, B; Graham, N; Mackay, D; Evans, J; Smith, B H; Porteous, D J; Medland, S E; Martin, N G; Holmans, P; McIntosh, A M; Pell, J P; Deary, I J; O'Donovan, M C

    2016-06-01

    Neuroticism is a personality trait of fundamental importance for psychological well-being and public health. It is strongly associated with major depressive disorder (MDD) and several other psychiatric conditions. Although neuroticism is heritable, attempts to identify the alleles involved in previous studies have been limited by relatively small sample sizes. Here we report a combined meta-analysis of genome-wide association study (GWAS) of neuroticism that includes 91 370 participants from the UK Biobank cohort, 6659 participants from the Generation Scotland: Scottish Family Health Study (GS:SFHS) and 8687 participants from a QIMR (Queensland Institute of Medical Research) Berghofer Medical Research Institute (QIMR) cohort. All participants were assessed using the same neuroticism instrument, the Eysenck Personality Questionnaire-Revised (EPQ-R-S) Short Form's Neuroticism scale. We found a single-nucleotide polymorphism-based heritability estimate for neuroticism of ∼15% (s.e.=0.7%). Meta-analysis identified nine novel loci associated with neuroticism. The strongest evidence for association was at a locus on chromosome 8 (P=1.5 × 10(-15)) spanning 4 Mb and containing at least 36 genes. Other associated loci included interesting candidate genes on chromosome 1 (GRIK3 (glutamate receptor ionotropic kainate 3)), chromosome 4 (KLHL2 (Kelch-like protein 2)), chromosome 17 (CRHR1 (corticotropin-releasing hormone receptor 1) and MAPT (microtubule-associated protein Tau)) and on chromosome 18 (CELF4 (CUGBP elav-like family member 4)). We found no evidence for genetic differences in the common allelic architecture of neuroticism by sex. By comparing our findings with those of the Psychiatric Genetics Consortia, we identified a strong genetic correlation between neuroticism and MDD and a less strong but significant genetic correlation with schizophrenia, although not with bipolar disorder. Polygenic risk scores derived from the primary UK Biobank sample captured

  15. Genome-wide association meta-analysis of neuropathologic features of Alzheimer's disease and related dementias.

    Directory of Open Access Journals (Sweden)

    Gary W Beecham

    2014-09-01

    Full Text Available Alzheimer's disease (AD and related dementias are a major public health challenge and present a therapeutic imperative for which we need additional insight into molecular pathogenesis. We performed a genome-wide association study and analysis of known genetic risk loci for AD dementia using neuropathologic data from 4,914 brain autopsies. Neuropathologic data were used to define clinico-pathologic AD dementia or controls, assess core neuropathologic features of AD (neuritic plaques, NPs; neurofibrillary tangles, NFTs, and evaluate commonly co-morbid neuropathologic changes: cerebral amyloid angiopathy (CAA, Lewy body disease (LBD, hippocampal sclerosis of the elderly (HS, and vascular brain injury (VBI. Genome-wide significance was observed for clinico-pathologic AD dementia, NPs, NFTs, CAA, and LBD with a number of variants in and around the apolipoprotein E gene (APOE. GalNAc transferase 7 (GALNT7, ATP-Binding Cassette, Sub-Family G (WHITE, Member 1 (ABCG1, and an intergenic region on chromosome 9 were associated with NP score; and Potassium Large Conductance Calcium-Activated Channel, Subfamily M, Beta Member 2 (KCNMB2 was strongly associated with HS. Twelve of the 21 non-APOE genetic risk loci for clinically-defined AD dementia were confirmed in our clinico-pathologic sample: CR1, BIN1, CLU, MS4A6A, PICALM, ABCA7, CD33, PTK2B, SORL1, MEF2C, ZCWPW1, and CASS4 with 9 of these 12 loci showing larger odds ratio in the clinico-pathologic sample. Correlation of effect sizes for risk of AD dementia with effect size for NFTs or NPs showed positive correlation, while those for risk of VBI showed a moderate negative correlation. The other co-morbid neuropathologic features showed only nominal association with the known AD loci. Our results discovered new genetic associations with specific neuropathologic features and aligned known genetic risk for AD dementia with specific neuropathologic changes in the largest brain autopsy study of AD and related

  16. Genome-wide reconstruction of complex structural variants using read clouds.

    Science.gov (United States)

    Spies, Noah; Weng, Ziming; Bishara, Alex; McDaniel, Jennifer; Catoe, David; Zook, Justin M; Salit, Marc; West, Robert B; Batzoglou, Serafim; Sidow, Arend

    2017-09-01

    In read cloud approaches, microfluidic partitioning of long genomic DNA fragments and barcoding of shorter fragments derived from these fragments retains long-range information in short sequencing reads. This combination of short reads with long-range information represents a powerful alternative to single-molecule long-read sequencing. We develop Genome-wide Reconstruction of Complex Structural Variants (GROC-SVs) for SV detection and assembly from read cloud data and apply this method to Illumina-sequenced 10x Genomics sarcoma and breast cancer data sets. Compared with short-fragment sequencing, GROC-SVs substantially improves the specificity of breakpoint detection at comparable sensitivity. This approach also performs sequence assembly across multiple breakpoints simultaneously, enabling the reconstruction of events exhibiting remarkable complexity. We show that chromothriptic rearrangements occurred before copy number amplifications, and that rates of single-nucleotide variants and SVs are not correlated. Our results support the use of read cloud approaches to advance the characterization of large and complex structural variation.

  17. A genome-wide, fine-scale map of natural pigmentation variation in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Héloïse Bastide

    2013-06-01

    Full Text Available Various approaches can be applied to uncover the genetic basis of natural phenotypic variation, each with their specific strengths and limitations. Here, we use a replicated genome-wide association approach (Pool-GWAS to fine-scale map genomic regions contributing to natural variation in female abdominal pigmentation in Drosophila melanogaster, a trait that is highly variable in natural populations and highly heritable in the laboratory. We examined abdominal pigmentation phenotypes in approximately 8000 female European D. melanogaster, isolating 1000 individuals with extreme phenotypes. We then used whole-genome Illumina sequencing to identify single nucleotide polymorphisms (SNPs segregating in our sample, and tested these for associations with pigmentation by contrasting allele frequencies between replicate pools of light and dark individuals. We identify two small regions near the pigmentation genes tan and bric-à-brac 1, both corresponding to known cis-regulatory regions, which contain SNPs showing significant associations with pigmentation variation. While the Pool-GWAS approach suffers some limitations, its cost advantage facilitates replication and it can be applied to any non-model system with an available reference genome.

  18. Single- and Bayesian Multi-Marker Genome-Wide Association for Haematological Parameters in Pigs.

    Directory of Open Access Journals (Sweden)

    Siriluck Ponsuksili

    Full Text Available Haematological traits are important traits that show associations with immune and metabolic status, as well as diseases in humans and animals. Mapping genome regions that affect the blood cell traits can contribute to the identification of genomic features useable as biomarkers for immune, disease and metabolic status. A genome-wide association study (GWAS was conducted using PorcineSNP60 BeadChips. Single-marker and Bayesian multi-marker approaches were integrated to identify genomic regions and corresponding genes overlapping for both methods. GWAS was performed for haematological traits of 591 German Landrace pig. Heritability estimates for haematological traits were medium to high. In total 252 single SNPs associated with 12 haematological traits were identified (NegLog10 of p-value > 5. The Bayesian multi-marker approach revealed 102 QTL regions across the genome, indicated by 1-Mb windows with contribution to additive genetic variance above 0.5%. The integration of both methods resulted in 24 overlapping QTL regions. This study identified overlapping QTL regions from single- and multi-marker approaches for haematological traits. Identifying candidate genes that affect blood cell traits provides the first step towards the understanding of the molecular basis of haematological phenotypes.

  19. Genome-wide association studies of female reproduction in tropically adapted beef cattle

    National Research Council Canada - National Science Library

    Hawken, R J; Zhang, Y D; Fortes, M R S; Collis, E; Barris, W C; Corbet, N J; Williams, P J; Fordyce, G; Holroyd, R G; Walkley, J R W; Barendse, W; Johnston, D J; Prayaga, K C; Tier, B; Reverter, A; Lehnert, S A

    2012-01-01

    .... To elucidate the genetics underlying reproduction in beef cattle, we performed a genome-wide association study using the bovine SNP50 chip in 2 tropically adapted beef cattle breeds, Brahman and Tropical Composite...

  20. Mammalian RNA polymerase II core promoters: insights from genome-wide studies

    DEFF Research Database (Denmark)

    Sandelin, Albin; Carninci, Piero; Lenhard, Boris

    2007-01-01

    The identification and characterization of mammalian core promoters and transcription start sites is a prerequisite to understanding how RNA polymerase II transcription is controlled. New experimental technologies have enabled genome-wide discovery and characterization of core promoters, revealin...

  1. Reconstructing Genome-Wide Protein–Protein Interaction Networks Using Multiple Strategies with Homologous Mapping

    Science.gov (United States)

    Lo, Yu-Shu; Huang, Sing-Han; Luo, Yong-Chun; Lin, Chun-Yu; Yang, Jinn-Moon

    2015-01-01

    PPIs share similar biological processes and cellular components, and the reconstructed genome-wide PPI network can reflect network topology and modularity. We believe that our method is useful for inferring reliable PPIs and reconstructing a comprehensive PPI network of an interesting organism. PMID:25602759

  2. Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci

    OpenAIRE

    Stahl, Eli A; Raychaudhuri, Soumya; Remmers, Elaine F.; Xie, Gang; Eyre, Stephen; Thomson, Brian P.; Li, Yonghong; Kurreeman, Fina A. S.; Zhernakova, Alexandra; Hinks, Anne; Guiducci, Candace; Chen, Robert; Alfredsson, Lars; Amos, Christopher I.; Ardlie, Kristin G.

    2010-01-01

    To identify novel genetic risk factors for rheumatoid arthritis (RA), we conducted a genome-wide association study (GWAS) meta-analysis of 5,539 autoantibody positive RA cases and 20,169 controls of European descent, followed by replication in an independent set of 6,768 RA cases and 8,806 controls. Of 34 SNPs selected for replication, 7 novel RA risk alleles were identified at genome-wide significance (P

  3. A genome-wide 20 K citrus microarray for gene expression analysis

    OpenAIRE

    Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose

    2008-01-01

    Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA...

  4. Pedigree reconstruction with genome-wide markers in potato

    Science.gov (United States)

    Reliable pedigree information facilitates a scientific approach to breeding, but errors can be introduced in many stages of a breeding program. Our objective was to use single nucleotide polymorphisms (SNPs) to check the pedigree records of elite North American potato germplasm. A population of 635 ...

  5. An integration of genome-wide association study and gene expression profiling to prioritize the discovery of novel susceptibility loci for osteoporosis-related traits

    NARCIS (Netherlands)

    C.Y. Hsu (Chao); M.C. Zillikens (Carola); S.G. Wilson (Scott); C.R. Farber (Charles); S. Demissie (Serkalem); N. Soranzo (Nicole); E.N. Bianchi (Estelle); E. Grundberg (Elin); L. Liang (Liming); J.B. Richards (Brent); K. Estrada Gil (Karol); Y. Zhou (Yanhua); A. van Nas (Atila); M.F. Moffatt (Miriam); G. Zhai (Guangju); A. Hofman (Albert); J.B.J. van Meurs (Joyce); H.A.P. Pols (Huib); R.I. Price (Roger Ian); O. Nilsson (Ola); T. Pastinen (Tomi); L.A. Cupples (Adrienne); A.J. Lusis (Aldons Jake); E.E. Schadt (Eric); A.G. Uitterlinden (André); D.P. Kiel (Douglas); F. Rivadeneira Ramirez (Fernando); T.D. Spector (Timothy); D. Karasik (David); S.L. Ferrari (Serge)

    2010-01-01

    textabstractOsteoporosis is a complex disorder and commonly leads to fractures in elderly persons. Genome-wide association studies (GWAS) have become an unbiased approach to identify variations in the genome that potentially affect health. However, the genetic variants identified so far only explain

  6. Genome-wide scan for visceral leishmaniasis in mixed-breed dogs identifies candidate genes involved in T helper cells and macrophage signaling

    Science.gov (United States)

    We conducted a genome-wide scan for visceral leishmaniasis in mixed-breed dogs from a highly endemic area in Brazil using 149,648 single nucleotide polymorphism (SNP) markers genotyped in 20 cases and 28 controls. Using a mixed model approach, we found two candidate loci on canine autosomes 1 and 2....

  7. A Genome-Wide Test of the Differential Susceptibility Hypothesis Reveals a Genetic Predictor of Differential Response to Psychological Treatments for Child Anxiety Disorders.

    Science.gov (United States)

    Keers, Robert; Coleman, Jonathan R I; Lester, Kathryn J; Roberts, Susanna; Breen, Gerome; Thastum, Mikael; Bögels, Susan; Schneider, Silvia; Heiervang, Einar; Meiser-Stedman, Richard; Nauta, Maaike; Creswell, Cathy; Thirlwall, Kerstin; Rapee, Ronald M; Hudson, Jennifer L; Lewis, Cathryn; Plomin, Robert; Eley, Thalia C

    2016-01-01

    The differential susceptibly hypothesis suggests that certain genetic variants moderate the effects of both negative and positive environments on mental health and may therefore be important predictors of response to psychological treatments. Nevertheless, the identification of such variants has so far been limited to preselected candidate genes. In this study we extended the differential susceptibility hypothesis from a candidate gene to a genome-wide approach to test whether a polygenic score of environmental sensitivity predicted response to cognitive behavioural therapy (CBT) in children with anxiety disorders. We identified variants associated with environmental sensitivity using a novel method in which within-pair variability in emotional problems in 1,026 monozygotic twin pairs was examined as a function of the pairs' genotype. We created a polygenic score of environmental sensitivity based on the whole-genome findings and tested the score as a moderator of parenting on emotional problems in 1,406 children and response to individual, group and brief parent-led CBT in 973 children with anxiety disorders. The polygenic score significantly moderated the effects of parenting on emotional problems and the effects of treatment. Individuals with a high score responded significantly better to individual CBT than group CBT or brief parent-led CBT (remission rates: 70.9, 55.5 and 41.6%, respectively). Pending successful replication, our results should be considered exploratory. Nevertheless, if replicated, they suggest that individuals with the greatest environmental sensitivity may be more likely to develop emotional problems in adverse environments but also benefit more from the most intensive types of treatment. © 2016 S. Karger AG, Basel.

  8. A Genome-Wide Test of the Differential Susceptibility Hypothesis Reveals a Genetic Predictor of Differential Response to Psychological Treatments for Child Anxiety Disorders

    Science.gov (United States)

    Keers, Robert; Coleman, Jonathan R.I.; Lester, Kathryn J.; Roberts, Susanna; Breen, Gerome; Thastum, Mikael; Bögels, Susan; Schneider, Silvia; Heiervang, Einar; Meiser-Stedman, Richard; Nauta, Maaike; Creswell, Cathy; Thirlwall, Kerstin; Rapee, Ronald M.; Hudson, Jennifer L.; Lewis, Cathryn; Plomin, Robert; Eley, Thalia C.

    2016-01-01

    Background The differential susceptibly hypothesis suggests that certain genetic variants moderate the effects of both negative and positive environments on mental health and may therefore be important predictors of response to psychological treatments. Nevertheless, the identification of such variants has so far been limited to preselected candidate genes. In this study we extended the differential susceptibility hypothesis from a candidate gene to a genome-wide approach to test whether a polygenic score of environmental sensitivity predicted response to cognitive behavioural therapy (CBT) in children with anxiety disorders. Methods We identified variants associated with environmental sensitivity using a novel method in which within-pair variability in emotional problems in 1,026 monozygotic twin pairs was examined as a function of the pairs' genotype. We created a polygenic score of environmental sensitivity based on the whole-genome findings and tested the score as a moderator of parenting on emotional problems in 1,406 children and response to individual, group and brief parent-led CBT in 973 children with anxiety disorders. Results The polygenic score significantly moderated the effects of parenting on emotional problems and the effects of treatment. Individuals with a high score responded significantly better to individual CBT than group CBT or brief parent-led CBT (remission rates: 70.9, 55.5 and 41.6%, respectively). Conclusions Pending successful replication, our results should be considered exploratory. Nevertheless, if replicated, they suggest that individuals with the greatest environmental sensitivity may be more likely to develop emotional problems in adverse environments but also benefit more from the most intensive types of treatment. PMID:27043157

  9. Incorporating biological pathways via a Markov random field model in genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Min Chen

    2011-04-01

    Full Text Available Genome-wide association studies (GWAS examine a large number of markers across the genome to identify associations between genetic variants and disease. Most published studies examine only single markers, which may be less informative than considering multiple markers and multiple genes jointly because genes may interact with each other to affect disease risk. Much knowledge has been accumulated in the literature on biological pathways and interactions. It is conceivable that appropriate incorporation of such prior knowledge may improve the likelihood of making genuine discoveries. Although a number of methods have been developed recently to prioritize genes using prior biological knowledge, such as pathways, most methods treat genes in a specific pathway as an exchangeable set without considering the topological structure of a pathway. However, how genes are related with each other in a pathway may be very informative to identify association signals. To make use of the connectivity information among genes in a pathway in GWAS analysis, we propose a Markov Random Field (MRF model to incorporate pathway topology for association analysis. We show that the conditional distribution of our MRF model takes on a simple logistic regression form, and we propose an iterated conditional modes algorithm as well as a decision theoretic approach for statistical inference of each gene's association with disease. Simulation studies show that our proposed framework is more effective to identify genes associated with disease than a single gene-based method. We also illustrate the usefulness of our approach through its applications to a real data example.

  10. Genome-wide protein-protein interactions and protein function exploration in cyanobacteria.

    Science.gov (United States)

    Lv, Qi; Ma, Weimin; Liu, Hui; Li, Jiang; Wang, Huan; Lu, Fang; Zhao, Chen; Shi, Tieliu

    2015-10-22

    Genome-wide network analysis is well implemented to study proteins of unknown function. Here, we effectively explored protein functions and the biological mechanism based on inferred high confident protein-protein interaction (PPI) network in cyanobacteria. We integrated data from seven different sources and predicted 1,997 PPIs, which were evaluated by experiments in molecular mechanism, text mining of literatures in proved direct/indirect evidences, and "interologs" in conservation. Combined the predicted PPIs with known PPIs, we obtained 4,715 no-redundant PPIs (involving 3,231 proteins covering over 90% of genome) to generate the PPI network. Based on the PPI network, terms in Gene ontology (GO) were assigned to function-unknown proteins. Functional modules were identified by dissecting the PPI network into sub-networks and analyzing pathway enrichment, with which we investigated novel function of underlying proteins in protein complexes and pathways. Examples of photosynthesis and DNA repair indicate that the network approach is a powerful tool in protein function analysis. Overall, this systems biology approach provides a new insight into posterior functional analysis of PPIs in cyanobacteria.

  11. Next-generation libraries for robust RNA interference-based genome-wide screens.

    Science.gov (United States)

    Kampmann, Martin; Horlbeck, Max A; Chen, Yuwen; Tsai, Jordan C; Bassik, Michael C; Gilbert, Luke A; Villalta, Jacqueline E; Kwon, S Chul; Chang, Hyeshik; Kim, V Narry; Weissman, Jonathan S

    2015-06-30

    Genetic screening based on loss-of-function phenotypes is a powerful discovery tool in biology. Although the recent development of clustered regularly interspaced short palindromic repeats (CRISPR)-based screening approaches in mammalian cell culture has enormous potential, RNA interference (RNAi)-based screening remains the method of choice in several biological contexts. We previously demonstrated that ultracomplex pooled short-hairpin RNA (shRNA) libraries can largely overcome the problem of RNAi off-target effects in genome-wide screens. Here, we systematically optimize several aspects of our shRNA library, including the promoter and microRNA context for shRNA expression, selection of guide strands, and features relevant for postscreen sample preparation for deep sequencing. We present next-generation high-complexity libraries targeting human and mouse protein-coding genes, which we grouped into 12 sublibraries based on biological function. A pilot screen suggests that our next-generation RNAi library performs comparably to current CRISPR interference (CRISPRi)-based approaches and can yield complementary results with high sensitivity and high specificity.

  12. BioMet Toolbox: genome-wide analysis of metabolism

    OpenAIRE

    Cvijovic, M.; R. Olivares-Hernandez; Agren, R.; Dahr, N.; Vongsangnak, W.; Nookaew, I.; K. R. Patil; Nielsen, J.

    2010-01-01

    The rapid progress of molecular biology tools for directed genetic modifications, accurate quantitative experimental approaches, high-throughput measurements, together with development of genome sequencing has made the foundation for a new area of metabolic engineering that is driven by metabolic models. Systematic analysis of biological processes by means of modelling and simulations has made the identification of metabolic networks and prediction of metabolic capabilities under different co...

  13. TATES: efficient multivariate genotype-phenotype analysis for genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Sophie van der Sluis

    Full Text Available To date, the genome-wide association study (GWAS is the primary tool to identify genetic variants that cause phenotypic variation. As GWAS analyses are generally univariate in nature, multivariate phenotypic information is usually reduced to a single composite score. This practice often results in loss of statistical power to detect causal variants. Multivariate genotype-phenotype methods do exist but attain maximal power only in special circumstances. Here, we present a new multivariate method that we refer to as TATES (Trait-based Association Test that uses Extended Simes procedure, inspired by the GATES procedure proposed by Li et al (2011. For each component of a multivariate trait, TATES combines p-values obtained in standard univariate GWAS to acquire one trait-based p-value, while correcting for correlations between components. Extensive simulations, probing a wide variety of genotype-phenotype models, show that TATES's false positive rate is correct, and that TATES's statistical power to detect causal variants explaining 0.5% of the variance can be 2.5-9 times higher than the power of univariate tests based on composite scores and 1.5-2 times higher than the power of the standard MANOVA. Unlike other multivariate methods, TATES detects both genetic variants that are common to multiple phenotypes and genetic variants that are specific to a single phenotype, i.e. TATES provides a more complete view of the genetic architecture of complex traits. As the actual causal genotype-phenotype model is usually unknown and probably phenotypically and genetically complex, TATES, available as an open source program, constitutes a powerful new multivariate strategy that allows researchers to identify novel causal variants, while the complexity of traits is no longer a limiting factor.

  14. Genome-wide linkage analysis of severe, early-onset chronic obstructive pulmonary disease: airflow obstruction and chronic bronchitis phenotypes.

    Science.gov (United States)

    Silverman, Edwin K; Mosley, Jonathan D; Palmer, Lyle J; Barth, Matthew; Senter, Jody M; Brown, Alison; Drazen, Jeffrey M; Kwiatkowski, David J; Chapman, Harold A; Campbell, Edward J; Province, Michael A; Rao, D C; Reilly, John J; Ginns, Leo C; Speizer, Frank E; Weiss, Scott T

    2002-03-15

    Familial aggregation of chronic obstructive pulmonary disease (COPD) has been demonstrated, but linkage analysis of COPD-related phenotypes has not been reported previously. An autosomal 10 cM genome-wide scan of short tandem repeat (STR) polymorphic markers was analyzed for linkage to COPD-related phenotypes in 585 members of 72 pedigrees ascertained through severe, early-onset COPD probands without severe alpha1-antitrypsin deficiency. Multipoint non-parametric linkage analysis (using the ALLEGRO program) was performed for qualitative phenotypes including moderate airflow obstruction [forced expiratory volume at one second (FEV(1)) < 60% predicted, FEV(1)/FVC < 90% predicted], mild airflow obstruction (FEV(1) < 80% predicted, FEV(1)/FVC < 90% predicted) and chronic bronchitis. The strongest evidence for linkage in all subjects was observed at chromosomes 12 (LOD = 1.70) and 19 (LOD = 1.54) for moderate airflow obstruction, chromosomes 8 (LOD = 1.36) and 19 (LOD = 1.09) for mild airflow obstruction and chromosomes 19 (LOD = 1.21) and 22 (LOD = 1.37) for chronic bronchitis. Restricting analysis to cigarette smokers only provided increased evidence for linkage of mild airflow obstruction and chronic bronchitis to several genomic regions; for mild airflow obstruction in smokers only, the maximum LOD was 1.64 at chromosome 19, whereas for chronic bronchitis in smokers only, the maximum LOD was 2.08 at chromosome 22. On chromosome 12p, 12 additional STR markers were genotyped, which provided additional support for an airflow obstruction locus in that region with a non-parametric multipoint approach for moderate airflow obstruction (LOD = 2.13) and mild airflow obstruction (LOD = 1.43). Using a dominant model with the STR markers on 12p, two point parametric linkage analysis of all subjects demonstrated a maximum LOD score of 2.09 for moderate airflow obstruction and 2.61 for mild airflow obstruction. In smokers only, the maximum two point LOD score for mild airflow

  15. Genome-wide association study of insect bite hypersensitivity in two horse populations in the Netherlands

    Directory of Open Access Journals (Sweden)

    Schurink Anouk

    2012-10-01

    Full Text Available Abstract Background Insect bite hypersensitivity is a common allergic disease in horse populations worldwide. Insect bite hypersensitivity is affected by both environmental and genetic factors. However, little is known about genes contributing to the genetic variance associated with insect bite hypersensitivity. Therefore, the aim of our study was to identify and quantify genomic associations with insect bite hypersensitivity in Shetland pony mares and Icelandic horses in the Netherlands. Methods Data on 200 Shetland pony mares and 146 Icelandic horses were collected according to a matched case–control design. Cases and controls were matched on various factors (e.g. region, sire to minimize effects of population stratification. Breed-specific genome-wide association studies were performed using 70 k single nucleotide polymorphisms genotypes. Bayesian variable selection method Bayes-C with a threshold model implemented in GenSel software was applied. A 1 Mb non-overlapping window approach that accumulated contributions of adjacent single nucleotide polymorphisms was used to identify associated genomic regions. Results The percentage of variance explained by all single nucleotide polymorphisms was 13% in Shetland pony mares and 28% in Icelandic horses. The 20 non-overlapping windows explaining the largest percentages of genetic variance were found on nine chromosomes in Shetland pony mares and on 14 chromosomes in Icelandic horses. Overlap in identified associated genomic regions between breeds would suggest interesting candidate regions to follow-up on. Such regions common to both breeds (within 15 Mb were found on chromosomes 3, 7, 11, 20 and 23. Positional candidate genes within 2 Mb from the associated windows were identified on chromosome 20 in both breeds. Candidate genes are within the equine lymphocyte antigen class II region, which evokes an immune response by recognizing many foreign molecules. Conclusions The genome-wide association

  16. Genome-wide and expression analysis of protein phosphatase 2C in rice and Arabidopsis

    Directory of Open Access Journals (Sweden)

    Jakab Stephen

    2008-11-01

    Full Text Available Abstract Background The protein phosphatase 2Cs (PP2Cs from various organisms have been implicated to act as negative modulators of protein kinase pathways involved in diverse environmental stress responses and developmental processes. A genome-wide overview of the PP2C gene family in plants is not yet available. Results A comprehensive computational analysis identified 80 and 78 PP2C genes in Arabidopsis thaliana (AtPP2Cs and Oryza sativa (OsPP2Cs, respectively, which denotes the PP2C gene family as one of the largest families identified in plants. Phylogenic analysis divided PP2Cs in Arabidopsis and rice into 13 and 11 subfamilies, respectively, which are supported by the analyses of gene structures and protein motifs. Comparative analysis between the PP2C genes in Arabidopsis and rice identified common and lineage-specific subfamilies and potential 'gene birth-and-death' events. Gene duplication analysis reveals that whole genome and chromosomal segment duplications mainly contributed to the expansion of both OsPP2Cs and AtPP2Cs, but tandem or local duplication occurred less frequently in Arabidopsis than rice. Some protein motifs are widespread among the PP2C proteins, whereas some other motifs are specific to only one or two subfamilies. Expression pattern analysis suggests that 1 most PP2C genes play functional roles in multiple tissues in both species, 2 the induced expression of most genes in subfamily A by diverse stimuli indicates their primary role in stress tolerance, especially ABA response, and 3 the expression pattern of subfamily D members suggests that they may constitute positive regulators in ABA-mediated signaling pathways. The analyses of putative upstream regulatory elements by two approaches further support the functions of subfamily A in ABA signaling, and provide insights into the shared and different transcriptional regulation machineries in dicots and monocots. Conclusion This comparative genome-wide overview of the PP

  17. Evaluation of the lasso and the elastic net in genome-wide association studies.

    Science.gov (United States)

    Waldmann, Patrik; Mészáros, Gábor; Gredler, Birgit; Fuerst, Christian; Sölkner, Johann

    2013-01-01

    The number of publications performing genome-wide association studies (GWAS) has increased dramatically. Penalized regression approaches have been developed to overcome the challenges caused by the high dimensional data, but these methods are relatively new in the GWAS field. In this study we have compared the statistical performance of two methods (the least absolute shrinkage and selection operator-lasso and the elastic net) on two simulated data sets and one real data set from a 50 K genome-wide single nucleotide polymorphism (SNP) panel of 5570 Fleckvieh bulls. The first simulated data set displays moderate to high linkage disequilibrium between SNPs, whereas the second simulated data set from the QTLMAS 2010 workshop is biologically more complex. We used cross-validation to find the optimal value of regularization parameter λ with both minimum MSE and minimum MSE + 1SE of minimum MSE. The optimal λ values were used for variable selection. Based on the first simulated data, we found that the minMSE in general picked up too many SNPs. At minMSE + 1SE, the lasso didn't acquire any false positives, but selected too few correct SNPs. The elastic net provided the best compromise between few false positives and many correct selections when the penalty weight α was around 0.1. However, in our simulation setting, this α value didn't result in the lowest minMSE + 1SE. The number of selected SNPs from the QTLMAS 2010 data was after correction for population structure 82 and 161 for the lasso and the elastic net, respectively. In the Fleckvieh data set after population structure correction lasso and the elastic net identified from 1291 to 1966 important SNPs for milk fat content, with major peaks on chromosomes 5, 14, 15, and 20. Hence, we can conclude that it is important to analyze GWAS data with both the lasso and the elastic net and an alternative tuning criterion to minimum MSE is needed for variable selection.

  18. Evaluation of the lasso and the elastic net in genome-wide association studies

    Directory of Open Access Journals (Sweden)

    Patrik eWaldmann

    2013-12-01

    Full Text Available The number of publications performing genome-wide association studies (GWAS has increased dramatically. Penalized regression approaches have been developed to overcome the challenges caused by the high dimensional data, but these methods are relatively new in the GWAS area. In this study we have compared the statistical performance of two methods (the least absolute shrinkage and selection operator - lasso and the elastic net on two simulated data sets and one real data set from a 50K genome-wide single nucleotide polymorphism (SNP panel of 5,570 Fleckvieh bulls. The first simulated data set displays moderate to high linkage disequilibrium between SNPs, whereas the second simulated data set from the QTLMAS 2010 workshop is biologically more complex. We used cross-validation to find the optimal value of regularization parameter λ with both minimum MSE and minimum MSE + 1 SE of minimum MSE. The optimal λ values were used for variable selection. Based on the first simulated data, we found that the minMSE in general picked up too many SNPs. At minMSE+1SE, the lasso didn’t acquire any false positives, but selected too few correct SNPs. The elastic net provided the best compromise between few false positives and many correct selections when the penalty weight α was around 0.1. However, in our simulation setting, this α value didn’t result in the lowest minMSE+1SE. The number of selected SNPs from the QTLMAS 2010 data was after correction for population structure 82 and 161 for the lasso and the elastic net, respectively. In the Fleckvieh data set after population structure correction lasso and the elastic net identified from 1,291 to 1,966 important SNPs for milk fat content, with major peaks on chromosomes 5, 14, 15 and 20. Hence, we can conclude that it is important to analyze GWAS data with both the lasso and the elastic net and an alternative tuning criterion to minimum MSE is needed for variable selection.

  19. A genome-wide association study identifies protein quantitative trait loci (pQTLs.

    Directory of Open Access Journals (Sweden)

    David Melzer

    2008-05-01

    Full Text Available There is considerable evidence that human genetic variation influences gene expression. Genome-wide studies have revealed that mRNA levels are associated with genetic variation in or close to the gene coding for those mRNA transcripts - cis effects, and elsewhere in the genome - trans effects. The role of genetic variation in determining protein levels has not been systematically assessed. Using a genome-wide association approach we show that common genetic variation influences levels of clinically relevant proteins in human serum and plasma. We evaluated the role of 496,032 polymorphisms on levels of 42 proteins measured in 1200 fasting individuals from the population based InCHIANTI study. Proteins included insulin, several interleukins, adipokines, chemokines, and liver function markers that are implicated in many common diseases including metabolic, inflammatory, and infectious conditions. We identified eight Cis effects, including variants in or near the IL6R (p = 1.8x10(-57, CCL4L1 (p = 3.9x10(-21, IL18 (p = 6.8x10(-13, LPA (p = 4.4x10(-10, GGT1 (p = 1.5x10(-7, SHBG (p = 3.1x10(-7, CRP (p = 6.4x10(-6 and IL1RN (p = 7.3x10(-6 genes, all associated with their respective protein products with effect sizes ranging from 0.19 to 0.69 standard deviations per allele. Mechanisms implicated include altered rates of cleavage of bound to unbound soluble receptor (IL6R, altered secretion rates of different sized proteins (LPA, variation in gene copy number (CCL4L1 and altered transcription (GGT1. We identified one novel trans effect that was an association between ABO blood group and tumour necrosis factor alpha (TNF-alpha levels (p = 6.8x10(-40, but this finding was not present when TNF-alpha was measured using a different assay , or in a second study, suggesting an assay-specific association. Our results show that protein levels share some of the features of the genetics of gene expression. These include the presence of strong genetic effects in cis

  20. Genome-wide association for abdominal subcutaneous and visceral adipose reveals a novel locus for visceral fat in women.

    Directory of Open Access Journals (Sweden)

    Caroline S Fox

    Full Text Available Body fat distribution, particularly centralized obesity, is associated with metabolic risk above and beyond total adiposity. We performed genome-wide association of abdominal adipose depots quantified using computed tomography (CT to uncover novel loci for body fat distribution among participants of European ancestry. Subcutaneous and visceral fat were quantified in 5,560 women and 4,997 men from 4 population-based studies. Genome-wide genotyping was performed using standard arrays and imputed to ~2.5 million Hapmap SNPs. Each study performed a genome-wide association analysis of subcutaneous adipose tissue (SAT, visceral adipose tissue (VAT, VAT adjusted for body mass index, and VAT/SAT ratio (a metric of the propensity to store fat viscerally as compared to subcutaneously in the overall sample and in women and men separately. A weighted z-score meta-analysis was conducted. For the VAT/SAT ratio, our most significant p-value was rs11118316 at LYPLAL1 gene (p = 3.1 × 10E-09, previously identified in association with waist-hip ratio. For SAT, the most significant SNP was in the FTO gene (p = 5.9 × 10E-08. Given the known gender differences in body fat distribution, we performed sex-specific analyses. Our most significant finding was for VAT in women, rs1659258 near THNSL2 (p = 1.6 × 10-08, but not men (p = 0.75. Validation of this SNP in the GIANT consortium data demonstrated a similar sex-specific pattern, with observed significance in women (p = 0.006 but not men (p = 0.24 for BMI and waist circumference (p = 0.04 [women], p = 0.49 [men]. Finally, we interrogated our data for the 14 recently published loci for body fat distribution (measured by waist-hip ratio adjusted for BMI; associations were observed at 7 of these loci. In contrast, we observed associations at only 7/32 loci previously identified in association with BMI; the majority of overlap was observed with SAT. Genome-wide association for visceral and subcutaneous fat revealed a

  1. Genome Wide Search for Biomarkers to Diagnose Yersinia Infections.

    Science.gov (United States)

    Kalia, Vipin Chandra; Kumar, Prasun

    2015-12-01

    Bacterial identification on the basis of the highly conserved 16S rRNA (rrs) gene is limited by its presence in multiple copies and a very high level of similarity among them. The need is to look for other genes with unique characteristics to be used as biomarkers. Fifty-one sequenced genomes belonging to 10 different Yersinia species were used for searching genes common to all the genomes. Out of 304 common genes, 34 genes of sizes varying from 0.11 to 4.42 kb, were selected and subjected to in silico digestion with 10 different Restriction endonucleases (RE) (4-6 base cutters). Yersinia species have 6-7 copies of rrs per genome, which are difficult to distinguish by multiple sequence alignments or their RE digestion patterns. However, certain unique combinations of other common gene sequences-carB, fadJ, gluM, gltX, ileS, malE, nusA, ribD, and rlmL and their RE digestion patterns can be used as markers for identifying 21 strains belonging to 10 Yersinia species: Y. aldovae, Y. enterocolitica, Y. frederiksenii, Y. intermedia, Y. kristensenii, Y. pestis, Y. pseudotuberculosis, Y. rohdei, Y. ruckeri, and Y. similis. This approach can be applied for rapid diagnostic applications.

  2. Genome-Wide Screening of Genes Required for Glycosylphosphatidylinositol Biosynthesis.

    Directory of Open Access Journals (Sweden)

    Yao Rong

    Full Text Available Glycosylphosphatidylinositol (GPI is synthesized and transferred to proteins in the endoplasmic reticulum (ER. GPI-anchored proteins are then transported from the ER to the plasma membrane through the Golgi apparatus. To date, at least 17 steps have been identified to be required for the GPI biosynthetic pathway. Here, we aimed to establish a comprehensive screening method to identify genes involved in GPI biosynthesis using mammalian haploid screens. Human haploid cells were mutagenized by the integration of gene trap vectors into the genome. Mutagenized cells were then treated with a bacterial pore-forming toxin, aerolysin, which binds to GPI-anchored proteins for targeting to the cell membrane. Cells that showed low surface expression of CD59, a GPI-anchored protein, were further enriched for. Gene trap insertion sites in the non-selected population and in the enriched population were determined by deep sequencing. This screening enriched 23 gene regions among the 26 known GPI biosynthetic genes, which when mutated are expected to decrease the surface expression of GPI-anchored proteins. Our results indicate that the forward genetic approach using haploid cells is a useful and powerful technique to identify factors involved in phenotypes of interest.

  3. Genome-wide DNA methylation maps in follicular lymphoma cells determined by methylation-enriched bisulfite sequencing.

    Directory of Open Access Journals (Sweden)

    Jeong-Hyeon Choi

    Full Text Available BACKGROUND: Follicular lymphoma (FL is a form of non-Hodgkin's lymphoma (NHL that arises from germinal center (GC B-cells. Despite the significant advances in immunotherapy, FL is still not curable. Beyond transcriptional profiling and genomics datasets, there currently is no epigenome-scale dataset or integrative biology approach that can adequately model this disease and therefore identify novel mechanisms and targets for successful prevention and treatment of FL. METHODOLOGY/PRINCIPAL FINDINGS: We performed methylation-enriched genome-wide bisulfite sequencing of FL cells and normal CD19(+ B-cells using 454 sequencing technology. The methylated DNA fragments were enriched with methyl-binding proteins, treated with bisulfite, and sequenced using the Roche-454 GS FLX sequencer. The total number of bases covered in the human genome was 18.2 and 49.3 million including 726,003 and 1.3 million CpGs in FL and CD19(+ B-cells, respectively. 11,971 and 7,882 methylated regions of interest (MRIs were identified respectively. The genome-wide distribution of these MRIs displayed significant differences between FL and normal B-cells. A reverse trend in the distribution of MRIs between the promoter and the gene body was observed in FL and CD19(+ B-cells. The MRIs identified in FL cells also correlated well with transcriptomic data and ChIP-on-Chip analyses of genome-wide histone modifications such as tri-methyl-H3K27, and tri-methyl-H3K4, indicating a concerted epigenetic alteration in FL cells. CONCLUSIONS/SIGNIFICANCE: This study is the first to provide a large scale and comprehensive analysis of the DNA methylation sequence composition and distribution in the FL epigenome. These integrated approaches have led to the discovery of novel and frequent targets of aberrant epigenetic alterations. The genome-wide bisulfite sequencing approach developed here can be a useful tool for profiling DNA methylation in clinical samples.

  4. Recent advances in globin research using genome-wide association studies and gene editing.

    Science.gov (United States)

    Orkin, Stuart H

    2016-03-01

    A long-sought goal in the hemoglobin field has been an improved understanding of the mechanisms that regulate the switch from fetal (HbF) to adult (HbA) hemoglobin during development. With such knowledge, the hope is that strategies for directed reactivation of HbF in adults could be devised as an approach to therapy for the β-hemoglobinopathies thalassemia and sickle cell disease. Recent genome-wide association studies (GWAS) led to identification of three loci (BCL11A, HBS1L-MYB, and the β-globin cluster itself) in which natural genetic variation is correlated with different HbF levels in populations. Here, the central role of BCL11A in control of HbF is reviewed from the perspective of how findings may be translated to gene therapy in the not-too-distant future. This summary traces the evolution of recent studies from the initial recognition of BCL11A through GWAS to identification of critical sequences in an enhancer required for its erythroid-specific expression, thereby highlighting an Achilles heel for genome editing.

  5. Genome-wide association study identifies novel breast cancer susceptibility loci

    Science.gov (United States)

    Easton, Douglas F.; Pooley, Karen A.; Dunning, Alison M.; Pharoah, Paul D. P.; Thompson, Deborah; Ballinger, Dennis G.; Struewing, Jeffery P.; Morrison, Jonathan; Field, Helen; Luben, Robert; Wareham, Nicholas; Ahmed, Shahana; Healey, Catherine S.; Bowman, Richard; Meyer, Kerstin B.; Haiman, Christopher A.; Kolonel, Laurence K.; Henderson, Brian E.; Marchand, Loic Le; Brennan, Paul; Sangrajrang, Suleeporn; Gaborieau, Valerie; Odefrey, Fabrice; Shen, Chen-Yang; Wu, Pei-Ei; Wang, Hui-Chun; Eccles, Diana; Evans, D. Gareth; Peto, Julian; Fletcher, Olivia; Johnson, Nichola; Seal, Sheila; Stratton, Michael R.; Rahman, Nazneen; Chenevix-Trench, Georgia; Bojesen, Stig E.; Nordestgaard, Børge G.; Axelsson, Christen K.; Garcia-Closas, Montserrat; Brinton, Louise; Chanock, Stephen; Lissowska, Jolanta; Peplonska, Beata; Nevanlinna, Heli; Fagerholm, Rainer; Eerola, Hannaleena; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Ahn, Sei-Hyun; Hunter, David J.; Hankinson, Susan E.; Cox, David G.; Hall, Per; Wedren, Sara; Liu, Jianjun; Low, Yen-Ling; Bogdanova, Natalia; Schürmann, Peter; Dörk, Thilo; Tollenaar, Rob A. E. M.; Jacobi, Catharina E.; Devilee, Peter; Klijn, Jan G. M.; Sigurdson, Alice J.; Doody, Michele M.; Alexander, Bruce H.; Zhang, Jinghui; Cox, Angela; Brock, Ian W.; MacPherson, Gordon; Reed, Malcolm W. R.; Couch, Fergus J.; Goode, Ellen L.; Olson, Janet E.; Meijers-Heijboer, Hanne; van den Ouweland, Ans; Uitterlinden, André; Rivadeneira, Fernando; Milne, Roger L.; Ribas, Gloria; Gonzalez-Neira, Anna; Benitez, Javier; Hopper, John L.; McCredie, Margaret; Southey, Melissa; Giles, Graham G.; Schroen, Chris; Justenhoven, Christina; Brauch, Hiltrud; Hamann, Ute; Ko, Yon-Dschun; Spurdle, Amanda B.; Beesley, Jonathan; Chen, Xiaoqing; Mannermaa, Arto; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana; Day, Nicholas E.; Cox, David R.; Ponder, Bruce A. J.; Luccarini, Craig; Conroy, Don; Shah, Mitul; Munday, Hannah; Jordan, Clare; Perkins, Barbara; West, Judy; Redman, Karen; Driver, Kristy; Aghmesheh, Morteza; Amor, David; Andrews, Lesley; Antill, Yoland; Armes, Jane; Armitage, Shane; Arnold, Leanne; Balleine, Rosemary; Begley, Glenn; Beilby, John; Bennett, Ian; Bennett, Barbara; Berry, Geoffrey; Blackburn, Anneke; Brennan, Meagan; Brown, Melissa; Buckley, Michael; Burke, Jo; Butow, Phyllis; Byron, Keith; Callen, David; Campbell, Ian; Chenevix-Trench, Georgia; Clarke, Christine; Colley, Alison; Cotton, Dick; Cui, Jisheng; Culling, Bronwyn; Cummings, Margaret; Dawson, Sarah-Jane; Dixon, Joanne; Dobrovic, Alexander; Dudding, Tracy; Edkins, Ted; Eisenbruch, Maurice; Farshid, Gelareh; Fawcett, Susan; Field, Michael; Firgaira, Frank; Fleming, Jean; Forbes, John; Friedlander, Michael; Gaff, Clara; Gardner, Mac; Gattas, Mike; George, Peter; Giles, Graham; Gill, Grantley; Goldblatt, Jack; Greening, Sian; Grist, Scott; Haan, Eric; Harris, Marion; Hart, Stewart; Hayward, Nick; Hopper, John; Humphrey, Evelyn; Jenkins, Mark; Jones, Alison; Kefford, Rick; Kirk, Judy; Kollias, James; Kovalenko, Sergey; Lakhani, Sunil; Leary, Jennifer; Lim, Jacqueline; Lindeman, Geoff; Lipton, Lara; Lobb, Liz; Maclurcan, Mariette; Mann, Graham; Marsh, Deborah; McCredie, Margaret; McKay, Michael; McLachlan, Sue Anne; Meiser, Bettina; Milne, Roger; Mitchell, Gillian; Newman, Beth; O'Loughlin, Imelda; Osborne, Richard; Peters, Lester; Phillips, Kelly; Price, Melanie; Reeve, Jeanne; Reeve, Tony; Richards, Robert; Rinehart, Gina; Robinson, Bridget; Rudzki, Barney; Salisbury, Elizabeth; Sambrook, Joe; Saunders, Christobel; Scott, Clare; Scott, Elizabeth; Scott, Rodney; Seshadri, Ram; Shelling, Andrew; Southey, Melissa; Spurdle, Amanda; Suthers, Graeme; Taylor, Donna; Tennant, Christopher; Thorne, Heather; Townshend, Sharron; Tucker, Kathy; Tyler, Janet; Venter, Deon; Visvader, Jane; Walpole, Ian; Ward, Robin; Waring, Paul; Warner, Bev; Warren, Graham; Watson, Elizabeth; Williams, Rachael; Wilson, Judy; Winship, Ingrid; Young, Mary Ann; Bowtell, David; Green, Adele; deFazio, Anna; Chenevix-Trench, Georgia; Gertig, Dorota; Webb, Penny

    2009-01-01

    Breast cancer exhibits familial aggregation, consistent with variation in genetic susceptibility to the disease. Known susceptibility genes account for less than 25% of the familial risk of breast cancer, and the residual genetic variance is likely to be due to variants conferring more moderate risks. To identify further susceptibility alleles, we conducted a two-stage genome-wide association study in 4,398 breast cancer cases and 4,316 controls, followed by a third stage in which 30 single nucleotide polymorphisms (SNPs) were tested for confirmation in 21,860 cases and 22,578 controls from 22 studies. We used 227,876 SNPs that were estimated to correlate with 77% of known common SNPs in Europeans at r2>0.5. SNPs in five novel independent loci exhibited strong and consistent evidence of association with breast cancer (P<10−7). Four of these contain plausible causative genes (FGFR2, TNRC9, MAP3K1 and LSP1). At the second stage, 1,792 SNPs were significant at the P<0.05 level compared with an estimated 1,343 that would be expected by chance, indicating that many additional common susceptibility alleles may be identifiable by this approach. PMID:17529967

  6. Genome-wide evidence for positive selection and recombination in Actinobacillus pleuropneumoniae

    Directory of Open Access Journals (Sweden)

    Zhou Rui

    2011-07-01

    Full Text Available Abstract Background Actinobacillus pleuropneumoniae is an economically important animal pathogen that causes contagious pleuropneumonia in pigs. Currently, the molecular evolutionary trajectories for this pathogenic bacterium remain to require a better elucidation under the help of comparative genomics data. For this reason, we employed a comparative phylogenomic approach to obtain a comprehensive understanding of roles of natural selective pressure and homologous recombination during adaptation of this pathogen to its swine host. Results In this study, 12 A. pleuropneumoniae genomes were used to carry out a phylogenomic analyses. We identified 1,587 orthologous core genes as an initial data set for the estimation of genetic recombination and positive selection. Based on the analyses of four recombination tests, 23% of the core genome of A. pleuropneumoniae showed strong signals for intragenic homologous recombination. Furthermore, the selection analyses indicated that 57 genes were undergoing significant positive selection. Extensive function properties underlying these positively selected genes demonstrated that genes coding for products relevant to bacterial surface structures and pathogenesis are prone to natural selective pressure, presumably due to their potential roles in the avoidance of the porcine immune system. Conclusions Overall, substantial genetic evidence was shown to indicate that recombination and positive selection indeed play a crucial role in the adaptive evolution of A. pleuropneumoniae. The genome-wide profile of positively selected genes and/or amino acid residues will provide valuable targets for further research into the mechanisms of immune evasion and host-pathogen interactions for this serious swine pathogen.

  7. [Importance of modern genome-wide studies for the risk of myocardial infarction].

    Science.gov (United States)

    Kessler, T; Erdmann, J; Schunkert, H

    2014-02-01

    The individual genetic susceptibility is a cornerstone in the pathogenesis of coronary artery disease (CAD). The search for the genetic background and the subsequently altered molecular mechanisms has been ineffective for several years. The increase in genome-wide association studies in recent years has changed the scenario and more than 40 variants have so far been identified to be highly significantly associated with CAD and the risk of myocardial infarction (MI). Whereas most of these findings affect frequent polymorphisms, exome-wide sequencing in families with a high prevalence of CAD revealed mutations with a high penetrance and as a consequence a high risk of suffering from MI. The findings allow a deeper insight into functional mechanisms involved in the pathogenesis of atherosclerosis. Furthermore, the data enables validation of the numerous epidemiologically identified risk markers with respect to the causal role in the development of CAD, making the genetic architecture of CAD much more transparent. Nevertheless, individual risk prediction has only made weak progress in the face of the new findings. Every individual without exception carries numerous risk alleles even when the number and effect strength shows individual differences. Thus, a varying degree of genetic susceptibility is shared by all of us. Current research is therefore focusing on the functional integration of genetic information to discover new approaches to prevention and therapy.

  8. Genome-wide identification and characterization of WRKY gene family in peanut

    Directory of Open Access Journals (Sweden)

    Hui eSong

    2016-04-01

    Full Text Available WRKY, an important transcription factor family, is widely distributed in the plant kingdom. Many reports focused on analysis of phylogenetic relationship and biological function of WRKY protein at the whole genome level in different plant species. However, little is known about WRKY proteins in the genome of Arachis species and their response to salicylic acid (SA and jasmonic acid (JA treatment. In this study, we identified 77 and 75 WRKY proteins from the two wild ancestral diploid genomes of cultivated tetraploid peanut, Arachis duranensis and Arachis ipaënsis, using bioinformatics approaches. Most peanut WRKY coding genes were located on A. duranensis chromosome A6 and A. ipaënsis chromosome B3, while the least number of WRKY genes was found in chromosome 9. The WRKY orthologous gene pairs in A. duranensis and A. ipaënsis chromosomes were highly syntenic. Our analysis indicated that segmental duplication events played a major role in AdWRKY and AiWRKY genes, and strong purifying selection was observed in gene duplication pairs. Furthermore, we translate the knowledge gained from the genome-wide analysis result of wild ancestral peanut to cultivated peanut to reveal that gene activities of specific cultivated peanut WRKY gene were changed due to SA and JA treatment. Peanut WRKY7, 8 and 13 genes were down-regulated, whereas WRKY1 and 12 genes were up-regulated with SA and JA treatment. These results could provide valuable information for peanut improvement.

  9. Genome-wide scans provide evidence for positive selection of genes implicated in Lassa fever.

    Science.gov (United States)

    Andersen, Kristian G; Shylakhter, Ilya; Tabrizi, Shervin; Grossman, Sharon R; Happi, Christian T; Sabeti, Pardis C

    2012-03-19

    Rapidly evolving viruses and other pathogens can have an immense impact on human evolution as natural selection acts to increase the prevalence of genetic variants providing resistance to disease. With the emergence of large datasets of human genetic variation, we can search for signatures of natural selection in the human genome driven by such disease-causing microorganisms. Based on this approach, we have previously hypothesized that Lassa virus (LASV) may have been a driver of natural selection in West African populations where Lassa haemorrhagic fever is endemic. In this study, we provide further evidence for this notion. By applying tests for selection to genome-wide data from the International Haplotype Map Consortium and the 1000 Genomes Consortium, we demonstrate evidence for positive selection in LARGE and interleukin 21 (IL21), two genes implicated in LASV infectivity and immunity. We further localized the signals of selection, using the recently developed composite of multiple signals method, to introns and putative regulatory regions of those genes. Our results suggest that natural selection may have targeted variants giving rise to alternative splicing or differential gene expression of LARGE and IL21. Overall, our study supports the hypothesis that selective pressures imposed by LASV may have led to the emergence of particular alleles conferring resistance to Lassa fever, and opens up new avenues of research pursuit.

  10. Genome-wide transcriptional response of a Saccharomyces cerevisiae strain with an altered redox metabolism.

    Science.gov (United States)

    Bro, Christoffer; Regenberg, Birgitte; Nielsen, Jens

    2004-02-05

    The genome-wide transcriptional response of a Saccharomyces cerevisiae strain deleted in GDH1 that encodes a NADP(+)-dependent glutamate dehydrogenase was compared to a wild-type strain under anaerobic steady-state conditions. The GDH1-deleted strain has a significantly reduced NADPH requirement, and therefore, an altered redox metabolism. Identification of genes with significantly changed expression using a t-test and a Bonferroni correction yielded only 16 transcripts when accepting two false-positives, and 7 of these were Open Reading Frames (ORFs) with unknown function. Among the 16 transcripts the only one with a direct link to redox metabolism was GND1, encoding phosphogluconate dehydrogenase. To extract additional information we analyzed the transcription data for a gene subset consisting of all known genes encoding metabolic enzymes that use NAD(+) or NADP(+). The subset was analyzed for genes with significantly changed expression again with a t-test and correction for multiple testing. This approach was found to enrich the analysis since GND1, ZWF1 and ALD6, encoding the most important enzymes for regeneration of NADPH under anaerobic conditions, were down-regulated together with eight other genes encoding NADP(H)-dependent enzymes. This indicates a possible common redox-dependent regulation of these genes. Furthermore, we showed that it might be necessary to analyze the expression of a subset of genes to extract all available information from global transcription analysis.

  11. Genome-wide gene-environment interactions on quantitative traits using family data.

    Science.gov (United States)

    Sitlani, Colleen M; Dupuis, Josée; Rice, Kenneth M; Sun, Fangui; Pitsillides, Achilleas N; Cupples, L Adrienne; Psaty, Bruce M

    2016-07-01

    Gene-environment interactions may provide a mechanism for targeting interventions to those individuals who would gain the most benefit from them. Searching for interactions agnostically on a genome-wide scale requires large sample sizes, often achieved through collaboration among multiple studies in a consortium. Family studies can contribute to consortia, but to do so they must account for correlation within families by using specialized analytic methods. In this paper, we investigate the performance of methods that account for within-family correlation, in the context of gene-environment interactions with binary exposures and quantitative outcomes. We simulate both cross-sectional and longitudinal measurements, and analyze the simulated data taking family structure into account, via generalized estimating equations (GEE) and linear mixed-effects models. With sufficient exposure prevalence and correct model specification, all methods perform well. However, when models are misspecified, mixed modeling approaches have seriously inflated type I error rates. GEE methods with robust variance estimates are less sensitive to model misspecification; however, when exposures are infrequent, GEE methods require modifications to preserve type I error rate. We illustrate the practical use of these methods by evaluating gene-drug interactions on fasting glucose levels in data from the Framingham Heart Study, a cohort that includes related individuals.

  12. Pseudo-Seq: Genome-Wide Detection of Pseudouridine Modifications in RNA.

    Science.gov (United States)

    Carlile, Thomas M; Rojas-Duran, Maria F; Gilbert, Wendy V

    2015-01-01

    RNA molecules contain a variety of chemically diverse, posttranscriptionally modified bases. The most abundant modified base found in cellular RNAs, pseudouridine (Ψ), has recently been mapped to hundreds of sites in mRNAs, many of which are dynamically regulated. Though the pseudouridine landscape has been determined in only a few cell types and growth conditions, the enzymes responsible for mRNA pseudouridylation are universally conserved, suggesting many novel pseudouridylated sites remain to be discovered. Here, we present Pseudo-seq, a technique that allows the identification of sites of pseudouridylation genome-wide with single-nucleotide resolution. In this chapter, we provide a detailed description of Pseudo-seq. We include protocols for RNA isolation from Saccharomyces cerevisiae, Pseudo-seq library preparation, and data analysis, including descriptions of processing and mapping of sequencing reads, computational identification of sites of pseudouridylation, and assignment of sites to specific pseudouridine synthases. The approach presented here is readily adaptable to any cell or tissue type from which high-quality mRNA can be isolated. Identification of novel pseudouridylation sites is an important first step in elucidating the regulation and functions of these modifications.

  13. A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis

    Directory of Open Access Journals (Sweden)

    Mezey Jason G

    2010-01-01

    Full Text Available Abstract Background The success achieved by genome-wide association (GWA studies in the identification of candidate loci for complex diseases has been accompanied by an inability to explain the bulk of heritability. Here, we describe the algorithm V-Bay, a variational Bayes algorithm for multiple locus GWA analysis, which is designed to identify weaker associations that may contribute to this missing heritability. Results V-Bay provides a novel solution to the computational scaling constraints of most multiple locus methods and can complete a simultaneous analysis of a million genetic markers in a few hours, when using a desktop. Using a range of simulated genetic and GWA experimental scenarios, we demonstrate that V-Bay is highly accurate, and reliably identifies associations that are too weak to be discovered by single-marker testing approaches. V-Bay can also outperform a multiple locus analysis method based on the lasso, which has similar scaling properties for large numbers of genetic markers. For demonstration purposes, we also use V-Bay to confirm associations with gene expression in cell lines derived from the Phase II individuals of HapMap. Conclusions V-Bay is a versatile, fast, and accurate multiple locus GWA analysis tool for the practitioner interested in identifying weaker associations without high false positive rates.

  14. Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes.

    Science.gov (United States)

    Wang, Da-Wei; Li, Da; Wang, Junjun; Zhao, Yue; Wang, Zhaojun; Yue, Guidong; Liu, Xin; Qin, Huanju; Zhang, Kunpu; Dong, Lingli; Wang, Daowen

    2017-03-16

    Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 α-, 11 γ-, one δ- and five ω-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of α-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The δ-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic α-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat.

  15. Recent advances in the genome-wide study of DNA replication origins in yeast

    Directory of Open Access Journals (Sweden)

    Chong ePeng

    2015-02-01

    Full Text Available DNA replication, one of the central events in the cell cycle, is the basis of biological inheritance. In order to be duplicated, a DNA double helix must be opened at defined sites, which are called DNA replication origins (ORIs. Unlike in bacteria, where replication initiates from a single replication origin, multiple origins are utilized in the eukaryotic genome. Among them, the ORIs in budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe have been best characterized. In recent years, advances in DNA microarray and next-generation sequencing technologies have increased the number of yeast species involved in ORIs research dramatically. The ORIs in some nonconventional yeast species such as Kluyveromyces lactis and Pichia pastoris have also been genome-widely identified. Relevant databases of replication origins in yeast were constructed, then the comparative genomic analysis can be carried out. Here, we review several experimental approaches that have been used to map replication origins in yeast and some of the available web resources related to yeast ORIs. We also discuss the sequence characteristics and chromosome structures of ORIs in the four yeast species, which can be utilized to improve the replication origins prediction.

  16. A genome-wide survey of genetic variation in gorillas using reduced representation sequencing.

    Directory of Open Access Journals (Sweden)

    Aylwyn Scally

    Full Text Available All non-human great apes are endangered in the wild, and it is therefore important to gain an understanding of their demography and genetic diversity. Whole genome assembly projects have provided an invaluable foundation for understanding genetics in all four genera, but to date genetic studies of multiple individuals within great ape species have largely been confined to mitochondrial DNA and a small number of other loci. Here, we present a genome-wide survey of genetic variation in gorillas using a reduced representation sequencing approach, focusing on the two lowland subspecies. We identify 3,006,670 polymorphic sites in 14 individuals: 12 western lowland gorillas (Gorilla gorilla gorilla and 2 eastern lowland gorillas (Gorilla beringei graueri. We find that the two species are genetically distinct, based on levels of heterozygosity and patterns of allele sharing. Focusing on the western lowland population, we observe evidence for population substructure, and a deficit of rare genetic variants suggesting a recent episode of population contraction. In western lowland gorillas, there is an elevation of variation towards telomeres and centromeres on the chromosomal scale. On a finer scale, we find substantial variation in genetic diversity, including a marked reduction close to the major histocompatibility locus, perhaps indicative of recent strong selection there. These findings suggest that despite their maintaining an overall level of genetic diversity equal to or greater than that of humans, population decline, perhaps associated with disease, has been a significant factor in recent and long-term pressures on wild gorilla populations.

  17. Genome-wide functional screen identifies a compendium of genes affecting sensitivity to tamoxifen.

    Science.gov (United States)

    Mendes-Pereira, Ana M; Sims, David; Dexter, Tim; Fenwick, Kerry; Assiotis, Ioannis; Kozarewa, Iwanka; Mitsopoulos, Costas; Hakas, Jarle; Zvelebil, Marketa; Lord, Christopher J; Ashworth, Alan

    2012-02-21

    Therapies that target estrogen signaling have made a very considerable contribution to reducing mortality from breast cancer. However, resistance to tamoxifen remains a major clinical problem. Here we have used a genome-wide functional profiling approach to identify multiple genes that confer resistance or sensitivity to tamoxifen. Combining whole-genome shRNA screening with massively parallel sequencing, we have profiled the impact of more than 56,670 RNA interference reagents targeting 16,487 genes on the cellular response to tamoxifen. This screen, along with subsequent validation experiments, identifies a compendium of genes whose silencing causes tamoxifen resistance (including BAP1, CLPP, GPRC5D, NAE1, NF1, NIPBL, NSD1, RAD21, RARG, SMC3, and UBA3) and also a set of genes whose silencing causes sensitivity to this endocrine agent (C10orf72, C15orf55/NUT, EDF1, ING5, KRAS, NOC3L, PPP1R15B, RRAS2, TMPRSS2, and TPM4). Multiple individual genes, including NF1, a regulator of RAS signaling, also correlate with clinical outcome after tamoxifen treatment.

  18. Genome-wide association studies and the genetic dissection of complex traits

    Science.gov (United States)

    Sebastiani, Paola; Timofeev, Nadia; Dworkis, Daniel A.; Perls, Thomas T.; Steinberg, Martin H.

    2010-01-01

    The availability of affordable high throughput technology for parallel genotyping has opened the field of genetics to genome-wide association studies (GWAS), and in the last few years hundreds of articles reporting results of GWAS for a variety of heritable traits have been published. What do these results tell us? Although GWAS have discovered a few hundred reproducible associations, this number is underwhelming in relation to the huge amount of data produced, and challenges the conjecture that common variants may be the genetic causes of common diseases. We argue that the massive amount of genetic data that result from these studies remains largely unexplored and unexploited because of the challenge of mining and modeling enormous data sets, the difficulty of using nontraditional computational techniques and the focus of accepted statistical analyses on controlling the false positive rate rather than limiting the false negative rate. In this article, we will review the common approach to analysis of GWAS data and then discuss options to learn more from these data. We will use examples from our ongoing studies of sickle cell anemia and also GWAS in multigenic traits. PMID:19569043

  19. Pathway analysis for genome-wide association study of lung cancer in Han Chinese population.

    Directory of Open Access Journals (Sweden)

    Ruyang Zhang

    Full Text Available Genome-wide association studies (GWAS have identified a number of genetic variants associated with lung cancer risk. However, these loci explain only a small fraction of lung cancer hereditability and other variants with weak effect may be lost in the GWAS approach due to the stringent significance level after multiple comparison correction. In this study, in order to identify important pathways involving the lung carcinogenesis, we performed a two-stage pathway analysis in GWAS of lung cancer in Han Chinese using gene set enrichment analysis (GSEA method. Predefined pathways by BioCarta and KEGG databases were systematically evaluated on Nanjing study (Discovery stage: 1,473 cases and 1,962 controls and the suggestive pathways were further to be validated in Beijing study (Replication stage: 858 cases and 1,115 controls. We found that four pathways (achPathway, metPathway, At1rPathway and rac1Pathway were consistently significant in both studies and the P values for combined dataset were 0.012, 0.010, 0.022 and 0.005 respectively. These results were stable after sensitivity analysis based on gene definition and gene overlaps between pathways. These findings may provide new insights into the etiology of lung cancer.

  20. P-value based analysis for shared controls design in genome-wide association studies.

    Science.gov (United States)

    Zaykin, Dmitri V; Kozbur, Damian O

    2010-11-01

    An appealing genome-wide association study design compares one large control group against several disease samples. A pioneering study by the Wellcome Trust Case Control Consortium that employed such a design has identified multiple susceptibility regions, many of which have been independently replicated. While reusing a control sample provides effective utilization of data, it also creates correlation between association statistics across diseases. An observation of a large association statistic for one of the diseases may greatly increase chances of observing a spuriously large association for a different disease. Accounting for the correlation is also particularly important when screening for SNPs that might be involved in a set of diseases with overlapping etiology. We describe methods that correct association statistics for dependency due to shared controls, and we describe ways to obtain a measure of overall evidence and to combine association signals across multiple diseases. The methods we describe require no access to individual subject data, instead, they efficiently utilize information contained in P-values for association reported for individual diseases. P-value based combined tests for association are flexible and essentially as powerful as the approach based on aggregating the individual subject data.

  1. Network-Assisted Investigation of Combined Causal Signals from Genome-Wide Association Studies in Schizophrenia

    Science.gov (United States)

    Jia, Peilin; Wang, Lily; Fanous, Ayman H.; Pato, Carlos N.; Edwards, Todd L.; Zhao, Zhongming

    2012-01-01

    With the recent success of genome-wide association studies (GWAS), a wealth of association data has been accomplished for more than 200 complex diseases/traits, proposing a strong demand for data integration and interpretation. A combinatory analysis of multiple GWAS datasets, or an integrative analysis of GWAS data and other high-throughput data, has been particularly promising. In this study, we proposed an integrative analysis framework of multiple GWAS datasets by overlaying association signals onto the protein-protein interaction network, and demonstrated it using schizophrenia datasets. Building on a dense module search algorithm, we first searched for significantly enriched subnetworks for schizophrenia in each single GWAS dataset and then implemented a discovery-evaluation strategy to identify module genes with consistent association signals. We validated the module genes in an independent dataset, and also examined them through meta-analysis of the related SNPs using multiple GWAS datasets. As a result, we identified 205 module genes with a joint effect significantly associated with schizophrenia; these module genes included a number of well-studied candidate genes such as DISC1, GNA12, GNA13, GNAI1, GPR17, and GRIN2B. Further functional analysis suggested these genes are involved in neuronal related processes. Additionally, meta-analysis found that 18 SNPs in 9 module genes had P metadatasets. This approach can be applied to any other complex diseases/traits where multiple GWAS datasets are available. PMID:22792057

  2. Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes

    Science.gov (United States)

    Wang, Da-Wei; Li, Da; Wang, Junjun; Zhao, Yue; Wang, Zhaojun; Yue, Guidong; Liu, Xin; Qin, Huanju; Zhang, Kunpu; Dong, Lingli; Wang, Daowen

    2017-01-01

    Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 α-, 11 γ-, one δ- and five ω-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of α-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The δ-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic α-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat. PMID:28300172

  3. Incorporating group correlations in genome-wide association studies using smoothed group Lasso.

    Science.gov (United States)

    Liu, Jin; Huang, Jian; Ma, Shuangge; Wang, Kai

    2013-04-01

    In genome-wide association studies, penalization is an important approach for identifying genetic markers associated with disease. Motivated by the fact that there exists natural grouping structure in single nucleotide polymorphisms and, more importantly, such groups are correlated, we propose a new penalization method for group variable selection which can properly accommodate the correlation between adjacent groups. This method is based on a combination of the group Lasso penalty and a quadratic penalty on the difference of regression coefficients of adjacent groups. The new method is referred to as smoothed group Lasso (SGL). It encourages group sparsity and smoothes regression coefficients for adjacent groups. Canonical correlations are applied to the weights between groups in the quadratic difference penalty. We first derive a GCD algorithm for computing the solution path with linear regression model. The SGL method is further extended to logistic regression for binary response. With the assistance of the majorize-minimization algorithm, the SGL penalized logistic regression turns out to be an iteratively penalized least-square problem. We also suggest conducting principal component analysis to reduce the dimensionality within groups. Simulation studies are used to evaluate the finite sample performance. Comparison with group Lasso shows that SGL is more effective in selecting true positives. Two datasets are analyzed using the SGL method.

  4. Scalable privacy-preserving data sharing methodology for genome-wide association studies.

    Science.gov (United States)

    Yu, Fei; Fienberg, Stephen E; Slavković, Aleksandra B; Uhler, Caroline

    2014-08-01

    The protection of privacy of individual-level information in genome-wide association study (GWAS) databases has been a major concern of researchers following the publication of "an attack" on GWAS data by Homer et al. (2008). Traditional statistical methods for confidentiality and privacy protection of statistical databases do not scale well to deal with GWAS data, especially in terms of guarantees regarding protection from linkage to external information. The more recent concept of differential privacy, introduced by the cryptographic community, is an approach that provides a rigorous definition of privacy with meaningful privacy guarantees in the presence of arbitrary external information, although the guarantees may come at a serious price in terms of data utility. Building on such notions, Uhler et al. (2013) proposed new methods to release aggregate GWAS data without compromising an individual's privacy. We extend the methods developed in Uhler et al. (2013) for releasing differentially-private χ(2)-statistics by allowing for arbitrary number of cases and controls, and for releasing differentially-private allelic test statistics. We also provide a new interpretation by assuming the controls' data are known, which is a realistic assumption because some GWAS use publicly available data as controls. We assess the performance of the proposed methods through a risk-utility analysis on a real data set consisting of DNA samples collected by the Wellcome Trust Case Control Consortium and compare the methods with the differentially-private release mechanism proposed by Johnson and Shmatikov (2013).