Gregersen, Vivi Raundahl; Bertelsen, Henriette Pasgaard; Poulsen, Nina Aagaard
The cheese renneting process is affected by a number of factors associated to milk composition and a number of Danish Holsteins has previously been identified to have poor milk coagulation ability. Therefore, the aim of this study was to identify genomic regions affecting the technological...
Laffin, Jennifer J S; Raca, Gordana; Jackson, Craig A; Strand, Edythe A; Jakielski, Kathy J; Shriberg, Lawrence D
The goal of this study was to identify new candidate genes and genomic copy-number variations associated with a rare, severe, and persistent speech disorder termed childhood apraxia of speech. Childhood apraxia of speech is the speech disorder segregating with a mutation in FOXP2 in a multigenerational London pedigree widely studied for its role in the development of speech-language in humans. A total of 24 participants who were suspected to have childhood apraxia of speech were assessed using a comprehensive protocol that samples speech in challenging contexts. All participants met clinical-research criteria for childhood apraxia of speech. Array comparative genomic hybridization analyses were completed using a customized 385K Nimblegen array (Roche Nimblegen, Madison, WI) with increased coverage of genes and regions previously associated with childhood apraxia of speech. A total of 16 copy-number variations with potential consequences for speech-language development were detected in 12 or half of the 24 participants. The copy-number variations occurred on 10 chromosomes, 3 of which had two to four candidate regions. Several participants were identified with copy-number variations in two to three regions. In addition, one participant had a heterozygous FOXP2 mutation and a copy-number variation on chromosome 2, and one participant had a 16p11.2 microdeletion and copy-number variations on chromosomes 13 and 14. Findings support the likelihood of heterogeneous genomic pathways associated with childhood apraxia of speech.
Full Text Available Human height is a highly heritable trait considered as an important factor for health. There has been limited success in identifying the genetic factors underlying height variation. We aim to identify sequence variants associated with adult height by a genome-wide association study of copy number variants (CNVs in Chinese.Genome-wide CNV association analyses were conducted in 1,625 unrelated Chinese adults and sex specific subgroup for height variation, respectively. Height was measured with a stadiometer. Affymetrix SNP6.0 genotyping platform was used to identify copy number polymorphisms (CNPs. We constructed a genomic map containing 1,009 CNPs in Chinese individuals and performed a genome-wide association study of CNPs with height.We detected 10 significant association signals for height (p<0.05 in the whole population, 9 and 11 association signals for Chinese female and male population, respectively. A copy number polymorphism (CNP12587, chr18:54081842-54086942, p = 2.41 × 10(-4 was found to be significantly associated with height variation in Chinese females even after strict Bonferroni correction (p = 0.048. Confirmatory real time PCR experiments lent further support for CNV validation. Compared to female subjects with two copies of the CNP, carriers of three copies had an average of 8.1% decrease in height. An important candidate gene, ubiquitin-protein ligase NEDD4-like (NEDD4L, was detected at this region, which plays important roles in bone metabolism by binding to bone formation regulators.Our findings suggest the important genetic variants underlying height variation in Chinese.
Oyebola, Kolapo M; Idowu, Emmanuel T; Olukosi, Yetunde A; Awolola, Taiwo S; Amambua-Ngwa, Alfred
The burden of falciparum malaria is especially high in sub-Saharan Africa. Differences in pressure from host immunity and antimalarial drugs lead to adaptive changes responsible for high level of genetic variations within and between the parasite populations. Population-specific genetic studies to survey for genes under positive or balancing selection resulting from drug pressure or host immunity will allow for refinement of interventions. We performed a pooled sequencing (pool-seq) of the genomes of 100 Plasmodium falciparum isolates from Nigeria. We explored allele-frequency based neutrality test (Tajima's D) and integrated haplotype score (iHS) to identify genes under selection. Fourteen shared iHS regions that had at least 2 SNPs with a score > 2.5 were identified. These regions code for genes that were likely to have been under strong directional selection. Two of these genes were the chloroquine resistance transporter (CRT) on chromosome 7 and the multidrug resistance 1 (MDR1) on chromosome 5. There was a weak signature of selection in the dihydrofolate reductase (DHFR) gene on chromosome 4 and MDR5 genes on chromosome 13, with only 2 and 3 SNPs respectively identified within the iHS window. We observed strong selection pressure attributable to continued chloroquine and sulfadoxine-pyrimethamine use despite their official proscription for the treatment of uncomplicated malaria. There was also a major selective sweep on chromosome 6 which had 32 SNPs within the shared iHS region. Tajima's D of circumsporozoite protein (CSP), erythrocyte-binding antigen (EBA-175), merozoite surface proteins - MSP3 and MSP7, merozoite surface protein duffy binding-like (MSPDBL2) and serine repeat antigen (SERA-5) were 1.38, 1.29, 0.73, 0.84 and 0.21, respectively. We have demonstrated the use of pool-seq to understand genomic patterns of selection and variability in P. falciparum from Nigeria, which bears the highest burden of infections. This investigation identified known
Stephen N White
Full Text Available BACKGROUND: Like human immunodeficiency virus (HIV, ovine lentivirus (OvLV is macrophage-tropic and causes lifelong infection. OvLV infects one quarter of U.S. sheep and induces pneumonia and body condition wasting. There is no vaccine to prevent OvLV infection and no cost-effective treatment for infected animals. However, breed differences in prevalence and proviral concentration have indicated a genetic basis for susceptibility to OvLV. A recent study identified TMEM154 variants in OvLV susceptibility. The objective here was to identify additional loci associated with odds and/or control of OvLV infection. METHODOLOGY/PRINCIPAL FINDINGS: This genome-wide association study (GWAS included 964 sheep from Rambouillet, Polypay, and Columbia breeds with serological status and proviral concentration phenotypes. Analytic models accounted for breed and age, as well as genotype. This approach identified TMEM154 (nominal P=9.2×10(-7; empirical P=0.13, provided 12 additional genomic regions associated with odds of infection, and provided 13 regions associated with control of infection (all nominal P<1 × 10(-5. Rapid decline of linkage disequilibrium with distance suggested many regions included few genes each. Genes in regions associated with odds of infection included DPPA2/DPPA4 (empirical P=0.006, and SYTL3 (P=0.051. Genes in regions associated with control of infection included a zinc finger cluster (ZNF192, ZSCAN16, ZNF389, and ZNF165; P=0.001, C19orf42/TMEM38A (P=0.047, and DLGAP1 (P=0.092. CONCLUSIONS/SIGNIFICANCE: These associations provide targets for mutation discovery in sheep susceptibility to OvLV. Aside from TMEM154, these genes have not been associated previously with lentiviral infection in any species, to our knowledge. Further, data from other species suggest functional hypotheses for future testing of these genes in OvLV and other lentiviral infections. Specifically, SYTL3 binds and may regulate RAB27A, which is required for enveloped
Skibola, Christine F.; Berndt, Sonja I.; Vijai, Joseph; Conde, Lucia; Wang, Zhaoming; Yeager, Meredith; de Bakker, Paul I. W.; Birmann, Brenda M.; Vajdic, Claire M.; Foo, Jia-Nee; Bracci, Paige M.; Vermeulen, Roel C. H.; Slager, Susan L.; de Sanjose, Silvia; Wang, Sophia S.; Linet, Martha S.; Salles, Gilles; Lan, Qing; Severi, Gianluca; Hjalgrim, Henrik; Lightfoot, Tracy; Melbye, Mads; Gu, Jian; Ghesquieres, Herve; Link, Brian K.; Morton, Lindsay M.; Holly, Elizabeth A.; Smith, Alex; Tinker, Lesley F.; Teras, Lauren R.; Kricker, Anne; Becker, Nikolaus; Purdue, Mark P.; Spinelli, John J.; Zhang, Yawei; Giles, Graham G.; Vineis, Paolo; Monnereau, Alain; Bertrand, Kimberly A.; Albanes, Demetrius; Zeleniuch-Jacquotte, Anne; Gabbas, Attilio; Chung, Charles C.; Burdett, Laurie; Hutchinson, Amy; Lawrence, Charles; Montalvan, Rebecca; Liang, Liming; Huang, Jinyan; Ma, Baoshan; Liu, Jianjun; Adami, Hans-Olov; Glimelius, Bengt; Ye, Yuanqing; Nowakowski, Grzegorz S.; Dogan, Ahmet; Thompson, Carrie A.; Habermann, Thomas M.; Novak, Anne J.; Liebow, Mark; Witzig, Thomas E.; Weiner, George J.; Schenk, Maryjean; Hartge, Patricia; De Roos, Anneclaire J.; Cozen, Wendy; Zhi, Degui; Akers, Nicholas K.; Riby, Jacques; Smith, Martyn T.; Lacher, Mortimer; Villano, Danylo J.; Maria, Ann; Roman, Eve; Kane, Eleanor; Jackson, Rebecca D.; North, Kari E.; Diver, W. Ryan; Turner, Jenny; Armstrong, Bruce K.; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; Staines, Anthony; McKay, James; Brooks-Wilson, Angela R.; Zheng, Tongzhang; Holford, Theodore R.; Chamosa, Saioa; Kaaks, Rudolph; Kelly, Rachel S.; Ohlsson, Bodil; Travis, Ruth C.; Weiderpass, Elisabete; Clave, Jacqueline; Giovannucci, Edward; Kraft, Peter; Virtamo, Jarmo; Mazza, Patrizio; Cocco, Pierluigi; Ennas, Maria Grazia; Chiu, Brian C. H.; Fraumeni, Joseph R.; Nieters, Alexandra; Offit, Kenneth; Wu, Xifeng; Cerhan, James R.; Smedby, Karin E.; Chanock, Stephen J.; Rothman, Nathaniel
Genome-wide association studies (GWASs) of follicular lymphoma (FL) have previously identified human leukocyte antigen (HLA) gene variants. To identify additional FL susceptibility loci, we conducted a large-scale two-stage GWAS in 4,523 case subjects and 13,344 control subjects of European
Wan, Zi Yi; Xia, Jun Hong; Lin, Grace; Wang, Le; Lin, Valerie C. L.; Yue, Gen Hua
Sexual dimorphism is an interesting biological phenomenon. Previous studies showed that DNA methylation might play a role in sexual dimorphism. However, the overall picture of the genome-wide methylation landscape in sexually dimorphic species remains unclear. We analyzed the DNA methylation landscape and transcriptome in hybrid tilapia (Oreochromis spp.) using whole genome bisulfite sequencing (WGBS) and RNA-sequencing (RNA-seq). We found 4,757 sexually dimorphic differentially methylated regions (DMRs), with significant clusters of DMRs located on chromosomal regions associated with sex determination. CpG methylation in promoter regions was negatively correlated with the gene expression level. MAPK/ERK pathway was upregulated in male tilapia. We also inferred active cis-regulatory regions (ACRs) in skeletal muscle tissues from WGBS datasets, revealing sexually dimorphic cis-regulatory regions. These results suggest that DNA methylation contribute to sex-specific phenotypes and serve as resources for further investigation to analyze the functions of these regions and their contributions towards sexual dimorphisms. PMID:27782217
Rahmatalla, Siham A; Arends, Danny; Reissmann, Monika; Said Ahmed, Ammar; Wimmers, Klaus; Reyer, Henry; Brockmann, Gudrun A
Sudan is endowed with a variety of indigenous goat breeds which are used for meat and milk production and which are well adapted to the local environment. The aim of the present study was to determine the genetic diversity and relationship within and between the four main Sudanese breeds of Nubian, Desert, Taggar and Nilotic goats. Using the 50 K SNP chip, 24 animals of each breed were genotyped. More than 96% of high quality SNPs were polymorphic with an average minor allele frequency of 0.3. In all breeds, no significant difference between observed (0.4) and expected (0.4) heterozygosity was found and the inbreeding coefficients (F IS ) did not differ from zero. F st coefficients for the genetic distance between breeds also did not significantly deviate from zero. In addition, the analysis of molecular variance revealed that 93% of the total variance in the examined population can be explained by differences among individuals, while only 7% result from differences between the breeds. These findings provide evidence for high genetic diversity and little inbreeding within breeds on one hand, and low diversity between breeds on the other hand. Further examinations using Nei's genetic distance and STRUCTURE analysis clustered Taggar goats distinct from the other breeds. In a principal component (PC) analysis, PC1 could separate Taggar, Nilotic and a mix of Nubian and Desert goats into three groups. The SNPs that contributed strongly to PC1 showed high F st values in Taggar goat versus the other goat breeds. PCA allowed us to identify target genomic regions which contain genes known to influence growth, development, bone formation and the immune system. The information on the genetic variability and diversity in this study confirmed that Taggar goat is genetically different from the other goat breeds in Sudan. The SNPs identified by the first principal components show high F st values in Taggar goat and allowed to identify candidate genes which can be used in the
to varying degrees of dyspnea (respiratory distress), cachexia (body condition wasting), mastitis , arthritis, and/or encephalitis [5,6]. One of the...General Transcription Factor IIH, polypeptide 5), the gene order does not agree with other mammal genomes including cow , human, dog, and mouse, and it may
Doran, Anthony G; Berry, Donagh P; Creevey, Christopher J
Four traits related to carcass performance have been identified as economically important in beef production: carcass weight, carcass fat, carcass conformation of progeny and cull cow carcass weight. Although Holstein-Friesian cattle are primarily utilized for milk production, they are also an important source of meat for beef production and export. Because of this, there is great interest in understanding the underlying genomic structure influencing these traits. Several genome-wide association studies have identified regions of the bovine genome associated with growth or carcass traits, however, little is known about the mechanisms or underlying biological pathways involved. This study aims to detect regions of the bovine genome associated with carcass performance traits (employing a panel of 54,001 SNPs) using measures of genetic merit (as predicted transmitting abilities) for 5,705 Irish Holstein-Friesian animals. Candidate genes and biological pathways were then identified for each trait under investigation. Following adjustment for false discovery (q-value carcass traits using a single SNP regression approach. Using a Bayesian approach, 46 QTL were associated (posterior probability > 0.5) with at least one of the four traits. In total, 557 unique bovine genes, which mapped to 426 human orthologs, were within 500kbs of QTL found associated with a trait using the Bayesian approach. Using this information, 24 significantly over-represented pathways were identified across all traits. The most significantly over-represented biological pathway was the peroxisome proliferator-activated receptor (PPAR) signaling pathway. A large number of genomic regions putatively associated with bovine carcass traits were detected using two different statistical approaches. Notably, several significant associations were detected in close proximity to genes with a known role in animal growth such as glucagon and leptin. Several biological pathways, including PPAR signaling, were
Sahana, Goutam; Kadlecová, Veronika; Hornshøj, Henrik
Feed conversion ratio (FCR) is an economically important trait in pigs and feed accounts for a significant proportion of the costs involved in pig production. In this study we used a high density SNP chip panel, Porcine SNP60 BeadChip, to identify association between FCR and SNP markers and to st...
Pan-Genome Analysis of Human Gastric Pathogen H. pylori: Comparative Genomics and Pathogenomics Approaches to Identify Regions Associated with Pathogenicity and Prediction of Potential Core Therapeutic Targets
Ali, Amjad; Naz, Anam; Soares, Siomar C.
-genome approach; the predicted conserved gene families (1,193) constitute similar to 77% of the average H. pylori genome and 45% of the global gene repertoire of the species. Reverse vaccinology strategies have been adopted to identify and narrow down the potential core-immunogenic candidates. Total of 28 nonhost....... Pan-genome analyses of the global representative H. pylori isolates consisting of 39 complete genomes are presented in this paper. Phylogenetic analyses have revealed close relationships among geographically diverse strains of H. pylori. The conservation among these genomes was further analyzed by pan...
Full Text Available Preeclampsia (PE is a leading cause of perinatal morbidity and mortality. However, as a common form of PE, the etiology of late-onset PE is elusive. We analyzed 5-methylcytosine (5mC and 5-hydroxymethylcytosine (5hmC levels in the placentas of late-onset severe PE patients (n = 4 and normal controls (n = 4 using a (hydroxymethylated DNA immunoprecipitation approach combined with deep sequencing ([h]MeDIP-seq, and the results were verified by (hMeDIP-qPCR. The most significant differentially methylated regions (DMRs were verified by MassARRAY EppiTYPER in an enlarged sample size (n = 20. Bioinformatics analysis identified 714 peaks of 5mC that were associated with 403 genes and 119 peaks of 5hmC that were associated with 61 genes, thus showing significant differences between the PE patients and the controls (>2-fold, p<0.05. Further, only one gene, PTPRN2, had both 5mC and 5hmC changes in patients. The ErbB signaling pathway was enriched in those 403 genes that had significantly different 5mC level between the groups. This genome-wide mapping of 5mC and 5hmC in late-onset severe PE and normal controls demonstrates that both 5mC and 5hmC play epigenetic roles in the regulation of the disease, but work independently. We reveal the genome-wide mapping of DNA methylation and DNA hydroxymethylation in late-onset PE placentas for the first time, and the identified ErbB signaling pathway and the gene PTPRN2 may be relevant to the epigenetic pathogenesis of late-onset PE.
Taras K Oleksyk
Full Text Available When a selective sweep occurs in the chromosomal region around a target gene in two populations that have recently separated, it produces three dramatic genomic consequences: 1 decreased multi-locus heterozygosity in the region; 2 elevated or diminished genetic divergence (F(ST of multiple polymorphic variants adjacent to the selected locus between the divergent populations, due to the alternative fixation of alleles; and 3 a consequent regional increase in the variance of F(ST (S(2F(ST for the same clustered variants, due to the increased alternative fixation of alleles in the loci surrounding the selection target. In the first part of our study, to search for potential targets of directional selection, we developed and validated a resampling-based computational approach; we then scanned an array of 31 different-sized moving windows of SNP variants (5-65 SNPs across the human genome in a set of European and African American population samples with 183,997 SNP loci after correcting for the recombination rate variation. The analysis revealed 180 regions of recent selection with very strong evidence in either population or both. In the second part of our study, we compared the newly discovered putative regions to those sites previously postulated in the literature, using methods based on inspecting patterns of linkage disequilibrium, population divergence and other methodologies. The newly found regions were cross-validated with those found in nine other studies that have searched for selection signals. Our study was replicated especially well in those regions confirmed by three or more studies. These validated regions were independently verified, using a combination of different methods and different databases in other studies, and should include fewer false positives. The main strength of our analysis method compared to others is that it does not require dense genotyping and therefore can be used with data from population-based genome SNP scans
Strange, Amy; Bellenguez, Céline; Sim, Xueling; Luben, Robert; Hysi, Pirro G.; Ramdas, Wishal D.; van Koolwijk, Leonieke M.E.; Freeman, Colin; Pirinen, Matti; Su, Zhan; Band, Gavin; Pearson, Richard; Vukcevic, Damjan; Langford, Cordelia; Deloukas, Panos
To discover quantitative trait loci for intraocular pressure, a major risk factor for glaucoma and the only modifiable one, we performed a genome-wide association study on a discovery cohort of 2175 individuals from Sydney, Australia. We found a novel association between intraocular pressure and a common variant at 7p21 near to GLCCI1 and ICA1. The findings in this region were confirmed through two UK replication cohorts totalling 4866 individuals (rs59072263, P(combined) = 1.10 × 10(-8)). A ...
Pandey, Manish K; Khan, Aamir W; Singh, Vikas K; Vishwakarma, Manish K; Shasidhar, Yaduru; Kumar, Vinay; Garg, Vanika; Bhat, Ramesh S; Chitikineni, Annapurna; Janila, Pasupuleti; Guo, Baozhu; Varshney, Rajeev K
Rust and late leaf spot (LLS) are the two major foliar fungal diseases in groundnut, and their co-occurrence leads to significant yield loss in addition to the deterioration of fodder quality. To identify candidate genomic regions controlling resistance to rust and LLS, whole-genome resequencing (WGRS)-based approach referred as 'QTL-seq' was deployed. A total of 231.67 Gb raw and 192.10 Gb of clean sequence data were generated through WGRS of resistant parent and the resistant and susceptible bulks for rust and LLS. Sequence analysis of bulks for rust and LLS with reference-guided resistant parent assembly identified 3136 single-nucleotide polymorphisms (SNPs) for rust and 66 SNPs for LLS with the read depth of ≥7 in the identified genomic region on pseudomolecule A03. Detailed analysis identified 30 nonsynonymous SNPs affecting 25 candidate genes for rust resistance, while 14 intronic and three synonymous SNPs affecting nine candidate genes for LLS resistance. Subsequently, allele-specific diagnostic markers were identified for three SNPs for rust resistance and one SNP for LLS resistance. Genotyping of one RIL population (TAG 24 × GPBD 4) with these four diagnostic markers revealed higher phenotypic variation for these two diseases. These results suggest usefulness of QTL-seq approach in precise and rapid identification of candidate genomic regions and development of diagnostic markers for breeding applications. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Full Text Available Chiari-like malformation (CM is a developmental abnormality of the craniocervical junction that is common in the Griffon Bruxellois (GB breed with an estimated prevalence of 65%. This disease is characterized by overcrowding of the neural parenchyma at the craniocervical junction and disturbance of cerebrospinal fluid (CSF flow. The most common clinical sign is pain either as a direct consequence of CM or neuropathic pain as a consequence of secondary syringomyelia. The etiology of CM remains unknown but genetic factors play an important role. To investigate the genetic complexity of the disease, a quantitative trait locus (QTL approach was adopted. A total of 14 quantitative skull and atlas measurements were taken and were tested for association to CM. Six traits were found to be associated to CM and were subjected to a whole-genome association study using the Illumina canine high density bead chip in 74 GB dogs (50 affected and 24 controls. Linear and mixed regression analyses identified associated single nucleotide polymorphisms (SNPs on 5 Canis Familiaris Autosomes (CFAs: CFA2, CFA9, CFA12, CFA14 and CFA24. A reconstructed haplotype of 0.53 Mb on CFA2 strongly associated to the height of the cranial fossa (diameter F and an haplotype of 2.5 Mb on CFA14 associated to both the height of the rostral part of the caudal cranial fossa (AE and the height of the brain (FG were significantly associated to CM after 10 000 permutations strengthening their candidacy for this disease (P = 0.0421, P = 0.0094 respectively. The CFA2 QTL harbours the Sall-1 gene which is an excellent candidate since its orthologue in humans is mutated in Townes-Brocks syndrome which has previously been associated to Chiari malformation I. Our study demonstrates the implication of multiple traits in the etiology of CM and has successfully identified two new QTL associated to CM and a potential candidate gene.
Nakajima, Masahiro; Takahashi, Atsushi; Kou, Ikuyo; Rodriguez-Fontenla, Cristina; Gomez-Reino, Juan J.; Furuichi, Tatsuya; Dai, Jin; Sudo, Akihiro; Uchida, Atsumasa; Fukui, Naoshi; Kubo, Michiaki; Kamatani, Naoyuki; Tsunoda, Tatsuhiko; Malizos, Konstantinos N.; Tsezou, Aspasia; Gonzalez, Antonio; Nakamura, Yusuke; Ikegawa, Shiro
Osteoarthritis (OA) is a common disease that has a definite genetic component. Only a few OA susceptibility genes that have definite functional evidence and replication of association have been reported, however. Through a genome-wide association study and a replication using a total of ∼4,800 Japanese subjects, we identified two single nucleotide polymorphisms (SNPs) (rs7775228 and rs10947262) associated with susceptibility to knee OA. The two SNPs were in a region containing HLA class II/III genes and their association reached genome-wide significance (combined P = 2.43×10−8 for rs7775228 and 6.73×10−8 for rs10947262). Our results suggest that immunologic mechanism is implicated in the etiology of OA. PMID:20305777
Juraeva, Dilafruz; Haenisch, Britta; Zapatka, Marc; Frank, Josef; Witt, Stephanie H.; Mühleisen, Thomas W.; Treutlein, Jens; Strohmaier, Jana; Meier, Sandra; Degenhardt, Franziska; Giegling, Ina; Ripke, Stephan; Leber, Markus; Lange, Christoph; Schulze, Thomas G.; Mössner, Rainald; Nenadic, Igor; Sauer, Heinrich; Rujescu, Dan; Maier, Wolfgang; Børglum, Anders; Ophoff, Roel; Cichon, Sven; Nöthen, Markus M.; Rietschel, Marcella; Mattheisen, Manuel; Brors, Benedikt; Kahn, René S.; Cahn, Wiepke; Linszen, Don H.; de Haan, Lieuwe; van Os, Jim; Krabbendam, Lydia; Myin-Germeys, Inez; Wiersma, Durk; Bruggeman, Richard; Mors, O.; Børglum, A. D.; Mortensen, P. B.; Pedersen, C. B.; Demontis, D.; Grove, J.; Mattheisen, M.; Hougaard, D. M.
In the present study, an integrated hierarchical approach was applied to: (1) identify pathways associated with susceptibility to schizophrenia; (2) detect genes that may be potentially affected in these pathways since they contain an associated polymorphism; and (3) annotate the functional
Roshandel, Delnaz; Gubitosi-Klug, Rose; Bull, Shelley B; Canty, Angelo J; Pezzolesi, Marcus G; King, George L; Keenan, Hillary A; Snell-Bergeon, Janet K; Maahs, David M; Klein, Ronald; Klein, Barbara E K; Orchard, Trevor J; Costacou, Tina; Weedon, Michael N; Oram, Richard A; Paterson, Andrew D
The aim of this study was to identify genetic variants associated with beta cell function in type 1 diabetes, as measured by serum C-peptide levels, through meta-genome-wide association studies (meta-GWAS). We performed a meta-GWAS to combine the results from five studies in type 1 diabetes with cross-sectionally measured stimulated, fasting or random C-peptide levels, including 3479 European participants. The p values across studies were combined, taking into account sample size and direction of effect. We also performed separate meta-GWAS for stimulated (n = 1303), fasting (n = 2019) and random (n = 1497) C-peptide levels. In the meta-GWAS for stimulated/fasting/random C-peptide levels, a SNP on chromosome 1, rs559047 (Chr1:238753916, T>A, minor allele frequency [MAF] 0.24-0.26), was associated with C-peptide (p = 4.13 × 10 -8 ), meeting the genome-wide significance threshold (p C>T, MAF 0.07-0.10, p = 8.43 × 10 -8 ). In the stimulated C-peptide meta-GWAS, rs61211515 (Chr6:30100975, T/-, MAF 0.17-0.19) in the MHC region was associated with stimulated C-peptide (β [SE] = - 0.39 [0.07], p = 9.72 × 10 -8 ). rs61211515 was also associated with the rate of stimulated C-peptide decline over time in a subset of individuals (n = 258) with annual repeated measures for up to 6 years (p = 0.02). In the meta-GWAS of random C-peptide, another MHC region, SNP rs3135002 (Chr6:32668439, C>A, MAF 0.02-0.06), was associated with C-peptide (p = 3.49 × 10 -8 ). Conditional analyses suggested that the three identified variants in the MHC region were independent of each other. rs9260151 and rs3135002 have been associated with type 1 diabetes, whereas rs559047 and rs61211515 have not been associated with a risk of developing type 1 diabetes. We identified a locus on chromosome 1 and multiple variants in the MHC region, at least some of which were distinct from type 1 diabetes risk loci, that were associated with C
Full Text Available Abstract Background Cancer-related genes show racial differences. Therefore, identification and characterization of DNA copy number alteration regions in different racial groups helps to dissect the mechanism of tumorigenesis. Methods Array-comparative genomic hybridization (array-CGH was analyzed for DNA copy number profile in 40 Asian and 20 Caucasian lung cancer patients. Three methods including MetaCore analysis for disease and pathway correlations, concordance analysis between array-CGH database and the expression array database, and literature search for copy number variation genes were performed to select novel lung cancer candidate genes. Four candidate oncogenes were validated for DNA copy number and mRNA and protein expression by quantitative polymerase chain reaction (qPCR, chromogenic in situ hybridization (CISH, reverse transcriptase-qPCR (RT-qPCR, and immunohistochemistry (IHC in more patients. Results We identified 20 chromosomal imbalance regions harboring 459 genes for Caucasian and 17 regions containing 476 genes for Asian lung cancer patients. Seven common chromosomal imbalance regions harboring 117 genes, included gain on 3p13-14, 6p22.1, 9q21.13, 13q14.1, and 17p13.3; and loss on 3p22.2-22.3 and 13q13.3 were found both in Asian and Caucasian patients. Gene validation for four genes including ARHGAP19 (10q24.1 functioning in Rho activity control, FRAT2 (10q24.1 involved in Wnt signaling, PAFAH1B1 (17p13.3 functioning in motility control, and ZNF322A (6p22.1 involved in MAPK signaling was performed using qPCR and RT-qPCR. Mean gene dosage and mRNA expression level of the four candidate genes in tumor tissues were significantly higher than the corresponding normal tissues (PP=0.06. In addition, CISH analysis of patients indicated that copy number amplification indeed occurred for ARHGAP19 and ZNF322A genes in lung cancer patients. IHC analysis of paraffin blocks from Asian Caucasian patients demonstrated that the frequency of
Lo, Fang-Yi; Nandi, Suvobroto; Salgia, Ravi; Wang, Yi-Ching; Chang, Jer-Wei; Chang, I-Shou; Chen, Yann-Jang; Hsu, Han-Shui; Huang, Shiu-Feng Kathy; Tsai, Fang-Yu; Jiang, Shih Sheng; Kanteti, Rajani
Cancer-related genes show racial differences. Therefore, identification and characterization of DNA copy number alteration regions in different racial groups helps to dissect the mechanism of tumorigenesis. Array-comparative genomic hybridization (array-CGH) was analyzed for DNA copy number profile in 40 Asian and 20 Caucasian lung cancer patients. Three methods including MetaCore analysis for disease and pathway correlations, concordance analysis between array-CGH database and the expression array database, and literature search for copy number variation genes were performed to select novel lung cancer candidate genes. Four candidate oncogenes were validated for DNA copy number and mRNA and protein expression by quantitative polymerase chain reaction (qPCR), chromogenic in situ hybridization (CISH), reverse transcriptase-qPCR (RT-qPCR), and immunohistochemistry (IHC) in more patients. We identified 20 chromosomal imbalance regions harboring 459 genes for Caucasian and 17 regions containing 476 genes for Asian lung cancer patients. Seven common chromosomal imbalance regions harboring 117 genes, included gain on 3p13-14, 6p22.1, 9q21.13, 13q14.1, and 17p13.3; and loss on 3p22.2-22.3 and 13q13.3 were found both in Asian and Caucasian patients. Gene validation for four genes including ARHGAP19 (10q24.1) functioning in Rho activity control, FRAT2 (10q24.1) involved in Wnt signaling, PAFAH1B1 (17p13.3) functioning in motility control, and ZNF322A (6p22.1) involved in MAPK signaling was performed using qPCR and RT-qPCR. Mean gene dosage and mRNA expression level of the four candidate genes in tumor tissues were significantly higher than the corresponding normal tissues (P<0.001~P=0.06). In addition, CISH analysis of patients indicated that copy number amplification indeed occurred for ARHGAP19 and ZNF322A genes in lung cancer patients. IHC analysis of paraffin blocks from Asian Caucasian patients demonstrated that the frequency of PAFAH1B1 protein overexpression was 68
Full Text Available The genus Helicobacter is a group of Gram-negative, helical-shaped pathogens consisting of at least 36 bacterial species. Helicobacter pylori (H. pylori, infecting more than 50% of the human population, is considered as the major cause of gastritis, peptic ulcer, and gastric cancer. However, the genetic underpinnings of H. pylori that are responsible for its large scale epidemic and gastrointestinal environment adaption within human beings remain unclear. Core-pan genome analysis was performed among 75 representative H. pylori and 24 non-pylori Helicobacter genomes. There were 1173 conserved protein families of H. pylori and 673 of all 99 Helicobacter genus strains. We found 79 genome unique regions, a total of 202,359bp, shared by at least 80% of the H. pylori but lacked in non-pylori Helicobacter species. The operons, genes, and sRNAs within the H. pylori unique regions were considered as potential ones associated with its pathogenicity and adaptability, and the relativity among them has been partially confirmed by functional annotation analysis. However, functions of at least 54 genes and 10 sRNAs were still unclear. Our analysis of protein-protein interaction showed that 30 genes within them may have the cooperation relationship.
Solomon, N.M.; Ross, S.; Morgan, T.; Belsky, J.L.; Hol, F.A.; Karnes, P.; Hopwood, N.J.; Myers, S.E.; Tan, A.; Warne, G.L.; Forrest, S.M.; Thomas, P.Q.
INTRODUCTION: Array comparative genomic hybridisation (array CGH) is a powerful method that detects alteration of gene copy number with greater resolution and efficiency than traditional methods. However, its ability to detect disease causing duplications in constitutional genomic DNA has not been
Marilyn C Cornelis
Full Text Available We report the first genome-wide association study of habitual caffeine intake. We included 47,341 individuals of European descent based on five population-based studies within the United States. In a meta-analysis adjusted for age, sex, smoking, and eigenvectors of population variation, two loci achieved genome-wide significance: 7p21 (P = 2.4 × 10(-19, near AHR, and 15q24 (P = 5.2 × 10(-14, between CYP1A1 and CYP1A2. Both the AHR and CYP1A2 genes are biologically plausible candidates as CYP1A2 metabolizes caffeine and AHR regulates CYP1A2.
Rasmussen, Simon; Nielsen, Henrik Bjørn; Jarmer, Hanne Østergaard
The majority of all genes have so far been identified and annotated systematically through in silico gene finding. Here we report the finding of 3662 strand-specific transcriptionally active regions (TARs) in the genome of Bacillus subtilis by the use of tiling arrays. We have measured the genome...
Murat, Claude; Zampieri, Elisa; Vallino, Marta; Daghino, Stefania; Perotto, Silvia; Bonfante, Paola
Characterization of genomic variation among different microbial species, or different strains of the same species, is a field of significant interest with a wide range of potential applications. We have investigated the genomic variation in mycorrhizal fungal genomes through genomic suppressive subtractive hybridization. The comparison was between phylogenetically distant and close truffle species (Tuber spp.), and between isolates of the ericoid mycorrhizal fungus Oidiodendron maius featuring different degrees of metal tolerance. In the interspecies experiment, almost all the sequences that were identified in the Tuber melanosporum genome and absent in Tuber borchii and Tuber indicum corresponded to transposable elements. In the intraspecies comparison, some specific sequences corresponded to regions coding for enzymes, among them a glutathione synthetase known to be involved in metal tolerance. This approach is a quick and rather inexpensive tool to develop molecular markers for mycorrhizal fungi tracking and barcoding, to identify functional genes and to investigate the genome plasticity, adaptation and evolution. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Investigators with The Cancer Genome Atlas (TCGA) Research Network have identified novel genomic and molecular characteristics of cervical cancer that will aid in subclassification of the disease and may help target therapies that are most appropriate for each patient.
Full Text Available Similar to other malignancies, urothelial carcinoma (UC is characterized by specific recurrent chromosomal aberrations and gene mutations. However, the interconnection between specific genomic alterations, and how patterns of chromosomal alterations adhere to different molecular subgroups of UC, is less clear. We applied tiling resolution array CGH to 146 cases of UC and identified a number of regions harboring recurrent focal genomic amplifications and deletions. Several potential oncogenes were included in the amplified regions, including known oncogenes like E2F3, CCND1, and CCNE1, as well as new candidate genes, such as SETDB1 (1q21, and BCL2L1 (20q11. We next combined genome profiling with global gene expression, gene mutation, and protein expression data and identified two major genomic circuits operating in urothelial carcinoma. The first circuit was characterized by FGFR3 alterations, overexpression of CCND1, and 9q and CDKN2A deletions. The second circuit was defined by E3F3 amplifications and RB1 deletions, as well as gains of 5p, deletions at PTEN and 2q36, 16q, 20q, and elevated CDKN2A levels. TP53/MDM2 alterations were common for advanced tumors within the two circuits. Our data also suggest a possible RAS/RAF circuit. The tumors with worst prognosis showed a gene expression profile that indicated a keratinized phenotype. Taken together, our integrative approach revealed at least two separate networks of genomic alterations linked to the molecular diversity seen in UC, and that these circuits may reflect distinct pathways of tumor development.
Full Text Available Identifying the signals of artificial selection can contribute to further shaping economically important traits. Here, a chicken 600k SNP-array was employed to detect the signals of artificial selection using 331 individuals from 9 breeds, including Jingfen (JF, Jinghong (JH, Araucanas (AR, White Leghorn (WL, Pekin-Bantam (PB, Shamo (SH, Gallus-Gallus-Spadiceus (GA, Rheinlander (RH and Vorwerkhuhn (VO. Per the population genetic structure, 9 breeds were combined into 5 breed-pools, and a 'two-step' strategy was used to reveal the signals of artificial selection. GA, which has little artificial selection, was defined as the reference population, and a total of 204, 155, 305 and 323 potential artificial selection signals were identified in AR_VO, PB, RH_WL and JH_JF, respectively. We also found signals derived from standing and de-novo genetic variations have contributed to adaptive evolution during artificial selection. Further enrichment analysis suggests that the genomic regions of artificial selection signals harbour genes, including THSR, PTHLH and PMCH, responsible for economic traits, such as fertility, growth and immunization. Overall, this study found a series of genes that contribute to the improvement of chicken breeds and revealed the genetic mechanisms of adaptive evolution, which can be used as fundamental information in future chicken functional genomics study.
Gonzalez-Perez, Abel; Mustonen, Ville; Reva, Boris
The International Cancer Genome Consortium (ICGC) aims to catalog genomic abnormalities in tumors from 50 different cancer types. Genome sequencing reveals hundreds to thousands of somatic mutations in each tumor but only a minority of these drive tumor progression. We present the result of discu......The International Cancer Genome Consortium (ICGC) aims to catalog genomic abnormalities in tumors from 50 different cancer types. Genome sequencing reveals hundreds to thousands of somatic mutations in each tumor but only a minority of these drive tumor progression. We present the result...... of discussions within the ICGC on how to address the challenge of identifying mutations that contribute to oncogenesis, tumor maintenance or response to therapy, and recommend computational techniques to annotate somatic variants and predict their impact on cancer phenotype....
Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren
GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often
Lopera, Juan G.; Falendysz, Elizabeth A.; Rocke, Tonie E.; Osorio, Jorge E.
Monkeypox virus (MPXV) is an emerging pathogen from Africa that causes disease similar to smallpox. Two clades with different geographic distributions and virulence have been described. Here, we utilized bioinformatic tools to identify genomic regions in MPXV containing multiple virulence genes and explored their roles in pathogenicity; two selected regions were then deleted singularly or in combination. In vitro and in vivostudies indicated that these regions play a significant role in MPXV replication, tissue spread, and mortality in mice. Interestingly, while deletion of either region led to decreased virulence in mice, one region had no effect on in vitro replication. Deletion of both regions simultaneously also reduced cell culture replication and significantly increased the attenuation in vivo over either single deletion. Attenuated MPXV with genomic deletions present a safe and efficacious tool in the study of MPX pathogenesis and in the identification of genetic factors associated with virulence.
Full Text Available Major advances in wheat production are needed to address global food insecurity under future climate conditions, such as high temperatures. The grain yield of bread wheat (Triticum aestivum L. is a quantitatively inherited complex trait that is strongly influenced by interacting genetic and environmental factors. Here, we conducted global QTL analysis for five yield-related traits, including spike yield, yield components and plant height (PH, in the Nongda3338/Jingdong6 doubled haploid (DH population using a high-density SNP and SSR-based genetic map. A total of 12 major genomic regions with stable QTL controlling yield-related traits were detected on chromosomes 1B, 2A, 2B, 2D, 3A, 4A, 4B, 4D, 5A, 6A, and 7A across 12 different field trials with timely sown (normal and late sown (heat stress conditions. Co-location of yield components revealed significant tradeoffs between thousand grain weight (TGW and grain number per spike (GNS on chromosome 4A. Dissection of a “QTL-hotspot” region for grain weight on chromosome 4B was helpful in marker-assisted selection (MAS breeding. Moreover, this study identified a novel QTL for heat susceptibility index of thousand grain weight (HSITGW on chromosome 4BL that explains approximately 10% of phenotypic variation. QPh.cau-4B.2, QPh.cau-4D.1 and QPh.cau-2D.3 were coincident with the dwarfing genes Rht1, Rht2, and Rht8, and haplotype analysis revealed their pleiotropic architecture with yield components. Overall, our findings will be useful for elucidating the genetic architecture of yield-related traits and developing new wheat varieties with high and stable yield.
Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S.; Beer, Michael A.
Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167–80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org. PMID:23771147
We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10(-11)) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10(-9)), ANK3 (rs10994359, P = 2.5 × 10(-8)) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10(-9)).
Eliot C Bush
Full Text Available An important challenge for human evolutionary biology is to understand the genetic basis of human-chimpanzee differences. One influential idea holds that such differences depend, to a large extent, on adaptive changes in gene expression. An important step in assessing this hypothesis involves gaining a better understanding of selective constraint on noncoding regions of hominid genomes. In noncoding sequence, functional elements are frequently small and can be separated by large nonfunctional regions. For this reason, constraint in hominid genomes is likely to be patchy. Here we use conservation in more distantly related mammals and amniotes as a way of identifying small sequence windows that are likely to be functional. We find that putatively functional noncoding elements defined in this manner are subject to significant selective constraint in hominids.
Full Text Available An important challenge for human evolutionary biology is to understand the genetic basis of human-chimpanzee differences. One influential idea holds that such differences depend, to a large extent, on adaptive changes in gene expression. An important step in assessing this hypothesis involves gaining a better understanding of selective constraint on noncoding regions of hominid genomes. In noncoding sequence, functional elements are frequently small and can be separated by large nonfunctional regions. For this reason, constraint in hominid genomes is likely to be patchy. Here we use conservation in more distantly related mammals and amniotes as a way of identifying small sequence windows that are likely to be functional. We find that putatively functional noncoding elements defined in this manner are subject to significant selective constraint in hominids.
Full Text Available Abstract Background With the recent advances and availability of various high-throughput sequencing technologies, data on many molecular aspects, such as gene regulation, chromatin dynamics, and the three-dimensional organization of DNA, are rapidly being generated in an increasing number of laboratories. The variation in biological context, and the increasingly dispersed mode of data generation, imply a need for precise, interoperable and flexible representations of genomic features through formats that are easy to parse. A host of alternative formats are currently available and in use, complicating analysis and tool development. The issue of whether and how the multitude of formats reflects varying underlying characteristics of data has to our knowledge not previously been systematically treated. Results We here identify intrinsic distinctions between genomic features, and argue that the distinctions imply that a certain variation in the representation of features as genomic tracks is warranted. Four core informational properties of tracks are discussed: gaps, lengths, values and interconnections. From this we delineate fifteen generic track types. Based on the track type distinctions, we characterize major existing representational formats and find that the track types are not adequately supported by any single format. We also find, in contrast to the XML formats, that none of the existing tabular formats are conveniently extendable to support all track types. We thus propose two unified formats for track data, an improved XML format, BioXSD 1.1, and a new tabular format, GTrack 1.0. Conclusions The defined track types are shown to capture relevant distinctions between genomic annotation tracks, resulting in varying representational needs and analysis possibilities. The proposed formats, GTrack 1.0 and BioXSD 1.1, cater to the identified track distinctions and emphasize preciseness, flexibility and parsing convenience.
Li, Yifeng; Shi, Wenqiang; Wasserman, Wyeth W
In the human genome, 98% of DNA sequences are non-protein-coding regions that were previously disregarded as junk DNA. In fact, non-coding regions host a variety of cis-regulatory regions which precisely control the expression of genes. Thus, Identifying active cis-regulatory regions in the human genome is critical for understanding gene regulation and assessing the impact of genetic variation on phenotype. The developments of high-throughput sequencing and machine learning technologies make it possible to predict cis-regulatory regions genome wide. Based on rich data resources such as the Encyclopedia of DNA Elements (ENCODE) and the Functional Annotation of the Mammalian Genome (FANTOM) projects, we introduce DECRES based on supervised deep learning approaches for the identification of enhancer and promoter regions in the human genome. Due to their ability to discover patterns in large and complex data, the introduction of deep learning methods enables a significant advance in our knowledge of the genomic locations of cis-regulatory regions. Using models for well-characterized cell lines, we identify key experimental features that contribute to the predictive performance. Applying DECRES, we delineate locations of 300,000 candidate enhancers genome wide (6.8% of the genome, of which 40,000 are supported by bidirectional transcription data), and 26,000 candidate promoters (0.6% of the genome). The predicted annotations of cis-regulatory regions will provide broad utility for genome interpretation from functional genomics to clinical applications. The DECRES model demonstrates potentials of deep learning technologies when combined with high-throughput sequencing data, and inspires the development of other advanced neural network models for further improvement of genome annotations.
Keel, B N; Nonneman, D J; Rohrer, G A
Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a more significant effect on phenotypic variation than do other types of genetic variants. Hence, a comprehensive list of these functional variants would be of considerable interest in swine genomic studies, particularly those targeting fertility and production traits. Whole-genome sequence was obtained from 72 of the founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC). These animals included all 24 of the founding boars (12 Duroc and 12 Landrace) and 48 Yorkshire-Landrace composite sows. Sequence reads were mapped to the Sscrofa10.2 genome build, resulting in a mean of 6.1 fold (×) coverage per genome. A total of 22 342 915 high confidence SNPs were identified from the sequenced genomes. These included 21 million previously reported SNPs and 79% of the 62 163 SNPs on the PorcineSNP60 BeadChip assay. Variation was detected in the coding sequence or untranslated regions (UTRs) of 87.8% of the genes in the porcine genome: loss-of-function variants were predicted in 504 genes, 10 202 genes contained nonsynonymous variants, 10 773 had variation in UTRs and 13 010 genes contained synonymous variants. Approximately 139 000 SNPs were classified as loss-of-function, nonsynonymous or regulatory, which suggests that over 99% of the variation detected in our pigs could potentially be ignored, allowing us to focus on a much smaller number of functional SNPs during future analyses. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
The objective of this research was to identify genomic regions associated with clinical mastitis (MAST) in US Holsteins using producer-reported data. Genome-wide association studies (GWAS) were performed on deregressed PTA using GEMMA v. 0.94. Genotypes included 60,671 SNP for all predictor bulls (n...
In the first study of its kind, an international team of genomics researchers has identified new regions of the human genome that are associated with skin color variation in some African populations, opening new avenues for research on skin diseases and cancer in all populations.
Tjong Kim Sang, E.; Wieling, Martijn; Kroon, Martin; van Noord, Gertjan; Bouma, Gosse
The Syntactic Atlas of Dutch Dialects (SAND) is a database of syntactic features observed in the language spoken by people from different dialect regions in The Netherlands and Flanders. We would like to know how specific syntactic features are for the different dialects. For this purpose we try to
Maeda, Toyoki; Chijiiwa, Yoshiharu; Tsuji, Hideo; Sakoda, Saburo; Tani, Kenzaburo; Suzuki, Tomokazu
In this study, a mouse genomic region is identified that undergoes DNA rearrangement and yields circular DNA in brain during embryogenesis. External region-directed inverse polymerase chain reaction on circular DNA extracted from late embryonic brain tissue repeatedly detected DNA of this region containing recombination joints. Wide-range genomic PCR and digestion-circularization PCR analysis showed this region underwent recombination accompanied with deletion of intervening sequences, including the circularized regions. This region was mapped by fluorescence in situ hybridization to C1 on mouse chromosome 16, where no gene and no physiological DNA rearrangement had been identified. DNA sequence in the region has segmental homology to an orthologous region on human chromosome 3q.13. These observations demonstrated somatic DNA recombination yielding genomic deletions in brain during embryogenesis
Mendoza-Porras, Omar; Botwright, Natasha A; McWilliam, Sean M; Cook, Mathew T; Harris, James O; Wijffels, Gene; Colgrave, Michelle L
Aside from their critical role in reproduction, abalone gonads serve as an indicator of sexual maturity and energy balance, two key considerations for effective abalone culture. Temperate abalone farmers face issues with tank restocking with highly marketable abalone owing to inefficient spawning induction methods. The identification of key proteins in sexually mature abalone will serve as the foundation for a greater understanding of reproductive biology. Addressing this knowledge gap is the first step towards improving abalone aquaculture methods. Proteomic profiling of female and male gonads of greenlip abalone, Haliotis laevigata, was undertaken using liquid chromatography-mass spectrometry. Owing to the incomplete nature of abalone protein databases, in addition to searching against two publicly available databases, a custom database comprising genomic data was used. Overall, 162 and 110 proteins were identified in females and males respectively with 40 proteins common to both sexes. For proteins involved in sexual maturation, sperm and egg structure, motility, acrosomal reaction and fertilization, 23 were identified only in females, 18 only in males and 6 were common. Gene ontology analysis revealed clear differences between the female and male protein profiles reflecting a higher rate of protein synthesis in the ovary and higher metabolic activity in the testis. A comprehensive mass spectrometry-based analysis was performed to profile the abalone gonad proteome providing the foundation for future studies of reproduction in abalone. Key proteins involved in both reproduction and energy balance were identified. Genomic resources were utilised to build a database of molluscan proteins yielding >60% more protein identifications than in a standard workflow employing public protein databases. Copyright © 2014 Elsevier B.V. All rights reserved.
Laakso, H.; Santolík, Ondřej; Horne, R.; Kolmašová, Ivana; Escoubet, P.; Masson, A.; Taylor, P.
Roč. 42, č. 9 (2015), s. 3141-3149 ISSN 0094-8276 R&D Projects: GA MŠk LH12231 Institutional support: RVO:68378289 Keywords : plasmaspheric hiss * plasmaspheric drainage plumes * plasmasphere * equatorial region of plumes Subject RIV: BL - Plasma and Gas Discharge Physics Impact factor: 4.212, year: 2015 http://onlinelibrary.wiley.com/doi/10.1002/2015GL063755/full
Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B
Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.
Full Text Available Many applications of high throughput sequencing rely on the availability of an accurate reference genome. Variant calling often produces large data sets that cannot be realistically validated and which may contain large numbers of false-positives. Errors in the reference assembly increase the number of false-positives. While resources are available to aid in the filtering of variants from human data, for other species these do not yet exist and strict filtering techniques must be employed which are more likely to exclude true-positives. This work assesses the accuracy of the pig reference genome (Sscrofa10.2 using whole genome sequencing reads from the Duroc sow whose genome the assembly was based on. Indicators of structural variation including high regional coverage, unexpected insert sizes, improper pairing and homozygous variants were used to identify low quality (LQ regions of the assembly. Low coverage (LC regions were also identified and analyzed separately. The LQ regions covered 13.85% of the genome, the LC regions covered 26.6% of the genome and combined (LQLC they covered 33.07% of the genome. Over half of dbSNP variants were located in the LQLC regions. Of CNVRs identified in a previous study, 86.3% were located in the LQLC regions. The regions were also enriched for gene predictions from RNA-seq data with 42.98% falling in the LQLC regions. Excluding variants in the LQ, LC or LQLC from future analyses will help reduce the number of false-positive variant calls. Researchers using WGS data should be aware that the current pig reference genome does not give an accurate representation of the copy number of alleles in the original Duroc sow’s genome.
Full Text Available Genetic and genomic studies highlight the substantial complexity and heterogeneity of human cancers and emphasize the general lack of therapeutics that can match this complexity. With the goal of expanding opportunities for drug discovery, we describe an approach that makes use of a phenotype-based screen combined with the use of multiple cancer cell lines. In particular, we have used the NCI-60 cancer cell line panel that includes drug sensitivity measures for over 40,000 compounds assayed on 59 independent cells lines. Targets are cancer-relevant phenotypes represented as gene expression signatures that are used to identify cells within the NCI-60 panel reflecting the signature phenotype and then connect to compounds that are selectively active against those cells. As a proof-of-concept, we show that this strategy effectively identifies compounds with selectivity to the RAS or PI3K pathways. We have then extended this strategy to identify compounds that have activity towards cells exhibiting the basal phenotype of breast cancer, a clinically-important breast cancer characterized as ER-, PR-, and Her2- that lacks viable therapeutic options. One of these compounds, Simvastatin, has previously been shown to inhibit breast cancer cell growth in vitro and importantly, has been associated with a reduction in ER-, PR- breast cancer in a clinical study. We suggest that this approach provides a novel strategy towards identification of therapeutic agents based on clinically relevant phenotypes that can augment the conventional strategies of target-based screens.
Jacobs, D. J.; Thorpe, M. F.; Kuhn, L. A.
In proteins it is possible to separate hard covalent forces involving bond lengths and bond angles from other weak forces. We model the microstructure of the protein as a generic bar-joint truss framework, where the hard covalent forces and strong hydrogen bonds are regarded as rigid bar constraints. We study the mechanical stability of proteins using FIRST (Floppy Inclusions and Rigid Substructure Topography) based on a recently developed combinatorial constraint counting algorithm (the 3D Pebble Game), which is a generalization of the 2D pebble game (D. J. Jacobs and M. F. Thorpe, ``Generic Rigidity: The Pebble Game'', Phys. Rev. Lett.) 75, 4051-4054 (1995) for the special class of bond-bending networks (D. J. Jacobs, "Generic Rigidity in Three Dimensional Bond-bending Networks", Preprint Aug (1997)). This approach is useful in identifying rigid motifs and flexible linkages in proteins, and thereby determines the essential degrees of freedom. We will show some preliminary results from the FIRST analysis on the myohemerythrin and lyozyme proteins.
Takezawa, Yusuke; Kikuchi, Atsuo; Haginoya, Kazuhiro; Niihori, Tetsuya; Numata-Uematsu, Yurika; Inui, Takehiko; Yamamura-Suzuki, Saeko; Miyabayashi, Takuya; Anzai, Mai; Suzuki-Muromoto, Sato; Okubo, Yukimune; Endo, Wakaba; Togashi, Noriko; Kobayashi, Yasuko; Onuma, Akira; Funayama, Ryo; Shirota, Matsuyuki; Nakayama, Keiko; Aoki, Yoko; Kure, Shigeo
Cerebral palsy is a common, heterogeneous neurodevelopmental disorder that causes movement and postural disabilities. Recent studies have suggested genetic diseases can be misdiagnosed as cerebral palsy. We hypothesized that two simple criteria, that is, full-term births and nonspecific brain MRI findings, are keys to extracting masqueraders among cerebral palsy cases due to the following: (1) preterm infants are susceptible to multiple environmental factors and therefore demonstrate an increased risk of cerebral palsy and (2) brain MRI assessment is essential for excluding environmental causes and other particular disorders. A total of 107 patients-all full-term births-without specific findings on brain MRI were identified among 897 patients diagnosed with cerebral palsy who were followed at our center. DNA samples were available for 17 of the 107 cases for trio whole-exome sequencing and array comparative genomic hybridization. We prioritized variants in genes known to be relevant in neurodevelopmental diseases and evaluated their pathogenicity according to the American College of Medical Genetics guidelines. Pathogenic/likely pathogenic candidate variants were identified in 9 of 17 cases (52.9%) within eight genes: CTNNB1 , CYP2U1 , SPAST , GNAO1 , CACNA1A , AMPD2 , STXBP1 , and SCN2A . Five identified variants had previously been reported. No pathogenic copy number variations were identified. The AMPD2 missense variant and the splice-site variants in CTNNB1 and AMPD2 were validated by in vitro functional experiments. The high rate of detecting causative genetic variants (52.9%) suggests that patients diagnosed with cerebral palsy in full-term births without specific MRI findings may include genetic diseases masquerading as cerebral palsy.
Bailey, Peter; Chang, David K; Nones, Katia; Johns, Amber L; Patch, Ann-Marie; Gingras, Marie-Claude; Miller, David K; Christ, Angelika N; Bruxner, Tim J C; Quinn, Michael C; Nourse, Craig; Murtaugh, L Charles; Harliwong, Ivon; Idrisoglu, Senel; Manning, Suzanne; Nourbakhsh, Ehsan; Wani, Shivangi; Fink, Lynn; Holmes, Oliver; Chin, Venessa; Anderson, Matthew J; Kazakoff, Stephen; Leonard, Conrad; Newell, Felicity; Waddell, Nick; Wood, Scott; Xu, Qinying; Wilson, Peter J; Cloonan, Nicole; Kassahn, Karin S; Taylor, Darrin; Quek, Kelly; Robertson, Alan; Pantano, Lorena; Mincarelli, Laura; Sanchez, Luis N; Evers, Lisa; Wu, Jianmin; Pinese, Mark; Cowley, Mark J; Jones, Marc D; Colvin, Emily K; Nagrial, Adnan M; Humphrey, Emily S; Chantrill, Lorraine A; Mawson, Amanda; Humphris, Jeremy; Chou, Angela; Pajic, Marina; Scarlett, Christopher J; Pinho, Andreia V; Giry-Laterriere, Marc; Rooman, Ilse; Samra, Jaswinder S; Kench, James G; Lovell, Jessica A; Merrett, Neil D; Toon, Christopher W; Epari, Krishna; Nguyen, Nam Q; Barbour, Andrew; Zeps, Nikolajs; Moran-Jones, Kim; Jamieson, Nigel B; Graham, Janet S; Duthie, Fraser; Oien, Karin; Hair, Jane; Grützmann, Robert; Maitra, Anirban; Iacobuzio-Donahue, Christine A; Wolfgang, Christopher L; Morgan, Richard A; Lawlor, Rita T; Corbo, Vincenzo; Bassi, Claudio; Rusev, Borislav; Capelli, Paola; Salvia, Roberto; Tortora, Giampaolo; Mukhopadhyay, Debabrata; Petersen, Gloria M; Munzy, Donna M; Fisher, William E; Karim, Saadia A; Eshleman, James R; Hruban, Ralph H; Pilarsky, Christian; Morton, Jennifer P; Sansom, Owen J; Scarpa, Aldo; Musgrove, Elizabeth A; Bailey, Ulla-Maja Hagbo; Hofmann, Oliver; Sutherland, Robert L; Wheeler, David A; Gill, Anthony J; Gibbs, Richard A; Pearson, John V; Waddell, Nicola; Biankin, Andrew V; Grimmond, Sean M
Integrated genomic analysis of 456 pancreatic ductal adenocarcinomas identified 32 recurrently mutated genes that aggregate into 10 pathways: KRAS, TGF-β, WNT, NOTCH, ROBO/SLIT signalling, G1/S transition, SWI-SNF, chromatin modification, DNA repair and RNA processing. Expression analysis defined 4 subtypes: (1) squamous; (2) pancreatic progenitor; (3) immunogenic; and (4) aberrantly differentiated endocrine exocrine (ADEX) that correlate with histopathological characteristics. Squamous tumours are enriched for TP53 and KDM6A mutations, upregulation of the TP63∆N transcriptional network, hypermethylation of pancreatic endodermal cell-fate determining genes and have a poor prognosis. Pancreatic progenitor tumours preferentially express genes involved in early pancreatic development (FOXA2/3, PDX1 and MNX1). ADEX tumours displayed upregulation of genes that regulate networks involved in KRAS activation, exocrine (NR5A2 and RBPJL), and endocrine differentiation (NEUROD1 and NKX2-2). Immunogenic tumours contained upregulated immune networks including pathways involved in acquired immune suppression. These data infer differences in the molecular evolution of pancreatic cancer subtypes and identify opportunities for therapeutic development.
Mourier, Tobias; Willerslev, Eske
in generating and maintaining retroelement-free regions in the human genome. METHODOLOGY/PRINCIPAL FINDINGS: Based on the known transcriptional properties of retroelements, we expect long interspersed elements (LINEs) to be able to display a high degree of transcriptional interference. In contrast, we expect......BACKGROUND: Eukaryotic genomes are scattered with retroelements that proliferate through retrotransposition. Although retroelements make up around 40 percent of the human genome, large regions are found to be completely devoid of retroelements. This has been hypothesised to be a result of genomic...... activity of LINEs has been identified previously. CONCLUSIONS/SIGNIFICANCE: Our observations are consistent with the notion that selection against transcriptional interference has contributed to the maintenance and/or generation of retroelement-free regions in the human genome....
Full Text Available Speciation is a continuous process and analysis of species pairs at different stages of divergence provides insight into how it unfolds. Previous genomic studies on young species pairs have revealed peaks of divergence and heterogeneous genomic differentiation. Yet less known is how localised peaks of differentiation progress to genome-wide divergence during the later stages of speciation in the presence of persistent gene flow. Spanning the speciation continuum, stickleback species pairs are ideal for investigating how genomic divergence builds up during speciation. However, attention has largely focused on young postglacial species pairs, with little knowledge of the genomic signatures of divergence and introgression in older stickleback systems. The Japanese stickleback species pair, composed of the Pacific Ocean three-spined stickleback (Gasterosteus aculeatus and the Japan Sea stickleback (G. nipponicus, which co-occur in the Japanese islands, is at a late stage of speciation. Divergence likely started well before the end of the last glacial period and crosses between Japan Sea females and Pacific Ocean males result in hybrid male sterility. Here we use coalescent analyses and Approximate Bayesian Computation to show that the two species split approximately 0.68-1 million years ago but that they have continued to exchange genes at a low rate throughout divergence. Population genomic data revealed that, despite gene flow, a high level of genomic differentiation is maintained across the majority of the genome. However, we identified multiple, small regions of introgression, occurring mainly in areas of low recombination rate. Our results demonstrate that a high level of genome-wide divergence can establish in the face of persistent introgression and that gene flow can be localized to small genomic regions at the later stages of speciation with gene flow.
Ravinet, Mark; Yoshida, Kohta; Shigenobu, Shuji; Toyoda, Atsushi; Fujiyama, Asao; Kitano, Jun
Speciation is a continuous process and analysis of species pairs at different stages of divergence provides insight into how it unfolds. Previous genomic studies on young species pairs have revealed peaks of divergence and heterogeneous genomic differentiation. Yet less known is how localised peaks of differentiation progress to genome-wide divergence during the later stages of speciation in the presence of persistent gene flow. Spanning the speciation continuum, stickleback species pairs are ideal for investigating how genomic divergence builds up during speciation. However, attention has largely focused on young postglacial species pairs, with little knowledge of the genomic signatures of divergence and introgression in older stickleback systems. The Japanese stickleback species pair, composed of the Pacific Ocean three-spined stickleback (Gasterosteus aculeatus) and the Japan Sea stickleback (G. nipponicus), which co-occur in the Japanese islands, is at a late stage of speciation. Divergence likely started well before the end of the last glacial period and crosses between Japan Sea females and Pacific Ocean males result in hybrid male sterility. Here we use coalescent analyses and Approximate Bayesian Computation to show that the two species split approximately 0.68-1 million years ago but that they have continued to exchange genes at a low rate throughout divergence. Population genomic data revealed that, despite gene flow, a high level of genomic differentiation is maintained across the majority of the genome. However, we identified multiple, small regions of introgression, occurring mainly in areas of low recombination rate. Our results demonstrate that a high level of genome-wide divergence can establish in the face of persistent introgression and that gene flow can be localized to small genomic regions at the later stages of speciation with gene flow.
Full Text Available Abstract Changes in DNA copy number are one of the hallmarks of the genetic instability common to most human cancers. Previous micro-array-based methods have been used to identify chromosomal gains and losses; however, they are unable to genotype alleles at the level of single nucleotide polymorphisms (SNPs. Here we describe a novel algorithm that uses a recently developed high-density oligonucleotide array-based SNP genotyping method, whole genome sampling analysis (WGSA, to identify genome-wide chromosomal gains and losses at high resolution. WGSA simultaneously genotypes over 10,000 SNPs by allele-specific hybridisation to perfect match (PM and mismatch (MM probes synthesised on a single array. The copy number algorithm jointly uses PM intensity and discrimination ratios between paired PM and MM intensity values to identify and estimate genetic copy number changes. Values from an experimental sample are compared with SNP-specific distributions derived from a reference set containing over 100 normal individuals to gain statistical power. Genomic regions with statistically significant copy number changes can be identified using both single point analysis and contiguous point analysis of SNP intensities. We identified multiple regions of amplification and deletion using a panel of human breast cancer cell lines. We verified these results using an independent method based on quantitative polymerase chain reaction and found that our approach is both sensitive and specific and can tolerate samples which contain a mixture of both tumour and normal DNA. In addition, by using known allele frequencies from the reference set, statistically significant genomic intervals can be identified containing contiguous stretches of homozygous markers, potentially allowing the detection of regions undergoing loss of heterozygosity (LOH without the need for a matched normal control sample. The coupling of LOH analysis, via SNP genotyping, with copy number
Grigoriev, Igor V.; Banks, Jo Ann; Nishiyama, Tomoaki; Hasebe, Mitsuyasu; Bowman, John L.; Gribskov, Michael; dePamphilis, Claude; Albert, Victor A.; Aono, Naoki; Aoyama, Tsuyoshi; Ambrose, Barbara A.; Ashton, Neil W.; Axtell, Michael J.; Barker, Elizabeth; Barker, Michael S.; Bennetzen, Jeffrey L.; Bonawitz, Nicholas D.; Chapple, Clint; Cheng, Chaoyang; Correa, Luiz Gustavo Guedes; Dacre, Michael; DeBarry, Jeremy; Dreyer, Ingo; Elias, Marek; Engstrom, Eric M.; Estelle, Mark; Feng, Liang; Finet, Cedric; Floyd, Sandra K.; Frommer, Wolf B.; Fujita, Tomomichi; Gramzow, Lydia; Gutensohn, Michael; Harholt, Jesper; Hattori, Mitsuru; Heyl, Alexander; Hirai, Tadayoshi; Hiwatashi, Yuji; Ishikawa, Masaki; Iwata, Mineko; Karol, Kenneth G.; Koehler, Barbara; Kolukisaoglu, Uener; Kubo, Minoru; Kurata, Tetsuya; Lalonde, Sylvie; Li, Kejie; Li, Ying; Litt, Amy; Lyons, Eric; Manning, Gerard; Maruyama, Takeshi; Michael, Todd P.; Mikami, Koji; Miyazaki, Saori; Morinaga, Shin-ichi; Murata, Takashi; Mueller-Roeber, Bernd; Nelson, David R.; Obara, Mari; Oguri, Yasuko; Olmstead, Richard G.; Onodera, Naoko; Petersen, Bent Larsen; Pils, Birgit; Prigge, Michael; Rensing, Stefan A.; Riano-Pachon, Diego Mauricio; Roberts, Alison W.; Sato, Yoshikatsu; Scheller, Henrik Vibe; Schulz, Burkhard; Schulz, Christian; Shakirov, Eugene V.; Shibagaki, Nakako; Shinohara, Naoki; Shippen, Dorothy E.; Sorensen, Iben; Sotooka, Ryo; Sugimoto, Nagisa; Sugita, Mamoru; Sumikawa, Naomi; Tanurdzic, Milos; Theilsen, Gunter; Ulvskov, Peter; Wakazuki, Sachiko; Weng, Jing-Ke; Willats, William W.G.T.; Wipf, Daniel; Wolf, Paul G.; Yang, Lixing; Zimmer, Andreas D.; Zhu, Qihui; Mitros, Therese; Hellsten, Uffe; Loque, Dominique; Otillar, Robert; Salamov, Asaf; Schmutz, Jeremy; Shapiro, Harris; Lindquist, Erika; Lucas, Susan; Rokhsar, Daniel
We report the genome sequence of the nonseed vascular plant, Selaginella moellendorffii, and by comparative genomics identify genes that likely played important roles in the early evolution of vascular plants and their subsequent evolution
Haanes, E J; Tomlinson, C C
Canine herpesvirus (CHV) is an alpha-herpesvirus of limited pathogenicity in healthy adult dogs and infectivity of the virus appears to be largely limited to cells of canine origin. CHV's low virulence and species specificity make it an attractive candidate for a recombinant vaccine vector to protect dogs against a variety of pathogens. As part of the analysis of the CHV genome, the authors determined the complete nucleotide sequence of the CHV US region as well as portions of the flanking inverted repeats. Seven full open reading frames (ORFs) encoding proteins larger than 100 amino acids were identified within, or partially within the CHV US: cUS2, cUS3, cUS4, cUS6, cUS7, cUS8 and cUS9; which are homologs of the herpes simplex virus type-1 US2; protein kinase; gG, gD, gI, gE; and US9 genes, respectively. An eighth ORF was identified in the inverted repeat region, cIR6, a homolog of the equine herpesvirus type-1 IR6 gene. The authors identified and mapped most of the major transcripts for the predicted CHV US ORFs by Northern analysis.
Full Text Available High-altitude hypoxia (reduced inspired oxygen tension due to decreased barometric pressure exerts severe physiological stress on the human body. Two high-altitude regions where humans have lived for millennia are the Andean Altiplano and the Tibetan Plateau. Populations living in these regions exhibit unique circulatory, respiratory, and hematological adaptations to life at high altitude. Although these responses have been well characterized physiologically, their underlying genetic basis remains unknown. We performed a genome scan to identify genes showing evidence of adaptation to hypoxia. We looked across each chromosome to identify genomic regions with previously unknown function with respect to altitude phenotypes. In addition, groups of genes functioning in oxygen metabolism and sensing were examined to test the hypothesis that particular pathways have been involved in genetic adaptation to altitude. Applying four population genetic statistics commonly used for detecting signatures of natural selection, we identified selection-nominated candidate genes and gene regions in these two populations (Andeans and Tibetans separately. The Tibetan and Andean patterns of genetic adaptation are largely distinct from one another, with both populations showing evidence of positive natural selection in different genes or gene regions. Interestingly, one gene previously known to be important in cellular oxygen sensing, EGLN1 (also known as PHD2, shows evidence of positive selection in both Tibetans and Andeans. However, the pattern of variation for this gene differs between the two populations. Our results indicate that several key HIF-regulatory and targeted genes are responsible for adaptation to high altitude in Andeans and Tibetans, and several different chromosomal regions are implicated in the putative response to selection. These data suggest a genetic role in high-altitude adaption and provide a basis for future genotype/phenotype association
Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou
Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.
Full Text Available Blackleg, caused by Leptosphaeria maculans, is a significant disease which affects the sustainable production of canola. This study reports a genome-wide association study based on 18,804 polymorphic SNPs to identify loci associated with qualitative and quantitative resistance to L. maculans. Genomic regions delimited with 503 significant SNP markers, that are associated with resistance evaluated using 12 single spore isolates and pathotypes from four canola stubble were identified. Several significant associations were detected at known disease resistance loci including in the vicinity of recently cloned Rlm2/LepR3 genes, and at new loci on chromosomes A01/C01, A02/C02, A03/C03, A05/C05, A06, A08, and A09. In addition, we validated statistically significant associations on A01, A07 and A10 in four genetic mapping populations, demonstrating that GWAS marker loci are indeed associated with resistance to L. maculans. One of the novel loci identified for the first time, Rlm12, conveys adult plant resistance and mapped within 13.2 kb from Arabidopsis R gene of TIR-NBS class. We showed that resistance loci are located in the vicinity of R genes of A. thaliana and B. napus on the sequenced genome of B. napus cv. Darmor-bzh. Significantly associated SNP markers provide a valuable tool to enrich germplasm for favorable alleles in order to improve the level of resistance to L. maculans in canola.
Jin, Eun-Heui; Zhang, Enji; Ko, Youngkwon; Sim, Woo Seog; Moon, Dong Eon; Yoon, Keon Jung; Hong, Jang Hee; Lee, Won Hyung
Complex regional pain syndrome (CRPS) is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II) and 5 controls (cut-off value: 1.5-fold change and pCRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10−4). The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression. PMID:24244504
Levy, Daniel; Neuhausen, Susan L; Hunt, Steven C; Kimura, Masayuki; Hwang, Shih-Jen; Chen, Wei; Bis, Joshua C; Fitzpatrick, Annette L; Smith, Erin; Johnson, Andrew D; Gardner, Jeffrey P; Srinivasan, Sathanur R; Schork, Nicholas; Rotter, Jerome I; Herbig, Utz; Psaty, Bruce M; Sastrasinh, Malinee; Murray, Sarah S; Vasan, Ramachandran S; Province, Michael A; Glazer, Nicole L; Lu, Xiaobin; Cao, Xiaojian; Kronmal, Richard; Mangino, Massimo; Soranzo, Nicole; Spector, Tim D; Berenson, Gerald S; Aviv, Abraham
Telomeres are engaged in a host of cellular functions, and their length is regulated by multiple genes. Telomere shortening, in the course of somatic cell replication, ultimately leads to replicative senescence. In humans, rare mutations in genes that regulate telomere length have been identified in monogenic diseases such as dyskeratosis congenita and idiopathic pulmonary fibrosis, which are associated with shortened leukocyte telomere length (LTL) and increased risk for aplastic anemia. Shortened LTL is observed in a host of aging-related complex genetic diseases and is associated with diminished survival in the elderly. We report results of a genome-wide association study of LTL in a consortium of four observational studies (n = 3,417 participants with LTL and genome-wide genotyping). SNPs in the regions of the oligonucleotide/oligosaccharide-binding folds containing one gene (OBFC1; rs4387287; P = 3.9 x 10(-9)) and chemokine (C-X-C motif) receptor 4 gene (CXCR4; rs4452212; P = 2.9 x 10(-8)) were associated with LTL at a genome-wide significance level (P a gene associated with LTL (P = 1.1 x 10(-5)). The identification of OBFC1 through genome-wide association as a locus for interindividual variation in LTL in the general population advances the understanding of telomere biology in humans and may provide insights into aging-related disorders linked to altered LTL dynamics.
Zhao, Meicheng; Zhi, Hui; Doust, Andrew N; Li, Wei; Wang, Yongfang; Li, Haiquan; Jia, Guanqing; Wang, Yongqiang; Zhang, Ning; Diao, Xianmin
The Setaria genus is increasingly of interest to researchers, as its two species, S. viridis and S. italica, are being developed as models for understanding C4 photosynthesis and plant functional genomics. The genome constitution of Setaria species has been studied in the diploid species S. viridis, S. adhaerans and S. grisebachii, where three genomes A, B and C were identified respectively. Two allotetraploid species, S. verticillata and S. faberi, were found to have AABB genomes, and one autotetraploid species, S. queenslandica, with an AAAA genome, has also been identified. The genomes and genome constitutions of most other species remain unknown, even though it was thought there are approximately 125 species in the genus distributed world-wide. GISH was performed to detect the genome constitutions of Eurasia species of S. glauca, S. plicata, and S. arenaria, with the known A, B and C genomes as probes. No or very poor hybridization signal was detected indicating that their genomes are different from those already described. GISH was also performed reciprocally between S. glauca, S. plicata, and S. arenaria genomes, but no hybridization signals between each other were found. The two sets of chromosomes of S. lachnea both hybridized strong signals with only the known C genome of S. grisebachii. Chromosomes of Qing 9, an accession formerly considered as S. viridis, hybridized strong signal only to B genome of S. adherans. Phylogenetic trees constructed with 5S rDNA and knotted1 markers, clearly classify the samples in this study into six clusters, matching the GISH results, and suggesting that the F genome of S. arenaria is basal in the genus. Three novel genomes in the Setaria genus were identified and designated as genome D (S. glauca), E (S. plicata) and F (S. arenaria) respectively. The genome constitution of tetraploid S. lachnea is putatively CCC'C'. Qing 9 is a B genome species indigenous to China and is hypothesized to be a newly identified species. The
Walter, Vonn; Du, Ying; Danilova, Ludmila; Hayward, Michele C; Hayes, D Neil
Integrated analyses of multiple genomic datatypes are now common in cancer profiling studies. Such data present opportunities for numerous computational experiments, yet analytic pipelines are limited. Tools such as the cBioPortal and Regulome Explorer, although useful, are not easy to access programmatically or to implement locally. Here, we introduce the MVisAGe R package, which allows users to quantify gene-level associations between two genomic datatypes to investigate the effect of genomic alterations (e.g., DNA copy number changes on gene expression). Visualizing Pearson/Spearman correlation coefficients according to the genomic positions of the underlying genes provides a powerful yet novel tool for conducting exploratory analyses. We demonstrate its utility by analyzing three publicly available cancer datasets. Our approach highlights canonical oncogenes in chr11q13 that displayed the strongest associations between expression and copy number, including CCND1 and CTTN , genes not identified by copy number analysis in the primary reports. We demonstrate highly concordant usage of shared oncogenes on chr3q, yet strikingly diverse oncogene usage on chr11q as a function of HPV infection status. Regions of chr19 that display remarkable associations between methylation and gene expression were identified, as were previously unreported miRNA-gene expression associations that may contribute to the epithelial-to-mesenchymal transition. Significance: This study presents an important bioinformatics tool that will enable integrated analyses of multiple genomic datatypes. Cancer Res; 78(12); 3375-85. ©2018 AACR . ©2018 American Association for Cancer Research.
Okbay, Aysu; P. Beauchamp, Jonathan; Alan Fontana, Mark
-nucleotide polymorphisms associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural......Educational attainment is strongly influenced by social and other environmental factors, but genetic factors are estimated to account for at least 20% of the variation across individuals1. Here we report the results of a genome-wide association study (GWAS) for educational attainment that extends...... development. Our findings demonstrate that, even for a behavioural phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because educational attainment is measured in large numbers of individuals...
Morrison, Carl D.; Liu, Pengyuan; Woloszynska-Read, Anna; Zhang, Jianmin; Luo, Wei; Qin, Maochun; Bshara, Wiam; Conroy, Jeffrey M.; Sabatini, Linda; Vedell, Peter; Xiong, Donghai; Liu, Song; Wang, Jianmin; Shen, He; Li, Yinwei; Omilian, Angela R.; Hill, Annette; Head, Karen; Guru, Khurshid; Kunnev, Dimiter; Leach, Robert; Eng, Kevin H.; Darlak, Christopher; Hoeflich, Christopher; Veeranki, Srividya; Glenn, Sean; You, Ming; Pruitt, Steven C.; Johnson, Candace S.; Trump, Donald L.
Using complete genome analysis, we sequenced five bladder tumors accrued from patients with muscle-invasive transitional cell carcinoma of the urinary bladder (TCC-UB) and identified a spectrum of genomic aberrations. In three tumors, complex genotype changes were noted. All three had tumor protein p53 mutations and a relatively large number of single-nucleotide variants (SNVs; average of 11.2 per megabase), structural variants (SVs; average of 46), or both. This group was best characterized by chromothripsis and the presence of subclonal populations of neoplastic cells or intratumoral mutational heterogeneity. Here, we provide evidence that the process of chromothripsis in TCC-UB is mediated by nonhomologous end-joining using kilobase, rather than megabase, fragments of DNA, which we refer to as “stitchers,” to repair this process. We postulate that a potential unifying theme among tumors with the more complex genotype group is a defective replication–licensing complex. A second group (two bladder tumors) had no chromothripsis, and a simpler genotype, WT tumor protein p53, had relatively few SNVs (average of 5.9 per megabase) and only a single SV. There was no evidence of a subclonal population of neoplastic cells. In this group, we used a preclinical model of bladder carcinoma cell lines to study a unique SV (translocation and amplification) of the gene glutamate receptor ionotropic N-methyl D-aspertate as a potential new therapeutic target in bladder cancer. PMID:24469795
de Boer, Ynto S; van Gerven, Nicole M F; Zwiers, Antonie; Verwer, Bart J; van Hoek, Bart; van Erpecum, Karel J; Beuers, Ulrich; van Buuren, Henk R; Drenth, Joost P H; den Ouden, Jannie W; Verdonk, Robert C; Koek, Ger H; Brouwer, Johannes T; Guichelaar, Maureen M J; Vrolijk, Jan M; Kraal, Georg; Mulder, Chris J J; van Nieuwkerk, Carin M J; Fischer, Janett; Berg, Thomas; Stickel, Felix; Sarrazin, Christoph; Schramm, Christoph; Lohse, Ansgar W; Weiler-Normann, Christina; Lerch, Markus M; Nauck, Matthias; Völzke, Henry; Homuth, Georg; Bloemena, Elisabeth; Verspaget, Hein W; Kumar, Vinod; Zhernakova, Alexandra; Wijmenga, Cisca; Franke, Lude; Bouma, Gerd
Autoimmune hepatitis (AIH) is an uncommon autoimmune liver disease of unknown etiology. We used a genome-wide approach to identify genetic variants that predispose individuals to AIH. We performed a genome-wide association study of 649 adults in The Netherlands with AIH type 1 and 13,436 controls. Initial associations were further analyzed in an independent replication panel comprising 451 patients with AIH type 1 in Germany and 4103 controls. We also performed an association analysis in the discovery cohort using imputed genotypes of the major histocompatibility complex region. We associated AIH with a variant in the major histocompatibility complex region at rs2187668 (P = 1.5 × 10(-78)). Analysis of this variant in the discovery cohort identified HLA-DRB1*0301 (P = 5.3 × 10(-49)) as a primary susceptibility genotype and HLA-DRB1*0401 (P = 2.8 × 10(-18)) as a secondary susceptibility genotype. We also associated AIH with variants of SH2B3 (rs3184504, 12q24; P = 7.7 × 10(-8)) and CARD10 (rs6000782, 22q13.1; P = 3.0 × 10(-6)). In addition, strong inflation of association signal was found with single-nucleotide polymorphisms associated with other immune-mediated diseases, including primary sclerosing cholangitis and primary biliary cirrhosis, but not with single-nucleotide polymorphisms associated with other genetic traits. In a genome-wide association study, we associated AIH type 1 with variants in the major histocompatibility complex region, and identified variants of SH2B3and CARD10 as likely risk factors. These findings support a complex genetic basis for AIH pathogenesis and indicate that part of the genetic susceptibility overlaps with that for other immune-mediated liver diseases. Copyright © 2014 AGA Institute. Published by Elsevier Inc. All rights reserved.
Parker, Brian John; Moltke, Ida; Roth, Adam
a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein...
Seo, Beomseok; Kim, Chuna; Hills, Mark; Sung, Sanghyun; Kim, Hyesook; Kim, Eunkyeong; Lim, Daisy S; Oh, Hyun-Seok; Choi, Rachael Mi Jung; Chun, Jongsik; Shim, Jaegal; Lee, Junho
Cells surviving crisis are often tumorigenic and their telomeres are commonly maintained through the reactivation of telomerase. However, surviving cells occasionally activate a recombination-based mechanism called alternative lengthening of telomeres (ALT). Here we establish stably maintained survivors in telomerase-deleted Caenorhabditis elegans that escape from sterility by activating ALT. ALT survivors trans-duplicate an internal genomic region, which is already cis-duplicated to chromosome ends, across the telomeres of all chromosomes. These 'Template for ALT' (TALT) regions consist of a block of genomic DNA flanked by telomere-like sequences, and are different between two genetic background. We establish a model that an ancestral duplication of a donor TALT region to a proximal telomere region forms a genomic reservoir ready to be incorporated into telomeres on ALT activation.
The genome of Saccharomyces cerevisiae contains several duplicated regions. The recent sequencing results of several yeast species suggest that the duplicated regions found in the modern Saccharomyces species are probably the result of a single gross duplication, as well as a series of sporadic...
Raviram, Ramya; Rocha, Pedro P; Müller, Christian L; Miraldi, Emily R; Badri, Sana; Fu, Yi; Swanzey, Emily; Proudhon, Charlotte; Snetkova, Valentina; Bonneau, Richard; Skok, Jane A
4C-Seq has proven to be a powerful technique to identify genome-wide interactions with a single locus of interest (or "bait") that can be important for gene regulation. However, analysis of 4C-Seq data is complicated by the many biases inherent to the technique. An important consideration when dealing with 4C-Seq data is the differences in resolution of signal across the genome that result from differences in 3D distance separation from the bait. This leads to the highest signal in the region immediately surrounding the bait and increasingly lower signals in far-cis and trans. Another important aspect of 4C-Seq experiments is the resolution, which is greatly influenced by the choice of restriction enzyme and the frequency at which it can cut the genome. Thus, it is important that a 4C-Seq analysis method is flexible enough to analyze data generated using different enzymes and to identify interactions across the entire genome. Current methods for 4C-Seq analysis only identify interactions in regions near the bait or in regions located in far-cis and trans, but no method comprehensively analyzes 4C signals of different length scales. In addition, some methods also fail in experiments where chromatin fragments are generated using frequent cutter restriction enzymes. Here, we describe 4C-ker, a Hidden-Markov Model based pipeline that identifies regions throughout the genome that interact with the 4C bait locus. In addition, we incorporate methods for the identification of differential interactions in multiple 4C-seq datasets collected from different genotypes or experimental conditions. Adaptive window sizes are used to correct for differences in signal coverage in near-bait regions, far-cis and trans chromosomes. Using several datasets, we demonstrate that 4C-ker outperforms all existing 4C-Seq pipelines in its ability to reproducibly identify interaction domains at all genomic ranges with different resolution enzymes.
Okbay, Aysu; Beauchamp, Jonathan P.; Fontana, Mark A.; Lee, James J.; Pers, Tune H.; Rietveld, Cornelius A.; Turley, Patrick; Chen, Guo-Bo; Emilsson, Valur; Meddens, S. Fleur W.; Oskarsson, Sven; Pickrell, Joseph K.; Thom, Kevin; Timshel, Pascal; de Vlaming, Ronald; Abdellaoui, Abdel; Ahluwalia, Tarunveer S.; Bacelis, Jonas; Baumbach, Clemens; Bjornsdottir, Gyda; Brandsma, Johannes H.; Concas, Maria Pina; Derringer, Jaime; Furlotte, Nicholas A.; Galesloot, Tessel E.; Girotto, Giorgia; Gupta, Richa; Hall, Leanne M.; Harris, Sarah E.; Hofer, Edith; Horikoshi, Momoko; Huffman, Jennifer E.; Kaasik, Kadri; Kalafati, Ioanna P.; Karlsson, Robert; Kong, Augustine; Lahti, Jari; van der Lee, Sven J.; de Leeuw, Christiaan; Lind, Penelope A.; Lindgren, Karl-Oskar; Liu, Tian; Mangino, Massimo; Marten, Jonathan; Mihailov, Evelin; Miller, Michael B.; van der Most, Peter J.; Oldmeadow, Christopher; Payton, Antony; Pervjakova, Natalia; Peyrot, Wouter J.; Qian, Yong; Raitakari, Olli; Rueedi, Rico; Salvi, Erika; Schmidt, Börge; Schraut, Katharina E.; Shi, Jianxin; Smith, Albert V.; Poot, Raymond A.; Pourcain, Beate; Teumer, Alexander; Thorleifsson, Gudmar; Verweij, Niek; Vuckovic, Dragana; Wellmann, Juergen; Westra, Harm-Jan; Yang, Jingyun; Zhao, Wei; Zhu, Zhihong; Alizadeh, Behrooz Z.; Amin, Najaf; Bakshi, Andrew; Baumeister, Sebastian E.; Biino, Ginevra; Bønnelykke, Klaus; Boyle, Patricia A.; Campbell, Harry; Cappuccio, Francesco P.; Davies, Gail; De Neve, Jan-Emmanuel; Deloukas, Panos; Demuth, Ilja; Ding, Jun; Eibich, Peter; Eisele, Lewin; Eklund, Niina; Evans68, David M.; Faul, Jessica D.; Feitosa, Mary F.; Forstner, Andreas J.; Gandin, Ilaria; Gunnarsson, Bjarni; Halldórsson, Bjarni V.; Harris, Tamara B.; Heath, Andrew C.; Hocking, Lynne J.; Holliday, Elizabeth G.; Homuth, Georg; Horan, Michael A.; Hottenga, Jouke-Jan; de Jager, Philip L.; Joshi, Peter K.; Jugessur, Astanand; Kaakinen, Marika A.; Kähönen, Mika; Kanoni, Stavroula; Keltigangas-Järvinen, Liisa; Kiemeney, Lambertus A.L.M.; Kolcic, Ivana; Koskinen, Seppo; Kraja, Aldi T.; Kroh, Martin; Kutalik, Zoltan; Latvala, Antti; Launer, Lenore J.; Lebreton, Maël P.; Levinson, Douglas F.; Lichtenstein, Paul; Lichtner, Peter; Liewald, David C.M.; Loukola, Anu; Madden, Pamela A.; Mägi, Reedik; Mäki-Opas, Tomi; Marioni, Riccardo E.; Marques-Vidal, Pedro; Meddens, Gerardus A.; McMahon, George; Meisinger, Christa; Meitinger, Thomas; Milaneschi, Yusplitri; Milani, Lili; Montgomery, Grant W.; Myhre, Ronny; Nelson, Christopher P.; Nyholt, Dale R.; Ollier, William E.R.; Palotie, Aarno; Paternoster, Lavinia; Pedersen, Nancy L.; Petrovic, Katja E.; Porteous, David J.; Räikkönen, Katri; Ring, Susan M.; Robino, Antonietta; Rostapshova, Olga; Rudan, Igor; Rustichini, Aldo; Salomaa, Veikko; Sanders, Alan R.; Sarin, Antti-Pekka; Schmidt, Helena; Scott, Rodney J.; Smith, Blair H.; Smith, Jennifer A.; Staessen, Jan A.; Steinhagen-Thiessen, Elisabeth; Strauch, Konstantin; Terracciano, Antonio; Tobin, Martin D.; Ulivi, Sheila; Vaccargiu, Simona; Quaye, Lydia; van Rooij, Frank J.A.; Venturini, Cristina; Vinkhuyzen, Anna A.E.; Völker, Uwe; Völzke, Henry; Vonk, Judith M.; Vozzi, Diego; Waage, Johannes; Ware, Erin B.; Willemsen, Gonneke; Attia, John R.; Bennett, David A.; Berger, Klaus; Bertram, Lars; Bisgaard, Hans; Boomsma, Dorret I.; Borecki, Ingrid B.; Bultmann, Ute; Chabris, Christopher F.; Cucca, Francesco; Cusi, Daniele; Deary, Ian J.; Dedoussis, George V.; van Duijn, Cornelia M.; Eriksson, Johan G.; Franke, Barbara; Franke, Lude; Gasparini, Paolo; Gejman, Pablo V.; Gieger, Christian; Grabe, Hans-Jörgen; Gratten, Jacob; Groenen, Patrick J.F.; Gudnason, Vilmundur; van der Harst, Pim; Hayward, Caroline; Hinds, David A.; Hoffmann, Wolfgang; Hyppönen, Elina; Iacono, William G.; Jacobsson, Bo; Järvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Kaprio, Jaakko; Kardia, Sharon L.R.; Lehtimäki, Terho; Lehrer, Steven F.; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; Metspalu, Andres; Pendleton, Neil; Penninx, Brenda W.J.H.; Perola, Markus; Pirastu, Nicola; Pirastu, Mario; Polasek, Ozren; Posthuma, Danielle; Power, Christine; Province, Michael A.; Samani, Nilesh J.; Schlessinger, David; Schmidt, Reinhold; Sørensen, Thorkild I.A.; Spector, Tim D.; Stefansson, Kari; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tiemeier, Henning; Tung, Joyce Y.; Uitterlinden, André G.; Vitart, Veronique; Vollenweider, Peter; Weir, David R.; Wilson, James F.; Wright, Alan F.; Conley, Dalton C.; Krueger, Robert F.; Smith, George Davey; Hofman, Albert; Laibson, David I.; Medland, Sarah E.; Meyer, Michelle N.; Yang, Jian; Johannesson, Magnus; Visscher, Peter M.; Esko, Tõnu; Koellinger, Philipp D.; Cesarini, David; Benjamin, Daniel J.
Summary Educational attainment (EA) is strongly influenced by social and other environmental factors, but genetic factors are also estimated to account for at least 20% of the variation across individuals1. We report the results of a genome-wide association study (GWAS) for EA that extends our earlier discovery sample1,2 of 101,069 individuals to 293,723 individuals, and a replication in an independent sample of 111,349 individuals from the UK Biobank. We now identify 74 genome-wide significant loci associated with number of years of schooling completed. Single-nucleotide polymorphisms (SNPs) associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioral phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because EA is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric disease. PMID:27225129
Ripke, S.; Sanders, A. R.; Kendler, K. S.; Levinson, D. F.; Sklar, P.; Holmans, P. A.; Lin, D. Y.; Duan, J.; Ophoff, R. A.; Andreassen, O. A.; Scolnick, E.; Cichon, S.; St Clair, D.; Corvin, A.; Gurling, H.; Werge, T.; Rujescu, D.; Blackwood, D. H.; Pato, C. N.; Malhotra, A. K.; Purcell, S.; Dudbridge, F.; Neale, B. M.; Rossin, L.; Visscher, P. M.; Posthuma, D.; Ruderfer, D. M.; Fanous, A.; Stefansson, H.; Steinberg, S.; Mowry, B. J.; Golimbet, V.; de Hert, M.; Jonsson, E. G.; Bitter, I.; Pietilainen, O. P.; Collier, D. A.; Tosato, S.; Agartz, I.; Albus, M.; Alexander, M.; Amdur, R. L.; Amin, F.; Bass, N.; Bergen, S. E.; Black, D. W.; Borglum, A. D.; Brown, M. A.; Bruggeman, R.; Buccola, N. G.; Byerley, W. F.; Cahn, W.; Cantor, R. M.; Carr, V. J.; Catts, S. V.; Choudhury, K.; Cloninger, C. R.; Cormican, P.; Craddock, N.; Danoy, P. A.; Datta, S.; de Haan, L.; Demontis, D.; Dikeos, D.; Djurovic, S.; Donnely, P.; Donohoe, G.; Duong, L.; Dwyer, S.; Fink-Jensen, A.; Freedman, R.; Freimer, N. B.; Friedl, M.; Georgieva, L.; Giegling, I.; Gill, M.; Glenthoj, B.; Godard, S.; Hamshere, M.; Hansen, M.; Hartmann, A. M.; Henskens, F. A.; Hougaard, D. M.; Hultman, C. M.; Ingason, A.; Jablensky, A. V.; Jakobsen, K. D.; Jay, M.; Jurgens, G.; Kahn, R. S.; Keller, M. C.; Kenis, G.; Kenny, E.; Kim, Y.; Kirov, G. K.; Konnerth, H.; Konte, B.; Krabbendam, L.; Krasucki, R.; Lasseter, V. K.; Laurent, C.; Lawrence, J.; Lencz, T.; Lerer, F. B.; Liang, K. Y.; Lichtenstein, P.; Lieberman, J. A.; Linszen, D. H.; Lonnqvist, J.; Loughland, C. M.; Maclean, A. W.; Maher, B. S.; Maier, W.; Mallet, J.; Malloy, P.; Mattheisen, M.; Mattingsdal, M.; McGhee, K. A.; McGrath, J. J.; McIntosh, A.; McLean, D. E.; McQuillin, A.; Melle, I.; Michie, P. T.; Milanova, V.; Morris, D. W.; Mors, O.; Mortensen, P. B.; Moskvina, V.; Muglia, P.; Myin-Germeys, I.; Nertney, D. A.; Nestadt, G.; Nielsen, J.; Nikolov, I.; Nordentoft, M.; Norton, N.; Nothen, M. M.; O'Dushlaine, C. T.; Olincy, A.; Olsen, L.; O'Neill, F. A.; Orntoft, T. F.; Owen, M. J.; Pantelis, C.; Papadimitriou, G.; Pato, M. T.; Peltonen, L.; Petursson, H.; Pickard, B.; Pimm, J.; Pulver, A. E.; Puri, V.; Quested, D.; Quinn, E. M.; Rasmussen, H. B.; Rethelyi, J. M.; Ribble, R.; Rietschel, M.; Riley, B. P.; Ruggeri, M.; Schall, U.; Schulze, T. G.; Schwab, S. G.; Scott, R. J.; Shi, J.; Sigurdsson, E.; Silvermann, J. M.; Spencer, C. C.; Stefansson, K.; Strange, A.; Strengman, E.; Stroup, T. S.; Suvisaari, J.; Terenius, L.; Thirumalai, S.; Thygesen, J. H.; Timm, S.; Toncheva, D.; van den Oord, E.; van Os, J.; van Winkel, R.; Veldink, J.; Walsh, D.; Wang, A. G.; Wiersma, D.; Wildenauer, D. B.; Williams, H. J.; Williams, N. M.; Wormley, B.; Zammit, S.; Sullivan, P. F.; O'Donovan, M. C.; Daly, M. J.; Gejman, P. V.
We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded
Hestand, Matthew S.; Kalbfleisch, Theodore S.; Coleman, Stephen J.
Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced m...... and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross...
Himes, Blanca E.; Sheppard, Keith; Berndt, Annerose; Leme, Adriana S.; Myers, Rachel A.; Gignoux, Christopher R.; Levin, Albert M.; Gauderman, W. James; Yang, James J.; Mathias, Rasika A.; Romieu, Isabelle; Torgerson, Dara G.; Roth, Lindsey A.; Huntsman, Scott; Eng, Celeste; Klanderman, Barbara; Ziniti, John; Senter-Sylvia, Jody; Szefler, Stanley J.; Lemanske, Robert F.; Zeiger, Robert S.; Strunk, Robert C.; Martinez, Fernando D.; Boushey, Homer; Chinchilli, Vernon M.; Israel, Elliot; Mauger, David; Koppelman, Gerard H.; Postma, Dirkje S.; Nieuwenhuis, Maartje A. E.; Vonk, Judith M.; Lima, John J.; Irvin, Charles G.; Peters, Stephen P.; Kubo, Michiaki; Tamari, Mayumi; Nakamura, Yusuke; Litonjua, Augusto A.; Tantisira, Kelan G.; Raby, Benjamin A.; Bleecker, Eugene R.; Meyers, Deborah A.; London, Stephanie J.; Barnes, Kathleen C.; Gilliland, Frank D.; Williams, L. Keoki; Burchard, Esteban G.; Nicolae, Dan L.; Ober, Carole; DeMeo, Dawn L.; Silverman, Edwin K.; Paigen, Beverly; Churchill, Gary; Shapiro, Steve D.; Weiss, Scott
Asthma is a common chronic respiratory disease characterized by airway hyperresponsiveness (AHR). The genetics of asthma have been widely studied in mouse and human, and homologous genomic regions have been associated with mouse AHR and human asthma-related phenotypes. Our goal was to identify
Kohane Isaac S
Full Text Available Abstract Background Genomic sequencing of SNPs is increasingly prevalent, though the amount of familial information these data contain has not been quantified. Methods We provide a framework for measuring the risk to siblings of a patient's SNP genotype disclosure, and demonstrate that sibling SNP genotypes can be inferred with substantial accuracy. Results Extending this inference technique, we determine that a very low number of matches at commonly varying SNPs is sufficient to confirm sib-ship, demonstrating that published sequence data can reliably be used to derive sibling identities. Using HapMap trio data, at SNPs where one child is homozygotic major, with a minor allele frequency ≤ 0.20, (N = 452684, 65.1% we achieve 91.9% inference accuracy for sibling genotypes. Conclusion These findings demonstrate that substantial discrimination and privacy risks arise from use of inferred familial genomic data.
Full Text Available Questions of understanding and quantifying the representation and amount of information in organisms have become a central part of biological research, as they potentially hold the key to fundamental advances. In this paper, we demonstrate the use of information-theoretic tools for the task of identifying segments of biomolecules (DNA or RNA that are statistically correlated. We develop a precise and reliable methodology, based on the notion of mutual information, for finding and extracting statistical as well as structural dependencies. A simple threshold function is defined, and its use in quantifying the level of significance of dependencies between biological segments is explored. These tools are used in two specific applications. First, they are used for the identification of correlations between different parts of the maize zmSRp32 gene. There, we find significant dependencies between the 5Ã¢Â€Â² untranslated region in zmSRp32 and its alternatively spliced exons. This observation may indicate the presence of as-yet unknown alternative splicing mechanisms or structural scaffolds. Second, using data from the FBI's combined DNA index system (CODIS, we demonstrate that our approach is particularly well suited for the problem of discovering short tandem repeatsÃ¢Â€Â”an application of importance in genetic profiling.
Graham, Rikki M A; Hiley, Lester; Rathnayake, Irani U; Jennison, Amy V
Salmonella enterica is a major cause of gastroenteritis and foodborne illness in Australia where notification rates in the state of Queensland are the highest in the country. S. Enteritidis is among the five most common serotypes reported in Queensland and it is a priority for epidemiological surveillance due to concerns regarding its emergence in Australia. Using whole genome sequencing, we have analysed the genomic epidemiology of 217 S. Enteritidis isolates from Queensland, and observed that they fall into three distinct clades, which we have differentiated as Clades A, B and C. Phage types and MLST sequence types differed between the clades and comparative genomic analysis has shown that each has a unique profile of prophage and genomic islands. Several of the phage regions present in the S. Enteritidis reference strain P125109 were absent in Clades A and C, and these clades also had difference in the presence of pathogenicity islands, containing complete SPI-6 and SPI-19 regions, while P125109 does not. Antimicrobial resistance markers were found in 39 isolates, all but one of which belonged to Clade B. Phylogenetic analysis of the Queensland isolates in the context of 170 international strains showed that Queensland Clade B isolates group together with the previously identified global clade, while the other two clades are distinct and appear largely restricted to Australia. Locally sourced environmental isolates included in this analysis all belonged to Clades A and C, which is consistent with the theory that these clades are a source of locally acquired infection, while Clade B isolates are mostly travel related.
Ping, Zheng; Siegal, Gene P.; Almeida, Jonas S.; Schnitt, Stuart J.; Shen, Dejun
Background: Genetics and genomics have radically altered our understanding of breast cancer progression. However, the genomic basis of various histopathologic features of breast cancer is not yet well-defined. Materials and Methods: The Cancer Genome Atlas (TCGA) is an international database containing a large collection of human cancer genome sequencing data. cBioPortal is a web tool developed for mining these sequencing data. We performed mining of TCGA sequencing data in an attempt to characterize the genomic features correlated with breast cancer histopathology. We first assessed the quality of the TCGA data using a group of genes with known alterations in various cancers. Both genome-wide gene mutation and copy number changes as well as a group of genes with a high frequency of genetic changes were then correlated with various histopathologic features of invasive breast cancer. Results: Validation of TCGA data using a group of genes with known alterations in breast cancer suggests that the TCGA has accurately documented the genomic abnormalities of multiple malignancies. Further analysis of TCGA breast cancer sequencing data shows that accumulation of specific genomic defects is associated with higher tumor grade, larger tumor size and receptor negativity. Distinct groups of genomic changes were found to be associated with the different grades of invasive ductal carcinoma. The mutator role of the TP53 gene was validated by genomic sequencing data of invasive breast cancer and TP53 mutation was found to play a critical role in defining high tumor grade. Conclusions: Data mining of the TCGA genome sequencing data is an innovative and reliable method to help characterize the genomic abnormalities associated with histopathologic features of invasive breast cancer. PMID:24672738
Full Text Available Background: Genetics and genomics have radically altered our understanding of breast cancer progression. However, the genomic basis of various histopathologic features of breast cancer is not yet well-defined. Materials and Methods: The Cancer Genome Atlas (TCGA is an international database containing a large collection of human cancer genome sequencing data. cBioPortal is a web tool developed for mining these sequencing data. We performed mining of TCGA sequencing data in an attempt to characterize the genomic features correlated with breast cancer histopathology. We first assessed the quality of the TCGA data using a group of genes with known alterations in various cancers. Both genome-wide gene mutation and copy number changes as well as a group of genes with a high frequency of genetic changes were then correlated with various histopathologic features of invasive breast cancer. Results: Validation of TCGA data using a group of genes with known alterations in breast cancer suggests that the TCGA has accurately documented the genomic abnormalities of multiple malignancies. Further analysis of TCGA breast cancer sequencing data shows that accumulation of specific genomic defects is associated with higher tumor grade, larger tumor size and receptor negativity. Distinct groups of genomic changes were found to be associated with the different grades of invasive ductal carcinoma. The mutator role of the TP53 gene was validated by genomic sequencing data of invasive breast cancer and TP53 mutation was found to play a critical role in defining high tumor grade. Conclusions: Data mining of the TCGA genome sequencing data is an innovative and reliable method to help characterize the genomic abnormalities associated with histopathologic features of invasive breast cancer.
Full Text Available BACKGROUND: Eukaryotic genomes are scattered with retroelements that proliferate through retrotransposition. Although retroelements make up around 40 percent of the human genome, large regions are found to be completely devoid of retroelements. This has been hypothesised to be a result of genomic regions being intolerant to insertions of retroelements. The inadvertent transcriptional activity of retroelements may affect neighbouring genes, which in turn could be detrimental to an organism. We speculate that such retroelement transcription, or transcriptional interference, is a contributing factor in generating and maintaining retroelement-free regions in the human genome. METHODOLOGY/PRINCIPAL FINDINGS: Based on the known transcriptional properties of retroelements, we expect long interspersed elements (LINEs to be able to display a high degree of transcriptional interference. In contrast, we expect short interspersed elements (SINEs to display very low levels of transcriptional interference. We find that genomic regions devoid of long interspersed elements (LINEs are enriched for protein-coding genes, but that this is not the case for regions devoid of short interspersed elements (SINEs. This is expected if genes are subject to selection against transcriptional interference. We do not find microRNAs to be associated with genomic regions devoid of either SINEs or LINEs. We further observe an increased relative activity of genes overlapping LINE-free regions during early embryogenesis, where activity of LINEs has been identified previously. CONCLUSIONS/SIGNIFICANCE: Our observations are consistent with the notion that selection against transcriptional interference has contributed to the maintenance and/or generation of retroelement-free regions in the human genome.
... institutional gaps or inadequacies are found, regions should be identified keeping in mind which agencies would... section 208 of the Federal Water Pollution Control Act, with underground injection control agencies...
Full Text Available Complex regional pain syndrome (CRPS is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II and 5 controls (cut-off value: 1.5-fold change and p<0.05. Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1, matrix metalloproteinase 9 (MMP9, alanine aminopeptidase N (ANPEP, l-histidine decarboxylase (HDC, granulocyte colony-stimulating factor 3 receptor (G-CSF3R, and signal transducer and activator of transcription 3 (STAT3 genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR. We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10(-4. The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression.
Johnson Todd A
Full Text Available Abstract Background The strong linkage disequilibrium (LD recently found in genic or exonic regions of the human genome demonstrated that LD can be increased by evolutionary mechanisms that select for functionally important loci. This suggests that LD might be stronger in regions conserved among species than in non-conserved regions, since regions exposed to natural selection tend to be conserved. To assess this hypothesis, we used genome-wide polymorphism data from the HapMap project and investigated LD within DNA sequences conserved between the human and mouse genomes. Results Unexpectedly, we observed that LD was significantly weaker in conserved regions than in non-conserved regions. To investigate why, we examined sequence features that may distort the relationship between LD and conserved regions. We found that interspersed repeats, and not other sequence features, were associated with the weak LD tendency in conserved regions. To appropriately understand the relationship between LD and conserved regions, we removed the effect of repetitive elements and found that the high degree of sequence conservation was strongly associated with strong LD in coding regions but not with that in non-coding regions. Conclusion Our work demonstrates that the degree of sequence conservation does not simply increase LD as predicted by the hypothesis. Rather, it implies that purifying selection changes the polymorphic patterns of coding sequences but has little influence on the patterns of functional units such as regulatory elements present in non-coding regions, since the former are generally restricted by the constraint of maintaining a functional protein product across multiple exons while the latter may exist more as individually isolated units.
Sahu Binod B
Full Text Available Abstract Background Molecular markers facilitate both genotype identification, essential for modern animal and plant breeding, and the isolation of genes based on their map positions. Advancements in sequencing technology have made possible the identification of single nucleotide polymorphisms (SNPs for any genomic regions. Here a sequence based polymorphic (SBP marker technology for generating molecular markers for targeted genomic regions in Arabidopsis is described. Results A ~3X genome coverage sequence of the Arabidopsis thaliana ecotype, Niederzenz (Nd-0 was obtained by applying Illumina's sequencing by synthesis (Solexa technology. Comparison of the Nd-0 genome sequence with the assembled Columbia-0 (Col-0 genome sequence identified putative single nucleotide polymorphisms (SNPs throughout the entire genome. Multiple 75 base pair Nd-0 sequence reads containing SNPs and originating from individual genomic DNA molecules were the basis for developing co-dominant SBP markers. SNPs containing Col-0 sequences, supported by transcript sequences or sequences from multiple BAC clones, were compared to the respective Nd-0 sequences to identify possible restriction endonuclease enzyme site variations. Small amplicons, PCR amplified from both ecotypes, were digested with suitable restriction enzymes and resolved on a gel to reveal the sequence based polymorphisms. By applying this technology, 21 SBP markers for the marker poor regions of the Arabidopsis map representing polymorphisms between Col-0 and Nd-0 ecotypes were generated. Conclusions The SBP marker technology described here allowed the development of molecular markers for targeted genomic regions of Arabidopsis. It should facilitate isolation of co-dominant molecular markers for targeted genomic regions of any animal or plant species, whose genomic sequences have been assembled. This technology will particularly facilitate the development of high density molecular marker maps, essential for
Lu, Qiongshi; Hu, Yiming; Sun, Jiehuan; Cheng, Yuwei; Cheung, Kei-Hoi; Zhao, Hongyu
Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu.
The results of this thesis show that the probability of introgression of a putative transgene to wild relatives indeed depends strongly on the insertion location of the transgene. The study of genomic selection patterns can identify crop genomic regions under negative selection in multiple
Puente, Xose S.; Pinyol, Magda; Quesada, Víctor; Conde, Laura; Ordóñez, Gonzalo R.; Villamor, Neus; Escaramis, Georgia; Jares, Pedro; Beà, Sílvia; González-Díaz, Marcos; Bassaganyas, Laia; Baumann, Tycho; Juan, Manel; López-Guerra, Mónica; Colomer, Dolors; Tubío, José M. C.; López, Cristina; Navarro, Alba; Tornador, Cristian; Aymerich, Marta; Rozman, María; Hernández, Jesús M.; Puente, Diana A.; Freije, José M. P.; Velasco, Gloria; Gutiérrez-Fernández, Ana; Costa, Dolors; Carrió, Anna; Guijarro, Sara; Enjuanes, Anna; Hernández, Lluís; Yagüe, Jordi; Nicolás, Pilar; Romeo-Casabona, Carlos M.; Himmelbauer, Heinz; Castillo, Ester; Dohm, Juliane C.; de Sanjosé, Silvia; Piris, Miguel A.; de Alava, Enrique; Miguel, Jesús San; Royo, Romina; Gelpí, Josep L.; Torrents, David; Orozco, Modesto; Pisano, David G.; Valencia, Alfonso; Guigó, Roderic; Bayés, Mónica; Heath, Simon; Gut, Marta; Klatt, Peter; Marshall, John; Raine, Keiran; Stebbings, Lucy A.; Futreal, P. Andrew; Stratton, Michael R.; Campbell, Peter J.; Gut, Ivo; López-Guillermo, Armando; Estivill, Xavier; Montserrat, Emili; López-Otín, Carlos; Campo, Elías
Chronic lymphocytic leukaemia (CLL), the most frequent leukaemia in adults in Western countries, is a heterogeneous disease with variable clinical presentation and evolution1,2. Two major molecular subtypes can be distinguished, characterized respectively by a high or low number of somatic hypermutations in the variable region of immunoglobulin genes3,4. The molecular changes leading to the pathogenesis of the disease are still poorly understood. Here we performed whole-genome sequencing of four cases of CLL and identified 46 somatic mutations that potentially affect gene function. Further analysis of these mutations in 363 patients with CLL identified four genes that are recurrently mutated: notch 1 (NOTCH1), exportin 1 (XPO1), myeloid differentiation primary response gene 88 (MYD88) and kelch-like 6 (KLHL6). Mutations in MYD88 and KLHL6 are predominant in cases of CLL with mutated immunoglobulin genes, whereas NOTCH1 and XPO1 mutations are mainly detected in patients with unmutated immunoglobulins. The patterns of somatic mutation, supported by functional and clinical analyses, strongly indicate that the recurrent NOTCH1, MYD88 and XPO1 mutations are oncogenic changes that contribute to the clinical evolution of the disease. To our knowledge, this is the first comprehensive analysis of CLL combining whole-genome sequencing with clinical characteristics and clinical outcomes. It highlights the usefulness of this approach for the identification of clinically relevant mutations in cancer. PMID:21642962
Full Text Available Inbreeding has long been recognized as a primary cause of fitness reduction in both wild and domesticated populations. Consanguineous matings cause inheritance of haplotypes that are identical by descent (IBD and result in homozygous stretches along the genome of the offspring. Size and position of regions of homozygosity (ROHs are expected to correlate with genomic features such as GC content and recombination rate, but also direction of selection. Thus, ROHs should be non-randomly distributed across the genome. Therefore, demographic history may not fully predict the effects of inbreeding. The porcine genome has a relatively heterogeneous distribution of recombination rate, making Sus scrofa an excellent model to study the influence of both recombination landscape and demography on genomic variation. This study utilizes next-generation sequencing data for the analysis of genomic ROH patterns, using a comparative sliding window approach. We present an in-depth study of genomic variation based on three different parameters: nucleotide diversity outside ROHs, the number of ROHs in the genome, and the average ROH size. We identified an abundance of ROHs in all genomes of multiple pigs from commercial breeds and wild populations from Eurasia. Size and number of ROHs are in agreement with known demography of the populations, with population bottlenecks highly increasing ROH occurrence. Nucleotide diversity outside ROHs is high in populations derived from a large ancient population, regardless of current population size. In addition, we show an unequal genomic ROH distribution, with strong correlations of ROH size and abundance with recombination rate and GC content. Global gene content does not correlate with ROH frequency, but some ROH hotspots do contain positive selected genes in commercial lines and wild populations. This study highlights the importance of the influence of demography and recombination on homozygosity in the genome to understand
Jorjani, Hadi; Zavolan, Mihaela
Accurate identification of transcription start sites (TSSs) is an essential step in the analysis of transcription regulatory networks. In higher eukaryotes, the capped analysis of gene expression technology enabled comprehensive annotation of TSSs in genomes such as those of mice and humans. In bacteria, an equivalent approach, termed differential RNA sequencing (dRNA-seq), has recently been proposed, but the application of this approach to a large number of genomes is hindered by the paucity of computational analysis methods. With few exceptions, when the method has been used, annotation of TSSs has been largely done manually. In this work, we present a computational method called 'TSSer' that enables the automatic inference of TSSs from dRNA-seq data. The method rests on a probabilistic framework for identifying both genomic positions that are preferentially enriched in the dRNA-seq data as well as preferentially captured relative to neighboring genomic regions. Evaluating our approach for TSS calling on several publicly available datasets, we find that TSSer achieves high consistency with the curated lists of annotated TSSs, but identifies many additional TSSs. Therefore, TSSer can accelerate genome-wide identification of TSSs in bacterial genomes and can aid in further characterization of bacterial transcription regulatory networks. TSSer is freely available under GPL license at http://www.clipz.unibas.ch/TSSer/index.php
Faucon, Frederic; Dusfour, Isabelle; Gaude, Thierry; Navratil, Vincent; Boyer, Frederic; Chandre, Fabrice; Sirisopa, Patcharawan; Thanispong, Kanutcharee; Juntarajumnong, Waraporn; Poupardin, Rodolphe; Chareonviriyaphap, Theeraphap; Girod, Romain; Corbel, Vincent; Reynaud, Stephane; David, Jean-Philippe
The capacity of mosquitoes to resist insecticides threatens the control of diseases such as dengue and malaria. Until alternative control tools are implemented, characterizing resistance mechanisms is crucial for managing resistance in natural populations. Insecticide biodegradation by detoxification enzymes is a common resistance mechanism; however, the genomic changes underlying this mechanism have rarely been identified, precluding individual resistance genotyping. In particular, the role of copy number variations (CNVs) and polymorphisms of detoxification enzymes have never been investigated at the genome level, although they can represent robust markers of metabolic resistance. In this context, we combined target enrichment with high-throughput sequencing for conducting the first comprehensive screening of gene amplifications and polymorphisms associated with insecticide resistance in mosquitoes. More than 760 candidate genes were captured and deep sequenced in several populations of the dengue mosquito Ae. aegypti displaying distinct genetic backgrounds and contrasted resistance levels to the insecticide deltamethrin. CNV analysis identified 41 gene amplifications associated with resistance, most affecting cytochrome P450s overtranscribed in resistant populations. Polymorphism analysis detected more than 30,000 variants and strong selection footprints in specific genomic regions. Combining Bayesian and allele frequency filtering approaches identified 55 nonsynonymous variants strongly associated with resistance. Both CNVs and polymorphisms were conserved within regions but differed across continents, confirming that genomic changes underlying metabolic resistance to insecticides are not universal. By identifying novel DNA markers of insecticide resistance, this study opens the way for tracking down metabolic changes developed by mosquitoes to resist insecticides within and among populations. PMID:26206155
Full Text Available Progress in understanding complex genetic diseases has been bolstered by synthetic approaches that overlay diverse data types and analyses to identify functionally important genes. Pre-term birth (PTB, a major complication of pregnancy, is a leading cause of infant mortality worldwide. A major obstacle in addressing PTB is that the mechanisms controlling parturition and birth timing remain poorly understood. Integrative approaches that overlay datasets derived from comparative genomics with function-derived ones have potential to advance our understanding of the genetics of birth timing, and thus provide insights into the genes that may contribute to PTB. We intersected data from fast evolving coding and non-coding gene regions in the human and primate lineage with data from genes expressed in the placenta, from genes that show enriched expression only in the placenta, as well as from genes that are differentially expressed in four distinct PTB clinical subtypes. A large fraction of genes that are expressed in placenta, and differentially expressed in PTB clinical subtypes (23-34% are fast evolving, and are associated with functions that include adhesion neurodevelopmental and immune processes. Functional categories of genes that express fast evolution in coding regions differ from those linked to fast evolution in non-coding regions. Finally, there is a surprising lack of overlap between fast evolving genes that are differentially expressed in four PTB clinical subtypes. Integrative approaches, especially those that incorporate evolutionary perspectives, can be successful in identifying potential genetic contributions to complex genetic diseases, such as PTB.
Full Text Available GRAbB (Genomic Region Assembly by Baiting is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often neglected or poorly assembled, although they contain interesting information from phylogenetic or epidemiologic perspectives, but also single copy regions can be assembled. The program is capable of targeting multiple regions within a single run. Furthermore, GRAbB can be used to extract specific loci from NGS data, based on homology, like sequences that are used for barcoding. To make the assembly specific, a known part of the region, such as the sequence of a PCR amplicon or a homologous sequence from a related species must be specified. By assembling only the region of interest, the assembly process is computationally much less demanding and may lead to assemblies of better quality. In this study the different applications and functionalities of the program are demonstrated such as: exhaustive assembly (rDNA region and mitochondrial genome, extracting homologous regions or genes (IGS, RPB1, RPB2 and TEF1a, as well as extracting multiple regions within a single run. The program is also compared with MITObim, which is meant for the exhaustive assembly of a single target based on a similar query sequence. GRAbB is shown to be more efficient than MITObim in terms of speed, memory and disk usage. The other functionalities (handling multiple targets simultaneously and extracting homologous regions of the new program are not matched by other programs. The program is available with explanatory documentation at https://github.com/b-brankovics/grabb. GRAbB has been tested on Ubuntu (12.04 and 14.04, Fedora (23, CentOS (7.1.1503 and Mac OS X (10.7. Furthermore, GRAbB is available as a docker repository: brankovics/grabb (https://hub.docker.com/r/brankovics/grabb/.
Katherine S Pollard
Full Text Available Comparative genomics allow us to search the human genome for segments that were extensively changed in the last approximately 5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202 genomic elements that are highly conserved in vertebrates but show evidence of significantly accelerated substitution rates in human. These are mostly in non-coding DNA, often near genes associated with transcription and DNA binding. Resequencing confirmed that the five most accelerated elements are dramatically changed in human but not in other primates, with seven times more substitutions in human than in chimp. The accelerated elements, and in particular the top five, show a strong bias for adenine and thymine to guanine and cytosine nucleotide changes and are disproportionately located in high recombination and high guanine and cytosine content environments near telomeres, suggesting either biased gene conversion or isochore selection. In addition, there is some evidence of directional selection in the regions containing the two most accelerated regions. A combination of evolutionary forces has contributed to accelerated evolution of the fastest evolving elements in the human genome.
Sergey I Nikolaev
Full Text Available Detection of the rare polymorphisms and causative mutations of genetic diseases in a targeted genomic area has become a major goal in order to understand genomic and phenotypic variability. We have interrogated repeat-masked regions of 8.9 Mb on human chromosomes 21 (7.8 Mb and 7 (1.1 Mb from an individual from the International HapMap Project (NA12872. We have optimized a method of genomic selection for high throughput sequencing. Microarray-based selection and sequencing resulted in 260-fold enrichment, with 41% of reads mapping to the target region. 83% of SNPs in the targeted region had at least 4-fold sequence coverage and 54% at least 15-fold. When assaying HapMap SNPs in NA12872, our sequence genotypes are 91.3% concordant in regions with coverage > or = 4-fold, and 97.9% concordant in regions with coverage > or = 15-fold. About 81% of the SNPs recovered with both thresholds are listed in dbSNP. We observed that regions with low sequence coverage occur in close proximity to low-complexity DNA. Validation experiments using Sanger sequencing were performed for 46 SNPs with 15-20 fold coverage, with a confirmation rate of 96%, suggesting that DNA selection provides an accurate and cost-effective method for identifying rare genomic variants.
Preston, Mark D.; Campino, Susana; Assefa, Samuel A.; Echeverry, Diego F.; Ocholla, Harold; Amambua-Ngwa, Alfred; Stewart, Lindsay B.; Conway, David J.; Borrmann, Steffen; Michon, Pascal; Zongo, Issaka; Oué draogo, Jean-Bosco; Djimde, Abdoulaye A.; Doumbo, Ogobara K.; Nosten, Francois; Pain, Arnab; Bousema, Teun; Drakeley, Chris J.; Fairhurst, Rick M.; Sutherland, Colin J.; Roper, Cally; Clark, Taane G.
Malaria is a major public health problem that is actively being addressed in a global eradication campaign. Increased population mobility through international air travel has elevated the risk of re-introducing parasites to elimination areas and dispersing drug-resistant parasites to new regions. A simple genetic marker that quickly and accurately identifies the geographic origin of infections would be a valuable public health tool for locating the source of imported outbreaks. Here we analyse the mitochondrion and apicoplast genomes of 711 Plasmodium falciparum isolates from 14 countries, and find evidence that they are non-recombining and co-inherited. The high degree of linkage produces a panel of relatively few single-nucleotide polymorphisms (SNPs) that is geographically informative. We design a 23-SNP barcode that is highly predictive (?92%) and easily adapted to aid case management in the field and survey parasite migration worldwide. 2014 Macmillan Publishers Limited. All rights reserved.
Preston, Mark D.
Malaria is a major public health problem that is actively being addressed in a global eradication campaign. Increased population mobility through international air travel has elevated the risk of re-introducing parasites to elimination areas and dispersing drug-resistant parasites to new regions. A simple genetic marker that quickly and accurately identifies the geographic origin of infections would be a valuable public health tool for locating the source of imported outbreaks. Here we analyse the mitochondrion and apicoplast genomes of 711 Plasmodium falciparum isolates from 14 countries, and find evidence that they are non-recombining and co-inherited. The high degree of linkage produces a panel of relatively few single-nucleotide polymorphisms (SNPs) that is geographically informative. We design a 23-SNP barcode that is highly predictive (?92%) and easily adapted to aid case management in the field and survey parasite migration worldwide. 2014 Macmillan Publishers Limited. All rights reserved.
Full Text Available Abstract Re-emergence of schistosomiasis in regions of China where control programs have ceased requires development of molecular-genetic tools to track gene flow and assess genetic diversity of Schistosoma populations. We identified many microsatellite loci in the draft genome of Schistosoma japonicum using defined search criteria and selected a subset for further analysis. From an initial panel of 50 loci, 20 new microsatellites were selected for eventual optimization and application to a panel of worms from endemic areas. All but one of the selected microsatellites contain simple tri-nucleotide repeats. Moderate to high levels of polymorphism were detected. Numbers of alleles ranged from 6 to 14 and observed heterozygosity was always >0.6. The loci reported here will facilitate high resolution population-genetic studies on schistosomes in re-emergent foci.
Full Text Available The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary between breeds, and we identify novel associations with both morphological and behavioral traits. We next scan the genome for signatures of selective sweeps in single breeds, characterized by long regions of reduced heterozygosity and fixation of extended haplotypes. These scans identify hundreds of regions, including 22 blocks of homozygosity longer than one megabase in certain breeds. Candidate selection loci are strongly enriched for developmental genes. We chose one highly differentiated region, associated with body size and ear morphology, and characterized it using high-throughput sequencing to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease.
Yauk, Carole Lyn; Argueso, J. Lucas; Auerbach, Scott S.; Awadalla, Philip; Davis, Sean R.; DeMarini, David M.; Douglas, George R.; Dubrova, Yuri E.; Elespuru, Rosalie K.; Glover, Thomas W.; Hales, Barbara F.; Hurles, Matthew E.; Klein, Catherine B.; Lupski, James R.; Manchester, David K.; Marchetti, Francesco; Montpetit, Alexandre; Mulvihill, John J.; Robaire, Bernard; Robbins, Wendie A.; Rouleau, Guy A.; Shaughnessy, Daniel T.; Somers, Christopher M.; Taylor, James G.; Trasler, Jacquetta; Waters, Michael D.; Wilson, Thomas E.; Witt, Kristine L.; Bishop, Jack B.
Next-generation sequencing technologies can now be used to directly measure heritable de novo DNA sequence mutations in humans. However, these techniques have not been used to examine environmental factors that induce such mutations and their associated diseases. To address this issue, a working group on environmentally induced germline mutation analysis (ENIGMA) met in October 2011 to propose the necessary foundational studies, which include sequencing of parent–offspring trios from highly exposed human populations, and controlled dose–response experiments in animals. These studies will establish background levels of variability in germline mutation rates and identify environmental agents that influence these rates and heritable disease. Guidance for the types of exposures to examine come from rodent studies that have identified agents such as cancer chemotherapeutic drugs, ionizing radiation, cigarette smoke, and air pollution as germ-cell mutagens. Research is urgently needed to establish the health consequences of parental exposures on subsequent generations. PMID:22935230
Lu, Wei; Wise, Michael J; Tay, Chin Yen; Windsor, Helen M; Marshall, Barry J; Peacock, Christopher; Perkins, Tim
Isolates of Helicobacter pylori can be classified phylogeographically. High genetic diversity and rapid microevolution are a hallmark of H. pylori genomes, a phenomenon that is proposed to play a functional role in persistence and colonization of diverse human populations. To provide further genomic evidence in the lineage of H. pylori and to further characterize diverse strains of this pathogen in different human populations, we report the finished genome sequence of Sahul64, an H. pylori strain isolated from an indigenous Australian. Our analysis identified genes that were highly divergent compared to the 38 publically available genomes, which include genes involved in the biosynthesis and modification of lipopolysaccharide, putative prophage genes, restriction modification components, and hypothetical genes. Furthermore, the virulence-associated vacA locus is a pseudogene and the cag pathogenicity island (cagPAI) is not present. However, the genome does contain a gene cluster associated with pathogenicity, including dupA. Our analysis found that with the addition of Sahul64 to the 38 genomes, the core genome content of H. pylori is reduced by approximately 14% (∼170 genes) and the pan-genome has expanded from 2,070 to 2,238 genes. We have identified three putative horizontally acquired regions, including one that is likely to have been acquired from the closely related Helicobacter cetorum prior to speciation. Our results suggest that Sahul64, with the absence of cagPAI, highly divergent cell envelope proteins, and a predicted nontransportable VacA protein, could be more highly adapted to ancient indigenous Australian people but with lower virulence potential compared to other sequenced and cagPAI-positive H. pylori strains.
... are often identified as regions susceptible to seasonal blooms of harmful ... that the bay acts as a net importer of bottom water and net exporter of surface waters over a synoptic cycle. This ... waves or wind stress on the surface friction layer.
Chen, Tsute; Gajare, Prasad; Olsen, Ingar; Dewhirst, Floyd E.
ABSTRACT The advent of next generation sequencing is producing more genomic sequences for various strains of many human oral microbial species and allows for insightful functional comparisons at both intra- and inter-species levels. This study performed in-silico functional comparisons for currently available genomic sequences of major species associated with periodontitis including Aggregatibacter actinomycetemcomitans (AA), Porphyromonas gingivalis (PG), Treponema denticola (TD), and Tannerella forsythia (TF), as well as several cariogenic and commensal streptococcal species. Complete or draft sequences were annotated with the RAST to infer structured functional subsystems for each genome. The subsystems profiles were clustered to groups of functions with similar patterns. Functional enrichment and depletion were evaluated based on hypergeometric distribution to identify subsystems that are unique or missing between two groups of genomes. Unique or missing metabolic pathways and biological functions were identified in different species. For example, components involved in flagellar motility were found only in the motile species TD, as expected, with few exceptions scattered in several streptococcal species, likely associated with chemotaxis. Transposable elements were only found in the two Bacteroidales species PG and TF, and half of the AA genomes. Genes involved in CRISPR were prevalent in most oral species. Furthermore, prophage related subsystems were also commonly found in most species except for PG and Streptococcus mutans, in which very few genomes contain prophage components. Comparisons between pathogenic (P) and nonpathogenic (NP) genomes also identified genes potentially important for virulence. Two such comparisons were performed between AA (P) and several A. aphrophilus (NP) strains, and between S. mutans + S. sobrinus (P) and other oral streptococcal species (NP). This comparative genomics approach can be readily used to identify functions unique to
Full Text Available The spatial structures of cities have changed dramatically with rapid socio-economic development in ways that are not well understood. To support urban structural analysis and rational planning, we propose a framework to identify urban functional regions and quantitatively explore the intensity of the interactions between them, thus increasing the understanding of urban structures. A method for the identification of functional regions via spatial semantics is proposed, which involves two steps: (1 the study area is classified into three types of functional regions using taxi origin/destination (O/D flows; and (2 the spatial semantics for the three types of functional regions are demonstrated based on point-of-interest (POI categories. To validate the existence of urban functional regions, we explored the intensity of interactions quantitatively between them. A case study using POI data and taxi trajectory data from Beijing validates the proposed framework. The results show that the proposed framework can be used to identify urban functional regions and promotes an enhanced understanding of urban structures.
Rachel A Mann
Full Text Available The plant pathogen Erwinia amylovora can be divided into two host-specific groupings; strains infecting a broad range of hosts within the Rosaceae subfamily Spiraeoideae (e.g., Malus, Pyrus, Crataegus, Sorbus and strains infecting Rubus (raspberries and blackberries. Comparative genomic analysis of 12 strains representing distinct populations (e.g., geographic, temporal, host origin of E. amylovora was used to describe the pan-genome of this major pathogen. The pan-genome contains 5751 coding sequences and is highly conserved relative to other phytopathogenic bacteria comprising on average 89% conserved, core genes. The chromosomes of Spiraeoideae-infecting strains were highly homogeneous, while greater genetic diversity was observed between Spiraeoideae- and Rubus-infecting strains (and among individual Rubus-infecting strains, the majority of which was attributed to variable genomic islands. Based on genomic distance scores and phylogenetic analysis, the Rubus-infecting strain ATCC BAA-2158 was genetically more closely related to the Spiraeoideae-infecting strains of E. amylovora than it was to the other Rubus-infecting strains. Analysis of the accessory genomes of Spiraeoideae- and Rubus-infecting strains has identified putative host-specific determinants including variation in the effector protein HopX1(Ea and a putative secondary metabolite pathway only present in Rubus-infecting strains.
Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior
Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.
Consistent but indirect evidence has implicated genetic factors in smoking behavior. We report meta-analyses of several smoking phenotypes within cohorts of the Tobacco and Genetics Consortium (n = 74,053). We also partnered with the European Network of Genetic and Genomic Epidemiology (ENGAGE) and Oxford-GlaxoSmithKline (Ox-GSK) consortia to follow up the 15 most significant regions (n > 140,000). We identified three loci associated with number of cigarettes smoked per day. The strongest association was a synonymous 15q25 SNP in the nicotinic receptor gene CHRNA3 (rs1051730[A], beta = 1.03, standard error (s.e.) = 0.053, P = 2.8 x 10(-73)). Two 10q25 SNPs (rs1329650[G], beta = 0.367, s.e. = 0.059, P = 5.7 x 10(-10); and rs1028936[A], beta = 0.446, s.e. = 0.074, P = 1.3 x 10(-9)) and one 9q13 SNP in EGLN2 (rs3733829[G], beta = 0.333, s.e. = 0.058, P = 1.0 x 10(-8)) also exceeded genome-wide significance for cigarettes per day. For smoking initiation, eight SNPs exceeded genome-wide significance, with the strongest association at a nonsynonymous SNP in BDNF on chromosome 11 (rs6265[C], odds ratio (OR) = 1.06, 95% confidence interval (Cl) 1.04-1.08, P = 1.8 x 10(-8)). One SNP located near DBH on chromosome 9 (rs3025343[G], OR = 1.12, 95% Cl 1.08-1.18, P = 3.6 x 10(-8)) was significantly associated with smoking cessation.
Lu, Xinguo; Lu, Jibo
Integrative analysis of molecular mechanics underlying cancer can distinguish interactions that cannot be revealed based on one kind of data for the appropriate diagnosis and treatment of cancer patients. Tumor samples exhibit heterogeneity in omics data, such as somatic mutations, Copy Number Variations CNVs), gene expression profiles and so on. In this paper we combined gene co-expression modules and mutation modulators separately in tumor patients to obtain the candidate driver genes for resistant and sensitive tumor from the heterogeneous data. The final list of modulators identified are well known in biological processes associated with ovarian cancer, such as CCL17, CACTIN, CCL16, CCL22, APOB, KDF1, CCL11, HNF1B, LRG1, MED1 and so on, which can help to facilitate the discovery of biomarkers, molecular diagnostics, and drug discovery.
Background: The purpose of this study is to: i) develop a computational model of promoters of human histone-encoding genes (shortly histone genes), an important class of genes that participate in various critical cellular processes, ii) use the model so developed to identify regions across the human genome that have similar structure as promoters of histone genes; such regions could represent potential genomic regulatory regions, e.g. promoters, of genes that may be coregulated with histone genes, and iii/ identify in this way genes that have high likelihood of being coregulated with the histone genes.Results: We successfully developed a histone promoter model using a comprehensive collection of histone genes. Based on leave-one-out cross-validation test, the model produced good prediction accuracy (94.1% sensitivity, 92.6% specificity, and 92.8% positive predictive value). We used this model to predict across the genome a number of genes that shared similar promoter structures with the histone gene promoters. We thus hypothesize that these predicted genes could be coregulated with histone genes. This hypothesis matches well with the available gene expression, gene ontology, and pathways data. Jointly with promoters of the above-mentioned genes, we found a large number of intergenic regions with similar structure as histone promoters.Conclusions: This study represents one of the most comprehensive computational analyses conducted thus far on a genome-wide scale of promoters of human histone genes. Our analysis suggests a number of other human genes that share a high similarity of promoter structure with the histone genes and thus are highly likely to be coregulated, and consequently coexpressed, with the histone genes. We also found that there are a large number of intergenic regions across the genome with their structures similar to promoters of histone genes. These regions may be promoters of yet unidentified genes, or may represent remote control regions that
Full Text Available 4C-Seq has proven to be a powerful technique to identify genome-wide interactions with a single locus of interest (or "bait" that can be important for gene regulation. However, analysis of 4C-Seq data is complicated by the many biases inherent to the technique. An important consideration when dealing with 4C-Seq data is the differences in resolution of signal across the genome that result from differences in 3D distance separation from the bait. This leads to the highest signal in the region immediately surrounding the bait and increasingly lower signals in far-cis and trans. Another important aspect of 4C-Seq experiments is the resolution, which is greatly influenced by the choice of restriction enzyme and the frequency at which it can cut the genome. Thus, it is important that a 4C-Seq analysis method is flexible enough to analyze data generated using different enzymes and to identify interactions across the entire genome. Current methods for 4C-Seq analysis only identify interactions in regions near the bait or in regions located in far-cis and trans, but no method comprehensively analyzes 4C signals of different length scales. In addition, some methods also fail in experiments where chromatin fragments are generated using frequent cutter restriction enzymes. Here, we describe 4C-ker, a Hidden-Markov Model based pipeline that identifies regions throughout the genome that interact with the 4C bait locus. In addition, we incorporate methods for the identification of differential interactions in multiple 4C-seq datasets collected from different genotypes or experimental conditions. Adaptive window sizes are used to correct for differences in signal coverage in near-bait regions, far-cis and trans chromosomes. Using several datasets, we demonstrate that 4C-ker outperforms all existing 4C-Seq pipelines in its ability to reproducibly identify interaction domains at all genomic ranges with different resolution enzymes.
Yan, Hong-Bin; Lou, Zhong-Zi; Li, Li; Brindley, Paul J; Zheng, Yadong; Luo, Xuenong; Hou, Junling; Guo, Aijiang; Jia, Wan-Zhong; Cai, Xuepeng
Cysticercosis remains a major neglected tropical disease of humanity in many regions, especially in sub-Saharan Africa, Central America and elsewhere. Owing to the emerging drug resistance and the inability of current drugs to prevent re-infection, identification of novel vaccines and chemotherapeutic agents against Taenia solium and related helminth pathogens is a public health priority. The T. solium genome and the predicted proteome were reported recently, providing a wealth of information from which new interventional targets might be identified. In order to characterize and classify the entire repertoire of protease-encoding genes of T. solium, which act fundamental biological roles in all life processes, we analyzed the predicted proteins of this cestode through a combination of bioinformatics tools. Functional annotation was performed to yield insights into the signaling processes relevant to the complex developmental cycle of this tapeworm and to highlight a suite of the proteases as potential intervention targets. Within the genome of this helminth parasite, we identified 200 open reading frames encoding proteases from five clans, which correspond to 1.68% of the 11,902 protein-encoding genes predicted to be present in its genome. These proteases include calpains, cytosolic, mitochondrial signal peptidases, ubiquitylation related proteins, and others. Many not only show significant similarity to proteases in the Conserved Domain Database but have conserved active sites and catalytic domains. KEGG Automatic Annotation Server (KAAS) analysis indicated that ~60% of these proteases share strong sequence identities with proteins of the KEGG database, which are involved in human disease, metabolic pathways, genetic information processes, cellular processes, environmental information processes and organismal systems. Also, we identified signal peptides and transmembrane helices through comparative analysis with classes of important regulatory proteases
Sørensen, Lars P; Janss, Luc; Madsen, Per
was used. There was a clear difference in the region-wise patterns of genomic correlation among combinations of traits, with distinctive peaks indicating the presence of pleiotropic QTL. CONCLUSIONS: The results show that it is possible to estimate, genome-wide and region-wise genomic (co)variances......BACKGROUND: Multi-trait genomic models in a Bayesian context can be used to estimate genomic (co)variances, either for a complete genome or for genomic regions (e.g. per chromosome) for the purpose of multi-trait genomic selection or to gain further insight into the genomic architecture of related...... with a common prior distribution for the marker allele substitution effects and estimation of the hyperparameters in this prior distribution from the progeny means data. From the Markov chain Monte Carlo samples of the allele substitution effects, genomic (co)variances were calculated on a whole-genome level...
Persson, U.; Möller, B.; Werner, S.
This study presents a methodology to assess annual excess heat volumes from fuel combustion activities in energy and industry sector facilities based on carbon dioxide emission data. The aim is to determine regional balances of excess heat relative heat demands for all third level administrative regions in the European Union (EU) and to identify strategic regions suitable for large-scale implementation of district heating. The approach is motivated since the efficiency of current supply structures to meet building heat demands, mainly characterised by direct use of primary energy sources, is low and improvable. District heating is conceived as an urban supply side energy efficiency measure employable to enhance energy system efficiency by increased excess heat recoveries; hereby reducing primary energy demands by fuel substitution. However, the importance of heat has long been underestimated in EU decarbonisation strategies and local heat synergies have often been overlooked in energy models used for such scenarios. Study results indicate that 46% of all excess heat in EU27, corresponding to 31% of total building heat demands, is located within identified strategic regions. Still, a realisation of these rich opportunities will require higher recognition of the heat sector in future EU energy policy. - Highlights: • EU27 energy and industry sector heat recycling resources are mapped and quantified. • Target regions for large-scale implementation of district heating are identified. • 46% of total EU27 excess heat volume is seized in 63 strategic heat synergy regions. • Large urban zones have lead roles to play in transition to sustainability in Europe. • Higher recognition of heat sector is needed in future EU energy policy for realisation
Full Text Available Identification of genetic polymorphisms and subsequent development of molecular markers is important for marker assisted breeding of superior cultivars of economically important species. Sweet cherry (Prunus avium L. is an economically important non-climacteric tree fruit crop in the Rosaceae family and has undergone a genetic bottleneck due to breeding, resulting in limited genetic diversity in the germplasm that is utilized for breeding new cultivars. Therefore, it is critical to recognize the best platforms for identifying genome-wide polymorphisms that can help identify, and consequently preserve, the diversity in a genetically constrained species. For the identification of polymorphisms in five closely related genotypes of sweet cherry, a gel-based approach (TRAP, reduced representation sequencing (TRAPseq, a 6k cherry SNParray, and whole genome sequencing (WGS approaches were evaluated in the identification of genome-wide polymorphisms in sweet cherry cultivars. All platforms facilitated detection of polymorphisms among the genotypes with variable efficiency. In assessing multiple SNP detection platforms, this study has demonstrated that a combination of appropriate approaches is necessary for efficient polymorphism identification, especially between closely related cultivars of a species. The information generated in this study provides a valuable resource for future genetic and genomic studies in sweet cherry, and the insights gained from the evaluation of multiple approaches can be utilized for other closely related species with limited genetic diversity in the breeding germplasm. Keywords: Polymorphisms, Prunus avium, Next-generation sequencing, Target region amplification polymorphism (TRAP, Genetic diversity, SNParray, Reduced representation sequencing, Whole genome sequencing (WGS
Full Text Available Melampsora larici-populina is a fungal pathogen responsible for foliar rust disease on poplar trees, which causes damage to forest plantations worldwide, particularly in Northern Europe. The reference genome of the isolate 98AG31 was previously sequenced using a whole genome shotgun strategy, revealing a large genome of 101 megabases containing 16,399 predicted genes, which included secreted protein genes representing poplar rust candidate effectors. In the present study, the genomes of 15 isolates collected over the past 20 years throughout the French territory, representing distinct virulence profiles, were characterized by massively parallel sequencing to assess genetic variation in the poplar rust fungus. Comparison to the reference genome revealed striking structural variations. Analysis of coverage and sequencing depth identified large missing regions between isolates related to the mating type loci. More than 611,824 single-nucleotide polymorphism (SNP positions were uncovered overall, indicating a remarkable level of polymorphism. Based on the accumulation of non-synonymous substitutions in coding sequences and the relative frequencies of synonymous and non-synonymous polymorphisms (i.e. PN/PS, we identify candidate genes that may be involved in fungal pathogenesis. Correlation between non-synonymous SNPs in genes encoding secreted proteins and pathotypes of the studied isolates revealed candidate genes potentially related to virulences 1, 6 and 8 of the poplar rust fungus.
Gussow, Ayal B; Copeland, Brett R; Dhindsa, Ryan S; Wang, Quanli; Petrovski, Slavé; Majoros, William H; Allen, Andrew S; Goldstein, David B
There is broad agreement that genetic mutations occurring outside of the protein-coding regions play a key role in human disease. Despite this consensus, we are not yet capable of discerning which portions of non-coding sequence are important in the context of human disease. Here, we present Orion, an approach that detects regions of the non-coding genome that are depleted of variation, suggesting that the regions are intolerant of mutations and subject to purifying selection in the human lineage. We show that Orion is highly correlated with known intolerant regions as well as regions that harbor putatively pathogenic variation. This approach provides a mechanism to identify pathogenic variation in the human non-coding genome and will have immediate utility in the diagnostic interpretation of patient genomes and in large case control studies using whole-genome sequences.
Cornick, Jennifer E.; Chaguza, Chrispin; Yalcin, Feyruz; Harris, Simon R.; Gray, Katherine J.; Kiran, Anmol M.; Molyneux, Elizabeth; French, Neil; Faragher, Brian E.; Everett, Dean B.; Bentley, Stephen D.
Streptococcus pneumoniae is a nasopharyngeal commensal that occasionally invades normally sterile sites to cause bloodstream infection and meningitis. Although the pneumococcal population structure and evolutionary genetics are well defined, it is not clear whether pneumococci that cause meningitis are genetically distinct from those that do not. Here, we used whole-genome sequencing of 140 isolates of S. pneumoniae recovered from bloodstream infection (n = 70) and meningitis (n = 70) to compare their genetic contents. By fitting a double-exponential decaying-function model, we show that these isolates share a core of 1,427 genes (95% confidence interval [CI], 1,425 to 1,435 genes) and that there is no difference in the core genome or accessory gene content from these disease manifestations. Gene presence/absence alone therefore does not explain the virulence behavior of pneumococci that reach the meninges. Our analysis, however, supports the requirement of a range of previously described virulence factors and vaccine candidates for both meningitis- and bacteremia-causing pneumococci. This high-resolution view suggests that, despite considerable competency for genetic exchange, all pneumococci are under considerable pressure to retain key components advantageous for colonization and transmission and that these components are essential for access to and survival in sterile sites. PMID:26259813
Ming TIAN,Rui HAO,Suyun FANG,Yanqiang WANG,Xiaorong GU,Chungang FENG,Xiaoxiang HU,Ning LI
Full Text Available A unique characteristic of the Silkie chicken is its fibromelanosis phenotype. The dermal layer of its skin, its connective tissue and shank dermis are hyperpigmented. This dermal hyperpigmentation phenotype is controlled by the sex-linked inhibitor of dermal melanin gene (ID and the dominant fibromelanosis allele. This study attempted to confirm the genomic region associated with ID. By genotyping, ID was found to be closely linked to the region between GGA_rs16127903 and GGA_rs14685542 (8406919 bp on chromosome Z, which contains ten functional genes. The expression of these genes was characterized in the embryo and 4 days after hatching and it was concluded that MTAP, encoding methylthioadenosinephosphorylase, would be the most likely candidate gene. Finally, target DNA capture and sequence analysis was performed, but no specific SNP(s was found in the targeted region of the Silkie genome. Further work is necessary to identify the causal ID mutation located on chromosome Z.
Full Text Available This data article contains the benchmark dataset for training and testing iRNA-Methyl, a web-server predictor for identifying N6-methyladenosine sites in RNA (Chen et al., 2015 . It can also be used to develop other predictors for identifying N6-methyladenosine sites in the Saccharomyces cerevisiae genome.
Christine E McLaren
Full Text Available The existence of multiple inherited disorders of iron metabolism in man, rodents and other vertebrates suggests genetic contributions to iron deficiency. To identify new genomic locations associated with iron deficiency, a genome-wide association study (GWAS was performed using DNA collected from white men aged≥25 y and women≥50 y in the Hemochromatosis and Iron Overload Screening (HEIRS Study with serum ferritin (SF≤12 µg/L (cases and iron replete controls (SF>100 µg/L in men, SF>50 µg/L in women. Regression analysis was used to examine the association between case-control status (336 cases, 343 controls and quantitative serum iron measures and 331,060 single nucleotide polymorphism (SNP genotypes, with replication analyses performed in a sample of 71 cases and 161 controls from a population of white male and female veterans screened at a US Veterans Affairs (VA medical center. Five SNPs identified in the GWAS met genome-wide statistical significance for association with at least one iron measure, rs2698530 on chr. 2p14; rs3811647 on chr. 3q22, a known SNP in the transferrin (TF gene region; rs1800562 on chr. 6p22, the C282Y mutation in the HFE gene; rs7787204 on chr. 7p21; and rs987710 on chr. 22q11 (GWAS observed P<1.51×10(-7 for all. An association between total iron binding capacity and SNP rs3811647 in the TF gene (GWAS observed P=7.0×10(-9, corrected P=0.012 was replicated within the VA samples (observed P=0.012. Associations with the C282Y mutation in the HFE gene also were replicated. The joint analysis of the HEIRS and VA samples revealed strong associations between rs2698530 on chr. 2p14 and iron status outcomes. These results confirm a previously-described TF polymorphism and implicate one potential new locus as a target for gene identification.
Saija J Ahonen
Full Text Available Glaucoma is an optic neuropathy and one of the leading causes of blindness. Its hereditary forms are classified into primary closed-angle (PCAG, primary open-angle (POAG and primary congenital glaucoma (PCG. Although many loci have been mapped in human, only a few genes have been identified that are associated with the development of glaucoma and the genetic basis of the disease remains poorly understood. Glaucoma has also been described in many dog breeds, including Dandie Dinmont Terriers (DDT in which it is a late-onset (>7 years disease. We designed clinical and genetic studies to better define the clinical features of glaucoma in the DDT and to identify the genetic cause. Clinical diagnosis was based on ophthalmic examinations of the affected dogs and 18 additionally investigated unaffected DDTs. We collected DNA from over 400 DTTs and a genome wide association study was performed in a cohort of 23 affected and 23 controls, followed by a fine mapping, a replication study and candidate gene sequencing. The clinical study suggested that ocular abnormalities including abnormal iridocorneal angles and pectinate ligament dysplasia are common (50% and 72%, respectively in the breed and the disease resembles human PCAG. The genetic study identified a novel 9.5 Mb locus on canine chromosome 8 including the 1.6 Mb best associated region (p = 1.63 × 10(-10, OR = 32 for homozygosity. Mutation screening in five candidate genes did not reveal any causative variants. This study indicates that although ocular abnormalities are common in DDTs, the genetic risk for glaucoma is conferred by a novel locus on CFA8. The canine locus shares synteny to a region in human chromosome 14q, which harbors several loci associated with POAG and PCG. Our study reveals a new locus for canine glaucoma and ongoing molecular studies will likely help to understand the genetic etiology of the disease.
Guo, Xingyi; Shi, Jiajun; Cai, Qiuyin; Shu, Xiao-Ou; He, Jing; Wen, Wanqing; Allen, Jamie; Pharoah, Paul; Dunning, Alison; Hunter, David J; Kraft, Peter; Easton, Douglas F; Zheng, Wei; Long, Jirong
Functional disruptions of susceptibility genes by large genomic structure variant (SV) deletions in germlines are known to be associated with cancer risk. However, few studies have been conducted to systematically search for SV deletions in breast cancer susceptibility genes. We analysed deep (> 30x) whole-genome sequencing (WGS) data generated in blood samples from 128 breast cancer patients of Asian and European descent with either a strong family history of breast cancer or early cancer onset disease. To identify SV deletions in known or suspected breast cancer susceptibility genes, we used multiple SV calling tools including Genome STRiP, Delly, Manta, BreakDancer and Pindel. SV deletions were detected by at least three of these bioinformatics tools in five genes. Specifically, we identified heterozygous deletions covering a fraction of the coding regions of BRCA1 (with approximately 80kb in two patients), and TP53 genes (with ∼1.6 kb in two patients), and of intronic regions (∼1 kb) of the PALB2 (one patient), PTEN (three patients) and RAD51C genes (one patient). We confirmed the presence of these deletions using real-time quantitative PCR (qPCR). Our study identified novel SV deletions in breast cancer susceptibility genes and the identification of such SV deletions may improve clinical testing.
Höglund, Johanna; Guldbrandtsen, Bernt; Lund, Mogens Sandø
6 QTL were detected for FTI: one QTL on each of BTA7, BTA20, BTA23, BTA25, and two QTL on BTA9 (QTL9–1 and QTL9–2). In the second step, ICF showed association with the QTL regions on BTA7, QTL9–2 QTL2 on BTA9, and BTA25, AIS for cows on BTA20 and BTA23, AIS for heifers on QTL9–2 on BTA9, IFL...... for cows on BTA20, BTA23 and BTA25, IFL for heifers on BTA7 and QTL9-2 on BTA9, NRR for heifers on BTA7 and BTA23, and NRR for cows on BTA23. Conclusion: The genome wide association study presented here revealed 6 genomic regions associated with FTI. Screening these 6 QTL regions for the underlying female...... quantitative trait locus regions were re-analyzed using a linear mixed model (animal model) for both FTI and its component traits AIS, NRR, IFL and ICF. The underlying traits were analyzed separately for heifers (first parity cows) and cows (later parity cows) for AIS, NRR, and IFL. Results: In the first step...
Saito, Kuniaki; Mukasa, Akitake; Nagae, Genta; Aihara, Koki; Otani, Ryohei; Takayanagi, Shunsaku; Omata, Mayu; Tanaka, Shota; Shibahara, Junji; Takahashi, Miwako; Momose, Toshimitsu; Shimamura, Teppei; Miyano, Satoru; Narita, Yoshitaka; Ueki, Keisuke; Nishikawa, Ryo; Nagane, Motoo; Aburatani, Hiroyuki; Saito, Nobuhito
Low-grade gliomas often undergo malignant progression, and these transformations are a leading cause of death in patients with low-grade gliomas. However, the molecular mechanisms underlying malignant tumor progression are still not well understood. Recent evidence indicates that epigenetic deregulation is an important cause of gliomagenesis; therefore, we examined the impact of epigenetic changes during malignant progression of low-grade gliomas. Specifically, we used the Illumina Infinium Human Methylation 450K BeadChip to perform genome-wide DNA methylation analysis of 120 gliomas and four normal brains. This study sample included 25 matched-pairs of initial low-grade gliomas and recurrent tumors (temporal heterogeneity) and 20 of the 25 recurring tumors recurred as malignant progressions, and one matched-pair of newly emerging malignant lesions and pre-existing lesions (spatial heterogeneity). Analyses of methylation profiles demonstrated that most low-grade gliomas in our sample (43/51; 84%) had a CpG island methylator phenotype (G-CIMP). Remarkably, approximately 50% of secondary glioblastomas that had progressed from low-grade tumors with the G-CIMP status exhibited a characteristic partial demethylation of genomic DNA during malignant progression, but other recurrent gliomas showed no apparent change in DNA methylation pattern. Interestingly, we found that most loci that were demethylated during malignant progression were located outside of CpG islands. The information of histone modifications patterns in normal human astrocytes and embryonal stem cells also showed that the ratio of active marks at the site corresponding to DNA demethylated loci in G-CIMP-demethylated tumors was significantly lower; this finding indicated that most demethylated loci in G-CIMP-demethylated tumors were likely transcriptionally inactive. A small number of the genes that were upregulated and had demethylated CpG islands were associated with cell cycle-related pathway. In
Background Apple is an economically important fruit crop worldwide. Developing a genetic linkage map is a critical step towards mapping and cloning of genes responsible for important horticultural traits in apple. To facilitate linkage map construction, we surveyed and characterized the distribution and frequency of perfect microsatellites in assembled contig sequences of the apple genome. Results A total of 28,538 SSRs have been identified in the apple genome, with an overall density of 40.8 SSRs per Mb. Di-nucleotide repeats are the most frequent microsatellites in the apple genome, accounting for 71.9% of all microsatellites. AT/TA repeats are the most frequent in genomic regions, accounting for 38.3% of all the G-SSRs, while AG/GA dimers prevail in transcribed sequences, and account for 59.4% of all EST-SSRs. A total set of 310 SSRs is selected to amplify eight apple genotypes. Of these, 245 (79.0%) are found to be polymorphic among cultivars and wild species tested. AG/GA motifs in genomic regions have detected more alleles and higher PIC values than AT/TA or AC/CA motifs. Moreover, AG/GA repeats are more variable than any other dimers in apple, and should be preferentially selected for studies, such as genetic diversity and linkage map construction. A total of 54 newly developed apple SSRs have been genetically mapped. Interestingly, clustering of markers with distorted segregation is observed on linkage groups 1, 2, 10, 15, and 16. A QTL responsible for malic acid content of apple fruits is detected on linkage group 8, and accounts for ~13.5% of the observed phenotypic variation. Conclusions This study demonstrates that di-nucleotide repeats are prevalent in the apple genome and that AT/TA and AG/GA repeats are the most frequent in genomic and transcribed sequences of apple, respectively. All SSR motifs identified in this study as well as those newly mapped SSRs will serve as valuable resources for pursuing apple genetic studies, aiding the apple breeding
Ren, Shancheng; Wei, Gong-Hong; Liu, Dongbing
BACKGROUND: Global disparities in prostate cancer (PCa) incidence highlight the urgent need to identify genomic abnormalities in prostate tumors in different ethnic populations including Asian men. OBJECTIVE: To systematically explore the genomic complexity and define disease-driven genetic......-scale and comprehensive genomic data of prostate cancer from Asian population. Identification of these genetic alterations may help advance prostate cancer diagnosis, prognosis, and treatment....... alterations in PCa. DESIGN, SETTING, AND PARTICIPANTS: The study sequenced whole-genome and transcriptome of tumor-benign paired tissues from 65 treatment-naive Chinese PCa patients. Subsequent targeted deep sequencing of 293 PCa-relevant genes was performed in another cohort of 145 prostate tumors. OUTCOME...
Tsai, Chia-Ti; Hsieh, Chia-Shan; Chang, Sheng-Nan; Chuang, Eric Y.; Ueng, Kwo-Chang; Tsai, Chin-Feng; Lin, Tsung-Hsien; Wu, Cho-Kai; Lee, Jen-Kuang; Lin, Lian-Yu; Wang, Yi-Chih; Yu, Chih-Chieh; Lai, Ling-Ping; Tseng, Chuen-Den; Hwang, Juey-Jen; Chiang, Fu-Tien; Lin, Jiunn-Lee
Atrial fibrillation (AF) is the most common sustained cardiac arrhythmia. Previous genome-wide association studies had identified single-nucleotide polymorphisms in several genomic regions to be associated with AF. In human genome, copy number variations (CNVs) are known to contribute to disease susceptibility. Using a genome-wide multistage approach to identify AF susceptibility CNVs, we here show a common 4,470-bp diallelic CNV in the first intron of potassium interacting channel 1 gene (KCNIP1) is strongly associated with AF in Taiwanese populations (odds ratio=2.27 for insertion allele; P=6.23 × 10−24). KCNIP1 insertion is associated with higher KCNIP1 mRNA expression. KCNIP1-encoded protein potassium interacting channel 1 (KCHIP1) is physically associated with potassium Kv channels and modulates atrial transient outward current in cardiac myocytes. Overexpression of KCNIP1 results in inducible AF in zebrafish. In conclusions, a common CNV in KCNIP1 gene is a genetic predictor of AF risk possibly pointing to a functional pathway. PMID:26831368
Full Text Available Current target enrichment systems for large-scale next-generation sequencing typically require synthetic oligonucleotides used as capture reagents to isolate sequences of interest. The majority of target enrichment reagents are focused on gene coding regions or promoters en masse. Here we introduce development of a customizable targeted capture system using biotinylated RNA probe baits transcribed from sheared bacterial artificial chromosome clone templates that enables capture of large, contiguous blocks of the genome for sequencing applications. This clone adapted template capture hybridization sequencing (CATCH-Seq procedure can be used to capture both coding and non-coding regions of a gene, and resolve the boundaries of copy number variations within a genomic target site. Furthermore, libraries constructed with methylated adapters prior to solution hybridization also enable targeted bisulfite sequencing. We applied CATCH-Seq to diverse targets ranging in size from 125 kb to 3.5 Mb. Our approach provides a simple and cost effective alternative to other capture platforms because of template-based, enzymatic probe synthesis and the lack of oligonucleotide design costs. Given its similarity in procedure, CATCH-Seq can also be performed in parallel with commercial systems.
Jiao, Hong; Arner, Peter; Hoffstedt, Johan
Recent genome-wide association (GWA) analyses have identified common single nucleotide polymorphisms (SNPs) that are associated with obesity. However, the reported genetic variation in obesity explains only a minor fraction of the total genetic variation expected to be present in the population....... Thus many genetic variants controlling obesity remain to be identified. The aim of this study was to use GWA followed by multiple stepwise validations to identify additional genes associated with obesity....
Christensen, Kim; Manani, Kishan A.; Peters, Nicholas S.
Atrial fibrillation (AF) is the most common abnormal heart rhythm and the single biggest cause of stroke. Ablation, destroying regions of the atria, is applied largely empirically and can be curative but with a disappointing clinical success rate. We design a simple model of activation wave front propagation on an anisotropic structure mimicking the branching network of heart muscle cells. This integration of phenomenological dynamics and pertinent structure shows how AF emerges spontaneously when the transverse cell-to-cell coupling decreases, as occurs with age, beyond a threshold value. We identify critical regions responsible for the initiation and maintenance of AF, the ablation of which terminates AF. The simplicity of the model allows us to calculate analytically the risk of arrhythmia and express the threshold value of transversal cell-to-cell coupling as a function of the model parameters. This threshold value decreases with increasing refractory period by reducing the number of critical regions which can initiate and sustain microreentrant circuits. These biologically testable predictions might inform ablation therapies and arrhythmic risk assessment.
Full Text Available Protein interaction networks are an important part of the post-genomic effort to integrate a part-list view of the cell into system-level understanding. Using a set of 11 yeast genomes we show that combining comparative genomics and secondary structure information greatly increases consensus-based prediction of SH3 targets. Benchmarking of our method against positive and negative standards gave 83% accuracy with 26% coverage. The concept of an optimal divergence time for effective comparative genomics studies was analyzed, demonstrating that genomes of species that diverged very recently from Saccharomyces cerevisiae(S. mikatae, S. bayanus, and S. paradoxus, or a long time ago (Neurospora crassa and Schizosaccharomyces pombe, contain less information for accurate prediction of SH3 targets than species within the optimal divergence time proposed. We also show here that intrinsically disordered SH3 domain targets are more probable sites of interaction than equivalent sites within ordered regions. Our findings highlight several novel S. cerevisiae SH3 protein interactions, the value of selection of optimal divergence times in comparative genomics studies, and the importance of intrinsic disorder for protein interactions. Based on our results we propose novel roles for the S. cerevisiae proteins Abp1p in endocytosis and Hse1p in endosome protein sorting.
Full Text Available Protein interaction networks are an important part of the post-genomic effort to integrate a part-list view of the cell into system-level understanding. Using a set of 11 yeast genomes we show that combining comparative genomics and secondary structure information greatly increases consensus-based prediction of SH3 targets. Benchmarking of our method against positive and negative standards gave 83% accuracy with 26% coverage. The concept of an optimal divergence time for effective comparative genomics studies was analyzed, demonstrating that genomes of species that diverged very recently from Saccharomyces cerevisiae(S. mikatae, S. bayanus, and S. paradoxus, or a long time ago (Neurospora crassa and Schizosaccharomyces pombe, contain less information for accurate prediction of SH3 targets than species within the optimal divergence time proposed. We also show here that intrinsically disordered SH3 domain targets are more probable sites of interaction than equivalent sites within ordered regions. Our findings highlight several novel S. cerevisiae SH3 protein interactions, the value of selection of optimal divergence times in comparative genomics studies, and the importance of intrinsic disorder for protein interactions. Based on our results we propose novel roles for the S. cerevisiae proteins Abp1p in endocytosis and Hse1p in endosome protein sorting.
Lutz, Sharon M; Cho, Michael H; Young, Kendra; Hersh, Craig P; Castaldi, Peter J; McDonald, Merry-Lynn; Regan, Elizabeth; Mattheisen, Manuel; DeMeo, Dawn L; Parker, Margaret; Foreman, Marilyn; Make, Barry J; Jensen, Robert L; Casaburi, Richard; Lomas, David A; Bhatt, Surya P; Bakke, Per; Gulsvik, Amund; Crapo, James D; Beaty, Terri H; Laird, Nan M; Lange, Christoph; Hokanson, John E; Silverman, Edwin K
Pulmonary function decline is a major contributor to morbidity and mortality among smokers. Post bronchodilator FEV1 and FEV1/FVC ratio are considered the standard assessment of airflow obstruction. We performed a genome-wide association study (GWAS) in 9919 current and former smokers in the COPDGene study (6659 non-Hispanic Whites [NHW] and 3260 African Americans [AA]) to identify associations with spirometric measures (post-bronchodilator FEV1 and FEV1/FVC). We also conducted meta-analysis of FEV1 and FEV1/FVC GWAS in the COPDGene, ECLIPSE, and GenKOLS cohorts (total n = 13,532). Among NHW in the COPDGene cohort, both measures of pulmonary function were significantly associated with SNPs at the 15q25 locus [containing CHRNA3/5, AGPHD1, IREB2, CHRNB4] (lowest p-value = 2.17 × 10(-11)), and FEV1/FVC was associated with a genomic region on chromosome 4 [upstream of HHIP] (lowest p-value = 5.94 × 10(-10)); both regions have been previously associated with COPD. For the meta-analysis, in addition to confirming associations to the regions near CHRNA3/5 and HHIP, genome-wide significant associations were identified for FEV1 on chromosome 1 [TGFB2] (p-value = 8.99 × 10(-9)), 9 [DBH] (p-value = 9.69 × 10(-9)) and 19 [CYP2A6/7] (p-value = 3.49 × 10(-8)) and for FEV1/FVC on chromosome 1 [TGFB2] (p-value = 8.99 × 10(-9)), 4 [FAM13A] (p-value = 3.88 × 10(-12)), 11 [MMP3/12] (p-value = 3.29 × 10(-10)) and 14 [RIN3] (p-value = 5.64 × 10(-9)). In a large genome-wide association study of lung function in smokers, we found genome-wide significant associations at several previously described loci with lung function or COPD. We additionally identified a novel genome-wide significant locus with FEV1 on chromosome 9 [DBH] in a meta-analysis of three study populations.
Full Text Available Abstract Background Specific genetic contributions for preeclampsia (PE are currently unknown. This genome-wide association study (GWAS aims to identify maternal single nucleotide polymorphisms (SNPs and copy-number variants (CNVs involved in the etiology of PE. Methods A genome-wide scan was performed on 177 PE cases (diagnosed according to National Heart, Lung and Blood Institute guidelines and 116 normotensive controls. White female study subjects from Iowa were genotyped on Affymetrix SNP 6.0 microarrays. CNV calls made using a combination of four detection algorithms (Birdseye, Canary, PennCNV, and QuantiSNP were merged using CNVision and screened with stringent prioritization criteria. Due to limited DNA quantities and the deleterious nature of copy-number deletions, it was decided a priori that only deletions would be selected for assay on the entire case-control dataset using quantitative real-time PCR. Results The top four SNP candidates had an allelic or genotypic p-value between 10-5 and 10-6, however, none surpassed the Bonferroni-corrected significance threshold. Three recurrent rare deletions meeting prioritization criteria detected in multiple cases were selected for targeted genotyping. A locus of particular interest was found showing an enrichment of case deletions in 19q13.31 (5/169 cases and 1/114 controls, which encompasses the PSG11 gene contiguous to a highly plastic genomic region. All algorithm calls for these regions were assay confirmed. Conclusions CNVs may confer risk for PE and represent interesting regions that warrant further investigation. Top SNP candidates identified from the GWAS, although not genome-wide significant, may be useful to inform future studies in PE genetics.
Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D
The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.
Weidinger, Stephan; Willis-Owen, Saffron A G; Kamatani, Yoichiro; Baurecht, Hansjörg; Morar, Nilesh; Liang, Liming; Edser, Pauline; Street, Teresa; Rodriguez, Elke; O'Regan, Grainne M; Beattie, Paula; Fölster-Holst, Regina; Franke, Andre; Novak, Natalija; Fahy, Caoimhe M; Winge, Mårten C G; Kabesch, Michael; Illig, Thomas; Heath, Simon; Söderhäll, Cilla; Melén, Erik; Pershagen, Göran; Kere, Juha; Bradley, Maria; Lieden, Agne; Nordenskjold, Magnus; Harper, John I; McLean, W H Irwin; Brown, Sara J; Cookson, William O C; Lathrop, G Mark; Irvine, Alan D; Moffatt, Miriam F
Atopic dermatitis (AD) is the most common dermatological disease of childhood. Many children with AD have asthma and AD shares regions of genetic linkage with psoriasis, another chronic inflammatory skin disease. We present here a genome-wide association study (GWAS) of childhood-onset AD in 1563 European cases with known asthma status and 4054 European controls. Using Illumina genotyping followed by imputation, we generated 268 034 consensus genotypes and in excess of 2 million single nucleotide polymorphisms (SNPs) for analysis. Association signals were assessed for replication in a second panel of 2286 European cases and 3160 European controls. Four loci achieved genome-wide significance for AD and replicated consistently across all cohorts. These included the epidermal differentiation complex (EDC) on chromosome 1, the genomic region proximal to LRRC32 on chromosome 11, the RAD50/IL13 locus on chromosome 5 and the major histocompatibility complex (MHC) on chromosome 6; reflecting action of classical HLA alleles. We observed variation in the contribution towards co-morbid asthma for these regions of association. We further explored the genetic relationship between AD, asthma and psoriasis by examining previously identified susceptibility SNPs for these diseases. We found considerable overlap between AD and psoriasis together with variable coincidence between allergic rhinitis (AR) and asthma. Our results indicate that the pathogenesis of AD incorporates immune and epidermal barrier defects with combinations of specific and overlapping effects at individual loci.
Li, M-H; Tiirikka, T; Kantanen, J
In sheep, coat colour (and pattern) is one of the important traits of great biological, economic and social importance. However, the genetics of sheep coat colour has not yet been fully clarified. We conducted a genome-wide association study of sheep coat colours by genotyping 47 303 single-nucleotide polymorphisms (SNPs) in the Finnsheep population in Finland. We identified 35 SNPs associated with all the coat colours studied, which cover genomic regions encompassing three kno...
Tran, Hue T M; Ramaraj, Thiruvarangan; Furtado, Agnelo; Lee, Leonard Slade; Henry, Robert J
Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high- or low-caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long-read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCOs were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms (SNPs) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway-based analysis, 65 caffeine-associated SNPs were discovered, among which 11 SNPs were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Sahl, Jason W.; Gillece, John D.; Schupp, James M.; Waddell, Victor G.; Driebe, Elizabeth M.; Engelthaler, David M.; Keim, Paul
Acinetobacter baumannii is an emergent and global nosocomial pathogen. In addition to A. baumannii, other Acinetobacter species, especially those in the Acinetobacter calcoaceticus-baumannii (Acb) complex, have also been associated with serious human infection. Although mechanisms of attachment, persistence on abiotic surfaces, and pathogenesis in A. baumannii have been identified, the genetic mechanisms that explain the emergence of A. baumannii as the most widespread and virulent Acinetobacter species are not fully understood. Recent whole genome sequencing has provided insight into the phylogenetic structure of the genus Acinetobacter. However, a global comparison of genomic features between Acinetobacter spp. has not been described in the literature. In this study, 136 Acinetobacter genomes, including 67 sequenced in this study, were compared to identify the acquisition and loss of genes in the expansion of the Acinetobacter genus. A whole genome phylogeny confirmed that A. baumannii is a monophyletic clade and that the larger Acb complex is also a well-supported monophyletic group. The whole genome phylogeny provided the framework for a global genomic comparison based on a blast score ratio (BSR) analysis. The BSR analysis demonstrated that specific genes have been both lost and acquired in the evolution of A. baumannii. In addition, several genes associated with A. baumannii pathogenesis were found to be more conserved in the Acb complex, and especially in A. baumannii, than in other Acinetobacter genomes; until recently, a global analysis of the distribution and conservation of virulence factors across the genus was not possible. The results demonstrate that the acquisition of specific virulence factors has likely contributed to the widespread persistence and virulence of A. baumannii. The identification of novel features associated with transcriptional regulation and acquired by clades in the Acb complex presents targets for better understanding the
Ventura, Marco; Canchaya, Carlos; Bernini, Valentina; Altermann, Eric; Barrangou, Rodolphe; McGrath, Stephen; Claesson, Marcus J.; Li, Yin; Leahy, Sinead; Walker, Carey D.; Zink, Ralf; Neviani, Erasmo; Steele, Jim; Broadbent, Jeff; Klaenhammer, Todd R.; Fitzgerald, Gerald F.; O'Toole, Paul W.; van Sinderen, Douwe
Lactobacillus gasseri ATCC 33323, Lactobacillus salivarius subsp. salivarius UCC 118, and Lactobacillus casei ATCC 334 contain one (LgaI), four (Sal1, Sal2, Sal3, Sal4), and one (Lca1) distinguishable prophage sequences, respectively. Sequence analysis revealed that LgaI, Lca1, Sal1, and Sal2 prophages belong to the group of Sfi11-like pac site and cos site Siphoviridae, respectively. Phylogenetic investigation of these newly described prophage sequences revealed that they have not followed an evolutionary development similar to that of their bacterial hosts and that they show a high degree of diversity, even within a species. The attachment sites were determined for all these prophage elements; LgaI as well as Sal1 integrates in tRNA genes, while prophage Sal2 integrates in a predicted arginino-succinate lyase-encoding gene. In contrast, Lca1 and the Sal3 and Sal4 prophage remnants are integrated in noncoding regions in the L. casei ATCC 334 and L. salivarius UCC 118 genomes. Northern analysis showed that large parts of the prophage genomes are transcriptionally silent and that transcription is limited to genome segments located near the attachment site. Finally, pulsed-field gel electrophoresis followed by Southern blot hybridization with specific prophage probes indicates that these prophage sequences are narrowly distributed within lactobacilli. PMID:16672450
Full Text Available Blown pack spoilage (BPS is a major issue for the beef industry. Aetiological agents of BPS involve members of a group of Clostridium species, including Clostridium estertheticum which has the ability to produce gas, mostly carbon dioxide, under anaerobic psychotrophic growth conditions. This spore-forming bacterium grows slowly under laboratory conditions, and it can take up to 3 months to produce a workable culture. These characteristics have limited the study of this commercially challenging bacterium. Consequently information on this bacterium is limited and no effective controls are currently available to confidently detect and manage this production risk. In this study the complete genome of Clostridium estertheticum DSM 8809 was determined by SMRT® sequencing. The genome consists of a circular chromosome of 4.7 Mbp along with a single plasmid carrying a potential tellurite resistance gene tehB and a Tn3-like resolvase-encoding gene tnpR. The genome sequence was searched for central metabolic pathways that would support its biochemical profile and several enzymes contributing to this phenotype were identified. Several putative antibiotic/biocide/metal resistance-encoding genes and virulence factors were also identified in the genome, a feature that requires further research. The availability of the genome sequence will provide a basic blueprint from which to develop valuable biomarkers that could support and improve the detection and control of this bacterium along the beef production chain.
C.M. Lindgren (Cecilia); I.M. Heid (Iris); J.C. Randall (Joshua); C. Lamina (Claudia); V. Steinthorsdottir (Valgerdur); L. Qi (Lu); E.K. Speliotes (Elizabeth); G. Thorleifsson (Gudmar); C.J. Willer (Cristen); B.M. Herrera (Blanca); A.U. Jackson (Anne); N. Lim (Noha); P. Scheet (Paul); N. Soranzo (Nicole); N. Amin (Najaf); Y.S. Aulchenko (Yurii); J.C. Chambers (John); A. Drong (Alexander); J. Luan; H.N. Lyon (Helen); F. Rivadeneira Ramirez (Fernando); S. Sanna (Serena); N.J. Timpson (Nicholas); M.C. Zillikens (Carola); H.Z. Jing; P. Almgren (Peter); S. Bandinelli (Stefania); A.J. Bennett (Amanda); R.N. Bergman (Richard); L.L. Bonnycastle (Lori); S. Bumpstead (Suzannah); S.J. Chanock (Stephen); L. Cherkas (Lynn); P.S. Chines (Peter); L. Coin (Lachlan); C. Cooper (Charles); G. Crawford (Gabe); A. Doering (Angela); A. Dominiczak (Anna); A.S.F. Doney (Alex); S. Ebrahim (Shanil); P. Elliott (Paul); M.R. Erdos (Michael); K. Estrada Gil (Karol); L. Ferrucci (Luigi); G. Fischer (Guido); N.G. Forouhi (Nita); C. Gieger (Christian); H. Grallert (Harald); C.J. Groves (Christopher); S.M. Grundy (Scott); C. Guiducci (Candace); D. Hadley (David); A. Hamsten (Anders); A.S. Havulinna (Aki); A. Hofman (Albert); R. Holle (Rolf); J.W. Holloway (John); T. Illig (Thomas); B. Isomaa (Bo); L.C. Jacobs (Leonie); K. Jameson (Karen); P. Jousilahti (Pekka); F. Karpe (Fredrik); J. Kuusisto (Johanna); J. Laitinen (Jaana); G.M. Lathrop (Mark); D.A. Lawlor (Debbie); M. Mangino (Massimo); W.L. McArdle (Wendy); T. Meitinger (Thomas); M.A. Morken (Mario); A.P. Morris (Andrew); P. Munroe (Patricia); N. Narisu (Narisu); A. Nordström (Anna); B.A. Oostra (Ben); C.N.A. Palmer (Colin); F. Payne (Felicity); J. Peden (John); I. Prokopenko (Inga); F. Renström (Frida); A. Ruokonen (Aimo); V. Salomaa (Veikko); M.S. Sandhu (Manjinder); L.J. Scott (Laura); A. Scuteri (Angelo); K. Silander (Kaisa); K. Song (Kijoung); X. Yuan (Xin); H.M. Stringham (Heather); A.J. Swift (Amy); T. Tuomi (Tiinamaija); M. Uda (Manuela); P. Vollenweider (Peter); G. Waeber (Gérard); C. Wallace (Chris); G.B. Walters (Bragi); M.N. Weedon (Michael); J.C.M. Witteman (Jacqueline); C. Zhang (Cuilin); M. Caulfield (Mark); F.S. Collins (Francis); G.D. Smith; I.N.M. Day (Ian); P.W. Franks (Paul); A.T. Hattersley (Andrew); F.B. Hu (Frank); M.-R. Jarvelin (Marjo-Riitta); A. Kong (Augustine); J.S. Kooner (Jaspal); M. Laakso (Markku); E. Lakatta (Edward); V. Mooser (Vincent); L. Peltonen (Leena Johanna); N.J. Samani (Nilesh); T.D. Spector (Timothy); D.P. Strachan (David); T. Tanaka (Toshiko); J. Tuomilehto (Jaakko); A.G. Uitterlinden (André); P. Tikka-Kleemola (Päivi); N.J. Wareham (Nick); H. Watkins (Hugh); D. Waterworth (Dawn); M. Boehnke (Michael); P. Deloukas (Panagiotis); L. Groop (Leif); D.J. Hunter (David); U. Thorsteinsdottir (Unnur); D. Schlessinger (David); H.E. Wichmann (Erich); T.M. Frayling (Timothy); G.R. Abecasis (Gonçalo); J.N. Hirschhorn (Joel); R.J.F. Loos (Ruth); J-A. Zwart (John-Anker); K.L. Mohlke (Karen); I.E. Barroso (Inês); M.I. McCarthy (Mark)
textabstractTo identify genetic loci influencing central obesity and fat distribution, we performed a meta-analysis of 16 genome-wide association studies (GWAS, N = 38,580) informative for adult waist circumference (WC) and waist-hip ratio (WHR). We selected 26 SNPs for follow-up, for which the
Kote-Jarai, Zsofia; Olama, Ali Amin Al; Giles, Graham G
Prostate cancer (PrCa) is the most frequently diagnosed male cancer in developed countries. We conducted a multi-stage genome-wide association study for PrCa and previously reported the results of the first two stages, which identified 16 PrCa susceptibility loci. We report here the results of st...
This article explores the impact of the mapping work of the Human Genome Project on individuals with mental retardation and the negative effects of genetic testing. The potential to identify disabilities and the concept of eugenics are discussed, along with ethical issues surrounding potential genetic therapies. (Contains references.) (CR)
Ghoussaini, Maya; Fletcher, Olivia; Michailidou, Kyriaki
Breast cancer is the most common cancer among women. To date, 22 common breast cancer susceptibility loci have been identified accounting for ∼8% of the heritability of the disease. We attempted to replicate 72 promising associations from two independent genome-wide association studies (GWAS...
Nicolas, Aude; Kenna, Kevin P.; Renton, Alan E.; Ticozzi, Nicola; Faghri, Faraz; Chia, Ruth; Dominov, Janice A.; Kenna, Brendan J.; Nalls, Mike A.; Keagle, Pamela; Rivera, Alberto M.; van Rheenen, Wouter; Murphy, Natalie A.; van Vugt, Joke J.F.A.; Geiger, Joshua T.; van der Spek, Rick; Pliner, Hannah A.; Smith, Bradley N.; Marangi, Giuseppe; Topp, Simon D.; Abramzon, Yevgeniya; Gkazi, Athina Soragia; Eicher, John D.; Kenna, Aoife; Logullo, Francesco O.; Simone, Isabella L.; Logroscino, Giancarlo; Salvi, Fabrizio; Bartolomei, Ilaria; Borghero, Giuseppe; Murru, Maria Rita; Costantino, Emanuela; Pani, Carla; Puddu, Roberta; Caredda, Carla; Piras, Valeria; Tranquilli, Stefania; Cuccu, Stefania; Corongiu, Daniela; Melis, Maurizio; Milia, Antonio; Marrosu, Francesco; Marrosu, Maria Giovanna; Floris, Gianluca; Cannas, Antonino; Capasso, Margherita; Caponnetto, Claudia; Mancardi, Gianluigi; Origone, Paola; Mandich, Paola; Conforti, Francesca L.; Cavallaro, Sebastiano; Mora, Gabriele; Marinou, Kalliopi; Sideri, Riccardo; Penco, Silvana; Mosca, Lorena; Lunetta, Christian; Pinter, Giuseppe Lauria; Corbo, Massimo; Riva, Nilo; Carrera, Paola; Volanti, Paolo; Mandrioli, Jessica; Fini, Nicola; Fasano, Antonio; Tremolizzo, Lucio; Arosio, Alessandro; Ferrarese, Carlo; Trojsi, Francesca; Tedeschi, Gioacchino; Monsurrò, Maria Rosaria; Piccirillo, Giovanni; Femiano, Cinzia; Ticca, Anna; Ortu, Enzo; La Bella, Vincenzo; Spataro, Rossella; Colletti, Tiziana; Sabatelli, Mario; Zollino, Marcella; Conte, Amelia; Luigetti, Marco; Lattante, Serena; Marangi, Giuseppe; Santarelli, Marialuisa; Petrucci, Antonio; Pugliatti, Maura; Pirisi, Angelo; Parish, Leslie D.; Occhineri, Patrizia; Giannini, Fabio; Battistini, Stefania; Ricci, Claudia; Benigni, Michele; Cau, Tea B.; Loi, Daniela; Calvo, Andrea; Moglia, Cristina; Brunetti, Maura; Barberis, Marco; Restagno, Gabriella; Casale, Federico; Marrali, Giuseppe; Fuda, Giuseppe; Ossola, Irene; Cammarosano, Stefania; Canosa, Antonio; Ilardi, Antonio; Manera, Umberto; Grassano, Maurizio; Tanel, Raffaella; Pisano, Fabrizio; Mora, Gabriele; Calvo, Andrea; Mazzini, Letizia; Riva, Nilo; Mandrioli, Jessica; Caponnetto, Claudia; Battistini, Stefania; Volanti, Paolo; La Bella, Vincenzo; Conforti, Francesca L.; Borghero, Giuseppe; Messina, Sonia; Simone, Isabella L.; Trojsi, Francesca; Salvi, Fabrizio; Logullo, Francesco O.; D'Alfonso, Sandra; Corrado, Lucia; Capasso, Margherita; Ferrucci, Luigi; Harms, Matthew B.; Goldstein, David B.; Shneider, Neil A.; Goutman, Stephen A.; Simmons, Zachary; Miller, Timothy M.; Chandran, Siddharthan; Pal, Suvankar; Manousakis, George; Appel, Stanley H.; Simpson, Ericka; Wang, Leo; Baloh, Robert H.; Gibson, Summer B.; Bedlack, Richard; Lacomis, David; Sareen, Dhruv; Sherman, Alexander; Bruijn, Lucie; Penny, Michelle; Moreno, Cristiane de Araujo Martins; Kamalakaran, Sitharthan; Goldstein, David B.; Allen, Andrew S.; Appel, Stanley; Baloh, Robert H.; Bedlack, Richard S.; Boone, Braden E.; Brown, Robert; Carulli, John P.; Chesi, Alessandra; Chung, Wendy K.; Cirulli, Elizabeth T.; Cooper, Gregory M.; Couthouis, Julien; Day-Williams, Aaron G.; Dion, Patrick A.; Gibson, Summer B.; Gitler, Aaron D.; Glass, Jonathan D.; Goldstein, David B.; Han, Yujun; Harms, Matthew B.; Harris, Tim; Hayes, Sebastian D.; Jones, Angela L.; Keebler, Jonathan; Krueger, Brian J.; Lasseigne, Brittany N.; Levy, Shawn E.; Lu, Yi Fan; Maniatis, Tom; McKenna-Yasek, Diane; Miller, Timothy M.; Myers, Richard M.; Petrovski, Slavé; Pulst, Stefan M.; Raphael, Alya R.; Ravits, John M.; Ren, Zhong; Rouleau, Guy A.; Sapp, Peter C.; Shneider, Neil A.; Simpson, Ericka; Sims, Katherine B.; Staropoli, John F.; Waite, Lindsay L.; Wang, Quanli; Wimbish, Jack R.; Xin, Winnie W.; Gitler, Aaron D.; Harris, Tim; Myers, Richard M.; Phatnani, Hemali; Kwan, Justin; Sareen, Dhruv; Broach, James R.; Simmons, Zachary; Arcila-Londono, Ximena; Lee, Edward B.; Van Deerlin, Vivianna M.; Shneider, Neil A.; Fraenkel, Ernest; Ostrow, Lyle W.; Baas, Frank; Zaitlen, Noah; Berry, James D.; Malaspina, Andrea; Fratta, Pietro; Cox, Gregory A.; Thompson, Leslie M.; Finkbeiner, Steve; Dardiotis, Efthimios; Miller, Timothy M.; Chandran, Siddharthan; Pal, Suvankar; Hornstein, Eran; MacGowan, Daniel J.L.; Heiman-Patterson, Terry D.; Hammell, Molly G.; Patsopoulos, Nikolaos A.; Dubnau, Joshua; Nath, Avindra; Phatnani, Hemali; Musunuri, Rajeeva Lochan; Evani, Uday Shankar; Abhyankar, Avinash; Zody, Michael C.; Kaye, Julia; Finkbeiner, Steven; Wyman, Stacia K.; LeNail, Alexander; Lima, Leandro; Fraenkel, Ernest; Rothstein, Jeffrey D.; Svendsen, Clive N.; Thompson, Leslie M.; Van Eyk, Jenny; Maragakis, Nicholas J.; Berry, James D.; Glass, Jonathan D.; Miller, Timothy M.; Kolb, Stephen J.; Baloh, Robert H.; Cudkowicz, Merit; Baxi, Emily; Kaye, Julia; Finkbeiner, Steven; Wyman, Stacia K.; Finkbeiner, Steven; LeNail, Alex; Lima, Leandro; Fraenkel, Ernest; Fraenkel, Ernest; Svendsen, Clive N.; Svendsen, Clive N.; Thompson, Leslie M.; Thompson, Leslie M.; Van Eyk, Jennifer E.; Berry, James D.; Berry, James D.; Miller, Timothy M.; Kolb, Stephen J.; Cudkowicz, Merit; Cudkowicz, Merit; Baxi, Emily; Benatar, Michael; Taylor, J. Paul; Wu, Gang; Rampersaud, Evadnie; Wuu, Joanne; Rademakers, Rosa; Züchner, Stephan; Schule, Rebecca; McCauley, Jacob; Hussain, Sumaira; Cooley, Anne; Wallace, Marielle; Clayman, Christine; Barohn, Richard; Statland, Jeffrey; Ravits, John M.; Swenson, Andrea; Jackson, Carlayne; Trivedi, Jaya; Khan, Shaida; Katz, Jonathan; Jenkins, Liberty; Burns, Ted; Gwathmey, Kelly; Caress, James; McMillan, Corey; Elman, Lauren; Pioro, Erik P.; Heckmann, Jeannine; So, Yuen; Walk, David; Maiser, Samuel; Zhang, Jinghui; Benatar, Michael; Taylor, J. Paul; Taylor, J. Paul; Rampersaud, Evadnie; Wu, Gang; Wuu, Joanne; Silani, Vincenzo; Ticozzi, Nicola; Gellera, Cinzia; Ratti, Antonia; Taroni, Franco; Lauria, Giuseppe; Verde, Federico; Fogh, Isabella; Tiloca, Cinzia; Comi, Giacomo P.; Sorarù, Gianni; Cereda, Cristina; D'Alfonso, Sandra; Corrado, Lucia; De Marchi, Fabiola; Corti, Stefania; Ceroni, Mauro; Mazzini, Letizia; Siciliano, Gabriele; Filosto, Massimiliano; Inghilleri, Maurizio; Peverelli, Silvia; Colombrita, Claudia; Poletti, Barbara; Maderna, Luca; Del Bo, Roberto; Gagliardi, Stella; Querin, Giorgia; Bertolin, Cinzia; Pensato, Viviana; Castellotti, Barbara; Lauria, Giuseppe; Verde, Federico; Fogh, Isabella; Tiloca, Cinzia; Fogh, Isabella; Comi, Giacomo P.; Sorarù, Gianni; Cereda, Cristina; Camu, William; Mouzat, Kevin; Lumbroso, Serge; Corcia, Philippe; Meininger, Vincent; Besson, Gérard; Lagrange, Emmeline; Clavelou, Pierre; Guy, Nathalie; Couratier, Philippe; Vourch, Patrick; Danel, Véronique; Bernard, Emilien; Lemasson, Gwendal; Corcia, Philippe; Laaksovirta, Hannu; Myllykangas, Liisa; Jansson, Lilja; Valori, Miko; Ealing, John; Hamdalla, Hisham; Rollinson, Sara; Pickering-Brown, Stuart; Orrell, Richard W.; Sidle, Katie C.; Malaspina, Andrea; Hardy, John; Singleton, Andrew B.; Johnson, Janel O.; Arepalli, Sampath; Sapp, Peter C.; McKenna-Yasek, Diane; Polak, Meraida; Asress, Seneshaw; Al-Sarraj, Safa; King, Andrew; Troakes, Claire; Vance, Caroline; de Belleroche, Jacqueline; Baas, Frank; ten Asbroek, Anneloor L.M.A.; Muñoz-Blanco, José Luis; Hernandez, Dena G.; Ding, Jinhui; Gibbs, J. Raphael; Scholz, Sonja W.; Scholz, Sonja W.; Floeter, Mary Kay; Campbell, Roy H.; Landi, Francesco; Bowser, Robert; Pulst, Stefan M.; Ravits, John M.; MacGowan, Daniel J.L.; Kirby, Janine; Pioro, Erik P.; Pamphlett, Roger; Broach, James; Gerhard, Glenn; Dunckley, Travis L.; Brady, Christopher B.; Brady, Christopher B.; Kowall, Neil W.; Troncoso, Juan C.; Le Ber, Isabelle; Mouzat, Kevin; Lumbroso, Serge; Mouzat, Kevin; Lumbroso, Serge; Heiman-Patterson, Terry D.; Heiman-Patterson, Terry D.; Kamel, Freya; Van Den Bosch, Ludo; Van Den Bosch, Ludo; Baloh, Robert H.; Strom, Tim M.; Meitinger, Thomas; Strom, Tim M.; Shatunov, Aleksey; Van Eijk, Kristel R.; de Carvalho, Mamede; de Carvalho, Mamede; Kooyman, Maarten; Middelkoop, Bas; Moisse, Matthieu; McLaughlin, Russell; Van Es, Michael A.; Weber, Markus; Boylan, Kevin B.; Van Blitterswijk, Marka; Rademakers, Rosa; Morrison, Karen; Basak, A. Nazli; Mora, Jesús S.; Drory, Vivian; Shaw, Pamela; Turner, Martin R.; Talbot, Kevin; Hardiman, Orla; Williams, Kelly L.; Fifita, Jennifer A.; Nicholson, Garth A.; Blair, Ian P.; Nicholson, Garth A.; Rouleau, Guy A.; Esteban-Pérez, Jesús; García-Redondo, Alberto; Al-Chalabi, Ammar; Al Kheifat, Ahmad; Al-Chalabi, Ammar; Andersen, Peter M.; Basak, A. Nazli; Blair, Ian P.; Chio, Adriano; Cooper-Knock, Jonathan; Corcia, Philippe; Couratier, Philippe; de Carvalho, Mamede; Dekker, Annelot; Drory, Vivian; Redondo, Alberto Garcia; Gotkine, Marc; Hardiman, Orla; Hide, Winston; Iacoangeli, Alfredo; Glass, Jonathan D.; Kenna, Kevin P.; Kiernan, Matthew; Kooyman, Maarten; Landers, John E.; McLaughlin, Russell; Middelkoop, Bas; Mill, Jonathan; Neto, Miguel Mitne; Moisse, Matthieu; Pardina, Jesus Mora; Morrison, Karen; Newhouse, Stephen; Pinto, Susana; Pulit, Sara; Robberecht, Wim; Shatunov, Aleksey; Shaw, Pamela; Shaw, Chris; Silani, Vincenzo; Sproviero, William; Tazelaar, Gijs; Ticozzi, Nicola; Van Damme, Philip; van den Berg, Leonard; van der Spek, Rick; Van Eijk, Kristel R.; Van Es, Michael A.; van Rheenen, Wouter; van Vugt, Joke J.F.A.; Veldink, Jan H.; Weber, Markus; Williams, Kelly L.; Van Damme, Philip; Robberecht, Wim; Zatz, Mayana; Robberecht, Wim; Bauer, Denis C.; Twine, Natalie A.; Rogaeva, Ekaterina; Zinman, Lorne; Ostrow, Lyle W.; Maragakis, Nicholas J.; Rothstein, Jeffrey D.; Simmons, Zachary; Cooper-Knock, Johnathan; Brice, Alexis; Goutman, Stephen A.; Feldman, Eva L.; Gibson, Summer B.; Taroni, Franco; Ratti, Antonia; Ratti, Antonia; Gellera, Cinzia; Van Damme, Philip; Robberecht, Wim; Fratta, Pietro; Sabatelli, Mario; Lunetta, Christian; Ludolph, Albert C.; Andersen, Peter M.; Weishaupt, Jochen H.; Camu, William; Trojanowski, John Q.; Van Deerlin, Vivianna M.; Brown, Robert H.; van den Berg, Leonard; Veldink, Jan H.; Harms, Matthew B.; Glass, Jonathan D.; Stone, David J.; Tienari, Pentti; Silani, Vincenzo; Silani, Vincenzo; Chiò, Adriano; Shaw, Christopher E.; Chiò, Adriano; Traynor, Bryan J.; Landers, John E.; Traynor, Bryan J.
To identify novel genes associated with ALS, we undertook two lines of investigation. We carried out a genome-wide association study comparing 20,806 ALS cases and 59,804 controls. Independently, we performed a rare variant burden analysis comparing 1,138 index familial ALS cases and 19,494
Kottgen, A.; Albrecht, E.; Teumer, A.; Vitart, V.; Krumsiek, J.; Hundertmark, C.; Pistis, G.; Ruggiero, D.; O'Seaghdha, C.M.; Haller, T.; Yang, Q.; Johnson, A.D.; Kutalik, Z.; Smith, A.V.; Shi, J.L.; Struchalin, M.; Middelberg, R.P.S.; Brown, M.J.; Gaffo, A.L.; Pirastu, N.; Li, G.; Hayward, C.; Zemunik, T.; Huffman, J.; Yengo, L.; Zhao, J.H.; Demirkan, A.; Feitosa, M.F.; Liu, X.; Malerba, G.; Lopez, L.M.; van der Harst, P.; Li, X.Z.; Kleber, M.E.; Hicks, A.A.; Nolte, I.M.; Johansson, A.; Murgia, F.; Wild, S.H.; Bakker, S.J.L.; Peden, J.F.; Dehghan, A.; Steri, M.; Tenesa, A.; Lagou, V.; Salo, P.; Mangino, M.; Rose, L.M.; Lehtimaki, T.; Woodward, O.M.; Okada, Y.; Tin, A.; Muller, C.; Oldmeadow, C.; Putku, M.; Czamara, D.; Kraft, P.; Frogheri, L.; Thun, G.A.; Grotevendt, A.; Gislason, G.K.; Harris, T.B.; Launer, L.J.; McArdle, P.; Shuldiner, A.R.; Boerwinkle, E.; Coresh, J.; Schmidt, H.; Schallert, M.; Martin, N.G.; Montgomery, G.W.; Kubo, M.; Nakamura, Y.; Tanaka, T.; Munroe, P.B.; Samani, N.J.; Jacobs, D.R.; Liu, K.; d'Adamo, P.; Ulivi, S.; Rotter, J.I.; Psaty, B.M.; Vollenweider, P.; Waeber, G.; Campbell, S.; Devuyst, O.; Navarro, P.; Kolcic, I.; Hastie, N.; Balkau, B.; Froguel, P.; Esko, T.; Salumets, A.; Khaw, K.T.; Langenberg, C.; Wareham, N.J.; Isaacs, A.; Kraja, A.; Zhang, Q.Y.; Penninx, B.W.J.H.; Smit, J.H.; Bochud, M.; Gieger, C.
Elevated serum urate concentrations can cause gout, a prevalent and painful inflammatory arthritis. By combining data from >140,000 individuals of European ancestry within the Global Urate Genetics Consortium (GUGC), we identified and replicated 28 genome-wide significant loci in association with
Köttgen, Anna; Albrecht, Eva; Teumer, Alexander; Vitart, Veronique; Krumsiek, Jan; Hundertmark, Claudia; Pistis, Giorgio; Ruggiero, Daniela; O'Seaghdha, Conall M; Haller, Toomas; Yang, Qiong; Tanaka, Toshiko; Johnson, Andrew D; Kutalik, Zoltán; Smith, Albert V; Shi, Julia; Struchalin, Maksim; Middelberg, Rita P S; Brown, Morris J; Gaffo, Angelo L; Pirastu, Nicola; Li, Guo; Hayward, Caroline; Zemunik, Tatijana; Huffman, Jennifer; Yengo, Loic; Zhao, Jing Hua; Demirkan, Ayse; Feitosa, Mary F; Liu, Xuan; Malerba, Giovanni; Lopez, Lorna M; van der Harst, Pim; Li, Xinzhong; Kleber, Marcus E; Hicks, Andrew A; Nolte, Ilja M; Johansson, Asa; Murgia, Federico; Bakker, Stephan J L; Lagou, Vasiliki; Bruinenberg, Marcel; Stolk, Ronald P; Penninx, Brenda W; Mateo Leach, Irene; van Gilst, Wiek H; Hillege, Hans L; Wolffenbuttel, Bruce H R; Snieder, Harold; Navis, Gerjan
Elevated serum urate concentrations can cause gout, a prevalent and painful inflammatory arthritis. By combining data from >140,000 individuals of European ancestry within the Global Urate Genetics Consortium (GUGC), we identified and replicated 28 genome-wide significant loci in association with
Cerhan, James R.; Berndt, Sonja I.; Vijai, Joseph; Ghesquières, Hervé; McKay, James; Wang, Sophia S.; Wang, Zhaoming; Yeager, Meredith; Conde, Lucia; De Bakker, Paul I W; Nieters, Alexandra; Cox, David; Burdett, Laurie; Monnereau, Alain; Flowers, Christopher R.; De Roos, Anneclaire J.; Brooks-Wilson, Angela R.; Lan, Qing; Severi, Gianluca; Melbye, Mads; Gu, Jian; Jackson, Rebecca D.; Kane, Eleanor; Teras, Lauren R.; Purdue, Mark P.; Vajdic, Claire M.; Spinelli, John J.; Giles, Graham G.; Albanes, Demetrius; Kelly, Rachel S.; Zucca, Mariagrazia; Bertrand, Kimberly A.; Zeleniuch-Jacquotte, Anne; Lawrence, Charles; Hutchinson, Amy; Zhi, Degui; Habermann, Thomas M.; Link, Brian K.; Novak, Anne J.; Dogan, Ahmet; Asmann, Yan W.; Liebow, Mark; Thompson, Carrie A.; Ansell, Stephen M.; Witzig, Thomas E.; Weiner, George J.; Veron, Amelie S.; Zelenika, Diana; Tilly, Hervé; Haioun, Corinne; Molina, Thierry Jo; Hjalgrim, Henrik; Glimelius, Bengt; Adami, Hans Olov; Bracci, Paige M.; Riby, Jacques; Smith, Martyn T.; Holly, Elizabeth A.; Cozen, Wendy; Hartge, Patricia; Morton, Lindsay M.; Severson, Richard K.; Tinker, Lesley F.; North, Kari E.; Becker, Nikolaus; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; Staines, Anthony; Lightfoot, Tracy; Crouch, Simon; Smith, Alex; Roman, Eve; Diver, W. Ryan; Offit, Kenneth; Zelenetz, Andrew; Klein, Robert J.; Villano, Danylo J.; Zheng, Tongzhang; Zhang, Yawei; Holford, Theodore R.; Kricker, Anne; Turner, Jenny; Southey, Melissa C.; Clavel, Jacqueline; Virtamo, Jarmo; Weinstein, Stephanie; Riboli, Elio; Vineis, Paolo; Kaaks, Rudolph; Trichopoulos, Dimitrios; Vermeulen, Roel C H; Boeing, Heiner; Tjonneland, Anne; Angelucci, Emanuele; Di Lollo, Simonetta; Rais, Marco; Birmann, Brenda M.; Laden, Francine; Giovannucci, Edward; Kraft, Peter; Huang, Jinyan; Ma, Baoshan; Ye, Yuanqing; Chiu, Brian C H; Sampson, Joshua; Liang, Liming; Park, Ju Hyun; Chung, Charles C.; Weisenburger, Dennis D.; Chatterjee, Nilanjan; Fraumeni, Joseph F.; Slager, Susan L.; Wu, Xifeng; De Sanjose, Silvia; Smedby, Karin E.; Salles, Gilles; Skibola, Christine F.; Rothman, Nathaniel; Chanock, Stephen J.
Diffuse large B cell lymphoma (DLBCL) is the most common lymphoma subtype and is clinically aggressive. To identify genetic susceptibility loci for DLBCL, we conducted a meta-analysis of 3 new genome-wide association studies (GWAS) and 1 previous scan, totaling 3,857 cases and 7,666 controls of
Hara, Kazuo; Fujita, Hayato; Johnson, Todd A
Although over 60 loci for type 2 diabetes (T2D) have been identified, there still remains a large genetic component to be clarified. To explore unidentified loci for T2D, we performed a genome-wide association study (GWAS) of 6 209 637 single-nucleotide polymorphisms (SNPs), which were directly g...
Guidarelli Jack W
Full Text Available Abstract Background: The highly dimensional data produced by functional genomic (FG studies makes it difficult to visualize relationships between gene products and experimental conditions (i.e., assays. Although dimensionality reduction methods such as principal component analysis (PCA have been very useful, their application to identify assay-specific signatures has been limited by the lack of appropriate methodologies. This article proposes a new and powerful PCA-based method for the identification of assay-specific gene signatures in FG studies. Results: The proposed method (PM is unique for several reasons. First, it is the only one, to our knowledge, that uses gene contribution, a product of the loading and expression level, to obtain assay signatures. The PM develops and exploits two types of assay-specific contribution plots, which are new to the application of PCA in the FG area. The first type plots the assay-specific gene contribution against the given order of the genes and reveals variations in distribution between assay-specific gene signatures as well as outliers within assay groups indicating the degree of importance of the most dominant genes. The second type plots the contribution of each gene in ascending or descending order against a constantly increasing index. This type of plots reveals assay-specific gene signatures defined by the inflection points in the curve. In addition, sharp regions within the signature define the genes that contribute the most to the signature. We proposed and used the curvature as an appropriate metric to characterize these sharp regions, thus identifying the subset of genes contributing the most to the signature. Finally, the PM uses the full dataset to determine the final gene signature, thus eliminating the chance of gene exclusion by poor screening in earlier steps. The strengths of the PM are demonstrated using a simulation study, and two studies of real DNA microarray data – a study of
Cormier, Fabien; Le Gouis, Jacques; Dubreuil, Pierre; Lafarge, Stéphane; Praud, Sébastien
This study identified 333 genomic regions associated to 28 traits related to nitrogen use efficiency in European winter wheat using genome-wide association in a 214-varieties panel experimented in eight environments. Improving nitrogen use efficiency is a key factor to sustainably ensure global production increase. However, while high-throughput screening methods remain at a developmental stage, genetic progress may be mainly driven by marker-assisted selection. The objective of this study was to identify chromosomal regions associated with nitrogen use efficiency-related traits in bread wheat (Triticum aestivum L.) using a genome-wide association approach. Two hundred and fourteen European elite varieties were characterised for 28 traits related to nitrogen use efficiency in eight environments in which two different nitrogen fertilisation levels were tested. The genome-wide association study was carried out using 23,603 SNP with a mixed model for taking into account parentage relationships among varieties. We identified 1,010 significantly associated SNP which defined 333 chromosomal regions associated with at least one trait and found colocalisations for 39 % of these chromosomal regions. A method based on linkage disequilibrium to define the associated region was suggested and discussed with reference to false positive rate. Through a network approach, colocalisations were analysed and highlighted the impact of genomic regions controlling nitrogen status at flowering, precocity, and nitrogen utilisation on global agronomic performance. We were able to explain 40 ± 10 % of the total genetic variation. Numerous colocalisations with previously published genomic regions were observed with such candidate genes as Ppd-D1, Rht-D1, NADH-Gogat, and GSe. We highlighted selection pressure on yield and nitrogen utilisation discussing allele frequencies in associated regions.
Seyerle, Amanda A; Lin, Henry J; Gogarten, Stephanie M; Stilp, Adrienne; Méndez Giráldez, Raul; Soliman, Elsayed; Baldassari, Antoine; Graff, Mariaelisa; Heckbert, Susan; Kerr, Kathleen F; Kooperberg, Charles; Rodriguez, Carlos; Guo, Xiuqing; Yao, Jie; Sotoodehnia, Nona; Taylor, Kent D; Whitsel, Eric A; Rotter, Jerome I; Laurie, Cathy C; Avery, Christy L
PR interval (PR) is a heritable electrocardiographic measure of atrial and atrioventricular nodal conduction. Changes in PR duration may be associated with atrial fibrillation, heart failure and all-cause mortality. Hispanic/Latino populations have high burdens of cardiovascular morbidity and mortality, are highly admixed and represent exceptional opportunities for novel locus identification. However, they remain chronically understudied. We present the first genome-wide association study (GWAS) of PR in 14 756 participants of Hispanic/Latino ancestry from three studies. Study-specific summary results of the association between 1000 Genomes Phase 1 imputed single-nucleotide polymorphisms (SNPs) and PR assumed an additive genetic model and were adjusted for global ancestry, study centre/region and clinical covariates. Results were combined using fixed-effects, inverse variance weighted meta-analysis. Sequential conditional analyses were used to identify independent signals. Replication of novel loci was performed in populations of Asian, African and European descent. ENCODE and RoadMap data were used to annotate results. We identified a novel genome-wide association (PPR at ID2 (rs6730558), which replicated in Asian and European populations (PPR loci to Hispanics/Latinos. Bioinformatics annotation provided evidence for regulatory function in cardiac tissue. Further, for six loci that generalised, the Hispanic/Latino index SNP was genome-wide significant and identical to (or in high linkage disequilibrium with) the previously identified GWAS lead SNP. Our results suggest that genetic determinants of PR are consistent across race/ethnicity, but extending studies to admixed populations can identify novel associations, underscoring the importance of conducting genetic studies in diverse populations. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise
Chandra Sekhar Reddy Chilamakuri
Full Text Available Accurate functional annotation of protein sequences is hampered by important factors such as the failure of sequence search methods to identify relationships and the inherent diversity in function of proteins related at low sequence similarities. Earlier, we had employed intermediate sequence search approach to establish new domain relationships in the unassigned regions of gene products at the whole genome level by taking Mycoplasma gallisepticum as a specific example and established new domain relationships. In this paper, we report a detailed comparison of the conservation status of the domain and domain architectures of the gene products that bear our newly predicted domains amongst 14 other Mycoplasma genomes and reported the probable implications for the organisms. Some of the domain associations, observed in Mycoplasma that afflict humans and other non-human primates, are involved in regulation of solute transport and DNA binding suggesting specific modes of host-pathogen interactions.
Ruth, Katherine S; Campbell, Purdey J; Chew, Shelby; Lim, Ee Mun; Hadlow, Narelle; Stuckey, Bronwyn G A; Brown, Suzanne J; Feenstra, Bjarke; Joseph, John; Surdulescu, Gabriela L; Zheng, Hou Feng; Richards, J Brent; Murray, Anna; Spector, Tim D; Wilson, Scott G; Perry, John R B
Genetic factors contribute strongly to sex hormone levels, yet knowledge of the regulatory mechanisms remains incomplete. Genome-wide association studies (GWAS) have identified only a small number of loci associated with sex hormone levels, with several reproductive hormones yet to be assessed. The aim of the study was to identify novel genetic variants contributing to the regulation of sex hormones. We performed GWAS using genotypes imputed from the 1000 Genomes reference panel. The study used genotype and phenotype data from a UK twin register. We included 2913 individuals (up to 294 males) from the Twins UK study, excluding individuals receiving hormone treatment. Phenotypes were standardised for age, sex, BMI, stage of menstrual cycle and menopausal status. We tested 7,879,351 autosomal SNPs for association with levels of dehydroepiandrosterone sulphate (DHEAS), oestradiol, free androgen index (FAI), follicle-stimulating hormone (FSH), luteinizing hormone (LH), prolactin, progesterone, sex hormone-binding globulin and testosterone. Eight independent genetic variants reached genome-wide significance (P<5 × 10(-8)), with minor allele frequencies of 1.3-23.9%. Novel signals included variants for progesterone (P=7.68 × 10(-12)), oestradiol (P=1.63 × 10(-8)) and FAI (P=1.50 × 10(-8)). A genetic variant near the FSHB gene was identified which influenced both FSH (P=1.74 × 10(-8)) and LH (P=3.94 × 10(-9)) levels. A separate locus on chromosome 7 was associated with both DHEAS (P=1.82 × 10(-14)) and progesterone (P=6.09 × 10(-14)). This study highlights loci that are relevant to reproductive function and suggests overlap in the genetic basis of hormone regulation.
Kumar, Narender; Mariappan, Vanitha; Baddam, Ramani; Lankapalli, Aditya K; Shaik, Sabiha; Goh, Khean-Lee; Loke, Mun Fai; Perkins, Tim; Benghezal, Mohammed; Hasnain, Seyed E; Vadivelu, Jamuna; Marshall, Barry J; Ahmed, Niyaz
The discordant prevalence of Helicobacter pylori and its related diseases, for a long time, fostered certain enigmatic situations observed in the countries of the southern world. Variation in H. pylori infection rates and disease outcomes among different populations in multi-ethnic Malaysia provides a unique opportunity to understand dynamics of host-pathogen interaction and genome evolution. In this study, we extensively analyzed and compared genomes of 27 Malaysian H. pylori isolates and identified three major phylogeographic lineages: hspEastAsia, hpEurope and hpSouthIndia. The analysis of the virulence genes within the core genome, however, revealed a comparable pathogenic potential of the strains. In addition, we identified four genes limited to strains of East-Asian lineage. Our analyses identified a few strain-specific genes encoding restriction modification systems and outlined 311 core genes possibly under differential evolutionary constraints, among the strains representing different ethnic groups. The cagA and vacA genes also showed variations in accordance with the host genetic background of the strains. Moreover, restriction modification genes were found to be significantly enriched in East-Asian strains. An understanding of these variations in the genome content would provide significant insights into various adaptive and host modulation strategies harnessed by H. pylori to effectively persist in a host-specific manner. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Boone, Philip M.; Soens, Zachry T.; Campbell, Ian M.; Stankiewicz, Pawel; Cheung, Sau Wai; Patel, Ankita; Beaudet, Arthur L.; Plon, Sharon E.; Shaw, Chad A.; McGuire, Amy L.; Lupski, James R.
Purpose Mutational load of susceptibility variants has not been studied on a genomic scale in a clinical population, nor has the potential to identify these mutations as incidental findings during clinical testing been systematically ascertained. Methods Array comparative genomic hybridization, a method for genome-wide detection of DNA copy-number variants, was performed clinically on DNA from 9,005 individuals. Copy-number variants encompassing or disrupting single genes were identified and analyzed for their potential to confer predisposition to dominant, adult-onset disease. Multigene copy-number variants affecting dominant, adult-onset cancer syndrome genes were also assessed. Results In our cohort, 83 single-gene copy-number variants affected 40 unique genes associated with dominant, adult-onset disorders and unrelated to the patients’ referring diagnoses (i.e., incidental) were found. Fourteen of these copy-number variants are likely disease-predisposing, 25 are likely benign, and 44 are of unknown clinical consequence. When incidental copy-number variants spanning up to 20 genes were considered, 27 copy-number variants affected 17 unique genes associated with dominant, adult-onset cancer predisposition. Conclusion Copy-number variants potentially conferring susceptibility to adult-onset disease can be identified as incidental findings during routine genome-wide testing. Some of these mutations may be medically actionable, enabling disease surveillance or prevention; however, most incidentally observed single-gene copy-number variants are currently of unclear significance to the patient. PMID:22878507
Full Text Available A genome-wide association study was performed to identify genetic factors involved in susceptibility to psoriasis (PS and psoriatic arthritis (PSA, inflammatory diseases of the skin and joints in humans. 223 PS cases (including 91 with PSA were genotyped with 311,398 single nucleotide polymorphisms (SNPs, and results were compared with those from 519 Northern European controls. Replications were performed with an independent cohort of 577 PS cases and 737 controls from the U.S., and 576 PSA patients and 480 controls from the U.K.. Strongest associations were with the class I region of the major histocompatibility complex (MHC. The most highly associated SNP was rs10484554, which lies 34.7 kb upstream from HLA-C (P = 7.8x10(-11, GWA scan; P = 1.8x10(-30, replication; P = 1.8x10(-39, combined; U.K. PSA: P = 6.9x10(-11. However, rs2395029 encoding the G2V polymorphism within the class I gene HCP5 (combined P = 2.13x10(-26 in U.S. cases yielded the highest ORs with both PS and PSA (4.1 and 3.2 respectively. This variant is associated with low viral set point following HIV infection and its effect is independent of rs10484554. We replicated the previously reported association with interleukin 23 receptor and interleukin 12B (IL12B polymorphisms in PS and PSA cohorts (IL23R: rs11209026, U.S. PS, P = 1.4x10(-4; U.K. PSA: P = 8.0x10(-4; IL12B:rs6887695, U.S. PS, P = 5x10(-5 and U.K. PSA, P = 1.3x10(-3 and detected an independent association in the IL23R region with a SNP 4 kb upstream from IL12RB2 (P = 0.001. Novel associations replicated in the U.S. PS cohort included the region harboring lipoma HMGIC fusion partner (LHFP and conserved oligomeric golgi complex component 6 (COG6 genes on chromosome 13q13 (combined P = 2x10(-6 for rs7993214; OR = 0.71, the late cornified envelope gene cluster (LCE from the Epidermal Differentiation Complex (PSORS4 (combined P = 6.2x10(-5 for rs6701216; OR 1.45 and a region of LD at 15q21 (combined P = 2.9x10(-5 for rs
Full Text Available Methanopyrus spp. are usually isolated from harsh niches, such as high osmotic pressure and extreme temperature. However, the molecular mechanisms for their environmental adaption are poorly understood. Archaeal species is commonly considered as primitive organism. The evolutional placement of archaea is a fundamental and intriguing scientific question. We sequenced the genomes of Methanopyrus strains SNP6 and KOL6 isolated from the Atlantic and Iceland, respectively. Comparative genomic analysis revealed genetic diversity and instability implicated in niche adaption, including a number of transporter- and integrase/transposase-related genes. Pan-genome analysis also defined the gene pool of Methanopyrus spp., in addition of ~120-Kb genomic region of plasticity impacting cognate genomic architecture. We believe that Methanopyrus genomics could facilitate efficient investigation/recognition of archaeal phylogenetic diverse patterns, as well as improve understanding of biological roles and significance of these versatile microbes.
Full Text Available Abstract Background Linkage analyses strongly suggest a number of QTL for production, health and conformation traits in the middle part of bovine chromosome 6 (BTA6. The identification of the molecular background underlying the genetic variation at the QTL and subsequent functional studies require a well-annotated gene sequence map of the critical QTL intervals. To complete the sequence map of the defined subchromosomal regions on BTA6 poorly covered with comparative gene information, we focused on targeted isolation of transcribed sequences from bovine bacterial artificial chromosome (BAC clones mapped to the QTL intervals. Results Using the method of exon trapping, 92 unique exon trapping sequences (ETS were discovered in a chromosomal region of poor gene coverage. Sequence identity to the current NCBI sequence assembly for BTA6 was detected for 91% of unique ETS. Comparative sequence similarity search revealed that 11% of the isolated ETS displayed high similarity to genomic sequences located on the syntenic chromosomes of the human and mouse reference genome assemblies. Nearly a third of the ETS identified similar equivalent sequences in genomic sequence scaffolds from the alternative Celera-based sequence assembly of the human genome. Screening gene, EST, and protein databases detected 17% of ETS with identity to known transcribed sequences. Expression analysis of a subset of the ETS showed that most ETS (84% displayed a distinctive expression pattern in a multi-tissue panel of a lactating cow verifying their existence in the bovine transcriptome. Conclusion The results of our study demonstrate that the exon trapping method based on region-specific BAC clones is very useful for targeted screening for novel transcripts located within a defined chromosomal region being deficiently endowed with annotated gene information. The majority of identified ETS represents unknown noncoding sequences in intergenic regions on BTA6 displaying a
Harvey Steven P
Full Text Available Abstract Background The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. Results B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation to that of the most diverse tandemly repeated regions found in other less diverse bacteria. Conclusion The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were
Cordes, Alexander; Gehrke, Birgit; Römisch, Roman; Rammer, Christian; Schliessler, Paula; Wassmann, Pia
[Introduction ...] Overall, this report is structured as follows: the next chapter (2) briefly outlines the relevance of regional trade indicators for determining the competitiveness of a region. In chapter 3, the methodology for the calculation of regional trade performance indicators is introduced, and the elementary results are described. Chapter 4 presents an econometric analysis relating key regional characteristics to international success of local industries. Based upon the regional di...
Full Text Available Autism is a common heritable neurodevelopmental disorder with complex etiology. Several genome-wide linkage and association scans have been carried out to identify regions harboring genes related to autism or autism spectrum disorders, with mixed results. Given the overlap in autism features with genetic abnormalities known to be associated with imprinting, one possible reason for lack of consistency would be the influence of parent-of-origin effects that may mask the ability to detect linkage and association.We have performed a genome-wide linkage scan that accounts for potential parent-of-origin effects using 16,311 SNPs among families from the Autism Genetic Resource Exchange (AGRE and the National Institute of Mental Health (NIMH autism repository. We report parametric (GH, Genehunter and allele-sharing linkage (Aspex results using a broad spectrum disorder case definition. Paternal-origin genome-wide statistically significant linkage was observed on chromosomes 4 (LOD(GH = 3.79, empirical p<0.005 and LOD(Aspex = 2.96, p = 0.008, 15 (LOD(GH = 3.09, empirical p<0.005 and LOD(Aspex = 3.62, empirical p = 0.003 and 20 (LOD(GH = 3.36, empirical p<0.005 and LOD(Aspex = 3.38, empirical p = 0.006.These regions may harbor imprinted sites associated with the development of autism and offer fruitful domains for molecular investigation into the role of epigenetic mechanisms in autism.
Full Text Available Reliable identification of copy number aberrations (CNA from comparative genomic hybridization data would be improved by the availability of a generalised method for processing large datasets. To this end, we developed swatCGH, a data analysis framework and region detection heuristic for computational grids. swatCGH analyses sequentially displaced (sliding windows of neighbouring probes and applies adaptive thresholds of varying stringency to identify the 10% of each chromosome that contains the most frequently occurring CNAs. We used the method to analyse a published dataset, comparing data preprocessed using four different DNA segmentation algorithms, and two methods for prioritising the detected CNAs. The consolidated list of the most commonly detected aberrations confirmed the value of swatCGH as a simplified high-throughput method for identifying biologically significant CNA regions of interest.
Full Text Available BACKGROUND: It is difficult to identify copy number variations (CNV in normal human genomic data due to noise and non-linear relationships between different genomic regions and signal intensity. A high-resolution array comparative genomic hybridization (aCGH containing 42 million probes, which is very large compared to previous arrays, was recently published. Most existing CNV detection algorithms do not work well because of noise associated with the large amount of input data and because most of the current methods were not designed to analyze normal human samples. Normal human genome analysis often requires a joint approach across multiple samples. However, the majority of existing methods can only identify CNVs from a single sample. METHODOLOGY AND PRINCIPAL FINDINGS: We developed a multi-sample-based genomic variations detector (MGVD that uses segmentation to identify common breakpoints across multiple samples and a k-means-based clustering strategy. Unlike previous methods, MGVD simultaneously considers multiple samples with different genomic intensities and identifies CNVs and CNV zones (CNVZs; CNVZ is a more precise measure of the location of a genomic variant than the CNV region (CNVR. CONCLUSIONS AND SIGNIFICANCE: We designed a specialized algorithm to detect common CNVs from extremely high-resolution multi-sample aCGH data. MGVD showed high sensitivity and a low false discovery rate for a simulated data set, and outperformed most current methods when real, high-resolution HapMap datasets were analyzed. MGVD also had the fastest runtime compared to the other algorithms evaluated when actual, high-resolution aCGH data were analyzed. The CNVZs identified by MGVD can be used in association studies for revealing relationships between phenotypes and genomic aberrations. Our algorithm was developed with standard C++ and is available in Linux and MS Windows format in the STL library. It is freely available at: http://embio.yonsei.ac.kr/~Park/mgvd.php.
Kantor, G.J.; Deiss-Tolbert, D.M.
Size separation after UV-endonuclease digestion of DNA from UV-irradiated human cells using denaturing conditions fractionates the genome based on cyclobutane pyrimidine dimer content. We have examined the largest molecules available (50-80 kb; about 5% of the DNA) after fractionation and those of average size (5-15 kb) for content of some specific genes. We find that the largest molecules are not a representative sampling of the genome. Three contiguous genes located in a G+C-rich isochore (tyrosine hydroxylase, insulin, insulin-like growth factor II) have concentrations two to three times greater in the largest molecules. This shows that this genomic region has fewer pyrimidine dimers than most other genomic regions. In contrast, the β-actin genomic region, which has a similar G+C content, has an equal concentration in both fractions as do the p53 and β-globin genomic regions, which are A+T-rich. These data show that DNA damage in the form of cyclobutane pyrimidine dimers occurs with different probabilities in specific isochores. Part of the reason may be the relative G-C content, but other factors must play a significant role. We also report that the transcriptionally inactive insulin region is repaired at the genome-overall rate in normal cells and is not repaired in xeroderma pigmentosum complementation group C cells. (author)
Huerta, Araceli M.; Francino, M. Pilar; Morett, Enrique; Collado-Vides, Julio
The evolutionary processes operating in the DNA regions that participate in the regulation of gene expression are poorly understood. In Escherichia coli, we have established a sequence pattern that distinguishes regulatory from nonregulatory regions. The density of promoter-like sequences, that are recognizable by RNA polymerase and may function as potential promoters, is high within regulatory regions, in contrast to coding regions and regions located between convergently-transcribed genes. Moreover, functional promoter sites identified experimentally are often found in the subregions of highest density of promoter-like signals, even when individual sites with higher binding affinity for RNA polymerase exist elsewhere within the regulatory region. In order to investigate the generality of this pattern, we have used position weight matrices describing the -35 and -10 promoter boxes of E. coli to search for these motifs in 43 additional genomes belonging to most established bacterial phyla, after specific calibration of the matrices according to the base composition of the noncoding regions of each genome. We have found that all bacterial species analyzed contain similar promoter-like motifs, and that, in most cases, these motifs follow the same genomic distribution observed in E. coli. Differential densities between regulatory and nonregulatory regions are detectable in most bacterial genomes, with the exception of those that have experienced evolutionary extreme genome reduction. Thus, the phylogenetic distribution of this pattern mirrors that of genes and other genomic features that require weak selection to be effective in order to persist. On this basis, we suggest that the loss of differential densities in the reduced genomes of host-restricted pathogens and symbionts is the outcome of a process of genome degradation resulting from the decreased efficiency of purifying selection in highly structured small populations. This implies that the differential
Berndt, S.I.; Skibola, C.F.; Joseph, V.; Camp, N.J.; Nieters, A.; Wang, Z.; Cozen, W.; Monnereau, A.; Wang, S.S.; Kelly, R.S.; Lan, Q.; Teras, L.R.; Chatterjee, N.; Chung, C.C.; Yeager, M.
Genome-wide association studies (GWAS) have previously identified 13 loci associated with risk of chronic lymphocytic leukemia or small lymphocytic lymphoma (CLL). To identify additional CLL susceptibility loci, we conducted the largest meta-analysis for CLL thus far, including four GWAS with a total of 3,100 individuals with CLL (cases) and 7,667 controls. In the meta-analysis, we identified ten independent associated SNPs in nine new loci at 10q23.31 (ACTA2 or FAS (ACTA2/FAS), P = 1.22 × 10...
Full Text Available Duplications play a significant role in both extremes of the phenotypic spectrum of newly arising mutations: they can have severe deleterious effects (e.g. duplications underlie a variety of diseases but can also be highly advantageous. The phenotypic potential of newly arisen duplications has stimulated wide interest in both the mutational and selective processes shaping these variants in the genome. Here we take advantage of the Drosophila simulans-Drosophila melanogaster genetic system to further our understanding of both processes. Regarding mutational processes, the study of two closely related species allows investigation of the potential existence of shared duplication hotspots, and the similarities and differences between the two genomes can be used to dissect its underlying causes. Regarding selection, the difference in the effective population size between the two species can be leveraged to ask questions about the strength of selection acting on different classes of duplications. In this study, we conducted a survey of duplication polymorphisms in 14 different lines of D. simulans using tiling microarrays and combined it with an analogous survey for the D. melanogaster genome. By integrating the two datasets, we identified duplication hotspots conserved between the two species. However, unlike the duplication hotspots identified in mammalian genomes, Drosophila duplication hotspots are not associated with sequences of high sequence identity capable of mediating non-allelic homologous recombination. Instead, Drosophila duplication hotspots are associated with late-replicating regions of the genome, suggesting a link between DNA replication and duplication rates. We also found evidence supporting a higher effectiveness of selection on duplications in D. simulans than in D. melanogaster. This is also true for duplications segregating at high frequency, where we find evidence in D. simulans that a sizeable fraction of these mutations is
Zhang, Yan-Cong; Lin, Kui
Overlapping genes (OGs) represent one type of widespread genomic feature in bacterial genomes and have been used as rare genomic markers in phylogeny inference of closely related bacterial species. However, the inference may experience a decrease in performance for phylogenomic analysis of too closely or too distantly related genomes. Another drawback of OGs as phylogenetic markers is that they usually take little account of the effects of genomic rearrangement on the similarity estimation, such as intra-chromosome/genome translocations, horizontal gene transfer, and gene losses. To explore such effects on the accuracy of phylogeny reconstruction, we combine phylogenetic signals of OGs with collinear genomic regions, here called locally collinear blocks (LCBs). By putting these together, we refine our previous metric of pairwise similarity between two closely related bacterial genomes. As a case study, we used this new method to reconstruct the phylogenies of 88 Enterobacteriale genomes of the class Gammaproteobacteria. Our results demonstrated that the topological accuracy of the inferred phylogeny was improved when both OGs and LCBs were simultaneously considered, suggesting that combining these two phylogenetic markers may reduce, to some extent, the influence of gene loss on phylogeny inference. Such phylogenomic studies, we believe, will help us to explore a more effective approach to increasing the robustness of phylogeny reconstruction of closely related bacterial organisms. PMID:26715828
Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J.; Tropf, Felix C.; Shen, Xia; Wilson, James F.; Chasman, Daniel I.; Nolte, Ilja M.; Tragante, Vinicius; van der Laan, Sander W.; Perry, John R. B.; Kong, Augustine; Ahluwalia, Tarunveer; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F.; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J.; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F.; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J.; Gieger, Christian; Gunderson, Erica P.; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K.; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A.; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F.; McMahon, George; Meddens, S. Fleur W.; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A.; Monnereau, Claire; van der Most, Peter J.; Myhre, Ronny; Nalls, Mike A.; Nutile, Teresa; Panagiota, Kalafati Ioanna; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B.; Rich-Edwards, Janet; Rietveld, Cornelius A.; Robino, Antonietta; Rose, Lynda M.; Rueedi, Rico; Ryan, Kathy; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A.; Stolk, Lisette; Streeten, Elizabeth; Tonjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V.; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I.; Buring, Julie E.; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R.; Cucca, Francesco; Daniela, Toniolo; Davey-Smith, George; Deary, Ian J.; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M.; de Geus, Eco JC.; Eriksson, Johan G.; Evans, Denis A.; Faul, Jessica D.; Felicita, Sala Cinzia; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J.F.; de Haan, Hugoline G.; Haerting, Johannes; Harris, Tamara B.; Heath, Andrew C.; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hypponen, Elina; Jacobsson, Bo; Jaddoe, Vincent W. V.; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L.R.; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; McQuillan, Ruth; Medland, Sarah E.; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Michela, Traglia; Milani, Lili; Mitchell, Paul; Montgomery, Grant W.; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K.; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda WJH; Perola, Markus; Peyser, Patricia A.; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J.; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M.; Ring, Susan M.; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D.; Starr, John M.; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A.; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tönjes, Anke; Tung, Joyce Y.; Uitterlinden, André G.; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G.; Wang, Jie Jin; Wareham, Nicholas J.; Weir, David R.; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F.; Zondervan, Krina T.; Stefansson, Kari; Krueger, Robert F.; Lee, James J.; Benjamin, Daniel J.; Cesarini, David; Koellinger, Philipp D.; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C.
The genetic architecture of human reproductive behavior – age at first birth (AFB) and number of children ever born (NEB) – has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified and the underlying mechanisms of AFB and NEB are poorly understood. We report the largest genome-wide association study to date of both sexes including 251,151 individuals for AFB and 343,072 for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study, and four additional loci in a gene-based effort. These loci harbor genes that are likely to play a role – either directly or by affecting non-local gene expression – in human reproduction and infertility, thereby increasing our understanding of these complex traits. PMID:27798627
Full Text Available We performed a genome wide analysis of 164 urothelial carcinoma samples and 27 bladder cancer cell lines to identify copy number changes associated with disease characteristics, and examined the association of amplification events with stage and grade of disease. Multiplex inversion probe (MIP analysis, a recently developed genomic technique, was used to study 80 urothelial carcinomas to identify mutations and copy number changes. Selected amplification events were then analyzed in a validation cohort of 84 bladder cancers by multiplex ligation-dependent probe assay (MLPA. In the MIP analysis, 44 regions of significant copy number change were identified using GISTIC. Nine gene-containing regions of amplification were selected for validation in the second cohort by MLPA. Amplification events at these 9 genomic regions were found to correlate strongly with stage, being seen in only 2 of 23 (9% Ta grade 1 or 1-2 cancers, in contrast to 31 of 61 (51% Ta grade 3 and T2 grade 2 cancers, p<0.001. These observations suggest that analysis of genomic amplification of these 9 regions might help distinguish non-invasive from invasive urothelial carcinoma, although further study is required. Both MIP and MLPA methods perform well on formalin-fixed paraffin-embedded DNA, enhancing their potential clinical use. Furthermore several of the amplified genes identified here (ERBB2, MDM2, CCND1 are potential therapeutic targets.
Michailidou, Kyriaki; Beesley, Jonathan; Lindstrom, Sara; Canisius, Sander; Dennis, Joe; Lush, Michael J; Maranian, Mel J; Bolla, Manjeet K; Wang, Qin; Shah, Mitul; Perkins, Barbara J; Czene, Kamila; Eriksson, Mikael; Darabi, Hatef; Brand, Judith S; Bojesen, Stig E; Nordestgaard, Børge G; Flyger, Henrik; Nielsen, Sune F; Rahman, Nazneen; Turnbull, Clare; Fletcher, Olivia; Peto, Julian; Gibson, Lorna; dos-Santos-Silva, Isabel; Chang-Claude, Jenny; Flesch-Janys, Dieter; Rudolph, Anja; Eilber, Ursula; Behrens, Sabine; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Khan, Sofia; Aaltonen, Kirsimari; Ahsan, Habibul; Kibriya, Muhammad G; Whittemore, Alice S; John, Esther M; Malone, Kathleen E; Gammon, Marilie D; Santella, Regina M; Ursin, Giske; Makalic, Enes; Schmidt, Daniel F; Casey, Graham; Hunter, David J; Gapstur, Susan M; Gaudet, Mia M; Diver, W Ryan; Haiman, Christopher A; Schumacher, Fredrick; Henderson, Brian E; Le Marchand, Loic; Berg, Christine D; Chanock, Stephen J; Figueroa, Jonine; Hoover, Robert N; Lambrechts, Diether; Neven, Patrick; Wildiers, Hans; van Limbergen, Erik; Schmidt, Marjanka K; Broeks, Annegien; Verhoef, Senno; Cornelissen, Sten; Couch, Fergus J; Olson, Janet E; Hallberg, Emily; Vachon, Celine; Waisfisz, Quinten; Meijers-Heijboer, Hanne; Adank, Muriel A; van der Luijt, Rob B; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Yoo, Keun-Young; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Tajima, Kazuo; Guénel, Pascal; Truong, Thérèse; Mulot, Claire; Sanchez, Marie; Burwinkel, Barbara; Marme, Frederik; Surowy, Harald; Sohn, Christof; Wu, Anna H; Tseng, Chiu-chen; Van Den Berg, David; Stram, Daniel O; González-Neira, Anna; Benitez, Javier; Zamora, M Pilar; Perez, Jose Ignacio Arias; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Lindblom, Annika; Margolin, Sara; Teo, Soo Hwang; Yip, Cheng Har; Taib, Nur Aishah Mohd; Tan, Gie-Hooi; Hooning, Maartje J; Hollestelle, Antoinette; Martens, John W M; Collée, J Margriet; Blot, William; Signorello, Lisa B; Cai, Qiuyin; Hopper, John L; Southey, Melissa C; Tsimiklis, Helen; Apicella, Carmel; Shen, Chen-Yang; Hsiung, Chia-Ni; Wu, Pei-Ei; Hou, Ming-Feng; Kristensen, Vessela N; Nord, Silje; Alnaes, Grethe I Grenaker; Giles, Graham G; Milne, Roger L; McLean, Catriona; Canzian, Federico; Trichopoulos, Dimitrios; Peeters, Petra; Lund, Eiliv; Sund, Malin; Khaw, Kay-Tee; Gunter, Marc J; Palli, Domenico; Mortensen, Lotte Maxild; Dossus, Laure; Huerta, Jose-Maria; Meindl, Alfons; Schmutzler, Rita K; Sutter, Christian; Yang, Rongxi; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Hartman, Mikael; Miao, Hui; Chia, Kee Seng; Chan, Ching Wan; Fasching, Peter A; Hein, Alexander; Beckmann, Matthias W; Haeberle, Lothar; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J; Swerdlow, Anthony J; Brinton, Louise; Garcia-Closas, Montserrat; Zheng, Wei; Halverson, Sandra L; Shrubsole, Martha; Long, Jirong; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Brauch, Hiltrud; Hamann, Ute; Brüning, Thomas; Radice, Paolo; Peterlongo, Paolo; Manoukian, Siranoush; Bernard, Loris; Bogdanova, Natalia V; Dörk, Thilo; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Van Asperen, Christi J; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Huzarski, Tomasz; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Slager, Susan; Toland, Amanda E; Ambrosone, Christine B; Yannoukakos, Drakoulis; Kabisch, Maria; Torres, Diana; Neuhausen, Susan L; Anton-Culver, Hoda; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Healey, Catherine S; Tessier, Daniel C; Vincent, Daniel; Bacot, Francois; Pita, Guillermo; Alonso, M Rosario; Álvarez, Nuria; Herrero, Daniel; Simard, Jacques; Pharoah, Paul P D P; Kraft, Peter; Dunning, Alison M; Chenevix-Trench, Georgia; Hall, Per; Easton, Douglas F
Genome-wide association studies (GWAS) and large-scale replication studies have identified common variants in 79 loci associated with breast cancer, explaining ∼14% of the familial risk of the disease. To identify new susceptibility loci, we performed a meta-analysis of 11 GWAS, comprising 15,748 breast cancer cases and 18,084 controls together with 46,785 cases and 42,892 controls from 41 studies genotyped on a 211,155-marker custom array (iCOGS). Analyses were restricted to women of European ancestry. We generated genotypes for more than 11 million SNPs by imputation using the 1000 Genomes Project reference panel, and we identified 15 new loci associated with breast cancer at P association analysis with ChIP-seq chromatin binding data in mammary cell lines and ChIA-PET chromatin interaction data from ENCODE, we identified likely target genes in two regions: SETBP1 at 18q12.3 and RNF115 and PDZK1 at 1q21.1. One association appears to be driven by an amino acid substitution encoded in EXO1.
Noor, Dzul Azri Mohamed; Jeyapalan, Jennie N; Alhazmi, Safiah; Carr, Matthew; Squibb, Benjamin; Wallace, Claire; Tan, Christopher; Cusack, Martin; Hughes, Jaime; Reader, Tom; Shipley, Janet; Sheer, Denise; Scotting, Paul J
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome-wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours' biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription-quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes.
Okbay, Aysu; Beauchamp, Jonathan; Fontana, M.A. (Mark Alan); Lee, James J.; Pers, Tune; Rietveld, C.A. (Cornelius A.); Turley, Patrick; Chen, G.-B. (Guo-Bo); Emilsson, Valur; Meddens, S.F.W. (S. Fleur W.); Oskarsson, S. (Sven); Pickrell, J.K. (Joseph K.); Thom, K. (Kevin); Timshel, P. (Pascal); Vlaming, Ronald
textabstractEducational attainment is strongly influenced by social and other environmental factors, but genetic factors are estimated to account for at least 20% of the variation across individuals. Here we report the results of a genome-wide association study (GWAS) for educational attainment that extends our earlier discovery sample of 101,069 individuals to 293,723 individuals, and a replication study in an independent sample of 111,349 individuals from the UK Biobank. We identify 74 geno...
Feltus Frank A
Full Text Available Abstract Background Switchgrass, a C4 species and a warm-season grass native to the prairies of North America, has been targeted for development into an herbaceous biomass fuel crop. Genetic improvement of switchgrass feedstock traits through marker-assisted breeding and biotechnology approaches calls for genomic tools development. Establishment of integrated physical and genetic maps for switchgrass will accelerate mapping of value added traits useful to breeding programs and to isolate important target genes using map based cloning. The reported polyploidy series in switchgrass ranges from diploid (2X = 18 to duodecaploid (12X = 108. Like in other large, repeat-rich plant genomes, this genomic complexity will hinder whole genome sequencing efforts. An extensive physical map providing enough information to resolve the homoeologous genomes would provide the necessary framework for accurate assembly of the switchgrass genome. Results A switchgrass BAC library constructed by partial digestion of nuclear DNA with EcoRI contains 147,456 clones covering the effective genome approximately 10 times based on a genome size of 3.2 Gigabases (~1.6 Gb effective. Restriction digestion and PFGE analysis of 234 randomly chosen BACs indicated that 95% of the clones contained inserts, ranging from 60 to 180 kb with an average of 120 kb. Comparative sequence analysis of two homoeologous genomic regions harboring orthologs of the rice OsBRI1 locus, a low-copy gene encoding a putative protein kinase and associated with biomass, revealed that orthologous clones from homoeologous chromosomes can be unambiguously distinguished from each other and correctly assembled to respective fingerprint contigs. Thus, the data obtained not only provide genomic resources for further analysis of switchgrass genome, but also improve efforts for an accurate genome sequencing strategy. Conclusions The construction of the first switchgrass BAC library and comparative analysis of
Full Text Available Hypertension is a heritable and major contributor to the global burden of disease. The sum of rare and common genetic variants robustly identified so far explain only 1%-2% of the population variation in BP and hypertension. This suggests the existence of more undiscovered common variants. We conducted a genome-wide association study in 1,621 hypertensive cases and 1,699 controls and follow-up validation analyses in 19,845 cases and 16,541 controls using an extreme case-control design. We identified a locus on chromosome 16 in the 5' region of Uromodulin (UMOD; rs13333226, combined P value of 3.6 × 10⁻¹¹. The minor G allele is associated with a lower risk of hypertension (OR [95%CI]: 0.87 [0.84-0.91], reduced urinary uromodulin excretion, better renal function; and each copy of the G allele is associated with a 7.7% reduction in risk of CVD events after adjusting for age, sex, BMI, and smoking status (H.R. = 0.923, 95% CI 0.860-0.991; p = 0.027. In a subset of 13,446 individuals with estimated glomerular filtration rate (eGFR measurements, we show that rs13333226 is independently associated with hypertension (unadjusted for eGFR: 0.89 [0.83-0.96], p = 0.004; after eGFR adjustment: 0.89 [0.83-0.96], p = 0.003. In clinical functional studies, we also consistently show the minor G allele is associated with lower urinary uromodulin excretion. The exclusive expression of uromodulin in the thick portion of the ascending limb of Henle suggests a putative role of this variant in hypertension through an effect on sodium homeostasis. The newly discovered UMOD locus for hypertension has the potential to give new insights into the role of uromodulin in BP regulation and to identify novel drugable targets for reducing cardiovascular risk.
Allison David B
Full Text Available Abstract Background HIV susceptibility and pathogenicity exhibit both interindividual and intergroup variability. The etiology of intergroup variability is still poorly understood, and could be partly linked to genetic differences among racial/ethnic groups. These genetic differences may be traceable to different regimes of natural selection in the 60,000 years since the human radiation out of Africa. Here, we examine population differentiation and haplotype patterns at several loci identified through genome-wide association studies on HIV-1 control, as determined by viral-load setpoint, in European and African-American populations. We use genome-wide data from the Human Genome Diversity Project, consisting of 53 world-wide populations, to compare measures of FST and relative extended haplotype homozygosity (REHH at these candidate loci to the rest of the respective chromosome. Results We find that the Europe-Middle East and Europe-South Asia pairwise FST in the most strongly associated region are elevated compared to most pairwise comparisons with the sub-Saharan African group, which exhibit very low FST. We also find genetic signatures of recent positive selection (higher REHH at these associated regions among all groups except for sub-Saharan Africans and Native Americans. This pattern is consistent with one in which genetic differentiation, possibly due to diversifying/positive selection, occurred at these loci among Eurasians. Conclusions These findings are concordant with those from earlier studies suggesting recent evolutionary change at immunity-related genomic regions among Europeans, and shed light on the potential genetic and evolutionary origin of population differences in HIV-1 control.
Lindström, Sara; Thompson, Deborah J.; Paterson, Andrew D.; Li, Jingmei; Gierach, Gretchen L.; Scott, Christopher; Stone, Jennifer; Douglas, Julie A.; dos-Santos-Silva, Isabel; Fernandez-Navarro, Pablo; Verghase, Jajini; Smith, Paula; Brown, Judith; Luben, Robert; Wareham, Nicholas J.; Loos, Ruth J.F.; Heit, John A.; Pankratz, V. Shane; Norman, Aaron; Goode, Ellen L.; Cunningham, Julie M.; deAndrade, Mariza; Vierkant, Robert A.; Czene, Kamila; Fasching, Peter A.; Baglietto, Laura; Southey, Melissa C.; Giles, Graham G.; Shah, Kaanan P.; Chan, Heang-Ping; Helvie, Mark A.; Beck, Andrew H.; Knoblauch, Nicholas W.; Hazra, Aditi; Hunter, David J.; Kraft, Peter; Pollan, Marina; Figueroa, Jonine D.; Couch, Fergus J.; Hopper, John L.; Hall, Per; Easton, Douglas F.; Boyd, Norman F.; Vachon, Celine M.; Tamimi, Rulla M.
Mammographic density reflects the amount of stromal and epithelial tissues in relation to adipose tissue in the breast and is a strong risk factor for breast cancer. Here we report the results from meta-analysis of genome-wide association studies (GWAS) of three mammographic density phenotypes: dense area, non-dense area and percent density in up to 7,916 women in stage 1 and an additional 10,379 women in stage 2. We identify genome-wide significant (P<5×10−8) loci for dense area (AREG, ESR1, ZNF365, LSP1/TNNT3, IGF1, TMEM184B, SGSM3/MKL1), non-dense area (8p11.23) and percent density (PRDM6, 8p11.23, TMEM184B). Four of these regions are known breast cancer susceptibility loci, and four additional regions were found to be associated with breast cancer (P<0.05) in a large meta-analysis. These results provide further evidence of a shared genetic basis between mammographic density and breast cancer and illustrate the power of studying intermediate quantitative phenotypes to identify putative disease susceptibility loci. PMID:25342443
Firnhaber Christopher B
Full Text Available Abstract Background The Notch signaling pathway regulates a diverse array of developmental processes, and aberrant Notch signaling can lead to diseases, including cancer. To obtain a more comprehensive understanding of the genetic network that integrates into Notch signaling, we performed a genome-wide RNAi screen in Drosophila cell culture to identify genes that modify Notch-dependent transcription. Results Employing complementary data analyses, we found 399 putative modifiers: 189 promoting and 210 antagonizing Notch activated transcription. These modifiers included several known Notch interactors, validating the robustness of the assay. Many novel modifiers were also identified, covering a range of cellular localizations from the extracellular matrix to the nucleus, as well as a large number of proteins with unknown function. Chromatin-modifying proteins represent a major class of genes identified, including histone deacetylase and demethylase complex components and other chromatin modifying, remodeling and replacement factors. A protein-protein interaction map of the Notch-dependent transcription modifiers revealed that a large number of the identified proteins interact physically with these core chromatin components. Conclusions The genome-wide RNAi screen identified many genes that can modulate Notch transcriptional output. A protein interaction map of the identified genes highlighted a network of chromatin-modifying enzymes and remodelers that regulate Notch transcription. Our results open new avenues to explore the mechanisms of Notch signal regulation and the integration of this pathway into diverse cellular processes.
Pollard, Katherine S; Salama, Sofie R; King, Bryan
Comparative genomics allow us to search the human genome for segments that were extensively changed in the last approximately 5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202...... genomic elements that are highly conserved in vertebrates but show evidence of significantly accelerated substitution rates in human. These are mostly in non-coding DNA, often near genes associated with transcription and DNA binding. Resequencing confirmed that the five most accelerated elements...... contributed to accelerated evolution of the fastest evolving elements in the human genome....
BACKGROUND: Genome sequencing and bioinformatics have provided the full hypothetical proteome of many pathogenic organisms. Advances in microarray and mass spectrometry have also yielded large output datasets of possible target proteins\\/genes. However, the challenge remains to identify new targets for drug discovery from this wealth of information. Further analysis includes bioinformatics and\\/or molecular biology tools to validate the findings. This is time consuming and expensive, and could fail to yield novel drugs if protein purification and crystallography is impossible. To pre-empt this, a researcher may want to rapidly filter the output datasets for proteins that show good homology to proteins that have already been structurally characterised or proteins that are already targets for known drugs. Critically, those researchers developing novel antibiotics need to select out the proteins that show close homology to any human proteins, as future inhibitors are likely to cross-react with the host protein, causing off-target toxicity effects later in clinical trials. METHODOLOGY\\/PRINCIPAL FINDINGS: To solve many of these issues, we have developed a free online resource called Genomes2Drugs which ranks sequences to identify proteins that are (i) homologous to previously crystallized proteins or (ii) targets of known drugs, but are (iii) not homologous to human proteins. When tested using the Plasmodium falciparum malarial genome the program correctly enriched the ranked list of proteins with known drug target proteins. CONCLUSIONS\\/SIGNIFICANCE: Genomes2Drugs rapidly identifies proteins that are likely to succeed in drug discovery pipelines. This free online resource helps in the identification of potential drug targets. Importantly, the program further highlights proteins that are likely to be inhibited by FDA-approved drugs. These drugs can then be rapidly moved into Phase IV clinical studies under \\'change-of-application\\' patents.
Full Text Available BACKGROUND: Genome sequencing and bioinformatics have provided the full hypothetical proteome of many pathogenic organisms. Advances in microarray and mass spectrometry have also yielded large output datasets of possible target proteins/genes. However, the challenge remains to identify new targets for drug discovery from this wealth of information. Further analysis includes bioinformatics and/or molecular biology tools to validate the findings. This is time consuming and expensive, and could fail to yield novel drugs if protein purification and crystallography is impossible. To pre-empt this, a researcher may want to rapidly filter the output datasets for proteins that show good homology to proteins that have already been structurally characterised or proteins that are already targets for known drugs. Critically, those researchers developing novel antibiotics need to select out the proteins that show close homology to any human proteins, as future inhibitors are likely to cross-react with the host protein, causing off-target toxicity effects later in clinical trials. METHODOLOGY/PRINCIPAL FINDINGS: To solve many of these issues, we have developed a free online resource called Genomes2Drugs which ranks sequences to identify proteins that are (i homologous to previously crystallized proteins or (ii targets of known drugs, but are (iii not homologous to human proteins. When tested using the Plasmodium falciparum malarial genome the program correctly enriched the ranked list of proteins with known drug target proteins. CONCLUSIONS/SIGNIFICANCE: Genomes2Drugs rapidly identifies proteins that are likely to succeed in drug discovery pipelines. This free online resource helps in the identification of potential drug targets. Importantly, the program further highlights proteins that are likely to be inhibited by FDA-approved drugs. These drugs can then be rapidly moved into Phase IV clinical studies under 'change-of-application' patents.
Cornelia Di Gaetano
Full Text Available The peculiar position of Sardinia in the Mediterranean sea has rendered its population an interesting biogeographical isolate. The aim of this study was to investigate the genetic population structure, as well as to estimate Runs of Homozygosity and regions under positive selection, using about 1.2 million single nucleotide polymorphisms genotyped in 1077 Sardinian individuals. Using four different methods--fixation index, inflation factor, principal component analysis and ancestry estimation--we were able to highlight, as expected for a genetic isolate, the high internal homogeneity of the island. Sardinians showed a higher percentage of genome covered by RoHs>0.5 Mb (F(RoH%0.5 when compared to peninsular Italians, with the only exception of the area surrounding Alghero. We furthermore identified 9 genomic regions showing signs of positive selection and, we re-captured many previously inferred signals. Other regions harbor novel candidate genes for positive selection, like TMEM252, or regions containing long non coding RNA. With the present study we confirmed the high genetic homogeneity of Sardinia that may be explained by the shared ancestry combined with the action of evolutionary forces.
Kulbrock, Maike; Lehner, Stefanie; Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar
Equine recurrent uveitis (ERU) is a common eye disease affecting up to 3–15% of the horse population. A genome-wide association study (GWAS) using the Illumina equine SNP50 bead chip was performed to identify loci conferring risk to ERU. The sample included a total of 144 German warmblood horses. A GWAS showed a significant single nucleotide polymorphism (SNP) on horse chromosome (ECA) 20 at 49.3 Mb, with IL-17A and IL-17F being the closest genes. This locus explained a fraction of 23% of the phenotypic variance for ERU. A GWAS taking into account the severity of ERU, revealed a SNP on ECA18 nearby to the crystalline gene cluster CRYGA-CRYGF. For both genomic regions on ECA18 and 20, significantly associated haplotypes containing the genome-wide significant SNPs could be demonstrated. In conclusion, our results are indicative for a genetic component regulating the possible critical role of IL-17A and IL-17F in the pathogenesis of ERU. The associated SNP on ECA18 may be indicative for cataract formation in the course of ERU. PMID:23977091
Greub, Gilbert; Kebbi-Beghdadi, Carole; Bertelli, Claire; Collyn, François; Riederer, Beat M; Yersin, Camille; Croxatto, Antony; Raoult, Didier
With the availability of new generation sequencing technologies, bacterial genome projects have undergone a major boost. Still, chromosome completion needs a costly and time-consuming gap closure, especially when containing highly repetitive elements. However, incomplete genome data may be sufficiently informative to derive the pursued information. For emerging pathogens, i.e. newly identified pathogens, lack of release of genome data during gap closure stage is clearly medically counterproductive. We thus investigated the feasibility of a dirty genome approach, i.e. the release of unfinished genome sequences to develop serological diagnostic tools. We showed that almost the whole genome sequence of the emerging pathogen Parachlamydia acanthamoebae was retrieved even with relatively short reads from Genome Sequencer 20 and Solexa. The bacterial proteome was analyzed to select immunogenic proteins, which were then expressed and used to elaborate the first steps of an ELISA. This work constitutes the proof of principle for a dirty genome approach, i.e. the use of unfinished genome sequences of pathogenic bacteria, coupled with proteomics to rapidly identify new immunogenic proteins useful to develop in the future specific diagnostic tests such as ELISA, immunohistochemistry and direct antigen detection. Although applied here to an emerging pathogen, this combined dirty genome sequencing/proteomic approach may be used for any pathogen for which better diagnostics are needed. These genome sequences may also be very useful to develop DNA based diagnostic tests. All these diagnostic tools will allow further evaluations of the pathogenic potential of this obligate intracellular bacterium.
Daniëlle van Manen
Full Text Available BACKGROUND: AIDS develops typically after 7-11 years of untreated HIV-1 infection, with extremes of very rapid disease progression (15 years. To reveal additional host genetic factors that may impact on the clinical course of HIV-1 infection, we designed a genome-wide association study (GWAS in 404 participants of the Amsterdam Cohort Studies on HIV-1 infection and AIDS. METHODS: The association of SNP genotypes with the clinical course of HIV-1 infection was tested in Cox regression survival analyses using AIDS-diagnosis and AIDS-related death as endpoints. RESULTS: Multiple, not previously identified SNPs, were identified to be strongly associated with disease progression after HIV-1 infection, albeit not genome-wide significant. However, three independent SNPs in the top ten associations between SNP genotypes and time between seroconversion and AIDS-diagnosis, and one from the top ten associations between SNP genotypes and time between seroconversion and AIDS-related death, had P-values smaller than 0.05 in the French Genomics of Resistance to Immunodeficiency Virus cohort on disease progression. CONCLUSIONS: Our study emphasizes that the use of different phenotypes in GWAS may be useful to unravel the full spectrum of host genetic factors that may be associated with the clinical course of HIV-1 infection.
van Manen, Daniëlle; Delaneau, Olivier; Kootstra, Neeltje A.; Boeser-Nunnink, Brigitte D.; Limou, Sophie; Bol, Sebastiaan M.; Burger, Judith A.; Zwinderman, Aeilko H.; Moerland, Perry D.; van 't Slot, Ruben; Zagury, Jean-François; van 't Wout, Angélique B.; Schuitemaker, Hanneke
Background AIDS develops typically after 7–11 years of untreated HIV-1 infection, with extremes of very rapid disease progression (15 years). To reveal additional host genetic factors that may impact on the clinical course of HIV-1 infection, we designed a genome-wide association study (GWAS) in 404 participants of the Amsterdam Cohort Studies on HIV-1 infection and AIDS. Methods The association of SNP genotypes with the clinical course of HIV-1 infection was tested in Cox regression survival analyses using AIDS-diagnosis and AIDS-related death as endpoints. Results Multiple, not previously identified SNPs, were identified to be strongly associated with disease progression after HIV-1 infection, albeit not genome-wide significant. However, three independent SNPs in the top ten associations between SNP genotypes and time between seroconversion and AIDS-diagnosis, and one from the top ten associations between SNP genotypes and time between seroconversion and AIDS-related death, had P-values smaller than 0.05 in the French Genomics of Resistance to Immunodeficiency Virus cohort on disease progression. Conclusions Our study emphasizes that the use of different phenotypes in GWAS may be useful to unravel the full spectrum of host genetic factors that may be associated with the clinical course of HIV-1 infection. PMID:21811574
Johnston, Henry Richard; Hu, Yi-Juan; Gao, Jingjing; O'Connor, Timothy D; Abecasis, Gonçalo R; Wojcik, Genevieve L; Gignoux, Christopher R; Gourraud, Pierre-Antoine; Lizee, Antoine; Hansen, Mark; Genuario, Rob; Bullis, Dave; Lawley, Cindy; Kenny, Eimear E; Bustamante, Carlos; Beaty, Terri H; Mathias, Rasika A; Barnes, Kathleen C; Qin, Zhaohui S
A primary goal of The Consortium on Asthma among African-ancestry Populations in the Americas (CAAPA) is to develop an 'African Diaspora Power Chip' (ADPC), a genotyping array consisting of tagging SNPs, useful in comprehensively identifying African specific genetic variation. This array is designed based on the novel variation identified in 642 CAAPA samples of African ancestry with high coverage whole genome sequence data (~30× depth). This novel variation extends the pattern of variation catalogued in the 1000 Genomes and Exome Sequencing Projects to a spectrum of populations representing the wide range of West African genomic diversity. These individuals from CAAPA also comprise a large swath of the African Diaspora population and incorporate historical genetic diversity covering nearly the entire Atlantic coast of the Americas. Here we show the results of designing and producing such a microchip array. This novel array covers African specific variation far better than other commercially available arrays, and will enable better GWAS analyses for researchers with individuals of African descent in their study populations. A recent study cataloging variation in continental African populations suggests this type of African-specific genotyping array is both necessary and valuable for facilitating large-scale GWAS in populations of African ancestry.
Singer Peter A
Full Text Available Abstract Background While innovations in medicine, science and technology have resulted in improved health and quality of life for many people, the benefits of modern medicine continue to elude millions of people in many parts of the world. To assess the potential of genomics to address health needs in EMR, the World Health Organization's Eastern Mediterranean Regional Office and the University of Toronto Joint Centre for Bioethics jointly organized a Genomics and Public Health Policy Executive Course, held September 20th–23rd, 2003, in Muscat, Oman. The 4-day course was sponsored by WHO-EMRO with additional support from the Canadian Program in Genomics and Global Health. The overall objective of the course was to collectively explore how to best harness genomics to improve health in the region. This article presents the course findings and recommendations for genomics policy in EMR. Methods The course brought together senior representatives from academia, biotechnology companies, regulatory bodies, media, voluntary, and legal organizations to engage in discussion. Topics covered included scientific advances in genomics, followed by innovations in business models, public sector perspectives, ethics, legal issues and national innovation systems. Results A set of recommendations, summarized below, was formulated for the Regional Office, the Member States and for individuals. • Advocacy for genomics and biotechnology for political leadership; • Networking between member states to share information, expertise, training, and regional cooperation in biotechnology; coordination of national surveys for assessment of health biotechnology innovation systems, science capacity, government policies, legislation and regulations, intellectual property policies, private sector activity; • Creation in each member country of an effective National Body on genomics, biotechnology and health to: - formulate national biotechnology strategies - raise
Lay Person Interpretation: Injectional anthrax has been plaguing heroin drug users across Europe for more than 10 years. In order to better understand this outbreak, we assessed genomic relationships of all available injectional anthrax strains from four countries spanning a >12 year period. Very few differences were identified using genome-based analysis, but these differentiated the isolates into two distinct clusters. This strongly supports a hypothesis of at least two separate anthrax spore contamination events perhaps during the drug production processes. Identification of two events would not have been possible from standard epidemiological analysis. These comprehensive data will be invaluable for classifying future injectional anthrax isolates and for future geographic attribution.
Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette
Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...
Full Text Available The microarray dataset attached to this report is related to the research article with the title: “A genomic approach to susceptibility and pathogenesis leads to identifying potential novel therapeutic targets in androgenetic alopecia” (Dey-Rao and Sinha, 2017 . Male-pattern hair loss that is induced by androgens (testosterone in genetically predisposed individuals is known as androgenetic alopecia (AGA. The raw dataset is being made publicly available to enable critical and/or extended analyses. Our related research paper utilizes the attached raw dataset, for genome-wide gene-expression associated investigations. Combined with several in silico bioinformatics-based analyses we were able to delineate five strategic molecular elements as potential novel targets towards future AGA-therapy.
Olm, Matthew R.; Morowitz, Michael J.
ABSTRACT Antibiotic resistance in pathogens is extensively studied, and yet little is known about how antibiotic resistance genes of typical gut bacteria influence microbiome dynamics. Here, we leveraged genomes from metagenomes to investigate how genes of the premature infant gut resistome correspond to the ability of bacteria to survive under certain environmental and clinical conditions. We found that formula feeding impacts the resistome. Random forest models corroborated by statistical tests revealed that the gut resistome of formula-fed infants is enriched in class D beta-lactamase genes. Interestingly, Clostridium difficile strains harboring this gene are at higher abundance in formula-fed infants than C. difficile strains lacking this gene. Organisms with genes for major facilitator superfamily drug efflux pumps have higher replication rates under all conditions, even in the absence of antibiotic therapy. Using a machine learning approach, we identified genes that are predictive of an organism’s direction of change in relative abundance after administration of vancomycin and cephalosporin antibiotics. The most accurate results were obtained by reducing annotated genomic data to five principal components classified by boosted decision trees. Among the genes involved in predicting whether an organism increased in relative abundance after treatment are those that encode subclass B2 beta-lactamases and transcriptional regulators of vancomycin resistance. This demonstrates that machine learning applied to genome-resolved metagenomics data can identify key genes for survival after antibiotics treatment and predict how organisms in the gut microbiome will respond to antibiotic administration. IMPORTANCE The process of reconstructing genomes from environmental sequence data (genome-resolved metagenomics) allows unique insight into microbial systems. We apply this technique to investigate how the antibiotic resistance genes of bacteria affect their ability to
Thomas Danielsen, E.; E. Møller, Morten; Yamanaka, Naoki
Steroid hormones control important developmental processes and are linked to many diseases. To systematically identify genes and pathways required for steroid production, we performed a Drosophila genome-wide in vivo RNAi screen and identified 1,906 genes with potential roles in steroidogenesis...... and developmental timing. Here, we use our screen as a resource to identify mechanisms regulating intracellular levels of cholesterol, a substrate for steroidogenesis. We identify a conserved fatty acid elongase that underlies a mechanism that adjusts cholesterol trafficking and steroidogenesis with nutrition...... and developmental programs. In addition, we demonstrate the existence of an autophagosomal cholesterol mobilization mechanism and show that activation of this system rescues Niemann-Pick type C1 deficiency that causes a disorder characterized by cholesterol accumulation. These cholesterol-trafficking mechanisms...
Zhang, Hao; Shaffer, John R.; Hansen, Thomas; Esserlind, Ann-Louise; Boyd, Heather A.; Nohr, Ellen A.; Timpson, Nicholas J.; Fatemifar, Ghazaleh; Paternoster, Lavinia; Evans, David M.; Weyant, Robert J.; Levy, Steven M.; Lathrop, Mark; Smith, George Davey; Murray, Jeffrey C.; Olesen, Jes; Werge, Thomas; Marazita, Mary L.; Sørensen, Thorkild I. A.; Melbye, Mads
The sequence and timing of permanent tooth eruption is thought to be highly heritable and can have important implications for the risk of malocclusion, crowding, and periodontal disease. We conducted a genome-wide association study of number of permanent teeth erupted between age 6 and 14 years, analyzed as age-adjusted standard deviation score averaged over multiple time points, based on childhood records for 5,104 women from the Danish National Birth Cohort. Four loci showed association at Peruption and were also known to influence height and breast cancer, respectively. The two other loci pointed to genomic regions without any previous significant genome-wide association study results. The intronic SNP rs7924176 in ADK could be linked to gene expression in monocytes. The combined effect of the four genetic variants was most pronounced between age 10 and 12 years, where children with 6 to 8 delayed tooth eruption alleles had on average 3.5 (95% confidence interval: 2.9–4.1) fewer permanent teeth than children with 0 or 1 of these alleles. PMID:21931568
Kumar, Nitin; Cai, Haoyang; von Mering, Christian; Baudis, Michael
Regional genomic copy number alterations (CNA) are observed in the vast majority of cancers. Besides specifically targeting well-known, canonical oncogenes, CNAs may also play more subtle roles in terms of modulating genetic potential and broad gene expression patterns of developing tumors. Any significant differences in the overall CNA patterns between different cancer types may thus point towards specific biological mechanisms acting in those cancers. In addition, differences among CNA profiles may prove valuable for cancer classifications beyond existing annotation systems. We have analyzed molecular-cytogenetic data from 25579 tumors samples, which were classified into 160 cancer types according to the International Classification of Disease (ICD) coding system. When correcting for differences in the overall CNA frequencies between cancer types, related cancers were often found to cluster together according to similarities in their CNA profiles. Based on a randomization approach, distance measures from the cluster dendrograms were used to identify those specific genomic regions that contributed significantly to this signal. This approach identified 43 non-neutral genomic regions whose propensity for the occurrence of copy number alterations varied with the type of cancer at hand. Only a subset of these identified loci overlapped with previously implied, highly recurrent (hot-spot) cytogenetic imbalance regions. Thus, for many genomic regions, a simple null-hypothesis of independence between cancer type and relative copy number alteration frequency can be rejected. Since a subset of these regions display relatively low overall CNA frequencies, they may point towards second-tier genomic targets that are adaptively relevant but not necessarily essential for cancer development.
Gallus, Susanne; Janke, Axel
Abstract Phylogenetic reconstruction from transposable elements (TEs) offers an additional perspective to study evolutionary processes. However, detecting phylogenetically informative TE insertions requires tedious experimental work, limiting the power of phylogenetic inference. Here, we analyzed the genomes of seven bear species using high-throughput sequencing data to detect thousands of TE insertions. The newly developed pipeline for TE detection called TeddyPi (TE detection and discovery for Phylogenetic Inference) identified 150,513 high-quality TE insertions in the genomes of ursine and tremarctine bears. By integrating different TE insertion callers and using a stringent filtering approach, the TeddyPi pipeline produced highly reliable TE insertion calls, which were confirmed by extensive in vitro validation experiments. Analysis of single nucleotide substitutions in the flanking regions of the TEs shows that these substitutions correlate with the phylogenetic signal from the TE insertions. Our phylogenomic analyses show that TEs are a major driver of genomic variation in bears and enabled phylogenetic reconstruction of a well-resolved species tree, despite strong signals for incomplete lineage sorting and introgression. The analyses show that the Asiatic black, sun, and sloth bear form a monophyletic clade, in which phylogenetic incongruence originates from incomplete lineage sorting. TeddyPi is open source and can be adapted to various TE and structural variation callers. The pipeline makes it possible to confidently extract thousands of TE insertions even from low-coverage genomes (∼10×) of nonmodel organisms. This opens new possibilities for biologists to study phylogenies and evolutionary processes as well as rates and patterns of (retro-)transposition and structural variation. PMID:28985298
Lammers, Fritjof; Gallus, Susanne; Janke, Axel; Nilsson, Maria A
Phylogenetic reconstruction from transposable elements (TEs) offers an additional perspective to study evolutionary processes. However, detecting phylogenetically informative TE insertions requires tedious experimental work, limiting the power of phylogenetic inference. Here, we analyzed the genomes of seven bear species using high-throughput sequencing data to detect thousands of TE insertions. The newly developed pipeline for TE detection called TeddyPi (TE detection and discovery for Phylogenetic Inference) identified 150,513 high-quality TE insertions in the genomes of ursine and tremarctine bears. By integrating different TE insertion callers and using a stringent filtering approach, the TeddyPi pipeline produced highly reliable TE insertion calls, which were confirmed by extensive in vitro validation experiments. Analysis of single nucleotide substitutions in the flanking regions of the TEs shows that these substitutions correlate with the phylogenetic signal from the TE insertions. Our phylogenomic analyses show that TEs are a major driver of genomic variation in bears and enabled phylogenetic reconstruction of a well-resolved species tree, despite strong signals for incomplete lineage sorting and introgression. The analyses show that the Asiatic black, sun, and sloth bear form a monophyletic clade, in which phylogenetic incongruence originates from incomplete lineage sorting. TeddyPi is open source and can be adapted to various TE and structural variation callers. The pipeline makes it possible to confidently extract thousands of TE insertions even from low-coverage genomes (∼10×) of nonmodel organisms. This opens new possibilities for biologists to study phylogenies and evolutionary processes as well as rates and patterns of (retro-)transposition and structural variation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Betty M Booker
Full Text Available The molecular events leading to the development of the bat wing remain largely unknown, and are thought to be caused, in part, by changes in gene expression during limb development. These expression changes could be instigated by variations in gene regulatory enhancers. Here, we used a comparative genomics approach to identify regions that evolved rapidly in the bat ancestor, but are highly conserved in other vertebrates. We discovered 166 bat accelerated regions (BARs that overlap H3K27ac and p300 ChIP-seq peaks in developing mouse limbs. Using a mouse enhancer assay, we show that five Myotis lucifugus BARs drive gene expression in the developing mouse limb, with the majority showing differential enhancer activity compared to the mouse orthologous BAR sequences. These include BAR116, which is located telomeric to the HoxD cluster and had robust forelimb expression for the M. lucifugus sequence and no activity for the mouse sequence at embryonic day 12.5. Developing limb expression analysis of Hoxd10-Hoxd13 in Miniopterus natalensis bats showed a high-forelimb weak-hindlimb expression for Hoxd10-Hoxd11, similar to the expression trend observed for M. lucifugus BAR116 in mice, suggesting that it could be involved in the regulation of the bat HoxD complex. Combined, our results highlight novel regulatory regions that could be instrumental for the morphological differences leading to the development of the bat wing.
Full Text Available Interaction between HBV and host genome integrations in hepatocellular carcinoma (HCC development is a complex process and the mechanism is still unclear. Here we described in details the quality controls and data mining of aCGH and transcriptome sequencing data on 50 HCC samples from the Chinese patients, published by Dong et al. (2015 (GEO#: GSE65486. In additional to the HBV-MLL4 integration discovered, we also investigated the genetic aberrations of HBV and host genes as well as their genetic interactions. We reported human genome copy number changes and frequent transcriptome variations (e.g. TP53, CTNNB1 mutation, especially MLL family mutations in this cohort of the patients. For HBV genotype C, we identified a novel linkage disequilibrium region covering HBV replication regulatory elements, including basal core promoter, DR1, epsilon and poly-A regions, which is associated with HBV core antigen over-expression and almost exclusive to HBV-MLL4 integration.
Glebes, Tirzah Y; Sandoval, Nicholas R; Gillis, Jacob H; Gill, Ryan T
Engineering both feedstock and product tolerance is important for transitioning towards next-generation biofuels derived from renewable sources. Tolerance to chemical inhibitors typically results in complex phenotypes, for which multiple genetic changes must often be made to confer tolerance. Here, we performed a genome-wide search for furfural-tolerant alleles using the TRackable Multiplex Recombineering (TRMR) method (Warner et al. (2010), Nature Biotechnology), which uses chromosomally integrated mutations directed towards increased or decreased expression of virtually every gene in Escherichia coli. We employed various growth selection strategies to assess the role of selection design towards growth enrichments. We also compared genes with increased fitness from our TRMR selection to those from a previously reported genome-wide identification study of furfural tolerance genes using a plasmid-based genomic library approach (Glebes et al. (2014) PLOS ONE). In several cases, growth improvements were observed for the chromosomally integrated promoter/RBS mutations but not for the plasmid-based overexpression constructs. Through this assessment, four novel tolerance genes, ahpC, yhjH, rna, and dicA, were identified and confirmed for their effect on improving growth in the presence of furfural. © 2014 Wiley Periodicals, Inc.
McCoy, Thomas H; Castro, Victor M; Snapper, Leslie A; Hart, Kamber L; Perlis, Roy H
Biobanks and national registries represent a powerful tool for genomic discovery, but rely on diagnostic codes that may be unreliable and fail to capture the relationship between related diagnoses. We developed an efficient means of conducting genome-wide association studies using combinations of diagnostic codes from electronic health records (EHR) for 10845 participants in a biobanking program at two large academic medical centers. Specifically, we applied latent Dirichilet allocation to fit 50 disease topics based on diagnostic codes, then conducted genome-wide common-variant association for each topic. In sensitivity analysis, these results were contrasted with those obtained from traditional single-diagnosis phenome-wide association analysis, as well as those in which only a subset of diagnostic codes are included per topic. In meta-analysis across three biobank cohorts, we identified 23 disease-associated loci with p<1e-15, including previously associated autoimmune disease loci. In all cases, observed significant associations were of greater magnitude than for single phenome-wide diagnostic codes, and incorporation of less strongly-loading diagnostic codes enhanced association. This strategy provides a more efficient means of phenome-wide association in biobanks with coded clinical data.
Karlas, Alexander; Berre, Stefano; Couderc, Thérèse; Varjak, Margus; Braun, Peter; Meyer, Michael; Gangneux, Nicolas; Karo-Astover, Liis; Weege, Friderike; Raftery, Martin; Schönrich, Günther; Klemm, Uwe; Wurzlbauer, Anne; Bracher, Franz; Merits, Andres; Meyer, Thomas F; Lecuit, Marc
Chikungunya virus (CHIKV) is a globally spreading alphavirus against which there is no commercially available vaccine or therapy. Here we use a genome-wide siRNA screen to identify 156 proviral and 41 antiviral host factors affecting CHIKV replication. We analyse the cellular pathways in which human proviral genes are involved and identify druggable targets. Twenty-one small-molecule inhibitors, some of which are FDA approved, targeting six proviral factors or pathways, have high antiviral activity in vitro, with low toxicity. Three identified inhibitors have prophylactic antiviral effects in mouse models of chikungunya infection. Two of them, the calmodulin inhibitor pimozide and the fatty acid synthesis inhibitor TOFA, have a therapeutic effect in vivo when combined. These results demonstrate the value of loss-of-function screening and pathway analysis for the rational identification of small molecules with therapeutic potential and pave the way for the development of new, host-directed, antiviral agents.
Marenholz, Ingo; Grosche, Sarah; Kalb, Birgit; Rüschendorf, Franz; Blümchen, Katharina; Schlags, Rupert; Harandi, Neda; Price, Mareike; Hansen, Gesine; Seidenberg, Jürgen; Röblitz, Holger; Yürek, Songül; Tschirner, Sebastian; Hong, Xiumei; Wang, Xiaobin; Homuth, Georg; Schmidt, Carsten O; Nöthen, Markus M; Hübner, Norbert; Niggemann, Bodo; Beyer, Kirsten; Lee, Young-Ae
Genetic factors and mechanisms underlying food allergy are largely unknown. Due to heterogeneity of symptoms a reliable diagnosis is often difficult to make. Here, we report a genome-wide association study on food allergy diagnosed by oral food challenge in 497 cases and 2387 controls. We identify five loci at genome-wide significance, the clade B serpin (SERPINB) gene cluster at 18q21.3, the cytokine gene cluster at 5q31.1, the filaggrin gene, the C11orf30/LRRC32 locus, and the human leukocyte antigen (HLA) region. Stratifying the results for the causative food demonstrates that association of the HLA locus is peanut allergy-specific whereas the other four loci increase the risk for any food allergy. Variants in the SERPINB gene cluster are associated with SERPINB10 expression in leukocytes. Moreover, SERPINB genes are highly expressed in the esophagus. All identified loci are involved in immunological regulation or epithelial barrier function, emphasizing the role of both mechanisms in food allergy.
We conducted a combined genome-wide association study (GWAS) of 7,481 individuals with bipolar disorder (cases) and 9,250 controls as part of the Psychiatric GWAS Consortium. Our replication study tested 34 SNPs in 4,496 independent cases with bipolar disorder and 42,422 independent controls and found that 18 of 34 SNPs had P < 0.05, with 31 of 34 SNPs having signals with the same direction of effect (P = 3.8 × 10(-7)). An analysis of all 11,974 bipolar disorder cases and 51,792 controls confirmed genome-wide significant evidence of association for CACNA1C and identified a new intronic variant in ODZ4. We identified a pathway comprised of subunits of calcium channels enriched in bipolar disorder association intervals. Finally, a combined GWAS analysis of schizophrenia and bipolar disorder yielded strong association evidence for SNPs in CACNA1C and in the region of NEK4-ITIH1-ITIH3-ITIH4. Our replication results imply that increasing sample sizes in bipolar disorder will confirm many additional loci.
Hertel, Robert; Rodríguez, David Pintor; Hollensteiner, Jacqueline; Dietrich, Sascha; Leimbach, Andreas; Hoppert, Michael; Liesegang, Heiko; Volland, Sonja
Prophages are viruses, which have integrated their genomes into the genome of a bacterial host. The status of the prophage genome can vary from fully intact with the potential to form infective particles to a remnant state where only a few phage genes persist. Prophages have impact on the properties of their host and are therefore of great interest for genomic research and strain design. Here we present a genome- and next generation sequencing (NGS)-based approach for identification and activity evaluation of prophage regions. Seven prophage or prophage-like regions were identified in the genome of Bacillus licheniformis DSM13. Six of these regions show similarity to members of the Siphoviridae phage family. The remaining region encodes the B. licheniformis orthologue of the PBSX prophage from Bacillus subtilis. Analysis of isolated phage particles (induced by mitomycin C) from the wild-type strain and prophage deletion mutant strains revealed activity of the prophage regions BLi_Pp2 (PBSX-like), BLi_Pp3 and BLi_Pp6. In contrast to BLi_Pp2 and BLi_Pp3, neither phage DNA nor phage particles of BLi_Pp6 could be visualized. However, the ability of prophage BLi_Pp6 to generate particles could be confirmed by sequencing of particle-protected DNA mapping to prophage locus BLi_Pp6. The introduced NGS-based approach allows the investigation of prophage regions and their ability to form particles. Our results show that this approach increases the sensitivity of prophage activity analysis and can complement more conventional approaches such as transmission electron microscopy (TEM). PMID:25811873
Liu, Yawen; Gao, Hui; Marstrand, Troels Torben
, but there are also regions that are bound by ERalpha only in the presence of ERbeta, as well as regions that are selectively bound by either receptor. Analysis of bound regions shows that regions bound by ERalpha have distinct properties in terms of genome landscape, sequence features, and conservation compared...
Hauser, Frank; Williamson, Michael; Cazzamali, Giuseppe
insect genome, that of the fruitfly Drosophila melanogaster, was sequenced in 2000, and about 200 GPCRs have been annnotated in this model insect. About 50 of these receptors were predicted to have neuropeptides or protein hormones as their ligands. Since 2000, the cDNAs of most of these candidate...... receptors have been cloned and for many receptors the endogenous ligand has been identified. In this review, we will give an update about the current knowledge of all Drosophila neuropeptide and protein hormone receptors, and discuss their phylogenetic relationships. Udgivelsesdato: 2006-Feb...
Cancer genome characterization efforts now provide an initial view of the somatic alterations in primary tumors. However, most point mutations occur at low frequency, and the function of these alleles remains undefined. We have developed a scalable systematic approach to interrogate the function of cancer-associated gene variants. We subjected 474 mutant alleles curated from 5,338 tumors to pooled in vivo tumor formation assays and gene expression profiling. We identified 12 transforming alleles, including two in genes (PIK3CB, POT1) that have not been shown to be tumorigenic.
Low, G W; Chattopadhyay, B; Garg, K M; Irestedt, M; Ericson, Pgp; Yap, G; Tang, Q; Wu, S; Rheindt, F E
Invasive species exert a serious impact on native fauna and flora and have been the target of many eradication and management efforts worldwide. However, a lack of data on population structure and history, exacerbated by the recency of many species introductions, limits the efficiency with which such species can be kept at bay. In this study we generated a novel genome of high assembly quality and genotyped 4735 genome-wide single nucleotide polymorphic (SNP) markers from 78 individuals of an invasive population of the Javan Myna Acridotheres javanicus across the island of Singapore. We inferred limited population subdivision at a micro-geographic level, a genetic patch size (~13-14 km) indicative of a pronounced dispersal ability, and barely an increase in effective population size since introduction despite an increase of four to five orders of magnitude in actual population size, suggesting that low population-genetic diversity following a bottleneck has not impeded establishment success. Landscape genomic analyses identified urban features, such as low-rise neighborhoods, that constitute pronounced barriers to gene flow. Based on our data, we consider an approach targeting the complete eradication of Javan Mynas across Singapore to be unfeasible. Instead, a mixed approach of localized mitigation measures taking into account urban geographic features and planning policy may be the most promising avenue to reducing the adverse impacts of this urban pest. Our study demonstrates how genomic methods can directly inform the management and control of invasive species, even in geographically limited datasets with high gene flow rates.
Zhu, Y.; Jong, M.C.; Frazer, K.A.; Gong, E.; Krauss, R.M.; Cheng, J.F.; Boffelli, D.; Rubin, E.M.
To accelerate the biological annotation of novel genes discovered in sequenced of mammalian genomes, we are creating large deletions in the mouse genome targeted to include clusters of such genes. Here we describe the targeted deletion of a 450 kb region on mouse chromosome 11 which, based on computational analysis of the deleted murine sequences and human 5q orthologous sequences, codes for nine putative genes. Mice homozygous for the deletion had a variety of abnormalities including severe hypertriglyceridemia, hepatic and cardiac enlargement, growth retardation and premature mortality. Analysis of triglyceride metabolism in these animals demonstrated a several-fold increase in hepatic very-low density lipoprotein (VLDL) triglyceride secretion, the most prevalent mechanism responsible for hypertriglyceridemia in humans. A series of mouse BAC and human YAC transgenes covering different intervals of the 450 kb deleted region were assessed for their ability to complement the deletion induced abnormalities. These studies revealed that OCTN2, a gene recently shown to play a role in carnitine transport, was able to correct the triglyceride abnormalities. The discovery of this previously unappreciated relationship between OCTN2, carnitine and hepatic triglyceride production is of particular importance due to the clinical consequence of hypertriglyceridemia and the paucity of genes known to modulate triglyceride secretion.
Jennifer L Bolton
Full Text Available Variation in plasma levels of cortisol, an essential hormone in the stress response, is associated in population-based studies with cardio-metabolic, inflammatory and neuro-cognitive traits and diseases. Heritability of plasma cortisol is estimated at 30-60% but no common genetic contribution has been identified. The CORtisol NETwork (CORNET consortium undertook genome wide association meta-analysis for plasma cortisol in 12,597 Caucasian participants, replicated in 2,795 participants. The results indicate that <1% of variance in plasma cortisol is accounted for by genetic variation in a single region of chromosome 14. This locus spans SERPINA6, encoding corticosteroid binding globulin (CBG, the major cortisol-binding protein in plasma, and SERPINA1, encoding α1-antitrypsin (which inhibits cleavage of the reactive centre loop that releases cortisol from CBG. Three partially independent signals were identified within the region, represented by common SNPs; detailed biochemical investigation in a nested sub-cohort showed all these SNPs were associated with variation in total cortisol binding activity in plasma, but some variants influenced total CBG concentrations while the top hit (rs12589136 influenced the immunoreactivity of the reactive centre loop of CBG. Exome chip and 1000 Genomes imputation analysis of this locus in the CROATIA-Korcula cohort identified missense mutations in SERPINA6 and SERPINA1 that did not account for the effects of common variants. These findings reveal a novel common genetic source of variation in binding of cortisol by CBG, and reinforce the key role of CBG in determining plasma cortisol levels. In turn this genetic variation may contribute to cortisol-associated degenerative diseases.
Jing, Shengli; Zhang, Lei; Ma, Yinhua; Liu, Bingfang; Zhao, Yan; Yu, Hangjin; Zhou, Xi; Qin, Rui; Zhu, Lili; He, Guangcun
Insects and plants have coexisted for over 350 million years and their interactions have affected ecosystems and agricultural practices worldwide. Variation in herbivorous insects' virulence to circumvent host resistance has been extensively documented. However, despite decades of investigation, the genetic foundations of virulence are currently unknown. The brown planthopper (Nilaparvata lugens) is the most destructive rice (Oryza sativa) pest in the world. The identification of the resistance gene Bph1 and its introduction in commercial rice varieties prompted the emergence of a new virulent brown planthopper biotype that was able to break the resistance conferred by Bph1. In this study, we aimed to construct a high density linkage map for the brown planthopper and identify the loci responsible for its virulence in order to determine their genetic architecture. Based on genotyping data for hundreds of molecular markers in three mapping populations, we constructed the most comprehensive linkage map available for this species, covering 96.6% of its genome. Fifteen chromosomes were anchored with 124 gene-specific markers. Using genome-wide scanning and interval mapping, the Qhp7 locus that governs preference for Bph1 plants was mapped to a 0.1 cM region of chromosome 7. In addition, two major QTLs that govern the rate of insect growth on resistant rice plants were identified on chromosomes 5 (Qgr5) and 14 (Qgr14). This is the first study to successfully locate virulence in the genome of this important agricultural insect by marker-based genetic mapping. Our results show that the virulence which overcomes the resistance conferred by Bph1 is controlled by a few major genes and that the components of virulence originate from independent genetic characters. The isolation of these loci will enable the elucidation of the molecular mechanisms underpinning the rice-brown planthopper interaction and facilitate the development of durable approaches for controlling this most
Full Text Available Insects and plants have coexisted for over 350 million years and their interactions have affected ecosystems and agricultural practices worldwide. Variation in herbivorous insects' virulence to circumvent host resistance has been extensively documented. However, despite decades of investigation, the genetic foundations of virulence are currently unknown. The brown planthopper (Nilaparvata lugens is the most destructive rice (Oryza sativa pest in the world. The identification of the resistance gene Bph1 and its introduction in commercial rice varieties prompted the emergence of a new virulent brown planthopper biotype that was able to break the resistance conferred by Bph1. In this study, we aimed to construct a high density linkage map for the brown planthopper and identify the loci responsible for its virulence in order to determine their genetic architecture. Based on genotyping data for hundreds of molecular markers in three mapping populations, we constructed the most comprehensive linkage map available for this species, covering 96.6% of its genome. Fifteen chromosomes were anchored with 124 gene-specific markers. Using genome-wide scanning and interval mapping, the Qhp7 locus that governs preference for Bph1 plants was mapped to a 0.1 cM region of chromosome 7. In addition, two major QTLs that govern the rate of insect growth on resistant rice plants were identified on chromosomes 5 (Qgr5 and 14 (Qgr14. This is the first study to successfully locate virulence in the genome of this important agricultural insect by marker-based genetic mapping. Our results show that the virulence which overcomes the resistance conferred by Bph1 is controlled by a few major genes and that the components of virulence originate from independent genetic characters. The isolation of these loci will enable the elucidation of the molecular mechanisms underpinning the rice-brown planthopper interaction and facilitate the development of durable approaches for
Ojeda-Gonzalez, A.; Prestes, A.; Klausner, V. [Laboratory of Physics and Astronomy, IP and D/Universidade do Vale do Paraíba—UNIVAP, São José dos Campos, SP (Brazil); Mendes, O. [Division of Space Geophysics, National Institute for Space Research, São José dos Campos, SP (Brazil); Calzadilla, A. [Department of Space Geophysics, Institute of Geophysics and Astronomy, Havana (Cuba); Domingues, M. O., E-mail: email@example.com [Associate Laboratory of Applied Computing and Mathematics, National Institute for Space Research, São José dos Campos, SP (Brazil)
Spatio-temporal entropy (STE) analysis is used as an alternative mathematical tool to identify possible magnetic cloud (MC) candidates. We analyze Interplanetary Magnetic Field (IMF) data using a time interval of only 10 days. We select a convenient data interval of 2500 records moving forward by 200 record steps until the end of the time series. For every data segment, the STE is calculated at each step. During an MC event, the STE reaches values close to zero. This extremely low value of STE is due to MC structure features. However, not all of the magnetic components in MCs have STE values close to zero at the same time. For this reason, we create a standardization index (the so-called Interplanetary Entropy, IE, index). This index is a worthwhile effort to develop new tools to help diagnose ICME structures. The IE was calculated using a time window of one year (1999), and it has a success rate of 70% over other identifiers of MCs. The unsuccessful cases (30%) are caused by small and weak MCs. The results show that the IE methodology identified 9 of 13 MCs, and emitted nine false alarm cases. In 1999, a total of 788 windows of 2500 values existed, meaning that the percentage of false alarms was 1.14%, which can be considered a good result. In addition, four time windows, each of 10 days, are studied, where the IE method was effective in finding MC candidates. As a novel result, two new MCs are identified in these time windows.
Background: Access to sheep genome sequences significantly improves the chances of identifying genes that may influence the health, welfare, and productivity of these animals. Methods: A public, searchable DNA sequence resource for U.S. sheep was created with whole genome sequence (WGS) of 96 rams. ...
Nalls, Mike A.; Pankratz, Nathan; Lill, Christina M.; Do, Chuong B.; Hernandez, Dena G.; Saad, Mohamad; DeStefano, Anita L.; Kara, Eleanna; Bras, Jose; Sharma, Manu; Schulte, Claudia; Keller, Margaux F.; Arepalli, Sampath; Letson, Christopher; Edsall, Connor; Stefansson, Hreinn; Liu, Xinmin; Pliner, Hannah; Lee, Joseph H.; Cheng, Rong; Ikram, M. Arfan; Ioannidis, John P. A.; Hadjigeorgiou, Georgios M.; Bis, Joshua C.; Martinez, Maria; Perlmutter, Joel S.; Goate, Alison; Marder, Karen; Fiske, Brian; Sutherland, Margaret; Xiromerisiou, Georgia; Myers, Richard H.; Clark, Lorraine N.; Stefansson, Kari; Hardy, John A.; Heutink, Peter; Chen, Honglei; Wood, Nicholas W.; Houlden, Henry; Payami, Haydeh; Brice, Alexis; Scott, William K.; Gasser, Thomas; Bertram, Lars; Eriksson, Nicholas; Foroud, Tatiana; Singleton, Andrew B.; Plagnol, Vincent; Sheerin, Una-Marie; Simón-Sánchez, Javier; Lesage, Suzanne; Sveinbjörnsdóttir, Sigurlaug; Barker, Roger; Ben-Shlomo, Yoav; Berendse, Henk W.; Berg, Daniela; Bhatia, Kailash; de Bie, Rob M. A.; Biffi, Alessandro; Bloem, Bas; Bochdanovits, Zoltan; Bonin, Michael; Bras, Jose M.; Brockmann, Kathrin; Brooks, Janet; Burn, David J.; Charlesworth, Gavin; Chinnery, Patrick F.; Chong, Sean; Clarke, Carl E.; Cookson, Mark R.; Cooper, J. Mark; Corvol, Jean Christophe; Counsell, Carl; Damier, Philippe; Dartigues, Jean-François; Deloukas, Panos; Deuschl, Günther; Dexter, David T.; van Dijk, Karin D.; Dillman, Allissa; Durif, Frank; Dürr, Alexandra; Edkins, Sarah; Evans, Jonathan R.; Foltynie, Thomas; Dong, Jing; Gardner, Michelle; Gibbs, J. Raphael; Gray, Emma; Guerreiro, Rita; Harris, Clare; van Hilten, Jacobus J.; Hofman, Albert; Hollenbeck, Albert; Holton, Janice; Hu, Michele; Huang, Xuemei; Wurster, Isabel; Mätzler, Walter; Hudson, Gavin; Hunt, Sarah E.; Huttenlocher, Johanna; Illig, Thomas; Jónsson, Pálmi V.; Lambert, Jean-Charles; Langford, Cordelia; Lees, Andrew; Lichtner, Peter; Limousin, Patricia; Lopez, Grisel; Lorenz, Delia; McNeill, Alisdair; Moorby, Catriona; Moore, Matthew; Morris, Huw R.; Morrison, Karen E.; Mudanohwo, Ese; O'Sullivan, Sean S.; Pearson, Justin; Pétursson, Hjörvar; Pollak, Pierre; Post, Bart; Potter, Simon; Ravina, Bernard; Revesz, Tamas; Riess, Olaf; Rivadeneira, Fernando; Rizzu, Patrizia; Ryten, Mina; Sawcer, Stephen; Schapira, Anthony; Scheffer, Hans; Shaw, Karen; Shoulson, Ira; Sidransky, Ellen; Smith, Colin; Spencer, Chris C. A.; Stefánsson, Hreinn; Bettella, Francesco; Stockton, Joanna D.; Strange, Amy; Talbot, Kevin; Tanner, Carlie M.; Tashakkori-Ghanbaria, Avazeh; Tison, François; Trabzuni, Daniah; Traynor, Bryan J.; Uitterlinden, André G.; Velseboer, Daan; Vidailhet, Marie; Walker, Robert; van de Warrenburg, Bart; Wickremaratchi, Mirdhu; Williams, Nigel; Williams-Gray, Caroline H.; Winder-Rhodes, Sophie; Stefánsson, Kári; Hardy, John; Factor, S.; Higgins, D.; Evans, S.; Shill, H.; Stacy, M.; Danielson, J.; Marlor, L.; Williamson, K.; Jankovic, J.; Hunter, C.; Simon, D.; Ryan, P.; Scollins, L.; Saunders-Pullman, R.; Boyar, K.; Costan-Toth, C.; Ohmann, E.; Sudarsky, L.; Joubert, C.; Friedman, J.; Chou, K.; Fernandez, H.; Lannon, M.; Galvez-Jimenez, N.; Podichetty, A.; Thompson, K.; Lewitt, P.; Deangelis, M.; O'Brien, C.; Seeberger, L.; Dingmann, C.; Judd, D.; Marder, K.; Fraser, J.; Harris, J.; Bertoni, J.; Peterson, C.; Rezak, M.; Medalle, G.; Chouinard, S.; Panisset, M.; Hall, J.; Poiffaut, H.; Calabrese, V.; Roberge, P.; Wojcieszek, J.; Belden, J.; Jennings, D.; Marek, K.; Mendick, S.; Reich, S.; Dunlop, B.; Jog, M.; Horn, C.; Uitti, R.; Turk, M.; Ajax, T.; Mannetter, J.; Sethi, K.; Carpenter, J.; Dill, B.; Hatch, L.; Ligon, K.; Narayan, S.; Blindauer, K.; Abou-Samra, K.; Petit, J.; Elmer, L.; Aiken, E.; Davis, K.; Schell, C.; Wilson, S.; Velickovic, M.; Koller, W.; Phipps, S.; Feigin, A.; Gordon, M.; Hamann, J.; Licari, E.; Marotta-Kollarus, M.; Shannon, B.; Winnick, R.; Simuni, T.; Videnovic, A.; Kaczmarek, A.; Williams, K.; Wolff, M.; Rao, J.; Cook, M.; Fernandez, M.; Kostyk, S.; Hubble, J.; Campbell, A.; Reider, C.; Seward, A.; Camicioli, R.; Carter, J.; Nutt, J.; Andrews, P.; Morehouse, S.; Stone, C.; Mendis, T.; Grimes, D.; Alcorn-Costa, C.; Gray, P.; Haas, K.; Vendette, J.; Sutton, J.; Hutchinson, B.; Young, J.; Rajput, A.; Klassen, L.; Shirley, T.; Manyam, B.; Simpson, P.; Whetteckey, J.; Wulbrecht, B.; Truong, D.; Pathak, M.; Frei, K.; Luong, N.; Tra, T.; Tran, A.; Vo, J.; Lang, A.; Kleiner- Fisman, G.; Nieves, A.; Johnston, L.; So, J.; Podskalny, G.; Giffin, L.; Atchison, P.; Allen, C.; Martin, W.; Wieler, M.; Suchowersky, O.; Furtado, S.; Klimek, M.; Hermanowicz, N.; Niswonger, S.; Shults, C.; Fontaine, D.; Aminoff, M.; Christine, C.; Diminno, M.; Hevezi, J.; Dalvi, A.; Kang, U.; Richman, J.; Uy, S.; Sahay, A.; Gartner, M.; Schwieterman, D.; Hall, D.; Leehey, M.; Culver, S.; Derian, T.; Demarcaida, T.; Thurlow, S.; Rodnitzky, R.; Dobson, J.; Lyons, K.; Pahwa, R.; Gales, T.; Thomas, S.; Shulman, L.; Weiner, W.; Dustin, K.; Singer, C.; Zelaya, L.; Tuite, P.; Hagen, V.; Rolandelli, S.; Schacherer, R.; Kosowicz, J.; Gordon, P.; Werner, J.; Serrano, C.; Roque, S.; Kurlan, R.; Berry, D.; Gardiner, I.; Hauser, R.; Sanchez-Ramos, J.; Zesiewicz, T.; Delgado, H.; Price, K.; Rodriguez, P.; Wolfrath, S.; Pfeiffer, R.; Davis, L.; Pfeiffer, B.; Dewey, R.; Hayward, B.; Johnson, A.; Meacham, M.; Estes, B.; Walker, F.; Hunt, V.; O'Neill, C.; Racette, B.; Swisher, L.; Dijamco, Cheri; Conley, Emily Drabant; Dorfman, Elizabeth; Tung, Joyce Y.; Hinds, David A.; Mountain, Joanna L.; Wojcicki, Anne; Lew, M.; Klein, C.; Golbe, L.; Growdon, J.; Wooten, G. F.; Watts, R.; Guttman, M.; Goldwurm, S.; Saint-Hilaire, M. H.; Baker, K.; Litvan, I.; Nicholson, G.; Nance, M.; Drasby, E.; Isaacson, S.; Burn, D.; Pramstaller, P.; Al-hinti, J.; Moller, A.; Sherman, S.; Roxburgh, R.; Slevin, J.; Perlmutter, J.; Mark, M. H.; Huggins, N.; Pezzoli, G.; Massood, T.; Itin, I.; Corbett, A.; Chinnery, P.; Ostergaard, K.; Snow, B.; Cambi, F.; Kay, D.; Samii, A.; Agarwal, P.; Roberts, J. W.; Higgins, D. S.; Molho, Eric; Rosen, Ami; Montimurro, J.; Martinez, E.; Griffith, A.; Kusel, V.; Yearout, D.; Zabetian, C.; Clark, L. N.; Liu, X.; Lee, J. H.; Taub, R. Cheng; Louis, E. D.; Cote, L. J.; Waters, C.; Ford, B.; Fahn, S.; Vance, Jeffery M.; Beecham, Gary W.; Martin, Eden R.; Nuytemans, Karen; Pericak-Vance, Margaret A.; Haines, Jonathan L.; DeStefano, Anita; Seshadri, Sudha; Choi, Seung Hoan; Frank, Samuel; Psaty, Bruce M.; Rice, Kenneth; Longstreth, W. T.; Ton, Thanh G. N.; Jain, Samay; van Duijn, Cornelia M.; Verlinden, Vincent J.; Koudstaal, Peter J.; Singleton, Andrew; Cookson, Mark; Hernandez, Dena; Nalls, Michael; Zonderman, Alan; Ferrucci, Luigi; Johnson, Robert; Longo, Dan; O'Brien, Richard; Traynor, Bryan; Troncoso, Juan; van der Brug, Marcel; Zielke, Ronald; Weale, Michael; Ramasamy, Adaikalavan; Dardiotis, Efthimios; Tsimourtou, Vana; Spanaki, Cleanthe; Plaitakis, Andreas; Bozi, Maria; Stefanis, Leonidas; Vassilatis, Dimitris; Koutsis, Georgios; Panas, Marios; Lunnon, Katie; Lupton, Michelle; Powell, John; Parkkinen, Laura; Ansorge, Olaf
We conducted a meta-analysis of Parkinson's disease genome-wide association studies using a common set of 7,893,274 variants across 13,708 cases and 95,282 controls. Twenty-six loci were identified as having genome-wide significant association; these and 6 additional previously reported loci were
Tsai, Kevin J; Lu, Mei-Yeh Jade; Yang, Kai-Jung; Li, Mengyun; Teng, Yuchuan; Chen, Shihmay; Ku, Maurice S B; Li, Wen-Hsiung
The diploid C 4 plant foxtail millet (Setaria italica L. Beauv.) is an important crop in many parts of Africa and Asia for the vast consumption of its grain and ability to grow in harsh environments, but remains understudied in terms of complete genomic architecture. To date, there have been only two genome assembly and annotation efforts with neither assembly reaching over 86% of the estimated genome size. We have combined de novo assembly with custom reference-guided improvements on a popular cultivar of foxtail millet and have achieved a genome assembly of 477 Mbp in length, which represents over 97% of the estimated 490 Mbp. The assembly anchors over 98% of the predicted genes to the nine assembled nuclear chromosomes and contains more functional annotation gene models than previous assemblies. Our annotation has identified a large number of unique gene ontology terms related to metabolic activities, a region of chromosome 9 with several growth factor proteins, and regions syntenic with pearl millet or maize genomic regions that have been previously shown to affect growth. The new assembly and annotation for this important species can be used for detailed investigation and future innovations in growth for millet and other grains.
Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich
The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Full Text Available Dogs, with their breed-determined limited genetic background, are great models of human disease including cancer. Canine B-cell lymphoma and hemangiosarcoma are both malignancies of the hematologic system that are clinically and histologically similar to human B-cell non-Hodgkin lymphoma and angiosarcoma, respectively. Golden retrievers in the US show significantly elevated lifetime risk for both B-cell lymphoma (6% and hemangiosarcoma (20%. We conducted genome-wide association studies for hemangiosarcoma and B-cell lymphoma, identifying two shared predisposing loci. The two associated loci are located on chromosome 5, and together contribute ~20% of the risk of developing these cancers. Genome-wide p-values for the top SNP of each locus are 4.6×10-7 and 2.7×10-6, respectively. Whole genome resequencing of nine cases and controls followed by genotyping and detailed analysis identified three shared and one B-cell lymphoma specific risk haplotypes within the two loci, but no coding changes were associated with the risk haplotypes. Gene expression analysis of B-cell lymphoma tumors revealed that carrying the risk haplotypes at the first locus is associated with down-regulation of several nearby genes including the proximal gene TRPC6, a transient receptor Ca2+-channel involved in T-cell activation, among other functions. The shared risk haplotype in the second locus overlaps the vesicle transport and release gene STX8. Carrying the shared risk haplotype is associated with gene expression changes of 100 genes enriched for pathways involved in immune cell activation. Thus, the predisposing germ-line mutations in B-cell lymphoma and hemangiosarcoma appear to be regulatory, and affect pathways involved in T-cell mediated immune response in the tumor. This suggests that the interaction between the immune system and malignant cells plays a common role in the tumorigenesis of these relatively different cancers.
Bhaskar, Anand; Song, Yun S
The sample frequency spectrum (SFS) is a widely-used summary statistic of genomic variation in a sample of homologous DNA sequences. It provides a highly efficient dimensional reduction of large-scale population genomic data and its mathematical dependence on the underlying population demography is well understood, thus enabling the development of efficient inference algorithms. However, it has been recently shown that very different population demographies can actually generate the same SFS for arbitrarily large sample sizes. Although in principle this nonidentifiability issue poses a thorny challenge to statistical inference, the population size functions involved in the counterexamples are arguably not so biologically realistic. Here, we revisit this problem and examine the identifiability of demographic models under the restriction that the population sizes are piecewise-defined where each piece belongs to some family of biologically-motivated functions. Under this assumption, we prove that the expected SFS of a sample uniquely determines the underlying demographic model, provided that the sample is sufficiently large. We obtain a general bound on the sample size sufficient for identifiability; the bound depends on the number of pieces in the demographic model and also on the type of population size function in each piece. In the cases of piecewise-constant, piecewise-exponential and piecewise-generalized-exponential models, which are often assumed in population genomic inferences, we provide explicit formulas for the bounds as simple functions of the number of pieces. Lastly, we obtain analogous results for the "folded" SFS, which is often used when there is ambiguity as to which allelic type is ancestral. Our results are proved using a generalization of Descartes' rule of signs for polynomials to the Laplace transform of piecewise continuous functions.
Bhaskar, Anand; Song, Yun S.
The sample frequency spectrum (SFS) is a widely-used summary statistic of genomic variation in a sample of homologous DNA sequences. It provides a highly efficient dimensional reduction of large-scale population genomic data and its mathematical dependence on the underlying population demography is well understood, thus enabling the development of efficient inference algorithms. However, it has been recently shown that very different population demographies can actually generate the same SFS for arbitrarily large sample sizes. Although in principle this nonidentifiability issue poses a thorny challenge to statistical inference, the population size functions involved in the counterexamples are arguably not so biologically realistic. Here, we revisit this problem and examine the identifiability of demographic models under the restriction that the population sizes are piecewise-defined where each piece belongs to some family of biologically-motivated functions. Under this assumption, we prove that the expected SFS of a sample uniquely determines the underlying demographic model, provided that the sample is sufficiently large. We obtain a general bound on the sample size sufficient for identifiability; the bound depends on the number of pieces in the demographic model and also on the type of population size function in each piece. In the cases of piecewise-constant, piecewise-exponential and piecewise-generalized-exponential models, which are often assumed in population genomic inferences, we provide explicit formulas for the bounds as simple functions of the number of pieces. Lastly, we obtain analogous results for the “folded” SFS, which is often used when there is ambiguity as to which allelic type is ancestral. Our results are proved using a generalization of Descartes’ rule of signs for polynomials to the Laplace transform of piecewise continuous functions. PMID:28018011
Anthon, Christian; Tafer, Hakim; Havgaard, Jakob H
BACKGROUND: Annotating mammalian genomes for noncoding RNAs (ncRNAs) is nontrivial since far from all ncRNAs are known and the computational models are resource demanding. Currently, the human genome holds the best mammalian ncRNA annotation, a result of numerous efforts by several groups. However......, a more direct strategy is desired for the increasing number of sequenced mammalian genomes of which some, such as the pig, are relevant as disease models and production animals. RESULTS: We present a comprehensive annotation of structured RNAs in the pig genome. Combining sequence and structure...... lncRNA loci, 11 conflicts of annotation, and 3,183 ncRNA genes. The ncRNA genes comprise 359 miRNAs, 8 ribozymes, 185 rRNAs, 638 snoRNAs, 1,030 snRNAs, 810 tRNAs and 153 ncRNA genes not belonging to the here fore mentioned classes. When running the pipeline on a local shuffled version of the genome...
Full Text Available Of all the meat quality traits, tenderness is considered the most important with regard to eating quality and market value. In this study we have utilised genome wide association studies (GWAS for peak shear force (PSF of loin muscle as a measure of tenderness for 1,976 crossbred commercial pigs, genotyped for 42,721 informative SNPs using the Illumina PorcineSNP60 Beadchip. Four 1 Mb genomic regions, three on SSC2 (at 4 Mb, 5 Mb and 109 Mb and one on SSC17 (at 20 Mb, were detected which collectively explained about 15.30% and 3.07% of the total genetic and phenotypic variance for PSF respectively. Markers ASGA0008566, ASGA0008695, DRGA0003285 and ASGA0075615 in the four regions were strongly associated with the effects. Analysis of the reference genome sequence in the region with the most important SNPs for SSC2_5 identified FRMD8, SLC25A45 and LTBP3 as potential candidate genes for meat tenderness on the basis of functional annotation of these genes. The region SSC2_109 was close to a previously reported candidate gene CAST; however, the very weak LD between DRGA0003285 (the best marker representing region SSC2_109 and CAST indicated the potential for additional genes which are distinct from, or interact with, CAST to affect meat tenderness. Limited information of known genes in regions SSC2_109 and SSC17_20 restricts further analysis. Re-sequencing of these regions for informative animals may help to resolve the molecular architecture and identify new candidate genes and causative mutations affecting this trait. These findings contribute significantly to our knowledge of the genomic regions affecting pork shear force and will potentially lead to new insights into the molecular mechanisms regulating meat tenderness.
Full Text Available Central corneal thickness (CCT is one of the most heritable ocular traits and it is also a phenotypic risk factor for primary open angle glaucoma (POAG. The present study uses the BXD Recombinant Inbred (RI strains to identify novel quantitative trait loci (QTLs modulating CCT in the mouse with the potential of identifying a molecular link between CCT and risk of developing POAG. The BXD RI strain set was used to define mammalian genomic loci modulating CCT, with a total of 818 corneas measured from 61 BXD RI strains (between 60-100 days of age. The mice were anesthetized and the eyes were positioned in front of the lens of the Phoenix Micron IV Image-Guided OCT system or the Bioptigen OCT system. CCT data for each strain was averaged and used to QTLs modulating this phenotype using the bioinformatics tools on GeneNetwork (www.genenetwork.org. The candidate genes and genomic loci identified in the mouse were then directly compared with the summary data from a human POAG genome wide association study (NEIGHBORHOOD to determine if any genomic elements modulating mouse CCT are also risk factors for POAG.This analysis revealed one significant QTL on Chr 13 and a suggestive QTL on Chr 7. The significant locus on Chr 13 (13 to 19 Mb was examined further to define candidate genes modulating this eye phenotype. For the Chr 13 QTL in the mouse, only one gene in the region (Pou6f2 contained nonsynonymous SNPs. Of these five nonsynonymous SNPs in Pou6f2, two resulted in changes in the amino acid proline which could result in altered secondary structure affecting protein function. The 7 Mb region under the mouse Chr 13 peak distributes over 2 chromosomes in the human: Chr 1 and Chr 7. These genomic loci were examined in the NEIGHBORHOOD database to determine if they are potential risk factors for human glaucoma identified using meta-data from human GWAS. The top 50 hits all resided within one gene (POU6F2, with the highest significance level of p = 10-6 for
Waldram, Alison; Dolan, Gayle; Ashton, Philip M; Jenkins, Claire; Dallman, Timothy J
The unprecedented level of bacterial strain discrimination provided by whole genome sequencing (WGS) presents new challenges with respect to the utility and interpretation of the data. Whole genome sequences from 1445 isolates of Salmonella belonging to the most commonly identified serotypes in England and Wales isolated between April and August 2014 were analysed. Single linkage single nucleotide polymorphism thresholds at the 10, 5 and 0 level were explored for evidence of epidemiological links between clustered cases. Analysis of the WGS data organised 566 of the 1445 isolates into 32 clusters of five or more. A statistically significant epidemiological link was identified for 17 clusters. The clusters were associated with foreign travel (n = 8), consumption of Chinese takeaways (n = 4), chicken eaten at home (n = 2), and one each of the following; eating out, contact with another case in the home and contact with reptiles. In the same time frame, one cluster was detected using traditional outbreak detection methods. WGS can be used for the highly specific and highly sensitive detection of biologically related isolates when epidemiological links are obscured. Improvements in the collection of detailed, standardised exposure information would enhance cluster investigations. Copyright © 2017 Elsevier Ltd. All rights reserved.
Kevin A Kwei
Full Text Available Pancreatobiliary cancers have among the highest mortality rates of any cancer type. Discovering the full spectrum of molecular genetic alterations may suggest new avenues for therapy. To catalogue genomic alterations, we carried out array-based genomic profiling of 31 exocrine pancreatic cancers and 6 distal bile duct cancers, expanded as xenografts to enrich the tumor cell fraction. We identified numerous focal DNA amplifications and deletions, including in 19% of pancreatobiliary cases gain at cytoband 18q11.2, a locus uncommonly amplified in other tumor types. The smallest shared amplification at 18q11.2 included GATA6, a transcriptional regulator previously linked to normal pancreas development. When amplified, GATA6 was overexpressed at both the mRNA and protein levels, and strong immunostaining was observed in 25 of 54 (46% primary pancreatic cancers compared to 0 of 33 normal pancreas specimens surveyed. GATA6 expression in xenografts was associated with specific microarray gene-expression patterns, enriched for GATA binding sites and mitochondrial oxidative phosphorylation activity. siRNA mediated knockdown of GATA6 in pancreatic cancer cell lines with amplification led to reduced cell proliferation, cell cycle progression, and colony formation. Our findings indicate that GATA6 amplification and overexpression contribute to the oncogenic phenotypes of pancreatic cancer cells, and identify GATA6 as a candidate lineage-specific oncogene in pancreatobiliary cancer, with implications for novel treatment strategies.
Blyth, Julie; Makrantoni, Vasso; Barton, Rachael E.; Spanos, Christos; Rappsilber, Juri; Marston, Adele L.
Meiosis is a specialized cell division that generates gametes, such as eggs and sperm. Errors in meiosis result in miscarriages and are the leading cause of birth defects; however, the molecular origins of these defects remain unknown. Studies in model organisms are beginning to identify the genes and pathways important for meiosis, but the parts list is still poorly defined. Here we present a comprehensive catalog of genes important for meiosis in the fission yeast, Schizosaccharomyces pombe. Our genome-wide functional screen surveyed all nonessential genes for roles in chromosome segregation and spore formation. Novel genes important at distinct stages of the meiotic chromosome segregation and differentiation program were identified. Preliminary characterization implicated three of these genes in centrosome/spindle pole body, centromere, and cohesion function. Our findings represent a near-complete parts list of genes important for meiosis in fission yeast, providing a valuable resource to advance our molecular understanding of meiosis. PMID:29259000
Wen, Yan; Wang, Wenyu; Guo, Xiong; Zhang, Feng
: Pleiotropy is common in the genetic architectures of complex diseases. To the best of our knowledge, no analysis tool has been developed for identifying pleiotropic pathways using multiple genome-wide association study (GWAS) summaries by now. Here, we present PAPA, a flexible tool for pleiotropic pathway analysis utilizing GWAS summary results. The performance of PAPA was validated using publicly available GWAS summaries of body mass index and waist-hip ratio of the GIANT datasets. PAPA identified a set of pleiotropic pathways, which have been demonstrated to be involved in the development of obesity. PAPA program, document and illustrative example are available at http://sourceforge.net/projects/papav1/files/ : firstname.lastname@example.org Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: email@example.com.
Anttila, Verneri; Winsvold, Bendik S; Gormley, Padhraig; Kurth, Tobias; Bettella, Francesco; McMahon, George; Kallela, Mikko; Malik, Rainer; de Vries, Boukje; Terwindt, Gisela; Medland, Sarah E; Todt, Unda; McArdle, Wendy L; Quaye, Lydia; Koiranen, Markku; Ikram, M Arfan; Lehtimäki, Terho; Stam, Anine H; Ligthart, Lannie; Wedenoja, Juho; Dunham, Ian; Neale, Benjamin M; Palta, Priit; Hamalainen, Eija; Schürks, Markus; Rose, Lynda M; Buring, Julie E; Ridker, Paul M; Steinberg, Stacy; Stefansson, Hreinn; Jakobsson, Finnbogi; Lawlor, Debbie A; Evans, David M; Ring, Susan M; Färkkilä, Markus; Artto, Ville; Kaunisto, Mari A; Freilinger, Tobias; Schoenen, Jean; Frants, Rune R; Pelzer, Nadine; Weller, Claudia M; Zielman, Ronald; Heath, Andrew C; Madden, Pamela A F; Montgomery, Grant W; Martin, Nicholas G; Borck, Guntram; Göbel, Hartmut; Heinze, Axel; Heinze-Kuhn, Katja; Williams, Frances M K; Hartikainen, Anna-Liisa; Pouta, Anneli; van den Ende, Joyce; Uitterlinden, Andre G; Hofman, Albert; Amin, Najaf; Hottenga, Jouke-Jan; Vink, Jacqueline M; Heikkilä, Kauko; Alexander, Michael; Muller-Myhsok, Bertram; Schreiber, Stefan; Meitinger, Thomas; Wichmann, Heinz Erich; Aromaa, Arpo; Eriksson, Johan G; Traynor, Bryan; Trabzuni, Daniah; Rossin, Elizabeth; Lage, Kasper; Jacobs, Suzanne B R; Gibbs, J Raphael; Birney, Ewan; Kaprio, Jaakko; Penninx, Brenda W; Boomsma, Dorret I; van Duijn, Cornelia; Raitakari, Olli; Jarvelin, Marjo-Riitta; Zwart, John-Anker; Cherkas, Lynn; Strachan, David P; Kubisch, Christian; Ferrari, Michel D; van den Maagdenberg, Arn M J M; Dichgans, Martin; Wessman, Maija; Smith, George Davey; Stefansson, Kari; Daly, Mark J; Nyholt, Dale R; Chasman, Daniel; Palotie, Aarno
Migraine is the most common brain disorder, affecting approximately 14% of the adult population, but its molecular mechanisms are poorly understood. We report the results of a meta-analysis across 29 genome-wide association studies, including a total of 23,285 individuals with migraine (cases) and 95,425 population-matched controls. We identified 12 loci associated with migraine susceptibility (P<5×10(-8)). Five loci are new: near AJAP1 at 1p36, near TSPAN2 at 1p13, within FHL5 at 6q16, within C7orf10 at 7p14 and near MMP16 at 8q21. Three of these loci were identified in disease subgroup analyses. Brain tissue expression quantitative trait locus analysis suggests potential functional candidate genes at four loci: APOA1BP, TBC1D7, FUT9, STAT6 and ATP5B.
Feenstra, Bjarke; Bager, Peter; Liu, Xueping
BACKGROUND: Inflammation of the tonsils is a normal response to infection, but some individuals experience recurrent, severe tonsillitis and massive hypertrophy of the tonsils in which case surgical removal of the tonsils may be considered. OBJECTIVE: To identify common genetic variants associate...... the molecular mechanisms underlying the genetic association involve general lymphoid hyper-reaction throughout the mucosa-associated lymphoid tissue system.......BACKGROUND: Inflammation of the tonsils is a normal response to infection, but some individuals experience recurrent, severe tonsillitis and massive hypertrophy of the tonsils in which case surgical removal of the tonsils may be considered. OBJECTIVE: To identify common genetic variants associated...... with tonsillectomy. METHODS: We used tonsillectomy information from Danish health registers and carried out a genome-wide association study comprising 1464 patients and 12 019 controls of Northwestern European ancestry, with replication in an independent sample set of 1575 patients and 1367 controls. RESULTS...
Irvine, B. J.; Fleskens, L.; Kirkby, M. J.
The DESIRE project has trialled a series of sustainable land management (SLM) technologies. These technologies have been identified as being beneficial in mitigating land degradation by local stakeholders from a range of semi-arid study sites. The field results and the qualitative WOCAT technology assessment ftom across the study sites have been used to develop the adapted PESERA SLM model. This paper considers the development of the adapted PESERA SLM model and the potential for applying locally successful SLM technologies across a wider range of climatic and environmental conditions with respect to degradation risk, biomass production and the investment cost interface (PESERA/DESMICE). The integrate PESERA/DESMICE model contributes to the policy debate by providing a biophysical and socio-economic assessment of technology and policy scenarios.
Sahana, Goutam; Guldbrandtsen, Bernt; Bendixen, Christian
Six genomic regions affecting clinical mastitis were identified through a GWAS study with imputed BovineHD chip genotype data in the Nordic Holstein cattle population. The association analyses were carried out using a SNP-by-SNP analysis by fitting the regression of allele dosage and a polygenic...... Effect Predictor (VEP) vers. 2.6 using ENSEMBL vers. 67 databases. Candidate polymorphisms affecting clinical mastitis were selected based on their association with the traits and functional annotations. A strong positional candidate gene for mastitis resistance on chromosome-6 is the NPFFR2 which...... Factor Receptor Alpha (LIFR) emerged as a strong candidate gene for mastitis resistance. The LIFR gene is involved in acute phase response and is expressed in saliva and mammary gland....
Wray, Naomi R; Ripke, Stephan; Mattheisen, Manuel; Trzaskowski, Maciej; Byrne, Enda M; Abdellaoui, Abdel; Adams, Mark J; Agerbo, Esben; Air, Tracy M; Andlauer, Till M F; Bacanu, Silviu-Alin; Bækvad-Hansen, Marie; Beekman, Aartjan F T; Bigdeli, Tim B; Binder, Elisabeth B; Blackwood, Douglas R H; Bryois, Julien; Buttenschøn, Henriette N; Bybjerg-Grauholm, Jonas; Cai, Na; Castelao, Enrique; Christensen, Jane Hvarregaard; Clarke, Toni-Kim; Coleman, Jonathan I R; Colodro-Conde, Lucía; Couvy-Duchesne, Baptiste; Craddock, Nick; Crawford, Gregory E; Crowley, Cheynna A; Dashti, Hassan S; Davies, Gail; Deary, Ian J; Degenhardt, Franziska; Derks, Eske M; Direk, Nese; Dolan, Conor V; Dunn, Erin C; Eley, Thalia C; Eriksson, Nicholas; Escott-Price, Valentina; Kiadeh, Farnush Hassan Farhadi; Finucane, Hilary K; Forstner, Andreas J; Frank, Josef; Gaspar, Héléna A; Gill, Michael; Giusti-Rodríguez, Paola; Goes, Fernando S; Gordon, Scott D; Grove, Jakob; Hall, Lynsey S; Hannon, Eilis; Hansen, Christine Søholm; Hansen, Thomas F; Herms, Stefan; Hickie, Ian B; Hoffmann, Per; Homuth, Georg; Horn, Carsten; Hottenga, Jouke-Jan; Hougaard, David M; Hu, Ming; Hyde, Craig L; Ising, Marcus; Jansen, Rick; Jin, Fulai; Jorgenson, Eric; Knowles, James A; Kohane, Isaac S; Kraft, Julia; Kretzschmar, Warren W; Krogh, Jesper; Kutalik, Zoltán; Lane, Jacqueline M; Li, Yihan; Li, Yun; Lind, Penelope A; Liu, Xiaoxiao; Lu, Leina; MacIntyre, Donald J; MacKinnon, Dean F; Maier, Robert M; Maier, Wolfgang; Marchini, Jonathan; Mbarek, Hamdi; McGrath, Patrick; McGuffin, Peter; Medland, Sarah E; Mehta, Divya; Middeldorp, Christel M; Mihailov, Evelin; Milaneschi, Yuri; Milani, Lili; Mill, Jonathan; Mondimore, Francis M; Montgomery, Grant W; Mostafavi, Sara; Mullins, Niamh; Nauck, Matthias; Ng, Bernard; Nivard, Michel G; Nyholt, Dale R; O'Reilly, Paul F; Oskarsson, Hogni; Owen, Michael J; Painter, Jodie N; Pedersen, Carsten Bøcker; Pedersen, Marianne Giørtz; Peterson, Roseann E; Pettersson, Erik; Peyrot, Wouter J; Pistis, Giorgio; Posthuma, Danielle; Purcell, Shaun M; Quiroz, Jorge A; Qvist, Per; Rice, John P; Riley, Brien P; Rivera, Margarita; Saeed Mirza, Saira; Saxena, Richa; Schoevers, Robert; Schulte, Eva C; Shen, Ling; Shi, Jianxin; Shyn, Stanley I; Sigurdsson, Engilbert; Sinnamon, Grant B C; Smit, Johannes H; Smith, Daniel J; Stefansson, Hreinn; Steinberg, Stacy; Stockmeier, Craig A; Streit, Fabian; Strohmaier, Jana; Tansey, Katherine E; Teismann, Henning; Teumer, Alexander; Thompson, Wesley; Thomson, Pippa A; Thorgeirsson, Thorgeir E; Tian, Chao; Traylor, Matthew; Treutlein, Jens; Trubetskoy, Vassily; Uitterlinden, André G; Umbricht, Daniel; Van der Auwera, Sandra; van Hemert, Albert M; Viktorin, Alexander; Visscher, Peter M; Wang, Yunpeng; Webb, Bradley T; Weinsheimer, Shantel Marie; Wellmann, Jürgen; Willemsen, Gonneke; Witt, Stephanie H; Wu, Yang; Xi, Hualin S; Yang, Jian; Zhang, Futao; Arolt, Volker; Baune, Bernhard T; Berger, Klaus; Boomsma, Dorret I; Cichon, Sven; Dannlowski, Udo; de Geus, E C J; DePaulo, J Raymond; Domenici, Enrico; Domschke, Katharina; Esko, Tõnu; Grabe, Hans J; Hamilton, Steven P; Hayward, Caroline; Heath, Andrew C; Hinds, David A; Kendler, Kenneth S; Kloiber, Stefan; Lewis, Glyn; Li, Qingqin S; Lucae, Susanne; Madden, Pamela F A; Magnusson, Patrik K; Martin, Nicholas G; McIntosh, Andrew M; Metspalu, Andres; Mors, Ole; Mortensen, Preben Bo; Müller-Myhsok, Bertram; Nordentoft, Merete; Nöthen, Markus M; O'Donovan, Michael C; Paciga, Sara A; Pedersen, Nancy L; Penninx, Brenda W J H; Perlis, Roy H; Porteous, David J; Potash, James B; Preisig, Martin; Rietschel, Marcella; Schaefer, Catherine; Schulze, Thomas G; Smoller, Jordan W; Stefansson, Kari; Tiemeier, Henning; Uher, Rudolf; Völzke, Henry; Weissman, Myrna M; Werge, Thomas; Winslow, Ashley R; Lewis, Cathryn M; Levinson, Douglas F; Breen, Gerome; Børglum, Anders D; Sullivan, Patrick F
Major depressive disorder (MDD) is a common illness accompanied by considerable morbidity, mortality, costs, and heightened risk of suicide. We conducted a genome-wide association meta-analysis based in 135,458 cases and 344,901 controls and identified 44 independent and significant loci. The genetic findings were associated with clinical features of major depression and implicated brain regions exhibiting anatomical differences in cases. Targets of antidepressant medications and genes involved in gene splicing were enriched for smaller association signal. We found important relationships of genetic risk for major depression with educational attainment, body mass, and schizophrenia: lower educational attainment and higher body mass were putatively causal, whereas major depression and schizophrenia reflected a partly shared biological etiology. All humans carry lesser or greater numbers of genetic risk factors for major depression. These findings help refine the basis of major depression and imply that a continuous measure of risk underlies the clinical phenotype.
Falkenberg, K J; Newbold, A; Gould, C M; Luu, J; Trapani, J A; Matthews, G M; Simpson, K J; Johnstone, R W
Vorinostat is an FDA-approved histone deacetylase inhibitor (HDACi) that has proven clinical success in some patients; however, it remains unclear why certain patients remain unresponsive to this agent and other HDACis. Constitutive STAT (signal transducer and activator of transcription) activation, overexpression of prosurvival Bcl-2 proteins and loss of HR23B have been identified as potential biomarkers of HDACi resistance; however, none have yet been used to aid the clinical utility of HDACi. Herein, we aimed to further elucidate vorinostat-resistance mechanisms through a functional genomics screen to identify novel genes that when knocked down by RNA interference (RNAi) sensitized cells to vorinostat-induced apoptosis. A synthetic lethal functional screen using a whole-genome protein-coding RNAi library was used to identify genes that when knocked down cooperated with vorinostat to induce tumor cell apoptosis in otherwise resistant cells. Through iterative screening, we identified 10 vorinostat-resistance candidate genes that sensitized specifically to vorinostat. One of these vorinostat-resistance genes was GLI1, an oncogene not previously known to regulate the activity of HDACi. Treatment of vorinostat-resistant cells with the GLI1 small-molecule inhibitor, GANT61, phenocopied the effect of GLI1 knockdown. The mechanism by which GLI1 loss of function sensitized tumor cells to vorinostat-induced apoptosis is at least in part through interactions with vorinostat to alter gene expression in a manner that favored apoptosis. Upon GLI1 knockdown and vorinostat treatment, BCL2L1 expression was repressed and overexpression of BCL2L1 inhibited GLI1-knockdown-mediated vorinostat sensitization. Taken together, we present the identification and characterization of GLI1 as a new HDACi resistance gene, providing a strong rationale for development of GLI1 inhibitors for clinical use in combination with HDACi therapy.
Dodda, Subba Reddy; Aich, Aparajita; Sarkar, Nibedita; Jain, Piyush; Jain, Sneha; Mondal, Sudipa; Aikat, Kaustav; Mukhopadhyay, Sudit S.
Thermostable glucose tolerant β-glucosidase from Aspergillus species has attracted worldwide interest for their potentiality in industrial applications and bioethanol production. A strain of Aspergillus fumigatus (AfNITDGPKA3) identified by our laboratory from straw retting ground showed higher cellulase activity, specifically the β-glucosidase activity, compared to other contemporary strains. Though A. fumigatus has been known for high cellulase activity, detailed identification and characterization of the cellulase genes from their genome is yet to be done. In this work we have been analyzed the cellulase genes from the genome sequence database of Aspergillus fumigatus (Af293). Genome analysis suggests two cellobiohydrolase, eleven endoglucanase and seventeen β-glucosidase genes present. β-Glucosidase genes belong to either Glycohydro1 (GH1 or Bgl1) or Glycohydro3 (GH3 or Bgl3) family. The sequence similarity suggests that Bgl1 and Bgl3 of A. fumagatus are phylogenetically close to those of A. fisheri and A. oryzae. The modelled structure of the Bgl1 predicts the (β/α)8 barrel type structure with deep and narrow active site, whereas, Bgl3 shows the (α/β)8 barrel and (α/β)6 sandwich structure with shallow and open active site. Docking results suggest that amino acids Glu544, Glu466, Trp408,Trp567,Tyr44,Tyr222,Tyr770,Asp844,Asp537,Asn212,Asn217 of Bgl3 and Asp224,Asn242,Glu440, Glu445, Tyr367, Tyr365,Thr994,Trp435,Trp446 of Bgl1 are involved in the hydrolysis. Binding affinity analyses suggest that Bgl3 and Bgl1 enzymes are more active on the substrates like 4-methylumbelliferyl glycoside (MUG) and p-nitrophenyl-β-D-1, 4-glucopyranoside (pNPG) than on cellobiose. Further docking with glucose suggests that Bgl1 is more glucose tolerant than Bgl3. Analysis of the Aspergillus fumigatus genome may help to identify a β-glucosidase enzyme with better property and the structural information may help to develop an engineered recombinant enzyme.
The investigations described in this dissertation were designed to determine the transcriptionally active DNA sequences of IIR region and to identify the viral mRNA transcribed from the transcriptionally most active DNA sequences of that region during late phase of HCMV Towne infection. Preliminary transcriptional studies which included the hybridization of a southern blot of XbaI digested entire HCMV genome to 32 P-labelled late phase infected cell A + RNA, indicated that late viral transcripts homologous to XbaI Q fragment of IIR region were very highly abundant while XbaI Q fragment showed a very low transcriptional activity. To facilitate further analysis of late transcription of IIR region, the entire DNA sequences of IIR region were molecularly cloned as U, S, and H BamHI fragments in pACYC-184 plasmid vector. In addition, to be used in future studies on other regions of the genome, except for y and c' smaller fragments the entire 240 kb HCMV genome was cloned as BamHI fragments in the same vector. Furthermore, the U, S, and H BamHI fragments were mapped with six other restriction enzymes in order to use that mapping data in subsequent transcriptional analysis of the IIR region. Further localization of transcriptionally active DNA sequences within IIR region was achieved by hybridization of southern blots of restricted U, S, and H BamHI fragments with 3' 32 P-labelled infected cell late A + RNA. The 1.5 kb EcooRI subfragments of S BamHI fragment and the adjoining 0.72 kb XhoI subfragment of H BamHI fragment revealed the highest level of transcription, although the remainder of the S fragment was also transcribed at a substantial level. The U fragment and the remainder of the H fragment was transcribed at a very low level
Liu, Jiaxuan; Zhao, Wei; Ware, Erin B; Turner, Stephen T; Mosley, Thomas H; Smith, Jennifer A
Genetic variations in apolipoprotein E (APOE) and proximal genes (PVRL2, TOMM40, and APOC1) are associated with cognitive function and dementia, particularly Alzheimer's disease. Epigenetic mechanisms such as DNA methylation play a central role in the regulation of gene expression. Recent studies have found evidence that DNA methylation may contribute to the pathogenesis of dementia, but its association with cognitive function in populations without dementia remains unclear. We assessed DNA methylation levels of 48 CpG sites in the APOE genomic region in peripheral blood leukocytes collected from 289 African Americans (mean age = 67 years) from the Genetic Epidemiology Network of Arteriopathy (GENOA) study. Using linear regression, we examined the relationship between methylation in the APOE genomic region and multiple cognitive measures including learning, memory, processing speed, concentration, language and global cognitive function. We identified eight CpG sites in three genes (PVRL2, TOMM40, and APOE) that showed an inverse association between methylation level and delayed recall, a measure of memory, after adjusting for age and sex (False Discovery Rate q-value accounting for known genetic predictors for cognition. Our findings highlight the important role of epigenetic mechanisms in influencing cognitive performance, and suggest that changes in blood methylation may be an early indicator of individuals at risk for dementia as well as potential targets for intervention in asymptomatic populations.
Bramon, Elvira; Pirinen, Matti; Strange, Amy; Lin, Kuang; Freeman, Colin; Bellenguez, Céline; Su, Zhan; Band, Gavin; Pearson, Richard; Vukcevic, Damjan; Langford, Cordelia; Deloukas, Panos; Hunt, Sarah; Gray, Emma; Dronov, Serge
Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories.
Kaas, Christian Schrøder; Kristensen, Claus; Betenbaugh, Michael J.
Background: The DHFR negative CHO DXB11 cell line (also known as DUX-B11 and DUKX) was historically the first CHO cell line to be used for large scale production of heterologous proteins and is still used for production of a number of complex proteins. Results: Here we present the genomic sequence...... of the CHO DXB11 genome sequenced to a depth of 33x. Overall a significant genomic drift was seen favoring GC -> AT point mutations in line with the chemical mutagenesis strategy used for generation of the cell line. The sequencing depth for each gene in the genome revealed distinct peaks at sequencing...... in eight additional analyzed CHO genomes (15-20% haploidy) but not in the genome of the Chinese hamster. The dhfr gene is confirmed to be haploid in CHO DXB11; transcriptionally active and the remaining allele contains a G410C point mutation causing a Thr137Arg missense mutation. We find similar to 2...
Evangelou, Evangelos; Kerkhof, Hanneke J; Styrkarsdottir, Unnur
Osteoarthritis (OA) is the most common form of arthritis with a clear genetic component. To identify novel loci associated with hip OA we performed a meta-analysis of genome-wide association studies (GWAS) on European subjects.......Osteoarthritis (OA) is the most common form of arthritis with a clear genetic component. To identify novel loci associated with hip OA we performed a meta-analysis of genome-wide association studies (GWAS) on European subjects....
Full Text Available Bacillus cereus is a bacterial pathogen that is responsible for many recurrent disease outbreaks due to food contamination. Spores and biofilms are considered the most important reservoirs of B. cereus in contaminated fresh vegetables and fruits. Biofilms are bacterial communities that are difficult to eradicate from biotic and abiotic surfaces because of their stable and extremely strong extracellular matrix. These extracellular matrixes contain exopolysaccharides, proteins, extracellular DNA, and other minor components. Although B. cereus can form biofilms, the bacterial features governing assembly of the protective extracellular matrix are not known. Using the well-studied bacterium B. subtilis as a model, we identified two genomic loci in B. cereus, which encodes two orthologs of the amyloid-like protein TasA of B. subtilis and a SipW signal peptidase. Deletion of this genomic region in B. cereus inhibited biofilm assembly; notably, mutation of the putative signal peptidase SipW caused the same phenotype. However, mutations in tasA or calY did not completely prevent biofilm formation; strains that were mutated for either of these genes formed phenotypically different surface attached biofilms. Electron microscopy studies revealed that TasA polymerizes to form long and abundant fibers on cell surfaces, whereas CalY does not aggregate similarly. Heterologous expression of this amyloid-like cassette in a B. subtilis strain lacking the factors required for the assembly of TasA amyloid-like fibers revealed i the involvement of this B. cereus genomic region in formation of the air-liquid interphase pellicles and ii the intrinsic ability of TasA to form fibers similar to the amyloid-like fibers produced by its B. subtilis ortholog.
Reyes-Solis, Guadalupe Del Carmen; Saavedra-Rodriguez, Karla; Suarez, Adriana Flores; Black, William C
The mosquito Aedes aegypti is the principal vector of dengue and yellow fever flaviviruses. Temephos is an organophosphate insecticide used globally to suppress Ae. aegypti larval populations but resistance has evolved in many locations. Quantitative Trait Loci (QTL) controlling temephos survival in Ae. aegypti larvae were mapped in a pair of F3 advanced intercross lines arising from temephos resistant parents from Solidaridad, México and temephos susceptible parents from Iquitos, Peru. Two sets of 200 F3 larvae were exposed to a discriminating dose of temephos and then dead larvae were collected and preserved for DNA isolation every two hours up to 16 hours. Larvae surviving longer than 16 hours were considered resistant. For QTL mapping, single nucleotide polymorphisms (SNPs) were identified at 23 single copy genes and 26 microsatellite loci of known physical positions in the Ae. aegypti genome. In both reciprocal crosses, Multiple Interval Mapping identified eleven QTL associated with time until death. In the Solidaridad×Iquitos (SLD×Iq) cross twelve were associated with survival but in the reciprocal IqxSLD cross, only six QTL were survival associated. Polymorphisms at acetylcholine esterase (AchE) loci 1 and 2 were not associated with either resistance phenotype suggesting that target site insensitivity is not an organophosphate resistance mechanism in this region of México. Temephos resistance is under the control of many metabolic genes of small effect and dispersed throughout the Ae. aegypti genome.
McAllister, T A; Meale, S J; Valle, E; Guan, L L; Zhou, M; Kelly, W J; Henderson, G; Attwood, G T; Janssen, P H
Globally, methane (CH4) emissions account for 40% to 45% of greenhouse gas emissions from ruminant livestock, with over 90% of these emissions arising from enteric fermentation. Reduction of carbon dioxide to CH4 is critical for efficient ruminal fermentation because it prevents the accumulation of reducing equivalents in the rumen. Methanogens exist in a symbiotic relationship with rumen protozoa and fungi and within biofilms associated with feed and the rumen wall. Genomics and transcriptomics are playing an increasingly important role in defining the ecology of ruminal methanogenesis and identifying avenues for its mitigation. Metagenomic approaches have provided information on changes in abundances as well as the species composition of the methanogen community among ruminants that vary naturally in their CH4 emissions, their feed efficiency, and their response to CH4 mitigators. Sequencing the genomes of rumen methanogens has provided insight into surface proteins that may prove useful in the development of vaccines and has allowed assembly of biochemical pathways for use in chemogenomic approaches to lowering ruminal CH4 emissions. Metagenomics and metatranscriptomic analysis of entire rumen microbial communities are providing new perspectives on how methanogens interact with other members of this ecosystem and how these relationships may be altered to reduce methanogenesis. Identification of community members that produce antimethanogen agents that either inhibit or kill methanogens could lead to the identification of new mitigation approaches. Discovery of a lytic archaeophage that specifically lyses methanogens is 1 such example. Efforts in using genomic data to alter methanogenesis have been hampered by a lack of sequence information that is specific to the microbial community of the rumen. Programs such as Hungate1000 and the Global Rumen Census are increasing the breadth and depth of our understanding of global ruminal microbial communities, steps that
Cheng, Yu-Ching; Stanne, Tara M.; Giese, Anne-Katrin; Ho, Weang Kee; Traylor, Matthew; Amouyel, Philippe; Holliday, Elizabeth G.; Malik, Rainer; Xu, Huichun; Kittner, Steven J.; Cole, John W.; O’Connell, Jeffrey R.; Danesh, John; Rasheed, Asif; Zhao, Wei; Engelter, Stefan; Grond-Ginsbach, Caspar; Kamatani, Yoichiro; Lathrop, Mark; Leys, Didier; Thijs, Vincent; Metso, Tiina M.; Tatlisumak, Turgut; Pezzini, Alessandro; Parati, Eugenio A.; Norrving, Bo; Bevan, Steve; Rothwell, Peter M; Sudlow, Cathie; Slowik, Agnieszka; Lindgren, Arne; Walters, Matthew R; Jannes, Jim; Shen, Jess; Crosslin, David; Doheny, Kimberly; Laurie, Cathy C.; Kanse, Sandip M.; Bis, Joshua C.; Fornage, Myriam; Mosley, Thomas H.; Hopewell, Jemma C.; Strauch, Konstantin; Müller-Nurasyid, Martina; Gieger, Christian; Waldenberger, Melanie; Peters, Annette; Meisinger, Christine; Ikram, M. Arfan; Longstreth, WT; Meschia, James F.; Seshadri, Sudha; Sharma, Pankaj; Worrall, Bradford; Jern, Christina; Levi, Christopher; Dichgans, Martin; Boncoraglio, Giorgio B.; Markus, Hugh S.; Debette, Stephanie; Rolfs, Arndt; Saleheen, Danish; Mitchell, Braxton D.
Background and Purpose Although a genetic contribution to ischemic stroke is well recognized, only a handful of stroke loci have been identified by large-scale genetic association studies to date. Hypothesizing that genetic effects might be stronger for early- versus late-onset stroke, we conducted a two-stage meta-analysis of genome-wide association studies (GWAS), focusing on stroke cases with an age of onset genetic variants at loci with association Pstroke susceptibility locus at 10q25 reached genome-wide significance in the combined analysis of all samples from the Discovery and Follow-up Stages (rs11196288, OR=1.41, P=9.5×10−9). The associated locus is in an intergenic region between TCF7L2 and HABP2. In a further analysis in an independent sample, we found that two SNPs in high linkage disequilibrium with rs11196288 were significantly associated with total plasma factor VII-activating protease levels, a product of HABP2. Conclusions HABP2, which encodes an extracellular serine protease involved in coagulation, fibrinolysis, and inflammatory pathways, may be a genetic susceptibility locus for early-onset stroke. PMID:26732560
Kao, Chung-Feng; Jia, Peilin; Zhao, Zhongming; Kuo, Po-Hsiu
Major depressive disorder (MDD) has caused a substantial burden of disease worldwide with moderate heritability. Despite efforts through conducting numerous association studies and now, genome-wide association (GWA) studies, the success of identifying susceptibility loci for MDD has been limited, which is partially attributed to the complex nature of depression pathogenesis. A pathway-based analytic strategy to investigate the joint effects of various genes within specific biological pathways has emerged as a powerful tool for complex traits. The present study aimed to identify enriched pathways for depression using a GWA dataset for MDD. For each gene, we estimated its gene-wise p value using combined and minimum p value, separately. Canonical pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) and BioCarta were used. We employed four pathway-based analytic approaches (gene set enrichment analysis, hypergeometric test, sum-square statistic, sum-statistic). We adjusted for multiple testing using Benjamini & Hochberg's method to report significant pathways. We found 17 significantly enriched pathways for depression, which presented low-to-intermediate crosstalk. The top four pathways were long-term depression (p⩽1×10-5), calcium signalling (p⩽6×10-5), arrhythmogenic right ventricular cardiomyopathy (p⩽1.6×10-4) and cell adhesion molecules (p⩽2.2×10-4). In conclusion, our comprehensive pathway analyses identified promising pathways for depression that are related to neurotransmitter and neuronal systems, immune system and inflammatory response, which may be involved in the pathophysiological mechanisms underlying depression. We demonstrated that pathway enrichment analysis is promising to facilitate our understanding of complex traits through a deeper interpretation of GWA data. Application of this comprehensive analytic strategy in upcoming GWA data for depression could validate the findings reported in this study.
Nazari-Ghadikolaei, Anahit; Mehrabani-Yeganeh, Hassan; Miarei-Aashtiani, Seyed R; Staiger, Elizabeth A; Rashidi, Amir; Huson, Heather J
The Markhoz goat provides an opportunity to study the genetics underlying coat color and mohair traits of an Angora type goat using genome-wide association studies (GWAS). This indigenous Iranian breed is valued for its quality mohair used in ceremonial garments and has the distinction of exhibiting an array of coat colors including black, brown, and white. Here, we performed 16 GWAS for different fleece (mohair) traits and coat color in 228 Markhoz goats sampled from the Markhoz Goat Research Station in Sanandaj, Kurdistan province, located in western Iran using the Illumina Caprine 50K beadchip. The Efficient Mixed Model Linear analysis was used to identify genomic regions with potential candidate genes contributing to coat color and mohair characteristics while correcting for population structure. Significant associations to coat color were found within or near the ASIP, ITCH, AHCY , and RALY genes on chromosome 13 for black and brown coat color and the KIT and PDGFRA genes on chromosome 6 for white coat color. Individual mohair traits were analyzed for genetic association along with principal components that allowed for a broader perspective of combined traits reflecting overall mohair quality and volume. A multitude of markers demonstrated significant association to mohair traits highlighting potential candidate genes of POU1F1 on chromosome 1 for mohair quality, MREG on chromosome 2 for mohair volume, DUOX1 on chromosome 10 for yearling fleece weight, and ADGRV1 on chromosome 7 for grease percentage. Variation in allele frequencies and haplotypes were identified for coat color and differentiated common markers associated with both brown and black coat color. This demonstrates the potential for genetic markers to be used in future breeding programs to improve selection for coat color and mohair traits. Putative candidate genes, both novel and previously identified in other species or breeds, require further investigation to confirm phenotypic causality and
Full Text Available The Markhoz goat provides an opportunity to study the genetics underlying coat color and mohair traits of an Angora type goat using genome-wide association studies (GWAS. This indigenous Iranian breed is valued for its quality mohair used in ceremonial garments and has the distinction of exhibiting an array of coat colors including black, brown, and white. Here, we performed 16 GWAS for different fleece (mohair traits and coat color in 228 Markhoz goats sampled from the Markhoz Goat Research Station in Sanandaj, Kurdistan province, located in western Iran using the Illumina Caprine 50K beadchip. The Efficient Mixed Model Linear analysis was used to identify genomic regions with potential candidate genes contributing to coat color and mohair characteristics while correcting for population structure. Significant associations to coat color were found within or near the ASIP, ITCH, AHCY, and RALY genes on chromosome 13 for black and brown coat color and the KIT and PDGFRA genes on chromosome 6 for white coat color. Individual mohair traits were analyzed for genetic association along with principal components that allowed for a broader perspective of combined traits reflecting overall mohair quality and volume. A multitude of markers demonstrated significant association to mohair traits highlighting potential candidate genes of POU1F1 on chromosome 1 for mohair quality, MREG on chromosome 2 for mohair volume, DUOX1 on chromosome 10 for yearling fleece weight, and ADGRV1 on chromosome 7 for grease percentage. Variation in allele frequencies and haplotypes were identified for coat color and differentiated common markers associated with both brown and black coat color. This demonstrates the potential for genetic markers to be used in future breeding programs to improve selection for coat color and mohair traits. Putative candidate genes, both novel and previously identified in other species or breeds, require further investigation to confirm phenotypic
Winkelmann Bernhard R
Full Text Available Abstract Background Genome-wide association studies (GWAS have identified new candidate genes for the occurrence of acute coronary syndrome (ACS, but possible effects of such genes on survival following ACS have yet to be investigated. Methods We examined 95 polymorphisms in 69 distinct gene regions identified in a GWAS for premature myocardial infarction for their association with post-ACS mortality among 811 whites recruited from university-affiliated hospitals in Kansas City, Missouri. We then sought replication of a positive genetic association in a large, racially diverse cohort of myocardial infarction patients (N = 2284 using Kaplan-Meier survival analyses and Cox regression to adjust for relevant covariates. Finally, we investigated the apparent association further in 6086 additional coronary artery disease patients. Results After Cox adjustment for other ACS risk factors, of 95 SNPs tested in 811 whites only the association with the rs6922269 in MTHFD1L was statistically significant, with a 2.6-fold mortality hazard (P = 0.007. The recessive A/A genotype was of borderline significance in an age- and race-adjusted analysis of the entire combined cohort (N = 3095; P = 0.052, but this finding was not confirmed in independent cohorts (N = 6086. Conclusions We found no support for the hypothesis that the GWAS-identified variants in this study substantially alter the probability of post-ACS survival. Large-scale, collaborative, genome-wide studies may be required in order to detect genetic variants that are robustly associated with survival in patients with coronary artery disease.
Urasaki, Naoya; Takagi, Hiroki; Natsume, Satoshi; Uemura, Aiko; Taniai, Naoki; Miyagi, Norimichi; Fukushima, Mai; Suzuki, Shouta; Tarora, Kazuhiko; Tamaki, Moritoshi; Sakamoto, Moriaki; Terauchi, Ryohei; Matsumura, Hideo
Bitter gourd (Momordica charantia) is an important vegetable and medicinal plant in tropical and subtropical regions globally. In this study, the draft genome sequence of a monoecious bitter gourd inbred line, OHB3-1, was analyzed. Through Illumina sequencing and de novo assembly, scaffolds of 285.5 Mb in length were generated, corresponding to ∼84% of the estimated genome size of bitter gourd (339 Mb). In this draft genome sequence, 45,859 protein-coding gene loci were identified, and transposable elements accounted for 15.3% of the whole genome. According to synteny mapping and phylogenetic analysis of conserved genes, bitter gourd was more related to watermelon (Citrullus lanatus) than to cucumber (Cucumis sativus) or melon (C. melo). Using RAD-seq analysis, 1507 marker loci were genotyped in an F2 progeny of two bitter gourd lines, resulting in an improved linkage map, comprising 11 linkage groups. By anchoring RAD tag markers, 255 scaffolds were assigned to the linkage map. Comparative analysis of genome sequences and predicted genes determined that putative trypsin-inhibitor and ribosome-inactivating genes were distinctive in the bitter gourd genome. These genes could characterize the bitter gourd as a medicinal plant. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Wain, Louise V; Verwoert, Germaine C; O’Reilly, Paul F; Shi, Gang; Johnson, Toby; Johnson, Andrew D; Bochud, Murielle; Rice, Kenneth M; Henneman, Peter; Smith, Albert V; Ehret, Georg B; Amin, Najaf; Larson, Martin G; Mooser, Vincent; Hadley, David; Dörr, Marcus; Bis, Joshua C; Aspelund, Thor; Esko, Tõnu; Janssens, A Cecile JW; Zhao, Jing Hua; Heath, Simon; Laan, Maris; Fu, Jingyuan; Pistis, Giorgio; Luan, Jian’an; Arora, Pankaj; Lucas, Gavin; Pirastu, Nicola; Pichler, Irene; Jackson, Anne U; Webster, Rebecca J; Zhang, Feng; Peden, John F; Schmidt, Helena; Tanaka, Toshiko; Campbell, Harry; Igl, Wilmar; Milaneschi, Yuri; Hotteng, Jouke-Jan; Vitart, Veronique; Chasman, Daniel I; Trompet, Stella; Bragg-Gresham, Jennifer L; Alizadeh, Behrooz Z; Chambers, John C; Guo, Xiuqing; Lehtimäki, Terho; Kühnel, Brigitte; Lopez, Lorna M; Polašek, Ozren; Boban, Mladen; Nelson, Christopher P; Morrison, Alanna C; Pihur, Vasyl; Ganesh, Santhi K; Hofman, Albert; Kundu, Suman; Mattace-Raso, Francesco US; Rivadeneira, Fernando; Sijbrands, Eric JG; Uitterlinden, Andre G; Hwang, Shih-Jen; Vasan, Ramachandran S; Wang, Thomas J; Bergmann, Sven; Vollenweider, Peter; Waeber, Gérard; Laitinen, Jaana; Pouta, Anneli; Zitting, Paavo; McArdle, Wendy L; Kroemer, Heyo K; Völker, Uwe; Völzke, Henry; Glazer, Nicole L; Taylor, Kent D; Harris, Tamara B; Alavere, Helene; Haller, Toomas; Keis, Aime; Tammesoo, Mari-Liis; Aulchenko, Yurii; Barroso, Inês; Khaw, Kay-Tee; Galan, Pilar; Hercberg, Serge; Lathrop, Mark; Eyheramendy, Susana; Org, Elin; Sõber, Siim; Lu, Xiaowen; Nolte, Ilja M; Penninx, Brenda W; Corre, Tanguy; Masciullo, Corrado; Sala, Cinzia; Groop, Leif; Voight, Benjamin F; Melander, Olle; O’Donnell, Christopher J; Salomaa, Veikko; d’Adamo, Adamo Pio; Fabretto, Antonella; Faletra, Flavio; Ulivi, Sheila; Del Greco, M Fabiola; Facheris, Maurizio; Collins, Francis S; Bergman, Richard N; Beilby, John P; Hung, Joseph; Musk, A William; Mangino, Massimo; Shin, So-Youn; Soranzo, Nicole; Watkins, Hugh; Goel, Anuj; Hamsten, Anders; Gider, Pierre; Loitfelder, Marisa; Zeginigg, Marion; Hernandez, Dena; Najjar, Samer S; Navarro, Pau; Wild, Sarah H; Corsi, Anna Maria; Singleton, Andrew; de Geus, Eco JC; Willemsen, Gonneke; Parker, Alex N; Rose, Lynda M; Buckley, Brendan; Stott, David; Orru, Marco; Uda, Manuela; van der Klauw, Melanie M; Zhang, Weihua; Li, Xinzhong; Scott, James; Chen, Yii-Der Ida; Burke, Gregory L; Kähönen, Mika; Viikari, Jorma; Döring, Angela; Meitinger, Thomas; Davies, Gail; Starr, John M; Emilsson, Valur; Plump, Andrew; Lindeman, Jan H; ’t Hoen, Peter AC; König, Inke R; Felix, Janine F; Clarke, Robert; Hopewell, Jemma C; Ongen, Halit; Breteler, Monique; Debette, Stéphanie; DeStefano, Anita L; Fornage, Myriam; Mitchell, Gary F; Smith, Nicholas L; Holm, Hilma; Stefansson, Kari; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Samani, Nilesh J; Preuss, Michael; Rudan, Igor; Hayward, Caroline; Deary, Ian J; Wichmann, H-Erich; Raitakari, Olli T; Palmas, Walter; Kooner, Jaspal S; Stolk, Ronald P; Jukema, J Wouter; Wright, Alan F; Boomsma, Dorret I; Bandinelli, Stefania; Gyllensten, Ulf B; Wilson, James F; Ferrucci, Luigi; Schmidt, Reinhold; Farrall, Martin; Spector, Tim D; Palmer, Lyle J; Tuomilehto, Jaakko; Pfeufer, Arne; Gasparini, Paolo; Siscovick, David; Altshuler, David; Loos, Ruth JF; Toniolo, Daniela; Snieder, Harold; Gieger, Christian; Meneton, Pierre; Wareham, Nicholas J; Oostra, Ben A; Metspalu, Andres; Launer, Lenore; Rettig, Rainer; Strachan, David P; Beckmann, Jacques S; Witteman, Jacqueline CM; Erdmann, Jeanette; van Dijk, Ko Willems; Boerwinkle, Eric; Boehnke, Michael; Ridker, Paul M; Jarvelin, Marjo-Riitta; Chakravarti, Aravinda; Abecasis, Goncalo R; Gudnason, Vilmundur; Newton-Cheh, Christopher; Levy, Daniel; Munroe, Patricia B; Psaty, Bruce M; Caulfield, Mark J; Rao, Dabeeru C
Numerous genetic loci influence systolic blood pressure (SBP) and diastolic blood pressure (DBP) in Europeans 1-3. We now report genome-wide association studies of pulse pressure (PP) and mean arterial pressure (MAP). In discovery (N=74,064) and follow-up studies (N=48,607), we identified at genome-wide significance (P= 2.7×10-8 to P=2.3×10-13) four novel PP loci (at 4q12 near CHIC2/PDGFRAI, 7q22.3 near PIK3CG, 8q24.12 in NOV, 11q24.3 near ADAMTS-8), two novel MAP loci (3p21.31 in MAP4, 10q25.3 near ADRB1) and one locus associated with both traits (2q24.3 near FIGN) which has recently been associated with SBP in east Asians. For three of the novel PP signals, the estimated effect for SBP was opposite to that for DBP, in contrast to the majority of common SBP- and DBP-associated variants which show concordant effects on both traits. These findings indicate novel genetic mechanisms underlying blood pressure variation, including pathways that may differentially influence SBP and DBP. PMID:21909110
Xia, Jun Hong; Li, Hong Lian; Zhang, Yong; Meng, Zi Ning; Lin, Hao Ran
Fish species inhabitating seawater (SW) or freshwater (FW) habitats have to develop genetic adaptations to alternative environment factors, especially salinity. Functional consequences of the protein variations associated with habitat environments in fish mitochondrial genomes have not yet received much attention. We analyzed 829 complete fish mitochondrial genomes and compared the amino acid differences of 13 mitochondrial protein families between FW and SW fish groups. We identified 47 specificity determining sites (SDS) that associated with FW or SW environments from 12 mitochondrial protein families. Thirty-two (68%) of the SDS sites are hydrophobic, 13 (28%) are neutral, and the remaining sites are acidic or basic. Seven of those SDS from ND1, ND2 and ND5 were scored as probably damaging to the protein structures. Furthermore, phylogenetic tree based Bayes Empirical Bayes analysis also detected 63 positive sites associated with alternative habitat environments across ten mtDNA proteins. These signatures could be important for studying mitochondrial genetic variation relevant to fish physiology and ecology.
Li, Changgui; Li, Zhiqiang; Liu, Shiguo; Wang, Can; Han, Lin; Cui, Lingling; Zhou, Jingguo; Zou, Hejian; Liu, Zhen; Chen, Jianhua; Cheng, Xiaoyu; Zhou, Zhaowei; Ding, Chengcheng; Wang, Meng; Chen, Tong; Cui, Ying; He, Hongmei; Zhang, Keke; Yin, Congcong; Wang, Yunlong; Xing, Shichao; Li, Baojie; Ji, Jue; Jia, Zhaotong; Ma, Lidan; Niu, Jiapeng; Xin, Ying; Liu, Tian; Chu, Nan; Yu, Qing; Ren, Wei; Wang, Xuefeng; Zhang, Aiqing; Sun, Yuping; Wang, Haili; Lu, Jie; Li, Yuanyuan; Qing, Yufeng; Chen, Gang; Wang, Yangang; Zhou, Li; Niu, Haitao; Liang, Jun; Dong, Qian; Li, Xinde; Mi, Qing-Sheng; Shi, Yongyong
Gout is one of the most common types of inflammatory arthritis, caused by the deposition of monosodium urate crystals in and around the joints. Previous genome-wide association studies (GWASs) have identified many genetic loci associated with raised serum urate concentrations. However, hyperuricemia alone is not sufficient for the development of gout arthritis. Here we conduct a multistage GWAS in Han Chinese using 4,275 male gout patients and 6,272 normal male controls (1,255 cases and 1,848 controls were genome-wide genotyped), with an additional 1,644 hyperuricemic controls. We discover three new risk loci, 17q23.2 (rs11653176, P=1.36 × 10−13, BCAS3), 9p24.2 (rs12236871, P=1.48 × 10−10, RFX3) and 11p15.5 (rs179785, P=1.28 × 10−8, KCNQ1), which contain inflammatory candidate genes. Our results suggest that these loci are most likely related to the progression from hyperuricemia to inflammatory gout, which will provide new insights into the pathogenesis of gout arthritis. PMID:25967671
Li, Changgui; Li, Zhiqiang; Liu, Shiguo; Wang, Can; Han, Lin; Cui, Lingling; Zhou, Jingguo; Zou, Hejian; Liu, Zhen; Chen, Jianhua; Cheng, Xiaoyu; Zhou, Zhaowei; Ding, Chengcheng; Wang, Meng; Chen, Tong; Cui, Ying; He, Hongmei; Zhang, Keke; Yin, Congcong; Wang, Yunlong; Xing, Shichao; Li, Baojie; Ji, Jue; Jia, Zhaotong; Ma, Lidan; Niu, Jiapeng; Xin, Ying; Liu, Tian; Chu, Nan; Yu, Qing; Ren, Wei; Wang, Xuefeng; Zhang, Aiqing; Sun, Yuping; Wang, Haili; Lu, Jie; Li, Yuanyuan; Qing, Yufeng; Chen, Gang; Wang, Yangang; Zhou, Li; Niu, Haitao; Liang, Jun; Dong, Qian; Li, Xinde; Mi, Qing-Sheng; Shi, Yongyong
Gout is one of the most common types of inflammatory arthritis, caused by the deposition of monosodium urate crystals in and around the joints. Previous genome-wide association studies (GWASs) have identified many genetic loci associated with raised serum urate concentrations. However, hyperuricemia alone is not sufficient for the development of gout arthritis. Here we conduct a multistage GWAS in Han Chinese using 4,275 male gout patients and 6,272 normal male controls (1,255 cases and 1,848 controls were genome-wide genotyped), with an additional 1,644 hyperuricemic controls. We discover three new risk loci, 17q23.2 (rs11653176, P=1.36 × 10(-13), BCAS3), 9p24.2 (rs12236871, P=1.48 × 10(-10), RFX3) and 11p15.5 (rs179785, P=1.28 × 10(-8), KCNQ1), which contain inflammatory candidate genes. Our results suggest that these loci are most likely related to the progression from hyperuricemia to inflammatory gout, which will provide new insights into the pathogenesis of gout arthritis.
Trung Anh Trieu
Full Text Available Mucorales are an emerging group of human pathogens that are responsible for the lethal disease mucormycosis. Unfortunately, functional studies on the genetic factors behind the virulence of these organisms are hampered by their limited genetic tractability, since they are reluctant to classical genetic tools like transposable elements or gene mapping. Here, we describe an RNAi-based functional genomic platform that allows the identification of new virulence factors through a forward genetic approach firstly described in Mucorales. This platform contains a whole-genome collection of Mucor circinelloides silenced transformants that presented a broad assortment of phenotypes related to the main physiological processes in fungi, including virulence, hyphae morphology, mycelial and yeast growth, carotenogenesis and asexual sporulation. Selection of transformants with reduced virulence allowed the identification of mcplD, which encodes a Phospholipase D, and mcmyo5, encoding a probably essential cargo transporter of the Myosin V family, as required for a fully virulent phenotype of M. circinelloides. Knock-out mutants for those genes showed reduced virulence in both Galleria mellonella and Mus musculus models, probably due to a delayed germination and polarized growth within macrophages. This study provides a robust approach to study virulence in Mucorales and as a proof of concept identified new virulence determinants in M. circinelloides that could represent promising targets for future antifungal therapies.
Full Text Available BACKGROUND: With the availability of new generation sequencing technologies, bacterial genome projects have undergone a major boost. Still, chromosome completion needs a costly and time-consuming gap closure, especially when containing highly repetitive elements. However, incomplete genome data may be sufficiently informative to derive the pursued information. For emerging pathogens, i.e. newly identified pathogens, lack of release of genome data during gap closure stage is clearly medically counterproductive. METHODS/PRINCIPAL FINDINGS: We thus investigated the feasibility of a dirty genome approach, i.e. the release of unfinished genome sequences to develop serological diagnostic tools. We showed that almost the whole genome sequence of the emerging pathogen Parachlamydia acanthamoebae was retrieved even with relatively short reads from Genome Sequencer 20 and Solexa. The bacterial proteome was analyzed to select immunogenic proteins, which were then expressed and used to elaborate the first steps of an ELISA. CONCLUSIONS/SIGNIFICANCE: This work constitutes the proof of principle for a dirty genome approach, i.e. the use of unfinished genome sequences of pathogenic bacteria, coupled with proteomics to rapidly identify new immunogenic proteins useful to develop in the future specific diagnostic tests such as ELISA, immunohistochemistry and direct antigen detection. Although applied here to an emerging pathogen, this combined dirty genome sequencing/proteomic approach may be used for any pathogen for which better diagnostics are needed. These genome sequences may also be very useful to develop DNA based diagnostic tests. All these diagnostic tools will allow further evaluations of the pathogenic potential of this obligate intracellular bacterium.
Kristopher J. L. Irizarry
Full Text Available Comparative genomics approaches provide a means of leveraging functional genomics information from a highly annotated model organism’s genome (such as the mouse genome in order to make physiological inferences about the role of genes and proteins in a less characterized organism’s genome (such as the Burmese python. We employed a comparative genomics approach to produce the functional annotation of Python bivittatus genes encoding proteins associated with sperm phenotypes. We identify 129 gene-phenotype relationships in the python which are implicated in 10 specific sperm phenotypes. Results obtained through our systematic analysis identified subsets of python genes exhibiting associations with gene ontology annotation terms. Functional annotation data was represented in a semantic scatter plot. Together, these newly annotated Python bivittatus genome resources provide a high resolution framework from which the biology relating to reptile spermatogenesis, fertility, and reproduction can be further investigated. Applications of our research include (1 production of genetic diagnostics for assessing fertility in domestic and wild reptiles; (2 enhanced assisted reproduction technology for endangered and captive reptiles; and (3 novel molecular targets for biotechnology-based approaches aimed at reducing fertility and reproduction of invasive reptiles. Additional enhancements to reptile genomic resources will further enhance their value.
Zheng, Yonglan; Ogundiran, Temidayo O; Falusi, Adeyinka G; Nathanson, Katherine L; John, Esther M; Hennis, Anselm J M; Ambs, Stefan; Domchek, Susan M; Rebbeck, Timothy R; Simon, Michael S; Nemesure, Barbara; Wu, Suh-Yuh; Leske, Maria Cristina; Odetunde, Abayomi; Niu, Qun; Zhang, Jing; Afolabi, Chibuzor; Gamazon, Eric R; Cox, Nancy J; Olopade, Christopher O; Olopade, Olufunmilayo I; Huo, Dezheng
Numerous single nucleotide polymorphisms (SNPs) associated with breast cancer susceptibility have been identified by genome-wide association studies (GWAS). However, these SNPs were primarily discovered and validated in women of European and Asian ancestry. Because linkage disequilibrium is ancestry-dependent and heterogeneous among racial/ethnic populations, we evaluated common genetic variants at 22 GWAS-identified breast cancer susceptibility loci in a pooled sample of 1502 breast cancer cases and 1378 controls of African ancestry. None of the 22 GWAS index SNPs could be validated, challenging the direct generalizability of breast cancer risk variants identified in Caucasians or Asians to other populations. Novel breast cancer risk variants for women of African ancestry were identified in regions including 5p12 (odds ratio [OR] = 1.40, 95% confidence interval [CI] = 1.11-1.76; P = 0.004), 5q11.2 (OR = 1.22, 95% CI = 1.09-1.36; P = 0.00053) and 10p15.1 (OR = 1.22, 95% CI = 1.08-1.38; P = 0.0015). We also found positive association signals in three regions (6q25.1, 10q26.13 and 16q12.1-q12.2) previously confirmed by fine mapping in women of African ancestry. In addition, polygenic model indicated that eight best markers in this study, compared with 22 GWAS-identified SNPs, could better predict breast cancer risk in women of African ancestry (per-allele OR = 1.21, 95% CI = 1.16-1.27; P = 9.7 × 10(-16)). Our results demonstrate that fine mapping is a powerful approach to better characterize the breast cancer risk alleles in diverse populations. Future studies and new GWAS in women of African ancestry hold promise to discover additional variants for breast cancer susceptibility with clinical implications throughout the African diaspora.
Rudiger Hamm; Christiane Goebel
The development and support of clusters is an issue that became quite popular by players dealing with regional economic policy. But before a regional development agency can start to implement a cluster-oriented strategy there a two question that have to be answered: 1. What are the regional fields of competence (cluster potentials) that fulfill the requirements for a cluster-oriented regional development policy? 2. If you find such regional fields of competence, are the enterprises willing to...
Greally, John M
To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival.
Seung Hwan Lee
Full Text Available This genome-wide association study (GWAS was conducted to identify major loci that are significantly associated with carcass weight, and their effects, in order to provide increased understanding of the genetic architecture of carcass weight in Hanwoo. This genome-wide association study identified one major chromosome region ranging from 23 Mb to 25 Mb on chromosome 14 as being associated with carcass weight in Hanwoo. Significant Bonferroni-corrected genome-wide associations (P<1.52×10(-6 were detected for 6 Single Nucleotide Polymorphic (SNP loci for carcass weight on chromosome 14. The most significant SNP was BTB-01280026 (P = 4.02×10(-11, located in the 25 Mb region on Bos taurus autosome 14 (BTA14. The other 5 significant SNPs were Hapmap27934-BTC-065223 (P = 4.04×10(-11 in 25.2 Mb, BTB-01143580 (P = 6.35×10(-11 in 24.3 Mb, Hapmap30932-BTC-011225 (P = 5.92×10(-10 in 24.8 Mb, Hapmap27112-BTC-063342 (P = 5.18×10(-9 in 25.4 Mb, and Hapmap24414-BTC-073009 (P = 7.38×10(-8 in 25.4 Mb, all on BTA 14. One SNP (BTB-01143580; P = 6.35×10(-11 lies independently from the other 5 SNPs. The 5 SNPs that lie together showed a large Linkage disequilibrium (LD block (block size of 553 kb with LD coefficients ranging from 0.53 to 0.89 within the block. The most significant SNPs accounted for 6.73% to 10.55% of additive genetic variance, which is quite a large proportion of the total additive genetic variance. The most significant SNP (BTB-01280026; P = 4.02×10(-11 had 16.96 kg of allele substitution effect, and the second most significant SNP (Hapmap27934-BTC-065223; P = 4.04×10(-11 had 18.06 kg of effect on carcass weight, which correspond to 44% and 47%, respectively, of the phenotypic standard deviation for carcass weight in Hanwoo cattle. Our results demonstrated that carcass weight was affected by a major Quantitative Trait Locus (QTL with a large effect and by many SNPs with small effects that are normally
Full Text Available Cholangiocarcinoma (CCA is an aggressive malignancy of the bile ducts, with poor prognosis and limited treatment options. Here, we describe the integrated analysis of somatic mutations, RNA expression, copy number, and DNA methylation by The Cancer Genome Atlas of a set of predominantly intrahepatic CCA cases and propose a molecular classification scheme. We identified an IDH mutant-enriched subtype with distinct molecular features including low expression of chromatin modifiers, elevated expression of mitochondrial genes, and increased mitochondrial DNA copy number. Leveraging the multi-platform data, we observed that ARID1A exhibited DNA hypermethylation and decreased expression in the IDH mutant subtype. More broadly, we found that IDH mutations are associated with an expanded histological spectrum of liver tumors with molecular features that stratify with CCA. Our studies reveal insights into the molecular pathogenesis and heterogeneity of cholangiocarcinoma and provide classification information of potential therapeutic significance.
Garcia-Closas, Montserrat; Couch, Fergus J; Lindstrom, Sara
differences in genetic predisposition. To identify susceptibility loci specific to ER-negative disease, we combined in a meta-analysis 3 genome-wide association studies of 4,193 ER-negative breast cancer cases and 35,194 controls with a series of 40 follow-up studies (6,514 cases and 41,455 controls......), genotyped using a custom Illumina array, iCOGS, developed by the Collaborative Oncological Gene-environment Study (COGS). SNPs at four loci, 1q32.1 (MDM4, P = 2.1 × 10(-12) and LGR6, P = 1.4 × 10(-8)), 2p24.1 (P = 4.6 × 10(-8)) and 16q12.2 (FTO, P = 4.0 × 10(-8)), were associated with ER-negative but not ER...
Huang, Mingtao; Bai, Yunpeng; Sjostrom, Staffan L.
There is an increasing demand for biotech-based production of recombinant proteins for use as pharmaceuticals in the food and feed industry and in industrial applications. Yeast Saccharomyces cerevisiae is among preferred cell factories for recombinant protein production, and there is increasing...... interest in improving its protein secretion capacity. Due to the complexity of the secretory machinery in eukaryotic cells, it is difficult to apply rational engineering for construction of improved strains. Here we used high-throughput microfluidics for the screening of yeast libraries, generated by UV...... mutagenesis. Several screening and sorting rounds resulted in the selection of eight yeast clones with significantly improved secretion of recombinant a-amylase. Efficient secretion was genetically stable in the selected clones. We performed whole-genome sequencing of the eight clones and identified 330...
Li, Zhengcao; Chen, Jiucheng; Wang, Zhen; Pan, Yuchun; Wang, Qishan; Xu, Ningying; Wang, Zhengguang
Chinese pigs have been undergoing both natural and artificial selection for thousands of years. Jinhua pigs are of great importance, as they can be a valuable model for exploring the genetic mechanisms linked to meat quality and other traits such as disease resistance, reproduction and production. The purpose of this study was to identify distinctive footprints of selection between Jinhua pigs and other breeds utilizing genome-wide SNP data. Genotyping by genome reducing and sequencing was implemented in order to perform cross-population extended haplotype homozygosity to reveal strong signatures of selection for those economically important traits. This work was performed at a 2% genome level, which comprised 152 006 SNPs genotyped in a total of 517 individuals. Population-specific footprints of selective sweeps were searched for in the genome of Jinhua pigs using six native breeds and three European breeds as reference groups. Several candidate genes associated with meat quality, health and reproduction, such as GH1, CRHR2, TRAF4 and CCK, were found to be overlapping with the significantly positive outliers. Additionally, the results revealed that some genomic regions associated with meat quality, immune response and reproduction in Jinhua pigs have evolved directionally under domestication and subsequent selections. The identified genes and biological pathways in Jinhua pigs showed different selection patterns in comparison with the Chinese and European breeds. © 2016 Stichting International Foundation for Animal Genetics.
Felix, Janine F.; Bradfield, Jonathan P.; Monnereau, Claire; van der Valk, Ralf J.P.; Stergiakouli, Evie; Chesi, Alessandra; Gaillard, Romy; Feenstra, Bjarke; Thiering, Elisabeth; Kreiner-Møller, Eskil; Mahajan, Anubha; Pitkänen, Niina; Joro, Raimo; Cavadino, Alana; Huikari, Ville; Franks, Steve; Groen-Blokhuis, Maria M.; Cousminer, Diana L.; Marsh, Julie A.; Lehtimäki, Terho; Curtin, John A.; Vioque, Jesus; Ahluwalia, Tarunveer S.; Myhre, Ronny; Price, Thomas S.; Vilor-Tejedor, Natalia; Yengo, Loïc; Grarup, Niels; Ntalla, Ioanna; Ang, Wei; Atalay, Mustafa; Bisgaard, Hans; Blakemore, Alexandra I.; Bonnefond, Amelie; Carstensen, Lisbeth; Eriksson, Johan; Flexeder, Claudia; Franke, Lude; Geller, Frank; Geserick, Mandy; Hartikainen, Anna-Liisa; Haworth, Claire M.A.; Hirschhorn, Joel N.; Hofman, Albert; Holm, Jens-Christian; Horikoshi, Momoko; Hottenga, Jouke Jan; Huang, Jinyan; Kadarmideen, Haja N.; Kähönen, Mika; Kiess, Wieland; Lakka, Hanna-Maaria; Lakka, Timo A.; Lewin, Alexandra M.; Liang, Liming; Lyytikäinen, Leo-Pekka; Ma, Baoshan; Magnus, Per; McCormack, Shana E.; McMahon, George; Mentch, Frank D.; Middeldorp, Christel M.; Murray, Clare S.; Pahkala, Katja; Pers, Tune H.; Pfäffle, Roland; Postma, Dirkje S.; Power, Christine; Simpson, Angela; Sengpiel, Verena; Tiesler, Carla M. T.; Torrent, Maties; Uitterlinden, André G.; van Meurs, Joyce B.; Vinding, Rebecca; Waage, Johannes; Wardle, Jane; Zeggini, Eleftheria; Zemel, Babette S.; Dedoussis, George V.; Pedersen, Oluf; Froguel, Philippe; Sunyer, Jordi; Plomin, Robert; Jacobsson, Bo; Hansen, Torben; Gonzalez, Juan R.; Custovic, Adnan; Raitakari, Olli T.; Pennell, Craig E.; Widén, Elisabeth; Boomsma, Dorret I.; Koppelman, Gerard H.; Sebert, Sylvain; Järvelin, Marjo-Riitta; Hyppönen, Elina; McCarthy, Mark I.; Lindi, Virpi; Harri, Niinikoski; Körner, Antje; Bønnelykke, Klaus; Heinrich, Joachim; Melbye, Mads; Rivadeneira, Fernando; Hakonarson, Hakon; Ring, Susan M.; Smith, George Davey; Sørensen, Thorkild I.A.; Timpson, Nicholas J.; Grant, Struan F.A.; Jaddoe, Vincent W.V.
A large number of genetic loci are associated with adult body mass index. However, the genetics of childhood body mass index are largely unknown. We performed a meta-analysis of genome-wide association studies of childhood body mass index, using sex- and age-adjusted standard deviation scores. We included 35 668 children from 20 studies in the discovery phase and 11 873 children from 13 studies in the replication phase. In total, 15 loci reached genome-wide significance (P-value < 5 × 10−8) in the joint discovery and replication analysis, of which 12 are previously identified loci in or close to ADCY3, GNPDA2, TMEM18, SEC16B, FAIM2, FTO, TFAP2B, TNNI3K, MC4R, GPR61, LMX1B and OLFM4 associated with adult body mass index or childhood obesity. We identified three novel loci: rs13253111 near ELP3, rs8092503 near RAB27B and rs13387838 near ADAM23. Per additional risk allele, body mass index increased 0.04 Standard Deviation Score (SDS) [Standard Error (SE) 0.007], 0.05 SDS (SE 0.008) and 0.14 SDS (SE 0.025), for rs13253111, rs8092503 and rs13387838, respectively. A genetic risk score combining all 15 SNPs showed that each additional average risk allele was associated with a 0.073 SDS (SE 0.011, P-value = 3.12 × 10−10) increase in childhood body mass index in a population of 1955 children. This risk score explained 2% of the variance in childhood body mass index. This study highlights the shared genetic background between childhood and adult body mass index and adds three novel loci. These loci likely represent age-related differences in strength of the associations with body mass index. PMID:26604143
Full Text Available Summary: The emergence of influenza A viruses (IAVs from zoonotic reservoirs poses a great threat to human health. As seasonal vaccines are ineffective against zoonotic strains, and newly transmitted viruses can quickly acquire drug resistance, there remains a need for host-directed therapeutics against IAVs. Here, we performed a genome-scale CRISPR/Cas9 knockout screen in human lung epithelial cells with a human isolate of an avian H5N1 strain. Several genes involved in sialic acid biosynthesis and related glycosylation pathways were highly enriched post-H5N1 selection, including SLC35A1, a sialic acid transporter essential for IAV receptor expression and thus viral entry. Importantly, we have identified capicua (CIC as a negative regulator of cell-intrinsic immunity, as loss of CIC resulted in heightened antiviral responses and restricted replication of multiple viruses. Therefore, our study demonstrates that the CRISPR/Cas9 system can be utilized for the discovery of host factors critical for the replication of intracellular pathogens. : Using a genome-wide CRISPR/Cas9 screen, Han et al. demonstrate that the major hit, the sialic acid transporter SLC35A1, is an essential host factor for IAV entry. In addition, they identify the DNA-binding transcriptional repressor CIC as a negative regulator of cell-intrinsic immunity. Keywords: CRISPR/Cas9 screen, GeCKO, influenza virus, host factors, sialic acid pathway, SLC35A1, Capicua, CIC, cell-intrinsic immunity, H5N1
Full Text Available There is considerable evidence that human genetic variation influences gene expression. Genome-wide studies have revealed that mRNA levels are associated with genetic variation in or close to the gene coding for those mRNA transcripts - cis effects, and elsewhere in the genome - trans effects. The role of genetic variation in determining protein levels has not been systematically assessed. Using a genome-wide association approach we show that common genetic variation influences levels of clinically relevant proteins in human serum and plasma. We evaluated the role of 496,032 polymorphisms on levels of 42 proteins measured in 1200 fasting individuals from the population based InCHIANTI study. Proteins included insulin, several interleukins, adipokines, chemokines, and liver function markers that are implicated in many common diseases including metabolic, inflammatory, and infectious conditions. We identified eight Cis effects, including variants in or near the IL6R (p = 1.8x10(-57, CCL4L1 (p = 3.9x10(-21, IL18 (p = 6.8x10(-13, LPA (p = 4.4x10(-10, GGT1 (p = 1.5x10(-7, SHBG (p = 3.1x10(-7, CRP (p = 6.4x10(-6 and IL1RN (p = 7.3x10(-6 genes, all associated with their respective protein products with effect sizes ranging from 0.19 to 0.69 standard deviations per allele. Mechanisms implicated include altered rates of cleavage of bound to unbound soluble receptor (IL6R, altered secretion rates of different sized proteins (LPA, variation in gene copy number (CCL4L1 and altered transcription (GGT1. We identified one novel trans effect that was an association between ABO blood group and tumour necrosis factor alpha (TNF-alpha levels (p = 6.8x10(-40, but this finding was not present when TNF-alpha was measured using a different assay , or in a second study, suggesting an assay-specific association. Our results show that protein levels share some of the features of the genetics of gene expression. These include the presence of strong genetic effects in cis
Full Text Available Abstract Background The scope of our understanding of the evolutionary history between viruses and animals is limited. The fact that the recent availability of many complete insect virus genomes and vertebrate genomes as well as the ability to screen these sequences makes it possible to gain a new perspective insight into the evolutionary interaction between insect viruses and vertebrates. This study is to determine the possibility of existence of sequence identity between the genomes of insect viruses and vertebrates, attempt to explain this phenomenon in term of genetic mobile element, and try to investigate the evolutionary relationship between these short regions of identity among these species. Results Some of studied insect viruses contain variable numbers of short regions of sequence identity to the genomes of vertebrate with nucleotide sequence length from 28 bp to 124 bp. They are found to locate in multiple sites of the vertebrate genomes. The ontology of animal genes with identical regions involves in several processes including chromatin remodeling, regulation of apoptosis, signaling pathway, nerve system development and some enzyme-like catalysis. Phylogenetic analysis reveals that at least some short regions of sequence identity in the genomes of vertebrate are derived the ancestral of insect viruses. Conclusion Short regions of sequence identity were found in the vertebrates and insect viruses. These sequences played an important role not only in the long-term evolution of vertebrates, but also in promotion of insect virus. This typical win-win strategy may come from natural selection.
Fan, Gaowei; Li, Jinming
The scope of our understanding of the evolutionary history between viruses and animals is limited. The fact that the recent availability of many complete insect virus genomes and vertebrate genomes as well as the ability to screen these sequences makes it possible to gain a new perspective insight into the evolutionary interaction between insect viruses and vertebrates. This study is to determine the possibility of existence of sequence identity between the genomes of insect viruses and vertebrates, attempt to explain this phenomenon in term of genetic mobile element, and try to investigate the evolutionary relationship between these short regions of identity among these species. Some of studied insect viruses contain variable numbers of short regions of sequence identity to the genomes of vertebrate with nucleotide sequence length from 28 bp to 124 bp. They are found to locate in multiple sites of the vertebrate genomes. The ontology of animal genes with identical regions involves in several processes including chromatin remodeling, regulation of apoptosis, signaling pathway, nerve system development and some enzyme-like catalysis. Phylogenetic analysis reveals that at least some short regions of sequence identity in the genomes of vertebrate are derived the ancestral of insect viruses. Short regions of sequence identity were found in the vertebrates and insect viruses. These sequences played an important role not only in the long-term evolution of vertebrates, but also in promotion of insect virus. This typical win-win strategy may come from natural selection.
Meyer, Michael J; Geske, Philip; Yu, Haiyuan
Biological sequence databases are integral to efforts to characterize and understand biological molecules and share biological data. However, when analyzing these data, scientists are often left holding disparate biological currency-molecular identifiers from different databases. For downstream applications that require converting the identifiers themselves, there are many resources available, but analyzing associated loci and variants can be cumbersome if data is not given in a form amenable to particular analyses. Here we present BISQUE, a web server and customizable command-line tool for converting molecular identifiers and their contained loci and variants between different database conventions. BISQUE uses a graph traversal algorithm to generalize the conversion process for residues in the human genome, genes, transcripts and proteins, allowing for conversion across classes of molecules and in all directions through an intuitive web interface and a URL-based web service. BISQUE is freely available via the web using any major web browser (http://bisque.yulab.org/). Source code is available in a public GitHub repository (https://github.com/hyulab/BISQUE). firstname.lastname@example.org Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: email@example.com.
Cecilia M Lindgren
Full Text Available To identify genetic loci influencing central obesity and fat distribution, we performed a meta-analysis of 16 genome-wide association studies (GWAS, N = 38,580 informative for adult waist circumference (WC and waist-hip ratio (WHR. We selected 26 SNPs for follow-up, for which the evidence of association with measures of central adiposity (WC and/or WHR was strong and disproportionate to that for overall adiposity or height. Follow-up studies in a maximum of 70,689 individuals identified two loci strongly associated with measures of central adiposity; these map near TFAP2B (WC, P = 1.9x10(-11 and MSRA (WC, P = 8.9x10(-9. A third locus, near LYPLAL1, was associated with WHR in women only (P = 2.6x10(-8. The variants near TFAP2B appear to influence central adiposity through an effect on overall obesity/fat-mass, whereas LYPLAL1 displays a strong female-only association with fat distribution. By focusing on anthropometric measures of central obesity and fat distribution, we have identified three loci implicated in the regulation of human adiposity.
Knowles, Emma E M; Carless, Melanie A; de Almeida, Marcio A A; Curran, Joanne E; McKay, D Reese; Sprooten, Emma; Dyer, Thomas D; Göring, Harald H; Olvera, Rene; Fox, Peter; Almasy, Laura; Duggirala, Ravi; Kent, Jack W; Blangero, John; Glahn, David C
It is well established that risk for developing psychosis is largely mediated by the influence of genes, but identifying precisely which genes underlie that risk has been problematic. Focusing on endophenotypes, rather than illness risk, is one solution to this problem. Impaired cognition is a well-established endophenotype of psychosis. Here we aimed to characterize the genetic architecture of cognition using phenotypically detailed models as opposed to relying on general IQ or individual neuropsychological measures. In so doing we hoped to identify genes that mediate cognitive ability, which might also contribute to psychosis risk. Hierarchical factor models of genetically clustered cognitive traits were subjected to linkage analysis followed by QTL region-specific association analyses in a sample of 1,269 Mexican American individuals from extended pedigrees. We identified four genome wide significant QTLs, two for working and two for spatial memory, and a number of plausible and interesting candidate genes. The creation of detailed models of cognition seemingly enhanced the power to detect genetic effects on cognition and provided a number of possible candidate genes for psychosis. © 2013 Wiley Periodicals, Inc.
Full Text Available BACKGROUND: Hepatic insulin resistance impairs insulin's ability to suppress hepatic glucose production (HGP and contributes to the development of type 2 diabetes (T2D. Although the interests to discover novel genes that modulate insulin sensitivity and HGP are high, it remains challenging to have a human cell based system to identify novel genes. METHODOLOGY/PRINCIPAL FINDINGS: To identify genes that modulate hepatic insulin signaling and HGP, we generated a human cell line stably expressing beta-lactamase under the control of the human glucose-6-phosphatase (G6PC promoter (AH-G6PC cells. Both beta-lactamase activity and endogenous G6PC mRNA were increased in AH-G6PC cells by a combination of dexamethasone and pCPT-cAMP, and reduced by insulin. A 4-gene High-Throughput-Genomics assay was developed to concomitantly measure G6PC and pyruvate-dehydrogenase-kinase-4 (PDK4 mRNA levels. Using this assay, we screened an siRNA library containing pooled siRNA targeting 6650 druggable genes and identified 614 hits that lowered G6PC expression without increasing PDK4 mRNA levels. Pathway analysis indicated that siRNA-mediated knockdown (KD of genes known to positively or negatively affect insulin signaling increased or decreased G6PC mRNA expression, respectively, thus validating our screening platform. A subset of 270 primary screen hits was selected and 149 hits were confirmed by target gene KD by pooled siRNA and 7 single siRNA for each gene to reduce G6PC expression in 4-gene HTG assay. Subsequently, pooled siRNA KD of 113 genes decreased PEPCK and/or PGC1alpha mRNA expression thereby demonstrating their role in regulating key gluconeogenic genes in addition to G6PC. Last, KD of 61 of the above 113 genes potentiated insulin-stimulated Akt phosphorylation, suggesting that they suppress gluconeogenic gene by enhancing insulin signaling. CONCLUSIONS/SIGNIFICANCE: These results support the proposition that the proteins encoded by the genes identified in
Full Text Available Kernel starch content is an important trait in maize (Zea mays L. as it accounts for 65% to 75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60% to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001, among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437 is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops.
Full Text Available Abstract Background The integrative analysis of multiple genomics data often requires that genome coordinates-based signals have to be associated with proximal genes. The relative location of a genomic region with respect to the gene (gene area is important for functional data interpretation; hence algorithms that match regions to genes should be able to deliver insight into this information. Results In this work we review the tools that are publicly available for making region-to-gene associations. We also present a novel method, RGmatch, a flexible and easy-to-use Python tool that computes associations either at the gene, transcript, or exon level, applying a set of rules to annotate each region-gene association with the region location within the gene. RGmatch can be applied to any organism as long as genome annotation is available. Furthermore, we qualitatively and quantitatively compare RGmatch to other tools. Conclusions RGmatch simplifies the association of a genomic region with its closest gene. At the same time, it is a powerful tool because the rules used to annotate these associations are very easy to modify according to the researcher’s specific interests. Some important differences between RGmatch and other similar tools already in existence are RGmatch’s flexibility, its wide range of user options, compatibility with any annotatable organism, and its comprehensive and user-friendly output.
Full Text Available DNA methylation at CpG islands (CGIs is one of the most intensively studied epigenetic mechanisms. It is fundamental for cellular differentiation and control of transcriptional potential. DNA methylation is involved also in several processes that are central to evolutionary biology, including phenotypic plasticity and evolvability. In this study, we explored the relationship between CpG islands methylation and signatures of selective pressure in Homo Sapiens, using a computational biology approach. By analyzing methylation data of 25 cell lines from the Encyclopedia of DNA Elements (ENCODE Consortium, we compared the DNA methylation of CpG islands in genomic regions under selective pressure with the methylation of CpG islands in the remaining part of the genome. To define genomic regions under selective pressure, we used three different methods, each oriented to provide distinct information about selective events. Independently of the method and of the cell type used, we found evidences of undermethylation of CGIs in human genomic regions under selective pressure. Additionally, by analyzing SNP frequency in CpG islands, we demonstrated that CpG islands in regions under selective pressure show lower genetic variation. Our findings suggest that the CpG islands in regions under selective pressure seem to be somehow more "protected" from methylation when compared with other regions of the genome.
Okbay, Aysu; Baselmans, B.M.L. (Bart M.L.); Neve, Jan-Emmanuel; Turley, Patrick; Nivard, Michel; Fontana, M.A. (Mark Alan); Meddens, S.F.W. (S. Fleur W.); Linnér, R.K. (Richard Karlsson); Rietveld, C.A. (Cornelius A); Derringer, J.; Gratten, Jacob; Lee, James J.; Liu, J.Z. (Jimmy Z); Vlaming, Ronald; SAhluwalia, T. (Tarunveer)
textabstractVery few genetic variants have been associated with depression and neuroticism, likely because of limitations on sample size in previous studies. Subjective well-being, a phenotype that is genetically correlated with both of these traits, has not yet been studied with genome-wide data. We conducted genome-wide association studies of three phenotypes: subjective well-being (n = 298,420), depressive symptoms (n = 161,460), and neuroticism (n = 170,911). We identify 3 variants associ...
Wei, Guifang; Pan, Li; Du, Huimin; Chen, Junyi; Zhao, Liping
Bacterial populations common to healthy human guts may play important roles in human health. A new strategy for discovering genomic sequences as markers for these bacteria was developed using Enterobacterial Repetitive Intergenic Consensus (ERIC)-PCR fingerprinting. Structural features within microbial communities are compared with ERIC-PCR followed by DNA hybridization to identify genomic fragments shared by samples from healthy human individuals. ERIC-PCR profiles of fecal samples from 12 diseased or healthy human and piglet subjects demonstrated stable, unique banding patterns for each individual tested. Sequence homology of DNA fragments in bands of identical size was examined between samples by hybridization under high stringency conditions with DIG-labeled ERIC-PCR products derived from the fecal sample of one healthy child. Comparative analysis of the hybridization profiles with the original agarose fingerprints identified three predominant bands as signatures for populations associated with healthy human guts with sizes of 500, 800 and 1000 bp. Clone library profiling of the three bands produced 17 genome fragments, three of which showed high similarity only with regions of the Bacteroides thetaiotaomicron genome, while the remainder were orphan sequences. Association of these sequences with healthy guts was validated by sequence-selective PCR experiments, which showed that a single fragment was present in all 32 healthy humans and 13 healthy piglets tested. Two fragments were present in the healthy human group and in 18 children with non-infectious diarrhea but not in eight children with infectious diarrhea. Genome fragments identified with this novel strategy may be used as genome-specific markers for dynamic monitoring and sequence-guided isolation of functionally important bacterial populations in complex communities such as human gut microflora.
Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing) as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.
Nicolas, Aude; Kenna, Kevin P; Renton, Alan E; Ticozzi, Nicola; Faghri, Faraz; Chia, Ruth; Dominov, Janice A; Kenna, Brendan J; Nalls, Mike A; Keagle, Pamela; Rivera, Alberto M; van Rheenen, Wouter; Murphy, Natalie A; van Vugt, Joke J F A; Geiger, Joshua T; Van der Spek, Rick A; Pliner, Hannah A; Shankaracharya; Smith, Bradley N; Marangi, Giuseppe; Topp, Simon D; Abramzon, Yevgeniya; Gkazi, Athina Soragia; Eicher, John D; Kenna, Aoife; Mora, Gabriele; Calvo, Andrea; Mazzini, Letizia; Riva, Nilo; Mandrioli, Jessica; Caponnetto, Claudia; Battistini, Stefania; Volanti, Paolo; La Bella, Vincenzo; Conforti, Francesca L; Borghero, Giuseppe; Messina, Sonia; Simone, Isabella L; Trojsi, Francesca; Salvi, Fabrizio; Logullo, Francesco O; D'Alfonso, Sandra; Corrado, Lucia; Capasso, Margherita; Ferrucci, Luigi; Moreno, Cristiane de Araujo Martins; Kamalakaran, Sitharthan; Goldstein, David B; Gitler, Aaron D; Harris, Tim; Myers, Richard M; Phatnani, Hemali; Musunuri, Rajeeva Lochan; Evani, Uday Shankar; Abhyankar, Avinash; Zody, Michael C; Kaye, Julia; Finkbeiner, Steven; Wyman, Stacia K; LeNail, Alex; Lima, Leandro; Fraenkel, Ernest; Svendsen, Clive N; Thompson, Leslie M; Van Eyk, Jennifer E; Berry, James D; Miller, Timothy M; Kolb, Stephen J; Cudkowicz, Merit; Baxi, Emily; Benatar, Michael; Taylor, J Paul; Rampersaud, Evadnie; Wu, Gang; Wuu, Joanne; Lauria, Giuseppe; Verde, Federico; Fogh, Isabella; Tiloca, Cinzia; Comi, Giacomo P; Sorarù, Gianni; Cereda, Cristina; Corcia, Philippe; Laaksovirta, Hannu; Myllykangas, Liisa; Jansson, Lilja; Valori, Miko; Ealing, John; Hamdalla, Hisham; Rollinson, Sara; Pickering-Brown, Stuart; Orrell, Richard W; Sidle, Katie C; Malaspina, Andrea; Hardy, John; Singleton, Andrew B; Johnson, Janel O; Arepalli, Sampath; Sapp, Peter C; McKenna-Yasek, Diane; Polak, Meraida; Asress, Seneshaw; Al-Sarraj, Safa; King, Andrew; Troakes, Claire; Vance, Caroline; de Belleroche, Jacqueline; Baas, Frank; Ten Asbroek, Anneloor L M A; Muñoz-Blanco, José Luis; Hernandez, Dena G; Ding, Jinhui; Gibbs, J Raphael; Scholz, Sonja W; Floeter, Mary Kay; Campbell, Roy H; Landi, Francesco; Bowser, Robert; Pulst, Stefan M; Ravits, John M; MacGowan, Daniel J L; Kirby, Janine; Pioro, Erik P; Pamphlett, Roger; Broach, James; Gerhard, Glenn; Dunckley, Travis L; Brady, Christopher B; Kowall, Neil W; Troncoso, Juan C; Le Ber, Isabelle; Mouzat, Kevin; Lumbroso, Serge; Heiman-Patterson, Terry D; Kamel, Freya; Van Den Bosch, Ludo; Baloh, Robert H; Strom, Tim M; Meitinger, Thomas; Shatunov, Aleksey; Van Eijk, Kristel R; de Carvalho, Mamede; Kooyman, Maarten; Middelkoop, Bas; Moisse, Matthieu; McLaughlin, Russell L; Van Es, Michael A; Weber, Markus; Boylan, Kevin B; Van Blitterswijk, Marka; Rademakers, Rosa; Morrison, Karen E; Basak, A Nazli; Mora, Jesús S; Drory, Vivian E; Shaw, Pamela J; Turner, Martin R; Talbot, Kevin; Hardiman, Orla; Williams, Kelly L; Fifita, Jennifer A; Nicholson, Garth A; Blair, Ian P; Rouleau, Guy A; Esteban-Pérez, Jesús; García-Redondo, Alberto; Al-Chalabi, Ammar; Rogaeva, Ekaterina; Zinman, Lorne; Ostrow, Lyle W; Maragakis, Nicholas J; Rothstein, Jeffrey D; Simmons, Zachary; Cooper-Knock, Johnathan; Brice, Alexis; Goutman, Stephen A; Feldman, Eva L; Gibson, Summer B; Taroni, Franco; Ratti, Antonia; Gellera, Cinzia; Van Damme, Philip; Robberecht, Wim; Fratta, Pietro; Sabatelli, Mario; Lunetta, Christian; Ludolph, Albert C; Andersen, Peter M; Weishaupt, Jochen H; Camu, William; Trojanowski, John Q; Van Deerlin, Vivianna M; Brown, Robert H; van den Berg, Leonard H; Veldink, Jan H; Harms, Matthew B; Glass, Jonathan D; Stone, David J; Tienari, Pentti; Silani, Vincenzo; Chiò, Adriano; Shaw, Christopher E; Traynor, Bryan J; Landers, John E
To identify novel genes associated with ALS, we undertook two lines of investigation. We carried out a genome-wide association study comparing 20,806 ALS cases and 59,804 controls. Independently, we performed a rare variant burden analysis comparing 1,138 index familial ALS cases and 19,494 controls. Through both approaches, we identified kinesin family member 5A (KIF5A) as a novel gene associated with ALS. Interestingly, mutations predominantly in the N-terminal motor domain of KIF5A are causative for two neurodegenerative diseases: hereditary spastic paraplegia (SPG10) and Charcot-Marie-Tooth type 2 (CMT2). In contrast, ALS-associated mutations are primarily located at the C-terminal cargo-binding tail domain and patients harboring loss-of-function mutations displayed an extended survival relative to typical ALS cases. Taken together, these results broaden the phenotype spectrum resulting from mutations in KIF5A and strengthen the role of cytoskeletal defects in the pathogenesis of ALS. Copyright © 2018 Elsevier Inc. All rights reserved.
Easton, Douglas F.; Pooley, Karen A.; Dunning, Alison M.; Pharoah, Paul D. P.; Thompson, Deborah; Ballinger, Dennis G.; Struewing, Jeffery P.; Morrison, Jonathan; Field, Helen; Luben, Robert; Wareham, Nicholas; Ahmed, Shahana; Healey, Catherine S.; Bowman, Richard; Meyer, Kerstin B.; Haiman, Christopher A.; Kolonel, Laurence K.; Henderson, Brian E.; Marchand, Loic Le; Brennan, Paul; Sangrajrang, Suleeporn; Gaborieau, Valerie; Odefrey, Fabrice; Shen, Chen-Yang; Wu, Pei-Ei; Wang, Hui-Chun; Eccles, Diana; Evans, D. Gareth; Peto, Julian; Fletcher, Olivia; Johnson, Nichola; Seal, Sheila; Stratton, Michael R.; Rahman, Nazneen; Chenevix-Trench, Georgia; Bojesen, Stig E.; Nordestgaard, Børge G.; Axelsson, Christen K.; Garcia-Closas, Montserrat; Brinton, Louise; Chanock, Stephen; Lissowska, Jolanta; Peplonska, Beata; Nevanlinna, Heli; Fagerholm, Rainer; Eerola, Hannaleena; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Ahn, Sei-Hyun; Hunter, David J.; Hankinson, Susan E.; Cox, David G.; Hall, Per; Wedren, Sara; Liu, Jianjun; Low, Yen-Ling; Bogdanova, Natalia; Schürmann, Peter; Dörk, Thilo; Tollenaar, Rob A. E. M.; Jacobi, Catharina E.; Devilee, Peter; Klijn, Jan G. M.; Sigurdson, Alice J.; Doody, Michele M.; Alexander, Bruce H.; Zhang, Jinghui; Cox, Angela; Brock, Ian W.; MacPherson, Gordon; Reed, Malcolm W. R.; Couch, Fergus J.; Goode, Ellen L.; Olson, Janet E.; Meijers-Heijboer, Hanne; van den Ouweland, Ans; Uitterlinden, André; Rivadeneira, Fernando; Milne, Roger L.; Ribas, Gloria; Gonzalez-Neira, Anna; Benitez, Javier; Hopper, John L.; McCredie, Margaret; Southey, Melissa; Giles, Graham G.; Schroen, Chris; Justenhoven, Christina; Brauch, Hiltrud; Hamann, Ute; Ko, Yon-Dschun; Spurdle, Amanda B.; Beesley, Jonathan; Chen, Xiaoqing; Mannermaa, Arto; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana; Day, Nicholas E.; Cox, David R.; Ponder, Bruce A. J.; Luccarini, Craig; Conroy, Don; Shah, Mitul; Munday, Hannah; Jordan, Clare; Perkins, Barbara; West, Judy; Redman, Karen; Driver, Kristy; Aghmesheh, Morteza; Amor, David; Andrews, Lesley; Antill, Yoland; Armes, Jane; Armitage, Shane; Arnold, Leanne; Balleine, Rosemary; Begley, Glenn; Beilby, John; Bennett, Ian; Bennett, Barbara; Berry, Geoffrey; Blackburn, Anneke; Brennan, Meagan; Brown, Melissa; Buckley, Michael; Burke, Jo; Butow, Phyllis; Byron, Keith; Callen, David; Campbell, Ian; Chenevix-Trench, Georgia; Clarke, Christine; Colley, Alison; Cotton, Dick; Cui, Jisheng; Culling, Bronwyn; Cummings, Margaret; Dawson, Sarah-Jane; Dixon, Joanne; Dobrovic, Alexander; Dudding, Tracy; Edkins, Ted; Eisenbruch, Maurice; Farshid, Gelareh; Fawcett, Susan; Field, Michael; Firgaira, Frank; Fleming, Jean; Forbes, John; Friedlander, Michael; Gaff, Clara; Gardner, Mac; Gattas, Mike; George, Peter; Giles, Graham; Gill, Grantley; Goldblatt, Jack; Greening, Sian; Grist, Scott; Haan, Eric; Harris, Marion; Hart, Stewart; Hayward, Nick; Hopper, John; Humphrey, Evelyn; Jenkins, Mark; Jones, Alison; Kefford, Rick; Kirk, Judy; Kollias, James; Kovalenko, Sergey; Lakhani, Sunil; Leary, Jennifer; Lim, Jacqueline; Lindeman, Geoff; Lipton, Lara; Lobb, Liz; Maclurcan, Mariette; Mann, Graham; Marsh, Deborah; McCredie, Margaret; McKay, Michael; McLachlan, Sue Anne; Meiser, Bettina; Milne, Roger; Mitchell, Gillian; Newman, Beth; O'Loughlin, Imelda; Osborne, Richard; Peters, Lester; Phillips, Kelly; Price, Melanie; Reeve, Jeanne; Reeve, Tony; Richards, Robert; Rinehart, Gina; Robinson, Bridget; Rudzki, Barney; Salisbury, Elizabeth; Sambrook, Joe; Saunders, Christobel; Scott, Clare; Scott, Elizabeth; Scott, Rodney; Seshadri, Ram; Shelling, Andrew; Southey, Melissa; Spurdle, Amanda; Suthers, Graeme; Taylor, Donna; Tennant, Christopher; Thorne, Heather; Townshend, Sharron; Tucker, Kathy; Tyler, Janet; Venter, Deon; Visvader, Jane; Walpole, Ian; Ward, Robin; Waring, Paul; Warner, Bev; Warren, Graham; Watson, Elizabeth; Williams, Rachael; Wilson, Judy; Winship, Ingrid; Young, Mary Ann; Bowtell, David; Green, Adele; deFazio, Anna; Chenevix-Trench, Georgia; Gertig, Dorota; Webb, Penny
Breast cancer exhibits familial aggregation, consistent with variation in genetic susceptibility to the disease. Known susceptibility genes account for less than 25% of the familial risk of breast cancer, and the residual genetic variance is likely to be due to variants conferring more moderate risks. To identify further susceptibility alleles, we conducted a two-stage genome-wide association study in 4,398 breast cancer cases and 4,316 controls, followed by a third stage in which 30 single nucleotide polymorphisms (SNPs) were tested for confirmation in 21,860 cases and 22,578 controls from 22 studies. We used 227,876 SNPs that were estimated to correlate with 77% of known common SNPs in Europeans at r2>0.5. SNPs in five novel independent loci exhibited strong and consistent evidence of association with breast cancer (P<10−7). Four of these contain plausible causative genes (FGFR2, TNRC9, MAP3K1 and LSP1). At the second stage, 1,792 SNPs were significant at the P<0.05 level compared with an estimated 1,343 that would be expected by chance, indicating that many additional common susceptibility alleles may be identifiable by this approach. PMID:17529967
Full Text Available Hedgehog (Hh proteins are secreted molecules that function as organizers in animal development. In addition to being palmitoylated, Hh is the only metazoan protein known to possess a covalently-linked cholesterol moiety. The absence of either modification severely disrupts the organization of numerous tissues during development. It is currently not known how lipid-modified Hh is secreted and released from producing cells. We have performed a genome-wide RNAi screen in Drosophila melanogaster cells to identify regulators of Hh secretion. We found that cholesterol-modified Hh secretion is strongly dependent on coat protein complex I (COPI but not COPII vesicles, suggesting that cholesterol modification alters the movement of Hh through the early secretory pathway. We provide evidence that both proteolysis and cholesterol modification are necessary for the efficient trafficking of Hh through the ER and Golgi. Finally, we identified several putative regulators of protein secretion and demonstrate a role for some of these genes in Hh and Wingless (Wg morphogen secretion in vivo. These data open new perspectives for studying how morphogen secretion is regulated, as well as provide insight into regulation of lipid-modified protein secretion.
Wong, Yee-Chin; Abd El Ghany, Moataz; Naeem, Raeece; Lee, Kok-Wei; Tan, Yung-Chie; Pain, Arnab; Nathan, Sheila
Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing) as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.
Kiel, Mark J; Velusamy, Thirunavukkarasu; Betz, Bryan L; Zhao, Lili; Weigelin, Helmut G; Chiang, Mark Y; Huebner-Chan, David R; Bailey, Nathanael G; Yang, David T; Bhagat, Govind; Miranda, Roberto N; Bahler, David W; Medeiros, L Jeffrey; Lim, Megan S; Elenitoba-Johnson, Kojo S J
Splenic marginal zone lymphoma (SMZL), the most common primary lymphoma of spleen, is poorly understood at the genetic level. In this study, using whole-genome DNA sequencing (WGS) and confirmation by Sanger sequencing, we observed mutations identified in several genes not previously known to be recurrently altered in SMZL. In particular, we identified recurrent somatic gain-of-function mutations in NOTCH2, a gene encoding a protein required for marginal zone B cell development, in 25 of 99 (∼25%) cases of SMZL and in 1 of 19 (∼5%) cases of nonsplenic MZLs. These mutations clustered near the C-terminal proline/glutamate/serine/threonine (PEST)-rich domain, resulting in protein truncation or, rarely, were nonsynonymous substitutions affecting the extracellular heterodimerization domain (HD). NOTCH2 mutations were not present in other B cell lymphomas and leukemias, such as chronic lymphocytic leukemia/small lymphocytic lymphoma (CLL/SLL; n = 15), mantle cell lymphoma (MCL; n = 15), low-grade follicular lymphoma (FL; n = 44), hairy cell leukemia (HCL; n = 15), and reactive lymphoid hyperplasia (n = 14). NOTCH2 mutations were associated with adverse clinical outcomes (relapse, histological transformation, and/or death) among SMZL patients (P = 0.002). These results suggest that NOTCH2 mutations play a role in the pathogenesis and progression of SMZL and are associated with a poor prognosis.
Full Text Available The intestinal epithelium is the most rapidly self-renewing tissue in adult animals and maintained by intestinal stem cells (ISCs in both Drosophila and mammals. To comprehensively identify genes and pathways that regulate ISC fates, we performed a genome-wide transgenic RNAi screen in adult Drosophila intestine and identified 405 genes that regulate ISC maintenance and lineage-specific differentiation. By integrating these genes into publicly available interaction databases, we further developed functional networks that regulate ISC self-renewal, ISC proliferation, ISC maintenance of diploid status, ISC survival, ISC-to-enterocyte (EC lineage differentiation, and ISC-to-enteroendocrine (EE lineage differentiation. By comparing regulators among ISCs, female germline stem cells, and neural stem cells, we found that factors related to basic stem cell cellular processes are commonly required in all stem cells, and stem-cell-specific, niche-related signals are required only in the unique stem cell type. Our findings provide valuable insights into stem cell maintenance and lineage-specific differentiation.
Full Text Available Burkholderia cenocepacia infection often leads to fatal cepacia syndrome in cystic fibrosis patients. However, antibiotic therapy rarely results in complete eradication of the pathogen due to its intrinsic resistance to many clinically available antibiotics. Recent attention has turned to the identification of essential genes as the proteins encoded by these genes may serve as potential targets for development of novel antimicrobials. In this study, we utilized TraDIS (Transposon Directed Insertion-site Sequencing as a genome-wide screening tool to facilitate the identification of B. cenocepacia genes essential for its growth and viability. A transposon mutant pool consisting of approximately 500,000 mutants was successfully constructed, with more than 400,000 unique transposon insertion sites identified by computational analysis of TraDIS datasets. The saturated library allowed for the identification of 383 genes that were predicted to be essential in B. cenocepacia. We extended the application of TraDIS to identify conditionally essential genes required for in vitro growth and revealed an additional repertoire of 439 genes to be crucial for B. cenocepacia growth under nutrient-depleted conditions. The library of B. cenocepacia mutants can subsequently be subjected to various biologically related conditions to facilitate the discovery of genes involved in niche adaptation as well as pathogenicity and virulence.
Michailidou, Kyriaki; Beesley, Jonathan; Lindstrom, Sara; Canisius, Sander; Dennis, Joe; Lush, Michael J.; Maranian, Mel J.; Bolla, Manjeet K.; Wang, Qin; Shah, Mitul; Perkins, Barbara J.; Czene, Kamila; Eriksson, Mikael; Darabi, Hatef; Brand, Judith S.; Bojesen, Stig E.; Nordestgaard, Borge G.; Flyger, Henrik; Nielsen, Sune F.; Rahman, Nazneen; Turnbull, Clare; Fletcher, Olivia; Peto, Julian; Gibson, Lorna; dos-Santos-Silva, Isabel; Chang-Claude, Jenny; Flesch-Janys, Dieter; Rudolph, Anja; Eilber, Ursula; Behrens, Sabine; Nevanlinna, Heli; Muranen, Taru A.; Aittomaki, Kristiina; Blomqvist, Carl; Khan, Sofia; Aaltonen, Kirsimari; Ahsan, Habibul; Kibriya, Muhammad G.; Whittemore, Alice S.; John, Esther M.; Malone, Kathleen E.; Gammon, Marilie D.; Santella, Regina M.; Ursin, Giske; Makalic, Enes; Schmidt, Daniel F.; Casey, Graham; Hunter, David J.; Gapstur, Susan M.; Gaudet, Mia M.; Diver, W. Ryan; Haiman, Christopher A.; Schumacher, Fredrick; Henderson, Brian E.; Le Marchand, Loic; Berg, Christine D.; Chanock, Stephen J.; Figueroa, Jonine; Hoover, Robert N.; Lambrechts, Diether; Neven, Patrick; Wildiers, Hans; van Limbergen, Erik; Schmidt, Marjanka K.; Broeks, Annegien; Verhoef, Senno; Cornelissen, Sten; Couch, Fergus J.; Olson, Janet E.; Hallberg, Emily; Vachon, Celine; Waisfisz, Quinten; Meijers-Heijboer, Hanne; Adank, Muriel A.; van der Luijt, Rob B.; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K.; Yoo, Keun-Young; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Tajima, Kazuo; Guenel, Pascal; Truong, Therese; Mulot, Claire; Sanchez, Marie; Burwinkel, Barbara; Marme, Frederik; Surowy, Harald; Sohn, Christof; Wu, Anna H.; Tseng, Chiu-chen; Van den Berg, David; Stram, Daniel O.; Gonzalez-Neira, Anna; Benitez, Javier; Zamora, M. Pilar; Arias Perez, Jose Ignacio; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Cox, Angela; Cross, Simon S.; Reed, Malcolm W. R.; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Mulligan, Anna Marie; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Lindblom, Annika; Margolin, Sara; Teo, Soo Hwang; Yip, Cheng Har; Taib, Nur Aishah Mohd; Tan, Gie-Hooi; Hooning, Maartje J.; Hollestelle, Antoinette; Martens, John W. M.; Collee, J. Margriet; Blot, William; Signorello, Lisa B.; Cai, Qiuyin; Hopper, John L.; Southey, Melissa C.; Tsimiklis, Helen; Apicella, Carmel; Shen, Chen-Yang; Hsiung, Chia-Ni; Wu, Pei-Ei; Hou, Ming-Feng; Kristensen, Vessela N.; Nord, Silje; Alnaes, Grethe I. Grenaker; Giles, Graham G.; Milne, Roger L.; McLean, Catriona; Canzian, Federico; Trichopoulos, Dimitrios; Peeters, Petra; Lund, Eiliv; Sund, Malin; Khaw, Kay-Tee; Gunter, Marc J.; Palli, Domenico; Mortensen, Lotte Maxild; Dossus, Laure; Huerta, Jose-Maria; Meindl, Alfons; Schmutzler, Rita K.; Sutter, Christian; Yang, Rongxi; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Hartman, Mikael; Miao, Hui; Chia, Kee Seng; Chan, Ching Wan; Fasching, Peter A.; Hein, Alexander; Beckmann, Matthias W.; Haeberle, Lothar; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J.; Swerdlow, Anthony J.; Brinton, Louise; Garcia-Closas, Montserrat; Zheng, Wei; Halverson, Sandra L.; Shrubsole, Martha; Long, Jirong; Goldberg, Mark S.; Labreche, France; Dumont, Martine; Winqvist, Robert; Pylkas, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Brauch, Hiltrud; Hamann, Ute; Bruening, Thomas; Radice, Paolo; Peterlongo, Paolo; Manoukian, Siranoush; Bernard, Loris; Bogdanova, Natalia V.; Doerk, Thilo; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Devilee, Peter; Tollenaar, Robert A. E. M.; Seynaeve, Caroline; Van Asperen, Christi J.; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Huzarski, Tomasz; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; Mckay, James; Slager, Susan; Toland, Amanda E.; Ambrosone, Christine B.; Yannoukakos, Drakoulis; Kabisch, Maria; Torres, Diana; Neuhausen, Susan L.; Anton-Culver, Hoda; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Healey, Catherine S.; Tessier, Daniel C.; Vincent, Daniel; Bacot, Francois; Pita, Guillermo; Rosario Alonso, M.; Alvarez, Nuria; Herrero, Daniel; Simard, Jacques; Pharoah, Paul P. D. P.; Kraft, Peter; Dunning, Alison M.; Chenevix-Trench, Georgia; Hall, Per; Easton, Douglas F.
Genome-wide association studies (GWAS) and large-scale replication studies have identified common variants in 79 loci associated with breast cancer, explaining similar to 14% of the familial risk of the disease. To identify new susceptibility loci, we performed a meta-analysis of 11 GWAS, comprising
K. Michailidou (Kyriaki); J. Beesley (Jonathan); S. Lindstrom (Stephen); S. Canisius (Sander); J. Dennis (Joe); M. Lush (Michael); M. Maranian (Melanie); M.K. Bolla (Manjeet); Q. Wang (Qing); M. Shah (Mitul); B. Perkins (Barbara); K. Czene (Kamila); M. Eriksson (Mikael); H. Darabi (Hatef); J.S. Brand (Judith S.); S.E. Bojesen (Stig); B.G. Nordestgaard (Børge); H. Flyger (Henrik); S.F. Nielsen (Sune); N. Rahman (Nazneen); C. Turnbull (Clare); O. Fletcher (Olivia); J. Peto (Julian); L.J. Gibson (Lorna); I. dos Santos Silva (Isabel); J. Chang-Claude (Jenny); D. Flesch-Janys (Dieter); A. Rudolph (Anja); U. Eilber (Ursula); T.W. Behrens (Timothy); H. Nevanlinna (Heli); T.A. Muranen (Taru); K. Aittomäki (Kristiina); C. Blomqvist (Carl); S. Khan (Sofia); K. Aaltonen (Kirsimari); H. Ahsan (Habibul); M.G. Kibriya (Muhammad); A.S. Whittemore (Alice S.); E.M. John (Esther M.); K.E. Malone (Kathleen E.); M.D. Gammon (Marilie); R.M. Santella (Regina M.); G. Ursin (Giske); E. Makalic (Enes); D.F. Schmidt (Daniel); G. Casey (Graham); D.J. Hunter (David J.); S.M. Gapstur (Susan M.); M.M. Gaudet (Mia); W.R. Diver (Ryan); C.A. Haiman (Christopher A.); F.R. Schumacher (Fredrick); B.E. Henderson (Brian); L. Le Marchand (Loic); C.D. Berg (Christine); S.J. Chanock (Stephen); J.D. Figueroa (Jonine); R.N. Hoover (Robert N.); D. Lambrechts (Diether); P. Neven (Patrick); H. Wildiers (Hans); E. van Limbergen (Erik); M.K. Schmidt (Marjanka); A. Broeks (Annegien); S. Verhoef; S. Cornelissen (Sten); F.J. Couch (Fergus); J.E. Olson (Janet); B. Hallberg (Boubou); C. Vachon (Celine); Q. Waisfisz (Quinten); E.J. Meijers-Heijboer (Hanne); M.A. Adank (Muriel); R.B. van der Luijt (Rob); J. Li (Jingmei); J. Liu (Jianjun); M.K. Humphreys (Manjeet); D. Kang (Daehee); J.-Y. Choi (Ji-Yeob); S.K. Park (Sue K.); K.Y. Yoo; K. Matsuo (Keitaro); H. Ito (Hidemi); H. Iwata (Hiroji); K. Tajima (Kazuo); P. Guénel (Pascal); T. Truong (Thérèse); C. Mulot (Claire); M. Sanchez (Marie); B. Burwinkel (Barbara); F. Marme (Federick); H. Surowy (Harald); C. Sohn (Christof); A.H. Wu (Anna H); C.-C. Tseng (Chiu-chen); D. Van Den Berg (David); D.O. Stram (Daniel O.); A. González-Neira (Anna); J. Benítez (Javier); M.P. Zamora (Pilar); J.I.A. Perez (Jose Ignacio Arias); X.-O. Shu (Xiao-Ou); W. Lu (Wei); Y. Gao; H. Cai (Hui); A. Cox (Angela); S.S. Cross (Simon); M.W.R. Reed (Malcolm); I.L. Andrulis (Irene); J.A. Knight (Julia); G. Glendon (Gord); A.-M. Mulligan (Anna-Marie); E.J. Sawyer (Elinor); I.P. Tomlinson (Ian); M. Kerin (Michael); N. Miller (Nicola); A. Lindblom (Annika); S. Margolin (Sara); S.H. Teo (Soo Hwang); C.H. Yip (Cheng Har); N.A.M. Taib (Nur Aishah Mohd); G.-H. Tan (Gie-Hooi); M.J. Hooning (Maartje); A. Hollestelle (Antoinette); J.W.M. Martens (John); J.M. Collée (Margriet); W.J. Blot (William); L.B. Signorello (Lisa B.); Q. Cai (Qiuyin); J. Hopper (John); M.C. Southey (Melissa); H. Tsimiklis (Helen); C. Apicella (Carmel); C-Y. Shen (Chen-Yang); C.-N. Hsiung (Chia-Ni); P.-E. Wu (Pei-Ei); M.-F. Hou (Ming-Feng); V. Kristensen (Vessela); S. Nord (Silje); G.G. Alnæs (Grethe); G.G. Giles (Graham G.); R.L. Milne (Roger); C.A. McLean (Catriona Ann); F. Canzian (Federico); D. Trichopoulos (Dimitrios); P.H.M. Peeters; E. Lund (Eiliv); R. Sund (Reijo); K.T. Khaw; M.J. Gunter (Marc J.); D. Palli (Domenico); L.M. Mortensen (Lotte Maxild); L. Dossus (Laure); J.-M. Huerta (Jose-Maria); A. Meindl (Alfons); R.K. Schmutzler (Rita); C. Sutter (Christian); R. Yang (Rongxi); K. Muir (Kenneth); A. Lophatananon (Artitaya); S. Stewart-Brown (Sarah); P. Siriwanarangsan (Pornthep); J.M. Hartman (Joost); X. Miao; K.S. Chia (Kee Seng); C.W. Chan (Ching Wan); P.A. Fasching (Peter); R. Hein (Rebecca); M.W. Beckmann (Matthias); L. Haeberle (Lothar); H. Brenner (Hermann); A.K. Dieffenbach (Aida Karina); V. Arndt (Volker); C. Stegmaier (Christa); A. Ashworth (Alan); N. Orr (Nick); M. Schoemaker (Minouk); A.J. Swerdlow (Anthony ); L.A. Brinton (Louise); M. García-Closas (Montserrat); W. Zheng (Wei); S.L. Halverson (Sandra L.); M. Shrubsole (Martha); J. Long (Jirong); M.S. Goldberg (Mark); F. Labrèche (France); M. Dumont (Martine); R. Winqvist (Robert); K. Pykäs (Katri); A. Jukkola-Vuorinen (Arja); M. Grip (Mervi); H. Brauch (Hiltrud); U. Hamann (Ute); T. Brüning (Thomas); P. Radice (Paolo); P. Peterlongo (Paolo); S. Manoukian (Siranoush); L. Bernard (Loris); N.V. Bogdanova (Natalia); T. Dörk (Thilo); A. Mannermaa (Arto); V. Kataja (Vesa); V-M. Kosma (Veli-Matti); J.M. Hartikainen (J.); P. Devilee (Peter); R.A.E.M. Tollenaar (Rob); C.M. Seynaeve (Caroline); C.J. van Asperen (Christi); A. Jakubowska (Anna); J. Lubinski (Jan); K. Jaworska (Katarzyna); T. Huzarski (Tomasz); S. Sangrajrang (Suleeporn); V. Gaborieau (Valerie); P. Brennan (Paul); J.D. McKay (James); S. Slager (Susan); A.E. Toland (Amanda); C.B. Ambrosone (Christine); D. Yannoukakos (Drakoulis); M. Kabisch (Maria); D. Torres (Diana); S.L. Neuhausen (Susan); H. Anton-Culver (Hoda); C. Luccarini (Craig); C. Baynes (Caroline); S. Ahmed (Shahana); S. Healey (Sue); D.C. Tessier (Daniel C.); D. Vincent (Daniel); F. Bacot (Francois); G. Pita (Guillermo); M.R. Alonso (Rosario); N. Álvarez (Nuria); D. Herrero (Daniel); J. Simard (Jacques); P.P.D.P. Pharoah (Paul P.D.P.); P. Kraft (Peter); A.M. Dunning (Alison); G. Chenevix-Trench (Georgia); P. Hall (Per); D.F. Easton (Douglas)
textabstractGenome-wide association studies (GWAS) and large-scale replication studies have identified common variants in 79 loci associated with breast cancer, explaining ∼14% of the familial risk of the disease. To identify new susceptibility loci, we performed a meta-analysis of 11 GWAS,
Deelen, Joris; Beekman, Marian; Uh, Hae-Won
By studying the loci which contribute to human longevity, we aim to identify mechanisms that contribute to healthy aging. To identify such loci, we performed a genome-wide association study (GWAS) comparing 403 unrelated nonagenarians from long-living families included in the Leiden Longevity Stu...
Full Text Available Twenty-six Salmonella enterica serovar Eko isolated from various sources in Nigeria were investigated by whole genome sequencing to identify the source of human infections. Diversity among the isolates was observed and camel and cattle were identified as the primary reservoirs and the most likely source of the human infections.
Leekitcharoenphon, Pimlapas; Raufu, Ibrahim; Thorup Nielsen, Mette
Twenty-six Salmonella enterica serovar Eko isolated from various sources in Nigeria were investigated by whole genome sequencing to identify the source of human infections. Diversity among the isolates was observed and camel and cattle were identified as the primary reservoirs and the most likely...
Michailidou, Kyriaki; Beesley, Jonathan; Lindstrom, Sara
Genome-wide association studies (GWAS) and large-scale replication studies have identified common variants in 79 loci associated with breast cancer, explaining ∼14% of the familial risk of the disease. To identify new susceptibility loci, we performed a meta-analysis of 11 GWAS, comprising 15,748...
Cho, Michael H; Castaldi, Peter J; Wan, Emily S
The genetic risk factors for chronic obstructive pulmonary disease (COPD) are still largely unknown. To date, genome-wide association studies (GWASs) of limited size have identified several novel risk loci for COPD at CHRNA3/CHRNA5/IREB2, HHIP and FAM13A; additional loci may be identified through...
Conall M O'Seaghdha
Full Text Available Calcium is vital to the normal functioning of multiple organ systems and its serum concentration is tightly regulated. Apart from CASR, the genes associated with serum calcium are largely unknown. We conducted a genome-wide association meta-analysis of 39,400 individuals from 17 population-based cohorts and investigated the 14 most strongly associated loci in ≤ 21,679 additional individuals. Seven loci (six new regions in association with serum calcium were identified and replicated. Rs1570669 near CYP24A1 (P = 9.1E-12, rs10491003 upstream of GATA3 (P = 4.8E-09 and rs7481584 in CARS (P = 1.2E-10 implicate regions involved in Mendelian calcemic disorders: Rs1550532 in DGKD (P = 8.2E-11, also associated with bone density, and rs7336933 near DGKH/KIAA0564 (P = 9.1E-10 are near genes that encode distinct isoforms of diacylglycerol kinase. Rs780094 is in GCKR. We characterized the expression of these genes in gut, kidney, and bone, and demonstrate modulation of gene expression in bone in response to dietary calcium in mice. Our results shed new light on the genetics of calcium homeostasis.
Aswad, Amr; Katzourakis, Aris
Herpesviridae is a diverse family of large and complex pathogens whose genomes are extremely difficult to sequence. This is particularly true for clinical samples, and if the virus, host, or both genomes are being sequenced for the first time. Although herpesviruses are known to occasionally integrate in host genomes, and can also be inherited in a Mendelian fashion, they are notably absent from the genomic fossil record comprised of endogenous viral elements (EVEs). Here, we combine paleovirological and metagenomic approaches to both explore the constituent viral diversity of mammalian genomes and search for endogenous herpesviruses. We describe the first endogenous herpesvirus from the genome of the Philippine tarsier, belonging to the Roseolovirus genus, and characterize its highly defective genome that is integrated and flanked by unambiguous host DNA. From a draft assembly of the aye-aye genome, we use bioinformatic tools to reveal over 100,000 bp of a novel rhadinovirus that is the first lemur gammaherpesvirus, closely related to Kaposi's sarcoma-associated virus. We also identify 58 genes of Pan paniscus lymphocryptovirus 1, the bonobo equivalent of human Epstein-Barr virus. For each of the viruses, we postulate gene function via comparative analysis to known viral relatives. Most notably, the evidence from gene content and phylogenetics suggests that the aye-aye sequences represent the most basal known rhadinovirus, and indicates that tumorigenic herpesviruses have been infecting primates since their emergence in the late Cretaceous. Overall, these data show that a genomic fossil record of herpesviruses exists despite their extremely large genomes, and expands the known diversity of Herpesviridae, which will aid the characterization of pathogenesis. Our analytical approach illustrates the benefit of intersecting evolutionary approaches with metagenomics, genetics and paleovirology. PMID:24945689
Full Text Available Abstract Background Temperature adaptation is one of the most important determinants of distribution and population size of organisms in nature. Recently, quantitative trait loci (QTL mapping and gene expression profiling approaches have been used for detecting candidate genes for heat resistance. However, the resolution of QTL mapping is not high enough to examine the individual effects of various genes in each QTL. Heat stress-responsive genes, characterized by gene expression profiling studies, are not necessarily responsible for heat resistance. Some of these genes may be regulated in association with the heat stress response of other genes. Results To evaluate which heat-responsive genes are potential candidates for heat resistance with higher resolution than previous QTL mapping studies, we performed genome-wide deficiency screen for QTL for heat resistance. We screened 439 isogenic deficiency strains from the DrosDel project, covering 65.6% of the Drosophila melanogaster genome in order to map QTL for thermal resistance. As a result, we found 19 QTL for heat resistance, including 3 novel QTL outside the QTL found in previous studies. Conclusion The QTL found in this study encompassed 19 heat-responsive genes found in the previous gene expression profiling studies, suggesting that they were strong candidates for heat resistance. This result provides new insights into the genetic architecture of heat resistance. It also emphasizes the advantages of genome-wide deficiency screen using isogenic deficiency libraries.
Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin
Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.
Hu, Zheng; Zhu, Da; Wang, Wei
Human papillomavirus (HPV) integration is a key genetic event in cervical carcinogenesis1. By conducting whole-genome sequencing and high-throughput viral integration detection, we identified 3,667 HPV integration breakpoints in 26 cervical intraepithelial neoplasias, 104 cervical carcinomas and ...
Bidon, Tobias; Schreck, Nancy; Hailer, Frank; Nilsson, Maria A; Janke, Axel
The male-inherited Y chromosome is the major haploid fraction of the mammalian genome, rendering Y-linked sequences an indispensable resource for evolutionary research. However, despite recent large-scale genome sequencing approaches, only a handful of Y chromosome sequences have been characterized to date, mainly in model organisms. Using polar bear (Ursus maritimus) genomes, we compare two different in silico approaches to identify Y-linked sequences: 1) Similarity to known Y-linked genes and 2) difference in the average read depth of autosomal versus sex chromosomal scaffolds. Specifically, we mapped available genomic sequencing short reads from a male and a female polar bear against the reference genome and identify 112 Y-chromosomal scaffolds with a combined length of 1.9 Mb. We verified the in silico findings for the longer polar bear scaffolds by male-specific in vitro amplification, demonstrating the reliability of the average read depth approach. The obtained Y chromosome sequences contain protein-coding sequences, single nucleotide polymorphisms, microsatellites, and transposable elements that are useful for evolutionary studies. A high-resolution phylogeny of the polar bear patriline shows two highly divergent Y chromosome lineages, obtained from analysis of the identified Y scaffolds in 12 previously published male polar bear genomes. Moreover, we find evidence of gene conversion among ZFX and ZFY sequences in the giant panda lineage and in the ancestor of ursine and tremarctine bears. Thus, the identification of Y-linked scaffold sequences from unordered genome sequences yields valuable data to infer phylogenomic and population-genomic patterns in bears. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Bidon, Tobias; Schreck, Nancy; Hailer, Frank; Nilsson, Maria A.; Janke, Axel
The male-inherited Y chromosome is the major haploid fraction of the mammalian genome, rendering Y-linked sequences an indispensable resource for evolutionary research. However, despite recent large-scale genome sequencing approaches, only a handful of Y chromosome sequences have been characterized to date, mainly in model organisms. Using polar bear (Ursus maritimus) genomes, we compare two different in silico approaches to identify Y-linked sequences: 1) Similarity to known Y-linked genes and 2) difference in the average read depth of autosomal versus sex chromosomal scaffolds. Specifically, we mapped available genomic sequencing short reads from a male and a female polar bear against the reference genome and identify 112 Y-chromosomal scaffolds with a combined length of 1.9 Mb. We verified the in silico findings for the longer polar bear scaffolds by male-specific in vitro amplification, demonstrating the reliability of the average read depth approach. The obtained Y chromosome sequences contain protein-coding sequences, single nucleotide polymorphisms, microsatellites, and transposable elements that are useful for evolutionary studies. A high-resolution phylogeny of the polar bear patriline shows two highly divergent Y chromosome lineages, obtained from analysis of the identified Y scaffolds in 12 previously published male polar bear genomes. Moreover, we find evidence of gene conversion among ZFX and ZFY sequences in the giant panda lineage and in the ancestor of ursine and tremarctine bears. Thus, the identification of Y-linked scaffold sequences from unordered genome sequences yields valuable data to infer phylogenomic and population-genomic patterns in bears. PMID:26019166
Adam H Freedman
Full Text Available Controlling for background demographic effects is important for accurately identifying loci that have recently undergone positive selection. To date, the effects of demography have not yet been explicitly considered when identifying loci under selection during dog domestication. To investigate positive selection on the dog lineage early in the domestication, we examined patterns of polymorphism in six canid genomes that were previously used to infer a demographic model of dog domestication. Using an inferred demographic model, we computed false discovery rates (FDR and identified 349 outlier regions consistent with positive selection at a low FDR. The signals in the top 100 regions were frequently centered on candidate genes related to brain function and behavior, including LHFPL3, CADM2, GRIK3, SH3GL2, MBP, PDE7B, NTAN1, and GLRA1. These regions contained significant enrichments in behavioral ontology categories. The 3rd top hit, CCRN4L, plays a major role in lipid metabolism, that is supported by additional metabolism related candidates revealed in our scan, including SCP2D1 and PDXC1. Comparing our method to an empirical outlier approach that does not directly account for demography, we found only modest overlaps between the two methods, with 60% of empirical outliers having no overlap with our demography-based outlier detection approach. Demography-aware approaches have lower-rates of false discovery. Our top candidates for selection, in addition to expanding the set of neurobehavioral candidate genes, include genes related to lipid metabolism, suggesting a dietary target of selection that was important during the period when proto-dogs hunted and fed alongside hunter-gatherers.
Feng, Wenyi; Collingwood, David; Boeck, Max E; Fox, Lindsay A; Alvino, Gina M; Fangman, Walton L; Raghuraman, Mosur K; Brewer, Bonita J
During DNA replication one or both strands transiently become single stranded: first at the sites where initiation of DNA synthesis occurs (known as origins of replication) and subsequently on the lagging strands of replication forks as discontinuous Okazaki fragments are generated. We report a genome-wide analysis of single-stranded DNA (ssDNA) formation in the presence of hydroxyurea during DNA replication in wild-type and checkpoint-deficient rad53 Saccharomyces cerevisiae cells. In wild-type cells, ssDNA was first observed at a subset of replication origins and later 'migrated' bi-directionally, suggesting that ssDNA formation is associated with continuously moving replication forks. In rad53 cells, ssDNA was observed at virtually every known origin, but remained there over time, suggesting that replication forks stall. Telomeric regions seemed to be particularly sensitive to the loss of Rad53 checkpoint function. Replication origins in Schizosaccharomyces pombe were also mapped using our method.
Rice, K L; Lin, X; Wolniak, K; Ebert, B L; Berkofsky-Fessler, W; Buzzai, M; Sun, Y; Xi, C; Elkin, P; Levine, R; Golub, T; Gilliland, D G; Crispino, J D; Licht, J D; Zhang, W
Polycythemia vera (PV), essential thrombocythemia and primary myelofibrosis, are myeloproliferative neoplasms (MPNs) with distinct clinical features and are associated with the JAK2V617F mutation. To identify genomic anomalies involved in the pathogenesis of these disorders, we profiled 87 MPN patients using Affymetrix 250K single-nucleotide polymorphism (SNP) arrays. Aberrations affecting chr9 were the most frequently observed and included 9pLOH (n=16), trisomy 9 (n=6) and amplifications of 9p13.3–23.3 (n=1), 9q33.1–34.13 (n=1) and 9q34.13 (n=6). Patients with trisomy 9 were associated with elevated JAK2V617F mutant allele burden, suggesting that gain of chr9 represents an alternative mechanism for increasing JAK2V617F dosage. Gene expression profiling of patients with and without chr9 abnormalities (+9, 9pLOH), identified genes potentially involved in disease pathogenesis including JAK2, STAT5B and MAPK14. We also observed recurrent gains of 1p36.31–36.33 (n=6), 17q21.2–q21.31 (n=5) and 17q25.1–25.3 (n=5) and deletions affecting 18p11.31–11.32 (n=8). Combined SNP and gene expression analysis identified aberrations affecting components of a non-canonical PRC2 complex (EZH1, SUZ12 and JARID2) and genes comprising a ‘HSC signature' (MLLT3, SMARCA2 and PBX1). We show that NFIB, which is amplified in 7/87 MPN patients and upregulated in PV CD34+ cells, protects cells from apoptosis induced by cytokine withdrawal
Blackmon Barbara P
Full Text Available Abstract Background BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. Results This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Conclusions Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed.
Christina L. Zheng
Full Text Available Somatic mutations in cancer are more frequent in heterochromatic and late-replicating regions of the genome. We report that regional disparities in mutation density are virtually abolished within transcriptionally silent genomic regions of cutaneous squamous cell carcinomas (cSCCs arising in an XPC−/− background. XPC−/− cells lack global genome nucleotide excision repair (GG-NER, thus establishing differential access of DNA repair machinery within chromatin-rich regions of the genome as the primary cause for the regional disparity. Strikingly, we find that increasing levels of transcription reduce mutation prevalence on both strands of gene bodies embedded within H3K9me3-dense regions, and only to those levels observed in H3K9me3-sparse regions, also in an XPC-dependent manner. Therefore, transcription appears to reduce mutation prevalence specifically by relieving the constraints imposed by chromatin structure on DNA repair. We model this relationship among transcription, chromatin state, and DNA repair, revealing a new, personalized determinant of cancer risk.
Nobile, C.; Romeo, G.
A method for partial digestion of total human DNA with restriction enzymes has been developed on the basis of a principle already utilized by P.A. Whittaker and E. Southern for the analysis of phage lambda recombinants. Total human DNA irradiated with uv light of 254 nm is partially digested by restriction enzymes that recognize sequences containing adjacent thymidines because of TT dimer formation. The products resulting from partial digestion of specific genomic regions are detected in Southern blots by genomic-unique DNA probes with high reproducibility. This procedure is rapid and simple to perform because the same conditions of uv irradiation are used for different enzymes and probes. It is shown that restriction site polymorphisms occurring in the genomic regions analyzed are recognized by the allelic partial digest patterns they determine
Full Text Available Coronary artery disease (CAD is a leading cause of death world-wide, and most cases have a complex, multifactorial aetiology that includes a substantial heritable component. Identification of new genes involved in CAD may inform pathogenesis and provide new therapeutic targets. The PROCARDIS study recruited 2,658 affected sibling pairs (ASPs with onset of CAD before age 66 y from four European countries to map susceptibility loci for CAD. ASPs were defined as having CAD phenotype if both had CAD, or myocardial infarction (MI phenotype if both had a MI. In a first study, involving a genome-wide linkage screen, tentative loci were mapped to Chromosomes 3 and 11 with the CAD phenotype (1,464 ASPs, and to Chromosome 17 with the MI phenotype (739 ASPs. In a second study, these loci were examined with a dense panel of grid-tightening markers in an independent set of families (1,194 CAD and 344 MI ASPs. This replication study showed a significant result on Chromosome 17 (MI phenotype; p = 0.009 after adjustment for three independent replication tests. An exclusion analysis suggests that further genes of effect size lambda(sib > 1.24 are unlikely to exist in these populations of European ancestry. To our knowledge, this is the first genome-wide linkage analysis to map, and replicate, a CAD locus. The region on Chromosome 17 provides a compelling target within which to identify novel genes underlying CAD. Understanding the genetic aetiology of CAD may lead to novel preventative and/or therapeutic strategies.
Full Text Available Most of the previously reported loci for total immunoglobulin E (IgE levels are related to Th2 cell-dependent pathways. We undertook a genome-wide association study (GWAS to identify genetic loci responsible for IgE regulation. A total of 479,940 single nucleotide polymorphisms (SNPs were tested for association with total serum IgE levels in 1180 Japanese adults. Fine-mapping with SNP imputation demonstrated 6 candidate regions: the PYHIN1/IFI16, MHC classes I and II, LEMD2, GRAMD1B, and chr13∶60576338 regions. Replication of these candidate loci in each region was assessed in 2 independent Japanese cohorts (n = 1110 and 1364, respectively. SNP rs3130941 in the HLA-C region was consistently associated with total IgE levels in 3 independent populations, and the meta-analysis yielded genome-wide significance (P = 1.07×10(-10. Using our GWAS results, we also assessed the reproducibility of previously reported gene associations with total IgE levels. Nine of 32 candidate genes identified by a literature search were associated with total IgE levels after correction for multiple testing. Our findings demonstrate that SNPs in the HLA-C region are strongly associated with total serum IgE levels in the Japanese population and that some of the previously reported genetic associations are replicated across ethnic groups.
Sud, Amit; Thomsen, Hauke; Law, Philip J.
Several susceptibility loci for classical Hodgkin lymphoma have been reported. However, much of the heritable risk is unknown. Here, we perform a meta-analysis of two existing genome-wide association studies, a new genome-wide association study, and replication totalling 5,314 cases and 16,749 co...
Sud, A. (Amit); Thomsen, H. (Hauke); Law, P.J. (Philip J.); A. Försti (Asta); Filho, M.I.D.S. (Miguel Inacio Da Silva); Holroyd, A. (Amy); P. Broderick (Peter); Orlando, G. (Giulia); Lenive, O. (Oleg); Wright, L. (Lauren); R. Cooke (Rosie); D.F. Easton (Douglas); P.D.P. Pharoah (Paul); A.M. Dunning (Alison); J. Peto (Julian); F. Canzian (Federico); Eeles, R. (Rosalind); Z. Kote-Jarai; K.R. Muir (K.); Pashayan, N. (Nora); B.E. Henderson (Brian); C.A. Haiman (Christopher); S. Benlloch (Sara); F.R. Schumacher (Fredrick R); Olama, A.A.A. (Ali Amin Al); S.I. Berndt (Sonja); G. Conti (Giario); F. Wiklund (Fredrik); S.J. Chanock (Stephen); Stevens, V.L. (Victoria L.); C.M. Tangen (Catherine M.); Batra, J. (Jyotsna); Clements, J. (Judith); H. Grönberg (Henrik); Schleutker, J. (Johanna); D. Albanes (Demetrius); Weinstein, S. (Stephanie); K. Wolk (Kerstin); West, C. (Catharine); Mucci, L. (Lorelei); Cancel-Tassin, G. (Géraldine); Koutros, S. (Stella); Sorensen, K.D. (Karina Dalsgaard); L. Maehle; D. Neal (David); S.P.L. Travis (Simon); Hamilton, R.J. (Robert J.); S.A. Ingles (Sue); B.S. Rosenstein (Barry S.); Lu, Y.-J. (Yong-Jie); Giles, G.G. (Graham G.); A. Kibel (Adam); Vega, A. (Ana); M. Kogevinas (Manolis); Penney, K.L. (Kathryn L.); Park, J.Y. (Jong Y.); Stanford, J.L. (Janet L.); C. Cybulski (Cezary); B.G. Nordestgaard (Børge); Brenner, H. (Hermann); Maier, C. (Christiane); Kim, J. (Jeri); E.M. John (Esther); P.J. Teixeira; Neuhausen, S.L. (Susan L.); De Ruyck, K. (Kim); Razack, A. (Azad); Newcomb, L.F. (Lisa F.); Lessel, D. (Davor); Kaneva, R. (Radka); N. Usmani (Nawaid); F. Claessens; Townsend, P.A. (Paul A.); Dominguez, M.G. (Manuela Gago); Roobol, M.J. (Monique J.); F. Menegaux (Florence); P. Hoffmann (Per); M.M. Nöthen (Markus); K.-H. JöCkel (Karl-Heinz); Strandmann, E.P.V. (Elke Pogge Von); Lightfoot, T. (Tracy); Kane, E. (Eleanor); Roman, E. (Eve); Lake, A. (Annette); Montgomery, D. (Dorothy); Jarrett, R.F. (Ruth F.); A.J. Swerdlow (Anthony ); A. Engert (Andreas); N. Orr (Nick); K. Hemminki (Kari); Houlston, R.S. (Richard S.)
textabstractSeveral susceptibility loci for classical Hodgkin lymphoma have been reported. However, much of the heritable risk is unknown. Here, we perform a meta-analysis of two existing genome-wide association studies, a new genome-wide association study, and replication totalling 5,314 cases and
Wang, Kai; Yuen, Siu Tsan; Xu, Jiangchun; Lee, Siu Po; Yan, Helen H N; Shi, Stephanie T; Siu, Hoi Cheong; Deng, Shibing; Chu, Kent Man; Law, Simon; Chan, Kok Hoe; Chan, Annie S Y; Tsui, Wai Yin; Ho, Siu Lun; Chan, Anthony K W; Man, Jonathan L K; Foglizzo, Valentina; Ng, Man Kin; Chan, April S; Ching, Yick Pang; Cheng, Grace H W; Xie, Tao; Fernandez, Julio; Li, Vivian S W; Clevers, Hans; Rejto, Paul A; Mao, Mao; Leung, Suet Yi
Gastric cancer is a heterogeneous disease with diverse molecular and histological subtypes. We performed whole-genome sequencing in 100 tumor-normal pairs, along with DNA copy number, gene expression and methylation profiling, for integrative genomic analysis. We found subtype-specific genetic and
Marcos De Donato
Full Text Available KCNQ1OT1 is located in the region with the highest number of genes showing genomic imprinting, but the mechanisms controlling the genes under its influence have not been fully elucidated. Therefore, we conducted a comparative analysis of the KCNQ1/KCNQ1OT1-CDKN1C region to study its conservation across the best assembled eutherian mammalian genomes sequenced to date and analyzed potential elements that may be implicated in the control of genomic imprinting in this region. The genomic features in these regions from human, mouse, cattle, and dog show a higher number of genes and CpG islands (detected using cpgplot from EMBOSS, but lower number of repetitive elements (including short interspersed nuclear elements and long interspersed nuclear elements, compared with their whole chromosomes (detected by RepeatMasker. The KCNQ1OT1-CDKN1C region contains the highest number of conserved noncoding sequences (CNS among mammals, where we found 16 regions containing about 38 different highly conserved repetitive elements (using mVista, such as LINE1 elements: L1M4, L1MB7, HAL1, L1M4a, L1Med, and an LTR element: MLT1H. From these elements, we found 74 CNS showing high sequence identity (>70% between human, cattle, and mouse, from which we identified 13 motifs (using Multiple Em for Motif Elicitation/Motif Alignment and Search Tool with a significant probability of occurrence, 3 of which were the most frequent and were used to find transcription factor–binding sites. We detected several transcription factors (using JASPAR suite from the families SOX, FOX, and GATA. A phylogenetic analysis of these CNS from human, marmoset, mouse, rat, cattle, dog, horse, and elephant shows branches with high levels of support and very similar phylogenetic relationships among these groups, confirming previous reports. Our results suggest that functional DNA elements identified by comparative genomics in a region densely populated with imprinted mammalian genes may be
Klein, Alison P; Wolpin, Brian M; Risch, Harvey A; Stolzenberg-Solomon, Rachael Z; Mocci, Evelina; Zhang, Mingfeng; Canzian, Federico; Childs, Erica J; Hoskins, Jason W; Jermusyk, Ashley; Zhong, Jun; Chen, Fei; Albanes, Demetrius; Andreotti, Gabriella; Arslan, Alan A; Babic, Ana; Bamlet, William R; Beane-Freeman, Laura; Berndt, Sonja I; Blackford, Amanda; Borges, Michael; Borgida, Ayelet; Bracci, Paige M; Brais, Lauren; Brennan, Paul; Brenner, Hermann; Bueno-de-Mesquita, Bas; Buring, Julie; Campa, Daniele; Capurso, Gabriele; Cavestro, Giulia Martina; Chaffee, Kari G; Chung, Charles C; Cleary, Sean; Cotterchio, Michelle; Dijk, Frederike; Duell, Eric J; Foretova, Lenka; Fuchs, Charles; Funel, Niccola; Gallinger, Steven; M Gaziano, J Michael; Gazouli, Maria; Giles, Graham G; Giovannucci, Edward; Goggins, Michael; Goodman, Gary E; Goodman, Phyllis J; Hackert, Thilo; Haiman, Christopher; Hartge, Patricia; Hasan, Manal; Hegyi, Peter; Helzlsouer, Kathy J; Herman, Joseph; Holcatova, Ivana; Holly, Elizabeth A; Hoover, Robert; Hung, Rayjean J; Jacobs, Eric J; Jamroziak, Krzysztof; Janout, Vladimir; Kaaks, Rudolf; Khaw, Kay-Tee; Klein, Eric A; Kogevinas, Manolis; Kooperberg, Charles; Kulke, Matthew H; Kupcinskas, Juozas; Kurtz, Robert J; Laheru, Daniel; Landi, Stefano; Lawlor, Rita T; Lee, I-Min; LeMarchand, Loic; Lu, Lingeng; Malats, Núria; Mambrini, Andrea; Mannisto, Satu; Milne, Roger L; Mohelníková-Duchoňová, Beatrice; Neale, Rachel E; Neoptolemos, John P; Oberg, Ann L; Olson, Sara H; Orlow, Irene; Pasquali, Claudio; Patel, Alpa V; Peters, Ulrike; Pezzilli, Raffaele; Porta, Miquel; Real, Francisco X; Rothman, Nathaniel; Scelo, Ghislaine; Sesso, Howard D; Severi, Gianluca; Shu, Xiao-Ou; Silverman, Debra; Smith, Jill P; Soucek, Pavel; Sund, Malin; Talar-Wojnarowska, Renata; Tavano, Francesca; Thornquist, Mark D; Tobias, Geoffrey S; Van Den Eeden, Stephen K; Vashist, Yogesh; Visvanathan, Kala; Vodicka, Pavel; Wactawski-Wende, Jean; Wang, Zhaoming; Wentzensen, Nicolas; White, Emily; Yu, Herbert; Yu, Kai; Zeleniuch-Jacquotte, Anne; Zheng, Wei; Kraft, Peter; Li, Donghui; Chanock, Stephen; Obazee, Ofure; Petersen, Gloria M; Amundadottir, Laufey T
In 2020, 146,063 deaths due to pancreatic cancer are estimated to occur in Europe and the United States combined. To identify common susceptibility alleles, we performed the largest pancreatic cancer GWAS to date, including 9040 patients and 12,496 controls of European ancestry from the Pancreatic Cancer Cohort Consortium (PanScan) and the Pancreatic Cancer Case-Control Consortium (PanC4). Here, we find significant evidence of a novel association at rs78417682 (7p12/TNS3, P = 4.35 × 10 -8 ). Replication of 10 promising signals in up to 2737 patients and 4752 controls from the PANcreatic Disease ReseArch (PANDoRA) consortium yields new genome-wide significant loci: rs13303010 at 1p36.33 (NOC2L, P = 8.36 × 10 -14 ), rs2941471 at 8q21.11 (HNF4G, P = 6.60 × 10 -10 ), rs4795218 at 17q12 (HNF1B, P = 1.32 × 10 -8 ), and rs1517037 at 18q21.32 (GRP, P = 3.28 × 10 -8 ). rs78417682 is not statistically significantly associated with pancreatic cancer in PANDoRA. Expression quantitative trait locus analysis in three independent pancreatic data sets provides molecular support of NOC2L as a pancreatic cancer susceptibility gene.
Full Text Available The pathogenesis of dengue hemorrhagic fever (DHF, following dengue virus (DENV infection, is a complex and poorly understood phenomenon. In view of the clinical need of identifying patients with higher likelihood of developing this severe outcome, we undertook a comparative genome-wide association analysis of epitope variants from sequences available in the ViPR database that have been reported to be differentially related to dengue fever and DHF. Having enumerated the incriminated epitope variants, we determined the corresponding HLA alleles in the context of which DENV infection could potentially precipitate DHF. Our analysis considered the development of DHF in three different perspectives: (a as a consequence of primary DENV infection, (b following secondary DENV infection with a heterologous serotype, (c as a result of DENV infection following infection with related flaviviruses like Zika virus, Japanese Encephalitis virus, West Nile virus, etc. Subject to experimental validation, these viral and host markers would be valuable in triaging DENV-infected patients for closer supervision owing to the relatively higher risk of poor prognostic outcome and also for the judicious allocation of scarce institutional resources during large outbreaks.
Garcia-Closas, Montserrat; Couch, Fergus J; Lindstrom, Sara; Michailidou, Kyriaki; Schmidt, Marjanka K; Brook, Mark N; orr, Nick; Rhie, Suhn Kyong; Riboli, Elio; Feigelson, Heather s; Le Marchand, Loic; Buring, Julie E; Eccles, Diana; Miron, Penelope; Fasching, Peter A; Brauch, Hiltrud; Chang-Claude, Jenny; Carpenter, Jane; Godwin, Andrew K; Nevanlinna, Heli; Giles, Graham G; Cox, Angela; Hopper, John L; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Dicks, Ed; Howat, Will J; Schoof, Nils; Bojesen, Stig E; Lambrechts, Diether; Broeks, Annegien; Andrulis, Irene L; Guénel, Pascal; Burwinkel, Barbara; Sawyer, Elinor J; Hollestelle, Antoinette; Fletcher, Olivia; Winqvist, Robert; Brenner, Hermann; Mannermaa, Arto; Hamann, Ute; Meindl, Alfons; Lindblom, Annika; Zheng, Wei; Devillee, Peter; Goldberg, Mark S; Lubinski, Jan; Kristensen, Vessela; Swerdlow, Anthony; Anton-Culver, Hoda; Dörk, Thilo; Muir, Kenneth; Matsuo, Keitaro; Wu, Anna H; Radice, Paolo; Teo, Soo Hwang; Shu, Xiao-Ou; Blot, William; Kang, Daehee; Hartman, Mikael; Sangrajrang, Suleeporn; Shen, Chen-Yang; Southey, Melissa C; Park, Daniel J; Hammet, Fleur; Stone, Jennifer; Veer, Laura J Van’t; Rutgers, Emiel J; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Peto, Julian; Schrauder, Michael G; Ekici, Arif B; Beckmann, Matthias W; Silva, Isabel dos Santos; Johnson, Nichola; Warren, Helen; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Marme, Federick; Schneeweiss, Andreas; Sohn, Christof; Truong, Therese; Laurent-Puig, Pierre; Kerbrat, Pierre; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Milne, Roger L; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Lichtner, Peter; Lochmann, Magdalena; Justenhoven, Christina; Ko, Yon-Dschun; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Greco, Dario; Heikkinen, Tuomas; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Antonenkova, Natalia N; Margolin, Sara; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Balleine, Rosemary; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Neven, Patrick; Dieudonné, Anne-Sophie; Leunen, Karin; Rudolph, Anja; Nickels, Stefan; Flesch-Janys, Dieter; Peterlongo, Paolo; Peissel, Bernard; Bernard, Loris; Olson, Janet E; Wang, Xianshu; Stevens, Kristen; Severi, Gianluca; Baglietto, Laura; Mclean, Catriona; Coetzee, Gerhard A; Feng, Ye; Henderson, Brian E; Schumacher, Fredrick; Bogdanova, Natalia V; Labrèche, France; Dumont, Martine; Yip, Cheng Har; Taib, Nur Aishah Mohd; Cheng, Ching-Yu; Shrubsole, Martha; Long, Jirong; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Tollenaar, Robertus A E M; Seynaeve, Caroline M; Kriege, Mieke; Hooning, Maartje J; Van den Ouweland, Ans M W; Van Deurzen, Carolien H M; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Balasubramanian, Sabapathy P; Cross, Simon S; Reed, Malcolm W R; Signorello, Lisa; Cai, Qiuyin; Shah, Mitul; Miao, Hui; Chan, Ching Wan; Chia, Kee Seng; Jakubowska, Anna; Jaworska, Katarzyna; Durda, Katarzyna; Hsiung, Chia-Ni; Wu, Pei-Ei; Yu, Jyh-Cherng; Ashworth, Alan; Jones, Michael; Tessier, Daniel C; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Vincent, Daniel; Bacot, Francois; Ambrosone, Christine B; Bandera, Elisa V; John, Esther M; Chen, Gary K; Hu, Jennifer J; Rodriguez-gil, Jorge L; Bernstein, Leslie; Press, Michael F; Ziegler, Regina G; Millikan, Robert M; Deming-Halverson, Sandra L; Nyante, Sarah; Ingles, Sue A; Waisfisz, Quinten; Tsimiklis, Helen; Makalic, Enes; Schmidt, Daniel; Bui, Minh; Gibson, Lorna; Müller-Myhsok, Bertram; Schmutzler, Rita K; Hein, Rebecca; Dahmen, Norbert; Beckmann, Lars; Aaltonen, Kirsimari; Czene, Kamila; Irwanto, Astrid; Liu, Jianjun; Turnbull, Clare; Rahman, Nazneen; Meijers-Heijboer, Hanne; Uitterlinden, Andre G; Rivadeneira, Fernando; Olswold, Curtis; Slager, Susan; Pilarski, Robert; Ademuyiwa, Foluso; Konstantopoulou, Irene; Martin, Nicholas G; Montgomery, Grant W; Slamon, Dennis J; Rauh, Claudia; Lux, Michael P; Jud, Sebastian M; Bruning, Thomas; Weaver, Joellen; Sharma, Priyanka; Pathak, Harsh; Tapper, Will; Gerty, Sue; Durcan, Lorraine; Trichopoulos, Dimitrios; Tumino, Rosario; Peeters, Petra H; Kaaks, Rudolf; Campa, Daniele; Canzian, Federico; Weiderpass, Elisabete; Johansson, Mattias; Khaw, Kay-Tee; Travis, Ruth; Clavel-Chapelon, Françoise; Kolonel, Laurence N; Chen, Constance; Beck, Andy; Hankinson, Susan E; Berg, Christine D; Hoover, Robert N; Lissowska, Jolanta; Figueroa, Jonine D; Chasman, Daniel I; Gaudet, Mia M; Diver, W Ryan; Willett, Walter C; Hunter, David J; Simard, Jacques; Benitez, Javier; Dunning, Alison M; Sherman, Mark E; Chenevix-Trench, Georgia; Chanock, Stephen J; Hall, Per; Pharoah, Paul D P; Vachon, Celine; Easton, Douglas F; Haiman, Christopher A; Kraft, Peter
Estrogen receptor (ER)-negative tumors represent 20–30% of all breast cancers, with a higher proportion occurring in younger women and women of African ancestry1. The etiology2 and clinical behavior3 of ER-negative tumors are different from those of tumors expressing ER (ER positive), including differences in genetic predisposition4. To identify susceptibility loci specific to ER-negative disease, we combined in a meta-analysis 3 genome-wide association studies of 4,193 ER-negative breast cancer cases and 35,194 controls with a series of 40 follow-up studies (6,514 cases and 41,455 controls), genotyped using a custom Illumina array, iCOGS, developed by the Collaborative Oncological Gene-environment Study (COGS). SNPs at four loci, 1q32.1 (MDM4, P = 2.1 × 10−12 and LGR6, P = 1.4 × 10−8), 2p24.1 (P = 4.6 × 10−8) and 16q12.2 (FTO, P = 4.0 × 10−8), were associated with ER-negative but not ER-positive breast cancer (P > 0.05). These findings provide further evidence for distinct etiological pathways associated with invasive ER-positive and ER-negative breast cancers. PMID:23535733
Full Text Available Glioblastoma Multiforme (GBM cells are highly invasive, infiltrating into the surrounding normal brain tissue, making it impossible to completely eradicate GBM tumors by surgery or radiation. Increasing evidence also shows that these migratory cells are highly resistant to cytotoxic reagents, but decreasing their migratory capability can re-sensitize them to chemotherapy. These evidences suggest that the migratory cell population may serve as a better therapeutic target for more effective treatment of GBM. In order to understand the regulatory mechanism underlying the motile phenotype, we carried out a genome-wide RNAi screen for genes inhibiting the migration of GBM cells. The screening identified a total of twenty-five primary hits; seven of them were confirmed by secondary screening. Further study showed that three of the genes, FLNA, KHSRP and HCFC1, also functioned in vivo, and knocking them down caused multifocal tumor in a mouse model. Interestingly, two genes, KHSRP and HCFC1, were also found to be correlated with the clinical outcome of GBM patients. These two genes have not been previously associated with cell migration.
Patxi San Martin-Uriz
Full Text Available Acidiphilium spp. are conspicuous dwellers of acidic, metal-rich environments. Indeed, they are among the most metal-resistant organisms; yet little is known about the mechanisms behind the metal tolerance in this genus. Acidiphilium sp. PM is an environmental isolate from Rio Tinto, an acidic, metal-laden river located in southwestern Spain. The characterization of its metal resistance revealed a remarkable ability to tolerate high Ni concentrations. Here we report the screening of a genomic library of Acidiphilium sp. PM to identify genes involved in Ni resistance. This approach revealed seven different genes conferring Ni resistance to E. coli, two of which form an operon encoding the ATP-dependent protease HslVU (ClpQY. This protease was found to enhance resistance to both Ni and Co in E. coli, a function not previously reported. Other Ni-resistance determinants include genes involved in lipopolysaccharide biosynthesis and the synthesis of branched amino acids. The diversity of molecular functions of the genes recovered in the screening suggests that Ni resistance in Acidiphilium sp. PM probably relies on different molecular mechanisms.
De Martino, Andrea; De Martino, Daniele; Mulet, Roberto; Pagnani, Andrea
The stoichiometry of a metabolic network gives rise to a set of conservation laws for the aggregate level of specific pools of metabolites, which, on one hand, pose dynamical constraints that cross-link the variations of metabolite concentrations and, on the other, provide key insight into a cell's metabolic production capabilities. When the conserved quantity identifies with a chemical moiety, extracting all such conservation laws from the stoichiometry amounts to finding all non-negative integer solutions of a linear system, a programming problem known to be NP-hard. We present an efficient strategy to compute the complete set of integer conservation laws of a genome-scale stoichiometric matrix, also providing a certificate for correctness and maximality of the solution. Our method is deployed for the analysis of moiety conservation relationships in two large-scale reconstructions of the metabolism of the bacterium E. coli, in six tissue-specific human metabolic networks, and, finally, in the human reactome as a whole, revealing that bacterial metabolism could be evolutionarily designed to cover broader production spectra than human metabolism. Convergence to the full set of moiety conservation laws in each case is achieved in extremely reduced computing times. In addition, we uncover a scaling relation that links the size of the independent pool basis to the number of metabolites, for which we present an analytical explanation.
Andrea De Martino
Full Text Available The stoichiometry of a metabolic network gives rise to a set of conservation laws for the aggregate level of specific pools of metabolites, which, on one hand, pose dynamical constraints that cross-link the variations of metabolite concentrations and, on the other, provide key insight into a cell's metabolic production capabilities. When the conserved quantity identifies with a chemical moiety, extracting all such conservation laws from the stoichiometry amounts to finding all non-negative integer solutions of a linear system, a programming problem known to be NP-hard. We present an efficient strategy to compute the complete set of integer conservation laws of a genome-scale stoichiometric matrix, also providing a certificate for correctness and maximality of the solution. Our method is deployed for the analysis of moiety conservation relationships in two large-scale reconstructions of the metabolism of the bacterium E. coli, in six tissue-specific human metabolic networks, and, finally, in the human reactome as a whole, revealing that bacterial metabolism could be evolutionarily designed to cover broader production spectra than human metabolism. Convergence to the full set of moiety conservation laws in each case is achieved in extremely reduced computing times. In addition, we uncover a scaling relation that links the size of the independent pool basis to the number of metabolites, for which we present an analytical explanation.
Yuen, Ryan KC; Merico, Daniele; Bookman, Matt; Howe, Jennifer L; Thiruvahindrapuram, Bhooma; Patel, Rohan V; Whitney, Joe; Deflaux, Nicole; Bingham, Jonathan; Wang, Zhuozhi; Pellecchia, Giovanna; Buchanan, Janet A; Walker, Susan; Marshall, Christian R; Uddin, Mohammed; Zarrei, Mehdi; Deneault, Eric; D’Abate, Lia; Chan, Ada JS; Koyanagi, Stephanie; Paton, Tara; Pereira, Sergio L; Hoang, Ny; Engchuan, Worrawat; Higginbotham, Edward J; Ho, Karen; Lamoureux, Sylvia; Li, Weili; MacDonald, Jeffrey R; Nalpathamkalam, Thomas; Sung, Wilson WL; Tsoi, Fiona J; Wei, John; Xu, Lizhen; Tasse, Anne-Marie; Kirby, Emily; Van Etten, William; Twigger, Simon; Roberts, Wendy; Drmic, Irene; Jilderda, Sanne; Modi, Bonnie MacKinnon; Kellam, Barbara; Szego, Michael; Cytrynbaum, Cheryl; Weksberg, Rosanna; Zwaigenbaum, Lonnie; Woodbury-Smith, Marc; Brian, Jessica; Senman, Lili; Iaboni, Alana; Doyle-Thomas, Krissy; Thompson, Ann; Chrysler, Christina; Leef, Jonathan; Savion-Lemieux, Tal; Smith, Isabel M; Liu, Xudong; Nicolson, Rob; Seifer, Vicki; Fedele, Angie; Cook, Edwin H; Dager, Stephen; Estes, Annette; Gallagher, Louise; Malow, Beth A; Parr, Jeremy R; Spence, Sarah J; Vorstman, Jacob; Frey, Brendan J; Robinson, James T; Strug, Lisa J; Fernandez, Bridget A; Elsabbagh, Mayada; Carter, Melissa T; Hallmayer, Joachim; Knoppers, Bartha M; Anagnostou, Evdokia; Szatmari, Peter; Ring, Robert H; Glazer, David; Pletcher, Mathew T; Scherer, Stephen W
We are performing whole genome sequencing (WGS) of families with Autism Spectrum Disorder (ASD) to build a resource, named MSSNG, to enable the sub-categorization of phenotypes and underlying genetic factors involved. Here, we report WGS of 5,205 samples from families with ASD, accompanied by clinical information, creating a database accessible in a cloud platform, and through an internet portal with controlled access. We found an average of 73.8 de novo single nucleotide variants and 12.6 de novo insertion/deletions (indels) or copy number variations (CNVs) per ASD subject. We identified 18 new candidate ASD-risk genes such as MED13 and PHF3, and found that participants bearing mutations in susceptibility genes had significantly lower adaptive ability (p=6×10−4). In 294/2,620 (11.2%) of ASD cases, a molecular basis could be determined and 7.2% of these carried CNV/chromosomal abnormalities, emphasizing the importance of detecting all forms of genetic variation as diagnostic and therapeutic targets in ASD. PMID:28263302
Full Text Available Intracellular bacterial pathogens are metabolically adapted to grow within mammalian cells. While these adaptations are fundamental to the ability to cause disease, we know little about the relationship between the pathogen's metabolism and virulence. Here we used an integrative Metabolic Analysis Tool that combines transcriptome data with genome-scale metabolic models to define the metabolic requirements of Listeria monocytogenes during infection. Twelve metabolic pathways were identified as differentially active during L. monocytogenes growth in macrophage cells. Intracellular replication requires de novo synthesis of histidine, arginine, purine, and branch chain amino acids (BCAAs, as well as catabolism of L-rhamnose and glycerol. The importance of each metabolic pathway during infection was confirmed by generation of gene knockout mutants in the respective pathways. Next, we investigated the association of these metabolic requirements in the regulation of L. monocytogenes virulence. Here we show that limiting BCAA concentrations, primarily isoleucine, results in robust induction of the master virulence activator gene, prfA, and the PrfA-regulated genes. This response was specific and required the nutrient responsive regulator CodY, which is known to bind isoleucine. Further analysis demonstrated that CodY is involved in prfA regulation, playing a role in prfA activation under limiting conditions of BCAAs. This study evidences an additional regulatory mechanism underlying L. monocytogenes virulence, placing CodY at the crossroads of metabolism and virulence.
Adams, Hieab HH; Hibar, Derrek P; Chouraki, Vincent; Stein, Jason L; Nyquist, Paul A; Rentería, Miguel E; Trompet, Stella; Arias-Vasquez, Alejandro; Seshadri, Sudha; Desrivières, Sylvane; Beecham, Ashley H; Jahanshad, Neda; Wittfeld, Katharina; Van der Lee, Sven J; Abramovic, Lucija; Alhusaini, Saud; Amin, Najaf; Andersson, Micael; Arfanakis, Konstantinos; Aribisala, Benjamin S; Armstrong, Nicola J; Athanasiu, Lavinia; Axelsson, Tomas; Beiser, Alexa; Bernard, Manon; Bis, Joshua C; Blanken, Laura ME; Blanton, Susan H; Bohlken, Marc M; Boks, Marco P; Bralten, Janita; Brickman, Adam M; Carmichael, Owen; Chakravarty, M Mallar; Chauhan, Ganesh; Chen, Qiang; Ching, Christopher RK; Cuellar-Partida, Gabriel; Den Braber, Anouk; Doan, Nhat Trung; Ehrlich, Stefan; Filippi, Irina; Ge, Tian; Giddaluru, Sudheer; Goldman, Aaron L; Gottesman, Rebecca F; Greven, Corina U; Grimm, Oliver; Griswold, Michael E; Guadalupe, Tulio; Hass, Johanna; Haukvik, Unn K; Hilal, Saima; Hofer, Edith; Hoehn, David; Holmes, Avram J; Hoogman, Martine; Janowitz, Deborah; Jia, Tianye; Kasperaviciute, Dalia; Kim, Sungeun; Klein, Marieke; Kraemer, Bernd; Lee, Phil H; Liao, Jiemin; Liewald, David CM; Lopez, Lorna M; Luciano, Michelle; Macare, Christine; Marquand, Andre; Matarin, Mar; Mather, Karen A; Mattheisen, Manuel; Mazoyer, Bernard; McKay, David R; McWhirter, Rebekah; Milaneschi, Yuri; Mirza-Schreiber, Nazanin; Muetzel, Ryan L; Maniega, Susana Muñoz; Nho, Kwangsik; Nugent, Allison C; Olde Loohuis, Loes M; Oosterlaan, Jaap; Papmeyer, Martina; Pappa, Irene; Pirpamer, Lukas; Pudas, Sara; Pütz, Benno; Rajan, Kumar B; Ramasamy, Adaikalavan; Richards, Jennifer S; Risacher, Shannon L; Roiz-Santiañez, Roberto; Rommelse, Nanda; Rose, Emma J; Royle, Natalie A; Rundek, Tatjana; Sämann, Philipp G; Satizabal, Claudia L; Schmaal, Lianne; Schork, Andrew J; Shen, Li; Shin, Jean; Shumskaya, Elena; Smith, Albert V; Sprooten, Emma; Strike, Lachlan T; Teumer, Alexander; Thomson, Russell; Tordesillas-Gutierrez, Diana; Toro, Roberto; Trabzuni, Daniah; Vaidya, Dhananjay; Van der Grond, Jeroen; Van der Meer, Dennis; Van Donkelaar, Marjolein MJ; Van Eijk, Kristel R; Van Erp, Theo GM; Van Rooij, Daan; Walton, Esther; Westlye, Lars T; Whelan, Christopher D; Windham, Beverly G; Winkler, Anderson M; Woldehawariat, Girma; Wolf, Christiane; Wolfers, Thomas; Xu, Bing; Yanek, Lisa R; Yang, Jingyun; Zijdenbos, Alex; Zwiers, Marcel P; Agartz, Ingrid; Aggarwal, Neelum T; Almasy, Laura; Ames, David; Amouyel, Philippe; Andreassen, Ole A; Arepalli, Sampath; Assareh, Amelia A; Barral, Sandra; Bastin, Mark E; Becker, Diane M; Becker, James T; Bennett, David A; Blangero, John; van Bokhoven, Hans; Boomsma, Dorret I; Brodaty, Henry; Brouwer, Rachel M; Brunner, Han G; Buckner, Randy L; Buitelaar, Jan K; Bulayeva, Kazima B; Cahn, Wiepke; Calhoun, Vince D; Cannon, Dara M; Cavalleri, Gianpiero L; Chen, Christopher; Cheng, Ching-Yu; Cichon, Sven; Cookson, Mark R; Corvin, Aiden; Crespo-Facorro, Benedicto; Curran, Joanne E; Czisch, Michael; Dale, Anders M; Davies, Gareth E; De Geus, Eco JC; De Jager, Philip L; de Zubicaray, Greig I; Delanty, Norman; Depondt, Chantal; DeStefano, Anita L; Dillman, Allissa; Djurovic, Srdjan; Donohoe, Gary; Drevets, Wayne C; Duggirala, Ravi; Dyer, Thomas D; Erk, Susanne; Espeseth, Thomas; Evans, Denis A; Fedko, Iryna O; Fernández, Guillén; Ferrucci, Luigi; Fisher, Simon E; Fleischman, Debra A; Ford, Ian; Foroud, Tatiana M; Fox, Peter T; Francks, Clyde; Fukunaga, Masaki; Gibbs, J Raphael; Glahn, David C; Gollub, Randy L; Göring, Harald HH; Grabe, Hans J; Green, Robert C; Gruber, Oliver; Gudnason, Vilmundur; Guelfi, Sebastian; Hansell, Narelle K; Hardy, John; Hartman, Catharina A; Hashimoto, Ryota; Hegenscheid, Katrin; Heinz, Andreas; Le Hellard, Stephanie; Hernandez, Dena G; Heslenfeld, Dirk J; Ho, Beng-Choon; Hoekstra, Pieter J; Hoffmann, Wolfgang; Hofman, Albert; Holsboer, Florian; Homuth, Georg; Hosten, Norbert; Hottenga, Jouke-Jan; Hulshoff Pol, Hilleke E; Ikeda, Masashi; Ikram, M Kamran; Jack, Clifford R; Jenkinson, Mark; Johnson, Robert; Jönsson, Erik G; Jukema, J Wouter; Kahn, René S; Kanai, Ryota; Kloszewska, Iwona; Knopman, David S; Kochunov, Peter; Kwok, John B; Lawrie, Stephen M; Lemaître, Hervé; Liu, Xinmin; Longo, Dan L; Longstreth, WT; Lopez, Oscar L; Lovestone, Simon; Martinez, Oliver; Martinot, Jean-Luc; Mattay, Venkata S; McDonald, Colm; McIntosh, Andrew M; McMahon, Katie L; McMahon, Francis J; Mecocci, Patrizia; Melle, Ingrid; Meyer-Lindenberg, Andreas; Mohnke, Sebastian; Montgomery, Grant W; Morris, Derek W; Mosley, Thomas H; Mühleisen, Thomas W; Müller-Myhsok, Bertram; Nalls, Michael A; Nauck, Matthias; Nichols, Thomas E; Niessen, Wiro J; Nöthen, Markus M; Nyberg, Lars; Ohi, Kazutaka; Olvera, Rene L; Ophoff, Roel A; Pandolfo, Massimo; Paus, Tomas; Pausova, Zdenka; Penninx, Brenda WJH; Pike, G Bruce; Potkin, Steven G; Psaty, Bruce M; Reppermund, Simone; Rietschel, Marcella; Roffman, Joshua L; Romanczuk-Seiferth, Nina; Rotter, Jerome I; Ryten, Mina; Sacco, Ralph L; Sachdev, Perminder S; Saykin, Andrew J; Schmidt, Reinhold; Schofield, Peter R; Sigurdsson, Sigurdur; Simmons, Andy; Singleton, Andrew; Sisodiya, Sanjay M; Smith, Colin; Smoller, Jordan W; Soininen, Hilkka; Srikanth, Velandai; Steen, Vidar M; Stott, David J; Sussmann, Jessika E; Thalamuthu, Anbupalam; Tiemeier, Henning; Toga, Arthur W; Traynor, Bryan J; Troncoso, Juan; Turner, Jessica A; Tzourio, Christophe; Uitterlinden, Andre G; Valdés Hernández, Maria C; Van der Brug, Marcel; Van der Lugt, Aad; Van der Wee, Nic JA; Van Duijn, Cornelia M; Van Haren, Neeltje EM; Van 't Ent, Dennis; Van Tol, Marie-Jose; Vardarajan, Badri N; Veltman, Dick J; Vernooij, Meike W; Völzke, Henry; Walter, Henrik; Wardlaw, Joanna M; Wassink, Thomas H; Weale, Michael E; Weinberger, Daniel R; Weiner, Michael W; Wen, Wei; Westman, Eric; White, Tonya; Wong, Tien Y; Wright, Clinton B; Zielke, H Ronald; Zonderman, Alan B; Deary, Ian J; DeCarli, Charles; Schmidt, Helena; Martin, Nicholas G; De Craen, Anton JM; Wright, Margaret J; Launer, Lenore J; Schumann, Gunter; Fornage, Myriam; Franke, Barbara; Debette, Stéphanie; Medland, Sarah E; Ikram, M Arfan; Thompson, Paul M
Intracranial volume reflects the maximally attained brain size during development, and remains stable with loss of tissue in late life. It is highly heritable, but the underlying genes remain largely undetermined. In a genome-wide association study of 32,438 adults, we discovered five novel loci for intracranial volume and confirmed two known signals. Four of the loci are also associated with adult human stature, but these remained associated with intracranial volume after adjusting for height. We found a high genetic correlation with child head circumference (ρgenetic=0.748), which indicated a similar genetic background and allowed for the identification of four additional loci through meta-analysis (Ncombined = 37,345). Variants for intracranial volume were also related to childhood and adult cognitive function, Parkinson’s disease, and enriched near genes involved in growth pathways including PI3K–AKT signaling. These findings identify biological underpinnings of intracranial volume and provide genetic support for theories on brain reserve and brain overgrowth. PMID:27694991
Edifizi, Diletta; Schumacher, Björn
DNA damage causally contributes to aging and age-related diseases. The declining functioning of tissues and organs during aging can lead to the increased risk of succumbing to aging-associated diseases. Congenital syndromes that are caused by heritable mutations in DNA repair pathways lead to cancer susceptibility and accelerated aging, thus underlining the importance of genome maintenance for withstanding aging. High-throughput mass-spectrometry-based approaches have recently contributed to identifying signalling response networks and gaining a more comprehensive understanding of the physiological adaptations occurring upon unrepaired DNA damage. The insulin-like signalling pathway has been implicated in a DNA damage response (DDR) network that includes epidermal growth factor (EGF)-, AMP-activated protein kinases (AMPK)- and the target of rapamycin (TOR)-like signalling pathways, which are known regulators of growth, metabolism, and stress responses. The same pathways, together with the autophagy-mediated proteostatic response and the decline in energy metabolism have also been found to be similarly regulated during natural aging, suggesting striking parallels in the physiological adaptation upon persistent DNA damage due to DNA repair defects and long-term low-level DNA damage accumulation occurring during natural aging. These insights will be an important starting point to study the interplay between signalling networks involved in progeroid syndromes that are caused by DNA repair deficiencies and to gain new understanding of the consequences of DNA damage in the aging process.
Yaw Shin Ooi
Full Text Available The enveloped alphaviruses include important and emerging human pathogens such as Chikungunya virus and Eastern equine encephalitis virus. Alphaviruses enter cells by clathrin-mediated endocytosis, and exit by budding from the plasma membrane. While there has been considerable progress in defining the structure and function of the viral proteins, relatively little is known about the host factors involved in alphavirus infection. We used a genome-wide siRNA screen to identify host factors that promote or inhibit alphavirus infection in human cells. Fuzzy homologue (FUZ, a protein with reported roles in planar cell polarity and cilia biogenesis, was required for the clathrin-dependent internalization of both alphaviruses and the classical endocytic ligand transferrin. The tetraspanin membrane protein TSPAN9 was critical for the efficient fusion of low pH-triggered virus with the endosome membrane. FUZ and TSPAN9 were broadly required for infection by the alphaviruses Sindbis virus, Semliki Forest virus, and Chikungunya virus, but were not required by the structurally-related flavivirus Dengue virus. Our results highlight the unanticipated functions of FUZ and TSPAN9 in distinct steps of alphavirus entry and suggest novel host proteins that may serve as targets for antiviral therapy.
Full Text Available Pseudomonas aeruginosa is a human opportunistic pathogen that causes mortality in cystic fibrosis and immunocompromised patients. While many virulence factors of this pathogen have already been identified, several remain to be discovered. In this respect we set an unprecedented genome-wide screen of a P. aeruginosa expression library based on a yeast growth phenotype. 51 candidates were selected in a three-round screening process. The robustness of the screen was validated by the selection of three well known secreted proteins including one demonstrated virulence factor, the protease LepA. Further in silico sorting of the 51 candidates highlighted three potential new Pseudomonas effector candidates (Pec. By testing the cytotoxicity of wild type P. aeruginosa vs pec mutants towards macrophages and the virulence in the Caenorhabditis elegans model, we demonstrated that the three selected Pecs are novel virulence factors of P. aeruginosa. Additional cellular localization experiments in the host revealed specific localization for Pec1 and Pec2 that could inform about their respective functions.
Allan, Kristina J; Mahoney, Douglas J; Baird, Stephen D; Lefebvre, Charles A; Stojdl, David F
High-throughput genome-wide RNAi (RNA interference) screening technology has been widely used for discovering host factors that impact virus replication. Here we present the application of this technology to uncovering host targets that specifically modulate the replication of Maraba virus, an oncolytic rhabdovirus, and vaccinia virus with the goal of enhancing therapy. While the protocol has been tested for use with oncolytic Maraba virus and oncolytic vaccinia virus, this approach is applicable to other oncolytic viruses and can also be utilized for identifying host targets that modulate virus replication in mammalian cells in general. This protocol describes the development and validation of an assay for high-throughput RNAi screening in mammalian cells, the key considerations and preparation steps important for conducting a primary high-throughput RNAi screen, and a step-by-step guide for conducting a primary high-throughput RNAi screen; in addition, it broadly outlines the methods for conducting secondary screen validation and tertiary validation studies. The benefit of high-throughput RNAi screening is that it allows one to catalogue, in an extensive and unbiased fashion, host factors that modulate any aspect of virus replication for which one can develop an in vitro assay such as infectivity, burst size, and cytotoxicity. It has the power to uncover biotherapeutic targets unforeseen based on current knowledge.
Garcia-Closas, Montserrat; Couch, Fergus J; Lindstrom, Sara; Michailidou, Kyriaki; Schmidt, Marjanka K; Brook, Mark N; Orr, Nick; Rhie, Suhn Kyong; Riboli, Elio; Feigelson, Heather S; Le Marchand, Loic; Buring, Julie E; Eccles, Diana; Miron, Penelope; Fasching, Peter A; Brauch, Hiltrud; Chang-Claude, Jenny; Carpenter, Jane; Godwin, Andrew K; Nevanlinna, Heli; Giles, Graham G; Cox, Angela; Hopper, John L; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Dicks, Ed; Howat, Will J; Schoof, Nils; Bojesen, Stig E; Lambrechts, Diether; Broeks, Annegien; Andrulis, Irene L; Guénel, Pascal; Burwinkel, Barbara; Sawyer, Elinor J; Hollestelle, Antoinette; Fletcher, Olivia; Winqvist, Robert; Brenner, Hermann; Mannermaa, Arto; Hamann, Ute; Meindl, Alfons; Lindblom, Annika; Zheng, Wei; Devillee, Peter; Goldberg, Mark S; Lubinski, Jan; Kristensen, Vessela; Swerdlow, Anthony; Anton-Culver, Hoda; Dörk, Thilo; Muir, Kenneth; Matsuo, Keitaro; Wu, Anna H; Radice, Paolo; Teo, Soo Hwang; Shu, Xiao-Ou; Blot, William; Kang, Daehee; Hartman, Mikael; Sangrajrang, Suleeporn; Shen, Chen-Yang; Southey, Melissa C; Park, Daniel J; Hammet, Fleur; Stone, Jennifer; Veer, Laura J Van't; Rutgers, Emiel J; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Peto, Julian; Schrauder, Michael G; Ekici, Arif B; Beckmann, Matthias W; Dos Santos Silva, Isabel; Johnson, Nichola; Warren, Helen; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Marme, Federick; Schneeweiss, Andreas; Sohn, Christof; Truong, Therese; Laurent-Puig, Pierre; Kerbrat, Pierre; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Milne, Roger L; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Lichtner, Peter; Lochmann, Magdalena; Justenhoven, Christina; Ko, Yon-Dschun; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Greco, Dario; Heikkinen, Tuomas; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Antonenkova, Natalia N; Margolin, Sara; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Balleine, Rosemary; Tseng, Chiu-Chen; Berg, David Van Den; Stram, Daniel O; Neven, Patrick; Dieudonné, Anne-Sophie; Leunen, Karin; Rudolph, Anja; Nickels, Stefan; Flesch-Janys, Dieter; Peterlongo, Paolo; Peissel, Bernard; Bernard, Loris; Olson, Janet E; Wang, Xianshu; Stevens, Kristen; Severi, Gianluca; Baglietto, Laura; McLean, Catriona; Coetzee, Gerhard A; Feng, Ye; Henderson, Brian E; Schumacher, Fredrick; Bogdanova, Natalia V; Labrèche, France; Dumont, Martine; Yip, Cheng Har; Taib, Nur Aishah Mohd; Cheng, Ching-Yu; Shrubsole, Martha; Long, Jirong; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; Knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Tollenaar, Robertus A E M; Seynaeve, Caroline M; Kriege, Mieke; Hooning, Maartje J; van den Ouweland, Ans M W; van Deurzen, Carolien H M; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Balasubramanian, Sabapathy P; Cross, Simon S; Reed, Malcolm W R; Signorello, Lisa; Cai, Qiuyin; Shah, Mitul; Miao, Hui; Chan, Ching Wan; Chia, Kee Seng; Jakubowska, Anna; Jaworska, Katarzyna; Durda, Katarzyna; Hsiung, Chia-Ni; Wu, Pei-Ei; Yu, Jyh-Cherng; Ashworth, Alan; Jones, Michael; Tessier, Daniel C; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Vincent, Daniel; Bacot, Francois; Ambrosone, Christine B; Bandera, Elisa V; John, Esther M; Chen, Gary K; Hu, Jennifer J; Rodriguez-Gil, Jorge L; Bernstein, Leslie; Press, Michael F; Ziegler, Regina G; Millikan, Robert M; Deming-Halverson, Sandra L; Nyante, Sarah; Ingles, Sue A; Waisfisz, Quinten; Tsimiklis, Helen; Makalic, Enes; Schmidt, Daniel; Bui, Minh; Gibson, Lorna; Müller-Myhsok, Bertram; Schmutzler, Rita K; Hein, Rebecca; Dahmen, Norbert; Beckmann, Lars; Aaltonen, Kirsimari; Czene, Kamila; Irwanto, Astrid; Liu, Jianjun; Turnbull, Clare; Rahman, Nazneen; Meijers-Heijboer, Hanne; Uitterlinden, Andre G; Rivadeneira, Fernando; Olswold, Curtis; Slager, Susan; Pilarski, Robert; Ademuyiwa, Foluso; Konstantopoulou, Irene; Martin, Nicholas G; Montgomery, Grant W; Slamon, Dennis J; Rauh, Claudia; Lux, Michael P; Jud, Sebastian M; Bruning, Thomas; Weaver, Joellen; Sharma, Priyanka; Pathak, Harsh; Tapper, Will; Gerty, Sue; Durcan, Lorraine; Trichopoulos, Dimitrios; Tumino, Rosario; Peeters, Petra H; Kaaks, Rudolf; Campa, Daniele; Canzian, Federico; Weiderpass, Elisabete; Johansson, Mattias; Khaw, Kay-Tee; Travis, Ruth; Clavel-Chapelon, Françoise; Kolonel, Laurence N; Chen, Constance; Beck, Andy; Hankinson, Susan E; Berg, Christine D; Hoover, Robert N; Lissowska, Jolanta; Figueroa, Jonine D; Chasman, Daniel I; Gaudet, Mia M; Diver, W Ryan; Willett, Walter C; Hunter, David J; Simard, Jacques; Benitez, Javier; Dunning, Alison M; Sherman, Mark E; Chenevix-Trench, Georgia; Chanock, Stephen J; Hall, Per; Pharoah, Paul D P; Vachon, Celine; Easton, Douglas F; Haiman, Christopher A; Kraft, Peter
Estrogen receptor (ER)-negative tumors represent 20-30% of all breast cancers, with a higher proportion occurring in younger women and women of African ancestry. The etiology and clinical behavior of ER-negative tumors are different from those of tumors expressing ER (ER positive), including differences in genetic predisposition. To identify susceptibility loci specific to ER-negative disease, we combined in a meta-analysis 3 genome-wide association studies of 4,193 ER-negative breast cancer cases and 35,194 controls with a series of 40 follow-up studies (6,514 cases and 41,455 controls), genotyped using a custom Illumina array, iCOGS, developed by the Collaborative Oncological Gene-environment Study (COGS). SNPs at four loci, 1q32.1 (MDM4, P = 2.1 × 10(-12) and LGR6, P = 1.4 × 10(-8)), 2p24.1 (P = 4.6 × 10(-8)) and 16q12.2 (FTO, P = 4.0 × 10(-8)), were associated with ER-negative but not ER-positive breast cancer (P > 0.05). These findings provide further evidence for distinct etiological pathways associated with invasive ER-positive and ER-negative breast cancers.
Swindell, William R.; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P.; Voorhees, John J.; Elder, James T.; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P.; DiGiovanni, John; Pittelkow, Mark R.; Ward, Nicole L.; Gudjonsson, Johann E.
Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1). While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis. PMID:21483750
Full Text Available DNA damage causally contributes to aging and age-related diseases. The declining functioning of tissues and organs during aging can lead to the increased risk of succumbing to aging-associated diseases. Congenital syndromes that are caused by heritable mutations in DNA repair pathways lead to cancer susceptibility and accelerated aging, thus underlining the importance of genome maintenance for withstanding aging. High-throughput mass-spectrometry-based approaches have recently contributed to identifying signalling response networks and gaining a more comprehensive understanding of the physiological adaptations occurring upon unrepaired DNA damage. The insulin-like signalling pathway has been implicated in a DNA damage response (DDR network that includes epidermal growth factor (EGF-, AMP-activated protein kinases (AMPK- and the target of rapamycin (TOR-like signalling pathways, which are known regulators of growth, metabolism, and stress responses. The same pathways, together with the autophagy-mediated proteostatic response and the decline in energy metabolism have also been found to be similarly regulated during natural aging, suggesting striking parallels in the physiological adaptation upon persistent DNA damage due to DNA repair defects and long-term low-level DNA damage accumulation occurring during natural aging. These insights will be an important starting point to study the interplay between signalling networks involved in progeroid syndromes that are caused by DNA repair deficiencies and to gain new understanding of the consequences of DNA damage in the aging process.
Composite interval mapping identified a total of three. QTLs on linkage ..... Soybean seeds decline in quality faster than seeds of other crops (Fabrizius et al. 1999). ... harvest and postharvest management practices (Lewis et al. 1998). Cho and ...
Khan, Aziz; Mathelier, Anthony
A common task for scientists relies on comparing lists of genes or genomic regions derived from high-throughput sequencing experiments. While several tools exist to intersect and visualize sets of genes, similar tools dedicated to the visualization of genomic region sets are currently limited. To address this gap, we have developed the Intervene tool, which provides an easy and automated interface for the effective intersection and visualization of genomic region or list sets, thus facilitating their analysis and interpretation. Intervene contains three modules: venn to generate Venn diagrams of up to six sets, upset to generate UpSet plots of multiple sets, and pairwise to compute and visualize intersections of multiple sets as clustered heat maps. Intervene, and its interactive web ShinyApp companion, generate publication-quality figures for the interpretation of genomic region and list sets. Intervene and its web application companion provide an easy command line and an interactive web interface to compute intersections of multiple genomic and list sets. They have the capacity to plot intersections using easy-to-interpret visual approaches. Intervene is developed and designed to meet the needs of both computer scientists and biologists. The source code is freely available at https://bitbucket.org/CBGR/intervene , with the web application available at https://asntech.shinyapps.io/intervene .
Liu, Fan; van der Lijn, Fedde; Schurmann, Claudia; Zhu, Gu; Chakravarty, M. Mallar; Hysi, Pirro G.; Wollstein, Andreas; Lao, Oscar; de Bruijne, Marleen; Ikram, M. Arfan; van der Lugt, Aad; Rivadeneira, Fernando; Uitterlinden, André G.; Hofman, Albert; Niessen, Wiro J.; Homuth, Georg; de Zubicaray, Greig; McMahon, Katie L.; Thompson, Paul M.; Daboul, Amro; Puls, Ralf; Hegenscheid, Katrin; Bevan, Liisa; Pausova, Zdenka; Medland, Sarah E.; Montgomery, Grant W.; Wright, Margaret J.; Wicking, Carol; Boehringer, Stefan; Spector, Timothy D.; Paus, Tomáš; Martin, Nicholas G.; Biffar, Reiner; Kayser, Manfred
Inter-individual variation in facial shape is one of the most noticeable phenotypes in humans, and it is clearly under genetic regulation; however, almost nothing is known about the genetic basis of normal human facial morphology. We therefore conducted a genome-wide association study for facial shape phenotypes in multiple discovery and replication cohorts, considering almost ten thousand individuals of European descent from several countries. Phenotyping of facial shape features was based on landmark data obtained from three-dimensional head magnetic resonance images (MRIs) and two-dimensional portrait images. We identified five independent genetic loci associated with different facial phenotypes, suggesting the involvement of five candidate genes—PRDM16, PAX3, TP63, C5orf50, and COL17A1—in the determination of the human face. Three of them have been implicated previously in vertebrate craniofacial development and disease, and the remaining two genes potentially represent novel players in the molecular networks governing facial development. Our finding at PAX3 influencing the position of the nasion replicates a recent GWAS of facial features. In addition to the reported GWA findings, we established links between common DNA variants previously associated with NSCL/P at 2p21, 8q24, 13q31, and 17q22 and normal facial-shape variations based on a candidate gene approach. Overall our study implies that DNA variants in genes essential for craniofacial development contribute with relatively small effect size to the spectrum of normal variation in human facial morphology. This observation has important consequences for future studies aiming to identify more genes involved in the human facial morphology, as well as for potential applications of DNA prediction of facial shape such as in future forensic applications. PMID:23028347
Full Text Available Inter-individual variation in facial shape is one of the most noticeable phenotypes in humans, and it is clearly under genetic regulation; however, almost nothing is known about the genetic basis of normal human facial morphology. We therefore conducted a genome-wide association study for facial shape phenotypes in multiple discovery and replication cohorts, considering almost ten thousand individuals of European descent from several countries. Phenotyping of facial shape features was based on landmark data obtained from three-dimensional head magnetic resonance images (MRIs and two-dimensional portrait images. We identified five independent genetic loci associated with different facial phenotypes, suggesting the involvement of five candidate genes--PRDM16, PAX3, TP63, C5orf50, and COL17A1--in the determination of the human face. Three of them have been implicated previously in vertebrate craniofacial development and disease, and the remaining two genes potentially represent novel players in the molecular networks governing facial development. Our finding at PAX3 influencing the position of the nasion replicates a recent GWAS of facial features. In addition to the reported GWA findings, we established links between common DNA variants previously associated with NSCL/P at 2p21, 8q24, 13q31, and 17q22 and normal facial-shape variations based on a candidate gene approach. Overall our study implies that DNA variants in genes essential for craniofacial development contribute with relatively small effect size to the spectrum of normal variation in human facial morphology. This observation has important consequences for future studies aiming to identify more genes involved in the human facial morphology, as well as for potential applications of DNA prediction of facial shape such as in future forensic applications.
Kommadath, A.; Nie, H.; Groenen, M.A.M.; Pas, te M.F.W.; Veerkamp, R.F.; Smits, M.A.
Eukaryotic genes are distributed along chromosomes as clusters of highly expressed genes termed RIDGEs (Regions of IncreaseD Gene Expression) and lowly expressed genes termed anti-RIDGEs, interspersed among genes expressed at intermediate levels or not expressed. Previous studies based on this
Full Text Available Phospho- and sphingolipids are crucial cellular and intracellular compounds. These lipids are required for active transport, a number of enzymatic processes, membrane formation, and cell signalling. Disruption of their metabolism leads to several diseases, with diverse neurological, psychiatric, and metabolic consequences. A large number of phospholipid and sphingolipid species can be detected and measured in human plasma. We conducted a meta-analysis of five European family-based genome-wide association studies (N = 4034 on plasma levels of 24 sphingomyelins (SPM, 9 ceramides (CER, 57 phosphatidylcholines (PC, 20 lysophosphatidylcholines (LPC, 27 phosphatidylethanolamines (PE, and 16 PE-based plasmalogens (PLPE, as well as their proportions in each major class. This effort yielded 25 genome-wide significant loci for phospholipids (smallest P-value = 9.88×10(-204 and 10 loci for sphingolipids (smallest P-value = 3.10×10(-57. After a correction for multiple comparisons (P-value<2.2×10(-9, we observed four novel loci significantly associated with phospholipids (PAQR9, AGPAT1, PKD2L1, PDXDC1 and two with sphingolipids (PLD2 and APOE explaining up to 3.1% of the variance. Further analysis of the top findings with respect to within class molar proportions uncovered three additional loci for phospholipids (PNLIPRP2, PCDH20, and ABDH3 suggesting their involvement in either fatty acid elongation/saturation processes or fatty acid specific turnover mechanisms. Among those, 14 loci (KCNH7, AGPAT1, PNLIPRP2, SYT9, FADS1-2-3, DLG2, APOA1, ELOVL2, CDK17, LIPC, PDXDC1, PLD2, LASS4, and APOE mapped into the glycerophospholipid and 12 loci (ILKAP, ITGA9, AGPAT1, FADS1-2-3, APOA1, PCDH20, LIPC, PDXDC1, SGPP1, APOE, LASS4, and PLD2 to the sphingolipid pathways. In large meta-analyses, associations between FADS1-2-3 and carotid intima media thickness, AGPAT1 and type 2 diabetes, and APOA1 and coronary artery disease were observed. In conclusion, our
Full Text Available BACKGROUND: Medulloblastoma is the most common malignant brain tumor in children. Despite recent improvements in cure rates, prediction of disease outcome remains a major challenge and survivors suffer from serious therapy-related side-effects. Recent data showed that patients with WNT-activated tumors have a favorable prognosis, suggesting that these patients could be treated less intensively, thereby reducing the side-effects. This illustrates the potential benefits of a robust classification of medulloblastoma patients and a detailed knowledge of associated biological mechanisms. METHODS AND FINDINGS: To get a better insight into the molecular biology of medulloblastoma we established mRNA expression profiles of 62 medulloblastomas and analyzed 52 of them also by comparative genomic hybridization (CGH arrays. Five molecular subtypes were identified, characterized by WNT signaling (A; 9 cases, SHH signaling (B; 15 cases, expression of neuronal differentiation genes (C and D; 16 and 11 cases, respectively or photoreceptor genes (D and E; both 11 cases. Mutations in beta-catenin were identified in all 9 type A tumors, but not in any other tumor. PTCH1 mutations were exclusively identified in type B tumors. CGH analysis identified several fully or partly subtype-specific chromosomal aberrations. Monosomy of chromosome 6 occurred only in type A tumors, loss of 9q mostly occurred in type B tumors, whereas chromosome 17 aberrations, most common in medulloblastoma, were strongly associated with type C or D tumors. Loss of the inactivated X-chromosome was highly specific for female cases of type C, D and E tumors. Gene expression levels faithfully reflected the chromosomal copy number changes. Clinicopathological features significantly different between the 5 subtypes included metastatic disease and age at diagnosis and histology. Metastatic disease at diagnosis was significantly associated with subtypes C and D and most strongly with subtype E
Tang, Yew Chung; Ho, Szu-Chi; Tan, Elisabeth; Ng, Alvin Wei Tian; McPherson, John R; Goh, Germaine Yen Lin; Teh, Bin Tean; Bard, Frederic; Rozen, Steven G
Phosphatase and tensin homolog (PTEN) is one of the most frequently inactivated tumor suppressors in breast cancer. While PTEN itself is not considered a druggable target, PTEN synthetic-sick or synthetic-lethal (PTEN-SSL) genes are potential drug targets in PTEN-deficient breast cancers. Therefore, with the aim of identifying potential targets for precision breast cancer therapy, we sought to discover PTEN-SSL genes present in a broad spectrum of breast cancers. To discover broad-spectrum PTEN-SSL genes in breast cancer, we used a multi-step approach that started with (1) a genome-wide short interfering RNA (siRNA) screen of ~ 21,000 genes in a pair of isogenic human mammary epithelial cell lines, followed by (2) a short hairpin RNA (shRNA) screen of ~ 1200 genes focused on hits from the first screen in a panel of 11 breast cancer cell lines; we then determined reproducibility of hits by (3) identification of overlaps between our results and reanalyzed data from 3 independent gene-essentiality screens, and finally, for selected candidate PTEN-SSL genes we (4) confirmed PTEN-SSL activity using either drug sensitivity experiments in a panel of 19 cell lines or mutual exclusivity analysis of publicly available pan-cancer somatic mutation data. The screens (steps 1 and 2) and the reproducibility analysis (step 3) identified six candidate broad-spectrum PTEN-SSL genes (PIK3CB, ADAMTS20, AP1M2, HMMR, STK11, and NUAK1). PIK3CB was previously identified as PTEN-SSL, while the other five genes represent novel PTEN-SSL candidates. Confirmation studies (step 4) provided additional evidence that NUAK1 and STK11 have PTEN-SSL patterns of activity. Consistent with PTEN-SSL status, inhibition of the NUAK1 protein kinase by the small molecule drug HTH-01-015 selectively impaired viability in multiple PTEN-deficient breast cancer cell lines, while mutations affecting STK11 and PTEN were largely mutually exclusive across large pan-cancer data sets. Six genes showed PTEN
Schrimpf, Rahel; Gottschalk, Maren; Metzger, Julia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar
Stallion fertility is an economically important trait due to the increase of artificial insemination in horses. The availability of whole genome sequence data facilitates identification of rare high-impact variants contributing to stallion fertility. The aim of our study was to genotype rare high-impact variants retrieved from next-generation sequencing (NGS)-data of 11 horses in order to unravel harmful genetic variants in large samples of stallions. Gene ontology (GO) terms and search results from public databases were used to obtain a comprehensive list of human und mice genes predicted to participate in the regulation of male reproduction. The corresponding equine orthologous genes were searched in whole genome sequence data of seven stallions and four mares and filtered for high-impact genetic variants using SnpEFF, SIFT and Polyphen 2 software. All genetic variants with the missing homozygous mutant genotype were genotyped on 337 fertile stallions of 19 breeds using KASP genotyping assays or PCR-RFLP. Mixed linear model analysis was employed for an association analysis with de-regressed estimated breeding values of the paternal component of the pregnancy rate per estrus (EBV-PAT). We screened next generation sequenced data of whole genomes from 11 horses for equine genetic variants in 1194 human and mice genes involved in male fertility and linked through common gene ontology (GO) with male reproductive processes. Variants were filtered for high-impact on protein structure and validated through SIFT and Polyphen 2. Only those genetic variants were followed up when the homozygote mutant genotype was missing in the detection sample comprising 11 horses. After this filtering process, 17 single nucleotide polymorphism (SNPs) were left. These SNPs were genotyped in 337 fertile stallions of 19 breeds using KASP genotyping assays or PCR-RFLP. An association analysis in 216 Hanoverian stallions revealed a significant association of the splice-site disruption variant
Higgins, Michael J.; Day, Colleen D.; Smilinich, Nancy J.; Ni, L.; Cooper, Paul R.; Nowak, Norma J.; Davies, Chris; de Jong, Pieter J.; Hejtmancik, Fielding; Evans, Glen A.; Smith, Richard J.H.; Shows, Thomas B.
Usher syndrome 1C (USH1C) is a congenital condition manifesting profound hearing loss, the absence of vestibular function, and eventual retinal degeneration. The USH1C locus has been mapped genetically to a 2- to 3-cM interval in 11p14–15.1 between D11S899 and D11S861. In an effort to identify the USH1C disease gene we have isolated the region between these markers in yeast artificial chromosomes (YACs) using a combination of STS content mapping and Alu–PCR hybridization. The YAC contig is ∼3.5 Mb and has located several other loci within this interval, resulting in the order CEN-LDHA-SAA1-TPH-D11S1310-(D11S1888/KCNC1)-MYOD1-D11S902D11S921-D11S1890-TEL. Subsequent haplotyping and homozygosity analysis refined the location of the disease gene to a 400-kb interval between D11S902 and D11S1890 with all affected individuals being homozygous for the internal marker D11S921. To facilitate gene identification, the critical region has been converted into P1 artificial chromosome (PAC) clones using sequence-tagged sites (STSs) mapped to the YAC contig, Alu–PCR products generated from the YACs, and PAC end probes. A contig of >50 PAC clones has been assembled between D11S1310 and D11S1890, confirming the order of markers used in haplotyping. Three PAC clones representing nearly two-thirds of the USH1C critical region have been sequenced. PowerBLAST analysis identified six clusters of expressed sequence tags (ESTs), two known genes (BIR,SUR1) mapped previously to this region, and a previously characterized but unmapped gene NEFA (DNA binding/EF hand/acidic amino-acid-rich). GRAIL analysis identified 11 CpG islands and 73 exons of excellent quality. These data allowed the construction of a transcription map for the USH1C critical region, consisting of three known genes and six or more novel transcripts. Based on their map location, these loci represent candidate disease loci for USH1C. The NEFA gene was assessed as the USH1C locus by the sequencing of an amplified NEFA
Blanca E Himes
Full Text Available Asthma is a common chronic respiratory disease characterized by airway hyperresponsiveness (AHR. The genetics of asthma have been widely studied in mouse and human, and homologous genomic regions have been associated with mouse AHR and human asthma-related phenotypes. Our goal was to identify asthma-related genes by integrating AHR associations in mouse with human genome-wide association study (GWAS data. We used Efficient Mixed Model Association (EMMA analysis to conduct a GWAS of baseline AHR measures from males and females of 31 mouse strains. Genes near or containing SNPs with EMMA p-values <0.001 were selected for further study in human GWAS. The results of the previously reported EVE consortium asthma GWAS meta-analysis consisting of 12,958 diverse North American subjects from 9 study centers were used to select a subset of homologous genes with evidence of association with asthma in humans. Following validation attempts in three human asthma GWAS (i.e., Sepracor/LOCCS/LODO/Illumina, GABRIEL, DAG and two human AHR GWAS (i.e., SHARP, DAG, the Kv channel interacting protein 4 (KCNIP4 gene was identified as nominally associated with both asthma and AHR at a gene- and SNP-level. In EVE, the smallest KCNIP4 association was at rs6833065 (P-value 2.9e-04, while the strongest associations for Sepracor/LOCCS/LODO/Illumina, GABRIEL, DAG were 1.5e-03, 1.0e-03, 3.1e-03 at rs7664617, rs4697177, rs4696975, respectively. At a SNP level, the strongest association across all asthma GWAS was at rs4697177 (P-value 1.1e-04. The smallest P-values for association with AHR were 2.3e-03 at rs11947661 in SHARP and 2.1e-03 at rs402802 in DAG. Functional studies are required to validate the potential involvement of KCNIP4 in modulating asthma susceptibility and/or AHR. Our results suggest that a useful approach to identify genes associated with human asthma is to leverage mouse AHR association data.
Full Text Available Abstract Background To have an insight into the Mayetiola destructor (Hessian fly genome, we performed an in silico comparative genomic analysis utilizing genetic mapping, genomic sequence and EST sequence data along with data available from public databases. Results Chromosome walking and FISH were utilized to identify a contig of 50 BAC clones near the telomere of the short arm of Hessian fly chromosome X2 and near the avirulence gene vH13. These clones enabled us to correlate physical and genetic distance in this region of the Hessian fly genome. Sequence data from these BAC ends encompassing a 760 kb region, and a fully sequenced and assembled 42.6 kb BAC clone, was utilized to perform a comparative genomic study. In silico gene prediction combined with BLAST analyses was used to determine putative orthology to the sequenced dipteran genomes of the fruit fly, Drosophila melanogaster, and the malaria mosquito, Anopheles gambiae, and to infer evolutionary relationships. Conclusion This initial effort enables us to advance our understanding of the structure, composition and evolution of the genome of this important agricultural pest and is an invaluable tool for a whole genome sequencing effort.
Roe Bruce A
Full Text Available Abstract Background Recent genome sequencing enables mega-base scale comparisons between related genomes. Comparisons between animals, plants, fungi, and bacteria demonstrate extensive synteny tempered by rearrangements. Within the legume plant family, glimpses of synteny have also been observed. Characterizing syntenic relationships in legumes is important in transferring knowledge from model legumes to crops that are important sources of protein, fixed nitrogen, and health-promoting compounds. Results We have uncovered two large soybean regions exhibiting synteny with M. truncatula and with a network of segmentally duplicated regions in Arabidopsis. In all, syntenic regions comprise over 500 predicted genes spanning 3 Mb. Up to 75% of soybean genes are colinear with M. truncatula, including one region in which 33 of 35 soybean predicted genes with database support are colinear to M. truncatula. In some regions, 60% of soybean genes share colinearity with a network of A. thaliana duplications. One region is especially interesting because this 500 kbp segment of soybean is syntenic to two paralogous regions in M. truncatula on different chromosomes. Phylogenetic analysis of individual genes within these regions demonstrates that one is orthologous to the soybean region, with which it also shows substantially denser synteny and significantly lower levels of synonymous nucleotide substitutions. The other M. truncatula region is inferred to be paralogous, presumably resulting from a duplication event preceding speciation. Conclusion The presence of well-defined M. truncatula segments showing orthologous and paralogous relationships with soybean allows us to explore the evolution of contiguous genomic regions in the context of ancient genome duplication and speciation events.
Avila Cobos, Francisco; Anckaert, Jasper; Volders, Pieter-Jan; Everaert, Celine; Rombaut, Dries; Vandesompele, Jo; De Preter, Katleen; Mestdagh, Pieter
Reconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. Current state-of-the-art tools for long non-coding RNA (lncRNA) annotation are mainly based on evolutionary constraints, which may result in false negatives due to the overall limited conservation of lncRNAs. To tackle this problem we have developed the Zipper plot, a novel visualization and analysis method that enables users to simultaneously interrogate thousands of human putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These include publicly available CAGE-sequencing, ChIP-sequencing and DNase-sequencing datasets. Our method only requires three tab-separated fields (chromosome, genomic coordinate of the TSS and strand) as input and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot. Using the Zipper plot, we found evidence of transcription for a set of well-characterized lncRNAs and observed that fewer mono-exonic lncRNAs have CAGE peaks overlapping with their TSSs compared to multi-exonic lncRNAs. Using publicly available RNA-seq data, we found more than one hundred cases where junction reads connected protein-coding gene exons with a downstream mono-exonic lncRNA, revealing the need for a careful evaluation of lncRNA 5'-boundaries. Our method is implemented using the statistical programming language R and is freely available as a webtool.
Zhang, Yang; Devries, Mark E; Skolnick, Jeffrey
G protein-coupled receptors (GPCRs), encoded by about 5% of human genes, comprise the largest family of integral membrane proteins and act as cell surface receptors responsible for the transduction of endogenous signal into a cellular response. Although tertiary structural information is crucial for function annotation and drug design, there are few experimentally determined GPCR structures. To address this issue, we employ the recently developed threading assembly refinement (TASSER) method to generate structure predictions for all 907 putative GPCRs in the human genome. Unlike traditional homology modeling approaches, TASSER modeling does not require solved homologous template structures; moreover, it often refines the structures closer to native. These features are essential for the comprehensive modeling of all human GPCRs when close homologous templates are absent. Based on a benchmarked confidence score, approximately 820 predicted models should have the correct folds. The majority of GPCR models share the characteristic seven-transmembrane helix topology, but 45 ORFs are predicted to have different structures. This is due to GPCR fragments that are predominantly from extracellular or intracellular domains as well as database annotation errors. Our preliminary validation includes the automated modeling of bovine rhodopsin, the only solved GPCR in the Protein Data Bank. With homologous templates excluded, the final model built by TASSER has a global C(alpha) root-mean-squared deviation from native of 4.6 angstroms, with a root-mean-squared deviation in the transmembrane helix region of 2.1 angstroms. Models of several representative GPCRs are compared with mutagenesis and affinity labeling data, and consistent agreement is demonstrated. Structure clustering of the predicted models shows that GPCRs with similar structures tend to belong to a similar functional class even when their sequences are diverse. These results demonstrate the usefulness and robustness
Full Text Available G protein-coupled receptors (GPCRs, encoded by about 5% of human genes, comprise the largest family of integral membrane proteins and act as cell surface receptors responsible for the transduction of endogenous signal into a cellular response. Although tertiary structural information is crucial for function annotation and drug design, there are few experimentally determined GPCR structures. To address this issue, we employ the recently developed threading assembly refinement (TASSER method to generate structure predictions for all 907 putative GPCRs in the human genome. Unlike traditional homology modeling approaches, TASSER modeling does not require solved homologous template structures; moreover, it often refines the structures closer to native. These features are essential for the comprehensive modeling of all human GPCRs when close homologous templates are absent. Based on a benchmarked confidence score, approximately 820 predicted models should have the correct folds. The majority of GPCR models share the characteristic seven-transmembrane helix topology, but 45 ORFs are predicted to have different structures. This is due to GPCR fragments that are predominantly from extracellular or intracellular domains as well as database annotation errors. Our preliminary validation includes the automated modeling of bovine rhodopsin, the only solved GPCR in the Protein Data Bank. With homologous templates excluded, the final model built by TASSER has a global C(alpha root-mean-squared deviation from native of 4.6 angstroms, with a root-mean-squared deviation in the transmembrane helix region of 2.1 angstroms. Models of several representative GPCRs are compared with mutagenesis and affinity labeling data, and consistent agreement is demonstrated. Structure clustering of the predicted models shows that GPCRs with similar structures tend to belong to a similar functional class even when their sequences are diverse. These results demonstrate the usefulness
Boschiero, Clarissa; Moreira, Gabriel Costa Monteiro; Gheyas, Almas Ara; Godoy, Thaís Fernanda; Gasparin, Gustavo; Mariani, Pilar Drummond Sampaio Corrêa; Paduan, Marcela; Cesar, Aline Silva Mello; Ledur, Mônica Corrêa; Coutinho, Luiz Lehmann
Meat and egg-type chickens have been selected for several generations for different traits. Artificial and natural selection for different phenotypes can change frequency of genetic variants, leaving particular genomic footprints throghtout the genome. Thus, the aims of this study were to sequence 28 chickens from two Brazilian lines (meat and white egg-type) and use this information to characterize genome-wide genetic variations, identify putative regions under selection using Fst method, and find putative pathways under selection. A total of 13.93 million SNPs and 1.36 million INDELs were identified, with more variants detected from the broiler (meat-type) line. Although most were located in non-coding regions, we identified 7255 intolerant non-synonymous SNPs, 512 stopgain/loss SNPs, 1381 frameshift and 1094 non-frameshift INDELs that may alter protein functions. Genes harboring intolerant non-synonymous SNPs affected metabolic pathways related mainly to reproduction and endocrine systems in the white-egg layer line, and lipid metabolism and metabolic diseases in the broiler line. Fst analysis in sliding windows, using SNPs and INDELs separately, identified over 300 putative regions of selection overlapping with more than 250 genes. For the first time in chicken, INDEL variants were considered for selection signature analysis, showing high level of correlation in results between SNP and INDEL data. The putative regions of selection signatures revealed interesting candidate genes and pathways related to important phenotypic traits in chicken, such as lipid metabolism, growth, reproduction, and cardiac development. In this study, Fst method was applied to identify high confidence putative regions under selection, providing novel insights into selection footprints that can help elucidate the functional mechanisms underlying different phenotypic traits relevant to meat and egg-type chicken lines. In addition, we generated a large catalog of line-specific and common
Leekitcharoenphon, Pimlapas; Kaas, Rolf Sommer; Thomsen, Martin Christen Frølund
identify SNPs and construct phylogenetic trees from WGS as well as from assembled genomes or contigs. WGS data in fastq format are aligned to reference genomes by BWA while contigs in fasta format are processed by Nucmer. SNPs are concatenated based on position on reference genome and a tree is constructed...... to differentiate and classify isolates. One of the successfully and broadly used methods is analysis of single nucletide polymorphisms (SNPs). Currently, there are different tools and methods to identify SNPs including various options and cut-off values. Furthermore, all current methods require bioinformatic...... skills. Thus, we lack a standard and simple automatic tool to determine SNPs and construct phylogenetic tree from WGS data. Results Here we introduce snpTree, a server for online-automatic SNPs analysis. This tool is composed of different SNPs analysis suites, perl and python scripts. snpTree can...
Speliotes, Elizabeth K; Yerges-Armstrong, Laura M; Wu, Jun
steatosis, a non-invasive measure of NAFLD, in large population based samples. Using variance components methods, we show that CT hepatic steatosis is heritable (~26%-27%) in family-based Amish, Family Heart, and Framingham Heart Studies (n¿=¿880 to 3,070). By carrying out a fixed-effects meta......-analysis of genome-wide association (GWA) results between CT hepatic steatosis and ~2.4 million imputed or genotyped SNPs in 7,176 individuals from the Old Order Amish, Age, Gene/Environment Susceptibility-Reykjavik study (AGES), Family Heart, and Framingham Heart Studies, we identify variants associated at genome......Nonalcoholic fatty liver disease (NAFLD) clusters in families, but the only known common genetic variants influencing risk are near PNPLA3. We sought to identify additional genetic variants influencing NAFLD using genome-wide association (GWA) analysis of computed tomography (CT) measured hepatic...
Liu, Xiaohua; Kelsoe, John R; Greenwood, Tiffany A
Bipolar disorder is a heterogeneous mood disorder associated with several important clinical comorbidities, such as eating disorders. This clinical heterogeneity complicates the identification of genetic variants contributing to bipolar susceptibility. Here we investigate comorbidity of eating disorders as a subphenotype of bipolar disorder to identify genetic variation that is common and unique to both disorders. We performed a genome-wide association analysis contrasting 184 bipolar subjects with eating disorder comorbidity against both 1370 controls and 2006 subjects with bipolar disorder only from the Bipolar Genome Study (BiGS). The most significant genome-wide finding was observed bipolar with comorbid eating disorder vs. controls within SOX2-OT (p=8.9×10(-8) for rs4854912) with a secondary peak in the adjacent FXR1 gene (p=1.2×10(-6) for rs1805576) on chromosome 3q26.33. This region was also the most prominent finding in the case-only analysis (p=3.5×10(-7) and 4.3×10(-6), respectively). Several regions of interest containing genes involved in neurodevelopment and neuroprotection processes were also identified. While our primary finding did not quite reach genome-wide significance, likely due to the relatively limited sample size, these results can be viewed as a replication of a recent study of eating disorders in a large cohort. These findings replicate the prior association of SOX2-OT with eating disorders and broadly support the involvement of neurodevelopmental/neuroprotective mechanisms in the pathophysiology of both disorders. They further suggest that different clinical manifestations of bipolar disorder may reflect differential genetic contributions and argue for the utility of clinical subphenotypes in identifying additional molecular pathways leading to illness. Copyright © 2015 Elsevier B.V. All rights reserved.
Pearson, Hillary; Granados, Diana Paola; Durette, Chantal; Bonneil, Eric; Courcelles, Mathieu; Rodenbrock, Anja; Laverdure, Jean-Philippe; Côté, Caroline; Thibault, Pierre
MHC class I–associated peptides (MAPs) define the immune self for CD8+ T lymphocytes and are key targets of cancer immunosurveillance. Here, the goals of our work were to determine whether the entire set of protein-coding genes could generate MAPs and whether specific features influence the ability of discrete genes to generate MAPs. Using proteogenomics, we have identified 25,270 MAPs isolated from the B lymphocytes of 18 individuals who collectively expressed 27 high-frequency HLA-A,B allotypes. The entire MAP repertoire presented by these 27 allotypes covered only 10% of the exomic sequences expressed in B lymphocytes. Indeed, 41% of expressed protein-coding genes generated no MAPs, while 59% of genes generated up to 64 MAPs, often derived from adjacent regions and presented by different allotypes. We next identified several features of transcripts and proteins associated with efficient MAP production. From these data, we built a logistic regression model that predicts with good accuracy whether a gene generates MAPs. Our results show preferential selection of MAPs from a limited repertoire of proteins with distinctive features. The notion that the MHC class I immunopeptidome presents only a small fraction of the protein-coding genome for monitoring by the immune system has profound implications in autoimmunity and cancer immunology. PMID:27841757
Full Text Available Manganese (Mn is an essential micro-nutrient for plants, but flooded rice fields can accumulate high levels of Mn2+ leading to Mn toxicity. Here, we present a genome-wide association study (GWAS to identify candidate loci conferring Mn toxicity tolerance in rice (Oryza sativa L.. A diversity panel of 288 genotypes was grown in hydroponic solutions in a greenhouse under optimal and toxic Mn concentrations. We applied a Mn toxicity treatment (5 ppm Mn2+, 3 weeks at twelve days after transplanting. Mn toxicity caused moderate damage in rice in terms of biomass loss and symptom formation despite extremely high shoot Mn concentrations ranging from 2.4 to 17.4 mg g-1. The tropical japonica subpopulation was more sensitive to Mn toxicity than other subpopulations. Leaf damage symptoms were significantly correlated with Mn uptake into shoots. Association mapping was conducted for seven traits using 416741 single nucleotide polymorphism (SNP markers using a mixed linear model, and detected six significant associations for the traits shoot manganese concentration and relative shoot length. Candidate regions contained genes coding for a heavy metal transporter, peroxidase precursor and Mn2+ ion binding proteins. The significant marker SNP-2.22465867 caused an amino acid change in a gene (LOC_Os02g37170 with unknown function. This study demonstrated significant natural variation in rice for Mn toxicity tolerance and the possibility of using GWAS to unravel genetic factors responsible for such complex traits.
Xu, Xiao; Sun, Xin; Hu, Xue-Song; Zhuang, Yan; Liu, Yue-Chen; Meng, Hao; Miao, Lin; Yu, He; Luo, Shu-Jin
Domestic cats exhibit abundant variations in tail morphology and serve as an excellent model to study the development and evolution of vertebrate tails. Cats with shortened and kinked tails were first recorded in the Malayan archipelago by Charles Darwin in 1868 and remain quite common today in Southeast and East Asia. To elucidate the genetic basis of short tails in Asian cats, we built a pedigree of 13 cats segregating at the trait with a founder from southern China and performed linkage mapping based on whole genome sequencing data from the pedigree. The short-tailed trait was mapped to a 5.6 Mb region of Chr E1, within which the substitution c. 5T > C in the somite segmentation-related gene HES7 was identified as the causal mutation resulting in a missense change (p.V2A). Validation in 245 unrelated cats confirmed the correlation between HES7-c. 5T > C and Chinese short-tailed feral cats as well as the Japanese Bobtail breed, indicating a common genetic basis of the two. In addition, some of our sampled kinked-tailed cats could not be explained by either HES7 or the Manx-related T-box, suggesting at least three independent events in the evolution of domestic cats giving rise to short-tailed traits.
Wang, Zhaoming; McGlynn, Katherine A.; Rajpert-De Meyts, Ewa
The international Testicular Cancer Consortium (TECAC) combined five published genome-wide association studies of testicular germ cell tumor (TGCT; 3,558 cases and 13,970 controls) to identify new susceptibility loci. We conducted a fixed-effects meta-analysis, including, to our knowledge, the fi...
Goode, Ellen L; Chenevix-Trench, Georgia; Song, Honglin
Ovarian cancer accounts for more deaths than all other gynecological cancers combined. To identify common low-penetrance ovarian cancer susceptibility genes, we conducted a genome-wide association study of 507,094 SNPs in 1,768 individuals with ovarian cancer (cases) and 2,354 controls, with foll...
Jin, Ying; Andersen, Genevieve; Yorgov, Daniel; Ferrara, Tracey M.; Ben, Songtao; Brownson, Kelly M.; Holland, Paulene J.; Birlea, Stanca A.; Siebert, Janet; Hartmann, Anke; Lienert, Anne; van Geel, Nanja; Lambert, Jo; Luiten, Rosalie M.; Wolkerstorfer, Albert; Wietze van der Veen, J. P.; Bennett, Dorothy C.; Taïeb, Alain; Ezzedine, Khaled; Kemp, E. Helen; Gawkrodger, David J.; Weetman, Anthony P.; Kõks, Sulev; Prans, Ele; Kingo, Külli; Karelson, Maire; Wallace, Margaret R.; McCormack, Wayne T.; Overbeck, Andreas; Moretti, Silvia; Colucci, Roberta; Picardo, Mauro; Silverberg, Nanette B.; Olsson, Mats; Valle, Yan; Korobko, Igor; Böhm, Markus; Lim, Henry W.; Hamzavi, Iltefat; Zhou, Li; Mi, Qing-Sheng; Fain, Pamela R.; Santorico, Stephanie A.; Spritz, Richard A.
Vitiligo is an autoimmune disease in which depigmented skin results from the destruction of melanocytes, with epidemiological association with other autoimmune diseases. In previous linkage and genome-wide association studies (GWAS1 and GWAS2), we identified 27 vitiligo susceptibility loci in
Lan, Q.; Hsiung, C.A.; Matsuo, K.; Hong, Y.C.; Seow, A.; Wang, Z.; Hosgood, H.D.; Chen, K.; Wang, J.C.; Chatterjee, N.; Hu, W.; Wong, M.P.; Zheng, W.; Caporaso, N.; Park, J.Y.; Chen, C.J.; Kim, Y.H.; Kim, Y.T.; Landi, M.T.; Shen, H.; Lawrence, C.; Burdett, L.; Yeager, M.; Yuenger, J.; Jacobs, K.B.; Chang, I.S.; Mitsudomi, T.; Kim, H.N.; Chang, G.C.; Bassig, B.A.; Tucker, M.; Wei, F.; Yin, Y.; Wu, C.; An, S.J.; Qian, B.; Lee, V.H.; Lu, D.; Liu, J.; Jeon, H.S.; Hsiao, C.F.; Sung, J.S.; Kim, J.H.; Gao, Y.T.; Tsai, Y.H.; Jung, Y.J.; Guo, H.; Hu, Z.; Hutchinson, A.; Wang, W.C.; Klein, R.; Chung, C.C.; Oh, I.J.; Chen, K.Y.; Berndt, S.I.; He, X.; Wu, W.; Chang, J.; Zhang, X.C.; Huang, M.S.; Zheng, H.; Wang, J.; Zhao, X.|info:eu-repo/dai/nl/413577805; Li, Y.; Choi, J.E.; Su, W.C.; Park, K.H.; Sung, S.W.; Shu, X.O.; Chen, Y.M.; Liu, L.; Kang, C.H.; Hu, L.; Chen, C.H.; Pao, W.; Kim, Y.C.; Yang, T.Y.; Xu, J.; Guan, P.; Tan, W.; Su, J.; Wang, C.L.; Li, H.; Sihoe, A.D.; Zhao, Z.|info:eu-repo/dai/nl/304120995; Chen, Y.; Choi, Y.Y.; Hung, J.Y.; Kim, J.S.; Yoon, H.I.; Cai, Q.; Lin, C.C.; Park, I.K.; Xu, P.; Dong, J.; Kim, C.; He, Q; Perng, R.P.; Kohno, T.; Kweon, S.S.; Chen, C.Y.; Vermeulen, R.|info:eu-repo/dai/nl/216532620; Wu, J.; Lim, W.Y.; Chen, K.C.; Chow, W.H.; Ji, B.T.; Chan, J.K.; Chu, M.; Li, Y.J.; Yokota, J.; Li, J.; Chen, H.; Xiang, Y.B.; Yu, C.J.; Kunitoh, H.; Wu, G.; Jin, L.; Lo, Y.L.; Shiraishi, K.; Chen, Y.H.; Lin, H.C.; Wu, T.; WU, Y.; Yang, P.C.; Zhou, B.; Shin, M.H.; Fraumeni, J.F.; Lin, D.; Chanock, S.J.; Rothman, N.
To identify common genetic variants that contribute to lung cancer susceptibility, we conducted a multistage genome-wide association study of lung cancer in Asian women who never smoked. We scanned 5,510 never-smoking female lung cancer cases and 4,544 controls drawn from 14 studies from mainland
S.I. Berndt (Sonja); S. Gustafsson (Stefan); R. Mägi (Reedik); A. Ganna (Andrea); E. Wheeler (Eleanor); M.F. Feitosa (Mary Furlan); A.E. Justice (Anne); K.L. Monda (Keri); D.C. Croteau-Chonka (Damien); F.R. Day (Felix); T. Esko (Tõnu); M. Fall (Magnus); T. Ferreira (Teresa); D. Gentilini (Davide); A.U. Jackson (Anne); J. Luan; J.C. Randall (Joshua); S. Vedantam (Sailaja); C.J. Willer (Cristen); T.W. Winkler (Thomas); A.R. Wood (Andrew); T. Workalemahu (Tsegaselassie); Y.-J. Hu (Yi-Juan); S.H. Lee (Sang Hong); L. Liang (Liming); D.Y. Lin (Dan); J. Min (Josine); B.M. Neale (Benjamin); G. Thorleifsson (Gudmar); J. Yang (Jian); E. Albrecht (Eva); N. Amin (Najaf); J.L. Bragg-Gresham (Jennifer L.); G. Cadby (Gemma); M. den Heijer (Martin); N. Eklund (Niina); K. Fischer (Krista); A. Goel (Anuj); J.J. Hottenga (Jouke Jan); J.E. Huffman (Jennifer); I. Jarick (Ivonne); A. Johansson (Åsa); T. Johnson (Toby); S. Kanoni (Stavroula); M.E. Kleber (Marcus); I.R. König (Inke); K. Kristiansson (Kati); Z. Kutalik (Zoltán); C. Lamina (Claudia); C. Lecoeur (Cécile); G. Li (Guo); M. Mangino (Massimo); W.L. McArdle (Wendy); M.C. Medina-Gomez (Carolina); M. Müller-Nurasyid (Martina); J.S. Ngwa; I.M. Nolte (Ilja); L. Paternoster (Lavinia); S. Pechlivanis (Sonali); M. Perola (Markus); M.J. Peters (Marjolein); M. Preuss (Michael); L.M. Rose (Lynda); J. Shi (Jianxin); D. Shungin (Dmitry); G.D. Smith; R.J. Strawbridge (Rona); I. Surakka (Ida); A. Teumer (Alexander); M.D. Trip (Mieke); J.P. Tyrer (Jonathan); J.V. van Vliet-Ostaptchouk (Jana); L. Vandenput (Liesbeth); L. Waite (Lindsay); J.H. Zhao (Jing Hua); D. Absher (Devin); F.W. Asselbergs (Folkert); M. Atalay (Mustafa); A.P. Attwood (Antony); A.J. Balmforth (Anthony); D.C.G. Basart (Dick); J.P. Beilby (John); L.L. Bonnycastle (Lori); P. Brambilla (Paolo); M. Bruinenberg (M.); H. Campbell (Harry); D.I. Chasman (Daniel); P.S. Chines (Peter); F.S. Collins (Francis); J. Connell (John); W. O Cookson (William); U. de Faire (Ulf); F. de Vegt (Femmie); M. Dei (Mariano); M. Dimitriou (Maria); T. Edkins (Ted); K. Estrada Gil (Karol); D.M. Evans (David); M. Farrall (Martin); F. Ferrario (Franco); J. Ferrières (Jean); L. Franke (Lude); F. Frau (Francesca); P.V. Gejman (Pablo); H. Grallert (Harald); H. Grönberg (Henrik); V. Gudnason (Vilmundur); A. Hall (Anne); A.S. Hall (Alistair); A.L. Hartikainen; C. Hayward (Caroline); N.L. Heard-Costa (Nancy); A.C. Heath (Andrew); J. Hebebrand (Johannes); G. Homuth (Georg); F.B. Hu (Frank); S.E. Hunt (Sarah); E. Hyppönen (Elina); C. Iribarren (Carlos); K.B. Jacobs (Kevin); J.-O. Jansson (John-Olov); A. Jula (Antti); M. Kähönen (Mika); S. Kathiresan (Sekar); F. Kee (F.); K-T. Khaw (Kay-Tee); M. Kivimaki (Mika); W. Koenig (Wolfgang); A. Kraja (Aldi); M. Kumari (Meena); K. Kuulasmaa (Kari); J. Kuusisto (Johanna); J. Laitinen (Jaana); T.A. Lakka (Timo); C. Langenberg (Claudia); L.J. Launer (Lenore); L. Lind (Lars); J. Lindstrom (Jaana); J. Liu (Jianjun); A. Liuzzi (Antonio); M.L. Lokki; M. Lorentzon (Mattias); P.A. Madden (Pamela); P.K. Magnusson (Patrik); P. Manunta (Paolo); D. Marek (Diana); W. März (Winfried); I.M. Leach (Irene Mateo); B. McKnight (Barbara); S.E. Medland (Sarah Elizabeth); E. Mihailov (Evelin); L. Milani (Lili); G.W. Montgomery (Grant); V. Mooser (Vincent); T.W. Mühleisen (Thomas); P. Munroe (Patricia); A.W. Musk (Arthur); N. Narisu (Narisu); G. Navis (Gerjan); G. Nicholson (Ggeorge); C. Nohr (Christian); K. Ong (Ken); B.A. Oostra (Ben); C.N.A. Palmer (Colin); A. Palotie (Aarno); J. Peden (John); N. Pedersen; A. Peters (Annette); O. Polasek (Ozren); A. Pouta (Anneli); P.P. Pramstaller (Peter Paul); I. Prokopenko (Inga); C. Pütter (Carolin); A. Radhakrishnan (Aparna); O. Raitakari (Olli); A. Rendon (Augusto); F. Rivadeneira Ramirez (Fernando); I. Rudan (Igor); T. Saaristo (Timo); J.G. Sambrook (Jennifer); A.R. Sanders (Alan); S. Sanna (Serena); J. Saramies (Jouko); S. Schipf (Sabine); S. Schreiber (Stefan); H. Schunkert (Heribert); S.-Y. Shin; S. Signorini (Stefano); J. Sinisalo (Juha); B. Skrobek (Boris); N. Soranzo (Nicole); A. Stancáková (Alena); K. Stark (Klaus); J. Stephens (Jonathan); K. Stirrups (Kathy); R.P. Stolk (Ronald); M. Stumvoll (Michael); A.J. Swift (Amy); E.V. Theodoraki (Eirini); B. Thorand (Barbara); D.-A. Tregouet (David-Alexandre); E. Tremoli (Elena); M.M. van der Klauw (Melanie); J.B.J. van Meurs (Joyce); S.H.H.M. Vermeulen (Sita); J. Viikari (Jorma); J. Virtamo (Jarmo); V. Vitart (Veronique); G. Waeber (Gérard); Z. Wang (Zhaoming); E. Widen (Elisabeth); S.H. Wild (Sarah); G.A.H.M. Willemsen (Gonneke); B. Winkelmann; J.C.M. Witteman (Jacqueline); B.H.R. Wolffenbuttel (Bruce); A. Wong (Andrew); A.F. Wright (Alan); M.C. Zillikens (Carola); P. Amouyel (Philippe); B.O. Boehm (Bernhard); E.A. Boerwinkle (Eric); D.I. Boomsma (Dorret); M. Caulfield (Mark); S.J. Chanock (Stephen); L.A. Cupples (Adrienne); D. Cusi (Daniele); G.V. Dedoussis (George); J. Erdmann (Jeanette); J.G. Eriksson (Johan); P.W. Franks (Paul); P. Froguel (Philippe); C. Gieger (Christian); U. Gyllensten (Ulf); A. Hamsten (Anders); T.B. Harris (Tamara); C. Hengstenberg (Christian); A.A. Hicks (Andrew); A. Hingorani (Aroon); A. Hinney (Anke); A. Hofman (Albert); G.K. Hovingh (Kees); K. Hveem (Kristian); T. Illig (Thomas); M.-R. Jarvelin (Marjo-Riitta); K.-H. Jöckel (Karl-Heinz); S. Keinanen-Kiukaanniemi (Sirkka); L.A.L.M. Kiemeney (Bart); D. Kuh (Diana); M. Laakso (Markku); T. Lehtimäki (Terho); D.F. Levinson (Douglas); N.G. Martin (Nicholas); A. Metspalu (Andres); A.D. Morris (Andrew); M.S. Nieminen (Markku); I. Njølstad (Inger); C. Ohlsson (Claes); A.J. Oldehinkel (Albertine); W.H. Ouwehand (Willem); C. Palmer (Cameron); B.W.J.H. Penninx (Brenda); C. Power (Christopher); M.A. Province (Mike); B.M. Psaty (Bruce); L. Qi (Lu); R. Rauramaa (Rainer); P.M. Ridker (Paul); S. Ripatti (Samuli); V. Salomaa (Veikko); N.J. Samani (Nilesh); H. Snieder (Harold); H.G. Sorensen; T.D. Spector (Timothy); J-A. Zwart (John-Anker); A. Tönjes (Anke); J. Tuomilehto (Jaakko); A.G. Uitterlinden (André); M. Uusitupa (Matti); P. van der Harst (Pim); P. Vollenweider (Peter); H. Wallaschofski (Henri); N.J. Wareham (Nick); H. Watkins (Hugh); H.E. Wichmann (Heinz Erich); J.F. Wilson (James F); G.R. Abecasis (Gonçalo); T.L. Assimes (Themistocles); I.E. Barroso (Inês); M. Boehnke (Michael); I.B. Borecki (Ingrid); P. Deloukas (Panagiotis); C. Fox (Craig); T.M. Frayling (Timothy); L. Groop (Leif); T. Haritunian (Talin); I.M. Heid (Iris); D. Hunter (David); R.C. Kaplan (Robert); F. Karpe (Fredrik); M.F. Moffatt (Miriam); K.L. Mohlke (Karen); J.R. O´Connell; Y. Pawitan (Yudi); E.E. Schadt (Eric); D. Schlessinger (David); V. Steinthorsdottir (Valgerdur); D.P. Strachan (David); U. Thorsteinsdottir (Unnur); C.M. van Duijn (Cornelia); P.M. Visscher (Peter); A.M. Di Blasio (Anna Maria); J.N. Hirschhorn (Joel); C.M. Lindgren (Cecilia); A.D. Morris (Andrew); D. Meyre (David); A. Scherag (Andre); M.I. McCarthy (Mark); E.K. Speliotes (Elizabeth); K.E. North (Kari); R.J.F. Loos (Ruth); E. Ingelsson (Erik)
textabstractApproaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of
Berndt, Sonja I; Gustafsson, Stefan; Mägi, Reedik; Ganna, Andrea; Wheeler, Eleanor; Feitosa, Mary F; Justice, Anne E; Monda, Keri L; Croteau-Chonka, Damien C; Day, Felix R; Esko, Tõnu; Fall, Tove; Ferreira, Teresa; Gentilini, Davide; Jackson, Anne U; Luan, Jian'an; Randall, Joshua C; Vedantam, Sailaja; Willer, Cristen J; Winkler, Thomas W; Wood, Andrew R; Workalemahu, Tsegaselassie; Hu, Yi-Juan; Lee, Sang Hong; Liang, Liming; Lin, Dan-Yu; Min, Josine L; Neale, Benjamin M; Thorleifsson, Gudmar; Yang, Jian; Albrecht, Eva; Amin, Najaf; Bragg-Gresham, Jennifer L; Cadby, Gemma; den Heijer, Martin; Eklund, Niina; Fischer, Krista; Goel, Anuj; Hottenga, Jouke-Jan; Huffman, Jennifer E; Jarick, Ivonne; Johansson, Åsa; Johnson, Toby; Kanoni, Stavroula; Kleber, Marcus E; König, Inke R; Kristiansson, Kati; Kutalik, Zoltán; Lamina, Claudia; Lecoeur, Cecile; Li, Guo; Mangino, Massimo; McArdle, Wendy L; Medina-Gomez, Carolina; Müller-Nurasyid, Martina; Ngwa, Julius S; Nolte, Ilja M; Paternoster, Lavinia; Pechlivanis, Sonali; Perola, Markus; Peters, Marjolein J; Preuss, Michael; Rose, Lynda M; Shi, Jianxin; Shungin, Dmitry; Smith, Albert Vernon; Strawbridge, Rona J; Surakka, Ida; Teumer, Alexander; Trip, Mieke D; Tyrer, Jonathan; Van Vliet-Ostaptchouk, Jana V; Vandenput, Liesbeth; Waite, Lindsay L; Zhao, Jing Hua; Absher, Devin; Asselbergs, Folkert W; Atalay, Mustafa; Attwood, Antony P; Balmforth, Anthony J; Basart, Hanneke; Beilby, John; Bonnycastle, Lori L; Brambilla, Paolo; Bruinenberg, Marcel; Campbell, Harry; Chasman, Daniel I; Chines, Peter S; Collins, Francis S; Connell, John M; Cookson, William O; de Faire, Ulf; de Vegt, Femmie; Dei, Mariano; Dimitriou, Maria; Edkins, Sarah; Estrada, Karol; Evans, David M; Farrall, Martin; Ferrario, Marco M; Ferrières, Jean; Franke, Lude; Frau, Francesca; Gejman, Pablo V; Grallert, Harald; Grönberg, Henrik; Gudnason, Vilmundur; Hall, Alistair S; Hall, Per; Hartikainen, Anna-Liisa; Hayward, Caroline; Heard-Costa, Nancy L; Heath, Andrew C; Hebebrand, Johannes; Homuth, Georg; Hu, Frank B; Hunt, Sarah E; Hyppönen, Elina; Iribarren, Carlos; Jacobs, Kevin B; Jansson, John-Olov; Jula, Antti; Kähönen, Mika; Kathiresan, Sekar; Kee, Frank; Khaw, Kay-Tee; Kivimäki, Mika; Koenig, Wolfgang; Kraja, Aldi T; Kumari, Meena; Kuulasmaa, Kari; Kuusisto, Johanna; Laitinen, Jaana H; Lakka, Timo A; Langenberg, Claudia; Launer, Lenore J; Lind, Lars; Lindström, Jaana; Liu, Jianjun; Liuzzi, Antonio; Lokki, Marja-Liisa; Lorentzon, Mattias; Madden, Pamela A; Magnusson, Patrik K; Manunta, Paolo; Marek, Diana; März, Winfried; Mateo Leach, Irene; McKnight, Barbara; Medland, Sarah E; Mihailov, Evelin; Milani, Lili; Montgomery, Grant W; Mooser, Vincent; Mühleisen, Thomas W; Munroe, Patricia B; Musk, Arthur W; Narisu, Narisu; Navis, Gerjan; Nicholson, George; Nohr, Ellen A; Ong, Ken K; Oostra, Ben A; Palmer, Colin N A; Palotie, Aarno; Peden, John F; Pedersen, Nancy; Peters, Annette; Polasek, Ozren; Pouta, Anneli; Pramstaller, Peter P; Prokopenko, Inga; Pütter, Carolin; Radhakrishnan, Aparna; Raitakari, Olli; Rendon, Augusto; Rivadeneira, Fernando; Rudan, Igor; Saaristo, Timo E; Sambrook, Jennifer G; Sanders, Alan R; Sanna, Serena; Saramies, Jouko; Schipf, Sabine; Schreiber, Stefan; Schunkert, Heribert; Shin, So-Youn; Signorini, Stefano; Sinisalo, Juha; Skrobek, Boris; Soranzo, Nicole; Stančáková, Alena; Stark, Klaus; Stephens, Jonathan C; Stirrups, Kathleen; Stolk, Ronald P; Stumvoll, Michael; Swift, Amy J; Theodoraki, Eirini V; Thorand, Barbara; Tregouet, David-Alexandre; Tremoli, Elena; Van der Klauw, Melanie M; van Meurs, Joyce B J; Vermeulen, Sita H; Viikari, Jorma; Virtamo, Jarmo; Vitart, Veronique; Waeber, Gérard; Wang, Zhaoming; Widén, Elisabeth; Wild, Sarah H; Willemsen, Gonneke; Winkelmann, Bernhard R; Witteman, Jacqueline C M; Wolffenbuttel, Bruce H R; Wong, Andrew; Wright, Alan F; Zillikens, M Carola; Amouyel, Philippe; Boehm, Bernhard O; Boerwinkle, Eric; Boomsma, Dorret I; Caulfield, Mark J; Chanock, Stephen J; Cupples, L Adrienne; Cusi, Daniele; Dedoussis, George V; Erdmann, Jeanette; Eriksson, Johan G; Franks, Paul W; Froguel, Philippe; Gieger, Christian; Gyllensten, Ulf; Hamsten, Anders; Harris, Tamara B; Hengstenberg, Christian; Hicks, Andrew A; Hingorani, Aroon; Hinney, Anke; Hofman, Albert; Hovingh, Kees G; Hveem, Kristian; Illig, Thomas; Jarvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Keinanen-Kiukaanniemi, Sirkka M; Kiemeney, Lambertus A; Kuh, Diana; Laakso, Markku; Lehtimäki, Terho; Levinson, Douglas F; Martin, Nicholas G; Metspalu, Andres; Morris, Andrew D; Nieminen, Markku S; Njølstad, Inger; Ohlsson, Claes; Oldehinkel, Albertine J; Ouwehand, Willem H; Palmer, Lyle J; Penninx, Brenda; Power, Chris; Province, Michael A; Psaty, Bruce M; Qi, Lu; Rauramaa, Rainer; Ridker, Paul M; Ripatti, Samuli; Salomaa, Veikko; Samani, Nilesh J; Snieder, Harold; Sørensen, Thorkild I A; Spector, Timothy D; Stefansson, Kari; Tönjes, Anke; Tuomilehto, Jaakko; Uitterlinden, André G; Uusitupa, Matti; van der Harst, Pim; Vollenweider, Peter; Wallaschofski, Henri; Wareham, Nicholas J; Watkins, Hugh; Wichmann, H-Erich; Wilson, James F; Abecasis, Goncalo R; Assimes, Themistocles L; Barroso, Inês; Boehnke, Michael; Borecki, Ingrid B; Deloukas, Panos; Fox, Caroline S; Frayling, Timothy; Groop, Leif C; Haritunian, Talin; Heid, Iris M; Hunter, David; Kaplan, Robert C; Karpe, Fredrik; Moffatt, Miriam F; Mohlke, Karen L; O'Connell, Jeffrey R; Pawitan, Yudi; Schadt, Eric E; Schlessinger, David; Steinthorsdottir, Valgerdur; Strachan, David P; Thorsteinsdottir, Unnur; van Duijn, Cornelia M; Visscher, Peter M; Di Blasio, Anna Maria; Hirschhorn, Joel N; Lindgren, Cecilia M; Morris, Andrew P; Meyre, David; Scherag, André; McCarthy, Mark I; Speliotes, Elizabeth K; North, Kari E; Loos, Ruth J F; Ingelsson, Erik
Approaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of body mass
Berndt, Sonja I.; Gustafsson, Stefan; Mägi, Reedik; Ganna, Andrea; Wheeler, Eleanor; Feitosa, Mary F.; Justice, Anne E.; Monda, Keri L.; Croteau-Chonka, Damien C.; Day, Felix R.; Esko, Tõnu; Fall, Tove; Ferreira, Teresa; Gentilini, Davide; Jackson, Anne U.; Luan, Jian'an; Randall, Joshua C.; Vedantam, Sailaja; Willer, Cristen J.; Winkler, Thomas W.; Wood, Andrew R.; Workalemahu, Tsegaselassie; Hu, Yi-Juan; Lee, Sang Hong; Liang, Liming; Lin, Dan-Yu; Min, Josine L.; Neale, Benjamin M.; Thorleifsson, Gudmar; Yang, Jian; Albrecht, Eva; Amin, Najaf; Bragg-Gresham, Jennifer L.; Cadby, Gemma; den Heijer, Martin; Eklund, Niina; Fischer, Krista; Goel, Anuj; Hottenga, Jouke-Jan; Huffman, Jennifer E.; Jarick, Ivonne; Johansson, Asa; Johnson, Toby; Kanoni, Stavroula; Kleber, Marcus E.; König, Inke R.; Kristiansson, Kati; Kutalik, Zoltán; Lamina, Claudia; Lecoeur, Cecile; Li, Guo; Mangino, Massimo; McArdle, Wendy L.; Medina-Gomez, Carolina; Müller-Nurasyid, Martina; Ngwa, Julius S.; Nolte, Ilja M.; Paternoster, Lavinia; Pechlivanis, Sonali; Perola, Markus; Peters, Marjolein J.; Preuss, Michael; Rose, Lynda M.; Shi, Jianxin; Shungin, Dmitry; Smith, Albert Vernon; Strawbridge, Rona J.; Surakka, Ida; Teumer, Alexander; Trip, Mieke D.; Tyrer, Jonathan; van Vliet-Ostaptchouk, Jana V.; Vandenput, Liesbeth; Waite, Lindsay L.; Zhao, Jing Hua; Absher, Devin; Asselbergs, Folkert W.; Atalay, Mustafa; Attwood, Antony P.; Balmforth, Anthony J.; Basart, Hanneke; Beilby, John; Bonnycastle, Lori L.; Brambilla, Paolo; Bruinenberg, Marcel; Campbell, Harry; Chasman, Daniel I.; Chines, Peter S.; Collins, Francis S.; Connell, John M.; Cookson, William O.; de Faire, Ulf; de Vegt, Femmie; dei, Mariano; Dimitriou, Maria; Edkins, Sarah; Estrada, Karol; Evans, David M.; Farrall, Martin; Ferrario, Marco M.; Ferrières, Jean; Franke, Lude; Frau, Francesca; Gejman, Pablo V.; Grallert, Harald; Grönberg, Henrik; Gudnason, Vilmundur; Hall, Alistair S.; Hall, Per; Hartikainen, Anna-Liisa; Hayward, Caroline; Heard-Costa, Nancy L.; Heath, Andrew C.; Hebebrand, Johannes; Homuth, Georg; Hu, Frank B.; Hunt, Sarah E.; Hyppönen, Elina; Iribarren, Carlos; Jacobs, Kevin B.; Jansson, John-Olov; Jula, Antti; Kähönen, Mika; Kathiresan, Sekar; Kee, Frank; Khaw, Kay-Tee; Kivimäki, Mika; Koenig, Wolfgang; Kraja, Aldi T.; Kumari, Meena; Kuulasmaa, Kari; Kuusisto, Johanna; Laitinen, Jaana H.; Lakka, Timo A.; Langenberg, Claudia; Launer, Lenore J.; Lind, Lars; Lindström, Jaana; Liu, Jianjun; Liuzzi, Antonio; Lokki, Marja-Liisa; Lorentzon, Mattias; Madden, Pamela A.; Magnusson, Patrik K.; Manunta, Paolo; Marek, Diana; März, Winfried; Mateo Leach, Irene; McKnight, Barbara; Medland, Sarah E.; Mihailov, Evelin; Milani, Lili; Montgomery, Grant W.; Mooser, Vincent; Mühleisen, Thomas W.; Munroe, Patricia B.; Musk, Arthur W.; Narisu, Narisu; Navis, Gerjan; Nicholson, George; Nohr, Ellen A.; Ong, Ken K.; Oostra, Ben A.; Palmer, Colin N. A.; Palotie, Aarno; Peden, John F.; Pedersen, Nancy; Peters, Annette; Polasek, Ozren; Pouta, Anneli; Pramstaller, Peter P.; Prokopenko, Inga; Pütter, Carolin; Radhakrishnan, Aparna; Raitakari, Olli; Rendon, Augusto; Rivadeneira, Fernando; Rudan, Igor; Saaristo, Timo E.; Sambrook, Jennifer G.; Sanders, Alan R.; Sanna, Serena; Saramies, Jouko; Schipf, Sabine; Schreiber, Stefan; Schunkert, Heribert; Shin, So-Youn; Signorini, Stefano; Sinisalo, Juha; Skrobek, Boris; Soranzo, Nicole; Stančáková, Alena; Stark, Klaus; Stephens, Jonathan C.; Stirrups, Kathleen; Stolk, Ronald P.; Stumvoll, Michael; Swift, Amy J.; Theodoraki, Eirini V.; Thorand, Barbara; Tregouet, David-Alexandre; Tremoli, Elena; van der Klauw, Melanie M.; van Meurs, Joyce B. J.; Vermeulen, Sita H.; Viikari, Jorma; Virtamo, Jarmo; Vitart, Veronique; Waeber, Gérard; Wang, Zhaoming; Widén, Elisabeth; Wild, Sarah H.; Willemsen, Gonneke; Winkelmann, Bernhard R.; Witteman, Jacqueline C. M.; Wolffenbuttel, Bruce H. R.; Wong, Andrew; Wright, Alan F.; Zillikens, M. Carola; Amouyel, Philippe; Boehm, Bernhard O.; Boerwinkle, Eric; Boomsma, Dorret I.; Caulfield, Mark J.; Chanock, Stephen J.; Cupples, L. Adrienne; Cusi, Daniele; Dedoussis, George V.; Erdmann, Jeanette; Eriksson, Johan G.; Franks, Paul W.; Froguel, Philippe; Gieger, Christian; Gyllensten, Ulf; Hamsten, Anders; Harris, Tamara B.; Hengstenberg, Christian; Hicks, Andrew A.; Hingorani, Aroon; Hinney, Anke; Hofman, Albert; Hovingh, Kees G.; Hveem, Kristian; Illig, Thomas; Jarvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Keinanen-Kiukaanniemi, Sirkka M.; Kiemeney, Lambertus A.; Kuh, Diana; Laakso, Markku; Lehtimäki, Terho; Levinson, Douglas F.; Martin, Nicholas G.; Metspalu, Andres; Morris, Andrew D.; Nieminen, Markku S.; Njølstad, Inger; Ohlsson, Claes; Oldehinkel, Albertine J.; Ouwehand, Willem H.; Palmer, Lyle J.; Penninx, Brenda; Power, Chris; Province, Michael A.; Psaty, Bruce M.; Qi, Lu; Rauramaa, Rainer; Ridker, Paul M.; Ripatti, Samuli; Salomaa, Veikko; Samani, Nilesh J.; Snieder, Harold; Sørensen, Thorkild I. A.; Spector, Timothy D.; Stefansson, Kari; Tönjes, Anke; Tuomilehto, Jaakko; Uitterlinden, André G.; Uusitupa, Matti; van der Harst, Pim; Vollenweider, Peter; Wallaschofski, Henri; Wareham, Nicholas J.; Watkins, Hugh; Wichmann, H.-Erich; Wilson, James F.; Abecasis, Goncalo R.; Assimes, Themistocles L.; Barroso, Inês; Boehnke, Michael; Borecki, Ingrid B.; Deloukas, Panos; Fox, Caroline S.; Frayling, Timothy; Groop, Leif C.; Haritunian, Talin; Heid, Iris M.; Hunter, David; Kaplan, Robert C.; Karpe, Fredrik; Moffatt, Miriam F.; Mohlke, Karen L.; O'Connell, Jeffrey R.; Pawitan, Yudi; Schadt, Eric E.; Schlessinger, David; Steinthorsdottir, Valgerdur; Strachan, David P.; Thorsteinsdottir, Unnur; van Duijn, Cornelia M.; Visscher, Peter M.; Di Blasio, Anna Maria; Hirschhorn, Joel N.; Lindgren, Cecilia M.; Morris, Andrew P.; Meyre, David; Scherag, André; McCarthy, Mark I.; Speliotes, Elizabeth K.; North, Kari E.; Loos, Ruth J. F.; Ingelsson, Erik
Approaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of body mass
Bramon, Elvira; Pirinen, Matti; Strange, Amy; Lin, Kuang; Freeman, Colin; Bellenguez, Céline; Su, Zhan; Band, Gavin; Pearson, Richard; Vukcevic, Damjan; Langford, Cordelia; Deloukas, Panos; Hunt, Sarah; Gray, Emma; Dronov, Serge; Potter, Simon C.; Tashakkori-Ghanbaria, Avazeh; Edkins, Sarah; Bumpstead, Suzannah J.; Arranz, Maria J.; Bakker, Steven; Bender, Stephan; Bruggeman, Richard; Cahn, Wiepke; Chandler, David; Collier, David A.; Crespo-Facorro, Benedicto; Dazzan, Paola; de Haan, Lieuwe; Di Forti, Marta; Dragović, Milan; Giegling, Ina; Hall, Jeremy; Iyegbe, Conrad; Jablensky, Assen; Kahn, René S.; Kalaydjieva, Luba; Kravariti, Eugenia; Lawrie, Stephen; Linszen, Don H.; Mata, Ignacio; McDonald, Colm; McIntosh, Andrew; Myin-Germeys, Inez; Ophoff, Roel A.; Pariante, Carmine M.; Paunio, Tiina; Picchioni, Marco; Ripke, Stephan; Rujescu, Dan
Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories. 1239 cases with schizophrenia, schizoaffective disorder, or psychotic
Bramon, Elvira; Pirinen, Matti; Strange, Amy; Lin, Kuang; Freeman, Colin; Bellenguez, Celine; Su, Zhan; Band, Gavin; Pearson, Richard; Vukcevic, Damjan; Langford, Cordelia; Deloukas, Panos; Hunt, Sarah; Gray, Emma; Dronov, Serge; Potter, Simon C.; Tashakkori-Ghanbaria, Avazeh; Edkins, Sarah; Bumpstead, Suzannah J.; Arranz, Maria J.; Bakker, Steven; Bender, Stephan; Bruggeman, Richard; Cahn, Wiepke; Chandler, David; Collier, David A.; Crespo-Facorro, Benedicto; Dazzan, Paola; de Haan, Lieuwe; di Forti, Marta; Dragovic, Milan; Giegling, Ina; Hall, Jeremy; Iyegbe, Conrad; Jablensky, Assen; Kahn, Rene S.; Kalaydjieva, Luba; Kravariti, Eugenia; Lawrie, Stephen; Lins-Zen, Don H.; Mata, Ignacio; McDonald, Colm; McIntosh, Andrew; Myin-Germeys, Inez; Ophoff, Roel A.; Pariante, Carmine M.; Paunio, Tiina; Picchioni, Marco; Ripke, Stephan; Wiersma, Durk
Background: Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories. Methods: 1239 cases with schizophrenia, schizoaffective
Imamura, Minako; Takahashi, Atsushi; Yamauchi, Toshimasa
Genome-wide association studies (GWAS) have identified more than 80 susceptibility loci for type 2 diabetes (T2D), but most of its heritability still remains to be elucidated. In this study, we conducted a meta-analysis of GWAS for T2D in the Japanese population. Combined data from discovery and ...
Berndt, Sonja I; Gustafsson, Stefan; Mägi, Reedik
Approaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of body mass ...
Tönjes, Anke; Scholz, Markus; Krüger, Jacqueline; Krause, Kerstin; Schleinitz, Dorit; Kirsten, Holger; Gebhardt, Claudia; Marzi, Carola; Grallert, Harald; Ladenvall, Claes; Heyne, Henrike; Laurila, Esa; Kriebel, Jennifer; Meisinger, Christa; Rathmann, Wolfgang; Gieger, Christian; Groop, Leif; Prokopenko, Inga; Isomaa, Bo; Beutner, Frank; Kratzsch, Jürgen; Fischer-Rosinsky, Antje; Pfeiffer, Andreas; Krohn, Knut; Spranger, Joachim; Thiery, Joachim; Blüher, Matthias; Stumvoll, Michael; Kovacs, Peter
Progranulin is a secreted protein with important functions in processes including immune and inflammatory response, metabolism and embryonic development. The present study aimed at identification of genetic factors determining progranulin concentrations. We conducted a genome-wide association meta-analysis for serum progranulin in three independent cohorts from Europe: Sorbs (N = 848) and KORA (N = 1628) from Germany and PPP-Botnia (N = 335) from Finland (total N = 2811). Single nucleotide polymorphisms (SNPs) associated with progranulin levels were replicated in two additional German cohorts: LIFE-Heart Study (Leipzig; N = 967) and Metabolic Syndrome Berlin Potsdam (Berlin cohort; N = 833). We measured mRNA expression of genes in peripheral blood mononuclear cells (PBMC) by micro-arrays and performed mRNA expression quantitative trait and expression-progranulin association studies to functionally substantiate identified loci. Finally, we conducted siRNA silencing experiments in vitro to validate potential candidate genes within the associated loci. Heritability of circulating progranulin levels was estimated at 31.8% and 26.1% in the Sorbs and LIFE-Heart cohort, respectively. SNPs at three loci reached study-wide significance (rs660240 in CELSR2-PSRC1-MYBPHL-SORT1, rs4747197 in CDH23-PSAP and rs5848 in GRN) explaining 19.4%/15.0% of the variance and 61%/57% of total heritability in the Sorbs/LIFE-Heart Study. The strongest evidence for association was at rs660240 (P = 5.75 × 10-50), which was also associated with mRNA expression of PSRC1 in PBMC (P = 1.51 × 10-21). Psrc1 knockdown in murine preadipocytes led to a consecutive 30% reduction in progranulin secretion. In conclusion, the present meta-GWAS combined with mRNA expression identified three loci associated with progranulin and supports the role of PSRC1 in the regulation of progranulin secretion. © The Author(s) 2017. Published by Oxford University Press. All rights
Gog, Julia R; Lever, Andrew M L; Skittrall, Jordan P
We present a fast, robust and parsimonious approach to detecting signals in an ordered sequence of numbers. Our motivation is in seeking a suitable method to take a sequence of scores corresponding to properties of positions in virus genomes, and find outlying regions of low scores. Suitable statistical methods without using complex models or making many assumptions are surprisingly lacking. We resolve this by developing a method that detects regions of low score within sequences of real numbers. The method makes no assumptions a priori about the length of such a region; it gives the explicit location of the region and scores it statistically. It does not use detailed mechanistic models so the method is fast and will be useful in a wide range of applications. We present our approach in detail, and test it on simulated sequences. We show that it is robust to a wide range of signal morphologies, and that it is able to capture multiple signals in the same sequence. Finally we apply it to viral genomic data to identify regions of evolutionary conservation within influenza and rotavirus.
C.E. Elks (Cathy); J.R.B. Perry (John); P. Sulem (Patrick); D.I. Chasman (Daniel); N. Franceschini (Nora); C. He (Chunyan); K.L. Lunetta (Kathryn); J.A. Visser (Jenny); E.M. Byrne (Enda); D.L. Cousminer (Diana); D.F. Gudbjartsson (Daniel); T. Esko (Tõnu); B. Feenstra (Bjarke); J.J. Hottenga (Jouke Jan); D.L. Koller (Daniel); Z. Kutalik (Zoltán); P. Lin (Peng); M. Mangino (Massimo); M. Marongiu (Mara); P.F. McArdle (Patrick); A.V. Smith (Albert Vernon); L. Stolk (Lisette); S. van Wingerden (Sophie); J.H. Zhao (Jing Hua); E. Albrecht (Eva); T. Corre (Tanguy); E. Ingelsson (Erik); C. Hayward (Caroline); P.K. Magnusson (Patrik); S. Ulivi (Shelia); N.M. Warrington (Nicole); L. Zgaga (Lina); H. Alavere (Helene); N. Amin (Najaf); T. Aspelund (Thor); S. Bandinelli (Stefania); I.E. Barroso (Inês); G. Berenson (Gerald); S.M. Bergmann (Sven); H. Blackburn (Hannah); E.A. Boerwinkle (Eric); J.E. Buring (Julie); F. Busonero; H. Campbell (Harry); S.J. Chanock (Stephen); W. Chen (Wei); M. Cornelis (Marilyn); D.J. Couper (David); A.D. Coviello (Andrea); P. d' Adamo (Pio); U. de Faire (Ulf); E.J.C. de Geus (Eco); P. Deloukas (Panagiotis); A. Döring (Angela); D.F. Easton (Douglas); G. Eiriksdottir (Gudny); V. Emilsson (Valur); J.G. Eriksson (Johan); L. Ferrucci (Luigi); A.R. Folsom (Aaron); T. Foroud (Tatiana); M. Garcia (Melissa); P. Gasparini (Paolo); F. Geller (Frank); C. Gieger (Christian); V. Gudnason (Vilmundur); A.S. Hall (Alistair); S.E. Hankinson (Susan); L. Ferreli (Liana); A.C. Heath (Andrew); D.G. Hernandez (Dena); A. Hofman (Albert); F.B. Hu (Frank); T. Illig (Thomas); M.R. Järvelin; A.D. Johnson (Andrew); D. Karasik (David); K-T. Khaw (Kay-Tee); D.P. Kiel (Douglas); T.O. Kilpelänen (Tuomas); I. Kolcic (Ivana); P. Kraft (Peter); L.J. Launer (Lenore); J.S.E. Laven (Joop); S. Li (Shengxu); J. Liu (Jianjun); D. Levy (Daniel); N.G. Martin (Nicholas); M. Melbye (Mads); V. Mooser (Vincent); J.C. Murray (Jeffrey); M.A. Nalls (Michael); P. Navarro (Pau); M. Nelis (Mari); A.R. Ness (Andrew); K. Northstone (Kate); B.A. Oostra (Ben); M. Peacock (Munro); C. Palmer (Cameron); A. Palotie (Aarno); G. Paré (Guillaume); A.N. Parker (Alex); N.L. Pedersen (Nancy); L. Peltonen (Leena Johanna); C.E. Pennell (Craig); P.D.P. Pharoah (Paul); O. Polasek (Ozren); A.S. Plump (Andrew); A. Pouta (Anneli); E. Porcu (Eleonora); T. Rafnar (Thorunn); J.P. Rice (John); S.M. Ring (Susan); F. Rivadeneira Ramirez (Fernando); I. Rudan (Igor); C. Sala (Cinzia); V. Salomaa (Veikko); S. Sanna (Serena); D. Schlessinger; N.J. Schork (Nicholas); A. Scuteri (Angelo); A.V. Segrè (Ayellet); A.R. Shuldiner (Alan); N. Soranzo (Nicole); U. Sovio (Ulla); S.R. Srinivasan (Sathanur); D.P. Strachan (David); M.L. Tammesoo; E. Tikkanen (Emmi); D. Toniolo (Daniela); K. Tsui (Kim); L. Tryggvadottir (Laufey); J.P. Tyrer (Jonathan); M. Uda (Manuela); R.M. van Dam (Rob); J.B.J. van Meurs (Joyce); P. Vollenweider (Peter); G. Waeber (Gérard); N.J. Wareham (Nick); D. Waterworth (Dawn); H.E. Wichmann (Heinz Erich); G.A.H.M. Willemsen (Gonneke); J.F. Wilson (James); A.F. Wright (Alan); L. Young (Lauren); G. Zhai (Guangju); W.V. Zhuang; L.J. Bierut (Laura); D.I. Boomsma (Dorret); H.A. Boyd (Heather); L. Crisponi (Laura); E.W. Demerath (Ellen); P. Tikka-Kleemola (Päivi); M.J. Econs (Michael); T.B. Harris (Tamara); D. Hunter (David); R.J.F. Loos (Ruth); A. Metspalu (Andres); G.W. Montgomery (Grant); P.M. Ridker (Paul); T.D. Spector (Tim); E.A. Streeten (Elizabeth); K. Stefansson (Kari); U. Thorsteinsdottir (Unnur); A.G. Uitterlinden (André); E. Widen (Elisabeth); J. Murabito (Joanne); K. Ong (Ken); M.N. Weedon (Michael)
textabstractTo identify loci for age at menarche, we performed a meta-analysis of 32 genome-wide association studies in 87,802 women of European descent, with replication in up to 14,731 women. In addition to the known loci at LIN28B (P = 5.4 × 10 -60) and 9q31.2 (P = 2.2 × 10 -33), we identified 30
Full Text Available Abstract Background We have used the genomic data in the Integrated Microbial Genomes system of the Department of Energy’s Joint Genome Institute to make predictions about rhizobial open reading frames that play a role in nodulation of host plants. The genomic data was screened by searching for ORFs conserved in α-proteobacterial rhizobia, but not conserved in closely-related non-nitrogen-fixing α-proteobacteria. Results Using this approach, we identified many genes known to be involved in nodulation or nitrogen fixation, as well as several new candidate genes. We knocked out selected new genes and assayed for the presence of nodulation phenotypes and/or nodule-specific expression. One of these genes, SMc00911, is strongly expressed by bacterial cells within host plant nodules, but is expressed minimally by free-living bacterial cells. A strain carrying an insertion mutation in SMc00911 is not defective in the symbiosis with host plants, but in contrast to expectations, this mutant strain is able to out-compete the S. meliloti 1021 wild type strain for nodule occupancy in co-inoculation experiments. The SMc00911 ORF is predicted to encode a “SodM-like” (superoxide dismutase-like protein containing a rhodanese sulfurtransferase domain at the N-terminus and a chromate-resistance superfamily domain at the C-terminus. Several other ORFs (SMb20360, SMc01562, SMc01266, SMc03964, and the SMc01424-22 operon identified in the screen are expressed at a moderate level by bacteria within nodules, but not by free-living bacteria. Conclusions Based on the analysis of ORFs identified in this study, we conclude that this comparative genomics approach can identify rhizobial genes involved in the nitrogen-fixing symbiosis with host plants, although none of the newly identified genes were found to be essential for this process.
Delgado, Dayana A; Zhang, Chenan; Chen, Lin S; Gao, Jianjun; Roy, Shantanu; Shinkle, Justin; Sabarinathan, Mekala; Argos, Maria; Tong, Lin; Ahmed, Alauddin; Islam, Tariqul; Rakibuz-Zaman, Muhammad; Sarwar, Golam; Shahriar, Hasan; Rahman, Mahfuzar; Yunus, Mohammad; Jasmine, Farzana; Kibriya, Muhammad G; Ahsan, Habibul; Pierce, Brandon L
Leucocyte telomere length (TL) is a potential biomarker of ageing and risk for age-related disease. Leucocyte TL is heritable and shows substantial differences by race/ethnicity. Recent genome-wide association studies (GWAS) report ~10 loci harbouring SNPs associated with leucocyte TL, but these studies focus primarily on populations of European ancestry. This study aims to enhance our understanding of genetic determinants of TL across populations. We performed a GWAS of TL using data on 5075 Bangladeshi adults. We measured TL using one of two technologies (qPCR or a Luminex-based method) and used standardised variables as TL phenotypes. Our results replicate previously reported associations in the TERC and TERT regions (P=2.2×10 -8 and P=6.4×10 -6 , respectively). We observed a novel association signal in the RTEL1 gene (intronic SNP rs2297439; P=2.82×10 -7 ) that is independent of previously reported TL-associated SNPs in this region. The minor allele for rs2297439 is common in South Asian populations (≥0.25) but at lower frequencies in other populations (eg, 0.07 in Northern Europeans). Among the eight other previously reported association signals, all were directionally consistent with our study, but only rs8105767 ( ZNF208 ) was nominally significant (P=0.003). SNP-based heritability estimates were as high as 44% when analysing close relatives but much lower when analysing distant relatives only. In this first GWAS of TL in a South Asian population, we replicate some, but not all, of the loci reported in prior GWAS of individuals of European ancestry, and we identify a novel second association signal at the RTEL1 locus. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Champeimont, Raphaël; Laine, Elodie; Hu, Shuang-Wei; Penin, Francois; Carbone, Alessandra
A novel computational approach of coevolution analysis allowed us to reconstruct the protein-protein interaction network of the Hepatitis C Virus (HCV) at the residue resolution. For the first time, coevolution analysis of an entire viral genome was realized, based on a limited set of protein sequences with high sequence identity within genotypes. The identified coevolving residues constitute highly relevant predictions of protein-protein interactions for further experimental identification of HCV protein complexes. The method can be used to analyse other viral genomes and to predict the associated protein interaction networks.
Full Text Available ALK-break positive non-small cell lung cancer (NSCLC patients initially respond to crizotinib, but resistance occurs inevitably. In this study we aimed to identify fusion genes in crizotinib resistant tumor samples. Re-biopsies of three patients were subjected to paired-end RNA sequencing to identify fusion genes using deFuse and EricScript. The IGV browser was used to determine presence of known resistance-associated mutations. Sanger sequencing was used to validate fusion genes and digital droplet PCR to validate mutations. ALK fusion genes were detected in all three patients with EML4 being the fusion partner. One patient had no additional fusion genes. Another patient had one additional fusion gene, but without a predicted open reading frame (ORF. The third patient had three additional fusion genes, of which two were derived from the same chromosomal region as the EML4-ALK. A predicted ORF was identified only in the CLIP4-VSNL1 fusion product. The fusion genes validated in the post-treatment sample were also present in the biopsy before crizotinib. ALK mutations (p.C1156Y and p.G1269A detected in the re-biopsies of two patients, were not detected in pre-treatment biopsies. In conclusion, fusion genes identified in our study are unlikely to be involved in crizotinib resistance based on presence in pre-treatment biopsies. The detection of ALK mutations in post-treatment tumor samples of two patients underlines their role in crizotinib resistance.
Tran Thi Tuyet, H.; Zwart, M.P.; Phuong, N.T.; Oanh, D.T.H.; Jong, de M.C.M.; Vlak, J.M.
Sequence comparisons of the genomes of white spot syndrome virus (WSSV) strains have identified regions containing variable-length insertions/deletions (i.e. indels). Indel-I and Indel-II, positioned between open reading frames (ORFs) 14/15 and 23/24, respectively, are the largest and the most
Tabassum, Rubina; Chauhan, Ganesh; Dwivedi, Om Prakash; Mahajan, Anubha; Jaiswal, Alok; Kaur, Ismeet; Bandesh, Khushdeep; Singh, Tejbir; Mathai, Benan John; Pandey, Yogesh; Chidambaram, Manickam; Sharma, Amitabh; Chavali, Sreenivas; Sengupta, Shantanu; Ramakrishnan, Lakshmi; Venkatesh, Pradeep; Aggarwal, Sanjay K; Ghosh, Saurabh; Prabhakaran, Dorairaj; Srinath, Reddy K; Saxena, Madhukar; Banerjee, Monisha; Mathur, Sandeep; Bhansali, Anil; Shah, Viral N; Madhu, Sri Venkata; Marwaha, Raman K; Basu, Analabha; Scaria, Vinod; McCarthy, Mark I; Venkatesan, Radha; Mohan, Viswanathan; Tandon, Nikhil; Bharadwaj, Dwaipayan
Indians undergoing socioeconomic and lifestyle transitions will be maximally affected by epidemic of type 2 diabetes (T2D). We conducted a two-stage genome-wide association study of T2D in 12,535 Indians, a less explored but high-risk group. We identified a new type 2 diabetes-associated locus at 2q21, with the lead signal being rs6723108 (odds ratio 1.31; P = 3.32 × 10⁻⁹). Imputation analysis refined the signal to rs998451 (odds ratio 1.56; P = 6.3 × 10⁻¹²) within TMEM163 that encodes a probable vesicular transporter in nerve terminals. TMEM163 variants also showed association with decreased fasting plasma insulin and homeostatic model assessment of insulin resistance, indicating a plausible effect through impaired insulin secretion. The 2q21 region also harbors RAB3GAP1 and ACMSD; those are involved in neurologic disorders. Forty-nine of 56 previously reported signals showed consistency in direction with similar effect sizes in Indians and previous studies, and 25 of them were also associated (P < 0.05). Known loci and the newly identified 2q21 locus altogether explained 7.65% variance in the risk of T2D in Indians. Our study suggests that common susceptibility variants for T2D are largely the same across populations, but also reveals a population-specific locus and provides further insights into genetic architecture and etiology of T2D.
Bryant Susan V
Full Text Available Abstract Background The basis of genome size variation remains an outstanding question because DNA sequence data are lacking for organisms with large genomes. Sixteen BAC clones from the Mexican axolotl (Ambystoma mexicanum: c-value = 32 × 109 bp were isolated and sequenced to characterize the structure of genic regions. Results Annotation of genes within BACs showed that axolotl introns are on average 10× longer than orthologous vertebrate introns and they are predicted to contain more functional elements, including miRNAs and snoRNAs. Loci were discovered within BACs for two novel EST transcripts that are differentially expressed during spinal cord regeneration and skin metamorphosis. Unexpectedly, a third novel gene was also discovered while manually annotating BACs. Analysis of human-axolotl protein-coding sequences suggests there are 2% more lineage specific genes in the axolotl genome than the human genome, but the great majority (86% of genes between axolotl and human are predicted to be 1:1 orthologs. Considering that axolotl genes are on average 5× larger than human genes, the genic component of the salamander genome is estimated to be incredibly large, approximately 2.8 gigabases! Conclusion This study shows that a large salamander genome has a correspondingly large genic component, primarily because genes have incredibly long introns. These intronic sequences may harbor novel coding and non-coding sequences that regulate biological processes that are unique to salamanders.
Jacqueline Zoe-Munn Chan
Full Text Available Two bacteriophages, RPP1 and RLP1, infecting members of the marine Roseobacter clade were isolated from seawater. Their linear genomes are 74.7 and 74.6 kb and encode 91 and 92 coding DNA sequences, respectively. Around 30% of these are homologous to genes found in Enterobacter phage N4. Comparative genomics of these two new Roseobacter phages and twenty-three other sequenced N4-like phages (three infecting members of the Roseobacter lineage and twenty infecting other Gammaproteobacteria revealed that N4-like phages share a core genome of 14 genes responsible for control of gene expression, replication and virion proteins. Phylogenetic analysis of these genes placed the five N4-like roseophages (RN4 into a distinct subclade. Analysis of the RN4 phage genomes revealed they share a further 19 genes of which nine are found exclusively in RN4 phages and four appear to have been acquired from their bacterial hosts. Proteomic analysis of the RPP1 and RLP1 virions identified a second structural module present in the RN4 phages similar to that found in the Pseudomonas N4-like phage LIT1. Searches of various metagenomic databases, included the GOS database, using CDS sequences from RPP1 suggests these phages are widely distributed in marine environments in particular in the open ocean environment.
Mavian, Carla; López-Bueno, Alberto; Balseiro, Ana; Casais, Rosa; Alcamí, Antonio; Alejo, Alí
Worldwide amphibian population declines have been ascribed to global warming, increasing pollution levels, and other factors directly related to human activities. These factors may additionally be favoring the emergence of novel pathogens. In this report, we have determined the complete genome sequence of the emerging common midwife toad ranavirus (CMTV), which has caused fatal disease in several amphibian species across Europe. Phylogenetic and gene content analyses of the first complete genomic sequence from a ranavirus isolated in Europe show that CMTV is an amphibian-like ranavirus (ALRV). However, the CMTV genome structure is novel and represents an intermediate evolutionary stage between the two previously described ALRV groups. We find that CMTV clusters with several other ranaviruses isolated from different hosts and locations which might also be included in this novel ranavirus group. This work sheds light on the phylogenetic relationships within this complex group of emerging, disease-causing viruses.
Wanzek, Katharina; Schwindt, Eike; Capra, John A.; Paeschke, Katrin
The regulation of replication is essential to preserve genome integrity. Mms1 is part of the E3 ubiquitin ligase complex that is linked to replication fork progression. By identifying Mms1 binding sites genome-wide in Saccharomyces cerevisiae we connected Mms1 function to genome integrity and
Ghaffari, Pouyan; Mardinoglu, Adil; Asplund, Anna
Human cancer cell lines are used as important model systems to study molecular mechanisms associated with tumor growth, hereunder how genomic and biological heterogeneity found in primary tumors affect cellular phenotypes. We reconstructed Genome scale metabolic models (GEMs) for eleven cell lines...... based on RNA-Seq data and validated the functionality of these models with data from metabolite profiling. We used cell line-specific GEMs to analyze the differences in the metabolism of cancer cell lines, and to explore the heterogeneous expression of the metabolic subsystems. Furthermore, we predicted...... for inhibition of cell growth may provide leads for the development of efficient cancer treatment strategies....
Robinson, Hannah; Hickey, Lee; Richard, Cecile; Mace, Emma; Kelly, Alison; Borrell, Andrew; Franckowiak, Jerome; Fox, Glen
Water availability is a major limiting factor for crop production, making drought adaptation and its many component traits a desirable attribute of plant cultivars. Previous studies in cereal crops indicate that root traits expressed at early plant developmental stages, such as seminal root angle and root number, are associated with water extraction at different depths. Here, we conducted the first study to map seminal root traits in barley ( L.). Using a recently developed high-throughput phenotyping method, a panel of 30 barley genotypes and a doubled-haploid (DH) population (ND24260 × 'Flagship') comprising 330 lines genotyped with diversity array technology (DArT) markers were evaluated for seminal root angle (deviation from vertical) and root number under controlled environmental conditions. A high degree of phenotypic variation was observed in the panel of 30 genotypes: 13.5 to 82.2 and 3.6 to 6.9° for root angle and root number, respectively. A similar range was observed in the DH population: 16.4 to 70.5 and 3.6 to 6.5° for root angle and number, respectively. Seven quantitative trait loci (QTL) for seminal root traits (root angle, two QTL; root number, five QTL) were detected in the DH population. A major QTL influencing both root angle and root number (/) was positioned on chromosome 5HL. Across-species analysis identified 10 common genes underlying root trait QTL in barley, wheat ( L.), and sorghum [ (L.) Moench]. Here, we provide insight into seminal root phenotypes and provide a first look at the genetics controlling these traits in barley. Copyright © 2016 Crop Science Society of America.
Full Text Available Water availability is a major limiting factor for crop production, making drought adaptation and its many component traits a desirable attribute of plant cultivars. Previous studies in cereal crops indicate that root traits expressed at early plant developmental stages, such as seminal root angle and root number, are associated with water extraction at different depths. Here, we conducted the first study to map seminal root traits in barley ( L.. Using a recently developed high-throughput phenotyping method, a panel of 30 barley genotypes and a doubled-haploid (DH population (ND24260 × ‘Flagship’ comprising 330 lines genotyped with diversity array technology (DArT markers were evaluated for seminal root angle (deviation from vertical and root number under controlled environmental conditions. A high degree of phenotypic variation was observed in the panel of 30 genotypes: 13.5 to 82.2 and 3.6 to 6.9° for root angle and root number, respectively. A similar range was observed in the DH population: 16.4 to 70.5 and 3.6 to 6.5° for root angle and number, respectively. Seven quantitative trait loci (QTL for seminal root traits (root angle, two QTL; root number, five QTL were detected in the DH population. A major QTL influencing both root angle and root number (/ was positioned on chromosome 5HL. Across-species analysis identified 10 common genes underlying root trait QTL in barley, wheat ( L., and sorghum [ (L. Moench]. Here, we provide insight into seminal root phenotypes and provide a first look at the genetics controlling these traits in barley.
Zena T Wolf
Full Text Available Cleft lip with or without cleft palate (CL/P is the most commonly occurring craniofacial birth defect. We provide insight into the genetic etiology of this birth defect by performing genome-wide association studies in two species: dogs and humans. In the dog, a genome-wide association study of 7 CL/P cases and 112 controls from the Nova Scotia Duck Tolling Retriever (NSDTR breed identified a significantly associated region on canine chromosome 27 (unadjusted p=1.1 x 10(-13; adjusted p= 2.2 x 10(-3. Further analysis in NSDTR families and additional full sibling cases identified a 1.44 Mb homozygous haplotype (chromosome 27: 9.29 - 10.73 Mb segregating with a more complex phenotype of cleft lip, cleft palate, and syndactyly (CLPS in 13 cases. Whole-genome sequencing of 3 CLPS cases and 4 controls at 15X coverage led to the discovery of a frameshift mutation within ADAMTS20 (c.1360_1361delAA (p.Lys453Ilefs*3, which segregated concordant with the phenotype. In a parallel study in humans, a family-based association analysis (DFAM of 125 CL/P cases, 420 unaffected relatives, and 392 controls from a Guatemalan cohort, identified a suggestive association (rs10785430; p =2.67 x 10-6 with the same gene, ADAMTS20. Sequencing of cases from the Guatemalan cohort was unable to identify a causative mutation within the coding region of ADAMTS20, but four coding variants were found in additional cases of CL/P. In summary, this study provides genetic evidence for a role of ADAMTS20 in CL/P development in dogs and as a candidate gene for CL/P development in humans.
Wolf, Zena T; Brand, Harrison A; Shaffer, John R; Leslie, Elizabeth J; Arzi, Boaz; Willet, Cali E; Cox, Timothy C; McHenry, Toby; Narayan, Nicole; Feingold, Eleanor; Wang, Xioajing; Sliskovic, Saundra; Karmi, Nili; Safra, Noa; Sanchez, Carla; Deleyiannis, Frederic W B; Murray, Jeffrey C; Wade, Claire M; Marazita, Mary L; Bannasch, Danika L
Cleft lip with or without cleft palate (CL/P) is the most commonly occurring craniofacial birth defect. We provide insight into the genetic etiology of this birth defect by performing genome-wide association studies in two species: dogs and humans. In the dog, a genome-wide association study of 7 CL/P cases and 112 controls from the Nova Scotia Duck Tolling Retriever (NSDTR) breed identified a significantly associated region on canine chromosome 27 (unadjusted p=1.1 x 10(-13); adjusted p= 2.2 x 10(-3)). Further analysis in NSDTR families and additional full sibling cases identified a 1.44 Mb homozygous haplotype (chromosome 27: 9.29 - 10.73 Mb) segregating with a more complex phenotype of cleft lip, cleft palate, and syndactyly (CLPS) in 13 cases. Whole-genome sequencing of 3 CLPS cases and 4 controls at 15X coverage led to the discovery of a frameshift mutation within ADAMTS20 (c.1360_1361delAA (p.Lys453Ilefs*3)), which segregated concordant with the phenotype. In a parallel study in humans, a family-based association analysis (DFAM) of 125 CL/P cases, 420 unaffected relatives, and 392 controls from a Guatemalan cohort, identified a suggestive association (rs10785430; p =2.67 x 10-6) with the same gene, ADAMTS20. Sequencing of cases from the Guatemalan cohort was unable to identify a causative mutation within the coding region of ADAMTS20, but four coding variants were found in additional cases of CL/P. In summary, this study provides genetic evidence for a role of ADAMTS20 in CL/P development in dogs and as a candidate gene for CL/P development in humans.
Kim, Sangkyu; Welsh, David A; Myers, Leann; Cherry, Katie E; Wyckoff, Jennifer; Jazwinski, S Michal
We have completed a genome-wide linkage scan for healthy aging using data collected from a family study, followed by fine-mapping by association in a separate population, the first such attempt reported. The family cohort consisted of parents of age 90 or above and their children ranging in age from 50 to 80. As a quantitative measure of healthy aging, we used a frailty index, called FI34, based on 34 health and function variables. The linkage scan found a single significant linkage peak on chromosome 12. Using an independent cohort of unrelated nonagenarians, we carried out a fine-scale association mapping of the region suggestive of linkage and identified three sites associated with healthy aging. These healthy-aging sites (HASs) are located in intergenic regions at 12q13-14. HAS-1 has been previously associated with multiple diseases, and an enhancer was recently mapped and experimentally validated within the site. HAS-2 is a previously uncharacterized site possessing genomic features suggestive of enhancer activity. HAS-3 contains features associated with Polycomb repression. The HASs also contain variants associated with exceptional longevity, based on a separate analysis. Our results provide insight into functional genomic networks involving non-coding regulatory elements that are involved in healthy aging and longevity.
Silar, Philippe; Barreau, Christian; Debuchy, Robert; Kicka, Sébastien; Turcq, Béatrice; Sainsard-Chanet, Annie; Sellem, Carole H; Billault, Alain; Cattolico, Laurence; Duprat, Simone; Weissenbach, Jean
A Podospora anserina BAC library of 4800 clones has been constructed in the vector pBHYG allowing direct selection in fungi. Screening of the BAC collection for centromeric sequences of chromosome V allowed the recovery of clones localized on either sides of the centromere, but no BAC clone was found to contain the centromere. Seven BAC clones containing 322,195 and 156,244bp from either sides of the centromeric region were sequenced and annotated. One 5S rRNA gene, 5 tRNA genes, and 163 putative coding sequences (CDS) were identified. Among these, only six CDS seem specific to P. anserina. The gene density in the centromeric region is approximately one gene every 2.8kb. Extrapolation of this gene density to the whole genome of P. anserina suggests that the genome contains about 11,000 genes. Synteny analyses between P. anserina and Neurospora crassa show that co-linearity extends at the most to a few genes, suggesting rapid genome rearrangements between these two species.
Washietl, Stefan; Pedersen, Jakob Skou; Korbel, Jan O
Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack...... with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3'-UTRs. While we estimate a significant false discovery rate of approximately 50%-70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz...
Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K P; Woo, Patrick C Y
Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10-49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n=2), Pichia (Candida) norvegensis (n=2), Candida tropicalis (n=1) and Saccharomyces cerevisiae (n=1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study.
Crucello, Aline; Sforça, Danilo Augusto; Horta, Maria Augusta Crivelente; dos Santos, Clelton Aparecido; Viana, Américo José Carvalho; Beloti, Lilian Luzia; de Toledo, Marcelo Augusto Szymanski; Vincentz, Michel; Kuroshu, Reginaldo Massanobu; de Souza, Anete Pereira
Trichoderma harzianum IOC-3844 secretes high levels of cellulolytic-active enzymes and is therefore a promising strain for use in biotechnological applications in second-generation bioethanol production. However, the T. harzianum biomass degradation mechanism has not been well explored at the genetic level. The present work investigates six genomic regions (~150 kbp each) in this fungus that are enriched with genes related to biomass conversion. A BAC library consisting of 5,760 clones was constructed, with an average insert length of 90 kbp. The assembled BAC sequences revealed 232 predicted genes, 31.5% of which were related to catabolic pathways, including those involved in biomass degradation. An expression profile analysis based on RNA-Seq data demonstrated that putative regulatory elements, such as membrane transport proteins and transcription factors, are located in the same genomic regions as genes related to carbohydrate metabolism and exhibit similar expression profiles. Thus, we demonstrate a rapid and efficient tool that focuses on specific genomic regions by combining a BAC library with transcriptomic data. This is the first BAC-based structural genomic study of the cellulolytic fungus T. harzianum, and its findings provide new perspectives regarding the use of this species in biomass degradation processes.
H. Furberg (Helena); Y. Kim (Yunjung); J. Dackor (Jennifer); E.A. Boerwinkle (Eric); N. Franceschini (Nora); D. Ardissino (Diego); L. Bernardinelli (Luisa); P.M. Mannucci (Pier); F. Mauri (Francesco); P.A. Merlini (Piera); D. Absher (Devin); T.L. Assimes (Themistocles); S.P. Fortmann (Stephen); C. Iribarren (Carlos); J.W. Knowles (Joshua); T. Quertermous (Thomas); L. Ferrucci (Luigi); T. Tanaka (Toshiko); J.C. Bis (Joshua); T. Haritunians (Talin); B. McKnight (Barbara); B.M. Psaty (Bruce); K.D. Taylor (Kent); E.L. Thacker (Evan); P. Almgren (Peter); L. Groop (Leif); C. Ladenvall (Claes); M. Boehnke (Michael); A.U. Jackson (Anne); K.L. Mohlke (Karen); H.M. Stringham (Heather); J. Tuomilehto (Jaakko); E.J. Benjamin (Emelia); S.J. Hwang; D. Levy (Daniel); S.R. Preis; R.S. Vasan (Ramachandran Srini); J. Duan (Jubao); P.V. Gejman (Pablo); D.F. Levinson (Douglas); A.R. Sanders (Alan); J. Shi (Jianxin); E.H. Lips (Esther); J.D. McKay (James); A. Agudo (Antonio); L. Barzan (Luigi); V. Bencko (Vladimir); S. Benhamou (Simone); X. Castellsagué (Xavier); C. Canova (Cristina); D.I. Conway (David); E. Fabianova (Eleonora); L. Foretova (Lenka); V. Janout (Vladimir); C.M. Healy (Claire); I. Holcátová (Ivana); K. Kjaerheim (Kristina); P. Lagiou; J. Lissowska (Jolanta); R. Lowry (Ray); T.V. MacFarlane (Tatiana); D. Mates (Dana); L. Richiardi (Lorenzo); P. Rudnai (Peter); N. Szeszenia-Dabrowska (Neonilia); D. Zaridze; A. Znaor (Ariana); M. Lathrop (Mark); P. Brennan (Paul); S. Bandinelli (Stefania); T.M. Frayling (Timothy); J.M. Guralnik (Jack); Y. Milaneschi (Yuri); J.R.B. Perry (John); D. Altshuler (David); R. Elosua (Roberto); S. Kathiresan (Sekar); G. Lucas (Gavin); O. Melander (Olle); V. Salomaa (Veikko); S.M. Schwartz (Stephen); B.F. Voight (Benjamin); B.W.J.H. Penninx (Brenda); J.H. Smit (Johannes); N. Vogelzangs (Nicole); D.I. Boomsma (Dorret); E.J.C. de Geus (Eco); J.M. Vink (Jacqueline); G.A.H.M. Willemsen (Gonneke); S.J. Chanock (Stephen); F. Gu (Fangyi); S.E. Hankinson (Susan); D. Hunter (David); A. Hofman (Albert); H.W. Tiemeier (Henning); A.G. Uitterlinden (André); P. Tikka-Kleemola (Päivi); S. Walter (Stefan); D.I. Chasman (Daniel); B.M. Everett (Brendan); G. Pare (Guillaume); P.M. Ridker (Paul); M.D. Li (Ming); H.H. Maes (Hermine); J. Audrain-Mcgovern (Janet); D. Posthuma (Danielle); L.M. Thornton (Laura); C. Lerman (Caryn); J. Kaprio (Jaakko); J.E. Rose (Jed); J.P.A. Ioannidis (John); P. Kraft (Peter); D.Y. Lin (Dan); P.F. Sullivan (Patrick); C.J. O'Donnell (Christopher)
textabstractConsistent but indirect evidence has implicated genetic factors in smoking behavior. We report meta-analyses of several smoking phenotypes within cohorts of the Tobacco and Genetics Consortium (n = 74,053). We also partnered with the European Network of Genetic and Genomic Epidemiology
Anttila, Verneri; Winsvold, Bendik S; Gormley, Padhraig
Migraine is the most common brain disorder, affecting approximately 14% of the adult population, but its molecular mechanisms are poorly understood. We report the results of a meta-analysis across 29 genome-wide association studies, including a total of 23,285 individuals with migraine (cases) an...
Bønnelykke, Klaus; Matheson, Melanie C; Pers, Tune Hannes
Allergen-specific immunoglobulin E (present in allergic sensitization) has a central role in the pathogenesis of allergic disease. We performed the first large-scale genome-wide association study (GWAS) of allergic sensitization in 5,789 affected individuals and 10,056 controls and followed up th...
Cornelis, M. C.; Byrne, E. M.; Esko, T.; Nalls, M. A.; Ganna, A.; Paynter, N.; Monda, K. L.; Amin, N.; Fischer, K.; Renstrom, F.; Ngwa, J. S.; Huikari, V.; Cavadino, A.; Nolte, I. M.; Teumer, A.; Yu, K.; Marques-Vidal, P.; Rawal, R.; Manichaikul, A.; Wojczynski, M. K.; Vink, J. M.; Zhao, J. H.; Burlutsky, G.; Lahti, J.; Mikkilä, V.; Lemaitre, R. N.; Eriksson, J.; Musani, S. K.; Tanaka, T.; Geller, F.; Luan, J.; Hui, J.; Mägi, R.; Dimitriou, M.; Garcia, M. E.; Ho, W.-K.; Wright, M. J.; Rose, L. M.; Magnusson, P. K. E.; Pedersen, N. L.; Couper, D.; Oostra, B. A.; Hofman, A.; Ikram, M. A.; Tiemeier, H. W.; Uitterlinden, A. G.; van Rooij, F. J. A.; Barroso, I.; Johansson, I.; Xue, L.; Kaakinen, M.; Milani, L.; Power, C.; Snieder, H.; Stolk, R. P.; Baumeister, S. E.; Biffar, R.; Gu, F.; Bastardot, F.; Kutalik, Z.; Jacobs, D. R.; Forouhi, N. G.; Mihailov, E.; Lind, L.; Lindgren, C.; Michaëlsson, K.; Morris, A.; Jensen, M.; Khaw, K.-T.; Luben, R. N.; Wang, J. J.; Männistö, S.; Perälä, M.-M.; Kähönen, M.; Lehtimäki, T.; Viikari, J.; Mozaffarian, D.; Mukamal, K.; Psaty, B. M.; Döring, A.; Heath, A. C.; Montgomery, G. W.; Dahmen, N.; Carithers, T.; Tucker, K. L.; Ferrucci, L.; Boyd, H. A.; Melbye, M.; Treur, J. L.; Mellström, D.; Hottenga, J. J.; Prokopenko, I.; Tönjes, A.; Deloukas, P.; Kanoni, S.; Lorentzon, M.; Houston, D. K.; Liu, Y.; Danesh, J.; Rasheed, A.; Mason, M. A.; Zonderman, A. B.; Franke, L.; Kristal, B. S.; Karjalainen, J.; Reed, D. R.; Westra, H.-J.; Evans, M. K.; Saleheen, D.; Harris, T. B.; Dedoussis, G.; Curhan, G.; Stumvoll, M.; Beilby, J.; Pasquale, L. R.; Feenstra, B.; Bandinelli, S.; Ordovas, J. M.; Chan, A. T.; Peters, U.; Ohlsson, C.; Gieger, C.; Martin, N. G.; Waldenberger, M.; Siscovick, D. S.; Raitakari, O.; Eriksson, J. G.; Mitchell, P.; Hunter, D. J.; Kraft, P.; Rimm, E. B.; Boomsma, D. I.; Borecki, I. B.; Loos, R. J. F.; Wareham, N. J.; Vollenweider, P.; Caporaso, N.; Grabe, H. J.; Neuhouser, M. L.; Wolffenbuttel, B. H. R.; Hu, F. B.; Hyppönen, E.; Järvelin, M.-R.; Cupples, L. A.; Franks, P. W.; Ridker, P. M.; van Duijn, C. M.; Heiss, G.; Metspalu, A.; North, K. E.; Ingelsson, E.; Nettleton, J. A.; van Dam, R. M.; Chasman, D. I.; Nalls, Michael A.; Plagnol, Vincent; Hernandez, Dena G.; Sharma, Manu; Sheerin, Una-Marie; Saad, Mohamad; Simón-Sánchez, Javier; Schulte, Claudia; Lesage, Suzanne; Sveinbjörnsdóttir, Sigurlaug; Arepalli, Sampath; Barker, Roger; Ben-Shlomo, Yoav; Berendse, Henk W.; Berg, Daniela; Bhatia, Kailash; de Bie, Rob M. A.; Biffi, Alessandro; Bloem, Bas; Bochdanovits, Zoltan; Bonin, Michael; Bras, M.; Brockmann, Kathrin; Brooks, Janet; Burn, David J.; Charlesworth, Gavin; Chen, Honglei; Chinnery, Patrick F.; Chong, Sean; Clarke, Carl E.; Cookson, Mark R.; Cooper, J. Mark; Corvol, Jean Christophe; Counsell, Carl; Damier, Philippe; Dartigues, Jean-François; Deloukas, Panos; Deuschl, Günther; Dexter, David T.; van Dijk, Karin D.; Dillman, Allissa; Durif, Frank; Dürr, Alexandra; Edkins, Sarah; Evans, Jonathan R.; Foltynie, Thomas; Dong, Jing; Gardner, Michelle; Gibbs, J. Raphael; Goate, Alison; Gray, Emma; Guerreiro, Rita; Harris, Clare; van Hilten, Jacobus J.; Hofman, Albert; Hollenbeck, Albert; Holton, Janice; Hu, Michele; Huang, Xuemei; Hershey, Milton S.; Wurster, Isabel; Mätzler, Walter; Hudson, Gavin; Hunt, Sarah E.; Huttenlocher, Johanna; Illig, Thomas; München, Helmholtz Zentrum; Jónsson, Pálmi V.; Lambert, Jean-Charles; Langford, Cordelia; Lees, Andrew; Lichtner, Peter; Limousin, Patricia; Lopez, Grisel; Lorenz, Delia; McNeill, Alisdair; Moorby, Catriona; Moore, Matthew; Morris, Huw R.; Morrison, Karen E.; O' Sullivan, Sean S.; Pearson, Justin; Perlmutter, Joel S.; Pétursson, Hjörvar; Pollak, Pierre; Potter, Simon; Ravina, Bernard; Revesz, Tamas; Riess, Olaf; Rivadeneira, Fernando; Rizzu, Patrizia; Ryten, Mina; Sawcer, Stephen; Schapira, Anthony; Scheffer, Hans; Shaw, Karen; Sidransky, Ellen; Smith, Colin; Spencer, Chris C. A.; Stefánsson, Hreinn; Bettella, Francesco; Stockton, Joanna D.; Strange, Amy; Talbot, Kevin; Tanner, M.; Tashakkori-Ghanbaria, Avazeh; Tison, François; Trabzuni, Daniah; Traynor, Bryan J.; Uitterlinden, André G.; Velseboer, Daan; Vidailhet, Marie; Walker, Robert; van de Warrenburg, Bart; Wickremaratchi, Mirdhu; Williams, Nigel; Williams-Gray, Caroline H.; Winder-Rhodes, Sophie; Stefánsson, Kári; Martinez, Maria; Sabatier, Paul; Wood, Nicholas W.; Hardy, John; Heutink, Peter; Brice, Alexis; Gasser, Thomas; Singleton, Andrew B.; Singleton, Andrew; Cookson, Mark; Hernandez, Dena; Nalls, Michael; Zonderman, Alan; Ferrucci, Luigi; Johnson, Robert; Longo, Dan; O'Brien, Richard; Traynor, Bryan; Troncoso, Juan; van der Brug, Marcel; Zielke, Ronald; Weale, Michael; Ramasamy, Adaikalavan; Box, P. O.
Coffee, a major dietary source of caffeine, is among the most widely consumed beverages in the world and has received considerable attention regarding health risks and benefits. We conducted a genome-wide (GW) meta-analysis of predominately regular-type coffee consumption (cups per day) among up to
Kelly Ivors; Matteo Garbelotto; Ineke De Vries; Peter Bonants
Investigating the population genetics of Phytophthora ramorum, the causal agent of sudden oak death (SOD), is critical to understanding the biology and epidemiology of this important phytopathogen. Raw sequence data (445,000 reads) of P. ramorum was provided by the Joint Genome Institute. Our objective was to develop and utilize...
Adams, Hieab H H; Hibar, Derrek P; Chouraki, Vincent; Stein, Jason L; Nyquist, Paul A; Rentería, Miguel E; Trompet, Stella; Arias-Vasquez, Alejandro; Seshadri, Sudha; Desrivières, Sylvane; Beecham, Ashley H; Jahanshad, Neda; Wittfeld, Katharina; Van der Lee, Sven J; Abramovic, Lucija; Alhusaini, Saud; Amin, Najaf; Andersson, Micael; Arfanakis, Konstantinos; Aribisala, Benjamin S; Armstrong, Nicola J; Athanasiu, Lavinia; Axelsson, Tomas; Beiser, Alexa; Bernard, Manon; Bis, Joshua C; Blanken, Laura M E; Blanton, Susan H; Bohlken, Marc M; Boks, Marco P; Bralten, Janita; Brickman, Adam M; Carmichael, Owen; Chakravarty, M Mallar; Chauhan, Ganesh; Chen, Qiang; Ching, Christopher R K; Cuellar-Partida, Gabriel; Braber, Anouk Den; Doan, Nhat Trung; Ehrlich, Stefan; Filippi, Irina; Ge, Tian; Giddaluru, Sudheer; Goldman, Aaron L; Gottesman, Rebecca F; Greven, Corina U; Grimm, Oliver; Griswold, Michael E; Guadalupe, Tulio; Hass, Johanna; Haukvik, Unn K; Hilal, Saima; Hofer, Edith; Hoehn, David; Holmes, Avram J; Hoogman, Martine; Janowitz, Deborah; Jia, Tianye; Kasperaviciute, Dalia; Kim, Sungeun; Klein, Marieke; Kraemer, Bernd; Lee, Phil H; Liao, Jiemin; Liewald, David C M; Lopez, Lorna M; Luciano, Michelle; Macare, Christine; Marquand, Andre; Matarin, Mar; Mather, Karen A; Mattheisen, Manuel; Mazoyer, Bernard; McKay, David R; McWhirter, Rebekah; Milaneschi, Yuri; Mirza-Schreiber, Nazanin; Muetzel, Ryan L; Maniega, Susana Muñoz; Nho, Kwangsik; Nugent, Allison C; Loohuis, Loes M Olde; Oosterlaan, Jaap; Papmeyer, Martina; Pappa, Irene; Pirpamer, Lukas; Pudas, Sara; Pütz, Benno; Rajan, Kumar B; Ramasamy, Adaikalavan; Richards, Jennifer S; Risacher, Shannon L; Roiz-Santiañez, Roberto; Rommelse, Nanda; Rose, Emma J; Royle, Natalie A; Rundek, Tatjana; Sämann, Philipp G; Satizabal, Claudia L; Schmaal, Lianne; Schork, Andrew J; Shen, Li; Shin, Jean; Shumskaya, Elena; Smith, Albert V; Sprooten, Emma; Strike, Lachlan T; Teumer, Alexander; Thomson, Russell; Tordesillas-Gutierrez, Diana; Toro, Roberto; Trabzuni, Daniah; Vaidya, Dhananjay; Van der Grond, Jeroen; Van der Meer, Dennis; Van Donkelaar, Marjolein M J; Van Eijk, Kristel R; Van Erp, Theo G M; Van Rooij, Daan; Walton, Esther; Westlye, Lars T; Whelan, Christopher D; Windham, Beverly G; Winkler, Anderson M; Woldehawariat, Girma; Wolf, Christiane; Wolfers, Thomas; Xu, Bing; Yanek, Lisa R; Yang, Jingyun; Zijdenbos, Alex; Zwiers, Marcel P; Agartz, Ingrid; Aggarwal, Neelum T; Almasy, Laura; Ames, David; Amouyel, Philippe; Andreassen, Ole A; Arepalli, Sampath; Assareh, Amelia A; Barral, Sandra; Bastin, Mark E; Becker, Diane M; Becker, James T; Bennett, David A; Blangero, John; van Bokhoven, Hans; Boomsma, Dorret I; Brodaty, Henry; Brouwer, Rachel M; Brunner, Han G; Buckner, Randy L; Buitelaar, Jan K; Bulayeva, Kazima B; Cahn, Wiepke; Calhoun, Vince D; Cannon, Dara M; Cavalleri, Gianpiero L; Chen, Christopher; Cheng, Ching-Yu; Cichon, Sven; Cookson, Mark R; Corvin, Aiden; Crespo-Facorro, Benedicto; Curran, Joanne E; Czisch, Michael; Dale, Anders M; Davies, Gareth E; De Geus, Eco J C; De Jager, Philip L; de Zubicaray, Greig I; Delanty, Norman; Depondt, Chantal; DeStefano, Anita L; Dillman, Allissa; Djurovic, Srdjan; Donohoe, Gary; Drevets, Wayne C; Duggirala, Ravi; Dyer, Thomas D; Erk, Susanne; Espeseth, Thomas; Evans, Denis A; Fedko, Iryna O; Fernández, Guillén; Ferrucci, Luigi; Fisher, Simon E; Fleischman, Debra A; Ford, Ian; Foroud, Tatiana M; Fox, Peter T; Francks, Clyde; Fukunaga, Masaki; Gibbs, J Raphael; Glahn, David C; Gollub, Randy L; Göring, Harald H H; Grabe, Hans J; Green, Robert C; Gruber, Oliver; Gudnason, Vilmundur; Guelfi, Sebastian; Hansell, Narelle K; Hardy, John; Hartman, Catharina A; Hashimoto, Ryota; Hegenscheid, Katrin; Heinz, Andreas; Le Hellard, Stephanie; Hernandez, Dena G; Heslenfeld, Dirk J; Ho, Beng-Choon; Hoekstra, Pieter J; Hoffmann, Wolfgang; Hofman, Albert; Holsboer, Florian; Homuth, Georg; Hosten, Norbert; Hottenga, Jouke-Jan; Pol, Hilleke E Hulshoff; Ikeda, Masashi; Ikram, M Kamran; Jack, Clifford R; Jenkinson, Mark; Johnson, Robert; Jönsson, Erik G; Jukema, J Wouter; Kahn, René S; Kanai, Ryota; Kloszewska, Iwona; Knopman, David S; Kochunov, Peter; Kwok, John B; Lawrie, Stephen M; Lemaître, Hervé; Liu, Xinmin; Longo, Dan L; Longstreth, W T; Lopez, Oscar L; Lovestone, Simon; Martinez, Oliver; Martinot, Jean-Luc; Mattay, Venkata S; McDonald, Colm; McIntosh, Andrew M; McMahon, Katie L; McMahon, Francis J; Mecocci, Patrizia; Melle, Ingrid; Meyer-Lindenberg, Andreas; Mohnke, Sebastian; Montgomery, Grant W; Morris, Derek W; Mosley, Thomas H; Mühleisen, Thomas W; Müller-Myhsok, Bertram; Nalls, Michael A; Nauck, Matthias; Nichols, Thomas E; Niessen, Wiro J; Nöthen, Markus M; Nyberg, Lars; Ohi, Kazutaka; Olvera, Rene L; Ophoff, Roel A; Pandolfo, Massimo; Paus, Tomas; Pausova, Zdenka; Penninx, Brenda W J H; Pike, G Bruce; Potkin, Steven G; Psaty, Bruce M; Reppermund, Simone; Rietschel, Marcella; Roffman, Joshua L; Romanczuk-Seiferth, Nina; Rotter, Jerome I; Ryten, Mina; Sacco, Ralph L; Sachdev, Perminder S; Saykin, Andrew J; Schmidt, Reinhold; Schofield, Peter R; Sigurdsson, Sigurdur; Simmons, Andy; Singleton, Andrew; Sisodiya, Sanjay M; Smith, Colin; Smoller, Jordan W; Soininen, Hilkka; Srikanth, Velandai; Steen, Vidar M; Stott, David J; Sussmann, Jessika E; Thalamuthu, Anbupalam; Tiemeier, Henning; Toga, Arthur W; Traynor, Bryan J; Troncoso, Juan; Turner, Jessica A; Tzourio, Christophe; Uitterlinden, Andre G; Hernández, Maria C Valdés; Van der Brug, Marcel; Van der Lugt, Aad; Van der Wee, Nic J A; Van Duijn, Cornelia M; Van Haren, Neeltje E M; Van T Ent, Dennis; Van Tol, Marie-Jose; Vardarajan, Badri N; Veltman, Dick J; Vernooij, Meike W; Völzke, Henry; Walter, Henrik; Wardlaw, Joanna M; Wassink, Thomas H; Weale, Michael E; Weinberger, Daniel R; Weiner, Michael W; Wen, Wei; Westman, Eric; White, Tonya; Wong, Tien Y; Wright, Clinton B; Zielke, H Ronald; Zonderman, Alan B; Deary, Ian J; DeCarli, Charles; Schmidt, Helena; Martin, Nicholas G; De Craen, Anton J M; Wright, Margaret J; Launer, Lenore J; Schumann, Gunter; Fornage, Myriam; Franke, Barbara; Debette, Stéphanie; Medland, Sarah E; Ikram, M Arfan; Thompson, Paul M
Intracranial volume reflects the maximally attained brain size during development, and remains stable with loss of tissue in late life. It is highly heritable, but the underlying genes remain largely undetermined. In a genome-wide association study of 32,438 adults, we discovered five previously
Yuen, Ryan K C; Merico, Daniele; Bookman, Matt; Howe, Jennifer L.; Thiruvahindrapuram, Bhooma; Patel, Rohan V.; Whitney, Joe; Deflaux, Nicole; Bingham, Jonathan; Wang, Zhuozhi; Pellecchia, Giovanna; Buchanan, Janet A.; Walker, Susan; Marshall, Christian R.; Uddin, Mohammed; Zarrei, Mehdi; Deneault, Eric; D'Abate, Lia; Chan, Ada J S; Koyanagi, Stephanie; Paton, Tara; Pereira, Sergio L.; Hoang, Ny; Engchuan, Worrawat; Higginbotham, Edward J.; Ho, Karen; Lamoureux, Sylvia; Li, Weili; MacDonald, Jeffrey R.; Nalpathamkalam, Thomas; Sung, Wilson W L; Tsoi, Fiona J.; Wei, John; Xu, Lizhen; Tasse, Anne Marie; Kirby, Emily; Van Etten, William; Twigger, Simon; Roberts, Wendy; Drmic, Irene; Jilderda, Sanne; Modi, Bonnie Mackinnon; Kellam, Barbara; Szego, Michael; Cytrynbaum, Cheryl; Weksberg, Rosanna; Zwaigenbaum, Lonnie; Woodbury-Smith, Marc; Brian, Jessica; Senman, Lili; Iaboni, Alana; Doyle-Thomas, Krissy; Thompson, Ann; Chrysler, Christina; Leef, Jonathan; Savion-Lemieux, Tal; Smith, Isabel M.; Liu, Xudong; Nicolson, Rob; Seifer, Vicki; Fedele, Angie; Cook, Edwin H.; Dager, Stephen; Estes, Annette; Gallagher, Louise; Malow, Beth A.; Parr, Jeremy R.; Spence, Sarah J.; Vorstman, Jacob; Frey, Brendan J.; Robinson, James T.; Strug, Lisa J.; Fernandez, Bridget A.; Elsabbagh, Mayada; Carter, Melissa T.; Hallmayer, Joachim; Knoppers, Bartha M.; Anagnostou, Evdokia; Szatmari, Peter; Ring, Robert H.; Glazer, David; Pletcher, Mathew T.; Scherer, Stephen W.
We are performing whole-genome sequencing of families with autism spectrum disorder (ASD) to build a resource (MSSNG) for subcategorizing the phenotypes and underlying genetic factors involved. Here we report sequencing of 5,205 samples from families with ASD, accompanied by clinical information,
Peifer, Martin; Fernandez-Cuesta, Lynnette; Sos, Martin L.; George, Julie; Seidel, Danila; Kasper, Lawryn H.; Plenker, Dennis; Leenders, Frauke; Sun, Ruping; Zander, Thomas; Menon, Roopika; Koker, Mirjam; Dahmen, Ilona; Mueller, Christian; Di Cerbo, Vincenzo; Schildhaus, Hans-Ulrich; Altmueller, Janine; Baessmann, Ingelore; Becker, Christian; de Wilde, Bram; Vandesompele, Jo; Boehm, Diana; Ansen, Sascha; Gabler, Franziska; Wilkening, Ines; Heynck, Stefanie; Heuckmann, Johannes M.; Lu, Xin; Carter, Scott L.; Cibulskis, Kristian; Banerji, Shantanu; Getz, Gad; Park, Kwon-Sik; Rauh, Daniel; Gruetter, Christian; Fischer, Matthias; Pasqualucci, Laura; Wright, Gavin; Wainer, Zoe; Russell, Prudence; Petersen, Iver; Chen, Yuan; Stoelben, Erich; Ludwig, Corinna; Schnabel, Philipp; Hoffmann, Hans; Muley, Thomas; Brockmann, Michael; Engel-Riedel, Walburga; Muscarella, Lucia A.; Fazio, Vito M.; Groen, Harry; Timens, Wim; Sietsma, Hannie; Thunnissen, Erik; Smit, Egbert; Heideman, Danielle A. M.; Snijders, Peter J. F.; Cappuzzo, Federico; Ligorio, Claudia; Damiani, Stefania; Field, John; Solberg, Steinar; Brustugun, Odd Terje; Lund-Iversen, Marius; Saenger, Joerg; Clement, Joachim H.; Soltermann, Alex; Moch, Holger; Weder, Walter; Solomon, Benjamin; Soria, Jean-Charles; Validire, Pierre; Besse, Benjamin; Brambilla, Elisabeth; Brambilla, Christian; Lantuejoul, Sylvie; Lorimier, Philippe; Schneider, Peter M.; Hallek, Michael; Pao, William; Meyerson, Matthew; Sage, Julien; Shendure, Jay; Schneider, Robert; Buettner, Reinhard; Wolf, Juergen; Nuernberg, Peter; Perner, Sven; Heukamp, Lukas C.; Brindle, Paul K.; Haas, Stefan; Thomas, Roman K.
Small-cell lung cancer (SCLC) is an aggressive lung tumor subtype with poor prognosis(1-3). We sequenced 29 SCLC exomes, 2 genomes and 15 transcriptomes and found an extremely high mutation rate of 7.4 +/- 1 protein-changing mutations per million base pairs. Therefore, we conducted integrated
Mitchell, Jonathan S; Li, Ni; Weinhold, Niels
Multiple myeloma (MM) is a plasma cell malignancy with a significant heritable basis. Genome-wide association studies have transformed our understanding of MM predisposition, but individual studies have had limited power to discover risk loci. Here we perform a meta-analysis of these GWAS, add a ...
Cornelis, M. C.; Byrne, E. M.; Esko, T.; Nalls, M. A.; Ganna, A.; Paynter, N.; Monda, K. L.; Amin, N.; Fischer, K.; Renstrom, F.; Ngwa, J. S.; Huikari, V.; Cavadino, A.; Nolte, I. M.; Teumer, A.; Yu, K.; Marques-Vidal, P.; Rawal, R.; Manichaikul, A.; Wojczynski, M. K.; Vink, J. M.; Zhao, J. H.; Burlutsky, G.; Lahti, J.; Mikkila, V.; Lemaitre, R. N.; Eriksson, J.; Musani, S. K.; Tanaka, T.; Geller, F.; Luan, J.; Hui, J.; Maegi, R.; Dimitriou, M.; Garcia, M. E.; Ho, W-K; Wright, M. J.; Rose, L. M.; Magnusson, P. K. E.; Pedersen, N. L.; Couper, D.; Oostra, B. A.; Hofman, A.; Ikram, M. A.; Tiemeier, H. W.; Uitterlinden, A. G.; van Rooij, F. J. A.; Barroso, I.; Johansson, I.; Xue, L.; Kaakinen, M.; Milani, L.; Power, C.; Snieder, H.; Stolk, R. P.; Baumeister, S. E.; Biffar, R.; Gu, F.; Bastardot, F.; Kutalik, Z.; Jacobs, D. R.; Forouhi, N. G.; Mihailov, E.; Lind, L.; Lindgren, C.; Michaelsson, K.; Morris, A.; Jensen, M.; Khaw, K-T; Luben, R. N.; Wang, J. J.; Mannisto, S.; Perala, M-M; Kahonen, M.; Lehtimaki, T.; Viikari, J.; Mozaffarian, D.; Mukamal, K.; Psaty, B. M.; Doering, A.; Heath, A. C.; Montgomery, G. W.; Dahmen, N.; Carithers, T.; Tucker, K. L.; Ferrucci, L.; Boyd, H. A.; Melbye, M.; Treur, J. L.; Mellstrom, D.; Hottenga, J. J.; Prokopenko, I.; Toenjes, A.; Deloukas, P.; Kanoni, S.; Lorentzon, M.; Houston, D. K.; Liu, Y.; Danesh, J.; Rasheed, A.; Mason, M. A.; Zonderman, A. B.; Franke, L.; Kristal, B. S.; Karjalainen, J.; Reed, D. R.; Westra, H-J; Evans, M. K.; Saleheen, D.; Harris, T. B.; Dedoussis, G.; Curhan, G.; Stumvoll, M.; Beilby, J.; Pasquale, L. R.; Feenstra, B.; Bandinelli, S.; Ordovas, J. M.; Chan, A. T.; Peters, U.; Ohlsson, C.; Gieger, C.; Martin, N. G.; Waldenberger, M.; Siscovick, D. S.; Raitakari, O.; Eriksson, J. G.; Mitchell, P.; Hunter, D. J.; Kraft, P.; Rimm, E. B.; Boomsma, D. I.; Borecki, I. B.; Loos, R.