The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.
Aklujkar, Muktak; Krushkal, Julia; DiBartolo, Genevieve; Lapidus, Alla; Land, Miriam L.; Lovley, Derek R.
Background: The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results: The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion: The genomic evidence suggests that metabolism, physiology and regulation of gene expression in G. metallireducens may be dramatically different from other Geobacteraceae.
Sørensen, Peter; Edwards, Stefan McKinnon; Rohde, Palle Duun
-additive genetic mechanisms. These modeling approaches have proven to be highly useful to determine population genetic parameters as well as prediction of genetic risk or value. We present a series of statistical modelling approaches that use prior biological information for evaluating the collective action......Whole-genome sequences and multiple trait phenotypes from large numbers of individuals will soon be available in many populations. Well established statistical modeling approaches enable the genetic analyses of complex trait phenotypes while accounting for a variety of additive and non...... regions and gene ontologies) that provide better model fit and increase predictive ability of the statistical model for this trait....
Falling costs in genomic laboratory experiments have led to a steady increase of genomic feature and variation data. Multiple genomic data formats exist for sharing these data, and whilst they are similar, they are addressing slightly different data viewpoints and are consequently not fully compatible with each other. The fragmentation of data format specifications makes it hard to integrate and interpret data for further analysis with information from multiple data providers. As a solution, a new ontology is presented here for annotating and representing genomic feature and variation dataset contents. The Genomic Feature and Variation Ontology (GFVO) specifically addresses genomic data as it is regularly shared using the GFF3 (incl. FASTA), GTF, GVF and VCF file formats. GFVO simplifies data integration and enables linking of genomic annotations across datasets through common semantics of genomic types and relations. Availability and implementation. The latest stable release of the ontology is available via its base URI; previous and development versions are available at the ontology’s GitHub repository: https://github.com/BioInterchange/Ontologies; versions of the ontology are indexed through BioPortal (without external class-/property-equivalences due to BioPortal release 4.10 limitations); examples and reference documentation is provided on a separate web-page: http://www.biointerchange.org/ontologies.html. GFVO version 1.0.2 is licensed under the CC0 1.0 Universal license (https://creativecommons.org/publicdomain/zero/1.0) and therefore de facto within the public domain; the ontology can be appropriated without attribution for commercial and non-commercial use.
Full Text Available Abstract Background Sequence data and other characters from mitochondrial genomes (gene translocations, secondary structure of RNA molecules are useful in phylogenetic studies among metazoan animals from population to phylum level. Moreover, the comparison of complete mitochondrial sequences gives valuable information about the evolution of small genomes, e.g. about different mechanisms of gene translocation, gene duplication and gene loss, or concerning nucleotide frequency biases. The Peracarida (gammarids, isopods, etc. comprise about 21,000 species of crustaceans, living in many environments from deep sea floor to arid terrestrial habitats. Ligia oceanica is a terrestrial isopod living at rocky seashores of the european North Sea and Atlantic coastlines. Results The study reveals the first complete mitochondrial DNA sequence from a peracarid crustacean. The mitochondrial genome of Ligia oceanica is a circular double-stranded DNA molecule, with a size of 15,289 bp. It shows several changes in mitochondrial gene order compared to other crustacean species. An overview about mitochondrial gene order of all crustacean taxa yet sequenced is also presented. The largest non-coding part (the putative mitochondrial control region of the mitochondrial genome of Ligia oceanica is unexpectedly not AT-rich compared to the remainder of the genome. It bears two repeat regions (4× 10 bp and 3× 64 bp, and a GC-rich hairpin-like secondary structure. Some of the transfer RNAs show secondary structures which derive from the usual cloverleaf pattern. While some tRNA genes are putative targets for RNA editing, trnR could not be localized at all. Conclusion Gene order is not conserved among Peracarida, not even among isopods. The two isopod species Ligia oceanica and Idotea baltica show a similarly derived gene order, compared to the arthropod ground pattern and to the amphipod Parhyale hawaiiensis, suggesting that most of the translocation events were already
LIUCai-ling; ZHUXiang-xiong; FuWen-jun; ZHAOMu-jun
Polydnavirus was purified from the calyx fluid of Cotesia plutellae ovary. The genomic features of C. plutellae polydnavirus (CpPDV) were investigated. The viral genome consists of at least 12 different segments and the aggregate genome size is a lower estimate of 80kbp. By partial digestion of CpPDV DNA with BamHI and subsequent ligation with BamHI-cut plasmid Bluescript, a representative library of CpPDV genome was obtained.
Join us for a live, moderated discussion about two NCI efforts to expand access to cancer genomics data: the Genomic Data Commons and Genomic Cloud Pilots. NCI subject matters experts will include Louis M. Staudt, M.D., Ph.D., Director Center for Cancer Genomics, Warren Kibbe, Ph.D., Director, NCI Center for Biomedical Informatics and Information Technology, and moderated by Anthony Kerlavage, Ph.D., Chief, Cancer Informatics Branch, Center for Biomedical Informatics and Information Technology. We welcome your questions before and during the Hangout on Twitter using the hashtag #AskNCI.
Investigators with The Cancer Genome Atlas (TCGA) Research Network have identified molecular characteristics of a type of breast cancer, invasive lobular carcinoma (ILC), that distinguishes it from invasive ductal carcinoma (IDC), the most common invasive breast cancer subtype.
Deng, Zhong; Wang, Zhuo; Lieberman, Paul M.
Genome maintenance mechanisms actively suppress genetic instability associated with cancer and aging. Some viruses provoke genetic instability by subverting the host’s control of genome maintenance. Viruses have their own specialized strategies for genome maintenance, which can mimic and modify host cell processes. Here, we review some of the common features of genome maintenance utilized by viruses and host chromosomes, with a particular focus on terminal repeat (TR) elements. The TRs of cellular chromosomes, better known as telomeres, have well-established roles in cellular chromosome stability. Cellular telomeres are themselves maintained by viral-like mechanisms, including self-propagation by reverse transcription, recombination, and retrotransposition. Viral TR elements, like cellular telomeres, are essential for viral genome stability and propagation. We review the structure and function of viral repeat elements and discuss how they may share telomere-like structures and genome protection functions. We consider how viral infections modulate telomere regulatory factors for viral repurposing and can alter normal host telomere structure and chromosome stability. Understanding the common strategies of viral and cellular genome maintenance may provide new insights into viral–host interactions and the mechanisms driving genetic instability in cancer. PMID:23293769
Zee, C.S; Segall, H.D.; Miller, C.; Ahmad, J.; McComb, J.G.; Han, J.S.; Park, S.H.
While many medulloblastomas have characteristic features on computed tomography (CT), a significant number have atypical features, including a cystic or necrotic component, calcification, hemorrhage, lack of contrast enhancement, and eccentric location, and/or direct supratentorial extension. Of 30 consecutive untreated cases reviewed by the authors, 14 (47%) had such findings. Failure to make the proper diagnosis will result in some cases if these features are not recognized as possible signs of medulloblastoma
Full Text Available Tuberculosis (TB is a disease caused by Mycobacterium tuberculosis. Despite the availability of novel therapeutic approaches, TB is considered as one of the leading causes of death due to infectious diseases worldwide. Alveolar macrophages are the first line of defense against M. tuberculosis; they ingest and sequester the bacilli within granulomatous structures. Control and resolution of the infection requires activated T lymphocytes as well as Th1 cytokines. There are two forms of TB: active TB and latent TB. Latent TB is a state in which M. tuberculosis survives in the body without causing overt signs and symptoms. People with latent TB are noncontagious. However, M. tuberculosis can become active in the body, multiply, and cause overt TB. Sarcoidosis, on the other hand, is an autoimmune disease of unknown etiology which can affect multiple systems of the body. Nonspecific constitutional symptoms, such as fever, fatigue, malaise, and weight loss, are present in approximately one-third of patients. Chest X-ray usually shows hilar and mediastinal lymphadenopathy. Although the lungs are the most common sites of inflammation, sarcoidosis can also involve other organs, such as the eyes (intraocular and adnexal, skin, lymph nodes, salivary glands, heart, spleen, liver, and the nervous system. Recent investigations have provided further insights into the genetic basis of sarcoidosis and the way genotype determines the clinical presentation and phenotype of patients. Histopathologic features are usually insufficient for diagnosis of sarcoidosis. Diagnosis of sarcoidosis in endemic areas for TB can become a great challenge. Both TB and sarcoidosis are granulomatous diseases; TB is characterized by caseating granulomas, whereas sarcoidosis is characterized by noncaseating granulomas. New cases of sarcoidosis are increasingly being diagnosed in areas endemic for TB due to increased orientation of physicians and availability of diagnostic modalities
Ali, Shahid; Amina, Bibi; Anwar, Saneela; Minhas, Rashid; Parveen, Nazia; Nawaz, Uzma; Azam, Syed Sikandar; Abbasi, Amir Ali
To elucidate important cellular and molecular interactions that regulate patterning and skeletal development, vertebrate limbs served as a model organ. A growing body of evidence from detailed studies on a subset of limb regulators like the HOXD cluster or SHH, reveals the importance of enhancers in limb related developmental and disease processes. Exploiting the recent genome-wide availability of functionally confirmed enhancer dataset, this study establishes regulatory interactions for dozens of human limb developmental genes. From these data, it appears that the long-range regulatory interactions are fairly common during limb development. This observation highlights the significance of chromosomal breaks/translocations in human limb deformities. Transcriptional factor (TF) analysis predicts that the differentiation of early nascent limb-bud into future territories entail distinct TF interaction networks. Conclusively, an important motivation for annotating the human limb specific regulatory networks is to pave way for the systematic exploration of their role in disease and evolution. Copyright © 2016. Published by Elsevier Inc.
Sarah M. Peterson
Full Text Available The human genome encodes for over 1800 microRNAs, which are short noncoding RNA molecules that function to regulate gene expression post-transcriptionally. Due to the potential for one microRNA to target multiple gene transcripts, microRNAs are recognized as a major mechanism to regulate gene expression and mRNA translation. Computational prediction of microRNA targets is a critical initial step in identifying microRNA:mRNA target interactions for experimental validation. The available tools for microRNA target prediction encompass a range of different computational approaches, from the modeling of physical interactions to the incorporation of machine learning. This review provides an overview of the major computational approaches to microRNA target prediction. Our discussion highlights three tools for their ease of use, reliance on relatively updated versions of miRBase, and range of capabilities, and these are DIANA-microT-CDS, miRanda-mirSVR, and TargetScan. In comparison across all microRNA target prediction tools, four main aspects of the microRNA:mRNA target interaction emerge as common features on which most target prediction is based: seed match, conservation, free energy, and site accessibility. This review explains these features and identifies how they are incorporated into currently available target prediction tools. MicroRNA target prediction is a dynamic field with increasing attention on development of new analysis tools. This review attempts to provide a comprehensive assessment of these tools in a manner that is accessible across disciplines. Understanding the basis of these prediction methodologies will aid in user selection of the appropriate tools and interpretation of the tool output.
Guo, F-B; Xia, Z-K; Wei, W; Zhao, H-L
We performed statistical analyses of five conserved features of genomic islands of bacteria. Analyses were made based on 104 known genomic islands, which were identified by comparative methods. Four of these features include sequence size, abnormal G+C content, flanking tRNA gene, and embedded mobility gene, which are frequently investigated. One relatively new feature, G+C homogeneity, was also investigated. Among the 104 known genomic islands, 88.5% were found to fall in the typical length of 10-200 kb and 80.8% had G+C deviations with absolute values larger than 2%. For the 88 genomic islands whose hosts have been sequenced and annotated, 52.3% of them were found to have flanking tRNA genes and 64.7% had embedded mobility genes. For the homogeneity feature, 85% had an h homogeneity index less than 0.1, indicating that their G+C content is relatively uniform. Taking all the five features into account, 87.5% of 88 genomic islands had three of them. Only one genomic island had only one conserved feature and none of the genomic islands had zero features. These statistical results should help to understand the general structure of known genomic islands. We found that larger genomic islands tend to have relatively small G+C deviations relative to absolute values. For example, the absolute G+C deviations of 9 genomic islands longer than 100,000 bp were all less than 5%. This is a novel but reasonable result given that larger genomic islands should have greater restrictions in their G+C contents, in order to maintain the stable G+C content of the recipient genome.
Ebeid, M.; Steiner, A.
Radiographs of an injured or infected bovine foot can be tricky to interpret - the anatomy is complex, and the signs may be subtle. This guide leads you through the classic radiographic features of several common foot conditions
Genomics was introduced with big promises and expectations of its future contribution to our society. Medical genomics was introduced as that which would lay the foundation for a revolution in our management of common diseases. Genomics would lead the way towards a future of personalised medicine.
Carrasco-Gutierrez, Carlos Enrique; Reis Gomes, Fábio Augusto
The aim of this work is to analyze the business cycles of Mercosur member countries in order to investigate their degree of synchronization. The econometric model uses the Beveridge-Nelson-Stock-Watson multivariate trend-cycle decomposition, taking into account the presence of common features such as common trend and common cycle. Once the business cycles are estimated, their degree of synchronization is analyzed by means of linear correlation in time domain and coherence and phase in f...
This article features Richard J. Klarchek Information Commons (IC) at Loyola University Chicago, an all-glass library building on the shore of Chicago's Lake Michigan that is not only a state-of-the-art digital research library and study space--it also runs on cutting-edge energy technology. The building has attracted attention and visitors from…
Sims, Gregory E.; Kim, Sung-Hou
A whole-genome phylogeny of the Escherichia coli/Shigella group was constructed by using the feature frequency profile (FFP) method. This alignment-free approach uses the frequencies of l-mer features of whole genomes to infer phylogenic distances. We present two phylogenies that accentuate different aspects of E. coli/Shigella genomic evolution: (i) one based on the compositions of all possible features of length l = 24 (∼8.4 million features), which are likely to reveal the phenetic grouping and relationship among the organisms and (ii) the other based on the compositions of core features with low frequency and low variability (∼0.56 million features), which account for ∼69% of all commonly shared features among 38 taxa examined and are likely to have genome-wide lineal evolutionary signal. Shigella appears as a single clade when all possible features are used without filtering of noncore features. However, results using core features show that Shigella consists of at least two distantly related subclades, implying that the subclades evolved into a single clade because of a high degree of convergence influenced by mobile genetic elements and niche adaptation. In both FFP trees, the basal group of the E. coli/Shigella phylogeny is the B2 phylogroup, which contains primarily uropathogenic strains, suggesting that the E. coli/Shigella ancestor was likely a facultative or opportunistic pathogen. The extant commensal strains diverged relatively late and appear to be the result of reductive evolution of genomes. We also identify clade distinguishing features and their associated genomic regions within each phylogroup. Such features may provide useful information for understanding evolution of the groups and for quick diagnostic identification of each phylogroup. PMID:21536867
Ping, Zheng; Siegal, Gene P.; Almeida, Jonas S.; Schnitt, Stuart J.; Shen, Dejun
Background: Genetics and genomics have radically altered our understanding of breast cancer progression. However, the genomic basis of various histopathologic features of breast cancer is not yet well-defined. Materials and Methods: The Cancer Genome Atlas (TCGA) is an international database containing a large collection of human cancer genome sequencing data. cBioPortal is a web tool developed for mining these sequencing data. We performed mining of TCGA sequencing data in an attempt to characterize the genomic features correlated with breast cancer histopathology. We first assessed the quality of the TCGA data using a group of genes with known alterations in various cancers. Both genome-wide gene mutation and copy number changes as well as a group of genes with a high frequency of genetic changes were then correlated with various histopathologic features of invasive breast cancer. Results: Validation of TCGA data using a group of genes with known alterations in breast cancer suggests that the TCGA has accurately documented the genomic abnormalities of multiple malignancies. Further analysis of TCGA breast cancer sequencing data shows that accumulation of specific genomic defects is associated with higher tumor grade, larger tumor size and receptor negativity. Distinct groups of genomic changes were found to be associated with the different grades of invasive ductal carcinoma. The mutator role of the TP53 gene was validated by genomic sequencing data of invasive breast cancer and TP53 mutation was found to play a critical role in defining high tumor grade. Conclusions: Data mining of the TCGA genome sequencing data is an innovative and reliable method to help characterize the genomic abnormalities associated with histopathologic features of invasive breast cancer. PMID:24672738
Full Text Available Background: Genetics and genomics have radically altered our understanding of breast cancer progression. However, the genomic basis of various histopathologic features of breast cancer is not yet well-defined. Materials and Methods: The Cancer Genome Atlas (TCGA is an international database containing a large collection of human cancer genome sequencing data. cBioPortal is a web tool developed for mining these sequencing data. We performed mining of TCGA sequencing data in an attempt to characterize the genomic features correlated with breast cancer histopathology. We first assessed the quality of the TCGA data using a group of genes with known alterations in various cancers. Both genome-wide gene mutation and copy number changes as well as a group of genes with a high frequency of genetic changes were then correlated with various histopathologic features of invasive breast cancer. Results: Validation of TCGA data using a group of genes with known alterations in breast cancer suggests that the TCGA has accurately documented the genomic abnormalities of multiple malignancies. Further analysis of TCGA breast cancer sequencing data shows that accumulation of specific genomic defects is associated with higher tumor grade, larger tumor size and receptor negativity. Distinct groups of genomic changes were found to be associated with the different grades of invasive ductal carcinoma. The mutator role of the TP53 gene was validated by genomic sequencing data of invasive breast cancer and TP53 mutation was found to play a critical role in defining high tumor grade. Conclusions: Data mining of the TCGA genome sequencing data is an innovative and reliable method to help characterize the genomic abnormalities associated with histopathologic features of invasive breast cancer.
Christiansen, Jens H; Christensen, Jeppe Høy; Grünbaum, Thor
Spatial features of an object can be specified using two different response types: either by use of symbols or motorically by directly acting upon the object. Is this response dichotomy reflected in a dual representation of the visual world: one for perception and one for action? Previously, symb...... of matching object-processing characteristics is also in agreement with the idea of a common representation driving both response types....
Desmedt, Christine; Zoppoli, Gabriele; Sotiriou, Christos; Salgado, Roberto
Accounting for 10-15% of all breast neoplasms, invasive lobular breast cancer (ILC) is the second most common histological subtype of breast cancer after invasive ductal breast cancer (IDC). Understanding ILC biology, which differs from IDC in terms of clinical presentation, treatment response, relapse timing and patterns, is essential in order to adopt novel, disease-specific management strategies. While the contribution of the histological subtypes to tumour biology has been poorly investigated and acknowledged in the past, recently several major, independent efforts have led to the assembly and molecular characterization of well-annotated ILC case sets. In this review, we provide a critical overview of the literature exploring ILC, through comprehensive and multiomic methods. The first part specifically focuses on ILC transcriptomic features by reviewing the intrinsic molecular subtypes, the application of gene expression scores for the prediction of recurrence, and the identification of gene expression subtypes. The second part describes the main research efforts that lead to the identification of the genomic landscape of ILC, with a special focus to findings that differentiate ILC from IDC and carry potential clinical relevance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Luckow, Kasper Søe; Thomsen, Bent; Frost, Christian
We present a novel tool for statically determining the Worst Case Execution Time (WCET) of Java Bytecode-based programs called Tool for Execution Time Analysis of Java bytecode (TetaJ). This tool differentiates itself from existing tools by separating the individual constituents of the execution...... environment into independent components. The prime benefit is that it can be used for execution environments featuring common embedded processors and software implementations of the JVM. TetaJ employs a model checking approach for statically determining WCET where the Java program, the JVM, and the hardware...
Full Text Available Abstract Background Milkweeds (Asclepias L. have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L. could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. Results A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp and 5S rDNA (120 bp sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp, with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae unigenes (median coverage of 0.29× and 66% of single copy orthologs (COSII in asterids (median coverage of 0.14×. From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites and phylogenetics (low-copy nuclear genes studies were developed. Conclusions The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species
Straub, Shannon C K; Fishbein, Mark; Livshultz, Tatyana; Foster, Zachary; Parks, Matthew; Weitemier, Kevin; Cronn, Richard C; Liston, Aaron
Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives. This study represents a first
Full Text Available Background: Common variable immunodeficiency (CVID is one of the most common symptomatic primary immunodeficiency syndromes. The purpose of this article was to broaden our knowledge about CVID for better diagnosis and treatment. Methods: Clinical and immunological features of 40 Chinese patients with CVID were analyzed retrospectively. Results: The median age at onset was 11-year-old (range 4-51 years. The median age at diagnosis was 14.5-year-old (range 5-66 years. The average time of delay in diagnosis was 5.3 years (range 1-41 years. The most common main complaint was fever due to infections (35 cases, 87.5%. Pneumonia (28 cases, 70% was the most common type of infections. Bronchiectasis was present in 6 patients (15%. Autoimmune disease was detected in 6 cases of CVID, and malignancy in 2 cases. The median total serum levels of IgG, IgA, and IgM at diagnosis were 1.07 g/L, 0.07 g/L, and 0.28 g/L, respectively. The percentages of CD3− /CD19 + B-cells were 1%-3.14%. Conclusions: Infection is the most frequent presentation of CVID. Patients with unexplainable infections should receive further examination including serum immunoglobulin (Ig and lymphocyte subset analysis. Regular and sufficient substitution with Ig is recommended.
Rohde, Palle Duun; Østergaard, Solveig; Kristensen, Torsten Nygaard
to investigate locomotor activity, and applied genomic feature prediction models to identify gene ontology (GO) cate- gories predictive of this phenotype. Next, we applied the covariance association test to partition the genomic variance of the predictive GO terms to the genes within these terms. We...... then functionally assessed whether the identified candidate genes affected locomotor activity by reducing gene expression using RNA interference. In five of the seven candidate genes tested, reduced gene expression altered the phenotype. The ranking of genes within the predictive GO term was highly correlated......Understanding the genetic underpinnings of complex traits requires knowledge of the genetic variants that contribute to phenotypic variability. Reliable statistical approaches are needed to obtain such knowledge. In genome-wide association studies, variants are tested for association with trait...
Zhang, Wei; Zeng, Erliang; Liu, Dan; Jones, Stuart E; Emrich, Scott
Recently, the utility of trait-based approaches for microbial communities has been identified. Increasing availability of whole genome sequences provide the opportunity to explore the genetic foundations of a variety of functional traits. We proposed a machine learning framework to quantitatively link the genomic features with functional traits. Genes from bacteria genomes belonging to different functional traits were grouped to Cluster of Orthologs (COGs), and were used as features. Then, TF-IDF technique from the text mining domain was applied to transform the data to accommodate the abundance and importance of each COG. After TF-IDF processing, COGs were ranked using feature selection methods to identify their relevance to the functional trait of interest. Extensive experimental results demonstrated that functional trait related genes can be detected using our method. Further, the method has the potential to provide novel biological insights.
I. B. Bovina
Full Text Available We discuss the first phase results of a research project dedicated to study of suicide representations in youth. In the framework of structural approach to social representations, we study features of structure and content of social representations of suiciders in two groups of young people (the criterion for group allocation was their acquaintance with people who has suicide attempts. Our sample (N = 106 consisted of representatives of several youth groups (students and working youths with specialized secondary, higher or incomplete higher education, aged 18 to 35 years (M = 23,48 years, SD = 4,36 years: 67 women and 39 men. The 1st group includes respondents personally acquainted with suicide attempters (44 respondents, the 2nd group – respondents without such experience. The subject of research were common representations of suiciders. We tested assumptions about the specificity of protective functions of social representations, as well as consistency of representations in the two groups of respondents.
Holly H Ganz
Full Text Available Here we present vB_BanS-Tsamsa, a novel temperate phage isolated from Bacillus anthracis, the agent responsible for anthrax infections in wildlife, livestock and humans. Tsamsa phage is a giant siphovirus (order Caudovirales, featuring a long, flexible and non-contractile tail of 440 nm (not including baseplate structure and an isometric head of 82 nm in diameter. We induced Tsamsa phage in samples from two different carcass sites in Etosha National Park, Namibia. The Tsamsa phage genome is the largest sequenced Bacillus siphovirus, containing 168,876 bp and 272 ORFs. The genome features an integrase/recombinase enzyme, indicative of a temperate lifestyle. Among bacterial strains tested, the phage infected only certain members of the Bacillus cereus sensu lato group (B. anthracis, B. cereus and B. thuringiensis and exhibited moderate specificity for B. anthracis. Tsamsa lysed seven out of 25 B. cereus strains, two out of five B. thuringiensis strains and six out of seven B. anthracis strains tested. It did not lyse B. anthracis PAK-1, an atypical strain that is also resistant to both gamma phage and cherry phage. The Tsamsa endolysin features a broader lytic spectrum than the phage host range, indicating possible use of the enzyme in Bacillus biocontrol.
Full Text Available Abstract Background Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. Result To develop such valuable resources in common carp (Cyprinus carpio, a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp. Conclusion BAC end sequences are great resources for the first genome wide survey of common carp. The repetitive DNA was estimated to be approximate 28% of common carp genome, indicating the higher complexity of the genome. Comparative analysis had mapped around 40,000 BES to zebrafish genome and established over 3
Background Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES) are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. Result To develop such valuable resources in common carp (Cyprinus carpio), a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp. Conclusion BAC end sequences are great resources for the first genome wide survey of common carp. The repetitive DNA was estimated to be approximate 28% of common carp genome, indicating the higher complexity of the genome. Comparative analysis had mapped around 40,000 BES to zebrafish genome and established over 3,100 microsyntenies, covering over 50% of
Reva, Oleg N.; Hallin, Peter Fischer; Willenbrock, Hanni
The global feature of the completely sequenced Alcanivorax borkumensis SK2 type strain chromosome is its symmetry and homogeneity. The origin and terminus of replication are located opposite to each other in the chromosome and are discerned with high signal to noise ratios by maximal oligonucleot......The global feature of the completely sequenced Alcanivorax borkumensis SK2 type strain chromosome is its symmetry and homogeneity. The origin and terminus of replication are located opposite to each other in the chromosome and are discerned with high signal to noise ratios by maximal...... oligonucleotide usage biases on the leading and lagging strand. Genomic DNA structure is rather uniform throughout the chromosome with respect to intrinsic curvature, position preference or base stacking energy. The orthologs and paralogs of A. borkumensis genes with the highest sequence homology were found...
Full Text Available BACKGROUND: Miniature inverted-repeat transposable elements (MITEs, which are common in eukaryotic genomes, are small non-coding elements that transpose by utilizing transposases encoded by autonomous transposons. Recent genome-wide analyses and cross-mobilization assays have greatly improved our knowledge on MITE proliferation, however, specific mechanisms for the origin and evolution of MITEs are still unclear. PRINCIPAL FINDINGS: A group of coral MITEs called CMITE were identified from two corals, Acropora millepora and Acropora palmata. CMITEs conform to many common characteristics of MITEs, but also present several unusual features. The most unusual feature of CMITEs is conservation of the internal region, which is more conserved between MITE families than the TIRs. The origin of this internal region remains unknown, although we found one CMITE family that seems to be derived from a piggyBac-like transposon in A. millepora. CMITEs can form tandem arrays, suggesting an unconventional way for MITEs to increase copy numbers. We also describe a case in which a novel transposable element was created by a CMITE insertion event. CONCLUSIONS: To our knowledge, this is the first report of identification of MITEs from coral genomes. Proliferation of CMITEs seems to be related to the transposition machinery of piggyBac-like autonomous transposons. The highly conserved internal region of CMITEs suggests a potential role for this region in their successful transposition. However, the origin of these unusual features in CMITEs remains unclear, and thus represents an intriguing topic for future investigations.
Wang, Shi; Zhang, Lingling; Meyer, Eli; Matz, Mikhail V
Miniature inverted-repeat transposable elements (MITEs), which are common in eukaryotic genomes, are small non-coding elements that transpose by utilizing transposases encoded by autonomous transposons. Recent genome-wide analyses and cross-mobilization assays have greatly improved our knowledge on MITE proliferation, however, specific mechanisms for the origin and evolution of MITEs are still unclear. A group of coral MITEs called CMITE were identified from two corals, Acropora millepora and Acropora palmata. CMITEs conform to many common characteristics of MITEs, but also present several unusual features. The most unusual feature of CMITEs is conservation of the internal region, which is more conserved between MITE families than the TIRs. The origin of this internal region remains unknown, although we found one CMITE family that seems to be derived from a piggyBac-like transposon in A. millepora. CMITEs can form tandem arrays, suggesting an unconventional way for MITEs to increase copy numbers. We also describe a case in which a novel transposable element was created by a CMITE insertion event. To our knowledge, this is the first report of identification of MITEs from coral genomes. Proliferation of CMITEs seems to be related to the transposition machinery of piggyBac-like autonomous transposons. The highly conserved internal region of CMITEs suggests a potential role for this region in their successful transposition. However, the origin of these unusual features in CMITEs remains unclear, and thus represents an intriguing topic for future investigations.
van Passel, Mark Wj
Horizontal gene transfer constitutes a powerful and innovative force in evolution, but often little is known about the actual origins of transferred genes. Sequence alignments are generally of limited use in tracking the original donor, since still only a small fraction of the total genetic diversity is thought to be uncovered. Alternatively, approaches based on similarities in the genome specific relative oligonucleotide frequencies do not require alignments. Even though the exact origins of horizontally transferred genes may still not be established using these compositional analyses, it does suggest that compositionally very similar regions are likely to have had a common origin. These analyses have shown that up to a third of large acquired gene clusters that reside in the same genome are compositionally very similar, indicative of a shared origin. This brings us closer to uncovering the original donors of horizontally transferred genes, and could help in elucidating possible regulatory interactions between previously unlinked sequences.
Joshua D. Campbell
Full Text Available Summary: This integrated, multiplatform PanCancer Atlas study co-mapped and identified distinguishing molecular features of squamous cell carcinomas (SCCs from five sites associated with smoking and/or human papillomavirus (HPV. SCCs harbor 3q, 5p, and other recurrent chromosomal copy-number alterations (CNAs, DNA mutations, and/or aberrant methylation of genes and microRNAs, which are correlated with the expression of multi-gene programs linked to squamous cell stemness, epithelial-to-mesenchymal differentiation, growth, genomic integrity, oxidative damage, death, and inflammation. Low-CNA SCCs tended to be HPV(+ and display hypermethylation with repression of TET1 demethylase and FANCF, previously linked to predisposition to SCC, or harbor mutations affecting CASP8, RAS-MAPK pathways, chromatin modifiers, and immunoregulatory molecules. We uncovered hypomethylation of the alternative promoter that drives expression of the ΔNp63 oncogene and embedded miR944. Co-expression of immune checkpoint, T-regulatory, and Myeloid suppressor cells signatures may explain reduced efficacy of immune therapy. These findings support possibilities for molecular classification and therapeutic approaches. : Campbell et al. reveal that squamous cell cancers from different tissue sites may be distinguished from other cancers and subclassified molecularly by recurrent alterations in chromosomes, DNA methylation, messenger and microRNA expression, or by mutations. These affect squamous cell pathways and programs that provide candidates for therapy. Keywords: genomics, transcriptomics, proteomics, head and neck squamous cell carcinoma, lung squamous cell carcinoma, esophageal squamous cell carcinoma, cervical squamous cell carcinoma, bladder carcinoma with squamous differentiation, human papillomavirus
Investigators with The Cancer Genome Atlas (TCGA) Research Network have identified novel genomic and molecular characteristics of cervical cancer that will aid in subclassification of the disease and may help target therapies that are most appropriate for each patient.
Alessandro M. Varani
Full Text Available The Xylella fastidiosa comparative genomic database is a scientific resource with the aim to provide a user-friendly interface for accessing high-quality manually curated genomic annotation and comparative sequence analysis, as well as for identifying and mapping prophage-like elements, a marked feature of Xylella genomes. Here we describe a database and tools for exploring the biology of this important plant pathogen. The hallmarks of this database are the high quality genomic annotation, the functional and comparative genomic analysis and the identification and mapping of prophage-like elements. It is available from web site http://www.xylella.lncc.br.
Lyne, Mike; Smith, Richard N; Lyne, Rachel; Aleksic, Jelena; Hu, Fengyuan; Kalderimis, Alex; Stepan, Radek; Micklem, Gos
Common metabolic and endocrine diseases such as diabetes affect millions of people worldwide and have a major health impact, frequently leading to complications and mortality. In a search for better prevention and treatment, there is ongoing research into the underlying molecular and genetic bases of these complex human diseases, as well as into the links with risk factors such as obesity. Although an increasing number of relevant genomic and proteomic data sets have become available, the quantity and diversity of the data make their efficient exploitation challenging. Here, we present metabolicMine, a data warehouse with a specific focus on the genomics, genetics and proteomics of common metabolic diseases. Developed in collaboration with leading UK metabolic disease groups, metabolicMine integrates data sets from a range of experiments and model organisms alongside tools for exploring them. The current version brings together information covering genes, proteins, orthologues, interactions, gene expression, pathways, ontologies, diseases, genome-wide association studies and single nucleotide polymorphisms. Although the emphasis is on human data, key data sets from mouse and rat are included. These are complemented by interoperation with the RatMine rat genomics database, with a corresponding mouse version under development by the Mouse Genome Informatics (MGI) group. The web interface contains a number of features including keyword search, a library of Search Forms, the QueryBuilder and list analysis tools. This provides researchers with many different ways to analyse, view and flexibly export data. Programming interfaces and automatic code generation in several languages are supported, and many of the features of the web interface are available through web services. The combination of diverse data sets integrated with analysis tools and a powerful query system makes metabolicMine a valuable research resource. The web interface makes it accessible to first
Coupe, Robert L.M.
Common causes of hair loss include androgenic hair loss, alopecia areata, trichotillomania, tinea capitis, telogen effluvium, and traction alopecia. The author discusses their distinguishing clinical features and those of less common alopecias.
Jensen, Mark A; Ferretti, Vincent; Grossman, Robert L; Staudt, Louis M
The National Cancer Institute Genomic Data Commons (GDC) is an information system for storing, analyzing, and sharing genomic and clinical data from patients with cancer. The recent high-throughput sequencing of cancer genomes and transcriptomes has produced a big data problem that precludes many cancer biologists and oncologists from gleaning knowledge from these data regarding the nature of malignant processes and the relationship between tumor genomic profiles and treatment response. The GDC aims to democratize access to cancer genomic data and to foster the sharing of these data to promote precision medicine approaches to the diagnosis and treatment of cancer.
Kravatsky, Yuri V; Chechetkin, Vladimir R; Tchurikov, Nikolai A; Kravatskaya, Galina I
The broad class of tasks in genetics and epigenetics can be reduced to the study of various features that are distributed over the genome (genome tracks). The rapid and efficient processing of the huge amount of data stored in the genome-scale databases cannot be achieved without the software packages based on the analytical criteria. However, strong inhomogeneity of genome tracks hampers the development of relevant statistics. We developed the criteria for the assessment of genome track inhomogeneity and correlations between two genome tracks. We also developed a software package, Genome Track Analyzer, based on this theory. The theory and software were tested on simulated data and were applied to the study of correlations between CpG islands and transcription start sites in the Homo sapiens genome, between profiles of protein-binding sites in chromosomes of Drosophila melanogaster, and between DNA double-strand breaks and histone marks in the H. sapiens genome. Significant correlations between transcription start sites on the forward and the reverse strands were observed in genomes of D. melanogaster, Caenorhabditis elegans, Mus musculus, H. sapiens, and Danio rerio. The observed correlations may be related to the regulation of gene expression in eukaryotes. Genome Track Analyzer is freely available at http://ancorr.eimb.ru/. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Xu, Peng; Zhang, Xiaofeng; Wang, Xumin; Li, Jiongtang; Liu, Guiming; Kuang, Youyi; Xu, Jian; Zheng, Xianhu; Ren, Lufeng; Wang, Guoliang; Zhang, Yan; Huo, Linhe; Zhao, Zixia; Cao, Dingchen; Lu, Cuiyun; Li, Chao; Zhou, Yi; Liu, Zhanjiang; Fan, Zhonghua; Shan, Guangle; Li, Xingang; Wu, Shuangxiu; Song, Lipu; Hou, Guangyuan; Jiang, Yanliang; Jeney, Zsigmond; Yu, Dan; Wang, Li; Shao, Changjun; Song, Lai; Sun, Jing; Ji, Peifeng; Wang, Jian; Li, Qiang; Xu, Liming; Sun, Fanyue; Feng, Jianxin; Wang, Chenghui; Wang, Shaolin; Wang, Baosen; Li, Yan; Zhu, Yaping; Xue, Wei; Zhao, Lan; Wang, Jintu; Gu, Ying; Lv, Weihua; Wu, Kejing; Xiao, Jingfa; Wu, Jiayan; Zhang, Zhang; Yu, Jun; Sun, Xiaowen
The common carp, Cyprinus carpio, is one of the most important cyprinid species and globally accounts for 10% of freshwater aquaculture production. Here we present a draft genome of domesticated C. carpio (strain Songpu), whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100). The latest round of whole-genome duplication has been estimated to have occurred approximately 8.2 million years ago. Genome resequencing of 33 representative individuals from worldwide populations demonstrates a single origin for C. carpio in 2 subspecies (C. carpio Haematopterus and C. carpio carpio). Integrative genomic and transcriptomic analyses were used to identify loci potentially associated with traits including scaling patterns and skin color. In combination with the high-resolution genetic map, the draft genome paves the way for better molecular studies and improved genome-assisted breeding of C. carpio and other closely related species.
Full Text Available Aquaporins (Aqps are integral membrane proteins that facilitate the transport of water and small solutes across cell membranes. Among vertebrate species, Aqps are highly conserved in both gene structure and amino acid sequence. These proteins are vital for maintaining water homeostasis in living organisms, especially for aquatic animals such as teleost fish. Studies on teleost Aqps are mainly limited to several model species with diploid genomes. Common carp, which has a tetraploidized genome, is one of the most common aquaculture species being adapted to a wide range of aquatic environments. The complete common carp genome has recently been released, providing us the possibility for gene evolution of aqp gene family after whole genome duplication.In this study, we identified a total of 37 aqp genes from common carp genome. Phylogenetic analysis revealed that most of aqps are highly conserved. Comparative analysis was performed across five typical vertebrate genomes. We found that almost all of the aqp genes in common carp were duplicated in the evolution of the gene family. We postulated that the expansion of the aqp gene family in common carp was the result of an additional whole genome duplication event and that the aqp gene family in other teleosts has been lost in their evolution history with the reason that the functions of genes are redundant and conservation. Expression patterns were assessed in various tissues, including brain, heart, spleen, liver, intestine, gill, muscle, and skin, which demonstrated the comprehensive expression profiles of aqp genes in the tetraploidized genome. Significant gene expression divergences have been observed, revealing substantial expression divergences or functional divergences in those duplicated aqp genes post the latest WGD event.To some extent, the gene families are also considered as a unique source for evolutionary studies. Moreover, the whole set of common carp aqp gene family provides an
A.C.J.W. Janssens (Cécile); P. Tikka-Kleemola (Päivi)
textabstractThe translation of emerging genomic knowledge into public health and clinical care is one of the major challenges for the coming decades. At the moment, genome-based prediction of common diseases, such as type 2 diabetes, coronary heart disease and cancer, is still not informative. Our
Lamas, Natalia Silva; Fontenele, Rafaela Salgado; Melo, Fernando Lucas; Costa, Antonio Felix; Varsani, Arvind; Ribeiro, Simone Graça
A new genomovirus has been identified in three common bean plants in Brazil. This virus has a circular genome of 2,220 nucleotides and 3 major open reading frames. It shares 80.7% genome-wide pairwise identity with a genomovirus recovered from Tongan fruit bat guano. Copyright © 2016 Lamas et al.
Stavrovskaya, Elena D; Niranjan, Tejasvi; Fertig, Elana J; Wheelan, Sarah J; Favorov, Alexander V; Mironov, Andrey A
Genomics features with similar genome-wide distributions are generally hypothesized to be functionally related, for example, colocalization of histones and transcription start sites indicate chromatin regulation of transcription factor activity. Therefore, statistical algorithms to perform spatial, genome-wide correlation among genomic features are required. Here, we propose a method, StereoGene, that rapidly estimates genome-wide correlation among pairs of genomic features. These features may represent high-throughput data mapped to reference genome or sets of genomic annotations in that reference genome. StereoGene enables correlation of continuous data directly, avoiding the data binarization and subsequent data loss. Correlations are computed among neighboring genomic positions using kernel correlation. Representing the correlation as a function of the genome position, StereoGene outputs the local correlation track as part of the analysis. StereoGene also accounts for confounders such as input DNA by partial correlation. We apply our method to numerous comparisons of ChIP-Seq datasets from the Human Epigenome Atlas and FANTOM CAGE to demonstrate its wide applicability. We observe the changes in the correlation between epigenomic features across developmental trajectories of several tissue types consistent with known biology and find a novel spatial correlation of CAGE clusters with donor splice sites and with poly(A) sites. These analyses provide examples for the broad applicability of StereoGene for regulatory genomics. The StereoGene C ++ source code, program documentation, Galaxy integration scripts and examples are available from the project homepage http://stereogene.bioinf.fbb.msu.ru/. email@example.com. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: firstname.lastname@example.org
Zhang, Yan-Cong; Lin, Kui
Overlapping genes (OGs) represent one type of widespread genomic feature in bacterial genomes and have been used as rare genomic markers in phylogeny inference of closely related bacterial species. However, the inference may experience a decrease in performance for phylogenomic analysis of too closely or too distantly related genomes. Another drawback of OGs as phylogenetic markers is that they usually take little account of the effects of genomic rearrangement on the similarity estimation, such as intra-chromosome/genome translocations, horizontal gene transfer, and gene losses. To explore such effects on the accuracy of phylogeny reconstruction, we combine phylogenetic signals of OGs with collinear genomic regions, here called locally collinear blocks (LCBs). By putting these together, we refine our previous metric of pairwise similarity between two closely related bacterial genomes. As a case study, we used this new method to reconstruct the phylogenies of 88 Enterobacteriale genomes of the class Gammaproteobacteria. Our results demonstrated that the topological accuracy of the inferred phylogeny was improved when both OGs and LCBs were simultaneously considered, suggesting that combining these two phylogenetic markers may reduce, to some extent, the influence of gene loss on phylogeny inference. Such phylogenomic studies, we believe, will help us to explore a more effective approach to increasing the robustness of phylogeny reconstruction of closely related bacterial organisms. PMID:26715828
Parker, K; Pabla, R; Hay, N; Ayliffe, P
Ter Haar syndrome is a rare genetic syndrome with <30 cases reported worldwide. There is nothing within the published literature regarding the dental development and dental features of these patients. This case series examines three patients with Ter Haar syndrome and tracks their dental development and identifies common dental and skeletal features. All three patients received dental treatment and regular follow-up at Great Ormond Street Hospital Dental Department. These patients have many common dental and craniofacial features which poses the question as to whether these features are due to Ter Haar syndrome.
Lubke, G.H.; Laurin, C.; Amin, N.; Hottenga, J.J.; Willemsen, G.; Grootheest, G.; Abdellaoui, A.; Karssen, L.C.; Oostra, B.A.; van Duijn, C.M.; Penninx, B.W.J.H.; Boomsma, D.I.
The heritability of borderline personality (BP) features has been established in multiple twin and family studies. Using data from the borderline subscale of the Personality Assessment Inventory Borderline Features Scale (PAI-BOR) collected in two Dutch cohorts (N=7125), the Netherlands Twin
Winsor, Geoffrey L; Griffiths, Emma J; Lo, Raymond; Dhillon, Bhavjinder K; Shay, Julie A; Brinkman, Fiona S L
The Pseudomonas Genome Database (http://www.pseudomonas.com) is well known for the application of community-based annotation approaches for producing a high-quality Pseudomonas aeruginosa PAO1 genome annotation, and facilitating whole-genome comparative analyses with other Pseudomonas strains. To aid analysis of potentially thousands of complete and draft genome assemblies, this database and analysis platform was upgraded to integrate curated genome annotations and isolate metadata with enhanced tools for larger scale comparative analysis and visualization. Manually curated gene annotations are supplemented with improved computational analyses that help identify putative drug targets and vaccine candidates or assist with evolutionary studies by identifying orthologs, pathogen-associated genes and genomic islands. The database schema has been updated to integrate isolate metadata that will facilitate more powerful analysis of genomes across datasets in the future. We continue to place an emphasis on providing high-quality updates to gene annotations through regular review of the scientific literature and using community-based approaches including a major new Pseudomonas community initiative for the assignment of high-quality gene ontology terms to genes. As we further expand from thousands of genomes, we plan to provide enhancements that will aid data visualization and analysis arising from whole-genome comparative studies including more pan-genome and population-based approaches. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Rohde, Palle Duun; Østergaard, Solveig; Kristensen, Torsten Nygaard
Understanding the genetic underpinnings of complex traits requires knowledge of the genetic variants that contribute to phenotypic variability. Reliable statistical approaches are needed to obtain such knowledge. In genome-wide association studies, variants are tested for association with trait...... then functionally assessed whether the identified candidate genes affected locomotor activity by reducing gene expression using RNA interference. In five of the seven candidate genes tested, reduced gene expression altered the phenotype. The ranking of genes within the predictive GO term was highly correlated...
Fédrigo, Olivier; Haygood, Ralph; Mukherjee, Sayan; Wray, Gregory A.
Variation in gene expression is an important contributor to phenotypic diversity within and between species. Although this variation often has a genetic component, identification of the genetic variants driving this relationship remains challenging. In particular, measurements of gene expression usually do not reveal whether the genetic basis for any observed variation lies in cis or in trans to the gene, a distinction that has direct relevance to the physical location of the underlying genetic variant, and which may also impact its evolutionary trajectory. Allelic imbalance measurements identify cis-acting genetic effects by assaying the relative contribution of the two alleles of a cis-regulatory region to gene expression within individuals. Identification of patterns that predict commonly imbalanced genes could therefore serve as a useful tool and also shed light on the evolution of cis-regulatory variation itself. Here, we show that sequence motifs, polymorphism levels, and divergence levels around a gene can be used to predict commonly imbalanced genes in a human data set. Reduction of this feature set to four factors revealed that only one factor significantly differentiated between commonly imbalanced and nonimbalanced genes. We demonstrate that these results are consistent between the original data set and a second published data set in humans obtained using different technical and statistical methods. Finally, we show that variation in the single allelic imbalance-associated factor is partially explained by the density of genes in the region of a target gene (allelic imbalance is less probable for genes in gene-dense regions), and, to a lesser extent, the evenness of expression of the gene across tissues and the magnitude of negative selection on putative regulatory regions of the gene. These results suggest that the genomic distribution of functional cis-regulatory variants in the human genome is nonrandom, perhaps due to local differences in evolutionary
Lyu, Haomin; He, Ziwen; Wu, Chung-I; Shi, Suhua
Several clades of mangrove trees independently invade the interface between land and sea at the margin of woody plant distribution. As phenotypic convergence among mangroves is common, the possibility of convergent adaptation in their genomes is quite intriguing. To study this molecular convergence, we sequenced multiple mangrove genomes. In this study, we focused on the evolution of transposable elements (TEs) in relation to the genome size evolution. TEs, generally considered genomic parasites, are the most common components of woody plant genomes. Analyzing the long terminal repeat-retrotransposon (LTR-RT) type of TE, we estimated their death rates by counting solo-LTRs and truncated elements. We found that all lineages of mangroves massively and convergently reduce TE loads in comparison to their nonmangrove relatives; as a consequence, genome size reduction happens independently in all six mangrove lineages; TE load reduction in mangroves can be attributed to the paucity of young elements; the rarity of young LTR-RTs is a consequence of fewer births rather than access death. In conclusion, mangrove genomes employ a convergent strategy of TE load reduction by suppressing element origination in their independent adaptation to a new environment. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
While the 1970's have been called the environmental years, the 1990's could be seen as the genome years. As the challenge to map and to sequence the human genome mobilized the scientific community, risks and benefits of information and uses that would derive from this project have also raised ethical issues at the international level. The particular interest of the 1997 UNESCO Declaration relies on the fact that it emphasizes both the scientific importance of genetics and the appropriate reinforcement of human rights in this area. It considers the human genome, at least symbolically, as the common heritage of humanity.
Full Text Available Qat (Catha edulis, Celastraceae is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA genes, 8 ribosomal RNA (rRNA genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae.
Tembrock, Luke R.; Zheng, Shaoyu; Wu, Zhiqiang
Qat (Catha edulis, Celastraceae) is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp) genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA) genes, 8 ribosomal RNA (rRNA) genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae. PMID:29425128
Joshua D. Campbell; Christina Yau; Reanne Bowlby; Yuexin Liu; Kevin Brennan; Huihui Fan; Alison M. Taylor; Chen Wang; Vonn Walter; Rehan Akbani; Lauren Averett Byers; Chad J. Creighton; Cristian Coarfa; Juliann Shih; Andrew D. Cherniack
Summary: This integrated, multiplatform PanCancer Atlas study co-mapped and identified distinguishing molecular features of squamous cell carcinomas (SCCs) from five sites associated with smoking and/or human papillomavirus (HPV). SCCs harbor 3q, 5p, and other recurrent chromosomal copy-number alterations (CNAs), DNA mutations, and/or aberrant methylation of genes and microRNAs, which are correlated with the expression of multi-gene programs linked to squamous cell stemness, epithelial-to-mes...
Stephen B Montgomery
Full Text Available Population-scale genome sequencing allows the characterization of functional effects of a broad spectrum of genetic variants underlying human phenotypic variation. Here, we investigate the influence of rare and common genetic variants on gene expression patterns, using variants identified from sequencing data from the 1000 genomes project in an African and European population sample and gene expression data from lymphoblastoid cell lines. We detect comparable numbers of expression quantitative trait loci (eQTLs when compared to genotypes obtained from HapMap 3, but as many as 80% of the top expression quantitative trait variants (eQTVs discovered from 1000 genomes data are novel. The properties of the newly discovered variants suggest that mapping common causal regulatory variants is challenging even with full resequencing data; however, we observe significant enrichment of regulatory effects in splice-site and nonsense variants. Using RNA sequencing data, we show that 46.2% of nonsynonymous variants are differentially expressed in at least one individual in our sample, creating widespread potential for interactions between functional protein-coding and regulatory variants. We also use allele-specific expression to identify putative rare causal regulatory variants. Furthermore, we demonstrate that outlier expression values can be due to rare variant effects, and we approximate the number of such effects harboured in an individual by effect size. Our results demonstrate that integration of genomic and RNA sequencing analyses allows for the joint assessment of genome sequence and genome function.
Farfel, Jose M; Yu, Lei; Buchman, Aron S; Schneider, Julie A; De Jager, Philip L; Bennett, David A
To investigate the associations of previously reported Alzheimer disease (AD) dementia genomic variants with common neuropathologies. This is a postmortem study including 1,017 autopsied participants from 2 clinicopathologic cohorts. Analyses focused on 22 genomic variants associated with AD dementia in large-scale case-control genome-wide association study (GWAS) meta-analyses. The neuropathologic traits of interest were a pathologic diagnosis of AD according to NIA-Reagan criteria, macroscopic and microscopic infarcts, Lewy bodies (LB), and hippocampal sclerosis. For each variant, multiple logistic regression was used to investigate its association with neuropathologic traits, adjusting for age, sex, and subpopulation structure. We also conducted power analyses to estimate the sample sizes required to detect genome-wide significance (p dementia variants are not likely to be detected for association with pathologic AD with a sample size in excess of the largest GWAS meta-analyses of AD dementia. Many recently discovered genomic variants for AD dementia are not associated with the pathology of AD. Some genomic variants for AD dementia appear to be associated with other common neuropathologies. © 2016 American Academy of Neurology.
Cornick, Jennifer E.; Chaguza, Chrispin; Yalcin, Feyruz; Harris, Simon R.; Gray, Katherine J.; Kiran, Anmol M.; Molyneux, Elizabeth; French, Neil; Faragher, Brian E.; Everett, Dean B.; Bentley, Stephen D.
Streptococcus pneumoniae is a nasopharyngeal commensal that occasionally invades normally sterile sites to cause bloodstream infection and meningitis. Although the pneumococcal population structure and evolutionary genetics are well defined, it is not clear whether pneumococci that cause meningitis are genetically distinct from those that do not. Here, we used whole-genome sequencing of 140 isolates of S. pneumoniae recovered from bloodstream infection (n = 70) and meningitis (n = 70) to compare their genetic contents. By fitting a double-exponential decaying-function model, we show that these isolates share a core of 1,427 genes (95% confidence interval [CI], 1,425 to 1,435 genes) and that there is no difference in the core genome or accessory gene content from these disease manifestations. Gene presence/absence alone therefore does not explain the virulence behavior of pneumococci that reach the meninges. Our analysis, however, supports the requirement of a range of previously described virulence factors and vaccine candidates for both meningitis- and bacteremia-causing pneumococci. This high-resolution view suggests that, despite considerable competency for genetic exchange, all pneumococci are under considerable pressure to retain key components advantageous for colonization and transmission and that these components are essential for access to and survival in sterile sites. PMID:26259813
Promponas Vasilis J
Full Text Available Abstract Background The availability of increasing amounts of sequence data from completely sequenced genomes boosts the development of new computational methods for automated genome annotation and comparative genomics. Therefore, there is a need for tools that facilitate the visualization of raw data and results produced by bioinformatics analysis, providing new means for interactive genome exploration. Visual inspection can be used as a basis to assess the quality of various analysis algorithms and to aid in-depth genomic studies. Results GeneViTo is a JAVA-based computer application that serves as a workbench for genome-wide analysis through visual interaction. The application deals with various experimental information concerning both DNA and protein sequences (derived from public sequence databases or proprietary data sources and meta-data obtained by various prediction algorithms, classification schemes or user-defined features. Interaction with a Graphical User Interface (GUI allows easy extraction of genomic and proteomic data referring to the sequence itself, sequence features, or general structural and functional features. Emphasis is laid on the potential comparison between annotation and prediction data in order to offer a supplement to the provided information, especially in cases of "poor" annotation, or an evaluation of available predictions. Moreover, desired information can be output in high quality JPEG image files for further elaboration and scientific use. A compilation of properly formatted GeneViTo input data for demonstration is available to interested readers for two completely sequenced prokaryotes, Chlamydia trachomatis and Methanococcus jannaschii. Conclusions GeneViTo offers an inspectional view of genomic functional elements, concerning data stemming both from database annotation and analysis tools for an overall analysis of existing genomes. The application is compatible with Linux or Windows ME-2000-XP operating
We report on a comprehensive examination of the exotic superconductors (the materials so-labelled by Uemura and co-workers), to determine as far as possible the true systematics among their many anomalous features. In the crystal-chemistry aspects as well as in the electronic properties, we find features which appear to be universal for these materials, and also features which are clearly not universal but which are common enough to be considered typical for these materials. A number of implications are presented. It appears that all of these materials are sharing some ''new'' pairing mechanism, usually in addition to the conventional phonon mechanism
Wang, Shuliang; Chi, Hehua; Yuan, Hanning; Geng, Jing
Human facial expressions are key ingredient to convert an individual's innate emotion in communication. However, the variation of facial expressions affects the reliable identification of human emotions. In this paper, we present a cloud model to extract facial features for representing human emotion. First, the uncertainties in facial expression are analyzed in the context of cloud model. The feature extraction and representation algorithm is established under cloud generators. With forward cloud generator, facial expression images can be re-generated as many as we like for visually representing the extracted three features, and each feature shows different roles. The effectiveness of the computing model is tested on Japanese Female Facial Expression database. Three common features are extracted from seven facial expression images. Finally, the paper is concluded and remarked.
Gayk, Zach G; Le Duc, Diana; Horn, Jeffrey; Lindsay, Alec R
The common loon (Gavia immer) is one of five species that comprise the avian order Gaviiformes. Loons are specialized divers, reaching depths up to 60 m while staying submerged for intervals up to three minutes. In this study we used comparative genomics to investigate the genetic basis of the common loon adaptations to its ecological niche. We used Illumina short read DNA sequence data from a female bird to produce a draft assembly of the common loon (Gavia immer) genome. We identified 14,169 common loon genes, which based on well-resolved avian genomes, represent approximately 80.7% of common loon genes. Evolutionary analyses between common loon and Adelie penguin (Pygoscelis adeliae), red-throated loon (Gavia stellata), chicken (Gallus gallus), northern fulmar (Fulmarus glacialis), and rock pigeon (Columba livia) show 164 positively selected genes in common and red-throated loons. These genes were enriched for a number of protein classes, including those involved in muscle tissue development, immunoglobulin function, hemoglobin iron binding, G-protein coupled receptors, and ATP metabolism. Signatures of positive selection in these areas suggest the genus Gavia may have adapted for underwater diving by modulating their oxidative and metabolic pathways. While more research is required, these adaptations likely result in (1) compensations in oxygen respiration and energetic metabolism, (2) low-light visual acuity, and (3) elevated solute exchange. This work represents the first effort to understand the genomic adaptations of the common loon as well as other Gavia and may have implications for subsequent studies that target particular genes for loon population genetic, ecological or conservation studies.
A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for symbiotic nitrogen fixation (SNF) and related traits in the Andean diversity panel (ADP) comprised of 259 common bean (Phaseolus vulgaris) genotypes. The ADP was evaluated for SNF and related traits in...
Gary W Beecham
Full Text Available Alzheimer's disease (AD and related dementias are a major public health challenge and present a therapeutic imperative for which we need additional insight into molecular pathogenesis. We performed a genome-wide association study and analysis of known genetic risk loci for AD dementia using neuropathologic data from 4,914 brain autopsies. Neuropathologic data were used to define clinico-pathologic AD dementia or controls, assess core neuropathologic features of AD (neuritic plaques, NPs; neurofibrillary tangles, NFTs, and evaluate commonly co-morbid neuropathologic changes: cerebral amyloid angiopathy (CAA, Lewy body disease (LBD, hippocampal sclerosis of the elderly (HS, and vascular brain injury (VBI. Genome-wide significance was observed for clinico-pathologic AD dementia, NPs, NFTs, CAA, and LBD with a number of variants in and around the apolipoprotein E gene (APOE. GalNAc transferase 7 (GALNT7, ATP-Binding Cassette, Sub-Family G (WHITE, Member 1 (ABCG1, and an intergenic region on chromosome 9 were associated with NP score; and Potassium Large Conductance Calcium-Activated Channel, Subfamily M, Beta Member 2 (KCNMB2 was strongly associated with HS. Twelve of the 21 non-APOE genetic risk loci for clinically-defined AD dementia were confirmed in our clinico-pathologic sample: CR1, BIN1, CLU, MS4A6A, PICALM, ABCA7, CD33, PTK2B, SORL1, MEF2C, ZCWPW1, and CASS4 with 9 of these 12 loci showing larger odds ratio in the clinico-pathologic sample. Correlation of effect sizes for risk of AD dementia with effect size for NFTs or NPs showed positive correlation, while those for risk of VBI showed a moderate negative correlation. The other co-morbid neuropathologic features showed only nominal association with the known AD loci. Our results discovered new genetic associations with specific neuropathologic features and aligned known genetic risk for AD dementia with specific neuropathologic changes in the largest brain autopsy study of AD and related
Huckins, L M; Hatzikotoulas, K; Southam, L; Thornton, L M; Steinberg, J; Aguilera-McKay, F; Treasure, J; Schmidt, U; Gunasinghe, C; Romero, A; Curtis, C; Rhodes, D; Moens, J; Kalsi, G; Dempster, D; Leung, R; Keohane, A; Burghardt, R; Ehrlich, S; Hebebrand, J; Hinney, A; Ludolph, A; Walton, E; Deloukas, P; Hofman, A; Palotie, A; Palta, P; van Rooij, F J A; Stirrups, K; Adan, R; Boni, C; Cone, R; Dedoussis, G; van Furth, E; Gonidakis, F; Gorwood, P; Hudson, J; Kaprio, J; Kas, M; Keski-Rahonen, A; Kiezebrink, K; Knudsen, G-P; Slof-Op 't Landt, M C T; Maj, M; Monteleone, A M; Monteleone, P; Raevuori, A H; Reichborn-Kjennerud, T; Tozzi, F; Tsitsika, A; van Elburg, A; Adan, R A H; Alfredsson, L; Ando, T; Andreassen, O A; Aschauer, H; Baker, J H; Barrett, J C; Bencko, V; Bergen, A W; Berrettini, W H; Birgegard, A; Boni, C; Boraska Perica, V; Brandt, H; Breen, G; Bulik, C M; Carlberg, L; Cassina, M; Cichon, S; Clementi, M; Cohen-Woods, S; Coleman, J; Cone, R D; Courtet, P; Crawford, S; Crow, S; Crowley, J; Danner, U N; Davis, O S P; de Zwaan, M; Dedoussis, G; Degortes, D; DeSocio, J E; Dick, D M; Dikeos, D; Dina, C; Ding, B; Dmitrzak-Weglarz, M; Docampo, E; Duncan, L; Egberts, K; Ehrlich, S; Escaramís, G; Esko, T; Espeseth, T; Estivill, X; Favaro, A; Fernández-Aranda, F; Fichter, M M; Finan, C; Fischer, K; Floyd, J A B; Foretova, L; Forzan, M; Franklin, C S; Gallinger, S; Gambaro, G; Gaspar, H A; Giegling, I; Gonidakis, F; Gorwood, P; Gratacos, M; Guillaume, S; Guo, Y; Hakonarson, H; Halmi, K A; Hatzikotoulas, K; Hauser, J; Hebebrand, J; Helder, S; Herms, S; Herpertz-Dahlmann, B; Herzog, W; Hilliard, C E; Hinney, A; Hübel, C; Huckins, L M; Hudson, J I; Huemer, J; Inoko, H; Janout, V; Jiménez-Murcia, S; Johnson, C; Julià, A; Juréus, A; Kalsi, G; Kaminska, D; Kaplan, A S; Kaprio, J; Karhunen, L; Karwautz, A; Kas, M J H; Kaye, W; Kennedy, J L; Keski-Rahkonen, A; Kiezebrink, K; Klareskog, L; Klump, K L; Knudsen, G P S; Koeleman, B P C; Koubek, D; La Via, M C; Landén, M; Le Hellard, S; Levitan, R D; Li, D; Lichtenstein, P; Lilenfeld, L; Lissowska, J; Lundervold, A; Magistretti, P; Maj, M; Mannik, K; Marsal, S; Martin, N; Mattingsdal, M; McDevitt, S; McGuffin, P; Merl, E; Metspalu, A; Meulenbelt, I; Micali, N; Mitchell, J; Mitchell, K; Monteleone, P; Monteleone, A M; Mortensen, P; Munn-Chernoff, M A; Navratilova, M; Nilsson, I; Norring, C; Ntalla, I; Ophoff, R A; O'Toole, J K; Palotie, A; Pante, J; Papezova, H; Pinto, D; Rabionet, R; Raevuori, A; Rajewski, A; Ramoz, N; Rayner, N W; Reichborn-Kjennerud, T; Ripatti, S; Roberts, M; Rotondo, A; Rujescu, D; Rybakowski, F; Santonastaso, P; Scherag, A; Scherer, S W; Schmidt, U; Schork, N J; Schosser, A; Slachtova, L; Sladek, R; Slagboom, P E; Slof-Op 't Landt, M C T; Slopien, A; Soranzo, N; Southam, L; Steen, V M; Strengman, E; Strober, M; Sullivan, P F; Szatkiewicz, J P; Szeszenia-Dabrowska, N; Tachmazidou, I; Tenconi, E; Thornton, L M; Tortorella, A; Tozzi, F; Treasure, J; Tsitsika, A; Tziouvas, K; van Elburg, A A; van Furth, E F; Wagner, G; Walton, E; Watson, H; Wichmann, H-E; Widen, E; Woodside, D B; Yanovski, J; Yao, S; Yilmaz, Z; Zeggini, E; Zerwas, S; Zipfel, S; Collier, D A; Sullivan, P F; Breen, G; Bulik, C M; Zeggini, E
Anorexia nervosa (AN) is a complex neuropsychiatric disorder presenting with dangerously low body weight, and a deep and persistent fear of gaining weight. To date, only one genome-wide significant locus associated with AN has been identified. We performed an exome-chip based genome-wide association studies (GWAS) in 2158 cases from nine populations of European origin and 15 485 ancestrally matched controls. Unlike previous studies, this GWAS also probed association in low-frequency and rare variants. Sixteen independent variants were taken forward for in silico and de novo replication (11 common and 5 rare). No findings reached genome-wide significance. Two notable common variants were identified: rs10791286, an intronic variant in OPCML (P=9.89 × 10−6), and rs7700147, an intergenic variant (P=2.93 × 10−5). No low-frequency variant associations were identified at genome-wide significance, although the study was well-powered to detect low-frequency variants with large effect sizes, suggesting that there may be no AN loci in this genomic search space with large effect sizes. PMID:29155802
Xiao-Wen, Jia; Hai-Yun, Liu; Wen-Tao, Zhang; Lin, Zhao; Jian-Qiao, Meng; Guo-Dong, Liu; Xiao-Li, Dong; Zhi-An, Ren; Wei, Yi; Guang-Can, Che; Zhong-Xian, Zhao; Gang, Wu; Rong-Hua, Liu; Xian-Hui, Chen; Gen-Fu, Chen; Nan-Lin, Wang; Yong, Zhu; Xiao-Yang, Wang; Gui-Ling, Wang; Yong, Zhou
High resolution photoemission measurements are carried out on non-superconducting LaFeAsO parent compound and various superconducting RFeAs(O 1-x F x ) (R=La, Ce and Pr) compounds. It is found that the parent LaFeAsO compound shows a metallic character. By extensive measurements, several common features are identified in the electronic structure of these Fe-based compounds: (1) 0.2 eV feature in the valence band, (2) a universal 13-16 meV feature, (3) near Ef spectral weight suppression with decreasing temperature. These universal features can provide important information about band structure, superconducting gap and pseudogap in these Fe-based materials
Pozzoli, Uberto; Elgar, Greg; Cagliani, Rachele; Riva, Laura; Comi, Giacomo P.; Bresolin, Nereo; Bardoni, Alessandra; Sironi, Manuela
The human DMD gene is the largest known to date, spanning > 2000 kb on the X chromosome. The gene size is mainly accounted for by huge intronic regions. We sequenced 190 kb of Fugu rubripes (pufferfish) genomic DNA corresponding to the complete dystrophin gene (FrDMD) and provide the first report of gene structure and sequence comparison among dystrophin genomic sequences from different vertebrate organisms. Almost all intron positions and phases are conserved between FrDMD and its mammalian counterparts, and the predicted protein product of the Fugu gene displays 55% identity and 71% similarity to human dystrophin. In analogy to the human gene, FrDMD presents several-fold longer than average intronic regions. Analysis of intron sequences of the human and murine genes revealed that they are extremely conserved in size and that a similar fraction of total intron length is represented by repetitive elements; moreover, our data indicate that intron expansion through repeat accumulation in the two orthologs is the result of independent insertional events. The hypothesis that intron length might be functionally relevant to the DMD gene regulation is proposed and substantiated by the finding that dystrophin intron gigantism is common to the three vertebrate genes. [Supplemental material is available online at www.genome.org.] PMID:12727896
Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S.; Beer, Michael A.
Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167–80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org. PMID:23771147
Full Text Available Abstract Background Bovine herpesvirus 4 (BoHV-4 is a useful model for the human pathogenic gammaherpesviruses Epstein-Barr virus and Kaposi's Sarcoma-associated Herpesvirus. Although genome manipulations of this virus have been greatly facilitated by the cloning of the BoHV-4 V.test strain as a Bacterial Artificial Chromosome (BAC, the lack of a complete genome sequence for this strain limits its experimental use. Methods In this study, we have determined the complete sequence of BoHV-4 V.test strain by a pyrosequencing approach. Results The long unique coding region (LUR consists of 108,241 bp encoding at least 79 open reading frames and is flanked by several polyrepetitive DNA units (prDNA. As previously suggested, we showed that the prDNA unit located at the left prDNA-LUR junction (prDNA-G differs from the other prDNA units (prDNA-inner. Namely, the prDNA-G unit lacks the conserved pac-2 cleavage and packaging signal in its right terminal region. Based on the mechanisms of cleavage and packaging of herpesvirus genomes, this feature implies that only genomes bearing left and right end prDNA units are encapsulated into virions. Conclusions In this study, we have determined the complete genome sequence of the BAC-cloned BoHV-4 V.test strain and identified genome organization features that could be important in other herpesviruses.
Zhang, Yu; Zhou, Guoxu; Jin, Jing; Wang, Xingyu; Cichocki, Andrzej
Canonical correlation analysis (CCA) has been successfully applied to steady-state visual evoked potential (SSVEP) recognition for brain-computer interface (BCI) application. Although the CCA method outperforms the traditional power spectral density analysis through multi-channel detection, it requires additionally pre-constructed reference signals of sine-cosine waves. It is likely to encounter overfitting in using a short time window since the reference signals include no features from training data. We consider that a group of electroencephalogram (EEG) data trials recorded at a certain stimulus frequency on a same subject should share some common features that may bear the real SSVEP characteristics. This study therefore proposes a common feature analysis (CFA)-based method to exploit the latent common features as natural reference signals in using correlation analysis for SSVEP recognition. Good performance of the CFA method for SSVEP recognition is validated with EEG data recorded from ten healthy subjects, in contrast to CCA and a multiway extension of CCA (MCCA). Experimental results indicate that the CFA method significantly outperformed the CCA and the MCCA methods for SSVEP recognition in using a short time window (i.e., less than 1s). The superiority of the proposed CFA method suggests it is promising for the development of a real-time SSVEP-based BCI. Copyright © 2014 Elsevier B.V. All rights reserved.
Ekins, Sean; Freundlich, Joel S; Coffee, Megan
We are currently faced with a global infectious disease crisis which has been anticipated for decades. While many promising biotherapeutics are being tested, the search for a small molecule has yet to deliver an approved drug or therapeutic for the Ebola or similar filoviruses that cause haemorrhagic fever. Two recent high throughput screens published in 2013 did however identify several hits that progressed to animal studies that are FDA approved drugs used for other indications. The current computational analysis uses these molecules from two different structural classes to construct a common features pharmacophore. This ligand-based pharmacophore implicates a possible common target or mechanism that could be further explored. A recent structure based design project yielded nine co-crystal structures of pyrrolidinone inhibitors bound to the viral protein 35 (VP35). When receptor-ligand pharmacophores based on the analogs of these molecules and the protein structures were constructed, the molecular features partially overlapped with the common features of solely ligand-based pharmacophore models based on FDA approved drugs. These previously identified FDA approved drugs with activity against Ebola were therefore docked into this protein. The antimalarials chloroquine and amodiaquine docked favorably in VP35. We propose that these drugs identified to date as inhibitors of the Ebola virus may be targeting VP35. These computational models may provide preliminary insights into the molecular features that are responsible for their activity against Ebola virus in vitro and in vivo and we propose that this hypothesis could be readily tested.
Kasperaviciūte, Dalia; Catarino, Claudia B; Heinzen, Erin L; Depondt, Chantal; Cavalleri, Gianpiero L; Caboclo, Luis O; Tate, Sarah K; Jamnadas-Khoda, Jenny; Chinthapalli, Krishna; Clayton, Lisa M S; Shianna, Kevin V; Radtke, Rodney A; Mikati, Mohamad A; Gallentine, William B; Husain, Aatif M; Alhusaini, Saud; Leppert, David; Middleton, Lefkos T; Gibson, Rachel A; Johnson, Michael R; Matthews, Paul M; Hosford, David; Heuser, Kjell; Amos, Leslie; Ortega, Marcos; Zumsteg, Dominik; Wieser, Heinz-Gregor; Steinhoff, Bernhard J; Krämer, Günter; Hansen, Jörg; Dorn, Thomas; Kantanen, Anne-Mari; Gjerstad, Leif; Peuralinna, Terhi; Hernandez, Dena G; Eriksson, Kai J; Kälviäinen, Reetta K; Doherty, Colin P; Wood, Nicholas W; Pandolfo, Massimo; Duncan, John S; Sander, Josemir W; Delanty, Norman; Goldstein, David B; Sisodiya, Sanjay M
Partial epilepsies have a substantial heritability. However, the actual genetic causes are largely unknown. In contrast to many other common diseases for which genetic association-studies have successfully revealed common variants associated with disease risk, the role of common variation in partial epilepsies has not yet been explored in a well-powered study. We undertook a genome-wide association-study to identify common variants which influence risk for epilepsy shared amongst partial epilepsy syndromes, in 3445 patients and 6935 controls of European ancestry. We did not identify any genome-wide significant association. A few single nucleotide polymorphisms may warrant further investigation. We exclude common genetic variants with effect sizes above a modest 1.3 odds ratio for a single variant as contributors to genetic susceptibility shared across the partial epilepsies. We show that, at best, common genetic variation can only have a modest role in predisposition to the partial epilepsies when considered across syndromes in Europeans. The genetic architecture of the partial epilepsies is likely to be very complex, reflecting genotypic and phenotypic heterogeneity. Larger meta-analyses are required to identify variants of smaller effect sizes (odds ratio<1.3) or syndrome-specific variants. Further, our results suggest research efforts should also be directed towards identifying the multiple rare variants likely to account for at least part of the heritability of the partial epilepsies. Data emerging from genome-wide association-studies will be valuable during the next serious challenge of interpreting all the genetic variation emerging from whole-genome sequencing studies.
Mourad, Raphaël; Cuvier, Olivier
Recent advances in long-range Hi-C contact mapping have revealed the importance of the 3D structure of chromosomes in gene expression. A current challenge is to identify the key molecular drivers of this 3D structure. Several genomic features, such as architectural proteins and functional elements, were shown to be enriched at topological domain borders using classical enrichment tests. Here we propose multiple logistic regression to identify those genomic features that positively or negatively influence domain border establishment or maintenance. The model is flexible, and can account for statistical interactions among multiple genomic features. Using both simulated and real data, we show that our model outperforms enrichment test and non-parametric models, such as random forests, for the identification of genomic features that influence domain borders. Using Drosophila Hi-C data at a very high resolution of 1 kb, our model suggests that, among architectural proteins, BEAF-32 and CP190 are the main positive drivers of 3D domain borders. In humans, our model identifies well-known architectural proteins CTCF and cohesin, as well as ZNF143 and Polycomb group proteins as positive drivers of domain borders. The model also reveals the existence of several negative drivers that counteract the presence of domain borders including P300, RXRA, BCL11A and ELK1.
Wang, Yiheng; Liu, Tong; Xu, Dong; Shi, Huidong; Zhang, Chaoyang; Mo, Yin-Yuan; Wang, Zheng
The hypo- or hyper-methylation of the human genome is one of the epigenetic features of leukemia. However, experimental approaches have only determined the methylation state of a small portion of the human genome. We developed deep learning based (stacked denoising autoencoders, or SdAs) software named "DeepMethyl" to predict the methylation state of DNA CpG dinucleotides using features inferred from three-dimensional genome topology (based on Hi-C) and DNA sequence patterns. We used the experimental data from immortalised myelogenous leukemia (K562) and healthy lymphoblastoid (GM12878) cell lines to train the learning models and assess prediction performance. We have tested various SdA architectures with different configurations of hidden layer(s) and amount of pre-training data and compared the performance of deep networks relative to support vector machines (SVMs). Using the methylation states of sequentially neighboring regions as one of the learning features, an SdA achieved a blind test accuracy of 89.7% for GM12878 and 88.6% for K562. When the methylation states of sequentially neighboring regions are unknown, the accuracies are 84.82% for GM12878 and 72.01% for K562. We also analyzed the contribution of genome topological features inferred from Hi-C. DeepMethyl can be accessed at http://dna.cs.usm.edu/deepmethyl/.
Edwards, Stefan McKinnon
externally founded information, such as KEGG pathways, Gene Ontology gene sets, or genomic features, and estimate the joint contribution of the genetic variants within these sets to complex trait phenotypes. The analysis of complex trait phenotypes is hampered by the myriad of genes that control the trait...
Cosentino Lagomarsino, Marco; Sellerio, Alessandro L; Heijning, Philip D; Bassetti, Bruno
Protein domains can be used to study proteome evolution at a coarse scale. In particular, they are found on genomes with notable statistical distributions. It is known that the distribution of domains with a given topology follows a power law. We focus on a further aspect: these distributions, and the number of distinct topologies, follow collective trends, or scaling laws, depending on the total number of domains only, and not on genome-specific features. We present a stochastic duplication/innovation model, in the class of the so-called 'Chinese restaurant processes', that explains this observation with two universal parameters, representing a minimal number of domains and the relative weight of innovation to duplication. Furthermore, we study a model variant where new topologies are related to occurrence in genomic data, accounting for fold specificity. Both models have general quantitative agreement with data from hundreds of genomes, which indicates that the domains of a genome are built with a combination of specificity and robust self-organizing phenomena. The latter are related to the basic evolutionary 'moves' of duplication and innovation, and give rise to the observed scaling laws, a priori of the specific evolutionary history of a genome. We interpret this as the concurrent effect of neutral and selective drives, which increase duplication and decrease innovation in larger and more complex genomes. The validity of our model would imply that the empirical observation of a small number of folds in nature may be a consequence of their evolution.
Mavian, Carla; López-Bueno, Alberto; Balseiro, Ana; Casais, Rosa; Alcamí, Antonio; Alejo, Alí
Worldwide amphibian population declines have been ascribed to global warming, increasing pollution levels, and other factors directly related to human activities. These factors may additionally be favoring the emergence of novel pathogens. In this report, we have determined the complete genome sequence of the emerging common midwife toad ranavirus (CMTV), which has caused fatal disease in several amphibian species across Europe. Phylogenetic and gene content analyses of the first complete genomic sequence from a ranavirus isolated in Europe show that CMTV is an amphibian-like ranavirus (ALRV). However, the CMTV genome structure is novel and represents an intermediate evolutionary stage between the two previously described ALRV groups. We find that CMTV clusters with several other ranaviruses isolated from different hosts and locations which might also be included in this novel ranavirus group. This work sheds light on the phylogenetic relationships within this complex group of emerging, disease-causing viruses.
Choi, Kyoung Su; Park, SeonJoo
Aster spathulifolius, a member of the Asteraceae family, is distributed along the coast of Japan and Korea. This plant is used for medicinal and ornamental purposes. The complete chloroplast (cp) genome of A. sphathulifolius consists of 149,473 bp that include a pair of inverted repeats of 24,751 bp separated by a large single copy region of 81,998 bp and a small single copy region of 17,973 bp. The chloroplast genome contains 78 coding genes, four rRNA genes and 29 tRNA genes. When compared to other cpDNA sequences of Asteraceae, A. spathulifolius showed the closest relationship with Jacobaea vulgaris, and its atpB gene was found to be a pseudogene, unlike J. vulgaris. Furthermore, evaluation of the gene compositions of J. vulgaris, Helianthus annuus, Guizotia abyssinica and A. spathulifolius revealed that 13.6-kb showed inversion from ndhF to rps15, unlike Lactuca of Asteraceae. Comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates with J. vulgaris revealed that synonymous genes related to a small subunit of the ribosome showed the highest value (0.1558), while nonsynonymous rates of genes related to ATP synthase genes were highest (0.0118). These findings revealed that substitution has occurred at similar rates in most genes, and the substitution rates suggested that most genes is a purified selection. Copyright © 2015 Elsevier B.V. All rights reserved.
Lisa C Crossman
Full Text Available This work centres on the genomic comparisons of two closely-related nitrogen-fixing symbiotic bacteria, Rhizobium leguminosarum biovar viciae 3841 and Rhizobium etli CFN42. These strains maintain a stable genomic core that is also common to other rhizobia species plus a very variable and significant accessory component. The chromosomes are highly syntenic, whereas plasmids are related by fewer syntenic blocks and have mosaic structures. The pairs of plasmids p42f-pRL12, p42e-pRL11 and p42b-pRL9 as well large parts of p42c with pRL10 are shown to be similar, whereas the symbiotic plasmids (p42d and pRL10 are structurally unrelated and seem to follow distinct evolutionary paths. Even though purifying selection is acting on the whole genome, the accessory component is evolving more rapidly. This component is constituted largely for proteins for transport of diverse metabolites and elements of external origin. The present analysis allows us to conclude that a heterogeneous and quickly diversifying group of plasmids co-exists in a common genomic framework.
Background Common carp (Cyprinus carpio) is thought to have undergone one extra round of genome duplication compared to zebrafish. Transcriptome analysis has been used to study the existence and timing of genome duplication in species for which genome sequences are incomplete. Large-scale transcriptome data for the common carp genome should help reveal the timing of the additional duplication event. Results We have sequenced the transcriptome of common carp using 454 pyrosequencing. After assembling the 454 contigs and the published common carp sequences together, we obtained 49,669 contigs and identified genes using homology searches and an ab initio method. We identified 4,651 orthologous pairs between common carp and zebrafish and found 129,984 paralogous pairs within the common carp. An estimation of the synonymous substitution rate in the orthologous pairs indicated that common carp and zebrafish diverged 120 million years ago (MYA). We identified one round of genome duplication in common carp and estimated that it had occurred 5.6 to 11.3 MYA. In zebrafish, no genome duplication event after speciation was observed, suggesting that, compared to zebrafish, common carp had undergone an additional genome duplication event. We annotated the common carp contigs with Gene Ontology terms and KEGG pathways. Compared with zebrafish gene annotations, we found that a set of biological processes and pathways were enriched in common carp. Conclusions The assembled contigs helped us to estimate the time of the fourth-round of genome duplication in common carp. The resource that we have built as part of this study will help advance functional genomics and genome annotation studies in the future. PMID:22424280
Inda, M.A.; van Batenburg, M.F.; Roos, M.; Belloum, A.S.Z.; Vasunin, D.; Wibisono, A.; van Kampen, A.H.C.; Breit, T.M.
Background: Chromosome location is often used as a scaffold to organize genomic information in both the living cell and molecular biological research. Thus, ever-increasing amounts of data about genomic features are stored in public databases and can be readily visualized by genome browsers. To
Inda, Marcia A.; van Batenburg, Marinus F.; Roos, Marco; Belloum, Adam Sz; Vasunin, Dmitry; Wibisono, Adianto; van Kampen, Antoine H. C.; Breit, Timo M.
ABSTRACT: BACKGROUND: Chromosome location is often used as a scaffold to organize genomic information in both the living cell and molecular biological research. Thus, ever-increasing amounts of data about genomic features are stored in public databases and can be readily visualized by genome
Simola, Daniel F.; Wissler, Lothar; Donahue, Greg
Genomes of eusocial insects code for dramatic examples of phenotypic plasticity and social organization. We compared the genomes of seven ants, the honeybee, and various solitary insects to examine whether eusocial lineages share distinct features of genomic organization. Each ant lineage contain...
Strozzi, Francesco; Aerts, Jan
The Ensembl database makes genomic features available via its Genome Browser. It is also possible to access the underlying data through a Perl API for advanced querying. We have developed a full-featured Ruby API to the Ensembl databases, providing the same functionality as the Perl interface with additional features. A single Ruby API is used to access different releases of the Ensembl databases and is also able to query multi-species databases. Most functionality of the API is provided using the ActiveRecord pattern. The library depends on introspection to make it release independent. The API is available through the Rubygem system and can be installed with the command gem install ruby-ensembl-api.
Full Text Available The aim of this study was to reveal the molecular mechanism involved in multidrug resistance and virulence of Campylobacter jejuni isolated from broiler chickens. The virulence of six multidrug resistant C. jejuni was determined by in vitro and in vivo methods. The de novo whole genome sequencing technology and molecular biology methods were used to analyze the genomic features associated with the multidrug resistance and virulence of a selected isolate (C. jejuni 1655. The comparative genomic analyses revealed a large number of single nucleotide polymorphisms, deletions, rearrangements, and inversions in C. jejuni 1655 compared to reference C. jejuni genomes. The co-emergence of Thr-86-Ile mutation in gyrA gene, A2075G mutation in 23S rRNA gene, tetO, aphA and aadE genes and pTet plasmid in C. jejuni 1655 contributed its multidrug resistance to fluoroquinolones, macrolides, tetracycline and aminoglycosides. The combination of multiple virulence genes may work together to confer the relative higher virulence in C. jejuni 1655. The co-existence of mobile gene elements (e.g. pTet and CRISPR-Cas system in C. jejuni 1655 may play an important role in the gene transfer and immune defense. The present study provides basic information of phenotypic and genomic features of C. jejuni 1655, a strain recently isolated from a chicken displaying multidrug resistance and relatively high level of virulence.
Full Text Available Clustering is a popular technique for explorative analysis of data, as it can reveal subgroupings and similarities between data in an unsupervised manner. While clustering is routinely applied to gene expression data, there is a lack of appropriate general methodology for clustering of sequence-level genomic and epigenomic data, e.g. ChIP-based data. We here introduce a general methodology for clustering data sets of coordinates relative to a genome assembly, i.e. genomic tracks. By defining appropriate feature extraction approaches and similarity measures, we allow biologically meaningful clustering to be performed for genomic tracks using standard clustering algorithms. An implementation of the methodology is provided through a tool, ClusTrack, which allows fine-tuned clustering analyses to be specified through a web-based interface. We apply our methods to the clustering of occupancy of the H3K4me1 histone modification in samples from a range of different cell types. The majority of samples form meaningful subclusters, confirming that the definitions of features and similarity capture biological, rather than technical, variation between the genomic tracks. Input data and results are available, and can be reproduced, through a Galaxy Pages document at http://hyperbrowser.uio.no/hb/u/hb-superuser/p/clustrack. The clustering functionality is available as a Galaxy tool, under the menu option "Specialized analyzis of tracks", and the submenu option "Cluster tracks based on genome level similarity", at the Genomic HyperBrowser server: http://hyperbrowser.uio.no/hb/.
Ma, Ling; Liu, Xiabi; Fei, Baowei
Common CT imaging signs of lung diseases (CISLs) are defined as the imaging signs that frequently appear in lung CT images from patients. CISLs play important roles in the diagnosis of lung diseases. This paper proposes a novel learning method, namely learning with distribution of optimized feature (DOF), to effectively recognize the characteristics of CISLs. We improve the classification performance by learning the optimized features under different distributions. Specifically, we adopt the minimum spanning tree algorithm to capture the relationship between features and discriminant ability of features for selecting the most important features. To overcome the problem of various distributions in one CISL, we propose a hierarchical learning method. First, we use an unsupervised learning method to cluster samples into groups based on their distribution. Second, in each group, we use a supervised learning method to train a model based on their categories of CISLs. Finally, we obtain multiple classification decisions from multiple trained models and use majority voting to achieve the final decision. The proposed approach has been implemented on a set of 511 samples captured from human lung CT images and achieves a classification accuracy of 91.96%. The proposed DOF method is effective and can provide a useful tool for computer-aided diagnosis of lung diseases on CT images.
Full Text Available Genome editing describes a variety of molecular biology applications enabling targeted and precise alterations of the genomes of plants, animals and microorganisms. These rapidly developing techniques are likely to revolutionize the breeding of new crop varieties. Since genome editing can lead to the development of plants that could also have come into existence naturally or by conventional breeding techniques, there are strong arguments that these cases should not be classified as genetically modified organisms (GMOs and be regulated no differently from conventionally bred crops. If a specific regulation would be regarded necessary, the application of genome editing for crop development may challenge risk assessment and post-market monitoring. In the session “Plant genome editing—any novel features to consider for ERA and regulation?” held at the 14th ISBGMO, scientists from various disciplines as well as regulators, risk assessors and potential users of the new technologies were brought together for a knowledge-based discussion to identify knowledge gaps and analyze scenarios for the introduction of genome-edited crops into the environment. It was aimed to enable an open exchange forum on the regulatory approaches, ethical aspects and decision-making considerations.
Scheuner, Maren T; Sieverding, Pauline; Shekelle, Paul G
The greatest public health benefit of advances in understanding the human genome may be realized for common chronic diseases such as cardiovascular disease, diabetes mellitus, and cancer. Attempts to integrate such knowledge into clinical practice are still in the early stages, and as a result, many questions surround the current state of this translation. To synthesize current information on genetic health services for common adult-onset conditions by examining studies that have addressed the outcomes, consumer information needs, delivery, and challenges in integrating these services. MEDLINE articles published between January 2000 and February 2008. Original research articles and systematic reviews dealing with common chronic adult-onset conditions were reviewed. A total of 3371 citations were reviewed, 170 articles retrieved, and 68 articles included in the analysis. Data were independently extracted by one reviewer and checked by another with disagreement resolved by consensus. Variables assessed included study design and 4 key areas: outcomes of genomic medicine, consumer information needs, delivery of genomic medicine, and challenges and barriers to integration of genomic medicine. Sixty-eight articles contributed data to the synthesis: 5 systematic reviews, 8 experimental studies, 35 surveys, 7 pre/post studies, 3 observational studies, and 10 qualitative reports. Three systematic reviews, 4 experimental studies, and 9 additional studies reported on outcomes of genetic services. Generally there were modest positive effects on psychological outcomes such as worry and anxiety, behavioral outcomes have shown mixed results, and clinical outcomes were less well studied. One systematic review, 1 randomized controlled trial, and 14 other studies assessed consumer information needs and found in general that genetics knowledge was reported to be low but that attitudes were generally positive. Three randomized controlled trials and 13 other studies assessed how
Gutiérrez, Pablo; Alzate, Juan; Yepes, Mauricio Salazar; Marín, Mauricio
Colletotrichum lindemuthianum is the causal agent of anthracnose in common bean (Phaseolus vulgaris), one of the most limiting factors for this crop in South and Central America. In this work, the mitochondrial sequence of a Colombian isolate of C. lindemuthianum obtained from a common bean plant (var. Cargamanto) with anthracnose symptoms is presented. The mtDNA codes for 13 proteins of the respiratory chain, 1 ribosomal protein, 2 homing endonucleases, 2 ribosomal RNAs and 28 tRNAs. This is the first report of a complete mtDNA genome sequence from C. lindemuthianum.
Full Text Available Computational methods to identify functional genomic elements using genetic information have been very successful in determining gene structure and in identifying a handful of cis-regulatory elements. But the vast majority of regulatory elements have yet to be discovered, and it has become increasingly apparent that their discovery will not come from using genetic information alone. Recently, high-throughput technologies have enabled the creation of information-rich epigenetic maps, most notably for histone modifications. However, tools that search for functional elements using this epigenetic information have been lacking. Here, we describe an unsupervised learning method called ChromaSig to find, in an unbiased fashion, commonly occurring chromatin signatures in both tiling microarray and sequencing data. Applying this algorithm to nine chromatin marks across a 1% sampling of the human genome in HeLa cells, we recover eight clusters of distinct chromatin signatures, five of which correspond to known patterns associated with transcriptional promoters and enhancers. Interestingly, we observe that the distinct chromatin signatures found at enhancers mark distinct functional classes of enhancers in terms of transcription factor and coactivator binding. In addition, we identify three clusters of novel chromatin signatures that contain evolutionarily conserved sequences and potential cis-regulatory elements. Applying ChromaSig to a panel of 21 chromatin marks mapped genomewide by ChIP-Seq reveals 16 classes of genomic elements marked by distinct chromatin signatures. Interestingly, four classes containing enrichment for repressive histone modifications appear to be locally heterochromatic sites and are enriched in quickly evolving regions of the genome. The utility of this approach in uncovering novel, functionally significant genomic elements will aid future efforts of genome annotation via chromatin modifications.
Full Text Available BACKGROUND: Medulloblastoma is the most common malignant brain tumor in children. Despite recent improvements in cure rates, prediction of disease outcome remains a major challenge and survivors suffer from serious therapy-related side-effects. Recent data showed that patients with WNT-activated tumors have a favorable prognosis, suggesting that these patients could be treated less intensively, thereby reducing the side-effects. This illustrates the potential benefits of a robust classification of medulloblastoma patients and a detailed knowledge of associated biological mechanisms. METHODS AND FINDINGS: To get a better insight into the molecular biology of medulloblastoma we established mRNA expression profiles of 62 medulloblastomas and analyzed 52 of them also by comparative genomic hybridization (CGH arrays. Five molecular subtypes were identified, characterized by WNT signaling (A; 9 cases, SHH signaling (B; 15 cases, expression of neuronal differentiation genes (C and D; 16 and 11 cases, respectively or photoreceptor genes (D and E; both 11 cases. Mutations in beta-catenin were identified in all 9 type A tumors, but not in any other tumor. PTCH1 mutations were exclusively identified in type B tumors. CGH analysis identified several fully or partly subtype-specific chromosomal aberrations. Monosomy of chromosome 6 occurred only in type A tumors, loss of 9q mostly occurred in type B tumors, whereas chromosome 17 aberrations, most common in medulloblastoma, were strongly associated with type C or D tumors. Loss of the inactivated X-chromosome was highly specific for female cases of type C, D and E tumors. Gene expression levels faithfully reflected the chromosomal copy number changes. Clinicopathological features significantly different between the 5 subtypes included metastatic disease and age at diagnosis and histology. Metastatic disease at diagnosis was significantly associated with subtypes C and D and most strongly with subtype E
Need, Anna C.; Attix, Deborah K.; McEvoy, Jill M.; Cirulli, Elizabeth T.; Linney, Kristen L.; Hunt, Priscilla; Ge, Dongliang; Heinzen, Erin L.; Maia, Jessica M.; Shianna, Kevin V.; Weale, Michael E.; Cherkas, Lynn F.; Clement, Gail; Spector, Tim D.; Gibson, Greg; Goldstein, David B.
Psychiatric disorders such as schizophrenia are commonly accompanied by cognitive impairments that are treatment resistant and crucial to functional outcome. There has been great interest in studying cognitive measures as endophenotypes for psychiatric disorders, with the hope that their genetic basis will be clearer. To investigate this, we performed a genome-wide association study involving 11 cognitive phenotypes from the Cambridge Neuropsychological Test Automated Battery. We showed these measures to be heritable by comparing the correlation in 100 monozygotic and 100 dizygotic twin pairs. The full battery was tested in ∼750 subjects, and for spatial and verbal recognition memory, we investigated a further 500 individuals to search for smaller genetic effects. We were unable to find any genome-wide significant associations with either SNPs or common copy number variants. Nor could we formally replicate any polymorphism that has been previously associated with cognition, although we found a weak signal of lower than expected P-values for variants in a set of 10 candidate genes. We additionally investigated SNPs in genomic loci that have been shown to harbor rare variants that associate with neuropsychiatric disorders, to see if they showed any suggestion of association when considered as a separate set. Only NRXN1 showed evidence of significant association with cognition. These results suggest that common genetic variation does not strongly influence cognition in healthy subjects and that cognitive measures do not represent a more tractable genetic trait than clinical endpoints such as schizophrenia. We discuss a possible role for rare variation in cognitive genomics. PMID:19734545
Full Text Available Drosophila melanogaster is an excellent model to explore the molecular exchanges that occur between an animal intestine and associated microbes. Previous studies in Drosophila uncovered a sophisticated web of host responses to intestinal bacteria. The outcomes of these responses define critical events in the host, such as the establishment of immune responses, access to nutrients, and the rate of larval development. Despite our steady march towards illuminating the host machinery that responds to bacterial presence in the gut, there are significant gaps in our understanding of the microbial products that influence bacterial association with a fly host. We sequenced and characterized the genomes of three common Drosophila-associated microbes: Lactobacillus plantarum, Lactobacillus brevis and Acetobacter pasteurianus. For each species, we compared the genomes of Drosophila-associated strains to the genomes of strains isolated from alternative sources. We found that environmental Lactobacillus strains readily associated with adult Drosophila and were similar to fly isolates in terms of genome organization. In contrast, we identified a strain of A. pasteurianus that apparently fails to associate with adult Drosophila due to an inability to grow on fly nutrient food. Comparisons between association competent and incompetent A. pasteurianus strains identified a short list of candidate genes that may contribute to survival on fly medium. Many of the gene products unique to fly-associated strains have established roles in the stabilization of host-microbe interactions. These data add to a growing body of literature that examines the microbial perspective of host-microbe relationships.
Wilson, Shane; Fitzsimons, Michael; Ferguson, Martin; Heath, Allison; Jensen, Mark; Miller, Josh; Murphy, Mark W; Porter, James; Sahni, Himanso; Staudt, Louis; Tang, Yajing; Wang, Zhining; Yu, Christine; Zhang, Junjun; Ferretti, Vincent; Grossman, Robert L
The NCI Genomic Data Commons (GDC) was launched in 2016 and makes available over 4 petabytes (PB) of cancer genomic and associated clinical data to the research community. This dataset continues to grow and currently includes over 14,500 patients. The GDC is an example of a biomedical data commons, which collocates biomedical data with storage and computing infrastructure and commonly used web services, software applications, and tools to create a secure, interoperable, and extensible resource for researchers. The GDC is (i) a data repository for downloading data that have been submitted to it, and also a system that (ii) applies a common set of bioinformatics pipelines to submitted data; (iii) reanalyzes existing data when new pipelines are developed; and (iv) allows users to build their own applications and systems that interoperate with the GDC using the GDC Application Programming Interface (API). We describe the GDC API and how it has been used both by the GDC itself and by third parties. Cancer Res; 77(21); e15-18. ©2017 AACR . ©2017 American Association for Cancer Research.
Narla, S T; Lee, Y-W; Benson, C A; Sarder, P; Brennand, K J; Stachowiak, E K; Stachowiak, M K
The watershed-hypothesis of schizophrenia asserts that over 200 different mutations dysregulate distinct pathways that converge on an unspecified common mechanism(s) that controls disease ontogeny. Consistent with this hypothesis, our RNA-sequencing of neuron committed cells (NCCs) differentiated from established iPSCs of 4 schizophrenia patients and 4 control subjects uncovered a dysregulated transcriptome of 1349 mRNAs common to all patients. Data reveals a global dysregulation of developmental genome, deconstruction of coordinated mRNA networks, and the formation of aberrant, new coordinated mRNA networks indicating a concerted action of the responsible factor(s). Sequencing of miRNA transcriptomes demonstrated an overexpression of 16 miRNAs and deconstruction of interactive miRNA-mRNA networks in schizophrenia NCCs. ChiPseq revealed that the nuclear (n) form of FGFR1, a pan-ontogenic regulator, is overexpressed in schizophrenia NCCs and overtargets dysregulated mRNA and miRNA genes. The nFGFR1 targeted 54% of all human gene promoters and 84.4% of schizophrenia dysregulated genes. The upregulated genes reside within major developmental pathways that control neurogenesis and neuron formation, whereas downregulated genes are involved in oligodendrogenesis. Our results indicate (i) an early (preneuronal) genomic etiology of schizophrenia, (ii) dysregulated genes and new coordinated gene networks are common to unrelated cases of schizophrenia, (iii) gene dysregulations are accompanied by increased nFGFR1-genome interactions, and (iv) modeling of increased nFGFR1 by an overexpression of a nFGFR1 lead to up or downregulation of selected genes as observed in schizophrenia NCCs. Together our results designate nFGFR1 signaling as a potential common dysregulated mechanism in investigated patients and potential therapeutic target in schizophrenia. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Novakovic, Boris; Evain-Brion, Danièle; Murthi, Padma; Fournier, Thiery; Saffery, Richard
placental development and function.-Novakovic, B., Evain-Brion, D., Murthi, P., Fournier, T., Saffery, R. Variable DAXX gene methylation is a common feature of placental trophoblast differentiation, preeclampsia, and response to hypoxia. © FASEB.
Bukiya, Anna N; Dopico, Alejandro M
Cholesterol-protein interactions are essential for the architectural organization of cell membranes and for lipid metabolism. While cholesterol-sensing motifs in transmembrane proteins have been identified, little is known about cholesterol recognition by soluble proteins. We reviewed the structural characteristics of binding sites for cholesterol and cholesterol sulfate from crystallographic structures available in the Protein Data Bank. This analysis unveiled key features of cholesterol-binding sites that are present in either all or the majority of sites: i ) the cholesterol molecule is generally positioned between protein domains that have an organized secondary structure; ii ) the cholesterol hydroxyl/sulfo group is often partnered by Asn, Gln, and/or Tyr, while the hydrophobic part of cholesterol interacts with Leu, Ile, Val, and/or Phe; iii ) cholesterol hydrogen-bonding partners are often found on α-helices, while amino acids that interact with cholesterol's hydrophobic core have a slight preference for β-strands and secondary structure-lacking protein areas; iv ) the steroid's C21 and C26 constitute the "hot spots" most often seen for steroid-protein hydrophobic interactions; v ) common "cold spots" are C8-C10, C13, and C17, at which contacts with the proteins were not detected. Several common features we identified for soluble protein-steroid interaction appear evolutionarily conserved. Copyright © 2017 by the American Society for Biochemistry and Molecular Biology, Inc.
Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M
RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.
Boraska, Vesna; Jerončić, Ana; Colonna, Vincenza; Southam, Lorraine; Nyholt, Dale R.; William Rayner, Nigel; Perry, John R.B.; Toniolo, Daniela; Albrecht, Eva; Ang, Wei; Bandinelli, Stefania; Barbalic, Maja; Barroso, Inês; Beckmann, Jacques S.; Biffar, Reiner; Boomsma, Dorret; Campbell, Harry; Corre, Tanguy; Erdmann, Jeanette; Esko, Tõnu; Fischer, Krista; Franceschini, Nora; Frayling, Timothy M.; Girotto, Giorgia; Gonzalez, Juan R.; Harris, Tamara B.; Heath, Andrew C.; Heid, Iris M.; Hoffmann, Wolfgang; Hofman, Albert; Horikoshi, Momoko; Hua Zhao, Jing; Jackson, Anne U.; Hottenga, Jouke-Jan; Jula, Antti; Kähönen, Mika; Khaw, Kay-Tee; Kiemeney, Lambertus A.; Klopp, Norman; Kutalik, Zoltán; Lagou, Vasiliki; Launer, Lenore J.; Lehtimäki, Terho; Lemire, Mathieu; Lokki, Marja-Liisa; Loley, Christina; Luan, Jian'an; Mangino, Massimo; Mateo Leach, Irene; Medland, Sarah E.; Mihailov, Evelin; Montgomery, Grant W.; Navis, Gerjan; Newnham, John; Nieminen, Markku S.; Palotie, Aarno; Panoutsopoulou, Kalliope; Peters, Annette; Pirastu, Nicola; Polašek, Ozren; Rehnström, Karola; Ripatti, Samuli; Ritchie, Graham R.S.; Rivadeneira, Fernando; Robino, Antonietta; Samani, Nilesh J.; Shin, So-Youn; Sinisalo, Juha; Smit, Johannes H.; Soranzo, Nicole; Stolk, Lisette; Swinkels, Dorine W.; Tanaka, Toshiko; Teumer, Alexander; Tönjes, Anke; Traglia, Michela; Tuomilehto, Jaakko; Valsesia, Armand; van Gilst, Wiek H.; van Meurs, Joyce B.J.; Smith, Albert Vernon; Viikari, Jorma; Vink, Jacqueline M.; Waeber, Gerard; Warrington, Nicole M.; Widen, Elisabeth; Willemsen, Gonneke; Wright, Alan F.; Zanke, Brent W.; Zgaga, Lina; Boehnke, Michael; d'Adamo, Adamo Pio; de Geus, Eco; Demerath, Ellen W.; den Heijer, Martin; Eriksson, Johan G.; Ferrucci, Luigi; Gieger, Christian; Gudnason, Vilmundur; Hayward, Caroline; Hengstenberg, Christian; Hudson, Thomas J.; Järvelin, Marjo-Riitta; Kogevinas, Manolis; Loos, Ruth J.F.; Martin, Nicholas G.; Metspalu, Andres; Pennell, Craig E.; Penninx, Brenda W.; Perola, Markus; Raitakari, Olli; Salomaa, Veikko; Schreiber, Stefan; Schunkert, Heribert; Spector, Tim D.; Stumvoll, Michael; Uitterlinden, André G.; Ulivi, Sheila; van der Harst, Pim; Vollenweider, Peter; Völzke, Henry; Wareham, Nicholas J.; Wichmann, H.-Erich; Wilson, James F.; Rudan, Igor; Xue, Yali; Zeggini, Eleftheria
The male-to-female sex ratio at birth is constant across world populations with an average of 1.06 (106 male to 100 female live births) for populations of European descent. The sex ratio is considered to be affected by numerous biological and environmental factors and to have a heritable component. The aim of this study was to investigate the presence of common allele modest effects at autosomal and chromosome X variants that could explain the observed sex ratio at birth. We conducted a large-scale genome-wide association scan (GWAS) meta-analysis across 51 studies, comprising overall 114 863 individuals (61 094 women and 53 769 men) of European ancestry and 2 623 828 common (minor allele frequency >0.05) single-nucleotide polymorphisms (SNPs). Allele frequencies were compared between men and women for directly-typed and imputed variants within each study. Forward-time simulations for unlinked, neutral, autosomal, common loci were performed under the demographic model for European populations with a fixed sex ratio and a random mating scheme to assess the probability of detecting significant allele frequency differences. We do not detect any genome-wide significant (P < 5 × 10−8) common SNP differences between men and women in this well-powered meta-analysis. The simulated data provided results entirely consistent with these findings. This large-scale investigation across ∼115 000 individuals shows no detectable contribution from common genetic variants to the observed skew in the sex ratio. The absence of sex-specific differences is useful in guiding genetic association study design, for example when using mixed controls for sex-biased traits. PMID:22843499
Friedlander, Terence W; Roy, Ritu; Tomlins, Scott A; Ngo, Vy T; Kobayashi, Yasuko; Azameera, Aruna; Rubin, Mark A; Pienta, Kenneth J; Chinnaiyan, Arul; Ittmann, Michael M; Ryan, Charles J; Paris, Pamela L
Progression of primary prostate cancer to castration-resistant prostate cancer (CRPC) is associated with numerous genetic and epigenetic alterations that are thought to promote survival at metastatic sites. In this study, we investigated gene copy number and CpG methylation status in CRPC to gain insight into specific pathophysiologic pathways that are active in this advanced form of prostate cancer. Our analysis defined and validated 495 genes exhibiting significant differences in CRPC in gene copy number, including gains in androgen receptor (AR) and losses of PTEN and retinoblastoma 1 (RB1). Significant copy number differences existed between tumors with or without AR gene amplification, including a common loss of AR repressors in AR-unamplified tumors. Simultaneous gene methylation and allelic deletion occurred frequently in RB1 and HSD17B2, the latter of which is involved in testosterone metabolism. Lastly, genomic DNA from most CRPC was hypermethylated compared with benign prostate tissue. Our findings establish a comprehensive methylation signature that couples epigenomic and structural analyses, thereby offering insights into the genomic alterations in CRPC that are associated with a circumvention of hormonal therapy. Genes identified in this integrated genomic study point to new drug targets in CRPC, an incurable disease state which remains the chief therapeutic challenge. ©2012 AACR.
Suzuki, Shino; Kuenen, J Gijs; Schipper, Kira; van der Velde, Suzanne; Ishii, Shun'ichi; Wu, Angela; Sorokin, Dimitry Y; Tenney, Aaron; Meng, XianYing; Morrill, Penny L; Kamagata, Yoichi; Muyzer, Gerard; Nealson, Kenneth H
Serpentinization, or the aqueous alteration of ultramafic rocks, results in challenging environments for life in continental sites due to the combination of extremely high pH, low salinity and lack of obvious electron acceptors and carbon sources. Nevertheless, certain Betaproteobacteria have been frequently observed in such environments. Here we describe physiological and genomic features of three related Betaproteobacterial strains isolated from highly alkaline (pH 11.6) serpentinizing springs at The Cedars, California. All three strains are obligate alkaliphiles with an optimum for growth at pH 11 and are capable of autotrophic growth with hydrogen, calcium carbonate and oxygen. The three strains exhibit differences, however, regarding the utilization of organic carbon and electron acceptors. Their global distribution and physiological, genomic and transcriptomic characteristics indicate that the strains are adapted to the alkaline and calcium-rich environments represented by the terrestrial serpentinizing ecosystems. We propose placing these strains in a new genus 'Serpentinomonas'.
Benoit, Joshua B; Adelman, Zach N; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C; Szuter, Elise M; Hagan, Richard W; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M; Nelson, David R; Rosendale, Andrew J; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R; Ioannidis, Panagiotis; Waterhouse, Robert M; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J Spencer; Gondhalekar, Ameya D; Scharf, Michael E; Peterson, Brittany F; Raje, Kapil R; Hottel, Benjamin A; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S T; Duncan, Elizabeth J; Murali, Shwetha C; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C; Muzny, Donna M; Wheeler, David; Panfilio, Kristen A; Vargas Jentzsch, Iris M; Vargo, Edward L; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T; Anderson, Michelle A E; Jones, Jeffery W; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D; Attardo, Geoffrey M; Robertson, Hugh M; Zdobnov, Evgeny M; Ribeiro, Jose M C; Gibbs, Richard A; Werren, John H; Palli, Subba R; Schal, Coby; Richards, Stephen
The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host-symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human-bed bug and symbiont-bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite.
Benoit, Joshua B.; Adelman, Zach N.; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C.; Szuter, Elise M.; Hagan, Richard W.; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M.; Nelson, David R.; Rosendale, Andrew J.; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M.; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R.; Ioannidis, Panagiotis; Waterhouse, Robert M.; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J. Spencer; Gondhalekar, Ameya D.; Scharf, Michael E.; Peterson, Brittany F.; Raje, Kapil R.; Hottel, Benjamin A.; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S. T.; Duncan, Elizabeth J.; Murali, Shwetha C.; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L.; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C.; Muzny, Donna M.; Wheeler, David; Panfilio, Kristen A.; Vargas Jentzsch, Iris M.; Vargo, Edward L.; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T.; Anderson, Michelle A. E.; Jones, Jeffery W.; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D.; Attardo, Geoffrey M.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Ribeiro, Jose M. C.; Gibbs, Richard A.; Werren, John H.; Palli, Subba R.; Schal, Coby; Richards, Stephen
The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host–symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human–bed bug and symbiont–bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite. PMID:26836814
Scarlet S Shell
Full Text Available RNA-seq technologies have provided significant insight into the transcription networks of mycobacteria. However, such studies provide no definitive information on the translational landscape. Here, we use a combination of high-throughput transcriptome and proteome-profiling approaches to more rigorously understand protein expression in two mycobacterial species. RNA-seq and ribosome profiling in Mycobacterium smegmatis, and transcription start site (TSS mapping and N-terminal peptide mass spectrometry in Mycobacterium tuberculosis, provide complementary, empirical datasets to examine the congruence of transcription and translation in the Mycobacterium genus. We find that nearly one-quarter of mycobacterial transcripts are leaderless, lacking a 5' untranslated region (UTR and Shine-Dalgarno ribosome-binding site. Our data indicate that leaderless translation is a major feature of mycobacterial genomes and is comparably robust to leadered initiation. Using translational reporters to systematically probe the cis-sequence requirements of leaderless translation initiation in mycobacteria, we find that an ATG or GTG at the mRNA 5' end is both necessary and sufficient. This criterion, together with our ribosome occupancy data, suggests that mycobacteria encode hundreds of small, unannotated proteins at the 5' ends of transcripts. The conservation of small proteins in both mycobacterial species tested suggests that some play important roles in mycobacterial physiology. Our translational-reporter system further indicates that mycobacterial leadered translation initiation requires a Shine Dalgarno site in the 5' UTR and that ATG, GTG, TTG, and ATT codons can robustly initiate translation. Our combined approaches provide the first comprehensive view of mycobacterial gene structures and their non-canonical mechanisms of protein expression.
Full Text Available Abstract Background Pichia pastoris is widely used as a production platform for heterologous proteins and model organism for organelle proliferation. Without a published genome sequence available, strain and process development relied mainly on analogies to other, well studied yeasts like Saccharomyces cerevisiae. Results To investigate specific features of growth and protein secretion, we have sequenced the 9.4 Mb genome of the type strain DSMZ 70382 and analyzed the secretome and the sugar transporters. The computationally predicted secretome consists of 88 ORFs. When grown on glucose, only 20 proteins were actually secreted at detectable levels. These data highlight one major feature of P. pastoris, namely the low contamination of heterologous proteins with host cell protein, when applying glucose based expression systems. Putative sugar transporters were identified and compared to those of related yeast species. The genome comprises 2 homologs to S. cerevisiae low affinity transporters and 2 to high affinity transporters of other Crabtree negative yeasts. Contrary to other yeasts, P. pastoris possesses 4 H+/glycerol transporters. Conclusion This work highlights significant advantages of using the P. pastoris system with glucose based expression and fermentation strategies. As only few proteins and no proteases are actually secreted on glucose, it becomes evident that cell lysis is the relevant cause of proteolytic degradation of secreted proteins. The endowment with hexose transporters, dominantly of the high affinity type, limits glucose uptake rates and thus overflow metabolism as observed in S. cerevisiae. The presence of 4 genes for glycerol transporters explains the high specific growth rates on this substrate and underlines the suitability of a glycerol/glucose based fermentation strategy. Furthermore, we present an open access web based genome browser http://www.pichiagenome.org.
Aklujkar, Muktak [University of Massachusetts, Amherst; Haveman, Shelley [University of Massachusetts, Amherst; DiDonatoJr, Raymond [University of Massachusetts, Amherst; Chertkov, Olga [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Land, Miriam L [ORNL; Brown, Peter [University of Massachusetts, Amherst; Lovley, Derek [University of Massachusetts, Amherst
Background: The bacterium Pelobacter carbinolicus is able to grow by fermentation, syntrophic hydrogen/formate transfer, or electron transfer to sulfur from short-chain alcohols, hydrogen or formate; it does not oxidize acetate and is not known to ferment any sugars or grow autotrophically. The genome of P. carbinolicus was sequenced in order to understand its metabolic capabilities and physiological features in comparison with its relatives, acetate-oxidizing Geobacter species. Results: Pathways were predicted for catabolism of known substrates: 2,3-butanediol, acetoin, glycerol, 1,2-ethanediol, ethanolamine, choline and ethanol. Multiple isozymes of 2,3-butanediol dehydrogenase, ATP synthase and [FeFe]-hydrogenase were differentiated and assigned roles according to their structural properties and genomic contexts. The absence of asparagine synthetase and the presence of a mutant tRNA for asparagine encoded among RNA-active enzymes suggest that P. carbinolicus may make asparaginyl-tRNA in a novel way. Catabolic glutamate dehydrogenases were discovered, implying that the tricarboxylic acid (TCA) cycle can function catabolically. A phosphotransferase system for uptake of sugars was discovered, along with enzymes that function in 2,3-butanediol production. Pyruvate: ferredoxin/flavodoxin oxidoreductase was identified as a potential bottleneck in both the supply of oxaloacetate for oxidation of acetate by the TCA cycle and the connection of glycolysis to production of ethanol. The P. carbinolicus genome was found to encode autotransporters and various appendages, including three proteins with similarity to the geopilin of electroconductive nanowires. Conclusions: Several surprising metabolic capabilities and physiological features were predicted from the genome of P. carbinolicus, suggesting that it is more versatile than anticipated.
Orlando, Lori A; Sperber, Nina R; Voils, Corrine; Nichols, Marshall; Myers, Rachel A; Wu, R Ryanne; Rakhra-Burris, Tejinder; Levy, Kenneth D; Levy, Mia; Pollin, Toni I; Guan, Yue; Horowitz, Carol R; Ramos, Michelle; Kimmel, Stephen E; McDonough, Caitrin W; Madden, Ebony B; Damschroder, Laura J
PurposeImplementation research provides a structure for evaluating the clinical integration of genomic medicine interventions. This paper describes the Implementing Genomics in Practice (IGNITE) Network's efforts to promote (i) a broader understanding of genomic medicine implementation research and (ii) the sharing of knowledge generated in the network.MethodsTo facilitate this goal, the IGNITE Network Common Measures Working Group (CMG) members adopted the Consolidated Framework for Implementation Research (CFIR) to guide its approach to identifying constructs and measures relevant to evaluating genomic medicine as a whole, standardizing data collection across projects, and combining data in a centralized resource for cross-network analyses.ResultsCMG identified 10 high-priority CFIR constructs as important for genomic medicine. Of those, eight did not have standardized measurement instruments. Therefore, we developed four survey tools to address this gap. In addition, we identified seven high-priority constructs related to patients, families, and communities that did not map to CFIR constructs. Both sets of constructs were combined to create a draft genomic medicine implementation model.ConclusionWe developed processes to identify constructs deemed valuable for genomic medicine implementation and codified them in a model. These resources are freely available to facilitate knowledge generation and sharing across the field.
O. M. Kunah
Full Text Available The role of geomorphological ecogeographical variables have been shown, which are received by means of the digital elevation model created on the basis of remote sensing data as markers of an ecological niche of weeds on an example common milkweed (Asclepias syriaca L.. The research range chooses territory which is in settlement Vovnjanka district (the Poltava region. The range has the linear sizes of 26 kilometres in a direction from the east on the west and 15 kilometres in a direction from the north on the south, the range total area makes 390 км2. As geomorphological variables the topographical wetness index, topographic position index, mass balance index, erosion LS-factor, direct and disseminated insolation, altitude above channel network, multiresolution valley bottom flatness, multiresolution ridge top flatness index, vector ruggedness measure have been considered. It is established, that on set of the geomorphological indicators received by means of digital model of a relief, it is possible to assert, that within a separate agricultural field a wide variety of microconditions which is caused by relief features is formed. Possibly, the variation of thermal and water modes, moisture redistribution, and also productivity mechanical processings of soil and efforts under the control of number of weeds make a background in which limits there is possible a moving of weed plants, including common milkweed.
Wu, Chen-Chi; Chen, Yuh-Shyang; Chen, Pei-Jer; Hsu, Chuan-Jen
The purpose of the study was to investigate the etiological factors and the audiological data of different types of inner ear malformations, which the authors thought might be helpful in elucidating the inter-relation among malformations and shedding light on pathogenesis. Retrospective study from 1998 to 2002 at a tertiary care university hospital. One hundred sixty consecutive children with a total of 302 affected ears undergoing high-resolution computed tomography of the temporal bone for sensorineural hearing loss were enrolled. The image results were correlated with causes and origins, hearing loss patterns, hearing levels, and audiogram configurations. Inner ear malformation was present in 114 (38%) ears. The most common malformations were enlarged vestibular aqueduct, incomplete partition of cochlea (Mondini dysplasia), large vestibule, and semicircular canal dysplasia, presenting either as isolated finding or in combination. Eighty-four (74%) ears had abnormalities confined to these four malformations; only 30 (26%) ears showed other malformations. Patients with complex of enlarged vestibular aqueduct, Mondini dysplasia, large vestibule, and semicircular canal dysplasia (EMVS complex) demonstrated a significantly higher incidence of fluctuating hearing loss (93%) and a better hearing level compared with those with other malformations. Homogeneity in audiological features among these four malformations was also disclosed. The authors identified a distinct clinical entity, the EMVS complex, which is characterized by fluctuating hearing loss and a better hearing level. The authors proposed that malformations belonging to this complex result from a common pathogenetic mechanism.
Jones, Matt; Love, Bradley C.
Historically, accounts of object representation and perceived similarity have focused on intrinsic features. Although more recent accounts have explored how objects, scenes, and situations containing common relational structures come to be perceived as similar, less is known about how the perceived similarity of parts or objects embedded within these relational systems is affected. The current studies test the hypothesis that objects situated in common relational systems come to be perceived as more similar. Similarity increases most for objects playing the same role within a relation (e.g., predator), but also increases for objects playing different roles within the same relation (e.g., the predator or prey role in the hunts relation) regardless of whether the objects participate in the same instance of the relation. This pattern of results can be captured by extending existing models that extract meaning from text corpora so that they are sensitive to the verb-specific thematic roles that objects fill. Alternative explanations based on analogical and inferential processes are also considered, as well as the implications of the current findings to research in language processing, personality and person perception, decision making, and category learning. PMID:17094958
Full Text Available Abstract Background Protein-protein interaction data used in the creation or prediction of molecular networks is usually obtained from large scale or high-throughput experiments. This experimental data is liable to contain a large number of spurious interactions. Hence, there is a need to validate the interactions and filter out the incorrect data before using them in prediction studies. Results In this study, we use a combination of 3 genomic features – structurally known interacting Pfam domains, Gene Ontology annotations and sequence homology – as a means to assign reliability to the protein-protein interactions in Saccharomyces cerevisiae determined by high-throughput experiments. Using Bayesian network approaches, we show that protein-protein interactions from high-throughput data supported by one or more genomic features have a higher likelihood ratio and hence are more likely to be real interactions. Our method has a high sensitivity (90% and good specificity (63%. We show that 56% of the interactions from high-throughput experiments in Saccharomyces cerevisiae have high reliability. We use the method to estimate the number of true interactions in the high-throughput protein-protein interaction data sets in Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens to be 27%, 18% and 68% respectively. Our results are available for searching and downloading at http://helix.protein.osaka-u.ac.jp/htp/. Conclusion A combination of genomic features that include sequence, structure and annotation information is a good predictor of true interactions in large and noisy high-throughput data sets. The method has a very high sensitivity and good specificity and can be used to assign a likelihood ratio, corresponding to the reliability, to each interaction.
Michael B Walker
Full Text Available Arrangements of genes along chromosomes are a product of evolutionary processes, and we can expect that preferable arrangements will prevail over the span of evolutionary time, often being reflected in the non-random clustering of structurally and/or functionally related genes. Such non-random arrangements can arise by two distinct evolutionary processes: duplications of DNA sequences that give rise to clusters of genes sharing both sequence similarity and common sequence features and the migration together of genes related by function, but not by common descent. To provide a background for distinguishing between the two, which is important for future efforts to unravel the evolutionary processes involved, we here provide a description of the extent to which ancestrally related genes are found in proximity.Towards this purpose, we combined information from five genomic datasets, InterPro, SCOP, PANTHER, Ensembl protein families, and Ensembl gene paralogs. The results are provided in publicly available datasets (http://cgd.jax.org/datasets/clustering/paraclustering.shtml describing the extent to which ancestrally related genes are in proximity beyond what is expected by chance (i.e. form paraclusters in the human and nine other vertebrate genomes, as well as the D. melanogaster, C. elegans, A. thaliana, and S. cerevisiae genomes. With the exception of Saccharomyces, paraclusters are a common feature of the genomes we examined. In the human genome they are estimated to include at least 22% of all protein coding genes. Paraclusters are far more prevalent among some gene families than others, are highly species or clade specific and can evolve rapidly, sometimes in response to environmental cues. Altogether, they account for a large portion of the functional clustering previously reported in several genomes.
Full Text Available Dogs, with their breed-determined limited genetic background, are great models of human disease including cancer. Canine B-cell lymphoma and hemangiosarcoma are both malignancies of the hematologic system that are clinically and histologically similar to human B-cell non-Hodgkin lymphoma and angiosarcoma, respectively. Golden retrievers in the US show significantly elevated lifetime risk for both B-cell lymphoma (6% and hemangiosarcoma (20%. We conducted genome-wide association studies for hemangiosarcoma and B-cell lymphoma, identifying two shared predisposing loci. The two associated loci are located on chromosome 5, and together contribute ~20% of the risk of developing these cancers. Genome-wide p-values for the top SNP of each locus are 4.6×10-7 and 2.7×10-6, respectively. Whole genome resequencing of nine cases and controls followed by genotyping and detailed analysis identified three shared and one B-cell lymphoma specific risk haplotypes within the two loci, but no coding changes were associated with the risk haplotypes. Gene expression analysis of B-cell lymphoma tumors revealed that carrying the risk haplotypes at the first locus is associated with down-regulation of several nearby genes including the proximal gene TRPC6, a transient receptor Ca2+-channel involved in T-cell activation, among other functions. The shared risk haplotype in the second locus overlaps the vesicle transport and release gene STX8. Carrying the shared risk haplotype is associated with gene expression changes of 100 genes enriched for pathways involved in immune cell activation. Thus, the predisposing germ-line mutations in B-cell lymphoma and hemangiosarcoma appear to be regulatory, and affect pathways involved in T-cell mediated immune response in the tumor. This suggests that the interaction between the immune system and malignant cells plays a common role in the tumorigenesis of these relatively different cancers.
Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R.; Correia, Catarina; Abrahams, Brett S.; Sykes, Nuala; Pagnamenta, Alistair T.; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J.; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F.; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R.; Casallo, Guillermo; Casey, Jillian; Chu, Su H.; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L.; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A.; Folstein, Susan E.; Fombonne, Eric; Freitag, Christine M.; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T.; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J.; Hakonarson, Hakon; Heron, Elizabeth A.; Hill, Matthew; Holt, Richard; Howe, Jennifer L.; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M.; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M.; Lamb, Janine A.; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L.; Lionel, Anath C.; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C.; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R.; McConachie, Helen; McDougle, Christopher J.; McGrath, Jane; McMahon, William M.; Melhem, Nadine M.; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J.; Mirza, Ghazala K.; Munson, Jeff; Nelson, Stanley F.; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R.; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L.; Bierut, Laura J.; Rice, John P.; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C.; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P.; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B.; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H.; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L.; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D.; Cantor, Rita M.; Cook, Edwin H.; Coon, Hilary; Cuccaro, Michael L.; Gallagher, Louise; Geschwind, Daniel H.; Gill, Michael; Haines, Jonathan L.; Miller, Judith; Monaco, Anthony P.; Nurnberger, John I.; Paterson, Andrew D.; Pericak-Vance, Margaret A.; Schellenberg, Gerard D.; Scherer, Stephen W.; Sutcliffe, James S.; Szatmari, Peter; Vicente, Astrid M.; Vieland, Veronica J.; Wijsman, Ellen M.; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim
Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10−8. When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10−8 threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C. PMID:20663923
Wood, Andrew R; Esko, Tonu; Yang, Jian; Vedantam, Sailaja; Pers, Tune H; Gustafsson, Stefan; Chu, Audrey Y; Estrada, Karol; Luan, Jian'an; Kutalik, Zoltán; Amin, Najaf; Buchkovich, Martin L; Croteau-Chonka, Damien C; Day, Felix R; Duan, Yanan; Fall, Tove; Fehrmann, Rudolf; Ferreira, Teresa; Jackson, Anne U; Karjalainen, Juha; Lo, Ken Sin; Locke, Adam E; Mägi, Reedik; Mihailov, Evelin; Porcu, Eleonora; Randall, Joshua C; Scherag, André; Vinkhuyzen, Anna A E; Westra, Harm-Jan; Winkler, Thomas W; Workalemahu, Tsegaselassie; Zhao, Jing Hua; Absher, Devin; Albrecht, Eva; Anderson, Denise; Baron, Jeffrey; Beekman, Marian; Demirkan, Ayse; Ehret, Georg B; Feenstra, Bjarke; Feitosa, Mary F; Fischer, Krista; Fraser, Ross M; Goel, Anuj; Gong, Jian; Justice, Anne E; Kanoni, Stavroula; Kleber, Marcus E; Kristiansson, Kati; Lim, Unhee; Lotay, Vaneet; Lui, Julian C; Mangino, Massimo; Mateo Leach, Irene; Medina-Gomez, Carolina; Nalls, Michael A; Nyholt, Dale R; Palmer, Cameron D; Pasko, Dorota; Pechlivanis, Sonali; Prokopenko, Inga; Ried, Janina S; Ripke, Stephan; Shungin, Dmitry; Stancáková, Alena; Strawbridge, Rona J; Sung, Yun Ju; Tanaka, Toshiko; Teumer, Alexander; Trompet, Stella; van der Laan, Sander W; van Setten, Jessica; Van Vliet-Ostaptchouk, Jana V; Wang, Zhaoming; Yengo, Loïc; Zhang, Weihua; Afzal, Uzma; Arnlöv, Johan; Arscott, Gillian M; Bandinelli, Stefania; Barrett, Amy; Bellis, Claire; Bennett, Amanda J; Berne, Christian; Blüher, Matthias; Bolton, Jennifer L; Böttcher, Yvonne; Boyd, Heather A; Bruinenberg, Marcel; Buckley, Brendan M; Buyske, Steven; Caspersen, Ida H; Chines, Peter S; Clarke, Robert; Claudi-Boehm, Simone; Cooper, Matthew; Daw, E Warwick; De Jong, Pim A; Deelen, Joris; Delgado, Graciela; Denny, Josh C; Dhonukshe-Rutten, Rosalie; Dimitriou, Maria; Doney, Alex S F; Dörr, Marcus; Eklund, Niina; Eury, Elodie; Folkersen, Lasse; Garcia, Melissa E; Geller, Frank; Giedraitis, Vilmantas; Go, Alan S; Grallert, Harald; Grammer, Tanja B; Gräßler, Jürgen; Grönberg, Henrik; de Groot, Lisette C P G M; Groves, Christopher J; Haessler, Jeffrey; Hall, Per; Haller, Toomas; Hallmans, Goran; Hannemann, Anke; Hartman, Catharina A; Hassinen, Maija; Hayward, Caroline; Heard-Costa, Nancy L; Helmer, Quinta; Hemani, Gibran; Henders, Anjali K; Hillege, Hans L; Hlatky, Mark A; Hoffmann, Wolfgang; Hoffmann, Per; Holmen, Oddgeir; Houwing-Duistermaat, Jeanine J; Illig, Thomas; Isaacs, Aaron; James, Alan L; Jeff, Janina; Johansen, Berit; Johansson, Åsa; Jolley, Jennifer; Juliusdottir, Thorhildur; Junttila, Juhani; Kho, Abel N; Kinnunen, Leena; Klopp, Norman; Kocher, Thomas; Kratzer, Wolfgang; Lichtner, Peter; Lind, Lars; Lindström, Jaana; Lobbens, Stéphane; Lorentzon, Mattias; Lu, Yingchang; Lyssenko, Valeriya; Magnusson, Patrik K E; Mahajan, Anubha; Maillard, Marc; McArdle, Wendy L; McKenzie, Colin A; McLachlan, Stela; McLaren, Paul J; Menni, Cristina; Merger, Sigrun; Milani, Lili; Moayyeri, Alireza; Monda, Keri L; Morken, Mario A; Müller, Gabriele; Müller-Nurasyid, Martina; Musk, Arthur W; Narisu, Narisu; Nauck, Matthias; Nolte, Ilja M; Nöthen, Markus M; Oozageer, Laticia; Pilz, Stefan; Rayner, Nigel W; Renstrom, Frida; Robertson, Neil R; Rose, Lynda M; Roussel, Ronan; Sanna, Serena; Scharnagl, Hubert; Scholtens, Salome; Schumacher, Fredrick R; Schunkert, Heribert; Scott, Robert A; Sehmi, Joban; Seufferlein, Thomas; Shi, Jianxin; Silventoinen, Karri; Smit, Johannes H; Smith, Albert Vernon; Smolonska, Joanna; Stanton, Alice V; Stirrups, Kathleen; Stott, David J; Stringham, Heather M; Sundström, Johan; Swertz, Morris A; Syvänen, Ann-Christine; Tayo, Bamidele O; Thorleifsson, Gudmar; Tyrer, Jonathan P; van Dijk, Suzanne; van Schoor, Natasja M; van der Velde, Nathalie; van Heemst, Diana; van Oort, Floor V A; Vermeulen, Sita H; Verweij, Niek; Vonk, Judith M; Waite, Lindsay L; Waldenberger, Melanie; Wennauer, Roman; Wilkens, Lynne R; Willenborg, Christina; Wilsgaard, Tom; Wojczynski, Mary K; Wong, Andrew; Wright, Alan F; Zhang, Qunyuan; Arveiler, Dominique; Bakker, Stephan J L; Beilby, John; Bergman, Richard N; Bergmann, Sven; Biffar, Reiner; Blangero, John; Boomsma, Dorret I; Bornstein, Stefan R; Bovet, Pascal; Brambilla, Paolo; Brown, Morris J; Campbell, Harry; Caulfield, Mark J; Chakravarti, Aravinda; Collins, Rory; Collins, Francis S; Crawford, Dana C; Cupples, L Adrienne; Danesh, John; de Faire, Ulf; den Ruijter, Hester M; Erbel, Raimund; Erdmann, Jeanette; Eriksson, Johan G; Farrall, Martin; Ferrannini, Ele; Ferrières, Jean; Ford, Ian; Forouhi, Nita G; Forrester, Terrence; Gansevoort, Ron T; Gejman, Pablo V; Gieger, Christian; Golay, Alain; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Haas, David W; Hall, Alistair S; Harris, Tamara B; Hattersley, Andrew T; Heath, Andrew C; Hengstenberg, Christian; Hicks, Andrew A; Hindorff, Lucia A; Hingorani, Aroon D; Hofman, Albert; Hovingh, G Kees; Humphries, Steve E; Hunt, Steven C; Hypponen, Elina; Jacobs, Kevin B; Jarvelin, Marjo-Riitta; Jousilahti, Pekka; Jula, Antti M; Kaprio, Jaakko; Kastelein, John J P; Kayser, Manfred; Kee, Frank; Keinanen-Kiukaanniemi, Sirkka M; Kiemeney, Lambertus A; Kooner, Jaspal S; Kooperberg, Charles; Koskinen, Seppo; Kovacs, Peter; Kraja, Aldi T; Kumari, Meena; Kuusisto, Johanna; Lakka, Timo A; Langenberg, Claudia; Le Marchand, Loic; Lehtimäki, Terho; Lupoli, Sara; Madden, Pamela A F; Männistö, Satu; Manunta, Paolo; Marette, André; Matise, Tara C; McKnight, Barbara; Meitinger, Thomas; Moll, Frans L; Montgomery, Grant W; Morris, Andrew D; Morris, Andrew P; Murray, Jeffrey C; Nelis, Mari; Ohlsson, Claes; Oldehinkel, Albertine J; Ong, Ken K; Ouwehand, Willem H; Pasterkamp, Gerard; Peters, Annette; Pramstaller, Peter P; Price, Jackie F; Qi, Lu; Raitakari, Olli T; Rankinen, Tuomo; Rao, D C; Rice, Treva K; Ritchie, Marylyn; Rudan, Igor; Salomaa, Veikko; Samani, Nilesh J; Saramies, Jouko; Sarzynski, Mark A; Schwarz, Peter E H; Sebert, Sylvain; Sever, Peter; Shuldiner, Alan R; Sinisalo, Juha; Steinthorsdottir, Valgerdur; Stolk, Ronald P; Tardif, Jean-Claude; Tönjes, Anke; Tremblay, Angelo; Tremoli, Elena; Virtamo, Jarmo; Vohl, Marie-Claude; Amouyel, Philippe; Asselbergs, Folkert W; Assimes, Themistocles L; Bochud, Murielle; Boehm, Bernhard O; Boerwinkle, Eric; Bottinger, Erwin P; Bouchard, Claude; Cauchi, Stéphane; Chambers, John C; Chanock, Stephen J; Cooper, Richard S; de Bakker, Paul I W; Dedoussis, George; Ferrucci, Luigi; Franks, Paul W; Froguel, Philippe; Groop, Leif C; Haiman, Christopher A; Hamsten, Anders; Hayes, M Geoffrey; Hui, Jennie; Hunter, David J; Hveem, Kristian; Jukema, J Wouter; Kaplan, Robert C; Kivimaki, Mika; Kuh, Diana; Laakso, Markku; Liu, Yongmei; Martin, Nicholas G; März, Winfried; Melbye, Mads; Moebus, Susanne; Munroe, Patricia B; Njølstad, Inger; Oostra, Ben A; Palmer, Colin N A; Pedersen, Nancy L; Perola, Markus; Pérusse, Louis; Peters, Ulrike; Powell, Joseph E; Power, Chris; Quertermous, Thomas; Rauramaa, Rainer; Reinmaa, Eva; Ridker, Paul M; Rivadeneira, Fernando; Rotter, Jerome I; Saaristo, Timo E; Saleheen, Danish; Schlessinger, David; Slagboom, P Eline; Snieder, Harold; Spector, Tim D; Strauch, Konstantin; Stumvoll, Michael; Tuomilehto, Jaakko; Uusitupa, Matti; van der Harst, Pim; Völzke, Henry; Walker, Mark; Wareham, Nicholas J; Watkins, Hugh; Wichmann, H-Erich; Wilson, James F; Zanen, Pieter; Deloukas, Panos; Heid, Iris M; Lindgren, Cecilia M; Mohlke, Karen L; Speliotes, Elizabeth K; Thorsteinsdottir, Unnur; Barroso, Inês; Fox, Caroline S; North, Kari E; Strachan, David P; Beckmann, Jacques S; Berndt, Sonja I; Boehnke, Michael; Borecki, Ingrid B; McCarthy, Mark I; Metspalu, Andres; Stefansson, Kari; Uitterlinden, André G; van Duijn, Cornelia M; Franke, Lude; Willer, Cristen J; Price, Alkes L; Lettre, Guillaume; Loos, Ruth J F; Weedon, Michael N; Ingelsson, Erik; O'Connell, Jeffrey R; Abecasis, Goncalo R; Chasman, Daniel I; Goddard, Michael E; Visscher, Peter M; Hirschhorn, Joel N; Frayling, Timothy M
Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700 and ∼9,500 SNPs explained ∼21%, ∼24% and ∼29% of phenotypic variance. Furthermore, all common variants together captured 60% of heritability. The 697 variants clustered in 423 loci were enriched for genes, pathways and tissue types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/β-catenin and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants.
Chu, Audrey Y; Estrada, Karol; Luan, Jian’an; Kutalik, Zoltán; Amin, Najaf; Buchkovich, Martin L; Croteau-Chonka, Damien C; Day, Felix R; Duan, Yanan; Fall, Tove; Fehrmann, Rudolf; Ferreira, Teresa; Jackson, Anne U; Karjalainen, Juha; Lo, Ken Sin; Locke, Adam E; Mägi, Reedik; Mihailov, Evelin; Porcu, Eleonora; Randall, Joshua C; Scherag, André; Vinkhuyzen, Anna AE; Westra, Harm-Jan; Winkler, Thomas W; Workalemahu, Tsegaselassie; Zhao, Jing Hua; Absher, Devin; Albrecht, Eva; Anderson, Denise; Baron, Jeffrey; Beekman, Marian; Demirkan, Ayse; Ehret, Georg B; Feenstra, Bjarke; Feitosa, Mary F; Fischer, Krista; Fraser, Ross M; Goel, Anuj; Gong, Jian; Justice, Anne E; Kanoni, Stavroula; Kleber, Marcus E; Kristiansson, Kati; Lim, Unhee; Lotay, Vaneet; Lui, Julian C; Mangino, Massimo; Leach, Irene Mateo; Medina-Gomez, Carolina; Nalls, Michael A; Nyholt, Dale R; Palmer, Cameron D; Pasko, Dorota; Pechlivanis, Sonali; Prokopenko, Inga; Ried, Janina S; Ripke, Stephan; Shungin, Dmitry; Stancáková, Alena; Strawbridge, Rona J; Sung, Yun Ju; Tanaka, Toshiko; Teumer, Alexander; Trompet, Stella; van der Laan, Sander W; van Setten, Jessica; Van Vliet-Ostaptchouk, Jana V; Wang, Zhaoming; Yengo, Loïc; Zhang, Weihua; Afzal, Uzma; Ärnlöv, Johan; Arscott, Gillian M; Bandinelli, Stefania; Barrett, Amy; Bellis, Claire; Bennett, Amanda J; Berne, Christian; Blüher, Matthias; Bolton, Jennifer L; Böttcher, Yvonne; Boyd, Heather A; Bruinenberg, Marcel; Buckley, Brendan M; Buyske, Steven; Caspersen, Ida H; Chines, Peter S; Clarke, Robert; Claudi-Boehm, Simone; Cooper, Matthew; Daw, E Warwick; De Jong, Pim A; Deelen, Joris; Delgado, Graciela; Denny, Josh C; Dhonukshe-Rutten, Rosalie; Dimitriou, Maria; Doney, Alex SF; Dörr, Marcus; Eklund, Niina; Eury, Elodie; Folkersen, Lasse; Garcia, Melissa E; Geller, Frank; Giedraitis, Vilmantas; Go, Alan S; Grallert, Harald; Grammer, Tanja B; Gräßler, Jürgen; Grönberg, Henrik; de Groot, Lisette C.P.G.M.; Groves, Christopher J; Haessler, Jeffrey; Hall, Per; Haller, Toomas; Hallmans, Goran; Hannemann, Anke; Hartman, Catharina A; Hassinen, Maija; Hayward, Caroline; Heard-Costa, Nancy L; Helmer, Quinta; Hemani, Gibran; Henders, Anjali K; Hillege, Hans L; Hlatky, Mark A; Hoffmann, Wolfgang; Hoffmann, Per; Holmen, Oddgeir; Houwing-Duistermaat, Jeanine J; Illig, Thomas; Isaacs, Aaron; James, Alan L; Jeff, Janina; Johansen, Berit; Johansson, Åsa; Jolley, Jennifer; Juliusdottir, Thorhildur; Junttila, Juhani; Kho, Abel N; Kinnunen, Leena; Klopp, Norman; Kocher, Thomas; Kratzer, Wolfgang; Lichtner, Peter; Lind, Lars; Lindström, Jaana; Lobbens, Stéphane; Lorentzon, Mattias; Lu, Yingchang; Lyssenko, Valeriya; Magnusson, Patrik KE; Mahajan, Anubha; Maillard, Marc; McArdle, Wendy L; McKenzie, Colin A; McLachlan, Stela; McLaren, Paul J; Menni, Cristina; Merger, Sigrun; Milani, Lili; Moayyeri, Alireza; Monda, Keri L; Morken, Mario A; Müller, Gabriele; Müller-Nurasyid, Martina; Musk, Arthur W; Narisu, Narisu; Nauck, Matthias; Nolte, Ilja M; Nöthen, Markus M; Oozageer, Laticia; Pilz, Stefan; Rayner, Nigel W; Renstrom, Frida; Robertson, Neil R; Rose, Lynda M; Roussel, Ronan; Sanna, Serena; Scharnagl, Hubert; Scholtens, Salome; Schumacher, Fredrick R; Schunkert, Heribert; Scott, Robert A; Sehmi, Joban; Seufferlein, Thomas; Shi, Jianxin; Silventoinen, Karri; Smit, Johannes H; Smith, Albert Vernon; Smolonska, Joanna; Stanton, Alice V; Stirrups, Kathleen; Stott, David J; Stringham, Heather M; Sundström, Johan; Swertz, Morris A; Syvänen, Ann-Christine; Tayo, Bamidele O; Thorleifsson, Gudmar; Tyrer, Jonathan P; van Dijk, Suzanne; van Schoor, Natasja M; van der Velde, Nathalie; van Heemst, Diana; van Oort, Floor VA; Vermeulen, Sita H; Verweij, Niek; Vonk, Judith M; Waite, Lindsay L; Waldenberger, Melanie; Wennauer, Roman; Wilkens, Lynne R; Willenborg, Christina; Wilsgaard, Tom; Wojczynski, Mary K; Wong, Andrew; Wright, Alan F; Zhang, Qunyuan; Arveiler, Dominique; Bakker, Stephan JL; Beilby, John; Bergman, Richard N; Bergmann, Sven; Biffar, Reiner; Blangero, John; Boomsma, Dorret I; Bornstein, Stefan R; Bovet, Pascal; Brambilla, Paolo; Brown, Morris J; Campbell, Harry; Caulfield, Mark J; Chakravarti, Aravinda; Collins, Rory; Collins, Francis S; Crawford, Dana C; Cupples, L Adrienne; Danesh, John; de Faire, Ulf; den Ruijter, Hester M; Erbel, Raimund; Erdmann, Jeanette; Eriksson, Johan G; Farrall, Martin; Ferrannini, Ele; Ferrières, Jean; Ford, Ian; Forouhi, Nita G; Forrester, Terrence; Gansevoort, Ron T; Gejman, Pablo V; Gieger, Christian; Golay, Alain; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Haas, David W; Hall, Alistair S; Harris, Tamara B; Hattersley, Andrew T; Heath, Andrew C; Hengstenberg, Christian; Hicks, Andrew A; Hindorff, Lucia A; Hingorani, Aroon D; Hofman, Albert; Hovingh, G Kees; Humphries, Steve E; Hunt, Steven C; Hypponen, Elina; Jacobs, Kevin B; Jarvelin, Marjo-Riitta; Jousilahti, Pekka; Jula, Antti M; Kaprio, Jaakko; Kastelein, John JP; Kayser, Manfred; Kee, Frank; Keinanen-Kiukaanniemi, Sirkka M; Kiemeney, Lambertus A; Kooner, Jaspal S; Kooperberg, Charles; Koskinen, Seppo; Kovacs, Peter; Kraja, Aldi T; Kumari, Meena; Kuusisto, Johanna; Lakka, Timo A; Langenberg, Claudia; Le Marchand, Loic; Lehtimäki, Terho; Lupoli, Sara; Madden, Pamela AF; Männistö, Satu; Manunta, Paolo; Marette, André; Matise, Tara C; McKnight, Barbara; Meitinger, Thomas; Moll, Frans L; Montgomery, Grant W; Morris, Andrew D; Morris, Andrew P; Murray, Jeffrey C; Nelis, Mari; Ohlsson, Claes; Oldehinkel, Albertine J; Ong, Ken K; Ouwehand, Willem H; Pasterkamp, Gerard; Peters, Annette; Pramstaller, Peter P; Price, Jackie F; Qi, Lu; Raitakari, Olli T; Rankinen, Tuomo; Rao, DC; Rice, Treva K; Ritchie, Marylyn; Rudan, Igor; Salomaa, Veikko; Samani, Nilesh J; Saramies, Jouko; Sarzynski, Mark A; Schwarz, Peter EH; Sebert, Sylvain; Sever, Peter; Shuldiner, Alan R; Sinisalo, Juha; Steinthorsdottir, Valgerdur; Stolk, Ronald P; Tardif, Jean-Claude; Tönjes, Anke; Tremblay, Angelo; Tremoli, Elena; Virtamo, Jarmo; Vohl, Marie-Claude; Amouyel, Philippe; Asselbergs, Folkert W; Assimes, Themistocles L; Bochud, Murielle; Boehm, Bernhard O; Boerwinkle, Eric; Bottinger, Erwin P; Bouchard, Claude; Cauchi, Stéphane; Chambers, John C; Chanock, Stephen J; Cooper, Richard S; de Bakker, Paul IW; Dedoussis, George; Ferrucci, Luigi; Franks, Paul W; Froguel, Philippe; Groop, Leif C; Haiman, Christopher A; Hamsten, Anders; Hayes, M Geoffrey; Hui, Jennie; Hunter, David J.; Hveem, Kristian; Jukema, J Wouter; Kaplan, Robert C; Kivimaki, Mika; Kuh, Diana; Laakso, Markku; Liu, Yongmei; Martin, Nicholas G; März, Winfried; Melbye, Mads; Moebus, Susanne; Munroe, Patricia B; Njølstad, Inger; Oostra, Ben A; Palmer, Colin NA; Pedersen, Nancy L; Perola, Markus; Pérusse, Louis; Peters, Ulrike; Powell, Joseph E; Power, Chris; Quertermous, Thomas; Rauramaa, Rainer; Reinmaa, Eva; Ridker, Paul M; Rivadeneira, Fernando; Rotter, Jerome I; Saaristo, Timo E; Saleheen, Danish; Schlessinger, David; Slagboom, P Eline; Snieder, Harold; Spector, Tim D; Strauch, Konstantin; Stumvoll, Michael; Tuomilehto, Jaakko; Uusitupa, Matti; van der Harst, Pim; Völzke, Henry; Walker, Mark; Wareham, Nicholas J; Watkins, Hugh; Wichmann, H-Erich; Wilson, James F; Zanen, Pieter; Deloukas, Panos; Heid, Iris M; Lindgren, Cecilia M; Mohlke, Karen L; Speliotes, Elizabeth K; Thorsteinsdottir, Unnur; Barroso, Inês; Fox, Caroline S; North, Kari E; Strachan, David P; Beckmann, Jacques S.; Berndt, Sonja I; Boehnke, Michael; Borecki, Ingrid B; McCarthy, Mark I; Metspalu, Andres; Stefansson, Kari; Uitterlinden, André G; van Duijn, Cornelia M; Franke, Lude; Willer, Cristen J; Price, Alkes L.; Lettre, Guillaume; Loos, Ruth JF; Weedon, Michael N; Ingelsson, Erik; O’Connell, Jeffrey R; Abecasis, Goncalo R; Chasman, Daniel I; Goddard, Michael E
Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explain one-fifth of heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ~2,000, ~3,700 and ~9,500 SNPs explained ~21%, ~24% and ~29% of phenotypic variance. Furthermore, all common variants together captured the majority (60%) of heritability. The 697 variants clustered in 423 loci enriched for genes, pathways, and tissue-types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/beta-catenin, and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants. PMID:25282103
Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner\\'s curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C.
Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R; Correia, Catarina; Abrahams, Brett S; Sykes, Nuala; Pagnamenta, Alistair T; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R; Casallo, Guillermo; Casey, Jillian; Chu, Su H; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A; Folstein, Susan E; Fombonne, Eric; Freitag, Christine M; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J; Hakonarson, Hakon; Heron, Elizabeth A; Hill, Matthew; Holt, Richard; Howe, Jennifer L; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M; Lamb, Janine A; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L; Lionel, Anath C; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R; McConachie, Helen; McDougle, Christopher J; McGrath, Jane; McMahon, William M; Melhem, Nadine M; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J; Mirza, Ghazala K; Munson, Jeff; Nelson, Stanley F; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L; Bierut, Laura J; Rice, John P; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D; Cantor, Rita M; Cook, Edwin H; Coon, Hilary; Cuccaro, Michael L; Gallagher, Louise; Geschwind, Daniel H; Gill, Michael; Haines, Jonathan L; Miller, Judith; Monaco, Anthony P; Nurnberger, John I; Paterson, Andrew D; Pericak-Vance, Margaret A; Schellenberg, Gerard D; Scherer, Stephen W; Sutcliffe, James S; Szatmari, Peter; Vicente, Astrid M; Vieland, Veronica J; Wijsman, Ellen M; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim
Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C.
Cazzato, Daniele; Castori, Marco; Lombardi, Raffaella; Caravello, Francesca; Bella, Eleonora Dalla; Petrucci, Antonio; Grammatico, Paola; Dordoni, Chiara; Colombi, Marina
Objective: To investigate the involvement of small nerve fibers in Ehlers-Danlos syndrome (EDS). Methods: Patients diagnosed with EDS underwent clinical, neurophysiologic, and skin biopsy assessment. We recorded sensory symptoms and signs and evaluated presence and severity of neuropathic pain according to the Douleur Neuropathique 4 (DN4) and ID Pain questionnaires and the Numeric Rating Scale (NRS). Sensory action potential amplitude and conduction velocity of sural nerve was recorded. Skin biopsy was performed at distal leg and intraepidermal nerve fiber density (IENFD) obtained and referred to published sex- and age-adjusted normative reference values. Results: Our cohort included 20 adults with joint hypermobility syndrome/hypermobility EDS, 3 patients with vascular EDS, and 1 patient with classic EDS. All except one patient had neuropathic pain according to DN4 and ID Pain questionnaires and reported 7 or more symptoms at the Small Fiber Neuropathy Symptoms Inventory Questionnaire. Pain intensity was moderate (NRS ≥4 and <7) in 8 patients and severe (NRS ≥7) in 11 patients. Sural nerve conduction study was normal in all patients. All patients showed a decrease of IENFD consistent with the diagnosis of small fiber neuropathy (SFN), regardless of the EDS type. Conclusions: SFN is a common feature in adults with EDS. Skin biopsy could be considered an additional diagnostic tool to investigate pain manifestations in EDS. PMID:27306637
Obert, Alexandre; Gierski, Fabien; Calmus, Arnaud; Flucher, Aurélie; Portefaix, Christophe; Pierot, Laurent; Kaladjian, Arthur; Caillies, Stéphanie
Irony is a kind of figurative language used by a speaker to say something that contrasts with the context and, to some extent, lends humor to a situation. However, little is known about the brain regions that specifically support the processing of these two common features of irony. The present study had two main aims: (i) investigate the neural basis of irony processing, by delivering short ironic spoken sentences (and their literal counterparts) to participants undergoing fMRI; and (ii) assess the neural effect of two irony parameters, obtained from normative studies: degree of contrast and humor appreciation. Results revealed activation of the bilateral inferior frontal gyrus (IFG), posterior part of the left superior temporal gyrus, medial frontal cortex, and left caudate during irony processing, suggesting the involvement of both semantic and theory-of-mind networks. Parametric models showed that contrast was specifically associated with the activation of bilateral frontal and subcortical areas, and that these regions were also sensitive to humor, as shown by a conjunction analysis. Activation of the bilateral IFG is consistent with the literature on humor processing, and reflects incongruity detection/resolution processes. Moreover, the activation of subcortical structures can be related to the reward processing of social events. PMID:27851821
Bastir, Markus; Godoy, Paula; Rosas, Antonio
Sexual dimorphism in the human craniofacial system is an important feature of intraspecific variation in recent and fossil humans. Although several studies have reported different morphological patterns of sexual dimorphism in different populations, this study searches for common morphological aspects related to functional anatomy of the respiratory apparatus. 3D geometric morphometrics were used to test the hypothesis that due to higher daily energy expenditure and associated greater respiratory air consumption as well as differences in body composition, males should have absolutely and relatively greater air passages in the bony cranial airways than females. We measured 25 3D landmarks in five populations (N = 212) of adult humans from different geographic regions. Male average cranial airways were larger in centroid sizes than female ones. Males tended to show relatively taller piriform apertures and, more consistently, relatively taller internal nasal cavities and choanae than females. Multivariate regressions and residual analysis further indicated that after standardizing to the same size, males still show relatively larger airway passages than females. Because the dimensions of the choanae are limiting factors for air transmission towards the noncranial part of the respiratory system, the identified sex-specific differences in cranial airways, possibly shared among human populations, may be linked with sex-specific differences in body size, composition, and energetics. These findings may be important to understanding trends in hominin facial evolution. Copyright © 2011 Wiley-Liss, Inc.
Christopher A Lavender
Full Text Available Antisense transcription is a prevalent feature at mammalian promoters. Previous studies have primarily focused on antisense transcription initiating upstream of genes. Here, we characterize promoter-proximal antisense transcription downstream of gene transcription starts sites in human breast cancer cells, investigating the genomic context of downstream antisense transcription. We find extensive correlations between antisense transcription and features associated with the chromatin environment at gene promoters. Antisense transcription downstream of promoters is widespread, with antisense transcription initiation observed within 2 kb of 28% of gene transcription start sites. Antisense transcription initiates between nucleosomes regularly positioned downstream of these promoters. The nucleosomes between gene and downstream antisense transcription start sites carry histone modifications associated with active promoters, such as H3K4me3 and H3K27ac. This region is bound by chromatin remodeling and histone modifying complexes including SWI/SNF subunits and HDACs, suggesting that antisense transcription or resulting RNA transcripts contribute to the creation and maintenance of a promoter-associated chromatin environment. Downstream antisense transcription overlays additional regulatory features, such as transcription factor binding, DNA accessibility, and the downstream edge of promoter-associated CpG islands. These features suggest an important role for antisense transcription in the regulation of gene expression and the maintenance of a promoter-associated chromatin environment.
Boocock, James; Chagné, David; Merriman, Tony R; Black, Michael A
Copy number variation (CNV) is a common feature of eukaryotic genomes, and a growing body of evidence suggests that genes affected by CNV are enriched in processes that are associated with environmental responses. Here we use next generation sequence (NGS) data to detect copy-number variable regions (CNVRs) within the Malus x domestica genome, as well as to examine their distribution and impact. CNVRs were detected using NGS data derived from 30 accessions of M. x domestica analyzed using the read-depth method, as implemented in the CNVrd2 software. To improve the reliability of our results, we developed a quality control and analysis procedure that involved checking for organelle DNA, not repeat masking, and the determination of CNVR identity using a permutation testing procedure. Overall, we identified 876 CNVRs, which spanned 3.5 % of the apple genome. To verify that detected CNVRs were not artifacts, we analyzed the B- allele-frequencies (BAF) within a single nucleotide polymorphism (SNP) array dataset derived from a screening of 185 individual apple accessions and found the CNVRs were enriched for SNPs having aberrant BAFs (P apple scab. We present the first analysis and catalogue of CNVRs in the M. x domestica genome. The enrichment of the CNVRs with R gene models and their overlap with gene loci of agricultural significance draw attention to a form of unexplored genetic variation in apple. This research will underpin further investigation of the role that CNV plays within the apple genome.
Wu, Jing; Zhu, Jifeng; Wang, Lanfen; Wang, Shumin
Nucleotide-binding site and leucine-rich repeat (NBS-LRR) genes represent the largest and most important disease resistance genes in plants. The genome sequence of the common bean ( Phaseolus vulgaris L.) provides valuable data for determining the genomic organization of NBS-LRR genes. However, data on the NBS-LRR genes in the common bean are limited. In total, 178 NBS-LRR-type genes and 145 partial genes (with or without a NBS) located on 11 common bean chromosomes were identified from genome sequences database. Furthermore, 30 NBS-LRR genes were classified into Toll/interleukin-1 receptor (TIR)-NBS-LRR (TNL) types, and 148 NBS-LRR genes were classified into coiled-coil (CC)-NBS-LRR (CNL) types. Moreover, the phylogenetic tree supported the division of these PvNBS genes into two obvious groups, TNL types and CNL types. We also built expression profiles of NBS genes in response to anthracnose and common bacterial blight using qRT-PCR. Finally, we detected nine disease resistance loci for anthracnose (ANT) and seven for common bacterial blight (CBB) using the developed NBS-SSR markers. Among these loci, NSSR24, NSSR73, and NSSR265 may be located at new regions for ANT resistance, while NSSR65 and NSSR260 may be located at new regions for CBB resistance. Furthermore, we validated NSSR24, NSSR65, NSSR73, NSSR260, and NSSR265 using a new natural population. Our results provide useful information regarding the function of the NBS-LRR proteins and will accelerate the functional genomics and evolutionary studies of NBS-LRR genes in food legumes. NBS-SSR markers represent a wide-reaching resource for molecular breeding in the common bean and other food legumes. Collectively, our results should be of broad interest to bean scientists and breeders.
Grace, Christy Rani R; Perrin, Marilyn H; Cantle, Jeffrey P; Vale, Wylie W; Rivier, Jean E; Riek, Roland
Members of the corticoliberin family include the corticotropin releasing factors (CRFs), sauvagine, the urotensins, and urocortin 1 (Ucn1), which bind to both the CRF receptors CRF-R1 and CRF-R2, and the urocortins 2 (Ucn2) and 3 (Ucn3), which are selective agonists of CRF-R2. Structure activity relationship studies led to several potent and long-acting analogues with selective binding to either one of the receptors. NMR structures of six ligands of this family (the antagonists astressin B and astressin2-B, the agonists stressin1, and the natural ligands human Ucn1, Ucn2, and Ucn3) were determined in DMSO. These six peptides show differences in binding affinities, receptor-selectivity, and NMR structure. Overall, their backbones are alpha-helical, with a small kink or a turn around residues 25-27, resulting in a helix-loop-helix motif. The C-terminal helices are of amphipathic nature, whereas the N-terminal helices vary in their amphipathicity. The C-terminal helices thereby assume a conformation very similar to that of astressin bound to the ECD1 of CRF-R2 recently reported by our group.1 On the basis of an analysis of the observed 3D structures and relative potencies of [Ala]-substituted analogues, it is proposed that both helices could play a crucial role in receptor binding and selectivity. In conclusion, the C-terminal helices may interact along their hydrophobic faces with the ECD1, whereas the entire N-terminal helical surface may be involved in receptor activation. On the basis of the common and divergent features observed in the 3D structures of these ligands, multiple binding models are proposed that may explain their plurality of actions.
Paul M Armistead
Full Text Available Minor histocompatibility antigens (mHA mediate much of the graft vs. leukemia (GvL effect and graft vs. host disease (GvHD in patients who undergo allogeneic stem cell transplantation (SCT. Therapeutic decision making and treatments based upon mHAs will require the evaluation of multiple candidate mHAs and the selection of those with the potential to have the greatest impact on clinical outcomes. We hypothesized that common, immunodominant mHAs, which are presented by HLA-A, B, and C molecules, can mediate clinically significant GvL and/or GvHD, and that these mHAs can be identified through association of genomic data with clinical outcomes.Because most mHAs result from donor/recipient cSNP disparities, we genotyped 57 myeloid leukemia patients and their donors at 13,917 cSNPs. We correlated the frequency of genetically predicted mHA disparities with clinical evidence of an immune response and then computationally screened all peptides mapping to the highly associated cSNPs for their ability to bind to HLA molecules. As proof-of-concept, we analyzed one predicted antigen, T4A, whose mHA mismatch trended towards improved overall and disease free survival in our cohort. T4A mHA mismatches occurred at the maximum theoretical frequency for any given SCT. T4A-specific CD8+ T lymphocytes (CTLs were detected in 3 of 4 evaluable post-transplant patients predicted to have a T4A mismatch.Our method is the first to combine clinical outcomes data with genomics and bioinformatics methods to predict and confirm a mHA. Refinement of this method should enable the discovery of clinically relevant mHAs in the majority of transplant patients and possibly lead to novel immunotherapeutics.
Full Text Available BACKGROUND: The chicken is an important agricultural and avian-model species. A survey of gene expression in a range of different tissues will provide a benchmark for understanding expression levels under normal physiological conditions in birds. With expression data for birds being very scant, this benchmark is of particular interest for comparative expression analysis among various terrestrial vertebrates. METHODOLOGY/PRINCIPAL FINDINGS: We carried out a gene expression survey in eight major chicken tissues using whole genome microarrays. A global picture of gene expression is presented for the eight tissues, and tissue specific as well as common gene expression were identified. A Gene Ontology (GO term enrichment analysis showed that tissue-specific genes are enriched with GO terms reflecting the physiological functions of the specific tissue, and housekeeping genes are enriched with GO terms related to essential biological functions. Comparisons of structural genomic features between tissue-specific genes and housekeeping genes show that housekeeping genes are more compact. Specifically, coding sequence and particularly introns are shorter than genes that display more variation in expression between tissues, and in addition intergenic space was also shorter. Meanwhile, housekeeping genes are more likely to co-localize with other abundantly or highly expressed genes on the same chromosomal regions. Furthermore, comparisons of gene expression in a panel of five common tissues between birds, mammals and amphibians showed that the expression patterns across tissues are highly similar for orthologous genes compared to random gene pairs within each pair-wise comparison, indicating a high degree of functional conservation in gene expression among terrestrial vertebrates. CONCLUSIONS: The housekeeping genes identified in this study have shorter gene length, shorter coding sequence length, shorter introns, and shorter intergenic regions, there seems
Harowicz, Michael R.; Marks, Jeffrey R.; Marcom, P. Kelly; Mazurowski, Maciej A.
Medical oncologists increasingly rely on expensive genomic analysis to stratify patients for different treatment. The genomic markers are able to divide patients into groups that behave differently in terms of tumor presentation, likelihood of metastatic spread, and response to chemotherapy and radiation therapy. In recent years there has been a rapid increase in the number of genomic tests available, like the Oncotype DX test, which provides the risk of cancer recurrence for a subset of patients. Radiogenomics, a new field that investigates the relationship between imaging phenotypes and genomic characteristics, may offer a less expensive and less invasive imaging surrogate for molecular subtype and Oncotype DX recurrence score (ODRS). This retrospective study analyzes the relationship between Breast Imaging-Reporting and Data System (BI-RADS) features as assessed by radiologists on mammograms with molecular subtype and ODRS. We used data from patients with BI-RADS features (shape or margin) and a genomic feature (subtype or ODRS) for the following cohort: shape vs. subtype (n=69), margin vs. subtype (n=78), shape vs. ODRS (n=20), and margin vs. ODRS (n=18). The association between features was assessed using a Fisher's exact test. Our results show that shape assessed by radiologists according to the BI-RADS lexicon is associated with molecular subtype (p=0.0171), while BI-RADS features of shape and margin were not significantly associated with ODRS (p=0.7839, p=0.6047 respectively).
Michael J Considine
Full Text Available Polyploidization results in genome duplication and is an important step in evolution and speciation. The Malus genome confirmed that this genus was derived through auto-polyploidization, yet the genetic and meiotic mechanisms for polyploidization, particularly for aneuploidization, are unclear in this genus or other woody perennials. In fact the contribution of aneuploidization remains poorly understood throughout Plantae. We add to this knowledge by characterization of eupolyploidization and aneuploidization in 27,542 F₁ seedlings from seven diploid Malus populations using cytology and microsatellite markers. We provide the first evidence that aneuploidy exceeds eupolyploidy in the diploid crosses, suggesting aneuploidization is a leading cause of genome duplication. Gametes from diploid Malus had a unique combinational pattern; ova preserved euploidy exclusively, while spermatozoa presented both euploidy and aneuploidy. All non-reduced gametes were genetically heterozygous, indicating first-division restitution was the exclusive mode for Malus eupolyploidization and aneuploidization. Chromosome segregation pattern among aneuploids was non-uniform, however, certain chromosomes were associated for aneuploidization. This study is the first to provide molecular evidence for the contribution of heterozygous non-reduced gametes to fitness in polyploids and aneuploids. Aneuploidization can increase, while eupolyploidization may decrease genetic diversity in their newly established populations. Auto-triploidization is important for speciation in the extant Malus. The features of Malus polyploidization confer genetic stability and diversity, and present heterozygosity, heterosis and adaptability for evolutionary selection. A protocol using co-dominant markers was proposed for accelerating apple triploid breeding program. A path was postulated for evolution of numerically odd basic chromosomes. The model for Malus derivation was considerably revised
Considine, Michael J; Wan, Yizhen; D'Antuono, Mario F; Zhou, Qian; Han, Mingyu; Gao, Hua; Wang, Man
Polyploidization results in genome duplication and is an important step in evolution and speciation. The Malus genome confirmed that this genus was derived through auto-polyploidization, yet the genetic and meiotic mechanisms for polyploidization, particularly for aneuploidization, are unclear in this genus or other woody perennials. In fact the contribution of aneuploidization remains poorly understood throughout Plantae. We add to this knowledge by characterization of eupolyploidization and aneuploidization in 27,542 F₁ seedlings from seven diploid Malus populations using cytology and microsatellite markers. We provide the first evidence that aneuploidy exceeds eupolyploidy in the diploid crosses, suggesting aneuploidization is a leading cause of genome duplication. Gametes from diploid Malus had a unique combinational pattern; ova preserved euploidy exclusively, while spermatozoa presented both euploidy and aneuploidy. All non-reduced gametes were genetically heterozygous, indicating first-division restitution was the exclusive mode for Malus eupolyploidization and aneuploidization. Chromosome segregation pattern among aneuploids was non-uniform, however, certain chromosomes were associated for aneuploidization. This study is the first to provide molecular evidence for the contribution of heterozygous non-reduced gametes to fitness in polyploids and aneuploids. Aneuploidization can increase, while eupolyploidization may decrease genetic diversity in their newly established populations. Auto-triploidization is important for speciation in the extant Malus. The features of Malus polyploidization confer genetic stability and diversity, and present heterozygosity, heterosis and adaptability for evolutionary selection. A protocol using co-dominant markers was proposed for accelerating apple triploid breeding program. A path was postulated for evolution of numerically odd basic chromosomes. The model for Malus derivation was considerably revised. Impacts of
Full Text Available Lung adenocarcinoma (LUAD is the most common histologic subtype of lung cancer and has a high risk of distant metastasis at every disease stage. We aimed to characterize the genomic landscape of LUAD and identify mutation signatures associated with tumor progression.We performed an integrative genomic analysis, incorporating whole exome sequencing (WES, determination of DNA copy number and DNA methylation, and transcriptome sequencing for 101 LUAD samples from the Environment And Genetics in Lung cancer Etiology (EAGLE study. We detected driver genes by testing whether the nonsynonymous mutation rate was significantly higher than the background mutation rate and replicated our findings in public datasets with 724 samples. We performed subclonality analysis for mutations based on mutant allele data and copy number alteration data. We also tested the association between mutation signatures and clinical outcomes, including distant metastasis, survival, and tumor grade. We identified and replicated two novel candidate driver genes, POU class 4 homeobox 2 (POU4F2 (mutated in 9 [8.9%] samples and ZKSCAN1 (mutated in 6 [5.9%] samples, and characterized their major deleterious mutations. ZKSCAN1 was part of a mutually exclusive gene set that included the RTK/RAS/RAF pathway genes BRAF, EGFR, KRAS, MET, and NF1, indicating an important driver role for this gene. Moreover, we observed strong associations between methylation in specific genomic regions and somatic mutation patterns. In the tumor evolution analysis, four driver genes had a significantly lower fraction of subclonal mutations (FSM, including TP53 (p = 0.007, KEAP1 (p = 0.012, STK11 (p = 0.0076, and EGFR (p = 0.0078, suggesting a tumor initiation role for these genes. Subclonal mutations were significantly enriched in APOBEC-related signatures (p < 2.5×10-50. The total number of somatic mutations (p = 0.0039 and the fraction of transitions (p = 5.5×10-4 were associated with increased risk of
Grossmann, Patrick; Gutman, David A.; Dunn, William D. Jr; Holder, Chad A.; Aerts, Hugo J. W. L.
Glioblastoma (GBM) tumors exhibit strong phenotypic differences that can be quantified using magnetic resonance imaging (MRI), but the underlying biological drivers of these imaging phenotypes remain largely unknown. An Imaging-Genomics analysis was performed to reveal the mechanistic associations between MRI derived quantitative volumetric tumor phenotype features and molecular pathways. One hundred fourty one patients with presurgery MRI and survival data were included in our analysis. Volumetric features were defined, including the necrotic core (NE), contrast-enhancement (CE), abnormal tumor volume assessed by post-contrast T1w (tumor bulk or TB), tumor-associated edema based on T2-FLAIR (ED), and total tumor volume (TV), as well as ratios of these tumor components. Based on gene expression where available (n = 91), pathway associations were assessed using a preranked gene set enrichment analysis. These results were put into context of molecular subtypes in GBM and prognostication. Volumetric features were significantly associated with diverse sets of biological processes (FDR < 0.05). While NE and TB were enriched for immune response pathways and apoptosis, CE was associated with signal transduction and protein folding processes. ED was mainly enriched for homeostasis and cell cycling pathways. ED was also the strongest predictor of molecular GBM subtypes (AUC = 0.61). CE was the strongest predictor of overall survival (C-index = 0.6; Noether test, p = 4x10 −4 ). GBM volumetric features extracted from MRI are significantly enriched for information about the biological state of a tumor that impacts patient outcomes. Clinical decision-support systems could exploit this information to develop personalized treatment strategies on the basis of noninvasive imaging. The online version of this article (doi:10.1186/s12885-016-2659-5) contains supplementary material, which is available to authorized users
Wei, Guifang; Pan, Li; Du, Huimin; Chen, Junyi; Zhao, Liping
Bacterial populations common to healthy human guts may play important roles in human health. A new strategy for discovering genomic sequences as markers for these bacteria was developed using Enterobacterial Repetitive Intergenic Consensus (ERIC)-PCR fingerprinting. Structural features within microbial communities are compared with ERIC-PCR followed by DNA hybridization to identify genomic fragments shared by samples from healthy human individuals. ERIC-PCR profiles of fecal samples from 12 diseased or healthy human and piglet subjects demonstrated stable, unique banding patterns for each individual tested. Sequence homology of DNA fragments in bands of identical size was examined between samples by hybridization under high stringency conditions with DIG-labeled ERIC-PCR products derived from the fecal sample of one healthy child. Comparative analysis of the hybridization profiles with the original agarose fingerprints identified three predominant bands as signatures for populations associated with healthy human guts with sizes of 500, 800 and 1000 bp. Clone library profiling of the three bands produced 17 genome fragments, three of which showed high similarity only with regions of the Bacteroides thetaiotaomicron genome, while the remainder were orphan sequences. Association of these sequences with healthy guts was validated by sequence-selective PCR experiments, which showed that a single fragment was present in all 32 healthy humans and 13 healthy piglets tested. Two fragments were present in the healthy human group and in 18 children with non-infectious diarrhea but not in eight children with infectious diarrhea. Genome fragments identified with this novel strategy may be used as genome-specific markers for dynamic monitoring and sequence-guided isolation of functionally important bacterial populations in complex communities such as human gut microflora.
Henderickx, David; Maetens, Kathleen; Soetens, Eric
Briand (J Exp Psychol Hum Percept Perform 24:1243-1256, 1998) and Briand and Klein (J Exp Psychol Hum Percept Perform 13:228-241, 1987) demonstrated that spatial cueing effects are larger for detecting conjunction of features than for detecting simple features when spatial attention is oriented exogenously, and not when attention is oriented endogenously. Their results were interpreted as if only exogenous attention affects the posterior spatial attention system that performs the feature binding function attributed to spatial attention by Treisman's feature integration theory (FIT; 1980). In a series of 6 experiments, we attempted to replicate Briand's findings. Manipulations of distractor string size and symmetry of stimulus presentation left and right from fixation were implemented in Posner's cueing paradigm. The data indicate that both exogenous and endogenous cueing address the same attentional mechanism needed for feature binding. The results also limit the generalisability of Briand's proposal concerning the role of exogenous attention in feature integration. Furthermore, the importance to control the effect of unintended attentional capture in a cueing task is demonstrated.
A.R. Wood (Andrew); T. Esko (Tõnu); J. Yang (Jian); S. Vedantam (Sailaja); T.H. Pers (Tune); S. Gustafsson (Stefan); A.Y. Chu (Audrey Y); K. Estrada Gil (Karol); J. Luan; Z. Kutalik; N. Amin (Najaf); M.L. Buchkovich (Martin); D.C. Croteau-Chonka (Damien); F.R. Day (Felix); Y. Duan (Yanan); M. Fall (Magnus); R.S.N. Fehrmann (Rudolf); T. Ferreira (Teresa); A.U. Jackson (Anne); J. Karjalainen (Juha); K.S. Lo (Ken Sin); A. Locke (Adam); R. Mägi (Reedik); E. Mihailov (Evelin); E. Porcu (Eleonora); J.C. Randall (Joshua); A. Scherag (Andre); A.A.E. Vinkhuyzen (Anna A.); H.J. Westra (Harm-Jan); T.W. Winkler (Thomas W.); T. Workalemahu (Tsegaselassie); J.H. Zhao (Jing Hua); D. Absher (Devin); E. Albrecht (Eva); J. Baron (Jeffrey); M. Beekman (Marian); A. Demirkan (Ayşe); G.B. Ehret (Georg); B. Feenstra; M.F. Feitosa (Mary Furlan); K. Fischer (Krista); R.M. Fraser (Ross); A. Goel (Anuj); J. Gong (Jian); A.E. Justice (Anne); S. Kanoni (Stavroula); M.E. Kleber (Marcus); K. Kristiansson (Kati); U. Lim (Unhee); V. Lotay (Vaneet); J.C. Lui (Julian C); M. Mangino (Massimo); I.M. Leach (Irene Mateo); M.C. Medina-Gomez (Carolina); M.A. Nalls (Michael); A.S. Dimas (Antigone); C. Palmer (Cameron); D. Pasko (Dorota); S. Pechlivanis (Sonali); I. Prokopenko (Inga); J.S. Ried (Janina); S. Ripke (Stephan); D. Shungin (Dmitry); A. Stancáková (Alena); R.J. Strawbridge (Rona); Y.J. Sung (Yun Ju); T. Tanaka (Toshiko); A. Teumer (Alexander); S. Trompet (Stella); S.W. Van Der Laan (Sander W.); J. van Setten (Jessica); J.V. van Vliet-Ostaptchouk (Jana); Z. Wang (Zhaoming); L. Yengo (Loic); W. Zhang (Weihua); U. Afzal (Uzma); J. Ärnlöv (Johan); G.M. Arscott (Gillian M.); S. Bandinelli (Stefania); A. Barrett (Angela); C. Bellis (Claire); A.J. Bennett (Amanda); C. Berne (Christian); M. Blüher (Matthias); J.L. Bolton (Jennifer); Y. Böttcher (Yvonne); H.A. Boyd; M. Bruinenberg (M.); B.M. Buckley (Brendan M.); S. Buyske (Steven); I.H. Caspersen (Ida H.); P.S. Chines (Peter); R. Clarke (Robert); S. Claudi-Boehm (Simone); M.N. Cooper (Matthew); E.W. Daw (E Warwick); P.A. De Jong (Pim A); J. Deelen (Joris); G. Delgado; J.C. Denny (Josh C); R.A.M. Dhonukshe-Rutten (Rosalie); M. Dimitriou (Maria); A.S.F. Doney (Alex); M. Dörr (Marcus); N. Eklund (Niina); E. Eury (Elodie); L. Folkersen (Lasse); M. Garcia (Melissa); F. Geller (Frank); V. Giedraitis (Vilmantas); A. Go (Attie); H. Grallert (Harald); T.B. Grammer (Tanja B); J. Gräßler (Jürgen); H. Grönberg (Henrik); L.C.P.G.M. de Groot (Lisette); C.J. Groves (Christopher J.); J. Haessler (Jeff); P. Hall (Per); T. Haller (Toomas); G. Hallmans (Göran); M. Hannemann (Mario); C.A. Hartman (Catharina); M. Hassinen (Maija); C. Hayward (Caroline); N.L. Heard-Costa (Nancy); Q. Helmer (Quinta); G. Hemani; A.K. Henders (Anjali); H.L. Hillege (Hans); M.A. Hlatky (Mark); W. Hoffmann (Wolfgang); P. Hoffmann (Per); O.L. Holmen (Oddgeir); J.J. Houwing-Duistermaat (Jeanine); T. Illig (Thomas); A. Isaacs (Aaron); A.L. James (Alan); J. Jeff (Janina); B. Johansen (Berit); A. Johansson (Åsa); G.J. Jolley (Jason); T. Juliusdottir (Thorhildur); M.J. Junttila (Juhani); M.M.L. Kho (Marcia); L. Kinnunen (Leena); N. Klopp (Norman); T. Kocher; W. Kratzer (Wolfgang); P. Lichtner (Peter); L. Lind (Lars); J. Lindström (Jaana); S. Lobbens (Stéphane); M. Lorentzon (Mattias); Y. Lu (Yingchang); V. Lyssenko (Valeriya); P.K. Magnusson (Patrik); A. Mahajan (Anubha); M. Maillard (Marc); W.L. McArdle (Wendy); C.A. McKenzie (Colin A.); S. McLachlan (Stela); P.J. McLaren (Paul J); C. Menni (Cristina); S. Merger (Sigrun); L. Milani (Lili); A. Moayyeri (Alireza); K.L. Monda (Keri); M.A. Morken (Mario); G. Müller (Gabriele); M. Müller-Nurasyid (Martina); A.W. Musk (Arthur); N. Narisu (Narisu); M. Nauck (Matthias); I.M. Nolte (Ilja M.); M.M. Nöthen (Markus); L. Oozageer (Laticia); S. Pilz (Stefan); N.W. Rayner (Nigel William); F. Renström (Frida); N.R. Robertson (Neil R.); L.M. Rose (Lynda M.); R. Roussel (Ronan); S. Sanna (Serena); H. Scharnagl (Hubert); S. Scholtens (Salome); F.R. Schumacher (Fredrick R); H. Schunkert (Heribert); R.A. Scott (Robert); J.S. Sehmi (Joban); T. Seufferlein (Thomas); J. Shi (Jianxin); K. Silventoinen (Karri); J.H. Smit (Johannes); G.D. Smith; J. Smolonska (Joanna); A. Stanton (Alice); K. Stirrups (Kathy); D.J. Stott (David J); H.M. Stringham (Heather); J. Sundstrom (Johan); M. Swertz (Morris); A.C. Syvanen; B. Tayo (Bamidele); G. Thorleifsson (Gudmar); J.P. Tyrer (Jonathan); S. Van Dijk (Suzanne); N.M. van Schoor (Natasja); N. van der Velde (Nathalie); D. van Heemst (Diana); F.V.A. Van Oort (Floor V A); S.H.H.M. Vermeulen (Sita); N. Verweij (Niek); J.M. Vonk (Judith M); L. Waite (Lindsay); M. Waldenberger (Melanie); R. Wennauer (Roman); L.R. Wilkens (Lynne R.); C. Willenborg (Christina); T. Wilsgaard (Tom); M.K. Wojczynski (Mary ); A. Wong (Andrew); A. Wright (Alan); Q. Zhang (Qunyuan); D. Arveiler (Dominique); S.J.L. Bakker (Stephan); J. Beilby (John); R.N. Bergman (Richard); S.M. Bergmann (Sven); R. Biffar; J. Blangero (John); D.I. Boomsma (Dorret); S.R. Bornstein (Stefan R.); P. Bovet (Pascal); P. Brambilla (Paolo); M.J. Brown (Morris); H. Campbell (Harry); M. Caulfield (Mark); A. Chakravarti (Aravinda); F.S. Collins (Francis); D.C. Crawford (Dana); L.A. Cupples (Adrienne); J. Danesh (John); U. de Faire (Ulf); H.M. den Ruijter (Hester ); R. Erbel (Raimund); J. Erdmann (Jeanette); J. Eriksson; M. Farrall (Martin); E. Ferrannini (Ele); J. Ferrieres (Jean); I. Ford; N.G. Forouhi (Nita); T. Forrester (Terrence); R.T. Gansevoort (Ron); P.V. Gejman (Pablo); C. Gieger (Christian); A. Golay (Alain); R.F. Gottesman (Rebecca); V. Gudnason (Vilmundur); U. Gyllensten (Ulf); D.W. Haas (David W); A.S. Hall (Alistair); T.B. Harris (Tamara); A.T. Hattersley (Andrew); A.C. Heath (Andrew C); C. Hengstenberg (Christian); A.A. Hicks (Andrew); L.A. Hindorff (Lucia A); A. Hingorani (Aroon); A. Hofman (Albert); G.K. Hovingh (Kees); S.E. Humphries (Steve E.); S.C. Hunt (Steven); E. Hypponen (Elina); K.B. Jacobs (Kevin); M.-R. Jarvelin (Marjo-Riitta); P. Jousilahti (Pekka); A. Jula (Antti); J. Kaprio (Jaakko); J.J.P. Kastelein (John); M.H. Kayser (Manfred); F. Kee (Frank); S. Keinanen-Kiukaanniemi (Sirkka); L.A.L.M. Kiemeney (Bart); J.S. Kooner (Jaspal S.); C. Kooperberg (Charles); S. Koskinen (Seppo); P. Kovacs (Peter); A. Kraja (Aldi); M. Kumari (Meena); J. Kuusisto (Johanna); T.A. Lakka (Timo); C. Langenberg (Claudia); L. Le Marchand (Loic); T. Lehtimäki (Terho); S. Lupoli (Sara); P.A. Madden; S. Männistö (Satu); P. Manunta (Paolo); A. Marette (Andre'); T.C. Matise (Tara C.); B. McKnight (Barbara); T. Meitinger (Thomas); F.L. Moll (Frans); G.W. Montgomery (Grant W.); A.D. Morris (Andrew); A.P. Morris (Andrew); J.C. Murray (Jeffrey); M. Nelis (Mari); C. Ohlsson (Claes); A.J. Oldehinkel (Albertine); K.K. Ong (Ken K.); W.H. Ouwehand (Willem); G. Pasterkamp (Gerard); A. Peters (Annette); P.P. Pramstaller (Peter Paul); J.F. Price (Jackie F.); L. Qi (Lu); O. Raitakari (Olli); T. Rankinen (Tuomo); D.C. Rao (Dabeeru C.); T.K. Rice (Treva K.); M.D. Ritchie (Marylyn D.); I. Rudan (Igor); V. Salomaa (Veikko); N.J. Samani (Nilesh); J. Saramies (Jouko); M.A. Sarzynski (Mark A.); P.E.H. Schwarz (Peter E. H.); S. Sebert (Sylvain); P. Sever (Peter); A.R. Shuldiner (Alan); J. Sinisalo (Juha); V. Steinthorsdottir (Valgerdur); R.P. Stolk; J.-C. Tardif (Jean-Claude); A. Tönjes (Anke); A. Tremblay (Angelo); E. Tremoli (Elena); J. Virtamo (Jarmo); M.-C. Vohl (Marie-Claude); P. Amouyel (Philippe); F.W. Asselbergs (Folkert W.); T.L. Assimes (Themistocles); M. Bochud (Murielle); B.O. Boehm (Bernhard); E.A. Boerwinkle (Eric); E.P. Bottinger (Erwin P.); C. Bouchard (Claude); S. Cauchi (Stéphane); J.C. Chambers (John C.); S.J. Chanock (Stephen); R.S. Cooper (Richard S.); P.I.W. de Bakker (Paul); G.V. Dedoussis (George); L. Ferrucci (Luigi); P.W. Franks; P. Froguel (Philippe); L. Groop (Leif); C.A. Haiman (Christopher); A. Hamsten (Anders); M.G. Hayes (M. Geoffrey); J. Hui (Jennie); D. Hunter (David); K. Hveem (Kristian); J.W. Jukema (Jan Wouter); R.C. Kaplan (Robert); M. Kivimaki (Mika); D. Kuh (Diana); M. Laakso (Markku); Y. Liu (YongMei); N.G. Martin (Nicholas); W. März (Winfried); M. Melbye (Mads); S. Moebus (Susanne); P. Munroe (Patricia); I. Njølstad (Inger); B.A. Oostra (Ben); C.N.A. Palmer (Colin); N.L. Pedersen (Nancy L.); M. Perola (Markus); L. Perusse (Louis); U. Peters (Ulrike); J.E. Powell (Joseph); C. Power (Christine); T. Quertermous (Thomas); R. Rauramaa (Rainer); E. Reinmaa (Eva); P.M. Ridker (Paul); F. Rivadeneira Ramirez (Fernando); J.I. Rotter (Jerome I.); T. Saaristo (Timo); D. Saleheen; D. Schlessinger (David); P.E. Slagboom (P Eline); H. Snieder (Harold); T.D. Spector (Timothy); K. Strauch (Konstantin); M. Stumvoll (Michael); J. Tuomilehto (Jaakko); M. Uusitupa (Matti); P. van der Harst (Pim); H. Völzke (Henry); M. Walker (Mark); N.J. Wareham (Nick); H. Watkins (Hugh); H.E. Wichmann (Heinz Erich); J.F. Wilson (James F); P. Zanen (Pieter); P. Deloukas (Panagiotis); I.M. Heid (Iris); C.M. Lindgren (Cecilia); K.L. Mohlke (Karen); E.K. Speliotes (Elizabeth); U. Thorsteinsdottir (Unnur); I.E. Barroso (Inês); C.S. Fox (Caroline S.); K.E. North (Kari); D.P. Strachan (David P.); J.S. Beckmann (Jacques); S.I. Berndt (Sonja); M. Boehnke (Michael); I.B. Borecki (Ingrid); M.I. McCarthy (Mark); A. Metspalu (Andres); J-A. Zwart (John-Anker); A.G. Uitterlinden (André); C.M. van Duijn (Cornelia); L. Franke (Lude); C.J. Willer (Cristen); A. Price (Alkes); G. Lettre (Guillaume); R.J.F. Loos (Ruth); M.N. Weedon (Michael); E. Ingelsson (Erik); J.R. O´Connell; G.R. Abecasis (Gonçalo); D.I. Chasman (Daniel); D. Anderson (Denise); M.E. Goddard (Michael); P.M. Visscher (Peter); J.N. Hirschhorn (Joel); T.M. Frayling (Timothy)
textabstractUsing genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated
Lund, Mogens Sandø; de Ross, Sander PW; de Vries, Alfred G
Background Size of the reference population and reliability of phenotypes are crucial factors influencing the reliability of genomic predictions. It is therefore useful to combine closely related populations. Increased accuracies of genomic predictions depend on the number of individuals added to...
Wood, Andrew R.; Esko, Tonu; Yang, Jian; Vedantam, Sailaja; Pers, Tune H.; Gustafsson, Stefan; Chu, Audrey Y.; Estrada, Karol; Luan, Jian'an; Kutalik, Zoltán; Amin, Najaf; Buchkovich, Martin L.; Croteau-Chonka, Damien C.; Day, Felix R.; Duan, Yanan; Fall, Tove; Fehrmann, Rudolf; Ferreira, Teresa; Jackson, Anne U.; Karjalainen, Juha; Lo, Ken Sin; Locke, Adam E.; Mägi, Reedik; Mihailov, Evelin; Porcu, Eleonora; Randall, Joshua C.; Scherag, André; Vinkhuyzen, Anna A. E.; Westra, Harm-Jan; Winkler, Thomas W.; Workalemahu, Tsegaselassie; Zhao, Jing Hua; Absher, Devin; Albrecht, Eva; Anderson, Denise; Baron, Jeffrey; Beekman, Marian; Demirkan, Ayse; Ehret, Georg B.; Feenstra, Bjarke; Feitosa, Mary F.; Fischer, Krista; Fraser, Ross M.; Goel, Anuj; Gong, Jian; Justice, Anne E.; Kanoni, Stavroula; Kleber, Marcus E.; Kristiansson, Kati; Lim, Unhee; Lotay, Vaneet; Lui, Julian C.; Mangino, Massimo; Mateo Leach, Irene; Medina-Gomez, Carolina; Nalls, Michael A.; Nyholt, Dale R.; Palmer, Cameron D.; Pasko, Dorota; Pechlivanis, Sonali; Prokopenko, Inga; Ried, Janina S.; Ripke, Stephan; Shungin, Dmitry; Stancáková, Alena; Strawbridge, Rona J.; Sung, Yun Ju; Tanaka, Toshiko; Teumer, Alexander; Trompet, Stella; van der Laan, Sander W.; van Setten, Jessica; van Vliet-Ostaptchouk, Jana V.; Wang, Zhaoming; Yengo, Loïc; Zhang, Weihua; Afzal, Uzma; Arnlöv, Johan; Arscott, Gillian M.; Bandinelli, Stefania; Barrett, Amy; Bellis, Claire; Bennett, Amanda J.; Berne, Christian; Blüher, Matthias; Bolton, Jennifer L.; Böttcher, Yvonne; Boyd, Heather A.; Bruinenberg, Marcel; Buckley, Brendan M.; Buyske, Steven; Caspersen, Ida H.; Chines, Peter S.; Clarke, Robert; Claudi-Boehm, Simone; Cooper, Matthew; Daw, E. Warwick; de Jong, Pim A.; Deelen, Joris; Delgado, Graciela; Denny, Josh C.; Dhonukshe-Rutten, Rosalie; Dimitriou, Maria; Doney, Alex S. F.; Dörr, Marcus; Eklund, Niina; Eury, Elodie; Folkersen, Lasse; Garcia, Melissa E.; Geller, Frank; Giedraitis, Vilmantas; Go, Alan S.; Grallert, Harald; Grammer, Tanja B.; Gräßler, Jürgen; Grönberg, Henrik; de Groot, Lisette C. P. G. M.; Groves, Christopher J.; Haessler, Jeffrey; Hall, Per; Haller, Toomas; Hallmans, Goran; Hannemann, Anke; Hartman, Catharina A.; Hassinen, Maija; Hayward, Caroline; Heard-Costa, Nancy L.; Helmer, Quinta; Hemani, Gibran; Henders, Anjali K.; Hillege, Hans L.; Hlatky, Mark A.; Hoffmann, Wolfgang; Hoffmann, Per; Holmen, Oddgeir; Houwing-Duistermaat, Jeanine J.; Illig, Thomas; Isaacs, Aaron; James, Alan L.; Jeff, Janina; Johansen, Berit; Johansson, Åsa; Jolley, Jennifer; Juliusdottir, Thorhildur; Junttila, Juhani; Kho, Abel N.; Kinnunen, Leena; Klopp, Norman; Kocher, Thomas; Kratzer, Wolfgang; Lichtner, Peter; Lind, Lars; Lindström, Jaana; Lobbens, Stéphane; Lorentzon, Mattias; Lu, Yingchang; Lyssenko, Valeriya; Magnusson, Patrik K. E.; Mahajan, Anubha; Maillard, Marc; McArdle, Wendy L.; McKenzie, Colin A.; McLachlan, Stela; McLaren, Paul J.; Menni, Cristina; Merger, Sigrun; Milani, Lili; Moayyeri, Alireza; Monda, Keri L.; Morken, Mario A.; Müller, Gabriele; Müller-Nurasyid, Martina; Musk, Arthur W.; Narisu, Narisu; Nauck, Matthias; Nolte, Ilja M.; Nöthen, Markus M.; Oozageer, Laticia; Pilz, Stefan; Rayner, Nigel W.; Renstrom, Frida; Robertson, Neil R.; Rose, Lynda M.; Roussel, Ronan; Sanna, Serena; Scharnagl, Hubert; Scholtens, Salome; Schumacher, Fredrick R.; Schunkert, Heribert; Scott, Robert A.; Sehmi, Joban; Seufferlein, Thomas; Shi, Jianxin; Silventoinen, Karri; Smit, Johannes H.; Smith, Albert Vernon; Smolonska, Joanna; Stanton, Alice V.; Stirrups, Kathleen; Stott, David J.; Stringham, Heather M.; Sundström, Johan; Swertz, Morris A.; Syvänen, Ann-Christine; Tayo, Bamidele O.; Thorleifsson, Gudmar; Tyrer, Jonathan P.; van Dijk, Suzanne; van Schoor, Natasja M.; van der Velde, Nathalie; van Heemst, Diana; van Oort, Floor V. A.; Vermeulen, Sita H.; Verweij, Niek; Vonk, Judith M.; Waite, Lindsay L.; Waldenberger, Melanie; Wennauer, Roman; Wilkens, Lynne R.; Willenborg, Christina; Wilsgaard, Tom; Wojczynski, Mary K.; Wong, Andrew; Wright, Alan F.; Zhang, Qunyuan; Arveiler, Dominique; Bakker, Stephan J. L.; Beilby, John; Bergman, Richard N.; Bergmann, Sven; Biffar, Reiner; Blangero, John; Boomsma, Dorret I.; Bornstein, Stefan R.; Bovet, Pascal; Brambilla, Paolo; Brown, Morris J.; Campbell, Harry; Caulfield, Mark J.; Chakravarti, Aravinda; Collins, Rory; Collins, Francis S.; Crawford, Dana C.; Cupples, L. Adrienne; Danesh, John; de Faire, Ulf; den Ruijter, Hester M.; Erbel, Raimund; Erdmann, Jeanette; Eriksson, Johan G.; Farrall, Martin; Ferrannini, Ele; Ferrières, Jean; Ford, Ian; Forouhi, Nita G.; Forrester, Terrence; Gansevoort, Ron T.; Gejman, Pablo V.; Gieger, Christian; Golay, Alain; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Haas, David W.; Hall, Alistair S.; Harris, Tamara B.; Hattersley, Andrew T.; Heath, Andrew C.; Hengstenberg, Christian; Hicks, Andrew A.; Hindorff, Lucia A.; Hingorani, Aroon D.; Hofman, Albert; Hovingh, G. Kees; Humphries, Steve E.; Hunt, Steven C.; Hypponen, Elina; Jacobs, Kevin B.; Jarvelin, Marjo-Riitta; Jousilahti, Pekka; Jula, Antti M.; Kaprio, Jaakko; Kastelein, John J. P.; Kayser, Manfred; Kee, Frank; Keinanen-Kiukaanniemi, Sirkka M.; Kiemeney, Lambertus A.; Kooner, Jaspal S.; Kooperberg, Charles; Koskinen, Seppo; Kovacs, Peter; Kraja, Aldi T.; Kumari, Meena; Kuusisto, Johanna; Lakka, Timo A.; Langenberg, Claudia; Le Marchand, Loic; Lehtimäki, Terho; Lupoli, Sara; Madden, Pamela A. F.; Männistö, Satu; Manunta, Paolo; Marette, André; Matise, Tara C.; McKnight, Barbara; Meitinger, Thomas; Moll, Frans L.; Montgomery, Grant W.; Morris, Andrew D.; Morris, Andrew P.; Murray, Jeffrey C.; Nelis, Mari; Ohlsson, Claes; Oldehinkel, Albertine J.; Ong, Ken K.; Ouwehand, Willem H.; Pasterkamp, Gerard; Peters, Annette; Pramstaller, Peter P.; Price, Jackie F.; Qi, Lu; Raitakari, Olli T.; Rankinen, Tuomo; Rao, D. C.; Rice, Treva K.; Ritchie, Marylyn; Rudan, Igor; Salomaa, Veikko; Samani, Nilesh J.; Saramies, Jouko; Sarzynski, Mark A.; Schwarz, Peter E. H.; Sebert, Sylvain; Sever, Peter; Shuldiner, Alan R.; Sinisalo, Juha; Steinthorsdottir, Valgerdur; Stolk, Ronald P.; Tardif, Jean-Claude; Tönjes, Anke; Tremblay, Angelo; Tremoli, Elena; Virtamo, Jarmo; Vohl, Marie-Claude; Amouyel, Philippe; Asselbergs, Folkert W.; Assimes, Themistocles L.; Bochud, Murielle; Boehm, Bernhard O.; Boerwinkle, Eric; Bottinger, Erwin P.; Bouchard, Claude; Cauchi, Stéphane; Chambers, John C.; Chanock, Stephen J.; Cooper, Richard S.; de Bakker, Paul I. W.; Dedoussis, George; Ferrucci, Luigi; Franks, Paul W.; Froguel, Philippe; Groop, Leif C.; Haiman, Christopher A.; Hamsten, Anders; Hayes, M. Geoffrey; Hui, Jennie; Hunter, David J.; Hveem, Kristian; Jukema, J. Wouter; Kaplan, Robert C.; Kivimaki, Mika; Kuh, Diana; Laakso, Markku; Liu, Yongmei; Martin, Nicholas G.; März, Winfried; Melbye, Mads; Moebus, Susanne; Munroe, Patricia B.; Njølstad, Inger; Oostra, Ben A.; Palmer, Colin N. A.; Pedersen, Nancy L.; Perola, Markus; Pérusse, Louis; Peters, Ulrike; Powell, Joseph E.; Power, Chris; Quertermous, Thomas; Rauramaa, Rainer; Reinmaa, Eva; Ridker, Paul M.; Rivadeneira, Fernando; Rotter, Jerome I.; Saaristo, Timo E.; Saleheen, Danish; Schlessinger, David; Slagboom, P. Eline; Snieder, Harold; Spector, Tim D.; Strauch, Konstantin; Stumvoll, Michael; Tuomilehto, Jaakko; Uusitupa, Matti; van der Harst, Pim; Völzke, Henry; Walker, Mark; Wareham, Nicholas J.; Watkins, Hugh; Wichmann, H.-Erich; Wilson, James F.; Zanen, Pieter; Deloukas, Panos; Heid, Iris M.; Lindgren, Cecilia M.; Mohlke, Karen L.; Speliotes, Elizabeth K.; Thorsteinsdottir, Unnur; Barroso, Inês; Fox, Caroline S.; North, Kari E.; Strachan, David P.; Beckmann, Jacques S.; Berndt, Sonja I.; Boehnke, Michael; Borecki, Ingrid B.; McCarthy, Mark I.; Metspalu, Andres; Stefansson, Kari; Uitterlinden, André G.; van Duijn, Cornelia M.; Franke, Lude; Willer, Cristen J.; Price, Alkes L.; Lettre, Guillaume; Loos, Ruth J. F.; Weedon, Michael N.; Ingelsson, Erik; O'Connell, Jeffrey R.; Abecasis, Goncalo R.; Chasman, Daniel I.; Goddard, Michael E.; Visscher, Peter M.; Hirschhorn, Joel N.; Frayling, Timothy M.; McCarty, Catherine A.; Starren, Justin; Peissig, Peggy; Berg, Richard; Rasmussen, Luke; Linneman, James; Miller, Aaron; Choudary, Vidhu; Chen, Lin; Waudby, Carol; Kitchner, Terrie; Reeser, Jonathan; Fost, Norman; Wilke, Russell A.; Chisholm, Rex L.; Avila, Pedro C.; Greenland, Philip; Hayes, M. Geoff; Kho, Abel; Kibbe, Warren A.; Lemke, Amy A.; Lowe, William L.; Smith, Maureen E.; Wolf, Wendy A.; Pacheco, Jennifer A.; Thompson, William K.; Humowiecki, Joel; Law, May; Chute, Christopher; Kullo, Iftikar; Koenig, Barbara; de Andrade, Mariza; Bielinski, Suzette; Pathak, Jyotishman; Savova, Guergana; Wu, Joel; Henriksen, Joan; Ding, Keyue; Hart, Lacey; Palbicki, Jeremy; Larson, Eric B.; Newton, Katherine; Ludman, Evette; Spangler, Leslie; Hart, Gene; Carrell, David; Jarvik, Gail; Crane, Paul; Burke, Wylie; Fullerton, Stephanie Malia; Trinidad, Susan Brown; Carlson, Chris; Hutchinson, Fred; McDavid, Andrew; Roden, Dan M.; Clayton, Ellen; Haines, Jonathan L.; Masys, Daniel R.; Churchill, Larry R.; Cornfield, Daniel; Crawford, Dana; Darbar, Dawood; Denny, Joshua C.; Malin, Bradley A.; Ritchie, Marylyn D.; Schildcrout, Jonathan S.; Xu, Hua; Ramirez, Andrea Havens; Basford, Melissa; Pulley, Jill; Alizadeh, Behrooz Z.; de Boer, Rudolf A.; Boezen, H. Marike; van der Klauw, Melanie M.; Navis, Gerjan; Ormel, Johan; Postma, Dirkje S.; Rosmalen, Judith G. M.; Slaets, Joris P.; Wolffenbuttel, Bruce H. R.; Wijmenga, Cisca; Kathiresan, Sekar; Voight, Benjamin F.; Purcell, Shaun; Musunuru, Kiran; Ardissino, Diego; Mannucci, Pier M.; Anand, Sonia; Engert, James C.; Reilly, Muredach P.; Rader, Daniel J.; Morgan, Thomas; Spertus, John A.; Stoll, Monika; Girelli, Domenico; McKeown, Pascal P.; Patterson, Chris C.; Siscovick, David S.; O'Donnell, Christopher J.; Elosua, Roberto; Peltonen, Leena; Schwartz, Stephen M.; Melander, Olle; Altshuler, David; Merlini, Pier Angelica; Berzuini, Carlo; Bernardinelli, Luisa; Peyvandi, Flora; Tubaro, Marco; Celli, Patrizia; Ferrario, Maurizio; Fetiveau, Raffaela; Marziliano, Nicola; Casari, Giorgio; Galli, Michele; Ribichini, Flavio; Rossi, Marco; Bernardi, Francesco; Zonzin, Pietro; Piazza, Alberto; Yee, Jean; Friedlander, Yechiel; Marrugat, Jaume; Lucas, Gavin; Subirana, Isaac; Sala, Joan; Ramos, Rafael; Meigs, James B.; Williams, Gordon; Nathan, David M.; MacRae, Calum A.; Havulinna, Aki S.; Berglund, Goran; Asselta, Rosanna; Duga, Stefano; Spreafico, Marta; Daly, Mark J.; Nemesh, James; Korn, Joshua M.; McCarroll, Steven A.; Surti, Aarti; Guiducci, Candace; Gianniny, Lauren; Mirel, Daniel; Parkin, Melissa; Burtt, Noel; Gabriel, Stacey B.; Thompson, John R.; Braund, Peter S.; Wright, Benjamin J.; Balmforth, Anthony J.; Ball, Stephen G.; Schunkert, I. Heribert; Linsel-Nitschke, Patrick; Lieb, Wolfgang; Ziegler, Andreas; König, Inke R.; Fischer, Marcus; Stark, Klaus; Grosshennig, Anika; Preuss, Michael; Schreiber, Stefan; Ouwehand, Willem; Scholz, Michael; Cambien, Francois; Goodall, Alison; Li, Mingyao; Chen, Zhen; Wilensky, Robert; Matthai, William; Qasim, Atif; Hakonarson, Hakon H.; Devaney, Joe; Burnett, Mary-Susan; Pichard, Augusto D.; Kent, Kenneth M.; Satler, Lowell; Lindsay, Joseph M.; Waksman, Ron; Knouff, Christopher W.; Waterworth, Dawn M.; Walker, Max C.; Mooser, Vincent; Epstein, Stephen E.; Scheffold, Thomas; Berger, Klaus; Huge, Andreas; Martinelli, Nicola; Olivieri, Oliviero; Corrocher, Roberto; Hólm, Hilma; Do, Ron; Xie, Changchun; Siscovick, David; Matise, Tara; Buyske, Steve; Higashio, Julia; Williams, Rasheeda; Nato, Andrew; Ambite, Jose Luis; Deelman, Ewa; Manolio, Teri; Hindorff, Lucia; Heiss, Gerardo; Taylor, Kira; Franceschini, Nora; Avery, Christy; Graff, Misa; Lin, Danyu; Quibrera, Miguel; Cochran, Barbara; Kao, Linda; Umans, Jason; Cole, Shelley; MacCluer, Jean; Person, Sharina; Pankow, James; Gross, Myron; Fornage, Myriam; Durda, Peter; Jenny, Nancy; Patsy, Bruce; Arnold, Alice; Buzkova, Petra; Haines, Jonathan; Murdock, Deborah; Glenn, Kim; Brown-Gentry, Kristin; Thornton-Wells, Tricia; Dumitrescu, Logan; Bush, William S.; Mitchell, Sabrina L.; Goodloe, Robert; Wilson, Sarah; Boston, Jonathan; Malinowski, Jennifer; Restrepo, Nicole; Oetjens, Matthew; Fowke, Jay; Zheng, Wei; Spencer, Kylee; Pendergrass, Sarah; Le Marchand, Loïc; Wilkens, Lynne; Park, Lani; Tiirikainen, Maarit; Kolonel, Laurence; Cheng, Iona; Wang, Hansong; Shohet, Ralph; Haiman, Christopher; Stram, Daniel; Henderson, Brian; Monroe, Kristine; Schumacher, Fredrick; Anderson, Garnet; Prentice, Ross; LaCroix, Andrea; Wu, Chunyuan; Carty, Cara; Rosse, Stephanie; Young, Alicia; Haessler, Jeff; Kocarnik, Jonathan; Lin, Yi; Jackson, Rebecca; Duggan, David; Kuller, Lew
Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700
Bastiaansen, J.W.M.; Bink, M.C.A.M.; Coster, A.; Maliepaard, C.A.; Calus, M.P.L.
Background - Genomic selection, the use of markers across the whole genome, receives increasing amounts of attention and is having more and more impact on breeding programs. Development of statistical and computational methods to estimate breeding values based on markers is a very active area of
Bouska, Alyssa; Bi, Chengfeng; Lone, Waseem; Zhang, Weiwei; Kedwaii, Ambreen; Heavican, Tayla; Lachel, Cynthia M; Yu, Jiayu; Ferro, Roberto; Eldorghamy, Nanees; Greiner, Timothy C; Vose, Julie; Weisenburger, Dennis D; Gascoyne, Randy D; Rosenwald, Andreas; Ott, German; Campo, Elias; Rimsza, Lisa M; Jaffe, Elaine S; Braziel, Rita M; Siebert, Reiner; Miles, Rodney R; Dave, Sandeep; Reddy, Anupama; Delabie, Jan; Staudt, Louis M; Song, Joo Y; McKeithan, Timothy W; Fu, Kai; Green, Michael; Chan, Wing C; Iqbal, Javeed
The adult high-grade B-cell lymphomas sharing molecular features with Burkitt lymphoma (BL) are highly aggressive lymphomas with poor clinical outcome. High-resolution structural and functional genomic analysis of adult Burkitt lymphoma (BL) and high-grade B-cell lymphoma with BL gene signature (adult-molecularly defined BL [mBL]) revealed the MYC-ARF-p53 axis as the primary deregulated pathway. Adult-mBL had either unique or more frequent genomic aberrations (del13q14, del17p, gain8q24, and gain18q21) compared with pediatric-mBL, but shared commonly mutated genes. Mutations in genes promoting the tonic B-cell receptor (BCR)→PI3K pathway ( TCF3 and ID3 ) did not differ by age, whereas effectors of chronic BCR→NF-κB signaling were associated with adult-mBL. A subset of adult-mBL had BCL2 translocation and mutation and elevated BCL2 mRNA and protein expression, but had a mutation profile similar to mBL. These double-hit lymphomas may have arisen from a tumor precursor that acquired both BCL2 and MYC translocations and/or KMT2D ( MLL2 ) mutation. Gain/amplification of MIR17HG and its paralogue loci was observed in 50% of adult-mBL. In vitro studies suggested miR-17∼92 's role in constitutive activation of BCR signaling and sensitivity to ibrutinib. Overall integrative analysis identified an interrelated gene network affected by copy number and mutation, leading to disruption of the p53 pathway and the BCR→PI3K or NF-κB activation, which can be further exploited in vivo by small-molecule inhibitors for effective therapy in adult-mBL.
Bartsch, Conny; Weiss, Michael; Kipper, Silke
Sexual ornamentation may be related to the degree of paternal care and the 'good-parent' model predicts that male secondary characters honestly advertise paternal investment. In most birds, males are involved in bringing up the young and successful reproduction highly depends on male contribution during breeding. In passerines, male song is indicative of male attributes and for few species it has been shown that song features also signal paternal investment to females. Males of nightingales Luscinia megarhynchos are famous for their elaborate singing but so far there is only little knowledge on the role of male song in intersexual communication, and it is unknown whether male song predicts male parenting abilities. Using RFID technology to record male feeding visits to the nest, we found that nightingale males substantially contribute to chick feeding. Also, we analyzed male nocturnal song with focus on song features that have been shown to signal male quality before. We found that several song features, namely measures of song complexity and song sequencing, were correlated with male feeding rates. Moreover, the combination of these song features had strong predictive power for male contribution to nestling feeding. Since male nightingales are involved in chick rearing, paternal investment might be a crucial variable for female mate choice in this species. Females may assess future paternal care on the basis of song features identified in our study and thus these features may have evolved to signal direct benefits to females. Additionally we underline the importance of multiple acoustic cues for female mating decisions especially in species with complex song such as the nightingale.
He, Yan; Wang, Minghui; Dukowic-Schulze, Stefanie; Zhou, Adele; Tiang, Choon-Lin; Shilo, Shay; Sidhu, Gaganpreet K; Eichten, Steven; Bradbury, Peter; Springer, Nathan M; Buckler, Edward S; Levy, Avraham A; Sun, Qi; Pillardy, Jaroslaw; Kianian, Penny M A; Kianian, Shahryar F; Chen, Changbin; Pawlowski, Wojciech P
Meiotic recombination is the most important source of genetic variation in higher eukaryotes. It is initiated by formation of double-strand breaks (DSBs) in chromosomal DNA in early meiotic prophase. The DSBs are subsequently repaired, resulting in crossovers (COs) and noncrossovers (NCOs). Recombination events are not distributed evenly along chromosomes but cluster at recombination hotspots. How specific sites become hotspots is poorly understood. Studies in yeast and mammals linked initiation of meiotic recombination to active chromatin features present upstream from genes, such as absence of nucleosomes and presence of trimethylation of lysine 4 in histone H3 (H3K4me3). Core recombination components are conserved among eukaryotes, but it is unclear whether this conservation results in universal characteristics of recombination landscapes shared by a wide range of species. To address this question, we mapped meiotic DSBs in maize, a higher eukaryote with a large genome that is rich in repetitive DNA. We found DSBs in maize to be frequent in all chromosome regions, including sites lacking COs, such as centromeres and pericentromeric regions. Furthermore, most DSBs are formed in repetitive DNA, predominantly Gypsy retrotransposons, and only one-quarter of DSB hotspots are near genes. Genic and nongenic hotspots differ in several characteristics, and only genic DSBs contribute to crossover formation. Maize hotspots overlap regions of low nucleosome occupancy but show only limited association with H3K4me3 sites. Overall, maize DSB hotspots exhibit distribution patterns and characteristics not reported previously in other species. Understanding recombination patterns in maize will shed light on mechanisms affecting dynamics of the plant genome.
Full Text Available There is a surge of studies confirming that old age spares the ability to bind in visual working memory (VWM multiple features within singular object representations. Furthermore, it has been suggested that such ability may also be independent of the cultural background of the assessed individual. However, this evidence has been gathered with tasks that use arbitrary bindings of unfamiliar features. Whether age spares memory binding functions when the memoranda are features of everyday life objects remains less well explored. The present study investigated the influence of age, memory delay, and education, on conjunctive binding functions responsible for representing everyday items in VWM. We asked 32 healthy young and 41 healthy older adults to perform a memory binding task. During the task, participants saw visual arrays of objects, colours, or coloured objects presented for 6 s. Immediately after they were asked either to select the objects or the colours that were presented during the study display from larger sets of objects or colours, or to recombine them by selecting from such sets the objects and their corresponding colours. This procedure was repeated immediately after but this time providing a 30 s unfiled delay. We manipulated familiarity by presenting congruent and incongruent object-colour pairings. The results showed that the ability to bind intrinsic features in VWM does not decline with age even when these features belong to everyday items and form novel or well-known associations. Such preserved memory binding abilities held across memory delays. The impact of feature congruency on item-recognition appears to be greater in older than in younger adults. This suggests that long-term memory (LTM supports binding functions carried out in VWM for familiar everyday items and older adults still benefit from this LTM support. We have expanded the evidence supporting the lack of age effects on VWM binding functions to new feature and
Full Text Available We report the chloroplast (cp genome sequence of tartary buckwheat (Fagopyrum tataricum obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats and F. esculentum (one repeat, and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes--rpoC2, ycf3, accD, and clpP--have high synonymous (Ks value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum.
Catellicoccus marimammalium is a relatively uncharacterized Gram-positive, facultative anaerobe with potential utility as an indicator of waterfowl fecal contamination. Here we report an annotated draft genome sequence that suggests this organism may be a symbiotic gut microbe.
Full Text Available The National Prevention Plan (NPP 2010-2012, approved by the Agreement between the Government, the Regions and Autonomous Provinces of Trento and Bolzano on 29 April 2010, called for Regions to adopt, by 31/12/2010, the Regional Prevention Plan (RPP for implementing the interventions provided by the NPP 2010-2012.This article has considered and compared the different RPP’s. In an attempt to provide an outlook on the future medical prevention plans over the next few years in Italy, a comparison has been made between the RPP from 19 Regions and the Autonomous Province of Trento. This work has been focused on the actions identified in regional plans as a priority concerning the major common and innovative elements.The analysis of each RPP revealed a common plan to chronic degenerative diseases, because of the aging of the population in every Region of Italy. Other important common targets are: surveillance systems, vaccination programs and screening programs. Toscana and Liguria, more than other Regions, are engaged in the creation of networks involving various social actors. In some Regions there are projects aimed at eliminating social, economic or gender inequities, such as the project “women’s health” in the Region of Puglia. Toscana and Emilia-Romagna Plans pay attention to environment and pollution issues.Despite social, environmental and economic differences, the various Regions have common principles, concerning: life style, surveillance, vaccination and the screening for cancer.
Hacker, William C; Li, Shuxiang; Elcock, Adrian H
We describe structural models of the Escherichia coli chromosome in which the positions of all 4.6 million nucleotides of each DNA strand are resolved. Models consistent with two basic chromosomal orientations, differing in their positioning of the origin of replication, have been constructed. In both types of model, the chromosome is partitioned into plectoneme-abundant and plectoneme-free regions, with plectoneme lengths and branching patterns matching experimental distributions, and with spatial distributions of highly-transcribed chromosomal regions matching recent experimental measurements of the distribution of RNA polymerases. Physical analysis of the models indicates that the effective persistence length of the DNA and relative contributions of twist and writhe to the chromosome's negative supercoiling are in good correspondence with experimental estimates. The models exhibit characteristics similar to those of 'fractal globules,' and even the most genomically-distant parts of the chromosome can be physically connected, through paths combining linear diffusion and inter-segmental transfer, by an average of only ∼10 000 bp. Finally, macrodomain structures and the spatial distributions of co-expressed genes are analyzed: the latter are shown to depend strongly on the overall orientation of the chromosome. We anticipate that the models will prove useful in exploring other static and dynamic features of the bacterial chromosome. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Background Brachyspira spp. colonize the intestines of some mammalian and avian species and show different degrees of enteropathogenicity. Brachyspira intermedia can cause production losses in chickens and strain PWS/AT now becomes the fourth genome to be completed in the genus Brachyspira. Results 15 classes of unique and shared genes were analyzed in B. intermedia, B. murdochii, B. hyodysenteriae and B. pilosicoli. The largest number of unique genes was found in B. intermedia and B. murdochii. This indicates the presence of larger pan-genomes. In general, hypothetical protein annotations are overrepresented among the unique genes. A 3.2 kb plasmid was found in B. intermedia strain PWS/AT. The plasmid was also present in the B. murdochii strain but not in nine other Brachyspira isolates. Within the Brachyspira genomes, genes had been translocated and also frequently switched between leading and lagging strands, a process that can be followed by different AT-skews in the third positions of synonymous codons. We also found evidence that bacteriophages were being remodeled and genes incorporated into them. Conclusions The accessory gene pool shapes species-specific traits. It is also influenced by reductive genome evolution and horizontal gene transfer. Gene-transfer events can cross both species and genus boundaries and bacteriophages appear to play an important role in this process. A mechanism for horizontal gene transfer appears to be gene translocations leading to remodeling of bacteriophages in combination with broad tropism. PMID:21816042
Churikov, A.V.; L'vov, A.L.; Gamayunova, I.M.; Shirokov, A.V.
Electrochemical behaviour of Li-electrode in LiAlCl 4 solutions in thienyl chloride and LiBF 4 solutions in γ-butyrolactone is studied as well as Li-electrode with Li 2 CO 3 protected film in LiClO 4 solution in mixed solvent (propylene carbonate and dimethoxyethane). Common regularities of Li-electrode electrochemical kinetic are discussed. Methods of electrode impedance spectroscopy and pulse voltametry are used for investigations
Full Text Available Abstract Background Fabaceae (legumes is one of the largest families of flowering plants, and some members are important crops. In contrast to what we know about their great diversity or economic importance, our knowledge at the genomic level of chloroplast genomes (cpDNAs or plastomes for these crops is limited. Results We sequenced the complete genome of the common bean (Phaseolus vulgaris cv. Negro Jamapa chloroplast. The plastome of P. vulgaris is a 150,285 bp circular molecule. It has gene content similar to that of other legume plastomes, but contains two pseudogenes, rpl33 and rps16. A distinct inversion occurred at the junction points of trnH-GUG/rpl14 and rps19/rps8, as in adzuki bean 1. These two pseudogenes and the inversion were confirmed in 10 varieties representing the two domestication centers of the bean. Genomic comparative analysis indicated that inversions generally occur in legume plastomes and the magnitude and localization of insertions/deletions (indels also vary. The analysis of repeat sequences demonstrated that patterns and sequences of tandem repeats had an important impact on sequence diversification between legume plastomes and tandem repeats did not belong to dispersed repeats. Interestingly, P. vulgaris plastome had higher evolutionary rates of change on both genomic and gene levels than G. max, which could be the consequence of pressure from both mutation and natural selection. Conclusion Legume chloroplast genomes are widely diversified in gene content, gene order, indel structure, abundance and localization of repetitive sequences, intracellular sequence exchange and evolutionary rates. The P. vulgaris plastome is a rapidly evolving genome.
Tamara Vladimirovna Kuprina
Full Text Available The article discusses the features of migration flows in Russia, CIS and abroad in terms of transnationality and translocality. Special attention is paid to migration flows from Asia, depending on the gender perception. During the analysis, the preferences of migrants of different age groups are determined. It shows their desire for a more independent financial and professional life, relationship with family responsibilities, plans for the future. The data obtained by the method of continuous selection have led to the following conclusions: migration is a multidimensional phenomenon that requires multidisciplinary research; the heterogeneity of migration flows to the socio-cultural and gender features must be taken into account; the problem of illegal migration requires special solutions; in connection with the emigration of highly skilled professionals and immigration of low-skilled personnel, a professional important indicator of migratory replacement and attracting compatriots from abroad are important. The data obtained can be used for ethnic and cultural planning in order to improve production activities that affect the quality of solving of production tasks and programming adaptation of migrants as the significant impact of intangible factors on the development of society has been noted by economists for a long time. Thus, the economic sustainability requires new thinking based on socio-cultural sustainability determined by such global phenomena as international migration flows.
Full Text Available Cyprinids are the most important group of farmed fish globally in terms of production volume, with common carp (Cyprinus carpio being one of the most valuable species of the group. The use of modern selective breeding methods in carp is at a formative stage, implying a large scope for genetic improvement of key production traits. In the current study, a population of 1,425 carp juveniles, originating from a partial factorial cross between 40 sires and 20 dams, was used for investigating the potential of genomic selection (GS for juvenile growth, an exemplar polygenic production trait. RAD sequencing was used to identify and genotype SNP markers for subsequent parentage assignment, construction of a medium density genetic map (12,311 SNPs, genome-wide association study (GWAS, and testing of GS. A moderate heritability was estimated for body length of carp at 120 days (as a proxy of juvenile growth of 0.33 (s.e. 0.05. No genome-wide significant QTL was identified using a single marker GWAS approach. Genomic prediction of breeding values outperformed pedigree-based prediction, resulting in 18% improvement in prediction accuracy. The impact of reduced SNP densities on prediction accuracy was tested by varying minor allele frequency (MAF thresholds, with no drop in prediction accuracy until the MAF threshold is set <0.3 (2,744 SNPs. These results point to the potential for GS to improve economically important traits in common carp breeding programs.
I. V. Sharkova
Full Text Available The algorithm of differential diagnosis of the two most common genetic variants the limb-girdle muscular dystrophy (LGMD2A and DMD, developed on the basis of a comprehensive survey of 85 patients with a diagnosis specification using techniques of DNA analysis. It is shown that the accurate diagnosis of LGMD genetic types should be based on the results of the clinical and genealogical, biochemical and molecular genetic analysis. The proposed algorithm will significantly reduces the economic and time costs with expensive DNA testing.
Sánchez-Vega, Francisco; Gotea, Valer; Margolin, Gennady; Elnitski, Laura
The term CpG island methylator phenotype (CIMP) has been used to describe widespread DNA hypermethylation at CpG-rich genomic regions affecting clinically distinct subsets of cancer patients. Even though there have been numerous studies of CIMP in individual cancer types, a uniform analysis across tissues is still lacking. We analyze genome-wide patterns of CpG island hypermethylation in 5,253 solid epithelial tumors from 15 cancer types from TCGA and 23 cancer cell lines from ENCODE. We identify differentially methylated loci that define CIMP+ and CIMP- samples, and we use unsupervised clustering to provide a robust molecular stratification of tumor methylomes for 12 cancer types and all cancer cell lines. With a minimal set of 89 discriminative loci, we demonstrate accurate pan-cancer separation of the 12 CIMP+/- subpopulations, based on their average levels of methylation. Tumor samples in different CIMP subclasses show distinctive correlations with gene expression profiles and recurrence of somatic mutations, copy number variations, and epigenetic silencing. Enrichment analyses indicate shared canonical pathways and upstream regulators for CIMP-targeted regions across cancer types. Furthermore, genomic alterations showing consistent associations with CIMP+/- status include genes involved in DNA repair, chromatin remodeling genes, and several histone methyltransferases. Associations of CIMP status with specific clinical features, including overall survival in several cancer types, highlight the importance of the CIMP+/- designation for individual tumor evaluation and personalized medicine. We present a comprehensive computational study of CIMP that reveals pan-cancer commonalities and tissue-specific differences underlying concurrent hypermethylation of CpG islands across tumors. Our stratification of solid tumors and cancer cell lines based on CIMP status is data-driven and agnostic to tumor type by design, which protects against known biases that have hindered
Elijah R Behr
Full Text Available Marked prolongation of the QT interval on the electrocardiogram associated with the polymorphic ventricular tachycardia Torsades de Pointes is a serious adverse event during treatment with antiarrhythmic drugs and other culprit medications, and is a common cause for drug relabeling and withdrawal. Although clinical risk factors have been identified, the syndrome remains unpredictable in an individual patient. Here we used genome-wide association analysis to search for common predisposing genetic variants. Cases of drug-induced Torsades de Pointes (diTdP, treatment tolerant controls, and general population controls were ascertained across multiple sites using common definitions, and genotyped on the Illumina 610k or 1M-Duo BeadChips. Principal Components Analysis was used to select 216 Northwestern European diTdP cases and 771 ancestry-matched controls, including treatment-tolerant and general population subjects. With these sample sizes, there is 80% power to detect a variant at genome-wide significance with minor allele frequency of 10% and conferring an odds ratio of ≥2.7. Tests of association were carried out for each single nucleotide polymorphism (SNP by logistic regression adjusting for gender and population structure. No SNP reached genome wide-significance; the variant with the lowest P value was rs2276314, a non-synonymous coding variant in C18orf21 (p = 3×10(-7, odds ratio = 2, 95% confidence intervals: 1.5-2.6. The haplotype formed by rs2276314 and a second SNP, rs767531, was significantly more frequent in controls than cases (p = 3×10(-9. Expanding the number of controls and a gene-based analysis did not yield significant associations. This study argues that common genomic variants do not contribute importantly to risk for drug-induced Torsades de Pointes across multiple drugs.
de Queiroz, Casley Borges; Correia, Hilberty L Nunes; Menicucci, Renato Pedrozo; Vidigal, Pedro M Pereira; de Queiroz, Marisa Vieira
Colletotrichum lindemuthianum is the causal agent of anthracnose in common beans, one of the main limiting factors of their culture. Here, we report for the first time, to our knowledge, a draft of the complete genome sequences of two isolates belonging to 83.501 and 89 A 2 2-3 of C. lindemutuianum . Copyright © 2017 de Queiroz et al.
Juliana Morini Küpper Cardoso Perseguini
Full Text Available The common bean (Phaseolus vulgaris L. is the world's most important legume for human consumption. Anthracnose (ANT; Colletotrichum lindemuthianum and angular leaf spot (ALS; Pseudocercospora griseola are complex diseases that cause major yield losses in common bean. Depending on the cultivar and environmental conditions, anthracnose and angular leaf spot infections can reduce crop yield drastically. This study aimed to estimate linkage disequilibrium levels and identify quantitative resistance loci (QRL controlling resistance to both ANT and ALS diseases of 180 accessions of common bean using genome-wide association analysis. A randomized complete block design with four replicates was performed for the ANT and ALS experiments, with four plants per genotype in each replicate. Association mapping analyses were performed for ANT and ALS using a mixed linear model approach implemented in TASSEL. A total of 17 and 11 significant statistically associations involving SSRs were detected for ANT and ALS resistance loci, respectively. Using SNPs, 21 and 17 significant statistically associations were obtained for ANT and angular ALS, respectively, providing more associations with this marker. The SSR-IAC167 and PvM95 markers, both located on chromosome Pv03, and the SNP scaffold00021_89379, were associated with both diseases. The other markers were distributed across the entire common bean genome, with chromosomes Pv03 and Pv08 showing the greatest number of loci associated with ANT resistance. The chromosome Pv04 was the most saturated one, with six markers associated with ALS resistance. The telomeric region of this chromosome showed four markers located between approximately 2.5 Mb and 4.4 Mb. Our results demonstrate the great potential of genome-wide association studies to identify QRLs related to ANT and ALS in common bean. The results indicate a quantitative and complex inheritance pattern for both diseases in common bean. Our findings will
Perseguini, Juliana Morini Küpper Cardoso; Oblessuc, Paula Rodrigues; Rosa, João Ricardo Bachega Feijó; Gomes, Kleber Alves; Chiorato, Alisson Fernando; Carbonell, Sérgio Augusto Morais; Garcia, Antonio Augusto Franco; Vianello, Rosana Pereira; Benchimol-Reis, Luciana Lasry
The common bean (Phaseolus vulgaris L.) is the world's most important legume for human consumption. Anthracnose (ANT; Colletotrichum lindemuthianum) and angular leaf spot (ALS; Pseudocercospora griseola) are complex diseases that cause major yield losses in common bean. Depending on the cultivar and environmental conditions, anthracnose and angular leaf spot infections can reduce crop yield drastically. This study aimed to estimate linkage disequilibrium levels and identify quantitative resistance loci (QRL) controlling resistance to both ANT and ALS diseases of 180 accessions of common bean using genome-wide association analysis. A randomized complete block design with four replicates was performed for the ANT and ALS experiments, with four plants per genotype in each replicate. Association mapping analyses were performed for ANT and ALS using a mixed linear model approach implemented in TASSEL. A total of 17 and 11 significant statistically associations involving SSRs were detected for ANT and ALS resistance loci, respectively. Using SNPs, 21 and 17 significant statistically associations were obtained for ANT and angular ALS, respectively, providing more associations with this marker. The SSR-IAC167 and PvM95 markers, both located on chromosome Pv03, and the SNP scaffold00021_89379, were associated with both diseases. The other markers were distributed across the entire common bean genome, with chromosomes Pv03 and Pv08 showing the greatest number of loci associated with ANT resistance. The chromosome Pv04 was the most saturated one, with six markers associated with ALS resistance. The telomeric region of this chromosome showed four markers located between approximately 2.5 Mb and 4.4 Mb. Our results demonstrate the great potential of genome-wide association studies to identify QRLs related to ANT and ALS in common bean. The results indicate a quantitative and complex inheritance pattern for both diseases in common bean. Our findings will contribute to more
Hultsch, T; Albers, M W; Schreiber, S L; Hohman, R J
Investigations of the actions and interactions of the immunophilin ligands FK506, cyclosporin A (CsA), rapamycin, and 506BD suggest that complexes of FK506 with an FK506-binding protein or of CsA with a cyclophilin (CsA-binding protein) inhibit the T-cell receptor-mediated signal transduction that results in the transcription of interleukin 2. Now we report an identical spectrum of activities of FK506, CsA, rapamycin, and 506BD on IgE receptor-mediated signal transduction that results in exocytosis of secretory granules from the rat basophilic leukemia cell line RBL-2H3, a mast cell model. Both FK506 and CsA inhibit receptor-mediated exocytosis (CsA IC50 = 200 nM; FK506 IC50 = 2 nM) without affecting early receptor-associated events (hydrolysis of phosphatidylinositol, synthesis and release of eicosanoids, uptake of Ca2+). In contrast, rapamycin and 506BD, which share common structural elements with FK506, by themselves have no effect on IgE receptor-mediated exocytosis. Both compounds, however, prevent inhibition by FK506 but not by CsA. Affinity chromatography with FK506, CsA, and rapamycin matrices indicates that the same set of immunophilins present in RBL-2H3 cells have been found in Jurkat T cells and calf thymus; however, the relative amounts of these proteins differ in the two cell types. These results suggest the existence of a common step in cytoplasmic signaling in T cells and mast cells that may be part of a general signaling mechanism. Images PMID:1712484
Full Text Available CD4+Foxp3+ Treg cells are essential for maintaining self-tolerance and preventing excessive immune responses. In the context of Th1 immune responses, co-expression of the Th1 transcription factor T-bet with Foxp3 is essential for Treg cells to control Th1 responses. T-bet-dependent expression of CXCR3 directs Treg cells to the site of inflammation. However, the suppressive mediators enabling effective control of Th1 responses at this site are unknown. In this study, we determined the signature of CXCR3+ Treg cells arising in Th1 settings and defined universal features of Treg cells in this context using multiple Th1-dominated infection models. Our analysis defined a set of Th1-specific co-inhibitory receptors and cytotoxic molecules that are specifically expressed in Treg cells during Th1 immune responses in mice and humans. Among these, we identified the novel co-inhibitory receptor CD85k as a functional predictor for Treg-mediated suppression specifically of Th1 responses, which could be explored therapeutically for selective immune suppression in autoimmunity.
Roux, Simon; Enault, Francois; Ravet, Viviane; Colombet, Jonathan; Bettarel, Yvan; Auguet, Jean-Christophe; Bouvier, Thierry; Lucas-Staat, Soizick; Vellet, Agnès; Prangishvili, David; Forterre, Patrick; Debroas, Didier; Sime-Ngando, Telesphore
Microbial communities from hypersaline ponds, dominated by halophilic archaea, are considered specific of such extreme conditions. The associated viral communities have accordingly been shown to display specific features, such as similar morphologies among different sites. However, little is known about the genetic diversity of these halophilic viral communities across the Earth. Here, we studied viral communities in hypersaline ponds sampled on the coast of Senegal (8-36% of salinity) using metagenomics approach, and compared them with hypersaline viromes from Australia and Spain. The specificity of hyperhalophilic viruses could first be demonstrated at a community scale, salinity being a strong discriminating factor between communities. For the major viral group detected in all samples (Caudovirales), only a limited number of halophilic Caudovirales clades were highlighted. These clades gather viruses from different continents and display consistent genetic composition, indicating that they represent related lineages with a worldwide distribution. Non-tailed hyperhalophilic viruses display a greater rate of gene transfer and recombination, with uncharacterized genes conserved across different kind of viruses and plasmids. Thus, hypersaline viral communities around the world appear to form a genetically consistent community that are likely to harbour new genes coding for enzymes specifically adapted to these environments. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Koide, Michi; Koike, Fumihiro; Nagata, Tetsuo
A previous study of subvalence s-shell photoionization of potassium [Koide et al.: J. Phys. Soc. Jpn. 71 (2002) 1676] has been extended to the cases of heavier alkaline atoms Rb and Cs. We have measured the photoion time-of-flight spectra using monochromatized synchrotron radiation. Dual windows resonance structure previously observed in K was also found in Rb and Cs, suggesting that those structure are general features in alkaline atoms. We have observed also the Rydberg series of resonances that appear in dual windows. Our data analysis shows that the resonance widths are broad when compared with its rare gas neighbors. Based on multiconfiguration Dirac-Fock calculations, the Rydberg series of resonances were assigned to the 4s 1 4p 6 5s5p excitations embedded in the 4p 5 5s continua for Rb and to the 5s 1 5p 6 6s6p excitations embedded in the 5p 5 6s continua for Cs. (author)
Peter K Joshi
Full Text Available The analysis of less common variants in genome-wide association studies promises to elucidate complex trait genetics but is hampered by low power to reliably detect association. We show that addition of population-specific exome sequence data to global reference data allows more accurate imputation, particularly of less common SNPs (minor allele frequency 1-10% in two very different European populations. The imputation improvement corresponds to an increase in effective sample size of 28-38%, for SNPs with a minor allele frequency in the range 1-3%.
Thuan, Nguyen Huy; Dhakal, Dipesh; Pokhrel, Anaya Raj; Chu, Luan Luong; Van Pham, Thi Thuy; Shrestha, Anil; Sohng, Jae Kyung
Streptomyces peucetius ATCC 27952 produces two major anthracyclines, doxorubicin (DXR) and daunorubicin (DNR), which are potent chemotherapeutic agents for the treatment of several cancers. In order to gain detailed insight on genetics and biochemistry of the strain, the complete genome was determined and analyzed. The result showed that its complete sequence contains 7187 protein coding genes in a total of 8,023,114 bp, whereas 87% of the genome contributed to the protein coding region. The genomic sequence included 18 rRNA, 66 tRNAs, and 3 non-coding RNAs. In silico studies predicted ~ 68 biosynthetic gene clusters (BCGs) encoding diverse classes of secondary metabolites, including non-ribosomal polyketide synthase (NRPS), polyketide synthase (PKS I, II, and III), terpenes, and others. Detailed analysis of the genome sequence revealed versatile biocatalytic enzymes such as cytochrome P450 (CYP), electron transfer systems (ETS) genes, methyltransferase (MT), glycosyltransferase (GT). In addition, numerous functional genes (transporter gene, SOD, etc.) and regulatory genes (afsR-sp, metK-sp, etc.) involved in the regulation of secondary metabolites were found. This minireview summarizes the genome-based genome mining (GM) of diverse BCGs and genome exploration (GE) of versatile biocatalytic enzymes, and other enzymes involved in maintenance and regulation of metabolism of S. peucetius. The detailed analysis of genome sequence provides critically important knowledge useful in the bioengineering of the strain or harboring catalytically efficient enzymes for biotechnological applications.
Andrew J K Phillips
Full Text Available Mammalian sleep varies widely, ranging from frequent napping in rodents to consolidated blocks in primates and unihemispheric sleep in cetaceans. In humans, rats, mice and cats, sleep patterns are orchestrated by homeostatic and circadian drives to the sleep-wake switch, but it is not known whether this system is ubiquitous among mammals. Here, changes of just two parameters in a recent quantitative model of this switch are shown to reproduce typical sleep patterns for 17 species across 7 orders. Furthermore, the parameter variations are found to be consistent with the assumptions that homeostatic production and clearance scale as brain volume and surface area, respectively. Modeling an additional inhibitory connection between sleep-active neuronal populations on opposite sides of the brain generates unihemispheric sleep, providing a testable hypothetical mechanism for this poorly understood phenomenon. Neuromodulation of this connection alone is shown to account for the ability of fur seals to transition between bihemispheric sleep on land and unihemispheric sleep in water. Determining what aspects of mammalian sleep patterns can be explained within a single framework, and are thus universal, is essential to understanding the evolution and function of mammalian sleep. This is the first demonstration of a single model reproducing sleep patterns for multiple different species. These wide-ranging findings suggest that the core physiological mechanisms controlling sleep are common to many mammalian orders, with slight evolutionary modifications accounting for interspecies differences.
Howe, Kristy B.; Coates, Peter S.; Delehanty, David J.
Common Raven (Corvus corax) numbers and distribution are increasing throughout the sagebrush steppe, influencing avian communities in complex ways. Anthropogenic structures are thought to increase raven populations by providing food and nesting subsidies, which is cause for concern because ravens are important nest predators of sensitive species, including Greater Sage-Grouse (Centrocercus urophasianus). During 2007–2009, we located raven nests in southeastern Idaho and conducted a resource selection analysis. We measured variables at multiple spatial scales for 72 unique nest locations, including landscape-level vegetation characteristics and anthropogenic structures. Using generalized linear mixed models and an information-theoretic approach, we found a 31% decrease in the odds of nesting by ravens for every 1 km increase in distance away from a transmission line. Furthermore, a 100-m increase in distance away from the edge of two different land cover types decreased the odds of nesting by 20%, and an increase in the amount of edge by 1 km within an area of 102.1 ha centered on the nest increased the odds of nesting by 49%. A post hoc analysis revealed that ravens were most likely to nest near edges of adjoining big sagebrush (Artemisia tridentata) and land cover types that were associated with direct human disturbance or fire. These findings contribute to our understanding of raven expansion into rural environments and could be used to make better-informed conservation decisions, especially in the face of increasing renewable energy development.
Konold, Timm; Bone, Gemma; Vidal-Diez, Alberto; Tortosa, Raul; Davis, Andrew; Dexter, Glenda; Hill, Peter; Jeffrey, Martin; Simmons, Marion M; Chaplin, Melanie J; Bellworthy, Susan J; Berthelin-Baker, Christine
The variability in the clinical or pathological presentation of transmissible spongiform encephalopathies (TSEs) in sheep, such as scrapie and bovine spongiform encephalopathy (BSE), has been attributed to prion protein genotype, strain, breed, clinical duration, dose, route and type of inoculum and the age at infection. The study aimed to describe the clinical signs in sheep infected with the BSE agent throughout its clinical course to determine whether the clinical signs were as variable as described for classical scrapie in sheep. The clinical signs were compared to BSE-negative sheep to assess if disease-specific clinical markers exist. Forty-seven (34%) of 139 sheep, which comprised 123 challenged sheep and 16 undosed controls, were positive for BSE. Affected sheep belonged to five different breeds and three different genotypes (ARQ/ARQ, VRQ/VRQ and AHQ/AHQ). None of the controls or BSE exposed sheep with ARR alleles were positive. Pruritus was present in 41 (87%) BSE positive sheep; the remaining six were judged to be pre-clinically infected. Testing of the response to scratching along the dorsum of a sheep proved to be a good indicator of clinical disease with a test sensitivity of 85% and specificity of 98% and usually coincided with weight loss. Clinical signs that were displayed significantly earlier in BSE positive cases compared to negative cases were behavioural changes, pruritic behaviour, a positive scratch test, alopecia, skin lesions, teeth grinding, tremor, ataxia, loss of weight and loss of body condition. The frequency and severity of each specific clinical sign usually increased with the progression of disease over a period of 16-20 weeks. Our results suggest that BSE in sheep presents with relatively uniform clinical signs, with pruritus of increased severity and abnormalities in behaviour or movement as the disease progressed. Based on the studied sheep, these clinical features appear to be independent of breed, affected genotype, dose, route
Full Text Available Abstract Background The variability in the clinical or pathological presentation of transmissible spongiform encephalopathies (TSEs in sheep, such as scrapie and bovine spongiform encephalopathy (BSE, has been attributed to prion protein genotype, strain, breed, clinical duration, dose, route and type of inoculum and the age at infection. The study aimed to describe the clinical signs in sheep infected with the BSE agent throughout its clinical course to determine whether the clinical signs were as variable as described for classical scrapie in sheep. The clinical signs were compared to BSE-negative sheep to assess if disease-specific clinical markers exist. Results Forty-seven (34% of 139 sheep, which comprised 123 challenged sheep and 16 undosed controls, were positive for BSE. Affected sheep belonged to five different breeds and three different genotypes (ARQ/ARQ, VRQ/VRQ and AHQ/AHQ. None of the controls or BSE exposed sheep with ARR alleles were positive. Pruritus was present in 41 (87% BSE positive sheep; the remaining six were judged to be pre-clinically infected. Testing of the response to scratching along the dorsum of a sheep proved to be a good indicator of clinical disease with a test sensitivity of 85% and specificity of 98% and usually coincided with weight loss. Clinical signs that were displayed significantly earlier in BSE positive cases compared to negative cases were behavioural changes, pruritic behaviour, a positive scratch test, alopecia, skin lesions, teeth grinding, tremor, ataxia, loss of weight and loss of body condition. The frequency and severity of each specific clinical sign usually increased with the progression of disease over a period of 16–20 weeks. Conclusion Our results suggest that BSE in sheep presents with relatively uniform clinical signs, with pruritus of increased severity and abnormalities in behaviour or movement as the disease progressed. Based on the studied sheep, these clinical features appear to
Huang, Ya-Yi; Cho, Shu-Ting; Haryono, Mindia; Kuo, Chih-Horng
Common bermudagrass (Cynodon dactylon (L.) Pers.) belongs to the subfamily Chloridoideae of the Poaceae family, one of the most important plant families ecologically and economically. This grass has a long connection with human culture but its systematics is relatively understudied. In this study, we sequenced and investigated the chloroplast genome of common bermudagrass, which is 134,297 bp in length with two single copy regions (LSC: 79,732 bp; SSC: 12,521 bp) and a pair of inverted repeat (IR) regions (21,022 bp). The annotation contains a total of 128 predicted genes, including 82 protein-coding, 38 tRNA, and 8 rRNA genes. Additionally, our in silico analyses identified 10 sets of repeats longer than 20 bp and predicted the presence of 36 RNA editing sites. Overall, the chloroplast genome of common bermudagrass resembles those from other Poaceae lineages. Compared to most angiosperms, the accD gene and the introns of both clpP and rpoC1 genes are missing. Additionally, the ycf1, ycf2, ycf15, and ycf68 genes are pseudogenized and two genome rearrangements exist. Our phylogenetic analysis based on 47 chloroplast protein-coding genes supported the placement of common bermudagrass within Chloridoideae. Our phylogenetic character mapping based on the parsimony principle further indicated that the loss of the accD gene and clpP introns, the pseudogenization of four ycf genes, and the two rearrangements occurred only once after the most recent common ancestor of the Poaceae diverged from other monocots, which could explain the unusual long branch leading to the Poaceae when phylogeny is inferred based on chloroplast sequences.
Inda, Márcia A; van Batenburg, Marinus F; Roos, Marco; Belloum, Adam S Z; Vasunin, Dmitry; Wibisono, Adianto; van Kampen, Antoine H C; Breit, Timo M
Chromosome location is often used as a scaffold to organize genomic information in both the living cell and molecular biological research. Thus, ever-increasing amounts of data about genomic features are stored in public databases and can be readily visualized by genome browsers. To perform in silico experimentation conveniently with this genomics data, biologists need tools to process and compare datasets routinely and explore the obtained results interactively. The complexity of such experimentation requires these tools to be based on an e-Science approach, hence generic, modular, and reusable. A virtual laboratory environment with workflows, workflow management systems, and Grid computation are therefore essential. Here we apply an e-Science approach to develop SigWin-detector, a workflow-based tool that can detect significantly enriched windows of (genomic) features in a (DNA) sequence in a fast and reproducible way. For proof-of-principle, we utilize a biological use case to detect regions of increased and decreased gene expression (RIDGEs and anti-RIDGEs) in human transcriptome maps. We improved the original method for RIDGE detection by replacing the costly step of estimation by random sampling with a faster analytical formula for computing the distribution of the null hypothesis being tested and by developing a new algorithm for computing moving medians. SigWin-detector was developed using the WS-VLAM workflow management system and consists of several reusable modules that are linked together in a basic workflow. The configuration of this basic workflow can be adapted to satisfy the requirements of the specific in silico experiment. As we show with the results from analyses in the biological use case on RIDGEs, SigWin-detector is an efficient and reusable Grid-based tool for discovering windows enriched for features of a particular type in any sequence of values. Thus, SigWin-detector provides the proof-of-principle for the modular e-Science based concept
Full Text Available The candidate phylum Poribacteria is one of the most dominant and widespread members of the microbial communities residing within marine sponges. Cell compartmentalization had been postulated along with their discovery about a decade ago and their phylogenetic association to the Planctomycetes, Verrucomicrobia, Chlamydiae superphylum was proposed soon thereafter. In the present study we revised these features based on genomic data obtained from six poribacterial single cells. We propose that Poribacteria form a distinct monophyletic phylum contiguous to the PVC superphylum together with other candidate phyla. Our genomic analyses supported the possibility of cell compartmentalization in form of bacterial microcompartments. Further analyses of eukaryote-like protein domains stressed the importance of such proteins with features including tetratricopeptide repeats, leucin rich repeats as well as low density lipoproteins receptor repeats, the latter of which are reported here for the first time from a sponge symbiont. Finally, examining the most abundant protein domain family on poribacterial genomes revealed diverse phyH family proteins, some of which may be related to dissolved organic posphorus uptake.
Friso, Simonetta; Choi, Sang-Woon; Girelli, Domenico; Mason, Joel B.; Dolnikowski, Gregory G.; Bagley, Pamela J.; Olivieri, Oliviero; Jacques, Paul F.; Rosenberg, Irwin H.; Corrocher, Roberto; Selhub, Jacob
DNA methylation, an essential epigenetic feature of DNA that modulates gene expression and genomic integrity, is catalyzed by methyltransferases that use the universal methyl donor S-adenosyl-l-methionine. Methylenetetrahydrofolate reductase (MTHFR) catalyzes the synthesis of 5-methyltetrahydrofolate (5-methylTHF), the methyl donor for synthesis of methionine from homocysteine and precursor of S-adenosyl-l-methionine. In the present study we sought to determine the effect of folate status on genomic DNA methylation with an emphasis on the interaction with the common C677T mutation in the MTHFR gene. A liquid chromatography/MS method for the analysis of nucleotide bases was used to assess genomic DNA methylation in peripheral blood mononuclear cell DNA from 105 subjects homozygous for this mutation (T/T) and 187 homozygous for the wild-type (C/C) MTHFR genotype. The results show that genomic DNA methylation directly correlates with folate status and inversely with plasma homocysteine (tHcy) levels (P < 0.01). T/T genotypes had a diminished level of DNA methylation compared with those with the C/C wild-type (32.23 vs.62.24 ng 5-methylcytosine/μg DNA, P < 0.0001). When analyzed according to folate status, however, only the T/T subjects with low levels of folate accounted for the diminished DNA methylation (P < 0.0001). Moreover, in T/T subjects DNA methylation status correlated with the methylated proportion of red blood cell folate and was inversely related to the formylated proportion of red blood cell folates (P < 0.03) that is known to be solely represented in those individuals. These results indicate that the MTHFR C677T polymorphism influences DNA methylation status through an interaction with folate status. PMID:11929966
Gao, Rongbao; Bai, Tian; Li, Xiaodan; Xiong, Ying; Huang, Yiwei; Pan, Ming; Zhang, Ye; Bo, Hong; Zou, Shumei; Shu, Yuelong
H9N2 avian influenza virus circulates widely in poultry and has been responsible for sporadic human infections in several regions. Few studies have been conducted on the pathogenicity of H9N2 AIV isolates that have different genomic features. We compared the pathology induced by a novel reassortant H9N2 virus and two currently circulating H9N2 viruses that have different genomic features in ferrets. The results showed that the three viruses can induce infections with various amounts of viral shedding in ferrets. The novel H9N2 induced respiratory infection, but no pathological lesions were observed in lung tissues. The other two viruses induced mild to intermediate pathological lesions in lung tissues, although the clinical signs presented mildly in ferrets. The pathological lesions presented a diversity consistent with viral replication in ferrets. Copyright © 2015 Elsevier Inc. All rights reserved.
Rodríguez-Luna, Stefany Daniela; Cruz Vázquez, Angélica Patricia; Jiménez Suárez, Verónica; Rodríguez-Sanoja, Romina; Alvarez-Buylla, Elena R.; Sánchez, Sergio
Endophytic bacteria are wide-spread and associated with plant physiological benefits, yet their genomes and secondary metabolites remain largely unidentified. In this study, we explored the genome of the endophyte Streptomyces scabrisporus NF3 for discovery of potential novel molecules as well as genes and metabolites involved in host interactions. The complete genomes of seven Streptomyces and three other more distantly related bacteria were used to define the functional landscape of this unique microbe. The S. scabrisporus NF3 genome is larger than the average Streptomyces genome and not structured for an obligate endosymbiotic lifestyle; this and the fact that can grow in R2YE media implies that it could include a soil-living stage. The genome displays an enrichment of genes associated with amino acid production, protein secretion, secondary metabolite and antioxidants production and xenobiotic degradation, indicating that S. scabrisporus NF3 could contribute to the metabolic enrichment of soil microbial communities and of its hosts. Importantly, besides its metabolic advantages, the genome showed evidence for differential functional specificity and diversification of plant interaction molecules, including genes for the production of plant hormones, stress resistance molecules, chitinases, antibiotics and siderophores. Given the diversity of S. scabrisporus mechanisms for host upkeep, we propose that these strategies were necessary for its adaptation to plant hosts and to face changes in environmental conditions. PMID:29447216
Corina Diana Ceapă
Full Text Available Endophytic bacteria are wide-spread and associated with plant physiological benefits, yet their genomes and secondary metabolites remain largely unidentified. In this study, we explored the genome of the endophyte Streptomyces scabrisporus NF3 for discovery of potential novel molecules as well as genes and metabolites involved in host interactions. The complete genomes of seven Streptomyces and three other more distantly related bacteria were used to define the functional landscape of this unique microbe. The S. scabrisporus NF3 genome is larger than the average Streptomyces genome and not structured for an obligate endosymbiotic lifestyle; this and the fact that can grow in R2YE media implies that it could include a soil-living stage. The genome displays an enrichment of genes associated with amino acid production, protein secretion, secondary metabolite and antioxidants production and xenobiotic degradation, indicating that S. scabrisporus NF3 could contribute to the metabolic enrichment of soil microbial communities and of its hosts. Importantly, besides its metabolic advantages, the genome showed evidence for differential functional specificity and diversification of plant interaction molecules, including genes for the production of plant hormones, stress resistance molecules, chitinases, antibiotics and siderophores. Given the diversity of S. scabrisporus mechanisms for host upkeep, we propose that these strategies were necessary for its adaptation to plant hosts and to face changes in environmental conditions.
Fontenelle, Leonardo F; Oostermeijer, Sanne; Harrison, Ben J; Pantelis, Christos; Yücel, Murat
The basic concepts underlying compulsive, impulsive and addictive behaviours overlap, which may help explain why laymen use these expressions interchangeably. Although there has been a large research effort to better characterize and disentangle these behaviours, clinicians and scientists are still unable to clearly differentiate them. Accordingly, obsessive-compulsive disorder (OCD), impulse control disorders (ICD) and substance-related disorders (SUD) overlap on different levels, including phenomenology, co-morbidity, neurocircuitry, neurocognition, neurochemistry and family history. In this review we summarize these issues with particular emphasis on the role of the opioid system in the pathophysiology and treatment of OCD, ICD and SUD. We postulate that with progression and chronicity of OCD, the proportion of the OCD-related behaviours (e.g. checking, washing, ordering and hoarding, among others) that are driven by impulsive 'rash' processes increase as involvement of more ventral striatal circuits becomes prominent. In contrast, as SUD and ICD progress, the proportion of the SUD- and ICD-related behaviours that are driven by compulsive 'habitual' processes increase as involvement of more dorsal striatal circuits become prominent. We are not arguing that, with time, ICD becomes OCD or vice versa. Instead, we are proposing that these disorders may acquire qualities of the other with time. In other words, while patients with ICD/SUD may develop 'compulsive impulsions', patients with OCD may exhibit 'impulsive compulsions'. There are many potential implications of our model. Theoretically, OCD patients exhibiting impulsive or addictive features could be managed with drugs that address the quality of the underlying drives and the involvement of neural systems. For example, agents for the reduction or prevention of relapse of addiction (e.g. heavy drinking), which modulate the cortico-mesolimbic dopamine system through the opioid (e.g. buprenorphine and naltrexone
Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette
Background: Allergic Rhinitis (AR) is a complex disorder that affects many people around the world. There is a high genetic contribution to the development of the AR, as twins and family studies have estimated heritability of more than 33%. Due to the complex nature of the disease, single SNP...... analysis has limited power in identifying the genetic variations for AR. We combined genome-wide association analysis (GWAS) with polygenic risk score (PRS) in exploring the genetic basis underlying the disease. Methods: We collected clinical data on 631 Danish subjects with AR cases consisting of 434...... sibling pairs and unrelated individuals and control subjects of 197 unrelated individuals. SNP genotyping was done by Affymetrix Genome-Wide Human SNP Array 5.0. SNP imputation was performed using "IMPUTE2". Using additive effect model, GWAS was conducted in discovery sample, the genotypes...
Background Machine learning techniques are becoming useful as an alternative approach to conventional medical diagnosis or prognosis as they are good for handling noisy and incomplete data, and significant results can be attained despite a small sample size. Traditionally, clinicians make prognostic decisions based on clinicopathologic markers. However, it is not easy for the most skilful clinician to come out with an accurate prognosis by using these markers alone. Thus, there is a need to use genomic markers to improve the accuracy of prognosis. The main aim of this research is to apply a hybrid of feature selection and machine learning methods in oral cancer prognosis based on the parameters of the correlation of clinicopathologic and genomic markers. Results In the first stage of this research, five feature selection methods have been proposed and experimented on the oral cancer prognosis dataset. In the second stage, the model with the features selected from each feature selection methods are tested on the proposed classifiers. Four types of classifiers are chosen; these are namely, ANFIS, artificial neural network, support vector machine and logistic regression. A k-fold cross-validation is implemented on all types of classifiers due to the small sample size. The hybrid model of ReliefF-GA-ANFIS with 3-input features of drink, invasion and p63 achieved the best accuracy (accuracy = 93.81%; AUC = 0.90) for the oral cancer prognosis. Conclusions The results revealed that the prognosis is superior with the presence of both clinicopathologic and genomic markers. The selected features can be investigated further to validate the potential of becoming as significant prognostic signature in the oral cancer studies. PMID:23725313
Ginsburg, Geoffrey S.; Kuderer, Nicole M.
Despite stunning advances in our understanding of the genetics and the molecular basis for cancer, many patients with cancer are not yet receiving therapy tailored specifically to their tumor biology. The translation of these advances into clinical practice has been hindered, in part, by the lack of evidence for biomarkers supporting the personalized medicine approach. Most stakeholders agree that the translation of biomarkers into clinical care requires evidence of clinical utility. The highest level of evidence comes from randomized controlled clinical trials (RCTs). However, in many instances, there may be no RCTs that are feasible for assessing the clinical utility of potentially valuable genomic biomarkers. In the absence of RCTs, evidence generation will require well-designed cohort studies for comparative effectiveness research (CER) that link detailed clinical information to tumor biology and genomic data. CER also uses systematic reviews, evidence-quality appraisal, and health outcomes research to provide a methodologic framework for assessing biologic patient subgroups. Rapid learning health care (RLHC) is a model in which diverse data are made available, ideally in a robust and real-time fashion, potentially facilitating CER and personalized medicine. Nonetheless, to realize the full potential of personalized care using RLHC requires advances in CER and biostatistics methodology and the development of interoperable informatics systems, which has been recognized by the National Cancer Institute's program for CER and personalized medicine. The integration of CER methodology and genomics linked to RLHC should enhance, expedite, and expand the evidence generation required for fully realizing personalized cancer care. PMID:23071236
Full Text Available Organisms are remarkably adapted to diverse environments by specialized metabolisms, morphology, or behaviors. To address the molecular mechanisms underlying environmental adaptation, we have utilized a Drosophila melanogaster line, termed "Dark-fly", which has been maintained in constant dark conditions for 57 years (1400 generations. We found that Dark-fly exhibited higher fecundity in dark than in light conditions, indicating that Dark-fly possesses some traits advantageous in darkness. Using next-generation sequencing technology, we determined the whole genome sequence of Dark-fly and identified approximately 220,000 single nucleotide polymorphisms (SNPs and 4,700 insertions or deletions (InDels in the Dark-fly genome compared to the genome of the Oregon-R-S strain, a control strain. 1.8% of SNPs were classified as non-synonymous SNPs (nsSNPs: i.e., they alter the amino acid sequence of gene products. Among them, we detected 28 nonsense mutations (i.e., they produce a stop codon in the protein sequence in the Dark-fly genome. These included genes encoding an olfactory receptor and a light receptor. We also searched runs of homozygosity (ROH regions as putative regions selected during the population history, and found 21 ROH regions in the Dark-fly genome. We identified 241 genes carrying nsSNPs or InDels in the ROH regions. These include a cluster of alpha-esterase genes that are involved in detoxification processes. Furthermore, analysis of structural variants in the Dark-fly genome showed the deletion of a gene related to fatty acid metabolism. Our results revealed unique features of the Dark-fly genome and provided a list of potential candidate genes involved in environmental adaptation.
Jamie A. O'Rourke
Full Text Available A small fast neutron mutant population has been established from Phaseolus vulgaris cv. Red Hawk. We leveraged the available P. vulgaris genome sequence and high throughput next generation DNA sequencing to examine the genomic structure of five Phaseolus vulgaris cv. Red Hawk fast neutron mutants with striking visual phenotypes. Analysis of these genomes identified three classes of structural variation; between cultivar variation, natural variation within the fast neutron mutant population, and fast neutron induced mutagenesis. Our analyses focused on the latter two classes. We identified 23 large deletions (>40 bp common to multiple individuals, illustrating residual heterogeneity and regions of structural variation within the common bean cv. Red Hawk. An additional 18 large deletions were identified in individual mutant plants. These deletions, ranging in size from 40 bp to 43,000 bp, are potentially the result of fast neutron mutagenesis. Six of the 18 deletions lie near or within gene coding regions, identifying potential candidate genes causing the mutant phenotype.
Chen, Zhijun; Wu, Chaozhong; Zhong, Ming; Lyu, Nengchao; Huang, Zhen
Drowsy/distracted driving has become one of the leading causes of traffic crash. Only certain particular drowsy/distracted driving behaviors have been studied by previous studies, which are mainly based on dedicated sensor devices such as bio and visual sensors. The objective of this study is to extract the common features for identifying drowsy/distracted driving through a set of common vehicle motion parameters. An intelligent vehicle was used to collect vehicle motion parameters. Fifty licensed drivers (37 males and 13 females, M=32.5 years, SD=6.2) were recruited to carry out road experiments in Wuhan, China and collecting vehicle motion data under four driving scenarios including talking, watching roadside, drinking and under the influence of drowsiness. For the first scenario, the drivers were exposed to a set of questions and asked to repeat a few sentences that had been proved valid in inducing driving distraction. Watching roadside, drinking and driving under drowsiness were assessed by an observer and self-reporting from the drivers. The common features of vehicle motions under four types of drowsy/distracted driving were analyzed using descriptive statistics and then Wilcoxon rank sum test. The results indicated that there was a significant difference of lateral acceleration rates and yaw rate acceleration between "normal driving" and drowsy/distracted driving. Study results also shown that, under drowsy/distracted driving, the lateral acceleration rates and yaw rate acceleration were significantly larger from the normal driving. The lateral acceleration rates were shown to suddenly increase or decrease by more than 2.0m/s(3) and the yaw rate acceleration by more than 2.5°/s(2). The standard deviation of acceleration rate (SDA) and standard deviation of yaw rate acceleration (SDY) were identified to as the common features of vehicle motion for distinguishing the drowsy/distracted driving from the normal driving. In order to identify a time window for
Shaligram, Shraddha; Kumbhare, Shreyas V; Dhotre, Dhiraj P; Muddeshwar, Manohar G; Kapley, Atya; Joseph, Neetha; Purohit, Hemant P; Shouche, Yogesh S; Pawar, Shrikant P
Genomic studies provide deeper insights into secondary metabolites produced by diverse bacterial communities, residing in various environmental niches. This study aims to understand the potential of a biosurfactant producing Bacillus sp. AM13, isolated from soil. An integrated approach of genomic and chemical analysis was employed to characterize the antibacterial lipopeptide produced by the strain AM13. Genome analysis revealed that strain AM13 harbors a nonribosomal peptide synthetase (NRPS) cluster; highly similar with known biosynthetic gene clusters from surfactin family: lichenysin (85 %) and surfactin (78 %). These findings were substantiated with supplementary experiments of oil displacement assay and surface tension measurements, confirming the biosurfactant production. Further investigation using LCMS approach exhibited similarity of the biomolecule with biosurfactants of the surfactin family. Our consolidated effort of functional genomics provided chemical as well as genetic leads for understanding the biochemical characteristics of the bioactive compound.
Tesfalem R Zere
Full Text Available The two-component signal transduction system BarA-UvrY of Escherichia coli and its orthologs globally regulate metabolism, motility, biofilm formation, stress resistance, virulence of pathogens and quorum sensing by activating the transcription of genes for regulatory sRNAs, e.g. CsrB and CsrC in E. coli. These sRNAs act by sequestering the RNA binding protein CsrA (RsmA away from lower affinity mRNA targets. In this study, we used ChIP-exo to identify, at single nucleotide resolution, genomic sites for UvrY (SirA binding in E. coli and Salmonella enterica. The csrB and csrC genes were the strongest targets of crosslinking, which required UvrY phosphorylation by the BarA sensor kinase. Crosslinking occurred at two sites, an inverted repeat sequence far upstream of the promoter and a site near the -35 sequence. DNAse I footprinting revealed specific binding of UvrY in vitro only to the upstream site, indicative of additional binding requirements and/or indirect binding to the downstream site. Additional genes, including cspA, encoding the cold-shock RNA-binding protein CspA, showed weaker crosslinking and modest or negligible regulation by UvrY. We conclude that the global effects of UvrY/SirA on gene expression are primarily mediated by activating csrB and csrC transcription. We also used in vivo crosslinking and other experimental approaches to reveal new features of csrB/csrC regulation by the DeaD and SrmB RNA helicases, IHF, ppGpp and DksA. Finally, the phylogenetic distribution of BarA-UvrY was analyzed and found to be uniquely characteristic of γ-Proteobacteria and strongly anti-correlated with fliW, which encodes a protein that binds to CsrA and antagonizes its activity in Bacillus subtilis. We propose that BarA-UvrY and orthologous TCS transcribe sRNA antagonists of CsrA throughout the γ-Proteobacteria, but rarely or never perform this function in other species.
Morgan, William F.
A number of nontargeted and delayed effects associated with radiation exposure have now been described. These include radiation-induced genomic instability, death-inducing and bystander effects, clastogenic factors and transgenerational effects. It is unlikely that these nontargeted effects are directly induced by cellular irradiation. Instead, it is proposed that some as yet to be identified secreted factor can be produced by irradiated cells that can stimulate effects in nonirradiated cells (death-inducing and bystander effects, clastogenic factors) and perpetuate genomic instability in the clonally expanded progeny of an irradiated cell. The proposed factor must be soluble and capable of being transported between cells by cell-to-cell gap junction communication channels. Furthermore, it must have the potential to stimulate cellular cytokines and/or reactive oxygen species. While it is difficult to imagine a role for such a secreted factor in contributing to transgenerational effects, the other nontargeted effects of radiation may all share a common mechanism.
De Petris, Giovanni; Dhungel, Bal M; Chen, Longwen; Chang, Yu-Hui H
Common variable immunodeficiency (CVID) is associated with an increased risk of gastric cancer. The aim of the study was to determine the morphological features of CVID-associated gastric adenocarcinoma (CAGA) and of the background gastritis. The population of gastric cancer patients with CVID of Mayo Clinic in the period 2000-2010 was studied; 6 cases of CVID (2 males, 4 females, average age 47 years, age range 26-71 years) were found in 5793 patients with gastric cancer in the study period. Each patient underwent gastric resection for which histology slides were reviewed. Chronic gastritis variables, CVID-related findings, and features of the adenocarcinoma were recorded. CAGA was of intestinal type, with high number of intratumoral lymphocytes (ITLs). Cancer was diagnosed in younger patients than in the overall population of gastric cancer. Severe atrophic metaplastic pangastritis with extensive dysplasia was present in the background in 4 cases, with features of lymphocytic gastritis in 2 cases. Features of CVID (plasma cells paucity in 4 of 6 cases, lymphoid nodules prominent in four cases) could be detected. In summary, gastric adenocarcinoma at young age with ITLs, accompanied by atrophic metaplastic pangastritis, should alert the pathologist of the possibility of CAGA. It follows that, in presence of those characteristics, the search of CVID-associated abnormalities should be undertaken in the nonneoplastic tissues. © The Author(s) 2014.
The complete mitochondrial genome of the enigmatic bigheadedturtle (Platysternon): description of unusual genomic features and thereconciliation of phylogenetic hypotheses based on mitochondrial andnuclear DNA
Parham, James F.; Feldman, Chris R.; Boore, Jeffrey L.
The big-headed turtle (Platysternon megacephalum) from east Asia is the sole living representative of a poorly-studied turtle lineage (Platysternidae). It has no close living relatives, and its phylogenetic position within turtles is one of the outstanding controversies in turtle systematics. Platysternon was traditionally considered to be close to snapping turtles (Chelydridae) based on some studies of its morphology and mitochondrial (mt) DNA, however, other studies of morphology and nuclear (nu) DNA do not support that hypothesis. We sequenced the complete mt genome of Platysternon and the nearly complete mt genomes of two other relevant turtles and compared them to turtle mt genomes from the literature to form the largest molecular dataset used to date to address this issue. The resulting phylogeny robustly rejects the placement of Platysternon with Chelydridae, but instead shows that it is a member of the Testudinoidea, a diverse, nearly globally-distributed group that includes pond turtles and tortoises. We also discovered that Platysternon mtDNA has large-scale gene rearrangements and possesses two, nearly identical, control regions, features that distinguish it from all other studied turtles. Our study robustly determines the phylogenetic placement of Platysternon and provides a well-resolved outline of major turtle lineages, while demonstrating the significantly greater resolving power of comparing large amounts of mt sequence over that of short fragments. Earlier phylogenies placing Platysternon with chelydrids required a temporal gap in the fossil record that is now unnecessary. The duplicated control regions and gene rearrangements of the Platysternon mt DNA probably resulted from the duplication of part of the genome and then the subsequent loss of redundant genes. Although it is possible that having two control regions may provide some advantage, explaining why the control regions would be maintained while some of the duplicated genes were eroded
Full Text Available The complex process of allopolyploid speciation includes various mechanisms ranging from species crosses and hybrid genome doubling to genome alterations and the establishment of new allopolyploids as persisting natural entities. Currently, little is known about the genetic mechanisms that underlie hybrid genome doubling, despite the fact that natural allopolyploid formation is highly dependent on this phenomenon. We examined the genetic basis for the spontaneous genome doubling of triploid F1 hybrids between the direct ancestors of allohexaploid common wheat (Triticum aestivum L., AABBDD genome, namely Triticumturgidum L. (AABB genome and Aegilopstauschii Coss. (DD genome. An Ae. tauschii intraspecific lineage that is closely related to the D genome of common wheat was identified by population-based analysis. Two representative accessions, one that produces a high-genome-doubling-frequency hybrid when crossed with a T. turgidum cultivar and the other that produces a low-genome-doubling-frequency hybrid with the same cultivar, were chosen from that lineage for further analyses. A series of investigations including fertility analysis, immunostaining, and quantitative trait locus (QTL analysis showed that (1 production of functional unreduced gametes through nonreductional meiosis is an early step key to successful hybrid genome doubling, (2 first division restitution is one of the cytological mechanisms that cause meiotic nonreduction during the production of functional male unreduced gametes, and (3 six QTLs in the Ae. tauschii genome, most of which likely regulate nonreductional meiosis and its subsequent gamete production processes, are involved in hybrid genome doubling. Interlineage comparisons of Ae. tauschii's ability to cause hybrid genome doubling suggested an evolutionary model for the natural variation pattern of the trait in which non-deleterious mutations in six QTLs may have important roles. The findings of this study demonstrated
Full Text Available Abstract Background The QTLMAS XVth dataset consisted of pedigree, marker genotypes and quantitative trait performances of animals with a sib family structure. Pedigree and genotypes concerned 3,000 progenies among those 2,000 were phenotyped. The trait was regulated by 8 QTLs which displayed additive, imprinting or epistatic effects. The 1,000 unphenotyped progenies were considered as candidates to selection and their Genomic Estimated Breeding Values (GEBV were evaluated by participants of the XVth QTLMAS workshop. This paper aims at comparing the GEBV estimation results obtained by seven participants to the workshop. Methods From the known QTL genotypes of each candidate, two "true" genomic values (TV were estimated by organizers: the genotypic value of the candidate (TGV and the expectation of its progeny genotypic values (TBV. GEBV were computed by the participants following different statistical methods: random linear models (including BLUP and Ridge Regression, selection variable techniques (LASSO, Elastic Net and Bayesian methods. Accuracy was evaluated by the correlation between TV (TGV or TBV and GEBV presented by participants. Rank correlation of the best 10% of individuals and error in predictions were also evaluated. Bias was tested by regression of TV on GEBV. Results Large differences between methods were found for all criteria and type of genetic values (TGV, TBV. In general, the criteria ranked consistently methods belonging to the same family. Conclusions Bayesian methods - A
John A Capra
Full Text Available G-quadruplex DNA is a four-stranded DNA structure formed by non-Watson-Crick base pairing between stacked sets of four guanines. Many possible functions have been proposed for this structure, but its in vivo role in the cell is still largely unresolved. We carried out a genome-wide survey of the evolutionary conservation of regions with the potential to form G-quadruplex DNA structures (G4 DNA motifs across seven yeast species. We found that G4 DNA motifs were significantly more conserved than expected by chance, and the nucleotide-level conservation patterns suggested that the motif conservation was the result of the formation of G4 DNA structures. We characterized the association of conserved and non-conserved G4 DNA motifs in Saccharomyces cerevisiae with more than 40 known genome features and gene classes. Our comprehensive, integrated evolutionary and functional analysis confirmed the previously observed associations of G4 DNA motifs with promoter regions and the rDNA, and it identified several previously unrecognized associations of G4 DNA motifs with genomic features, such as mitotic and meiotic double-strand break sites (DSBs. Conserved G4 DNA motifs maintained strong associations with promoters and the rDNA, but not with DSBs. We also performed the first analysis of G4 DNA motifs in the mitochondria, and surprisingly found a tenfold higher concentration of the motifs in the AT-rich yeast mitochondrial DNA than in nuclear DNA. The evolutionary conservation of the G4 DNA motif and its association with specific genome features supports the hypothesis that G4 DNA has in vivo functions that are under evolutionary constraint.
Full Text Available Uterine leiomyosarcoma (LMS is the worst malignancy among the gynecologic cancers. Uterine leiomyoma (LM, a benign tumor of myometrial origin, is the most common among women of childbearing age. Because of their similar symptoms, it is difficult to preoperatively distinguish the two conditions only by ultrasound and pelvic MRI. While histopathological diagnosis is currently the main approach used to distinguish them postoperatively, unusual histologic variants of LM tend to be misdiagnosed as LMS. Therefore, development of molecular diagnosis as an alternative or confirmatory means will help to diagnose LMS more accurately. We adopted omics-based technologies to identify genome-wide features to distinguish LMS from LM and revealed that copy number, gene expression, and DNA methylation profiles successfully distinguished these tumors. LMS was found to possess features typically observed in malignant solid tumors, such as extensive chromosomal abnormalities, overexpression of cell cycle-related genes, hypomethylation spreading through large genomic regions, and frequent hypermethylation at the polycomb group target genes and protocadherin genes. We also identified candidate expression and DNA methylation markers, which will facilitate establishing postoperative molecular diagnostic tests based on conventional quantitative assays. Our results demonstrate the feasibility of establishing such tests and the possibility of developing preoperative and noninvasive methods.
Full Text Available Abstract Duplications and rearrangements of coding genes are major themes in the evolution of mitochondrial genomes, bearing important consequences in the function of mitochondria and the fitness of organisms. Yu et al. (BMC Genomics 2008, 9:477 reported the complete mt genome sequence of the oyster Crassostrea hongkongensis (16,475 bp and found that a DNA segment containing four tRNA genes (trnK1, trnC, trnQ1 and trnN, a duplicated (rrnS and a split rRNA gene (rrnL5' was absent compared with that of two other Crassostrea species. It was suggested that the absence was a novel case of "tandem duplication-random loss" with evolutionary significance. We independently sequenced the complete mt genome of three C. hongkongensis individuals, all of which were 18,622 bp and contained the segment that was missing in Yu et al.'s sequence. Further, we designed primers, verified sequences and demonstrated that the sequence loss in Yu et al.'s study was an artifact caused by placing primers in a duplicated region. The duplication and split of ribosomal RNA genes are unique for Crassostrea oysters and not lost in C. hongkongensis. Our study highlights the need for caution when amplifying and sequencing through duplicated regions of the genome.
Lautenbach, Denise M; Christensen, Kurt D; Sparks, Jeffrey A; Green, Robert C
Communicating genetic risk information in ways that maximize understanding and promote health is increasingly important given the rapidly expanding availability and capabilities of genomic technologies. A well-developed literature on risk communication in general provides guidance for best practices, including presentation of information in multiple formats, attention to framing effects, use of graphics, sensitivity to the way numbers are presented, parsimony of information, attentiveness to emotions, and interactivity as part of the communication process. Challenges to communicating genetic risk information include deciding how best to tailor it, streamlining the process, deciding what information to disclose, accepting that communications may have limited influence, and understanding the impact of context. Meeting these challenges has great potential for empowering individuals to adopt healthier lifestyles and improve public health, but will require multidisciplinary approaches and collaboration.
Hughes, Joseph; Saucedo, Bernardo; Rijks, Jolianne; Kik, Marja; Haenen, Olga L. M.; Engelsma, Marc Y.; Gröne, Andrea; Verheije, M. Helene; Wilkie, Gavin
A ranavirus associated with mass mortalities in wild water frogs (Pelophylax spp.) and other amphibians in the Netherlands since 2010 was isolated, and its complete genome sequence was determined. The virus has a genome of 107,772 bp and shows 96.5% sequence identity with the common midwife toad virus from Spain. PMID:25540340
Almstrup, Kristian; Hoei-Hansen, Christina E; Wirkner, Ute
in their stoichiometry on progression into embryonic carcinoma. We compared the CIS expression profile with patterns reported in embryonic stem cells (ESCs), which revealed a substantial overlap that may be as high as 50%. We also demonstrated an over-representation of expressed genes in regions of 17q and 12, reported......Carcinoma in situ (CIS) is the common precursor of histologically heterogeneous testicular germ cell tumors (TGCTs), which in recent decades have markedly increased and now are the most common malignancy of young men. Using genome-wide gene expression profiling, we identified >200 genes highly...
Rochi, Lucia; Diéguez, María José; Burguener, Germán; Darino, Martín Alejandro; Pergolesi, María Fernanda; Ingala, Lorena Romina; Cuyeu, Alba Romina; Turjanski, Adrián; Kreff, Enrique Domingo; Sacco, Francisco
Rust fungi are one of the most devastating pathogens of crop plants. The biotrophic fungus Puccinia sorghi Schwein (Ps) is responsible for maize common rust, an endemic disease of maize (Zea mays L.) in Argentina that causes significant yield losses in corn production. In spite of this, the Ps genomic sequence was not available. We used Illumina sequencing to rapidly produce the 99.6Mbdraft genome sequence of Ps race RO10H11247, derived from a single-uredinial isolate from infected maize leaves collected in the Argentine Corn Belt Region during 2010. High quality reads were obtained from 200bppaired-end and 5000bpmate-paired libraries and assembled in 15,722 scaffolds. A pipeline which combined an ab initio program with homology-based models and homology to in planta enriched ESTs from four cereal pathogenic fungus (the three sequenced wheat rusts and Ustilago maydis) was used to identify 21,087 putative coding sequences, of which 1599 might be part of the Ps RO10H11247 secretome. Among the 458 highly conserved protein families from the euKaryotic Orthologous Groups (KOG) that occur in a wide range of eukaryotic organisms, 97.5% have at least one member with high homology in the Ps assembly (TBlastN, E-value⩽e-10) covering more than 50% of the length of the KOG protein. Comparative studies with the three sequenced wheat rust fungus, and microsynteny analysis involving Puccinia striiformis f. sp. tritici (Pst, wheat stripe rust fungus), support the quality achieved. The results presented here show the effectiveness of the Illumina strategy for sequencing dikaryotic genomes of non-model organisms and provides reliable DNA sequence information for genomic studies, including pathogenic mechanisms of this maize fungus and molecular marker design. Copyright © 2016 Elsevier Inc. All rights reserved.
Roca, Xavier; Olson, Andrew J; Rao, Atmakuri R
Many human diseases, including Fanconi anemia, hemophilia B, neurofibromatosis, and phenylketonuria, can be caused by 5'-splice-site (5'ss) mutations that are not predicted to disrupt splicing, according to position weight matrices. By using comparative genomics, we identify pairwise dependencies...
Tian, Ai-Guo; Wang, Jun; Cui, Peng
to be fast-evolving. Soybean unigenes with no match to genes within the Arabidopsis genome were identified as soybean-specific genes. These genes were mainly involved in nodule development and the synthesis of seed storage proteins. In addition, we also identified 61 genes regulated by salicylic acid, 1...
Lai, Yiling; Liu, Keke; Zhang, Xinyu; Zhang, Xiaoling; Li, Kuan; Wang, Niuniu; Shu, Chi; Wu, Yunpeng; Wang, Chengshu; Bushley, Kathryn E.; Xiang, Meichun; Liu, Xingzhong
Hirsutella minnesotensis [Ophiocordycipitaceae (Hypocreales, Ascomycota)] is a dominant endoparasitic fungus by using conidia that adhere to and penetrate the secondary stage juveniles of soybean cyst nematode. Its genome was de novo sequenced and compared with five entomopathogenic fungi in the Hypocreales and three nematode-trapping fungi in the Orbiliales (Ascomycota). The genome of H. minnesotensis is 51.4 Mb and encodes 12,702 genes enriched with transposable elements up to 32%. Phylogenomic analysis revealed that H. minnesotensis was diverged from entomopathogenic fungi in Hypocreales. Genome of H. minnesotensis is similar to those of entomopathogenic fungi to have fewer genes encoding lectins for adhesion and glycoside hydrolases for cellulose degradation, but is different from those of nematode-trapping fungi to possess more genes for protein degradation, signal transduction, and secondary metabolism. Those results indicate that H. minnesotensis has evolved different mechanism for nematode endoparasitism compared with nematode-trapping fungi. Transcriptomics analyses for the time-scale parasitism revealed the upregulations of lectins, secreted proteases and the genes for biosynthesis of secondary metabolites that could be putatively involved in host surface adhesion, cuticle degradation, and host manipulation. Genome and transcriptome analyses provided comprehensive understanding of the evolution and lifestyle of nematode endoparasitism. PMID:25359922
Manaresi, Elisabetta; Conti, Ilaria; Bua, Gloria; Bonvicini, Francesca; Gallinella, Giorgio
Central to genetic studies for Parvovirus B19 (B19V) is the availability of genomic clones that may possess functional competence and ability to generate infectious virus. In our study, we established a new model genetic system for Parvovirus B19. A synthetic approach was followed, by design of a reference genome sequence, by generation of a corresponding artificial construct and its molecular cloning in a complete and functional form, and by setup of an efficient strategy to generate infectious virus, via transfection in UT7/EpoS1 cells and amplification in erythroid progenitor cells. The synthetic genome was able to generate virus with biological properties paralleling those of native virus, its infectious activity being dependent on the preservation of self-complementarity and sequence heterogeneity within the terminal regions. A virus of defined genome sequence, obtained from controlled cell culture conditions, can constitute a reference tool for investigation of the structural and functional characteristics of the virus. Copyright © 2017 Elsevier Inc. All rights reserved.
Jennifer L Bolton
Full Text Available Variation in plasma levels of cortisol, an essential hormone in the stress response, is associated in population-based studies with cardio-metabolic, inflammatory and neuro-cognitive traits and diseases. Heritability of plasma cortisol is estimated at 30-60% but no common genetic contribution has been identified. The CORtisol NETwork (CORNET consortium undertook genome wide association meta-analysis for plasma cortisol in 12,597 Caucasian participants, replicated in 2,795 participants. The results indicate that <1% of variance in plasma cortisol is accounted for by genetic variation in a single region of chromosome 14. This locus spans SERPINA6, encoding corticosteroid binding globulin (CBG, the major cortisol-binding protein in plasma, and SERPINA1, encoding α1-antitrypsin (which inhibits cleavage of the reactive centre loop that releases cortisol from CBG. Three partially independent signals were identified within the region, represented by common SNPs; detailed biochemical investigation in a nested sub-cohort showed all these SNPs were associated with variation in total cortisol binding activity in plasma, but some variants influenced total CBG concentrations while the top hit (rs12589136 influenced the immunoreactivity of the reactive centre loop of CBG. Exome chip and 1000 Genomes imputation analysis of this locus in the CROATIA-Korcula cohort identified missense mutations in SERPINA6 and SERPINA1 that did not account for the effects of common variants. These findings reveal a novel common genetic source of variation in binding of cortisol by CBG, and reinforce the key role of CBG in determining plasma cortisol levels. In turn this genetic variation may contribute to cortisol-associated degenerative diseases.
Full Text Available Abstract Background Chromosome location is often used as a scaffold to organize genomic information in both the living cell and molecular biological research. Thus, ever-increasing amounts of data about genomic features are stored in public databases and can be readily visualized by genome browsers. To perform in silico experimentation conveniently with this genomics data, biologists need tools to process and compare datasets routinely and explore the obtained results interactively. The complexity of such experimentation requires these tools to be based on an e-Science approach, hence generic, modular, and reusable. A virtual laboratory environment with workflows, workflow management systems, and Grid computation are therefore essential. Findings Here we apply an e-Science approach to develop SigWin-detector, a workflow-based tool that can detect significantly enriched windows of (genomic features in a (DNA sequence in a fast and reproducible way. For proof-of-principle, we utilize a biological use case to detect regions of increased and decreased gene expression (RIDGEs and anti-RIDGEs in human transcriptome maps. We improved the original method for RIDGE detection by replacing the costly step of estimation by random sampling with a faster analytical formula for computing the distribution of the null hypothesis being tested and by developing a new algorithm for computing moving medians. SigWin-detector was developed using the WS-VLAM workflow management system and consists of several reusable modules that are linked together in a basic workflow. The configuration of this basic workflow can be adapted to satisfy the requirements of the specific in silico experiment. Conclusion As we show with the results from analyses in the biological use case on RIDGEs, SigWin-detector is an efficient and reusable Grid-based tool for discovering windows enriched for features of a particular type in any sequence of values. Thus, SigWin-detector provides the
Liu, Chunyu; Fetterman, Jessica L; Liu, Poching; Luo, Yan; Larson, Martin G; Vasan, Ramachandran S; Zhu, Jun; Levy, Daniel
Increasing evidence implicates mitochondrial dysfunction in aging and age-related conditions. But little is known about the molecular basis for this connection. A possible cause may be mutations in the mitochondrial DNA (mtDNA), which are often heteroplasmic-the joint presence of different alleles at a single locus in the same individual. However, the involvement of mtDNA heteroplasmy in aging and age-related conditions has not been investigated thoroughly. We deep-sequenced the complete mtDNA genomes of 356 Framingham Heart Study participants (52% women, mean age 43, mean coverage 4570-fold), identified 2880 unique mutations and comprehensively annotated them by MITOMAP and PolyPhen-2. We discovered 11 heteroplasmic "hot" spots [NADH dehydrogenase (ND) subunit 1, 4, 5 and 6 genes, n = 7; cytochrome c oxidase I (COI), n = 2; 16S rRNA, n = 1; D-loop, n = 1] for which the alternative-to-reference allele ratios significantly increased with advancing age (Bonferroni correction p < 0.001). Four of these heteroplasmic mutations in ND and COI genes were predicted to be deleterious nonsynonymous mutations which may have direct impact on ATP production. We confirmed previous findings that healthy individuals carry many low-frequency heteroplasmy mutations with potentially deleterious effects. We hypothesize that the effect of a single deleterious heteroplasmy may be minimal due to a low mutant-to-wildtype allele ratio, whereas the aggregate effects of many deleterious mutations may cause changes in mitochondrial function and contribute to age-related diseases. The identification of age-related mtDNA mutations is an important step to understand the genetic architecture of age-related diseases and may uncover novel therapeutic targets for such diseases.
Dey, Siddharth S; Foley, Jonathan E; Limsirichai, Prajit; Schaffer, David V; Arkin, Adam P
While gene expression noise has been shown to drive dramatic phenotypic variations, the molecular basis for this variability in mammalian systems is not well understood. Gene expression has been shown to be regulated by promoter architecture and the associated chromatin environment. However, the exact contribution of these two factors in regulating expression noise has not been explored. Using a dual-reporter lentiviral model system, we deconvolved the influence of the promoter sequence to systematically study the contribution of the chromatin environment at different genomic locations in regulating expression noise. By integrating a large-scale analysis to quantify mRNA levels by smFISH and protein levels by flow cytometry in single cells, we found that mean expression and noise are uncorrelated across genomic locations. Furthermore, we showed that this independence could be explained by the orthogonal control of mean expression by the transcript burst size and noise by the burst frequency. Finally, we showed that genomic locations displaying higher expression noise are associated with more repressed chromatin, thereby indicating the contribution of the chromatin environment in regulating expression noise. © 2015 The Authors. Published under the terms of the CC BY 4.0 license.
Zagradišnik, Boris; Krgović, Danijela; Herodež, Špela Stangler; Zagorac, Andreja; Ćižmarević, Bogdan; Vokač, Nadja Kokalj
Copy number variations (CNSs) of large genomic regions are an important mechanism implicated in the development of head and neck cancer, however, for most changes their exact role is not well understood. The aim of this study was to find possible associations between gains/losses of genomic regions and clinically distinct subgroups of head and neck cancer patients. Array comparative genomic hybridization (aCGH) analysis was performed on DNA samples in 64 patients with cancer in oral cavity, oropharynx or hypopharynx. Overlapping genomic regions created from gains and losses were used for statistical analysis. Following regions were overrepresented: in tumors with stage I or II a gain of 2.98 Mb on 6p21.2-p11 and a gain of 7.4 Mb on 8q11.1-q11.23; in tumors with grade I histology a gain of 1.1 Mb on 8q24.13, a loss of a large part of p arm of chromosome 3, a loss of a 1.24 Mb on 6q14.3, and a loss of terminal 32 Mb region of 8p23.3; in cases with affected lymph nodes a gain of 0.75 Mb on 3q24, and a gain of 0.9 Mb on 3q26.32-q26.33; in cases with unaffected lymph nodes a gain of 1.1 Mb on 8q23.3, in patients not treated with surgery a gain of 12.2 Mb on 7q21.3-q22.3 and a gain of 0.33 Mb on 20q11.22. Our study identified several genomic regions of interest which appear to be associated with various clinically distinct subgroups of head and neck cancer. They represent a potentially important source of biomarkers useful for the clinical management of head and neck cancer. In particular, the PIK3CA and AGTR1 genes could be singled out to predict the lymph node involvement.
Full Text Available Abstract Background Cowpea (Vigna unguiculata L. Walp is an important food and fodder legume of the semiarid tropics and subtropics worldwide, especially in sub-Saharan Africa. High density genetic linkage maps are needed for marker assisted breeding but are not available for cowpea. A single feature polymorphism (SFP is a microarray-based marker which can be used for high throughput genotyping and high density mapping. Results Here we report detection and validation of SFPs in cowpea using a readily available soybean (Glycine max genome array. Robustified projection pursuit (RPP was used for statistical analysis using RNA as a surrogate for DNA. Using a 15% outlying score cut-off, 1058 potential SFPs were enumerated between two parents of a recombinant inbred line (RIL population segregating for several important traits including drought tolerance, Fusarium and brown blotch resistance, grain size and photoperiod sensitivity. Sequencing of 25 putative polymorphism-containing amplicons yielded a SFP probe set validation rate of 68%. Conclusion We conclude that the Affymetrix soybean genome array is a satisfactory platform for identification of some 1000's of SFPs for cowpea. This study provides an example of extension of genomic resources from a well supported species to an orphan crop. Presumably, other legume systems are similarly tractable to SFP marker development using existing legume array resources.
Soo Chan Lee
Full Text Available Microsporidia are obligate intracellular, eukaryotic pathogens that infect a wide range of animals from nematodes to humans, and in some cases, protists. The preponderance of evidence as to the origin of the microsporidia reveals a close relationship with the fungi, either within the kingdom or as a sister group to it. Recent phylogenetic studies and gene order analysis suggest that microsporidia share a particularly close evolutionary relationship with the zygomycetes.Here we expanded this analysis and also examined a putative sex-locus for variability between microsporidian populations. Whole genome inspection reveals a unique syntenic gene pair (RPS9-RPL21 present in the vast majority of fungi and the microsporidians but not in other eukaryotic lineages. Two other unique gene fusions (glutamyl-prolyl tRNA synthetase and ubiquitin-ribosomal subunit S30 that are present in metazoans, choanoflagellates, and filasterean opisthokonts are unfused in the fungi and microsporidians. One locus previously found to be conserved in many microsporidian genomes is similar to the sex locus of zygomycetes in gene order and architecture. Both sex-related and sex loci harbor TPT, HMG, and RNA helicase genes forming a syntenic gene cluster. We sequenced and analyzed the sex-related locus in 11 different Encephalitozoon cuniculi isolates and the sibling species E. intestinalis (3 isolates and E. hellem (1 isolate. There was no evidence for an idiomorphic sex-related locus in this Encephalitozoon species sample. According to sequence-based phylogenetic analyses, the TPT and RNA helicase genes flanking the HMG genes are paralogous rather than orthologous between zygomycetes and microsporidians.The unique genomic hallmarks between microsporidia and fungi are independent of sequence based phylogenetic comparisons and further contribute to define the borders of the fungal kingdom and support the classification of microsporidia as unusual derived fungi. And the sex
Full Text Available The Humboldt Sulfuretum (HS, in the productive Humboldt Eastern Boundary Current Upwelling Ecosystem, extends under the hypoxic waters of the Peru-Chile Undercurrent (ca. 6°S and ca. 36°S. Studies show that primeval sulfuretums held diverse prokaryotic life, and, while rare today, still sustain species-rich giant sulfur-oxidizing bacterial communities. We here present the genomic features of a new bacteria of the HS, "Candidatus Venteria ishoeyi" ("Ca. V. ishoeyi" in the family Thiotrichaceae.Three identical filaments were micro-manipulated from reduced sediments collected off central Chile; their DNA was extracted, amplified, and sequenced by a Roche 454 GS FLX platform. Using three sequenced libraries and through de novo genome assembly, a draft genome of 5.7 Mbp, 495 scaffolds, and a N50 of 70 kbp, was obtained. The 16S rRNA gene phylogenetic analysis showed that "Ca. V. ishoeyi" is related to non-vacuolate forms presently known as Beggiatoa or Beggiatoa-like forms. The complete set of genes involved in respiratory nitrate-reduction to dinitrogen was identified in "Ca. V. ishoeyi"; including genes likely leading to ammonification. As expected, the sulfur-oxidation pathway reported for other sulfur-oxidizing bacteria were deduced and also, key inorganic and organic carbon acquisition related genes were identified. Unexpectedly, the genome of "Ca. V. ishoeyi" contained numerous CRISPR repeats and an I-F CRISPR-Cas type system gene coding array. Findings further show that, as a member of an eons-old marine ecosystem, "Ca. V. ishoeyi" contains the needed metabolic plasticity for life in an increasingly oxygenated and variable ocean.
Wu, Jing; Chen, Jibao; Wang, Lanfen; Wang, Shumin
WRKY transcription factor plays a key role in drought stress. However, the characteristics of the WRKY gene family in the common bean ( Phaseolus vulgaris L.) are unknown. In this study, we identified 88 complete WRKY proteins from the draft genome sequence of the "G19833" common bean. The predicted genes were non-randomly distributed in all chromosomes. Basic information, amino acid motifs, phylogenetic tree and the expression patterns of PvWRKY genes were analyzed, and the proteins were classified into groups 1, 2, and 3. Group 2 was further divided into five subgroups: 2a, 2b, 2c, 2d, and 2e. Finally, we detected 19 WRKY genes that were responsive to drought stress using qRT-PCR; 11 were down-regulated, and 8 were up-regulated under drought stress. This study comprehensively examines WRKY proteins in the common bean, a model food legume, and it provides a foundation for the functional characterization of the WRKY family and opportunities for understanding the mechanisms of drought stress tolerance in this plant.
Full Text Available Genetic analyses have recently been carried out on present-day Tuscans (Central Italy in order to investigate their presumable recent Near East ancestry in connection with the long-standing debate on the origins of the Etruscan civilization. We retrieved mitogenomes and genome-wide SNP data from 110 Tuscans analyzed within the context of The 1000 Genome Project. For phylogeographic and evolutionary analysis we made use of a large worldwide database of entire mitogenomes (>26,000 and partial control region sequences (>180,000.Different analyses reveal the presence of typical Near East haplotypes in Tuscans representing isolated members of various mtDNA phylogenetic branches. As a whole, the Near East component in Tuscan mitogenomes can be estimated at about 8%; a proportion that is comparable to previous estimates but significantly lower than admixture estimates obtained from autosomal SNP data (21%. Phylogeographic and evolutionary inter-population comparisons indicate that the main signal of Near Eastern Tuscan mitogenomes comes from Iran.Mitogenomes of recent Near East origin in present-day Tuscans do not show local or regional variation. This points to a demographic scenario that is compatible with a recent arrival of Near Easterners to this region in Italy with no founder events or bottlenecks.
Background Through the wealth of information contained within them, genome-wide association studies (GWAS) have the potential to provide researchers with a systematic means of associating genetic variants with a wide variety of disease phenotypes. Due to the limitations of approaches that have analyzed single variants one at a time, it has been proposed that the genetic basis of these disorders could be determined through detailed analysis of the genetic variants themselves and in conjunction with one another. The construction of models that account for these subsets of variants requires methodologies that generate predictions based on the total risk of a particular group of polymorphisms. However, due to the excessive number of variants, constructing these types of models has so far been computationally infeasible. Results We have implemented an algorithm, known as greedy RLS, that we use to perform the first known wrapper-based feature selection on the genome-wide level. The running time of greedy RLS grows linearly in the number of training examples, the number of features in the original data set, and the number of selected features. This speed is achieved through computational short-cuts based on matrix calculus. Since the memory consumption in present-day computers can form an even tighter bottleneck than running time, we also developed a space efficient variation of greedy RLS which trades running time for memory. These approaches are then compared to traditional wrapper-based feature selection implementations based on support vector machines (SVM) to reveal the relative speed-up and to assess the feasibility of the new algorithm. As a proof of concept, we apply greedy RLS to the Hypertension – UK National Blood Service WTCCC dataset and select the most predictive variants using 3-fold external cross-validation in less than 26 minutes on a high-end desktop. On this dataset, we also show that greedy RLS has a better classification performance on independent
Yin, Chuanlin; Li, Meizhen; Hu, Jian; Lang, Kun; Chen, Qiming; Liu, Jinding; Guo, Dianhao; He, Kang; Dong, Yipei; Luo, Jiapeng; Song, Zhenkun; Walters, James R; Zhang, Wenqing; Li, Fei; Chen, Xuexin
Parasitoid wasps are well-known natural enemies of major agricultural pests and arthropod borne diseases. The parasitoid wasp Macrocentrus cingulum (Hymenoptera: Braconidae) has been widely used to control the notorious insect pests Ostrinia furnacalis (Asian Corn Borer) and O. nubilalis (European corn borer). One striking phenomenon exhibited by M. cingulum is polyembryony, the formation of multiple genetically identical offspring from a single zygote. Moreover, M. cingulum employs a passive parasitic strategy by preventing the host's immune system from recognizing the embryo as a foreign body. Thus, the embryos evade the host's immune system and are not encapsulated by host hemocytes. Unfortunately, the mechanism of both polyembryony and immune evasion remains largely unknown. We report the genome of the parasitoid wasp M. cingulum. Comparative genomics analysis of M. cingulum and other 11 insects were conducted, finding some gene families with apparent expansion or contraction which might be linked to the parasitic behaviors or polyembryony of M. cingulum. Moreover, we present the evidence that the microRNA miR-14b regulates the polyembryonic development of M. cingulum by targeting the c-Myc Promoter-binding Protein 1 (MBP-1), histone-lysine N-methyltransferase 2E (KMT2E) and segmentation protein Runt. In addition, Hemomucin, an O-glycosylated transmembrane protein, protects the endoparasitoid wasp larvae from being encapsulated by host hemocytes. Motif and domain analysis showed that only the hemomucin in two endoparasitoids, M. cingulum and Venturia canescens, possessing the ability of passive immune evasion has intact mucin domain and similar O-glycosylation patterns, indicating that the hemomucin is a key factor modulating the immune evasion. The microRNA miR-14b participates in the regulation of polyembryonic development, and the O-glycosylation of the mucin domain in the hemomucin confers the passive immune evasion in this wasp. These key findings provide
Craig, Francesco; Lamanna, Anna Linda; Margari, Francesco; Matera, Emilia; Simone, Marta; Margari, Lucia
Recent studies support several overlapping traits between autism spectrum disorders (ASD) and attention-deficit/hyperactivity disorder (ADHD), assuming the existence of a combined phenotype. The aim of our study was to evaluate the common or distinctive clinical features between ASD and ADHD in order to identify possible different phenotypes that could have a clinical value. We enrolled 181 subjects divided into four diagnostic groups: ADHD group, ASD group, ASD+ADHD group (that met diagnostic criteria for both ASD and ADHD), and control group. Intelligent quotient (IQ), emotional and behavior problems, ADHD symptoms, ASD symptoms, and adaptive behaviors were investigated through the following test: Wechsler Intelligence Scale for Children, Wechsler Preschool and Primary Scale of Intelligence or Leiter International Performances Scale Revised, Child Behavior Checklist, Conners' Rating Scales-Revised, SNAP-IV Rating Scale, the Social Communication Questionnaire, Vineland Adaptive Behavior Scales. The ASD+ADHD group differs from ADHD or ASD in some domains such as lower IQ mean level and a higher autistic symptoms severity. However, the ASD+ADHD group shares inattention and hyperactivity deficit and some emotional and behavior problems with the ADHD group, while it shares adaptive behavior impairment with ASD group. These findings provide a new understanding of clinical manifestation of ASD+ADHD phenotype, they may also inform a novel treatment target. © 2015 The Authors Autism Research published by Wiley Periodicals, Inc. on behalf of International Society for Autism Research.
Full Text Available The magnetic field sensors enabling birds to extract orientational information from the Earth's magnetic field have remained enigmatic. Our previously published results from homing pigeons have made us suggest that the iron containing sensory dendrites in the inner dermal lining of the upper beak are a candidate structure for such an avian magnetometer system. Here we show that similar structures occur in two species of migratory birds (garden warbler, Sylvia borin and European robin, Erithacus rubecula and a non-migratory bird, the domestic chicken (Gallus gallus. In all these bird species, histological data have revealed dendrites of similar shape and size, all containing iron minerals within distinct subcellular compartments of nervous terminals of the median branch of the Nervus ophthalmicus. We also used microscopic X-ray absorption spectroscopy analyses to identify the involved iron minerals to be almost completely Fe III-oxides. Magnetite (Fe II/III may also occur in these structures, but not as a major Fe constituent. Our data suggest that this complex dendritic system in the beak is a common feature of birds, and that it may form an essential sensory basis for the evolution of at least certain types of magnetic field guided behavior.
Falkenberg, Gerald; Fleissner, Gerta; Schuchardt, Kirsten; Kuehbacher, Markus; Thalau, Peter; Mouritsen, Henrik; Heyers, Dominik; Wellenreuther, Gerd; Fleissner, Guenther
The magnetic field sensors enabling birds to extract orientational information from the Earth's magnetic field have remained enigmatic. Our previously published results from homing pigeons have made us suggest that the iron containing sensory dendrites in the inner dermal lining of the upper beak are a candidate structure for such an avian magnetometer system. Here we show that similar structures occur in two species of migratory birds (garden warbler, Sylvia borin and European robin, Erithacus rubecula) and a non-migratory bird, the domestic chicken (Gallus gallus). In all these bird species, histological data have revealed dendrites of similar shape and size, all containing iron minerals within distinct subcellular compartments of nervous terminals of the median branch of the Nervus ophthalmicus. We also used microscopic X-ray absorption spectroscopy analyses to identify the involved iron minerals to be almost completely Fe III-oxides. Magnetite (Fe II/III) may also occur in these structures, but not as a major Fe constituent. Our data suggest that this complex dendritic system in the beak is a common feature of birds, and that it may form an essential sensory basis for the evolution of at least certain types of magnetic field guided behavior.
Brown, T. A. (Terence A.)
... of genome expression and replication processes, and transcriptomics and proteomics. This text is richly illustrated with clear, easy-to-follow, full color diagrams, which are downloadable from the book's website...
Chen, Zhi-Teng; Du, Yu-Zhou
The complete mitochondrial genomes (mitogenomes) of Taeniopteryx ugola and Doddsia occidentalis (Plecoptera: Taeniopterygidae) were firstly sequenced from the family Taeniopterygidae. The 15,353-bp long mitogenome of T. ugola and the 16,020-bp long mitogenome of D. occidentalis each contained 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), two ribosomal RNA genes (rRNAs) and a control region (CR). The mitochondrial gene arrangement of the two taeniopterygids and other stoneflies was identical with the putative ancestral mitogenome of Drosophila yakuba. Most PCGs used standard ATN start codons and TAN termination codons. Twenty-one of the 22 tRNAs in each mitogenome could fold into the cloverleaf secondary structures, while the dihydrouridine (DHU) arm of trnSer (AGN) was reduced or absent. Stem-loop (SL) structures, poly-T stretch, poly-[AT] n stretch and tandem repeats were found in the CRs of the two mitogenomes. The phylogenetic analyses using Bayesian inference (BI) and maximum likelihood methods (ML) generated identical results, both supporting the monophyly of all stonefly families and the two infraorders, Systellognatha and Euholognatha. Taeniopterygidae was grouped with another two families from Euholognatha. The relationships within Plecoptera were recovered as (((Perlidae+Peltoperlidae)+((Pteronarcyidae+Chloroperlidae)+Styloperlidae))+((Capniidae+Taeniopterygidae)+Nemouridae))+Gripopterygidae. Copyright © 2017 Elsevier B.V. All rights reserved.
Delphine M Saulnier
Full Text Available The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to differ with respect to antimicrobial production, biofilm formation, and immunomodulation. To explain possible mechanisms of survival in the host and probiosis, we completed a detailed genomic comparison of two breast milk-derived isolates representative of each group: an established probiotic strain (L. reuteri ATCC 55730 and a strain with promising probiotic features (L. reuteri ATCC PTA 6475. Transcriptomes of L. reuteri strains in different growth phases were monitored using strain-specific microarrays, and compared using a pan-metabolic model representing all known metabolic reactions present in these strains. Both strains contained candidate genes involved in the survival and persistence in the gut such as mucus-binding proteins and enzymes scavenging reactive oxygen species. A large operon predicted to encode the synthesis of an exopolysaccharide was identified in strain 55730. Both strains were predicted to produce health-promoting factors, including antimicrobial agents and vitamins (folate, vitamin B(12. Additionally, a complete pathway for thiamine biosynthesis was predicted in strain 55730 for the first time in this species. Candidate genes responsible for immunomodulatory properties of each strain were identified by transcriptomic comparisons. The production of bioactive metabolites by human-derived probiotics may be predicted using metabolic modeling and transcriptomics. Such strategies may facilitate selection and optimization of probiotics for health promotion, disease prevention and amelioration.
Saulnier, Delphine M; Santos, Filipe; Roos, Stefan; Mistretta, Toni-Ann; Spinler, Jennifer K; Molenaar, Douwe; Teusink, Bas; Versalovic, James
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to differ with respect to antimicrobial production, biofilm formation, and immunomodulation. To explain possible mechanisms of survival in the host and probiosis, we completed a detailed genomic comparison of two breast milk-derived isolates representative of each group: an established probiotic strain (L. reuteri ATCC 55730) and a strain with promising probiotic features (L. reuteri ATCC PTA 6475). Transcriptomes of L. reuteri strains in different growth phases were monitored using strain-specific microarrays, and compared using a pan-metabolic model representing all known metabolic reactions present in these strains. Both strains contained candidate genes involved in the survival and persistence in the gut such as mucus-binding proteins and enzymes scavenging reactive oxygen species. A large operon predicted to encode the synthesis of an exopolysaccharide was identified in strain 55730. Both strains were predicted to produce health-promoting factors, including antimicrobial agents and vitamins (folate, vitamin B(12)). Additionally, a complete pathway for thiamine biosynthesis was predicted in strain 55730 for the first time in this species. Candidate genes responsible for immunomodulatory properties of each strain were identified by transcriptomic comparisons. The production of bioactive metabolites by human-derived probiotics may be predicted using metabolic modeling and transcriptomics. Such strategies may facilitate selection and optimization of probiotics for health promotion, disease prevention and amelioration.
Pilgrim, Jack; Ander, Mats; Garros, Claire; Baylis, Matthew; Hurst, Gregory D D; Siozios, Stefanos
There is increasing interest in the heritable bacteria of invertebrate vectors of disease as they present novel targets for control initiatives. Previous studies on biting midges (Culicoides spp.), known to transmit several RNA viruses of veterinary importance, have revealed infections with the endosymbiotic bacteria, Wolbachia and Cardinium. However, rickettsial symbionts in these vectors are underexplored. Here, we present the genome of a previously uncharacterized Rickettsia endosymbiont from Culicoides newsteadi (RiCNE). This genome presents unique features potentially associated with host invasion and adaptation, including genes for the complete non-oxidative phase of the pentose phosphate pathway, and others predicted to mediate lipopolysaccharides and cell wall modification. Screening of 414 Culicoides individuals from 29 Palearctic or Afrotropical species revealed that Rickettsia represent a widespread but previously overlooked association, reaching high frequencies in midge populations and present in 38% of the species tested. Sequence typing clusters the Rickettsia within the Torix group of the genus, a group known to infect several aquatic and hematophagous taxa. FISH analysis indicated the presence of Rickettsia bacteria in ovary tissue, indicating their maternal inheritance. Given the importance of biting midges as vectors, a key area of future research is to establish the impact of this endosymbiont on vector competence. © 2017 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.
Rohde, Palle Duun; Demontis, Ditte; Børglum, Anders
is enriched for causal variants. Here we apply the GFBLUP model to a small schizophrenia case-control study to test the promise of this model on psychiatric disorders, and hypothesize that the performance will be increased when applying the model to a larger ADHD case-control study if the genomic feature...... contains the causal variants. Materials and Methods: The schizophrenia study consisted of 882 controls and 888 schizophrenia cases genotyped for 520,000 SNPs. The ADHD study contained 25,954 controls and 16,663 ADHD cases with 8,4 million imputed genotypes. Results: The predictive ability for schizophrenia.......6% for the null model). Conclusion: The improvement in predictive ability for schizophrenia was marginal, however, greater improvement is expected for the larger ADHD data....
Liu, Xiang; Li, Shangqi; Peng, Wenzhu; Feng, Shuaisheng; Feng, Jianxin; Mahboob, Shahid; Al-Ghanim, Khalid A; Xu, Peng
The ATP-binding cassette (ABC) gene family is considered to be one of the largest gene families in all forms of prokaryotic and eukaryotic life. Although the ABC transporter genes have been annotated in some species, detailed information about the ABC superfamily and the evolutionary characterization of ABC genes in common carp (Cyprinus carpio) are still unclear. In this research, we identified 61 ABC transporter genes in the common carp genome. Phylogenetic analysis revealed that they could be classified into seven subfamilies, namely 11 ABCAs, six ABCBs, 19 ABCCs, eight ABCDs, two ABCEs, four ABCFs, and 11 ABCGs. Comparative analysis of the ABC genes in seven vertebrate species including common carp, showed that at least 10 common carp genes were retained from the third round of whole genome duplication, while 12 duplicated ABC genes may have come from the fourth round of whole genome duplication. Gene losses were also observed for 14 ABC genes. Expression profiles of the 61 ABC genes in six common carp tissues (brain, heart, spleen, kidney, intestine, and gill) revealed extensive functional divergence among the ABC genes. Different copies of some genes had tissue-specific expression patterns, which may indicate some gene function specialization. This study provides essential genomic resources for future studies in common carp.
Peng, Wenzhu; Feng, Shuaisheng; Feng, Jianxin; Mahboob, Shahid; Al-Ghanim, Khalid A.
The ATP-binding cassette (ABC) gene family is considered to be one of the largest gene families in all forms of prokaryotic and eukaryotic life. Although the ABC transporter genes have been annotated in some species, detailed information about the ABC superfamily and the evolutionary characterization of ABC genes in common carp (Cyprinus carpio) are still unclear. In this research, we identified 61 ABC transporter genes in the common carp genome. Phylogenetic analysis revealed that they could be classified into seven subfamilies, namely 11 ABCAs, six ABCBs, 19 ABCCs, eight ABCDs, two ABCEs, four ABCFs, and 11 ABCGs. Comparative analysis of the ABC genes in seven vertebrate species including common carp, showed that at least 10 common carp genes were retained from the third round of whole genome duplication, while 12 duplicated ABC genes may have come from the fourth round of whole genome duplication. Gene losses were also observed for 14 ABC genes. Expression profiles of the 61 ABC genes in six common carp tissues (brain, heart, spleen, kidney, intestine, and gill) revealed extensive functional divergence among the ABC genes. Different copies of some genes had tissue-specific expression patterns, which may indicate some gene function specialization. This study provides essential genomic resources for future studies in common carp. PMID:27058731
Full Text Available The ATP-binding cassette (ABC gene family is considered to be one of the largest gene families in all forms of prokaryotic and eukaryotic life. Although the ABC transporter genes have been annotated in some species, detailed information about the ABC superfamily and the evolutionary characterization of ABC genes in common carp (Cyprinus carpio are still unclear. In this research, we identified 61 ABC transporter genes in the common carp genome. Phylogenetic analysis revealed that they could be classified into seven subfamilies, namely 11 ABCAs, six ABCBs, 19 ABCCs, eight ABCDs, two ABCEs, four ABCFs, and 11 ABCGs. Comparative analysis of the ABC genes in seven vertebrate species including common carp, showed that at least 10 common carp genes were retained from the third round of whole genome duplication, while 12 duplicated ABC genes may have come from the fourth round of whole genome duplication. Gene losses were also observed for 14 ABC genes. Expression profiles of the 61 ABC genes in six common carp tissues (brain, heart, spleen, kidney, intestine, and gill revealed extensive functional divergence among the ABC genes. Different copies of some genes had tissue-specific expression patterns, which may indicate some gene function specialization. This study provides essential genomic resources for future studies in common carp.
Cyber security measures are generally implemented at nuclear facilities to protect against cyber-attacks that may compromise safety. The implementation of such cyber security measures may vary based on site specific requirements and each country's regulatory frameworks. Safety measures and cyber security measures for a nuclear power plant should be designed and implemented so that they do not compromise one another. This common position is intended to only apply to systems classified to the highest level of safety. The Digital Instrumentation and Controls Working Group (DICWG) has agreed that a common position on this topic is warranted given the increase of use of Digital I and C in new reactor designs, its safety implications, and the need to develop a common understanding from the perspectives of regulatory authorities. This action follows the DICWG examination of the regulatory requirements of the participating members and of relevant industry standards and IAEA documents. The DICWG proposes a common position based on its recent experience with the new reactor application reviews and operating plant issues
Comparative Genomic Analyses of Multiple Pseudomonas Strains Infecting Corylus avellana Trees Reveal the Occurrence of Two Genetic Clusters with Both Common and Distinctive Virulence and Fitness Traits
Marcelletti, Simone; Scortichini, Marco
The European hazelnut (Corylus avellana) is threatened in Europe by several pseudomonads which cause symptoms ranging from twig dieback to tree death. A comparison of the draft genomes of nine Pseudomonas strains isolated from symptomatic C. avellana trees was performed to identify common and distinctive genomic traits. The thorough assessment of genetic relationships among the strains revealed two clearly distinct clusters: P. avellanae and P. syringae. The latter including the pathovars avellanae, coryli and syringae. Between these two clusters, no recombination event was found. A genomic island of approximately 20 kb, containing the hrp/hrc type III secretion system gene cluster, was found to be present without any genomic difference in all nine pseudomonads. The type III secretion system effector repertoires were remarkably different in the two groups, with P. avellanae showing a higher number of effectors. Homologue genes of the antimetabolite mangotoxin and ice nucleation activity clusters were found solely in all P. syringae pathovar strains, whereas the siderophore yersiniabactin was only present in P. avellanae. All nine strains have genes coding for pectic enzymes and sucrose metabolism. By contrast, they do not have genes coding for indolacetic acid and anti-insect toxin. Collectively, this study reveals that genomically different Pseudomonas can converge on the same host plant by suppressing the host defence mechanisms with the use of different virulence weapons. The integration into their genomes of a horizontally acquired genomic island could play a fundamental role in their evolution, perhaps giving them the ability to exploit new ecological niches. PMID:26147218
Oliveira José L
Full Text Available Abstract Background The evolutionary forces that determine the arrangement of synonymous codons within open reading frames and fine tune mRNA translation efficiency are not yet understood. In order to tackle this question we have carried out a large scale study of codon-triplet contexts in 11 fungal species to unravel associations or relationships between codons present at the ribosome A-, P- and E-sites during each decoding cycle. Results Our analysis unveiled high bias within the context of codon-triplets, in particular strong preference for triplets of identical codons. We have also identified a surprisingly large number of codon-triplet combinations that vanished from fungal ORFeomes. Candida albicans exacerbated these features, showed an unbalanced tRNA population for decoding its pool of codons and used near-cognate decoding for a large set of codons, suggesting that unique evolutionary forces shaped the evolution of its ORFeome. Conclusion We have developed bioinformatics tools for large-scale analysis of codon-triplet contexts. These algorithms identified codon-triplets context biases, allowed for large scale comparative codon-triplet analysis, and identified rules governing codon-triplet context. They could also detect alterations to the standard genetic code.
Dietzgen, Ralf G.; Kondo, Hideki; Goodin, Michael M.; Kurath, Gael; Vasilakis, Nikos
The family Rhabdoviridae consists of mostly enveloped, bullet-shaped or bacilliform viruses with a negative-sense, single-stranded RNA genome that infect vertebrates, invertebrates or plants. This ecological diversity is reflected by the diversity and complexity of their genomes. Five canonical structural protein genes are conserved in all rhabdoviruses, but may be overprinted, overlapped or interspersed with several novel and diverse accessory genes. This review gives an overview of the characteristics and diversity of rhabdoviruses, their taxonomic classification, replication mechanism, properties of classical rhabdoviruses such as rabies virus and rhabdoviruses with complex genomes, rhabdoviruses infecting aquatic species, and plant rhabdoviruses with both mono- and bipartite genomes.
Full Text Available BACKGROUND: It is difficult to identify copy number variations (CNV in normal human genomic data due to noise and non-linear relationships between different genomic regions and signal intensity. A high-resolution array comparative genomic hybridization (aCGH containing 42 million probes, which is very large compared to previous arrays, was recently published. Most existing CNV detection algorithms do not work well because of noise associated with the large amount of input data and because most of the current methods were not designed to analyze normal human samples. Normal human genome analysis often requires a joint approach across multiple samples. However, the majority of existing methods can only identify CNVs from a single sample. METHODOLOGY AND PRINCIPAL FINDINGS: We developed a multi-sample-based genomic variations detector (MGVD that uses segmentation to identify common breakpoints across multiple samples and a k-means-based clustering strategy. Unlike previous methods, MGVD simultaneously considers multiple samples with different genomic intensities and identifies CNVs and CNV zones (CNVZs; CNVZ is a more precise measure of the location of a genomic variant than the CNV region (CNVR. CONCLUSIONS AND SIGNIFICANCE: We designed a specialized algorithm to detect common CNVs from extremely high-resolution multi-sample aCGH data. MGVD showed high sensitivity and a low false discovery rate for a simulated data set, and outperformed most current methods when real, high-resolution HapMap datasets were analyzed. MGVD also had the fastest runtime compared to the other algorithms evaluated when actual, high-resolution aCGH data were analyzed. The CNVZs identified by MGVD can be used in association studies for revealing relationships between phenotypes and genomic aberrations. Our algorithm was developed with standard C++ and is available in Linux and MS Windows format in the STL library. It is freely available at: http://embio.yonsei.ac.kr/~Park/mgvd.php.
Debowski, Katharina; Drummer, Charis; Lentes, Jana; Cors, Maren; Dressel, Ralf; Lingner, Thomas; Salinas-Riester, Gabriela; Fuchs, Sigrid; Sasaki, Erika; Behr, Rüdiger
Embryonic stem cells (ESCs) are useful for the study of embryonic development. However, since research on naturally conceived human embryos is limited, non-human primate (NHP) embryos and NHP ESCs represent an excellent alternative to the corresponding human entities. Though, ESC lines derived from naturally conceived NHP embryos are still very rare. Here, we report the generation and characterization of four novel ESC lines derived from natural preimplantation embryos of the common marmoset monkey (Callithrix jacchus). For the first time we document derivation of NHP ESCs derived from morula stages. We show that quantitative chromosome-wise transcriptome analyses precisely reflect trisomies present in both morula-derived ESC lines. We also demonstrate that the female ESC lines exhibit different states of X-inactivation which is impressively reflected by the abundance of the lncRNA X inactive-specific transcript (XIST). The novel marmoset ESC lines will promote basic primate embryo and ESC studies as well as preclinical testing of ESC-based regenerative approaches in NHP.
Hung, Rayjean J; Ulrich, Cornelia M; Goode, Ellen L; Brhane, Yonathan; Muir, Kenneth; Chan, Andrew T; Marchand, Loic Le; Schildkraut, Joellen; Witte, John S; Eeles, Rosalind; Boffetta, Paolo; Spitz, Margaret R; Poirier, Julia G; Rider, David N; Fridley, Brooke L; Chen, Zhihua; Haiman, Christopher; Schumacher, Fredrick; Easton, Douglas F; Landi, Maria Teresa; Brennan, Paul; Houlston, Richard; Christiani, David C; Field, John K; Bickeböller, Heike; Risch, Angela; Kote-Jarai, Zsofia; Wiklund, Fredrik; Grönberg, Henrik; Chanock, Stephen; Berndt, Sonja I; Kraft, Peter; Lindström, Sara; Al Olama, Ali Amin; Song, Honglin; Phelan, Catherine; Wentzensen, Nicholas; Peters, Ulrike; Slattery, Martha L; Sellers, Thomas A; Casey, Graham; Gruber, Stephen B; Hunter, David J; Amos, Christopher I; Henderson, Brian
Inflammation has been hypothesized to increase the risk of cancer development as an initiator or promoter, yet no large-scale study of inherited variation across cancer sites has been conducted. We conducted a cross-cancer genomic analysis for the inflammation pathway based on 48 genome-wide association studies within the National Cancer Institute GAME-ON Network across five common cancer sites, with a total of 64 591 cancer patients and 74 467 control patients. Subset-based meta-analysis was used to account for possible disease heterogeneity, and hierarchical modeling was employed to estimate the effect of the subcomponents within the inflammation pathway. The network was visualized by enrichment map. All statistical tests were two-sided. We identified three pleiotropic loci within the inflammation pathway, including one novel locus in Ch12q24 encoding SH2B3 (rs3184504), which reached GWAS significance with a P value of 1.78 x 10(-8), and it showed an association with lung cancer (P = 2.01 x 10(-6)), colorectal cancer (GECCO P = 6.72x10(-6); CORECT P = 3.32x10(-5)), and breast cancer (P = .009). We also identified five key subpathway components with genetic variants that are relevant for the risk of these five cancer sites: inflammatory response for colorectal cancer (P = .006), inflammation related cell cycle gene for lung cancer (P = 1.35x10(-6)), and activation of immune response for ovarian cancer (P = .009). In addition, sequence variations in immune system development played a role in breast cancer etiology (P = .001) and innate immune response was involved in the risk of both colorectal (P = .022) and ovarian cancer (P = .003). Genetic variations in inflammation and its related subpathway components are keys to the development of lung, colorectal, ovary, and breast cancer, including SH2B3, which is associated with lung, colorectal, and breast cancer. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e
Gerritsen, Alida T; New, Daniel D; Robison, Barrie D; Rashed, Arash; Hohenlohe, Paul; Forney, Larry; Rashidi, Mahnaz; Wilson, Cathy M; Settles, Matthew L
We report here the full mitochondrial genome sequence of Limonius californicus, a species of click beetle that is an agricultural pest in its larval form. The circular genome is 16.5 kb and contains 13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes. Copyright © 2016 Gerritsen et al.
Henzy, Jamie E; Gifford, Robert J; Johnson, Welkin E; Coffin, John M
Endogenous retroviruses (ERVs) represent ancestral sequences of modern retroviruses or their extinct relatives. The majority of ERVs cluster alongside exogenous retroviruses into two main groups based on phylogenetic analyses of the reverse transcriptase (RT) enzyme. Class I includes gammaretroviruses, and class II includes lentiviruses and alpha-, beta-, and deltaretroviruses. However, analyses of the transmembrane subunit (TM) of the envelope glycoprotein (env) gene result in a different topology for some retroviruses, suggesting recombination events in which heterologous env sequences have been acquired. We previously demonstrated that the TM sequences of five of the six genera of orthoretroviruses can be divided into three types, each of which infects a distinct set of vertebrate classes. Moreover, these classes do not always overlap the host range of the associated RT classes. Thus, recombination resulting in acquisition of a heterologous env gene could in theory facilitate cross-species transmissions across vertebrate classes, for example, from mammals to reptiles. Here we characterized a family of class II avian ERVs, "TgERV-F," that acquired a mammalian gammaretroviral env sequence. Although TgERV-F clusters near a sister clade to alpharetroviruses, its genome also has some features of betaretroviruses. We offer evidence that this unusual recombinant has circulated among several avian orders and may still have infectious members. In addition to documenting the infection of a nongalliform avian species by a mammalian retrovirus, TgERV-F also underscores the importance of env sequences in reconstructing phylogenies and supports a possible role for env swapping in allowing cross-species transmissions across wide taxonomic distances. Retroviruses can sometimes acquire an envelope gene (env) from a distantly related retrovirus. Since env is a key determinant of host range, such an event affects the host range of the recombinant virus and can lead to the creation
Almstrup, Kristian; Hoei-Hansen, Christina E; Wirkner, Ute; Blake, Jonathon; Schwager, Christian; Ansorge, Wilhelm; Nielsen, John E; Skakkebaek, Niels E; Rajpert-De Meyts, Ewa; Leffers, Henrik
Carcinoma in situ (CIS) is the common precursor of histologically heterogeneous testicular germ cell tumors (TGCTs), which in recent decades have markedly increased and now are the most common malignancy of young men. Using genome-wide gene expression profiling, we identified >200 genes highly expressed in testicular CIS, including many never reported in testicular neoplasms. Expression was further verified by semiquantitative reverse transcription-PCR and in situ hybridization. Among the highest expressed genes were NANOG and POU5F1, and reverse transcription-PCR revealed possible changes in their stoichiometry on progression into embryonic carcinoma. We compared the CIS expression profile with patterns reported in embryonic stem cells (ESCs), which revealed a substantial overlap that may be as high as 50%. We also demonstrated an over-representation of expressed genes in regions of 17q and 12, reported as unstable in cultured ESCs. The close similarity between CIS and ESCs explains the pluripotency of CIS. Moreover, the findings are consistent with an early prenatal origin of TGCTs and thus suggest that etiologic factors operating in utero are of primary importance for the incidence trends of TGCTs. Finally, some of the highly expressed genes identified in this study are promising candidates for new diagnostic markers for CIS and/or TGCTs.
Chiang, Hsueh-Sheng; Eroh, Justin; Spence, Jeffrey S; Motes, Michael A; Maguire, Mandy J; Krawczyk, Daniel C; Brier, Matthew R; Hart, John; Kraut, Michael A
How the brain combines the neural representations of features that comprise an object in order to activate a coherent object memory is poorly understood, especially when the features are presented in different modalities (visual vs. auditory) and domains (verbal vs. nonverbal). We examined this question using three versions of a modified Semantic Object Retrieval Test, where object memory was probed by a feature presented as a written word, a spoken word, or a picture, followed by a second feature always presented as a visual word. Participants indicated whether each feature pair elicited retrieval of the memory of a particular object. Sixteen subjects completed one of the three versions (N=48 in total) while their EEG were recorded simultaneously. We analyzed EEG data in four separate frequency bands (delta: 1-4Hz, theta: 4-7Hz; alpha: 8-12Hz; beta: 13-19Hz) using a multivariate data-driven approach. We found that alpha power time-locked to response was modulated by both cross-modality (visual vs. auditory) and cross-domain (verbal vs. nonverbal) probing of semantic object memory. In addition, retrieval trials showed greater changes in all frequency bands compared to non-retrieval trials across all stimulus types in both response-locked and stimulus-locked analyses, suggesting dissociable neural subcomponents involved in binding object features to retrieve a memory. We conclude that these findings support both modality/domain-dependent and modality/domain-independent mechanisms during semantic object memory retrieval. Copyright © 2016 Elsevier B.V. All rights reserved.
Chen, Zhi-Teng; Du, Yu-Zhou
The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer ( AGN ), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae.
Full Text Available Various attempts have been made to predict the individual disease risk based on genotype data from genome-wide association studies (GWAS. However, most studies only investigated one or two classification algorithms and feature encoding schemes. In this study, we applied seven different classification algorithms on GWAS case-control data sets for seven different diseases to create models for disease risk prediction. Further, we used three different encoding schemes for the genotypes of single nucleotide polymorphisms (SNPs and investigated their influence on the predictive performance of these models. Our study suggests that an additive encoding of the SNP data should be the preferred encoding scheme, as it proved to yield the best predictive performances for all algorithms and data sets. Furthermore, our results showed that the differences between most state-of-the-art classification algorithms are not statistically significant. Consequently, we recommend to prefer algorithms with simple models like the linear support vector machine (SVM as they allow for better subsequent interpretation without significant loss of accuracy.
Dietzgen, Ralf G; Kondo, Hideki; Goodin, Michael M; Kurath, Gael; Vasilakis, Nikos
The family Rhabdoviridae consists of mostly enveloped, bullet-shaped or bacilliform viruses with a negative-sense, single-stranded RNA genome that infect vertebrates, invertebrates or plants. This ecological diversity is reflected by the diversity and complexity of their genomes. Five canonical structural protein genes are conserved in all rhabdoviruses, but may be overprinted, overlapped or interspersed with several novel and diverse accessory genes. This review gives an overview of the characteristics and diversity of rhabdoviruses, their taxonomic classification, replication mechanism, properties of classical rhabdoviruses such as rabies virus and rhabdoviruses with complex genomes, rhabdoviruses infecting aquatic species, and plant rhabdoviruses with both mono- and bipartite genomes. Copyright Â© 2016 Elsevier B.V. All rights reserved.
Dietzgen, Ralf G.; Kondo, Hideki; Goodin, Michael M.; Kurath, Gael; Vasilakis, Nikos
The family Rhabdoviridae consists of mostly enveloped, bullet-shaped or bacilliform viruses with a negative-sense, single-stranded RNA genome that infect vertebrates, invertebrates or plants. This ecological diversity is reflected by the diversity and complexity of their genomes. Five canonical structural protein genes are conserved in all rhabdoviruses, but may be overprinted, overlapped or interspersed with several novel and diverse accessory genes. This review gives an overview of the char...
de Queiroz, Casley Borges; Santana, Mateus Ferreira; Pereira Vidigal, Pedro M; de Queiroz, Marisa Vieira
Fungi of the genus Colletotrichum are economically important and are used as models in plant-pathogen interaction studies. In this study, the complete mitochondrial genomes of two Colletotrichum lindemuthianum isolates were sequenced and compared with the mitochondrial genomes of seven species of Colletotrichum. The mitochondrial genome of C. lindemuthianum is a typical circular molecule 37,446 bp (isolate 89 A 2 2-3) and 37,440 bp (isolate 83.501) in length. The difference of six nucleotides between the two genomes is the result of a deletion in the ribosomal protein S3 (rps3) gene in the 83.501 isolate. In addition, substitution of adenine for guanine within the rps3 gene in the mitochondrial genome of the 83.501 isolate was observed. Compared to the previously sequenced C. lindemuthianum mitochondrial genome, an exon no annotated in the cytochrome c oxidase I (cox1) gene and a non-conserved open reading frame (ncORF) were observed. The size of the mitochondrial genomes of the seven species of Colletotrichum was highly variable, being attributed mainly to the ncORF, ranging from one to 10 and also from introns ranging from one to 11 and which encode a total of up to nine homing endonucleases. This paper reports for the first time by means of transcriptome that then ncORFs are transcribed in Colletotrichum spp. Phylogeny data revealed that core mitochondrial genes could be used as an alternative in phylogenetic relationship studies in Colletotrichum spp. This work contributes to the genetic and biological knowledge of Colletotrichum spp., which is of great economic and scientific importance.
Nikki S Vyas
Full Text Available Background: The quality and limitations of digital slides are not fully known. We aimed to estimate intrapathologist discrepancy in detecting specific microscopic features on glass slides and digital slides created by scanning at ×20. Methods: Hematoxylin and eosin and periodic acid-Schiff glass slides were digitized using the Mirax Scan (Carl Zeiss Inc., Germany. Six pathologists assessed 50-71 digital slides. We recorded objective magnification, total time, and detection of the following: Mast cells; eosinophils; plasma cells; pigmented macrophages; melanin in the epidermis; fungal bodies; neutrophils; civatte bodies; parakeratosis; and sebocytes. This process was repeated using the corresponding glass slides after 3 weeks. The diagnosis was not required. Results: The mean time to assess digital slides was 176.77 s and 137.61 s for glass slides (P < 0.001, 99% confidence interval [CI]. The mean objective magnification used to detect features using digital slides was 18.28 and 14.07 for glass slides (P < 0.001, 99.99% CI. Parakeratosis, civatte bodies, pigmented macrophages, melanin in the epidermis, mast cells, eosinophils, plasma cells, and neutrophils, were identified at lower objectives on glass slides (P = 0.023-0.001, 95% CI. Average intraobserver concordance ranged from κ = 0.30 to κ = 0.78. Features with poor to fair average concordance were: Melanin in the epidermis (κ = 0.15-0.58; plasma cells (κ = 0.15-0.49; and neutrophils (κ = 0.12-0.48. Features with moderate average intrapathologist concordance were: parakeratosis (κ = 0.21-0.61; civatte bodies (κ = 0.21-0.71; pigment-laden macrophages (κ = 0.34-0.66; mast cells (κ = 0.29-0.78; and eosinophils (κ = 0.31-0.79. The average intrapathologist concordance was good for sebocytes (κ = 0.51-1.00 and fungal bodies (κ = 0.47-0.76. Conclusions: Telepathology using digital slides scanned at ×20 is sufficient for detection of histopathologic features routinely encountered in
Saulnier, Delphine M.; Santos, Filipe; Roos, Stefan; Mistretta, Toni-Ann; Spinler, Jennifer K.; Molenaar, Douwe; Teusink, Bas; Versalovic, James
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to differ with respect to antimicrobial production, biofilm formation, and immunomodulation. To explain possible mechanisms of survival in the host and probiosis, we completed a detailed genomic comparison...
Full Text Available We are currently faced with a global infectious disease crisis which has been anticipated for decades. While many promising biotherapeutics are being tested, the search for a small molecule has yet to deliver an approved drug or therapeutic for the Ebola or similar filoviruses that cause haemorrhagic fever. Two recent high throughput screens published in 2013 did however identify several hits that progressed to animal studies that are FDA approved drugs used for other indications. The current computational analysis uses these molecules from two different structural classes to construct a common features pharmacophore. This ligand-based pharmacophore implicates a possible common target or mechanism that could be further explored. A recent structure based design project yielded nine co-crystal structures of pyrrolidinone inhibitors bound to the viral protein 35 (VP35. When receptor-ligand pharmacophores based on the analogs of these molecules and the protein structures were constructed, the molecular features partially overlapped with the common features of solely ligand-based pharmacophore models based on FDA approved drugs. These previously identified FDA approved drugs with activity against Ebola were therefore docked into this protein. The antimalarials chloroquine and amodiaquine docked favorably in VP35. We propose that these drugs identified to date as inhibitors of the Ebola virus may be targeting VP35. These computational models may provide preliminary insights into the molecular features that are responsible for their activity against Ebola virus in vitro and in vivo and we propose that this hypothesis could be readily tested.
Full Text Available We are currently faced with a global infectious disease crisis which has been anticipated for decades. While many promising biotherapeutics are being tested, the search for a small molecule has yet to deliver an approved drug or therapeutic for the Ebola or similar filoviruses that cause haemorrhagic fever. Two recent high throughput screens published in 2013 did however identify several hits that progressed to animal studies that are FDA approved drugs used for other indications. The current computational analysis uses these molecules from two different structural classes to construct a common features pharmacophore. This ligand-based pharmacophore implicates a possible common target or mechanism that could be further explored. A recent structure based design project yielded nine co-crystal structures of pyrrolidinone inhibitors bound to the viral protein 35 (VP35. When receptor-ligand pharmacophores based on the analogs of these molecules and the protein structures were constructed, the molecular features partially overlapped with the common features of solely ligand-based pharmacophore models based on FDA approved drugs. These previously identified FDA approved drugs with activity against Ebola were therefore docked into this protein. The antimalarials chloroquine and amodiaquine docked favorably in VP35. We propose that these drugs identified to date as inhibitors of the Ebola virus may be targeting VP35. These computational models may provide preliminary insights into the molecular features that are responsible for their activity against Ebola virus in vitro and in vivo and we propose that this hypothesis could be readily tested.
William A. MacDonald
Full Text Available Genomic imprinting is a form of epigenetic inheritance whereby the regulation of a gene or chromosomal region is dependent on the sex of the transmitting parent. During gametogenesis, imprinted regions of DNA are differentially marked in accordance to the sex of the parent, resulting in parent-specific expression. While mice are the primary research model used to study genomic imprinting, imprinted regions have been described in a broad variety of organisms, including other mammals, plants, and insects. Each of these organisms employs multiple, interrelated, epigenetic mechanisms to maintain parent-specific expression. While imprinted genes and imprint control regions are often species and locus-specific, the same suites of epigenetic mechanisms are often used to achieve imprinted expression. This review examines some examples of the epigenetic mechanisms responsible for genomic imprinting in mammals, plants, and insects.
Edsgard, Stefan Daniel; Dalgaard, Marlene Danner; Weinhold, Nils
Testicular germ cell cancer (TGCC) is one of the most heritable forms of cancer. Previous genome-wide association studies have focused on single nucleotide polymorphisms, largely ignoring the influence of copy number variants (CNVs). Here we present a genome-wide study of CNV on a cohort of 212...... of rare CNVs related to cell migration (false-discovery rate = 0.021, 1.8% of cases and 1.1% of controls). Dysregulation during migration of primordial germ cells has previously been suspected to be a part of TGCC development and this set of multiple rare variants may thereby have a minor contribution...
Jiang, Wen-kai; Liu, Yun-long; Xia, En-hua; Gao, Li-zhi
The evolution of genes and genomes after polyploidization has been the subject of extensive studies in evolutionary biology and plant sciences. While a significant number of duplicated genes are rapidly removed during a process called fractionation, which operates after the whole-genome duplication (WGD), another considerable number of genes are retained preferentially, leading to the phenomenon of biased gene retention. However, the evolutionary mechanisms underlying gene retention after WGD remain largely unknown. Through genome-wide analyses of sequence and functional data, we comprehensively investigated the relationships between gene features and the retention probability of duplicated genes after WGDs in six plant genomes, Arabidopsis (Arabidopsis thaliana), poplar (Populus trichocarpa), soybean (Glycine max), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays). The results showed that multiple gene features were correlated with the probability of gene retention. Using a logistic regression model based on principal component analysis, we resolved evolutionary rate, structural complexity, and GC3 content as the three major contributors to gene retention. Cluster analysis of these features further classified retained genes into three distinct groups in terms of gene features and evolutionary behaviors. Type I genes are more prone to be selected by dosage balance; type II genes are possibly subject to subfunctionalization; and type III genes may serve as potential targets for neofunctionalization. This study highlights that gene features are able to act jointly as primary forces when determining the retention and evolution of WGD-derived duplicated genes in flowering plants. These findings thus may help to provide a resolution to the debate on different evolutionary models of gene fates after WGDs. PMID:23396833
T.A. Knoch (Tobias); M. Göcker; R. Lohner (Rudolf); A. Abuseiris (Anis); F.G. Grosveld (Frank)
textabstractThe sequential organization of genomes, i.e. the relations between distant base pairs and regions within sequences, and its connection to the three-dimensional organization of genomes is still a largely unresolved problem. Long-range power-law correlations were found using correlation
Saulnier, D.M.A.; Santos, F.; Roos, S.; Mistretta, T.-A.; Spinler, J.K.; Molenaar, D.; Teusink, B.; Versalovic, J.
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to
Saulnier, D.M.; Santos, F.; Roos, S.; Mistretta, T.A.; Spinler, J.K.; Molenaar, D.; Teusink, B.; Versalovic, J.
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to
Saulnier, D.M.; santos, F.; Roos, S.; Mistretta, T.A.; Spinler, J.K.; Molenaar, D.; Teusink, B.; Versalovic, J.
The genomes of four Lactobacillus reuteri strains isolated from human breast milk and the gastrointestinal tract have been recently sequenced as part of the Human Microbiome Project. Preliminary genome comparisons suggested that these strains belong to two different clades, previously shown to
Esserlind, A-L; Christensen, A F; Le, H
Genetic factors contribute to the aetiology of the prevalent form of migraine without aura (MO) and migraine with typical aura (MTA). Due to the complex inheritance of MO and MTA, the genetic background is still not fully established. In a population-based genome-wide association study by Chasman...
Daniel Antunes Silva Pereira
Full Text Available OBJECTIVE: To describe the characteristics of a cohort of patients with lung-dominant connective tissue disease (LD-CTD. METHODS: This was a retrospective study of patients with interstitial lung disease (ILD, positive antinuclear antibody (ANA results (≥ 1/320, with or without specific autoantibodies, and at least one clinical feature suggestive of connective tissue disease (CTD. RESULTS: Of the 1,998 patients screened, 52 initially met the criteria for a diagnosis of LD-CTD: 37% were male; the mean age at diagnosis was 56 years; and the median follow-up period was 48 months. During follow-up, 8 patients met the criteria for a definitive diagnosis of a CTD. The remaining 44 patients comprised the LD-CTD group, in which the most prevalent extrathoracic features were arthralgia, gastroesophageal reflux disease, and Raynaud's phenomenon. The most prevalent autoantibodies in this group were ANA (89% and anti-SSA (anti-Ro, 27%. The mean baseline and final FVC was 69.5% and 74.0% of the predicted values, respectively (p > 0.05. Nonspecific interstitial pneumonia and usual interstitial pneumonia patterns were found in 45% and 9% of HRCT scans, respectively; 36% of the scans were unclassifiable. A similar prevalence was noted in histological samples. Diffuse esophageal dilatation was identified in 52% of HRCT scans. Nailfold capillaroscopy was performed in 22 patients; 17 showed a scleroderma pattern. CONCLUSIONS: In our LD-CTD group, there was predominance of females and the patients showed mild spirometric abnormalities at diagnosis, with differing underlying ILD patterns that were mostly unclassifiable on HRCT and by histology. We found functional stability on follow-up. Esophageal dilatation on HRCT and scleroderma pattern on nailfold capillaroscopy were frequent findings and might come to serve as diagnostic criteria.
Dreger, Dayna L.; Davis, Brian W.; Cocco, Raffaella; Sechi, Sara; Di Cerbo, Alessandro; Parker, Heidi G.; Polli, Michele; Marelli, Stefano P.; Crepaldi, Paola; Ostrander, Elaine A.
The island inhabitants of Sardinia have long been a focus for studies of complex human traits due to their unique ancestral background and population isolation reflecting geographic and cultural restriction. Population isolates share decreased genomic diversity, increased linkage disequilibrium, and increased inbreeding coefficients. In many regions, dogs and humans have been exposed to the same natural and artificial forces of environment, growth, and migration. Distinct dog breeds have arisen through human-driven selection of characteristics to meet an ideal standard of appearance and function. The Fonni’s Dog, an endemic dog population on Sardinia, has not been subjected to an intensive system of artificial selection, but rather has developed alongside the human population of Sardinia, influenced by geographic isolation and unregulated selection based on its environmental adaptation and aptitude for owner-desired behaviors. Through analysis of 28 dog breeds, represented with whole-genome sequences from 13 dogs and ∼170,000 genome-wide single nucleotide variants from 155 dogs, we have produced a genomic illustration of the Fonni’s Dog. Genomic patterns confirm within-breed similarity, while population and demographic analyses provide spatial identity of Fonni’s Dog to other Mediterranean breeds. Investigation of admixture and fixation indices reveals insights into the involvement of Fonni’s Dogs in breed development throughout the Mediterranean. We describe how characteristics of population isolates are reflected in dog breeds that have undergone artificial selection, and are mirrored in the Fonni’s Dog through traditional isolating factors that affect human populations. Lastly, we show that the genetic history of Fonni’s Dog parallels demographic events in local human populations. PMID:27519604
The author tries to identify V. Shukshin and Azerbaijani writer's reference points during creation of the concept of character in article The characteristic features which are common for heroes in works by I. Malik-zade and for heroes in stories by V. Shukshin. In this way present article can be considered as an original breakthrough, the first a swallow in enlightening of works by Shukshin and the representative of the national literature.
Full Text Available Background: Cornelia de Lange Syndrome (CdLS is a rare but severe clinically heterogeneous developmental disorder characterized by facial dysmorphia, growth and cognitive retardation, and abnormalities of limb development. Objectives: To determine the pathogenesis of a patient with CdLS. Methods: We studied a patient with CdLS by whole exome sequencing, karyotyping and Agilent CGH Array. The results were confirmed by quantitative real-time PCR analysis of the patient and her parents. Further comparison of our patient and cases with partially overlapping deletions retrieved from the literature and databases was undertaken. Results: Whole exome sequencing had excluded the mutation of cohesion genes such as NIPBL,SMC1A and SMC3. The result of karyotyping showed a deletion of chromosome 9q31.1-q32 and the result of Agilent CGH Array further displayed a 12.01-Mb region of deletion at chromosome bands 9q31.1-q32. Reported cases with the deletion of 9q31.1-q32 share similar features with our CdLS patient. One of the genes in the deleted region, SMC2, belongs to the Structural Maintenance of Chromosomes (SMC family and regulates gene expression and DNA repair. Conclusions: Patients carrying the deletion of 9q31.1-q32 showed similar phenotypes with CdLS.
Duplessis, Sebastien; Cuomo, Christina A.; Lin, Yao-Cheng; Aerts, Andrea; Tisserant, Emilie; Veneault-Fourrey, Claire; Joly, David L.; Hacquard, Stephane; Amselem, Joelle; Cantarel, Brandi; Chiu, Readman; Couthinho, Pedro; Feau, Nicolas; Field, Matthew; Frey, Pascal; Gelhaye, Eric; Goldberg, Jonathan; Grabherr, Manfred; Kodira, Chinnappa; Kohler, Annegret; Kues, Ursula; Lindquist, Erika; Lucas, Susan; Mago, Rohit; Mauceli, Evan; Morin, Emmanuelle; Murat, Claude; Pangilinan, Jasmyn L.; Park, Robert; Pearson, Matthew; Quesneville, Hadi; Rouhier, Nicolas; Sakthikumar, Sharadha; Salamov, Asaf A.; Schmutz, Jeremy; Selles, Benjamin; Shapiro, Harris; Tangay, Philippe; Tuskan, Gerald A.; Peer, Yves Van de; Henrissat, Bernard; Rouze, Pierre; Ellis, Jeffrey G.; Dodds, Peter N.; Schein, Jacqueline E.; Zhong, Shaobin; Hamelin, Richard C.; Grigoriev, Igor V.; Szabo, Les J.; Martin1, Francis
Rust fungi are some of the most devastating pathogens of crop plants. They are obligate biotrophs, which extract nutrients only from living plant tissues and cannot grow apart from their hosts. Their lifestyle has slowed the dissection of molecular mechanisms underlying host invasion and avoidance or suppression of plant innate immunity. We sequenced the 101 mega base pair genome of Melampsora larici-populina, the causal agent of poplar leaf rust, and the 89 mega base pair genome of Puccinia graminis f. sp. tritici, the causal agent of wheat and barley stem rust. We then compared the 16,841 predicted proteins of M. larici-populina to the 18,241 predicted proteins of P. graminis f. sp tritici. Genomic features related to their obligate biotrophic life-style include expanded lineage-specific gene families, a large repertoire of effector-like small secreted proteins (SSPs), impaired nitrogen and sulfur assimilation pathways, and expanded families of amino-acid, oligopeptide and hexose membrane transporters. The dramatic upregulation of transcripts coding for SSPs, secreted hydrolytic enzymes, and transporters in planta suggests that they play a role in host infection and nutrient acquisition. Some of these genomic hallmarks are mirrored in the genomes of other microbial eukaryotes that have independently evolved to infect plants, indicating convergent adaptation to a biotrophic existence inside plant cells
Rao, A; Net, J [University of Miami, Miami, Florida (United States); Brandt, K [Mayo Clinic, Rochester, Minnesota (United States); Huang, E [National Cancer Institute, NIH, Bethesda, MD (United States); Freymann, J; Kirby, J [Leidos Biomedical Research Inc., Frederick, MD (United States); Burnside, E [University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin (United States); Morris, E; Sutton, E [Memorial Sloan Kettering Cancer Center, New York, NY (United States); Bonaccio, E [Roswell Park Cancer Institute, Buffalo, NY (United States); Giger, M; Jaffe, C [Univ Chicago, Chicago, IL (United States); Ganott, M; Zuley, M [University of Pittsburgh Medical Center - Magee Womens Hospital, Pittsburgh, Pennsylvania (United States); Le-Petross, H [MD Anderson Cancer Center, Houston, TX (United States); Dogan, B [UT MDACC, Houston, TX (United States); Whitman, G [UTMDACC, Houston, TX (United States)
Purpose: To determine associations between radiologist-annotated MRI features and genomic measurements in breast invasive carcinoma (BRCA) from the Cancer Genome Atlas (TCGA). Methods: 98 TCGA patients with BRCA were assessed by a panel of radiologists (TCGA Breast Phenotype Research Group) based on a variety of mass and non-mass features according to the Breast Imaging Reporting and Data System (BI-RADS). Batch corrected gene expression data was obtained from the TCGA Data Portal. The Kruskal-Wallis test was used to assess correlations between categorical image features and tumor-derived genomic features (such as gene pathway activity, copy number and mutation characteristics). Image-derived features were also correlated with estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2/neu) status. Multiple hypothesis correction was done using Benjamini-Hochberg FDR. Associations at an FDR of 0.1 were selected for interpretation. Results: ER status was associated with rim enhancement and peritumoral edema. PR status was associated with internal enhancement. Several components of the PI3K/Akt pathway were associated with rim enhancement as well as heterogeneity. In addition, several components of cell cycle regulation and cell division were associated with imaging characteristics.TP53 and GATA3 mutations were associated with lesion size. MRI features associated with TP53 mutation status were rim enhancement and peritumoral edema. Rim enhancement was associated with activity of RB1, PIK3R1, MAP3K1, AKT1,PI3K, and PIK3CA. Margin status was associated with HIF1A/ARNT, Ras/ GTP/PI3K, KRAS, and GADD45A. Axillary lymphadenopathy was associated with RB1 and BCL2L1. Peritumoral edema was associated with Aurora A/GADD45A, BCL2L1, CCNE1, and FOXA1. Heterogeneous internal nonmass enhancement was associated with EGFR, PI3K, AKT1, HF/MET, and EGFR/Erbb4/neuregulin 1. Diffuse nonmass enhancement was associated with HGF/MET/MUC20/SHIP
Cronk, Quentin; Hidalgo, Oriane; Pellicer, Jaume; Percy, Diana; Leitch, Ilia J
The common stinging nettle, Urtica dioica L. sensu lato, is an invertebrate "superhost", its clonal patches maintaining large populations of insects and molluscs. It is extremely widespread in Europe and highly variable, and two ploidy levels (diploid and tetraploid) are known. However, geographical patterns in cytotype variation require further study. We assembled a collection of nettles in conjunction with a transect of Europe from the Aegean to Arctic Norway (primarily conducted to examine the diversity of Salix and Salix -associated insects). Using flow cytometry to measure genome size, our sample of 29 plants reveals 5 diploids and 24 tetraploids. Two diploids were found in SE Europe (Bulgaria and Romania) and three diploids in S. Finland. More detailed cytotype surveys in these regions are suggested. The tetraploid genome size (2C value) varied between accessions from 2.36 to 2.59 pg. The diploids varied from 1.31 to 1.35 pg per 2C nucleus, equivalent to a haploid genome size of c. 650 Mbp. Within the tetraploids, we find that the most northerly samples (from N. Finland and arctic Norway) have a generally higher genome size. This is possibly indicative of a distinct population in this region.
Liu, Wen; Ghouri, Fozia; Yu, Hang; Li, Xiang; Yu, Shuhong; Shahid, Muhammad Qasim; Liu, Xiangdong
Common wild rice (Oryza rufipogon Griff.) is an important germplasm for rice breeding, which contains many resistance genes. Re-sequencing provides an unprecedented opportunity to explore the abundant useful genes at whole genome level. Here, we identified the nucleotide-binding site leucine-rich repeat (NBS-LRR) encoding genes by re-sequencing of two wild rice lines (i.e. Huaye 1 and Huaye 2) that were developed from common wild rice. We obtained 128 to 147 million reads with approximately 32.5-fold coverage depth, and uniquely covered more than 89.6% (> = 1 fold) of reference genomes. Two wild rice lines showed high SNP (single-nucleotide polymorphisms) variation rate in 12 chromosomes against the reference genomes of Nipponbare (japonica cultivar) and 93-11 (indica cultivar). InDels (insertion/deletion polymorphisms) count-length distribution exhibited normal distribution in the two lines, and most of the InDels were ranged from -5 to 5 bp. With reference to the Nipponbare genome sequence, we detected a total of 1,209,308 SNPs, 161,117 InDels and 4,192 SVs (structural variations) in Huaye 1, and 1,387,959 SNPs, 180,226 InDels and 5,305 SVs in Huaye 2. A total of 44.9% and 46.9% genes exhibited sequence variations in two wild rice lines compared to the Nipponbare and 93-11 reference genomes, respectively. Analysis of NBS-LRR mutant candidate genes showed that they were mainly distributed on chromosome 11, and NBS domain was more conserved than LRR domain in both wild rice lines. NBS genes depicted higher levels of genetic diversity in Huaye 1 than that found in Huaye 2. Furthermore, protein-protein interaction analysis showed that NBS genes mostly interacted with the cytochrome C protein (Os05g0420600, Os01g0885000 and BGIOSGA038922), while some NBS genes interacted with heat shock protein, DNA-binding activity, Phosphoinositide 3-kinase and a coiled coil region. We explored abundant NBS-LRR encoding genes in two common wild rice lines through genome wide re
Hadjithomas, Michalis; Chen, I-Min A; Chu, Ken; Huang, Jinghua; Ratner, Anna; Palaniappan, Krishna; Andersen, Evan; Markowitz, Victor; Kyrpides, Nikos C; Ivanova, Natalia N
Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic gene clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Zhu, Ling; Shahid, Muhammad A; Markham, John; Browning, Glenn F; Noormohammadi, Amir H; Marenda, Marc S
The bacterial pathogen Mycoplasma synoviae can cause subclinical respiratory disease, synovitis, airsacculitis and reproductive tract disease in poultry and is a major cause of economic loss worldwide. The M. synoviae strain MS-H was developed by chemical mutagenesis of an Australian isolate and has been used as a live attenuated vaccine in many countries over the past two decades. As a result it may now be the most prevalent strain of M. synoviae globally. Differentiation of the MS-H vaccine from local field strains is important for epidemiological investigations and is often required for registration of the vaccine. The complete genomic sequence of the MS-H strain was determined using a combination of Illumina and Nanopore methods and compared to WVU-1853, the M. synoviae type strain isolated in the USA 30 years before the parent strain of MS-H, and MS53, a more recent isolate from Brazil. The vaccine strain genome had a slightly larger number of pseudogenes than the two other strains and contained a unique 55 kb chromosomal inversion partially affecting a putative genomic island. Variations in gene content were also noted, including a deoxyribose-phosphate aldolase (deoC) fragment and an ATP-dependent DNA helicase gene found only in MS-H. Some of these sequences may have been acquired horizontally from other avian mycoplasma species. MS-H was somewhat more similar to WVU-1853 than to MS53. The genome sequence of MS-H will enable identification of vaccine-specific genetic markers for use as diagnostic and epidemiological tools to better control M. synoviae.
Sadhu, Abhishek; Bhadra, Sreetama; Bandyopadhyay, Maumita
Cytological parameters such as chromosome numbers and genome sizes of plants are used routinely for studying evolutionary aspects of polyploid plants. Members of Zingiberaceae show a wide range of inter- and intrageneric variation in their reproductive habits and ploidy levels. Conventional cytological study in this group of plants is severely hampered by the presence of diverse secondary metabolites, which also affect their genome size estimation using flow cytometry. None of the several nuclei isolation buffers used in flow cytometry could be used very successfully for members of Zingiberaceae to isolate good quality nuclei from both shoot and root tissues. The competency of eight nuclei isolation buffers was compared with a newly formulated buffer, MB01, in six different genera of Zingiberaceae based on the fluorescence intensity of propidium iodide-stained nuclei using flow cytometric parameters, namely coefficient of variation of the G 0 /G 1 peak, debris factor and nuclei yield factor. Isolated nuclei were studied using fluorescence microscopy and bio-scanning electron microscopy to analyse stain-nuclei interaction and nuclei topology, respectively. Genome contents of 21 species belonging to these six genera were determined using MB01. Flow cytometric parameters showed significant differences among the analysed buffers. MB01 exhibited the best combination of analysed parameters; photomicrographs obtained from fluorescence and electron microscopy supported the superiority of MB01 buffer over other buffers. Among the 21 species studied, nuclear DNA contents of 14 species are reported for the first time. Results of the present study substantiate the enhanced efficacy of MB01, compared to other buffers tested, in the generation of acceptable cytograms from all species of Zingiberaceae studied. Our study facilitates new ways of sample preparation for further flow cytometric analysis of genome size of other members belonging to this highly complex polyploid family
Gemperle-Britschgi, Corinne; Iorgulescu, Daniela; Mager, Monica Alina; Anton-Paduraru, Dana; Vulturar, Romana; Thöny, Beat
The mutation spectrum for the phenylalanine hydroxylase (PAH) gene was investigated in a cohort of 84 hyperphenylalaninemia (HPA) patients from Romania identified through newborn screening or neurometabolic investigations. Differential diagnosis identified 81 patients with classic PAH deficiency while 3 had tetrahydropterin-cofactor deficiency and/or remained uncertain due to insufficient specimen. PAH-genetic analysis included a combination of Sanger sequencing of exons and exon–intron boundaries, MLPA and NGS with genomic DNA, and cDNA analysis from immortalized lymphoblasts. A diagnostic efficiency of 99.4% was achieved, as for one allele (out of a total of 162 alleles) no mutation could be identified. The most prevalent mutation was p.Arg408Trp which was found in ~ 38% of all PKU alleles. Three novel mutations were identified, including the two missense mutations p.Gln226Lys and p.Tyr268Cys that were both disease causing by prediction algorithms, and the large genomic deletion EX6del7831 (c.509 + 4140_706 + 510del7831) that resulted in skipping of exon 6 based on PAH-cDNA analysis in immortalized lymphocytes. The genomic deletion was present in a heterozygous state in 12 patients, i.e. in ~ 8% of all the analyzed PKU alleles, and might have originated from a Romanian founder.
Full Text Available Brachial circumference (BC, also known as upper arm or mid arm circumference, can be used as an indicator of muscle mass and fat tissue, which are distributed differently in men and women. Analysis of anthropometric measures of peripheral fat distribution such as BC could help in understanding the complex pathophysiology behind overweight and obesity. The purpose of this study is to identify genetic variants associated with BC through a large-scale genome-wide association scan (GWAS meta-analysis. We used fixed-effects meta-analysis to synthesise summary results across 14 GWAS discovery and 4 replication cohorts comprising overall 22,376 individuals (12,031 women and 10,345 men of European ancestry. Individual analyses were carried out for men, women, and combined across sexes using linear regression and an additive genetic model: adjusted for age and adjusted for age and BMI. We prioritised signals for follow-up in two-stages. We did not detect any signals reaching genome-wide significance. The FTO rs9939609 SNP showed nominal evidence for association (p<0.05 in the age-adjusted strata for men and across both sexes. In this first GWAS meta-analysis for BC to date, we have not identified any genome-wide significant signals and do not observe robust association of previously established obesity loci with BC. Large-scale collaborations will be necessary to achieve higher power to detect loci underlying BC.
Deng, Han-Xiang; Zhai, Hong; Bigio, Eileen H; Yan, Jianhua; Fecto, Faisal; Ajroud, Kaouther; Mishra, Manjari; Ajroud-Driss, Senda; Heller, Scott; Sufit, Robert; Siddique, Nailah; Mugnaini, Enrico; Siddique, Teepu
Amyotrophic lateral sclerosis (ALS) is a fatal disorder of motor neuron degeneration. Most cases of ALS are sporadic (SALS), but about 5 to 10% of ALS cases are familial (FALS). Recent studies have shown that mutations in FUS are causal in approximately 4 to 5% of FALS and some apparent SALS cases. The pathogenic mechanism of the mutant FUS-mediated ALS and potential roles of FUS in non-FUS ALS remain to be investigated. Immunostaining was performed on postmortem spinal cords from 78 ALS cases, including SALS (n = 52), ALS with dementia (ALS/dementia, n = 10), and FALS (n = 16). In addition, postmortem brains or spinal cords from 22 cases with or without frontotemporal lobar degeneration were also studied. In total, 100 cases were studied. FUS-immunoreactive inclusions were observed in spinal anterior horn neurons in all SALS and FALS cases, except for those with SOD1 mutations. The FUS-containing inclusions were also immunoreactive with antibodies to TDP43, p62, and ubiquitin. A fraction of tested FUS antibodies recognized FUS inclusions, and specific antigen retrieval protocol appeared to be important for detection of the skein-like FUS inclusions. Although mutations in FUS account for only a small fraction of FALS and SALS, our data suggest that FUS protein may be a common component of the cellular inclusions in non-SOD1 ALS and some other neurodegenerative conditions, implying a shared pathogenic pathway underlying SALS, non-SOD1 FALS, ALS/dementia, and related disorders. Our data also indicate that SOD1-linked ALS may have a pathogenic pathway distinct from SALS and other types of FALS.
Schwarz, Erika N; Ruhlman, Tracey A; Weng, Mao-Lun; Khiyami, Mohammad A; Sabir, Jamal S M; Hajarah, Nahid H; Alharbi, Njud S; Rabah, Samar O; Jansen, Robert K
This study represents the most comprehensive plastome-wide comparison of nucleotide substitution rates across the three subfamilies of Fabaceae: Caesalpinioideae, Mimosoideae, and Papilionoideae. Caesalpinioid and mimosoid legumes have large, unrearranged plastomes compared with papilionoids, which exhibit varying levels of rearrangement including the loss of the inverted repeat (IR) in the IR-lacking clade (IRLC). Using 71 genes common to 39 legume taxa representing all the three subfamilies, we show that papilionoids consistently have higher nucleotide substitution rates than caesalpinioids and mimosoids, and rates in the IRLC papilionoids are generally higher than those in the IR-containing papilionoids. Unsurprisingly, this pattern was significantly correlated with growth habit as most papilionoids are herbaceous, whereas caesalpinioids and mimosoids are largely woody. Both nonsynonymous (dN) and synonymous (dS) substitution rates were also correlated with several biological features including plastome size and plastomic rearrangements such as the number of inversions and indels. In agreement with previous reports, we found that genes in the IR exhibit between three and fourfold reductions in the substitution rates relative to genes within the large single-copy or small single-copy regions. Furthermore, former IR genes in IR-lacking taxa exhibit accelerated rates compared with genes contained in the IR.
Santini, Sebastien; Jeudy, Sandra; Bartoli, Julia; Poirot, Olivier; Lescot, Magali; Abergel, Chantal; Barbe, Valérie; Wommack, K. Eric; Noordeloos, Anna A. M.; Brussaard, Corina P. D.; Claverie, Jean-Michel
Large dsDNA viruses are involved in the population control of many globally distributed species of eukaryotic phytoplankton and have a prominent role in bloom termination. The genus Phaeocystis (Haptophyta, Prymnesiophyceae) includes several high-biomass-forming phytoplankton species, such as Phaeocystis globosa, the blooms of which occur mostly in the coastal zone of the North Atlantic and the North Sea. Here, we report the 459,984-bp-long genome sequence of P. globosa virus strain PgV-16T, encoding 434 proteins and eight tRNAs and, thus, the largest fully sequenced genome to date among viruses infecting algae. Surprisingly, PgV-16T exhibits no phylogenetic affinity with other viruses infecting microalgae (e.g., phycodnaviruses), including those infecting Emiliania huxleyi, another ubiquitous bloom-forming haptophyte. Rather, PgV-16T belongs to an emerging clade (the Megaviridae) clustering the viruses endowed with the largest known genomes, including Megavirus, Mimivirus (both infecting acanthamoeba), and a virus infecting the marine microflagellate grazer Cafeteria roenbergensis. Seventy-five percent of the best matches of PgV-16T–predicted proteins correspond to two viruses [Organic Lake phycodnavirus (OLPV)1 and OLPV2] from a hypersaline lake in Antarctica (Organic Lake), the hosts of which are unknown. As for OLPVs and other Megaviridae, the PgV-16T sequence data revealed the presence of a virophage-like genome. However, no virophage particle was detected in infected P. globosa cultures. The presence of many genes found only in Megaviridae in its genome and the presence of an associated virophage strongly suggest that PgV-16T shares a common ancestry with the largest known dsDNA viruses, the host range of which already encompasses the earliest diverging branches of domain Eukarya. PMID:23754393
Gregory E. Perry
Full Text Available Resistance to common bacterial blight, caused by Xanthomonas axonopodis pv. phaseoli, in Phaseolus vulgaris is conditioned by several loci on different chromosomes. Previous studies with OAC-Rex, a CBB-resistant, white bean variety of Mesoamerican origin, identified two resistance loci associated with the molecular markers Pv-CTT001 and SU91, on chromosome 4 and 8, respectively. Resistance to CBB is assumed to be derived from an interspecific cross with Phaseolus acutifolius in the pedigree of OAC-Rex. Our current whole genome sequencing effort with OAC-Rex provided the opportunity to compare its genome in the regions associated with CBB resistance with the v1.0 release of the P. vulgaris line G19833, which is a large seeded bean of Andean origin, and (assumed to be CBB susceptible.. In addition, the genomic regions containing SAP6, a marker associated with P. vulgaris-derived CBB-resistance on chromosome 10, were compared. These analyses indicated that gene content was highly conserved between G19833 and OAC-Rex across the regions examined (>80%. However, fifty-nine genes unique to OAC Rex were identified, with resistance gene homologues making up the largest category (10 genes identified. Two unique genes in OAC-Rex located within the SU91 resistance QTL have homology to P. acutifolius ESTs and may be potential sources of CBB resistance. As the genomic sequence assembly of OAC-Rex is completed, we expect that further comparisons between it and the G19833 genome will lead to a greater understanding of CBB resistance in bean.
Pierre J G M de Wit
Full Text Available We sequenced and compared the genomes of the Dothideomycete fungal plant pathogens Cladosporium fulvum (Cfu (syn. Passalora fulva and Dothistroma septosporum (Dse that are closely related phylogenetically, but have different lifestyles and hosts. Although both fungi grow extracellularly in close contact with host mesophyll cells, Cfu is a biotroph infecting tomato, while Dse is a hemibiotroph infecting pine. The genomes of these fungi have a similar set of genes (70% of gene content in both genomes are homologs, but differ significantly in size (Cfu >61.1-Mb; Dse 31.2-Mb, which is mainly due to the difference in repeat content (47.2% in Cfu versus 3.2% in Dse. Recent adaptation to different lifestyles and hosts is suggested by diverged sets of genes. Cfu contains an α-tomatinase gene that we predict might be required for detoxification of tomatine, while this gene is absent in Dse. Many genes encoding secreted proteins are unique to each species and the repeat-rich areas in Cfu are enriched for these species-specific genes. In contrast, conserved genes suggest common host ancestry. Homologs of Cfu effector genes, including Ecp2 and Avr4, are present in Dse and induce a Cf-Ecp2- and Cf-4-mediated hypersensitive response, respectively. Strikingly, genes involved in production of the toxin dothistromin, a likely virulence factor for Dse, are conserved in Cfu, but their expression differs markedly with essentially no expression by Cfu in planta. Likewise, Cfu has a carbohydrate-degrading enzyme catalog that is more similar to that of necrotrophs or hemibiotrophs and a larger pectinolytic gene arsenal than Dse, but many of these genes are not expressed in planta or are pseudogenized. Overall, comparison of their genomes suggests that these closely related plant pathogens had a common ancestral host but since adapted to different hosts and lifestyles by a combination of differentiated gene content, pseudogenization, and gene regulation.
de Wit, Pierre J. G. M.; van der Burgt, Ate; Okmen, Bilal; Stergiopoulos, Ioannis; Abd-Elsalam, Kamel A.; Aerts, Andrea L.; Bahkali, Ali H.; Beenen, Henriek G.; Chettri, Oranav; Cos, Murray P.; Datema, Erwin; de Vries, Ronald P.; DHillon, Braham; Ganley, Austen R.; Griffiths, Scott A.; Guo, Yanan; Gamelin, Richard C.; Henrissat, Bernard; Kabir, M. Shahjahan; Jashni, Mansoor Karimi; Kema, Gert; Klaubauf, Sylvia; Lapidus, Alla; Levasseur, Anthony; Lindquist, Erika; Mehrabi, Rahim; Ohm, Robin A.; Owen, Timothy J.; Salamov, Asaf; Schwelm, Arne; Schijlen, Elio; Sun, Hui; van den Burg, Harrold A.; van Burg, Roeland C. H. J.; Zhang, Shuguang; Goodwin, Stephen B.; Grigoriev, Igor V.; Collemare, Jerome; Bradshaw, Rosie E.
We sequenced and compared the genomes of the Dothideomycete fungal plant pathogens Cladosporium fulvum (Cfu) (syn. Passalora fulva) and Dothistroma septosporum (Dse) that are closely related phylogenetically, but have different lifestyles and hosts. Although both fungi grow extracellularly in close contact with host mesophyll cells, Cfu is a biotroph infecting tomato, while Dse is a hemibiotroph infecting pine. The genomes of these fungi have a similar set of genes (70percent of gene content in both genomes are homologs), but differ significantly in size (Cfu >61.1-Mb; Dse 31.2-Mb), which is mainly due to the difference in repeat content (47.2percent in Cfu versus 3.2percent in Dse). Recent adaptation to different lifestyles and hosts is suggested by diverged sets of genes. Cfu contains an tomatinase gene that we predict might be required for detoxification of tomatine, while this gene is absent in Dse. Many genes encoding secreted proteins are unique to each species and the repeat-rich areas in Cfu are enriched for these species-specific genes. In contrast, conserved genes suggest common host ancestry. Homologs of Cfu effector genes, including Ecp2 and Avr4, are present in Dse and induce a Cf-Ecp2- and Cf-4-mediated hypersensitive response, respectively. Strikingly, genes involved in production of the toxin dothistromin, a likely virulence factor for Dse, are conserved in Cfu, but their expression differs markedly with essentially no expression by Cfu in planta. Likewise, Cfu has a carbohydrate-degrading enzyme catalog that is more similar to that of necrotrophs or hemibiotrophs and a larger pectinolytic gene arsenal than Dse, but many of these genes are not expressed in planta or are pseudogenized. Overall, comparison of their genomes suggests that these closely related plant pathogens had a common ancestral host but since adapted to different hosts and lifestyles by a combination of differentiated gene content, pseudogenization, and gene regulation.
Ruan, Yunfeng; Shen, Lu; Zou, Yan; Qi, Zhengnan; Yin, Jun; Jiang, Jie; Guo, Liang; He, Lin; Chen, Zijiang; Tang, Zisheng; Qin, Shengying
Many species of the genus Prevotella are pathogens that cause oral diseases. Prevotella intermedia is known to cause various oral disorders e.g. periodontal disease, periapical periodontitis and noma as well as colonize in the respiratory tract and be associated with cystic fibrosis and chronic bronchitis. It is of clinical significance to identify the main drive of its various adaptation and pathogenicity. In order to explore the intra-species genetic differences among strains of Prevotella intermedia of different niches, we isolated a strain Prevotella intermedia ZT from the infected root canal of a Chinese patient with periapical periodontitis and gained a draft genome sequence. We annotated the genome and compared it with the genomes of other taxa in the genus Prevotella. The raw data set, consisting of approximately 65X-coverage reads, was trimmed and assembled into contigs from which 2165 ORFs were predicted. The comparison of the Prevotella intermedia ZT genome sequence with the published genome sequence of Prevotella intermedia 17 and Prevotella intermedia ATCC25611 revealed that ~14% of the genes were strain-specific. The Preveotella intermedia strains share a set of conserved genes contributing to its adaptation and pathogenic and possess strain-specific genes especially those involved in adhesion and secreting bacteriocin. The Prevotella intermedia ZT shares similar gene content with other taxa of genus Prevotella. The genomes of the genus Prevotella is highly dynamic with relative conserved parts: on average, about half of the genes in one Prevotella genome were not included in another genome of the different Prevotella species. The degree of conservation varied with different pathways: the ability of amino acid biosynthesis varied greatly with species but the pathway of cell wall components biosynthesis were nearly constant. Phylogenetic tree shows that the taxa from different niches are scarcely distributed among clades. Prevotella intermedia ZT
Farkas, Szilvia L; Benko, Mária; Elo, Péter; Ursu, Krisztina; Dán, Adám; Ahne, Winfried; Harrach, Balázs
Approximately 60% of the genome of an adenovirus isolated from a corn snake (Elaphe guttata) was cloned and sequenced. The results of homology searches showed that the genes of the corn snake adenovirus (SnAdV-1) were closest to their counterparts in members of the recently proposed new genus ATADENOVIRUS: In phylogenetic analyses of the complete hexon and protease genes, SnAdV-1 indeed clustered together with the atadenoviruses. The characteristic features in the genome organization of SnAdV-1 included the presence of a gene homologous to that for protein p32K, the lack of structural proteins V and IX and the absence of homologues of the E1A and E3 regions. These characteristics are in accordance with the genus-defining markers of atadenoviruses. Comparison of the cleavage sites of the viral protease in core protein pVII also confirmed SnAdV-1 as a candidate member of the genus ATADENOVIRUS: Thus, the hypothesis on the possible reptilian origin of atadenoviruses (Harrach, Acta Veterinaria Hungarica 48, 484-490, 2000) seems to be supported. However, the base composition of DNA sequence (>18 kb) determined from the SnAdV-1 genome showed an equilibrated GC content of 51%, which is unusual for an atadenovirus.
Sanborn, Adrian; Rao, Suhas; Huang, Su-Chen; Durand, Neva; Huntley, Miriam; Jewett, Andrew; Bochkov, Ivan; Chinnappan, Dharmaraj; Cutkosky, Ashok; Li, Jian; Geeting, Kristopher; McKenna, Doug; Stamenova, Elena; Gnirke, Andreas; Melnikov, Alexandre; Lander, Eric; Aiden, Erez
Our recent kilobase-resolution genome-wide maps of DNA self-contacts demonstrated that mammalian genomes are organized into domains and loops demarcated by the DNA-binding protein CTCF. Here, we combine these maps with new Hi-C, microscopy, and genome-editing experiments to study the physical structure of chromatin fibers, domains, and loops. We find that domains are inconsistent with equilibrium and fractal models. Instead, we use physical simulations to study two models of genome folding. In one, intermonomer attraction during condensation leads to formation of an anisotropic ``tension globule.'' In the other, CTCF and cohesin act together to extrude unknotted loops. Both models are consistent with the observed domains and loops. However, the extrusion model explains a far wider array of observations, such as why the CTCF-binding motifs at pairs of loop anchors lie in the convergent orientation. Finally, we perform 13 genome-editing experiments examining the effect of altering CTCF-binding sites on chromatin folding. The extrusion model predicts in silico the experimental maps using only CTCF-binding sites. Thus, we show that it is possible to disrupt, restore, and move loops and domains using targeted mutations as small as a single base pair.
Full Text Available In genome analysis, k-mer-based comparison methods have become standard tools. However, even though they are able to deliver reliable results, other algorithms seem to work better in some cases. To improve k-mer-based DNA sequence analysis and comparison, we successfully checked whether adding positional resolution is beneficial for finding and/or comparing interesting organizational structures. A simple but efficient algorithm for extracting and saving local k-mer spectra (frequency distribution of k-mers was developed and used. The results were analyzed by including positional information based on visualizations as genomic maps and by applying basic vector correlation methods. This analysis was concentrated on small word lengths (1 ≤ k ≤ 4 on relatively small viral genomes of Papillomaviridae and Herpesviridae, while also checking its usability for larger sequences, namely human chromosome 2 and the homologous chromosomes (2A, 2B of a chimpanzee. Using this alignment-free analysis, several regions with specific characteristics in Papillomaviridae and Herpesviridae formerly identified by independent, mostly alignment-based methods, were confirmed. Correlations between the k-mer content and several genes in these genomes have been found, showing similarities between classified and unclassified viruses, which may be potentially useful for further taxonomic research. Furthermore, unknown k-mer correlations in the genomes of Human Herpesviruses (HHVs, which are probably of major biological function, are found and described. Using the chromosomes of a chimpanzee and human that are currently known, identities between the species on every analyzed chromosome were reproduced. This demonstrates the feasibility of our approach for large data sets of complex genomes. Based on these results, we suggest k-mer analysis with positional resolution as a method for closing a gap between the effectiveness of alignment-based methods (like NCBI BLAST and the
Gronwald John W
Full Text Available Abstract Background Common bean (Phaseolus vulgaris L. and soybean (Glycine max both belong to the Phaseoleae tribe and share significant coding sequence homology. This suggests that the GeneChip® Soybean Genome Array (soybean GeneChip may be used for gene expression studies using common bean. Results To evaluate the utility of the soybean GeneChip for transcript profiling of common bean, we hybridized cRNAs purified from nodule, leaf, and root of common bean and soybean in triplicate to the soybean GeneChip. Initial data analysis showed a decreased sensitivity and accuracy of measuring differential gene expression in common bean cross-species hybridization (CSH GeneChip data compared to that of soybean. We employed a method that masked putative probes targeting inter-species variable (ISV regions between common bean and soybean. A masking signal intensity threshold was selected that optimized both sensitivity and accuracy of measuring differential gene expression. After masking for ISV regions, the number of differentially-expressed genes identified in common bean was increased by 2.8-fold reflecting increased sensitivity. Quantitative RT-PCR (qRT-PCR analysis of 20 randomly selected genes and purine-ureide pathway genes demonstrated an increased accuracy of measuring differential gene expression after masking for ISV regions. We also evaluated masked probe frequency per probe set to gain insight into the sequence divergence pattern between common bean and soybean. The sequence divergence pattern analysis suggested that the genes for basic cellular functions and metabolism were highly conserved between soybean and common bean. Additionally, our results show that some classes of genes, particularly those associated with environmental adaptation, are highly divergent. Conclusions The soybean GeneChip is a suitable cross-species platform for transcript profiling in common bean when used in combination with the masking protocol described. In
Full Text Available During the Deepwater Horizon (DWH oil spill in the Gulf of Mexico a deep-sea hydrocarbon plume developed resulting in a rapid succession of bacteria. Colwellia eventually supplanted Oceanospirillales, which dominated the plume early in the spill. These successional changes may have resulted, in part, from the changing composition and abundance of hydrocarbons over time. Colwellia abundance peaked when gaseous and simple aromatic hydrocarbons increased, yet the metabolic pathway used by Colwellia in hydrocarbon disposition is unknown. Here we used single-cell genomics to gain insights into the genome properties of a Colwellia enriched during the DWH deep-sea plume. A single amplified genome (SAG of a Colwellia cell isolated from a DWH plume, closely related (avg. 98% 16S rRNA gene similarity to other plume Colwellia, was sequenced and annotated. The SAG was similar to the sequenced isolate Colwellia psychrerythraea 34H (84% avg. nucleotide identity. Both had genes for denitrification, chemotaxis and motility, adaptations to cold environments, and a suite of nutrient acquisition genes. The Colwellia SAG may be capable of gaseous and aromatic hydrocarbon degradation, which contrasts with a DWH plume Oceanospirillales SAG genome which encoded non-gaseous n-alkane and cycloalkane degradation. The disparate hydrocarbon degradation pathways are consistent with hydrocarbons that were abundant at different times in the deep-sea plume; first, non-gaseous n-alkanes and cycloalkanes that could be degraded by Oceanospirillales, followed by gaseous, and simple aromatic hydrocarbons that may have been degraded by Colwellia. These insights into the genomic properties of a Colwellia species, which were supported by existing metagenomic sequence data from the plume and DWH contaminated sediments, help further our understanding of the successional changes in the dominant microbial players in the plume over the course of the DWH spill.
Böhme, Ulrike; Otto, Thomas D.; Cotton, James A.; Steinbiss, Sascha; Sanders, Mandy; Oyola, Samuel O.; Nicot, Antoine; Gandon, Sylvain; Patra, Kailash P.; Herd, Colin; Bushell, Ellen; Modrzynska, Katarzyna K.; Billker, Oliver; Vinetz, Joseph M.; Rivero, Ana; Newbold, Chris I.; Berriman, Matthew
Avian malaria parasites are prevalent around the world and infect a wide diversity of bird species. Here, we report the sequencing and analysis of high-quality draft genome sequences for two avian malaria species, Plasmodium relictum and Plasmodium gallinaceum. We identify 50 genes that are specific to avian malaria, located in an otherwise conserved core of the genome that shares gene synteny with all other sequenced malaria genomes. Phylogenetic analysis suggests that the avian malaria species form an outgroup to the mammalian Plasmodium species, and using amino acid divergence between species, we estimate the avian- and mammalian-infective lineages diverged in the order of 10 million years ago. Consistent with their phylogenetic position, we identify orthologs of genes that had previously appeared to be restricted to the clades of parasites containing Plasmodium falciparum and Plasmodium vivax, the species with the greatest impact on human health. From these orthologs, we explore differential diversifying selection across the genus and show that the avian lineage is remarkable in the extent to which invasion-related genes are evolving. The subtelomeres of the P. relictum and P. gallinaceum genomes contain several novel gene families, including an expanded surf multigene family. We also identify an expansion of reticulocyte binding protein homologs in P. relictum, and within these proteins, we detect distinct regions that are specific to nonhuman primate, humans, rodent, and avian hosts. For the first time in the Plasmodium lineage, we find evidence of transposable elements, including several hundred fragments of LTR-retrotransposons in both species and an apparently complete LTR-retrotransposon in the genome of P. gallinaceum. PMID:29500236
Goswami, Sathi; Sanyal, Sulagna; Chakraborty, Payal; Das, Chandrima; Sarkar, Munna
NSAIDs are the most common class of painkillers and anti-inflammatory agents. They also show other functions like chemoprevention and chemosuppression for which they act at the protein but not at the genome level since they are mostly anions at physiological pH, which prohibit their approach to the poly-anionic DNA. Complexing the drugs with bioactive metal obliterate their negative charge and allow them to bind to the DNA, thereby, opening the possibility of genome level interaction. To test this hypothesis, we present the interaction of a traditional NSAID, Piroxicam and its copper complex with core histone and chromatin. Spectroscopy, DLS, and SEM studies were applied to see the effect of the interaction on the structure of histone/chromatin. This was coupled with MTT assay, immunoblot analysis, confocal microscopy, micro array analysis and qRT-PCR. The interaction of Piroxicam and its copper complex with histone/chromatin results in structural alterations. Such structural alterations can have different biological manifestations, but to test our hypothesis, we have focused only on the accompanied modulations at the epigenomic/genomic level. The complex, showed alteration of key epigenetic signatures implicated in transcription in the global context, although Piroxicam caused no significant changes. We have correlated such alterations caused by the complex with the changes in global gene expression and validated the candidate gene expression alterations. Our results provide the proof of concept that DNA binding ability of the copper complexes of a traditional NSAID, opens up the possibility of modulations at the epigenomic/genomic level. Copyright © 2017 Elsevier B.V. All rights reserved.
Orozco, Gisela; Goh, Chee L; Al Olama, Ali Amin; Benlloch-Garcia, Sara; Govindasami, Koveela; Guy, Michelle; Muir, Kenneth R; Giles, Graham G; Severi, Gianluca; Neal, David E; Hamdy, Freddie C; Donovan, Jenny L; Kote-Jarai, Zsofia; Easton, Douglas F; Eyre, Steve; Eeles, Rosalind A
WHAT'S KNOWN ON THE SUBJECT? AND WHAT DOES THE STUDY ADD?: The link between inflammation and cancer has long been reported and inflammation is thought to play a role in the pathogenesis of many cancers, including prostate cancer (PrCa). Over the last 5 years, genome-wide association studies (GWAS) have reported numerous susceptibility loci that predispose individuals to many different traits. The present study aims to ascertain if there are common genetic risk profiles that might predispose individuals to both PrCa and the autoimmune inflammatory condition, rheumatoid arthritis. These results could have potential public heath impact in terms of screening and chemoprevention. To investigate if potential common pathways exist for the pathogenesis of autoimmune disease and prostate cancer (PrCa). To ascertain if the single nucleotide polymorphisms (SNPs) reported by genome-wide association studies (GWAS) as being associated with susceptibility to PrCa are also associated with susceptibility to the autoimmune disease rheumatoid arthritis (RA). The original Wellcome Trust Case Control Consortium (WTCCC) UK RA GWAS study was expanded to include a total of 3221 cases and 5272 controls. In all, 37 germline autosomal SNPs at genome-wide significance associated with PrCa risk were identified from a UK/Australian PrCa GWAS. Allele frequencies were compared for these 37 SNPs between RA cases and controls using a chi-squared trend test and corrected for multiple testing (Bonferroni). In all, 33 SNPs were able to be analysed in the RA dataset. Proxies could not be located for the SNPs in 3q26, 5p15 and for two SNPs in 17q12. After applying a Bonferroni correction for the number of SNPs tested, the SNP mapping to CCHCR1 (rs130067) retained statistically significant evidence for association (P = 6 × 10(-4) ; odds ratio [OR] = 1.15, 95% CI: 1.06-1.24); this has also been associated with psoriasis. However, further analyses showed that the association of this allele was due to
Behr, Elijah R.; Ritchie, Marylyn D.; Tanaka, Toshihiro; Kääb, Stefan; Crawford, Dana C.; Nicoletti, Paola; Floratos, Aris; Sinner, Moritz F.; Kannankeril, Prince J.; Wilde, Arthur A. M.; Bezzina, Connie R.; Schulze-Bahr, Eric; Zumhagen, Sven; Guicheney, Pascale; Bishopric, Nanette H.; Marshall, Vanessa; Shakir, Saad; Dalageorgou, Chrysoula; Bevan, Steve; Jamshidi, Yalda; Bastiaenen, Rachel; Myerburg, Robert J.; Schott, Jean-Jacques; Camm, A. John; Steinbeck, Gerhard; Norris, Kris; Altman, Russ B.; Tatonetti, Nicholas P.; Jeffery, Steve; Kubo, Michiaki; Nakamura, Yusuke; Shen, Yufeng; George, Alfred L.; Roden, Dan M.
Marked prolongation of the QT interval on the electrocardiogram associated with the polymorphic ventricular tachycardia Torsades de Pointes is a serious adverse event during treatment with antiarrhythmic drugs and other culprit medications, and is a common cause for drug relabeling and withdrawal.
T. Tanaka (Toshiko); J.S. Ngwa; F.J.A. van Rooij (Frank); M.C. Zillikens (Carola); M.K. Wojczynski (Mary ); A.C. Frazier-Wood (Alexis); D.K. Houston (Denise); S. Kanoni (Stavroula); R.N. Lemaitre (Rozenn ); J. Luan; V. Mikkilä (Vera); F. Renström (Frida); E. Sonestedt (Emily); J.H. Zhao (Jing Hua); A.Y. Chu (Audrey); L. Qi (Lu); D.I. Chasman (Daniel); M.C. De Oliveira Otto (Marcia); E.J. Dhurandhar (Emily); M.F. Feitosa (Mary Furlan); I. Johansson (Ingegerd); K-T. Khaw (Kay-Tee); K. Lohman (Kurt); A. Manichaikul (Ani); N.M. McKeown (Nicola ); D. Mozaffarian (Dariush); A.B. Singleton (Andrew); K. Stirrups (Kathy); J. Viikari (Jorma); Z. Ye (Zheng); S. Bandinelli (Stefania); I.E. Barroso (Inês); P. Deloukas (Panagiotis); N.G. Forouhi (Nita); A. Hofman (Albert); Y. Liu (YongMei); L.-P. Lyytikäinen (Leo-Pekka); K.E. North (Kari); M. Dimitriou (Maria); G. Hallmans (Göran); M. Kähönen (Mika); C. Langenberg (Claudia); J.M. Ordovas (Jose); A.G. Uitterlinden (André); F.B. Hu (Frank); I.-P. Kalafati (Ioanna-Panagiota); O. Raitakari (Olli); O.H. Franco (Oscar); A. Johnson (Anthony); V. Emilsson (Valur); J.A. Schrack (Jennifer); R.D. Semba; D.S. Siscovick (David); D.K. Arnett (Donna); I.B. Borecki (Ingrid); P.W. Franks (Paul); S.B. Kritchevsky (Stephen); R.J.F. Loos (Ruth); M. Orho-Melander (Marju); J.I. Rotter (Jerome); N.J. Wareham (Nick); J.C.M. Witteman (Jacqueline); L. Ferrucci (Luigi); G.V. Dedoussis (George); L.A. Cupples (Adrienne); J.A. Nettleton (Jennifer )
textabstractBackground: Macronutrient intake varies substantially between individuals, and there is evidence that this variation is partly accounted for by genetic variants. Objective: The objective of the study was to identify common genetic variants that are associated with macronutrient intake.
Gong, Yiwen; Feng, Shuaisheng; Li, Shangqi; Zhang, Yan; Zhao, Zixia; Hu, Mou; Xu, Peng; Jiang, Yanliang
The Toll-like receptor (TLR) gene family is a class of conserved pattern recognition receptors, which play an essential role in innate immunity providing efficient defense against invading microbial pathogens. Although TLRs have been extensively characterized in both invertebrates and vertebrates, a comprehensive analysis of TLRs in common carp is lacking. In the present study, we have conducted the first genome-wide systematic analysis of common carp (Cyprinus carpio) TLR genes. A set of 27 common carp TLR genes were identified and characterized. Sequence similarity analysis, functional domain prediction and phylogenetic analysis supported their annotation and orthologies. By examining the gene copy number of TLR genes across several vertebrates, gene duplications and losses were observed. The expression patterns of TLR genes were examined during early developmental stages and in various healthy tissues, and the results showed that TLR genes were ubiquitously expressed, indicating a likely role in maintaining homeostasis. Moreover, the differential expression of TLRs was examined after Aeromons hydrophila infection, and showed that most TLR genes were induced, with diverse patterns. TLR1, TLR4-2, TLR4-3, TLR22-2, TLR22-3 were significantly up-regulated at minimum one timepoint, whereas TLR2-1, TLR4-1, TLR7-1 and TLR7-2 were significantly down-regulated. Our results suggested that TLR genes play critical roles in the common carp immune response. Collectively, our findings provide fundamental genomic resources for future studies on fish disease management and disease-resistance selective breeding strategy development. Copyright © 2017 Elsevier Inc. All rights reserved.
Full Text Available Upon infection, many RNA viruses reorganize their capsid for release of the genome into the host cell cytosol for replication. Often, this process is triggered by receptor binding and/or by the acidic environment in endosomes. In the genus Enterovirus, which includes more than 150 human rhinovirus (HRV serotypes causing the common cold, there is persuasive evidence that the viral RNA exits single-stranded through channels formed in the protein shell. We have determined the time-dependent emergence of the RNA ends from HRV2 on incubation of virions at 56°C using hybridization with specific oligonucleotides and detection by fluorescence correlation spectroscopy. We report that psoralen UV crosslinking prevents complete RNA release, allowing for identification of the sequences remaining inside the capsid. We also present the structure of uncoating intermediates in which parts of the RNA are condensed and take the form of a rod that is directed roughly towards a two-fold icosahedral axis, the presumed RNA exit point. Taken together, in contrast to schemes frequently depicted in textbooks and reviews, our findings demonstrate that exit of the RNA starts from the 3'-end. This suggests that packaging also occurs in an ordered manner resulting in the 3'-poly-(A tail becoming located close to a position of pore formation during conversion of the virion into a subviral particle. This directional genome release may be common to many icosahedral non-enveloped single-stranded RNA viruses.
Theo H M Smits
Full Text Available BACKGROUND: Pantoea vagans is a commercialized biological control agent used against the pome fruit bacterial disease fire blight, caused by Erwinia amylovora. Compared to other biocontrol agents, relatively little is currently known regarding Pantoea genetics. Better understanding of antagonist mechanisms of action and ecological fitness is critical to improving efficacy. PRINCIPAL FINDINGS: Genome analysis indicated two major factors Contribute to biocontrol activity: competition for limiting substrates and antibacterial metabolite production. Pathways for utilization of a broad diversity of sugars and acquisition of iron were identified. Metabolism of sorbitol by P. vagans C9-1 may be a major metabolic feature in biocontrol of fire blight. Biosynthetic genes for the antibacterial peptide pantocin A were found on a chromosomal 28-kb genomic island, and for dapdiamide E on the plasmid pPag2. There was no evidence of potential virulence factors that could enable an animal or phytopathogenic lifestyle and no indication of any genetic-based biosafety risk in the antagonist. CONCLUSIONS: Identifying key determinants contributing to disease suppression allows the development of procedures to follow their expression in planta and the genome sequence contributes to rationale risk assessment regarding the use of the biocontrol strain in agricultural systems.
Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang
The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Carvalho Carla M
Full Text Available Abstract Background Campylobacter is the leading cause of foodborne diseases worldwide. Bacteriophages (phages are naturally occurring predators of bacteria, ubiquitous in the environment, with high host specificity and thus considered an appealing option to control bacterial pathogens. Nevertheless for an effective use of phages as antimicrobial agents, it is important to understand phage biology which renders crucial the analysis of phage genomes and proteomes. The lack of sequence data from Campylobacter phages adds further importance to these studies. Methods vB_CcoM-IBB_35 is a broad lytic spectrum Myoviridae Campylobacter phage with high potential for therapeutic use. The genome of this phage was obtained by pyrosequencing and the sequence data was further analyzed. The proteomic analysis was performed by SDS-PAGE and Mass spectrometry. Results and conclusions The DNA sequence data of vB_CcoM-IBB_35 consists of five contigs for a total of 172,065 bp with an average GC content of 27%. Attempts to close the gaps between contigs were unsuccessful since the DNA preparations appear to contain substances that inhibited Taq and ϕ29 polymerases. From the 210 identified ORFs, around 60% represent proteins that were not functionally assigned. Homology exists with members of the Teequatrovirinae namely for T4 proteins involved in morphogenesis, nucleotide metabolism, transcription, DNA replication and recombination. Tandem mass spectrometric analysis revealed 38 structural proteins as part of the mature phage particle. Conclusions Genes encoding proteins involved in the carbohydrate metabolism along with several incidences of gene duplications, split genes with inteins and introns have been rarely found in other phage genomes yet are found in this phage. We identified the genes encoding for tail fibres and for the lytic cassette, this later, expressing enzymes for bacterial capsular polysaccharides (CPS degradation, which has not been reported
Ogi, Miki; Yano, Yoshihiko; Chikahira, Masatsugu; Takai, Denshi; Oshibe, Tomohiro; Arashiro, Takeshi; Hanaoka, Nozomu; Fujimoto, Tsuguto; Hayashi, Yoshitake
Coxsackievirus A6 (CV-A6) is an enterovirus, which is known to cause herpangina. However, since 2009 it has frequently been isolated from children with hand, foot, and mouth disease (HFMD). In Japan, CV-A6 has been linked to HFMD outbreaks in 2011 and 2013. In this study, the full-length genome sequencing of CV-A6 strains were analyzed to identify the association with clinical manifestations. Five thousand six hundred and twelve children with suspected enterovirus infection (0-17 years old) between 1999 and 2013 in Hyogo Prefecture, Japan, were enrolled. Enterovirus infection was confirmed with reverse transcriptase-PCR in 753 children (791 samples), 127 of whom (133 samples) were positive for CV-A6 based on the direct sequencing of the VP4 region. The complete genomes of CV-A6 from 22 positive patients with different clinical manifestations were investigated. A phylogenetic analysis divided these 22 strains into two clusters based on the VP1 region; cluster I contained strains collected in 1999-2009 and mostly related to herpangina, and cluster II contained strains collected in 2011-2013 and related to HFMD outbreak. Based on the full-length polyprotein analysis, the amino acid differences between the strains in cluster I and II were 97.7 ± 0.28%. Amino acid differences were detected in 17 positions within the polyprotein. Strains collected in 1999-2009 and those in 2011-2013 were separately clustered by phylogenetic analysis based on 5'UTR and 3Dpol region, as well as VP1 region. In conclusion, HFMD outbreaks by CV-A6 were recently frequent in Japan and the accumulation of genomic change might be associated with the clinical course. © 2017 Wiley Periodicals, Inc.
Omotayo Opemipo Oyedara
Full Text Available Bdellovibrio spp. are predatory bacteria with great potential as antimicrobial agents. Studies have shown that members of the genus Bdellovibrio exhibit peculiar characteristics that influence their ecological adaptations. In this study, whole genomes of two different Bdellovibrio spp. designated SKB1291214 and SSB218315 isolated from soil were sequenced. The core genes shared by all the Bdellovibrio spp. considered for the pangenome analysis including the epibiotic B. exovorus were 795. The number of unique genes identified in Bdellovibrio spp. SKB1291214, SSB218315, W, and B. exovorus JJS was 1343, 113, 857, and 1572, respectively. These unique genes encode hydrolytic, chemotaxis, and transporter proteins which might be useful for predation in the Bdellovibrio strains. Furthermore, the two Bdellovibrio strains exhibited differences based on the % GC content, amino acid identity, and 16S rRNA gene sequence. The 16S rRNA gene sequence of Bdellovibrio sp. SKB1291214 shared 99% identity with that of an uncultured Bdellovibrio sp. clone 12L 106 (a pairwise distance of 0.008 and 95–97% identity (a pairwise distance of 0.043 with that of other culturable terrestrial Bdellovibrio spp., including strain SSB218315. In Bdellovibrio sp. SKB1291214, 174 bp sequence was inserted at the host interaction (hit locus region usually attributed to prey attachment, invasion, and development of host independent Bdellovibrio phenotypes. Also, a gene equivalent to Bd0108 in B. bacteriovorus HD100 was not conserved in Bdellovibrio sp. SKB1291214. The results of this study provided information on the genetic characteristics and diversity of the genus Bdellovibrio that can contribute to their successful applications as a biocontrol agent.
Yamashita, Rikiya; Yamaoka, Toshihide; Nishitai, Ryuta; Isoda, Hiroyoshi; Taura, Kojiro; Arizono, Shigeki; Furuta, Akihiro; Ohno, Tsuyoshi; Ono, Ayako; Togashi, Kaori
This study aimed to evaluate the common features and variations of portal vein anatomy in right-sided round ligament (RSRL), which can help propose a method to detect and diagnose this anomaly. In this retrospective study of 14 patients with RSRL, the branching order of the portal tree was analyzed, with special focus on the relationship between the dorsal branch of the right anterior segmental portal vein (P A-D ) and the lateral segmental portal vein (P LL ), to determine the common features. The configuration of the portal vein from the main portal trunk to the right umbilical portion (RUP), the inclination of the RUP, and the number and thickness of the ramifications branching from the right anterior segmental portal vein (P A ) were evaluated for variations. In all subjects, the diverging point of the P A-D was constantly distal to that of the P LL . The portal vein configuration was I- and Z-shaped in nine and five subjects, respectively. The RUP was tilted to the right in all subjects. In Z-shaped subjects, the portal trunk between the branching point of the right posterior segmental portal vein and that of the P LL was tilted to the left in one subject and was almost parallel to the vertical plane in four subjects. Multiple ramifications were radially distributed from the P A in eight subjects, whereas one predominant P A-D branched from the P A in six subjects. Based on the diverging points of the P A-D and P LL , we proposed a three-step method for the detection and diagnosis of RSRL.
Hu, Guang Fu; Liu, Xiang Jiang; Zou, Gui Wei; Li, Zhong; Liang, Hong-Wei; Hu, Shao-Na
We sequenced the complete mitogenomes of (Cyprinus carpio haematopterus) and Russian scattered scale mirror carp (Cyprinus carpio carpio). Comparison of these two mitogenomes revealed that the mitogenomes of these two common carp strains were remarkably similar in genome length, gene order and content, and AT content. There were only 55 bp variations in 16,581 nucleotides. About 1 bp variation was located in rRNAs, 2 bp in tRNAs, 9 bp in the control region and 43 bp in protein-coding genes. Furthermore, forty-three variable nucleotides in the protein-coding genes of the two strains led to four variable amino acids, which were located in the ND2, ATPase 6, ND5 and ND6 genes, respectively.
Full Text Available In the years 2001 and 2002, the study was conducted in six experiments in order to examine the conditioning of the length of stem variability and its impact on cropping features of determinate and indeterminate cultivars of common vetch. Rainfall in June and July as well as during the whole growing season was positively correlated with length of stem, but negatively correlated with seed yield, to a larger extent in the group of indeterminate cultivars than in the determinate one. Duration of blooming stage, length of stem, and seed yield showed the largest variability in both groups. Increase in length of stem of plants of indeterminate cultivars led to the delay in maturation, to less even maturation, and to the decrease in the thousand seed weight and seed yield. Increase in length of stem of plants of determinate cultivars delayed reaching the phase of technical maturation and decreased evenness of plant maturation. Determinate growth of common vetch did not lead to the reduction of lodging.
Murciano-Calles, Javier; Güell-Bosch, Jofre; Villegas, Sandra; Martinez, Jose C
PDZ domains are protein-protein interaction modules sharing the same structural arrangement. To discern whether they display common features in their unfolding/misfolding behaviour we have analyzed in this work the unfolding thermodynamics, together with the misfolding kinetics, of the PDZ fold using three archetypical examples: the second and third PDZ domains of the PSD95 protein and the Erbin PDZ domain. Results showed that all domains passed through a common intermediate, which populated upon unfolding, and that this in turn drove the misfolding towards worm-like fibrillar structures. Thus, the unfolding/misfolding behaviour appears to be shared within these domains. We have also analyzed how this landscape can be modified upon the inclusion of extra-elements, as it is in the nNOS PDZ domain, or the organization of swapped species, as happens in the second PDZ domain of the ZO2 protein. Although the intermediates still formed upon thermal unfolding, the misfolding was prevented to varying degrees.
Full Text Available In this study, 13 polymorphic microsatellite markers were isolated from the Phaseolus vulgaris L. (common bean by using the Fast Isolation by AFLP of Sequence COntaining Repeats (FIASCO protocol. These markers revealed two to seven alleles, with an average of 3.64 alleles per locus. The polymorphic information content (PIC values ranged from 0.055 to 0.721 over 13 loci, with a mean value of 0.492, and 7 loci having PIC greater than 0.5. The expected heterozygosity (HE and observed heterozygosity (HO levels ranged from 0.057 to 0.814 and from 0.026 to 0.531, respectively. Cross-species amplification of the 13 prime pairs was performed in its related specie of Vigna unguiculata L. Seven out of all these markers showed cross-species transferability. These markers will be useful for future genetic diversity and population genetics studies for this agricultural specie and its related species.
Tu, Shiqi; Yuan, Guo-Cheng; Shao, Zhen
Recently, long non-coding RNAs (lncRNAs) have emerged as an important class of molecules involved in many cellular processes. One of their primary functions is to shape epigenetic landscape through interactions with chromatin modifying proteins. However, mechanisms contributing to the specificity of such interactions remain poorly understood. Here we took the human and mouse lncRNAs that were experimentally determined to have physical interactions with Polycomb repressive complex 2 (PRC2), and systematically investigated the sequence features of these lncRNAs by developing a new computational pipeline for sequences composition analysis, in which each sequence is considered as a series of transitions between adjacent nucleotides. Through that, PRC2-binding lncRNAs were found to be associated with a set of distinctive and evolutionarily conserved sequence features, which can be utilized to distinguish them from the others with considerable accuracy. We further identified fragments of PRC2-binding lncRNAs that are enriched with these sequence features, and found they show strong PRC2-binding signals and are more highly conserved across species than the other parts, implying their functional importance.
Foyer, Christine H; Verrall, Susan R; Hancock, Robert D
Phloem-feeding insects (PFIs), of which aphids are the largest group, are major agricultural pests causing extensive damage to crop plants. In contrast to chewing insects, the nature of the plant response to PFIs remains poorly characterized. Scrutiny of the literature concerning transcriptional responses of model and crop plant species to PFIs reveals surprisingly little consensus with respect to the transcripts showing altered abundance following infestation. Nevertheless, core features of the transcriptional response to PFIs can be defined in Arabidopsis thaliana. This comparison of the PFI-associated transcriptional response observed in A. thaliana infested by the generalists Myzus persicae and Bemisia tabaci with the specialist Brevicoryne brassicae highlights the importance of calcium-dependent and receptor kinase-associated signalling. We discuss these findings within the context of the complex cross-talk between the different hormones regulating basal immune response mechanisms in plants. We identify PFI-responsive genes, highlighting the importance of cell wall-associated kinases in plant-PFI interactions, as well as the significant role of kinases containing the domain of unknown function 26. A common feature of plant-PFI interaction is enhanced abundance of transcripts encoding WRKY transcription factors. However, significant divergence was observed with respect to secondary metabolism dependent upon the insect attacker. Transcripts encoding enzymes and proteins associated with glucosinolate metabolism were decreased following attack by the generalist M. persicae but not by the specialist B. brassicae. This analysis provides a comprehensive overview of the molecular patterns associated with the plant response to PFIs and suggests that plants recognize and respond to perturbations in the cell wall occurring during PFI infestation. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights
John P Miller
Full Text Available A genome-scale RNAi screen was performed in a mammalian cell-based assay to identify modifiers of mutant huntingtin toxicity. Ontology analysis of suppressor data identified processes previously implicated in Huntington's disease, including proteolysis, glutamate excitotoxicity, and mitochondrial dysfunction. In addition to established mechanisms, the screen identified multiple components of the RRAS signaling pathway as loss-of-function suppressors of mutant huntingtin toxicity in human and mouse cell models. Loss-of-function in orthologous RRAS pathway members also suppressed motor dysfunction in a Drosophila model of Huntington's disease. Abnormal activation of RRAS and a down-stream effector, RAF1, was observed in cellular models and a mouse model of Huntington's disease. We also observe co-localization of RRAS and mutant huntingtin in cells and in mouse striatum, suggesting that activation of R-Ras may occur through protein interaction. These data indicate that mutant huntingtin exerts a pathogenic effect on this pathway that can be corrected at multiple intervention points including RRAS, FNTA/B, PIN1, and PLK1. Consistent with these results, chemical inhibition of farnesyltransferase can also suppress mutant huntingtin toxicity. These data suggest that pharmacological inhibition of RRAS signaling may confer therapeutic benefit in Huntington's disease.
Chiocci, F. L.; Gorini, C.; Ercilla, G.; Sakellariou, D.; Casalbore, D.; Ridente, D.
46,000 km of densely settled coastlines characterise the Mediterranean Sea. The region connects three continents, where the population doubled in the last 20 years, and among which, trade, maritime transports and migratory fluxes have been increasing. Moreover, the Mediterranean is by far the world's largest tourist destination, attracting almost a third of international tourists and generating more than a quarter of tourism-related revenues worldwide. The Mediterranean area lays in a plate boundary zone highly active in terms of seismicity, volcanism and submarine geological processes that over recent time have repeatedly demonstrated to be able to generate catastrophic events.. As an example 98 tsunamis where recorded in the Mediterranean on historical times (on average one every century). This census do not encompasses small events, such as minor tsunamis generated by submarine landslides that can produce serious damage in the near field. In Stromboli volcanic island (Southern Tyrrhenian Sea) for instance, the frequency of such events accounts for 5 events over the last century Mapping the seafloor for geohazard assessment becomes, therefore, especially important for the sustainable development of the marine and coastal areas both economically and socially. The increasing amount of high resolution seafloor mapping data allows defining geohazard features such as volcanic vents, active faults, submarine landslide, canyon head migrating bedforms fluid expulsion structure with a detail able to highlight even locally dangerous situations. If the marine geoscience community will be able to build common standards to interpret and cartographically represent the marine geohazard features private industries and public agencies will benefit of an unvaluable tool which will help in better exploit the marine resource and/or preserve the marine and coastal environment. This contribution will present spectacular examples of marine geohazards from the Mediterranean Seas; the
Galperin Michael Y
Full Text Available Abstract Background Comparative analysis of sequenced genomes reveals numerous instances of apparent horizontal gene transfer (HGT, at least in prokaryotes, and indicates that lineage-specific gene loss might have been even more common in evolution. This complicates the notion of a species tree, which needs to be re-interpreted as a prevailing evolutionary trend, rather than the full depiction of evolution, and makes reconstruction of ancestral genomes a non-trivial task. Results We addressed the problem of constructing parsimonious scenarios for individual sets of orthologous genes given a species tree. The orthologous sets were taken from the database of Clusters of Orthologous Groups of proteins (COGs. We show that the phyletic patterns (patterns of presence-absence in completely sequenced genomes of almost 90% of the COGs are inconsistent with the hypothetical species tree. Algorithms were developed to reconcile the phyletic patterns with the species tree by postulating gene loss, COG emergence and HGT (the latter two classes of events were collectively treated as gene gains. We prove that each of these algorithms produces a parsimonious evolutionary scenario, which can be represented as mapping of loss and gain events on the species tree. The distribution of the evolutionary events among the tree nodes substantially depends on the underlying assumptions of the reconciliation algorithm, e.g. whether or not independent gene gains (gain after loss after gain are permitted. Biological considerations suggest that, on average, gene loss might be a more likely event than gene gain. Therefore different gain penalties were used and the resulting series of reconstructed gene sets for the last universal common ancestor (LUCA of the extant life forms were analysed. The number of genes in the reconstructed LUCA gene sets grows as the gain penalty increases. However, qualitative examination of the LUCA versions reconstructed with different gain penalties
Full Text Available To evaluate the potential of genomic selection (GS, a selection experiment with GS and phenotypic selection (PS was performed in an allogamous crop, common buckwheat (Fagopyrum esculentum Moench. To indirectly select for seed yield per unit area, which cannot be measured on a single-plant basis, a selection index was constructed from seven agro-morphological traits measurable on a single plant basis. Over 3 years, we performed two GS and one PS cycles per year for improvement in the selection index. In GS, a prediction model was updated every year on the basis of genotypes of 14,598–50,000 markers and phenotypes. Plants grown from seeds derived from a series of generations of GS and PS populations were evaluated for the traits in the selection index and other yield-related traits. GS resulted in a 20.9% increase and PS in a 15.0% increase in the selection index in comparison with the initial population. Although the level of linkage disequilibrium in the breeding population was low, the target trait was improved with GS. Traits with higher weights in the selection index were improved more than those with lower weights, especially when prediction accuracy was high. No trait changed in an unintended direction in either GS or PS. The accuracy of genomic prediction models built in the first cycle decreased in the later cycles because the genetic bottleneck through the selection cycles changed linkage disequilibrium patterns in the breeding population. The present study emphasizes the importance of updating models in GS and demonstrates the potential of GS in mass selection of allogamous crop species, and provided a pilot example of successful application of GS to plant breeding.
Yabe, Shiori; Hara, Takashi; Ueno, Mariko; Enoki, Hiroyuki; Kimura, Tatsuro; Nishimura, Satoru; Yasui, Yasuo; Ohsawa, Ryo; Iwata, Hiroyoshi
To evaluate the potential of genomic selection (GS), a selection experiment with GS and phenotypic selection (PS) was performed in an allogamous crop, common buckwheat ( Fagopyrum esculentum Moench). To indirectly select for seed yield per unit area, which cannot be measured on a single-plant basis, a selection index was constructed from seven agro-morphological traits measurable on a single plant basis. Over 3 years, we performed two GS and one PS cycles per year for improvement in the selection index. In GS, a prediction model was updated every year on the basis of genotypes of 14,598-50,000 markers and phenotypes. Plants grown from seeds derived from a series of generations of GS and PS populations were evaluated for the traits in the selection index and other yield-related traits. GS resulted in a 20.9% increase and PS in a 15.0% increase in the selection index in comparison with the initial population. Although the level of linkage disequilibrium in the breeding population was low, the target trait was improved with GS. Traits with higher weights in the selection index were improved more than those with lower weights, especially when prediction accuracy was high. No trait changed in an unintended direction in either GS or PS. The accuracy of genomic prediction models built in the first cycle decreased in the later cycles because the genetic bottleneck through the selection cycles changed linkage disequilibrium patterns in the breeding population. The present study emphasizes the importance of updating models in GS and demonstrates the potential of GS in mass selection of allogamous crop species, and provided a pilot example of successful application of GS to plant breeding.
Nock, Nl; Zhang, Lx
Methods that can evaluate aggregate effects of rare and common variants are limited. Therefore, we applied a two-stage approach to evaluate aggregate gene effects in the 1000 Genomes Project data, which contain 24,487 single-nucleotide polymorphisms (SNPs) in 697 unrelated individuals from 7 populations. In stage 1, we identified potentially interesting genes (PIGs) as those having at least one SNP meeting Bonferroni correction using univariate, multiple regression models. In stage 2, we evaluate aggregate PIG effects on trait, Q1, by modeling each gene as a latent construct, which is defined by multiple common and rare variants, using the multivariate statistical framework of structural equation modeling (SEM). In stage 1, we found that PIGs varied markedly between a randomly selected replicate (replicate 137) and 100 other replicates, with the exception of FLT1. In stage 1, collapsing rare variants decreased false positives but increased false negatives. In stage 2, we developed a good-fitting SEM model that included all nine genes simulated to affect Q1 (FLT1, KDR, ARNT, ELAV4, FLT4, HIF1A, HIF3A, VEGFA, VEGFC) and found that FLT1 had the largest effect on Q1 (βstd = 0.33 ± 0.05). Using replicate 137 estimates as population values, we found that the mean relative bias in the parameters (loadings, paths, residuals) and their standard errors across 100 replicates was on average, less than 5%. Our latent variable SEM approach provides a viable framework for modeling aggregate effects of rare and common variants in multiple genes, but more elegant methods are needed in stage 1 to minimize type I and type II error.
Lee, Eduardo; Helt, Gregg A; Reese, Justin T; Munoz-Torres, Monica C; Childers, Chris P; Buels, Robert M; Stein, Lincoln; Holmes, Ian H; Elsik, Christine G; Lewis, Suzanna E
Web Apollo is the first instantaneous, collaborative genomic annotation editor available on the web. One of the natural consequences following from current advances in sequencing technology is that there are more and more researchers sequencing new genomes. These researchers require tools to describe the functional features of their newly sequenced genomes. With Web Apollo researchers can use any of the common browsers (for example, Chrome or Firefox) to jointly analyze and precisely describe the features of a genome in real time, whether they are in the same room or working from opposite sides of the world.
Full Text Available Grain weight, an essential yield component, is under strong genetic control and markedly influenced by the environment. Here, by genome-wide association analysis with a panel of 94 elite common wheat varieties, 37 loci were found significantly associated with thousand-grain weight (TGW in one or more environments differing in water and fertiliser levels. Five loci were stably associated with TGW under all 12 environments examined. Their elite alleles had positive effects on TGW. Four, two, three, and two loci were consistently associated with TGW in the irrigated and fertilised (IF, rainfed (RF, reduced nitrogen (RN, and reduced phosphorus (RP environments. The elite alleles of the IF-specific loci enhanced TGW under well-resourced conditions, whereas those of the RF-, RN-, or RP-specific loci conferred tolerance to the TGW decrease when irrigation, nitrogen, or phosphorus were reduced. Moreover, the elite alleles of the environment-independent and -specific loci often acted additively to enhance TGW. Four additional loci were found associated with TGW in specific locations, one of which was shown to contribute to the TGW difference between two experimental sites. Further analysis of 14 associated loci revealed that nine affected both grain length and width, whereas the remaining loci influenced either grain length or width, indicating that these loci control grain weight by regulating kernel size. Finally, the elite allele of Xpsp3152 frequently co-segregated with the larger grain haplotype of TaGW2-6A, suggesting probable genetic and functional linkages between Xpsp3152 and GW2 that are important for grain weight control in cereal plants. Our study provides new knowledge on TGW control in elite common wheat lines, which may aid the improvement of wheat grain weight trait in further research.
Identification and Mapping of Simple Sequence Repeat Markers from Common Bean (Phaseolus vulgaris L. Bacterial Artificial Chromosome End Sequences for Genome Characterization and Genetic–Physical Map Integration
Juana M. Córdoba
Full Text Available Microsatellite markers or simple sequence repeat (SSR loci are useful for diversity characterization and genetic–physical mapping. Different in silico microsatellite search methods have been developed for mining bacterial artificial chromosome (BAC end sequences for SSRs. The overall goal of this study was genome characterization based on SSRs in 89,017 BAC end sequences (BESs from the G19833 common bean ( L. library. Another objective was to identify new SSR taking into account three tandem motif identification programs (Automated Microsatellite Marker Development [AMMD], Tandem Repeats Finder [TRF], and SSRLocator [SSRL]. Among the microsatellite search engines, SSRL identified the highest number of SSRs; however, when primer design was attempted, the number dropped due to poor primer design regions. Automated Microsatellite Marker Development software identified many SSRs with valuable AT/TA or AG/TC motifs, while TRF found fewer SSRs and produced no primers. A subgroup of 323 AT-rich, di-, and trinucleotide SSRs were selected from the AMMD results and used in a parental survey with DOR364 and G19833, of which 75 could be mapped in the corresponding population; these represented 4052 BAC clones. Together with 92 previously mapped BES- and 114 non-BES-derived markers, a total of 280 SSRs were included in the polymerase chain reaction (PCR-based map, integrating a total of 8232 BAC clones in 162 contigs from the physical map.
Pareja, Fresia; Geyer, Felipe C; Kumar, Rahul; Selenica, Pier; Piscuoglio, Salvatore; Ng, Charlotte K Y; Burke, Kathleen A; Edelweiss, Marcia; Murray, Melissa P; Brogi, Edi; Weigelt, Britta; Reis-Filho, Jorge S
Breast fibroepithelial lesions (fibroadenomas and phyllodes tumors) are underpinned by recurrent MED12 exon 2 mutations, which are more common in fibroadenomas and benign phyllodes tumors. TERT promoter hotspot mutations have been documented in phyllodes tumors, and found to be more frequent in borderline and malignant lesions. Several lines of evidence suggest that a subset of phyllodes tumors might arise from fibroadenomas. Here we sought to investigate the genetic differences between phyllodes tumors with fibroadenoma-like areas vs. those without. We retrieved data for 16 borderline/ malignant phyllodes tumors, including seven phyllodes tumors with fibroadenoma-like areas and nine phyllodes tumors without fibroadenoma-like areas, which had been previously subjected to targeted capture massively parallel sequencing. Whilst MED12 exon 2 mutations were significantly more frequent in tumors with fibroadenoma-like areas (71 vs. 11%), an enrichment in genetic alterations targeting bona fide cancer genes was found in those without fibroadenoma-like areas, in particular in EGFR mutations and amplifications (78 vs. 14%). No significant difference in the frequency of TERT genetic alterations was observed (71% in cases with fibroadenoma-like areas vs 56% in those without fibroadenoma-like areas). Our data suggest that the development of phyllodes tumors might follow two different evolutionary pathways: a MED12 -mutant pathway that involves the progression from a fibroadenoma to a malignant phyllodes tumor; and a MED12 -wild-type pathway, where malignant phyllodes tumors arise de novo through the acquisition of genetic alterations targeting cancer genes. Additional studies are warranted to confirm our observations and define whether the outcome differs between both pathways.
Crkvenjakov, R.; Drmanac, R.
A genome is the sum total of the DNA sequences in the cells of an individual organism. The common usage that species possess genomes comes naturally to biochemists, who have shown that all protein and nucleic acid molecules are at the same time species- and individual-specific, with minor individual variations being superimposed on a consensus sequence that is constant for a species. By extension, this property is attributed to the common features of DNA in the chromosomes of members of a given species and is called species genome. Our proposal for the definition of a biological species is as follows: A species comprises a group of actual and potential biological organisms built according to a unique genome program that is recorded, and at least in part expressed, in the structures of their genomic nucleic acid molecule(s), having intragroup sequence differences which can be fully interconverted in the process of organismal reproduction.
M. Schaap (Michiel); R.J.L.F. Lemmers (Richard); R. Maassen (Roel); P.J. van der Vliet (Patrick); L.F. Hoogerheide (Lennart); H.K. van Dijk (Herman); N. Basturk (Nalan); P. de Knijff (Peter); S.M. van der Maarel (Silvère)
textabstractBackground: Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and
Schaap, M.; Lemmers, R.J.L.F.; Maassen, R.; van der Vliet, P.J.; Hoogerheide, L.F.; van Dijk, H.K.; Basturk, N.; de Knijff, P.; van der Maarel, S.M.
Background: Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and function is largely
Crkvenjakov, R.; Dramanac, R.
A genome is the sum total of the DNA sequences in the cells of an individual organism. The common usage that species possess genomes comes naturally to biochemists, who have shown that all protein and nucleic acid molecules are at the same time species and individual-specific, with minor individual variations being superimposed on a consensus sequence that is constant for a species. By extension, this property is attributed to the common features of DNA in the chromosomes of members of a given species and is called (species) genome. The definition of species based on chromosomes, genes, or genome common to its member organisms has been implied or mentioned in passing numerous times. Some population biologists think that members of species have similar ``homeostatic genotypes,`` which are to a degree resistant to mutation or environmental change in the production of a basic phenotype.
Full Text Available Introduction: Escherichia coli O103:H2 occurs as verotoxigenic E. coli (VTEC carrying only vtx1 or vtx2 or both variants, but also as vtx-negative atypical enteropathogenic E. coli (aEPEC. The majority of E. coli O103:H2 identified from cases of human disease are caused by the VTEC form. If aEPEC strains frequently acquire verotoxin genes and become VTEC, they must be considered a significant public health concern. In this study, we have characterized and compared aEPEC and VTEC isolates of E. coli O103:H2 from Swedish cattle. Methods: Fourteen isolates of E. coli O103:H2 with and without verotoxin genes were collected from samples of cattle feces taken during a nationwide cattle prevalence study 2011–2012. Isolates were sequenced with a 2×100 bp setup on a HiSeq2500 instrument producing >100× coverage per isolate. Single-nucleotide polymorphism (SNP typing was performed using the genome analysis tool kit (GATK. Virulence genes and other regions of interest were detected. Susceptibility to transduction by two verotoxin-encoding phages was investigated for one representative aEPEC O103:H2 isolate. Results and Discussion: This study shows that aEPEC O103:H2 is more commonly found (64% than VTEC O103:H2 (36% in the Swedish cattle reservoir. The only verotoxin gene variant identified was vtx1a. Phylogenetic comparison by SNP analysis indicates that while certain subgroups of aEPEC and VTEC are closely related and have otherwise near identical virulence gene repertoires, they belong to separate lineages. This indicates that the uptake or loss of verotoxin genes is a rare event in the natural cattle environment of these bacteria. However, a representative of a VTEC-like aEPEC O103:H2 subgroup could be stably lysogenized by a vtx-encoding phage in vitro.
Cho, Eun Young; Jang, Yangsoo; Shin, Eun Soon; Jang, Hye Yoon; Yoo, Yeon-Kyeong; Kim, Sook; Jang, Ji Hyun; Lee, Ji Yeon; Yun, Min Hye; Park, Min Young; Chae, Jey Sook; Lim, Jin Woo; Shin, Dong Jik; Park, Sungha; Lee, Jong Ho; Han, Bok Ghee; Rae, Kim Hyung; Cardon, Lon R; Morris, Andrew P; Lee, Jong Eun; Clarke, Geraldine M
Background Recent genome-wide association (GWA) studies have identified and replicated several genetic loci associated with the risk of development of coronary artery disease (CAD) in samples from populations of Caucasian and Asian descent. However, only chromosome 9p21 has been confirmed as a major susceptibility locus conferring risk for development of CAD across multiple ethnic groups. The authors aimed to find evidence of further similarities and differences in genetic risk of CAD between Korean and other populations. Methods The authors performed a GWA study comprising 230 cases and 290 controls from a Korean population typed on 490 032 single nucleotide polymorphisms (SNPs). A total of 3148 SNPs were taken forward for genotyping in a subsequent replication study using an independent sample of 1172 cases and 1087 controls from the same population. Results The association previously observed on chromosome 9p21 was independently replicated (p=3.08e–07). Within this region, the same risk haplotype was observed in samples from both Korea and of Western European descent, suggesting that the causal mutation carried on this background occurred on a single ancestral allele. Other than 9p21, the authors were unable to replicate any of the previously reported signals for association with CAD. Furthermore, no evidence of association was found at chromosome 1q41 for risk of myocardial infarction, previously identified as conferring risk in a Japanese population. Conclusion A common causal variant is likely to be responsible for risk of CAD in Korean and Western European populations at chromosome 9p21.3. Further investigations are required to confirm non-replication of any other cross-race genetic risk factors. PMID:27325954
Xu, Zong-Chang; Kong, Yingzhen
Cellulose-synthase proteins (CESAs) are membrane localized proteins and they form protein complexes to produce cellulose in the plasma membrane. CESA proteins play very important roles in cell wall construction during plant growth and development. In this study, a total of 21 NtCESA gene sequences were identified by using PF03552 conserved protein sequence and 10 AtCESA protein sequences of Arabidopsis thaliana to blast against the common tobacco (Nicotiana tabacum L.) genome database with TBLASTN protocol. We analyzed the physical and chemical properties of protein sequences based on some software or on-line analysis tools. The results showed that there were no significant variances in terms of the physical and chemical properties of the 21 NtCESA proteins. First, phylogenetic tree analysis showed that 21 NtCESA genes and 10 AtCESA genes were clustered into five groups, and the gene structures were similar among the genes that are clustered into the same group. Second, in all of the 21 NtCESA proteins the conserved zinc finger domain was identified in the N-terminus, transmembrane domains were identified in the C-terminus and the DDD-QXXRW conserved domains were also identified. Third, gene expression analysis results indicated that most NtCESA genes were expressed in roots and leaves of seedling or mature tissues of tobacco, seeds and callus tissues. The genes that clustered into the same group share similar expression patterns. Importantly, NtCESA proteins that are involved in secondary cell wall cellulose synthesis have two extra transmembrane domains compared with that involved in primary cell wall cellulose biosynthesis. In addition, subcellular localization results showed that NtCESA9 and NtCESA14 were two plasma membrane anchored proteins. This study will lay a foundation for further functional characterization of these NtCESA genes.
Sukhamrit Kaur; Sandeep Kaur
Abstract Genomics is study of genome which provides large amount of data for which large storage and computation power is needed. These issues are solved by cloud computing that provides various cloud platforms for genomics. These platforms provides many services to user like easy access to data easy sharing and transfer providing storage in hundreds of terabytes more computational power. Some cloud platforms are Google genomics DNAnexus and Globus genomics. Various features of cloud computin...
The "science commons," knowledge that is widely accessible at low or no cost, is a uniquely important input to scientific advance and cumulative technological innovation. It is primarily, although not exclusively, funded by government and nonprofit sources. Much of it is produced at academic research centers, although some academic science is proprietary and some privately funded R&D enters the science commons. Science in general aspires to Mertonian norms of openness, universality, objectivity, and critical inquiry. The science commons diverges from proprietary science primarily in being open and being very broadly available. These features make the science commons particularly valuable for advancing knowledge, for training innovators who will ultimately work in both public and private sectors, and in providing a common stock of knowledge upon which all players-both public and private-can draw readily. Open science plays two important roles that proprietary R&D cannot: it enables practical benefits even in the absence of profitable markets for goods and services, and its lays a shared foundation for subsequent private R&D. The history of genomics in the period 1992-2004, covering two periods when genomic startup firms attracted significant private R&D investment, illustrates these features of how a science commons contributes value. Commercial interest in genomics was intense during this period. Fierce competition between private sector and public sector genomics programs was highly visible. Seemingly anomalous behavior, such as private firms funding "open science," can be explained by unusual business dynamics between established firms wanting to preserve a robust science commons to prevent startup firms from limiting established firms' freedom to operate. Deliberate policies to create and protect a large science commons were pursued by nonprofit and government funders of genomics research, such as the Wellcome Trust and National Institutes of Health. These
We sequenced and compared the genomes of Dothideomycete fungal plant pathogens Cladosporium fulvum and Dothistroma septosporum that are related phylogenetically, but have different lifestyles and infect different hosts. C. fulvum is a biotroph that infects tomato, while D. septosporum is a hemibiotr...
Wang, Xianshu; Pankratz, V. Shane; Fredericksen, Zachary; Tarrell, Robert; Karaus, Mary; McGuffog, Lesley; Pharaoh, Paul D. P.; Ponder, Bruce A. J.; Dunning, Alison M.; Peock, Susan; Cook, Margaret; Oliver, Clare; Frost, Debra; Sinilnikova, Olga M.; Stoppa-Lyonnet, Dominique; Mazoyer, Sylvie; Houdayer, Claude; Hogervorst, Frans B. L.; Hooning, Maartje J.; Ligtenberg, Marjolijn J.; Spurdle, Amanda; Chenevix-Trench, Georgia; Schmutzler, Rita K.; Wappenschmidt, Barbara; Engel, Christoph; Meindl, Alfons; Domchek, Susan M.; Nathanson, Katherine L.; Rebbeck, Timothy R.; Singer, Christian F.; Gschwantler-Kaulich, Daphne; Dressler, Catherina; Fink, Anneliese; Szabo, Csilla I.; Zikan, Michal; Foretova, Lenka; Claes, Kathleen; Thomas, Gilles; Hoover, Robert N.; Hunter, David J.; Chanock, Stephen J.; Easton, Douglas F.; Antoniou, Antonis C.; Couch, Fergus J.; Gregory, Helen; Miedzybrodzka, Zosia; Morrison, Patrick; Cole, Trevor; McKeown, Carole; Taylor, Amy; Donaldson, Alan; Paterson, Joan; Murray, Alexandra; Rogers, Mark; McCann, Emma; Kennedy, John; Barton, David; Porteous, Mary; Brewer, Carole; Kivuva, Emma; Searle, Anne; Goodman, Selina; Davidson, Rosemarie; Murday, Victoria; Bradshaw, Nicola; Snadden, Lesley; Longmuir, Mark; Watt, Catherine; Izatt, Louise; Pichert, Gabriella; Langman, Caroline; Dorkins, Huw; Barwell, Julian; Chu, Carol; Bishop, Tim; Miller, Julie; Ellis, Ian; Evans, D. Gareth; Lalloo, Fiona; Holt, Felicity; Male, Alison; Robinson, Anne; Gardiner, Carol; Douglas, Fiona; Claber, Oonagh; Walker, Lisa; McLeod, Diane; Eeles, Ros; Shanley, Susan; Rahman, Nazneen; Houlston, Richard; Bancroft, Elizabeth; D'Mello, Lucia; Page, Elizabeth; Ardern-Jones, Audrey; Mitra, Anita; Cook, Jackie; Quarrell, Oliver; Bardsley, Cathryn; Hodgson, Shirley; Goff, Sheila; Brice, Glen; Winchester, Lizzie; Eccles, Diana; Lucassen, Anneke; Crawford, Gillian; Tyler, Emma; McBride, Donna; Bérard, Léon; Sinilnikova, Olga; Barjhoux, Laure; Giraud, Sophie; Léone, Mélanie; Gauthier-Villars, Marion; Moncoutier, Virginie; Belotti, Muriel; de Pauw, Antoine; Bressac-de-Paillerets, Brigitte; Remenieras, Audrey; Byrde, Véronique; Caron, Olivier; Lenoir, Gilbert; Bignon, Yves-Jean; Uhrhammer, Nancy; Lasset, Christine; Bonadona, Valérie; Hardouin, Agnès; Berthet, Pascaline; Sobol, Hagay; Bourdon, Violaine; Eisinger, Françoise; Coulet, Florence; Colas, Chrystelle; Soubrier, Florent; Coupier, Isabelle; Payrat, Jean-Philippe; Fournier, Joëlle; Révillion, Françoise; Vennin, Philippe; Adenis, Claude; Rouleau, Etienne; Lidereau, Rosette; Demange, Liliane; Nogues, Catherine; Muller, Danièle; Fricker, Jean-Pierre; Longy, Michel; Sevenet, Nicolas; Toulas, Christine; Guimbaud, Rosine; Gladieff, Laurence; Feillel, Viviane; Leroux, Dominique; Dreyfus, Hélèn; Rebischung, Christine; Cassini, Cécile; Olivier-Faivre, Laurence; Prieur, Fabienne; Ferrer, Sandra Fert; Frénay, Marc; Vénat-Bouvet, Laurence; Lynch, Henry T.; Hogervorst, Frans; Vernhoef, Senno; Pijpe, Anouk; van 't Veer, Laura; van Leeuwen, Flora; Rookus, Matti; Collée, Margriet; van den Ouweland, Ans; Kriege, Mieke; Schutte, Mieke; Hooning, Maartje; Seynaeve, Caroline; van Asperen, Christi; Wijnen, Juul; Vreeswijk, Maaike; Tollenaar, Rob; Devilee, Peter; Ligtenberg, Marjolijn; Hoogerbrugge, Nicoline; Ausems, Margreet; van der Luijt, Rob; Aalfs, Cora; van Os, Theo; Gille, Hans; Waisfisz, Quinten; Meijers-Heijboer, Hanne; Gomez-Garcia, Encarna; van Roozendaal, Kees; Blok, Marinus; Oosterwijk, Jan; van der Hout, Annemieke; Mourits, Marian; Vasen, Hans; Szabo, Csilla; Pohlreich, Petr; Kleibl, Zdenek; Machackova, Eva; Lukesova, Miroslava; de Leeneer, Kim; Poppe, Bruce; de Paepe, Anne
Recent studies have identified single nucleotide polymorphisms (SNPs) that significantly modify breast cancer risk in BRCA1 and BRCA2 mutation carriers. Since these risk modifiers were originally identified as genetic risk factors for breast cancer in genome-wide association studies (GWASs),
T.A. Knoch (Tobias); A. Abuseiris (Anis); M. Lesnussa (Michael); F.N. Kepper (Nick); R.M. de Graaf (Rob); F.G. Grosveld (Frank)
textabstractThe amount of information is growing exponentially with ever-new technologies emerging and is believed to be always at the limit. In contrast, huge resources are obviously available, which are underused in the IT sector, similar as e.g. in the renewable energy sector. Genome research is
A group of patients with Marfan's syndrome, who have finger and toe contractures, displays tendons' alterations upon an ultrasound examination: are these features common among classical Marfan patients?
Melchiorre, Daniela; Pratelli, Elisa; Torricelli, Elena; Sofi, Francesco; Abbate, Rosanna; Matucci-Cerinic, Marco; Gensini, GianFranco; Pepe, Guglielmina
The involvement of the musculoskeletal system with other mild pleiotropic manifestations represents a clinical criterion, called "systemic features," to d iagnose Marfan's syndrome. We aimed to investigate the features of the hands and feet redressable contractures present in a group of Marfan patients. In 13 patients with previously diagnosed Marfan's syndrome, an accurate clinical examination was performed. In particular the characterization of the musculoskeletal system by visual analogic scale to measure muscle pain (VAS) and muscle strength (MRC system) was carried out; the Beighton scale score was used to evaluate the articular hypermobility. Ultrasound examination (US) was performed to detect deep-superficial flexor tendons and extensor tendons of both hands, and the short and long flexor and extensor tendons of the fingers and toes in static and dynamic positions. The ImageJ program was adopted to measure a profile of tendon echo-intensity. A reduction of the thickness of all tendons was detected by US in our patients; the VAS and Beighton scale scores were in normal ranges. The profile of tendon echo-intensity showed different textural details in all Marfan patients. This study provides evidence for other contractures' localization, and for altered findings of the tendons in patients with Marfan syndrome and finger/toe contractures. These changes may be associated with structural modifications in connective tissue.
Karpinets, Tatiana V [ORNL; Park, Byung H [ORNL; Syed, Mustafa H [ORNL; Klotz, Martin G [University of North Carolina, Charlotte; Uberbacher, Edward C [ORNL
Most bacterial symbionts of plants are phenotypically characterized by their parasitic or matualistic relationship with the host; however, the genomic characteristics that likely discriminate mutualistic symbionts from pathogens of plants are poorly understood. This study comparatively analyzed the genomes of 54 plant-symbiontic bacteria, 27 mutualists and 27 pathogens, to discover genomic determinants of their parasitic and mutualistic nature in terms of protein family domains, KEGG orthologous groups, metabolic pathways and families of carbohydrate-active enzymes (CAZymes). We further used all bacteria with sequenced genomesl, published microarrays and transcriptomics experimental datasets, and literature to validate and to explore results of the comparison. The analysis revealed that genomes of mutualists are larger in size and higher in GC content and encode greater molecular, functional and metabolic diversity than the investigated genomes of pathogens. This enriched molecular and functional enzyme diversity included constructive biosynthetic signatures of CAZymes and metabolic pathways in genomes of mutualists compared with catabolic signatures dominant in the genomes of pathogens. Another discriminative characteristic of mutualists is the co-occurence of gene clusters required for the expression and function of nitrogenase and RuBisCO. Analysis of previously published experimental data indicate that nitrogen-fixing mutualists may employ Rubisco to fix CO2 not in the canonical Calvin-Benson-Basham cycle but in a novel metabolic pathway, here called Rubisco-based glycolysis , to increase efficiency of sugar utilization during the symbiosis with plants. An important discriminative characteristic of plant pathogenic bacteria is two groups of genes likely encoding effector proteins involved in host invasion and a genomic locus encoding a putative secretion system that includes a DUF1525 domain protein conserved in pathogens of plants and of other organisms. The
Sandholt, Camilla Helene; Mogensen, Mette S; Borch-Johnsen, Knut
The INSIG2 rs7566605 and PFKP rs6602024 polymorphisms have been identified as obesity gene variants in genome-wide association (GWA) studies. However, replication has been contradictory for both variants. The aims of this study were to validate these obesity-associations through case-control stud......The INSIG2 rs7566605 and PFKP rs6602024 polymorphisms have been identified as obesity gene variants in genome-wide association (GWA) studies. However, replication has been contradictory for both variants. The aims of this study were to validate these obesity-associations through case......-control studies and analyses of obesity-related quantitative traits. Moreover, since environmental and genetic factors may modulate the impact of a genetic variant, we wanted to perform such interaction analyses. We focused on physical activity as an environmental risk factor, and on the GWA identified obesity...
Strawbridge, Rona; Dupuis, Josée; Prokopenko, Inga; Barker, Adam; Ahlqvist, Emma; Rybin, Denis; Petrie, John; Bouatia-Naji, Nabila; Dimas, Antigone; Wheeler, Eleanor; Chen, Han; Voight, Benjamin; Taneera, Jalal; Kanoni, Stavroula; Peden, John
textabstractOBJECTIVE - Proinsulin is a precursor of mature insulin and C-peptide. Higher circulating proinsulin levels are associated with impaired b-cell function, raised glucose levels, insulin resistance, and type 2 diabetes (T2D). Studies of the insulin processing pathway could provide new insights about T2D pathophysiology. RESEARCH DESIGN AND METHODS - We have conducted a meta-analysis of genome-wide association tests of ;2.5 million genotyped or imputed single nucleotide polymorphisms...
Henríquez-Hernández, Luis Alberto; Pinar, Beatriz; Carmona-Vigo, Ruth
PURPOSE: To investigate the genomic signaling that defines sensitive lymphocytes to radiation and if such molecular profiles are consistent with clinical toxicity; trying to disclose the radiobiology mechanisms behind these cellular processes. PATIENTS AND METHODS: Twelve consecutive patients...... suffering from locally advanced breast cancer and treated with high-dose hyperfractionated radiotherapy were recruited. Initial DNA damage was measured by pulsed-field gel electrophoresis and radiation-induced apoptosis was measured by flow cytometry. Gene expression was assessed by DNA microarray. RESULTS...
Full Text Available Abstract Genomics is study of genome which provides large amount of data for which large storage and computation power is needed. These issues are solved by cloud computing that provides various cloud platforms for genomics. These platforms provides many services to user like easy access to data easy sharing and transfer providing storage in hundreds of terabytes more computational power. Some cloud platforms are Google genomics DNAnexus and Globus genomics. Various features of cloud computing to genomics are like easy access and sharing of data security of data less cost to pay for resources but still there are some demerits like large time needed to transfer data less network bandwidth.
Hsu, Yi-Hsiang; Liu, Youfang; Hannan, Marian T.; Maixner, William; Smith, Shad B.; Diatchenko, Luda; Golightly, Yvonne M.; Menz, Hylton B.; Kraus, Virginia B.; Doherty, Michael; Wilson, A.G.; Jordan, Joanne M.
Objective Hallux valgus (HV) affects ~36% of Caucasian adults. Although considered highly heritable, the underlying genetic determinants are unclear. We conducted the first genome-wide association study (GWAS) aimed to identify genetic variants associated with HV. Methods HV was assessed in 3 Caucasian cohorts (n=2,263, n=915, and n=1,231 participants, respectively). In each cohort, a GWAS was conducted using 2.5M imputed single nucleotide polymorphisms (SNPs). Mixed-effect regression with the additive genetic model adjusted for age, sex, weight and within-family correlations was used for both sex-specific and combined analyses. To combine GWAS results across cohorts, fixed-effect inverse-variance meta-analyses were used. Following meta-analyses, top-associated findings were also examined in an African American cohort (n=327). Results The proportion of HV variance explained by genome-wide genotyped SNPs was 50% in men and 48% in women. A higher proportion of genetic determinants of HV was sex-specific. The most significantly associated SNP in men was rs9675316 located on chr17q23-a24 near the AXIN2 gene (p=5.46×10−7); the most significantly associated SNP in women was rs7996797 located on chr13q14.1-q14.2 near the ESD gene (p=7.21×10−7). Genome-wide significant SNP-by-sex interaction was found for SNP rs1563374 located on chr11p15.1 near the MRGPRX3 gene (interaction p-value =4.1×10−9). The association signals diminished when combining men and women. Conclusion Findings suggest that the potential pathophysiological mechanisms of HV are complex and strongly underlined by sex-specific interactions. The identified genetic variants imply contribution of biological pathways observed in osteoarthritis as well as new pathways, influencing skeletal development and inflammation. PMID:26337638
Brown, Sean G; Publicover, Stephen J; Mansell, Steven A; Lishko, Polina V; Williams, Hannah L; Ramalingam, Mythili; Wilson, Stuart M; Barratt, Christopher L R; Sutton, Keith A; Da Silva, Sarah Martins
(+)) conductance and resting membrane potential (Vm) and signalling/motility assays were used to assess functional characteristics of sperm from IVF and ICSI patient samples. The mean Vm and outward membrane conductance in sperm from IVF and ICSI patients were not significantly different from those of control (donor) sperm prepared under the same conditions, but variation between individuals was significantly greater (P25%). In particular, in ≈10% of patients (7/81), we observed either a negligible outward conductance (4 patients) or an enhanced inward current (3 patients), both of which caused depolarization of Vm. Analysis of clinical data from the IVF patients showed significant association of depolarized Vm (≥0 mV) with low fertilization rate (P= 0.012). Spermatozoa with electrophysiological abnormities (conductance and Vm) responded normally to progesterone with elevation of [Ca(2+)]i and penetration of viscous medium, indicating retention of cation channel of sperm (CatSper) channel function. For practical, technical, ethical and logistical reasons, we could not obtain sufficient additional semen samples from men with conductance abnormalities to establish the cause of the conductance defects. Full exome sequencing was only available in two men with conductance defects. These data add significantly to the understanding of the role of ion channels in human sperm function and its impact on male fertility. Impaired potassium channel conductance (Gm) and/or Vm regulation is both common and complex in human spermatozoa and importantly is associated with impaired fertilization capacity when the Vm of cells is completely depolarized. The majority of the data were obtained using funding from MRC project grants (#MR/K013343/1, MR/012492/1). Additional funding was provided by NHS Tayside, TENOVUS, Chief Scientist Office NRS Fellowship and University of Abertay. The authors declare that there is no conflict of interest. Not applicable. © The Author 2016. Published by Oxford
Vilella Albert J
Full Text Available Abstract Background DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i exhaustive population-genetic analyses including those based on the coalescent theory; ii analysis adapted to the shallow data generated by the high-throughput genome projects; iii use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
Full Text Available Abstract Background Rhizobium tropici CIAT 899 and Rhizobium sp. PRF 81 are α-Proteobacteria that establish nitrogen-fixing symbioses with a range of legume hosts. These strains are broadly used in commercial inoculants for application to common bean (Phaseolus vulgaris in South America and Africa. Both strains display intrinsic resistance to several abiotic stressful conditions such as low soil pH and high temperatures, which are common in tropical environments, and to several antimicrobials, including pesticides. The genetic determinants of these interesting characteristics remain largely unknown. Results Genome sequencing revealed that CIAT 899 and PRF 81 share a highly-conserved symbiotic plasmid (pSym that is present also in Rhizobium leucaenae CFN 299, a rhizobium displaying a similar host range. This pSym seems to have arisen by a co-integration event between two replicons. Remarkably, three distinct nodA genes were found in the pSym, a characteristic that may contribute to the broad host range of these rhizobia. Genes for biosynthesis and modulation of plant-hormone levels were also identified in the pSym. Analysis of genes involved in stress response showed that CIAT 899 and PRF 81 are well equipped to cope with low pH, high temperatures and also with oxidative and osmotic stresses. Interestingly, the genomes of CIAT 899 and PRF 81 had large numbers of genes encoding drug-efflux systems, which may explain their high resistance to antimicrobials. Genome analysis also revealed a wide array of traits that may allow these strains to be successful rhizosphere colonizers, including surface polysaccharides, uptake transporters and catabolic enzymes for nutrients, diverse iron-acquisition systems, cell wall-degrading enzymes, type I and IV pili, and novel T1SS and T5SS secreted adhesins. Conclusions Availability of the complete genome sequences of CIAT 899 and PRF 81 may be exploited in further efforts to understand the interaction of tropical
Toxoplasma gondii is an important protozoan parasite that infects all warm-blooded animals and causes opportunistic infections in immuno-compromised humans. Its closest relative, Neospora caninum, is an important veterinary pathogen that causes spontaneous abortion in livestock. Comparative genomics of these two closely related coccidians has been of particular interest to identify genes that contribute to varied host cell specificity and disease. Here, we describe a manual evaluation of these genomes based on strand-specific RNA sequencing and shotgun proteomics from the invasive tachyzoite stages of these two parasites. We have corrected predicted structures of over one third of the previously annotated gene models and have annotated untranslated regions (UTRs) in over half of the predicted protein-coding genes. We observe distinctly long UTRs in both the organisms, almost four times longer than other model eukaryotes. We have also identified a putative set of cis-natural antisense transcripts (cis-NATs) and long intergenic non-coding RNAs (lincRNAs). We have significantly improved the annotation quality in these genomes that would serve as a manually curated dataset for Toxoplasma and Neospora research communities.
Hysi, Pirro G.; Mahroo, Omar A.; Cumberland, Phillippa; Wojciechowski, Robert; Williams, Katie M.; Young, Terri L.; Mackey, David A.; Rahi, Jugnoo S.; Hammond, Christopher J.
IMPORTANCE To date, relatively few genes responsible for a fraction of heritability have been identified by means of large genetic association studies of refractive error. OBJECTIVE To explore the genetic mechanisms that lead to refractive error in the general population. DESIGN, SETTING, AND PARTICIPANTS Genome-wide association studies were carried out in 2 British population-based independent cohorts (N = 5928 participants) to identify genes moderately associated with refractive error. MAIN OUTCOMES AND MEASURES Enrichment analyses were used to identify sets of genes overrepresented in both cohorts. Enriched groups of genes were compared between both participating cohorts as a further measure against random noise. RESULTS Groups of genes enriched at highly significant statistical levels were remarkably consistent in both cohorts. In particular, these results indicated that plasma membrane (P = 7.64 × 10−30), cell-cell adhesion (P = 2.42 × 10−18), synaptic transmission (P = 2.70 × 10−14), calcium ion binding (P = 3.55 × 10−15), and cation channel activity (P = 2.77 × 10−14) were significantly overrepresented in relation to refractive error. CONCLUSIONS AND RELEVANCE These findings provide evidence that development of refractive error in the general population is related to the intensity of photosignal transduced from the retina, which may have implications for future interventions to minimize this disorder. Pathways connected to the procession of the nerve impulse are major mechanisms involved in the development of refractive error in populations of European origin. PMID:24264139
Quentin Cronk; Oriane Hidalgo; Jaume Pellicer; Diana Percy; Ilia Leitch
Abstract Background The common stinging nettle, Urtica dioica L. sensu lato, is an invertebrate "superhost", its clonal patches maintaining large populations of insects and molluscs. It is extremely widespread in Europe and highly variable, and two ploidy levels (diploid and tetraploid) are known. However, geographical patterns in cytotype variation require further study. New information We assembled a collection of nettles in conjunction with a transect of Europe from the Aegean to Arctic No...
Simakov, Oleg; Marletaz, Ferdinand; Cho, Sung-Jin; Edsinger-Gonzales, Eric; Havlak, Paul; Hellsten, Uffe; Kuo, Dian-Han; Larsson, Tomas; Lv, Jie; Arendt, Detlev; Savage, Robert; Osoegawa, Kazutoyo; de Jong, Pieter; Grimwood, Jane; Chapman, Jarrod A.; Shapiro, Harris; Otillar, Robert P.; Terry, Astrid Y.; Boore, Jeffrey L.; Grigoriev, Igor V.; Lindberg, David R.; Seaver, Elaine C.; Weisblat, David A.; Putnam, Nicholas H.; Rokhsar, Daniel S.; Aerts, Andrea
Current genomic perspectives on animal diversity neglect two prominent phyla, the molluscs and annelids, that together account for nearly one-third of known marine species and are important both ecologically and as experimental systems in classical embryology1, 2, 3. Here we describe the draft genomes of the owl limpet (Lottia gigantea), a marine polychaete (Capitella teleta) and a freshwater leech (Helobdella robusta), and compare them with other animal genomes to investigate the origin and diversification of bilaterians from a genomic perspective. We find that the genome organization, gene structure and functional content of these species are more similar to those of some invertebrate deuterostome genomes (for example, amphioxus and sea urchin) than those of other protostomes that have been sequenced to date (flies, nematodes and flatworms). The conservation of these genomic features enables us to expand the inventory of genes present in the last common bilaterian ancestor, establish the tripartite diversification of bilaterians using multiple genomic characteristics and identify ancient conserved long- and short-range genetic linkages across metazoans. Superimposed on this broadly conserved pan-bilaterian background we find examples of lineage-specific genome evolution, including varying rates of rearrangement, intron gain and loss, expansions and contractions of gene families, and the evolution of clade-specific genes that produce the unique content of each genome.
Qin, Qi-Long; Xie, Bin-Bin; Yu, Yong; Shu, Yan-Li; Rong, Jin-Cheng; Zhang, Yan-Jiao; Zhao, Dian-Li; Chen, Xiu-Lan; Zhang, Xi-Ying; Chen, Bo; Zhou, Bai-Cheng; Zhang, Yu-Zhong
To what extent the genomes of different species belonging to one genus can be diverse and the relationship between genomic differentiation and environmental factor remain unclear for oceanic bacteria. With many new bacterial genera and species being isolated from marine environments, this question warrants attention. In this study, we sequenced all the type strains of the published species of Glaciecola, a recently defined cold-adapted genus with species from diverse marine locations, to study the genomic diversity and cold-adaptation strategy in this genus.The genome size diverged widely from 3.08 to 5.96 Mb, which can be explained by massive gene gain and loss events. Horizontal gene transfer and new gene emergence contributed substantially to the genome size expansion. The genus Glaciecola had an open pan-genome. Comparative genomic research indicated that species of the genus Glaciecola had high diversity in genome size, gene content and genetic relatedness. This may be prevalent in marine bacterial genera considering the dynamic and complex environments of the ocean. Species of Glaciecola had some common genomic features related to cold adaptation, which enable them to thrive and play a role in biogeochemical cycle in the cold marine environments.
Daunert, Sylvia; Sittampalam, Gurusingham Sitta; Goldschmidt-Clermont, Pascal J
Alzheimer's drugs are failing at a rate of 99.6%, and success rate for drugs designed to help patients with this form of dementia is 47 times less than for drugs designed to help patients with cancers ( www.scientificamerican.com/article/why-alzheimer-s-drugs-keep-failing/2014 ). How can it be so difficult to produce a valuable drug for Alzheimer's disease? Each human has a unique genetic and epigenetic makeup, thus endowing individuals with a highly unique complement of genes, polymorphisms, mutations, RNAs, proteins, lipids, and complex sugars, resulting in distinct genome, proteome, metabolome, and also microbiome identity. This editorial is taking into account the uniqueness of each individual and surrounding environment, and stresses the point that a more accurate definition of a "common" disorder could be simply the amalgamation of a myriad of "rare" diseases. These rare diseases are being grouped together because they share a rather constant complement of common features and, indeed, generally respond to empirically developed treatments, leading to a positive outcome consistently. We make the case that it is highly unlikely that such treatments, despite their statistical success measured with large cohorts using standardized clinical research, will be effective on all patients until we increase the depth and fidelity of our understanding of the individual "rare" diseases that are grouped together in the "buckets" of common illnesses. Antioxid. Redox Signal. 27, 511-516.
Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín
Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org.
Full Text Available Abstract Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any
Vang, Russell; Levine, Douglas A; Soslow, Robert A; Zaloudek, Charles; Shih, Ie-Ming; Kurman, Robert J
The Cancer Genome Atlas has reported that 96% of ovarian high-grade serous carcinomas (HGSCs) have TP53 somatic mutations suggesting that mutation of this gene is a defining feature of this neoplasm. In the current study, 5 gynecologic pathologists independently evaluated hematoxylin and eosin slides of 14 available cases from The Cancer Genome Atlas classified as HGSC that lacked a TP53 mutation. The histologic diagnoses rendered by these pathologists and the accompanying molecular genetic data are the subject of this report. Only 1 case (Case 5), which contained a homozygous deletion of TP53, had unanimous interobserver agreement for a diagnosis of pure HGSC. In 1 case (Case 3), all 5 observers (100%) rendered a diagnosis of HGSC; however, 3 observers (60%) noted that the histologic features were not classic for HGSC and suggested this case may have arisen from a low-grade serous carcinoma (arisen from an alternate pathway compared with the usual HGSC). In 2 cases (Cases 4 and 12), only 3 observers (60%) in each case, respectively, interpreted it as having a component of HGSC. In the remaining 10 (71%) of tumors (Cases 1, 2, 6-11, 13, and 14), the consensus diagnosis was not HGSC, with individual diagnoses including low-grade serous carcinoma, high-grade endometrioid carcinoma, HGSC, metastatic carcinoma, clear cell carcinoma, atypical proliferative (borderline) serous tumor, and adenocarcinoma, not otherwise specified. Therefore, 13 (93%) of the tumors (Cases 1-4 and 6-14) were either not a pure HGSC or represented a diagnosis other than HGSC, all with molecular results not characteristic of HGSC. Accordingly, our review of the TP53 wild-type HGSCs reported in The Cancer Genome Atlas suggests that 100% of de novo HGSCs contain TP53 somatic mutations or deletions, with the exception of the rare HGSCs that develop from a low-grade serous tumor precursor. We, therefore, propose that lack of molecular alterations of TP53 are essentially inconsistent with the
Focus on the Involvement of the Nose and Paranasal Sinuses in Eosinophilic Granulomatosis with Polyangiitis (Churg-Strauss Syndrome): Nasal Cytology Reveals Infiltration of Eosinophils as a Very Common Feature.
Seccia, Veronica; Baldini, Chiara; Latorre, Manuela; Gelardi, Matteo; Dallan, Iacopo; Cristofani-Mencacci, Lodovica; Sellari-Franceschini, Stefano; Bartoli, Maria Laura; Bacci, Elena; Paggiaro, Pierluigi
Eosinophilic granulomatosis with polyangiitis (EGPA) is a necrotizing vasculitis that predominantly affects small- to medium-sized vessels. It is characterized by a wide spectrum of extrapulmonary symptoms, including sinonasal and paranasal sinus abnormalities. These are the most common features of this disease, constituting diagnostic criteria for EGPA. However, the actual clinical features, cellular mechanisms and impact on patients' quality of life (QoL) are still a matter of study. Thirty-nine EGPA patients underwent multidimensional rhinological evaluations, including rhinofibroscopy, nasal cytology, and QoL questionnaires. This was coupled with respiratory and rheumatological assessments. Twenty-eight patients were diagnosed with chronic rhinosinusitis (CRS). Of these, 18 had nasal polyposis (NP). Chronic rhinitis was diagnosed in 10 patients. Of these, 3 had allergic rhinitis (AR) and seven had non-AR (NAR). Overall, only 1 patient (2.6%) was normal. Nasal cytology showed that hypereosinophilia was present in 17/28 patients with CRS, 4/7 patients with NAR and all patients with AR. SNOT-22 and SF-36 showed a severe impact of nasal symptoms on QoL. No differences in asthma control or rheumatological patterns for EGPA were observed among patients with or without NP. Even when the rheumatological assessment scored EGPA "under control" according to the Birmingham Vasculitis Activity Score and Vasculitis Damage Index, sinonasal diseases and related nasal inflammatory processes were not controlled. Therefore, there is a need for clinical monitoring and targeted treatment to control the inflammatory processes and improve the QoL of EGPA patients. © 2018 S. Karger AG, Basel.
Go, Yun Young; Bailey, Ernest; Cook, Deborah G.; Coleman, Stephen J.; MacLeod, James N.; Chen, Kuey-Chu; Timoney, Peter J.; Balasuriya, Udeni B. R.
Previously, we have shown that horses could be divided into susceptible and resistant groups based on an in vitro assay using dual-color flow cytometric analysis of CD3+ T cells infected with equine arteritis virus (EAV). Here, we demonstrate that the differences in in vitro susceptibility of equine CD3+ T lymphocytes to EAV infection have a genetic basis. To investigate the possible hereditary basis for this trait, we conducted a genome-wide association study (GWAS) to compare susceptible and resistant phenotypes. Testing of 267 DNA samples from four horse breeds that had a susceptible or a resistant CD3+ T lymphocyte phenotype using both Illumina Equine SNP50 BeadChip and Sequenom's MassARRAY system identified a common, genetically dominant haplotype associated with the susceptible phenotype in a region of equine chromosome 11 (ECA11), positions 49572804 to 49643932. The presence of a common haplotype indicates that the trait occurred in a common ancestor of all four breeds, suggesting that it may be segregated among other modern horse breeds. Biological pathway analysis revealed several cellular genes within this region of ECA11 encoding proteins associated with virus attachment and entry, cytoskeletal organization, and NF-κB pathways that may be associated with the trait responsible for the in vitro susceptibility/resistance of CD3+ T lymphocytes to EAV infection. The data presented in this study demonstrated a strong association of genetic markers with the trait, representing de facto proof that the trait is under genetic control. To our knowledge, this is the first GWAS of an equine infectious disease and the first GWAS of equine viral arteritis. PMID:21994447
Objectivity of rights must be rebuilt in a dimension not only structural but also functional, despite being a "thing" that exactly repeats itself, it differs depending on what the person intended to make.
Full Text Available Glycated hemoglobin (HbA1c is used to diagnose type 2 diabetes (T2D and assess glycemic control in patients with diabetes. Previous genome-wide association studies (GWAS have identified 18 HbA1c-associated genetic variants. These variants proved to be classifiable by their likely biological action as erythrocytic (also associated with erythrocyte traits or glycemic (associated with other glucose-related traits. In this study, we tested the hypotheses that, in a very large scale GWAS, we would identify more genetic variants associated with HbA1c and that HbA1c variants implicated in erythrocytic biology would affect the diagnostic accuracy of HbA1c. We therefore expanded the number of HbA1c-associated loci and tested the effect of genetic risk-scores comprised of erythrocytic or glycemic variants on incident diabetes prediction and on prevalent diabetes screening performance. Throughout this multiancestry study, we kept a focus on interancestry differences in HbA1c genetics performance that might influence race-ancestry differences in health outcomes.Using genome-wide association meta-analyses in up to 159,940 individuals from 82 cohorts of European, African, East Asian, and South Asian ancestry, we identified 60 common genetic variants associated with HbA1c. We classified variants as implicated in glycemic, erythrocytic, or unclassified biology and tested whether additive genetic scores of erythrocytic variants (GS-E or glycemic variants (GS-G were associated with higher T2D incidence in multiethnic longitudinal cohorts (N = 33,241. Nineteen glycemic and 22 erythrocytic variants were associated with HbA1c at genome-wide significance. GS-G was associated with higher T2D risk (incidence OR = 1.05, 95% CI 1.04-1.06, per HbA1c-raising allele, p = 3 × 10-29; whereas GS-E was not (OR = 1.00, 95% CI 0.99-1.01, p = 0.60. In Europeans and Asians, erythrocytic variants in aggregate had only modest effects on the diagnostic accuracy of HbA1c. Yet, in
Kumagai, Masahiko; Tanaka, Tsuyoshi; Ohyanagi, Hajime; Hsing, Yue-Ie C.; Itoh, Takeshi
This chapter summarizes recent data obtained from genome sequencing, annotation projects, and studies on the genome diversity of Oryza sativa and related Oryza species. O. sativa, commonly known as Asian rice, is the first monocot species whose complete genome sequence was deciphered based on physical mapping by an international collaborative effort. This genome, along with its accurate and comprehensive annotation, has become an indispensable foundation for crop genomics and breeding. With the development of innovative sequencing technologies, genomic studies of O. sativa have dramatically increased; in particular, a large number of cultivars and wild accessions have been sequenced and compared with the reference rice genome. Since de novo genome sequencing has become cost-effective, the genome of African cultivated rice, O. glaberrima, has also been determined. Comparative genomic studies have highlighted the independent domestication processes of different rice species, but it also turned out that Asian and African rice share a common gene set that has experienced similar artificial selection. An international project aimed at constructing reference genomes and examining the genome diversity of wild Oryza species is currently underway, and the genomes of some species are publicly available. This project provides a platform for investigations such as the evolution, development, polyploidization, and improvement of crops. Studies on the genomic diversity of Oryza species, including wild species, should provide new insights to solve the problem of growing food demands in the face of rapid climatic changes.
This chapter summarizes recent data obtained from genome sequencing, annotation projects, and studies on the genome diversity of Oryza sativa and related Oryza species. O. sativa, commonly known as Asian rice, is the first monocot species whose complete genome sequence was deciphered based on physical mapping by an international collaborative effort. This genome, along with its accurate and comprehensive annotation, has become an indispensable foundation for crop genomics and breeding. With the development of innovative sequencing technologies, genomic studies of O. sativa have dramatically increased; in particular, a large number of cultivars and wild accessions have been sequenced and compared with the reference rice genome. Since de novo genome sequencing has become cost-effective, the genome of African cultivated rice, O. glaberrima, has also been determined. Comparative genomic studies have highlighted the independent domestication processes of different rice species, but it also turned out that Asian and African rice share a common gene set that has experienced similar artificial selection. An international project aimed at constructing reference genomes and examining the genome diversity of wild Oryza species is currently underway, and the genomes of some species are publicly available. This project provides a platform for investigations such as the evolution, development, polyploidization, and improvement of crops. Studies on the genomic diversity of Oryza species, including wild species, should provide new insights to solve the problem of growing food demands in the face of rapid climatic changes.
Chen, Fei; Dong, Wei; Zhang, Jiawei; Guo, Xinyue; Chen, Junhao; Wang, Zhengjia; Lin, Zhenguo; Tang, Haibao; Zhang, Liangsheng
Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.
Full Text Available BACKGROUND: Spirodela polyrhiza is a species of the order Alismatales, which represent the basal lineage of monocots with more ancestral features than the Poales. Its complete sequence of the mitochondrial (mt genome could provide clues for the understanding of the evolution of mt genomes in plant. METHODS: Spirodela polyrhiza mt genome was sequenced from total genomic DNA without physical separation of chloroplast and nuclear DNA using the SOLiD platform. Using a genome copy number sensitive assembly algorithm, the mt genome was successfully assembled. Gap closure and accuracy was determined with PCR products sequenced with the dideoxy method. CONCLUSIONS: This is the most compact monocot mitochondrial genome with 228,493 bp. A total of 57 genes encode 35 known proteins, 3 ribosomal RNAs, and 19 tRNAs that recognize 15 amino acids. There are about 600 RNA editing sites predicted and three lineage specific protein-coding-gene losses. The mitochondrial genes, pseudogenes, and other hypothetical genes (ORFs cover 71,783 bp (31.0% of the genome. Imported plastid DNA accounts for an additional 9,295 bp (4.1% of the mitochondrial DNA. Absence of transposable element sequences suggests that very few nuclear sequences have migrated into Spirodela mtDNA. Phylogenetic analysis of conserved protein-coding genes suggests that Spirodela shares the common ancestor with other monocots, but there is no obvious synteny between Spirodela and rice mtDNAs. After eliminating genes, introns, ORFs, and plastid-derived DNA, nearly four-fifths of the Spirodela mitochondrial genome is of unknown origin and function. Although it contains a similar chloroplast DNA content and range of RNA editing as other monocots, it is void of nuclear insertions, active gene loss, and comprises large regions of sequences of unknown origin in non-coding regions. Moreover, the lack of synteny with known mitochondrial genomic sequences shed new light on the early evolution of monocot
Schaub Jr, Gary John
(PME)? I suggest three alternative paths that increased cooperation in PME at the level of the command and staff course could take: a Nordic Defence College, standardized national command and staff courses, and a core curriculum of common courses for common purposes. I conclude with a discussion of how...
Olszak, Andrzej; Jørgensen, Bo Nørregaard
Java programs called Featureous that addresses this issue. Featureous allows a programmer to easily establish feature-code traceability links and to analyze their characteristics using a number of visualizations. Featureous is an extension to the NetBeans IDE, and can itself be extended by third...
There are many common software patterns and utilities for the ORNL Quantum Computing Institute that can and should be shared across projects. Otherwise we find duplication of code which adds unwanted complexity. This is a software product seeks to alleviate this by providing common utilities such as object factories, graph data structures, parameter input mechanisms, etc., for other software products within the ORNL Quantum Computing Institute. This work enables pure basic research, has no export controlled utilities, and has no real commercial value.
Baran, Joachim; Durgahee, Bibi Sehnaaz Begum; Eilbeck, Karen; Antezana, Erick; Hoehndorf, Robert; Dumontier, Michel
Availability and implementation. The latest stable release of the ontology is available via its base URI; previous and development versions are available at the ontology’s GitHub repository: https://github.com/BioInterchange/Ontologies; versions of the ontology are indexed through BioPortal (without external class-/property-equivalences due to BioPortal release 4.10 limitations); examples and reference documentation is provided on a separate web-page: http://www.biointerchange.org/ontologies.html. GFVO version 1.0.2 is licensed under the CC0 1.0 Universal license (https://creativecommons.org/publicdomain/zero/1.0) and therefore de facto within the public domain; the ontology can be appropriated without attribution for commercial and non-commercial use.
Barbrook, Adrian C.
Dinoflagellate algae of the genus Symbiodinium form important symbioses within corals and other benthic marine animals. Dinoflagellates possess an extremely reduced plastid genome relative to those examined in plants and other algae. In dinoflagellates the plastid genes are located on small plasmids, commonly referred to as \\'minicircles\\'. However, the chloroplast genomes of dinoflagellates have only been extensively characterised from a handful of species. There is also evidence of considerable variation in the chloroplast genome organisation across those species that have been examined. We therefore characterised the chloroplast genome from an environmental coral isolate, in this case containing a symbiont belonging to the Symbiodinium sp. clade C3. The gene content of the genome is well conserved with respect to previously characterised genomes. However, unlike previously characterised dinoflagellate chloroplast genomes we did not identify any \\'empty\\' minicircles. The sequences of this chloroplast genome show a high rate of evolution relative to other algal species. Particularly notable was a surprisingly high level of sequence divergence within the core polypeptides of photosystem I, the reasons for which are currently unknown. This chloroplast genome also possesses distinctive codon usage and GC content. These features suggest that chloroplast genomes in Symbiodinium are highly plastic. © 2013 Adrian C. Barbrook.
Barbrook, Adrian C.; Voolstra, Christian R.; Howe, Christopher J.
Dinoflagellate algae of the genus Symbiodinium form important symbioses within corals and other benthic marine animals. Dinoflagellates possess an extremely reduced plastid genome relative to those examined in plants and other algae. In dinoflagellates the plastid genes are located on small plasmids, commonly referred to as 'minicircles'. However, the chloroplast genomes of dinoflagellates have only been extensively characterised from a handful of species. There is also evidence of considerable variation in the chloroplast genome organisation across those species that have been examined. We therefore characterised the chloroplast genome from an environmental coral isolate, in this case containing a symbiont belonging to the Symbiodinium sp. clade C3. The gene content of the genome is well conserved with respect to previously characterised genomes. However, unlike previously characterised dinoflagellate chloroplast genomes we did not identify any 'empty' minicircles. The sequences of this chloroplast genome show a high rate of evolution relative to other algal species. Particularly notable was a surprisingly high level of sequence divergence within the core polypeptides of photosystem I, the reasons for which are currently unknown. This chloroplast genome also possesses distinctive codon usage and GC content. These features suggest that chloroplast genomes in Symbiodinium are highly plastic. © 2013 Adrian C. Barbrook.
Full Text Available Transposons are ubiquitous genomic components that play pivotal roles in plant gene and genome evolution. We analyzed two genome sequences of common bean (Phaseolus vulgaris and identified a new CACTA transposon family named pvCACTA1. The family is extremely abundant, as more than 12,000 pvCACTA1 elements were found. To our knowledge, this is the most abundant CACTA family reported thus far. The computational and fluorescence in situ hybridization (FISH analyses indicated that the pvCACTA1 elements were concentrated in terminal regions of chromosomes and frequently generated AT-rich 3 bp target site duplications (TSD, WWW, W is A or T. Comparative analysis of the common bean genomes from two domesticated genetic pools revealed that new insertions or excisions of pvCACTA1 elements occurred after the divergence of the two common beans, and some of the polymorphic elements likely resulted in variation in gene sequences. pvCACTA1 elements were detected in related species but not outside the Phaseolus genus. We calculated the molecular evolutionary rate of pvCACTA1 transposons using orthologous elements that indicated that most transposition events likely occurred before the divergence of the two gene pools. These results reveal unique features and evolution of this new transposon family in the common bean genome.
Taam, Ronald E.; Ricker, Paul M.
The common envelope phase of binary star evolution plays a central role in many evolutionary pathways leading to the formation of compact objects in short period systems. Using three dimensional hydrodynamical computations, we review the major features of this evolutionary phase, focusing on the
Tavares, Sílvia; Ramos, Ana Paula; Pires, Ana Sofia; Azinheira, Helena G; Caldeirinha, Patrícia; Link, Tobias; Abranches, Rita; Silva, Maria do Céu; Voegele, Ralf T; Loureiro, João; Talhinhas, Pedro
Rust fungi (Basidiomycota, Pucciniales) are biotrophic plant pathogens which exhibit diverse complexities in their life cycles and host ranges. The completion of genome sequencing of a few rust fungi has revealed the occurrence of large genomes. Sequencing efforts for other rust fungi have been hampered by uncertainty concerning their genome sizes. Flow cytometry was recently applied to estimate the genome size of a few rust fungi, and confirmed the occurrence of large genomes in this order (averaging 225.3 Mbp, while the average for Basidiomycota was 49.9 Mbp and was 37.7 Mbp for all fungi). In this work, we have used an innovative and simple approach to simultaneously isolate nuclei from the rust and its host plant in order to estimate the genome size of 30 rust species by flow cytometry. Genome sizes varied over 10-fold, from 70 to 893 Mbp, with an average genome size value of 380.2 Mbp. Compared to the genome sizes of over 1800 fungi, Gymnosporangium confusum possesses the largest fungal genome ever reported (893.2 Mbp). Moreover, even the smallest rust genome determined in this study is larger than the vast majority of fungal genomes (94%). The average genome size of the Pucciniales is now of 305.5 Mbp, while the average Basidiomycota genome size has shifted to 70.4 Mbp and the average for all fungi reached 44.2 Mbp. Despite the fact that no correlation could be drawn between the genome sizes, the phylogenomics or the life cycle of rust fungi, it is interesting to note that rusts with Fabaceae hosts present genomes clearly larger than those with Poaceae hosts. Although this study comprises only a small fraction of the more than 7000 rust species described, it seems already evident that the Pucciniales represent a group where genome size expansion could be a common characteristic. This is in sharp contrast to sister taxa, placing this order in a relevant position in fungal genomics research.
Home; Journals; Resonance – Journal of Science Education. Feature Article. Articles in Resonance – Journal of Science Education. Volume 1 Issue 1 January 1996 pp 80-85 Feature Article. What's New in Computers Windows 95 · Vijnan Shastri · More Details Fulltext PDF. Volume 1 Issue 1 January 1996 pp 86-89 Feature ...
En Creative Commons licens giver en forfatter mulighed for at udbyde sit værk i en alternativ licensløsning, som befinder sig på forskellige trin på en skala mellem yderpunkterne "All rights reserved" og "No rights reserved". Derved opnås licensen "Some rights reserved"......En Creative Commons licens giver en forfatter mulighed for at udbyde sit værk i en alternativ licensløsning, som befinder sig på forskellige trin på en skala mellem yderpunkterne "All rights reserved" og "No rights reserved". Derved opnås licensen "Some rights reserved"...
SCP: Creative Commons licensing for open access publishing, Open Access Law journal-author agreements for converting journals to open access, and the Scholar's Copyright Addendum Engine for retaining rights to self-archive in meaningful formats and locations for future re-use. More than 250 science and technology journals already publish under Creative Commons licensing while 35 law journals utilize the Open Access Law agreements. The Addendum Engine is a new tool created in partnership with SPARC and U.S. universities. View John Wilbanks's biography
Cai, Binghuang; Li, Biao; Kiga, Nikki; Thusberg, Janita; Bergquist, Timothy; Chen, Yun-Ching; Niknafs, Noushin; Carter, Hannah; Tokheim, Collin; Beleva-Guthrie, Violeta; Douville, Christopher; Bhattacharya, Rohit; Yeo, Hui Ting Grace; Fan, Jean; Sengupta, Sohini; Kim, Dewey; Cline, Melissa; Turner, Tychele; Diekhans, Mark; Zaucha, Jan; Pal, Lipika R; Cao, Chen; Yu, Chen-Hsin; Yin, Yizhou; Carraro, Marco; Giollo, Manuel; Ferrari, Carlo; Leonardi, Emanuela; Tosatto, Silvio C E; Bobe, Jason; Ball, Madeleine; Hoskins, Roger A; Repo, Susanna; Church, George; Brenner, Steven E; Moult, John; Gough, Julian; Stanke, Mario; Karchin, Rachel; Mooney, Sean D
The advent of next-generation sequencing has dramatically decreased the cost for whole-genome sequencing and increased the viability for its application in research and clinical care. The Personal Genome Project (PGP) provides unrestricted access to genomes of individuals and their associated phenotypes. This resource enabled the Critical Assessment of Genome Interpretation (CAGI) to create a community challenge to assess the bioinformatics community's ability to predict traits from whole genomes. In the CAGI PGP challenge, researchers were asked to predict whether an individual had a particular trait or profile based on their whole genome. Several approaches were used to assess submissions, including ROC AUC (area under receiver operating characteristic curve), probability rankings, the number of correct predictions, and statistical significance simulations. Overall, we found that prediction of individual traits is difficult, relying on a strong knowledge of trait frequency within the general population, whereas matching genomes to trait profiles relies heavily upon a small number of common traits including ancestry, blood type, and eye color. When a rare genetic disorder is present, profiles can be matched when one or more pathogenic variants are identified. Prediction accuracy has improved substantially over the last 6 years due to improved methodology and a better understanding of features. © 2017 Wiley Periodicals, Inc.
Checcucci, Alice; Mengoni, Alessio
Integrated Microbial Genomes and Metagenomes (IMG) is a biocomputational system that allows to provide information and support for annotation and comparative analysis of microbial genomes and metagenomes. IMG has been developed by the US Department of Energy (DOE)-Joint Genome Institute (JGI). IMG platform contains both draft and complete genomes, sequenced by Joint Genome Institute and other public and available genomes. Genomes of strains belonging to Archaea, Bacteria, and Eukarya domains are present as well as those of viruses and plasmids. Here, we provide some essential features of IMG system and case study for pangenome analysis.
Zhao, Meixia; Du, Jianchang; Lin, Feng; Tong, Chaobo; Yu, Jingyin; Huang, Shunmou; Wang, Xiaowu; Liu, Shengyi; Ma, Jianxin
Recent sequencing of the Brassica rapa and Brassica oleracea genomes revealed extremely contrasting genomic features such as the abundance and distribution of transposable elements between the two genomes. However, whether and how these structural differentiations may have influenced the evolutionary rates of the two genomes since their split from a common ancestor are unknown. Here, we investigated and compared the rates of nucleotide substitution between two long terminal repeats (LTRs) of individual orthologous LTR-retrotransposons, the rates of synonymous and non-synonymous substitution among triplicated genes retained in both genomes from a shared whole genome triplication event, and the rates of genetic recombination estimated/deduced by the comparison of physical and genetic distances along chromosomes and ratios of solo LTRs to intact elements. Overall, LTR sequences and genic sequences showed more rapid nucleotide substitution in B. rapa than in B. oleracea. Synonymous substitution of triplicated genes retained from a shared whole genome triplication was detected at higher rates in B. rapa than in B. oleracea. Interestingly, non-synonymous substitution was observed at lower rates in the former than in the latter, indicating shifted densities of purifying selection between the two genomes. In addition to evolutionary asymmetry, orthologous genes differentially regulated and/or disrupted by transposable elements between the two genomes were also characterized. Our analyses suggest that local genomic and epigenomic features, such as recombination rates and chromatin dynamics reshaped by independent proliferation of transposable elements and elimination between the two genomes, are perhaps partially the causes and partially the outcomes of the observed inter-specific asymmetric evolution. © 2013 Purdue University The Plant Journal © 2013 John Wiley & Sons Ltd.
Non-random features in the genomic data are usually biologically meaningful. The key is to choose the feature well. Having a p-value based score prioritizes the findings. If two proteins share a unusually large number of common interaction partners, they tend to be involved in the same biological process. We used this finding to predict the functions of 81 un-annotated proteins in yeast.
Stano, Matej; Klucar, Lubos
phiGENOME is a web-based genome browser generating dynamic and interactive graphical representation of phage genomes stored in the phiSITE, database of gene regulation in bacteriophages. phiGENOME is an integral part of the phiSITE web portal (http://www.phisite.org/phigenome) and it was optimised for visualisation of phage genomes with the emphasis on the gene regulatory elements. phiGENOME consists of three components: (i) genome map viewer built using Adobe Flash technology, providing dynamic and interactive graphical display of phage genomes; (ii) sequence browser based on precisely formatted HTML tags, providing detailed exploration of genome features on the sequence level and (iii) regulation illustrator, based on Scalable Vector Graphics (SVG) and designed for graphical representation of gene regulations. Bringing 542 complete genome sequences accompanied with their rich annotations and references, makes phiGENOME a unique information resource in the field of phage genomics. Copyright Â© 2011 Elsevier Inc. All rights reserved.
In referring to issues confronting the energy field in this region and options to be exercised in the future, I would like to mention the fundamental condition of the utmost importance. That can be summed up as follows: any subject in energy area can never be solved by one country alone, given the geographical and geopolitical characteristics intrinsically possessed by energy. So, a regional approach is needed and it is especially necessary for the main players in the region to jointly address problems common to them. Though it may be a matter to be pursued in the distant future, I am personally dreaming a 'Common Energy Market for Northeast Asia,' in which member countries' interests are adjusted so that the market can be integrated and the region can become a most economically efficient market, thus formulating an effective power to encounter the outside. It should be noted that Europe needed forty years to integrate its market as the unified common market. It is necessary for us to follow a number of steps over the period to eventually materialize our common market concept, too. Now is the time for us to take a first step to lay the foundation for our descendants to enjoy prosperity from such a common market.
Anderson, Iain; Sorokin, Alexei; Kapatral, Vinayak; Reznik, Gary; Bhattacharya, Anamitra; Mikhailova, Natalia; Burd, Henry; Joukov, Victor; Kaznadzey, Denis; Walunas, Theresa; D' Souza, Mark; Larsen, Niels; Pusch,Gordon; Liolios, Konstantinos; Grechkin, Yuri; Lapidus, Alla; Goltsman,Eugene; Chu, Lien; Fonstein, Michael; Ehrlich, S. Dusko; Overbeek, Ross; Kyrpides, Nikos; Ivanova, Natalia
Genome features of the Bacillus cereus group genomes (representative strains of Bacillus cereus, Bacillus anthracis and Bacillus thuringiensis sub spp israelensis) were analyzed and compared with the Bacillus subtilis genome. A core set of 1,381 protein families among the four Bacillus genomes, with an additional set of 933 families common to the B. cereus group, was identified. Differences in signal transduction pathways, membrane transporters, cell surface structures, cell wall, and S-layer proteins suggesting differences in their phenotype were identified. The B. cereus group has signal transduction systems including a tyrosine kinase related to two-component system histidine kinases from B. subtilis. A model for regulation of the stress responsive sigma factor sigmaB in the B. cereus group different from the well studied regulation in B. subtilis has been proposed. Despite a high degree of chromosomal synteny among these genomes, significant differences in cell wall and spore coat proteins that contribute to the survival and adaptation in specific hosts has been identified.
Sadarangani, Vineet; Datta, Sunando; Arunachalam, Manonmani
Type III secretion system (T3SS) plays an important role in virulence or symbiosis of many pathogenic or symbiotic bacteria [CHM 2:291-294, 2007; Physiology (Bethesda) 20:326-339, 2005]. T3SS acts like a tunnel between a bacterium and its host through which the bacterium injects 'effector' proteins into the latter [Nature 444:567-573, 2006; COSB 18:258-266, 2008]. The effectors spatially and temporally modify the host signalling pathways [FEMS Microbiol Rev 35:1100-1125, 2011; Cell Host Microbe5:571-579, 2009]. In spite its crucial role in host-pathogen interaction, the study of T3SS and the associated effectors has been limited to a few bacteria [Cell Microbiol 13:1858-1869, 2011; Nat Rev Microbiol 6:11-16, 2008; Mol Microbiol 80:1420-1438, 2011]. Before one set out to perform systematic experimental studies on an unknown set of bacteria it would be beneficial to identify the potential candidates by developing an in silico screening algorithm. A system level study would also be advantageous over traditional laboratory methods to extract an overriding theme for host-pathogen interaction, if any, from the vast resources of data generated by sequencing multiple bacterial genomes. We have developed an in silico protocol in which the most conserved set of T3SS proteins was used as the query against the entire bacterial database with increasingly stringent search parameters. It enabled us to identify several uncharacterized T3SS positive bacteria. We adopted a similar strategy to predict the presence of the already known effectors in the newly identified T3SS positive bacteria. The huge resources of biochemical data [FEMS Microbiol Rev 35:1100-1125, 2011; Cell Host Microbe 5:571-579, 2009; BMC Bioinformatics 7(11):S4, 2010] on the T3SS effectors enabled us to search for the common theme in T3SS mediated pathogenesis. We identified few cellular signalling networks in the host, which are manipulated by most of the T3SS containing pathogens. We went on to look for
How are independent schools to be useful to the wider world? Beyond their common commitment to educate their students for meaningful lives in service of the greater good, can they educate a broader constituency and, thus, share their resources and skills more broadly? Their answers to this question will be shaped by their independence. Any…
Jukam, David; Shariati, S Ali M; Skotheim, Jan M
The first major developmental transition in vertebrate embryos is the maternal-to-zygotic transition (MZT) when maternal mRNAs are degraded and zygotic transcription begins. During the MZT, the embryo takes charge of gene expression to control cell differentiation and further development. This spectacular organismal transition requires nuclear reprogramming and the initiation of RNAPII at thousands of promoters. Zygotic genome activation (ZGA) is mechanistically coordinated with other embryonic events, including changes in the cell cycle, chromatin state, and nuclear-to-cytoplasmic component ratios. Here, we review progress in understanding vertebrate ZGA dynamics in frogs, fish, mice, and humans to explore differences and emphasize common features. Copyright © 2017 Elsevier Inc. All rights reserved.
Valles, Yvonne; Boore, Jeffrey L.
Progress in both molecular techniques and phylogeneticmethods has challenged many of the interpretations of traditionaltaxonomy. One example is in the recognition of the animal superphylumLophotrochozoa (annelids, mollusks, echiurans, platyhelminthes,brachiopods, and other phyla), although the relationships within thisgroup and the inclusion of some phyla remain uncertain. While much ofthis progress in phylogenetic reconstruction has been based on comparingsingle gene sequences, we are beginning to see the potential of comparinglarge-scale features of genomes, such as the relative order of genes.Even though tremendous progress is being made on the sequencedetermination of whole nuclear genomes, the dataset of choice forgenome-level characters for many animals across a broad taxonomic rangeremains mitochondrial genomes. We review here what is known aboutmitochondrial genomes of the lophotrochozoans and discuss the promisethat this dataset will enable insight into theirrelationships.
Huang, Chao; Thompson, Paul; Wang, Yalin; Yu, Yang; Zhang, Jingwen; Kong, Dehan; Colen, Rivka R; Knickmeyer, Rebecca C; Zhu, Hongtu
Functional phenotypes (e.g., subcortical surface representation), which commonly arise in imaging genetic studies, have been used to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. However, existing statistical methods largely ignore the functional features (e.g., functional smoothness and correlation). The aim of this paper is to develop a functional genome-wide association analysis (FGWAS) framework to efficiently carry out whole-genome analyses of functional phenotypes. FGWAS consists of three components: a multivariate varying coefficient model, a global sure independence screening procedure, and a test procedure. Compared with the standard multivariate regression model, the multivariate varying coefficient model explicitly models the functional features of functional phenotypes through the integration of smooth coefficient functions and functional principal component analysis. Statistically, compared with existing methods for genome-wide association studies (GWAS), FGWAS can substantially boost the detection power for discovering important genetic variants influencing brain structure and function. Simulation studies show that FGWAS outperforms existing GWAS methods for searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. We have successfully applied FGWAS to large-scale analysis of data from the Alzheimer's Disease Neuroimaging Initiative for 708 subjects, 30,000 vertices on the left and right hippocampal surfaces, and 501,584 SNPs. Copyright © 2017 Elsevier Inc. All rights reserved.
DeLong, Edward F
The complete genome sequence of Thermoplasma acidophilum, an acid- and heat-loving archaeon, has recently been reported. Comparative genomic analysis of this 'extremophile' is providing new insights into the metabolic machinery, ecology and evolution of thermophilic archaea.
Full Text Available The dynamics of reductive genome evolution for eukaryotes living inside other eukaryotic cells are poorly understood compared to well-studied model systems involving obligate intracellular bacteria. Here we present 8.5 Mb of sequence from the genome of the microsporidian Trachipleistophora hominis, isolated from an HIV/AIDS patient, which is an outgroup to the smaller compacted-genome species that primarily inform ideas of evolutionary mode for these enormously successful obligate intracellular parasites. Our data provide detailed information on the gene content, genome architecture and intergenic regions of a larger microsporidian genome, while comparative analyses allowed us to infer genomic features and metabolism of the common ancestor of the species investigated. Gene length reduction and massive loss of metabolic capacity in the common ancestor was accompanied by the evolution of novel microsporidian-specific protein families, whose conservation among microsporidians, against a background of reductive evolution, suggests they may have important functions in their parasitic lifestyle. The ancestor had already lost many metabolic pathways but retained glycolysis and the pentose phosphate pathway to provide cytosolic ATP and reduced coenzymes, and it had a minimal mitochondrion (mitosome making Fe-S clusters but not ATP. It possessed bacterial-like nucleotide transport proteins as a key innovation for stealing host-generated ATP, the machinery for RNAi, key elements of the early secretory pathway, canonical eukaryotic as well as microsporidian-specific regulatory elements, a diversity of repetitive and transposable elements, and relatively low average gene density. Microsporidian genome evolution thus appears to have proceeded in at least two major steps: an ancestral remodelling of the proteome upon transition to intracellular parasitism that involved reduction but also selective expansion, followed by a secondary compaction of genome
Bennetzen, Jeffrey L.; SanMiguel, Phillip; Chen, Mingsheng; Tikhonov, Alexander; Francki, Michael; Avramova, Zoya
For the most part, studies of grass genome structure have been limited to the generation of whole-genome genetic maps or the fine structure and sequence analysis of single genes or gene clusters. We have investigated large contiguous segments of the genomes of maize, sorghum, and rice, primarily focusing on intergenic spaces. Our data indicate that much (>50%) of the maize genome is composed of interspersed repetitive DNAs, primarily nested retrotransposons that in...
Norrild, Bodil; Guldberg, Per; Ralfkiær, Elisabeth Methner
Almost all cells in the human body contain a complete copy of the genome with an estimated number of 25,000 genes. The sequences of these genes make up about three percent of the genome and comprise the inherited set of genetic information. The genome also contains information that determines whe...
Full Text Available We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.
Lawrence, Michael; Huber, Wolfgang; Pagès, Hervé; Aboyoun, Patrick; Carlson, Marc; Gentleman, Robert; Morgan, Martin T; Carey, Vincent J
We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.
Gaitatzes, Athanasios; Johnson, Sarah H; Smadbeck, James B; Vasmatzis, George
The ability to produce and analyze whole genome sequencing (WGS) data from samples with structural variations (SV) generated the need to visualize such abnormalities in simplified plots. Conventional two-dimensional representations of WGS data frequently use either circular or linear layouts. There are several diverse advantages regarding both these representations, but their major disadvantage is that they do not use the two-dimensional space very efficiently. We propose a layout, termed the Genome U-Plot, which spreads the chromosomes on a two-dimensional surface and essentially quadruples the spatial resolution. We present the Genome U-Plot for producing clear and intuitive graphs that allows researchers to generate novel insights and hypotheses by visualizing SVs such as deletions, amplifications, and chromoanagenesis events. The main features of the Genome U-Plot are its layered layout, its high spatial resolution and its improved aesthetic qualities. We compare conventional visualization schemas with the Genome U-Plot using visualization metrics such as number of line crossings and crossing angle resolution measures. Based on our metrics, we improve the readability of the resulting graph by at least 2-fold, making apparent important features and making it easy to identify important genomic changes. A whole genome visualization tool with high spatial resolution and improved aesthetic qualities. An implementation and documentation of the Genome U-Plot is publicly available at https://github.com/gaitat/GenomeUPlot. email@example.com. Supplementary data are available at Bioinformatics online.
Sasaki, Naobumi V; Sato, Naoki
Cyanobacteria, which perform oxygen-evolving photosynthesis as do chloroplasts of plants and algae, are one of the best-studied prokaryotic phyla and one from which many representative genomes have been sequenced. Lack of a suitable comparative genomic database has been a problem in cyanobacterial genomics because many proteins involved in physiological functions such as photosynthesis and nitrogen fixation are not catalogued in commonly used databases, such as Clusters of Orthologous Proteins (COG). CyanoClust is a database of homolog groups in cyanobacteria and plastids that are produced by the program Gclust. We have developed a web-server system for the protein homology database featuring cyanobacteria and plastids. Database URL: http://cyanoclust.c.u-tokyo.ac.jp/.
Zhang, Xinjian; Zhang, Guangzhi; Yang, Hetong
Arthrobacter species are found ecologically diverse and can survive in various environments. Many strains of these species have metabolic versatility and can degrade many environmental pollutants. Arthrobacter species are thought to play important roles in catabolism of environmental pollutants in nature. In recent years, the genomes of many Arthrobacter strains have been sequenced, which provides comprehensive information to clarify the molecular mechanisms related to environmental adaptability of Arthrobacter species. These genomics findings revealed several features that are commonly observed in Arthrobacter strains allowing for survival under stressful conditions. These include an array of genes associated with sigma factors and responses to oxidative, osmotic, starvation and temperature stresses. The genomics basis of their environmental adaptability are reviewed, which is expected to provide useful information for applying Arthrobacter strains in pollution remediation and shed some light on other bacterial environmental adaptability researches.
Analysis of ORF5 and Full-Length Genome Sequences of Porcine Reproductive and Respiratory Syndrome Virus Isolates of Genotypes 1 and 2 Retrieved Worldwide Provides Evidence that Recombination Is a Common Phenomenon and May Produce Mosaic Isolates
Martín-Valls, G. E.; Kvisgaard, Lise Kirstine; Tello, M.
Recombination is currently recognized as a factor for high genetic diversity, but the frequency of such recombination events and the genome segments involved are not well known. In the present study, we initially focused on the detection of recombinant porcine reproductive and respiratory syndrom...
Wheeler, Eleanor; Leong, Aaron; Liu, Ching-Ti; Hivert, Marie-France; Strawbridge, Rona J; Podmore, Clara; Li, Man; Yao, Jie; Sim, Xueling; Hong, Jaeyoung; Chu, Audrey Y; Zhang, Weihua; Wang, Xu; Chen, Peng; Maruthur, Nisa M; Porneala, Bianca C; Sharp, Stephen J; Jia, Yucheng; Kabagambe, Edmond K; Chang, Li-Ching; Chen, Wei-Min; Elks, Cathy E; Evans, Daniel S; Fan, Qiao; Giulianini, Franco; Go, Min Jin; Hottenga, Jouke-Jan; Hu, Yao; Jackson, Anne U; Kanoni, Stavroula; Kim, Young Jin; Kleber, Marcus E; Ladenvall, Claes; Lecoeur, Cecile; Lim, Sing-Hui; Lu, Yingchang; Mahajan, Anubha; Marzi, Carola; Nalls, Mike A; Navarro, Pau; Nolte, Ilja M; Sanna, Serena; van der Most, Peter J; Van Vliet-Ostaptchouk, Jana V.; Alizadeh, Behrooz Z; Hartman, Catharina A; Swertz, Morris; Oldehinkel, Albertine J; Snieder, Harold; Wolffenbuttel, Bruce H R
Background Glycated hemoglobin (HbA1c) is used to diagnose type 2 diabetes (T2D) and assess glycemic control in patients with diabetes. Previous genome-wide association studies (GWAS) have identified 18 HbA1c-associated genetic variants. These variants proved to be classifiable by their likely
Wheeler, Eleanor; Leong, Aaron; Liu, Ching Ti; Hivert, Marie France; Strawbridge, Rona J.; Podmore, Clara; Li, Man; Yao, Jie; Sim, Xueling; Hong, Jaeyoung; Chu, Audrey Y.; Zhang, Weihua; Wang, Xu; Chen, Peng; Maruthur, Nisa M.; Porneala, Bianca C.; Sharp, Stephen J.; Jia, Yucheng; Kabagambe, Edmond K.; Chang, Li Ching; Chen, Wei Min; Elks, Cathy E.; Evans, Daniel S.; Fan, Qiao; Giulianini, Franco; Go, Min Jin; Hottenga, Jouke Jan; Hu, Yao; Jackson, Anne U.; Kanoni, Stavroula; Kim, Young Jin; Kleber, Marcus E.; Ladenvall, Claes; Lecoeur, Cecile; Lim, Sing Hui; Lu, Yingchang; Mahajan, Anubha; Marzi, Carola; Nalls, Mike A.; Navarro, Pau; Nolte, Ilja M.; Rose, Lynda M.; Rybin, Denis V.; Sanna, Serena; Shi, Yuan; Stram, Daniel O.; Takeuchi, Fumihiko; Tan, Shu Pei; van der Most, Peter J.; Van Vliet-Ostaptchouk, Jana V.; Wong, Andrew; Yengo, Loic; Zhao, Wanting; Goel, Anuj; Martinez Larrad, Maria Teresa; Radke, Dörte; Salo, Perttu; Tanaka, Toshiko; van Iperen, Erik P.A.; Abecasis, Goncalo; Afaq, Saima; Alizadeh, Behrooz Z.; Bertoni, Alain G.; Bonnefond, Amelie; Böttcher, Yvonne; Bottinger, Erwin P.; Campbell, Harry; Carlson, Olga D.; Chen, Chien Hsiun; Cho, Yoon Shin; Garvey, W. Timothy; Gieger, Christian; Goodarzi, Mark O.; Grallert, Harald; Hamsten, Anders; Hartman, Catharina A.; Herder, Christian; Hsiung, Chao Agnes; Huang, Jie; Igase, Michiya; Isono, Masato; Katsuya, Tomohiro; Khor, Chiea Chuen; Kiess, Wieland; Kohara, Katsuhiko; Kovacs, Peter; Lee, Juyoung; Lee, Wen Jane; Lehne, Benjamin; Li, Huaixing; Liu, Jianjun; Lobbens, Stephane; Luan, Jian'an; Lyssenko, Valeriya; Meitinger, Thomas; Miki, Tetsuro; Miljkovic, Iva; Moon, Sanghoon; Mulas, Antonella; Müller, Gabriele; Müller-Nurasyid, Martina; Nagaraja, Ramaiah; Nauck, Matthias; Pankow, James S.; Polasek, Ozren; Prokopenko, Inga; Ramos, Paula S.; Rasmussen-Torvik, Laura; Rathmann, Wolfgang; Rich, Stephen S.; Robertson, Neil R.; Roden, Michael; Roussel, Ronan; Rudan, Igor; Scott, Robert A.; Scott, William R.; Sennblad, Bengt; Siscovick, David S.; Strauch, Konstantin; Sun, Liang; Swertz, Morris; Tajuddin, Salman M.; Taylor, Kent D.; Teo, Yik Ying; Tham, Yih Chung; Tönjes, Anke; Wareham, Nicholas J.; Willemsen, Gonneke; Wilsgaard, Tom; Hingorani, Aroon D.; Egan, Josephine; Ferrucci, Luigi; Hovingh, G. Kees; Jula, Antti; Kivimaki, Mika; Kumari, Meena; Njølstad, Inger; Palmer, Colin N.A.; Serrano Ríos, Manuel; Stumvoll, Michael; Watkins, Hugh; Aung, Tin; Blüher, Matthias; Boehnke, Michael; Boomsma, Dorret I.; Bornstein, Stefan R.; Chambers, John C.; Chasman, Daniel I.; Chen, Yii Der Ida; Chen, Yduan Tsong; Cheng, Ching Yu; Cucca, Francesco; de Geus, Eco J.C.; Deloukas, Panos; Evans, Michele K.; Fornage, Myriam; Friedlander, Yechiel; Froguel, Philippe; Groop, Leif; Gross, Myron D.; Harris, Tamara B.; Hayward, Caroline; Heng, Chew Kiat; Ingelsson, Erik; Kato, Norihiro; Kim, Bong Jo; Koh, Woon Puay; Kooner, Jaspal S.; Körner, Antje; Kuh, Diana; Kuusisto, Johanna; Laakso, Markku; Lin, Xu; Liu, Yongmei; Loos, Ruth J.F.; Magnusson, Patrik K.E.; März, Winfried; McCarthy, Mark I.; Oldehinkel, Albertine J.; Ong, Ken K.; Pedersen, Nancy L.; Pereira, Mark A.; Peters, Annette; Ridker, Paul M.; Sabanayagam, Charumathi; Sale, Michele; Saleheen, Danish; Saltevo, Juha; Schwarz, Peter E.H.; Sheu, Wayne H.H.; Snieder, Harold; Spector, Timothy D.; Tabara, Yasuharu; Tuomilehto, Jaakko; van Dam, Rob M.; Wilson, James G.; Wilson, James F.; Wolffenbuttel, Bruce H.R.; Wong, Tien Yin; Wu, Jer Yuarn; Yuan, Jian Min; Zonderman, Alan B.; Soranzo, Nicole; Guo, Xiuqing; Roberts, David J.; Florez, Jose C.; Sladek, Robert; Dupuis, Josée; Morris, Andrew P.; Tai, E. Shyong; Selvin, Elizabeth; Rotter, Jerome I.; Langenberg, Claudia; Barroso, Inês; Meigs, James B.
Background: Glycated hemoglobin (HbA1c) is used to diagnose type 2 diabetes (T2D) and assess glycemic control in patients with diabetes. Previous genome-wide association studies (GWAS) have identified 18 HbA1c-associated genetic variants. These variants proved to be classifiable by their likely
Wheeler, Eleanor; Leong, Aaron; Liu, Ching-Ti; Hivert, Marie-France; Strawbridge, Rona J.; Podmore, Clara; Li, Man; Yao, Jie; Sim, Xueling; Hong, Jaeyoung; Chu, Audrey Y.; Zhang, Weihua; Wang, Xu; Chen, Peng; Maruthur, Nisa M.; Porneala, Bianca C.; Sharp, Stephen J.; Jia, Yucheng; Kabagambe, Edmond K.; Chang, Li-Ching; Chen, Wei-Min; Elks, Cathy E.; Evans, Daniel S.; Fan, Qiao; Giulianini, Franco; Go, Min Jin; Hottenga, Jouke-Jan; Hu, Yao; Jackson, Anne U.; Kanoni, Stavroula; Kim, Young Jin; Kleber, Marcus E.; Ladenvall, Claes; Lecoeur, Cecile; Lim, Sing-Hui; Lu, Yingchang; Mahajan, Anubha; Marzi, Carola; Nalls, Mike A.; Navarro, Pau; Nolte, Ilja M.; Rose, Lynda M.; Rybin, Denis V.; Sanna, Serena; Shi, Yuan; Stram, Daniel O.; Takeuchi, Fumihiko; Tan, Shu Pei; van der Most, Peter J.; van Vliet-Ostaptchouk, Jana V.; Wong, Andrew; Yengo, Loic; Zhao, Wanting; Goel, Anuj; Martinez Larrad, Maria Teresa; Radke, Dorte; Salo, Perttu; Tanaka, Toshiko; van Iperen, Erik P. A.; Abecasis, Goncalo; Afaq, Saima; Alizadeh, Behrooz Z.; Bertoni, Alain G.; Bonnefond, Amelie; Bottcher, Yvonne; Bottinger, Erwin P.; Campbell, Harry; Carlson, Olga D.; Chen, Chien-Hsiun; Cho, Yoon Shin; Garvey, W. Timothy; Gieger, Christian; Goodarzi, Mark O.; Grallert, Harald; Hamsten, Anders; Hartman, Catharina A.; Herder, Christian; Hsiung, Chao Agnes; Huang, Jie; Igase, Michiya; Isono, Masato; Katsuya, Tomohiro; Khor, Chiea-Chuen; Kiess, Wieland; Kohara, Katsuhiko; Kovacs, Peter; Lee, Juyoung; Lee, Wen-Jane; Lehne, Benjamin; Li, Huaixing; Liu, Jianjun; Lobbens, Stephane; Luan, Jian'an; Lyssenko, Valeriya; Meitinger, Thomas; Miki, Tetsuro; Miljkovic, Iva; Moon, Sanghoon; Mulas, Antonella; Muller, Gabriele; Muller-Nurasyid, Martina; Nagaraja, Ramaiah; Nauck, Matthias; Pankow, James S.; Polasek, Ozren; Prokopenko, Inga; Ramos, Paula S.; Rasmussen-Torvik, Laura; Rathmann, Wolfgang; Rich, Stephen S.; Robertson, Neil R.; Roden, Michael; Roussel, Ronan; Rudan, Igor; Scott, Robert A.; Scott, William R.; Sennblad, Bengt; Siscovick, David S.; Strauch, Konstantin; Sun, Liang; Swertz, Morris; Tajuddin, Salman M.; Taylor, Kent D.; teo, Yik-Ying; Tham, Yih Chung; Tonjes, Anke; Wareham, Nicholas J.; Willemsen, Gonneke; Wilsgaard, Tom; Hingorani, Aroon D.; Egan, Josephine; Ferrucci, Luigi; Hovingh, G. Kees; Jula, Antti; Kivimaki, Mika; Kumari, Meena; Njolstad, Inger; Palmer, Colin N. A.; Serrano Rios, Manuel; Stumvoll, Michael; Watkins, Hugh; Aung, Tin; Bluher, Matthias; Boehnke, Michael; Boomsma, Dorret I.; Bornstein, Stefan R.; Chambers, John C.; Chasman, Daniel I.; Chen, Yii-Der Ida; Chen, Yduan-Tsong; Cheng, Ching-Yu; Cucca, Francesco; de Geus, Eco J. C.; Deloukas, Panos; Evans, Michele K.; Fornage, Myriam; Friedlander, Yechiel; Froguel, Philippe; Groop, Leif; Gross, Myron D.; Harris, Tamara B.; Hayward, Caroline; Heng, Chew-Kiat; Ingelsson, Erik; Kato, Norihiro; Kim, Bong-Jo; Koh, Woon-Puay; Kooner, Jaspal S.; Korner, Antje; Kuh, Diana; Kuusisto, Johanna; Laakso, Markku; Lin, Xu; Liu, Yongmei; Loos, Ruth J. F.; Magnusson, Patrik K. E.; Marz, Winfried; McCarthy, Mark I.; Oldehinkel, Albertine J.; Ong, Ken K.; Pedersen, Nancy L.; Pereira, Mark A.; Peters, Annette; Ridker, Paul M.; Sabanayagam, Charumathi; Sale, Michele; Saleheen, Danish; Saltevo, Juha; Schwarz, Peter Eh; Sheu, Wayne H. H.; Snieder, Harold; Spector, Timothy D.; Tabara, Yasuharu; Tuomilehto, Jaakko; van Dam, Rob M.; Wilson, James G.; Wilson, James F.; Wolffenbuttel, Bruce H. R.; Wong, Tien Yin; Wu, Jer-Yuarn; Yuan, Jian-Min; Zonderman, Alan B.; Soranzo, Nicole; Guo, Xiuqing; Roberts, David J.; Florez, Jose C.; Sladek, Robert; Dupuis, Josee; Morris, Andrew P.; Tai, E.-Shyong; Selvin, Elizabeth; Rotter, Jerome I.; Langenberg, Claudia; Barroso, Ines; Meigs, James B.
Background Glycated hemoglobin (HbA1c) is used to diagnose type 2 diabetes (T2D) and assess glycemic control in patients with diabetes. Previous genome-wide association studies (GWAS) have identified 18 HbA1c-associated genetic variants. These variants proved to be classifiable by their likely
Wheeler, Eleanor; Leong, Aaron; Liu, Ching-Ti; Hivert, Marie-France; Strawbridge, Rona J; Podmore, Clara; Li, Man; Yao, Jie; Sim, Xueling; Hong, Jaeyoung; Chu, Audrey Y; Zhang, Weihua; Wang, Xu; Chen, Peng; Maruthur, Nisa M; Porneala, Bianca C; Sharp, Stephen J; Jia, Yucheng; Kabagambe, Edmond K; Chang, Li-Ching; Chen, Wei-Min; Elks, Cathy E; Evans, Daniel S; Fan, Qiao; Giulianini, Franco; Go, Min Jin; Hottenga, Jouke-Jan; Hu, Yao; Jackson, Anne U; Kanoni, Stavroula; Kim, Young Jin; Kleber, Marcus E; Ladenvall, Claes; Lecoeur, Cecile; Lim, Sing-Hui; Lu, Yingchang; Mahajan, Anubha; Marzi, Carola; Nalls, Mike A; Navarro, Pau; Nolte, Ilja M; Rose, Lynda M; Rybin, Denis V; Sanna, Serena; Shi, Yuan; Stram, Daniel O; Takeuchi, Fumihiko; Tan, Shu Pei; van der Most, Peter J; van Vliet-Ostaptchouk, Jana V; Wong, Andrew; Yengo, Loic; Zhao, Wanting; Goel, Anuj; Martinez Larrad, Maria Teresa; Radke, Dörte; Salo, Perttu; Tanaka, Toshiko; van Iperen, Erik P A; Abecasis, Goncalo; Afaq, Saima; Alizadeh, Behrooz Z; Bertoni, Alain G; Bonnefond, Amelie; Böttcher, Yvonne; Bottinger, Erwin P; Campbell, Harry; Carlson, Olga D; Chen, Chien-Hsiun; Cho, Yoon Shin; Garvey, W Timothy; Gieger, Christian; Goodarzi, Mark O; Grallert, Harald; Hamsten, Anders; Hartman, Catharina A; Herder, Christian; Hsiung, Chao Agnes; Huang, Jie; Igase, Michiya; Isono, Masato; Katsuya, Tomohiro; Khor, Chiea-Chuen; Kiess, Wieland; Kohara, Katsuhiko; Kovacs, Peter; Lee, Juyoung; Lee, Wen-Jane; Lehne, Benjamin; Li, Huaixing; Liu, Jianjun; Lobbens, Stephane; Luan, Jian'an; Lyssenko, Valeriya; Meitinger, Thomas; Miki, Tetsuro; Miljkovic, Iva; Moon, Sanghoon; Mulas, Antonella; Müller, Gabriele; Müller-Nurasyid, Martina; Nagaraja, Ramaiah; Nauck, Matthias; Pankow, James S; Polasek, Ozren; Prokopenko, Inga; Ramos, Paula S; Rasmussen-Torvik, Laura J; Rathmann, Wolfgang; Rich, Stephen S; Robertson, Neil R; Roden, Michael; Roussel, Ronan; Rudan, Igor; Scott, Robert A; Scott, William R; Sennblad, Bengt; Siscovick, David S; Strauch, Konstantin; Sun, Shan-Liang; Swertz, Morris a.; Tajuddin, Salman M; Taylor, Kent D; Teo, Yik-Ying; Tham, Yih Chung; Tönjes, Anke; Wareham, Nicholas J; Willemsen, Gonneke; Wilsgaard, Tom; Hingorani, Aroon D; Egan, Josephine; Ferrucci, Luigi; Hovingh, G. Kees; Jula, Antti; Kivimaki, Mika; Kumari, Meena; Njølstad, Inger; Palmer, Colin N A; Serrano Ríos, Manuel; Stumvoll, Michael; Watkins, Hugh; Aung, Tin; Blüher, Matthias; Boehnke, Michael; Boomsma, Dorret I; Bornstein, Stefan R; Chambers, John C; Chasman, Daniel I; Chen, Yii-Der Ida; Chen, Yduan-Tsong; Cheng, Ching-Yu; Cucca, Francesco; de Geus, Eco J C; Deloukas, Panos; Evans, Michele K; Fornage, Myriam; Friedlander, Yechiel; Froguel, Philippe; Groop, Leif; Gross, Myron D; Harris, Tamara B; Hayward, Caroline; Heng, Chew-Kiat; Ingelsson, Erik; Kato, Norihiro; Kim, Bong-Jo; Koh, Woon-Puay; Kooner, Jaspal S; Körner, Antje; Kuh, Diana; Kuusisto, Johanna; Laakso, Markku; Lin, Xu; Liu, YongMei; Loos, Ruth J F; Magnusson, Patrik K E; März, Winfried; McCarthy, Mark I; Oldehinkel, Albertine J; Ong, Ken K; Pedersen, Nancy L; Pereira, Mark A; Peters, Annette; Ridker, Paul M; Sabanayagam, Charumathi; Sale, Michele; Saleheen, Danish; Saltevo, Juha; Schwarz, Peter Eh; Sheu, Wayne H H; Snieder, Harold; Spector, Timothy D; Tabara, Yasuharu; Tuomilehto, Jaakko; van Dam, Rob M; Wilson, James G; Wilson, James F; Wolffenbuttel, Bruce H R; Wong, Tien Yin; Wu, Jer-Yuarn; Yuan, Jian-Min; Zonderman, Alan B; Soranzo, Nicole; Guo, Xiuqing; Roberts, David J; Florez, Jose C; Sladek, Robert; Dupuis, Josée; Morris, Andrew P; Tai, E Shyong; Selvin, Elizabeth; Rotter, Jerome I; Langenberg, Claudia; Barroso, Inês; Meigs, James B
BACKGROUND: Glycated hemoglobin (HbA1c) is used to diagnose type 2 diabetes (T2D) and assess glycemic control in patients with diabetes. Previous genome-wide association studies (GWAS) have identified 18 HbA1c-associated genetic variants. These variants proved to be classifiable by their likely
Elkon, Ran; Agami, Reuven
Genetic variants associated with common diseases are usually located in noncoding parts of the human genome. Delineation of the full repertoire of functional noncoding elements, together with efficient methods for probing their biological roles, is therefore of crucial importance. Over the past decade, DNA accessibility and various epigenetic modifications have been associated with regulatory functions. Mapping these features across the genome has enabled researchers to begin to document the full complement of putative regulatory elements. High-throughput reporter assays to probe the functions of regulatory regions have also been developed but these methods separate putative regulatory elements from the chromosome so that any effects of chromatin context and long-range regulatory interactions are lost. Definitive assignment of function(s) to putative cis-regulatory elements requires perturbation of these elements. Genome-editing technologies are now transforming our ability to perturb regulatory elements across entire genomes. Interpretation of high-throughput genetic screens that incorporate genome editors might enable the construction of an unbiased map of functional noncoding elements in the human genome.
Kergourlay, Gilles; Messaoudi, Soumaya; Dousset, Xavier; Prévost, Hervé
We report the draft genome sequence of Lactobacillus salivarius SMXD51, isolated from the cecum of healthy chickens showing an activity against Campylobacter--the food-borne pathogen that is the most common cause of gastroenteritis in the European Union (EU)--and potentially interesting features for a probiotic strain, explaining our interest in it.
Kulawik, Maciej; Pawełkowicz, Magdalena Ewa; Wojcieszek, Michał; PlÄ der, Wojciech; Nowak, Robert M.
Comparative genomic by increasing information about the genomes sequences available in the databases is a rapidly evolving science. A simple comparison of the general features of genomes such as genome size, number of genes, and chromosome number presents an entry point into comparative genomic analysis. Here we present the utility of the new tool genomecmp for finding rearrangements across the compared sequences and applications in plant comparative genomics.
Feature selection and reduction are key to robust multivariate analyses. In this talk I will focus on pros and cons of various variable selection methods and focus on those that are most relevant in the context of HEP.
National Oceanic and Atmospheric Administration, Department of Commerce — Collection includes a variety of solar feature datasets contributed by a number of national and private solar observatories located worldwide.
U.S. Environmental Protection Agency — This dataset consists of various site features from multiple Superfund sites in U.S. EPA Region 8. These data were acquired from multiple sources at different times...
Lack, Justin B; Lange, Jeremy D; Tang, Alison D; Corbett-Detig, Russell B; Pool, John E
The Drosophila Genome Nexus is a population genomic resource that provides D. melanogaster genomes from multiple sources. To facilitate comparisons across data sets, genomes are aligned using a common reference alignment pipeline which involves two rounds of mapping. Regions of residual heterozygosity, identity-by-descent, and recent population admixture are annotated to enable data filtering based on the user's needs. Here, we present a significant expansion of the Drosophila Genome Nexus, which brings the current data object to a total of 1,121 wild-derived genomes. New additions include 305 previously unpublished genomes from inbred lines representing six population samples in Egypt, Ethiopia, France, and South Africa, along with another 193 genomes added from recently-published data sets. We also provide an aligned D. simulans genome to facilitate divergence comparisons. This improved resource will broaden the range of population genomic questions that can addressed from multi-population allele frequencies and haplotypes in this model species. The larger set of genomes will also enhance the discovery of functionally relevant natural variation that exists within and between populations. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Ladd-Acosta, C; Hansen, K D; Briem, E; Fallin, M D; Kaufmann, W E; Feinberg, A P
Autism spectrum disorders (ASD) are increasingly common neurodevelopmental disorders defined clinically by a triad of features including impairment in social interaction, impairment in communication in social situations and restricted and repetitive patterns of behavior and interests, with considerable phenotypic heterogeneity among individuals. Although heritability estimates for ASD are high, conventional genetic-based efforts to identify genes involved in ASD have yielded only few reproducible candidate genes that account for only a small proportion of ASDs. There is mounting evidence to suggest environmental and epigenetic factors play a stronger role in the etiology of ASD than previously thought. To begin to understand the contribution of epigenetics to ASD, we have examined DNA methylation (DNAm) in a pilot study of postmortem brain tissue from 19 autism cases and 21 unrelated controls, among three brain regions including dorsolateral prefrontal cortex, temporal cortex and cerebellum. We measured over 485,000 CpG loci across a diverse set of functionally relevant genomic regions using the Infinium HumanMethylation450 BeadChip and identified four genome-wide significant differentially methylated regions (DMRs) using a bump hunting approach and a permutation-based multiple testing correction method. We replicated 3/4 DMRs identified in our genome-wide screen in a different set of samples and across different brain regions. The DMRs identified in this study represent suggestive evidence for commonly altered methylation sites in ASD and provide several promising new candidate genes.
Lincoln D Stein
Full Text Available The soil nematodes Caenorhabditis briggsae and Caenorhabditis elegans diverged from a common ancestor roughly 100 million years ago and yet are almost indistinguishable by eye. They have the same chromosome number and genome sizes, and they occupy the same ecological niche. To explore the basis for this striking conservation of structure and function, we have sequenced the C. briggsae genome to a high-quality draft stage and compared it to the finished C. elegans sequence. We predict approximately 19,500 protein-coding genes in the C. briggsae genome, roughly the same as in C. elegans. Of these, 12,200 have clear C. elegans orthologs, a further 6,500 have one or more clearly detectable C. elegans homologs, and approximately 800 C. briggsae genes have no detectable matches in C. elegans. Almost all of the noncoding RNAs (ncRNAs known are shared between the two species. The two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers. Operons, a distinctive feature of C. elegans, are highly conserved in C. briggsae, with the arrangement of genes being preserved in 96% of cases. The difference in size between the C. briggsae (estimated at approximately 104 Mbp and C. elegans (100.3 Mbp genomes is almost entirely due to repetitive sequence, which accounts for 22.4% of the C. briggsae genome in contrast to 16.5% of the C. elegans genome. Few, if any, repeat families are shared, suggesting that most were acquired after the two species diverged or are undergoing rapid evolution. Coclustering the C. elegans and C. briggsae proteins reveals 2,169 protein families of two or more members. Most of these are shared between the two species, but some appear to be expanding or contracting, and there seem to be as many as several hundred novel C. briggsae gene families. The C. briggsae draft sequence will greatly improve the annotation of the C. elegans genome. Based on similarity to C
Bale, Tejus A; Abedalthagafi, Malak; Bi, Wenya Linda; Kang, Yun Jee; Merrill, Parker; Dunn, Ian F; Dubuc, Adrian; Charbonneau, Sarah K; Brown, Loreal; Ligon, Azra H; Ramkissoon, Shakti H; Ligon, Keith L
Astroblastomas are rare primary brain tumors, diagnosed based on histologic features. Not currently assigned a WHO grade, they typically display indolent behavior, with occasional variants taking a more aggressive course. We characterized the immunohistochemical characteristics, copy number (high-resolution array comparative genomic hybridization, OncoCopy) and mutational profile (targeted next-generation exome sequencing, OncoPanel) of a cohort of seven biopsies from four patients to identify recurrent genomic events that may help distinguish astroblastomas from other more common high-grade gliomas. We found that tumor histology was variable across patients and between primary and recurrent tumor samples. No common molecular features were identified among the four tumors. Mutations commonly observed in astrocytic tumors (IDH1/2, TP53, ATRX, and PTEN) or ependymoma were not identified. However one case with rapid clinical progression displayed mutations more commonly associated with GBM (NF1(N1054H/K63)*, PIK3CA(R38H) and ERG(A403T)). Conversely, another case, originally classified as glioblastoma with nine-year survival before recurrence, lacked a GBM mutational profile. Other mutations frequently seen in lower grade gliomas (BCOR, BCORL1, ERBB3, MYB, ATM) were also present in several tumors. Copy number changes were variable across tumors. Our findings indicate that astroblastomas have variable growth patterns and morphologic features, posing significant challenges to accurate classification in the absence of diagnostically specific copy number alterations and molecular features. Their histopathologic overlap with glioblastoma will likely confound the observation of long-term GBM "survivors". Further genomic profiling is needed to determine whether these tumors represent a distinct entity and to guide management strategies. Copyright © 2016 Elsevier Inc. All rights reserved.
Soares Siomar de Castro
Full Text Available Microbes are highly diverse and widely distributed organisms. They account for ~60% of Earth’s biomass and new predictions point for the existence of 1011 to 1012 species, which are constantly sharing genes through several different mechanisms. Genomic Islands (GI are critical in this context, as they are large regions acquired through horizontal gene transfer. Also, they present common features like genomic signature deviation, transposase genes, flanking tRNAs and insertion sequences. GIs carry large numbers of genes related to specific lifestyle and are commonly classified in Pathogenicity, Resistance, Metabolic or Symbiotic Islands. With the advent of the next-generation sequencing technologies and the deluge of genomic data, many software tools have been developed that aim to tackle the problem of GI prediction and they are all based on the prediction of GI common features. However, there is still room for the development of new software tools that implements new approaches, such as, machine learning and pangenomics based analyses. Finally, GIs will always hold a potential application in every newly invented genomic approach as they are directly responsible for much of the genomic plasticity of bacteria.
Soares, Siomar de Castro; Oliveira, Letícia de Castro; Jaiswal, Arun Kumar; Azevedo, Vasco
Microbes are highly diverse and widely distributed organisms. They account for ~60% of Earth's biomass and new predictions point for the existence of 1011 to 1012 species, which are constantly sharing genes through several different mechanisms. Genomic Islands (GI) are critical in this context, as they are large regions acquired through horizontal gene transfer. Also, they present common features like genomic signature deviation, transposase genes, flanking tRNAs and insertion sequences. GIs carry large numbers of genes related to specific lifestyle and are commonly classified in Pathogenicity, Resistance, Metabolic or Symbiotic Islands. With the advent of the next-generation sequencing technologies and the deluge of genomic data, many software tools have been developed that aim to tackle the problem of GI prediction and they are all based on the prediction of GI common features. However, there is still room for the development of new software tools that implements new approaches, such as, machine learning and pangenomics based analyses. Finally, GIs will always hold a potential application in every newly invented genomic approach as they are directly responsible for much of the genomic plasticity of bacteria.
Full Text Available This meeting report summarizes the proceedings of the “eGenomics: Cataloguing our Complete Genome Collection III” workshop held September 11–13, 2006, at the National Institute for Environmental eScience (NIEeS, Cambridge, United Kingdom. This 3rd workshop of the Genomic Standards Consortium was divided into two parts. The first half of the three-day workshop was dedicated to reviewing the genomic diversity of our current and future genome and metagenome collection, and exploring linkages to a series of existing projects through formal presentations. The second half was dedicated to strategic discussions. Outcomes of the workshop include a revised “Minimum Information about a Genome Sequence” (MIGS specification (v1.1, consensus on a variety of features to be added to the Genome Catalogue (GCat, agreement by several researchers to adopt MIGS for imminent genome publications, and an agreement by the EBI and NCBI to input their genome collections into GCat for the purpose of quantifying the amount of optional data already available (e.g., for geographic location coordinates and working towards a single, global list of all public genomes and metagenomes.
Juan, Liran; Liu, Yongzhuang; Wang, Yongtian; Teng, Mingxiang; Zang, Tianyi; Wang, Yadong
Families with inherited diseases are widely used in Mendelian/complex disease studies. Owing to the advances in high-throughput sequencing technologies, family genome sequencing becomes more and more prevalent. Visualizing family genomes can greatly facilitate human genetics studies and personalized medicine. However, due to the complex genetic relationships and high similarities among genomes of consanguineous family members, family genomes are difficult to be visualized in traditional genome visualization framework. How to visualize the family genome variants and their functions with integrated pedigree information remains a critical challenge. We developed the Family Genome Browser (FGB) to provide comprehensive analysis and visualization for family genomes. The FGB can visualize family genomes in both individual level and variant level effectively, through integrating genome data with pedigree information. Family genome analysis, including determination of parental origin of the variants, detection of de novo mutations, identification of potential recombination events and identical-by-decent segments, etc., can be performed flexibly. Diverse annotations for the family genome variants, such as dbSNP memberships, linkage disequilibriums, genes, variant effects, potential phenotypes, etc., are illustrated as well. Moreover, the FGB can automatically search de novo mutations and compound heterozygous variants for a selected individual, and guide investigators to find high-risk genes with flexible navigation options. These features enable users to investigate and understand family genomes intuitively and systematically. The FGB is available at http://mlg.hit.edu.cn/FGB/. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: firstname.lastname@example.org.
Kyrpides, Nikos; Anderson, Iain; Rodriguez, Jason; Susanti, Dwi; Porat, Iris; Reich, Claudia; Ulrich, Luke E.; Elkins, James G.; Mavromatis, Kostas; Lykidis, Athanasios; Kim, Edwin; Thompson, Linda S.; Nolan, Matt; Land, Miriam; Copeland, Alex; Lapidus, Alla; Lucas, Susan; Detter, Chris; Zhulin, Igor B.; Olsen, Gary J.; Whitman, William; Mukhopadhyay, Biswarup; Bristow, James; Kyrpides, Nikos
We report the complete genome of Thermofilum pendens, a deep-branching, hyperthermophilic member of the order Thermoproteales within the archaeal kingdom Crenarchaeota. T. pendens is a sulfur-dependent, anaerobic heterotroph isolated from a solfatara in Iceland. It is an extracellular commensal, requiring an extract of Thermoproteus tenax for growth, and the genome sequence reveals that biosynthetic pathways for purines, most amino acids, and most cofactors are absent. In fact T. pendens has fewer biosynthetic enzymes than obligate intracellular parasites, although it does not display other features common among obligate parasites and thus does not appear to be in the process of becoming a parasite. It appears that T. pendens has adapted to life in an environment rich in nutrients. T. pendens was known to utilize peptides as an energy source, but the genome reveals substantial ability to grow on carbohydrates. T. pendens is the first crenarchaeote and only the second archaeon found to have a transporter of the phosphotransferase system. In addition to fermentation, T. pendens may gain energy from sulfur reduction with hydrogen and formate as electron donors. It may also be capable of sulfur-independent growth on formate with formate hydrogenlyase. Additional novel features are the presence of a monomethylamine:corrinoid methyltransferase, the first time this enzyme has been found outside of Methanosarcinales, and a presenilin-related protein. Predicted highly expressed proteins do not include housekeeping genes, and instead include ABC transporters for carbohydrates and peptides, and CRISPR-associated proteins.
Full Text Available Abstract Background Gastropod mitochondrial genomes exhibit an unusually great variety of gene orders compared to other metazoan mitochondrial genome such as e.g those of vertebrates. Hence, gastropod mitochondrial genomes constitute a good model system to study patterns, rates, and mechanisms of mitochondrial genome rearrangement. However, this kind of evolutionary comparative analysis requires a robust phylogenetic framework of the group under study, which has been elusive so far for gastropods in spite of the efforts carried out during the last two decades. Here, we report the complete nucleotide sequence of five mitochondrial genomes of gastropods (Pyramidella dolabrata, Ascobulla fragilis, Siphonaria pectinata, Onchidella celtica, and Myosotella myosotis, and we analyze them together with another ten complete mitochondrial genomes of gastropods currently available in molecular databases in order to reconstruct the phylogenetic relationships among the main lineages of gastropods. Results Comparative analyses with other mollusk mitochondrial genomes allowed us to describe molecular features and general trends in the evolution of mitochondrial genome organization in gastropods. Phylogenetic reconstruction with commonly used methods of phylogenetic inference (ME, MP, ML, BI arrived at a single topology, which was used to reconstruct the evolution of mitochondrial gene rearrangements in the group. Conclusion Four main lineages were identified within gastropods: Caenogastropoda, Vetigastropoda, Patellogastropoda, and Heterobranchia. Caenogastropoda and Vetigastropoda are sister taxa, as well as, Patellogastropoda and Heterobranchia. This result rejects the validity of the derived clade Apogastropoda (Caenogastropoda + Heterobranchia. The position of Patellogastropoda remains unclear likely due to long-branch attraction biases. Within Heterobranchia, the most heterogeneous group of gastropods, neither Euthyneura (because of the inclusion of P
Full Text Available Introduction: Technological advancements rapidly propel the field of genome research. Advances in genetics and genomics such as the sequence of the human genome, the human haplotype map, open access databases, cheaper genotyping and chemical genomics, have transformed basic and translational biomedical research. Several projects in the field of genomic and personalized medicine have been conducted at the Center for Life Sciences in Nazarbayev University. The prioritized areas of research include: genomics of multifactorial diseases, cancer genomics, bioinformatics, genetics of infectious diseases and population genomics. At present, DNA-based risk assessment for common complex diseases, application of molecular signatures for cancer diagnosis and prognosis, genome-guided therapy, and dose selection of therapeutic drugs are the important issues in personalized medicine. Results: To further develop genomic and biomedical projects at Center for Life Sciences, the development of bioinformatics research and infrastructure and the establishment of new collaborations in the field are essential. Widespread use of genetic tools will allow the identification of diseases before the onset of clinical symptoms, the individualization of drug treatment, and could induce individual behavioral changes on the basis of calculated disease risk. However, many challenges remain for the successful translation of genomic knowledge and technologies into health advances, such as medicines and diagnostics. It is important to integrate research and education in the fields of genomics, personalized medicine, and bioinformatics, which will be possible with opening of the new Medical Faculty at Nazarbayev University. People in practice and training need to be educated about the key concepts of genomics and engaged so they can effectively apply their knowledge in a matter that will bring the era of genomic medicine to patient care. This requires the development of well
Standen, Ismo; Christensen, Ole Fredslund
Genomic data are used in animal breeding to assist genetic evaluation. Several models to estimate genomic breeding values have been studied. In general, two approaches have been used. One approach estimates the marker effects first and then, genomic breeding values are obtained by summing marker...... effects. In the second approach, genomic breeding values are estimated directly using an equivalent model with a genomic relationship matrix. Allele coding is the method chosen to assign values to the regression coefficients in the statistical model. A common allele coding is zero for the homozygous...... genotype of the first allele, one for the heterozygote, and two for the homozygous genotype for the other allele. Another common allele coding changes these regression coefficients by subtracting a value from each marker such that the mean of regression coefficients is zero within each marker. We call...
Steven W Cole
Full Text Available A growing literature in human social genomics has begun to analyze how everyday life circumstances influence human gene expression. Social-environmental conditions such as urbanity, low socioeconomic status, social isolation, social threat, and low or unstable social status have been found to associate with differential expression of hundreds of gene transcripts in leukocytes and diseased tissues such as metastatic cancers. In leukocytes, diverse types of social adversity evoke a common conserved transcriptional response to adversity (CTRA characterized by increased expression of proinflammatory genes and decreased expression of genes involved in innate antiviral responses and antibody synthesis. Mechanistic analyses have mapped the neural "social signal transduction" pathways that stimulate CTRA gene expression in response to social threat and may contribute to social gradients in health. Research has also begun to analyze the functional genomics of optimal health and thriving. Two emerging opportunities now stand to revolutionize our understanding of the everyday life of the human genome: network genomics analyses examining how systems-level capabilities emerge from groups of individual socially sensitive genomes and near-real-time transcriptional biofeedback to empirically optimize individual well-being in the context of the unique genetic, geographic, historical, developmental, and social contexts that jointly shape the transcriptional realization of our innate human genomic potential for thriving.
Cox, Veronica L; Saeed Bamashmos, Anas A; Foo, Wai Chin; Gupta, Shiva; Yedururi, Sireesha; Garg, Naveen; Kang, Hyunseon Christine
Lynch syndrome is the most common hereditary cancer syndrome, the most common cause of heritable colorectal cancer, and the only known heritable cause of endometrial cancer. Other cancers associated with Lynch syndrome include cancers of the ovary, stomach, urothelial tract, and small bowel, and less frequently, cancers of the brain, biliary tract, pancreas, and prostate. The oncogenic tendency of Lynch syndrome stems from a set of genomic alterations of mismatch repair proteins. Defunct mismatch repair proteins cause unusually high instability of regions of the genome called microsatellites. Over time, the accumulation of mutations in microsatellites and elsewhere in the genome can affect the production of important cellular proteins, spurring tumorigenesis. Universal testing of colorectal tumors for microsatellite instability (MSI) is now recommended to (a) prevent cases of Lynch syndrome being missed owing to the use of clinical criteria alone, (b) reduce morbidity and mortality among the relatives of affected individuals, and (c) guide management decisions. Organ-specific cancer risks and associated screening paradigms vary according to the sex of the affected individual and the type of germline DNA alteration causing the MSI. Furthermore, Lynch syndrome-associated cancers have different pathologic, radiologic, and clinical features compared with their sporadic counterparts. Most notably, Lynch syndrome-associated tumors tend to be more indolent than non-Lynch syndrome-associated neoplasms and thus may respond differently to traditional chemotherapy regimens. The high MSI in cases of colorectal cancer reflects a difference in the biologic features of the tumor, possibly with a unique susceptibility to immunotherapy. © RSNA, 2018.
Kennedy, C.H.; Fukushima, N.H.; Neft, R.E.; Lechner, J.F.
Alpha particle-emitting radon daughters constitute a risk for development of lung cancer in humans. The development of this disease involves multiple genetic alterations. These changes and the time course they follow are not yet defined despite numerous in vitro endeavors to transform human lung cells with various physical or chemical agents. However, genomic instability, characterized both by structural and numerical chromosomal aberrations and by elevated rates of point mutations, is a common feature of tumor cells. Further, both types of genomic instability have been reported in the noncancerous progeny of normal murine hemopoietic cells exposed in vitro to α-particles. The purpose of this investigation was to determine if genomic instability is also a prominent feature of normal human bronchial epithelial cells exposed to α-particle irradiation from the decay of inhaled radon daughters
Wurch, Louie; Giannone, Richard J; Belisle, Bernard S; Swift, Carolyn; Utturkar, Sagar; Hettich, Robert L; Reysenbach, Anna-Louise; Podar, Mircea
Biological features can be inferred, based on genomic data, for many microbial lineages that remain uncultured. However, cultivation is important for characterizing an organism's physiology and testing its genome-encoded potential. Here we use single-cell genomics to infer cultivation conditions for the isolation of an ectosymbiotic Nanoarchaeota ('Nanopusillus acidilobi') and its host (Acidilobus, a crenarchaeote) from a terrestrial geothermal environment. The cells of 'Nanopusillus' are among the smallest known cellular organisms (100-300 nm). They appear to have a complete genetic information processing machinery, but lack almost all primary biosynthetic functions as well as respiration and ATP synthesis. Genomic and proteomic comparison with its distant relative, the marine Nanoarchaeum equitans illustrate an ancient, common evolutionary history of adaptation of the Nanoarchaeota to ectosymbiosis, so far unique among the Archaea.
the cell nucleus (mitochondrial and chloroplast genomes), and. (3) traits governed ... tively good embryonic development but very poor development of membranes and ... Human homologies for the type of situation described above are naturally ..... imprint; (b) New modifications of the paternal genome in germ cells of each ...
Oers, van M.M.; Vlak, J.M.
Baculovirus genomes are covalently closed circles of double stranded-DNA varying in size between 80 and 180 kilobase-pair. The genomes of more than fourty-one baculoviruses have been sequenced to date. The majority of these (37) are pathogenic to lepidopteran hosts; three infect sawflies
... this database. Top of Page Evaluation of Genomic Applications in Practice and Prevention (EGAPP™) In 2004, the Centers for Disease Control and Prevention launched the EGAPP initiative to establish and test a ... and other applications of genomic technology that are in transition from ...
Hoelzel, A Rus
Ever since its invention, the polymerase chain reaction has been the method of choice for work with ancient DNA. In an application of modern genomic methods to material from the Pleistocene, a recent study has instead undertaken to clone and sequence a portion of the ancient genome of the cave bear.
Zoppi, Nicoletta; Chiarelli, Nicola; Binetti, Silvia; Ritelli, Marco; Colombi, Marina
Hypermobile Ehlers-Danlos syndrome (hEDS) is a heritable connective tissue disorder with unknown molecular basis mainly characterized by generalized joint hypermobility, joint instability complications, and minor skin changes. The phenotypic spectrum is broad and includes multiple associated symptoms shared with chronic inflammatory systemic diseases. The stricter criteria defined in the 2017 EDS nosology leave without an identity many individuals with symptomatic joint hypermobility and/or features of hEDS; for these patients, the term Hypermobility Spectrum Disorders (HSD) was introduced. We previously reported that in vitro cultured hEDS and HSD patients' skin fibroblasts show a disarray of several extracellular matrix (ECM) components and dysregulated expression of genes involved in connective tissue homeostasis and inflammatory/pain/immune responses. Herein, we report that hEDS and HSD skin fibroblasts exhibit in vitro a similar myofibroblast-like phenotype characterized by the organization of α-smooth muscle actin cytoskeleton, expression of OB-cadherin/cadherin-11, enhanced migratory capability associated with augmented levels of the ECM-degrading metalloproteinase-9, and altered expression of the inflammation mediators CCN1/CYR61 and CCN2/CTGF. We demonstrate that in hEDS and HSD cells this fibroblast-to-myofibroblast transition is triggered by a signal transduction pathway that involves αvβ3 integrin-ILK complexes, organized in focal adhesions, and the Snail1/Slug transcription factor, thus providing insights into the molecular mechanisms related to the pathophysiology of these protean disorders. The indistinguishable phenotype identified in hEDS and HSD cells resembles an inflammatory-like condition, which correlates well with the systemic phenotype of patients, and suggests that these multisystemic disorders might be part of a phenotypic continuum rather than representing distinct clinical entities. Copyright © 2018 Elsevier B.V. All rights reserved.
Adams, Mark D. (Case-Western Reserve University)
The availability of complete or nearly complete mouse, human, and rat genomes (in addition to those from many other species) has resulted in a series of new and powerful opportunities to apply the technologies and approaches developed for large-scale genome sequencing to the study of disease. New approaches to biological problems are being explored that involve concepts from computer science such as systems theory and modern large scale computing techniques. A recent project at Celera Genomics involved sequencing protein coding regions from several humans and a chimpanzee. Computational models of evolutionary divergence enabled us to identify genes with unique evolutionary signatures. These genes give us some insight into features that may be uniquely human. The laboratory mouse and rat have long been favorite mammalian models of human disease. Integrated approaches to the study of disease that combine genetics, DNA sequence analysis, and careful analysis of phenotype at a molecular level are becoming more common and powerful. In addition, evaluation of the variation inherent in normal populations is now being used to build networks to describe heart function based on the interaction of multiple phenotypes in randomized populations using a factorial design.
Genomic regions represent features such as gene annotations, transcription factor binding sites and epigenetic modifications. Performing various genomic operations such as identifying overlapping/non-overlapping regions or nearest gene annotations are common research needs. The data can be saved in a database system for easy management, however, there is no comprehensive database built-in algorithm at present to identify overlapping regions. Therefore I have developed a novel region-mapping (RegMap) SQL-based algorithm to perform genomic operations and have benchmarked the performance of different databases. Benchmarking identified that PostgreSQL extracts overlapping regions much faster than MySQL. Insertion and data uploads in PostgreSQL were also better, although general searching capability of both databases was almost equivalent. In addition, using the algorithm pair-wise, overlaps of >1000 datasets of transcription factor binding sites and histone marks, collected from previous publications, were reported and it was found that HNF4G significantly co-locates with cohesin subunit STAG1 (SA1).Inc. © 2015 Wiley Periodicals, Inc.
In this article I discuss studies towards understanding the structure and function of DNA in the context of genomes from the perspective of a chemist. The first area I describe concerns the studies that led to the invention and subsequent development of a method for sequencing DNA on a genome scale at high speed and low cost, now known as Solexa/Illumina sequencing. The second theme will feature the four-stranded DNA structure known as a G-quadruplex with a focus on its fundamental properties, its presence in cellular genomic DNA and the prospects for targeting such a structure in cels with small molecules. The final topic for discussion is naturally occurring chemically modified DNA bases with an emphasis on chemistry for decoding (or sequencing) such modifications in genomic DNA. The genome is a fruitful topic to be further elucidated by the creation and application of chemical approaches. Copyright © 2014 Elsevier Ltd. All rights reserved.
Baig, Abiyad; Colom, Joan; Barrow, Paul
We have performed microbiological and genomic characterization of an historic collection of nine bacteriophages, specifically infecting a K1 E. coli O18:K1:H7 ColV+ strain. These phages were isolated from sewage and tested for their efficacy in vivo for the treatment of systemic E. coli infection...... in a mouse infection model by Smith and Huggins (1982). The aim of the study was to identify common microbiological and genomic characteristics, which co-relate to the performance of these phages in in vivo study. These features will allow an informed selection of phages for use as therapeutic agents...
Full Text Available The insulin-like growth factor 2 receptor (IGF2R is essential for prenatal growth regulation and shows gene dosage effects on fetal weight that can be affected by in-vitro embryo culture. Imprinted maternal expression of murine Igf2r is well documented for all fetal tissues excluding brain, but polymorphic imprinting and biallelic expression were reported for IGF2R in human. These differences have been attributed to evolutionary changes correlated with specific reproductive strategies. However, data from species suitable for testing this hypothesis are lacking. The domestic cow (Bos taurus carries a single conceptus with a similar gestation length as human. We identified 12 heterozygous concepti informative for imprinting studies among 68 Bos taurus fetuses at Day 80 of gestation (28% term and found predominantly maternal IGF2R expression in all fetal tissues but brain, which escapes imprinting. Inter-individual variation in allelic expression bias, i.e. expression of the repressed paternal allele relative to the maternal allele, ranged from 4.6-8.9% in heart, 4.3-10.2% in kidney, 6.1-11.2% in liver, 4.6-15.8% in lung and 3.2-12.2% in skeletal muscle. Allelic bias for mesodermal tissues (heart, skeletal muscle differed significantly (P<0.05 from endodermal tissues (liver, lung. The placenta showed partial imprinting with allelic bias of 22.9-34.7% and differed significantly (P<0.001 from all other tissues. Four informative fetuses were generated by in-vitro fertilization (IVF with embryo culture and two individuals displayed fetal overgrowth. However, there was no evidence for changes in imprinting or DNA methylation after IVF, or correlations between allelic bias and fetal weight. In conclusion, imprinting of Bos taurus IGF2R is similar to mouse except in placenta, which could indicate an effect of reproductive strategy. Common minor inter-individual variation in allelic bias and absence of imprinting abnormalities in IVF fetuses suggest
Fu, Yong-Bi; Peterson, Gregory W; Dong, Yibo
Genotyping-by-sequencing (GBS) has emerged as a useful genomic approach for exploring genome-wide genetic variation. However, GBS commonly samples a genome unevenly and can generate a substantial amount of missing data. These technical features would limit the power of various GBS-based genetic and genomic analyses. Here we present software called IgCoverage for in silico evaluation of genomic coverage through GBS with an individual or pair of restriction enzymes on one sequenced genome, and report a new set of 21 restriction enzyme combinations that can be applied to enhance GBS applications. These enzyme combinations were developed through an application of IgCoverage on 22 plant, animal, and fungus species with sequenced genomes, and some of them were empirically evaluated with different runs of Illumina MiSeq sequencing in 12 plant species. The in silico analysis of 22 organisms revealed up to eight times more genome coverage for the new combinations consisted of pairing four- or five-cutter restriction enzymes than the commonly used enzyme combination PstI + MspI. The empirical evaluation of the new enzyme combination (HinfI + HpyCH4IV) in 12 plant species showed 1.7-6 times more genome coverage than PstI + MspI, and 2.3 times more genome coverage in dicots than monocots. Also, the SNP genotyping in 12 Arabidopsis and 12 rice plants revealed that HinfI + HpyCH4IV generated 7 and 1.3 times more SNPs (with 0-16.7% missing observations) than PstI + MspI, respectively. These findings demonstrate that these novel enzyme combinations can be utilized to increase genome sampling and improve SNP genotyping in various GBS applications. Copyright © 2016 Fu et al.
Full Text Available Algonquian languages are famous for their animacy-based grammatical properties—an animacy based noun classification system and direct/inverse system which gives rise to animacy hierarchy effects in the determination of verb agreement. In this paper I provide new evidence for the proposal that the distinctive properties of these languages is due to the use of participant-based features, rather than spatio-temporal ones, for both nominal and verbal functional categories (Ritter & Wiltschko 2009, 2014. Building on Wiltschko (2012, I develop a formal treatment of the Blackfoot aspectual system that assumes a category Inner Aspect (cf. MacDonald 2008, Travis 1991, 2010. Focusing on lexical aspect in Blackfoot, I demonstrate that the classification of both nouns (Seinsarten and verbs (Aktionsarten is based on animacy, rather than boundedness, resulting in a strikingly different aspectual system for both categories.
Bakker, Freek T.; Lei, Di; Yu, Jiaying
Herbarium genomics is proving promising as next-generation sequencing approaches are well suited to deal with the usually fragmented nature of archival DNA. We show that routine assembly of partial plastome sequences from herbarium specimens is feasible, from total DNA extracts and with specimens...... up to 146 years old. We use genome skimming and an automated assembly pipeline, Iterative Organelle Genome Assembly, that assembles paired-end reads into a series of candidate assemblies, the best one of which is selected based on likelihood estimation. We used 93 specimens from 12 different...... correlation between plastome coverage and nuclear genome size (C value) in our samples, but the range of C values included is limited. Finally, we conclude that routine plastome sequencing from herbarium specimens is feasible and cost-effective (compared with Sanger sequencing or plastome...
Ilia J. Leitch
Full Text Available Monocot genomic diversity includes striking variation at many levels. This paper compares various genomic characters (e.g., range of chromosome numbers and ploidy levels, occurrence of endopolyploidy, GC content, chromosome packaging and organization, genome size between monocots and the remaining angiosperms to discern just how distinctive monocot genomes are. One of the most notable features of monocots is their wide range and diversity of genome sizes, including the species with the largest genome so far reported in plants. This genomic character is analysed in greater detail, within a phylogenetic context. By surveying available genome size and chromosome data it is apparent that different monocot orders follow distinctive modes of genome size and chromosome evolution. Further insights into genome size-evolution and dynamics were obtained using statistical modelling approaches to reconstruct the ancestral genome size at key nodes across the monocot phylogenetic tree. Such approaches reveal that while the ancestral genome size of all monocots was small (1C=1.9 pg, there have been several major increases and decreases during monocot evolution. In addition, notable increases in the rates of genome size-evolution were found in Asparagales and Poales compared with other monocot lineages.
Wolf Yuri I
Full Text Available Abstract Background It is common belief that all cellular life forms on earth have a common origin. This view is supported by the universality of the genetic code and the universal conservation of multiple genes, particularly those that encode key components of the translation system. A remarkable recent study claims to provide a formal, homology independent test of the Universal Common Ancestry hypothesis by comparing the ability of a common-ancestry model and a multiple-ancestry model to predict sequences of universally conserved proteins. Results We devised a computational experiment on a concatenated alignment of universally conserved proteins which shows that the purported demonstration of the universal common ancestry is a trivial consequence of significant sequence similarity between the analyzed proteins. The nature and origin of this similarity are irrelevant for the prediction of "common ancestry" of by the model-comparison approach. Thus, homology (common origin of the compared proteins remains an inference from sequence similarity rather than an independent property demonstrated by the likelihood analysis. Conclusion A formal demonstration of the Universal Common Ancestry hypothesis has not been achieved and is unlikely to be feasible in principle. Nevertheless, the evidence in support of this hypothesis provided by comparative genomics is overwhelming. Reviewers this article was reviewed by William Martin, Ivan Iossifov (nominated by Andrey Rzhetsky and Arcady Mushegian. For the complete reviews, see the Reviewers' Report section.
Full Text Available Abstract Background Within Chordata, the subphyla Vertebrata and Cephalochordata (lancelets are characterized by a remarkable stability of the mitochondrial (mt genome, with constancy of gene content and almost invariant gene order, whereas the limited mitochondrial data on the subphylum Tunicata suggest frequent and extensive gene rearrangements, observed also within ascidians of the same genus. Results To confirm this evolutionary trend and to better understand the evolutionary dynamics of the mitochondrial genome in Tunicata Ascidiacea, we have sequenced and characterized the complete mt genome of two congeneric ascidian species, Phallusia mammillata and Phallusia fumigata (Phlebobranchiata, Ascidiidae. The two mtDNAs are surprisingly rearranged, both with respect to one another and relative to those of other tunicates and chordates, with gene rearrangements affecting both protein-coding and tRNA genes. The new data highlight the extraordinary variability of ascidian mt genome in base composition, tRNA secondary structure, tRNA gene content, and non-coding regions (number, size, sequence and location. Indeed, both Phallusia genomes lack the trnD gene, show loss/acquisition of DHU-arm in two tRNAs, and have a G+C content two-fold higher than other ascidians. Moreover, the mt genome of P. fumigata presents two identical copies of trnI, an extra tRNA gene with uncertain amino acid specificity, and four almost identical sequence regions. In addition, a truncated cytochrome b, lacking a C-terminal tail that commonly protrudes into the mt matrix, has been identified as a new mt feature probably shared by all tunicates. Conclusion The frequent occurrence of major gene order rearrangements in ascidians both at high taxonomic level and within the same genus makes this taxon an excellent model to study the mechanisms of gene rearrangement, and renders the mt genome an invaluable phylogenetic marker to investigate molecular biodiversity and speciation
Seo, Young-Su; Lim, Jae Yun; Park, Jungwook; Kim, Sunyoung; Lee, Hyun-Hee; Cheong, Hoon; Kim, Sang-Mok; Moon, Jae Sun; Hwang, Ingyu
In addition to human and animal diseases, bacteria of the genus Burkholderia can cause plant diseases. The representative species of rice-pathogenic Burkholderia are Burkholderia glumae, B. gladioli, and B. plantarii, which primarily cause grain rot, sheath rot, and seedling blight, respectively, resulting in severe reductions in rice production. Though Burkholderia rice pathogens cause problems in rice-growing countries, comprehensive studies of these rice-pathogenic species aiming to control Burkholderia-mediated diseases are only in the early stages. We first sequenced the complete genome of B. plantarii ATCC 43733T. Second, we conducted comparative analysis of the newly sequenced B. plantarii ATCC 43733T genome with eleven complete or draft genomes of B. glumae and B. gladioli strains. Furthermore, we compared the genome of three rice Burkholderia pathogens with those of other Burkholderia species such as those found in environmental habitats and those known as animal/human pathogens. These B. glumae, B. gladioli, and B. plantarii strains have unique genes involved in toxoflavin or tropolone toxin production and the clustered regularly interspaced short palindromic repeats (CRISPR)-mediated bacterial immune system. Although the genome of B. plantarii ATCC 43733T has many common features with those of B. glumae and B. gladioli, this B. plantarii strain has several unique features, including quorum sensing and CRISPR/CRISPR-associated protein (Cas) systems. The complete genome sequence of B. plantarii ATCC 43733T and publicly available genomes of B. glumae BGR1 and B. gladioli BSR3 enabled comprehensive comparative genome analyses among three rice-pathogenic Burkholderia species responsible for tissue rotting and seedling blight. Our results suggest that B. glumae has evolved rapidly, or has undergone rapid genome rearrangements or deletions, in response to the hosts. It also, clarifies the unique features of rice pathogenic Burkholderia species relative to other
Brown, Nathan M; Mueller, Ryan S; Shepardson, Jonathan W; Landry, Zachary C; Morré, Jeffrey T; Maier, Claudia S; Hardy, F Joan; Dreher, Theo W
Very few closed genomes of the cyanobacteria that commonly produce toxic blooms in lakes and reservoirs are available, limiting our understanding of the properties of these organisms. A new anatoxin-a-producing member of the Nostocaceae, Anabaena sp. WA102, was isolated from a freshwater lake in Washington State, USA, in 2013 and maintained in non-axenic culture. The Anabaena sp. WA102 5.7 Mbp genome assembly has been closed with long-read, single-molecule sequencing and separately a draft genome assembly has been produced with short-read sequencing technology. The closed and draft genome assemblies are compared, showing a correlation between long repeats in the genome and the many gaps in the short-read assembly. Anabaena sp. WA102 encodes anatoxin-a biosynthetic genes, as does its close relative Anabaena sp. AL93 (also introduced in this study). These strains are distinguished by differences in the genes for light-harvesting phycobilins, with Anabaena sp. AL93 possessing a phycoerythrocyanin operon. Biologically relevant structural variants in the Anabaena sp. WA102 genome were detected only by long-read sequencing: a tandem triplication of the anaBCD promoter region in the anatoxin-a synthase gene cluster (not triplicated in Anabaena sp. AL93) and a 5-kbp deletion variant present in two-thirds of the population. The genome has a large number of mobile elements (160). Strikingly, there was no synteny with the genome of its nearest fully assembled relative, Anabaena sp. 90. Structural and functional genome analyses indicate that Anabaena sp. WA102 has a flexible genome. Genome closure, which can be readily achieved with long-read sequencing, reveals large scale (e.g., gene order) and local structural features that should be considered in understanding genome evolution and function.
Full Text Available Abstract Background Information on the occurrence of sequence features in genomes is crucial to comparative genomics, evolutionary analysis, the analyses of regulatory sequences and the quantitative evaluation of sequences. Computing the frequencies and the occurrences of a pattern in complete genomes is time-consuming. Results The proposed database provides information about sequence features generated by exhaustively computing the sequences of the complete genome. The repetitive elements in the eukaryotic genomes, such as LINEs, SINEs, Alu and LTR, are obtained from Repbase. The database supports various complete genomes including human, yeast, worm, and 128 microbial genomes. Conclusions This investigation presents and implements an efficiently computational approach to accumulate the occurrences of the oligonucleotides or patterns in complete genomes. A database is established to maintain the information of the sequence features, including the distributions of oligonucleotide, the gene distribution, the distribution of repetitive elements in genomes and the occurrences of the oligonucleotides. The database can provide more effective and efficient way to access the repetitive features in genomes.
Le Poder, Sophie
A new human coronavirus responsible for severe acute respiratory syndrome (SARS) was identified in 2003, which raised concern about coronaviruses as agents of serious infectious disease. Nevertheless, coronaviruses have been known for about 50 years to be major agents of respiratory, enteric, or systemic infections of domestic and companion animals. Feline and canine coronaviruses are widespread amo