chemical genomics center: Topics by WorldWideScience.org

Sample records for chemical genomics center

Funding Opportunity: Genomic Data Centers

Science.gov (United States)

Funding Opportunity CCG, Funding Opportunity Center for Cancer Genomics, CCG, Center for Cancer Genomics, CCG RFA, Center for cancer genomics rfa, genomic data analysis network, genomic data analysis network centers,
DEFINING THE CHEMICAL SPACE OF PUBLIC GENOMIC ...

Science.gov (United States)

The current project aims to chemically index the genomics content of public genomic databases to make these data accessible in relation to other publicly available, chemically-indexed toxicological information. By defining the chemical space of public genomic data, it is possible to identify classes of chemicals on which to develop methodologies for the integration of chemogenomic data into predictive toxicology. The chemical space of public genomic data will be presented as well as the methodologies and tools developed to identify this chemical space.
Chemical biology on the genome.

Science.gov (United States)

Balasubramanian, Shankar

2014-08-15

In this article I discuss studies towards understanding the structure and function of DNA in the context of genomes from the perspective of a chemist. The first area I describe concerns the studies that led to the invention and subsequent development of a method for sequencing DNA on a genome scale at high speed and low cost, now known as Solexa/Illumina sequencing. The second theme will feature the four-stranded DNA structure known as a G-quadruplex with a focus on its fundamental properties, its presence in cellular genomic DNA and the prospects for targeting such a structure in cels with small molecules. The final topic for discussion is naturally occurring chemically modified DNA bases with an emphasis on chemistry for decoding (or sequencing) such modifications in genomic DNA. The genome is a fruitful topic to be further elucidated by the creation and application of chemical approaches. Copyright © 2014 Elsevier Ltd. All rights reserved.
Structure-based inference of molecular functions of proteins of unknown function from Berkeley Structural Genomics Center

Energy Technology Data Exchange (ETDEWEB)

Kim, Sung-Hou; Shin, Dong Hae; Hou, Jingtong; Chandonia, John-Marc; Das, Debanu; Choi, In-Geol; Kim, Rosalind; Kim, Sung-Hou

2007-09-02

Advances in sequence genomics have resulted in an accumulation of a huge number of protein sequences derived from genome sequences. However, the functions of a large portion of them cannot be inferred based on the current methods of sequence homology detection to proteins of known functions. Three-dimensional structure can have an important impact in providing inference of molecular function (physical and chemical function) of a protein of unknown function. Structural genomics centers worldwide have been determining many 3-D structures of the proteins of unknown functions, and possible molecular functions of them have been inferred based on their structures. Combined with bioinformatics and enzymatic assay tools, the successful acceleration of the process of protein structure determination through high throughput pipelines enables the rapid functional annotation of a large fraction of hypothetical proteins. We present a brief summary of the process we used at the Berkeley Structural Genomics Center to infer molecular functions of proteins of unknown function.
Chemical Security Analysis Center

Data.gov (United States)

Federal Laboratory Consortium — In 2006, by Presidential Directive, DHS established the Chemical Security Analysis Center (CSAC) to identify and assess chemical threats and vulnerabilities in the...
Genome Variation Map: a data repository of genome variations in BIG Data Center

OpenAIRE

Song, Shuhui; Tian, Dongmei; Li, Cuiping; Tang, Bixia; Dong, Lili; Xiao, Jingfa; Bao, Yiming; Zhao, Wenming; He, Hang; Zhang, Zhang

2017-01-01

Abstract The Genome Variation Map (GVM; http://bigd.big.ac.cn/gvm/) is a public data repository of genome variations. As a core resource in the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, GVM dedicates to collect, integrate and visualize genome variations for a wide range of species, accepts submissions of different types of genome variations from all over the world and provides free open access to all publicly available data in support of worldwide research a...
Genome Variation Map: a data repository of genome variations in BIG Data Center.

Science.gov (United States)

Song, Shuhui; Tian, Dongmei; Li, Cuiping; Tang, Bixia; Dong, Lili; Xiao, Jingfa; Bao, Yiming; Zhao, Wenming; He, Hang; Zhang, Zhang

2018-01-04

The Genome Variation Map (GVM; http://bigd.big.ac.cn/gvm/) is a public data repository of genome variations. As a core resource in the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, GVM dedicates to collect, integrate and visualize genome variations for a wide range of species, accepts submissions of different types of genome variations from all over the world and provides free open access to all publicly available data in support of worldwide research activities. Unlike existing related databases, GVM features integration of a large number of genome variations for a broad diversity of species including human, cultivated plants and domesticated animals. Specifically, the current implementation of GVM not only houses a total of ∼4.9 billion variants for 19 species including chicken, dog, goat, human, poplar, rice and tomato, but also incorporates 8669 individual genotypes and 13 262 manually curated high-quality genotype-to-phenotype associations for non-human species. In addition, GVM provides friendly intuitive web interfaces for data submission, browse, search and visualization. Collectively, GVM serves as an important resource for archiving genomic variation data, helpful for better understanding population genetic diversity and deciphering complex mechanisms associated with different phenotypes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genome Variation Map: a data repository of genome variations in BIG Data Center

Science.gov (United States)

Tian, Dongmei; Li, Cuiping; Tang, Bixia; Dong, Lili; Xiao, Jingfa; Bao, Yiming; Zhao, Wenming; He, Hang

2018-01-01

Abstract The Genome Variation Map (GVM; http://bigd.big.ac.cn/gvm/) is a public data repository of genome variations. As a core resource in the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, GVM dedicates to collect, integrate and visualize genome variations for a wide range of species, accepts submissions of different types of genome variations from all over the world and provides free open access to all publicly available data in support of worldwide research activities. Unlike existing related databases, GVM features integration of a large number of genome variations for a broad diversity of species including human, cultivated plants and domesticated animals. Specifically, the current implementation of GVM not only houses a total of ∼4.9 billion variants for 19 species including chicken, dog, goat, human, poplar, rice and tomato, but also incorporates 8669 individual genotypes and 13 262 manually curated high-quality genotype-to-phenotype associations for non-human species. In addition, GVM provides friendly intuitive web interfaces for data submission, browse, search and visualization. Collectively, GVM serves as an important resource for archiving genomic variation data, helpful for better understanding population genetic diversity and deciphering complex mechanisms associated with different phenotypes. PMID:29069473
Causes of genome instability: the effect of low dose chemical exposures in modern society

Science.gov (United States)

Langie, Sabine A.S.; Koppen, Gudrun; Desaulniers, Daniel; Al-Mulla, Fahd; Al-Temaimi, Rabeah; Amedei, Amedeo; Azqueta, Amaya; Bisson, William H.; Brown, Dustin; Brunborg, Gunnar; Charles, Amelia K.; Chen, Tao; Colacci, Annamaria; Darroudi, Firouz; Forte, Stefano; Gonzalez, Laetitia; Hamid, Roslida A.; Knudsen, Lisbeth E.; Leyns, Luc; Lopez de Cerain Salsamendi, Adela; Memeo, Lorenzo; Mondello, Chiara; Mothersill, Carmel; Olsen, Ann-Karin; Pavanello, Sofia; Raju, Jayadev; Rojas, Emilio; Roy, Rabindra; Ryan, Elizabeth; Ostrosky-Wegman, Patricia; Salem, Hosni K.; Scovassi, Ivana; Singh, Neetu; Vaccari, Monica; Van Schooten, Frederik J.; Valverde, Mahara; Woodrick, Jordan; Zhang, Luoping; van Larebeke, Nik; Kirsch-Volders, Micheline; Collins, Andrew R.

2015-01-01

Genome instability is a prerequisite for the development of cancer. It occurs when genome maintenance systems fail to safeguard the genome’s integrity, whether as a consequence of inherited defects or induced via exposure to environmental agents (chemicals, biological agents and radiation). Thus, genome instability can be defined as an enhanced tendency for the genome to acquire mutations; ranging from changes to the nucleotide sequence to chromosomal gain, rearrangements or loss. This review raises the hypothesis that in addition to known human carcinogens, exposure to low dose of other chemicals present in our modern society could contribute to carcinogenesis by indirectly affecting genome stability. The selected chemicals with their mechanisms of action proposed to indirectly contribute to genome instability are: heavy metals (DNA repair, epigenetic modification, DNA damage signaling, telomere length), acrylamide (DNA repair, chromosome segregation), bisphenol A (epigenetic modification, DNA damage signaling, mitochondrial function, chromosome segregation), benomyl (chromosome segregation), quinones (epigenetic modification) and nano-sized particles (epigenetic pathways, mitochondrial function, chromosome segregation, telomere length). The purpose of this review is to describe the crucial aspects of genome instability, to outline the ways in which environmental chemicals can affect this cancer hallmark and to identify candidate chemicals for further study. The overall aim is to make scientists aware of the increasing need to unravel the underlying mechanisms via which chemicals at low doses can induce genome instability and thus promote carcinogenesis. PMID:26106144
Gene discovery by chemical mutagenesis and whole-genome sequencing in Dictyostelium.

Science.gov (United States)

Li, Cheng-Lin Frank; Santhanam, Balaji; Webb, Amanda Nicole; Zupan, Blaž; Shaulsky, Gad

2016-09-01

Whole-genome sequencing is a useful approach for identification of chemical-induced lesions, but previous applications involved tedious genetic mapping to pinpoint the causative mutations. We propose that saturation mutagenesis under low mutagenic loads, followed by whole-genome sequencing, should allow direct implication of genes by identifying multiple independent alleles of each relevant gene. We tested the hypothesis by performing three genetic screens with chemical mutagenesis in the social soil amoeba Dictyostelium discoideum Through genome sequencing, we successfully identified mutant genes with multiple alleles in near-saturation screens, including resistance to intense illumination and strong suppressors of defects in an allorecognition pathway. We tested the causality of the mutations by comparison to published data and by direct complementation tests, finding both dominant and recessive causative mutations. Therefore, our strategy provides a cost- and time-efficient approach to gene discovery by integrating chemical mutagenesis and whole-genome sequencing. The method should be applicable to many microbial systems, and it is expected to revolutionize the field of functional genomics in Dictyostelium by greatly expanding the mutation spectrum relative to other common mutagenesis methods. © 2016 Li et al.; Published by Cold Spring Harbor Laboratory Press.
Chemical rationale for selection of isolates for genome sequencing

DEFF Research Database (Denmark)

Rank, Christian; Larsen, Thomas Ostenfeld; Frisvad, Jens Christian

The advances in gene sequencing will in the near future enable researchers to affordably acquire the full genomes of handpicked isolates. We here present a method to evaluate the chemical potential of an entire species and select representatives for genome sequencing. The selection criteria for new...... strains to be sequenced can be manifold, but for studying the functional phenotype, using a metabolome based approach offers a cheap and rapid assessment of critical strains to cover the chemical diversity. We have applied this methodology on the complex A. flavus/A. oryzae group. Though these two species...... are in principal identical, they represent two different phenotypes. This is clearly presented through a correspondence analysis of selected extrolites, in which the subtle chemical differences are visually dispersed. The results points to a handful of strains, which, if sequenced, will likely enhance our...
DMS-Seq for In Vivo Genome-wide Mapping of Protein-DNA Interactions and Nucleosome Centers.

Science.gov (United States)

Umeyama, Taichi; Ito, Takashi

2017-10-03

Protein-DNA interactions provide the basis for chromatin structure and gene regulation. Comprehensive identification of protein-occupied sites is thus vital to an in-depth understanding of genome function. Dimethyl sulfate (DMS) is a chemical probe that has long been used to detect footprints of DNA-bound proteins in vitro and in vivo. Here, we describe a genomic footprinting method, dimethyl sulfate sequencing (DMS-seq), which exploits the cell-permeable nature of DMS to obviate the need for nuclear isolation. This feature makes DMS-seq simple in practice and removes the potential risk of protein re-localization during nuclear isolation. DMS-seq successfully detects transcription factors bound to cis-regulatory elements and non-canonical chromatin particles in nucleosome-free regions. Furthermore, an unexpected preference of DMS confers on DMS-seq a unique potential to directly detect nucleosome centers without using genetic manipulation. We expect that DMS-seq will serve as a characteristic method for genome-wide interrogation of in vivo protein-DNA interactions. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Chemical genomic guided engineering of gamma-valerolactone tolerant yeast.

Science.gov (United States)

Bottoms, Scott; Dickinson, Quinn; McGee, Mick; Hinchman, Li; Higbee, Alan; Hebert, Alex; Serate, Jose; Xie, Dan; Zhang, Yaoping; Coon, Joshua J; Myers, Chad L; Landick, Robert; Piotrowski, Jeff S

2018-01-12

Gamma valerolactone (GVL) treatment of lignocellulosic bomass is a promising technology for degradation of biomass for biofuel production; however, GVL is toxic to fermentative microbes. Using a combination of chemical genomics with the yeast (Saccharomyces cerevisiae) deletion collection to identify sensitive and resistant mutants, and chemical proteomics to monitor protein abundance in the presence of GVL, we sought to understand the mechanism toxicity and resistance to GVL with the goal of engineering a GVL-tolerant, xylose-fermenting yeast. Chemical genomic profiling of GVL predicted that this chemical affects membranes and membrane-bound processes. We show that GVL causes rapid, dose-dependent cell permeability, and is synergistic with ethanol. Chemical genomic profiling of GVL revealed that deletion of the functionally related enzymes Pad1p and Fdc1p, which act together to decarboxylate cinnamic acid and its derivatives to vinyl forms, increases yeast tolerance to GVL. Further, overexpression of Pad1p sensitizes cells to GVL toxicity. To improve GVL tolerance, we deleted PAD1 and FDC1 in a xylose-fermenting yeast strain. The modified strain exhibited increased anaerobic growth, sugar utilization, and ethanol production in synthetic hydrolysate with 1.5% GVL, and under other conditions. Chemical proteomic profiling of the engineered strain revealed that enzymes involved in ergosterol biosynthesis were more abundant in the presence of GVL compared to the background strain. The engineered GVL strain contained greater amounts of ergosterol than the background strain. We found that GVL exerts toxicity to yeast by compromising cellular membranes, and that this toxicity is synergistic with ethanol. Deletion of PAD1 and FDC1 conferred GVL resistance to a xylose-fermenting yeast strain by increasing ergosterol accumulation in aerobically grown cells. The GVL-tolerant strain fermented sugars in the presence of GVL levels that were inhibitory to the unmodified strain
Genomics:GTL Bioenergy Research Centers White Paper

Energy Technology Data Exchange (ETDEWEB)

Mansfield, Betty Kay [ORNL; Alton, Anita Jean [ORNL; Andrews, Shirley H [ORNL; Bownas, Jennifer Lynn [ORNL; Casey, Denise [ORNL; Martin, Sheryl A [ORNL; Mills, Marissa [ORNL; Nylander, Kim [ORNL; Wyrick, Judy M [ORNL; Drell, Dr. Daniel [Office of Science, Department of Energy; Weatherwax, Sharlene [U.S. Department of Energy; Carruthers, Julie [U.S. Department of Energy

2006-08-01

In his Advanced Energy Initiative announced in January 2006, President George W. Bush committed the nation to new efforts to develop alternative sources of energy to replace imported oil and fossil fuels. Developing cost-effective and energy-efficient methods of producing renewable alternative fuels such as cellulosic ethanol from biomass and solar-derived biofuels will require transformational breakthroughs in science and technology. Incremental improvements in current bioenergy production methods will not suffice. The Genomics:GTL Bioenergy Research Centers will be dedicated to fundamental research on microbe and plant systems with the goal of developing knowledge that will advance biotechnology-based strategies for biofuels production. The aim is to spur substantial progress toward cost-effective production of biologically based renewable energy sources. This document describes the rationale for the establishment of the centers and their objectives in light of the U.S. Department of Energy's mission and goals. Developing energy-efficient and cost-effective methods of producing alternative fuels such as cellulosic ethanol from biomass will require transformational breakthroughs in science and technology. Incremental improvements in current bioenergy-production methods will not suffice. The focus on microbes (for cellular mechanisms) and plants (for source biomass) fundamentally exploits capabilities well known to exist in the microbial world. Thus 'proof of concept' is not required, but considerable basic research into these capabilities remains an urgent priority. Several developments have converged in recent years to suggest that systems biology research into microbes and plants promises solutions that will overcome critical roadblocks on the path to cost-effective, large-scale production of cellulosic ethanol and other renewable energy from biomass. The ability to rapidly sequence the DNA of any organism is a critical part of these new
CoryneCenter – An online resource for the integrated analysis of corynebacterial genome and transcriptome data

Directory of Open Access Journals (Sweden)

Hüser Andrea T

2007-11-01

Full Text Available Abstract Background The introduction of high-throughput genome sequencing and post-genome analysis technologies, e.g. DNA microarray approaches, has created the potential to unravel and scrutinize complex gene-regulatory networks on a large scale. The discovery of transcriptional regulatory interactions has become a major topic in modern functional genomics. Results To facilitate the analysis of gene-regulatory networks, we have developed CoryneCenter, a web-based resource for the systematic integration and analysis of genome, transcriptome, and gene regulatory information for prokaryotes, especially corynebacteria. For this purpose, we extended and combined the following systems into a common platform: (1 GenDB, an open source genome annotation system, (2 EMMA, a MAGE compliant application for high-throughput transcriptome data storage and analysis, and (3 CoryneRegNet, an ontology-based data warehouse designed to facilitate the reconstruction and analysis of gene regulatory interactions. We demonstrate the potential of CoryneCenter by means of an application example. Using microarray hybridization data, we compare the gene expression of Corynebacterium glutamicum under acetate and glucose feeding conditions: Known regulatory networks are confirmed, but moreover CoryneCenter points out additional regulatory interactions. Conclusion CoryneCenter provides more than the sum of its parts. Its novel analysis and visualization features significantly simplify the process of obtaining new biological insights into complex regulatory systems. Although the platform currently focusses on corynebacteria, the integrated tools are by no means restricted to these species, and the presented approach offers a general strategy for the analysis and verification of gene regulatory networks. CoryneCenter provides freely accessible projects with the underlying genome annotation, gene expression, and gene regulation data. The system is publicly available at http://www.CoryneCenter.de.
Genomics in Public Health: Perspective from the Office of Public Health Genomics at the Centers for Disease Control and Prevention (CDC

Directory of Open Access Journals (Sweden)

Ridgely Fisk Green

2015-09-01

Full Text Available The national effort to use genomic knowledge to save lives is gaining momentum, as illustrated by the inclusion of genomics in key public health initiatives, including Healthy People 2020, and the recent launch of the precision medicine initiative. The Office of Public Health Genomics (OPHG at the Centers for Disease Control and Prevention (CDC partners with state public health departments and others to advance the translation of genome-based discoveries into disease prevention and population health. To do this, OPHG has adopted an “identify, inform, and integrate” model: identify evidence-based genomic applications ready for implementation, inform stakeholders about these applications, and integrate these applications into public health at the local, state, and national level. This paper addresses current and future work at OPHG for integrating genomics into public health programs.
Genomics in Public Health: Perspective from the Office of Public Health Genomics at the Centers for Disease Control and Prevention (CDC).

Science.gov (United States)

Green, Ridgely Fisk; Dotson, W David; Bowen, Scott; Kolor, Katherine; Khoury, Muin J

2015-01-01

The national effort to use genomic knowledge to save lives is gaining momentum, as illustrated by the inclusion of genomics in key public health initiatives, including Healthy People 2020, and the recent launch of the precision medicine initiative. The Office of Public Health Genomics (OPHG) at the Centers for Disease Control and Prevention (CDC) partners with state public health departments and others to advance the translation of genome-based discoveries into disease prevention and population health. To do this, OPHG has adopted an "identify, inform, and integrate" model: identify evidence-based genomic applications ready for implementation, inform stakeholders about these applications, and integrate these applications into public health at the local, state, and national level. This paper addresses current and future work at OPHG for integrating genomics into public health programs.
Chemical and Biophysical Modulation of Cas9 for Tunable Genome Engineering.

Science.gov (United States)

Nuñez, James K; Harrington, Lucas B; Doudna, Jennifer A

2016-03-18

The application of the CRISPR-Cas9 system for genome engineering has revolutionized the ability to interrogate genomes of mammalian cells. Programming the Cas9 endonuclease to induce DNA breaks at specified sites is achieved by simply modifying the sequence of its cognate guide RNA. Although Cas9-mediated genome editing has been shown to be highly specific, cleavage events at off-target sites have also been reported. Minimizing, and eventually abolishing, unwanted off-target cleavage remains a major goal of the CRISPR-Cas9 technology before its implementation for therapeutic use. Recent efforts have turned to chemical biology and biophysical approaches to engineer inducible genome editing systems for controlling Cas9 activity at the transcriptional and protein levels. Here, we review recent advancements to modulate Cas9-mediated genome editing by engineering split-Cas9 constructs, inteins, small molecules, protein-based dimerizing domains, and light-inducible systems.
Using Genome Sequence to Enable the Design of Medicines and Chemical Probes.

Science.gov (United States)

Angelbello, Alicia J; Chen, Jonathan L; Childs-Disney, Jessica L; Zhang, Peiyuan; Wang, Zi-Fu; Disney, Matthew D

2018-02-28

Rapid progress in genome sequencing technology has put us firmly into a postgenomic era. A key challenge in biomedical research is harnessing genome sequence to fulfill the promise of personalized medicine. This Review describes how genome sequencing has enabled the identification of disease-causing biomolecules and how these data have been converted into chemical probes of function, preclinical lead modalities, and ultimately U.S. Food and Drug Administration (FDA)-approved drugs. In particular, we focus on the use of oligonucleotide-based modalities to target disease-causing RNAs; small molecules that target DNA, RNA, or protein; the rational repurposing of known therapeutic modalities; and the advantages of pharmacogenetics. Lastly, we discuss the remaining challenges and opportunities in the direct utilization of genome sequence to enable design of medicines.
Combining chemical genomics screens in yeast to reveal spectrum of effects of chemical inhibition of sphingolipid biosynthesis

Directory of Open Access Journals (Sweden)

Giaever Guri

2009-01-01

Full Text Available Abstract Background Single genome-wide screens for the effect of altered gene dosage on drug sensitivity in the model organism Saccharomyces cerevisiae provide only a partial picture of the mechanism of action of a drug. Results Using the example of the tumor cell invasion inhibitor dihydromotuporamine C, we show that a more complete picture of drug action can be obtained by combining different chemical genomics approaches – analysis of the sensitivity of ρ0 cells lacking mitochondrial DNA, drug-induced haploinsufficiency, suppression of drug sensitivity by gene overexpression and chemical-genetic synthetic lethality screening using strains deleted of nonessential genes. Killing of yeast by this chemical requires a functional mitochondrial electron-transport chain and cytochrome c heme lyase function. However, we find that it does not require genes associated with programmed cell death in yeast. The chemical also inhibits endocytosis and intracellular vesicle trafficking and interferes with vacuolar acidification in yeast and in human cancer cells. These effects can all be ascribed to inhibition of sphingolipid biosynthesis by dihydromotuporamine C. Conclusion Despite their similar conceptual basis, namely altering drug sensitivity by modifying gene dosage, each of the screening approaches provided a distinct set of information that, when integrated, revealed a more complete picture of the mechanism of action of a drug on cells.

Genomic research perspectives in Kazakhstan

Directory of Open Access Journals (Sweden)

Ainur Akilzhanova

2014-01-01

Full Text Available Introduction: Technological advancements rapidly propel the field of genome research. Advances in genetics and genomics such as the sequence of the human genome, the human haplotype map, open access databases, cheaper genotyping and chemical genomics, have transformed basic and translational biomedical research. Several projects in the field of genomic and personalized medicine have been conducted at the Center for Life Sciences in Nazarbayev University. The prioritized areas of research include: genomics of multifactorial diseases, cancer genomics, bioinformatics, genetics of infectious diseases and population genomics. At present, DNA-based risk assessment for common complex diseases, application of molecular signatures for cancer diagnosis and prognosis, genome-guided therapy, and dose selection of therapeutic drugs are the important issues in personalized medicine. Results: To further develop genomic and biomedical projects at Center for Life Sciences, the development of bioinformatics research and infrastructure and the establishment of new collaborations in the field are essential. Widespread use of genetic tools will allow the identification of diseases before the onset of clinical symptoms, the individualization of drug treatment, and could induce individual behavioral changes on the basis of calculated disease risk. However, many challenges remain for the successful translation of genomic knowledge and technologies into health advances, such as medicines and diagnostics. It is important to integrate research and education in the fields of genomics, personalized medicine, and bioinformatics, which will be possible with opening of the new Medical Faculty at Nazarbayev University. People in practice and training need to be educated about the key concepts of genomics and engaged so they can effectively apply their knowledge in a matter that will bring the era of genomic medicine to patient care. This requires the development of well
Molecular Mechanisms Underlying Genomic Instability in Brca-Deficient Cells

Science.gov (United States)

2014-11-01

increased by hydroxyurea, ATR inhibition, deregulated c-Myc expression and by PARPi treatment of BRCA1 deficient cells. This work was recently published...Genome Stability." 6: May 27, 2013-Collaborative Research Center 655 from Cells to Tissues seminar series at the Max-Planck-Institute in Dresden, Germany ...Eisenach, Germany -“Genome Stability during DNA Replication” 8: May 3, 2013- Chemical and Systems Biology Department Seminar Series at Stanford
Genome-wide Escherichia coli stress response and improved tolerance towards industrially relevant chemicals

DEFF Research Database (Denmark)

Rau, Martin Holm; Calero Valdayo, Patricia; Lennen, Rebecca

2016-01-01

Economically viable biobased production of bulk chemicals and biofuels typically requires high product titers. During microbial bioconversion this often leads to product toxicity, and tolerance is therefore a critical element in the engineering of production strains. Here, a systems biology...... approach was employed to understand the chemical stress response of Escherichia coli, including a genome-wide screen for mutants with increased fitness during chemical stress. Twelve chemicals with significant production potential were selected, consisting of organic solvent-like chemicals (butanol......, hydroxy-γ-butyrolactone, 1,4-butanediol, furfural), organic acids (acetate, itaconic acid, levulinic acid, succinic acid), amino acids (serine, threonine) and membrane-intercalating chemicals (decanoic acid, geraniol). The transcriptional response towards these chemicals revealed large overlaps...
A plant-based chemical genomics screen for the identification of flowering inducers.

Science.gov (United States)

Fiers, Martijn; Hoogenboom, Jorin; Brunazzi, Alice; Wennekes, Tom; Angenent, Gerco C; Immink, Richard G H

2017-01-01

Floral timing is a carefully regulated process, in which the plant determines the optimal moment to switch from the vegetative to reproductive phase. While there are numerous genes known that control flowering time, little information is available on chemical compounds that are able to influence this process. We aimed to discover novel compounds that are able to induce flowering in the model plant Arabidopsis. For this purpose we developed a plant-based screening platform that can be used in a chemical genomics study. Here we describe the set-up of the screening platform and various issues and pitfalls that need to be addressed in order to perform a chemical genomics screening on Arabidopsis plantlets. We describe the choice for a molecular marker, in combination with a sensitive reporter that's active in plants and is sufficiently sensitive for detection. In this particular screen, the firefly Luciferase marker was used, fused to the regulatory sequences of the floral meristem identity gene APETALA1 (AP1) , which is an early marker for flowering. Using this screening platform almost 9000 compounds were screened, in triplicate, in 96-well plates at a concentration of 25 µM. One of the identified potential flowering inducing compounds was studied in more detail and named Flowering1 (F1). F1 turned out to be an analogue of the plant hormone Salicylic acid (SA) and appeared to be more potent than SA in the induction of flowering. The effect could be confirmed by watering Arabidopsis plants with SA or F1, in which F1 gave a significant reduction in time to flowering in comparison to SA treatment or the control. In this study a chemical genomics screening platform was developed to discover compounds that can induce flowering in Arabidopsis. This platform was used successfully, to identify a compound that can speed-up flowering in Arabidopsis.
Conservation and divergence of chemical defense system in the tunicate Oikopleura dioica revealed by genome wide response to two xenobiotics

Directory of Open Access Journals (Sweden)

Yadetie Fekadu

2012-02-01

Full Text Available Abstract Background Animals have developed extensive mechanisms of response to xenobiotic chemical attacks. Although recent genome surveys have suggested a broad conservation of the chemical defensome across metazoans, global gene expression responses to xenobiotics have not been well investigated in most invertebrates. Here, we performed genome survey for key defensome genes in Oikopleura dioica genome, and explored genome-wide gene expression using high density tiling arrays with over 2 million probes, in response to two model xenobiotic chemicals - the carcinogenic polycyclic aromatic hydrocarbon benzo[a]pyrene (BaP the pharmaceutical compound Clofibrate (Clo. Results Oikopleura genome surveys for key genes of the chemical defensome suggested a reduced repertoire. Not more than 23 cytochrome P450 (CYP genes could be identified, and neither CYP1 family genes nor their transcriptional activator AhR was detected. These two genes were present in deuterostome ancestors. As in vertebrates, the genotoxic compound BaP induced xenobiotic biotransformation and oxidative stress responsive genes. Notable exceptions were genes of the aryl hydrocarbon receptor (AhR signaling pathway. Clo also affected the expression of many biotransformation genes and markedly repressed genes involved in energy metabolism and muscle contraction pathways. Conclusions Oikopleura has the smallest number of CYP genes among sequenced animal genomes and lacks the AhR signaling pathway. However it appears to have basic xenobiotic inducible biotransformation genes such as a conserved genotoxic stress response gene set. Our genome survey and expression study does not support a role of AhR signaling pathway in the chemical defense of metazoans prior to the emergence of vertebrates.
Chemical Emulation of Radiation Pinning Center Geometries in High Temperature Superconductors

National Research Council Canada - National Science Library

Weinstein, Roy

2004-01-01

...). Discovery of an entire class of 200-400 nm size, double perovskite pinning centers, (A,B)REBa(sub 2)O(sub 6), led to 20 new chemical "point" pinning centers, and enabled replacement of successful but expensive and radioactive...
Genomic Data Commons and Genomic Cloud Pilots - Google Hangout

Science.gov (United States)

Join us for a live, moderated discussion about two NCI efforts to expand access to cancer genomics data: the Genomic Data Commons and Genomic Cloud Pilots. NCI subject matters experts will include Louis M. Staudt, M.D., Ph.D., Director Center for Cancer Genomics, Warren Kibbe, Ph.D., Director, NCI Center for Biomedical Informatics and Information Technology, and moderated by Anthony Kerlavage, Ph.D., Chief, Cancer Informatics Branch, Center for Biomedical Informatics and Information Technology. We welcome your questions before and during the Hangout on Twitter using the hashtag #AskNCI.
U.S. Department of Energy's Genomics: GTL Bioenergy Research Centers White Paper

Energy Technology Data Exchange (ETDEWEB)

none,

2006-08-01

The Genomics:GTL Bioenergy Research Centers will be dedicated to fundamental research on microbe and plant systems with the goal of developing knowledge that will advance biotechnology-based strategies for biofuels production. The aim is to spur substantial progress toward cost-effective production of biologically based renewable energy sources. This document describes the rationale for the establishment of the centers and their objectives in light of the U.S. Department of Energy’s mission and goals.
Use of Modern Chemical Protein Synthesis and Advanced Fluorescent Assay Techniques to Experimentally Validate the Functional Annotation of Microbial Genomes

Energy Technology Data Exchange (ETDEWEB)

Kent, Stephen [University of Chicago

2012-07-20

The objective of this research program was to prototype methods for the chemical synthesis of predicted protein molecules in annotated microbial genomes. High throughput chemical methods were to be used to make large numbers of predicted proteins and protein domains, based on microbial genome sequences. Microscale chemical synthesis methods for the parallel preparation of peptide-thioester building blocks were developed; these peptide segments are used for the parallel chemical synthesis of proteins and protein domains. Ultimately, it is envisaged that these synthetic molecules would be ‘printed’ in spatially addressable arrays. The unique ability of total synthesis to precision label protein molecules with dyes and with chemical or biochemical ‘tags’ can be used to facilitate novel assay technologies adapted from state-of-the art single molecule fluorescence detection techniques. In the future, in conjunction with modern laboratory automation this integrated set of techniques will enable high throughput experimental validation of the functional annotation of microbial genomes.
Genomic analysis of thermophilic Bacillus coagulans strains: efficient producers for platform bio-chemicals.

Science.gov (United States)

Su, Fei; Xu, Ping

2014-01-29

Microbial strains with high substrate efficiency and excellent environmental tolerance are urgently needed for the production of platform bio-chemicals. Bacillus coagulans has these merits; however, little genetic information is available about this species. Here, we determined the genome sequences of five B. coagulans strains, and used a comparative genomic approach to reconstruct the central carbon metabolism of this species to explain their fermentation features. A novel xylose isomerase in the xylose utilization pathway was identified in these strains. Based on a genome-wide positive selection scan, the selection pressure on amino acid metabolism may have played a significant role in the thermal adaptation. We also researched the immune systems of B. coagulans strains, which provide them with acquired resistance to phages and mobile genetic elements. Our genomic analysis provides comprehensive insights into the genetic characteristics of B. coagulans and paves the way for improving and extending the uses of this species.
Center for Integrated Nanotechnologies (CINT) Chemical Release Modeling Evaluation

Energy Technology Data Exchange (ETDEWEB)

Stirrup, Timothy Scott [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

2016-12-20

This evaluation documents the methodology and results of chemical release modeling for operations at Building 518, Center for Integrated Nanotechnologies (CINT) Core Facility. This evaluation is intended to supplement an update to the CINT [Standalone] Hazards Analysis (SHA). This evaluation also updates the original [Design] Hazards Analysis (DHA) completed in 2003 during the design and construction of the facility; since the original DHA, additional toxic materials have been evaluated and modeled to confirm the continued low hazard classification of the CINT facility and operations. This evaluation addresses the potential catastrophic release of the current inventory of toxic chemicals at Building 518 based on a standard query in the Chemical Information System (CIS).
Target Selection and Deselection at the Berkeley StructuralGenomics Center

Energy Technology Data Exchange (ETDEWEB)

Chandonia, John-Marc; Kim, Sung-Hou; Brenner, Steven E.

2005-03-22

At the Berkeley Structural Genomics Center (BSGC), our goalis to obtain a near-complete structural complement of proteins in theminimal organisms Mycoplasma genitalium and M. pneumoniae, two closelyrelated pathogens. Current targets for structure determination have beenselected in six major stages, starting with those predicted to be mosttractable to high throughput study and likely to yield new structuralinformation. We report on the process used to select these proteins, aswell as our target deselection procedure. Target deselection reducesexperimental effort by eliminating targets similar to those recentlysolved by the structural biology community or other centers. We measurethe impact of the 69 structures solved at the BSGC as of July 2004 onstructure prediction coverage of the M. pneumoniae and M. genitaliumproteomes. The number of Mycoplasma proteins for which thefold couldfirst be reliably assigned based on structures solved at the BSGC (24 M.pneumoniae and 21 M. genitalium) is approximately 25 percent of the totalresulting from work at all structural genomics centers and the worldwidestructural biology community (94 M. pneumoniae and 86M. genitalium)during the same period. As the number of structures contributed by theBSGC during that period is less than 1 percent of the total worldwideoutput, the benefits of a focused target selection strategy are apparent.If the structures of all current targets were solved, the percentage ofM. pneumoniae proteins for which folds could be reliably assigned wouldincrease from approximately 57 percent (391 of 687) at present to around80 percent (550 of 687), and the percentage of the proteome that could beaccurately modeled would increase from around 37 percent (254 of 687) toabout 64 percent (438 of 687). In M. genitalium, the percentage of theproteome that could be structurally annotated based on structures of ourremaining targets would rise from 72 percent (348 of 486) to around 76percent (371 of 486), with the
BYSTANDER EFFECTS GENOMIC INSTABILITY, ADAPTIVE RESPONSE AND CANCER RISK ASSESSMENT FOR RADIAION AND CHEMICAL EXPOSURES

Science.gov (United States)

BYSTANDER EFFECTS, GENOMIC INSTABILITY, ADAPTIVE RESPONSE AND CANCER RISK ASSESSMENT FOR RADIATION AND CHEMICAL EXPOSURESR. Julian PrestonEnvironmental Carcinogenesis Division, U.S. Environmental Protection Agency, Research Triangle Park, N.C. 27711, USAThere ...
Gene design, cloning and protein-expression methods for high-value targets at the Seattle Structural Genomics Center for Infectious Disease

International Nuclear Information System (INIS)

Raymond, Amy; Haffner, Taryn; Ng, Nathan; Lorimer, Don; Staker, Bart; Stewart, Lance

2011-01-01

An overview of one salvage strategy for high-value SSGCID targets is given. Any structural genomics endeavor, particularly ambitious ones such as the NIAID-funded Seattle Structural Genomics Center for Infectious Disease (SSGCID) and Center for Structural Genomics of Infectious Disease (CSGID), face technical challenges at all points of the production pipeline. One salvage strategy employed by SSGCID is combined gene engineering and structure-guided construct design to overcome challenges at the levels of protein expression and protein crystallization. Multiple constructs of each target are cloned in parallel using Polymerase Incomplete Primer Extension cloning and small-scale expressions of these are rapidly analyzed by capillary electrophoresis. Using the methods reported here, which have proven particularly useful for high-value targets, otherwise intractable targets can be resolved
Causes of genome instability

DEFF Research Database (Denmark)

Langie, Sabine A S; Koppen, Gudrun; Desaulniers, Daniel

2015-01-01

function, chromosome segregation, telomere length). The purpose of this review is to describe the crucial aspects of genome instability, to outline the ways in which environmental chemicals can affect this cancer hallmark and to identify candidate chemicals for further study. The overall aim is to make......Genome instability is a prerequisite for the development of cancer. It occurs when genome maintenance systems fail to safeguard the genome's integrity, whether as a consequence of inherited defects or induced via exposure to environmental agents (chemicals, biological agents and radiation). Thus...
Unexplored therapeutic opportunities in the human genome.

Science.gov (United States)

Oprea, Tudor I; Bologa, Cristian G; Brunak, Søren; Campbell, Allen; Gan, Gregory N; Gaulton, Anna; Gomez, Shawn M; Guha, Rajarshi; Hersey, Anne; Holmes, Jayme; Jadhav, Ajit; Jensen, Lars Juhl; Johnson, Gary L; Karlson, Anneli; Leach, Andrew R; Ma'ayan, Avi; Malovannaya, Anna; Mani, Subramani; Mathias, Stephen L; McManus, Michael T; Meehan, Terrence F; von Mering, Christian; Muthas, Daniel; Nguyen, Dac-Trung; Overington, John P; Papadatos, George; Qin, Jun; Reich, Christian; Roth, Bryan L; Schürer, Stephan C; Simeonov, Anton; Sklar, Larry A; Southall, Noel; Tomita, Susumu; Tudose, Ilinca; Ursu, Oleg; Vidovic, Dušica; Waller, Anna; Westergaard, David; Yang, Jeremy J; Zahoránszky-Köhalmi, Gergely

2018-05-01

A large proportion of biomedical research and the development of therapeutics is focused on a small fraction of the human genome. In a strategic effort to map the knowledge gaps around proteins encoded by the human genome and to promote the exploration of currently understudied, but potentially druggable, proteins, the US National Institutes of Health launched the Illuminating the Druggable Genome (IDG) initiative in 2014. In this article, we discuss how the systematic collection and processing of a wide array of genomic, proteomic, chemical and disease-related resource data by the IDG Knowledge Management Center have enabled the development of evidence-based criteria for tracking the target development level (TDL) of human proteins, which indicates a substantial knowledge deficit for approximately one out of three proteins in the human proteome. We then present spotlights on the TDL categories as well as key drug target classes, including G protein-coupled receptors, protein kinases and ion channels, which illustrate the nature of the unexplored opportunities for biomedical research and therapeutic development.
Use of comparative genomics approaches to characterize interspecies differences in response to environmental chemicals: Challenges, opportunities, and research needs

International Nuclear Information System (INIS)

Burgess-Herbert, Sarah L.; Euling, Susan Y.

2013-01-01

A critical challenge for environmental chemical risk assessment is the characterization and reduction of uncertainties introduced when extrapolating inferences from one species to another. The purpose of this article is to explore the challenges, opportunities, and research needs surrounding the issue of how genomics data and computational and systems level approaches can be applied to inform differences in response to environmental chemical exposure across species. We propose that the data, tools, and evolutionary framework of comparative genomics be adapted to inform interspecies differences in chemical mechanisms of action. We compare and contrast existing approaches, from disciplines as varied as evolutionary biology, systems biology, mathematics, and computer science, that can be used, modified, and combined in new ways to discover and characterize interspecies differences in chemical mechanism of action which, in turn, can be explored for application to risk assessment. We consider how genetic, protein, pathway, and network information can be interrogated from an evolutionary biology perspective to effectively characterize variations in biological processes of toxicological relevance among organisms. We conclude that comparative genomics approaches show promise for characterizing interspecies differences in mechanisms of action, and further, for improving our understanding of the uncertainties inherent in extrapolating inferences across species in both ecological and human health risk assessment. To achieve long-term relevance and consistent use in environmental chemical risk assessment, improved bioinformatics tools, computational methods robust to data gaps, and quantitative approaches for conducting extrapolations across species are critically needed. Specific areas ripe for research to address these needs are recommended
Life Sciences Division and Center for Human Genome Studies 1994

Energy Technology Data Exchange (ETDEWEB)

Cram, L.S.; Stafford, C. [comp.

1995-09-01

This report summarizes the research and development activities of the Los Alamos National Laboratory`s Life Sciences Division and the biological aspects of the Center for Human Genome Studies for the calendar year 1994. The technical portion of the report is divided into two parts, (1) selected research highlights and (2) research projects and accomplishments. The research highlights provide a more detailed description of a select set of projects. A technical description of all projects is presented in sufficient detail so that the informed reader will be able to assess the scope and significance of each project. Summaries useful to the casual reader desiring general information have been prepared by the group leaders and appear in each group overview. Investigators on the staff of the Life Sciences Division will be pleased to provide further information.
Genomic Testing

Science.gov (United States)

... this database. Top of Page Evaluation of Genomic Applications in Practice and Prevention (EGAPP™) In 2004, the Centers for Disease Control and Prevention launched the EGAPP initiative to establish and test a ... and other applications of genomic technology that are in transition from ...
Population-Based in Vitro Hazard and Concentration–Response Assessment of Chemicals: The 1000 Genomes High-Throughput Screening Study

Science.gov (United States)

Abdo, Nour; Xia, Menghang; Brown, Chad C.; Kosyk, Oksana; Huang, Ruili; Sakamuru, Srilatha; Zhou, Yi-Hui; Jack, John R.; Gallins, Paul; Xia, Kai; Li, Yun; Chiu, Weihsueh A.; Motsinger-Reif, Alison A.; Austin, Christopher P.; Tice, Raymond R.

2015-01-01

Background: Understanding of human variation in toxicity to environmental chemicals remains limited, so human health risk assessments still largely rely on a generic 10-fold factor (10½ each for toxicokinetics and toxicodynamics) to account for sensitive individuals or subpopulations. Objectives: We tested a hypothesis that population-wide in vitro cytotoxicity screening can rapidly inform both the magnitude of and molecular causes for interindividual toxicodynamic variability. Methods: We used 1,086 lymphoblastoid cell lines from the 1000 Genomes Project, representing nine populations from five continents, to assess variation in cytotoxic response to 179 chemicals. Analysis included assessments of population variation and heritability, and genome-wide association mapping, with attention to phenotypic relevance to human exposures. Results: For about half the tested compounds, cytotoxic response in the 1% most “sensitive” individual occurred at concentrations within a factor of 10½ (i.e., approximately 3) of that in the median individual; however, for some compounds, this factor was > 10. Genetic mapping suggested important roles for variation in membrane and transmembrane genes, with a number of chemicals showing association with SNP rs13120371 in the solute carrier SLC7A11, previously implicated in chemoresistance. Conclusions: This experimental approach fills critical gaps unaddressed by recent large-scale toxicity testing programs, providing quantitative, experimentally based estimates of human toxicodynamic variability, and also testable hypotheses about mechanisms contributing to interindividual variation. Citation: Abdo N, Xia M, Brown CC, Kosyk O, Huang R, Sakamuru S, Zhou YH, Jack JR, Gallins P, Xia K, Li Y, Chiu WA, Motsinger-Reif AA, Austin CP, Tice RR, Rusyn I, Wright FA. 2015. Population-based in vitro hazard and concentration–response assessment of chemicals: the 1000 Genomes high-throughput screening study. Environ Health Perspect 123:458�

Protein structure similarity clustering (PSSC) and natural product structure as inspiration sources for drug development and chemical genomics

NARCIS (Netherlands)

Dekker, Frank J; Koch, Marcus A; Waldmann, Herbert; Dekker, Frans

Finding small molecules that modulate protein function is of primary importance in drug development and in the emerging field of chemical genomics. To facilitate the identification of such molecules, we developed a novel strategy making use of structural conservatism found in protein domain
Genomic Data Commons launches

Science.gov (United States)

The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.
Aye-aye population genomic analyses highlight an important center of endemism in northern Madagascar.

Science.gov (United States)

Perry, George H; Louis, Edward E; Ratan, Aakrosh; Bedoya-Reina, Oscar C; Burhans, Richard C; Lei, Runhua; Johnson, Steig E; Schuster, Stephan C; Miller, Webb

2013-04-09

We performed a population genomics study of the aye-aye, a highly specialized nocturnal lemur from Madagascar. Aye-ayes have low population densities and extensive range requirements that could make this flagship species particularly susceptible to extinction. Therefore, knowledge of genetic diversity and differentiation among aye-aye populations is critical for conservation planning. Such information may also advance our general understanding of Malagasy biogeography, as aye-ayes have the largest species distribution of any lemur. We generated and analyzed whole-genome sequence data for 12 aye-ayes from three regions of Madagascar (North, West, and East). We found that the North population is genetically distinct, with strong differentiation from other aye-ayes over relatively short geographic distances. For comparison, the average FST value between the North and East aye-aye populations--separated by only 248 km--is over 2.1-times greater than that observed between human Africans and Europeans. This finding is consistent with prior watershed- and climate-based hypotheses of a center of endemism in northern Madagascar. Taken together, these results suggest a strong and long-term biogeographical barrier to gene flow. Thus, the specific attention that should be directed toward preserving large, contiguous aye-aye habitats in northern Madagascar may also benefit the conservation of other distinct taxonomic units. To help facilitate future ecological- and conservation-motivated population genomic analyses by noncomputational biologists, the analytical toolkit used in this study is available on the Galaxy Web site.
Quantitative Genome-Wide Analysis of Yeast Deletion Strain Sensitivities to Oxidative and Chemical Stress

Directory of Open Access Journals (Sweden)

Stanley Fields

2006-03-01

Full Text Available Understanding the actions of drugs and toxins in a cell is of critical importance to medicine, yet many of the molecular events involved in chemical resistance are relatively uncharacterized. In order to identify the cellular processes and pathways targeted by chemicals, we took advantage of the haploid Saccharomyces cerevisiae deletion strains (Winzeler et al., 1999. Although ~4800 of the strains are viable, the loss of a gene in a pathway affected by a drug can lead to a synthetic lethal effect in which the combination of a deletion and a normally sublethal dose of a chemical results in loss of viability. WE carried out genome-wide screens to determine quantitative sensitivities of the deletion set to four chemicals: hydrogen peroxide, menadione, ibuprofen and mefloquine. Hydrogen peroxide and menadione induce oxidative stress in the cell, whereas ibuprofen and mefloquine are toxic to yeast by unknown mechanisms. Here we report the sensitivities of 659 deletion strains that are sensitive to one or more of these four compounds, including 163 multichemicalsensitive strains, 394 strains specific to hydrogen peroxide and/or menadione, 47 specific to ibuprofen and 55 specific to mefloquine.We correlate these results with data from other large-scale studies to yield novel insights into cellular function.
Penicillium arizonense, a new, genome sequenced fungal species, reveals a high chemical diversity in secreted metabolites

DEFF Research Database (Denmark)

Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica

2016-01-01

A new soil-borne species belonging to the Penicillium section Canescentia is described, Penicillium arizonense sp. nov. (type strain CBS 141311T = IBT 12289T). The genome was sequenced and assembled into 33.7 Mb containing 12,502 predicted genes. A phylogenetic assessment based on marker genes...... confirmed the grouping of P. arizonense within section Canescentia. Compared to related species, P. arizonense proved to encode a high number of proteins involved in carbohydrate metabolism, in particular hemicellulases. Mining the genome for genes involved in secondary metabolite biosynthesis resulted...... of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential...
Approaches to advancing quantitative human health risk assessment of environmental chemicals in the post-genomic era.

Science.gov (United States)

Chiu, Weihsueh A; Euling, Susan Y; Scott, Cheryl Siegel; Subramaniam, Ravi P

2013-09-15

The contribution of genomics and associated technologies to human health risk assessment for environmental chemicals has focused largely on elucidating mechanisms of toxicity, as discussed in other articles in this issue. However, there is interest in moving beyond hazard characterization to making more direct impacts on quantitative risk assessment (QRA)--i.e., the determination of toxicity values for setting exposure standards and cleanup values. We propose that the evolution of QRA of environmental chemicals in the post-genomic era will involve three, somewhat overlapping phases in which different types of approaches begin to mature. The initial focus (in Phase I) has been and continues to be on "augmentation" of weight of evidence--using genomic and related technologies qualitatively to increase the confidence in and scientific basis of the results of QRA. Efforts aimed towards "integration" of these data with traditional animal-based approaches, in particular quantitative predictors, or surrogates, for the in vivo toxicity data to which they have been anchored are just beginning to be explored now (in Phase II). In parallel, there is a recognized need for "expansion" of the use of established biomarkers of susceptibility or risk of human diseases and disorders for QRA, particularly for addressing the issues of cumulative assessment and population risk. Ultimately (in Phase III), substantial further advances could be realized by the development of novel molecular and pathway-based biomarkers and statistical and in silico models that build on anticipated progress in understanding the pathways of human diseases and disorders. Such efforts would facilitate a gradual "reorientation" of QRA towards approaches that more directly link environmental exposures to human outcomes. Published by Elsevier Inc.
Approaches to advancing quantitative human health risk assessment of environmental chemicals in the post-genomic era

Energy Technology Data Exchange (ETDEWEB)

Chiu, Weihsueh A., E-mail: chiu.weihsueh@epa.gov [National Center for Environmental Assessment, U.S. Environmental Protection Agency, Washington DC, 20460 (United States); Euling, Susan Y.; Scott, Cheryl Siegel; Subramaniam, Ravi P. [National Center for Environmental Assessment, U.S. Environmental Protection Agency, Washington DC, 20460 (United States)

2013-09-15

The contribution of genomics and associated technologies to human health risk assessment for environmental chemicals has focused largely on elucidating mechanisms of toxicity, as discussed in other articles in this issue. However, there is interest in moving beyond hazard characterization to making more direct impacts on quantitative risk assessment (QRA) — i.e., the determination of toxicity values for setting exposure standards and cleanup values. We propose that the evolution of QRA of environmental chemicals in the post-genomic era will involve three, somewhat overlapping phases in which different types of approaches begin to mature. The initial focus (in Phase I) has been and continues to be on “augmentation” of weight of evidence — using genomic and related technologies qualitatively to increase the confidence in and scientific basis of the results of QRA. Efforts aimed towards “integration” of these data with traditional animal-based approaches, in particular quantitative predictors, or surrogates, for the in vivo toxicity data to which they have been anchored are just beginning to be explored now (in Phase II). In parallel, there is a recognized need for “expansion” of the use of established biomarkers of susceptibility or risk of human diseases and disorders for QRA, particularly for addressing the issues of cumulative assessment and population risk. Ultimately (in Phase III), substantial further advances could be realized by the development of novel molecular and pathway-based biomarkers and statistical and in silico models that build on anticipated progress in understanding the pathways of human diseases and disorders. Such efforts would facilitate a gradual “reorientation” of QRA towards approaches that more directly link environmental exposures to human outcomes.
Approaches to advancing quantitative human health risk assessment of environmental chemicals in the post-genomic era

International Nuclear Information System (INIS)

Chiu, Weihsueh A.; Euling, Susan Y.; Scott, Cheryl Siegel; Subramaniam, Ravi P.

2013-01-01

The contribution of genomics and associated technologies to human health risk assessment for environmental chemicals has focused largely on elucidating mechanisms of toxicity, as discussed in other articles in this issue. However, there is interest in moving beyond hazard characterization to making more direct impacts on quantitative risk assessment (QRA) — i.e., the determination of toxicity values for setting exposure standards and cleanup values. We propose that the evolution of QRA of environmental chemicals in the post-genomic era will involve three, somewhat overlapping phases in which different types of approaches begin to mature. The initial focus (in Phase I) has been and continues to be on “augmentation” of weight of evidence — using genomic and related technologies qualitatively to increase the confidence in and scientific basis of the results of QRA. Efforts aimed towards “integration” of these data with traditional animal-based approaches, in particular quantitative predictors, or surrogates, for the in vivo toxicity data to which they have been anchored are just beginning to be explored now (in Phase II). In parallel, there is a recognized need for “expansion” of the use of established biomarkers of susceptibility or risk of human diseases and disorders for QRA, particularly for addressing the issues of cumulative assessment and population risk. Ultimately (in Phase III), substantial further advances could be realized by the development of novel molecular and pathway-based biomarkers and statistical and in silico models that build on anticipated progress in understanding the pathways of human diseases and disorders. Such efforts would facilitate a gradual “reorientation” of QRA towards approaches that more directly link environmental exposures to human outcomes
INSPACE CHEMICAL PROPULSION SYSTEMS AT NASA's MARSHALL SPACE FLIGHT CENTER: HERITAGE AND CAPABILITIES

Science.gov (United States)

McRight, P. S.; Sheehy, J. A.; Blevins, J. A.

2005-01-01

NASA s Marshall Space Flight Center (MSFC) is well known for its contributions to large ascent propulsion systems such as the Saturn V rocket and the Space Shuttle external tank, solid rocket boosters, and main engines. This paper highlights a lesser known but very rich side of MSFC-its heritage in the development of in-space chemical propulsion systems and its current capabilities for spacecraft propulsion system development and chemical propulsion research. The historical narrative describes the flight development activities associated with upper stage main propulsion systems such as the Saturn S-IVB as well as orbital maneuvering and reaction control systems such as the S-IVB auxiliary propulsion system, the Skylab thruster attitude control system, and many more recent activities such as Chandra, the Demonstration of Automated Rendezvous Technology (DART), X-37, the X-38 de-orbit propulsion system, the Interim Control Module, the US Propulsion Module, and multiple technology development activities. This paper also highlights MSFC s advanced chemical propulsion research capabilities, including an overview of the center s Propulsion Systems Department and ongoing activities. The authors highlight near-term and long-term technology challenges to which MSFC research and system development competencies are relevant. This paper concludes by assessing the value of the full range of aforementioned activities, strengths, and capabilities in light of NASA s exploration missions.
Genomic Comparison of Two Family-Level Groups of the Uncultivated NAG1 Archaeal Lineage from Chemically and Geographically Disparate Hot Springs

Directory of Open Access Journals (Sweden)

Eric D. Becraft

2017-10-01

Full Text Available Recent progress based on single-cell genomics and metagenomic investigations of archaea in a variety of extreme environments has led to significant advances in our understanding of the diversity, evolution, and metabolic potential of archaea, yet the vast majority of archaeal diversity remains undersampled. In this work, we coordinated single-cell genomics with metagenomics in order to construct a near-complete genome from a deeply branching uncultivated archaeal lineage sampled from Great Boiling Spring (GBS in the U.S. Great Basin, Nevada. This taxon is distantly related (distinct families to an archaeal genome, designated “Novel Archaeal Group 1” (NAG1, which was extracted from a metagenome recovered from an acidic iron spring in Yellowstone National Park (YNP. We compared the metabolic predictions of the NAG1 lineage to better understand how these archaea could inhabit such chemically distinct environments. Similar to the NAG1 population previously studied in YNP, the NAG1 population from GBS is predicted to utilize proteins as a primary carbon source, ferment simple carbon sources, and use oxygen as a terminal electron acceptor under oxic conditions. However, GBS NAG1 populations contained distinct genes involved in central carbon metabolism and electron transfer, including nitrite reductase, which could confer the ability to reduce nitrite under anaerobic conditions. Despite inhabiting chemically distinct environments with large variations in pH, GBS NAG1 populations shared many core genomic and metabolic features with the archaeon identified from YNP, yet were able to carve out a distinct niche at GBS.
DNA-Bank of the Siberian Group Chemical Enterprises workers and Seversk city residents

International Nuclear Information System (INIS)

Freidin, M. B.; Goncharova, I. A.; Karpov, A. B.; Takhauov, R. M.

2004-01-01

According to the mostr common definition a DNA-bank is a system of a genetic material storage. Applying to nuclear-chemical plant workers, DNA-bank creation is determined by the necessity to preserve a hereditary material of these people and their descendants for the further evaluation of consequences fo technogenic factors action on human genome using a contemporary conceptual and applied advances of genetics. In the frameworks of the study of technogenic factors indluence on human genome and genetic-caused disorders development the Seversk Biophysical Research Center is being created DNA-bank of Siberian Group of Chemical Enterprises workers exposed to radiation, their descendants, and ZATO Seversk and Tomsk city inhabitants. The DNA-bank will be a basis for all major research laboratory projects: analysis of molecular basis of individual radiosensitivity; analysis of technogenic factors role in congenital malformations and hereditary diseases development in nuclear-chemical plant workers offspring; elaboration of genotype-specific tes-systems of cancer prognosis and development of cardiovascular and other common disorders connected with the effect of technogenic factors. The DNA-bank creation is a technological issue aggravated by ethical problems. Whereas the DNA isolation is not a problem today, ethical complication id debated widely in the world. These questions strongly arise in a view of advances of Human Genome Project. Information consent on DNA usage is imperative today. Also questions on DNA property (who is its owner a doner or a banker) and of a confidentiality, which maintenance is a doubtable question in a case of multiple genetic testing, are not solved today. At present, the Genomic Medicine Laboratory disposes the DNA samples of more than 400 Sevesk and Tomsk inhabitants affected with breast and lung cancer. More than 800 blood samples of main manufacture of the Siberian Group of Chemical Enterprises workers are collected. About 1500 DNA samples
The USC Epigenome Center.

Science.gov (United States)

Laird, Peter W

2009-10-01

The University of Southern California (USC, CA, USA) has a long tradition of excellence in epigenetics. With the recent explosive growth and technological maturation of the field of epigenetics, it became clear that a dedicated high-throughput epigenomic data production facility would be needed to remain at the forefront of epigenetic research. To address this need, USC launched the USC Epigenome Center as the first large-scale center in academics dedicated to epigenomic research. The Center is providing high-throughput data production for large-scale genomic and epigenomic studies, and developing novel analysis tools for epigenomic research. This unique facility promises to be a valuable resource for multidisciplinary research, education and training in genomics, epigenomics, bioinformatics, and translational medicine.
Fungal Genomics Program

Energy Technology Data Exchange (ETDEWEB)

Grigoriev, Igor

2012-03-12

The JGI Fungal Genomics Program aims to scale up sequencing and analysis of fungal genomes to explore the diversity of fungi important for energy and the environment, and to promote functional studies on a system level. Combining new sequencing technologies and comparative genomics tools, JGI is now leading the world in fungal genome sequencing and analysis. Over 120 sequenced fungal genomes with analytical tools are available via MycoCosm (www.jgi.doe.gov/fungi), a web-portal for fungal biologists. Our model of interacting with user communities, unique among other sequencing centers, helps organize these communities, improves genome annotation and analysis work, and facilitates new larger-scale genomic projects. This resulted in 20 high-profile papers published in 2011 alone and contributing to the Genomics Encyclopedia of Fungi, which targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts). Our next grand challenges include larger scale exploration of fungal diversity (1000 fungal genomes), developing molecular tools for DOE-relevant model organisms, and analysis of complex systems and metagenomes.
Report from the Third Annual Symposium of the RIKEN-Max Planck Joint Research Center for Systems Chemical Biology.

Science.gov (United States)

Brunschweiger, Andreas

2014-08-15

The third Annual Symposium of the RIKEN-Max Planck Joint Research Center for Systems Chemical Biology was held at Ringberg castle, May 21-24, 2014. At this meeting 45 scientists from Japan and Germany presented the latest results from their research spanning a broad range of topics in chemical biology and glycobiology.
Penicillium arizonense, a new, genome sequenced fungal species, reveals a high chemical diversity in secreted metabolites

Science.gov (United States)

Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica; Nielsen, Jens; Nielsen, Kristian Fog; Workman, Mhairi; Frisvad, Jens Christian

2016-01-01

A new soil-borne species belonging to the Penicillium section Canescentia is described, Penicillium arizonense sp. nov. (type strain CBS 141311T = IBT 12289T). The genome was sequenced and assembled into 33.7 Mb containing 12,502 predicted genes. A phylogenetic assessment based on marker genes confirmed the grouping of P. arizonense within section Canescentia. Compared to related species, P. arizonense proved to encode a high number of proteins involved in carbohydrate metabolism, in particular hemicellulases. Mining the genome for genes involved in secondary metabolite biosynthesis resulted in the identification of 62 putative biosynthetic gene clusters. Extracts of P. arizonense were analysed for secondary metabolites and austalides, pyripyropenes, tryptoquivalines, fumagillin, pseurotin A, curvulinic acid and xanthoepocin were detected. A comparative analysis against known pathways enabled the proposal of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential industrial applications for the new species P. arizonense. The description and availability of the genome sequence of P. arizonense, further provides the basis for biotechnological exploitation of this species. PMID:27739446
Inorganic chemical composition and chemical reactivity of settled dust generated by the World Trade Center building collapse: Chapter 12

Science.gov (United States)

Plumlee, Geoffrey S.; Hageman, Philip L.; Lamothe, Paul J.; Ziegler, Thomas L.; Meeker, Gregory P.; Theodorakos, Peter M.; Brownfield, Isabelle; Adams, Monique G.; Swayze, Gregg A.; Hoefen, Todd M.; Taggart, Joseph E.; Clark, Roger N.; Wilson, S.; Sutley, Stephen J.

2009-01-01

Samples of dust deposited around lower Manhattan by the September 11, 2001, World Trade Center (WTC) collapse have inorganic chemical compositions that result in part from the variable chemical contributions of concrete, gypsum wallboard, glass fibers, window glass, and other materials contained in the buildings. The dust deposits were also modified chemically by variable interactions with rain water or water used in street washing and fire fighting. Chemical leach tests using deionized water as the extraction fluid show the dust samples can be quite alkaline, due primarily to reactions with calcium hydroxide in concrete particles. Calcium and sulfate are the most soluble components in the dust, but many other elements are also readily leached, including metals such as Al, Sb, Mo Cr, Cu, and Zn. Indoor dust samples produce leachates with higher pH, alkalinity, and dissolved solids than outdoor dust samples, suggesting most outdoor dust had reacted with water and atmospheric carbon dioxide prior to sample collection. Leach tests using simulated lung fluids as the extracting fluid suggest that the dust might also be quite reactive in fluids lining the respiratory tract, resulting in dissolution of some particles and possible precipitation of new phases such as phosphates, carbonates, and silicates. Results of these chemical characterization studies can be used by health scientists as they continue to track and interpret health effects resulting from the short-term exposure to the initial dust cloud and the longer-term exposure to dusts resuspended during cleanup.
Allele coding in genomic evaluation

Directory of Open Access Journals (Sweden)

Christensen Ole F

2011-06-01

Full Text Available Abstract Background Genomic data are used in animal breeding to assist genetic evaluation. Several models to estimate genomic breeding values have been studied. In general, two approaches have been used. One approach estimates the marker effects first and then, genomic breeding values are obtained by summing marker effects. In the second approach, genomic breeding values are estimated directly using an equivalent model with a genomic relationship matrix. Allele coding is the method chosen to assign values to the regression coefficients in the statistical model. A common allele coding is zero for the homozygous genotype of the first allele, one for the heterozygote, and two for the homozygous genotype for the other allele. Another common allele coding changes these regression coefficients by subtracting a value from each marker such that the mean of regression coefficients is zero within each marker. We call this centered allele coding. This study considered effects of different allele coding methods on inference. Both marker-based and equivalent models were considered, and restricted maximum likelihood and Bayesian methods were used in inference. Results Theoretical derivations showed that parameter estimates and estimated marker effects in marker-based models are the same irrespective of the allele coding, provided that the model has a fixed general mean. For the equivalent models, the same results hold, even though different allele coding methods lead to different genomic relationship matrices. Calculated genomic breeding values are independent of allele coding when the estimate of the general mean is included into the values. Reliabilities of estimated genomic breeding values calculated using elements of the inverse of the coefficient matrix depend on the allele coding because different allele coding methods imply different models. Finally, allele coding affects the mixing of Markov chain Monte Carlo algorithms, with the centered coding being
Effects of immunostimulation on social behavior, chemical communication and genome-wide gene expression in honey bee workers (Apis mellifera

Directory of Open Access Journals (Sweden)

Richard Freddie-Jeanne

2012-10-01

Full Text Available Abstract Background Social insects, such as honey bees, use molecular, physiological and behavioral responses to combat pathogens and parasites. The honey bee genome contains all of the canonical insect immune response pathways, and several studies have demonstrated that pathogens can activate expression of immune effectors. Honey bees also use behavioral responses, termed social immunity, to collectively defend their hives from pathogens and parasites. These responses include hygienic behavior (where workers remove diseased brood and allo-grooming (where workers remove ectoparasites from nestmates. We have previously demonstrated that immunostimulation causes changes in the cuticular hydrocarbon profiles of workers, which results in altered worker-worker social interactions. Thus, cuticular hydrocarbons may enable workers to identify sick nestmates, and adjust their behavior in response. Here, we test the specificity of behavioral, chemical and genomic responses to immunostimulation by challenging workers with a panel of different immune stimulants (saline, Sephadex beads and Gram-negative bacteria E. coli. Results While only bacteria-injected bees elicited altered behavioral responses from healthy nestmates compared to controls, all treatments resulted in significant changes in cuticular hydrocarbon profiles. Immunostimulation caused significant changes in expression of hundreds of genes, the majority of which have not been identified as members of the canonical immune response pathways. Furthermore, several new candidate genes that may play a role in cuticular hydrocarbon biosynthesis were identified. Effects of immune challenge expression of several genes involved in immune response, cuticular hydrocarbon biosynthesis, and the Notch signaling pathway were confirmed using quantitative real-time PCR. Finally, we identified common genes regulated by pathogen challenge in honey bees and other insects. Conclusions These results demonstrate that
Genomic mechanisms of stress tolerance for the industrial yeast Saccharomyces cerevisiae against the major chemical classes of inhibitors derived from lignocellulosic biomass conversion

Science.gov (United States)

Scientists at ARS developed tolerant industrial yeast that is able to reduce major chemical classes of inhibitors into less toxic or none toxic compounds while producing ethanol. Using genomic studies, we defined mechanisms of in situ detoxification involved in novel gene functions, vital cofactor r...
Application of Chemical Genomics to Plant-Bacteria Communication: A High-Throughput System to Identify Novel Molecules Modulating the Induction of Bacterial Virulence Genes by Plant Signals.

Science.gov (United States)

Vandelle, Elodie; Puttilli, Maria Rita; Chini, Andrea; Devescovi, Giulia; Venturi, Vittorio; Polverari, Annalisa

2017-01-01

The life cycle of bacterial phytopathogens consists of a benign epiphytic phase, during which the bacteria grow in the soil or on the plant surface, and a virulent endophytic phase involving the penetration of host defenses and the colonization of plant tissues. Innovative strategies are urgently required to integrate copper treatments that control the epiphytic phase with complementary tools that control the virulent endophytic phase, thus reducing the quantity of chemicals applied to economically and ecologically acceptable levels. Such strategies include targeted treatments that weaken bacterial pathogens, particularly those inhibiting early infection steps rather than tackling established infections. This chapter describes a reporter gene-based chemical genomic high-throughput screen for the induction of bacterial virulence by plant molecules. Specifically, we describe a chemical genomic screening method to identify agonist and antagonist molecules for the induction of targeted bacterial virulence genes by plant extracts, focusing on the experimental controls required to avoid false positives and thus ensuring the results are reliable and reproducible.

Genomics: The Science and Technology Behind the Human Genome Project (by Charles R. Cantor and Cassandra L. Smith)

Science.gov (United States)

Serra, Reviewed By Martin J.

2000-01-01

Genomics is one of the most rapidly expanding areas of science. This book is an outgrowth of a series of lectures given by one of the former heads (CRC) of the Human Genome Initiative. The book is designed to reach a wide audience, from biologists with little chemical or physical science background through engineers, computer scientists, and physicists with little current exposure to the chemical or biological principles of genetics. The text starts with a basic review of the chemical and biological properties of DNA. However, without either a biochemistry background or a supplemental biochemistry text, this chapter and much of the rest of the text would be difficult to digest. The second chapter is designed to put DNA into the context of the larger chromosomal unit. Specialized chromosomal structures and sequences (centromeres, telomeres) are introduced, leading to a section on chromosome organization and purification. The next 4 chapters cover the physical (hybridization, electrophoresis), chemical (polymerase chain reaction), and biological (genetic) techniques that provide the backbone of genomic analysis. These chapters cover in significant detail the fundamental principles underlying each technique and provide a firm background for the remainder of the text. Chapters 79 consider the need and methods for the development of physical maps. Chapter 7 primarily discusses chromosomal localization techniques, including in situ hybridization, FISH, and chromosome paintings. The next two chapters focus on the development of libraries and clones. In particular, Chapter 9 considers the limitations of current mapping and clone production. The current state and future of DNA sequencing is covered in the next three chapters. The first considers the current methods of DNA sequencing - especially gel-based methods of analysis, although other possible approaches (mass spectrometry) are introduced. Much of the chapter addresses the limitations of current methods, including
Localized chemical switching of the charge state of nitrogen-vacancy luminescence centers in diamond

Energy Technology Data Exchange (ETDEWEB)

Shanley, Toby W.; Martin, Aiden A.; Aharonovich, Igor, E-mail: Igor.Aharonovich@uts.edu.au; Toth, Milos, E-mail: Milos.Toth@uts.edu.au [School of Physics and Advanced Materials, University of Technology, Sydney, P.O. Box 123, Broadway, New South Wales 2007 (Australia)

2014-08-11

We present a direct-write chemical technique for controlling the charge state of near-surface nitrogen vacancy centers (NVs) in diamond by surface fluorination. Fluorination of H-terminated diamond is realized by electron beam stimulated desorption of H{sub 2}O in the presence of NF{sub 3} and verified with environmental photoyield spectroscopy (EPYS) and photoluminescence (PL) spectroscopy. PL spectra of shallow NVs in H- and F-terminated nanodiamonds show the expected dependence of the NV charge state on their energetic position with respect to the Fermi-level. EPYS reveals a corresponding difference between the ionization potential of H- and F-terminated diamond. The electron beam fluorination process is highly localized and can be used to fluorinate H-terminated diamond, and to increase the population of negatively charged NV centers.
SNUGB: a versatile genome browser supporting comparative and functional fungal genomics

Directory of Open Access Journals (Sweden)

Kim Seungill

2008-12-01

Full Text Available Abstract Background Since the full genome sequences of Saccharomyces cerevisiae were released in 1996, genome sequences of over 90 fungal species have become publicly available. The heterogeneous formats of genome sequences archived in different sequencing centers hampered the integration of the data for efficient and comprehensive comparative analyses. The Comparative Fungal Genomics Platform (CFGP was developed to archive these data via a single standardized format that can support multifaceted and integrated analyses of the data. To facilitate efficient data visualization and utilization within and across species based on the architecture of CFGP and associated databases, a new genome browser was needed. Results The Seoul National University Genome Browser (SNUGB integrates various types of genomic information derived from 98 fungal/oomycete (137 datasets and 34 plant and animal (38 datasets species, graphically presents germane features and properties of each genome, and supports comparison between genomes. The SNUGB provides three different forms of the data presentation interface, including diagram, table, and text, and six different display options to support visualization and utilization of the stored information. Information for individual species can be quickly accessed via a new tool named the taxonomy browser. In addition, SNUGB offers four useful data annotation/analysis functions, including 'BLAST annotation.' The modular design of SNUGB makes its adoption to support other comparative genomic platforms easy and facilitates continuous expansion. Conclusion The SNUGB serves as a powerful platform supporting comparative and functional genomics within the fungal kingdom and also across other kingdoms. All data and functions are available at the web site http://genomebrowser.snu.ac.kr/.
Perfect alignment and preferential orientation of nitrogen-vacancy centers during chemical vapor deposition diamond growth on (111) surfaces

International Nuclear Information System (INIS)

Michl, Julia; Zaiser, Sebastian; Jakobi, Ingmar; Waldherr, Gerald; Dolde, Florian; Neumann, Philipp; Wrachtrup, Jörg; Teraji, Tokuyuki; Doherty, Marcus W.; Manson, Neil B.; Isoya, Junichi

2014-01-01

Synthetic diamond production is a key to the development of quantum metrology and quantum information applications of diamond. The major quantum sensor and qubit candidate in diamond is the nitrogen-vacancy (NV) color center. This lattice defect comes in four different crystallographic orientations leading to an intrinsic inhomogeneity among NV centers, which is undesirable in some applications. Here, we report a microwave plasma-assisted chemical vapor deposition diamond growth technique on (111)-oriented substrates, which yields perfect alignment (94% ± 2%) of as-grown NV centers along a single crystallographic direction. In addition, clear evidence is found that the majority (74% ± 4%) of the aligned NV centers were formed by the nitrogen being first included in the (111) growth surface and then followed by the formation of a neighboring vacancy on top. The achieved homogeneity of the grown NV centers will tremendously benefit quantum information and metrology applications
Sequencing the CHO DXB11 genome reveals regional variations in genomic stability and haploidy

DEFF Research Database (Denmark)

Kaas, Christian Schrøder; Kristensen, Claus; Betenbaugh, Michael J.

2015-01-01

Background: The DHFR negative CHO DXB11 cell line (also known as DUX-B11 and DUKX) was historically the first CHO cell line to be used for large scale production of heterologous proteins and is still used for production of a number of complex proteins. Results: Here we present the genomic sequence...... of the CHO DXB11 genome sequenced to a depth of 33x. Overall a significant genomic drift was seen favoring GC -> AT point mutations in line with the chemical mutagenesis strategy used for generation of the cell line. The sequencing depth for each gene in the genome revealed distinct peaks at sequencing...... in eight additional analyzed CHO genomes (15-20% haploidy) but not in the genome of the Chinese hamster. The dhfr gene is confirmed to be haploid in CHO DXB11; transcriptionally active and the remaining allele contains a G410C point mutation causing a Thr137Arg missense mutation. We find similar to 2...
MIPS: a database for protein sequences and complete genomes.

Science.gov (United States)

Mewes, H W; Hani, J; Pfeiffer, F; Frishman, D

1998-01-01

The MIPS group [Munich Information Center for Protein Sequences of the German National Center for Environment and Health (GSF)] at the Max-Planck-Institute for Biochemistry, Martinsried near Munich, Germany, is involved in a number of data collection activities, including a comprehensive database of the yeast genome, a database reflecting the progress in sequencing the Arabidopsis thaliana genome, the systematic analysis of other small genomes and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database (described elsewhere in this volume). Through its WWW server (http://www.mips.biochem.mpg.de ) MIPS provides access to a variety of generic databases, including a database of protein families as well as automatically generated data by the systematic application of sequence analysis algorithms. The yeast genome sequence and its related information was also compiled on CD-ROM to provide dynamic interactive access to the 16 chromosomes of the first eukaryotic genome unraveled. PMID:9399795
Ideas and Approaches on “Construction of High Level Simulation Experimental Teaching Center of Virtual Chemical Laboratory”

Science.gov (United States)

Zhang, Yunshen

2017-11-01

With the spiritual guidance of the Circular on the Construction of National Virtual Simulation Experimental Teaching Center by the National Department of Education, according to the requirements of construction task and work content, and based on the reality of the simulation experimental teaching center of virtual chemical laboratory at Tianjin University, this paper mainly strengthens the understanding of virtual simulation experimental teaching center from three aspects, and on this basis, this article puts forward specific construction ideas, which refer to the “four combinations, five in one, the optimization of the resources and school-enterprise cooperation”, and on this basis, this article has made effective explorations. It also shows the powerful functions of the virtual simulation experimental teaching platform in all aspects by taking the synthesis and analysis of organic compounds as an example.
Hazardous Chemicals

Centers for Disease Control (CDC) Podcasts

2007-04-10

Chemicals are a part of our daily lives, providing many products and modern conveniences. With more than three decades of experience, The Centers for Disease Control and Prevention (CDC) has been in the forefront of efforts to protect and assess people's exposure to environmental and hazardous chemicals. This report provides information about hazardous chemicals and useful tips on how to protect you and your family from harmful exposure. Created: 4/10/2007 by CDC National Center for Environmental Health. Date Released: 4/13/2007.
University of Texas Southwestern Medical Center: High-Throughput siRNA Screening of a Non-Small Cell Lung Cancer (NSCLC) Cell Line Panel | Office of Cancer Genomics

Science.gov (United States)

The goal of this project is to use siRNA screens to identify NSCLC-selective siRNAs from two genome-wide libraries that will allow us to functionally define genetic dependencies of subtypes of NSCLC. Using bioinformatics tools, the CTD2 center at the University of Texas Southwestern Medical Center are discovering associations between this functional data (siRNAs) and NSCLC mutational status, methylation arrays, gene expression arrays, and copy number variation data that will help us identify new targets and enrollment biomarkers.
Multiplexed precision genome editing with trackable genomic barcodes in yeast.

Science.gov (United States)

Roy, Kevin R; Smith, Justin D; Vonesch, Sibylle C; Lin, Gen; Tu, Chelsea Szu; Lederer, Alex R; Chu, Angela; Suresh, Sundari; Nguyen, Michelle; Horecka, Joe; Tripathi, Ashutosh; Burnett, Wallace T; Morgan, Maddison A; Schulz, Julia; Orsley, Kevin M; Wei, Wu; Aiyar, Raeka S; Davis, Ronald W; Bankaitis, Vytas A; Haber, James E; Salit, Marc L; St Onge, Robert P; Steinmetz, Lars M

2018-07-01

Our understanding of how genotype controls phenotype is limited by the scale at which we can precisely alter the genome and assess the phenotypic consequences of each perturbation. Here we describe a CRISPR-Cas9-based method for multiplexed accurate genome editing with short, trackable, integrated cellular barcodes (MAGESTIC) in Saccharomyces cerevisiae. MAGESTIC uses array-synthesized guide-donor oligos for plasmid-based high-throughput editing and features genomic barcode integration to prevent plasmid barcode loss and to enable robust phenotyping. We demonstrate that editing efficiency can be increased more than fivefold by recruiting donor DNA to the site of breaks using the LexA-Fkh1p fusion protein. We performed saturation editing of the essential gene SEC14 and identified amino acids critical for chemical inhibition of lipid signaling. We also constructed thousands of natural genetic variants, characterized guide mismatch tolerance at the genome scale, and ascertained that cryptic Pol III termination elements substantially reduce guide efficacy. MAGESTIC will be broadly useful to uncover the genetic basis of phenotypes in yeast.
Origins of chemical diversity of back-arc basin basalts: a segment-scale study of the Eastern Lau Spreading Center

OpenAIRE

Bézos, Antoine; Escrig, Stéphane; Langmuir, Charles H.; Michael, Peter J.; Asimow, Paul D.

2009-01-01

We report major, trace, and volatile element data on basaltic glasses from the northernmost segment of the Eastern Lau Spreading Center (ELSC1) in the Lau back-arc basin to further test and constrain models of back-arc volcanism. The zero-age samples come from 47 precisely collected stations from an 85 km length spreading center. The chemical data covary similarly to other back-arc systems but with tighter correlations and well-developed spatial systematics. We confirm a correlation between v...
A computational genomics pipeline for prokaryotic sequencing projects.

Science.gov (United States)

Kislyuk, Andrey O; Katz, Lee S; Agrawal, Sonia; Hagen, Matthew S; Conley, Andrew B; Jayaraman, Pushkala; Nelakuditi, Viswateja; Humphrey, Jay C; Sammons, Scott A; Govil, Dhwani; Mair, Raydel D; Tatti, Kathleen M; Tondella, Maria L; Harcourt, Brian H; Mayer, Leonard W; Jordan, I King

2010-08-01

New sequencing technologies have accelerated research on prokaryotic genomes and have made genome sequencing operations outside major genome sequencing centers routine. However, no off-the-shelf solution exists for the combined assembly, gene prediction, genome annotation and data presentation necessary to interpret sequencing data. The resulting requirement to invest significant resources into custom informatics support for genome sequencing projects remains a major impediment to the accessibility of high-throughput sequence data. We present a self-contained, automated high-throughput open source genome sequencing and computational genomics pipeline suitable for prokaryotic sequencing projects. The pipeline has been used at the Georgia Institute of Technology and the Centers for Disease Control and Prevention for the analysis of Neisseria meningitidis and Bordetella bronchiseptica genomes. The pipeline is capable of enhanced or manually assisted reference-based assembly using multiple assemblers and modes; gene predictor combining; and functional annotation of genes and gene products. Because every component of the pipeline is executed on a local machine with no need to access resources over the Internet, the pipeline is suitable for projects of a sensitive nature. Annotation of virulence-related features makes the pipeline particularly useful for projects working with pathogenic prokaryotes. The pipeline is licensed under the open-source GNU General Public License and available at the Georgia Tech Neisseria Base (http://nbase.biology.gatech.edu/). The pipeline is implemented with a combination of Perl, Bourne Shell and MySQL and is compatible with Linux and other Unix systems.
Data Mining Supercomputing with SAS JMP® Genomics

Directory of Open Access Journals (Sweden)

Richard S. Segall

2011-02-01

Full Text Available JMP® Genomics is statistical discovery software that can uncover meaningful patterns in high-throughput genomics and proteomics data. JMP® Genomics is designed for biologists, biostatisticians, statistical geneticists, and those engaged in analyzing the vast stores of data that are common in genomic research (SAS, 2009. Data mining was performed using JMP® Genomics on the two collections of microarray databases available from National Center for Biotechnology Information (NCBI for lung cancer and breast cancer. The Gene Expression Omnibus (GEO of NCBI serves as a public repository for a wide range of highthroughput experimental data, including the two collections of lung cancer and breast cancer that were used for this research. The results for applying data mining using software JMP® Genomics are shown in this paper with numerous screen shots.
[Development of Plant Metabolomics and Medicinal Plant Genomics].

Science.gov (United States)

Saito, Kazuki

2018-01-01

A variety of chemicals produced by plants, often referred to as 'phytochemicals', have been used as medicines, food, fuels and industrial raw materials. Recent advances in the study of genomics and metabolomics in plant science have accelerated our understanding of the mechanisms, regulation and evolution of the biosynthesis of specialized plant products. We can now address such questions as how the metabolomic diversity of plants is originated at the levels of genome, and how we should apply this knowledge to drug discovery, industry and agriculture. Our research group has focused on metabolomics-based functional genomics over the last 15 years and we have developed a new research area called 'Phytochemical Genomics'. In this review, the development of a research platform for plant metabolomics is discussed first, to provide a better understanding of the chemical diversity of plants. Then, representative applications of metabolomics to functional genomics in a model plant, Arabidopsis thaliana, are described. The extension of integrated multi-omics analyses to non-model specialized plants, e.g., medicinal plants, is presented, including the identification of novel genes, metabolites and networks for the biosynthesis of flavonoids, alkaloids, sulfur-containing metabolites and terpenoids. Further, functional genomics studies on a variety of medicinal plants is presented. I also discuss future trends in pharmacognosy and related sciences.
ENCODE whole-genome data in the UCSC genome browser (2011 update).

Science.gov (United States)

Raney, Brian J; Cline, Melissa S; Rosenbloom, Kate R; Dreszer, Timothy R; Learned, Katrina; Barber, Galt P; Meyer, Laurence R; Sloan, Cricket A; Malladi, Venkat S; Roskin, Krishna M; Suh, Bernard B; Hinrichs, Angie S; Clawson, Hiram; Zweig, Ann S; Kirkup, Vanessa; Fujita, Pauline A; Rhead, Brooke; Smith, Kayla E; Pohl, Andy; Kuhn, Robert M; Karolchik, Donna; Haussler, David; Kent, W James

2011-01-01

The ENCODE project is an international consortium with a goal of cataloguing all the functional elements in the human genome. The ENCODE Data Coordination Center (DCC) at the University of California, Santa Cruz serves as the central repository for ENCODE data. In this role, the DCC offers a collection of high-throughput, genome-wide data generated with technologies such as ChIP-Seq, RNA-Seq, DNA digestion and others. This data helps illuminate transcription factor-binding sites, histone marks, chromatin accessibility, DNA methylation, RNA expression, RNA binding and other cell-state indicators. It includes sequences with quality scores, alignments, signals calculated from the alignments, and in most cases, element or peak calls calculated from the signal data. Each data set is available for visualization and download via the UCSC Genome Browser (http://genome.ucsc.edu/). ENCODE data can also be retrieved using a metadata system that captures the experimental parameters of each assay. The ENCODE web portal at UCSC (http://encodeproject.org/) provides information about the ENCODE data and links for access.
Chemical Elicitors of Antibiotic Biosynthesis in Actinomycetes

Directory of Open Access Journals (Sweden)

Anton P. Tyurin

2018-06-01

Full Text Available Whole genome sequencing of actinomycetes has uncovered a new immense realm of microbial chemistry and biology. Most biosynthetic gene clusters present in genomes were found to remain “silent” under standard cultivation conditions. Some small molecules—chemical elicitors—can be used to induce the biosynthesis of antibiotics in actinobacteria and to expand the chemical diversity of secondary metabolites. Here, we outline a brief account of the basic principles of the search for regulators of this type and their application.
Structural Genomics of Minimal Organisms: Pipeline and Results

Energy Technology Data Exchange (ETDEWEB)

Kim, Sung-Hou; Shin, Dong-Hae; Kim, Rosalind; Adams, Paul; Chandonia, John-Marc

2007-09-14

The initial objective of the Berkeley Structural Genomics Center was to obtain a near complete three-dimensional (3D) structural information of all soluble proteins of two minimal organisms, closely related pathogens Mycoplasma genitalium and M. pneumoniae. The former has fewer than 500 genes and the latter has fewer than 700 genes. A semiautomated structural genomics pipeline was set up from target selection, cloning, expression, purification, and ultimately structural determination. At the time of this writing, structural information of more than 93percent of all soluble proteins of M. genitalium is avail able. This chapter summarizes the approaches taken by the authors' center.
Annual report of the Institute of Physical and Chemical Research, for fiscal 1998

International Nuclear Information System (INIS)

1999-01-01

This annual report describes the abstracts of researches and oral presentations and papers reported as the results for fiscal 1998 in each laboratory of RIKEN (the Institute of Physical and Chemical Research). Moreover, the themes of special project funding for basic science, grant research, contract research, industrial properties, research subjects of special postdoctoral researchers and junior research associate and technology research subjects of technology research fellow are inserted. The abstract of researches, oral presentations and publications reported by Frontier Research Program, Brain Science Institute, Riken and Riken Genomic Science Center are contained. Riken Symposia and Symposia Sponsored by Riken are explained. (S.Y.)
Perspectives on Genetic and Genomic Technologies in an Academic Medical Center: The Duke Experience

Science.gov (United States)

Katsanis, Sara Huston; Minear, Mollie A.; Vorderstrasse, Allison; Yang, Nancy; Reeves, Jason W.; Rakhra-Burris, Tejinder; Cook-Deegan, Robert; Ginsburg, Geoffrey S.; Simmons, Leigh Ann

2015-01-01

In this age of personalized medicine, genetic and genomic testing is expected to become instrumental in health care delivery, but little is known about its actual implementation in clinical practice. Methods. We surveyed Duke faculty and healthcare providers to examine the extent of genetic and genomic testing adoption. We assessed providers’ use of genetic and genomic testing options and indications in clinical practice, providers’ awareness of pharmacogenetic applications, and providers’ opinions on returning research-generated genetic test results to participants. Most clinician respondents currently use family history routinely in their clinical practice, but only 18 percent of clinicians use pharmacogenetics. Only two respondents correctly identified the number of drug package inserts with pharmacogenetic indications. We also found strong support for the return of genetic research results to participants. Our results demonstrate that while Duke healthcare providers are enthusiastic about genomic technologies, use of genomic tools outside of research has been limited. Respondents favor return of research-based genetic results to participants, but clinicians lack knowledge about pharmacogenetic applications. We identified challenges faced by this institution when implementing genetic and genomic testing into patient care that should inform a policy and education agenda to improve provider support and clinician-researcher partnerships. PMID:25854543
Perspectives on Genetic and Genomic Technologies in an Academic Medical Center: The Duke Experience

Directory of Open Access Journals (Sweden)

Sara Huston Katsanis

2015-04-01

Full Text Available In this age of personalized medicine, genetic and genomic testing is expected to become instrumental in health care delivery, but little is known about its actual implementation in clinical practice. Methods. We surveyed Duke faculty and healthcare providers to examine the extent of genetic and genomic testing adoption. We assessed providers’ use of genetic and genomic testing options and indications in clinical practice, providers’ awareness of pharmacogenetic applications, and providers’ opinions on returning research-generated genetic test results to participants. Most clinician respondents currently use family history routinely in their clinical practice, but only 18 percent of clinicians use pharmacogenetics. Only two respondents correctly identified the number of drug package inserts with pharmacogenetic indications. We also found strong support for the return of genetic research results to participants. Our results demonstrate that while Duke healthcare providers are enthusiastic about genomic technologies, use of genomic tools outside of research has been limited. Respondents favor return of research-based genetic results to participants, but clinicians lack knowledge about pharmacogenetic applications. We identified challenges faced by this institution when implementing genetic and genomic testing into patient care that should inform a policy and education agenda to improve provider support and clinician-researcher partnerships.

An eMERGE Clinical Center at Partners Personalized Medicine

Directory of Open Access Journals (Sweden)

Jordan W. Smoller

2016-01-01

Full Text Available The integration of electronic medical records (EMRs and genomic research has become a major component of efforts to advance personalized and precision medicine. The Electronic Medical Records and Genomics (eMERGE network, initiated in 2007, is an NIH-funded consortium devoted to genomic discovery and implementation research by leveraging biorepositories linked to EMRs. In its most recent phase, eMERGE III, the network is focused on facilitating implementation of genomic medicine by detecting and disclosing rare pathogenic variants in clinically relevant genes. Partners Personalized Medicine (PPM is a center dedicated to translating personalized medicine into clinical practice within Partners HealthCare. One component of the PPM is the Partners Healthcare Biobank, a biorepository comprising broadly consented DNA samples linked to the Partners longitudinal EMR. In 2015, PPM joined the eMERGE Phase III network. Here we describe the elements of the eMERGE clinical center at PPM, including plans for genomic discovery using EMR phenotypes, evaluation of rare variant penetrance and pleiotropy, and a novel randomized trial of the impact of returning genetic results to patients and clinicians.
Database Resources of the BIG Data Center in 2018.

Science.gov (United States)

2018-01-04

The BIG Data Center at Beijing Institute of Genomics (BIG) of the Chinese Academy of Sciences provides freely open access to a suite of database resources in support of worldwide research activities in both academia and industry. With the vast amounts of omics data generated at ever-greater scales and rates, the BIG Data Center is continually expanding, updating and enriching its core database resources through big-data integration and value-added curation, including BioCode (a repository archiving bioinformatics tool codes), BioProject (a biological project library), BioSample (a biological sample library), Genome Sequence Archive (GSA, a data repository for archiving raw sequence reads), Genome Warehouse (GWH, a centralized resource housing genome-scale data), Genome Variation Map (GVM, a public repository of genome variations), Gene Expression Nebulas (GEN, a database of gene expression profiles based on RNA-Seq data), Methylation Bank (MethBank, an integrated databank of DNA methylomes), and Science Wikis (a series of biological knowledge wikis for community annotations). In addition, three featured web services are provided, viz., BIG Search (search as a service; a scalable inter-domain text search engine), BIG SSO (single sign-on as a service; a user access control system to gain access to multiple independent systems with a single ID and password) and Gsub (submission as a service; a unified submission service for all relevant resources). All of these resources are publicly accessible through the home page of the BIG Data Center at http://bigd.big.ac.cn. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Argonne Chemical Sciences & Engineering - Center for Electrical Energy

Science.gov (United States)

Laboratory Chemical Sciences & Engineering DOE Logo CSE Home About CSE Research Facilities People Publications Awards News & Highlights Events Search Argonne ... Search Argonne Home > Chemical Sciences & Engineering > Fundamental Interactions Catalysis & Energy Conversion Electrochemical
Analysis of Whole-Genome Data in a Public Health Lab

Centers for Disease Control (CDC) Podcasts

2017-10-17

Dr. Kelly Oakeson, a bioinformatics and genomics research analyst with the Utah Department of Health, discusses bioinformatics and genomics research. Created: 10/17/2017 by National Center for Emerging and Zoonotic Infectious Diseases (NCEZID). Date Released: 10/17/2017.
Myeloperoxidase-produced Genomic DNA-centered Radicals and Protection by Resveratrol

Science.gov (United States)

Myeloperoxidase (MPO) released by activated neutrophils, production of hypochlorous acid (HOCI) and oxidation of the genomic DNA in epithelial cells is thought to initiate and promote carcinogenesis. In this study we applied the 5,5-dimethyl-l-pyrroline N-oxide (DMPO)-based i;nmu...
KnowEnG: a knowledge engine for genomics.

Science.gov (United States)

Sinha, Saurabh; Song, Jun; Weinshilboum, Richard; Jongeneel, Victor; Han, Jiawei

2015-11-01

We describe here the vision, motivations, and research plans of the National Institutes of Health Center for Excellence in Big Data Computing at the University of Illinois, Urbana-Champaign. The Center is organized around the construction of "Knowledge Engine for Genomics" (KnowEnG), an E-science framework for genomics where biomedical scientists will have access to powerful methods of data mining, network mining, and machine learning to extract knowledge out of genomics data. The scientist will come to KnowEnG with their own data sets in the form of spreadsheets and ask KnowEnG to analyze those data sets in the light of a massive knowledge base of community data sets called the "Knowledge Network" that will be at the heart of the system. The Center is undertaking discovery projects aimed at testing the utility of KnowEnG for transforming big data to knowledge. These projects span a broad range of biological enquiry, from pharmacogenomics (in collaboration with Mayo Clinic) to transcriptomics of human behavior. © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
The genome of Eucalyptus grandis

Energy Technology Data Exchange (ETDEWEB)

Myburg, Alexander A.; Grattapaglia, Dario; Tuskan, Gerald A.; Hellsten, Uffe; Hayes, Richard D.; Grimwood, Jane; Jenkins, Jerry; Lindquist, Erika; Tice, Hope; Bauer, Diane; Goodstein, David M.; Dubchak, Inna; Poliakov, Alexandre; Mizrachi, Eshchar; Kullan, Anand R. K.; Hussey, Steven G.; Pinard, Desre; van der Merwe, Karen; Singh, Pooja; van Jaarsveld, Ida; Silva-Junior, Orzenil B.; Togawa, Roberto C.; Pappas, Marilia R.; Faria, Danielle A.; Sansaloni, Carolina P.; Petroli, Cesar D.; Yang, Xiaohan; Ranjan, Priya; Tschaplinski, Timothy J.; Ye, Chu-Yu; Li, Ting; Sterck, Lieven; Vanneste, Kevin; Murat, Florent; Soler, Marçal; Clemente, Hélène San; Saidi, Naijib; Cassan-Wang, Hua; Dunand, Christophe; Hefer, Charles A.; Bornberg-Bauer, Erich; Kersting, Anna R.; Vining, Kelly; Amarasinghe, Vindhya; Ranik, Martin; Naithani, Sushma; Elser, Justin; Boyd, Alexander E.; Liston, Aaron; Spatafora, Joseph W.; Dharmwardhana, Palitha; Raja, Rajani; Sullivan, Christopher; Romanel, Elisson; Alves-Ferreira, Marcio; Külheim, Carsten; Foley, William; Carocha, Victor; Paiva, Jorge; Kudrna, David; Brommonschenkel, Sergio H.; Pasquali, Giancarlo; Byrne, Margaret; Rigault, Philippe; Tibbits, Josquin; Spokevicius, Antanas; Jones, Rebecca C.; Steane, Dorothy A.; Vaillancourt, René E.; Potts, Brad M.; Joubert, Fourie; Barry, Kerrie; Pappas, Georgios J.; Strauss, Steven H.; Jaiswal, Pankaj; Grima-Pettenati, Jacqueline; Salse, Jérôme; Van de Peer, Yves; Rokhsar, Daniel S.; Schmutz, Jeremy

2014-06-11

Eucalypts are the world s most widely planted hardwood trees. Their broad adaptability, rich species diversity, fast growth and superior multipurpose wood, have made them a global renewable resource of fiber and energy that mitigates human pressures on natural forests. We sequenced and assembled >94% of the 640 Mbp genome of Eucalyptus grandis into its 11 chromosomes. A set of 36,376 protein coding genes were predicted revealing that 34% occur in tandem duplications, the largest proportion found thus far in any plant genome. Eucalypts also show the highest diversity of genes for plant specialized metabolism that act as chemical defence against biotic agents and provide unique pharmaceutical oils. Resequencing of a set of inbred tree genomes revealed regions of strongly conserved heterozygosity, likely hotspots of inbreeding depression. The resequenced genome of the sister species E. globulus underscored the high inter-specific genome colinearity despite substantial genome size variation in the genus. The genome of E. grandis is the first reference for the early diverging Rosid order Myrtales and is placed here basal to the Eurosids. This resource expands knowledge on the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.
The BIG Data Center: from deposition to integration to translation.

Science.gov (United States)

2017-01-04

Biological data are generated at unprecedentedly exponential rates, posing considerable challenges in big data deposition, integration and translation. The BIG Data Center, established at Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, provides a suite of database resources, including (i) Genome Sequence Archive, a data repository specialized for archiving raw sequence reads, (ii) Gene Expression Nebulas, a data portal of gene expression profiles based entirely on RNA-Seq data, (iii) Genome Variation Map, a comprehensive collection of genome variations for featured species, (iv) Genome Warehouse, a centralized resource housing genome-scale data with particular focus on economically important animals and plants, (v) Methylation Bank, an integrated database of whole-genome single-base resolution methylomes and (vi) Science Wikis, a central access point for biological wikis developed for community annotations. The BIG Data Center is dedicated to constructing and maintaining biological databases through big data integration and value-added curation, conducting basic research to translate big data into big knowledge and providing freely open access to a variety of data resources in support of worldwide research activities in both academia and industry. All of these resources are publicly available and can be found at http://bigd.big.ac.cn. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genetic and Rare Diseases Information Center (GARD)

Data.gov (United States)

Federal Laboratory Consortium — NCATS collaborates with the National Human Genome Research Institute (NHGRI) to support GARD, a center designed to provide comprehensive information about rare and...
Chemical and UV Mutagenesis.

Science.gov (United States)

Bose, Jeffrey L

2016-01-01

The ability to create mutations is an important step towards understanding bacterial physiology and virulence. While targeted approaches are invaluable, the ability to produce genome-wide random mutations can lead to crucial discoveries. Transposon mutagenesis is a useful approach, but many interesting mutations can be missed by these insertions that interrupt coding and noncoding sequences due to the integration of an entire transposon. Chemical mutagenesis and UV-based random mutagenesis are alternate approaches to isolate mutations of interest with the potential of only single nucleotide changes. Once a standard method, difficulty in identifying mutation sites had decreased the popularity of this technique. However, thanks to the recent emergence of economical whole-genome sequencing, this approach to making mutations can once again become a viable option. Therefore, this chapter provides an overview protocol for random mutagenesis using UV light or DNA-damaging chemicals.
Endocrine Disrupting Chemicals (EDCs)

Science.gov (United States)

... Center Pacientes y Cuidadores Hormones and Health The Endocrine System Hormones Endocrine Disrupting Chemicals (EDCs) Steroid and Hormone ... Hormones and Health › Endocrine Disrupting Chemicals (EDCs) The Endocrine System Hormones Endocrine Disrupting Chemicals (EDCs) EDCs Myth vs. ...
Evolutionary Genomics of Life in (and from) the Sea

Energy Technology Data Exchange (ETDEWEB)

Boore, Jeffrey L.; Dehal, Paramvir; Fuerstenberg, Susan I.

2006-01-09

High throughput genome sequencing centers that were originally built for the Human Genome Project (Lander et al., 2001; Venter et al., 2001) have now become an engine for comparative genomics. The six largest centers alone are now producing over 150 billion nucleotides per year, more than 50 times the amount of DNA in the human genome, and nearly all of this is directed at projects that promise great insights into the pattern and processes of evolution. Unfortunately, this data is being produced at a pace far exceeding the capacity of the scientific community to provide insightful analysis, and few scientists with training and experience in evolutionary biology have played prominent roles to date. One of the consequences is that poor quality analyses are typical; for example, orthology among genes is generally determined by simple measures of sequence similarity, when this has been discredited by molecular evolutionary biologists decades ago. Here we discuss the how genomes are chosen for sequencing and how the scientific community can have input. We describe the PhIGs database and web tools (Dehal and Boore 2005a; http://PhIGs.org), which provide phylogenetic analysis of all gene families for all completely sequenced genomes and the associated 'Synteny Viewer', which allows comparisons of the relative positions of orthologous genes. This is the best tool available for inferring gene function across multiple genomes. We also describe how we have used the PhIGs methods with the whole genome sequences of a tunicate, fish, mouse, and human to conclusively demonstrate that two rounds of whole genome duplication occurred at the base of vertebrates (Dehal and Boore 2005b). This evidence is found in the large scale structure of the positions of paralogous genes that arose from duplications inferred by evolutionary analysis to have occurred at the base of vertebrates.
Anticipation of Personal Genomics Data Enhances Interest and Learning Environment in Genomics and Molecular Biology Undergraduate Courses.

Science.gov (United States)

Weber, K Scott; Jensen, Jamie L; Johnson, Steven M

2015-01-01

An important discussion at colleges is centered on determining more effective models for teaching undergraduates. As personalized genomics has become more common, we hypothesized it could be a valuable tool to make science education more hands on, personal, and engaging for college undergraduates. We hypothesized that providing students with personal genome testing kits would enhance the learning experience of students in two undergraduate courses at Brigham Young University: Advanced Molecular Biology and Genomics. These courses have an emphasis on personal genomics the last two weeks of the semester. Students taking these courses were given the option to receive personal genomics kits in 2014, whereas in 2015 they were not. Students sent their personal genomics samples in on their own and received the data after the course ended. We surveyed students in these courses before and after the two-week emphasis on personal genomics to collect data on whether anticipation of obtaining their own personal genomic data impacted undergraduate student learning. We also tested to see if specific personal genomic assignments improved the learning experience by analyzing the data from the undergraduate students who completed both the pre- and post-course surveys. Anticipation of personal genomic data significantly enhanced student interest and the learning environment based on the time students spent researching personal genomic material and their self-reported attitudes compared to those who did not anticipate getting their own data. Personal genomics homework assignments significantly enhanced the undergraduate student interest and learning based on the same criteria and a personal genomics quiz. We found that for the undergraduate students in both molecular biology and genomics courses, incorporation of personal genomic testing can be an effective educational tool in undergraduate science education.
The Hardwood Tree Improvement and Regeneration Center: its strategic plans for sustaining the hardwood resource

Science.gov (United States)

Charles H. Michler; Michael J. Bosela; Paula M. Pijut; Keith E. Woeste

2003-01-01

A regional center for hardwood tree improvement, genomics, and regeneration research, development and technology transfer will focus on black walnut, black cherry, northern red oak and, in the future, on other fine hardwoods as the effort is expanded. The Hardwood Tree Improvement and Regeneration Center (HTIRC) will use molecular genetics and genomics along with...
The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics.

Directory of Open Access Journals (Sweden)

Lincoln D Stein

2003-11-01

Full Text Available The soil nematodes Caenorhabditis briggsae and Caenorhabditis elegans diverged from a common ancestor roughly 100 million years ago and yet are almost indistinguishable by eye. They have the same chromosome number and genome sizes, and they occupy the same ecological niche. To explore the basis for this striking conservation of structure and function, we have sequenced the C. briggsae genome to a high-quality draft stage and compared it to the finished C. elegans sequence. We predict approximately 19,500 protein-coding genes in the C. briggsae genome, roughly the same as in C. elegans. Of these, 12,200 have clear C. elegans orthologs, a further 6,500 have one or more clearly detectable C. elegans homologs, and approximately 800 C. briggsae genes have no detectable matches in C. elegans. Almost all of the noncoding RNAs (ncRNAs known are shared between the two species. The two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers. Operons, a distinctive feature of C. elegans, are highly conserved in C. briggsae, with the arrangement of genes being preserved in 96% of cases. The difference in size between the C. briggsae (estimated at approximately 104 Mbp and C. elegans (100.3 Mbp genomes is almost entirely due to repetitive sequence, which accounts for 22.4% of the C. briggsae genome in contrast to 16.5% of the C. elegans genome. Few, if any, repeat families are shared, suggesting that most were acquired after the two species diverged or are undergoing rapid evolution. Coclustering the C. elegans and C. briggsae proteins reveals 2,169 protein families of two or more members. Most of these are shared between the two species, but some appear to be expanding or contracting, and there seem to be as many as several hundred novel C. briggsae gene families. The C. briggsae draft sequence will greatly improve the annotation of the C. elegans genome. Based on similarity to C
Searching for genomic constraints

Energy Technology Data Exchange (ETDEWEB)

Lio` , P [Cambridge, Univ. (United Kingdom). Genetics Dept.; Ruffo, S [Florence, Univ. (Italy). Fac. di Ingegneria. Dipt. di Energetica ` S. Stecco`

1998-01-01

The authors have analyzed general properties of very long DNA sequences belonging to simple and complex organisms, by using different correlation methods. They have distinguished those base compositional rules that concern the entire genome which they call `genomic constraints` from the rules that depend on the `external natural selection` acting on single genes, i. e. protein-centered constraints. They show that G + C content, purine / pyrimidine distributions and biological complexity of the organism are the most important factors which determine base compositional rules and genome complexity. Three main facts are here reported: bacteria with high G + C content have more restrictions on base composition than those with low G + C content; at constant G + C content more complex organisms, ranging from prokaryotes to higher eukaryotes (e.g. human) display an increase of repeats 10-20 nucleotides long, which are also partly responsible for long-range correlations; work selection of length 3 to 10 is stronger in human and in bacteria for two distinct reasons. With respect to previous studies, they have also compared the genomic sequence of the archeon Methanococcus jannaschii with those of bacteria and eukaryotes: it shows sometimes an intermediate statistical behaviour.
Searching for genomic constraints

International Nuclear Information System (INIS)

Lio', P.; Ruffo, S.

1998-01-01

The authors have analyzed general properties of very long DNA sequences belonging to simple and complex organisms, by using different correlation methods. They have distinguished those base compositional rules that concern the entire genome which they call 'genomic constraints' from the rules that depend on the 'external natural selection' acting on single genes, i. e. protein-centered constraints. They show that G + C content, purine / pyrimidine distributions and biological complexity of the organism are the most important factors which determine base compositional rules and genome complexity. Three main facts are here reported: bacteria with high G + C content have more restrictions on base composition than those with low G + C content; at constant G + C content more complex organisms, ranging from prokaryotes to higher eukaryotes (e.g. human) display an increase of repeats 10-20 nucleotides long, which are also partly responsible for long-range correlations; work selection of length 3 to 10 is stronger in human and in bacteria for two distinct reasons. With respect to previous studies, they have also compared the genomic sequence of the archeon Methanococcus jannaschii with those of bacteria and eukaryotes: it shows sometimes an intermediate statistical behaviour
Hazardous Chemicals

Centers for Disease Control (CDC) Podcasts

Chemicals are a part of our daily lives, providing many products and modern conveniences. With more than three decades of experience, The Centers for Disease Control and Prevention (CDC) has been in the forefront of efforts to protect and assess people's exposure to environmental and hazardous chemicals. This report provides information about hazardous chemicals and useful tips on how to protect you and your family from harmful exposure.
Journal of Chemical Sciences | Indian Academy of Sciences

Indian Academy of Sciences (India)

School of Chemical & Chemical Engineering, Yancheng Institute of Technology, Yancheng 224051, P. R. China; China-Australia Joint Research Center for Functional Molecular Materials, School of Chemical and Material Engineering, Jiangnan University, Wuxi 214122, P. R. China; China-Australia Joint Research Center ...
Using Genomics for Natural Product Structure Elucidation.

Science.gov (United States)

Tietz, Jonathan I; Mitchell, Douglas A

2016-01-01

Natural products (NPs) are the most historically bountiful source of chemical matter for drug development-especially for anti-infectives. With insights gleaned from genome mining, interest in natural product discovery has been reinvigorated. An essential stage in NP discovery is structural elucidation, which sheds light not only on the chemical composition of a molecule but also its novelty, properties, and derivatization potential. The history of structure elucidation is replete with techniquebased revolutions: combustion analysis, crystallography, UV, IR, MS, and NMR have each provided game-changing advances; the latest such advance is genomics. All natural products have a genetic basis, and the ability to obtain and interpret genomic information for structure elucidation is increasingly available at low cost to non-specialists. In this review, we describe the value of genomics as a structural elucidation technique, especially from the perspective of the natural product chemist approaching an unknown metabolite. Herein we first introduce the databases and programs of interest to the natural products chemist, with an emphasis on those currently most suited for general usability. We describe strategies for linking observed natural product-linked phenotypes to their corresponding gene clusters. We then discuss techniques for extracting structural information from genes, illustrated with numerous case examples. We also provide an analysis of the biases and limitations of the field with recommendations for future development. Our overview is not only aimed at biologically-oriented researchers already at ease with bioinformatic techniques, but also, in particular, at natural product, organic, and/or medicinal chemists not previously familiar with genomic techniques.

Research study on analysis/use technologies of genome information; Genome joho kaidoku riyo gijutsu no chosa kenkyu

Energy Technology Data Exchange (ETDEWEB)

NONE

1997-03-01

For wide use of genome information in the industrial field, the required R and D was surveyed from the standpoints of biology and information science. To clarify the present state and issues of the international research on genome analysis, the genome map as well as sequence and function information are first surveyed. The current analysis/use technologies of genome information are analyzed, and the following are summarized: prediction and identification of gene regions in genome sequences, techniques for searching and selecting useful genes, and techniques for predicting the expression of gene functions and the gene-product structure and functions. It is recommended that R and D and data collection/interpretation necessary to clarify inter-gene interactions and information networks should be promoted by integrating Japanese advanced know-how and technologies. As examples of the impact of the research results on industry and society, the present state and future expected effect are summarized for medicines, diagnosis/analysis instruments, chemicals, foods, agriculture, fishery, animal husbandry, electronics, environment and information. 278 refs., 42 figs., 5 tabs.
Genomic and functional features of the biosurfactant producing Bacillus sp. AM13.

Science.gov (United States)

Shaligram, Shraddha; Kumbhare, Shreyas V; Dhotre, Dhiraj P; Muddeshwar, Manohar G; Kapley, Atya; Joseph, Neetha; Purohit, Hemant P; Shouche, Yogesh S; Pawar, Shrikant P

2016-09-01

Genomic studies provide deeper insights into secondary metabolites produced by diverse bacterial communities, residing in various environmental niches. This study aims to understand the potential of a biosurfactant producing Bacillus sp. AM13, isolated from soil. An integrated approach of genomic and chemical analysis was employed to characterize the antibacterial lipopeptide produced by the strain AM13. Genome analysis revealed that strain AM13 harbors a nonribosomal peptide synthetase (NRPS) cluster; highly similar with known biosynthetic gene clusters from surfactin family: lichenysin (85 %) and surfactin (78 %). These findings were substantiated with supplementary experiments of oil displacement assay and surface tension measurements, confirming the biosurfactant production. Further investigation using LCMS approach exhibited similarity of the biomolecule with biosurfactants of the surfactin family. Our consolidated effort of functional genomics provided chemical as well as genetic leads for understanding the biochemical characteristics of the bioactive compound.
Relationship between metabolic and genomic diversity in sesame (Sesamum indicum L.

Directory of Open Access Journals (Sweden)

Karlovsky Petr

2008-05-01

Full Text Available Abstract Background Diversity estimates in cultivated plants provide a rationale for conservation strategies and support the selection of starting material for breeding programs. Diversity measures applied to crops usually have been limited to the assessment of genome polymorphism at the DNA level. Occasionally, selected morphological features are recorded and the content of key chemical constituents determined, but unbiased and comprehensive chemical phenotypes have not been included systematically in diversity surveys. Our objective in this study was to assess metabolic diversity in sesame by nontargeted metabolic profiling and elucidate the relationship between metabolic and genome diversity in this crop. Results Ten sesame accessions were selected that represent most of the genome diversity of sesame grown in India, Western Asia, Sudan and Venezuela based on previous AFLP studies. Ethanolic seed extracts were separated by HPLC, metabolites were ionized by positive and negative electrospray and ions were detected with an ion trap mass spectrometer in full-scan mode for m/z from 50 to 1000. Genome diversity was determined by Amplified Fragment Length Polymorphism (AFLP using eight primer pair combinations. The relationship between biodiversity at the genome and at the metabolome levels was assessed by correlation analysis and multivariate statistics. Conclusion Patterns of diversity at the genomic and metabolic levels differed, indicating that selection played a significant role in the evolution of metabolic diversity in sesame. This result implies that when used for the selection of genotypes in breeding and conservation, diversity assessment based on neutral DNA markers should be complemented with metabolic profiles. We hypothesize that this applies to all crops with a long history of domestication that possess commercially relevant traits affected by chemical phenotypes.
Making Personalized Health Care Even More Personalized: Insights From Activities of the IOM Genomics Roundtable.

Science.gov (United States)

David, Sean P; Johnson, Samuel G; Berger, Adam C; Feero, W Gregory; Terry, Sharon F; Green, Larry A; Phillips, Robert L; Ginsburg, Geoffrey S

2015-01-01

Genomic research has generated much new knowledge into mechanisms of human disease, with the potential to catalyze novel drug discovery and development, prenatal and neonatal screening, clinical pharmacogenomics, more sensitive risk prediction, and enhanced diagnostics. Genomic medicine, however, has been limited by critical evidence gaps, especially those related to clinical utility and applicability to diverse populations. Genomic medicine may have the greatest impact on health care if it is integrated into primary care, where most health care is received and where evidence supports the value of personalized medicine grounded in continuous healing relationships. Redesigned primary care is the most relevant setting for clinically useful genomic medicine research. Taking insights gained from the activities of the Institute of Medicine (IOM) Roundtable on Translating Genomic-Based Research for Health, we apply lessons learned from the patient-centered medical home national experience to implement genomic medicine in a patient-centered, learning health care system. © 2015 Annals of Family Medicine, Inc.
Annual report of the Institute of Physical and Chemical Research, for fiscal 1999

International Nuclear Information System (INIS)

2000-01-01

The research activities in the Institute of Physical and Chemical Research (RIKEN) for the fiscal year 1999 were briefly described in this report. In addition, the research papers published in the year from the laboratories in RIKEN Wako Main Campus, RIKEN Tsukuba Research Center of Life Science and RIKEN Harima Institute were presented. Moreover, ten special research projects for basic science are now progressing on the following themes: photosynthetic science (artificial photosynthesis and the mechanism of photosynthesis), biodesign research (cellular function system, membranous function system), coherent science research (coherent control for free electron, quantum processing, structural control and coherent molecular interaction), research on multi-bioprobes (development of multi-functional bioactive compounds), research on essential reaction (stereo-control and energy control), atomic-scale sciengineering (phase 2 study), MR science research (phase 2 study), slow quantum beam production of ultra slow highly charged ions and ecomolecular science research (material conversion and biological/chemical conversion for environmental compounds). The research activities of RIKEN Brain Science Institute were also outlined and RIKEN Genomic Sciences Center were also outlined. In the year, RIKEN symposium was held 38 times by various laboratories. Here, the themes of these symposia were listed as well as those of international symposia sponsored by RIKEN Institute. (M.N.)
Evidence that personal genome testing enhances student learning in a course on genomics and personalized medicine.

Directory of Open Access Journals (Sweden)

Keyan Salari

Full Text Available An emerging debate in academic medical centers is not about the need for providing trainees with fundamental education on genomics, but rather the most effective educational models that should be deployed. At Stanford School of Medicine, a novel hands-on genomics course was developed in 2010 that provided students the option to undergo personal genome testing as part of the course curriculum. We hypothesized that use of personal genome testing in the classroom would enhance the learning experience of students. No data currently exist on how such methods impact student learning; thus, we surveyed students before and after the course to determine its impact. We analyzed responses using paired statistics from the 31 medical and graduate students who completed both pre-course and post-course surveys. Participants were stratified by those who did (N = 23 or did not (N = 8 undergo personal genome testing. In reflecting on the experience, 83% of students who underwent testing stated that they were pleased with their decision compared to 12.5% of students who decided against testing (P = 0.00058. Seventy percent of those who underwent personal genome testing self-reported a better understanding of human genetics on the basis of having undergone testing. Further, students who underwent personal genome testing demonstrated an average 31% increase in pre- to post-course scores on knowledge questions (P = 3.5×10(-6; this was significantly higher (P = 0.003 than students who did not undergo testing, who showed a non-significant improvement. Undergoing personal genome testing and using personal genotype data in the classroom enhanced students' self-reported and assessed knowledge of genomics, and did not appear to cause significant anxiety. At least for self-selected students, the incorporation of personal genome testing can be an effective educational tool to teach important concepts of clinical genomic testing.
Genomic dissection and prioritizing of candidate genes of QTL for ...

Indian Academy of Sciences (India)

of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN 38163, USA. 5Mudanjiang ..... Fragile X mental retardation gene 1,. −2.1 ... stimulus/stress and signalling associated with acute-phase response were .... This work was supported by the Center of Genomics and Bioinfor- matics and ...
Characterization of the catalytic center of the Ebola virus L polymerase.

Science.gov (United States)

Schmidt, Marie Luisa; Hoenen, Thomas

2017-10-01

Ebola virus (EBOV) causes a severe hemorrhagic fever in humans and non-human primates. While no licensed therapeutics are available, recently there has been tremendous progress in developing antivirals. Targeting the ribonucleoprotein complex (RNP) proteins, which facilitate genome replication and transcription, and particularly the polymerase L, is a promising antiviral approach since these processes are essential for the virus life cycle. However, until now little is known about L in terms of its structure and function, and in particular the catalytic center of the RNA-dependent RNA polymerase (RdRp) of L, which is one of the most promising molecular targets, has never been experimentally characterized. Using multiple sequence alignments with other negative sense single-stranded RNA viruses we identified the putative catalytic center of the EBOV RdRp. An L protein with mutations in this center was then generated and characterized using various life cycle modelling systems. These systems are based on minigenomes, i.e. miniature versions of the viral genome, in which the viral genes are exchanged against a reporter gene. When such minigenomes are coexpressed with RNP proteins in mammalian cells, the RNP proteins recognize them as authentic templates for replication and transcription, resulting in reporter activity reflecting these processes. Replication-competent minigenome systems indicated that our L catalytic domain mutant was impaired in genome replication and/or transcription, and by using replication-deficient minigenome systems, as well as a novel RT-qPCR-based genome replication assay, we showed that it indeed no longer supported either of these processes. However, it still showed similar expression to wild-type L, and retained its ability to be incorporated into inclusion bodies, which are the sites of EBOV genome replication. We have experimentally defined the catalytic center of the EBOV RdRp, and thus a promising antiviral target regulating an essential
Characterization of the catalytic center of the Ebola virus L polymerase.

Directory of Open Access Journals (Sweden)

Marie Luisa Schmidt

2017-10-01

Full Text Available Ebola virus (EBOV causes a severe hemorrhagic fever in humans and non-human primates. While no licensed therapeutics are available, recently there has been tremendous progress in developing antivirals. Targeting the ribonucleoprotein complex (RNP proteins, which facilitate genome replication and transcription, and particularly the polymerase L, is a promising antiviral approach since these processes are essential for the virus life cycle. However, until now little is known about L in terms of its structure and function, and in particular the catalytic center of the RNA-dependent RNA polymerase (RdRp of L, which is one of the most promising molecular targets, has never been experimentally characterized.Using multiple sequence alignments with other negative sense single-stranded RNA viruses we identified the putative catalytic center of the EBOV RdRp. An L protein with mutations in this center was then generated and characterized using various life cycle modelling systems. These systems are based on minigenomes, i.e. miniature versions of the viral genome, in which the viral genes are exchanged against a reporter gene. When such minigenomes are coexpressed with RNP proteins in mammalian cells, the RNP proteins recognize them as authentic templates for replication and transcription, resulting in reporter activity reflecting these processes. Replication-competent minigenome systems indicated that our L catalytic domain mutant was impaired in genome replication and/or transcription, and by using replication-deficient minigenome systems, as well as a novel RT-qPCR-based genome replication assay, we showed that it indeed no longer supported either of these processes. However, it still showed similar expression to wild-type L, and retained its ability to be incorporated into inclusion bodies, which are the sites of EBOV genome replication.We have experimentally defined the catalytic center of the EBOV RdRp, and thus a promising antiviral target
Synthetic biology to access and expand nature’s chemical diversity

Science.gov (United States)

Smanski, Michael J.; Zhou, Hui; Claesen, Jan; Shen, Ben; Fischbach, Michael; Voigt, Christopher A.

2016-01-01

Bacterial genomes encode the biosynthetic potential to produce hundreds of thousands of complex molecules with diverse applications, from medicine to agriculture and materials. Economically accessing the potential encoded within sequenced genomes promises to reinvigorate waning drug discovery pipelines and provide novel routes to intricate chemicals. This is a tremendous undertaking, as the pathways often comprise dozens of genes spanning as much as 100+ kiliobases of DNA, are controlled by complex regulatory networks, and the most interesting molecules are made by non-model organisms. Advances in synthetic biology address these issues, including DNA construction technologies, genetic parts for precision expression control, synthetic regulatory circuits, computer aided design, and multiplexed genome engineering. Collectively, these technologies are moving towards an era when chemicals can be accessed en mass based on sequence information alone. This will enable the harnessing of metagenomic data and massive strain banks for high-throughput molecular discovery and, ultimately, the ability to forward design pathways to complex chemicals not found in nature. PMID:26876034
Genome Sequence of the Freshwater Yangtze Finless Porpoise.

Science.gov (United States)

Yuan, Yuan; Zhang, Peijun; Wang, Kun; Liu, Mingzhong; Li, Jing; Zheng, Jingsong; Wang, Ding; Xu, Wenjie; Lin, Mingli; Dong, Lijun; Zhu, Chenglong; Qiu, Qiang; Li, Songhai

2018-04-16

The Yangtze finless porpoise ( Neophocaena asiaeorientalis ssp. asiaeorientalis ) is a subspecies of the narrow-ridged finless porpoise ( N. asiaeorientalis ). In total, 714.28 gigabases (Gb) of raw reads were generated by whole-genome sequencing of the Yangtze finless porpoise, using an Illumina HiSeq 2000 platform. After filtering the low-quality and duplicated reads, we assembled a draft genome of 2.22 Gb, with contig N50 and scaffold N50 values of 46.69 kilobases (kb) and 1.71 megabases (Mb), respectively. We identified 887.63 Mb of repetitive sequences and predicted 18,479 protein-coding genes in the assembled genome. The phylogenetic tree showed a relationship between the Yangtze finless porpoise and the Yangtze River dolphin, which diverged approximately 20.84 million years ago. In comparisons with the genomes of 10 other mammals, we detected 44 species-specific gene families, 164 expanded gene families, and 313 positively selected genes in the Yangtze finless porpoise genome. The assembled genome sequence and underlying sequence data are available at the National Center for Biotechnology Information under BioProject accession number PRJNA433603.
NUMERICAL ALGORITHMS AT NON-ZERO CHEMICAL POTENTIAL. PROCEEDINGS OF RIKEN BNL RESEARCH CENTER WORKSHOP, VOLUME 19

International Nuclear Information System (INIS)

Blum, T.; Creutz, M.

1999-01-01

The RIKEN BNL Research Center hosted its 19th workshop April 27th through May 1, 1999. The topic was Numerical Algorithms at Non-Zero Chemical Potential. QCD at a non-zero chemical potential (non-zero density) poses a long-standing unsolved challenge for lattice gauge theory. Indeed, it is the primary unresolved issue in the fundamental formulation of lattice gauge theory. The chemical potential renders conventional lattice actions complex, practically excluding the usual Monte Carlo techniques which rely on a positive definite measure for the partition function. This ''sign'' problem appears in a wide range of physical systems, ranging from strongly coupled electronic systems to QCD. The lack of a viable numerical technique at non-zero density is particularly acute since new exotic ''color superconducting'' phases of quark matter have recently been predicted in model calculations. A first principles confirmation of the phase diagram is desirable since experimental verification is not expected soon. At the workshop several proposals for new algorithms were made: cluster algorithms, direct simulation of Grassman variables, and a bosonization of the fermion determinant. All generated considerable discussion and seem worthy of continued investigation. Several interesting results using conventional algorithms were also presented: condensates in four fermion models, SU(2) gauge theory in fundamental and adjoint representations, and lessons learned from strong; coupling, non-zero temperature and heavy quarks applied to non-zero density simulations
Gene calling and bacterial genome annotation with BG7.

Science.gov (United States)

Tobes, Raquel; Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Kovach, Evdokim; Alekhin, Alexey; Pareja, Eduardo

2015-01-01

New massive sequencing technologies are providing many bacterial genome sequences from diverse taxa but a refined annotation of these genomes is crucial for obtaining scientific findings and new knowledge. Thus, bacterial genome annotation has emerged as a key point to investigate in bacteria. Any efficient tool designed specifically to annotate bacterial genomes sequenced with massively parallel technologies has to consider the specific features of bacterial genomes (absence of introns and scarcity of nonprotein-coding sequence) and of next-generation sequencing (NGS) technologies (presence of errors and not perfectly assembled genomes). These features make it convenient to focus on coding regions and, hence, on protein sequences that are the elements directly related with biological functions. In this chapter we describe how to annotate bacterial genomes with BG7, an open-source tool based on a protein-centered gene calling/annotation paradigm. BG7 is specifically designed for the annotation of bacterial genomes sequenced with NGS. This tool is sequence error tolerant maintaining their capabilities for the annotation of highly fragmented genomes or for annotating mixed sequences coming from several genomes (as those obtained through metagenomics samples). BG7 has been designed with scalability as a requirement, with a computing infrastructure completely based on cloud computing (Amazon Web Services).
UCLA's Molecular Screening Shared Resource: enhancing small molecule discovery with functional genomics and new technology.

Science.gov (United States)

Damoiseaux, Robert

2014-05-01

The Molecular Screening Shared Resource (MSSR) offers a comprehensive range of leading-edge high throughput screening (HTS) services including drug discovery, chemical and functional genomics, and novel methods for nano and environmental toxicology. The MSSR is an open access environment with investigators from UCLA as well as from the entire globe. Industrial clients are equally welcome as are non-profit entities. The MSSR is a fee-for-service entity and does not retain intellectual property. In conjunction with the Center for Environmental Implications of Nanotechnology, the MSSR is unique in its dedicated and ongoing efforts towards high throughput toxicity testing of nanomaterials. In addition, the MSSR engages in technology development eliminating bottlenecks from the HTS workflow and enabling novel assays and readouts currently not available.
Chemical Genomics and Emerging DNA Technologies in the Identification of Drug Mechanisms and Drug Targets

DEFF Research Database (Denmark)

Olsen, Louise Cathrine Braun; Færgeman, Nils J.

2012-01-01

and validate therapeutic targets and to discover drug candidates for rapidly and effectively generating new interventions for human diseases. The recent emergence of genomic technologies and their application on genetically tractable model organisms like Drosophila melanogaster,Caenorhabditis elegans...... critical roles in the genomic age of biological research and drug discovery. In the present review we discuss how simple biological model organisms can be used as screening platforms in combination with emerging genomic technologies to advance the identification of potential drugs and their molecular...
MIPS: a database for genomes and protein sequences.

Science.gov (United States)

Mewes, H W; Frishman, D; Güldener, U; Mannhaupt, G; Mayer, K; Mokrejs, M; Morgenstern, B; Münsterkötter, M; Rudd, S; Weil, B

2002-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz-Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91-93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155-158; Barker et al. (2001) Nucleic Acids Res., 29, 29-32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de).
Natural Product Biosynthetic Diversity and Comparative Genomics of the Cyanobacteria.

Science.gov (United States)

Dittmann, Elke; Gugger, Muriel; Sivonen, Kaarina; Fewer, David P

2015-10-01

Cyanobacteria are an ancient lineage of slow-growing photosynthetic bacteria and a prolific source of natural products with intricate chemical structures and potent biological activities. The bulk of these natural products are known from just a handful of genera. Recent efforts have elucidated the mechanisms underpinning the biosynthesis of a diverse array of natural products from cyanobacteria. Many of the biosynthetic mechanisms are unique to cyanobacteria or rarely described from other organisms. Advances in genome sequence technology have precipitated a deluge of genome sequences for cyanobacteria. This makes it possible to link known natural products to biosynthetic gene clusters but also accelerates the discovery of new natural products through genome mining. These studies demonstrate that cyanobacteria encode a huge variety of cryptic gene clusters for the production of natural products, and the known chemical diversity is likely to be just a fraction of the true biosynthetic capabilities of this fascinating and ancient group of organisms. Copyright © 2015. Published by Elsevier Ltd.
Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing.

Science.gov (United States)

Straub, Shannon C K; Fishbein, Mark; Livshultz, Tatyana; Foster, Zachary; Parks, Matthew; Weitemier, Kevin; Cronn, Richard C; Liston, Aaron

2011-05-04

step in the development of a community resource for further study of plant-insect co-evolution, anti-herbivore defense, floral developmental genetics, reproductive biology, chemical evolution, population genetics, and comparative genomics using milkweeds, and A. syriaca in particular, as ecological and evolutionary models.
Marine Bacterial Genomics

DEFF Research Database (Denmark)

Machado, Henrique

For decades, terrestrial microorganisms have been used as sources of countless enzymes and chemical compounds that have been produced by pharmaceutical and biotech companies and used by mankind. There is a need for new chemical compounds, including antibiotics,new enzymatic activities and new...... microorganisms to be used as cell factories for production. Therefore exploitation of new microbial niches and use of different strategies is an opportunity to boost discoveries. Even though scientists have started to explore several habitats other than the terrestrial ones, the marine environment stands out...... as a hitherto under-explored niche. This thesis work uses high-throughput sequencing technologies on a collection of marine bacteria established during the Galathea 3 expedition, with the purpose of unraveling new biodiversity and new bioactivities. Several tools were used for genomic analysis in order...
Lessons from a phenotyping center revealed by the genome-guided mapping of powdery mildew resistance loci

Science.gov (United States)

The genomics era brought unprecedented tools for genetic analysis of host resistance, but careful attention is needed on obtaining accurate and reproducible phenotypes so that genomic results appropriately reflect biology. Phenotyping host resistance by natural infection in the field can produce var...

Empowering Mayo Clinic Individualized Medicine with Genomic Data Warehousing

Directory of Open Access Journals (Sweden)

Iain Horton

2017-08-01

Full Text Available Individualized medicine enables better diagnoses and treatment decisions for patients and promotes research in understanding the molecular underpinnings of disease. Linking individual patient’s genomic and molecular information with their clinical phenotypes is crucial to these efforts. To address this need, the Center for Individualized Medicine at Mayo Clinic has implemented a genomic data warehouse and a workflow management system to bring data from institutional electronic health records and genomic sequencing data from both clinical and research bioinformatics sources into the warehouse. The system is the foundation for Mayo Clinic to build a suite of tools and interfaces to support various clinical and research use cases. The genomic data warehouse is positioned to play a key role in enhancing the research capabilities and advancing individualized patient care at Mayo Clinic.
Empowering Mayo Clinic Individualized Medicine with Genomic Data Warehousing.

Science.gov (United States)

Horton, Iain; Lin, Yaxiong; Reed, Gay; Wiepert, Mathieu; Hart, Steven

2017-08-22

Individualized medicine enables better diagnoses and treatment decisions for patients and promotes research in understanding the molecular underpinnings of disease. Linking individual patient's genomic and molecular information with their clinical phenotypes is crucial to these efforts. To address this need, the Center for Individualized Medicine at Mayo Clinic has implemented a genomic data warehouse and a workflow management system to bring data from institutional electronic health records and genomic sequencing data from both clinical and research bioinformatics sources into the warehouse. The system is the foundation for Mayo Clinic to build a suite of tools and interfaces to support various clinical and research use cases. The genomic data warehouse is positioned to play a key role in enhancing the research capabilities and advancing individualized patient care at Mayo Clinic.
Chemical and radiation mutagenesis: Induction and detection by whole genome sequencing

Science.gov (United States)

Brachypodium distachyon has emerged as an effective model system to address fundamental questions in grass biology. With its small sequenced genome, short generation time and rapidly expanding array of genetic tools B. distachyon is an ideal system to elucidate the molecular basis of important trai...
EERC Center for Biomass Utilization 2006

Energy Technology Data Exchange (ETDEWEB)

Zygarlicke, Christopher J. [Univ. of North Dakota, Grand Forks, ND (United States). Energy and Environmental Research Center; Hurley, John P. [Univ. of North Dakota, Grand Forks, ND (United States). Energy and Environmental Research Center; Aulich, Ted R. [Univ. of North Dakota, Grand Forks, ND (United States). Energy and Environmental Research Center; Folkedahl, Bruce C. [Univ. of North Dakota, Grand Forks, ND (United States). Energy and Environmental Research Center; Strege, Joshua R. [Univ. of North Dakota, Grand Forks, ND (United States). Energy and Environmental Research Center; Patel, Nikhil [Univ. of North Dakota, Grand Forks, ND (United States). Energy and Environmental Research Center; Shockey, Richard E. [Univ. of North Dakota, Grand Forks, ND (United States). Energy and Environmental Research Center

2009-05-27

The Center for Biomass Utilization® 2006 project at the Energy & Environmental Research Center (EERC) consisted of three tasks related to applied fundamental research focused on converting biomass feedstocks to energy, liquid transportation fuels, and chemicals. Task 1, entitled Thermochemical Conversion of Biomass to Syngas and Chemical Feedstocks, involved three activities. Task 2, entitled Crop Oil Biorefinery Process Development, involved four activities. Task 3, entitled Management, Education, and Outreach, focused on overall project management and providing educational outreach related to biomass technologies through workshops and conferences.
The genomic applications in practice and prevention network.

Science.gov (United States)

Khoury, Muin J; Feero, W Gregory; Reyes, Michele; Citrin, Toby; Freedman, Andrew; Leonard, Debra; Burke, Wylie; Coates, Ralph; Croyle, Robert T; Edwards, Karen; Kardia, Sharon; McBride, Colleen; Manolio, Teri; Randhawa, Gurvaneet; Rasooly, Rebekah; St Pierre, Jeannette; Terry, Sharon

2009-07-01

The authors describe the rationale and initial development of a new collaborative initiative, the Genomic Applications in Practice and Prevention Network. The network convened by the Centers for Disease Control and Prevention and the National Institutes of Health includes multiple stakeholders from academia, government, health care, public health, industry and consumers. The premise of Genomic Applications in Practice and Prevention Network is that there is an unaddressed chasm between gene discoveries and demonstration of their clinical validity and utility. This chasm is due to the lack of readily accessible information about the utility of most genomic applications and the lack of necessary knowledge by consumers and providers to implement what is known. The mission of Genomic Applications in Practice and Prevention Network is to accelerate and streamline the effective integration of validated genomic knowledge into the practice of medicine and public health, by empowering and sponsoring research, evaluating research findings, and disseminating high quality information on candidate genomic applications in practice and prevention. Genomic Applications in Practice and Prevention Network will develop a process that links ongoing collection of information on candidate genomic applications to four crucial domains: (1) knowledge synthesis and dissemination for new and existing technologies, and the identification of knowledge gaps, (2) a robust evidence-based recommendation development process, (3) translation research to evaluate validity, utility and impact in the real world and how to disseminate and implement recommended genomic applications, and (4) programs to enhance practice, education, and surveillance.
Fungal genome resources at NCBI

Science.gov (United States)

Robbertse, B.; Tatusova, T.

2011-01-01

The National Center for Biotechnology Information (NCBI) is well known for the nucleotide sequence archive, GenBank and sequence analysis tool BLAST. However, NCBI integrates many types of biomolecular data from variety of sources and makes it available to the scientific community as interactive web resources as well as organized releases of bulk data. These tools are available to explore and compare fungal genomes. Searching all databases with Fungi [organism] at http://www.ncbi.nlm.nih.gov/ is the quickest way to find resources of interest with fungal entries. Some tools though are resources specific and can be indirectly accessed from a particular database in the Entrez system. These include graphical viewers and comparative analysis tools such as TaxPlot, TaxMap and UniGene DDD (found via UniGene Homepage). Gene and BioProject pages also serve as portals to external data such as community annotation websites, BioGrid and UniProt. There are many different ways of accessing genomic data at NCBI. Depending on the focus and goal of research projects or the level of interest, a user would select a particular route for accessing genomic databases and resources. This review article describes methods of accessing fungal genome data and provides examples that illustrate the use of analysis tools. PMID:22737589
Genome editing in pluripotent stem cells: research and therapeutic applications

Energy Technology Data Exchange (ETDEWEB)

Deleidi, Michela, E-mail: michela.deleidi@dzne.de [German Center for Neurodegenerative Diseases (DZNE) Tübingen within the Helmholtz Association, Tübingen (Germany); Hertie Institute for Clinical Brain Research, University of Tübingen (Germany); Yu, Cong [Department of Microbiology and Immunology, School of Medicine and Biomedical Sciences, University at Buffalo, New York (United States)

2016-05-06

Recent progress in human pluripotent stem cell (hPSC) and genome editing technologies has opened up new avenues for the investigation of human biology in health and disease as well as the development of therapeutic applications. Gene editing approaches with programmable nucleases have been successfully established in hPSCs and applied to study gene function, develop novel animal models and perform genetic and chemical screens. Several studies now show the successful editing of disease-linked alleles in somatic and patient-derived induced pluripotent stem cells (iPSCs) as well as in animal models. Importantly, initial clinical trials have shown the safety of programmable nucleases for ex vivo somatic gene therapy. In this context, the unlimited proliferation potential and the pluripotent properties of iPSCs may offer advantages for gene targeting approaches. However, many technical and safety issues still need to be addressed before genome-edited iPSCs are translated into the clinical setting. Here, we provide an overview of the available genome editing systems and discuss opportunities and perspectives for their application in basic research and clinical practice, with a particular focus on hPSC based research and gene therapy approaches. Finally, we discuss recent research on human germline genome editing and its social and ethical implications. - Highlights: • Programmable nucleases have proven efficient and specific for genome editing in human pluripotent stem cells (hPSCs). • Genome edited hPSCs can be employed to study gene function in health and disease as well as drug and chemical screens. • Genome edited hPSCs hold great promise for ex vivo gene therapy approaches. • Technical and safety issues should be first addressed to advance the clinical use of gene-edited hPSCs.
Genome editing in pluripotent stem cells: research and therapeutic applications

International Nuclear Information System (INIS)

Deleidi, Michela; Yu, Cong

2016-01-01

Recent progress in human pluripotent stem cell (hPSC) and genome editing technologies has opened up new avenues for the investigation of human biology in health and disease as well as the development of therapeutic applications. Gene editing approaches with programmable nucleases have been successfully established in hPSCs and applied to study gene function, develop novel animal models and perform genetic and chemical screens. Several studies now show the successful editing of disease-linked alleles in somatic and patient-derived induced pluripotent stem cells (iPSCs) as well as in animal models. Importantly, initial clinical trials have shown the safety of programmable nucleases for ex vivo somatic gene therapy. In this context, the unlimited proliferation potential and the pluripotent properties of iPSCs may offer advantages for gene targeting approaches. However, many technical and safety issues still need to be addressed before genome-edited iPSCs are translated into the clinical setting. Here, we provide an overview of the available genome editing systems and discuss opportunities and perspectives for their application in basic research and clinical practice, with a particular focus on hPSC based research and gene therapy approaches. Finally, we discuss recent research on human germline genome editing and its social and ethical implications. - Highlights: • Programmable nucleases have proven efficient and specific for genome editing in human pluripotent stem cells (hPSCs). • Genome edited hPSCs can be employed to study gene function in health and disease as well as drug and chemical screens. • Genome edited hPSCs hold great promise for ex vivo gene therapy approaches. • Technical and safety issues should be first addressed to advance the clinical use of gene-edited hPSCs.
Building a model: developing genomic resources for common milkweed (Asclepias syriaca with low coverage genome sequencing

Directory of Open Access Journals (Sweden)

Weitemier Kevin

2011-05-01

and its relatives. This study represents a first step in the development of a community resource for further study of plant-insect co-evolution, anti-herbivore defense, floral developmental genetics, reproductive biology, chemical evolution, population genetics, and comparative genomics using milkweeds, and A. syriaca in particular, as ecological and evolutionary models.
Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines

NARCIS (Netherlands)

Ellrott, Kyle; Bailey, Matthew H.; Saksena, Gordon; Covington, Kyle R.; Kandoth, Cyriac; Stewart, Chip; Hess, Julian; Ma, Singer; Chiotti, Kami E.; McLellan, Michael; Sofia, Heidi J.; Hutter, Carolyn M.; Getz, Gad; Wheeler, David A.; Ding, Li; Caesar-Johnson, Samantha J.; Demchok, John A.; Felau, Ina; Kasapi, Melpomeni; Ferguson, Martin L.; Hutter, Carolyn M.; Sofia, Heidi J.; Tarnuzzer, Roy; Wang, Zhining; Yang, Liming; Zenklusen, Jean C.; Zhang, Jiashan (Julia); Chudamani, Sudha; Liu, Jia; Lolla, Laxmi; Naresh, Rashi; Pihl, Todd; Sun, Qiang; Wan, Yunhu; Wu, Ye; Cho, Juok; DeFreitas, Timothy; Frazer, Scott; Gehlenborg, Nils; Getz, Gad; Heiman, David I.; Kim, Jaegil; Lawrence, Michael S.; Lin, Pei; Meier, Sam; Noble, Michael S.; Saksena, Gordon; Voet, Doug; Zhang, Hailei; Bernard, Brady; Chambwe, Nyasha; Dhankani, Varsha; Knijnenburg, Theo; Kramer, Roger; Leinonen, Kalle; Liu, Yuexin; Miller, Michael; Reynolds, Sheila; Shmulevich, Ilya; Thorsson, Vesteinn; Zhang, Wei; Akbani, Rehan; Broom, Bradley M.; Hegde, Apurva M.; Ju, Zhenlin; Kanchi, Rupa S.; Korkut, Anil; Li, Jun; Liang, Han; Ling, Shiyun; Liu, Wenbin; Lu, Yiling; Mills, Gordon B.; Ng, Kwok Shing; Rao, Arvind; Ryan, Michael; Wang, Jing; Weinstein, John N.; Zhang, Jiexin; Abeshouse, Adam; Armenia, Joshua; Chakravarty, Debyani; Chatila, Walid K.; de Bruijn, Ino; Gao, Jianjiong; Gross, Benjamin E.; Heins, Zachary J.; Kundra, Ritika; La, Konnor; Ladanyi, Marc; Luna, Augustin; Nissan, Moriah G.; Ochoa, Angelica; Phillips, Sarah M.; Reznik, Ed; Sanchez-Vega, Francisco; Sander, Chris; Schultz, Nikolaus; Sheridan, Robert; Sumer, S. Onur; Sun, Yichao; Taylor, Barry S.; Wang, Jioajiao; Zhang, Hongxin; Anur, Pavana; Peto, Myron; Spellman, Paul; Benz, Christopher; Stuart, Joshua M.; Wong, Christopher K.; Yau, Christina; Hayes, D. Neil; Wilkerson, Matthew D.; Ally, Adrian; Balasundaram, Miruna; Bowlby, Reanne; Brooks, Denise; Carlsen, Rebecca; Chuah, Eric; Dhalla, Noreen; Holt, Robert; Jones, Steven J.M.; Kasaian, Katayoon; Lee, Darlene; Ma, Yussanne; Marra, Marco A.; Mayo, Michael; Moore, Richard A.; Mungall, Andrew J.; Mungall, Karen; Robertson, A. Gordon; Sadeghi, Sara; Schein, Jacqueline E.; Sipahimalani, Payal; Tam, Angela; Thiessen, Nina; Tse, Kane; Wong, Tina; Berger, Ashton C.; Beroukhim, Rameen; Cherniack, Andrew D.; Cibulskis, Carrie; Gabriel, Stacey B.; Gao, Galen F.; Ha, Gavin; Meyerson, Matthew; Schumacher, Steven E.; Shih, Juliann; Kucherlapati, Melanie H.; Kucherlapati, Raju S.; Baylin, Stephen; Cope, Leslie; Danilova, Ludmila; Bootwalla, Moiz S.; Lai, Phillip H.; Maglinte, Dennis T.; Van Den Berg, David J.; Weisenberger, Daniel J.; Auman, J. Todd; Balu, Saianand; Bodenheimer, Tom; Fan, Cheng; Hoadley, Katherine A.; Hoyle, Alan P.; Jefferys, Stuart R.; Jones, Corbin D.; Meng, Shaowu; Mieczkowski, Piotr A.; Mose, Lisle E.; Perou, Amy H.; Perou, Charles M.; Roach, Jeffrey; Shi, Yan; Simons, Janae V.; Skelly, Tara; Soloway, Matthew G.; Tan, Donghui; Veluvolu, Umadevi; Fan, Huihui; Hinoue, Toshinori; Laird, Peter W.; Shen, Hui; Zhou, Wanding; Bellair, Michelle; Chang, Kyle; Covington, Kyle; Creighton, Chad J.; Dinh, Huyen; Doddapaneni, Harsha Vardhan; Donehower, Lawrence A.; Drummond, Jennifer; Gibbs, Richard A.; Glenn, Robert; Hale, Walker; Han, Yi; Hu, Jianhong; Korchina, Viktoriya; Lee, Sandra; Lewis, Lora; Li, Wei; Liu, Xiuping; Morgan, Margaret; Morton, Donna; Muzny, Donna; Santibanez, Jireh; Sheth, Margi; Shinbrot, Eve; Wang, Linghua; Wang, Min; Wheeler, David A.; Xi, Liu; Zhao, Fengmei; Hess, Julian; Appelbaum, Elizabeth L.; Bailey, Matthew; Cordes, Matthew G.; Ding, Li; Fronick, Catrina C.; Fulton, Lucinda A.; Fulton, Robert S.; Kandoth, Cyriac; Mardis, Elaine R.; McLellan, Michael D.; Miller, Christopher A.; Schmidt, Heather K.; Wilson, Richard K.; Crain, Daniel; Curley, Erin; Gardner, Johanna; Lau, Kevin; Mallery, David; Morris, Scott; Paulauskis, Joseph; Penny, Robert; Shelton, Candace; Shelton, Troy; Sherman, Mark; Thompson, Eric; Yena, Peggy; Bowen, Jay; Gastier-Foster, Julie M.; Gerken, Mark; Leraas, Kristen M.; Lichtenberg, Tara M.; Ramirez, Nilsa C.; Wise, Lisa; Zmuda, Erik; Corcoran, Niall; Costello, Tony; Hovens, Christopher; Carvalho, Andre L.; de Carvalho, Ana C.; Fregnani, José H.; Longatto-Filho, Adhemar; Reis, Rui M.; Scapulatempo-Neto, Cristovam; Silveira, Henrique C.S.; Vidal, Daniel O.; Burnette, Andrew; Eschbacher, Jennifer; Hermes, Beth; Noss, Ardene; Singh, Rosy; Anderson, Matthew L.; Castro, Patricia D.; Ittmann, Michael; Huntsman, David; Kohl, Bernard; Le, Xuan; Thorp, Richard; Andry, Chris; Duffy, Elizabeth R.; Lyadov, Vladimir; Paklina, Oxana; Setdikova, Galiya; Shabunin, Alexey; Tavobilov, Mikhail; McPherson, Christopher; Warnick, Ronald; Berkowitz, Ross; Cramer, Daniel; Feltmate, Colleen; Horowitz, Neil; Kibel, Adam; Muto, Michael; Raut, Chandrajit P.; Malykh, Andrei; Barnholtz-Sloan, Jill S.; Barrett, Wendi; Devine, Karen; Fulop, Jordonna; Ostrom, Quinn T.; Shimmel, Kristen; Wolinsky, Yingli; Sloan, Andrew E.; De Rose, Agostino; Giuliante, Felice; Goodman, Marc; Karlan, Beth Y.; Hagedorn, Curt H.; Eckman, John; Harr, Jodi; Myers, Jerome; Tucker, Kelinda; Zach, Leigh Anne; Deyarmin, Brenda; Hu, Hai; Kvecher, Leonid; Larson, Caroline; Mural, Richard J.; Somiari, Stella; Vicha, Ales; Zelinka, Tomas; Bennett, Joseph; Iacocca, Mary; Rabeno, Brenda; Swanson, Patricia; Latour, Mathieu; Lacombe, Louis; Têtu, Bernard; Bergeron, Alain; McGraw, Mary; Staugaitis, Susan M.; Chabot, John; Hibshoosh, Hanina; Sepulveda, Antonia; Su, Tao; Wang, Timothy; Potapova, Olga; Voronina, Olga; Desjardins, Laurence; Mariani, Odette; Roman-Roman, Sergio; Sastre, Xavier; Stern, Marc Henri; Cheng, Feixiong; Signoretti, Sabina; Berchuck, Andrew; Bigner, Darell; Lipp, Eric; Marks, Jeffrey; McCall, Shannon; McLendon, Roger; Secord, Angeles; Sharp, Alexis; Behera, Madhusmita; Brat, Daniel J.; Chen, Amy; Delman, Keith; Force, Seth; Khuri, Fadlo; Magliocca, Kelly; Maithel, Shishir; Olson, Jeffrey J.; Owonikoko, Taofeek; Pickens, Alan; Ramalingam, Suresh; Shin, Dong M.; Sica, Gabriel; Van Meir, Erwin G.; Zhang, Hongzheng; Eijckenboom, Wil; Gillis, Ad; Korpershoek, Esther; Looijenga, Leendert; Oosterhuis, Wolter; Stoop, Hans; van Kessel, Kim E.; Zwarthoff, Ellen C.; Calatozzolo, Chiara; Cuppini, Lucia; Cuzzubbo, Stefania; DiMeco, Francesco; Finocchiaro, Gaetano; Mattei, Luca; Perin, Alessandro; Pollo, Bianca; Chen, Chu; Houck, John; Lohavanichbutr, Pawadee; Hartmann, Arndt; Stoehr, Christine; Stoehr, Robert; Taubert, Helge; Wach, Sven; Wullich, Bernd; Kycler, Witold; Murawa, Dawid; Wiznerowicz, Maciej; Chung, Ki; Edenfield, W. Jeffrey; Martin, Julie; Baudin, Eric; Bubley, Glenn; Bueno, Raphael; De Rienzo, Assunta; Richards, William G.; Kalkanis, Steven; Mikkelsen, Tom; Noushmehr, Houtan; Scarpace, Lisa; Girard, Nicolas; Aymerich, Marta; Campo, Elias; Giné, Eva; Guillermo, Armando López; Van Bang, Nguyen; Hanh, Phan Thi; Phu, Bui Duc; Tang, Yufang; Colman, Howard; Evason, Kimberley; Dottino, Peter R.; Martignetti, John A.; Gabra, Hani; Juhl, Hartmut; Akeredolu, Teniola; Stepa, Serghei; Hoon, Dave; Ahn, Keunsoo; Kang, Koo Jeong; Beuschlein, Felix; Breggia, Anne; Birrer, Michael; Bell, Debra; Borad, Mitesh; Bryce, Alan H.; Castle, Erik; Chandan, Vishal; Cheville, John; Copland, John A.; Farnell, Michael; Flotte, Thomas; Giama, Nasra; Ho, Thai; Kendrick, Michael; Kocher, Jean Pierre; Kopp, Karla; Moser, Catherine; Nagorney, David; O'Brien, Daniel; O'Neill, Brian Patrick; Patel, Tushar; Petersen, Gloria; Que, Florencia; Rivera, Michael; Roberts, Lewis; Smallridge, Robert; Smyrk, Thomas; Stanton, Melissa; Thompson, R. Houston; Torbenson, Michael; Yang, Ju Dong; Zhang, Lizhi; Brimo, Fadi; Ajani, Jaffer A.; Angulo Gonzalez, Ana Maria; Behrens, Carmen; Bondaruk, Jolanta; Broaddus, Russell; Czerniak, Bogdan; Esmaeli, Bita; Fujimoto, Junya; Gershenwald, Jeffrey; Guo, Charles; Lazar, Alexander J.; Logothetis, Christopher; Meric-Bernstam, Funda; Moran, Cesar; Ramondetta, Lois; Rice, David; Sood, Anil; Tamboli, Pheroze; Thompson, Timothy; Troncoso, Patricia; Tsao, Anne; Wistuba, Ignacio; Carter, Candace; Haydu, Lauren; Hersey, Peter; Jakrot, Valerie; Kakavand, Hojabr; Kefford, Richard; Lee, Kenneth; Long, Georgina; Mann, Graham; Quinn, Michael; Saw, Robyn; Scolyer, Richard; Shannon, Kerwin; Spillane, Andrew; Stretch, Jonathan; Synott, Maria; Thompson, John; Wilmott, James; Al-Ahmadie, Hikmat; Chan, Timothy A.; Ghossein, Ronald; Gopalan, Anuradha; Levine, Douglas A.; Reuter, Victor; Singer, Samuel; Singh, Bhuvanesh; Tien, Nguyen Viet; Broudy, Thomas; Mirsaidi, Cyrus; Nair, Praveen; Drwiega, Paul; Miller, Judy; Smith, Jennifer; Zaren, Howard; Park, Joong Won; Hung, Nguyen Phi; Kebebew, Electron; Linehan, W. Marston; Metwalli, Adam R.; Pacak, Karel; Pinto, Peter A.; Schiffman, Mark; Schmidt, Laura S.; Vocke, Cathy D.; Wentzensen, Nicolas; Worrell, Robert; Yang, Hannah; Moncrieff, Marc; Goparaju, Chandra; Melamed, Jonathan; Pass, Harvey; Botnariuc, Natalia; Caraman, Irina; Cernat, Mircea; Chemencedji, Inga; Clipca, Adrian; Doruc, Serghei; Gorincioi, Ghenadie; Mura, Sergiu; Pirtac, Maria; Stancul, Irina; Tcaciuc, Diana; Albert, Monique; Alexopoulou, Iakovina; Arnaout, Angel; Bartlett, John; Engel, Jay; Gilbert, Sebastien; Parfitt, Jeremy; Sekhon, Harman; Thomas, George; Rassl, Doris M.; Rintoul, Robert C.; Bifulco, Carlo; Tamakawa, Raina; Urba, Walter; Hayward, Nicholas; Timmers, Henri; Antenucci, Anna; Facciolo, Francesco; Grazi, Gianluca; Marino, Mirella; Merola, Roberta; de Krijger, Ronald; Gimenez-Roqueplo, Anne Paule; Piché, Alain; Chevalier, Simone; McKercher, Ginette; Birsoy, Kivanc; Barnett, Gene; Brewer, Cathy; Farver, Carol; Naska, Theresa; Pennell, Nathan A.; Raymond, Daniel; Schilero, Cathy; Smolenski, Kathy; Williams, Felicia; Morrison, Carl; Borgia, Jeffrey A.; Liptay, Michael J.; Pool, Mark; Seder, Christopher W.; Junker, Kerstin; Omberg, Larsson; Dinkin, Mikhail; Manikhas, George; Alvaro, Domenico; Bragazzi, Maria Consiglia; Cardinale, Vincenzo; Carpino, Guido; Gaudio, Eugenio; Chesla, David; Cottingham, Sandra; Dubina, Michael; Moiseenko, Fedor; Dhanasekaran, Renumathy; Becker, Karl Friedrich; Janssen, Klaus Peter; Slotta-Huspenina, Julia; Abdel-Rahman, Mohamed H.; Aziz, Dina; Bell, Sue; Cebulla, Colleen M.; Davis, Amy; Duell, Rebecca; Elder, J. Bradley; Hilty, Joe; Kumar, Bahavna; Lang, James; Lehman, Norman L.; Mandt, Randy; Nguyen, Phuong; Pilarski, Robert; Rai, Karan; Schoenfield, Lynn; Senecal, Kelly; Wakely, Paul; Hansen, Paul; Lechan, Ronald; Powers, James; Tischler, Arthur; Grizzle, William E.; Sexton, Katherine C.; Kastl, Alison; Henderson, Joel; Porten, Sima; Waldmann, Jens; Fassnacht, Martin; Asa, Sylvia L.; Schadendorf, Dirk; Couce, Marta; Graefen, Markus; Huland, Hartwig; Sauter, Guido; Schlomm, Thorsten; Simon, Ronald; Tennstedt, Pierre; Olabode, Oluwole; Nelson, Mark; Bathe, Oliver; Carroll, Peter R.; Chan, June M.; Disaia, Philip; Glenn, Pat; Kelley, Robin K.; Landen, Charles N.; Phillips, Joanna; Prados, Michael; Simko, Jeffry; Smith-McCune, Karen; VandenBerg, Scott; Roggin, Kevin; Fehrenbach, Ashley; Kendler, Ady; Sifri, Suzanne; Steele, Ruth; Jimeno, Antonio; Carey, Francis; Forgie, Ian; Mannelli, Massimo; Carney, Michael; Hernandez, Brenda; Campos, Benito; Herold-Mende, Christel; Jungk, Christin; Unterberg, Andreas; von Deimling, Andreas; Bossler, Aaron; Galbraith, Joseph; Jacobus, Laura; Knudson, Michael; Knutson, Tina; Ma, Deqin; Milhem, Mohammed; Sigmund, Rita; Godwin, Andrew K.; Madan, Rashna; Rosenthal, Howard G.; Adebamowo, Clement; Adebamowo, Sally N.; Boussioutas, Alex; Beer, David; Giordano, Thomas; Mes-Masson, Anne Marie; Saad, Fred; Bocklage, Therese; Landrum, Lisa; Mannel, Robert; Moore, Kathleen; Moxley, Katherine; Postier, Russel; Walker, Joan; Zuna, Rosemary; Feldman, Michael; Valdivieso, Federico; Dhir, Rajiv; Luketich, James; Mora Pinero, Edna M.; Quintero-Aguilo, Mario; Carlotti, Carlos Gilberto; Dos Santos, Jose Sebastião; Kemp, Rafael; Sankarankuty, Ajith; Tirapelli, Daniela; Catto, James; Agnew, Kathy; Swisher, Elizabeth; Creaney, Jenette; Robinson, Bruce; Shelley, Carl Simon; Godwin, Eryn M.; Kendall, Sara; Shipman, Cassaundra; Bradford, Carol; Carey, Thomas; Haddad, Andrea; Moyer, Jeffey; Peterson, Lisa; Prince, Mark; Rozek, Laura; Wolf, Gregory; Bowman, Rayleen; Fong, Kwun M.; Yang, Ian; Korst, Robert; Rathmell, W. Kimryn; Fantacone-Campbell, J. Leigh; Hooke, Jeffrey A.; Kovatich, Albert J.; Shriver, Craig D.; DiPersio, John; Drake, Bettina; Govindan, Ramaswamy; Heath, Sharon; Ley, Timothy; Van Tine, Brian; Westervelt, Peter; Rubin, Mark A.; Lee, Jung Il; Aredes, Natália D.; Mariamidze, Armaz

2018-01-01

The Cancer Genome Atlas (TCGA) cancer genomics dataset includes over 10,000 tumor-normal exome pairs across 33 different cancer types, in total >400 TB of raw data files requiring analysis. Here we describe the Multi-Center Mutation Calling in Multiple Cancers project, our effort to generate a
Genomic impact of eukaryotic transposable elements.

Science.gov (United States)

Arkhipova, Irina R; Batzer, Mark A; Brosius, Juergen; Feschotte, Cédric; Moran, John V; Schmitz, Jürgen; Jurka, Jerzy

2012-11-21

The third international conference on the genomic impact of eukaryotic transposable elements (TEs) was held 24 to 28 February 2012 at the Asilomar Conference Center, Pacific Grove, CA, USA. Sponsored in part by the National Institutes of Health grant 5 P41 LM006252, the goal of the conference was to bring together researchers from around the world who study the impact and mechanisms of TEs using multiple computational and experimental approaches. The meeting drew close to 170 attendees and included invited floor presentations on the biology of TEs and their genomic impact, as well as numerous talks contributed by young scientists. The workshop talks were devoted to computational analysis of TEs with additional time for discussion of unresolved issues. Also, there was ample opportunity for poster presentations and informal evening discussions. The success of the meeting reflects the important role of Repbase in comparative genomic studies, and emphasizes the need for close interactions between experimental and computational biologists in the years to come.
Trans-generational radiation-induced chromosomal instability in the female enhances the action of chemical mutagens

International Nuclear Information System (INIS)

Camats, Nuria; Garcia, Francisca; Parrilla, Juan Jose; Calaf, Joaquim; Martin, Miguel; Caldes, Montserrat Garcia

2008-01-01

Genomic instability can be produced by ionising radiation, so-called radiation-induced genomic instability, and chemical mutagens. Radiation-induced genomic instability occurs in both germinal and somatic cells and also in the offspring of irradiated individuals, and it is characterised by genetic changes including chromosomal rearrangements. The majority of studies of trans-generational, radiation-induced genomic instability have been described in the male germ line, whereas the authors who have chosen the female as a model are scarce. The aim of this work is to find out the radiation-induced effects in the foetal offspring of X-ray-treated female rats and, at the same time, the possible impact of this radiation-induced genomic instability on the action of a chemical mutagen. In order to achieve both goals, the quantity and quality of chromosomal damage were analysed. In order to detect trans-generational genomic instability, a total of 4806 metaphases from foetal tissues from the foetal offspring of X-irradiated female rats (5 Gy, acute dose) were analysed. The study's results showed that there is radiation-induced genomic instability: the number of aberrant metaphases and the breaks per total metaphases studied increased and were found to be statistically significant (p ≤ 0.05), with regard to the control group. In order to identify how this trans-generational, radiation-induced chromosomal instability could influence the chromosomal behaviour of the offspring of irradiated rat females in front of a chemical agent (aphidicolin), a total of 2481 metaphases were studied. The observed results showed that there is an enhancement of the action of the chemical agent: chromosomal breaks per aberrant metaphases show significant differences (p ≤ 0.05) in the X-ray- and aphidicolin-treated group as regards the aphidicolin-treated group. In conclusion, our findings indicate that there is trans-generational, radiation-induced chromosomal instability in the foetal cells
Trans-generational radiation-induced chromosomal instability in the female enhances the action of chemical mutagens

Energy Technology Data Exchange (ETDEWEB)

Camats, Nuria [Institut de Biotecnologia i Biomedicina (IBB), Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Departament de Biologia Cel.lular, Fisiologia i Immunologia, Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Garcia, Francisca [Institut de Biotecnologia i Biomedicina (IBB), Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Parrilla, Juan Jose [Servicio de Ginecologia y Obstetricia, Hospital Universitario Virgen de la Arrixaca, 30120 El Palmar, Murcia (Spain); Calaf, Joaquim [Servei de Ginecologia i Obstetricia, Hospital Universitari de la Santa Creu i Sant Pau, 08025 Barcelona (Spain); Martin, Miguel [Departament de Pediatria, d' Obstetricia i Ginecologia i de Medicina Preventiva, Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Caldes, Montserrat Garcia [Institut de Biotecnologia i Biomedicina (IBB), Universitat Autonoma de Barcelona, 08193 Barcelona (Spain); Departament de Biologia Cel.lular, Fisiologia i Immunologia, Universitat Autonoma de Barcelona, 08193 Barcelona (Spain)], E-mail: Montserrat.Garcia.Caldes@uab.es

2008-04-02

Genomic instability can be produced by ionising radiation, so-called radiation-induced genomic instability, and chemical mutagens. Radiation-induced genomic instability occurs in both germinal and somatic cells and also in the offspring of irradiated individuals, and it is characterised by genetic changes including chromosomal rearrangements. The majority of studies of trans-generational, radiation-induced genomic instability have been described in the male germ line, whereas the authors who have chosen the female as a model are scarce. The aim of this work is to find out the radiation-induced effects in the foetal offspring of X-ray-treated female rats and, at the same time, the possible impact of this radiation-induced genomic instability on the action of a chemical mutagen. In order to achieve both goals, the quantity and quality of chromosomal damage were analysed. In order to detect trans-generational genomic instability, a total of 4806 metaphases from foetal tissues from the foetal offspring of X-irradiated female rats (5 Gy, acute dose) were analysed. The study's results showed that there is radiation-induced genomic instability: the number of aberrant metaphases and the breaks per total metaphases studied increased and were found to be statistically significant (p {<=} 0.05), with regard to the control group. In order to identify how this trans-generational, radiation-induced chromosomal instability could influence the chromosomal behaviour of the offspring of irradiated rat females in front of a chemical agent (aphidicolin), a total of 2481 metaphases were studied. The observed results showed that there is an enhancement of the action of the chemical agent: chromosomal breaks per aberrant metaphases show significant differences (p {<=} 0.05) in the X-ray- and aphidicolin-treated group as regards the aphidicolin-treated group. In conclusion, our findings indicate that there is trans-generational, radiation-induced chromosomal instability in the foetal
ATM signaling and genomic stability in response to DNA damage

International Nuclear Information System (INIS)

Lavin, Martin F.; Birrell, Geoff; Chen, Philip; Kozlov, Sergei; Scott, Shaun; Gueven, Nuri

2005-01-01

DNA double strand breaks represent the most threatening lesion to the integrity of the genome in cells exposed to ionizing radiation and radiomimetic chemicals. Those breaks are recognized, signaled to cell cycle checkpoints and repaired by protein complexes. The product of the gene (ATM) mutated in the human genetic disorder ataxia-telangiectasia (A-T) plays a central role in the recognition and signaling of DNA damage. ATM is one of an ever growing number of proteins which when mutated compromise the stability of the genome and predispose to tumour development. Mechanisms for recognising double strand breaks in DNA, maintaining genome stability and minimizing risk of cancer are discussed
77 FR 60446 - Center for Scientific Review; Notice of Closed Meeting

Science.gov (United States)

2012-10-03

... Panel; Genomic, Molecular Genetics Variation Studies Using Model Organisms AREA Review. Date: October 19...: David J Remondini, Ph.D., Scientific Review Officer, Center for Scientific Review, National Institutes...
MIPS: analysis and annotation of proteins from whole genomes.

Science.gov (United States)

Mewes, H W; Amid, C; Arnold, R; Frishman, D; Güldener, U; Mannhaupt, G; Münsterkötter, M; Pagel, P; Strack, N; Stümpflen, V; Warfsmann, J; Ruepp, A

2004-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF), Neuherberg, Germany, provides protein sequence-related information based on whole-genome analysis. The main focus of the work is directed toward the systematic organization of sequence-related attributes as gathered by a variety of algorithms, primary information from experimental data together with information compiled from the scientific literature. MIPS maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the database of complete cDNAs (German Human Genome Project, NGFN), the database of mammalian protein-protein interactions (MPPI), the database of FASTA homologies (SIMAP), and the interface for the fast retrieval of protein-associated information (QUIPOS). The Arabidopsis thaliana database, the rice database, the plant EST databases (MATDB, MOsDB, SPUTNIK), as well as the databases for the comprehensive set of genomes (PEDANT genomes) are described elsewhere in the 2003 and 2004 NAR database issues, respectively. All databases described, and the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).
Evaluation of three automated genome annotations for Halorhabdus utahensis.

Directory of Open Access Journals (Sweden)

Peter Bakke

2009-07-01

Full Text Available Genome annotations are accumulating rapidly and depend heavily on automated annotation systems. Many genome centers offer annotation systems but no one has compared their output in a systematic way to determine accuracy and inherent errors. Errors in the annotations are routinely deposited in databases such as NCBI and used to validate subsequent annotation errors. We submitted the genome sequence of halophilic archaeon Halorhabdus utahensis to be analyzed by three genome annotation services. We have examined the output from each service in a variety of ways in order to compare the methodology and effectiveness of the annotations, as well as to explore the genes, pathways, and physiology of the previously unannotated genome. The annotation services differ considerably in gene calls, features, and ease of use. We had to manually identify the origin of replication and the species-specific consensus ribosome-binding site. Additionally, we conducted laboratory experiments to test H. utahensis growth and enzyme activity. Current annotation practices need to improve in order to more accurately reflect a genome's biological potential. We make specific recommendations that could improve the quality of microbial annotation projects.
Structural genomics of infectious disease drug targets: the SSGCID

International Nuclear Information System (INIS)

Stacy, Robin; Begley, Darren W.; Phan, Isabelle; Staker, Bart L.; Van Voorhis, Wesley C.; Varani, Gabriele; Buchko, Garry W.; Stewart, Lance J.; Myler, Peter J.

2011-01-01

An introduction and overview of the focus, goals and overall mission of the Seattle Structural Genomics Center for Infectious Disease (SSGCID) is given. The Seattle Structural Genomics Center for Infectious Disease (SSGCID) is a consortium of researchers at Seattle BioMed, Emerald BioStructures, the University of Washington and Pacific Northwest National Laboratory that was established to apply structural genomics approaches to drug targets from infectious disease organisms. The SSGCID is currently funded over a five-year period by the National Institute of Allergy and Infectious Diseases (NIAID) to determine the three-dimensional structures of 400 proteins from a variety of Category A, B and C pathogens. Target selection engages the infectious disease research and drug-therapy communities to identify drug targets, essential enzymes, virulence factors and vaccine candidates of biomedical relevance to combat infectious diseases. The protein-expression systems, purified proteins, ligand screens and three-dimensional structures produced by SSGCID constitute a valuable resource for drug-discovery research, all of which is made freely available to the greater scientific community. This issue of Acta Crystallographica Section F, entirely devoted to the work of the SSGCID, covers the details of the high-throughput pipeline and presents a series of structures from a broad array of pathogenic organisms. Here, a background is provided on the structural genomics of infectious disease, the essential components of the SSGCID pipeline are discussed and a survey of progress to date is presented
DHS Office of Health Affairs Chemical Defense Program Analyzes Subway Safety Against Chemical Terrorist Threats

OpenAIRE

Center for Homeland Defense and Security

2012-01-01

Center for Homeland Defense and Security, OUT OF THE CLASSROOM In an article for the journal Domestic Preparedness, Joselito Ignacio examines how to protect subway riders from chemical attacks. Ignacio graduated from the Center for Homeland Defense and Security in...
Enabling a Community to Dissect an Organism: Overview of the Neurospora Functional Genomics Project

OpenAIRE

Dunlap, Jay C.; Borkovich, Katherine A.; Henn, Matthew R.; Turner, Gloria E.; Sachs, Matthew S.; Glass, N. Louise; McCluskey, Kevin; Plamann, Michael; Galagan, James E.; Birren, Bruce W.; Weiss, Richard L.; Townsend, Jeffrey P.; Loros, Jennifer J.; Nelson, Mary Anne; Lambreghts, Randy

2007-01-01

A consortium of investigators is engaged in a functional genomics project centered on the filamentous fungus Neurospora, with an eye to opening up the functional genomic analysis of all the filamentous fungi. The overall goal of the four interdependent projects in this effort is to acccomplish functional genomics, annotation, and expression analyses of Neurospora crassa, a filamentous fungus that is an established model for the assemblage of over 250,000 species of nonyeast fungi. Building fr...

Origins of the Human Genome Project.

Science.gov (United States)

Watson, J D; Cook-Deegan, R M

1991-01-01

The Human Genome Project has become a reality. Building on a debate that dates back to 1985, several genome projects are now in full stride around the world, and more are likely to form in the next several years. Italy began its genome program in 1987, and the United Kingdom and U.S.S.R. in 1988. The European communities mounted several genome projects on yeast, bacteria, Drosophila, and Arabidospis thaliana (a rapidly growing plant with a small genome) in 1988, and in 1990 commenced a new 2-year program on the human genome. In the United States, we have completed the first year of operation of the National Center for Human Genome Research at the National Institutes of Health (NIH), now the largest single funding source for genome research in the world. There have been dedicated budgets focused on genome-scale research at NIH, the U.S. Department of Energy, and the Howard Hughes Medical Institute for several years, and results are beginning to accumulate. There were three annual meetings on genome mapping and sequencing at Cold Spring Harbor, New York, in the spring of 1988, 1989, and 1990; the talks have shifted from a discussion about how to approach problems to presenting results from experiments already performed. We have finally begun to work rather than merely talk. The purpose of genome projects is to assemble data on the structure of DNA in human chromosomes and those of other organisms. A second goal is to develop new technologies to perform mapping and sequencing. There have been impressive technical advances in the past 5 years since the debate about the human genome project began. We are on the verge of beginning pilot projects to test several approaches to sequencing long stretches of DNA, using both automation and manual methods. Ordered sets of yeast artificial chromosome and cosmid clones have been assembled to span more than 2 million base pairs of several human chromosomes, and a region of 10 million base pairs has been assembled for
Protecting genomic data analytics in the cloud: state of the art and opportunities.

Science.gov (United States)

Tang, Haixu; Jiang, Xiaoqian; Wang, Xiaofeng; Wang, Shuang; Sofia, Heidi; Fox, Dov; Lauter, Kristin; Malin, Bradley; Telenti, Amalio; Xiong, Li; Ohno-Machado, Lucila

2016-10-13

The outsourcing of genomic data into public cloud computing settings raises concerns over privacy and security. Significant advancements in secure computation methods have emerged over the past several years, but such techniques need to be rigorously evaluated for their ability to support the analysis of human genomic data in an efficient and cost-effective manner. With respect to public cloud environments, there are concerns about the inadvertent exposure of human genomic data to unauthorized users. In analyses involving multiple institutions, there is additional concern about data being used beyond agreed research scope and being prcoessed in untrused computational environments, which may not satisfy institutional policies. To systematically investigate these issues, the NIH-funded National Center for Biomedical Computing iDASH (integrating Data for Analysis, 'anonymization' and SHaring) hosted the second Critical Assessment of Data Privacy and Protection competition to assess the capacity of cryptographic technologies for protecting computation over human genomes in the cloud and promoting cross-institutional collaboration. Data scientists were challenged to design and engineer practical algorithms for secure outsourcing of genome computation tasks in working software, whereby analyses are performed only on encrypted data. They were also challenged to develop approaches to enable secure collaboration on data from genomic studies generated by multiple organizations (e.g., medical centers) to jointly compute aggregate statistics without sharing individual-level records. The results of the competition indicated that secure computation techniques can enable comparative analysis of human genomes, but greater efficiency (in terms of compute time and memory utilization) are needed before they are sufficiently practical for real world environments.
Plant-symbiotic fungi as chemical engineers: multi-genome analysis of the clavicipitaceae reveals dynamics of alkaloid loci.

Directory of Open Access Journals (Sweden)

Christopher L Schardl

Full Text Available The fungal family Clavicipitaceae includes plant symbionts and parasites that produce several psychoactive and bioprotective alkaloids. The family includes grass symbionts in the epichloae clade (Epichloë and Neotyphodium species, which are extraordinarily diverse both in their host interactions and in their alkaloid profiles. Epichloae produce alkaloids of four distinct classes, all of which deter insects, and some-including the infamous ergot alkaloids-have potent effects on mammals. The exceptional chemotypic diversity of the epichloae may relate to their broad range of host interactions, whereby some are pathogenic and contagious, others are mutualistic and vertically transmitted (seed-borne, and still others vary in pathogenic or mutualistic behavior. We profiled the alkaloids and sequenced the genomes of 10 epichloae, three ergot fungi (Claviceps species, a morning-glory symbiont (Periglandula ipomoeae, and a bamboo pathogen (Aciculosporium take, and compared the gene clusters for four classes of alkaloids. Results indicated a strong tendency for alkaloid loci to have conserved cores that specify the skeleton structures and peripheral genes that determine chemical variations that are known to affect their pharmacological specificities. Generally, gene locations in cluster peripheries positioned them near to transposon-derived, AT-rich repeat blocks, which were probably involved in gene losses, duplications, and neofunctionalizations. The alkaloid loci in the epichloae had unusual structures riddled with large, complex, and dynamic repeat blocks. This feature was not reflective of overall differences in repeat contents in the genomes, nor was it characteristic of most other specialized metabolism loci. The organization and dynamics of alkaloid loci and abundant repeat blocks in the epichloae suggested that these fungi are under selection for alkaloid diversification. We suggest that such selection is related to the variable life histories
Fenton reaction induced cancer in wild type rats recapitulates genomic alterations observed in human cancer.

Directory of Open Access Journals (Sweden)

Shinya Akatsuka

Full Text Available Iron overload has been associated with carcinogenesis in humans. Intraperitoneal administration of ferric nitrilotriacetate initiates a Fenton reaction in renal proximal tubules of rodents that ultimately leads to a high incidence of renal cell carcinoma (RCC after repeated treatments. We performed high-resolution microarray comparative genomic hybridization to identify characteristics in the genomic profiles of this oxidative stress-induced rat RCCs. The results revealed extensive large-scale genomic alterations with a preference for deletions. Deletions and amplifications were numerous and sometimes fragmented, demonstrating that a Fenton reaction is a cause of such genomic alterations in vivo. Frequency plotting indicated that two of the most commonly altered loci corresponded to a Cdkn2a/2b deletion and a Met amplification. Tumor sizes were proportionally associated with Met expression and/or amplification, and clustering analysis confirmed our results. Furthermore, we developed a procedure to compare whole genomic patterns of the copy number alterations among different species based on chromosomal syntenic relationship. Patterns of the rat RCCs showed the strongest similarity to the human RCCs among five types of human cancers, followed by human malignant mesothelioma, an iron overload-associated cancer. Therefore, an iron-dependent Fenton chemical reaction causes large-scale genomic alterations during carcinogenesis, which may result in distinct genomic profiles. Based on the characteristics of extensive genome alterations in human cancer, our results suggest that this chemical reaction may play a major role during human carcinogenesis.
Focusing on function to mine cancer genome data | Center for Cancer Research

Science.gov (United States)

CCR scientists have devised a strategy to sift through the tens of thousands of mutations in cancer genome data to find mutations that actually drive the disease. They have used the method to discover that the JNK signaling pathway, which in different contexts can either spur cancerous growth or rein it in, acts as a tumor suppressor in gastric cancers.
Functional RNA structures throughout the Hepatitis C Virus genome.

Science.gov (United States)

Adams, Rebecca L; Pirakitikulr, Nathan; Pyle, Anna Marie

2017-06-01

The single-stranded Hepatitis C Virus (HCV) genome adopts a set of elaborate RNA structures that are involved in every stage of the viral lifecycle. Recent advances in chemical probing, sequencing, and structural biology have facilitated analysis of RNA folding on a genome-wide scale, revealing novel structures and networks of interactions. These studies have underscored the active role played by RNA in every function of HCV and they open the door to new types of RNA-targeted therapeutics. Copyright © 2017 Elsevier B.V. All rights reserved.
Fueling the future with fungal genomics

Energy Technology Data Exchange (ETDEWEB)

Grigoriev, Igor V.; Cullen, Dan; Goodwin, Steve X.; Hibbett, David; Jeffries, Thomas W.; Kubicek, Christian P.; Kuske, Cheryl R.; Magnuson, Jon K.; Martin, Francis; Spatafora, Joe W.; Tsang, Adrian; Baker, Scott E.

2011-07-25

Fungi play important roles across the range of current and future biofuel production processes. From crop/feedstock health to plant biomass saccharification, enzyme production to bioprocesses for producing ethanol, higher alcohols or future hydrocarbon biofuels, fungi are involved. Research and development are underway to understand the underlying biological processes and improve them to make efficient on an industrial scale. Genomics is the foundation of the systems biology approach that is being used to accelerate the research and development efforts across the spectrum of topic areas that impact biofuels production. In this review, we discuss past, current and future advances made possible by genomic analysis of the fungi that impact plant/feedstock health, degradation of lignocellulosic biomass and fermentation of sugars to ethanol, hydrocarbon biofuels and renewable chemicals.
THE CHEMICAL ABUNDANCES IN THE GALACTIC CENTER FROM THE ATMOSPHERES OF RED SUPERGIANTS

International Nuclear Information System (INIS)

Davies, Ben; Figer, Don F.; Origlia, Livia; Kudritzki, Rolf-Peter; Rich, R. Michael; Najarro, Francisco

2009-01-01

The Galactic center (GC) has experienced a high degree of recent star-forming activity, as evidenced by the large number of massive stars currently residing there. The relative abundances of chemical elements in the GC may provide insights into the origins of this activity. Here, we present high-resolution H-band spectra of two red supergiants (RSGs) in the GC (IRS 7 and VR 5-7), and in combination with spectral synthesis we derive abundances for Fe and C, as well as other α-elements Ca, Si, Mg Ti, and O. We find that the C depletion in VR 5-7 is consistent with the predictions of evolutionary models of RSGs, while the heavy depletion of C and O in IRS 7's atmosphere is indicative of deep mixing, possibly due to fast initial rotation and/or enhanced mass loss. Our results indicate that the current surface Fe/H content of each star is slightly above solar. However, comparisons to evolutionary models indicate that the initial Fe-to-H ratio was likely closer to solar, and has been driven higher by H depletion at the stars' surface. Overall, we find α-to-Fe ratios for both stars, which are consistent with the thin Galactic disk. These results are consistent with other chemical studies of the GC, given the precision to which abundances can currently be determined. We argue that the GC abundances are consistent with a scenario in which the recent star-forming activity in the GC was fueled by either material traveling down the Bar from the inner disk, or from the winds of stars in the inner bulge-with no need to invoke top-heavy stellar initial mass functions to explain anomalous abundance ratios.
The diploid genome sequence of an individual human.

Directory of Open Access Journals (Sweden)

Samuel Levy

2007-09-01

Full Text Available Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel included 3,213,401 single nucleotide polymorphisms (SNPs, 53,823 block substitutions (2-206 bp, 292,102 heterozygous insertion/deletion events (indels(1-571 bp, 559,473 homozygous indels (1-82,711 bp, 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.
A Genomic Approach: The Effects of Bisphenol A on Zebrafish

Science.gov (United States)

Genomics, proteomics, and metabolomics are emerging technologies used to analyze the effects of the increasing level of environmental pollutants that are affecting aquatic organisms. Some of these toxins are considered endocrine-disrupting chemicals (EDC) due to their interferenc...
Genome resource banking of biomedically important laboratory animals.

Science.gov (United States)

Agca, Yuksel

2012-11-01

Genome resource banking is the systematic collection, storage, and redistribution of biomaterials in an organized, logistical, and secure manner. Genome cryobanks usually contain biomaterials and associated genomic information essential for progression of biomedicine, human health, and research. In that regard, appropriate genome cryobanks could provide essential biomaterials for both current and future research projects in the form of various cell types and tissues, including sperm, oocytes, embryos, embryonic or adult stem cells, induced pluripotent stem cells, and gonadal tissues. In addition to cryobanked germplasm, cryobanking of DNA, serum, blood products, and tissues from scientifically, economically, and ecologically important species has become a common practice. For revitalization of the whole organism, cryopreserved germplasm in conjunction with assisted reproductive technologies, offer a powerful approach for research model management, as well as assisting in animal production for agriculture, conservation, and human reproductive medicine. Recently, many developed and developing countries have allocated substantial resources to establish genome resources banks which are responsible for safeguarding scientifically, economically, and ecologically important wild type, mutant, and transgenic plants, fish, and local livestock breeds, as well as wildlife species. This review is dedicated to the memory of Dr. John K. Critser, who has made profound contributions to the science of cryobiology and establishment of genome research and resources centers for mice, rats, and swine. Emphasis will be given to application of genome resource banks to species with substantial contributions to the advancement of biomedicine and human health. Copyright © 2012 Elsevier Inc. All rights reserved.
Teaching Chemical Engineers about Teaching

Science.gov (United States)

Heath, Daniel E.; Hoy, Mary; Rathman, James F.; Rohdieck, Stephanie

2013-01-01

The Chemical and Biomolecular Engineering Department at The Ohio State University in collaboration with the University Center for the Advancement of Teaching developed the Chemical Engineering Mentored Teaching Experience. The Mentored Teaching Experience is an elective for Ph.D. students interested in pursuing faculty careers. Participants are…
Genomic impact of eukaryotic transposable elements

Directory of Open Access Journals (Sweden)

Arkhipova Irina R

2012-11-01

Full Text Available Abstract The third international conference on the genomic impact of eukaryotic transposable elements (TEs was held 24 to 28 February 2012 at the Asilomar Conference Center, Pacific Grove, CA, USA. Sponsored in part by the National Institutes of Health grant 5 P41 LM006252, the goal of the conference was to bring together researchers from around the world who study the impact and mechanisms of TEs using multiple computational and experimental approaches. The meeting drew close to 170 attendees and included invited floor presentations on the biology of TEs and their genomic impact, as well as numerous talks contributed by young scientists. The workshop talks were devoted to computational analysis of TEs with additional time for discussion of unresolved issues. Also, there was ample opportunity for poster presentations and informal evening discussions. The success of the meeting reflects the important role of Repbase in comparative genomic studies, and emphasizes the need for close interactions between experimental and computational biologists in the years to come.
Design and Implementation of a Randomized Controlled Trial of Genomic Counseling for Patients with Chronic Disease

Directory of Open Access Journals (Sweden)

Kevin Sweet

2014-01-01

Full Text Available We describe the development and implementation of a randomized controlled trial to investigate the impact of genomic counseling on a cohort of patients with heart failure (HF or hypertension (HTN, managed at a large academic medical center, the Ohio State University Wexner Medical Center (OSUWMC. Our study is built upon the existing Coriell Personalized Medicine Collaborative (CPMC®. OSUWMC patient participants with chronic disease (CD receive eight actionable complex disease and one pharmacogenomic test report through the CPMC® web portal. Participants are randomized to either the in-person post-test genomic counseling—active arm, versus web-based only return of results—control arm. Study-specific surveys measure: (1 change in risk perception; (2 knowledge retention; (3 perceived personal control; (4 health behavior change; and, for the active arm (5, overall satisfaction with genomic counseling. This ongoing partnership has spurred creation of both infrastructure and procedures necessary for the implementation of genomics and genomic counseling in clinical care and clinical research. This included creation of a comprehensive informed consent document and processes for prospective return of actionable results for multiple complex diseases and pharmacogenomics (PGx through a web portal, and integration of genomic data files and clinical decision support into an EPIC-based electronic medical record. We present this partnership, the infrastructure, genomic counseling approach, and the challenges that arose in the design and conduct of this ongoing trial to inform subsequent collaborative efforts and best genomic counseling practices.
Pigs in sequence space: A 0.66X coverage pig genome survey based on shotgun sequencing

DEFF Research Database (Denmark)

Wernersson, Rasmus; Schierup, M.H.; Jorgensen, F.G.

2005-01-01

sequences (0.66X coverage) from the pig genome. The data are hereby released (NCBI Trace repository with center name "SDJVP", and project name "Sino-Danish Pig Genome Project") together with an initial evolutionary analysis. The non-repetitive fraction of the sequences was aligned to the UCSC human...
Implementation of genomics research in Africa: challenges and recommendations.

Science.gov (United States)

Adebamowo, Sally N; Francis, Veronica; Tambo, Ernest; Diallo, Seybou H; Landouré, Guida; Nembaware, Victoria; Dareng, Eileen; Muhamed, Babu; Odutola, Michael; Akeredolu, Teniola; Nerima, Barbara; Ozumba, Petronilla J; Mbhele, Slee; Ghanash, Anita; Wachinou, Ablo P; Ngomi, Nicholas

2018-01-01

There is exponential growth in the interest and implementation of genomics research in Africa. This growth has been facilitated by the Human Hereditary and Health in Africa (H3Africa) initiative, which aims to promote a contemporary research approach to the study of genomics and environmental determinants of common diseases in African populations. The purpose of this article is to describe important challenges affecting genomics research implementation in Africa. The observations, challenges and recommendations presented in this article were obtained through discussions by African scientists at teleconferences and face-to-face meetings, seminars at consortium conferences and in-depth individual discussions. Challenges affecting genomics research implementation in Africa, which are related to limited resources include ill-equipped facilities, poor accessibility to research centers, lack of expertise and an enabling environment for research activities in local hospitals. Challenges related to the research study include delayed funding, extensive procedures and interventions requiring multiple visits, delays setting up research teams and insufficient staff training, language barriers and an underappreciation of cultural norms. While many African countries are struggling to initiate genomics projects, others have set up genomics research facilities that meet international standards. The lessons learned in implementing successful genomics projects in Africa are recommended as strategies to overcome these challenges. These recommendations may guide the development and application of new research programs in low-resource settings.
The Functional Genomics Initiative at Oak Ridge National Laboratory

Energy Technology Data Exchange (ETDEWEB)

Johnson, Dabney; Justice, Monica; Beattle, Ken; Buchanan, Michelle; Ramsey, Michael; Ramsey, Rose; Paulus, Michael; Ericson, Nance; Allison, David; Kress, Reid; Mural, Richard; Uberbacher, Ed; Mann, Reinhold

1997-12-31

The Functional Genomics Initiative at the Oak Ridge National Laboratory integrates outstanding capabilities in mouse genetics, bioinformatics, and instrumentation. The 50 year investment by the DOE in mouse genetics/mutagenesis has created a one-of-a-kind resource for generating mutations and understanding their biological consequences. It is generally accepted that, through the mouse as a surrogate for human biology, we will come to understand the function of human genes. In addition to this world class program in mammalian genetics, ORNL has also been a world leader in developing bioinformatics tools for the analysis, management and visualization of genomic data. Combining this expertise with new instrumentation technologies will provide a unique capability to understand the consequences of mutations in the mouse at both the organism and molecular levels. The goal of the Functional Genomics Initiative is to develop the technology and methodology necessary to understand gene function on a genomic scale and apply these technologies to megabase regions of the human genome. The effort is scoped so as to create an effective and powerful resource for functional genomics. ORNL is partnering with the Joint Genome Institute and other large scale sequencing centers to sequence several multimegabase regions of both human and mouse genomic DNA, to identify all the genes in these regions, and to conduct fundamental surveys to examine gene function at the molecular and organism level. The Initiative is designed to be a pilot for larger scale deployment in the post-genome era. Technologies will be applied to the examination of gene expression and regulation, metabolism, gene networks, physiology and development.
Protecting genomic data analytics in the cloud: state of the art and opportunities

Directory of Open Access Journals (Sweden)

Haixu Tang

2016-10-01

Full Text Available Abstract The outsourcing of genomic data into public cloud computing settings raises concerns over privacy and security. Significant advancements in secure computation methods have emerged over the past several years, but such techniques need to be rigorously evaluated for their ability to support the analysis of human genomic data in an efficient and cost-effective manner. With respect to public cloud environments, there are concerns about the inadvertent exposure of human genomic data to unauthorized users. In analyses involving multiple institutions, there is additional concern about data being used beyond agreed research scope and being prcoessed in untrused computational environments, which may not satisfy institutional policies. To systematically investigate these issues, the NIH-funded National Center for Biomedical Computing iDASH (integrating Data for Analysis, ‘anonymization’ and SHaring hosted the second Critical Assessment of Data Privacy and Protection competition to assess the capacity of cryptographic technologies for protecting computation over human genomes in the cloud and promoting cross-institutional collaboration. Data scientists were challenged to design and engineer practical algorithms for secure outsourcing of genome computation tasks in working software, whereby analyses are performed only on encrypted data. They were also challenged to develop approaches to enable secure collaboration on data from genomic studies generated by multiple organizations (e.g., medical centers to jointly compute aggregate statistics without sharing individual-level records. The results of the competition indicated that secure computation techniques can enable comparative analysis of human genomes, but greater efficiency (in terms of compute time and memory utilization are needed before they are sufficiently practical for real world environments.
MIPS: analysis and annotation of proteins from whole genomes in 2005.

Science.gov (United States)

Mewes, H W; Frishman, D; Mayer, K F X; Münsterkötter, M; Noubibou, O; Pagel, P; Rattei, T; Oesterheld, M; Ruepp, A; Stümpflen, V

2006-01-01

The Munich Information Center for Protein Sequences (MIPS at the GSF), Neuherberg, Germany, provides resources related to genome information. Manually curated databases for several reference organisms are maintained. Several of these databases are described elsewhere in this and other recent NAR database issues. In a complementary effort, a comprehensive set of >400 genomes automatically annotated with the PEDANT system are maintained. The main goal of our current work on creating and maintaining genome databases is to extend gene centered information to information on interactions within a generic comprehensive framework. We have concentrated our efforts along three lines (i) the development of suitable comprehensive data structures and database technology, communication and query tools to include a wide range of different types of information enabling the representation of complex information such as functional modules or networks Genome Research Environment System, (ii) the development of databases covering computable information such as the basic evolutionary relations among all genes, namely SIMAP, the sequence similarity matrix and the CABiNet network analysis framework and (iii) the compilation and manual annotation of information related to interactions such as protein-protein interactions or other types of relations (e.g. MPCDB, MPPI, CYGD). All databases described and the detailed descriptions of our projects can be accessed through the MIPS WWW server (http://mips.gsf.de).
Strain-specific and pooled genome sequences for populations of Drosophila melanogaster from three continents.

Science.gov (United States)

Bergman, Casey M; Haddrill, Penelope R

2015-01-01

To contribute to our general understanding of the evolutionary forces that shape variation in genome sequences in nature, we have sequenced genomes from 50 isofemale lines and six pooled samples from populations of Drosophila melanogaster on three continents. Analysis of raw and reference-mapped reads indicates the quality of these genomic sequence data is very high. Comparison of the predicted and experimentally-determined Wolbachia infection status of these samples suggests that strain or sample swaps are unlikely to have occurred in the generation of these data. Genome sequences are freely available in the European Nucleotide Archive under accession ERP009059. Isofemale lines can be obtained from the Drosophila Species Stock Center.

Ten years of maintaining and expanding a microbial genome and metagenome analysis system.

Science.gov (United States)

Markowitz, Victor M; Chen, I-Min A; Chu, Ken; Pati, Amrita; Ivanova, Natalia N; Kyrpides, Nikos C

2015-11-01

Launched in March 2005, the Integrated Microbial Genomes (IMG) system is a comprehensive data management system that supports multidimensional comparative analysis of genomic data. At the core of the IMG system is a data warehouse that contains genome and metagenome datasets sequenced at the Joint Genome Institute or provided by scientific users, as well as public genome datasets available at the National Center for Biotechnology Information Genbank sequence data archive. Genomes and metagenome datasets are processed using IMG's microbial genome and metagenome sequence data processing pipelines and are integrated into the data warehouse using IMG's data integration toolkits. Microbial genome and metagenome application specific data marts and user interfaces provide access to different subsets of IMG's data and analysis toolkits. This review article revisits IMG's original aims, highlights key milestones reached by the system during the past 10 years, and discusses the main challenges faced by a rapidly expanding system, in particular the complexity of maintaining such a system in an academic setting with limited budgets and computing and data management infrastructure. Copyright © 2015 Elsevier Ltd. All rights reserved.
An automated system designed for large scale NMR data deposition and annotation: application to over 600 assigned chemical shift data entries to the BioMagResBank from the Riken Structural Genomics/Proteomics Initiative internal database

International Nuclear Information System (INIS)

Kobayashi, Naohiro; Harano, Yoko; Tochio, Naoya; Nakatani, Eiichi; Kigawa, Takanori; Yokoyama, Shigeyuki; Mading, Steve; Ulrich, Eldon L.; Markley, John L.; Akutsu, Hideo; Fujiwara, Toshimichi

2012-01-01

Biomolecular NMR chemical shift data are key information for the functional analysis of biomolecules and the development of new techniques for NMR studies utilizing chemical shift statistical information. Structural genomics projects are major contributors to the accumulation of protein chemical shift information. The management of the large quantities of NMR data generated by each project in a local database and the transfer of the data to the public databases are still formidable tasks because of the complicated nature of NMR data. Here we report an automated and efficient system developed for the deposition and annotation of a large number of data sets including 1 H, 13 C and 15 N resonance assignments used for the structure determination of proteins. We have demonstrated the feasibility of our system by applying it to over 600 entries from the internal database generated by the RIKEN Structural Genomics/Proteomics Initiative (RSGI) to the public database, BioMagResBank (BMRB). We have assessed the quality of the deposited chemical shifts by comparing them with those predicted from the PDB coordinate entry for the corresponding protein. The same comparison for other matched BMRB/PDB entries deposited from 2001–2011 has been carried out and the results suggest that the RSGI entries greatly improved the quality of the BMRB database. Since the entries include chemical shifts acquired under strikingly similar experimental conditions, these NMR data can be expected to be a promising resource to improve current technologies as well as to develop new NMR methods for protein studies.
Fueling the Future with Fungal Genomics

Energy Technology Data Exchange (ETDEWEB)

Grigoriev, Igor V.; Cullen, Daniel; Hibbett, David; Goodwin, Stephen B.; Jeffries, Thomas W.; Kubicek, Christian P.; Kuske, Cheryl; Magnuson, Jon K.; Martin, Francis; Spatafora, Joey; Tsang, Adrian; Baker, Scott E.

2011-04-29

Fungi play important roles across the range of current and future biofuel production processes. From crop/feedstock health to plant biomass saccharification, enzyme production to bioprocesses for producing ethanol, higher alcohols or future hydrocarbon biofuels, fungi are involved. Research and development are underway to understand the underlying biological processes and improve them to make bioenergy production efficient on an industrial scale. Genomics is the foundation of the systems biology approach that is being used to accelerate the research and development efforts across the spectrum of topic areas that impact biofuels production. In this review, we discuss past, current and future advances made possible by genomic analyses of the fungi that impact plant/feedstock health, degradation of lignocellulosic biomass and fermentation of sugars to ethanol, hydrocarbon biofuels and renewable chemicals.
Polymicrogyria-associated epilepsy: a multi-center phenotypic study from the Epilepsy Phenome/Genome Project

Science.gov (United States)

Shain, Catherine; Ramgopal, Sriram; Fallil, Zianka; Parulkar, Isha; Alongi, Richard; Knowlton, Robert; Poduri, Annapurna

2013-01-01

Purpose Polymicrogyria (PMG) is an epileptogenic malformation of cortical development. We describe the clinical epilepsy and imaging features of a large cohort with PMG-related epilepsy. Methods Participants were recruited through the Epilepsy Phenome/Genome Project, a multi-center collaborative effort to collect detailed phenotypic data on individuals with epilepsy. We reviewed phenotypic data from participants with epilepsy and PMG. Key Findings We identified 87 participants, 43 female and 44 male, with PMG and epilepsy. Median age of seizure onset was 3 years (range <1 month-37 years). Most presented with focal epilepsy (87.4%), some in combination with seizures generalized from onset (23.0%). Focal seizures with dyscognitive features were most common (54.3%). Of those presenting with generalized seizure types, infantile spasms were most prevalent (45.2%). The most common topographic pattern was perisylvian PMG (77.0%), of which the majority was bilateral (56.7%). Generalized PMG presented with an earlier age of seizure onset (median age of 8 months) and an increased prevalence of developmental delay prior to seizure onset (57.1%). Of the focal, unilateral and asymmetric bilateral groups where PMG was more involved in one hemisphere, the majority (71.4%) of participants had seizures that lateralized to the same hemisphere as the PMG or the hemisphere with greater involvement. Significance Participants with PMG had both focal and generalized onset of seizures. Our data confirm the involvement of known topographic patterns of PMG and suggest that more extensive distributions of PMG present with an earlier age of seizure onset and increased prevalence of developmental delay prior to seizure onset. PMID:23750890
Ensembl Genomes 2016: more genomes, more complexity.

Science.gov (United States)

Kersey, Paul Julian; Allen, James E; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello-Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M; Howe, Kevin L; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M

2016-01-04

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Improvements to PATRIC, the all-bacterial Bioinformatics Database and Analysis Resource Center

Science.gov (United States)

Wattam, Alice R.; Davis, James J.; Assaf, Rida; Boisvert, Sébastien; Brettin, Thomas; Bun, Christopher; Conrad, Neal; Dietrich, Emily M.; Disz, Terry; Gabbard, Joseph L.; Gerdes, Svetlana; Henry, Christopher S.; Kenyon, Ronald W.; Machi, Dustin; Mao, Chunhong; Nordberg, Eric K.; Olsen, Gary J.; Murphy-Olson, Daniel E.; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D.; Shukla, Maulik; Vonstein, Veronika; Warren, Andrew; Xia, Fangfang; Yoo, Hyunseung; Stevens, Rick L.

2017-01-01

The Pathosystems Resource Integration Center (PATRIC) is the bacterial Bioinformatics Resource Center (https://www.patricbrc.org). Recent changes to PATRIC include a redesign of the web interface and some new services that provide users with a platform that takes them from raw reads to an integrated analysis experience. The redesigned interface allows researchers direct access to tools and data, and the emphasis has changed to user-created genome-groups, with detailed summaries and views of the data that researchers have selected. Perhaps the biggest change has been the enhanced capability for researchers to analyze their private data and compare it to the available public data. Researchers can assemble their raw sequence reads and annotate the contigs using RASTtk. PATRIC also provides services for RNA-Seq, variation, model reconstruction and differential expression analysis, all delivered through an updated private workspace. Private data can be compared by ‘virtual integration’ to any of PATRIC's public data. The number of genomes available for comparison in PATRIC has expanded to over 80 000, with a special emphasis on genomes with antimicrobial resistance data. PATRIC uses this data to improve both subsystem annotation and k-mer classification, and tags new genomes as having signatures that indicate susceptibility or resistance to specific antibiotics. PMID:27899627
Genomic technologies in neonatology

Directory of Open Access Journals (Sweden)

L. N. Chernova

2017-01-01

Full Text Available In recent years, there has been a tremendous trend toward personalized medicine. Advances in the field forced clinicians, including neonatologists, to take a fresh look at prevention, tactics of management and therapy of various diseases. In the center of attention of foreign, and increasingly Russian, researchers and doctors, there are individual genomic data that allow not only to assess the risks of some form of pathology, but also to successfully apply personalized strategies of prediction, prevention and targeted treatment. This article provides a brief review of the latest achievements of genomic technologies in newborns, examines the problems and potential applications of genomics in promoting the concept of personalized medicine in neonatology. The increasing amount of personalized data simply impossible to analyze only by the human mind. In this connection, the need of computers and bioinformatics is obvious. The article reveals the role of translational bioinformatics in the analysis and integration of the results of the accumulated fundamental research into complete clinical decisions. The latest advances in neonatal translational bioinformatics such as clinical decision support systems are considered. It helps to monitor vital parameters of newborns influencing the course of a particular disease, to calculate the increased risks of the development of various pathologies and to select the drugs.
Metingear: a development environment for annotating genome-scale metabolic models.

Science.gov (United States)

May, John W; James, A Gordon; Steinbeck, Christoph

2013-09-01

Genome-scale metabolic models often lack annotations that would allow them to be used for further analysis. Previous efforts have focused on associating metabolites in the model with a cross reference, but this can be problematic if the reference is not freely available, multiple resources are used or the metabolite is added from a literature review. Associating each metabolite with chemical structure provides unambiguous identification of the components and a more detailed view of the metabolism. We have developed an open-source desktop application that simplifies the process of adding database cross references and chemical structures to genome-scale metabolic models. Annotated models can be exported to the Systems Biology Markup Language open interchange format. Source code, binaries, documentation and tutorials are freely available at http://johnmay.github.com/metingear. The application is implemented in Java with bundles available for MS Windows and Macintosh OS X.
Accelerator Center for Energy Research (ACER)

Data.gov (United States)

Federal Laboratory Consortium — The Accelerator Center for Energy Research (ACER) exploits radiation chemistry techniques to study chemical reactions (and other phenomena) by subjecting samples to...
phiGENOME: an integrative navigation throughout bacteriophage genomes.

Science.gov (United States)

Stano, Matej; Klucar, Lubos

2011-11-01

phiGENOME is a web-based genome browser generating dynamic and interactive graphical representation of phage genomes stored in the phiSITE, database of gene regulation in bacteriophages. phiGENOME is an integral part of the phiSITE web portal (http://www.phisite.org/phigenome) and it was optimised for visualisation of phage genomes with the emphasis on the gene regulatory elements. phiGENOME consists of three components: (i) genome map viewer built using Adobe Flash technology, providing dynamic and interactive graphical display of phage genomes; (ii) sequence browser based on precisely formatted HTML tags, providing detailed exploration of genome features on the sequence level and (iii) regulation illustrator, based on Scalable Vector Graphics (SVG) and designed for graphical representation of gene regulations. Bringing 542 complete genome sequences accompanied with their rich annotations and references, makes phiGENOME a unique information resource in the field of phage genomics. Copyright Â© 2011 Elsevier Inc. All rights reserved.
Toward a Standards-Compliant Genomic and Metagenomic Publication Record

DEFF Research Database (Denmark)

Garrity, GM; Field, D; Kyrpides, N

2008-01-01

Increasingly, we are aware as a community of the growing need to manage the avalanche of genomic and metagenomic data, in addition to related data types like ribosomal RNA and barcode sequences, in a way that tightly integrates contextual data with traditional literature in a machine-readable way...... is in the midst of a publishing revolution. This revolution is marked by a growing shift away from a traditional dichotomy between "journal articles" and "database entries" and an increasing adoption of hybrid models of collecting and disseminating scientific information. With respect to genomes and metagenomes...... or communities) such as the call by the GSC for a central repository of Standard Operating Procedures describing the genomic annotation pipelines of the major sequencing centers. We argue that such an "eJournal," published under the Open Access paradigm by the GSC, could be an attractive publishing forum...
Genome Maps, a new generation genome browser.

Science.gov (United States)

Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

2013-07-01

Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org.
Structural Genomics and Drug Discovery for Infectious Diseases

International Nuclear Information System (INIS)

Anderson, W.F.

2009-01-01

The application of structural genomics methods and approaches to proteins from organisms causing infectious diseases is making available the three dimensional structures of many proteins that are potential drug targets and laying the groundwork for structure aided drug discovery efforts. There are a number of structural genomics projects with a focus on pathogens that have been initiated worldwide. The Center for Structural Genomics of Infectious Diseases (CSGID) was recently established to apply state-of-the-art high throughput structural biology technologies to the characterization of proteins from the National Institute for Allergy and Infectious Diseases (NIAID) category A-C pathogens and organisms causing emerging, or re-emerging infectious diseases. The target selection process emphasizes potential biomedical benefits. Selected proteins include known drug targets and their homologs, essential enzymes, virulence factors and vaccine candidates. The Center also provides a structure determination service for the infectious disease scientific community. The ultimate goal is to generate a library of structures that are available to the scientific community and can serve as a starting point for further research and structure aided drug discovery for infectious diseases. To achieve this goal, the CSGID will determine protein crystal structures of 400 proteins and protein-ligand complexes using proven, rapid, highly integrated, and cost-effective methods for such determination, primarily by X-ray crystallography. High throughput crystallographic structure determination is greatly aided by frequent, convenient access to high-performance beamlines at third-generation synchrotron X-ray sources.
Chemical Addressability of Ultraviolet-Inactivated Viral Nanoparticles (VNPs)

Science.gov (United States)

Rae, Chris; Koudelka, Kristopher J.; Destito, Giuseppe; Estrada, Mayra N.; Gonzalez, Maria J.; Manchester, Marianne

2008-01-01

Background Cowpea Mosaic Virus (CPMV) is increasingly being used as a nanoparticle platform for multivalent display of molecules via chemical bioconjugation to the capsid surface. A growing variety of applications have employed the CPMV multivalent display technology including nanoblock chemistry, in vivo imaging, and materials science. CPMV nanoparticles can be inexpensively produced from experimentally infected cowpea plants at high yields and are extremely stable. Although CPMV has not been shown to replicate in mammalian cells, uptake in mammalian cells does occur in vitro and in vivo. Thus, inactivation of the virus RNA genome is important for biosafety considerations, however the surface characteristics and chemical reactivity of the particles must be maintained in order to preserve chemical and structural functionality. Methodology/Principal Findings Short wave (254 nm) UV irradiation was used to crosslink the RNA genome within intact particles. Lower doses of UV previously reported to inactivate CPMV infectivity inhibited symptoms on inoculated leaves but did not prohibit systemic virus spread in plants, whereas higher doses caused aggregation of the particles and an increase in chemical reactivity further indicating broken particles. Intermediate doses of 2.0–2.5 J/cm2 were shown to maintain particle structure and chemical reactivity, and cellular binding properties were similar to CPMV-WT. Conclusions These studies demonstrate that it is possible to inactivate CPMV infectivity while maintaining particle structure and function, thus paving the way for further development of CPMV nanoparticles for in vivo applications. PMID:18830402
Chemical addressability of ultraviolet-inactivated viral nanoparticles (VNPs.

Directory of Open Access Journals (Sweden)

Chris Rae

2008-10-01

Full Text Available Cowpea Mosaic Virus (CPMV is increasingly being used as a nanoparticle platform for multivalent display of molecules via chemical bioconjugation to the capsid surface. A growing variety of applications have employed the CPMV multivalent display technology including nanoblock chemistry, in vivo imaging, and materials science. CPMV nanoparticles can be inexpensively produced from experimentally infected cowpea plants at high yields and are extremely stable. Although CPMV has not been shown to replicate in mammalian cells, uptake in mammalian cells does occur in vitro and in vivo. Thus, inactivation of the virus RNA genome is important for biosafety considerations, however the surface characteristics and chemical reactivity of the particles must be maintained in order to preserve chemical and structural functionality.Short wave (254 nm UV irradiation was used to crosslink the RNA genome within intact particles. Lower doses of UV previously reported to inactivate CPMV infectivity inhibited symptoms on inoculated leaves but did not prohibit systemic virus spread in plants, whereas higher doses caused aggregation of the particles and an increase in chemical reactivity further indicating broken particles. Intermediate doses of 2.0-2.5 J/cm(2 were shown to maintain particle structure and chemical reactivity, and cellular binding properties were similar to CPMV-WT.These studies demonstrate that it is possible to inactivate CPMV infectivity while maintaining particle structure and function, thus paving the way for further development of CPMV nanoparticles for in vivo applications.
Waste management at the Karlsruhe Nuclear Research Center

International Nuclear Information System (INIS)

Hoehlein, G.; Lins, W.

1982-01-01

In the Karlsruhe Nuclear Research Center the responsibility for waste management is concentrated in the Decontamination Department which serves to collect and transport all liquid waste and solid material from central areas in the center for further waste treatment, clean radioactive equipment for repair and re-use or for recycling of material, remove from the liquid effluents any radioactive and chemical pollutants as specified in legislation on the protection of waters, convert radioactive wastes into mechanically and chemically stable forms allowing them to be transported into a repository. (orig./RW)
Implementation of genomics research in Africa: challenges and recommendations

Science.gov (United States)

Adebamowo, Sally N.; Francis, Veronica; Tambo, Ernest; Diallo, Seybou H.; Landouré, Guida; Nembaware, Victoria; Dareng, Eileen; Muhamed, Babu; Odutola, Michael; Akeredolu, Teniola; Nerima, Barbara; Ozumba, Petronilla J.; Mbhele, Slee; Ghanash, Anita; Wachinou, Ablo P.; Ngomi, Nicholas

2018-01-01

ABSTRACT Background: There is exponential growth in the interest and implementation of genomics research in Africa. This growth has been facilitated by the Human Hereditary and Health in Africa (H3Africa) initiative, which aims to promote a contemporary research approach to the study of genomics and environmental determinants of common diseases in African populations. Objective: The purpose of this article is to describe important challenges affecting genomics research implementation in Africa. Methods: The observations, challenges and recommendations presented in this article were obtained through discussions by African scientists at teleconferences and face-to-face meetings, seminars at consortium conferences and in-depth individual discussions. Results: Challenges affecting genomics research implementation in Africa, which are related to limited resources include ill-equipped facilities, poor accessibility to research centers, lack of expertise and an enabling environment for research activities in local hospitals. Challenges related to the research study include delayed funding, extensive procedures and interventions requiring multiple visits, delays setting up research teams and insufficient staff training, language barriers and an underappreciation of cultural norms. While many African countries are struggling to initiate genomics projects, others have set up genomics research facilities that meet international standards. Conclusions: The lessons learned in implementing successful genomics projects in Africa are recommended as strategies to overcome these challenges. These recommendations may guide the development and application of new research programs in low-resource settings. PMID:29336236
Chemical Control of Plant Growth.

Science.gov (United States)

Agricultural Research Center (USDA), Beltsville, MD.

Seven experiments are presented in this Science Study Aid to help students investigate the control of plant growth with chemicals. Plant growth regulators, weed control, and chemical pruning are the topics studied in the experiments which are based on investigations that have been and are being conducted at the U. S. Agricultural Research Center,…
The Impact of Structural Genomics: Expectations and Outcomes

Energy Technology Data Exchange (ETDEWEB)

Chandonia, John-Marc; Brenner, Steven E.

2005-12-21

Structural Genomics (SG) projects aim to expand our structural knowledge of biological macromolecules, while lowering the average costs of structure determination. We quantitatively analyzed the novelty, cost, and impact of structures solved by SG centers, and contrast these results with traditional structural biology. The first structure from a protein family is particularly important to reveal the fold and ancient relationships to other proteins. In the last year, approximately half of such structures were solved at a SG center rather than in a traditional laboratory. Furthermore, the cost of solving a structure at the most efficient U.S. center has now dropped to one-quarter the estimated cost of solving a structure by traditional methods. However, top structural biology laboratories are much more efficient than the average, and comparable to SG centers despite working on very challenging structures. Moreover, traditional structural biology papers are cited significantly more often, suggesting greater current impact.
Assessment of indoor environment in Paris child day care centers.

Science.gov (United States)

Roda, Célina; Barral, Sophie; Ravelomanantsoa, Hanitriniala; Dusséaux, Murielle; Tribout, Martin; Le Moullec, Yvon; Momas, Isabelle

2011-11-01

Children are sensitive to indoor environmental pollution. Up until now there has been a lack of data on air quality in child day care centers. The aim of this study is to document the indoor environment quality of Paris child day care centers by repeated measurements, and to compare pollutant levels in child day care centers with levels in Paris dwellings. We selected 28 child day care centers frequented by a random sample of babies who participated in the PARIS birth cohort environmental investigation, and visited the child day care centers for one week twice in one year. Biological contaminants assessed were fungi, endotoxin, dust mite allergens, and chemical pollutants: aldehydes, volatile organic compounds and nitrogen dioxide (NO2). Relative humidity, temperature, and carbon dioxide levels were measured simultaneously. A standardized questionnaire was used to gather information about the buildings and their inhabitants. Airborne endotoxin levels in child day care centers were higher than those found in Paris dwellings. Dust mite allergens in child day care centers were below the threshold level for sensitization in the majority of samples, and in common with dwelling samples. Penicillium and Cladosporium were the most commonly identified genera fungi. The child day care center indoor/outdoor ratio for most chemical pollutants was above unity except for NO2, the levels for NO2 being significantly higher than those measured in homes. Chemical and biological contamination in child day care centers appears to be low, apart from endotoxin and NO2. Failure to take child exposure in child day care centers into account could result in an overestimation of children's exposure to other pollutants. Copyright © 2011 Elsevier Inc. All rights reserved.

Genomics using the Assembly of the Mink Genome

DEFF Research Database (Denmark)

Guldbrandtsen, Bernt; Cai, Zexi; Sahana, Goutam

2018-01-01

The American Mink’s (Neovison vison) genome has recently been sequenced. This opens numerous avenues of research both for studying the basic genetics and physiology of the mink as well as genetic improvement in mink. Using genotyping-by-sequencing (GBS) generated marker data for 2,352 Danish farm...... mink runs of homozygosity (ROH) were detect in mink genomes. Detectable ROH made up on average 1.7% of the genome indicating the presence of at most a moderate level of genomic inbreeding. The fraction of genome regions found in ROH varied. Ten percent of the included regions were never found in ROH....... The ability to detect ROH in the mink genome also demonstrates the general reliability of the new mink genome assembly. Keywords: american mink, run of homozygosity, genome, selection, genomic inbreeding...
A DNA minor groove electronegative potential genome map based on photo-chemical probing

DEFF Research Database (Denmark)

Lindemose, Søren; Nielsen, Peter Eigil; Hansen, Morten

2011-01-01

The double-stranded DNA of the genome contains both sequence information directly relating to the protein and RNA coding as well as functional and structural information relating to protein recognition. Only recently is the importance of DNA shape in this recognition process being fully appreciat...
Visualization for genomics: the Microbial Genome Viewer.

NARCIS (Netherlands)

Kerkhoven, R.; Enckevort, F.H.J. van; Boekhorst, J.; Molenaar, D; Siezen, R.J.

2004-01-01

SUMMARY: A Web-based visualization tool, the Microbial Genome Viewer, is presented that allows the user to combine complex genomic data in a highly interactive way. This Web tool enables the interactive generation of chromosome wheels and linear genome maps from genome annotation data stored in a
Genome-wide Studies of Mycolic Acid Bacteria: Computational Identification and Analysis of a Minimal Genome

KAUST Repository

Kamanu, Frederick Kinyua

2012-12-01

The mycolic acid bacteria are a distinct suprageneric group of asporogenous Grampositive, high GC-content bacteria, distinguished by the presence of mycolic acids in their cell envelope. They exhibit great diversity in their cell and morphology; although primarily non-pathogens, this group contains three major pathogens Mycobacterium leprae, Mycobacterium tuberculosis complex, and Corynebacterium diphtheria. Although the mycolic acid bacteria are a clearly defined group of bacteria, the taxonomic relationships between its constituent genera and species are less well defined. Two approaches were tested for their suitability in describing the taxonomy of the group. First, a Multilocus Sequence Typing (MLST) experiment was assessed and found to be superior to monophyletic (16S small ribosomal subunit) in delineating a total of 52 mycolic acid bacterial species. Phylogenetic inference was performed using the neighbor-joining method. To further refine phylogenetic analysis and to take advantage of the widespread availability of bacterial genome data, a computational framework that simulates DNA-DNA hybridisation was developed and validated using multiscale bootstrap resampling. The tool classifies microbial genomes based on whole genome DNA, and was deployed as a web-application using PHP and Javascript. It is accessible online at http://cbrc.kaust.edu.sa/dna_hybridization/ A third study was a computational and statistical methods in the identification and analysis of a putative minimal mycolic acid bacterial genome so as to better understand (1) the genomic requirements to encode a mycolic acid bacterial cell and (2) the role and type of genes and genetic elements that lead to the massive increase in genome size in environmental mycolic acid bacteria. Using a reciprocal comparison approach, a total of 690 orthologous gene clusters forming a putative minimal genome were identified across 24 mycolic acid bacterial species. In order to identify new potential drug
Use of Genomic Databases for Inquiry-Based Learning about Influenza

Science.gov (United States)

Ledley, Fred; Ndung'u, Eric

2011-01-01

The genome projects of the past decades have created extensive databases of biological information with applications in both research and education. We describe an inquiry-based exercise that uses one such database, the National Center for Biotechnology Information Influenza Virus Resource, to advance learning about influenza. This database…
Quantum chemical modelling of magnesium centers in LiF crystals

International Nuclear Information System (INIS)

Shlyuger, A.L.; Mysovskij, S.N.; Nepomnyashchikh, A.I.

1989-01-01

It is shown theoretically that optical absorption at 4.0 eV in LiF irradiated crystals is linked with Mg c + v a + v c - -centers (M-centers) and results from electron transitions from quasi-local staes in valent zone to vetre local state. V k and M-centres resulting from M-centre photodecolorization at low temperatures cause optical absorption with maxima at 3.5 and 5.0 eV. M-centres are transformed into M-centres oriented along axis at temperatures higher than 240 K. Optical excitation of M-centres oriented along axis with 5.5 eV maximum results in the initiation of luminescence at 2 eV
Characterization of genome-reduced Bacillus subtilis strains and their application for the production of guanosine and thymidine.

Science.gov (United States)

Li, Yang; Zhu, Xujun; Zhang, Xueyu; Fu, Jing; Wang, Zhiwen; Chen, Tao; Zhao, Xueming

2016-06-03

Genome streamlining has emerged as an effective strategy to boost the production efficiency of bio-based products. Many efforts have been made to construct desirable chassis cells by reducing the genome size of microbes. It has been reported that the genome-reduced Bacillus subtilis strain MBG874 showed clear advantages for the production of several heterologous enzymes including alkaline cellulase and protease. In addition to enzymes, B. subtilis is also used for the production of chemicals. To our best knowledge, it is still unknown whether genome reduction could be used to optimize the production of chemicals such as nucleoside products. In this study, we constructed a series of genome-reduced strains by deleting non-essential regions in the chromosome of B. subtilis 168. These strains with genome reductions ranging in size from 581.9 to 814.4 kb displayed markedly decreased growth rates, sporulation ratios, transformation efficiencies and maintenance coefficients, as well as increased cell yields. We re-engineered the genome-reduced strains to produce guanosine and thymidine, respectively. The strain BSK814G2, in which purA was knocked out, and prs, purF and guaB were co-overexpressed, produced 115.2 mg/L of guanosine, which was 4.4-fold higher compared to the control strain constructed by introducing the same gene modifications into the parental strain. We also constructed a thymidine producer by deleting the tdk gene and overexpressing the prs, ushA, thyA, dut, and ndk genes from Escherichia coli in strain BSK756, and the resulting strain BSK756T3 accumulated 151.2 mg/L thymidine, showing a 5.2-fold increase compared to the corresponding control strain. Genome-scale genetic manipulation has a variety of effects on the physiological characteristics and cell metabolism of B. subtilis. By introducing specific gene modifications related to guanosine and thymidine accumulation, respectively, we demonstrated that genome-reduced strains had greatly improved
Rapid Prototyping of Microbial Cell Factories via Genome-scale Engineering

Science.gov (United States)

Si, Tong; Xiao, Han; Zhao, Huimin

2014-01-01

Advances in reading, writing and editing genetic materials have greatly expanded our ability to reprogram biological systems at the resolution of a single nucleotide and on the scale of a whole genome. Such capacity has greatly accelerated the cycles of design, build and test to engineer microbes for efficient synthesis of fuels, chemicals and drugs. In this review, we summarize the emerging technologies that have been applied, or are potentially useful for genome-scale engineering in microbial systems. We will focus on the development of high-throughput methodologies, which may accelerate the prototyping of microbial cell factories. PMID:25450192
Toxicology of Chemical Mixtures: A Review of Mixtures Assessment

National Research Council Canada - National Science Library

Bjarnason, Stephen

2004-01-01

.... Recent advances in disciplines such as genomics, proteomics, metabonomics and physiologically-based pharmacokinetic modeling should assist in the hazard assessment of complex chemical mixtures. However, the process of regulatory assessment of these types of exposures will remain both complex and difficult.
A computational approach to chemical etiologies of diabetes

DEFF Research Database (Denmark)

Audouze, Karine Marie Laure; Brunak, Søren; Grandjean, Philippe

2013-01-01

Computational meta-analysis can link environmental chemicals to genes and proteins involved in human diseases, thereby elucidating possible etiologies and pathogeneses of non-communicable diseases. We used an integrated computational systems biology approach to examine possible pathogenetic...... linkages in type 2 diabetes (T2D) through genome-wide associations, disease similarities, and published empirical evidence. Ten environmental chemicals were found to be potentially linked to T2D, the highest scores were observed for arsenic, 2,3,7,8-tetrachlorodibenzo-p-dioxin, hexachlorobenzene...
Economic Aspects of the Chemical Industry

Science.gov (United States)

Koleske, Joseph V.

Within the formal disciplines of science at traditional universities, through the years, chemistry has grown to have a unique status because of its close correspondence with an industry and with a branch of engineering—the chemical industry and chemical engineering. There is no biology industry, but aspects of biology have closely related disciplines such as fish raising and other aquaculture, animal cloning and other facets of agriculture, ethical drugs of pharmaceutical manufacture, genomics, water quality and conservation, and the like. Although there is no physics industry, there are power generation, electricity, computers, optics, magnetic media, and electronics that exist as industries. However, in the case of chemistry, there is a named industry. This unusual correspondence no doubt came about because in the chemical industry one makes things from raw materials—chemicals—and the science, manufacture, and use of chemicals grew up together during the past century or so.
The perennial ryegrass GenomeZipper: targeted use of genome resources for comparative grass genomics.

Science.gov (United States)

Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F X; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

2013-02-01

Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species.
MSU-Northern Bio-Energy Center of Excellence

Energy Technology Data Exchange (ETDEWEB)

Kegel, Greg [Montana State Univ., Bozeman, MT (United States); Alcorn-Windy Boy, Jessica [Montana State Univ., Bozeman, MT (United States); Abedin, Md. Joynal [Montana State Univ., Bozeman, MT (United States); Maglinao, Randy [Montana State Univ., Bozeman, MT (United States)

2014-09-30

MSU-Northern established the Bio-Energy Center (the Center) into a Regional Research Center of Excellence to address the obstacles concerning biofuels, feedstock, quality, conversion process, economic viability and public awareness. The Center built its laboratories and expertise in order to research and support product development and commercialization for the bio-energy industry in our region. The Center wanted to support the regional agricultural based economy by researching biofuels based on feedstock’s that can be grown in our region in an environmentally responsible manner. We were also interested in any technology that will improve the emissions and fuel economy performance of heavy duty diesel engines. The Center had a three step approach to accomplish these goals: 1. Enhance the Center’s research and testing capabilities 2. Develop advanced biofuels from locally grown agricultural crops. 3. Educate and outreach for public understanding and acceptance of new technology. The Center was very successful in completing the tasks as outlined in the project plan. Key successes include discovering and patenting a new chemical conversion process for converting camelina oil to jet fuel, as well as promise in developing a heterogeneous Grubs catalyst to support the new chemical conversion process. The Center also successfully fragmented and deoxygenated naturally occurring lignin with a Ni-NHC catalyst, showing promise for further exploration of using lignin for fuels and fuel additives. This would create another value-added product for lignin that can be sourced from beetle kill trees or waste products from cellulose ethanol fuel facilities.
78 FR 35292 - Center for Scientific Review; Notice of Closed Meetings

Science.gov (United States)

2013-06-12

...: Functional Epigenomics: Developing Tools and Technologies for Manipulation of the Epigenome (R01). Date: July... Special Emphasis Panel; Member Conflict: Genome Integrity and Tumor Progression. Date: July 11, 2013. Time....gov . Name of Committee: Center for Scientific Review Special Emphasis Panel; Member Conflict...
Genome sequence of vibrio cholerae G4222, a South African clinical isolate

CSIR Research Space (South Africa)

Le Rouw, Wouter J

2013-03-01

Full Text Available of Microbiology and Plant Pathology, University of Pretoria, South Africab; Center for Microbial Ecology and Genomics, Department of Genetics, University of Pretoria, South Africac Vibrio cholerae, a Gram-negative pathogen autochthonous to the aquatic environment..., is the causative agent of cholera. Here, we report the complete genome sequence of V. choleraeG4222, a clinical isolate from South Africa. Received 17 January 2013 Accepted 8 February 2013 Published 14 March 2013 Citation le Roux WJ, Chan WY, De Maayer P, Venter SN...
Chemical Engineering at NASA

Science.gov (United States)

Collins, Jacob

2008-01-01

This viewgraph presentation is a review of the career paths for chemicals engineer at NASA (specifically NASA Johnson Space Center.) The author uses his personal experience and history as an example of the possible career options.
Family genome browser: visualizing genomes with pedigree information.

Science.gov (United States)

Juan, Liran; Liu, Yongzhuang; Wang, Yongtian; Teng, Mingxiang; Zang, Tianyi; Wang, Yadong

2015-07-15

Families with inherited diseases are widely used in Mendelian/complex disease studies. Owing to the advances in high-throughput sequencing technologies, family genome sequencing becomes more and more prevalent. Visualizing family genomes can greatly facilitate human genetics studies and personalized medicine. However, due to the complex genetic relationships and high similarities among genomes of consanguineous family members, family genomes are difficult to be visualized in traditional genome visualization framework. How to visualize the family genome variants and their functions with integrated pedigree information remains a critical challenge. We developed the Family Genome Browser (FGB) to provide comprehensive analysis and visualization for family genomes. The FGB can visualize family genomes in both individual level and variant level effectively, through integrating genome data with pedigree information. Family genome analysis, including determination of parental origin of the variants, detection of de novo mutations, identification of potential recombination events and identical-by-decent segments, etc., can be performed flexibly. Diverse annotations for the family genome variants, such as dbSNP memberships, linkage disequilibriums, genes, variant effects, potential phenotypes, etc., are illustrated as well. Moreover, the FGB can automatically search de novo mutations and compound heterozygous variants for a selected individual, and guide investigators to find high-risk genes with flexible navigation options. These features enable users to investigate and understand family genomes intuitively and systematically. The FGB is available at http://mlg.hit.edu.cn/FGB/. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Challenges in Strategy and Management of Multinational R&D Centers in Emerging Markets: Perspective from a German Headquarters in the Chemical Sector

Directory of Open Access Journals (Sweden)

Osmar Mitsuo Saito

2013-03-01

Full Text Available The expansion of multinational company (MNCs operations abroad represents an observed trend for decades. The news is that in recent years the research and development (R&D activities also have become internationalized, including more intensified focus on emerging countries. Among the implications is the challenge for the MNCs to implement effective organizational structures with the intention to facilitate the articulated coordination of strategies and R&D management between the headquarters and their global R&D centers. The purpose of this study is to evaluate the strategy from the perspective of the corporate headquarters of a multinational company and the challenges in the formulation of the global R&D strategy and management of each center located inemerging and developed markets. For this reason, we developed an empirical research based on qualitative multiple case exploratory study in a German chemical MNC company in its five global R&D centers located in Germany (headquarters, USA, Brazil, China and India. The results suggested the needs to creation of organizational management capabilities for constant re-evaluation of its R&D strategy in order to capture the demands and the temporary windows of opportunities from these markets. These capabilities lead to reducing the strong observed centralization level and assigning more responsibilities to the subsidiaries with global R&D center status.
Germplasm Management in the Post-genomics Era-a case study with lettuce

Science.gov (United States)

High-throughput genotyping platforms and next-generation sequencing technologies revolutionized our ways in germplasm characterization. In collaboration with UC Davis Genome Center, we completed a project of genotyping the entire cultivated lettuce (Lactuca sativa L.) collection of 1,066 accessions ...
Genome sequencing reveals complex secondary metabolome in themarine actinomycete Salinispora tropica

Energy Technology Data Exchange (ETDEWEB)

Udwary, Daniel W.; Zeigler, Lisa; Asolkar, Ratnakar; Singan,Vasanth; Lapidus, Alla; Fenical, William; Jensen, Paul R.; Moore, BradleyS.

2007-05-01

Recent fermentation studies have identified actinomycetes ofthe marine-dwelling genus Salinispora as prolific natural productproducers. To further evaluate their biosynthetic potential, we analyzedall identifiable secondary natural product gene clusters from therecently sequenced 5,184,724 bp S. tropica CNB-440 circular genome. Ouranalysis shows that biosynthetic potential meets or exceeds that shown byprevious Streptomyces genome sequences as well as other naturalproduct-producing actinomycetes. The S. tropica genome features ninepolyketide synthase systems of every known formally classified family,non-ribosomal peptide synthetases and several hybrid clusters. While afew clusters appear to encode molecules previously identified inStreptomyces species,the majority of the 15 biosynthetic loci are novel.Specific chemical information about putative and observed natural productmolecules is presented and discussed. In addition, our bioinformaticanalysis was critical for the structure elucidation of the novelpolyenemacrolactam salinilactam A. This study demonstrates the potentialfor genomic analysis to complement and strengthen traditional naturalproduct isolation studies and firmly establishes the genus Salinispora asa rich source of novel drug-like molecules.

eGenomics: Cataloguing Our Complete Genome Collection III

Directory of Open Access Journals (Sweden)

Dawn Field

2007-01-01

Full Text Available This meeting report summarizes the proceedings of the “eGenomics: Cataloguing our Complete Genome Collection III” workshop held September 11–13, 2006, at the National Institute for Environmental eScience (NIEeS, Cambridge, United Kingdom. This 3rd workshop of the Genomic Standards Consortium was divided into two parts. The first half of the three-day workshop was dedicated to reviewing the genomic diversity of our current and future genome and metagenome collection, and exploring linkages to a series of existing projects through formal presentations. The second half was dedicated to strategic discussions. Outcomes of the workshop include a revised “Minimum Information about a Genome Sequence” (MIGS specification (v1.1, consensus on a variety of features to be added to the Genome Catalogue (GCat, agreement by several researchers to adopt MIGS for imminent genome publications, and an agreement by the EBI and NCBI to input their genome collections into GCat for the purpose of quantifying the amount of optional data already available (e.g., for geographic location coordinates and working towards a single, global list of all public genomes and metagenomes.
Genomic Prediction from Whole Genome Sequence in Livestock: The 1000 Bull Genomes Project

DEFF Research Database (Denmark)

Hayes, Benjamin J; MacLeod, Iona M; Daetwyler, Hans D

Advantages of using whole genome sequence data to predict genomic estimated breeding values (GEBV) include better persistence of accuracy of GEBV across generations and more accurate GEBV across breeds. The 1000 Bull Genomes Project provides a database of whole genome sequenced key ancestor bulls....... In a dairy data set, predictions using BayesRC and imputed sequence data from 1000 Bull Genomes were 2% more accurate than with 800k data. We could demonstrate the method identified causal mutations in some cases. Further improvements will come from more accurate imputation of sequence variant genotypes...
Functional toxicogenomic assessment of triclosan in human HepG2 cells using genome-wide CRISPR-Cas9 screen

Science.gov (United States)

Thousands of chemicals for which limited toxicological data are available are used and then detected in humans and the environment. Rapid and cost-effective approaches for assessing the toxicological properties of chemicals are needed. We used CRISPR-Cas9 functional genomic scree...
Next generation tools for genomic data generation, distribution, and visualization.

Science.gov (United States)

Nix, David A; Di Sera, Tonya L; Dalley, Brian K; Milash, Brett A; Cundick, Robert M; Quinn, Kevin S; Courdy, Samir J

2010-09-09

With the rapidly falling cost and availability of high throughput sequencing and microarray technologies, the bottleneck for effectively using genomic analysis in the laboratory and clinic is shifting to one of effectively managing, analyzing, and sharing genomic data. Here we present three open-source, platform independent, software tools for generating, analyzing, distributing, and visualizing genomic data. These include a next generation sequencing/microarray LIMS and analysis project center (GNomEx); an application for annotating and programmatically distributing genomic data using the community vetted DAS/2 data exchange protocol (GenoPub); and a standalone Java Swing application (GWrap) that makes cutting edge command line analysis tools available to those who prefer graphical user interfaces. Both GNomEx and GenoPub use the rich client Flex/Flash web browser interface to interact with Java classes and a relational database on a remote server. Both employ a public-private user-group security model enabling controlled distribution of patient and unpublished data alongside public resources. As such, they function as genomic data repositories that can be accessed manually or programmatically through DAS/2-enabled client applications such as the Integrated Genome Browser. These tools have gained wide use in our core facilities, research laboratories and clinics and are freely available for non-profit use. See http://sourceforge.net/projects/gnomex/, http://sourceforge.net/projects/genoviz/, and http://sourceforge.net/projects/useq.
Imputation and quality control steps for combining multiple genome-wide datasets

Directory of Open Access Journals (Sweden)

Shefali S Verma

2014-12-01

Full Text Available The electronic MEdical Records and GEnomics (eMERGE network brings together DNA biobanks linked to electronic health records (EHRs from multiple institutions. Approximately 52,000 DNA samples from distinct individuals have been genotyped using genome-wide SNP arrays across the nine sites of the network. The eMERGE Coordinating Center and the Genomics Workgroup developed a pipeline to impute and merge genomic data across the different SNP arrays to maximize sample size and power to detect associations with a variety of clinical endpoints. The 1000 Genomes cosmopolitan reference panel was used for imputation. Imputation results were evaluated using the following metrics: accuracy of imputation, allelic R2 (estimated correlation between the imputed and true genotypes, and the relationship between allelic R2 and minor allele frequency. Computation time and memory resources required by two different software packages (BEAGLE and IMPUTE2 were also evaluated. A number of challenges were encountered due to the complexity of using two different imputation software packages, multiple ancestral populations, and many different genotyping platforms. We present lessons learned and describe the pipeline implemented here to impute and merge genomic data sets. The eMERGE imputed dataset will serve as a valuable resource for discovery, leveraging the clinical data that can be mined from the EHR.
Automated ensemble assembly and validation of microbial genomes

Science.gov (United States)

2014-01-01

Background The continued democratization of DNA sequencing has sparked a new wave of development of genome assembly and assembly validation methods. As individual research labs, rather than centralized centers, begin to sequence the majority of new genomes, it is important to establish best practices for genome assembly. However, recent evaluations such as GAGE and the Assemblathon have concluded that there is no single best approach to genome assembly. Instead, it is preferable to generate multiple assemblies and validate them to determine which is most useful for the desired analysis; this is a labor-intensive process that is often impossible or unfeasible. Results To encourage best practices supported by the community, we present iMetAMOS, an automated ensemble assembly pipeline; iMetAMOS encapsulates the process of running, validating, and selecting a single assembly from multiple assemblies. iMetAMOS packages several leading open-source tools into a single binary that automates parameter selection and execution of multiple assemblers, scores the resulting assemblies based on multiple validation metrics, and annotates the assemblies for genes and contaminants. We demonstrate the utility of the ensemble process on 225 previously unassembled Mycobacterium tuberculosis genomes as well as a Rhodobacter sphaeroides benchmark dataset. On these real data, iMetAMOS reliably produces validated assemblies and identifies potential contamination without user intervention. In addition, intelligent parameter selection produces assemblies of R. sphaeroides comparable to or exceeding the quality of those from the GAGE-B evaluation, affecting the relative ranking of some assemblers. Conclusions Ensemble assembly with iMetAMOS provides users with multiple, validated assemblies for each genome. Although computationally limited to small or mid-sized genomes, this approach is the most effective and reproducible means for generating high-quality assemblies and enables users to
Identification of the iron-sulfur center of spinach ferredoxin-nitrite reductase as a tetranuclear center, and preliminary EPR studies of mechanism.

Science.gov (United States)

Lancaster, J R; Vega, J M; Kamin, H; Orme-Johnson, N R; Orme-Johnson, W H; Krueger, R J; Siegel, L M

1979-02-25

EPR spectroscopic and chemical analyses of spinach nitrite reductase show that the enzyme contains one reducible iron-sulfur center, and one site for binding either cyanide or nitrite, per siroheme. The heme is nearly all in the high spin ferric state in the enzyme as isolated. The extinction coefficient of the enzyme has been revised to E386 = 7.6 X 10(4) cm-1 (M heme)-1. The iron-sulfur center is reduced with difficulty by agents such as reduced methyl viologen (equilibrated with 1 atm of H2 at pH 7.7 in the presence of hydrogenase) or dithionite. Complexation of the enzyme with CO (a known ligand for nitrite reductase heme) markedly increases the reducibility of the iron-sulfur center. New chemical analyses and reinterpretation of previous data show that the enzyme contains 6 mol of iron and 4 mol of acid-labile S2-/mol of siroheme. The EPR spectrum of reduced nitrite reductase in 80% dimethyl sulfoxide establishes clearly that the enzyme contains a tetranuclear iron-sulfur (Fe4S4) center. The ferriheme and Fe4S4 centers are reduced at similar rates (k = 3 to 4 s-1) by dithionite. The dithionite-reduced Fe4S4 center is rapidly (k = 100 s-1) reoxidized by nitrite. These results indicate a role for the Fe4S4 center in catalysis.
NCI Symposium on Chromosome Biology to bring together internationally renowned experts in the fields of chromosome structure and function | Center for Cancer Research

Science.gov (United States)

The Center for Cancer Research’s Center of Excellence in Chromosome Biology is hosting the “Nuclear Structure, Genome Integrity and Cancer Symposium“ on November 30 - December 1, 2016 at the Natcher Conference Center, Bethesda, Maryland. Learn more ...
PERMANENCE OF BIOLOGICAL AND CHEMICAL WARFARE AGENTS IN MUNICIPAL SOLID WASTE LANDFILL LEACHATES

Science.gov (United States)

The objective of this work is to permit EPA/ORD's National Homeland Security Research Center (NHSRC) and Edgewood Chemical Biological Center to collaborate together to test the permanence of biological and chemical warfare agents in municipal solid waste landfills. Research into ...
A survey of single nucleotide polymorphisms identified from whole-genome sequencing and their functional effect in the porcine genome.

Science.gov (United States)

Keel, B N; Nonneman, D J; Rohrer, G A

2017-08-01

Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a more significant effect on phenotypic variation than do other types of genetic variants. Hence, a comprehensive list of these functional variants would be of considerable interest in swine genomic studies, particularly those targeting fertility and production traits. Whole-genome sequence was obtained from 72 of the founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC). These animals included all 24 of the founding boars (12 Duroc and 12 Landrace) and 48 Yorkshire-Landrace composite sows. Sequence reads were mapped to the Sscrofa10.2 genome build, resulting in a mean of 6.1 fold (×) coverage per genome. A total of 22 342 915 high confidence SNPs were identified from the sequenced genomes. These included 21 million previously reported SNPs and 79% of the 62 163 SNPs on the PorcineSNP60 BeadChip assay. Variation was detected in the coding sequence or untranslated regions (UTRs) of 87.8% of the genes in the porcine genome: loss-of-function variants were predicted in 504 genes, 10 202 genes contained nonsynonymous variants, 10 773 had variation in UTRs and 13 010 genes contained synonymous variants. Approximately 139 000 SNPs were classified as loss-of-function, nonsynonymous or regulatory, which suggests that over 99% of the variation detected in our pigs could potentially be ignored, allowing us to focus on a much smaller number of functional SNPs during future analyses. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
Genome U-Plot: a whole genome visualization.

Science.gov (United States)

Gaitatzes, Athanasios; Johnson, Sarah H; Smadbeck, James B; Vasmatzis, George

2018-05-15

The ability to produce and analyze whole genome sequencing (WGS) data from samples with structural variations (SV) generated the need to visualize such abnormalities in simplified plots. Conventional two-dimensional representations of WGS data frequently use either circular or linear layouts. There are several diverse advantages regarding both these representations, but their major disadvantage is that they do not use the two-dimensional space very efficiently. We propose a layout, termed the Genome U-Plot, which spreads the chromosomes on a two-dimensional surface and essentially quadruples the spatial resolution. We present the Genome U-Plot for producing clear and intuitive graphs that allows researchers to generate novel insights and hypotheses by visualizing SVs such as deletions, amplifications, and chromoanagenesis events. The main features of the Genome U-Plot are its layered layout, its high spatial resolution and its improved aesthetic qualities. We compare conventional visualization schemas with the Genome U-Plot using visualization metrics such as number of line crossings and crossing angle resolution measures. Based on our metrics, we improve the readability of the resulting graph by at least 2-fold, making apparent important features and making it easy to identify important genomic changes. A whole genome visualization tool with high spatial resolution and improved aesthetic qualities. An implementation and documentation of the Genome U-Plot is publicly available at https://github.com/gaitat/GenomeUPlot. vasmatzis.george@mayo.edu. Supplementary data are available at Bioinformatics online.
Nucleotide excision repair : a multi-step mechanism required to maintain genome integrity

NARCIS (Netherlands)

Moser, Jill

2010-01-01

DNA is continuously exposed to exogenous and genotoxic insults including ionizing and ultraviolet radiation as well as chemical agents. DNA damage can compromise the integrity of the genome and have potentially deleterious effects. Ultraviolet light (UV) can induce the formation of helix distorting
Single-Cell-Genomics-Facilitated Read Binning of Candidate Phylum EM19 Genomes from Geothermal Spring Metagenomes.

Science.gov (United States)

Becraft, Eric D; Dodsworth, Jeremy A; Murugapiran, Senthil K; Ohlsson, J Ingemar; Briggs, Brandon R; Kanbar, Jad; De Vlaminck, Iwijn; Quake, Stephen R; Dong, Hailiang; Hedlund, Brian P; Swingley, Wesley D

2016-02-15

The vast majority of microbial life remains uncatalogued due to the inability to cultivate these organisms in the laboratory. This "microbial dark matter" represents a substantial portion of the tree of life and of the populations that contribute to chemical cycling in many ecosystems. In this work, we leveraged an existing single-cell genomic data set representing the candidate bacterial phylum "Calescamantes" (EM19) to calibrate machine learning algorithms and define metagenomic bins directly from pyrosequencing reads derived from Great Boiling Spring in the U.S. Great Basin. Compared to other assembly-based methods, taxonomic binning with a read-based machine learning approach yielded final assemblies with the highest predicted genome completeness of any method tested. Read-first binning subsequently was used to extract Calescamantes bins from all metagenomes with abundant Calescamantes populations, including metagenomes from Octopus Spring and Bison Pool in Yellowstone National Park and Gongxiaoshe Spring in Yunnan Province, China. Metabolic reconstruction suggests that Calescamantes are heterotrophic, facultative anaerobes, which can utilize oxidized nitrogen sources as terminal electron acceptors for respiration in the absence of oxygen and use proteins as their primary carbon source. Despite their phylogenetic divergence, the geographically separate Calescamantes populations were highly similar in their predicted metabolic capabilities and core gene content, respiring O2, or oxidized nitrogen species for energy conservation in distant but chemically similar hot springs. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Rapid prototyping of microbial cell factories via genome-scale engineering.

Science.gov (United States)

Si, Tong; Xiao, Han; Zhao, Huimin

2015-11-15

Advances in reading, writing and editing genetic materials have greatly expanded our ability to reprogram biological systems at the resolution of a single nucleotide and on the scale of a whole genome. Such capacity has greatly accelerated the cycles of design, build and test to engineer microbes for efficient synthesis of fuels, chemicals and drugs. In this review, we summarize the emerging technologies that have been applied, or are potentially useful for genome-scale engineering in microbial systems. We will focus on the development of high-throughput methodologies, which may accelerate the prototyping of microbial cell factories. Copyright © 2014 Elsevier Inc. All rights reserved.
A Thousand Fly Genomes: An Expanded Drosophila Genome Nexus.

Science.gov (United States)

Lack, Justin B; Lange, Jeremy D; Tang, Alison D; Corbett-Detig, Russell B; Pool, John E

2016-12-01

The Drosophila Genome Nexus is a population genomic resource that provides D. melanogaster genomes from multiple sources. To facilitate comparisons across data sets, genomes are aligned using a common reference alignment pipeline which involves two rounds of mapping. Regions of residual heterozygosity, identity-by-descent, and recent population admixture are annotated to enable data filtering based on the user's needs. Here, we present a significant expansion of the Drosophila Genome Nexus, which brings the current data object to a total of 1,121 wild-derived genomes. New additions include 305 previously unpublished genomes from inbred lines representing six population samples in Egypt, Ethiopia, France, and South Africa, along with another 193 genomes added from recently-published data sets. We also provide an aligned D. simulans genome to facilitate divergence comparisons. This improved resource will broaden the range of population genomic questions that can addressed from multi-population allele frequencies and haplotypes in this model species. The larger set of genomes will also enhance the discovery of functionally relevant natural variation that exists within and between populations. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Visualization for genomics: the Microbial Genome Viewer.

Science.gov (United States)

Kerkhoven, Robert; van Enckevort, Frank H J; Boekhorst, Jos; Molenaar, Douwe; Siezen, Roland J

2004-07-22

A Web-based visualization tool, the Microbial Genome Viewer, is presented that allows the user to combine complex genomic data in a highly interactive way. This Web tool enables the interactive generation of chromosome wheels and linear genome maps from genome annotation data stored in a MySQL database. The generated images are in scalable vector graphics (SVG) format, which is suitable for creating high-quality scalable images and dynamic Web representations. Gene-related data such as transcriptome and time-course microarray experiments can be superimposed on the maps for visual inspection. The Microbial Genome Viewer 1.0 is freely available at http://www.cmbi.kun.nl/MGV
Chemical mutagens, transposons, and transgenes to interrogate gene function in Drosophila melanogaster.

Science.gov (United States)

Venken, Koen J T; Bellen, Hugo J

2014-06-15

The study of genetics, genes, and chromosomal inheritance was initiated by Thomas Morgan in 1910, when the first visible mutations were identified in fruit flies. The field expanded upon the work initiated by Herman Muller in 1926 when he used X-rays to develop the first balancer chromosomes. Today, balancers are still invaluable to maintain mutations and transgenes but the arsenal of tools has expanded vastly and numerous new methods have been developed, many relying on the availability of the genome sequence and transposable elements. Forward genetic screens based on chemical mutagenesis or transposable elements have resulted in the unbiased identification of many novel players involved in processes probed by specific phenotypic assays. Reverse genetic approaches have relied on the availability of a carefully selected set of transposon insertions spread throughout the genome to allow the manipulation of the region in the vicinity of each insertion. Lastly, the ability to transform Drosophila with single copy transgenes using transposons or site-specific integration using the ΦC31 integrase has allowed numerous manipulations, including the ability to create and integrate genomic rescue constructs, generate duplications, RNAi knock-out technology, binary expression systems like the GAL4/UAS system as well as other methods. Here, we will discuss the most useful methodologies to interrogate the fruit fly genome in vivo focusing on chemical mutagenesis, transposons and transgenes. Genome engineering approaches based on nucleases and RNAi technology are discussed in following chapters. Copyright © 2014 Elsevier Inc. All rights reserved.
Strategic plans for the Hardwood Tree Improvement and Regeneration Center

Science.gov (United States)

Charles H. Michler; Keith E. Woeste

2002-01-01

The mission of the Hardwood Tree Improvement and Regeneration Center (HTIRC) at Purdue University is to advance the science of hardwood tree improvement and genomics in the central hardwood region of the United States by: developing and disseminating knowledge on improving the genetic quality of hardwood tree species; conserving fine hardwood germplasm; developing...
Fourteenth-Sixteenth Microbial Genomics Conference-2006-2008

Energy Technology Data Exchange (ETDEWEB)

Miller, Jeffrey H

2011-04-18

The concept of an annual meeting on the E. coli genome was formulated at the Banbury Center Conference on the Genome of E. coli in October, 1991. The first meeting was held on September 10-14, 1992 at the University of Wisconsin, and this was followed by a yearly series of meetings, and by an expansion to include The fourteenth meeting took place September 24-28, 2006 at Lake Arrowhead, CA, the fifteenth September 16-20, 2007 at the University of Maryland, College Park, MD, and the sixteenth September 14-18, 2008 at Lake Arrowhead. The full program for the 16th meeting is attached. There have been rapid and exciting advances in microbial genomics that now make possible comparing large data sets of sequences from a wide variety of microbial genomes, and from whole microbial communities. Examining the “microbiomes”, the living microbial communities in different host organisms opens up many possibilities for understanding the landscape presented to pathogenic microorganisms. For quite some time there has been a shifting emphasis from pure sequence data to trying to understand how to use that information to solve biological problems. Towards this end new technologies are being developed and improved. Using genetics, functional genomics, and proteomics has been the recent focus of many different laboratories. A key element is the integration of different aspects of microbiology, sequencing technology, analysis techniques, and bioinformatics. The goal of these conference is to provide a regular forum for these interactions to occur. While there have been a number of genome conferences, what distinguishes the Microbial Genomics Conference is its emphasis on bringing together biology and genetics with sequencing and bioinformatics. Also, this conference is the longest continuing meeting, now established as a major regular annual meeting. In addition to its coverage of microbial genomes and biodiversity, the meetings also highlight microbial communities and the use of
Somatic, positive and negative domains of the Center for Epidemiological Studies Depression (CES-D) scale: a meta-analysis of genome-wide association studies.

Science.gov (United States)

Demirkan, A; Lahti, J; Direk, N; Viktorin, A; Lunetta, K L; Terracciano, A; Nalls, M A; Tanaka, T; Hek, K; Fornage, M; Wellmann, J; Cornelis, M C; Ollila, H M; Yu, L; Smith, J A; Pilling, L C; Isaacs, A; Palotie, A; Zhuang, W V; Zonderman, A; Faul, J D; Sutin, A; Meirelles, O; Mulas, A; Hofman, A; Uitterlinden, A; Rivadeneira, F; Perola, M; Zhao, W; Salomaa, V; Yaffe, K; Luik, A I; Liu, Y; Ding, J; Lichtenstein, P; Landén, M; Widen, E; Weir, D R; Llewellyn, D J; Murray, A; Kardia, S L R; Eriksson, J G; Koenen, K; Magnusson, P K E; Ferrucci, L; Mosley, T H; Cucca, F; Oostra, B A; Bennett, D A; Paunio, T; Berger, K; Harris, T B; Pedersen, N L; Murabito, J M; Tiemeier, H; van Duijn, C M; Räikkönen, K

2016-06-01

Major depressive disorder (MDD) is moderately heritable, however genome-wide association studies (GWAS) for MDD, as well as for related continuous outcomes, have not shown consistent results. Attempts to elucidate the genetic basis of MDD may be hindered by heterogeneity in diagnosis. The Center for Epidemiological Studies Depression (CES-D) scale provides a widely used tool for measuring depressive symptoms clustered in four different domains which can be combined together into a total score but also can be analysed as separate symptom domains. We performed a meta-analysis of GWAS of the CES-D symptom clusters. We recruited 12 cohorts with the 20- or 10-item CES-D scale (32 528 persons). One single nucleotide polymorphism (SNP), rs713224, located near the brain-expressed melatonin receptor (MTNR1A) gene, was associated with the somatic complaints domain of depression symptoms, with borderline genome-wide significance (p discovery = 3.82 × 10-8). The SNP was analysed in an additional five cohorts comprising the replication sample (6813 persons). However, the association was not consistent among the replication sample (p discovery+replication = 1.10 × 10-6) with evidence of heterogeneity. Despite the effort to harmonize the phenotypes across cohorts and participants, our study is still underpowered to detect consistent association for depression, even by means of symptom classification. On the contrary, the SNP-based heritability and co-heritability estimation results suggest that a very minor part of the variation could be captured by GWAS, explaining the reason of sparse findings.

Progress of CRISPR-Cas Based Genome Editing in Photosynthetic Microbes.

Science.gov (United States)

Naduthodi, Mihris Ibnu Saleem; Barbosa, Maria J; van der Oost, John

2018-02-03

The carbon footprint caused by unsustainable development and its environmental and economic impact has become a major concern in the past few decades. Photosynthetic microbes such as microalgae and cyanobacteria are capable of accumulating value-added compounds from carbon dioxide, and have been regarded as environmentally friendly alternatives to reduce the usage of fossil fuels, thereby contributing to reducing the carbon footprint. This light-driven generation of green chemicals and biofuels has triggered the research for metabolic engineering of these photosynthetic microbes. CRISPR-Cas systems are successfully implemented across a wide range of prokaryotic and eukaryotic species for efficient genome editing. However, the inception of this genome editing tool in microalgal and cyanobacterial species took off rather slowly due to various complications. In this review, we elaborate on the established CRISPR-Cas based genome editing in various microalgal and cyanobacterial species. The complications associated with CRISPR-Cas based genome editing in these species are addressed along with possible strategies to overcome these issues. It is anticipated that in the near future this will result in improving and expanding the microalgal and cyanobacterial genome engineering toolbox. © 2018 The Authors. Biotechnology Journal Published by Wiley-VCH Verlag GmbH & Co. KGaA.
COMPUTATIONAL SCIENCE CENTER

Energy Technology Data Exchange (ETDEWEB)

DAVENPORT,J.

2004-11-01

The Brookhaven Computational Science Center brings together researchers in biology, chemistry, physics, and medicine with applied mathematicians and computer scientists to exploit the remarkable opportunities for scientific discovery which have been enabled by modern computers. These opportunities are especially great in computational biology and nanoscience, but extend throughout science and technology and include for example, nuclear and high energy physics, astrophysics, materials and chemical science, sustainable energy, environment, and homeland security.
Stakeholder consultation insights on the future of genomics at the clinical-public health interface.

Science.gov (United States)

Modell, Stephen M; Kardia, Sharon L R; Citrin, Toby

2014-05-01

In summer 2011, the Centers for Disease Control and Prevention Office of Public Health Genomics conducted a stakeholder consultation, administered by the University of Michigan Center for Public Health and Community Genomics, and Genetic Alliance, to recommend priorities for public health genomics from 2012 through 2017. Sixty-two responses from health professionals, administrators, and members of the public were pooled with 2 sets of key informant interviews and 3 discussion groups. NVivo 9 and manual methods were used to organize themes. This review offers an interim analysis of progress with respect to the final recommendations, which demonstrated a strong interest in moving genomic discoveries toward implementation and comparative effectiveness (T3/T4) translational research. A translational research continuum exists with familial breast and ovarian cancer at one end and prostate cancer at the other. Cascade screening for inherited arrhythmia syndromes and hypercholesterolemia lags stakeholder recommendations in the United States but not in Europe; implementation of health service-based screening for Lynch syndrome, and integration into electronic health information systems, is on pace with the recommended timeline. A number of options exist to address deficits in the funding of translational research, particularly for oncogenomic gene expression profiling. The goal of personalized risk assessment necessitates both research progress (eg, in whole genome sequencing, as well as provider education in the differentiation of low- vs high-risk status. The public health approach supports an emphasis on genetic test validation while endorsing clinical translation research inclusion of an environmental and population-based perspective. Copyright © 2014 Mosby, Inc. All rights reserved.
A novel genomic alteration of LSAMP associates with aggressive prostate cancer in African American men

DEFF Research Database (Denmark)

Petrovics, Gyorgy; Li, Hua; Stümpel, Tanja

2015-01-01

a systematic whole genome analyses, revealing alterations that differentiate African American (AA) and Caucasian American (CA) CaP genomes. We discovered a recurrent deletion on chromosome 3q13.31 centering on the LSAMP locus that was prevalent in tumors from AA men (cumulative analyses of 435 patients: whole...... genome sequence, 14; FISH evaluations, 101; and SNP array, 320 patients). Notably, carriers of this deletion experienced more rapid disease progression. In contrast, PTEN and ERG common driver alterations in CaP were significantly lower in AA prostate tumors compared to prostate tumors from CA. Moreover...
The Sequenced Angiosperm Genomes and Genome Databases.

Science.gov (United States)

Chen, Fei; Dong, Wei; Zhang, Jiawei; Guo, Xinyue; Chen, Junhao; Wang, Zhengjia; Lin, Zhenguo; Tang, Haibao; Zhang, Liangsheng

2018-01-01

Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.
Carbon nanostructure-based field-effect transistors for label-free chemical/biological sensors.

Science.gov (United States)

Hu, PingAn; Zhang, Jia; Li, Le; Wang, Zhenlong; O'Neill, William; Estrela, Pedro

2010-01-01

Over the past decade, electrical detection of chemical and biological species using novel nanostructure-based devices has attracted significant attention for chemical, genomics, biomedical diagnostics, and drug discovery applications. The use of nanostructured devices in chemical/biological sensors in place of conventional sensing technologies has advantages of high sensitivity, low decreased energy consumption and potentially highly miniaturized integration. Owing to their particular structure, excellent electrical properties and high chemical stability, carbon nanotube and graphene based electrical devices have been widely developed for high performance label-free chemical/biological sensors. Here, we review the latest developments of carbon nanostructure-based transistor sensors in ultrasensitive detection of chemical/biological entities, such as poisonous gases, nucleic acids, proteins and cells.
Museum genomics: low-cost and high-accuracy genetic data from historical specimens.

Science.gov (United States)

Rowe, Kevin C; Singhal, Sonal; Macmanes, Matthew D; Ayroles, Julien F; Morelli, Toni Lyn; Rubidge, Emily M; Bi, Ke; Moritz, Craig C

2011-11-01

Natural history collections are unparalleled repositories of geographical and temporal variation in faunal conditions. Molecular studies offer an opportunity to uncover much of this variation; however, genetic studies of historical museum specimens typically rely on extracting highly degraded and chemically modified DNA samples from skins, skulls or other dried samples. Despite this limitation, obtaining short fragments of DNA sequences using traditional PCR amplification of DNA has been the primary method for genetic study of historical specimens. Few laboratories have succeeded in obtaining genome-scale sequences from historical specimens and then only with considerable effort and cost. Here, we describe a low-cost approach using high-throughput next-generation sequencing to obtain reliable genome-scale sequence data from a traditionally preserved mammal skin and skull using a simple extraction protocol. We show that single-nucleotide polymorphisms (SNPs) from the genome sequences obtained independently from the skin and from the skull are highly repeatable compared to a reference genome. © 2011 Blackwell Publishing Ltd.
Evidence-based design and evaluation of a whole genome sequencing clinical report for the reference microbiology laboratory.

Science.gov (United States)

Crisan, Anamaria; McKee, Geoffrey; Munzner, Tamara; Gardy, Jennifer L

2018-01-01

Microbial genome sequencing is now being routinely used in many clinical and public health laboratories. Understanding how to report complex genomic test results to stakeholders who may have varying familiarity with genomics-including clinicians, laboratorians, epidemiologists, and researchers-is critical to the successful and sustainable implementation of this new technology; however, there are no evidence-based guidelines for designing such a report in the pathogen genomics domain. Here, we describe an iterative, human-centered approach to creating a report template for communicating tuberculosis (TB) genomic test results. We used Design Study Methodology-a human centered approach drawn from the information visualization domain-to redesign an existing clinical report. We used expert consults and an online questionnaire to discover various stakeholders' needs around the types of data and tasks related to TB that they encounter in their daily workflow. We also evaluated their perceptions of and familiarity with genomic data, as well as its utility at various clinical decision points. These data shaped the design of multiple prototype reports that were compared against the existing report through a second online survey, with the resulting qualitative and quantitative data informing the final, redesigned, report. We recruited 78 participants, 65 of whom were clinicians, nurses, laboratorians, researchers, and epidemiologists involved in TB diagnosis, treatment, and/or surveillance. Our first survey indicated that participants were largely enthusiastic about genomic data, with the majority agreeing on its utility for certain TB diagnosis and treatment tasks and many reporting some confidence in their ability to interpret this type of data (between 58.8% and 94.1%, depending on the specific data type). When we compared our four prototype reports against the existing design, we found that for the majority (86.7%) of design comparisons, participants preferred the
GenColors-based comparative genome databases for small eukaryotic genomes.

Science.gov (United States)

Felder, Marius; Romualdi, Alessandro; Petzold, Andreas; Platzer, Matthias; Sühnel, Jürgen; Glöckner, Gernot

2013-01-01

Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources.
Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute's genomic medicine portfolio.

Science.gov (United States)

Manolio, Teri A

2016-10-01

Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual's genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of "Genomic Medicine Meetings," under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and difficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI's genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so. Published by Elsevier Ireland Ltd.
Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes

Energy Technology Data Exchange (ETDEWEB)

Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard; Schoch, Conrad L.; Horwitz, Benjamin A.; Barry, Kerrie W.; Condon, Bradford J.; Copeland, Alex C.; Dhillon, Braham; Glaser, Fabian; Hesse, Cedar N.; Kosti, Idit; LaButti, Kurt; Lindquist, Erika A.; Lucas, Susan; Salamov, Asaf A.; Bradshaw, Rosie E.; Ciuffetti, Lynda; Hamelin, Richard C.; Kema, Gert H. J.; Lawrence, Christopher; Scott, James A.; Spatafora, Joseph W.; Turgeon, B. Gillian; de Wit, Pierre J. G. M.; Zhong, Shaobin; Goodwin, Stephen B.; Grigoriev, Igor V.

2013-03-05

The class of Dothideomycetes is one of the largest and most diverse groups of fungi. Many are plant pathogens and pose a serious threat to agricultural crops that are grown for biofuel, food or feed. Most Dothideomycetes have only a single host plant, and related species can have very diverse hosts. Eighteen genomes of Dothideomycetes have currently been sequenced by the Joint Genome Institute and other sequencing centers. Here we describe the results of comparative analyses of the fungi in this group.
The IGNITE network: a model for genomic medicine implementation and research.

Science.gov (United States)

Weitzel, Kristin Wiisanen; Alexander, Madeline; Bernhardt, Barbara A; Calman, Neil; Carey, David J; Cavallari, Larisa H; Field, Julie R; Hauser, Diane; Junkins, Heather A; Levin, Phillip A; Levy, Kenneth; Madden, Ebony B; Manolio, Teri A; Odgis, Jacqueline; Orlando, Lori A; Pyeritz, Reed; Wu, R Ryanne; Shuldiner, Alan R; Bottinger, Erwin P; Denny, Joshua C; Dexter, Paul R; Flockhart, David A; Horowitz, Carol R; Johnson, Julie A; Kimmel, Stephen E; Levy, Mia A; Pollin, Toni I; Ginsburg, Geoffrey S

2016-01-05

Patients, clinicians, researchers and payers are seeking to understand the value of using genomic information (as reflected by genotyping, sequencing, family history or other data) to inform clinical decision-making. However, challenges exist to widespread clinical implementation of genomic medicine, a prerequisite for developing evidence of its real-world utility. To address these challenges, the National Institutes of Health-funded IGNITE (Implementing GeNomics In pracTicE; www.ignite-genomics.org ) Network, comprised of six projects and a coordinating center, was established in 2013 to support the development, investigation and dissemination of genomic medicine practice models that seamlessly integrate genomic data into the electronic health record and that deploy tools for point of care decision making. IGNITE site projects are aligned in their purpose of testing these models, but individual projects vary in scope and design, including exploring genetic markers for disease risk prediction and prevention, developing tools for using family history data, incorporating pharmacogenomic data into clinical care, refining disease diagnosis using sequence-based mutation discovery, and creating novel educational approaches. This paper describes the IGNITE Network and member projects, including network structure, collaborative initiatives, clinical decision support strategies, methods for return of genomic test results, and educational initiatives for patients and providers. Clinical and outcomes data from individual sites and network-wide projects are anticipated to begin being published over the next few years. The IGNITE Network is an innovative series of projects and pilot demonstrations aiming to enhance translation of validated actionable genomic information into clinical settings and develop and use measures of outcome in response to genome-based clinical interventions using a pragmatic framework to provide early data and proofs of concept on the utility of these
Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium.

Science.gov (United States)

Machado, Henrique; Gram, Lone

2017-01-01

Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur , amino-acid usage, ANI), which allowed us to identify two misidentified strains. Genome analyses also revealed occurrence of higher and lower GC content clades, correlating with phylogenetic clusters. Pan- and core-genome analysis revealed the conservation of 25% of the genome throughout the genus, with a large and open pan-genome. The major source of genomic diversity could be traced to the smaller chromosome and plasmids. Several of the physiological traits studied in the genus did not correlate with phylogenetic data. Since horizontal gene transfer (HGT) is often suggested as a source of genetic diversity and a potential driver of genomic evolution in bacterial species, we looked into evidence of such in Photobacterium genomes. Genomic islands were the source of genomic differences between strains of the same species. Also, we found transposase genes and CRISPR arrays that suggest multiple encounters with foreign DNA. Presence of genomic exchange traits was widespread and abundant in the genus, suggesting a role in genomic evolution. The high genetic variability and indications of genetic exchange make it difficult to elucidate genome evolutionary paths and raise the awareness of the roles of foreign DNA in the genomic evolution of environmental organisms.
Rodent malaria parasites : genome organization & comparative genomics

NARCIS (Netherlands)

Kooij, Taco W.A.

2006-01-01

The aim of the studies described in this thesis was to investigate the genome organization of rodent malaria parasites (RMPs) and compare the organization and gene content of the genomes of RMPs and the human malaria parasite P. falciparum. The release of the complete genome sequence of P.
Kernel-based whole-genome prediction of complex traits: a review.

Science.gov (United States)

Morota, Gota; Gianola, Daniel

2014-01-01

Prediction of genetic values has been a focus of applied quantitative genetics since the beginning of the 20th century, with renewed interest following the advent of the era of whole genome-enabled prediction. Opportunities offered by the emergence of high-dimensional genomic data fueled by post-Sanger sequencing technologies, especially molecular markers, have driven researchers to extend Ronald Fisher and Sewall Wright's models to confront new challenges. In particular, kernel methods are gaining consideration as a regression method of choice for genome-enabled prediction. Complex traits are presumably influenced by many genomic regions working in concert with others (clearly so when considering pathways), thus generating interactions. Motivated by this view, a growing number of statistical approaches based on kernels attempt to capture non-additive effects, either parametrically or non-parametrically. This review centers on whole-genome regression using kernel methods applied to a wide range of quantitative traits of agricultural importance in animals and plants. We discuss various kernel-based approaches tailored to capturing total genetic variation, with the aim of arriving at an enhanced predictive performance in the light of available genome annotation information. Connections between prediction machines born in animal breeding, statistics, and machine learning are revisited, and their empirical prediction performance is discussed. Overall, while some encouraging results have been obtained with non-parametric kernels, recovering non-additive genetic variation in a validation dataset remains a challenge in quantitative genetics.
Kernel-based whole-genome prediction of complex traits: a review

Directory of Open Access Journals (Sweden)

Gota eMorota

2014-10-01

Full Text Available Prediction of genetic values has been a focus of applied quantitative genetics since the beginning of the 20th century, with renewed interest following the advent of the era of whole genome-enabled prediction. Opportunities offered by the emergence of high-dimensional genomic data fueled by post-Sanger sequencing technologies, especially molecular markers, have driven researchers to extend Ronald Fisher and Sewall Wright's models to confront new challenges. In particular, kernel methods are gaining consideration as a regression method of choice for genome-enabled prediction. Complex traits are presumably influenced by many genomic regions working in concert with others (clearly so when considering pathways, thus generating interactions. Motivated by this view, a growing number of statistical approaches based on kernels attempt to capture non-additive effects, either parametrically or non-parametrically. This review centers on whole-genome regression using kernel methods applied to a wide range of quantitative traits of agricultural importance in animals and plants. We discuss various kernel-based approaches tailored to capturing total genetic variation, with the aim of arriving at an enhanced predictive performance in the light of available genome annotation information. Connections between prediction machines born in animal breeding, statistics, and machine learning are revisited, and their empirical prediction performance is discussed. Overall, while some encouraging results have been obtained with non-parametric kernels, recovering non-additive genetic variation in a validation dataset remains a challenge in quantitative genetics.
Next generation tools for genomic data generation, distribution, and visualization

Directory of Open Access Journals (Sweden)

Nix David A

2010-09-01

Full Text Available Abstract Background With the rapidly falling cost and availability of high throughput sequencing and microarray technologies, the bottleneck for effectively using genomic analysis in the laboratory and clinic is shifting to one of effectively managing, analyzing, and sharing genomic data. Results Here we present three open-source, platform independent, software tools for generating, analyzing, distributing, and visualizing genomic data. These include a next generation sequencing/microarray LIMS and analysis project center (GNomEx; an application for annotating and programmatically distributing genomic data using the community vetted DAS/2 data exchange protocol (GenoPub; and a standalone Java Swing application (GWrap that makes cutting edge command line analysis tools available to those who prefer graphical user interfaces. Both GNomEx and GenoPub use the rich client Flex/Flash web browser interface to interact with Java classes and a relational database on a remote server. Both employ a public-private user-group security model enabling controlled distribution of patient and unpublished data alongside public resources. As such, they function as genomic data repositories that can be accessed manually or programmatically through DAS/2-enabled client applications such as the Integrated Genome Browser. Conclusions These tools have gained wide use in our core facilities, research laboratories and clinics and are freely available for non-profit use. See http://sourceforge.net/projects/gnomex/, http://sourceforge.net/projects/genoviz/, and http://sourceforge.net/projects/useq.
Genome size analyses of Pucciniales reveal the largest fungal genomes.

Science.gov (United States)

Tavares, Sílvia; Ramos, Ana Paula; Pires, Ana Sofia; Azinheira, Helena G; Caldeirinha, Patrícia; Link, Tobias; Abranches, Rita; Silva, Maria do Céu; Voegele, Ralf T; Loureiro, João; Talhinhas, Pedro

2014-01-01

Rust fungi (Basidiomycota, Pucciniales) are biotrophic plant pathogens which exhibit diverse complexities in their life cycles and host ranges. The completion of genome sequencing of a few rust fungi has revealed the occurrence of large genomes. Sequencing efforts for other rust fungi have been hampered by uncertainty concerning their genome sizes. Flow cytometry was recently applied to estimate the genome size of a few rust fungi, and confirmed the occurrence of large genomes in this order (averaging 225.3 Mbp, while the average for Basidiomycota was 49.9 Mbp and was 37.7 Mbp for all fungi). In this work, we have used an innovative and simple approach to simultaneously isolate nuclei from the rust and its host plant in order to estimate the genome size of 30 rust species by flow cytometry. Genome sizes varied over 10-fold, from 70 to 893 Mbp, with an average genome size value of 380.2 Mbp. Compared to the genome sizes of over 1800 fungi, Gymnosporangium confusum possesses the largest fungal genome ever reported (893.2 Mbp). Moreover, even the smallest rust genome determined in this study is larger than the vast majority of fungal genomes (94%). The average genome size of the Pucciniales is now of 305.5 Mbp, while the average Basidiomycota genome size has shifted to 70.4 Mbp and the average for all fungi reached 44.2 Mbp. Despite the fact that no correlation could be drawn between the genome sizes, the phylogenomics or the life cycle of rust fungi, it is interesting to note that rusts with Fabaceae hosts present genomes clearly larger than those with Poaceae hosts. Although this study comprises only a small fraction of the more than 7000 rust species described, it seems already evident that the Pucciniales represent a group where genome size expansion could be a common characteristic. This is in sharp contrast to sister taxa, placing this order in a relevant position in fungal genomics research.
Correlation exploration of metabolic and genomic diversity in rice

Directory of Open Access Journals (Sweden)

Shinozaki Kazuo

2009-12-01

Full Text Available Abstract Background It is essential to elucidate the relationship between metabolic and genomic diversity to understand the genetic regulatory networks associated with the changing metabolo-phenotype among natural variation and/or populations. Recent innovations in metabolomics technologies allow us to grasp the comprehensive features of the metabolome. Metabolite quantitative trait analysis is a key approach for the identification of genetic loci involved in metabolite variation using segregated populations. Although several attempts have been made to find correlative relationships between genetic and metabolic diversity among natural populations in various organisms, it is still unclear whether it is possible to discover such correlations between each metabolite and the polymorphisms found at each chromosomal location. To assess the correlative relationship between the metabolic and genomic diversity found in rice accessions, we compared the distance matrices for these two "omics" patterns in the rice accessions. Results We selected 18 accessions from the world rice collection based on their population structure. To determine the genomic diversity of the rice genome, we genotyped 128 restriction fragment length polymorphism (RFLP markers to calculate the genetic distance among the accessions. To identify the variations in the metabolic fingerprint, a soluble extract from the seed grain of each accession was analyzed with one dimensional 1H-nuclear magnetic resonance (NMR. We found no correlation between global metabolic diversity and the phylogenetic relationships among the rice accessions (rs = 0.14 by analyzing the distance matrices (calculated from the pattern of the metabolic fingerprint in the 4.29- to 0.71-ppm 1H chemical shift and the genetic distance on the basis of the RFLP markers. However, local correlation analysis between the distance matrices (derived from each 0.04-ppm integral region of the 1H chemical shift against genetic
Soil, Groundwater, Surface Water, and Sediments of Kennedy Space Center, Florida: Background Chemical and Physical Characteristics

Science.gov (United States)

Shmalzer, Paul A.; Hensley, Melissa A.; Mota, Mario; Hall, Carlton R.; Dunlevy, Colleen A.

2000-01-01

This study documented background chemical composition of soils, groundwater, surface; water, and sediments of Kennedy Space Center. Two hundred soil samples were collected, 20 each in 10 soil classes. Fifty-one groundwater wells were installed in 4 subaquifers of the Surficial Aquifer and sampled; there were 24 shallow, 16 intermediate, and 11 deep wells. Forty surface water and sediment samples were collected in major watershed basins. All samples were away from sites of known contamination. Samples were analyzed for organochlorine pesticides, aroclors, chlorinated herbicides, polycyclic aromatic hydrocarbons (PAH), total metals, and other parameters. All aroclors (6) were below detection in all media. Some organochlorine pesticides were detected at very low frequencies in soil, sediment, and surface water. Chlorinated herbicides were detected at very low frequencies in soil and sediments. PAH occurred in low frequencies in soiL, shallow groundwater, surface water, and sediments. Concentrations of some metals differed among soil classes, with subaquifers and depths, and among watershed basins for surface water but not sediments. Most of the variation in metal concentrations was natural, but agriculture had increased Cr, Cu, Mn, and Zn.

78 FR 31951 - Center for Scientific Review; Notice of Closed Meetings

Science.gov (United States)

2013-05-28

... Emphasis Panel; Fellowships: Genes, Genomes and Genetics. Date: June 24, 2013. Time: 10:00 a.m. to 7:00 p.m... personal privacy. Name of Committee: Center for Scientific Review Special Emphasis Panel; Program Project...; Small Business: Education, Psychology, and Biology in Health Behavior. Date: June 24-25, 2013. Time: 8...
[Complete genome sequencing and sequence analysis of BCG Tice].

Science.gov (United States)

Wang, Zhiming; Pan, Yuanlong; Wu, Jun; Zhu, Baoli

2012-10-04

The objective of this study is to obtain the complete genome sequence of Bacillus Calmette-Guerin Tice (BCG Tice), in order to provide more information about the molecular biology of BCG Tice and design more reasonable vaccines to prevent tuberculosis. We assembled the data from high-throughput sequencing with SOAPdenovo software, with many contigs and scaffolds obtained. There are many sequence gaps and physical gaps remained as a result of regional low coverage and low quality. We designed primers at the end of contigs and performed PCR amplification in order to link these contigs and scaffolds. With various enzymes to perform PCR amplification, adjustment of PCR reaction conditions, and combined with clone construction to sequence, all the gaps were finished. We obtained the complete genome sequence of BCG Tice and submitted it to GenBank of National Center for Biotechnology Information (NCBI). The genome of BCG Tice is 4334064 base pairs in length, with GC content 65.65%. The problems and strategies during the finishing step of BCG Tice sequencing are illuminated here, with the hope of affording some experience to those who are involved in the finishing step of genome sequencing. The microarray data were verified by our results.
Universal and idiosyncratic characteristic lengths in bacterial genomes

Science.gov (United States)

Junier, Ivan; Frémont, Paul; Rivoire, Olivier

2018-05-01

In condensed matter physics, simplified descriptions are obtained by coarse-graining the features of a system at a certain characteristic length, defined as the typical length beyond which some properties are no longer correlated. From a physics standpoint, in vitro DNA has thus a characteristic length of 300 base pairs (bp), the Kuhn length of the molecule beyond which correlations in its orientations are typically lost. From a biology standpoint, in vivo DNA has a characteristic length of 1000 bp, the typical length of genes. Since bacteria live in very different physico-chemical conditions and since their genomes lack translational invariance, whether larger, universal characteristic lengths exist is a non-trivial question. Here, we examine this problem by leveraging the large number of fully sequenced genomes available in public databases. By analyzing GC content correlations and the evolutionary conservation of gene contexts (synteny) in hundreds of bacterial chromosomes, we conclude that a fundamental characteristic length around 10–20 kb can be defined. This characteristic length reflects elementary structures involved in the coordination of gene expression, which are present all along the genome of nearly all bacteria. Technically, reaching this conclusion required us to implement methods that are insensitive to the presence of large idiosyncratic genomic features, which may co-exist along these fundamental universal structures.
Small molecules enhance CRISPR genome editing in pluripotent stem cells.

Science.gov (United States)

Yu, Chen; Liu, Yanxia; Ma, Tianhua; Liu, Kai; Xu, Shaohua; Zhang, Yu; Liu, Honglei; La Russa, Marie; Xie, Min; Ding, Sheng; Qi, Lei S

2015-02-05

The bacterial CRISPR-Cas9 system has emerged as an effective tool for sequence-specific gene knockout through non-homologous end joining (NHEJ), but it remains inefficient for precise editing of genome sequences. Here we develop a reporter-based screening approach for high-throughput identification of chemical compounds that can modulate precise genome editing through homology-directed repair (HDR). Using our screening method, we have identified small molecules that can enhance CRISPR-mediated HDR efficiency, 3-fold for large fragment insertions and 9-fold for point mutations. Interestingly, we have also observed that a small molecule that inhibits HDR can enhance frame shift insertion and deletion (indel) mutations mediated by NHEJ. The identified small molecules function robustly in diverse cell types with minimal toxicity. The use of small molecules provides a simple and effective strategy to enhance precise genome engineering applications and facilitates the study of DNA repair mechanisms in mammalian cells. Copyright © 2015 Elsevier Inc. All rights reserved.
Journal of Chemical Sciences | Indian Academy of Sciences

Indian Academy of Sciences (India)

... YU1 JUAN QIN2 JIAQI ZHANG1 FULIN CHENG1 WENLIANG WU1. College of Chemical Engineering, Nanjing Tech University, Nanjing 210009, People's Republic of China; Technology and Finance Service Center of Jiangsu Province, Productivity Center of Jiangsu Province, Nanjing 210042, People's Republic of China ...
RPAN: rice pan-genome browser for ∼3000 rice genomes.

Science.gov (United States)

Sun, Chen; Hu, Zhiqiang; Zheng, Tianqing; Lu, Kuangchen; Zhao, Yue; Wang, Wensheng; Shi, Jianxin; Wang, Chunchao; Lu, Jinyuan; Zhang, Dabing; Li, Zhikang; Wei, Chaochun

2017-01-25

A pan-genome is the union of the gene sets of all the individuals of a clade or a species and it provides a new dimension of genome complexity with the presence/absence variations (PAVs) of genes among these genomes. With the progress of sequencing technologies, pan-genome study is becoming affordable for eukaryotes with large-sized genomes. The Asian cultivated rice, Oryza sativa L., is one of the major food sources for the world and a model organism in plant biology. Recently, the 3000 Rice Genome Project (3K RGP) sequenced more than 3000 rice genomes with a mean sequencing depth of 14.3×, which provided a tremendous resource for rice research. In this paper, we present a genome browser, Rice Pan-genome Browser (RPAN), as a tool to search and visualize the rice pan-genome derived from 3K RGP. RPAN contains a database of the basic information of 3010 rice accessions, including genomic sequences, gene annotations, PAV information and gene expression data of the rice pan-genome. At least 12 000 novel genes absent in the reference genome were included. RPAN also provides multiple search and visualization functions. RPAN can be a rich resource for rice biology and rice breeding. It is available at http://cgm.sjtu.edu.cn/3kricedb/ or http://www.rmbreeding.cn/pan3k. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Identification of genomic sites for CRISPR/Cas9-based genome editing in the Vitis vinifera genome

Science.gov (United States)

CRISPR/Cas9 has been recently demonstrated as an effective and popular genome editing tool for modifying genomes of human, animals, microorganisms, and plants. Success of such genome editing is highly dependent on the availability of suitable target sites in the genomes to be edited. Many specific t...
Genomics Portals: integrative web-platform for mining genomics data.

Science.gov (United States)

Shinde, Kaustubh; Phatak, Mukta; Johannes, Freudenberg M; Chen, Jing; Li, Qian; Vineet, Joshi K; Hu, Zhen; Ghosh, Krishnendu; Meller, Jaroslaw; Medvedovic, Mario

2010-01-13

A large amount of experimental data generated by modern high-throughput technologies is available through various public repositories. Our knowledge about molecular interaction networks, functional biological pathways and transcriptional regulatory modules is rapidly expanding, and is being organized in lists of functionally related genes. Jointly, these two sources of information hold a tremendous potential for gaining new insights into functioning of living systems. Genomics Portals platform integrates access to an extensive knowledge base and a large database of human, mouse, and rat genomics data with basic analytical visualization tools. It provides the context for analyzing and interpreting new experimental data and the tool for effective mining of a large number of publicly available genomics datasets stored in the back-end databases. The uniqueness of this platform lies in the volume and the diversity of genomics data that can be accessed and analyzed (gene expression, ChIP-chip, ChIP-seq, epigenomics, computationally predicted binding sites, etc), and the integration with an extensive knowledge base that can be used in such analysis. The integrated access to primary genomics data, functional knowledge and analytical tools makes Genomics Portals platform a unique tool for interpreting results of new genomics experiments and for mining the vast amount of data stored in the Genomics Portals backend databases. Genomics Portals can be accessed and used freely at http://GenomicsPortals.org.
i-Genome: A database to summarize oligonucleotide data in genomes

Directory of Open Access Journals (Sweden)

Chang Yu-Chung

2004-10-01

Full Text Available Abstract Background Information on the occurrence of sequence features in genomes is crucial to comparative genomics, evolutionary analysis, the analyses of regulatory sequences and the quantitative evaluation of sequences. Computing the frequencies and the occurrences of a pattern in complete genomes is time-consuming. Results The proposed database provides information about sequence features generated by exhaustively computing the sequences of the complete genome. The repetitive elements in the eukaryotic genomes, such as LINEs, SINEs, Alu and LTR, are obtained from Repbase. The database supports various complete genomes including human, yeast, worm, and 128 microbial genomes. Conclusions This investigation presents and implements an efficiently computational approach to accumulate the occurrences of the oligonucleotides or patterns in complete genomes. A database is established to maintain the information of the sequence features, including the distributions of oligonucleotide, the gene distribution, the distribution of repetitive elements in genomes and the occurrences of the oligonucleotides. The database can provide more effective and efficient way to access the repetitive features in genomes.
The Perennial Ryegrass GenomeZipper: Targeted Use of Genome Resources for Comparative Grass Genomics1[C][W

Science.gov (United States)

Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F.X.; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

2013-01-01

Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species. PMID:23184232
Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute’s genomic medicine portfolio

Science.gov (United States)

Manolio, Teri A.

2016-01-01

Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual’s genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of “Genomic Medicine Meetings,” under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and diffficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI’s genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so. PMID:27612677
Specialized microbial databases for inductive exploration of microbial genome sequences

Directory of Open Access Journals (Sweden)

Cabau Cédric

2005-02-01

Full Text Available Abstract Background The enormous amount of genome sequence data asks for user-oriented databases to manage sequences and annotations. Queries must include search tools permitting function identification through exploration of related objects. Methods The GenoList package for collecting and mining microbial genome databases has been rewritten using MySQL as the database management system. Functions that were not available in MySQL, such as nested subquery, have been implemented. Results Inductive reasoning in the study of genomes starts from "islands of knowledge", centered around genes with some known background. With this concept of "neighborhood" in mind, a modified version of the GenoList structure has been used for organizing sequence data from prokaryotic genomes of particular interest in China. GenoChore http://bioinfo.hku.hk/genochore.html, a set of 17 specialized end-user-oriented microbial databases (including one instance of Microsporidia, Encephalitozoon cuniculi, a member of Eukarya has been made publicly available. These databases allow the user to browse genome sequence and annotation data using standard queries. In addition they provide a weekly update of searches against the world-wide protein sequences data libraries, allowing one to monitor annotation updates on genes of interest. Finally, they allow users to search for patterns in DNA or protein sequences, taking into account a clustering of genes into formal operons, as well as providing extra facilities to query sequences using predefined sequence patterns. Conclusion This growing set of specialized microbial databases organize data created by the first Chinese bacterial genome programs (ThermaList, Thermoanaerobacter tencongensis, LeptoList, with two different genomes of Leptospira interrogans and SepiList, Staphylococcus epidermidis associated to related organisms for comparison.
Comparing Mycobacterium tuberculosis genomes using genome topology networks.

Science.gov (United States)

Jiang, Jianping; Gu, Jianlei; Zhang, Liang; Zhang, Chenyi; Deng, Xiao; Dou, Tonghai; Zhao, Guoping; Zhou, Yan

2015-02-14

Over the last decade, emerging research methods, such as comparative genomic analysis and phylogenetic study, have yielded new insights into genotypes and phenotypes of closely related bacterial strains. Several findings have revealed that genomic structural variations (SVs), including gene gain/loss, gene duplication and genome rearrangement, can lead to different phenotypes among strains, and an investigation of genes affected by SVs may extend our knowledge of the relationships between SVs and phenotypes in microbes, especially in pathogenic bacteria. In this work, we introduce a 'Genome Topology Network' (GTN) method based on gene homology and gene locations to analyze genomic SVs and perform phylogenetic analysis. Furthermore, the concept of 'unfixed ortholog' has been proposed, whose members are affected by SVs in genome topology among close species. To improve the precision of 'unfixed ortholog' recognition, a strategy to detect annotation differences and complete gene annotation was applied. To assess the GTN method, a set of thirteen complete M. tuberculosis genomes was analyzed as a case study. GTNs with two different gene homology-assigning methods were built, the Clusters of Orthologous Groups (COG) method and the orthoMCL clustering method, and two phylogenetic trees were constructed accordingly, which may provide additional insights into whole genome-based phylogenetic analysis. We obtained 24 unfixable COG groups, of which most members were related to immunogenicity and drug resistance, such as PPE-repeat proteins (COG5651) and transcriptional regulator TetR gene family members (COG1309). The GTN method has been implemented in PERL and released on our website. The tool can be downloaded from http://homepage.fudan.edu.cn/zhouyan/gtn/ , and allows re-annotating the 'lost' genes among closely related genomes, analyzing genes affected by SVs, and performing phylogenetic analysis. With this tool, many immunogenic-related and drug resistance-related genes
A universal genomic coordinate translator for comparative genomics.

Science.gov (United States)

Zamani, Neda; Sundström, Görel; Meadows, Jennifer R S; Höppner, Marc P; Dainat, Jacques; Lantz, Henrik; Haas, Brian J; Grabherr, Manfred G

2014-06-30

Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across
Functional Genome Mining for Metabolites Encoded by Large Gene Clusters through Heterologous Expression of a Whole-Genome Bacterial Artificial Chromosome Library in Streptomyces spp.

Science.gov (United States)

Xu, Min; Wang, Yemin; Zhao, Zhilong; Gao, Guixi; Huang, Sheng-Xiong; Kang, Qianjin; He, Xinyi; Lin, Shuangjun; Pang, Xiuhua; Deng, Zixin

2016-01-01

ABSTRACT Genome sequencing projects in the last decade revealed numerous cryptic biosynthetic pathways for unknown secondary metabolites in microbes, revitalizing drug discovery from microbial metabolites by approaches called genome mining. In this work, we developed a heterologous expression and functional screening approach for genome mining from genomic bacterial artificial chromosome (BAC) libraries in Streptomyces spp. We demonstrate mining from a strain of Streptomyces rochei, which is known to produce streptothricins and borrelidin, by expressing its BAC library in the surrogate host Streptomyces lividans SBT5, and screening for antimicrobial activity. In addition to the successful capture of the streptothricin and borrelidin biosynthetic gene clusters, we discovered two novel linear lipopeptides and their corresponding biosynthetic gene cluster, as well as a novel cryptic gene cluster for an unknown antibiotic from S. rochei. This high-throughput functional genome mining approach can be easily applied to other streptomycetes, and it is very suitable for the large-scale screening of genomic BAC libraries for bioactive natural products and the corresponding biosynthetic pathways. IMPORTANCE Microbial genomes encode numerous cryptic biosynthetic gene clusters for unknown small metabolites with potential biological activities. Several genome mining approaches have been developed to activate and bring these cryptic metabolites to biological tests for future drug discovery. Previous sequence-guided procedures relied on bioinformatic analysis to predict potentially interesting biosynthetic gene clusters. In this study, we describe an efficient approach based on heterologous expression and functional screening of a whole-genome library for the mining of bioactive metabolites from Streptomyces. The usefulness of this function-driven approach was demonstrated by the capture of four large biosynthetic gene clusters for metabolites of various chemical types, including
Evidence-based design and evaluation of a whole genome sequencing clinical report for the reference microbiology laboratory

Science.gov (United States)

Crisan, Anamaria; McKee, Geoffrey; Munzner, Tamara

2018-01-01

Background Microbial genome sequencing is now being routinely used in many clinical and public health laboratories. Understanding how to report complex genomic test results to stakeholders who may have varying familiarity with genomics—including clinicians, laboratorians, epidemiologists, and researchers—is critical to the successful and sustainable implementation of this new technology; however, there are no evidence-based guidelines for designing such a report in the pathogen genomics domain. Here, we describe an iterative, human-centered approach to creating a report template for communicating tuberculosis (TB) genomic test results. Methods We used Design Study Methodology—a human centered approach drawn from the information visualization domain—to redesign an existing clinical report. We used expert consults and an online questionnaire to discover various stakeholders’ needs around the types of data and tasks related to TB that they encounter in their daily workflow. We also evaluated their perceptions of and familiarity with genomic data, as well as its utility at various clinical decision points. These data shaped the design of multiple prototype reports that were compared against the existing report through a second online survey, with the resulting qualitative and quantitative data informing the final, redesigned, report. Results We recruited 78 participants, 65 of whom were clinicians, nurses, laboratorians, researchers, and epidemiologists involved in TB diagnosis, treatment, and/or surveillance. Our first survey indicated that participants were largely enthusiastic about genomic data, with the majority agreeing on its utility for certain TB diagnosis and treatment tasks and many reporting some confidence in their ability to interpret this type of data (between 58.8% and 94.1%, depending on the specific data type). When we compared our four prototype reports against the existing design, we found that for the majority (86.7%) of design
Evidence-based design and evaluation of a whole genome sequencing clinical report for the reference microbiology laboratory

Directory of Open Access Journals (Sweden)

Anamaria Crisan

2018-01-01

Full Text Available Background Microbial genome sequencing is now being routinely used in many clinical and public health laboratories. Understanding how to report complex genomic test results to stakeholders who may have varying familiarity with genomics—including clinicians, laboratorians, epidemiologists, and researchers—is critical to the successful and sustainable implementation of this new technology; however, there are no evidence-based guidelines for designing such a report in the pathogen genomics domain. Here, we describe an iterative, human-centered approach to creating a report template for communicating tuberculosis (TB genomic test results. Methods We used Design Study Methodology—a human centered approach drawn from the information visualization domain—to redesign an existing clinical report. We used expert consults and an online questionnaire to discover various stakeholders’ needs around the types of data and tasks related to TB that they encounter in their daily workflow. We also evaluated their perceptions of and familiarity with genomic data, as well as its utility at various clinical decision points. These data shaped the design of multiple prototype reports that were compared against the existing report through a second online survey, with the resulting qualitative and quantitative data informing the final, redesigned, report. Results We recruited 78 participants, 65 of whom were clinicians, nurses, laboratorians, researchers, and epidemiologists involved in TB diagnosis, treatment, and/or surveillance. Our first survey indicated that participants were largely enthusiastic about genomic data, with the majority agreeing on its utility for certain TB diagnosis and treatment tasks and many reporting some confidence in their ability to interpret this type of data (between 58.8% and 94.1%, depending on the specific data type. When we compared our four prototype reports against the existing design, we found that for the majority (86.7% of
Genomic Sequencing of Single Microbial Cells from Environmental Samples

Energy Technology Data Exchange (ETDEWEB)

Ishoey, Thomas; Woyke, Tanja; Stepanauskas, Ramunas; Novotny, Mark; Lasken, Roger S.

2008-02-01

Recently developed techniques allow genomic DNA sequencing from single microbial cells [Lasken RS: Single-cell genomic sequencing using multiple displacement amplification, Curr Opin Microbiol 2007, 10:510-516]. Here, we focus on research strategies for putting these methods into practice in the laboratory setting. An immediate consequence of single-cell sequencing is that it provides an alternative to culturing organisms as a prerequisite for genomic sequencing. The microgram amounts of DNA required as template are amplified from a single bacterium by a method called multiple displacement amplification (MDA) avoiding the need to grow cells. The ability to sequence DNA from individual cells will likely have an immense impact on microbiology considering the vast numbers of novel organisms, which have been inaccessible unless culture-independent methods could be used. However, special approaches have been necessary to work with amplified DNA. MDA may not recover the entire genome from the single copy present in most bacteria. Also, some sequence rearrangements can occur during the DNA amplification reaction. Over the past two years many research groups have begun to use MDA, and some practical approaches to single-cell sequencing have been developed. We review the consensus that is emerging on optimum methods, reliability of amplified template, and the proper interpretation of 'composite' genomes which result from the necessity of combining data from several single-cell MDA reactions in order to complete the assembly. Preferred laboratory methods are considered on the basis of experience at several large sequencing centers where >70% of genomes are now often recovered from single cells. Methods are reviewed for preparation of bacterial fractions from environmental samples, single-cell isolation, DNA amplification by MDA, and DNA sequencing.
Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Doethideomycetes Fungi

Energy Technology Data Exchange (ETDEWEB)

Ohm, Robin A.; Feau, Nicolas; Henrissat, Bernard; Schoch, Conrad L.; Horwitz, Benjamin A.; Barry, Kerrie W.; Condon, Bradford J.; Copeland, Alex C.; Dhillon, Braham; Glaser, Fabien; Hesse, Cedar N.; Kosti, Idit; LaButti, Kurt; Lindquist, Erika A.; Lucas, Susan; Salamov, Asaf A.; Bradshaw, Rosie E.; Ciuffetti, Lynda; Hamelin, Richard C.; Kema, Gert H. J.; Lawrence, Christopher; Scott, James A.; Spatafora, Joseph W.; Turgeon, B. Gillian; de Wit, Pierre J. G. M.; Zhong, Shaobin; Goodwin, Stephen B.; Grigoriev, Igor V.

2012-03-13

The class of Dothideomycetes is one of the largest and most diverse groups of fungi. Many are plant pathogens and pose a serious threat to agricultural crops grown for biofuel, food or feed. Most Dothideomycetes have only a single host and related species can have very diverse host plants. Eighteen genomes of Dothideomycetes have currently been sequenced by the Joint Genome Institute and other sequencing centers. Here we describe the results of comparative analyses of the fungi in this group.
Innovations in Undergraduate Chemical Biology Education.

Science.gov (United States)

Van Dyke, Aaron R; Gatazka, Daniel H; Hanania, Mariah M

2018-01-19

Chemical biology derives intellectual vitality from its scientific interface: applying chemical strategies and perspectives to biological questions. There is a growing need for chemical biologists to synergistically integrate their research programs with their educational activities to become holistic teacher-scholars. This review examines how course-based undergraduate research experiences (CUREs) are an innovative method to achieve this integration. Because CUREs are course-based, the review first offers strategies for creating a student-centered learning environment, which can improve students' outcomes. Exemplars of CUREs in chemical biology are then presented and organized to illustrate the five defining characteristics of CUREs: significance, scientific practices, discovery, collaboration, and iteration. Finally, strategies to overcome common barriers in CUREs are considered as well as future innovations in chemical biology education.

Genomics Portals: integrative web-platform for mining genomics data

Directory of Open Access Journals (Sweden)

Ghosh Krishnendu

2010-01-01

Full Text Available Abstract Background A large amount of experimental data generated by modern high-throughput technologies is available through various public repositories. Our knowledge about molecular interaction networks, functional biological pathways and transcriptional regulatory modules is rapidly expanding, and is being organized in lists of functionally related genes. Jointly, these two sources of information hold a tremendous potential for gaining new insights into functioning of living systems. Results Genomics Portals platform integrates access to an extensive knowledge base and a large database of human, mouse, and rat genomics data with basic analytical visualization tools. It provides the context for analyzing and interpreting new experimental data and the tool for effective mining of a large number of publicly available genomics datasets stored in the back-end databases. The uniqueness of this platform lies in the volume and the diversity of genomics data that can be accessed and analyzed (gene expression, ChIP-chip, ChIP-seq, epigenomics, computationally predicted binding sites, etc, and the integration with an extensive knowledge base that can be used in such analysis. Conclusion The integrated access to primary genomics data, functional knowledge and analytical tools makes Genomics Portals platform a unique tool for interpreting results of new genomics experiments and for mining the vast amount of data stored in the Genomics Portals backend databases. Genomics Portals can be accessed and used freely at http://GenomicsPortals.org.
Comparative genomic analysis of single-molecule sequencing and hybrid approaches for finishing the Clostridium autoethanogenum JA1-1 strain DSM 10061 genome

Energy Technology Data Exchange (ETDEWEB)

Brown, Steven D [ORNL; Nagaraju, Shilpa [LanzaTech; Utturkar, Sagar M [ORNL; De Tissera, Sashini [LanzaTech; Segovia, Simón [LanzaTech; Mitchell, Wayne [LanzaTech; Land, Miriam L [ORNL; Dassanayake, Asela [LanzaTech; Köpke, Michael [LanzaTech

2014-01-01

Background Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published. Results A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G + C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a
Genome-scale metabolic models as platforms for strain design and biological discovery.

Science.gov (United States)

Mienda, Bashir Sajo

2017-07-01

Genome-scale metabolic models (GEMs) have been developed and used in guiding systems' metabolic engineering strategies for strain design and development. This strategy has been used in fermentative production of bio-based industrial chemicals and fuels from alternative carbon sources. However, computer-aided hypotheses building using established algorithms and software platforms for biological discovery can be integrated into the pipeline for strain design strategy to create superior strains of microorganisms for targeted biosynthetic goals. Here, I described an integrated workflow strategy using GEMs for strain design and biological discovery. Specific case studies of strain design and biological discovery using Escherichia coli genome-scale model are presented and discussed. The integrated workflow presented herein, when applied carefully would help guide future design strategies for high-performance microbial strains that have existing and forthcoming genome-scale metabolic models.
Correcting Inconsistencies and Errors in Bacterial Genome Metadata Using an Automated Curation Tool in Excel (AutoCurE).

Science.gov (United States)

Schmedes, Sarah E; King, Jonathan L; Budowle, Bruce

2015-01-01

Whole-genome data are invaluable for large-scale comparative genomic studies. Current sequencing technologies have made it feasible to sequence entire bacterial genomes with relative ease and time with a substantially reduced cost per nucleotide, hence cost per genome. More than 3,000 bacterial genomes have been sequenced and are available at the finished status. Publically available genomes can be readily downloaded; however, there are challenges to verify the specific supporting data contained within the download and to identify errors and inconsistencies that may be present within the organizational data content and metadata. AutoCurE, an automated tool for bacterial genome database curation in Excel, was developed to facilitate local database curation of supporting data that accompany downloaded genomes from the National Center for Biotechnology Information. AutoCurE provides an automated approach to curate local genomic databases by flagging inconsistencies or errors by comparing the downloaded supporting data to the genome reports to verify genome name, RefSeq accession numbers, the presence of archaea, BioProject/UIDs, and sequence file descriptions. Flags are generated for nine metadata fields if there are inconsistencies between the downloaded genomes and genomes reports and if erroneous or missing data are evident. AutoCurE is an easy-to-use tool for local database curation for large-scale genome data prior to downstream analyses.
Genomic selection: genome-wide prediction in plant improvement.

Science.gov (United States)

Desta, Zeratsion Abera; Ortiz, Rodomiro

2014-09-01

Association analysis is used to measure relations between markers and quantitative trait loci (QTL). Their estimation ignores genes with small effects that trigger underpinning quantitative traits. By contrast, genome-wide selection estimates marker effects across the whole genome on the target population based on a prediction model developed in the training population (TP). Whole-genome prediction models estimate all marker effects in all loci and capture small QTL effects. Here, we review several genomic selection (GS) models with respect to both the prediction accuracy and genetic gain from selection. Phenotypic selection or marker-assisted breeding protocols can be replaced by selection, based on whole-genome predictions in which phenotyping updates the model to build up the prediction accuracy. Copyright © 2014 Elsevier Ltd. All rights reserved.
Complete Genome Sequence of the Probiotic Lactic Acid Bacterium Lactobacillus Rhamnosus

Directory of Open Access Journals (Sweden)

Samat Kozhakhmetov

2014-01-01

Full Text Available Introduction: Lactobacilli are a bacteria commonly found in the gastrointestinal tract. Some species of this genus have probiotic properties. The most common of these is Lactobacillus rhamnosus, a microoganism, generally regarded as safe (GRAS. It is also a homofermentative L-(+-lactic acid producer. The genus Lactobacillus is characterized by an extraordinary degree of the phenotypic and genotypic diversity. However, the studies of the genus were conducted mostly with the unequally distributed, non-random choice of species for sequencing; thus, there is only one representative genome from the Lactobacillus rhamnosus clade available to date. The aim of this study was to characterize the genome sequencing of selected strains of Lactobacilli. Methods: 109 samples were isolated from national domestic dairy products in the laboratory of Center for life sciences. After screaning isolates for probiotic properties, a highly active Lactobacillus spp strain was chosen. Genomic DNA was extracted according to the manufacturing protocol (Wizard® Genomic DNA Purification Kit. The Lactobacillus rhamnosus strain was identified as the highly active Lactobacillus strain accoridng to its morphological, cultural, physiological, and biochemical properties, and a genotypic analysis. Results: The genome of Lactobacillus rhamnosus was sequenced using the Roche 454 GS FLX (454 GS FLX platforms. The initial draft assembly was prepared from 14 large contigs (20 all contigs by the Newbler gsAssembler 2.3 (454 Life Sciences, Branford, CT. Conclusion: A full genome-sequencing of selected strains of lactic acid bacteria was made during the study.
Genome-wide mapping of furfural tolerance genes in Escherichia coli.

Science.gov (United States)

Glebes, Tirzah Y; Sandoval, Nicholas R; Reeder, Philippa J; Schilling, Katherine D; Zhang, Min; Gill, Ryan T

2014-01-01

Advances in genomics have improved the ability to map complex genotype-to-phenotype relationships, like those required for engineering chemical tolerance. Here, we have applied the multiSCale Analysis of Library Enrichments (SCALEs; Lynch et al. (2007) Nat. Method.) approach to map, in parallel, the effect of increased dosage for >10(5) different fragments of the Escherichia coli genome onto furfural tolerance (furfural is a key toxin of lignocellulosic hydrolysate). Only 268 of >4,000 E. coli genes (∼ 6%) were enriched after growth selections in the presence of furfural. Several of the enriched genes were cloned and tested individually for their effect on furfural tolerance. Overexpression of thyA, lpcA, or groESL individually increased growth in the presence of furfural. Overexpression of lpcA, but not groESL or thyA, resulted in increased furfural reduction rate, a previously identified mechanism underlying furfural tolerance. We additionally show that plasmid-based expression of functional LpcA or GroESL is required to confer furfural tolerance. This study identifies new furfural tolerant genes, which can be applied in future strain design efforts focused on the production of fuels and chemicals from lignocellulosic hydrolysate.
Genome-wide mapping of furfural tolerance genes in Escherichia coli.

Directory of Open Access Journals (Sweden)

Tirzah Y Glebes

Full Text Available Advances in genomics have improved the ability to map complex genotype-to-phenotype relationships, like those required for engineering chemical tolerance. Here, we have applied the multiSCale Analysis of Library Enrichments (SCALEs; Lynch et al. (2007 Nat. Method. approach to map, in parallel, the effect of increased dosage for >10(5 different fragments of the Escherichia coli genome onto furfural tolerance (furfural is a key toxin of lignocellulosic hydrolysate. Only 268 of >4,000 E. coli genes (∼ 6% were enriched after growth selections in the presence of furfural. Several of the enriched genes were cloned and tested individually for their effect on furfural tolerance. Overexpression of thyA, lpcA, or groESL individually increased growth in the presence of furfural. Overexpression of lpcA, but not groESL or thyA, resulted in increased furfural reduction rate, a previously identified mechanism underlying furfural tolerance. We additionally show that plasmid-based expression of functional LpcA or GroESL is required to confer furfural tolerance. This study identifies new furfural tolerant genes, which can be applied in future strain design efforts focused on the production of fuels and chemicals from lignocellulosic hydrolysate.
University of Texas MD Anderson Cancer Center: High-Throughput Screening Identifying Driving Mutations in Endometrial Cancer | Office of Cancer Genomics

Science.gov (United States)

Recent advances in next-generation sequencing technology have enabled the unprecedented characterization of a full spectrum of somatic alterations in cancer genomes. Given the large numbers of somatic mutations typically detected by this approach, a key challenge in the downstream analysis is to distinguish “drivers” that functionally contribute to tumorigenesis from “passengers” that occur as the consequence of genomic instability.
OryzaGenome: Genome Diversity Database of Wild Oryza Species

KAUST Repository

Ohyanagi, Hajime

2015-11-18

The species in the genus Oryza, encompassing nine genome types and 23 species, are a rich genetic resource and may have applications in deeper genomic analyses aiming to understand the evolution of plant genomes. With the advancement of next-generation sequencing (NGS) technology, a flood of Oryza species reference genomes and genomic variation information has become available in recent years. This genomic information, combined with the comprehensive phenotypic information that we are accumulating in our Oryzabase, can serve as an excellent genotype-phenotype association resource for analyzing rice functional and structural evolution, and the associated diversity of the Oryza genus. Here we integrate our previous and future phenotypic/habitat information and newly determined genotype information into a united repository, named OryzaGenome, providing the variant information with hyperlinks to Oryzabase. The current version of OryzaGenome includes genotype information of 446 O. rufipogon accessions derived by imputation and of 17 accessions derived by imputation-free deep sequencing. Two variant viewers are implemented: SNP Viewer as a conventional genome browser interface and Variant Table as a textbased browser for precise inspection of each variant one by one. Portable VCF (variant call format) file or tabdelimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/ scaffolds/contigs and genome-wide variation information for almost all of the closely and distantly related wild Oryza species from the NIG Wild Rice Collection will be available in future releases. All of the resources can be accessed through http://viewer.shigen.info/oryzagenome/.
Using Partial Genomic Fosmid Libraries for Sequencing CompleteOrganellar Genomes

Energy Technology Data Exchange (ETDEWEB)

McNeal, Joel R.; Leebens-Mack, James H.; Arumuganathan, K.; Kuehl, Jennifer V.; Boore, Jeffrey L.; dePamphilis, Claude W.

2005-08-26

Organellar genome sequences provide numerous phylogenetic markers and yield insight into organellar function and molecular evolution. These genomes are much smaller in size than their nuclear counterparts; thus, their complete sequencing is much less expensive than total nuclear genome sequencing, making broader phylogenetic sampling feasible. However, for some organisms it is challenging to isolate plastid DNA for sequencing using standard methods. To overcome these difficulties, we constructed partial genomic libraries from total DNA preparations of two heterotrophic and two autotrophic angiosperm species using fosmid vectors. We then used macroarray screening to isolate clones containing large fragments of plastid DNA. A minimum tiling path of clones comprising the entire genome sequence of each plastid was selected, and these clones were shotgun-sequenced and assembled into complete genomes. Although this method worked well for both heterotrophic and autotrophic plants, nuclear genome size had a dramatic effect on the proportion of screened clones containing plastid DNA and, consequently, the overall number of clones that must be screened to ensure full plastid genome coverage. This technique makes it possible to determine complete plastid genome sequences for organisms that defy other available organellar genome sequencing methods, especially those for which limited amounts of tissue are available.
Zebrafish embryos as models for embryotoxic and teratological effects of chemicals.

NARCIS (Netherlands)

Yang, Lixin; Ho, Nga Yu; Alshut, Rüdiger; Legradi, J.B.; Weiss, Carsten; Reischl, Markus; Mikut, Ralf; Liebel, Urban; Müller, Ferenc; Strähle, Uwe

2009-01-01

The experimental virtues of the zebrafish embryo such as small size, development outside of the mother, cheap maintenance of the adult made the zebrafish an excellent model for phenotypic genetic and more recently also chemical screens. The availability of a genome sequence and several thousand
Grass genomes

OpenAIRE

Bennetzen, Jeffrey L.; SanMiguel, Phillip; Chen, Mingsheng; Tikhonov, Alexander; Francki, Michael; Avramova, Zoya

1998-01-01

For the most part, studies of grass genome structure have been limited to the generation of whole-genome genetic maps or the fine structure and sequence analysis of single genes or gene clusters. We have investigated large contiguous segments of the genomes of maize, sorghum, and rice, primarily focusing on intergenic spaces. Our data indicate that much (>50%) of the maize genome is composed of interspersed repetitive DNAs, primarily nested retrotransposons that in...
Goodbye genome paper, hello genome report: the increasing popularity of 'genome announcements' and their impact on science.

Science.gov (United States)

Smith, David Roy

2017-05-01

Next-generation sequencing technologies have revolutionized genomics and altered the scientific publication landscape. Life-science journals abound with genome papers-peer-reviewed descriptions of newly sequenced chromosomes. Although they once filled the pages of Nature and Science, genome papers are now mostly relegated to journals with low-impact factors. Some have forecast the death of the genome paper and argued that they are using up valuable resources and not advancing science. However, the publication rate of genome papers is on the rise. This increase is largely because some journals have created a new category of manuscript called genome reports, which are short, fast-tracked papers describing a chromosome sequence(s), its GenBank accession number and little else. In 2015, for example, more than 2000 genome reports were published, and 2016 is poised to bring even more. Here, I highlight the growing popularity of genome reports and discuss their merits, drawbacks and impact on science and the academic publication infrastructure. Genome reports can be excellent assets for the research community, but they are also being used as quick and easy routes to a publication, and in some instances they are not peer reviewed. One of the best arguments for genome reports is that they are a citable, user-generated genomic resource providing essential methodological and biological information, which may not be present in the sequence database. But they are expensive and time-consuming avenues for achieving such a goal. © The Author 2016. Published by Oxford University Press.
Towards a Life Cycle Based Chemical Alternative Assessment (LCAA)

DEFF Research Database (Denmark)

Jolliet, O.; Huang, L.; Overcash, Michael

2017-01-01

approach combines the following elements: a) The manufacturing phase chemical inventory is based on the environmental genome of industrial products database, ensuring mass and energy balance, b) near-field exposure to consumer products during the use phase is determined based on the mass of chemical......There is a need for an operational quantitative screening-level assessment of alternatives, that is life-cycle based and able to serve both Life cycle Assessment (LCA and chemical alternatives assessment (CAA). This presentation therefore aims to develop and illustrate a new approach called “Life...... Cycle Based Chemical Alternative Assessment (LCAA)” that will quantify exposure and life cycle impacts consistently and efficiently over the main life cycle stages. The new LCAA approach is illustrated though a proof-of-concept case study of alternative plasticizers in vinyl flooring. The proposed LCAA...
Tennessee Valley Authority National Fertilizer and Environmental Research Center

International Nuclear Information System (INIS)

Gautney, J.

1991-01-01

The National Fertilizer and Environmental Research Center (NFERC) is a unique part of the Tennessee Valley Authority (TVA), a government agency created by an Act of Congress in 1933. The Center, located in Muscle Shoals, Alabama, is a national laboratory for research, development, education and commercialization for fertilizers and related agricultural chemicals including their economic and environmentally safe use, renewable fuel and chemical technologies, alternatives for solving environmental/waste problems, and technologies which support national defense- NFERC projects in the pesticide waste minimization/treatment/disposal areas include ''Model Site Demonstrations and Site Assessments,'' ''Development of Waste Treatment and Site Remediation Technologies for Fertilizer/Agrichemical Dealers,'' ''Development of a Dealer Information/Education Program,'' and ''Constructed Wetlands.''
Detection of genomic deletions in rice using oligonucleotide microarrays

Directory of Open Access Journals (Sweden)

Bordeos Alicia

2009-03-01

Full Text Available Abstract Background The induction of genomic deletions by physical- or chemical- agents is an easy and inexpensive means to generate a genome-saturating collection of mutations. Different mutagens can be selected to ensure a mutant collection with a range of deletion sizes. This would allow identification of mutations in single genes or, alternatively, a deleted group of genes that might collectively govern a trait (e.g., quantitative trait loci, QTL. However, deletion mutants have not been widely used in functional genomics, because the mutated genes are not tagged and therefore, difficult to identify. Here, we present a microarray-based approach to identify deleted genomic regions in rice mutants selected from a large collection generated by gamma ray or fast neutron treatment. Our study focuses not only on the utility of this method for forward genetics, but also its potential as a reverse genetics tool through accumulation of hybridization data for a collection of deletion mutants harboring multiple genetic lesions. Results We demonstrate that hybridization of labeled genomic DNA directly onto the Affymetrix Rice GeneChip® allows rapid localization of deleted regions in rice mutants. Deletions ranged in size from one gene model to ~500 kb and were predicted on all 12 rice chromosomes. The utility of the technique as a tool in forward genetics was demonstrated in combination with an allelic series of mutants to rapidly narrow the genomic region, and eventually identify a candidate gene responsible for a lesion mimic phenotype. Finally, the positions of mutations in 14 mutants were aligned onto the rice pseudomolecules in a user-friendly genome browser to allow for rapid identification of untagged mutations http://irfgc.irri.org/cgi-bin/gbrowse/IR64_deletion_mutants/. Conclusion We demonstrate the utility of oligonucleotide arrays to discover deleted genes in rice. The density and distribution of deletions suggests the feasibility of a
Materials Genome Initiative

Science.gov (United States)

Vickers, John

2015-01-01

The Materials Genome Initiative (MGI) project element is a cross-Center effort that is focused on the integration of computational tools to simulate manufacturing processes and materials behavior. These computational simulations will be utilized to gain understanding of processes and materials behavior to accelerate process development and certification to more efficiently integrate new materials in existing NASA projects and to lead to the design of new materials for improved performance. This NASA effort looks to collaborate with efforts at other government agencies and universities working under the national MGI. MGI plans to develop integrated computational/experimental/ processing methodologies for accelerating discovery and insertion of materials to satisfy NASA's unique mission demands. The challenges include validated design tools that incorporate materials properties, processes, and design requirements; and materials process control to rapidly mature emerging manufacturing methods and develop certified manufacturing processes
Dr. Janie Merkel is interviewed by Ryan Blum and Janice Friend.

Science.gov (United States)

Merkel, Janie

2007-12-01

Dr. Janie Merkel is the director of Yale's Chemical Genomics Screening Facility, a high-throughput screening laboratory that is part of the Yale University Center for Genomics and Proteomics. The Screening Facility connects Yale researchers with industry-quality robotic machinery and a diverse group of compound libraries, which have been used successfully to link therapeutic targets with potential therapies.
Probing Chromatin-modifying Enzymes with Chemical Tools

KAUST Repository

Fischle, Wolfgang

2016-02-04

Chromatin is the universal template of genetic information in all eukaryotic organisms. Chemical modifications of the DNA-packaging histone proteins and the DNA bases are crucial signaling events in directing the use and readout of eukaryotic genomes. The enzymes that install and remove these chromatin modifications as well as the proteins that bind these marks govern information that goes beyond the sequence of DNA. Therefore, these so-called epigenetic regulators are intensively studied and represent promising drug targets in modern medicine. We summarize and discuss recent advances in the field of chemical biology that have provided chromatin research with sophisticated tools for investigating the composition, activity, and target sites of chromatin modifying enzymes and reader proteins.

Plants' essential chemical elements

Science.gov (United States)

Kevin T. Smith

2007-01-01

Every garden center and hardware store sells fertilizer guaranteed to "feed" plants. In a strict sense, we can't feed plants. Food contains an energy source. Green plants capture solar energy and make their own food through photosynthesis! Photosynthesis and other metabolic processes require chemical elements in appropriate doses for plants to survive...
Impact of Genomics Platform and Statistical Filtering on Transcriptional Benchmark Doses (BMD and Multiple Approaches for Selection of Chemical Point of Departure (PoD.

Directory of Open Access Journals (Sweden)

A Francina Webster

Full Text Available Many regulatory agencies are exploring ways to integrate toxicogenomic data into their chemical risk assessments. The major challenge lies in determining how to distill the complex data produced by high-content, multi-dose gene expression studies into quantitative information. It has been proposed that benchmark dose (BMD values derived from toxicogenomics data be used as point of departure (PoD values in chemical risk assessments. However, there is limited information regarding which genomics platforms are most suitable and how to select appropriate PoD values. In this study, we compared BMD values modeled from RNA sequencing-, microarray-, and qPCR-derived gene expression data from a single study, and explored multiple approaches for selecting a single PoD from these data. The strategies evaluated include several that do not require prior mechanistic knowledge of the compound for selection of the PoD, thus providing approaches for assessing data-poor chemicals. We used RNA extracted from the livers of female mice exposed to non-carcinogenic (0, 2 mg/kg/day, mkd and carcinogenic (4, 8 mkd doses of furan for 21 days. We show that transcriptional BMD values were consistent across technologies and highly predictive of the two-year cancer bioassay-based PoD. We also demonstrate that filtering data based on statistically significant changes in gene expression prior to BMD modeling creates more conservative BMD values. Taken together, this case study on mice exposed to furan demonstrates that high-content toxicogenomics studies produce robust data for BMD modelling that are minimally affected by inter-technology variability and highly predictive of cancer-based PoD doses.
Genome update: the 1000th genome - a cautionary tale

DEFF Research Database (Denmark)

Lagesen, Karin; Ussery, David; Wassenaar, Gertrude Maria

2010-01-01

conclusions for example about the largest bacterial genome sequenced. Biological diversity is far greater than many have thought. For example, analysis of multiple Escherichia coli genomes has led to an estimate of around 45 000 gene families more genes than are recognized in the human genome. Moreover......There are now more than 1000 sequenced prokaryotic genomes deposited in public databases and available for analysis. Currently, although the sequence databases GenBank, DNA Database of Japan and EMBL are synchronized continually, there are slight differences in content at the genomes level...... for a variety of logistical reasons, including differences in format and loading errors, such as those caused by file transfer protocol interruptions. This means that the 1000th genome will be different in the various databases. Some of the data on the highly accessed web pages are inaccurate, leading to false...
Pre-genomic, genomic and post-genomic study of microbial communities involved in bioenergy.

Science.gov (United States)

Rittmann, Bruce E; Krajmalnik-Brown, Rosa; Halden, Rolf U

2008-08-01

Microorganisms can produce renewable energy in large quantities and without damaging the environment or disrupting food supply. The microbial communities must be robust and self-stabilizing, and their essential syntrophies must be managed. Pre-genomic, genomic and post-genomic tools can provide crucial information about the structure and function of these microbial communities. Applying these tools will help accelerate the rate at which microbial bioenergy processes move from intriguing science to real-world practice.
Chromosome-level genome map provides insights into diverse defense mechanisms in the medicinal fungus Ganoderma sinense

Science.gov (United States)

Zhu, Yingjie; Xu, Jiang; Sun, Chao; Zhou, Shiguo; Xu, Haibin; Nelson, David R.; Qian, Jun; Song, Jingyuan; Luo, Hongmei; Xiang, Li; Li, Ying; Xu, Zhichao; Ji, Aijia; Wang, Lizhi; Lu, Shanfa; Hayward, Alice; Sun, Wei; Li, Xiwen; Schwartz, David C.; Wang, Yitao; Chen, Shilin

2015-01-01

Fungi have evolved powerful genomic and chemical defense systems to protect themselves against genetic destabilization and other organisms. However, the precise molecular basis involved in fungal defense remain largely unknown in Basidiomycetes. Here the complete genome sequence, as well as DNA methylation patterns and small RNA transcriptomes, was analyzed to provide a holistic overview of secondary metabolism and defense processes in the model medicinal fungus, Ganoderma sinense. We reported the 48.96 Mb genome sequence of G. sinense, consisting of 12 chromosomes and encoding 15,688 genes. More than thirty gene clusters involved in the biosynthesis of secondary metabolites, as well as a large array of genes responsible for their transport and regulation were highlighted. In addition, components of genome defense mechanisms, namely repeat-induced point mutation (RIP), DNA methylation and small RNA-mediated gene silencing, were revealed in G. sinense. Systematic bioinformatic investigation of the genome and methylome suggested that RIP and DNA methylation combinatorially maintain G. sinense genome stability by inactivating invasive genetic material and transposable elements. The elucidation of the G. sinense genome and epigenome provides an unparalleled opportunity to advance our understanding of secondary metabolism and fungal defense mechanisms. PMID:26046933
Development of a Batch Fabrication Process for Chemical Nanosensors: Recent Advancements at NASA Glenn Research Center

Science.gov (United States)

Biaggi-Labiosa, Azlin M.

2014-01-01

A major objective in aerospace sensor development is to produce sensors that are small in size, easy to batch fabricate and low in cost, and have low power consumption. Chemical sensors involving nanostructured materials can provide these characteristics as well as the potential for the development of sensor systems with unique properties and improved performance. However, the fabrication and processing of nanostructures for sensor applications currently is limited by the ability to control their location on the sensor platform, which in turn hinders the progress for batch fabrication. This presentation will discuss the following: the development of a novel room temperature methane (CH4) sensor fabricated using porous tin oxide (SnO2) nanorods as the sensing material, the advantages of using nanomaterials in sensor designs, the challenges encountered with the integration of nanostructures into microsensordevices, and the different methods that have been attempted to address these challenges. An approach for the mass production of sensors with nanostructures using a method developed by our group at the NASA Glenn Research Center to control the alignment of nanostructures onto a sensor platform will also be described.
The Chthonomonas calidirosea Genome Is Highly Conserved across Geographic Locations and Distinct Chemical and Microbial Environments in New Zealand's Taupō Volcanic Zone.

Science.gov (United States)

Lee, Kevin C; Stott, Matthew B; Dunfield, Peter F; Huttenhower, Curtis; McDonald, Ian R; Morgan, Xochitl C

2016-06-15

Chthonomonas calidirosea T49(T) is a low-abundance, carbohydrate-scavenging, and thermophilic soil bacterium with a seemingly disorganized genome. We hypothesized that the C. calidirosea genome would be highly responsive to local selection pressure, resulting in the divergence of its genomic content, genome organization, and carbohydrate utilization phenotype across environments. We tested this hypothesis by sequencing the genomes of four C. calidirosea isolates obtained from four separate geothermal fields in the Taupō Volcanic Zone, New Zealand. For each isolation site, we measured physicochemical attributes and defined the associated microbial community by 16S rRNA gene sequencing. Despite their ecological and geographical isolation, the genome sequences showed low divergence (maximum, 1.17%). Isolate-specific variations included single-nucleotide polymorphisms (SNPs), restriction-modification systems, and mobile elements but few major deletions and no major rearrangements. The 50-fold variation in C. calidirosea relative abundance among the four sites correlated with site environmental characteristics but not with differences in genomic content. Conversely, the carbohydrate utilization profiles of the C. calidirosea isolates corresponded to the inferred isolate phylogenies, which only partially paralleled the geographical relationships among the sample sites. Genomic sequence conservation does not entirely parallel geographic distance, suggesting that stochastic dispersal and localized extinction, which allow for rapid population homogenization with little restriction by geographical barriers, are possible mechanisms of C. calidirosea distribution. This dispersal and extinction mechanism is likely not limited to C. calidirosea but may shape the populations and genomes of many other low-abundance free-living taxa. This study compares the genomic sequence variations and metabolisms of four strains of Chthonomonas calidirosea, a rare thermophilic bacterium from
Visualization of RNA structure models within the Integrative Genomics Viewer.

Science.gov (United States)

Busan, Steven; Weeks, Kevin M

2017-07-01

Analyses of the interrelationships between RNA structure and function are increasingly important components of genomic studies. The SHAPE-MaP strategy enables accurate RNA structure probing and realistic structure modeling of kilobase-length noncoding RNAs and mRNAs. Existing tools for visualizing RNA structure models are not suitable for efficient analysis of long, structurally heterogeneous RNAs. In addition, structure models are often advantageously interpreted in the context of other experimental data and gene annotation information, for which few tools currently exist. We have developed a module within the widely used and well supported open-source Integrative Genomics Viewer (IGV) that allows visualization of SHAPE and other chemical probing data, including raw reactivities, data-driven structural entropies, and data-constrained base-pair secondary structure models, in context with linear genomic data tracks. We illustrate the usefulness of visualizing RNA structure in the IGV by exploring structure models for a large viral RNA genome, comparing bacterial mRNA structure in cells with its structure under cell- and protein-free conditions, and comparing a noncoding RNA structure modeled using SHAPE data with a base-pairing model inferred through sequence covariation analysis. © 2017 Busan and Weeks; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Genome Surfing As Driver of Microbial Genomic Diversity.

Science.gov (United States)

Choudoir, Mallory J; Panke-Buisse, Kevin; Andam, Cheryl P; Buckley, Daniel H

2017-08-01

Historical changes in population size, such as those caused by demographic range expansions, can produce nonadaptive changes in genomic diversity through mechanisms such as gene surfing. We propose that demographic range expansion of a microbial population capable of horizontal gene exchange can result in genome surfing, a mechanism that can cause widespread increase in the pan-genome frequency of genes acquired by horizontal gene exchange. We explain that patterns of genetic diversity within Streptomyces are consistent with genome surfing, and we describe several predictions for testing this hypothesis both in Streptomyces and in other microorganisms. Copyright © 2017 Elsevier Ltd. All rights reserved.
Genome and transcriptome analyses of the mountain pine beetle-fungal symbiont Grosmannia clavigera, a lodgepole pine pathogen.

Science.gov (United States)

DiGuistini, Scott; Wang, Ye; Liao, Nancy Y; Taylor, Greg; Tanguay, Philippe; Feau, Nicolas; Henrissat, Bernard; Chan, Simon K; Hesse-Orce, Uljana; Alamouti, Sepideh Massoumi; Tsui, Clement K M; Docking, Roderick T; Levasseur, Anthony; Haridas, Sajeet; Robertson, Gordon; Birol, Inanc; Holt, Robert A; Marra, Marco A; Hamelin, Richard C; Hirst, Martin; Jones, Steven J M; Bohlmann, Jörg; Breuil, Colette

2011-02-08

In western North America, the current outbreak of the mountain pine beetle (MPB) and its microbial associates has destroyed wide areas of lodgepole pine forest, including more than 16 million hectares in British Columbia. Grosmannia clavigera (Gc), a critical component of the outbreak, is a symbiont of the MPB and a pathogen of pine trees. To better understand the interactions between Gc, MPB, and lodgepole pine hosts, we sequenced the ∼30-Mb Gc genome and assembled it into 18 supercontigs. We predict 8,314 protein-coding genes, and support the gene models with proteome, expressed sequence tag, and RNA-seq data. We establish that Gc is heterothallic, and report evidence for repeat-induced point mutation. We report insights, from genome and transcriptome analyses, into how Gc tolerates conifer-defense chemicals, including oleoresin terpenoids, as they colonize a host tree. RNA-seq data indicate that terpenoids induce a substantial antimicrobial stress in Gc, and suggest that the fungus may detoxify these chemicals by using them as a carbon source. Terpenoid treatment strongly activated a ∼100-kb region of the Gc genome that contains a set of genes that may be important for detoxification of these host-defense chemicals. This work is a major step toward understanding the biological interactions between the tripartite MPB/fungus/forest system.
Complete chloroplast genome sequence of a major allogamous forage species, perennial ryegrass (Lolium perenne L.).

Science.gov (United States)

Diekmann, Kerstin; Hodkinson, Trevor R; Wolfe, Kenneth H; van den Bekerom, Rob; Dix, Philip J; Barth, Susanne

2009-06-01

Lolium perenne L. (perennial ryegrass) is globally one of the most important forage and grassland crops. We sequenced the chloroplast (cp) genome of Lolium perenne cultivar Cashel. The L. perenne cp genome is 135 282 bp with a typical quadripartite structure. It contains genes for 76 unique proteins, 30 tRNAs and four rRNAs. As in other grasses, the genes accD, ycf1 and ycf2 are absent. The genome is of average size within its subfamily Pooideae and of medium size within the Poaceae. Genome size differences are mainly due to length variations in non-coding regions. However, considerable length differences of 1-27 codons in comparison of L. perenne to other Poaceae and 1-68 codons among all Poaceae were also detected. Within the cp genome of this outcrossing cultivar, 10 insertion/deletion polymorphisms and 40 single nucleotide polymorphisms were detected. Two of the polymorphisms involve tiny inversions within hairpin structures. By comparing the genome sequence with RT-PCR products of transcripts for 33 genes, 31 mRNA editing sites were identified, five of them unique to Lolium. The cp genome sequence of L. perenne is available under Accession number AM777385 at the European Molecular Biology Laboratory, National Center for Biotechnology Information and DNA DataBank of Japan.
Comparative genomic analysis of the thermophilic biomass-degrading fungi Myceliophthora thermophila and Thielavia terrestris

Energy Technology Data Exchange (ETDEWEB)

Berka, Randy M.; Grigoriev, Igor V.; Otillar, Robert; Salamov, Asaf; Grimwood, Jane; Reid, Ian; Ishmael, Nadeeza; John, Tricia; Darmond, Corinne; Moisan, Marie-Claude; Henrissat, Bernard; Coutinho, Pedro M.; Lombard, Vincent; Natvig, Donald O.; Lindquist, Erika; Schmutz, Jeremy; Lucas, Susan; Harris, Paul; Powlowski, Justin; Bellemare, Annie; Taylor, David; Butler, Gregory; de Vries, Ronald P.; Allijn, Iris E.; van den Brink, Joost; Ushinsky, Sophia; Storms, Reginald; Powell, Amy J.; Paulsen, Ian T.; Elbourne, Liam D. H.; Baker, Scott. E.; Magnuson, Jon; LaBoissiere, Sylvie; Clutterbuck, A. John; Martinez, Diego; Wogulis, Mark; Lopez de Leon, Alfredo; Rey, Michael W.; Tsang, Adrian

2011-05-16

Thermostable enzymes and thermophilic cell factories may afford economic advantages in the production of many chemicals and biomass-based fuels. Here we describe and compare the genomes of two thermophilic fungi, Myceliophthora thermophila and Thielavia terrestris. To our knowledge, these genomes are the first described for thermophilic eukaryotes and the first complete telomere-to-telomere genomes for filamentous fungi. Genome analyses and experimental data suggest that both thermophiles are capable of hydrolyzing all major polysaccharides found in biomass. Examination of transcriptome data and secreted proteins suggests that the two fungi use shared approaches in the hydrolysis of cellulose and xylan but distinct mechanisms in pectin degradation. Characterization of the biomass-hydrolyzing activity of recombinant enzymes suggests that these organisms are highly efficient in biomass decomposition at both moderate and high temperatures. Furthermore, we present evidence suggesting that aside from representing a potential reservoir of thermostable enzymes, thermophilic fungi are amenable to manipulation using classical and molecular genetics.
Scientist, Single Cell Analysis Facility | Center for Cancer Research

Science.gov (United States)

The Cancer Research Technology Program (CRTP) develops and implements emerging technology, cancer biology expertise and research capabilities to accomplish NCI research objectives. The CRTP is an outward-facing, multi-disciplinary hub purposed to enable the external cancer research community and provides dedicated support to NCI’s intramural Center for Cancer Research (CCR). The dedicated units provide electron microscopy, protein characterization, protein expression, optical microscopy and nextGen sequencing. These research efforts are an integral part of CCR at the Frederick National Laboratory for Cancer Research (FNLCR). CRTP scientists also work collaboratively with intramural NCI investigators to provide research technologies and expertise. KEY ROLES AND RESPONSIBILITIES We are seeking a highly motivated Scientist II to join the newly established Single Cell Analysis Facility (SCAF) of the Center for Cancer Research (CCR) at NCI. The SCAF will house state of the art single cell sequencing technologies including 10xGenomics Chromium, BD Genomics Rhapsody, DEPPArray, and other emerging single cell technologies. The Scientist: Will interact with close to 200 laboratories within the CCR to design and carry out single cell experiments for cancer research Will work on single cell isolation/preparation from various tissues and cells and related NexGen sequencing library preparation Is expected to author publications in peer reviewed scientific journals
Interactive or static reports to guide clinical interpretation of cancer genomics.

Science.gov (United States)

Gray, Stacy W; Gagan, Jeffrey; Cerami, Ethan; Cronin, Angel M; Uno, Hajime; Oliver, Nelly; Lowenstein, Carol; Lederman, Ruth; Revette, Anna; Suarez, Aaron; Lee, Charlotte; Bryan, Jordan; Sholl, Lynette; Van Allen, Eliezer M

2018-05-01

Misinterpretation of complex genomic data presents a major challenge in the implementation of precision oncology. We sought to determine whether interactive genomic reports with embedded clinician education and optimized data visualization improved genomic data interpretation. We conducted a randomized, vignette-based survey study to determine whether exposure to interactive reports for a somatic gene panel, as compared to static reports, improves physicians' genomic comprehension and report-related satisfaction (overall scores calculated across 3 vignettes, range 0-18 and 1-4, respectively, higher score corresponding with improved endpoints). One hundred and five physicians at a tertiary cancer center participated (29% participation rate): 67% medical, 20% pediatric, 7% radiation, and 7% surgical oncology; 37% female. Prior to viewing the case-based vignettes, 34% of the physicians reported difficulty making treatment recommendations based on the standard static report. After vignette/report exposure, physicians' overall comprehension scores did not differ by report type (mean score: interactive 11.6 vs static 10.5, difference = 1.1, 95% CI, -0.3, 2.5, P = .13). However, physicians exposed to the interactive report were more likely to correctly assess sequencing quality (P < .001) and understand when reports needed to be interpreted with caution (eg, low tumor purity; P = .02). Overall satisfaction scores were higher in the interactive group (mean score 2.5 vs 2.1, difference = 0.4, 95% CI, 0.2-0.7, P = .001). Interactive genomic reports may improve physicians' ability to accurately assess genomic data and increase report-related satisfaction. Additional research in users' genomic needs and efforts to integrate interactive reports into electronic health records may facilitate the implementation of precision oncology.
Deep whole-genome sequencing of 90 Han Chinese genomes.

Science.gov (United States)

Lan, Tianming; Lin, Haoxiang; Zhu, Wenjuan; Laurent, Tellier Christian Asker Melchior; Yang, Mengcheng; Liu, Xin; Wang, Jun; Wang, Jian; Yang, Huanming; Xu, Xun; Guo, Xiaosen

2017-09-01

Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000
Reconstruction and analysis of a genome-scale metabolic model for Scheffersomyces stipitis

Directory of Open Access Journals (Sweden)

Balagurunathan Balaji

2012-02-01

Full Text Available Abstract Background Fermentation of xylose, the major component in hemicellulose, is essential for economic conversion of lignocellulosic biomass to fuels and chemicals. The yeast Scheffersomyces stipitis (formerly known as Pichia stipitis has the highest known native capacity for xylose fermentation and possesses several genes for lignocellulose bioconversion in its genome. Understanding the metabolism of this yeast at a global scale, by reconstructing the genome scale metabolic model, is essential for manipulating its metabolic capabilities and for successful transfer of its capabilities to other industrial microbes. Results We present a genome-scale metabolic model for Scheffersomyces stipitis, a native xylose utilizing yeast. The model was reconstructed based on genome sequence annotation, detailed experimental investigation and known yeast physiology. Macromolecular composition of Scheffersomyces stipitis biomass was estimated experimentally and its ability to grow on different carbon, nitrogen, sulphur and phosphorus sources was determined by phenotype microarrays. The compartmentalized model, developed based on an iterative procedure, accounted for 814 genes, 1371 reactions, and 971 metabolites. In silico computed growth rates were compared with high-throughput phenotyping data and the model could predict the qualitative outcomes in 74% of substrates investigated. Model simulations were used to identify the biosynthetic requirements for anaerobic growth of Scheffersomyces stipitis on glucose and the results were validated with published literature. The bottlenecks in Scheffersomyces stipitis metabolic network for xylose uptake and nucleotide cofactor recycling were identified by in silico flux variability analysis. The scope of the model in enhancing the mechanistic understanding of microbial metabolism is demonstrated by identifying a mechanism for mitochondrial respiration and oxidative phosphorylation. Conclusion The genome
Comparative Genome Analysis and Genome Evolution

NARCIS (Netherlands)

Snel, Berend

2002-01-01

This thesis described a collection of bioinformatic analyses on complete genome sequence data. We have studied the evolution of gene content and find that vertical inheritance dominates over horizontal gene trasnfer, even to the extent that we can use the gene content to make genome phylogenies.
Malaria Parasite Metabolic Pathways (MPMP) Upgraded with Targeted Chemical Compounds

KAUST Repository

Ginsburg, Hagai

2015-10-31

Malaria Parasite Metabolic Pathways (MPMP) is the website for the functional genomics of intraerythrocytic Plasmodium falciparum. All the published information about targeted chemical compounds has now been added. Users can find the drug target and publication details linked to a drug database for further information about the medicinal properties of each compound.
Malaria Parasite Metabolic Pathways (MPMP) Upgraded with Targeted Chemical Compounds

KAUST Repository

Ginsburg, Hagai; Abdel-Haleem, Alyaa M.

2015-01-01

Malaria Parasite Metabolic Pathways (MPMP) is the website for the functional genomics of intraerythrocytic Plasmodium falciparum. All the published information about targeted chemical compounds has now been added. Users can find the drug target and publication details linked to a drug database for further information about the medicinal properties of each compound.
Genome projects and the functional-genomic era.

Science.gov (United States)

Sauer, Sascha; Konthur, Zoltán; Lehrach, Hans

2005-12-01

The problems we face today in public health as a result of the -- fortunately -- increasing age of people and the requirements of developing countries create an urgent need for new and innovative approaches in medicine and in agronomics. Genomic and functional genomic approaches have a great potential to at least partially solve these problems in the future. Important progress has been made by procedures to decode genomic information of humans, but also of other key organisms. The basic comprehension of genomic information (and its transfer) should now give us the possibility to pursue the next important step in life science eventually leading to a basic understanding of biological information flow; the elucidation of the function of all genes and correlative products encoded in the genome, as well as the discovery of their interactions in a molecular context and the response to environmental factors. As a result of the sequencing projects, we are now able to ask important questions about sequence variation and can start to comprehensively study the function of expressed genes on different levels such as RNA, protein or the cell in a systematic context including underlying networks. In this article we review and comment on current trends in large-scale systematic biological research. A particular emphasis is put on technology developments that can provide means to accomplish the tasks of future lines of functional genomics.

GAAP: Genome-organization-framework-Assisted Assembly Pipeline for prokaryotic genomes.

Science.gov (United States)

Yuan, Lina; Yu, Yang; Zhu, Yanmin; Li, Yulai; Li, Changqing; Li, Rujiao; Ma, Qin; Siu, Gilman Kit-Hang; Yu, Jun; Jiang, Taijiao; Xiao, Jingfa; Kang, Yu

2017-01-25

Next-generation sequencing (NGS) technologies have greatly promoted the genomic study of prokaryotes. However, highly fragmented assemblies due to short reads from NGS are still a limiting factor in gaining insights into the genome biology. Reference-assisted tools are promising in genome assembly, but tend to result in false assembly when the assigned reference has extensive rearrangements. Herein, we present GAAP, a genome assembly pipeline for scaffolding based on core-gene-defined Genome Organizational Framework (cGOF) described in our previous study. Instead of assigning references, we use the multiple-reference-derived cGOFs as indexes to assist in order and orientation of the scaffolds and build a skeleton structure, and then use read pairs to extend scaffolds, called local scaffolding, and distinguish between true and chimeric adjacencies in the scaffolds. In our performance tests using both empirical and simulated data of 15 genomes in six species with diverse genome size, complexity, and all three categories of cGOFs, GAAP outcompetes or achieves comparable results when compared to three other reference-assisted programs, AlignGraph, Ragout and MeDuSa. GAAP uses both cGOF and pair-end reads to create assemblies in genomic scale, and performs better than the currently available reference-assisted assembly tools as it recovers more assemblies and makes fewer false locations, especially for species with extensive rearranged genomes. Our method is a promising solution for reconstruction of genome sequence from short reads of NGS.
Genetic gatekeepers: regulating direct-to-consumer genomic services in an era of participatory medicine.

Science.gov (United States)

Palmer, Jessica Elizabeth

2012-01-01

Should consumers be able to obtain information about their own bodies, even if it has no proven medical value? Direct-to-consumer ("DTC") genomic companies offer consumers two services: generation of the consumer's personal genetic sequence, and interpretation of that sequence in light of current research. Concerned that consumers will misunderstand genomic information and make ill-advised health decisions, regulators, legislators and scholars have advocated restricted access to DTC genomic services. The Food and Drug Administration, which has historically refrained from regulating most genetic tests, has announced its intent to treat DTC genomic services as medical devices because they make "medical claims." This Article argues that FDA regulation of genomic services as medical devices would be counterproductive. Clinical laboratories conducting genetic tests are already overseen by a federal regime administered by the Centers for Medicare and Medicaid Services. While consumers and clinicians would benefit from clearer communication of test results and their health implications, FDA's gatekeeping framework is ill-suited to weigh the safety and efficacy of genomic information that is not medically actionable in traditional ways. Playing gatekeeper would burden FDA's resources, conflict with the patient-empowering policies promoted by personalized medicine initiatives, impair individuals' access to information in which they have powerful autonomy interests, weaken novel participatory research infrastructures, and set a poor precedent for the future regulation of medical information. Rather than applying its risk-based regulatory framework to genetic information, FDA should ameliorate regulatory uncertainty by working with the Federal Trade Commission and Centers for Medicare and Medicaid Services to ensure that DTC genomic services deliver analytically valid data, market and implement their services in a truthful manner, and fully disclose the limitations of their
The Perennial Ryegrass GenomeZipper – Targeted Use of Genome Resources for Comparative Grass Genomics

DEFF Research Database (Denmark)

Pfeiffer, Matthias; Martis, Mihaela; Asp, Torben

2013-01-01

(Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold......Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass...... to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous...
76 FR 38399 - Assessing the Current Research, Policy, and Practice Environment in Public Health Genomics

Science.gov (United States)

2011-06-30

... DEPARTMENT OF HEALTH AND HUMAN SERVICES Centers for Disease Control and Prevention [Docket Number CDC-2011-0008] Assessing the Current Research, Policy, and Practice Environment in Public Health... information helpful to assess the current research, policy, and practice environment in public health genomics...
Population Genomics of Infectious and Integrated Wolbachia pipientis Genomes in Drosophila ananassae

Science.gov (United States)

Choi, Jae Young; Bubnell, Jaclyn E.; Aquadro, Charles F.

2015-01-01

Coevolution between Drosophila and its endosymbiont Wolbachia pipientis has many intriguing aspects. For example, Drosophila ananassae hosts two forms of W. pipientis genomes: One being the infectious bacterial genome and the other integrated into the host nuclear genome. Here, we characterize the infectious and integrated genomes of W. pipientis infecting D. ananassae (wAna), by genome sequencing 15 strains of D. ananassae that have either the infectious or integrated wAna genomes. Results indicate evolutionarily stable maternal transmission for the infectious wAna genome suggesting a relatively long-term coevolution with its host. In contrast, the integrated wAna genome showed pseudogene-like characteristics accumulating many variants that are predicted to have deleterious effects if present in an infectious bacterial genome. Phylogenomic analysis of sequence variation together with genotyping by polymerase chain reaction of large structural variations indicated several wAna variants among the eight infectious wAna genomes. In contrast, only a single wAna variant was found among the seven integrated wAna genomes examined in lines from Africa, south Asia, and south Pacific islands suggesting that the integration occurred once from a single infectious wAna genome and then spread geographically. Further analysis revealed that for all D. ananassae we examined with the integrated wAna genomes, the majority of the integrated wAna genomic regions is represented in at least two copies suggesting a double integration or single integration followed by an integrated genome duplication. The possible evolutionary mechanism underlying the widespread geographical presence of the duplicate integration of the wAna genome is an intriguing question remaining to be answered. PMID:26254486
The Genome and Methylome of a Subsocial Small Carpenter Bee, Ceratina calcarata.

Science.gov (United States)

Rehan, Sandra M; Glastad, Karl M; Lawson, Sarah P; Hunt, Brendan G

2016-05-13

Understanding the evolution of animal societies, considered to be a major transition in evolution, is a key topic in evolutionary biology. Recently, new gateways for understanding social evolution have opened up due to advances in genomics, allowing for unprecedented opportunities in studying social behavior on a molecular level. In particular, highly eusocial insect species (caste-containing societies with nonreproductives that care for siblings) have taken center stage in studies of the molecular evolution of sociality. Despite advances in genomic studies of both solitary and eusocial insects, we still lack genomic resources for early insect societies. To study the genetic basis of social traits requires comparison of genomes from a diversity of organisms ranging from solitary to complex social forms. Here we present the genome of a subsocial bee, Ceratina calcarata This study begins to address the types of genomic changes associated with the earliest origins of simple sociality using the small carpenter bee. Genes associated with lipid transport and DNA recombination have undergone positive selection in C. calcarata relative to other bee lineages. Furthermore, we provide the first methylome of a noneusocial bee. Ceratina calcarata contains the complete enzymatic toolkit for DNA methylation. As in the honey bee and many other holometabolous insects, DNA methylation is targeted to exons. The addition of this genome allows for new lines of research into the genetic and epigenetic precursors to complex social behaviors. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Prediction of maize phenotype based on whole-genome single nucleotide polymorphisms using deep belief networks

Science.gov (United States)

Rachmatia, H.; Kusuma, W. A.; Hasibuan, L. S.

2017-05-01

Selection in plant breeding could be more effective and more efficient if it is based on genomic data. Genomic selection (GS) is a new approach for plant-breeding selection that exploits genomic data through a mechanism called genomic prediction (GP). Most of GP models used linear methods that ignore effects of interaction among genes and effects of higher order nonlinearities. Deep belief network (DBN), one of the architectural in deep learning methods, is able to model data in high level of abstraction that involves nonlinearities effects of the data. This study implemented DBN for developing a GP model utilizing whole-genome Single Nucleotide Polymorphisms (SNPs) as data for training and testing. The case study was a set of traits in maize. The maize dataset was acquisitioned from CIMMYT’s (International Maize and Wheat Improvement Center) Global Maize program. Based on Pearson correlation, DBN is outperformed than other methods, kernel Hilbert space (RKHS) regression, Bayesian LASSO (BL), best linear unbiased predictor (BLUP), in case allegedly non-additive traits. DBN achieves correlation of 0.579 within -1 to 1 range.
Dramatic improvement in genome assembly achieved using doubled-haploid genomes.

Science.gov (United States)

Zhang, Hong; Tan, Engkong; Suzuki, Yutaka; Hirose, Yusuke; Kinoshita, Shigeharu; Okano, Hideyuki; Kudoh, Jun; Shimizu, Atsushi; Saito, Kazuyoshi; Watabe, Shugo; Asakawa, Shuichi

2014-10-27

Improvement in de novo assembly of large genomes is still to be desired. Here, we improved draft genome sequence quality by employing doubled-haploid individuals. We sequenced wildtype and doubled-haploid Takifugu rubripes genomes, under the same conditions, using the Illumina platform and assembled contigs with SOAPdenovo2. We observed 5.4-fold and 2.6-fold improvement in the sizes of the N50 contig and scaffold of doubled-haploid individuals, respectively, compared to the wildtype, indicating that the use of a doubled-haploid genome aids in accurate genome analysis.
MIPS: analysis and annotation of genome information in 2007.

Science.gov (United States)

Mewes, H W; Dietmann, S; Frishman, D; Gregory, R; Mannhaupt, G; Mayer, K F X; Münsterkötter, M; Ruepp, A; Spannagl, M; Stümpflen, V; Rattei, T

2008-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) combines automatic processing of large amounts of sequences with manual annotation of selected model genomes. Due to the massive growth of the available data, the depth of annotation varies widely between independent databases. Also, the criteria for the transfer of information from known to orthologous sequences are diverse. To cope with the task of global in-depth genome annotation has become unfeasible. Therefore, our efforts are dedicated to three levels of annotation: (i) the curation of selected genomes, in particular from fungal and plant taxa (e.g. CYGD, MNCDB, MatDB), (ii) the comprehensive, consistent, automatic annotation employing exhaustive methods for the computation of sequence similarities and sequence-related attributes as well as the classification of individual sequences (SIMAP, PEDANT and FunCat) and (iii) the compilation of manually curated databases for protein interactions based on scrutinized information from the literature to serve as an accepted set of reliable annotated interaction data (MPACT, MPPI, CORUM). All databases and tools described as well as the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).
Dana-Farber Cancer Institute: Identification of Therapeutic Targets Across Cancer Types | Office of Cancer Genomics

Science.gov (United States)

The Dana Farber Cancer Institute CTD2 Center focuses on the use of high-throughput genetic and bioinformatic approaches to identify and credential oncogenes and co-dependencies in cancers. This Center aims to provide the cancer research community with information that will facilitate the prioritization of targets based on both genomic and functional evidence, inform the most appropriate genetic context for downstream mechanistic and validation studies, and enable the translation of this information into therapeutics and diagnostics.
Cancer genomics

DEFF Research Database (Denmark)

Norrild, Bodil; Guldberg, Per; Ralfkiær, Elisabeth Methner

2007-01-01

Almost all cells in the human body contain a complete copy of the genome with an estimated number of 25,000 genes. The sequences of these genes make up about three percent of the genome and comprise the inherited set of genetic information. The genome also contains information that determines whe...
Molecular Assemblies, Genes and Genomics Integrated Efficiently (MAGGIE)

Energy Technology Data Exchange (ETDEWEB)

Baliga, Nitin S

2011-05-26

when applied to the manually curated training set. Applying this method to the data representing around a quarter of the fraction space for water soluble proteins in D. vulgaris, we obtained 854 reliable pair wise interactions. Further, we have developed algorithms to analyze and assign significance to protein interaction data from bait pull-down experiments and integrate these data with other systems biology data through associative biclustering in a parallel computing environment. We will 'fill-in' missing information in these interaction data using a 'Transitive Closure' algorithm and subsequently use 'Between Commonality Decomposition' algorithm to discover complexes within these large graphs of protein interactions. To characterize the metabolic activities of proteins and their complexes we are developing algorithms to deconvolute pure mass spectra, estimate chemical formula for m/z values, and fit isotopic fine structure to metabolomics data. We have discovered that in comparison to isotopic pattern fitting methods restricting the chemical formula by these two dimensions actually facilitates unique solutions for chemical formula generators. To understand how microbial functions are regulated we have developed complementary algorithms for reconstructing gene regulatory networks (GRNs). Whereas the network inference algorithms cMonkey and Inferelator developed enable de novo reconstruction of predictive models for GRNs from diverse systems biology data, the RegPrecise and RegPredict framework developed uses evolutionary comparisons of genomes from closely related organisms to reconstruct conserved regulons. We have integrated the two complementary algorithms to rapidly generate comprehensive models for gene regulation of understudied organisms. Our preliminary analyses of these reconstructed GRNs have revealed novel regulatory mechanisms and cis-regulatory motifs, as well asothers that are conserved across species. Finally, we are
Large-scale chromosome folding versus genomic DNA sequences: A discrete double Fourier transform technique.

Science.gov (United States)

Chechetkin, V R; Lobzin, V V

2017-08-07

Using state-of-the-art techniques combining imaging methods and high-throughput genomic mapping tools leaded to the significant progress in detailing chromosome architecture of various organisms. However, a gap still remains between the rapidly growing structural data on the chromosome folding and the large-scale genome organization. Could a part of information on the chromosome folding be obtained directly from underlying genomic DNA sequences abundantly stored in the databanks? To answer this question, we developed an original discrete double Fourier transform (DDFT). DDFT serves for the detection of large-scale genome regularities associated with domains/units at the different levels of hierarchical chromosome folding. The method is versatile and can be applied to both genomic DNA sequences and corresponding physico-chemical parameters such as base-pairing free energy. The latter characteristic is closely related to the replication and transcription and can also be used for the assessment of temperature or supercoiling effects on the chromosome folding. We tested the method on the genome of E. coli K-12 and found good correspondence with the annotated domains/units established experimentally. As a brief illustration of further abilities of DDFT, the study of large-scale genome organization for bacteriophage PHIX174 and bacterium Caulobacter crescentus was also added. The combined experimental, modeling, and bioinformatic DDFT analysis should yield more complete knowledge on the chromosome architecture and genome organization. Copyright © 2017 Elsevier Ltd. All rights reserved.
Detection of genomic instability in normal human bronchial epithelial cells exposed to 238Pu

International Nuclear Information System (INIS)

Kennedy, C.H.; Fukushima, N.H.; Neft, R.E.; Lechner, J.F.

1994-01-01

Alpha particle-emitting radon daughters constitute a risk for development of lung cancer in humans. The development of this disease involves multiple genetic alterations. These changes and the time course they follow are not yet defined despite numerous in vitro endeavors to transform human lung cells with various physical or chemical agents. However, genomic instability, characterized both by structural and numerical chromosomal aberrations and by elevated rates of point mutations, is a common feature of tumor cells. Further, both types of genomic instability have been reported in the noncancerous progeny of normal murine hemopoietic cells exposed in vitro to α-particles. The purpose of this investigation was to determine if genomic instability is also a prominent feature of normal human bronchial epithelial cells exposed to α-particle irradiation from the decay of inhaled radon daughters
Personal genomics services: whose genomes?

Science.gov (United States)

Gurwitz, David; Bregman-Eschet, Yael

2009-07-01

New companies offering personal whole-genome information services over the internet are dynamic and highly visible players in the personal genomics field. For fees currently ranging from US$399 to US$2500 and a vial of saliva, individuals can now purchase online access to their individual genetic information regarding susceptibility to a range of chronic diseases and phenotypic traits based on a genome-wide SNP scan. Most of the companies offering such services are based in the United States, but their clients may come from nearly anywhere in the world. Although the scientific validity, clinical utility and potential future implications of such services are being hotly debated, several ethical and regulatory questions related to direct-to-consumer (DTC) marketing strategies of genetic tests have not yet received sufficient attention. For example, how can we minimize the risk of unauthorized third parties from submitting other people's DNA for testing? Another pressing question concerns the ownership of (genotypic and phenotypic) information, as well as the unclear legal status of customers regarding their own personal information. Current legislation in the US and Europe falls short of providing clear answers to these questions. Until the regulation of personal genomics services catches up with the technology, we call upon commercial providers to self-regulate and coordinate their activities to minimize potential risks to individual privacy. We also point out some specific steps, along the trustee model, that providers of DTC personal genomics services as well as regulators and policy makers could consider for addressing some of the concerns raised below.
Photoconversion of F+ centers in neutron-irradiated MgO

International Nuclear Information System (INIS)

Monge, M.A.; Gonzalez, R.; Munoz Santiuste, J.E.; Pareja, R.; Chen, Y.; Kotomin, E.A.; Popov, A.I.

2000-01-01

In neutron-irradiated MgO crystals, experiments and theory demonstrate that photon excitation of the positively charged anion vacancies (F + centers) at 5.0 eV releases holes that are subsequently trapped at V-type centers, which are cation vacancies charge-compensated by impurities, such as Al 3+ , F - , and OH - ions. A photoconversion mechanism occurs very likely via electron transfer to F + centers from the quasi-local states which are induced in the valence band. INDO quantum chemical simulations of F + centers confirmed the appearance of two induced quasi-local states located at 1.2 and 2.0 eV below the top of the valence band
Haplotype assembly in polyploid genomes and identical by descent shared tracts.

Science.gov (United States)

Aguiar, Derek; Istrail, Sorin

2013-07-01

Genome-wide haplotype reconstruction from sequence data, or haplotype assembly, is at the center of major challenges in molecular biology and life sciences. For complex eukaryotic organisms like humans, the genome is vast and the population samples are growing so rapidly that algorithms processing high-throughput sequencing data must scale favorably in terms of both accuracy and computational efficiency. Furthermore, current models and methodologies for haplotype assembly (i) do not consider individuals sharing haplotypes jointly, which reduces the size and accuracy of assembled haplotypes, and (ii) are unable to model genomes having more than two sets of homologous chromosomes (polyploidy). Polyploid organisms are increasingly becoming the target of many research groups interested in the genomics of disease, phylogenetics, botany and evolution but there is an absence of theory and methods for polyploid haplotype reconstruction. In this work, we present a number of results, extensions and generalizations of compass graphs and our HapCompass framework. We prove the theoretical complexity of two haplotype assembly optimizations, thereby motivating the use of heuristics. Furthermore, we present graph theory-based algorithms for the problem of haplotype assembly using our previously developed HapCompass framework for (i) novel implementations of haplotype assembly optimizations (minimum error correction), (ii) assembly of a pair of individuals sharing a haplotype tract identical by descent and (iii) assembly of polyploid genomes. We evaluate our methods on 1000 Genomes Project, Pacific Biosciences and simulated sequence data. HapCompass is available for download at http://www.brown.edu/Research/Istrail_Lab/. Supplementary data are available at Bioinformatics online.
Insights into structural variations and genome rearrangements in prokaryotic genomes.

Science.gov (United States)

Periwal, Vinita; Scaria, Vinod

2015-01-01

Structural variations (SVs) are genomic rearrangements that affect fairly large fragments of DNA. Most of the SVs such as inversions, deletions and translocations have been largely studied in context of genetic diseases in eukaryotes. However, recent studies demonstrate that genome rearrangements can also have profound impact on prokaryotic genomes, leading to altered cell phenotype. In contrast to single-nucleotide variations, SVs provide a much deeper insight into organization of bacterial genomes at a much better resolution. SVs can confer change in gene copy number, creation of new genes, altered gene expression and many other functional consequences. High-throughput technologies have now made it possible to explore SVs at a much refined resolution in bacterial genomes. Through this review, we aim to highlight the importance of the less explored field of SVs in prokaryotic genomes and their impact. We also discuss its potential applicability in the emerging fields of synthetic biology and genome engineering where targeted SVs could serve to create sophisticated and accurate genome editing. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
LLNL Genomic Assessment: Viral and Bacterial Sequencing Needs for TMTI, Task 1.4.2 Report

Energy Technology Data Exchange (ETDEWEB)

Slezak, T; Borucki, M; Lam, M; Lenhoff, R; Vitalis, E

2010-01-26

Good progress has been made on both bacterial and viral sequencing by the TMTI centers. While access to appropriate samples is a limiting factor to throughput, excellent progress has been made with respect to getting agreements in place with key sources of relevant materials. Sharing of sequenced genomes funded by TMTI has been extremely limited to date. The April 2010 exercise should force a resolution to this, but additional managerial pressures may be needed to ensure that rapid sharing of TMTI-funded sequencing occurs, regardless of collaborator constraints concerning ultimate publication(s). Policies to permit TMTI-internal rapid sharing of sequenced genomes should be written into all TMTI agreements with collaborators now being negotiated. TMTI needs to establish a Web-based system for tracking samples destined for sequencing. This includes metadata on sample origins and contributor, information on sample shipment/receipt, prioritization by TMTI, assignment to one or more sequencing centers (including possible TMTI-sponsored sequencing at a contributor site), and status history of the sample sequencing effort. While this system could be a component of the AFRL system, it is not part of any current development effort. Policy and standardized procedures are needed to ensure appropriate verification of all TMTI samples prior to the investment in sequencing. PCR, arrays, and classical biochemical tests are examples of potential verification methods. Verification is needed to detect miss-labeled, degraded, mixed or contaminated samples. Regular QC exercises are needed to ensure that the TMTI-funded centers are meeting all standards for producing quality genomic sequence data.
Limits of variation, specific infectivity, and genome packaging of massively recoded poliovirus genomes.

Science.gov (United States)

Song, Yutong; Gorbatsevych, Oleksandr; Liu, Ying; Mugavero, JoAnn; Shen, Sam H; Ward, Charles B; Asare, Emmanuel; Jiang, Ping; Paul, Aniko V; Mueller, Steffen; Wimmer, Eckard

2017-10-10

Computer design and chemical synthesis generated viable variants of poliovirus type 1 (PV1), whose ORF (6,189 nucleotides) carried up to 1,297 "Max" mutations (excess of overrepresented synonymous codon pairs) or up to 2,104 "SD" mutations (randomly scrambled synonymous codons). "Min" variants (excess of underrepresented synonymous codon pairs) are nonviable except for P2 Min , a variant temperature-sensitive at 33 and 39.5 °C. Compared with WT PV1, P2 Min displayed a vastly reduced specific infectivity (si) (WT, 1 PFU/118 particles vs. P2 Min , 1 PFU/35,000 particles), a phenotype that will be discussed broadly. Si of haploid PV presents cellular infectivity of a single genotype. We performed a comprehensive analysis of sequence and structures of the PV genome to determine if evolutionary conserved cis-acting packaging signal(s) were preserved after recoding. We showed that conserved synonymous sites and/or local secondary structures that might play a role in determining packaging specificity do not survive codon pair recoding. This makes it unlikely that numerous "cryptic, sequence-degenerate, dispersed RNA packaging signals mapping along the entire viral genome" [Patel N, et al. (2017) Nat Microbiol 2:17098] play the critical role in poliovirus packaging specificity. Considering all available evidence, we propose a two-step assembly strategy for +ssRNA viruses: step I, acquisition of packaging specificity, either ( a ) by specific recognition between capsid protein(s) and replication proteins (poliovirus), or ( b ) by the high affinity interaction of a single RNA packaging signal (PS) with capsid protein(s) (most +ssRNA viruses so far studied); step II, cocondensation of genome/capsid precursors in which an array of hairpin structures plays a role in virion formation.

Parasite Genome Projects and the Trypanosoma cruzi Genome Initiative

Directory of Open Access Journals (Sweden)

Wim Degrave

1997-11-01

Full Text Available Since the start of the human genome project, a great number of genome projects on other "model" organism have been initiated, some of them already completed. Several initiatives have also been started on parasite genomes, mainly through support from WHO/TDR, involving North-South and South-South collaborations, and great hopes are vested in that these initiatives will lead to new tools for disease control and prevention, as well as to the establishment of genomic research technology in developing countries. The Trypanosoma cruzi genome project, using the clone CL-Brener as starting point, has made considerable progress through the concerted action of more than 20 laboratories, most of them in the South. A brief overview of the current state of the project is given
Short and long-term genome stability analysis of prokaryotic genomes.

Science.gov (United States)

Brilli, Matteo; Liò, Pietro; Lacroix, Vincent; Sagot, Marie-France

2013-05-08

Gene organization dynamics is actively studied because it provides useful evolutionary information, makes functional annotation easier and often enables to characterize pathogens. There is therefore a strong interest in understanding the variability of this trait and the possible correlations with life-style. Two kinds of events affect genome organization: on one hand translocations and recombinations change the relative position of genes shared by two genomes (i.e. the backbone gene order); on the other, insertions and deletions leave the backbone gene order unchanged but they alter the gene neighborhoods by breaking the syntenic regions. A complete picture about genome organization evolution therefore requires to account for both kinds of events. We developed an approach where we model chromosomes as graphs on which we compute different stability estimators; we consider genome rearrangements as well as the effect of gene insertions and deletions. In a first part of the paper, we fit a measure of backbone gene order conservation (hereinafter called backbone stability) against phylogenetic distance for over 3000 genome comparisons, improving existing models for the divergence in time of backbone stability. Intra- and inter-specific comparisons were treated separately to focus on different time-scales. The use of multiple genomes of a same species allowed to identify genomes with diverging gene order with respect to their conspecific. The inter-species analysis indicates that pathogens are more often unstable with respect to non-pathogens. In a second part of the text, we show that in pathogens, gene content dynamics (insertions and deletions) have a much more dramatic effect on genome organization stability than backbone rearrangements. In this work, we studied genome organization divergence taking into account the contribution of both genome order rearrangements and genome content dynamics. By studying species with multiple sequenced genomes available, we were
Mapping Second Chromosome Mutations to Defined Genomic Regions in Drosophila melanogaster.

Science.gov (United States)

Kahsai, Lily; Cook, Kevin R

2018-01-04

Hundreds of Drosophila melanogaster stocks are currently maintained at the Bloomington Drosophila Stock Center with mutations that have not been associated with sequence-defined genes. They have been preserved because they have interesting loss-of-function phenotypes. The experimental value of these mutations would be increased by tying them to specific genomic intervals so that geneticists can more easily associate them with annotated genes. Here, we report the mapping of 85 second chromosome complementation groups in the Bloomington collection to specific, small clusters of contiguous genes or individual genes in the sequenced genome. This information should prove valuable to Drosophila geneticists interested in processes associated with particular phenotypes and those searching for mutations affecting specific sequence-defined genes. Copyright © 2018 Kahsai,Cook.
Mapping Second Chromosome Mutations to Defined Genomic Regions in Drosophila melanogaster

Directory of Open Access Journals (Sweden)

Lily Kahsai

2018-01-01

Full Text Available Hundreds of Drosophila melanogaster stocks are currently maintained at the Bloomington Drosophila Stock Center with mutations that have not been associated with sequence-defined genes. They have been preserved because they have interesting loss-of-function phenotypes. The experimental value of these mutations would be increased by tying them to specific genomic intervals so that geneticists can more easily associate them with annotated genes. Here, we report the mapping of 85 second chromosome complementation groups in the Bloomington collection to specific, small clusters of contiguous genes or individual genes in the sequenced genome. This information should prove valuable to Drosophila geneticists interested in processes associated with particular phenotypes and those searching for mutations affecting specific sequence-defined genes.
Development of radiation-induced mutation techniques and functional genomics studies

Energy Technology Data Exchange (ETDEWEB)

Kim, Dong Sub; Kang, Si Yong; Kim, Jin Baek [KAERI, Daejeon (Korea, Republic of); and others

2012-01-15

This project has been performed to develop plant genetic resources using radiation (gamma-rays, ion-beam, space environments), to conduct functional genomics studies with mutant resources, and to develop new radiation plant breeding techniques using various radiation sources during 3 years. In the first section, we developed flower genetic resources, functional crop resources, and bio-industrial plant resources. In the second section, we cloned several mutated genes and studied mechanisms of gene expression and genetic diversity of mutations induced by gamma-rays. In the third section, we developed new plant breeding techniques using gamma-phytotron, heavy ion-beam, and space environments. Based on these results, a total of 8 cultivars containing Chrysanthemum, Hibiscus, kenaf, rice, and soybean were applied for plant variety protection (PVP) and a total of 4 cultivars were registered for PVP. Also, license agreement for the dwarf type Hibiscus mutant 'Ggoma' was conducted with Supro co. and the manufacturing technology for natural antioxidant pear-grape vinegar was transferred into Enzenic co. Also, 8 gene sequences, such as F3'H and LDOX genes associated with flower color in Chrysanthemum and EPSPS gene from Korean lawn grass, were registered in the database of National Center for Biotechnology Information (NCBI). In the future study, we will develop new radiation mutation breeding techniques through the mutation spectrum induced by various radiation sources, the studies for mechanism of the cellular response to radiation, and the comparative{center_dot}structural{center_dot}functional genomics studies for useful traits.
Genome-Based Studies of Marine Microorganisms to Maximize the Diversity of Natural Products Discovery for Medical Treatments

Directory of Open Access Journals (Sweden)

Xin-Qing Zhao

2011-01-01

Full Text Available Marine microorganisms are rich source for natural products which play important roles in pharmaceutical industry. Over the past decade, genome-based studies of marine microorganisms have unveiled the tremendous diversity of the producers of natural products and also contributed to the efficiency of harness the strain diversity and chemical diversity, as well as the genetic diversity of marine microorganisms for the rapid discovery and generation of new natural products. In the meantime, genomic information retrieved from marine symbiotic microorganisms can also be employed for the discovery of new medical molecules from yet-unculturable microorganisms. In this paper, the recent progress in the genomic research of marine microorganisms is reviewed; new tools of genome mining as well as the advance in the activation of orphan pathways and metagenomic studies are summarized. Genome-based research of marine microorganisms will maximize the biodiscovery process and solve the problems of supply and sustainability of drug molecules for medical treatments.
PATtyFams: Protein families for the microbial genomes in the PATRIC database

Directory of Open Access Journals (Sweden)

James J Davis

2016-02-01

Full Text Available The ability to build accurate protein families is a fundamental operation in bioinformatics that influences comparative analyses, genome annotation and metabolic modeling. For several years we have been maintaining protein families for all microbial genomes in the PATRIC database (Pathosystems Resource Integration Center, patricbrc.org in order to drive many of the comparative analysis tools that are available through the PATRIC website. However, due to the burgeoning number of genomes, traditional approaches for generating protein families are becoming prohibitive. In this report, we describe a new approach for generating protein families, which we call PATtyFams. This method uses the k-mer-based function assignments available through RAST (Rapid Annotation using Subsystem Technology to rapidly guide family formation, and then differentiates the function-based groups into families using a Markov Cluster algorithm (MCL. This new approach for generating protein families is rapid, scalable and has properties that are consistent with alignment-based methods.
The genomes and comparative genomics of Lactobacillus delbrueckii phages.

Science.gov (United States)

Riipinen, Katja-Anneli; Forsman, Päivi; Alatossava, Tapani

2011-07-01

Lactobacillus delbrueckii phages are a great source of genetic diversity. Here, the genome sequences of Lb. delbrueckii phages LL-Ku, c5 and JCL1032 were analyzed in detail, and the genetic diversity of Lb. delbrueckii phages belonging to different taxonomic groups was explored. The lytic isometric group b phages LL-Ku (31,080 bp) and c5 (31,841 bp) showed a minimum nucleotide sequence identity of 90% over about three-fourths of their genomes. The genomic locations of their lysis modules were unique, and the genomes featured several putative overlapping transcription units of genes. LL-Ku and c5 virions displayed peptidoglycan hydrolytic activity associated with a ~36-kDa protein similar in size to the endolysin. Unexpectedly, the 49,433-bp genome of the prolate phage JCL1032 (temperate, group c) revealed a conserved gene order within its structural genes. Lb. delbrueckii phages representing groups a (a phage LL-H), b and c possessed only limited protein sequence homology. Genomic comparison of LL-Ku and c5 suggested that diversification of Lb. delbrueckii phages is mainly due to insertions, deletions and recombination. For the first time, the complete genome sequences of group b and c Lb. delbrueckii phages are reported.
Vacancy-impurity centers in diamond: prospects for synthesis and applications

Science.gov (United States)

Ekimov, E. A.; Kondrin, M. V.

2017-06-01

The bright luminescence of impurity-vacancy complexes, combined with high chemical and radiation resistance, makes diamond an attractive platform for the production of single-photon emitters and luminescent biomarkers for applications in nanoelectronics and medicine. Two representatives of this kind of defects in diamond, silicon-vacancy (SiV) and germanium-vacancy (GeV) centers, are discussed in this review; their similarities and differences are demonstrated in terms of the more thoroughly studied nitrogen-vacancy (NV) complexes. The recent discovery of GeV luminescent centers opens a unique opportunity for the controlled synthesis of single-photon emitters in nanodiamonds. We demonstrate prospects for the high-pressure high-temperature (HPHT) technique to create single-photon emitters, not only as an auxiliary to chemical vapor deposition (CVD) and ion-implantation methods but also as a primary synthesis tool for producing color centers in nanodiamonds. Besides practical applications, comparative studies of these two complexes, which belong to the same structural class of defects, have a fundamental importance for deeper understanding of shelving levels, the electronic structure, and optical properties of these centers. In conclusion, we discuss several open problems regarding the structure, charge state, and practical application of these centers, which still require a solution.
The Current State of Poison Control Centers in Pakistan and the Need for Capacity Building

Directory of Open Access Journals (Sweden)

Nadeem Khan

2014-03-01

Full Text Available Background: Chemical exposure is a major health problem globally. Poison control centers (PCCs play a leading role both in developed and developing countries in the prevention and control of poisonous chemical exposures. In this study, we aimed to assess the current state of PCCs in Pakistan and highlight capacity building needs in these centers. Methods: A cross-sectional survey of the two registered PCCs was done during August – December 2011. Necessary services of the PCCs were evaluated and the data were recorded on a predesigned checklist. Results: Both PCCs are affiliated to a tertiary care hospital. Clinical services to poisoned patients were available 24 hours a day / 7 days a week. Information on common local products was available to poison center staff. Both centers were involved in undergraduate and post graduate teaching. Telephone poison information service was not available in either of centers. There was a limited capacity for qualitative and analytical toxicology. Common antidotes were available. There were limited surveillance activities to capture toxic risks existing in the community and also a deficiency was observed in chemical disaster planning. Conclusion: PCCs in Pakistan need capacity building for specialized training in toxicology, toxicovigilance, chemical disaster planning, analytical laboratory tests and telephone service for consultation in poisoning cases. How to cite this article: Khan NU, Mir MU, Khan UR, Khan AR, Ara J, Raja K, et al. The Current State of Poison Control Centers in Pakistan and the Need for Capacity Building. Asia Pac J Med Toxicol 2014;3:31-5.
Genome-wide characterization of centromeric satellites from multiple mammalian genomes.

Science.gov (United States)

Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario

2011-01-01

Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.
COMPUTATIONAL SCIENCE CENTER

Energy Technology Data Exchange (ETDEWEB)

DAVENPORT, J.

2005-11-01

The Brookhaven Computational Science Center brings together researchers in biology, chemistry, physics, and medicine with applied mathematicians and computer scientists to exploit the remarkable opportunities for scientific discovery which have been enabled by modern computers. These opportunities are especially great in computational biology and nanoscience, but extend throughout science and technology and include, for example, nuclear and high energy physics, astrophysics, materials and chemical science, sustainable energy, environment, and homeland security. To achieve our goals we have established a close alliance with applied mathematicians and computer scientists at Stony Brook and Columbia Universities.
Engineering yeast metabolism for production of fuels and chemicals

DEFF Research Database (Denmark)

Nielsen, Jens

2016-01-01

faster development of metabolically engineered strains that can be used for production of fuels and chemicals. The yeast Saccharomyces cerevisiae is widely used for production of fuels, chemicals, pharmaceuticals and materials. Through metabolic engineering of this yeast a number of novel industrial...... as for metabolic design. In this lecture it will be demonstrated how the Design-Build-Test cycle of metabolic engineering has allowed for development of yeast cell factories for production of a range of different fuels and chemicals. Some examples of different technologies will be presented together with examples......Metabolic engineering relies on the Design-Build-Test cycle. This cycle includes technologies like mathematical modeling of metabolism, genome editing and advanced tools for phenotypic characterization. In recent years there have been advances in several of these technologies, which has enabled...
Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium

DEFF Research Database (Denmark)

Machado, Henrique; Gram, Lone

2017-01-01

was widespread and abundant in the genus, suggesting a role in genomic evolution. The high genetic variability and indications of genetic exchange make it difficult to elucidate genome evolutionary paths and raise the awareness of the roles of foreign DNA in the genomic evolution of environmental organisms.......Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand...... the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur, amino-acid usage, ANI), which allowed us to identify two...
Gene Composer in a structural genomics environment

International Nuclear Information System (INIS)

Lorimer, Don; Raymond, Amy; Mixon, Mark; Burgin, Alex; Staker, Bart; Stewart, Lance

2011-01-01

For structural biology applications, protein-construct engineering is guided by comparative sequence analysis and structural information, which allow the researcher to better define domain boundaries for terminal deletions and nonconserved regions for surface mutants. A database software application called Gene Composer has been developed to facilitate construct design. The structural genomics effort at the Seattle Structural Genomics Center for Infectious Disease (SSGCID) requires the manipulation of large numbers of amino-acid sequences and the underlying DNA sequences which are to be cloned into expression vectors. To improve efficiency in high-throughput protein structure determination, a database software package, Gene Composer, has been developed which facilitates the information-rich design of protein constructs and their underlying gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bioinformatics steps used in modern structure-guided protein engineering and synthetic gene engineering. An example of the structure determination of H1N1 RNA-dependent RNA polymerase PB2 subunit is given
Detailed analysis of putative genes encoding small proteins in legume genomes

Directory of Open Access Journals (Sweden)

Gabriel eGuillén

2013-06-01

Full Text Available Diverse plant genome sequencing projects coupled with powerful bioinformatics tools have facilitated massive data analysis to construct specialized databases classified according to cellular function. However, there are still a considerable number of genes encoding proteins whose function has not yet been characterized. Included in this category are small proteins (SPs, 30-150 amino acids encoded by short open reading frames (sORFs. SPs play important roles in plant physiology, growth, and development. Unfortunately, protocols focused on the genome-wide identification and characterization of sORFs are scarce or remain poorly implemented. As a result, these genes are underrepresented in many genome annotations. In this work, we exploited publicly available genome sequences of Phaseolus vulgaris, Medicago truncatula, Glycine max and Lotus japonicus to analyze the abundance of annotated SPs in plant legumes. Our strategy to uncover bona fide sORFs at the genome level was centered in bioinformatics analysis of characteristics such as evidence of expression (transcription, presence of known protein regions or domains, and identification of orthologous genes in the genomes explored. We collected 6170, 10461, 30521, and 23599 putative sORFs from P. vulgaris, G. max, M. truncatula, and L. japonicus genomes, respectively. Expressed sequence tags (ESTs available in the DFCI Gene Index database provided evidence that ~one-third of the predicted legume sORFs are expressed. Most potential SPs have a counterpart in a different plant species and counterpart regions or domains in larger proteins. Potential functional sORFs were also classified according to a reduced set of GO categories, and the expression of 13 of them during P. vulgaris nodule ontogeny was confirmed by qPCR. This analysis provides a collection of sORFs that potentially encode for meaningful SPs, and offers the possibility of their further functional evaluation.
Self-assembling nano-diameter needlelike pinning centers in YBCO, utilizing a foreign element dopant

Energy Technology Data Exchange (ETDEWEB)

Sawh, Ravi-Persad [Texas Center for Superconductivity and Physics Department, University of Houston, 632 Science and Research Bldg 1, Houston Texas 77204-5005 (United States); Weinstein, Roy [Texas Center for Superconductivity and Physics Department, University of Houston, 632 Science and Research Bldg 1, Houston Texas 77204-5005 (United States); Obot, Victor [Department of Mathematics, Texas Southern University, 3100 Cleburne St, Houston Texas 77004-4597 (United States); Parks, Drew [Texas Center for Superconductivity and Physics Department, University of Houston, 632 Science and Research Bldg 1, Houston Texas 77204-5005 (United States); Gandini, Alberto [Texas Center for Superconductivity and Physics Department, University of Houston, 632 Science and Research Bldg 1, Houston Texas 77204-5005 (United States); Skorpenske, Harley [Texas Center for Superconductivity and Physics Department, University of Houston, 632 Science and Research Bldg 1, Houston Texas 77204-5005 (United States)

2006-06-01

Although pinning centers created by irradiation presently produce the highest J{sub c}, it is probable that ultimately these will be emulated by chemical pinning centers. The best pinning centers produced by irradiation nevertheless provide guidelines for desirable morphology of chemical pinning structures. The highest J{sub c} produced earlier in textured HTS was obtained using isotropic high-energy ions produced by fission of {sup 235}U. This so-called U/n process produces pinning centers of diameter {<=} 4.5 nm, with an effective length of {approx}2.7 {mu}m. Maximum J{sub c} occurs for pinning center density of {approx}10{sup 10} cm{sup -3}. We use this as a model for desired chemical pinning centers. Our approach to introducing chemical pinning centers has been to produce precipitates within the HTS containing elements not native to the HTS, and to seek needlelike (columnar) deposits of small diameter. We report here on the formation of needlelike or columnar deposits in textured Y123 containing a dopant foreign to Y123. It serves as a demonstration that self-assembling nanometer diameter columns utilizing a dopant foreign to the HTS system are a feasible goal. These deposits, however, do not fully meet the ultimate requirements of pinning centers because the desired deposits should be smaller. The self-assembling columns formed contain titanium, are {approx}500 nm in diameter, and up to 10 {mu}m long. The size and morphology of the deposits vary with the mass of admixed Ti dopant. J{sub c} is decreased for small dopant mass. At larger dopant masses needlelike precipitates form, and J{sub c} increases again. A small range of mass of admixed Ti exists in which J{sub c} is enhanced by pinning. In the range of admixed Ti mass studied in these experiments there is a negligible effect on T{sub c}. Magnetization studies of J{sub c} are also reported.
Biotechnology for Chemical Production: Challenges and Opportunities.

Science.gov (United States)

Burk, Mark J; Van Dien, Stephen

2016-03-01

Biotechnology offers a new sustainable approach to manufacturing chemicals, enabling the replacement of petroleum-based raw materials with renewable biobased feedstocks, thereby reducing greenhouse gas (GHG) emissions, toxic byproducts, and the safety risks associated with traditional petrochemical processing. Development of such bioprocesses is enabled by recent advances in genomics, molecular biology, and systems biology, and will continue to accelerate as access to these tools becomes faster and cheaper. Copyright © 2015 Elsevier Ltd. All rights reserved.
Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

Energy Technology Data Exchange (ETDEWEB)

Kuo, Alan; Grigoriev, Igor

2009-04-17

Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.
Genome-wide analysis of Tol2 transposon reintegration in zebrafish.

Science.gov (United States)

Kondrychyn, Igor; Garcia-Lecea, Marta; Emelyanov, Alexander; Parinov, Sergey; Korzh, Vladimir

2009-09-08

Tol2, a member of the hAT family of transposons, has become a useful tool for genetic manipulation of model animals, but information about its interactions with vertebrate genomes is still limited. Furthermore, published reports on Tol2 have mainly been based on random integration of the transposon system after co-injection of a plasmid DNA harboring the transposon and a transposase mRNA. It is important to understand how Tol2 would behave upon activation after integration into the genome. We performed a large-scale enhancer trap (ET) screen and generated 338 insertions of the Tol2 transposon-based ET cassette into the zebrafish genome. These insertions were generated by remobilizing the transposon from two different donor sites in two transgenic lines. We found that 39% of Tol2 insertions occurred in transcription units, mostly into introns. Analysis of the transposon target sites revealed no strict specificity at the DNA sequence level. However, Tol2 was prone to target AT-rich regions with weak palindromic consensus sequences centered at the insertion site. Our systematic analysis of sequential remobilizations of the Tol2 transposon from two independent sites within a vertebrate genome has revealed properties such as a tendency to integrate into transcription units and into AT-rich palindrome-like sequences. This information will influence the development of various applications involving DNA transposons and Tol2 in particular.

Mojo Hand, a TALEN design tool for genome editing applications.

Science.gov (United States)

Neff, Kevin L; Argue, David P; Ma, Alvin C; Lee, Han B; Clark, Karl J; Ekker, Stephen C

2013-01-16

Recent studies of transcription activator-like (TAL) effector domains fused to nucleases (TALENs) demonstrate enormous potential for genome editing. Effective design of TALENs requires a combination of selecting appropriate genetic features, finding pairs of binding sites based on a consensus sequence, and, in some cases, identifying endogenous restriction sites for downstream molecular genetic applications. We present the web-based program Mojo Hand for designing TAL and TALEN constructs for genome editing applications (http://www.talendesign.org). We describe the algorithm and its implementation. The features of Mojo Hand include (1) automatic download of genomic data from the National Center for Biotechnology Information, (2) analysis of any DNA sequence to reveal pairs of binding sites based on a user-defined template, (3) selection of restriction-enzyme recognition sites in the spacer between the TAL monomer binding sites including options for the selection of restriction enzyme suppliers, and (4) output files designed for subsequent TALEN construction using the Golden Gate assembly method. Mojo Hand enables the rapid identification of TAL binding sites for use in TALEN design. The assembly of TALEN constructs, is also simplified by using the TAL-site prediction program in conjunction with a spreadsheet management aid of reagent concentrations and TALEN formulation. Mojo Hand enables scientists to more rapidly deploy TALENs for genome editing applications.
Mojo Hand, a TALEN design tool for genome editing applications

Directory of Open Access Journals (Sweden)

Neff Kevin L

2013-01-01

Full Text Available Abstract Background Recent studies of transcription activator-like (TAL effector domains fused to nucleases (TALENs demonstrate enormous potential for genome editing. Effective design of TALENs requires a combination of selecting appropriate genetic features, finding pairs of binding sites based on a consensus sequence, and, in some cases, identifying endogenous restriction sites for downstream molecular genetic applications. Results We present the web-based program Mojo Hand for designing TAL and TALEN constructs for genome editing applications (http://www.talendesign.org. We describe the algorithm and its implementation. The features of Mojo Hand include (1 automatic download of genomic data from the National Center for Biotechnology Information, (2 analysis of any DNA sequence to reveal pairs of binding sites based on a user-defined template, (3 selection of restriction-enzyme recognition sites in the spacer between the TAL monomer binding sites including options for the selection of restriction enzyme suppliers, and (4 output files designed for subsequent TALEN construction using the Golden Gate assembly method. Conclusions Mojo Hand enables the rapid identification of TAL binding sites for use in TALEN design. The assembly of TALEN constructs, is also simplified by using the TAL-site prediction program in conjunction with a spreadsheet management aid of reagent concentrations and TALEN formulation. Mojo Hand enables scientists to more rapidly deploy TALENs for genome editing applications.
Building a genome database using an object-oriented approach.

Science.gov (United States)

Barbasiewicz, Anna; Liu, Lin; Lang, B Franz; Burger, Gertraud

2002-01-01

GOBASE is a relational database that integrates data associated with mitochondria and chloroplasts. The most important data in GOBASE, i. e., molecular sequences and taxonomic information, are obtained from the public sequence data repository at the National Center for Biotechnology Information (NCBI), and are validated by our experts. Maintaining a curated genomic database comes with a towering labor cost, due to the shear volume of available genomic sequences and the plethora of annotation errors and omissions in records retrieved from public repositories. Here we describe our approach to increase automation of the database population process, thereby reducing manual intervention. As a first step, we used Unified Modeling Language (UML) to construct a list of potential errors. Each case was evaluated independently, and an expert solution was devised, and represented as a diagram. Subsequently, the UML diagrams were used as templates for writing object-oriented automation programs in the Java programming language.
Novel genomes and genome constitutions identified by GISH and 5S rDNA and knotted1 genomic sequences in the genus Setaria.

Science.gov (United States)

Zhao, Meicheng; Zhi, Hui; Doust, Andrew N; Li, Wei; Wang, Yongfang; Li, Haiquan; Jia, Guanqing; Wang, Yongqiang; Zhang, Ning; Diao, Xianmin

2013-04-11

The Setaria genus is increasingly of interest to researchers, as its two species, S. viridis and S. italica, are being developed as models for understanding C4 photosynthesis and plant functional genomics. The genome constitution of Setaria species has been studied in the diploid species S. viridis, S. adhaerans and S. grisebachii, where three genomes A, B and C were identified respectively. Two allotetraploid species, S. verticillata and S. faberi, were found to have AABB genomes, and one autotetraploid species, S. queenslandica, with an AAAA genome, has also been identified. The genomes and genome constitutions of most other species remain unknown, even though it was thought there are approximately 125 species in the genus distributed world-wide. GISH was performed to detect the genome constitutions of Eurasia species of S. glauca, S. plicata, and S. arenaria, with the known A, B and C genomes as probes. No or very poor hybridization signal was detected indicating that their genomes are different from those already described. GISH was also performed reciprocally between S. glauca, S. plicata, and S. arenaria genomes, but no hybridization signals between each other were found. The two sets of chromosomes of S. lachnea both hybridized strong signals with only the known C genome of S. grisebachii. Chromosomes of Qing 9, an accession formerly considered as S. viridis, hybridized strong signal only to B genome of S. adherans. Phylogenetic trees constructed with 5S rDNA and knotted1 markers, clearly classify the samples in this study into six clusters, matching the GISH results, and suggesting that the F genome of S. arenaria is basal in the genus. Three novel genomes in the Setaria genus were identified and designated as genome D (S. glauca), E (S. plicata) and F (S. arenaria) respectively. The genome constitution of tetraploid S. lachnea is putatively CCC'C'. Qing 9 is a B genome species indigenous to China and is hypothesized to be a newly identified species. The
The chemical heritage of Aspergillus flavus in A. oryzae RIB 40

DEFF Research Database (Denmark)

Rank, Christian; Klejnstrup, Marie Louise; Petersen, Lene Maj

Aspergillus oryzae is a very important species in biotechnology and has been used for centuries in traditional Asian fermentation. The RIB40 strain is particularly interesting as it was one of the first genome sequenced Aspergilli together with A. flavus, a prominent food and feed contaminant...... capable of producing aflatoxin. These species can be perceived as ecotypes. We have analyzed A. oryzae RIB40 and found that the chemical potential could be enhanced significantly under certain conditions. Delicate analysis of their metabolic profiles allow for chemical insight on the transcription level...
Genome-derived vaccines.

Science.gov (United States)

De Groot, Anne S; Rappuoli, Rino

2004-02-01

Vaccine research entered a new era when the complete genome of a pathogenic bacterium was published in 1995. Since then, more than 97 bacterial pathogens have been sequenced and at least 110 additional projects are now in progress. Genome sequencing has also dramatically accelerated: high-throughput facilities can draft the sequence of an entire microbe (two to four megabases) in 1 to 2 days. Vaccine developers are using microarrays, immunoinformatics, proteomics and high-throughput immunology assays to reduce the truly unmanageable volume of information available in genome databases to a manageable size. Vaccines composed by novel antigens discovered from genome mining are already in clinical trials. Within 5 years we can expect to see a novel class of vaccines composed by genome-predicted, assembled and engineered T- and Bcell epitopes. This article addresses the convergence of three forces--microbial genome sequencing, computational immunology and new vaccine technologies--that are shifting genome mining for vaccines onto the forefront of immunology research.
Molecular epidemiology of Staphylococcus aureus bacteremia in a single large Minnesota medical center in 2015 as assessed using MLST, core genome MLST and spa typing.

Directory of Open Access Journals (Sweden)

Kyung-Hwa Park

Full Text Available Staphylococcus aureus is a leading cause of bacteremia in hospitalized patients. Whether or not S. aureus bacteremia (SAB is associated with clonality, implicating potential nosocomial transmission, has not, however, been investigated. Herein, we examined the epidemiology of SAB using whole genome sequencing (WGS. 152 SAB isolates collected over the course of 2015 at a single large Minnesota medical center were studied. Staphylococcus protein A (spa typing was performed by PCR/Sanger sequencing; multilocus sequence typing (MLST and core genome MLST (cgMLST were determined by WGS. Forty-eight isolates (32% were methicillin-resistant S. aureus (MRSA. The isolates encompassed 66 spa types, clustered into 11 spa clonal complexes (CCs and 10 singleton types. 88% of 48 MRSA isolates belonged to spa CC-002 or -008. Methicillin-susceptible S. aureus (MSSA isolates were more genotypically diverse, with 61% distributed across four spa CCs (CC-002, CC-012, CC-008 and CC-084. By MLST, there was 31 sequence types (STs, including 18 divided into 6 CCs and 13 singleton STs. Amongst MSSA isolates, the common MLST clones were CC5 (23%, CC30 (19%, CC8 (15% and CC15 (11%. Common MRSA clones were CC5 (67% and CC8 (25%; there were no MRSA isolates in CC45 or CC30. By cgMLST analysis, there were 9 allelic differences between two isolates, with the remaining 150 isolates differing from each other by over 40 alleles. The two isolates were retroactively epidemiologically linked by medical record review. Overall, cgMLST analysis resulted in higher resolution epidemiological typing than did multilocus sequence or spa typing.
JGI Fungal Genomics Program

Energy Technology Data Exchange (ETDEWEB)

Grigoriev, Igor V.

2011-03-14

Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here
Genomic Encyclopedia of Fungi

Energy Technology Data Exchange (ETDEWEB)

Grigoriev, Igor

2012-08-10

Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.
The Genomic Code: Genome Evolution and Potential Applications

KAUST Repository

Bernardi, Giorgio

2016-01-25

The genome of metazoans is organized according to a genomic code which comprises three laws: 1) Compositional correlations hold between contiguous coding and non-coding sequences, as well as among the three codon positions of protein-coding genes; these correlations are the consequence of the fact that the genomes under consideration consist of fairly homogeneous, long (≥200Kb) sequences, the isochores; 2) Although isochores are defined on the basis of purely compositional properties, GC levels of isochores are correlated with all tested structural and functional properties of the genome; 3) GC levels of isochores are correlated with chromosome architecture from interphase to metaphase; in the case of interphase the correlation concerns isochores and the three-dimensional “topological associated domains” (TADs); in the case of mitotic chromosomes, the correlation concerns isochores and chromosomal bands. Finally, the genomic code is the fourth and last pillar of molecular biology, the first three pillars being 1) the double helix structure of DNA; 2) the regulation of gene expression in prokaryotes; and 3) the genetic code.
Genomic damage in children accidentally exposed to ionizing radiation

DEFF Research Database (Denmark)

Fucic, A; Brunborg, G; Lasan, R

2007-01-01

During the last decade, our knowledge of the mechanisms by which children respond to exposures to physical and chemical agents present in the environment, has significantly increased. Results of recent projects and programmes focused on children's health underline a specific vulnerability of chil...... and efficient preventive measures, by means of a better knowledge of the early and delayed health effects in children resulting from radiation exposure....... of children to environmental genotoxicants. Environmental research on children predominantly investigates the health effects of air pollution while effects from radiation exposure deserve more attention. The main sources of knowledge on genome damage of children exposed to radiation are studies performed...... after the Chernobyl nuclear plant accident in 1986. The present review presents and discusses data collected from papers analyzing genome damage in children environmentally exposed to ionizing radiation. Overall, the evidence from the studies conducted following the Chernobyl accident, nuclear tests...
NeisseriaBase: a specialised Neisseria genomic resource and analysis platform.

Science.gov (United States)

Zheng, Wenning; Mutha, Naresh V R; Heydari, Hamed; Dutta, Avirup; Siow, Cheuk Chuen; Jakubovics, Nicholas S; Wee, Wei Yee; Tan, Shi Yang; Ang, Mia Yang; Wong, Guat Jah; Choo, Siew Woh

2016-01-01

Background. The gram-negative Neisseria is associated with two of the most potent human epidemic diseases: meningococcal meningitis and gonorrhoea. In both cases, disease is caused by bacteria colonizing human mucosal membrane surfaces. Overall, the genus shows great diversity and genetic variation mainly due to its ability to acquire and incorporate genetic material from a diverse range of sources through horizontal gene transfer. Although a number of databases exist for the Neisseria genomes, they are mostly focused on the pathogenic species. In this present study we present the freely available NeisseriaBase, a database dedicated to the genus Neisseria encompassing the complete and draft genomes of 15 pathogenic and commensal Neisseria species. Methods. The genomic data were retrieved from National Center for Biotechnology Information (NCBI) and annotated using the RAST server which were then stored into the MySQL database. The protein-coding genes were further analyzed to obtain information such as calculation of GC content (%), predicted hydrophobicity and molecular weight (Da) using in-house Perl scripts. The web application was developed following the secure four-tier web application architecture: (1) client workstation, (2) web server, (3) application server, and (4) database server. The web interface was constructed using PHP, JavaScript, jQuery, AJAX and CSS, utilizing the model-view-controller (MVC) framework. The in-house developed bioinformatics tools implemented in NeisseraBase were developed using Python, Perl, BioPerl and R languages. Results. Currently, NeisseriaBase houses 603,500 Coding Sequences (CDSs), 16,071 RNAs and 13,119 tRNA genes from 227 Neisseria genomes. The database is equipped with interactive web interfaces. Incorporation of the JBrowse genome browser in the database enables fast and smooth browsing of Neisseria genomes. NeisseriaBase includes the standard BLAST program to facilitate homology searching, and for Virulence Factor
NeisseriaBase: a specialised Neisseria genomic resource and analysis platform

Directory of Open Access Journals (Sweden)

Wenning Zheng

2016-03-01

Full Text Available Background. The gram-negative Neisseria is associated with two of the most potent human epidemic diseases: meningococcal meningitis and gonorrhoea. In both cases, disease is caused by bacteria colonizing human mucosal membrane surfaces. Overall, the genus shows great diversity and genetic variation mainly due to its ability to acquire and incorporate genetic material from a diverse range of sources through horizontal gene transfer. Although a number of databases exist for the Neisseria genomes, they are mostly focused on the pathogenic species. In this present study we present the freely available NeisseriaBase, a database dedicated to the genus Neisseria encompassing the complete and draft genomes of 15 pathogenic and commensal Neisseria species. Methods. The genomic data were retrieved from National Center for Biotechnology Information (NCBI and annotated using the RAST server which were then stored into the MySQL database. The protein-coding genes were further analyzed to obtain information such as calculation of GC content (%, predicted hydrophobicity and molecular weight (Da using in-house Perl scripts. The web application was developed following the secure four-tier web application architecture: (1 client workstation, (2 web server, (3 application server, and (4 database server. The web interface was constructed using PHP, JavaScript, jQuery, AJAX and CSS, utilizing the model-view-controller (MVC framework. The in-house developed bioinformatics tools implemented in NeisseraBase were developed using Python, Perl, BioPerl and R languages. Results. Currently, NeisseriaBase houses 603,500 Coding Sequences (CDSs, 16,071 RNAs and 13,119 tRNA genes from 227 Neisseria genomes. The database is equipped with interactive web interfaces. Incorporation of the JBrowse genome browser in the database enables fast and smooth browsing of Neisseria genomes. NeisseriaBase includes the standard BLAST program to facilitate homology searching, and for Virulence
PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

Directory of Open Access Journals (Sweden)

Wasnick Michael

2008-03-01

Full Text Available Abstract Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any
The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.

Science.gov (United States)

Mao, Qing; Ciotlos, Serban; Zhang, Rebecca Yu; Ball, Madeleine P; Chin, Robert; Carnevali, Paolo; Barua, Nina; Nguyen, Staci; Agarwal, Misha R; Clegg, Tom; Connelly, Abram; Vandewege, Ward; Zaranek, Alexander Wait; Estep, Preston W; Church, George M; Drmanac, Radoje; Peters, Brock A

2016-10-11

Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data. These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.
Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote.

Directory of Open Access Journals (Sweden)

Jonathan A Eisen

2006-09-01

Full Text Available The ciliate Tetrahymena thermophila is a model organism for molecular and cellular biology. Like other ciliates, this species has separate germline and soma functions that are embodied by distinct nuclei within a single cell. The germline-like micronucleus (MIC has its genome held in reserve for sexual reproduction. The soma-like macronucleus (MAC, which possesses a genome processed from that of the MIC, is the center of gene expression and does not directly contribute DNA to sexual progeny. We report here the shotgun sequencing, assembly, and analysis of the MAC genome of T. thermophila, which is approximately 104 Mb in length and composed of approximately 225 chromosomes. Overall, the gene set is robust, with more than 27,000 predicted protein-coding genes, 15,000 of which have strong matches to genes in other organisms. The functional diversity encoded by these genes is substantial and reflects the complexity of processes required for a free-living, predatory, single-celled organism. This is highlighted by the abundance of lineage-specific duplications of genes with predicted roles in sensing and responding to environmental conditions (e.g., kinases, using diverse resources (e.g., proteases and transporters, and generating structural complexity (e.g., kinesins and dyneins. In contrast to the other lineages of alveolates (apicomplexans and dinoflagellates, no compelling evidence could be found for plastid-derived genes in the genome. UGA, the only T. thermophila stop codon, is used in some genes to encode selenocysteine, thus making this organism the first known with the potential to translate all 64 codons in nuclear genes into amino acids. We present genomic evidence supporting the hypothesis that the excision of DNA from the MIC to generate the MAC specifically targets foreign DNA as a form of genome self-defense. The combination of the genome sequence, the functional diversity encoded therein, and the presence of some pathways missing from
The phase-resolved photoacoustic method to indicate chemical assignments of paracetamol

Science.gov (United States)

Camilotti, J. G.; Somer, A.; Costa, G. F.; Ribeiro, M. A.; Bonardi, C.; Cruz, G. K.; Gómez, S. L.; Beltrame, F. L.; Medina, A. N.; Sato, F.; Astrath, N. G. C.; Novatski, A.

2014-03-01

In this work, the phase-resolved photoacoustic method was applied to provide specific information on the chemical assignments of paracetamol in the near-infrared region. Two broad bands, centered at 1370 and 1130 nm, were well-resolved using this method, making it possible to assign the peaks centered at 1398, 1355 and 1295 nm to a C-H combination from a CH3 structure and the peak at 1305 nm to a C-H combination from the aromatic ring. This information represents a new finding in chemical studies regarding this medicament.
The Genomics Education Partnership: Successful Integration of Research into Laboratory Classes at a Diverse Group of Undergraduate Institutions

Science.gov (United States)

Shaffer, Christopher D.; Alvarez, Consuelo; Bailey, Cheryl; Barnard, Daron; Bhalla, Satish; Chandrasekaran, Chitra; Chandrasekaran, Vidya; Chung, Hui-Min; Dorer, Douglas R.; Du, Chunguang; Eckdahl, Todd T.; Poet, Jeff L.; Frohlich, Donald; Goodman, Anya L.; Gosser, Yuying; Hauser, Charles; Hoopes, Laura L.M.; Johnson, Diana; Jones, Christopher J.; Kaehler, Marian; Kokan, Nighat; Kopp, Olga R.; Kuleck, Gary A.; McNeil, Gerard; Moss, Robert; Myka, Jennifer L.; Nagengast, Alexis; Morris, Robert; Overvoorde, Paul J.; Shoop, Elizabeth; Parrish, Susan; Reed, Kelynne; Regisford, E. Gloria; Revie, Dennis; Rosenwald, Anne G.; Saville, Ken; Schroeder, Stephanie; Shaw, Mary; Skuse, Gary; Smith, Christopher; Smith, Mary; Spana, Eric P.; Spratt, Mary; Stamm, Joyce; Thompson, Jeff S.; Wawersik, Matthew; Wilson, Barbara A.; Youngblom, Jim; Leung, Wilson; Buhler, Jeremy; Mardis, Elaine R.; Lopatto, David

2010-01-01

Genomics is not only essential for students to understand biology but also provides unprecedented opportunities for undergraduate research. The goal of the Genomics Education Partnership (GEP), a collaboration between a growing number of colleges and universities around the country and the Department of Biology and Genome Center of Washington University in St. Louis, is to provide such research opportunities. Using a versatile curriculum that has been adapted to many different class settings, GEP undergraduates undertake projects to bring draft-quality genomic sequence up to high quality and/or participate in the annotation of these sequences. GEP undergraduates have improved more than 2 million bases of draft genomic sequence from several species of Drosophila and have produced hundreds of gene models using evidence-based manual annotation. Students appreciate their ability to make a contribution to ongoing research, and report increased independence and a more active learning approach after participation in GEP projects. They show knowledge gains on pre- and postcourse quizzes about genes and genomes and in bioinformatic analysis. Participating faculty also report professional gains, increased access to genomics-related technology, and an overall positive experience. We have found that using a genomics research project as the core of a laboratory course is rewarding for both faculty and students. PMID:20194808
Brief Guide to Genomics: DNA, Genes and Genomes

Science.gov (United States)

... clinic. Most new drugs based on genome-based research are estimated to be at least 10 to 15 years away, though recent genome-driven efforts in lipid-lowering therapy have considerably shortened that interval. According ...
MIPS plant genome information resources.

Science.gov (United States)

Spannagl, Manuel; Haberer, Georg; Ernst, Rebecca; Schoof, Heiko; Mayer, Klaus F X

2007-01-01

The Munich Institute for Protein Sequences (MIPS) has been involved in maintaining plant genome databases since the Arabidopsis thaliana genome project. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable data sets for model plant genomes as a backbone against which experimental data, for example from high-throughput functional genomics, can be organized and evaluated. In addition, model genomes also form a scaffold for comparative genomics, and much can be learned from genome-wide evolutionary studies.

Genomic interrogation of mechanism(s) underlying cellular responses to toxicants

International Nuclear Information System (INIS)

Amin, Rupesh P.; Hamadeh, Hisham K.; Bushel, Pierre R.; Bennett, Lee; Afshari, Cynthia A.; Paules, Richard S.

2002-01-01

Assessment of the impact of xenobiotic exposure on human health and disease progression is complex. Knowledge of mode(s) of action, including mechanism(s) contributing to toxicity and disease progression, is valuable for evaluating compounds. Toxicogenomics, the subdiscipline which merges genomics with toxicology, holds the promise to contributing significantly toward the goal of elucidating mechanism(s) by studying genome-wide effects of xenobiotics. Global gene expression profiling, revolutionized by microarray technology and a crucial aspect of a toxicogenomic study, allows measuring transcriptional modulation of thousands of genes following exposure to a xenobiotic. We use our results from previous studies on compounds representing two different classes of xenobiotics (barbiturate and peroxisome proliferator) to discuss the application of computational approaches for analyzing microarray data to elucidate mechanism(s) underlying cellular responses to toxicants. In particular, our laboratory demonstrated that chemical-specific patterns of gene expression can be revealed using cDNA microarrays. Transcript profiling provides discrimination between classes of toxicants, as well as, genome-wide insight into mechanism(s) of toxicity and disease progression. Ultimately, the expectation is that novel approaches for predicting xenobiotic toxicity in humans will emerge from such information
Malaria Parasite Metabolic Pathways (MPMP) Upgraded with Targeted Chemical Compounds.

Science.gov (United States)

Ginsburg, Hagai; Abdel-Haleem, Alyaa M

2016-01-01

Malaria Parasite Metabolic Pathways (MPMP) is the website for the functional genomics of intraerythrocytic Plasmodium falciparum. All the published information about targeted chemical compounds has now been added. Users can find the drug target and publication details linked to a drug database for further information about the medicinal properties of each compound. Copyright © 2015 Elsevier Ltd. All rights reserved.
Tumor Genomic Profiling in Breast Cancer Patients Using Targeted Massively Parallel Sequencing

Science.gov (United States)

2016-03-01

2015 “Cancer Care as a Model for Precision Medicine” MIT Collaborative Series Massachusetts Institute of Technology Invited Talk 2016 “Cancer...Precision Medicine” MIT -CHIEF Series Massachusetts Institute of Technology Invited Talk National 2013 “CanSeq: The Use of Whole Exome Sequencing To...Pennsylvania Philadelphia, PA Invited Talk 2014 “Clinical Genomics and Precision Cancer Medicine” Center for Molecular Oncology Memorial Sloan
The AOP framework and causality: Meeting chemical risk ...

Science.gov (United States)

Chemical safety assessments are expanding from a focus on a few chemicals (or chemical mixtures) to the broader “universe” of thousands, if not hundreds of thousands of substances that potentially could impact humans or the environment. This is exemplified in regulatory activities such as the REACH program in Europe, or the recent reauthorization of TSCA in the US, which require consideration of the potential impacts of a much greater number of chemicals than in the past. The data needed to address these types of legislated mandates cannot realistically be obtained solely through using the whole animal testing approaches historically employed for chemical risk assessment. Rather, there needs to be an increased emphasis on cost-effective tools that enable robust prediction of potential chemical impacts when empirical data are lacking. Concurrent with the realization that predictive methods will need to play an increasingly prominent role in regulatory toxicology has been the recent explosion in technology in the biological sciences enabling collection of large amounts of pathway-based molecular and biochemical data. For example, genomic techniques and high-throughput (robotic-based) in vitro testing enable the generation of knowledge concerning the effects of chemical perturbation on biological systems in an increasingly efficient and rapid manner. However, a pressing need stemming from these technological advances is the ability to actually apply th
Ensembl Genomes 2013: scaling up access to genome-wide data.

Science.gov (United States)

Kersey, Paul Julian; Allen, James E; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Hughes, Daniel Seth Toney; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Langridge, Nicholas; McDowall, Mark D; Maheswari, Uma; Maslen, Gareth; Nuhn, Michael; Ong, Chuang Kee; Paulini, Michael; Pedro, Helder; Toneva, Iliana; Tuli, Mary Ann; Walts, Brandon; Williams, Gareth; Wilson, Derek; Youens-Clark, Ken; Monaco, Marcela K; Stein, Joshua; Wei, Xuehong; Ware, Doreen; Bolser, Daniel M; Howe, Kevin Lee; Kulesha, Eugene; Lawson, Daniel; Staines, Daniel Michael

2014-01-01

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. This article provides an update to the previous publications about the resource, with a focus on recent developments. These include the addition of important new genomes (and related data sets) including crop plants, vectors of human disease and eukaryotic pathogens. In addition, the resource has scaled up its representation of bacterial genomes, and now includes the genomes of over 9000 bacteria. Specific extensions to the web and programmatic interfaces have been developed to support users in navigating these large data sets. Looking forward, analytic tools to allow targeted selection of data for visualization and download are likely to become increasingly important in future as the number of available genomes increases within all domains of life, and some of the challenges faced in representing bacterial data are likely to become commonplace for eukaryotes in future.
Recombination and its impact on the genome of the haplodiploid parasitoid wasp Nasonia.

Directory of Open Access Journals (Sweden)

Oliver Niehuis

2010-01-01

Full Text Available Homologous meiotic recombination occurs in most sexually reproducing organisms, yet its evolutionary advantages are elusive. Previous research explored recombination in the honeybee, a eusocial hymenopteran with an exceptionally high genome-wide recombination rate. A comparable study in a non-social member of the Hymenoptera that would disentangle the impact of sociality from Hymenoptera-specific features such as haplodiploidy on the evolution of the high genome-wide recombination rate in social Hymenoptera is missing. Utilizing single-nucleotide polymorphisms (SNPs between two Nasonia parasitoid wasp genomes, we developed a SNP genotyping microarray to infer a high-density linkage map for Nasonia. The map comprises 1,255 markers with an average distance of 0.3 cM. The mapped markers enabled us to arrange 265 scaffolds of the Nasonia genome assembly 1.0 on the linkage map, representing 63.6% of the assembled N. vitripennis genome. We estimated a genome-wide recombination rate of 1.4-1.5 cM/Mb for Nasonia, which is less than one tenth of the rate reported for the honeybee. The local recombination rate in Nasonia is positively correlated with the distance to the center of the linkage groups, GC content, and the proportion of simple repeats. In contrast to the honeybee genome, gene density in the parasitoid wasp genome is positively associated with the recombination rate; regions of low recombination are characterized by fewer genes with larger introns and by a greater distance between genes. Finally, we found that genes in regions of the genome with a low recombination frequency tend to have a higher ratio of non-synonymous to synonymous substitutions, likely due to the accumulation of slightly deleterious non-synonymous substitutions. These findings are consistent with the hypothesis that recombination reduces interference between linked sites and thereby facilitates adaptive evolution and the purging of deleterious mutations. Our results imply
Toward genome-enabled mycology.

Science.gov (United States)

Hibbett, David S; Stajich, Jason E; Spatafora, Joseph W

2013-01-01

Genome-enabled mycology is a rapidly expanding field that is characterized by the pervasive use of genome-scale data and associated computational tools in all aspects of fungal biology. Genome-enabled mycology is integrative and often requires teams of researchers with diverse skills in organismal mycology, bioinformatics and molecular biology. This issue of Mycologia presents the first complete fungal genomes in the history of the journal, reflecting the ongoing transformation of mycology into a genome-enabled science. Here, we consider the prospects for genome-enabled mycology and the technical and social challenges that will need to be overcome to grow the database of complete fungal genomes and enable all fungal biologists to make use of the new data.
Gain and loss of phototrophic genes revealed by comparison of two Citromicrobium bacterial genomes.

Directory of Open Access Journals (Sweden)

Qiang Zheng

Full Text Available Proteobacteria are thought to have diverged from a phototrophic ancestor, according to the scattered distribution of phototrophy throughout the proteobacterial clade, and so the occurrence of numerous closely related phototrophic and chemotrophic microorganisms may be the result of the loss of genes for phototrophy. A widespread form of bacterial phototrophy is based on the photochemical reaction center, encoded by puf and puh operons that typically are in a 'photosynthesis gene cluster' (abbreviated as the PGC with pigment biosynthesis genes. Comparison of two closely related Citromicrobial genomes (98.1% sequence identity of complete 16S rRNA genes, Citromicrobium sp. JL354, which contains two copies of reaction center genes, and Citromicrobium strain JLT1363, which is chemotrophic, revealed evidence for the loss of phototrophic genes. However, evidence of horizontal gene transfer was found in these two bacterial genomes. An incomplete PGC (pufLMC-puhCBA in strain JL354 was located within an integrating conjugative element, which indicates a potential mechanism for the horizontal transfer of genes for phototrophy.
Paramagnetic centers in nanocrystalline TiC/C system

International Nuclear Information System (INIS)

Guskos, N.; Bodziony, T.; Maryniak, M.; Typek, J.; Biedunkiewicz, A.

2008-01-01

Electron paramagnetic resonance is applied to study the defect centers in nanocrystalline titanium carbide dispersed in carbon matrix (TiC x /C) synthesized by the non-hydrolytic sol-gel process. The presence of Ti 3+ paramagnetic centers is identified below 120 K along with a minor contribution from localized defect spins coupled with the conduction electron system in the carbon matrix. The temperature dependence of the resonance intensity of the latter signal indicates weak antiferromagnetic interactions. The presence of paramagnetic centers connected with trivalent titanium is suggested to be the result of chemical disorder, which can be further related to the observed anomalous behavior of conductivity, hardness, and corrosion resistance of nanocrystalline TiC x /C
Routine Whole-Genome Sequencing for Outbreak Investigations of Staphylococcus aureus in a National Reference Center

Directory of Open Access Journals (Sweden)

Geraldine Durand

2018-03-01

Full Text Available The French National Reference Center for Staphylococci currently uses DNA arrays and spa typing for the initial epidemiological characterization of Staphylococcus aureus strains. We here describe the use of whole-genome sequencing (WGS to investigate retrospectively four distinct and virulent S. aureus lineages [clonal complexes (CCs: CC1, CC5, CC8, CC30] involved in hospital and community outbreaks or sporadic infections in France. We used a WGS bioinformatics pipeline based on de novo assembly (reference-free approach, single nucleotide polymorphism analysis, and on the inclusion of epidemiological markers. We examined the phylogeographic diversity of the French dominant hospital-acquired CC8-MRSA (methicillin-resistant S. aureus Lyon clone through WGS analysis which did not demonstrate evidence of large-scale geographic clustering. We analyzed sporadic cases along with two outbreaks of a CC1-MSSA (methicillin-susceptible S. aureus clone containing the Panton–Valentine leukocidin (PVL and results showed that two sporadic cases were closely related. We investigated an outbreak of PVL-positive CC30-MSSA in a school environment and were able to reconstruct the transmission history between eight families. We explored different outbreaks among newborns due to the CC5-MRSA Geraldine clone and we found evidence of an unsuspected link between two otherwise distinct outbreaks. Here, WGS provides the resolving power to disprove transmission events indicated by conventional methods (same sequence type, spa type, toxin profile, and antibiotic resistance profile and, most importantly, WGS can reveal unsuspected transmission events. Therefore, WGS allows to better describe and understand outbreaks and (inter-national dissemination of S. aureus lineages. Our findings underscore the importance of adding WGS for (inter-national surveillance of infections caused by virulent clones of S. aureus but also substantiate the fact that technological optimization at
Plantagora: modeling whole genome sequencing and assembly of plant genomes.

Directory of Open Access Journals (Sweden)

Roger Barthelson

Full Text Available BACKGROUND: Genomics studies are being revolutionized by the next generation sequencing technologies, which have made whole genome sequencing much more accessible to the average researcher. Whole genome sequencing with the new technologies is a developing art that, despite the large volumes of data that can be produced, may still fail to provide a clear and thorough map of a genome. The Plantagora project was conceived to address specifically the gap between having the technical tools for genome sequencing and knowing precisely the best way to use them. METHODOLOGY/PRINCIPAL FINDINGS: For Plantagora, a platform was created for generating simulated reads from several different plant genomes of different sizes. The resulting read files mimicked either 454 or Illumina reads, with varying paired end spacing. Thousands of datasets of reads were created, most derived from our primary model genome, rice chromosome one. All reads were assembled with different software assemblers, including Newbler, Abyss, and SOAPdenovo, and the resulting assemblies were evaluated by an extensive battery of metrics chosen for these studies. The metrics included both statistics of the assembly sequences and fidelity-related measures derived by alignment of the assemblies to the original genome source for the reads. The results were presented in a website, which includes a data graphing tool, all created to help the user compare rapidly the feasibility and effectiveness of different sequencing and assembly strategies prior to testing an approach in the lab. Some of our own conclusions regarding the different strategies were also recorded on the website. CONCLUSIONS/SIGNIFICANCE: Plantagora provides a substantial body of information for comparing different approaches to sequencing a plant genome, and some conclusions regarding some of the specific approaches. Plantagora also provides a platform of metrics and tools for studying the process of sequencing and assembly
Genomes in turmoil: quantification of genome dynamics in prokaryote supergenomes.

Science.gov (United States)

Puigbò, Pere; Lobkovsky, Alexander E; Kristensen, David M; Wolf, Yuri I; Koonin, Eugene V

2014-08-21

Genomes of bacteria and archaea (collectively, prokaryotes) appear to exist in incessant flux, expanding via horizontal gene transfer and gene duplication, and contracting via gene loss. However, the actual rates of genome dynamics and relative contributions of different types of event across the diversity of prokaryotes are largely unknown, as are the sizes of microbial supergenomes, i.e. pools of genes that are accessible to the given microbial species. We performed a comprehensive analysis of the genome dynamics in 35 groups (34 bacterial and one archaeal) of closely related microbial genomes using a phylogenetic birth-and-death maximum likelihood model to quantify the rates of gene family gain and loss, as well as expansion and reduction. The results show that loss of gene families dominates the evolution of prokaryotes, occurring at approximately three times the rate of gain. The rates of gene family expansion and reduction are typically seven and twenty times less than the gain and loss rates, respectively. Thus, the prevailing mode of evolution in bacteria and archaea is genome contraction, which is partially compensated by the gain of new gene families via horizontal gene transfer. However, the rates of gene family gain, loss, expansion and reduction vary within wide ranges, with the most stable genomes showing rates about 25 times lower than the most dynamic genomes. For many groups, the supergenome estimated from the fraction of repetitive gene family gains includes about tenfold more gene families than the typical genome in the group although some groups appear to have vast, 'open' supergenomes. Reconstruction of evolution for groups of closely related bacteria and archaea reveals an extremely rapid and highly variable flux of genes in evolving microbial genomes, demonstrates that extensive gene loss and horizontal gene transfer leading to innovation are the two dominant evolutionary processes, and yields robust estimates of the supergenome size.
Origins of chemical diversity of back-arc basin basalts: A segment-scale study of the Eastern Lau Spreading Center

Science.gov (United States)

BéZos, Antoine; Escrig, StéPhane; Langmuir, Charles H.; Michael, Peter J.; Asimow, Paul D.

2009-06-01

We report major, trace, and volatile element data on basaltic glasses from the northernmost segment of the Eastern Lau Spreading Center (ELSC1) in the Lau back-arc basin to further test and constrain models of back-arc volcanism. The zero-age samples come from 47 precisely collected stations from an 85 km length spreading center. The chemical data covary similarly to other back-arc systems but with tighter correlations and well-developed spatial systematics. We confirm a correlation between volatile content and apparent extent of melting of the mantle source but also show that the data cannot be reproduced by the model of isobaric addition of water that has been broadly applied to back-arc basins. The new data also confirm that there is no relationship between mantle temperature and the wet melting productivity. Two distinct magmatic provinces can be identified along the ELSC1 axis, a southern province influenced by a "wet component" with strong affinities to arc volcanism and a northern province influenced by a "damp component" intermediate between enriched mid-ocean ridge basalts (E-MORB) and arc basalts. High-field strength elements and rare earth elements are all mobilized to some extent by the wet component, and the detailed composition of this component is determined. It differs in significant ways from the Mariana component reported by E. Stolper and S. Newman (1994), particularly by having lower abundances of most elements relative to H2O. The differences can be explained if the slab temperature is higher for the Mariana and the source from which the fluid is derived is more enriched. The ELSC1 damp component is best explained by mixing between the wet component and an E-MORB-like component. We propose that mixing between water-rich fluids and low-degree silicate melts occurs at depth in the subduction zone to generate the chemical diversity of the ELSC1 subduction components. These modified sources then rise independently to the surface and melt, and these
A Web-Based Comparative Genomics Tutorial for Investigating Microbial Genomes

Directory of Open Access Journals (Sweden)

Michael Strong

2009-12-01

Full Text Available As the number of completely sequenced microbial genomes continues to rise at an impressive rate, it is important to prepare students with the skills necessary to investigate microorganisms at the genomic level. As a part of the core curriculum for first-year graduate students in the biological sciences, we have implemented a web-based tutorial to introduce students to the fields of comparative and functional genomics. The tutorial focuses on recent computational methods for identifying functionally linked genes and proteins on a genome-wide scale and was used to introduce students to the Rosetta Stone, Phylogenetic Profile, conserved Gene Neighbor, and Operon computational methods. Students learned to use a number of publicly available web servers and databases to identify functionally linked genes in the Escherichia coli genome, with emphasis on genome organization and operon structure. The overall effectiveness of the tutorial was assessed based on student evaluations and homework assignments. The tutorial is available to other educators at http://www.doe-mbi.ucla.edu/~strong/m253.php.
Dana-Farber Cancer Institute: Identification of Therapeutic Targets in KRAS Driven Lung Cancer | Office of Cancer Genomics

Science.gov (United States)

The CTD2 Center at Dana Farber Cancer Institute focuses on the use of high-throughput genetic and bioinformatic approaches to identify and credential oncogenes and co-dependencies in cancers. This Center aims to provide the cancer research community with information that will facilitate the prioritization of targets based on both genomic and functional evidence, inform the most appropriate genetic context for downstream mechanistic and validation studies, and enable the translation of this information into therapeutics and diagnostics.
Providing guidance for genomics-based cancer treatment decisions: insights from stakeholder engagement for post-prostatectomy radiation therapy.

Science.gov (United States)

Abe, James; Lobo, Jennifer M; Trifiletti, Daniel M; Showalter, Timothy N

2017-08-24

Despite the emergence of genomics-based risk prediction tools in oncology, there is not yet an established framework for communication of test results to cancer patients to support shared decision-making. We report findings from a stakeholder engagement program that aimed to develop a framework for using Markov models with individualized model inputs, including genomics-based estimates of cancer recurrence probability, to generate personalized decision aids for prostate cancer patients faced with radiation therapy treatment decisions after prostatectomy. We engaged a total of 22 stakeholders, including: prostate cancer patients, urological surgeons, radiation oncologists, genomic testing industry representatives, and biomedical informatics faculty. Slides were at each meeting to provide background information regarding the analytical framework. Participants were invited to provide feedback during the meeting, including revising the overall project aims. Stakeholder meeting content was reviewed and summarized by stakeholder group and by theme. The majority of stakeholder suggestions focused on aspects of decision aid design and formatting. Stakeholders were enthusiastic about the potential value of using decision analysis modeling with personalized model inputs for cancer recurrence risk, as well as competing risks from age and comorbidities, to generate a patient-centered tool to assist decision-making. Stakeholders did not view privacy considerations as a major barrier to the proposed decision aid program. A common theme was that decision aids should be portable across multiple platforms (electronic and paper), should allow for interaction by the user to adjust model inputs iteratively, and available to patients both before and during consult appointments. Emphasis was placed on the challenge of explaining the model's composite result of quality-adjusted life years. A range of stakeholders provided valuable insights regarding the design of a personalized decision
Reduced representation approaches to interrogate genome diversity in large repetitive plant genomes.

Science.gov (United States)

Hirsch, Cory D; Evans, Joseph; Buell, C Robin; Hirsch, Candice N

2014-07-01

Technology and software improvements in the last decade now provide methodologies to access the genome sequence of not only a single accession, but also multiple accessions of plant species. This provides a means to interrogate species diversity at the genome level. Ample diversity among accessions in a collection of species can be found, including single-nucleotide polymorphisms, insertions and deletions, copy number variation and presence/absence variation. For species with small, non-repetitive rich genomes, re-sequencing of query accessions is robust, highly informative, and economically feasible. However, for species with moderate to large sized repetitive-rich genomes, technical and economic barriers prevent en masse genome re-sequencing of accessions. Multiple approaches to access a focused subset of loci in species with larger genomes have been developed, including reduced representation sequencing, exome capture and transcriptome sequencing. Collectively, these approaches have enabled interrogation of diversity on a genome scale for large plant genomes, including crop species important to worldwide food security. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
The CRISPR-Cas9 technology: Closer to the ultimate toolkit for targeted genome editing.

Science.gov (United States)

Quétier, Francis

2016-01-01

The first period of plant genome editing was based on Agrobacterium; chemical mutagenesis by EMS (ethyl methanesulfonate) and ionizing radiations; each of these technologies led to randomly distributed genome modifications. The second period is associated with the discoveries of homing and meganuclease enzymes during the 80s and 90s, which were then engineered to provide efficient tools for targeted editing. From 2006 to 2012, a few crop plants were successfully and precisely modified using zinc-finger nucleases. A third wave of improvement in genome editing, which led to a dramatic decrease in off-target events, was achieved in 2009-2011 with the TALEN technology. The latest revolution surfaced in 2013 with the CRISPR-Cas9 system, whose high efficiency and technical ease of use is really impressive; scientists can use in-house kits or commercially available kits; the only two requirements are to carefully choose the location of the DNA double strand breaks to be induced and then to order an oligonucleotide. While this close-to- ultimate toolkit for targeted editing of genomes represents dramatic scientific progress which allows the development of more complex useful agronomic traits through synthetic biology, the social acceptance of genome editing remains regularly questioned by anti-GMO citizens and organizations. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Between Two Fern Genomes

Science.gov (United States)

2014-01-01

Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969
Exploring Other Genomes: Bacteria.

Science.gov (United States)

Flannery, Maura C.

2001-01-01

Points out the importance of genomes other than the human genome project and provides information on the identified bacterial genomes Pseudomonas aeuroginosa, Leprosy, Cholera, Meningitis, Tuberculosis, Bubonic Plague, and plant pathogens. Considers the computer's use in genome studies. (Contains 14 references.) (YDS)

Comparative Pan-Genome Analysis of Piscirickettsia salmonis Reveals Genomic Divergences within Genogroups

Directory of Open Access Journals (Sweden)

Guillermo Nourdin-Galindo

2017-10-01

Full Text Available Piscirickettsia salmonis is the etiological agent of salmonid rickettsial septicemia, a disease that seriously affects the salmonid industry. Despite efforts to genomically characterize P. salmonis, functional information on the life cycle, pathogenesis mechanisms, diagnosis, treatment, and control of this fish pathogen remain lacking. To address this knowledge gap, the present study conducted an in silico pan-genome analysis of 19 P. salmonis strains from distinct geographic locations and genogroups. Results revealed an expected open pan-genome of 3,463 genes and a core-genome of 1,732 genes. Two marked genogroups were identified, as confirmed by phylogenetic and phylogenomic relationships to the LF-89 and EM-90 reference strains, as well as by assessments of genomic structures. Different structural configurations were found for the six identified copies of the ribosomal operon in the P. salmonis genome, indicating translocation throughout the genetic material. Chromosomal divergences in genomic localization and quantity of genetic cassettes were also found for the Dot/Icm type IVB secretion system. To determine divergences between core-genomes, additional pan-genome descriptions were compiled for the so-termed LF and EM genogroups. Open pan-genomes composed of 2,924 and 2,778 genes and core-genomes composed of 2,170 and 2,228 genes were respectively found for the LF and EM genogroups. The core-genomes were functionally annotated using the Gene Ontology, KEGG, and Virulence Factor databases, revealing the presence of several shared groups of genes related to basic function of intracellular survival and bacterial pathogenesis. Additionally, the specific pan-genomes for the LF and EM genogroups were defined, resulting in the identification of 148 and 273 exclusive proteins, respectively. Notably, specific virulence factors linked to adherence, colonization, invasion factors, and endotoxins were established. The obtained data suggest that these
Excitation of surface plasmon polariton modes with multiple nitrogen vacancy centers in single nanodiamonds

DEFF Research Database (Denmark)

Kumar, Shailesh; Lausen, Jens L.; Garcia-Ortiz, Cesar E.

2016-01-01

) are especially useful as biological fluorophores due to their chemical neutrality, brightness and room-temperature photostability. Furthermore, NDs containing multiple NV centers also have potential in high-precision magnetic field and temperature sensing. Coupling NV centers to propagating surface plasmon...
A Genome-Wide Landscape of Retrocopies in Primate Genomes.

Science.gov (United States)

Navarro, Fábio C P; Galante, Pedro A F

2015-07-29

Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
First genome report on novel sequence types of Neisseria meningitidis: ST12777 and ST12778.

Science.gov (United States)

Veeraraghavan, Balaji; Lal, Binesh; Devanga Ragupathi, Naveen Kumar; Neeravi, Iyyan Raj; Jeyaraman, Ranjith; Varghese, Rosemol; Paul, Miracle Magdalene; Baskaran, Ashtawarthani; Ranjan, Ranjini

2018-03-01

Neisseria meningitidis is an important causative agent of meningitis and/or sepsis with high morbidity and mortality. Baseline genome data on N. meningitidis, especially from developing countries such as India, are lacking. This study aimed to investigate the whole genome sequences of N. meningitidis isolates from a tertiary care centre in India. Whole-genome sequencing was performed using an Ion Torrent™ Personal Genome Machine™ (PGM) with 400-bp chemistry. Data were assembled de novo using SPAdes Genome Assembler v.5.0.0.0. Sequence annotation was performed through PATRIC, RAST and the NCBI PGAAP server. Downstream analysis of the isolates was performed using the Center for Genomic Epidemiology databases for antimicrobial resistance genes and sequence types. Virulence factors and CRISPR were analysed using the PubMLST database and CRISPRFinder, respectively. This study reports the whole genome shotgun sequences of eight N. meningitidis isolates from bloodstream infections. The genome data revealed two novel sequence types (ST12777 and ST12778), along with ST11, ST437 and ST6928. The virulence profile of the isolates matched their sequence types. All isolates were negative for plasmid-mediated resistance genes. To the best of our knowledge, this is the first report of ST11 and ST437 N. meningitidis isolates in India along with two novel sequence types (ST12777 and ST12778). These results indicate that the sequence types circulating in India are diverse and require continuous monitoring. Further studies strengthening the genome data on N. meningitidis are required to understand the prevalence, spread, exact resistance and virulence mechanisms along with serotypes. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Journal of Chemical Sciences | Indian Academy of Sciences

Indian Academy of Sciences (India)

Home; Journals; Journal of Chemical Sciences; Volume 126; Issue 1 ... Centre for Nanotechnology Research, VIT University, Vellore 632 014, India; Department of ... Nissan Technology & Business Center India (P) Ltd., Chennai 603002, India ...
CRISPR–Cas system enables fast and simple genome editing of industrial Saccharomyces cerevisiae strains

Directory of Open Access Journals (Sweden)

Vratislav Stovicek

2015-12-01

Full Text Available There is a demand to develop 3rd generation biorefineries that integrate energy production with the production of higher value chemicals from renewable feedstocks. Here, robust and stress-tolerant industrial strains of Saccharomyces cerevisiae will be suitable production organisms. However, their genetic manipulation is challenging, as they are usually diploid or polyploid. Therefore, there is a need to develop more efficient genetic engineering tools. We applied a CRISPR–Cas9 system for genome editing of different industrial strains, and show simultaneous disruption of two alleles of a gene in several unrelated strains with the efficiency ranging between 65% and 78%. We also achieved simultaneous disruption and knock-in of a reporter gene, and demonstrate the applicability of the method by designing lactic acid-producing strains in a single transformation event, where insertion of a heterologous gene and disruption of two endogenous genes occurred simultaneously. Our study provides a foundation for efficient engineering of industrial yeast cell factories. Keywords: CRISPR–Cas9, Genome editing, Industrial yeast, Biorefineries, Chemical production
Comparative genomics of the marine bacterial genus Glaciecola reveals the high degree of genomic diversity and genomic characteristic for cold adaptation.

Science.gov (United States)

Qin, Qi-Long; Xie, Bin-Bin; Yu, Yong; Shu, Yan-Li; Rong, Jin-Cheng; Zhang, Yan-Jiao; Zhao, Dian-Li; Chen, Xiu-Lan; Zhang, Xi-Ying; Chen, Bo; Zhou, Bai-Cheng; Zhang, Yu-Zhong

2014-06-01

To what extent the genomes of different species belonging to one genus can be diverse and the relationship between genomic differentiation and environmental factor remain unclear for oceanic bacteria. With many new bacterial genera and species being isolated from marine environments, this question warrants attention. In this study, we sequenced all the type strains of the published species of Glaciecola, a recently defined cold-adapted genus with species from diverse marine locations, to study the genomic diversity and cold-adaptation strategy in this genus.The genome size diverged widely from 3.08 to 5.96 Mb, which can be explained by massive gene gain and loss events. Horizontal gene transfer and new gene emergence contributed substantially to the genome size expansion. The genus Glaciecola had an open pan-genome. Comparative genomic research indicated that species of the genus Glaciecola had high diversity in genome size, gene content and genetic relatedness. This may be prevalent in marine bacterial genera considering the dynamic and complex environments of the ocean. Species of Glaciecola had some common genomic features related to cold adaptation, which enable them to thrive and play a role in biogeochemical cycle in the cold marine environments.
Under pressure: evolutionary engineering of yeast strains for improved performance in fuels and chemicals production

NARCIS (Netherlands)

Mans, R.; Daran, J.G.; Pronk, J.T.

2018-01-01

Evolutionary engineering, which uses laboratory evolution to select for industrially relevant traits, is a popular strategy in the development of high-performing yeast strains for industrial production of fuels and chemicals. By integrating whole-genome sequencing, bioinformatics, classical
MicroScope: a platform for microbial genome annotation and comparative genomics.

Science.gov (United States)

Vallenet, D; Engelen, S; Mornico, D; Cruveiller, S; Fleury, L; Lajus, A; Rouy, Z; Roche, D; Salvignol, G; Scarpelli, C; Médigue, C

2009-01-01

The initial outcome of genome sequencing is the creation of long text strings written in a four letter alphabet. The role of in silico sequence analysis is to assist biologists in the act of associating biological knowledge with these sequences, allowing investigators to make inferences and predictions that can be tested experimentally. A wide variety of software is available to the scientific community, and can be used to identify genomic objects, before predicting their biological functions. However, only a limited number of biologically interesting features can be revealed from an isolated sequence. Comparative genomics tools, on the other hand, by bringing together the information contained in numerous genomes simultaneously, allow annotators to make inferences based on the idea that evolution and natural selection are central to the definition of all biological processes. We have developed the MicroScope platform in order to offer a web-based framework for the systematic and efficient revision of microbial genome annotation and comparative analysis (http://www.genoscope.cns.fr/agc/microscope). Starting with the description of the flow chart of the annotation processes implemented in the MicroScope pipeline, and the development of traditional and novel microbial annotation and comparative analysis tools, this article emphasizes the essential role of expert annotation as a complement of automatic annotation. Several examples illustrate the use of implemented tools for the review and curation of annotations of both new and publicly available microbial genomes within MicroScope's rich integrated genome framework. The platform is used as a viewer in order to browse updated annotation information of available microbial genomes (more than 440 organisms to date), and in the context of new annotation projects (117 bacterial genomes). The human expertise gathered in the MicroScope database (about 280,000 independent annotations) contributes to improve the quality of
Ebolavirus comparative genomics

DEFF Research Database (Denmark)

Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat

2015-01-01

The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms...
75 FR 81625 - Government-Owned Inventions; Availability for Licensing

Science.gov (United States)

2010-12-28

... of Federally-funded research and development. Foreign patent applications are filed on selected... of inflammation and improves blood vessel relaxation, lipid cholesterol profiles, and [email protected] . Collaborative Research Opportunity: The NIH Chemical Genomics Center (NCGC) is seeking...
Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift

Directory of Open Access Journals (Sweden)

Douglas Mark Ruden

2012-03-01

Full Text Available This paper describes a new program SnpSift for filtering differential DNA sequence variants between two or more experimental genomes after genotoxic chemical exposure. Here, we illustrate how SnpSift can be used to identify candidate phenotype-relevant variants including single nucleotide polymorphisms (SNPs, multiple nucleotide polymorphisms (MNPs, insertions and deletions (InDels in mutant strains isolated from genome-wide chemical mutagenesis of Drosophila melanogaster. First, the genomes of two independently-isolated mutant fly strains that are allelic for a novel recessive male-sterile locus generated by genotoxic chemical exposure were sequenced using the Illumina next-generation DNA sequencer to obtain 20- to 29-fold coverage of the euchromatic sequences. The sequencing reads were processed and variants were called using standard bioinformatic tools. Next, SnpEff was used to annotate all sequence variants and their potential mutational effects on associated genes. Then, SnpSift was used to filter and select differential variants that potentially disrupt a common gene in the two allelic mutant strains. The potential causative DNA lesions were partially validated by capillary sequencing of PCR-amplified DNA in the genetic interval as defined by meiotic mapping and deletions that remove defined regions of the chromosome. Of the five candidate genes located in the genetic interval, the Pka-like gene CG12069 was found to carry a separate premature stop codon mutation in each of the two allelic mutants whereas the other 4 candidate genes within the interval have wild-type sequences. The Pka-like gene is therefore a strong candidate gene for the male-sterile locus. These results demonstrate that combining SnpEff and SnpSift can expedite the identification of candidate phenotype-causative mutations in chemically-mutagenized Drosophila strains. This technique can also be used to characterize the variety of mutations generated by genotoxic
Comparative genomic analysis of multiple strains of two unusual plant pathogens: Pseudomonas corrugata and Pseudomonas mediterranea

Directory of Open Access Journals (Sweden)

Emmanouil A Trantas

2015-08-01

Full Text Available The non-fluorescent pseudomonads, Pseudomonas corrugata (Pcor and P. mediterranea (Pmed, are closely related species that cause pith necrosis, a disease of tomato that causes severe crop losses. However, they also show strong antagonistic effects against economically important pathogens, demonstrating their potential for utilization as biological control agents. In addition, their metabolic versatility makes them attractive for the production of commercial biomolecules and bioremediation. An extensive comparative genomics study is required to dissect the mechanisms that Pcor and Pmed employ to cause disease, prevent disease caused by other pathogens, and to mine their genomes for commercially significant chemical pathways. Here, we present the draft genomes of nine Pcor and Pmed strains from different geographical locations. This analysis covered significant genetic heterogeneity and allowed in-depth genomic comparison. All examined strains were able to trigger symptoms in tomato plants but not all induced a hypersensitive-like response in Nicotiana benthamiana. Genome-mining revealed the absence of a type III secretion system and of known type III effectors from all examined Pcor and Pmed strains. The lack of a type III secretion system appears to be unique among the plant pathogenic pseudomonads. Several gene clusters coding for type VI secretion system were detected in all genomes.
A novel genomic alteration of LSAMP associates with aggressive prostate cancer in African American men

Directory of Open Access Journals (Sweden)

Gyorgy Petrovics

2015-12-01

Full Text Available Evaluation of cancer genomes in global context is of great interest in light of changing ethnic distribution of the world population. We focused our study on men of African ancestry because of their disproportionately higher rate of prostate cancer (CaP incidence and mortality. We present a systematic whole genome analyses, revealing alterations that differentiate African American (AA and Caucasian American (CA CaP genomes. We discovered a recurrent deletion on chromosome 3q13.31 centering on the LSAMP locus that was prevalent in tumors from AA men (cumulative analyses of 435 patients: whole genome sequence, 14; FISH evaluations, 101; and SNP array, 320 patients. Notably, carriers of this deletion experienced more rapid disease progression. In contrast, PTEN and ERG common driver alterations in CaP were significantly lower in AA prostate tumors compared to prostate tumors from CA. Moreover, the frequency of inter-chromosomal rearrangements was significantly higher in AA than CA tumors. These findings reveal differentially distributed somatic mutations in CaP across ancestral groups, which have implications for precision medicine strategies.
SIGMA: A System for Integrative Genomic Microarray Analysis of Cancer Genomes

Directory of Open Access Journals (Sweden)

Davies Jonathan J

2006-12-01

Full Text Available Abstract Background The prevalence of high resolution profiling of genomes has created a need for the integrative analysis of information generated from multiple methodologies and platforms. Although the majority of data in the public domain are gene expression profiles, and expression analysis software are available, the increase of array CGH studies has enabled integration of high throughput genomic and gene expression datasets. However, tools for direct mining and analysis of array CGH data are limited. Hence, there is a great need for analytical and display software tailored to cross platform integrative analysis of cancer genomes. Results We have created a user-friendly java application to facilitate sophisticated visualization and analysis such as cross-tumor and cross-platform comparisons. To demonstrate the utility of this software, we assembled array CGH data representing Affymetrix SNP chip, Stanford cDNA arrays and whole genome tiling path array platforms for cross comparison. This cancer genome database contains 267 profiles from commonly used cancer cell lines representing 14 different tissue types. Conclusion In this study we have developed an application for the visualization and analysis of data from high resolution array CGH platforms that can be adapted for analysis of multiple types of high throughput genomic datasets. Furthermore, we invite researchers using array CGH technology to deposit both their raw and processed data, as this will be a continually expanding database of cancer genomes. This publicly available resource, the System for Integrative Genomic Microarray Analysis (SIGMA of cancer genomes, can be accessed at http://sigma.bccrc.ca.
Genome-wide analysis of Tol2 transposon reintegration in zebrafish

Directory of Open Access Journals (Sweden)

Parinov Sergey

2009-09-01

Full Text Available Abstract Background Tol2, a member of the hAT family of transposons, has become a useful tool for genetic manipulation of model animals, but information about its interactions with vertebrate genomes is still limited. Furthermore, published reports on Tol2 have mainly been based on random integration of the transposon system after co-injection of a plasmid DNA harboring the transposon and a transposase mRNA. It is important to understand how Tol2 would behave upon activation after integration into the genome. Results We performed a large-scale enhancer trap (ET screen and generated 338 insertions of the Tol2 transposon-based ET cassette into the zebrafish genome. These insertions were generated by remobilizing the transposon from two different donor sites in two transgenic lines. We found that 39% of Tol2 insertions occurred in transcription units, mostly into introns. Analysis of the transposon target sites revealed no strict specificity at the DNA sequence level. However, Tol2 was prone to target AT-rich regions with weak palindromic consensus sequences centered at the insertion site. Conclusion Our systematic analysis of sequential remobilizations of the Tol2 transposon from two independent sites within a vertebrate genome has revealed properties such as a tendency to integrate into transcription units and into AT-rich palindrome-like sequences. This information will influence the development of various applications involving DNA transposons and Tol2 in particular.
Genomics With Cloud Computing

OpenAIRE

Sukhamrit Kaur; Sandeep Kaur

2015-01-01

Abstract Genomics is study of genome which provides large amount of data for which large storage and computation power is needed. These issues are solved by cloud computing that provides various cloud platforms for genomics. These platforms provides many services to user like easy access to data easy sharing and transfer providing storage in hundreds of terabytes more computational power. Some cloud platforms are Google genomics DNAnexus and Globus genomics. Various features of cloud computin...
Genomics technologies to study structural variations in the grapevine genome

Directory of Open Access Journals (Sweden)

Cardone Maria Francesca

2016-01-01

Full Text Available Grapevine is one of the most important crop plants in the world. Recently there was great expansion of genomics resources about grapevine genome, thus providing increasing efforts for molecular breeding. Current cultivars display a great level of inter-specific differentiation that needs to be investigated to reach a comprehensive understanding of the genetic basis of phenotypic differences, and to find responsible genes selected by cross breeding programs. While there have been significant advances in resolving the pattern and nature of single nucleotide polymorphisms (SNPs on plant genomes, few data are available on copy number variation (CNV. Furthermore association between structural variations and phenotypes has been described in only a few cases. We combined high throughput biotechnologies and bioinformatics tools, to reveal the first inter-varietal atlas of structural variation (SV for the grapevine genome. We sequenced and compared four table grape cultivars with the Pinot noir inbred line PN40024 genome as the reference. We detected roughly 8% of the grapevine genome affected by genomic variations. Taken into account phenotypic differences existing among the studied varieties we performed comparison of SVs among them and the reference and next we performed an in-depth analysis of gene content of polymorphic regions. This allowed us to identify genes showing differences in copy number as putative functional candidates for important traits in grapevine cultivation.
Journal of Chemical Sciences | Indian Academy of Sciences

Indian Academy of Sciences (India)

Home; Journals; Journal of Chemical Sciences; Volume 119; Issue 5 ... We present here results of ab-initio studies of structures and interaction energies of ... Center for Computational Natural Sciences and Bioinformatics, International Institute ...
Journal of Chemical Sciences | Indian Academy of Sciences

Indian Academy of Sciences (India)

1 V Ferretti2. Department of Chemistry and Centre of Advanced Studies in Chemistry, Panjab University, Chandigarh 160 014, India; Center for Structural Diffractometry and Department of Chemical and Pharmaceutical Sciences, University of ...

The genome portal of the Department of Energy Joint Genome Institute: 2014 updates

Energy Technology Data Exchange (ETDEWEB)

Nordberg, Henrik [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Cantor, Michael [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Dusheyko, Serge [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Hua, Susan [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Poliakov, Alexander [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Shabalov, Igor [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Smirnova, Tatyana [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Grigoriev, Igor V. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Dubchak, Inna [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States)

2013-11-12

The U.S. Department of Energy (DOE) Joint Genome Institute (JGI), a national user facility, serves the diverse scientific community by providing integrated high-throughput sequencing and computational analysis to enable system-based scientific approaches in support of DOE missions related to clean energy generation and environmental characterization. The JGI Genome Portal (http://genome.jgi.doe.gov) provides unified access to all JGI genomic databases and analytical tools. The JGI maintains extensive data management systems and specialized analytical capabilities to manage and interpret complex genomic data. A user can search, download and explore multiple data sets available for all DOE JGI sequencing projects including their status, assemblies and annotations of sequenced genomes. In this paper, we describe major updates of the Genome Portal in the past 2 years with a specific emphasis on efficient handling of the rapidly growing amount of diverse genomic data accumulated in JGI.
Herbarium genomics

DEFF Research Database (Denmark)

Bakker, Freek T.; Lei, Di; Yu, Jiaying

2016-01-01

Herbarium genomics is proving promising as next-generation sequencing approaches are well suited to deal with the usually fragmented nature of archival DNA. We show that routine assembly of partial plastome sequences from herbarium specimens is feasible, from total DNA extracts and with specimens...... up to 146 years old. We use genome skimming and an automated assembly pipeline, Iterative Organelle Genome Assembly, that assembles paired-end reads into a series of candidate assemblies, the best one of which is selected based on likelihood estimation. We used 93 specimens from 12 different...... correlation between plastome coverage and nuclear genome size (C value) in our samples, but the range of C values included is limited. Finally, we conclude that routine plastome sequencing from herbarium specimens is feasible and cost-effective (compared with Sanger sequencing or plastome...
New Vistas in Chemical Product and Process Design

DEFF Research Database (Denmark)

Zhang, Lei; Babi, Deenesh Kavi; Gani, Rafiqul

2016-01-01

Design of chemicals-based products is broadly classified into those that are process centered and those that are product centered. In this article, the designs of both classes of products are reviewed from a process systems point of view; developments related to the design of the chemical product......, its corresponding process, and its integration are highlighted. Although significant advances have been made in the development of systematic model-based techniques for process design (also for optimization, operation, and control), much work is needed to reach the same level for product design....... Timeline diagrams illustrating key contributions in product design, process design, and integrated product-process design are presented. The search for novel, innovative, and sustainable solutions must be matched by consideration of issues related to the multidisciplinary nature of problems, the lack...
Recognizing genes and other components of genomic structure

Energy Technology Data Exchange (ETDEWEB)

Burks, C. (Los Alamos National Lab., NM (USA)); Myers, E. (Arizona Univ., Tucson, AZ (USA). Dept. of Computer Science); Stormo, G.D. (Colorado Univ., Boulder, CO (USA). Dept. of Molecular, Cellular and Developmental Biology)

1991-01-01

The Aspen Center for Physics (ACP) sponsored a three-week workshop, with 26 scientists participating, from 28 May to 15 June, 1990. The workshop, entitled Recognizing Genes and Other Components of Genomic Structure, focussed on discussion of current needs and future strategies for developing the ability to identify and predict the presence of complex functional units on sequenced, but otherwise uncharacterized, genomic DNA. We addressed the need for computationally-based, automatic tools for synthesizing available data about individual consensus sequences and local compositional patterns into the composite objects (e.g., genes) that are -- as composite entities -- the true object of interest when scanning DNA sequences. The workshop was structured to promote sustained informal contact and exchange of expertise between molecular biologists, computer scientists, and mathematicians. No participant stayed for less than one week, and most attended for two or three weeks. Computers, software, and databases were available for use as electronic blackboards'' and as the basis for collaborative exploration of ideas being discussed and developed at the workshop. 23 refs., 2 tabs.
Effects of sample treatments on genome recovery via single-cell genomics

Energy Technology Data Exchange (ETDEWEB)

Clingenpeel, Scott [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Schwientek, Patrick [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Hugenholtz, Philip [Univ. of Queensland, Brisbane (Australia); Woyke, Tanja [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States)

2014-06-13

It is known that single-cell genomics is a powerful tool for accessing genetic information from uncultivated microorganisms. Methods of handling samples before single-cell genomic amplification may affect the quality of the genomes obtained. Using three bacterial strains we demonstrate that, compared to cryopreservation, lower-quality single-cell genomes are recovered when the sample is preserved in ethanol or if the sample undergoes fluorescence in situ hybridization, while sample preservation in paraformaldehyde renders it completely unsuitable for sequencing.
Developing Navy Capability to Recover Forces in Chemical, Biological, and Radiological Hazard Environments

Science.gov (United States)

2013-01-01

damage control; LHD flight deck and well deck operations; fleet surgical team; Afloat Training Group; Assault Craft Unit; Naval Surface Warfare Center ...Biological, Radiological and Nuclear School, and U.S. Army Edgewood Chemical Biological Center , Guidelines for Mass Casualty Decontamination During a HAZMAT...Policy Center of the RAND National Defense Research Institute, a federally funded research and development center sponsored by OSD, the Joint Staff
Efficient coupling of a single diamond color center to propagating plasmonic gap modes

DEFF Research Database (Denmark)

Kumar, Shailesh; Huck, Alexander; Andersen, Ulrik L

2013-01-01

We report on coupling of a single nitrogen-vacancy (NV) center in a nanodiamond to the propagating gap mode of two parallel placed chemically grown silver nanowires. The coupled NV-center nanowire system is made by manipulating nanodiamonds and nanowires with the tip of an atomic force microscope...
An integrative omics perspective for the analysis of chemical signals in ecological interactions.

Science.gov (United States)

Brunetti, A E; Carnevale Neto, F; Vera, M C; Taboada, C; Pavarini, D P; Bauermeister, A; Lopes, N P

2018-03-05

All living organisms emit, detect, and respond to chemical stimuli, thus creating an almost limitless number of interactions by means of chemical signals. Technological and intellectual advances in the last two decades have enabled chemical signals analyses at several molecular levels, including gene expression, molecular diversity, and receptor affinity. These advances have also deepened our understanding of nature to encompass interactions at multiple organism levels across different taxa. This tutorial review describes the most recent analytical developments in 'omics' technologies (i.e., genomics, transcriptomics, proteomics, and metabolomics) and provide recent examples of its application in studies of chemical signals. We highlight how studies have integrated an enormous amount of information generated from different omics disciplines into one publicly available platform. In addition, we stress the importance of considering different signal modalities and an evolutionary perspective to establish a comprehensive understanding of chemical communication.
Informational laws of genome structures

Science.gov (United States)

Bonnici, Vincenzo; Manca, Vincenzo

2016-06-01

In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined.
Center for Advanced Space Propulsion Second Annual Technical Symposium Proceedings

Science.gov (United States)

1990-01-01

The proceedings for the Center for Advanced Space Propulsion Second Annual Technical Symposium are divided as follows: Chemical Propulsion, CFD; Space Propulsion; Electric Propulsion; Artificial Intelligence; Low-G Fluid Management; and Rocket Engine Materials.
How genome complexity can explain the difficulty of aligning reads to genomes.

Science.gov (United States)

Phan, Vinhthuy; Gao, Shanshan; Tran, Quang; Vo, Nam S

2015-01-01

Although it is frequently observed that aligning short reads to genomes becomes harder if they contain complex repeat patterns, there has not been much effort to quantify the relationship between complexity of genomes and difficulty of short-read alignment. Existing measures of sequence complexity seem unsuitable for the understanding and quantification of this relationship. We investigated several measures of complexity and found that length-sensitive measures of complexity had the highest correlation to accuracy of alignment. In particular, the rate of distinct substrings of length k, where k is similar to the read length, correlated very highly to alignment performance in terms of precision and recall. We showed how to compute this measure efficiently in linear time, making it useful in practice to estimate quickly the difficulty of alignment for new genomes without having to align reads to them first. We showed how the length-sensitive measures could provide additional information for choosing aligners that would align consistently accurately on new genomes. We formally established a connection between genome complexity and the accuracy of short-read aligners. The relationship between genome complexity and alignment accuracy provides additional useful information for selecting suitable aligners for new genomes. Further, this work suggests that the complexity of genomes sometimes should be thought of in terms of specific computational problems, such as the alignment of short reads to genomes.
Phytozome Comparative Plant Genomics Portal

Energy Technology Data Exchange (ETDEWEB)

Goodstein, David; Batra, Sajeev; Carlson, Joseph; Hayes, Richard; Phillips, Jeremy; Shu, Shengqiang; Schmutz, Jeremy; Rokhsar, Daniel

2014-09-09

The Dept. of Energy Joint Genome Institute is a genomics user facility supporting DOE mission science in the areas of Bioenergy, Carbon Cycling, and Biogeochemistry. The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to: 1. Understand and accelerate the improvement (domestication) of bioenergy crops 2. Characterize and moderate plant response to climate change 3. Use comparative genomics to identify constrained elements and infer gene function 4. Build high quality genomic resource platforms of JGI Plant Flagship genomes for functional and experimental work 5. Expand functional genomic resources for Plant Flagship genomes
A Taste of Algal Genomes from the Joint Genome Institute

Energy Technology Data Exchange (ETDEWEB)

Kuo, Alan; Grigoriev, Igor

2012-06-17

Algae play profound roles in aquatic food chains and the carbon cycle, can impose health and economic costs through toxic blooms, provide models for the study of symbiosis, photosynthesis, and eukaryotic evolution, and are candidate sources for bio-fuels; all of these research areas are part of the mission of DOE's Joint Genome Institute (JGI). To date JGI has sequenced, assembled, annotated, and released to the public the genomes of 18 species and strains of algae, sampling almost all of the major clades of photosynthetic eukaryotes. With more algal genomes currently undergoing analysis, JGI continues its commitment to driving forward basic and applied algal science. Among these ongoing projects are the pan-genome of the dominant coccolithophore Emiliania huxleyi, the interrelationships between the 4 genomes in the nucleomorph-containing Bigelowiella natans and Guillardia theta, and the search for symbiosis genes of lichens.
Structural changes in amorphous organic compounds and their role during chemical transformations

International Nuclear Information System (INIS)

Gusakovskaya, I.G.

1994-01-01

Using butanediol vinylacetate and dimetacrylate as an example, it can be shown that structural changes of amorphous-liquid substance play an important part at chemical transformations of amorphous compounds and chemical reaction rate provides an function of local order. When the amorphous polymer is viewed as an system of multiple transformations, each gives birth to the definite local order, the calculation of recombination reaction of active centers accumulated during irradiation of polymer at 77 K is carried out. Concentration of recombinated centers rises steeply near each transformation T k
A plant-based chemical genomics screen for the identification of flowering inducers

NARCIS (Netherlands)

Fiers, Martijn; Hoogenboom, Jorin; Brunazzi, Alice; Wennekes, Tom; Angenent, Gerco C; Immink, Richard G H

2017-01-01

BACKGROUND: Floral timing is a carefully regulated process, in which the plant determines the optimal moment to switch from the vegetative to reproductive phase. While there are numerous genes known that control flowering time, little information is available on chemical compounds that are able to
Exploiting Chemical Libraries, Structure, and Genomics in the Search for Kinase Inhibitors

NARCIS (Netherlands)

Gray, Nathanael S.; Wodicka, Lisa; Thunnissen, Andy-Mark W.H.; Norman, Thea C.; Kwon, Soojin; Espinoza, F. Hernan; Morgan, David O.; Barnes, Georjana; LeClerc, Sophie; Meijer, Laurent; Kim, Sung-Hou; Lockhart, David J.; Schultz, Peter G.

1998-01-01

Selective protein kinase inhibitors were developed on the basis of the unexpected binding mode of 2,6,9-trisubstituted purines to the adenosine triphosphate-binding site of the human cyclin-dependent kinase 2 (CDK2). By iterating chemical library synthesis and biological screening, potent inhibitors
PGSB/MIPS Plant Genome Information Resources and Concepts for the Analysis of Complex Grass Genomes.

Science.gov (United States)

Spannagl, Manuel; Bader, Kai; Pfeifer, Matthias; Nussbaumer, Thomas; Mayer, Klaus F X

2016-01-01

PGSB (Plant Genome and Systems Biology; formerly MIPS-Munich Institute for Protein Sequences) has been involved in developing, implementing and maintaining plant genome databases for more than a decade. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable datasets for model plant genomes as a backbone against which experimental data, e.g., from high-throughput functional genomics, can be organized and analyzed. In addition, genomes from both model and crop plants form a scaffold for comparative genomics, assisted by specialized tools such as the CrowsNest viewer to explore conserved gene order (synteny) between related species on macro- and micro-levels.The genomes of many economically important Triticeae plants such as wheat, barley, and rye present a great challenge for sequence assembly and bioinformatic analysis due to their enormous complexity and large genome size. Novel concepts and strategies have been developed to deal with these difficulties and have been applied to the genomes of wheat, barley, rye, and other cereals. This includes the GenomeZipper concept, reference-guided exome assembly, and "chromosome genomics" based on flow cytometry sorted chromosomes.
Genomic treasure troves: complete genome sequencing of herbarium and insect museum specimens.

Science.gov (United States)

Staats, Martijn; Erkens, Roy H J; van de Vossenberg, Bart; Wieringa, Jan J; Kraaijeveld, Ken; Stielow, Benjamin; Geml, József; Richardson, James E; Bakker, Freek T

2013-01-01

Unlocking the vast genomic diversity stored in natural history collections would create unprecedented opportunities for genome-scale evolutionary, phylogenetic, domestication and population genomic studies. Many researchers have been discouraged from using historical specimens in molecular studies because of both generally limited success of DNA extraction and the challenges associated with PCR-amplifying highly degraded DNA. In today's next-generation sequencing (NGS) world, opportunities and prospects for historical DNA have changed dramatically, as most NGS methods are actually designed for taking short fragmented DNA molecules as templates. Here we show that using a standard multiplex and paired-end Illumina sequencing approach, genome-scale sequence data can be generated reliably from dry-preserved plant, fungal and insect specimens collected up to 115 years ago, and with minimal destructive sampling. Using a reference-based assembly approach, we were able to produce the entire nuclear genome of a 43-year-old Arabidopsis thaliana (Brassicaceae) herbarium specimen with high and uniform sequence coverage. Nuclear genome sequences of three fungal specimens of 22-82 years of age (Agaricus bisporus, Laccaria bicolor, Pleurotus ostreatus) were generated with 81.4-97.9% exome coverage. Complete organellar genome sequences were assembled for all specimens. Using de novo assembly we retrieved between 16.2-71.0% of coding sequence regions, and hence remain somewhat cautious about prospects for de novo genome assembly from historical specimens. Non-target sequence contaminations were observed in 2 of our insect museum specimens. We anticipate that future museum genomics projects will perhaps not generate entire genome sequences in all cases (our specimens contained relatively small and low-complexity genomes), but at least generating vital comparative genomic data for testing (phylo)genetic, demographic and genetic hypotheses, that become increasingly more horizontal
Next-Generation Genomics Facility at C-CAMP: Accelerating Genomic Research in India

Science.gov (United States)

S, Chandana; Russiachand, Heikham; H, Pradeep; S, Shilpa; M, Ashwini; S, Sahana; B, Jayanth; Atla, Goutham; Jain, Smita; Arunkumar, Nandini; Gowda, Malali

2014-01-01

Next-Generation Sequencing (NGS; http://www.genome.gov/12513162) is a recent life-sciences technological revolution that allows scientists to decode genomes or transcriptomes at a much faster rate with a lower cost. Genomic-based studies are in a relatively slow pace in India due to the non-availability of genomics experts, trained personnel and dedicated service providers. Using NGS there is a lot of potential to study India's national diversity (of all kinds). We at the Centre for Cellular and Molecular Platforms (C-CAMP) have launched the Next Generation Genomics Facility (NGGF) to provide genomics service to scientists, to train researchers and also work on national and international genomic projects. We have HiSeq1000 from Illumina and GS-FLX Plus from Roche454. The long reads from GS FLX Plus, and high sequence depth from HiSeq1000, are the best and ideal hybrid approaches for de novo and re-sequencing of genomes and transcriptomes. At our facility, we have sequenced around 70 different organisms comprising of more than 388 genomes and 615 transcriptomes – prokaryotes and eukaryotes (fungi, plants and animals). In addition we have optimized other unique applications such as small RNA (miRNA, siRNA etc), long Mate-pair sequencing (2 to 20 Kb), Coding sequences (Exome), Methylome (ChIP-Seq), Restriction Mapping (RAD-Seq), Human Leukocyte Antigen (HLA) typing, mixed genomes (metagenomes) and target amplicons, etc. Translating DNA sequence data from NGS sequencer into meaningful information is an important exercise. Under NGGF, we have bioinformatics experts and high-end computing resources to dissect NGS data such as genome assembly and annotation, gene expression, target enrichment, variant calling (SSR or SNP), comparative analysis etc. Our services (sequencing and bioinformatics) have been utilized by more than 45 organizations (academia and industry) both within India and outside, resulting several publications in peer-reviewed journals and several genomic
Extreme genomes

OpenAIRE

DeLong, Edward F

2000-01-01

The complete genome sequence of Thermoplasma acidophilum, an acid- and heat-loving archaeon, has recently been reported. Comparative genomic analysis of this 'extremophile' is providing new insights into the metabolic machinery, ecology and evolution of thermophilic archaea.

The SGC beyond structural genomics: redefining the role of 3D structures by coupling genomic stratification with fragment-based discovery.

Science.gov (United States)

Bradley, Anthony R; Echalier, Aude; Fairhead, Michael; Strain-Damerell, Claire; Brennan, Paul; Bullock, Alex N; Burgess-Brown, Nicola A; Carpenter, Elisabeth P; Gileadi, Opher; Marsden, Brian D; Lee, Wen Hwa; Yue, Wyatt; Bountra, Chas; von Delft, Frank

2017-11-08

The ongoing explosion in genomics data has long since outpaced the capacity of conventional biochemical methodology to verify the large number of hypotheses that emerge from the analysis of such data. In contrast, it is still a gold-standard for early phenotypic validation towards small-molecule drug discovery to use probe molecules (or tool compounds), notwithstanding the difficulty and cost of generating them. Rational structure-based approaches to ligand discovery have long promised the efficiencies needed to close this divergence; in practice, however, this promise remains largely unfulfilled, for a host of well-rehearsed reasons and despite the huge technical advances spearheaded by the structural genomics initiatives of the noughties. Therefore the current, fourth funding phase of the Structural Genomics Consortium (SGC), building on its extensive experience in structural biology of novel targets and design of protein inhibitors, seeks to redefine what it means to do structural biology for drug discovery. We developed the concept of a Target Enabling Package (TEP) that provides, through reagents, assays and data, the missing link between genetic disease linkage and the development of usefully potent compounds. There are multiple prongs to the ambition: rigorously assessing targets' genetic disease linkages through crowdsourcing to a network of collaborating experts; establishing a systematic approach to generate the protocols and data that comprise each target's TEP; developing new, X-ray-based fragment technologies for generating high quality chemical matter quickly and cheaply; and exploiting a stringently open access model to build multidisciplinary partnerships throughout academia and industry. By learning how to scale these approaches, the SGC aims to make structures finally serve genomics, as originally intended, and demonstrate how 3D structures systematically allow new modes of druggability to be discovered for whole classes of targets. © 2017 The
GenomePeek—an online tool for prokaryotic genome and metagenome analysis

Directory of Open Access Journals (Sweden)

Katelyn McNair

2015-06-01

Full Text Available As more and more prokaryotic sequencing takes place, a method to quickly and accurately analyze this data is needed. Previous tools are mainly designed for metagenomic analysis and have limitations; such as long runtimes and significant false positive error rates. The online tool GenomePeek (edwards.sdsu.edu/GenomePeek was developed to analyze both single genome and metagenome sequencing files, quickly and with low error rates. GenomePeek uses a sequence assembly approach where reads to a set of conserved genes are extracted, assembled and then aligned against the highly specific reference database. GenomePeek was found to be faster than traditional approaches while still keeping error rates low, as well as offering unique data visualization options.
Amineborane Based Chemical Hydrogen Storage - Final Report

International Nuclear Information System (INIS)

Sneddon, Larry G.

2011-01-01

The development of efficient and safe methods for hydrogen storage is a major hurdle that must be overcome to enable the use of hydrogen as an alternative energy carrier. The objectives of this project in the DOE Center of Excellence in Chemical Hydride Storage were both to develop new methods for on-demand, low temperature hydrogen release from chemical hydrides and to design high-conversion off-board methods for chemical hydride regeneration. Because of their reactive protic (N-H) and hydridic (B-H) hydrogens and high hydrogen contents, amineboranes such as ammonia borane, NH 3 BH 3 (AB), 19.6-wt% H 2 , and ammonia triborane NH 3 B 3 H 7 (AT), 17.7-wt% H 2 , were initially identified by the Center as promising, high-capacity chemical hydrogen storage materials with the potential to store and deliver molecular hydrogen through dehydrogenation and hydrolysis reactions. In collaboration with other Center partners, the Penn project focused both on new methods to induce amineborane H 2 -release and on new strategies for the regeneration the amineborane spent-fuel materials. The Penn approach to improving amineborane H 2 -release focused on the use of ionic liquids, base additives and metal catalysts to activate AB dehydrogenation and these studies successfully demonstrated that in ionic liquids the AB induction period that had been observed in the solid-state was eliminated and both the rate and extent of AB H2-release were significantly increased. These results have clearly shown that, while improvements are still necessary, many of these systems have the potential to achieve DOE hydrogen-storage goals. The high extent of their H 2 -release, the tunability of both their H 2 materials weight-percents and release rates, and their product control that is attained by either trapping or suppressing unwanted volatile side products, such as borazine, continue to make AB/ionic-liquid based systems attractive candidates for chemical hydrogen storage applications. These
Amineborane Based Chemical Hydrogen Storage - Final Report

Energy Technology Data Exchange (ETDEWEB)

Sneddon, Larry G.

2011-04-21

The development of efficient and safe methods for hydrogen storage is a major hurdle that must be overcome to enable the use of hydrogen as an alternative energy carrier. The objectives of this project in the DOE Center of Excellence in Chemical Hydride Storage were both to develop new methods for on-demand, low temperature hydrogen release from chemical hydrides and to design high-conversion off-board methods for chemical hydride regeneration. Because of their reactive protic (N-H) and hydridic (B-H) hydrogens and high hydrogen contents, amineboranes such as ammonia borane, NH3BH3 (AB), 19.6-wt% H2, and ammonia triborane NH3B3H7 (AT), 17.7-wt% H2, were initially identified by the Center as promising, high-capacity chemical hydrogen storage materials with the potential to store and deliver molecular hydrogen through dehydrogenation and hydrolysis reactions. In collaboration with other Center partners, the Penn project focused both on new methods to induce amineborane H2-release and on new strategies for the regeneration the amineborane spent-fuel materials. The Penn approach to improving amineborane H2-release focused on the use of ionic liquids, base additives and metal catalysts to activate AB dehydrogenation and these studies successfully demonstrated that in ionic liquids the AB induction period that had been observed in the solid-state was eliminated and both the rate and extent of AB H2-release were significantly increased. These results have clearly shown that, while improvements are still necessary, many of these systems have the potential to achieve DOE hydrogen-storage goals. The high extent of their H2-release, the tunability of both their H2 materials weight-percents and release rates, and their product control that is attained by either trapping or suppressing unwanted volatile side products, such as borazine, continue to make AB/ionic-liquid based systems attractive candidates for chemical hydrogen storage applications. These studies also
Impact of phenotype definition on genome-wide association signals: empirical evaluation in human immunodeficiency virus type 1 infection

DEFF Research Database (Denmark)

Evangelou, Evangelos; Fellay, Jacques; Colombo, Sara

2011-01-01

Discussion on improving the power of genome-wide association studies to identify candidate variants and genes is generally centered on issues of maximizing sample size; less attention is given to the role of phenotype definition and ascertainment. The authors used genome-wide data from patients...... infected with human immunodeficiency virus type 1 (HIV-1) to assess whether differences in type of population (622 seroconverters vs. 636 seroprevalent subjects) or the number of measurements available for defining the phenotype resulted in differences in the effect sizes of associations between single...... available, particularly among seroconverters and for variants that achieved genome-wide significance. Differences in phenotype definition and ascertainment may affect the estimated magnitude of genetic effects and should be considered in optimizing power for discovering new associations....
Improved genome-scale multi-target virtual screening via a novel collaborative filtering approach to cold-start problem.

Science.gov (United States)

Lim, Hansaim; Gray, Paul; Xie, Lei; Poleksic, Aleksandar

2016-12-13

Conventional one-drug-one-gene approach has been of limited success in modern drug discovery. Polypharmacology, which focuses on searching for multi-targeted drugs to perturb disease-causing networks instead of designing selective ligands to target individual proteins, has emerged as a new drug discovery paradigm. Although many methods for single-target virtual screening have been developed to improve the efficiency of drug discovery, few of these algorithms are designed for polypharmacology. Here, we present a novel theoretical framework and a corresponding algorithm for genome-scale multi-target virtual screening based on the one-class collaborative filtering technique. Our method overcomes the sparseness of the protein-chemical interaction data by means of interaction matrix weighting and dual regularization from both chemicals and proteins. While the statistical foundation behind our method is general enough to encompass genome-wide drug off-target prediction, the program is specifically tailored to find protein targets for new chemicals with little to no available interaction data. We extensively evaluate our method using a number of the most widely accepted gene-specific and cross-gene family benchmarks and demonstrate that our method outperforms other state-of-the-art algorithms for predicting the interaction of new chemicals with multiple proteins. Thus, the proposed algorithm may provide a powerful tool for multi-target drug design.
GRAbB : Selective Assembly of Genomic Regions, a New Niche for Genomic Research

NARCIS (Netherlands)

Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren

GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often
Comparative Genome Viewer

International Nuclear Information System (INIS)

Molineris, I.; Sales, G.

2009-01-01

The amount of information about genomes, both in the form of complete sequences and annotations, has been exponentially increasing in the last few years. As a result there is the need for tools providing a graphical representation of such information that should be comprehensive and intuitive. Visual representation is especially important in the comparative genomics field since it should provide a combined view of data belonging to different genomes. We believe that existing tools are limited in this respect as they focus on a single genome at a time (conservation histograms) or compress alignment representation to a single dimension. We have therefore developed a web-based tool called Comparative Genome Viewer (Cgv): it integrates a bidimensional representation of alignments between two regions, both at small and big scales, with the richness of annotations present in other genome browsers. We give access to our system through a web-based interface that provides the user with an interactive representation that can be updated in real time using the mouse to move from region to region and to zoom in on interesting details.
Genomics With Cloud Computing

Directory of Open Access Journals (Sweden)

Sukhamrit Kaur

2015-04-01

Full Text Available Abstract Genomics is study of genome which provides large amount of data for which large storage and computation power is needed. These issues are solved by cloud computing that provides various cloud platforms for genomics. These platforms provides many services to user like easy access to data easy sharing and transfer providing storage in hundreds of terabytes more computational power. Some cloud platforms are Google genomics DNAnexus and Globus genomics. Various features of cloud computing to genomics are like easy access and sharing of data security of data less cost to pay for resources but still there are some demerits like large time needed to transfer data less network bandwidth.
IMA Genome-F 5G

OpenAIRE

Wingfield, Brenda D.; Barnes, Irene; Wilhelm de Beer, Z.; De Vos, Lieschen; Duong, Tuan A.; Kanzi, Aquillah M.; Naidoo, Kershney; Nguyen, Hai D.T.; Santana, Quentin C.; Sayari, Mohammad; Seifert, Keith A.; Steenkamp, Emma T.; Trollip, Conrad; van der Merwe, Nicolaas A.; van der Nest, Magriet A.

2015-01-01

The genomes of Ceratocystis eucalypticola, Chrysoporthe cubensis, Chrysoporthe deuterocubensis, Davidsoniella virescens, Fusarium temperatum, Graphilbum fragrans, Penicillium nordicum and Thielaviopsis musarum are presented in this genome announcement. These seven genomes are from plant pathogens and otherwise economically important fungal species. The genome sizes range from 28 Mb in the case of T. musarum to 45 Mb for Fusarium temperatum. These genomes include the first reports of genomes f...
Experimental Induction of Genome Chaos.

Science.gov (United States)

Ye, Christine J; Liu, Guo; Heng, Henry H

2018-01-01

Genome chaos, or karyotype chaos, represents a powerful survival strategy for somatic cells under high levels of stress/selection. Since the genome context, not the gene content, encodes the genomic blueprint of the cell, stress-induced rapid and massive reorganization of genome topology functions as a very important mechanism for genome (karyotype) evolution. In recent years, the phenomenon of genome chaos has been confirmed by various sequencing efforts, and many different terms have been coined to describe different subtypes of the chaotic genome including "chromothripsis," "chromoplexy," and "structural mutations." To advance this exciting field, we need an effective experimental system to induce and characterize the karyotype reorganization process. In this chapter, an experimental protocol to induce chaotic genomes is described, following a brief discussion of the mechanism and implication of genome chaos in cancer evolution.
Genome Sequences of Oryza Species

KAUST Repository

Kumagai, Masahiko

2018-02-14

This chapter summarizes recent data obtained from genome sequencing, annotation projects, and studies on the genome diversity of Oryza sativa and related Oryza species. O. sativa, commonly known as Asian rice, is the first monocot species whose complete genome sequence was deciphered based on physical mapping by an international collaborative effort. This genome, along with its accurate and comprehensive annotation, has become an indispensable foundation for crop genomics and breeding. With the development of innovative sequencing technologies, genomic studies of O. sativa have dramatically increased; in particular, a large number of cultivars and wild accessions have been sequenced and compared with the reference rice genome. Since de novo genome sequencing has become cost-effective, the genome of African cultivated rice, O. glaberrima, has also been determined. Comparative genomic studies have highlighted the independent domestication processes of different rice species, but it also turned out that Asian and African rice share a common gene set that has experienced similar artificial selection. An international project aimed at constructing reference genomes and examining the genome diversity of wild Oryza species is currently underway, and the genomes of some species are publicly available. This project provides a platform for investigations such as the evolution, development, polyploidization, and improvement of crops. Studies on the genomic diversity of Oryza species, including wild species, should provide new insights to solve the problem of growing food demands in the face of rapid climatic changes.
Genome Sequences of Oryza Species

KAUST Repository

Kumagai, Masahiko; Tanaka, Tsuyoshi; Ohyanagi, Hajime; Hsing, Yue-Ie C.; Itoh, Takeshi

2018-01-01

This chapter summarizes recent data obtained from genome sequencing, annotation projects, and studies on the genome diversity of Oryza sativa and related Oryza species. O. sativa, commonly known as Asian rice, is the first monocot species whose complete genome sequence was deciphered based on physical mapping by an international collaborative effort. This genome, along with its accurate and comprehensive annotation, has become an indispensable foundation for crop genomics and breeding. With the development of innovative sequencing technologies, genomic studies of O. sativa have dramatically increased; in particular, a large number of cultivars and wild accessions have been sequenced and compared with the reference rice genome. Since de novo genome sequencing has become cost-effective, the genome of African cultivated rice, O. glaberrima, has also been determined. Comparative genomic studies have highlighted the independent domestication processes of different rice species, but it also turned out that Asian and African rice share a common gene set that has experienced similar artificial selection. An international project aimed at constructing reference genomes and examining the genome diversity of wild Oryza species is currently underway, and the genomes of some species are publicly available. This project provides a platform for investigations such as the evolution, development, polyploidization, and improvement of crops. Studies on the genomic diversity of Oryza species, including wild species, should provide new insights to solve the problem of growing food demands in the face of rapid climatic changes.
Wei Xiong | NREL

Science.gov (United States)

-7965 Research Interests Production of fuels and value-added chemicals from photosynthetic , National Renewable Energy Laboratory, Biosciences Center, 2013-present Research Associate, Arizona State Physiology (2015) Illustration of a genome-based metabolic network of a lipid-producing green alga Chlorella
Patient-controlled encrypted genomic data: an approach to advance clinical genomics

Directory of Open Access Journals (Sweden)

Trakadis Yannis J

2012-07-01

Full Text Available Abstract Background The revolution in DNA sequencing technologies over the past decade has made it feasible to sequence an individual’s whole genome at a relatively low cost. The potential value of the information generated by genomic technologies for medicine and society is enormous. However, in order for exome sequencing, and eventually whole genome sequencing, to be implemented clinically, a number of major challenges need to be overcome. For instance, obtaining meaningful informed-consent, managing incidental findings and the great volume of data generated (including multiple findings with uncertain clinical significance, re-interpreting the genomic data and providing additional counselling to patients as genetic knowledge evolves are issues that need to be addressed. It appears that medical genetics is shifting from the present “phenotype-first” medical model to a “data-first” model which leads to multiple complexities. Discussion This manuscript discusses the different challenges associated with integrating genomic technologies into clinical practice and describes a “phenotype-first” approach, namely, “Individualized Mutation-weighed Phenotype Search”, and its benefits. The proposed approach allows for a more efficient prioritization of the genes to be tested in a clinical lab based on both the patient’s phenotype and his/her entire genomic data. It simplifies “informed-consent” for clinical use of genomic technologies and helps to protect the patient’s autonomy and privacy. Overall, this approach could potentially render widespread use of genomic technologies, in the immediate future, practical, ethical and clinically useful. Summary The “Individualized Mutation-weighed Phenotype Search” approach allows for an incremental integration of genomic technologies into clinical practice. It ensures that we do not over-medicalize genomic data but, rather, continue our current medical model which is based on serving
EUPAN enables pan-genome studies of a large number of eukaryotic genomes.

Science.gov (United States)

Hu, Zhiqiang; Sun, Chen; Lu, Kuang-Chen; Chu, Xixia; Zhao, Yue; Lu, Jinyuan; Shi, Jianxin; Wei, Chaochun

2017-08-01

Pan-genome analyses are routinely carried out for bacteria to interpret the within-species gene presence/absence variations (PAVs). However, pan-genome analyses are rare for eukaryotes due to the large sizes and higher complexities of their genomes. Here we proposed EUPAN, a eukaryotic pan-genome analysis toolkit, enabling automatic large-scale eukaryotic pan-genome analyses and detection of gene PAVs at a relatively low sequencing depth. In the previous studies, we demonstrated the effectiveness and high accuracy of EUPAN in the pan-genome analysis of 453 rice genomes, in which we also revealed widespread gene PAVs among individual rice genomes. Moreover, EUPAN can be directly applied to the current re-sequencing projects primarily focusing on single nucleotide polymorphisms. EUPAN is implemented in Perl, R and C ++. It is supported under Linux and preferred for a computer cluster with LSF and SLURM job scheduling system. EUPAN together with its standard operating procedure (SOP) is freely available for non-commercial use (CC BY-NC 4.0) at http://cgm.sjtu.edu.cn/eupan/index.html . ccwei@sjtu.edu.cn or jianxin.shi@sjtu.edu.cn. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Quantum measurement corrections to CIDNP in photosynthetic reaction centers

International Nuclear Information System (INIS)

Kominis, Iannis K

2013-01-01

Chemically induced dynamic nuclear polarization is a signature of spin order appearing in many photosynthetic reaction centers. Such polarization, significantly enhanced above thermal equilibrium, is known to result from the nuclear spin sorting inherent in the radical pair mechanism underlying long-lived charge-separated states in photosynthetic reaction centers. We will show here that the recently understood fundamental quantum dynamics of radical-ion-pair reactions open up a new and completely unexpected pathway toward obtaining chemically induced dynamic nuclear polarization signals. The fundamental decoherence mechanism inherent in the recombination process of radical pairs is shown to produce nuclear spin polarizations of the order of 10 4 times (or more) higher than the thermal equilibrium value at the Earth's magnetic field relevant to natural photosynthesis. This opens up the possibility of a fundamentally new exploration of the biological significance of high nuclear polarizations in photosynthesis. (paper)
The Drosophila genome nexus: a population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population.

Science.gov (United States)

Lack, Justin B; Cardeno, Charis M; Crepeau, Marc W; Taylor, William; Corbett-Detig, Russell B; Stevens, Kristian A; Langley, Charles H; Pool, John E

2015-04-01

Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets. Copyright © 2015 by the Genetics Society of America.
JICST Factual DatabaseJICST Chemical Substance Safety Regulation Database

Science.gov (United States)

Abe, Atsushi; Sohma, Tohru

JICST Chemical Substance Safety Regulation Database is based on the Database of Safety Laws for Chemical Compounds constructed by Japan Chemical Industry Ecology-Toxicology & Information Center (JETOC) sponsored by the Sience and Technology Agency in 1987. JICST has modified JETOC database system, added data and started the online service through JOlS-F (JICST Online Information Service-Factual database) in January 1990. JICST database comprises eighty-three laws and fourteen hundred compounds. The authors outline the database, data items, files and search commands. An example of online session is presented.
GI-POP: a combinational annotation and genomic island prediction pipeline for ongoing microbial genome projects.

Science.gov (United States)

Lee, Chi-Ching; Chen, Yi-Ping Phoebe; Yao, Tzu-Jung; Ma, Cheng-Yu; Lo, Wei-Cheng; Lyu, Ping-Chiang; Tang, Chuan Yi

2013-04-10

Sequencing of microbial genomes is important because of microbial-carrying antibiotic and pathogenetic activities. However, even with the help of new assembling software, finishing a whole genome is a time-consuming task. In most bacteria, pathogenetic or antibiotic genes are carried in genomic islands. Therefore, a quick genomic island (GI) prediction method is useful for ongoing sequencing genomes. In this work, we built a Web server called GI-POP (http://gipop.life.nthu.edu.tw) which integrates a sequence assembling tool, a functional annotation pipeline, and a high-performance GI predicting module, in a support vector machine (SVM)-based method called genomic island genomic profile scanning (GI-GPS). The draft genomes of the ongoing genome projects in contigs or scaffolds can be submitted to our Web server, and it provides the functional annotation and highly probable GI-predicting results. GI-POP is a comprehensive annotation Web server designed for ongoing genome project analysis. Researchers can perform annotation and obtain pre-analytic information include possible GIs, coding/non-coding sequences and functional analysis from their draft genomes. This pre-analytic system can provide useful information for finishing a genome sequencing project. Copyright © 2012 Elsevier B.V. All rights reserved.

The Amaranth Genome: Genome, Transcriptome, and Physical Map Assembly

Directory of Open Access Journals (Sweden)

J. W. Clouse

2016-03-01

Full Text Available Amaranth ( L. is an emerging pseudocereal native to the New World that has garnered increased attention in recent years because of its nutritional quality, in particular its seed protein and more specifically its high levels of the essential amino acid lysine. It belongs to the Amaranthaceae family, is an ancient paleopolyploid that shows disomic inheritance (2 = 32, and has an estimated genome size of 466 Mb. Here we present a high-quality draft genome sequence of the grain amaranth. The genome assembly consisted of 377 Mb in 3518 scaffolds with an N of 371 kb. Repetitive element analysis predicted that 48% of the genome is comprised of repeat sequences, of which -like elements were the most commonly classified retrotransposon. A de novo transcriptome consisting of 66,370 contigs was assembled from eight different amaranth tissue and abiotic stress libraries. Annotation of the genome identified 23,059 protein-coding genes. Seven grain amaranths (, , and and their putative progenitor ( were resequenced. A single nucleotide polymorphism (SNP phylogeny supported the classification of as the progenitor species of the grain amaranths. Lastly, we generated a de novo physical map for using the BioNano Genomics’ Genome Mapping platform. The physical map spanned 340 Mb and a hybrid assembly using the BioNano physical maps nearly doubled the N of the assembly to 697 kb. Moreover, we analyzed synteny between amaranth and sugar beet ( L. and estimated, using analysis, the age of the most recent polyploidization event in amaranth.
Whole genome sequencing and bioinformatics analysis of two Egyptian genomes.

Science.gov (United States)

ElHefnawi, Mahmoud; Jeon, Sungwon; Bhak, Youngjune; ElFiky, Asmaa; Horaiz, Ahmed; Jun, JeHoon; Kim, Hyunho; Bhak, Jong

2018-05-15

We report two Egyptian male genomes (EGP1 and EGP2) sequenced at ~ 30× sequencing depths. EGP1 had 4.7 million variants, where 198,877 were novel variants while EGP2 had 209,109 novel variants out of 4.8 million variants. The mitochondrial haplogroup of the two individuals were identified to be H7b1 and L2a1c, respectively. We also identified the Y haplogroup of EGP1 (R1b) and EGP2 (J1a2a1a2 > P58 > FGC11). EGP1 had a mutation in the NADH gene of the mitochondrial genome ND4 (m.11778 G > A) that causes Leber's hereditary optic neuropathy. Some SNPs shared by the two genomes were associated with an increased level of cholesterol and triglycerides, probably related with Egyptians obesity. Comparison of these genomes with African and Western-Asian genomes can provide insights on Egyptian ancestry and genetic history. This resource can be used to further understand genomic diversity and functional classification of variants as well as human migration and evolution across Africa and Western-Asia. Copyright © 2017. Published by Elsevier B.V.
Integration of Genomic, Biologic, and Chemical Approaches to Target p53 Loss and Gain-of-Function in Triple Negative Breast Cancer

Science.gov (United States)

2016-09-01

in this progress report: p53 triple-negative breast cancer subtypes gene expression somatic cell genetics CRISPR / Cas 3. ACCOMPLISHMENTS Major...report, we described the creation of an isogenic p53 mutant TNBC cell line panel using CRISPR / Cas -mediated genome editing8 and the resultant...LOF null state. To validate that mutant p53 is directly responsible for this altered transcription, we will use the same CRISPR -mediated genome
St2-80: a new FISH marker for St genome and genome analysis in Triticeae.

Science.gov (United States)

Wang, Long; Shi, Qinghua; Su, Handong; Wang, Yi; Sha, Lina; Fan, Xing; Kang, Houyang; Zhang, Haiqin; Zhou, Yonghong

2017-07-01

The St genome is one of the most fundamental genomes in Triticeae. Repetitive sequences are widely used to distinguish different genomes or species. The primary objectives of this study were to (i) screen a new sequence that could easily distinguish the chromosome of the St genome from those of other genomes by fluorescence in situ hybridization (FISH) and (ii) investigate the genome constitution of some species that remain uncertain and controversial. We used degenerated oligonucleotide primer PCR (Dop-PCR), Dot-blot, and FISH to screen for a new marker of the St genome and to test the efficiency of this marker in the detection of the St chromosome at different ploidy levels. Signals produced by a new FISH marker (denoted St 2 -80) were present on the entire arm of chromosomes of the St genome, except in the centromeric region. On the contrary, St 2 -80 signals were present in the terminal region of chromosomes of the E, H, P, and Y genomes. No signal was detected in the A and B genomes, and only weak signals were detected in the terminal region of chromosomes of the D genome. St 2 -80 signals were obvious and stable in chromosomes of different genomes, whether diploid or polyploid. Therefore, St 2 -80 is a potential and useful FISH marker that can be used to distinguish the St genome from those of other genomes in Triticeae.
The effects of environmental chemicals on renal function.

Science.gov (United States)

Kataria, Anglina; Trasande, Leonardo; Trachtman, Howard

2015-10-01

The global incidence of chronic kidney disease (CKD) is increasing among individuals of all ages. Despite advances in proteomics, genomics and metabolomics, there remains a lack of safe and effective drugs to reverse or stabilize renal function in patients with glomerular or tubulointerstitial causes of CKD. Consequently, modifiable risk factors that are associated with a progressive decline in kidney function need to be identified. Numerous reports have documented the adverse effects that occur in response to graded exposure to a wide range of environmental chemicals. This Review summarizes the effects of such chemicals on four aspects of cardiorenal function: albuminuria, glomerular filtration rate, blood pressure and serum uric acid concentration. We focus on compounds that individuals are likely to be exposed to as a consequence of normal consumer activities or medical treatment, namely phthalates, bisphenol A, polyfluorinated alkyl acids, dioxins and furans, polycyclic aromatic hydrocarbons and polychlorinated biphenyls. Environmental exposure to these chemicals during everyday life could have adverse consequences on renal function and might contribute to progressive cumulative renal injury over a lifetime. Regulatory efforts should be made to limit individual exposure to environmental chemicals in an attempt to reduce the incidence of cardiorenal disease.
DOE Human Genome Program: Contractor-Grantee Workshop IV, November 13--17, 1994, Santa Fe, New Mexico

Energy Technology Data Exchange (ETDEWEB)

1994-10-01

This volume contains the proceedings of the fourth Contractor-Grantee Workshop for the Department of Energy (DOE) Human Genome Program. Of the 204 abstracts in this book, some 200 describe the genome research of DOE-funded grantees and contractors located at the multidisciplinary centers at Lawrence Berkeley Laboratory, Lawrence Livermore National Laboratory, and Los Alamos National Laboratory; other DOE-supported laboratories; and more than 54 universities, research organizations, and companies in the United States and abroad. Included are 16 abstracts from ongoing projects in the Ethical, Legal, and Social Issues (ELSI) component, an area that continues to attract considerable attention from a wide variety of interested parties. Three abstracts summarize work in the new Microbial Genome Initiative launched this year by the Office of Health and Environmental Research (OHER) to provide genome sequence and mapping data on industrially important microorganisms and those that live under extreme conditions. Many of the projects will be discussed at plenary sessions held throughout the workshop, and all are represented in the poster sessions.
Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium

OpenAIRE

Henrique Machado; Henrique Machado; Lone Gram

2017-01-01

Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationship...
Whole-genome sequence of Escherichia coli serotype O157:H7 strain B6914-ARS

Science.gov (United States)

Escherichia coli serotype O157:H7 strain B6914-MS1 is a Shiga toxin-deficient human fecal isolate obtained by the Centers for Disease Control and Prevention that has been used extensively in applied research studies. Here we report the genome sequence of strain B6914-ARS, a B6914-MS1 clone that has ...
Incidence and characteristics of chemical burns.

Science.gov (United States)

Koh, Dong-Hee; Lee, Sang-Gil; Kim, Hwan-Cheol

2017-05-01

Chemical burns can lead to serious health outcomes. Previous studies about chemical burns have been performed based on burn center data so these studies have provided limited information about the incidence of chemical burns at the national level. The aim of this study was to evaluate the incidence and characteristics of chemical burns using nationwide databases. A cohort representing the Korean population, which was established using a national health insurance database, and a nationwide workers' compensation database were used to evaluate the incidence and characteristics of chemical burns. Characteristics of the affected body region, depth of burns, industry, task, and causative agents were analyzed from two databases. The incidence of chemical burns was calculated according to employment status. The most common regions involving chemical burns with hospital visits were the skin followed by the eyes. For skin lesions, the hands and wrists were the most commonly affected regions. Second degree burns were the most common in terms of depth of skin lesions. The hospital visit incidence was 1.96 per 10,000 person-year in the general population. The compensated chemical burns incidence was 0.17 per 10,000 person-year. Employees and the self-employed showed a significantly increased risk of chemical burns undergoing hospital visits compared to their dependents. Chemical burns on the skin and eyes are almost equally prevalent. The working environment was associated with increased risk of chemical burns. Our results may aid in estimating the size of the problem and prioritizing prevention of chemical burns. Copyright © 2016 Elsevier Ltd and ISBI. All rights reserved.
Baculovirus Genomics

NARCIS (Netherlands)

Oers, van M.M.; Vlak, J.M.

2007-01-01

Baculovirus genomes are covalently closed circles of double stranded-DNA varying in size between 80 and 180 kilobase-pair. The genomes of more than fourty-one baculoviruses have been sequenced to date. The majority of these (37) are pathogenic to lepidopteran hosts; three infect sawflies
Molluscan Evolutionary Genomics

Energy Technology Data Exchange (ETDEWEB)

Simison, W. Brian; Boore, Jeffrey L.

2005-12-01

In the last 20 years there have been dramatic advances in techniques of high-throughput DNA sequencing, most recently accelerated by the Human Genome Project, a program that has determined the three billion base pair code on which we are based. Now this tremendous capability is being directed at other genome targets that are being sampled across the broad range of life. This opens up opportunities as never before for evolutionary and organismal biologists to address questions of both processes and patterns of organismal change. We stand at the dawn of a new 'modern synthesis' period, paralleling that of the early 20th century when the fledgling field of genetics first identified the underlying basis for Darwin's theory. We must now unite the efforts of systematists, paleontologists, mathematicians, computer programmers, molecular biologists, developmental biologists, and others in the pursuit of discovering what genomics can teach us about the diversity of life. Genome-level sampling for mollusks to date has mostly been limited to mitochondrial genomes and it is likely that these will continue to provide the best targets for broad phylogenetic sampling in the near future. However, we are just beginning to see an inroad into complete nuclear genome sequencing, with several mollusks and other eutrochozoans having been selected for work about to begin. Here, we provide an overview of the state of molluscan mitochondrial genomics, highlight a few of the discoveries from this research, outline the promise of broadening this dataset, describe upcoming projects to sequence whole mollusk nuclear genomes, and challenge the community to prepare for making the best use of these data.
Molecular cytogenetic and genomic analyses reveal new insights into the origin of the wheat B genome.

Science.gov (United States)

Zhang, Wei; Zhang, Mingyi; Zhu, Xianwen; Cao, Yaping; Sun, Qing; Ma, Guojia; Chao, Shiaoman; Yan, Changhui; Xu, Steven S; Cai, Xiwen

2018-02-01

This work pinpointed the goatgrass chromosomal segment in the wheat B genome using modern cytogenetic and genomic technologies, and provided novel insights into the origin of the wheat B genome. Wheat is a typical allopolyploid with three homoeologous subgenomes (A, B, and D). The donors of the subgenomes A and D had been identified, but not for the subgenome B. The goatgrass Aegilops speltoides (genome SS) has been controversially considered a possible candidate for the donor of the wheat B genome. However, the relationship of the Ae. speltoides S genome with the wheat B genome remains largely obscure. The present study assessed the homology of the B and S genomes using an integrative cytogenetic and genomic approach, and revealed the contribution of Ae. speltoides to the origin of the wheat B genome. We discovered noticeable homology between wheat chromosome 1B and Ae. speltoides chromosome 1S, but not between other chromosomes in the B and S genomes. An Ae. speltoides-originated segment spanning a genomic region of approximately 10.46 Mb was detected on the long arm of wheat chromosome 1B (1BL). The Ae. speltoides-originated segment on 1BL was found to co-evolve with the rest of the B genome. Evidently, Ae. speltoides had been involved in the origin of the wheat B genome, but should not be considered an exclusive donor of this genome. The wheat B genome might have a polyphyletic origin with multiple ancestors involved, including Ae. speltoides. These novel findings will facilitate genome studies in wheat and other polyploids.
RadGenomics project

Energy Technology Data Exchange (ETDEWEB)

Iwakawa, Mayumi; Imai, Takashi; Harada, Yoshinobu [National Inst. of Radiological Sciences, Chiba (Japan). Frontier Research Center] [and others

2002-06-01

Human health is determined by a complex interplay of factors, predominantly between genetic susceptibility, environmental conditions and aging. The ultimate aim of the RadGenomics (Radiation Genomics) project is to understand the implications of heterogeneity in responses to ionizing radiation arising from genetic variation between individuals in the human population. The rapid progression of the human genome sequencing and the recent development of new technologies in molecular genetics are providing us with new opportunities to understand the genetic basis of individual differences in susceptibility to natural and/or artificial environmental factors, including radiation exposure. The RadGenomics project will inevitably lead to improved protocols for personalized radiotherapy and reductions in the potential side effects of such treatment. The project will contribute to future research into the molecular mechanisms of radiation sensitivity in humans and will stimulate the development of new high-throughput technologies for a broader application of biological and medical sciences. The staff members are specialists in a variety of fields, including genome science, radiation biology, medical science, molecular biology, and informatics, and have joined the RadGenomics project from various universities, companies, and research institutes. The project started in April 2001. (author)
RadGenomics project

International Nuclear Information System (INIS)

Iwakawa, Mayumi; Imai, Takashi; Harada, Yoshinobu

2002-01-01

Human health is determined by a complex interplay of factors, predominantly between genetic susceptibility, environmental conditions and aging. The ultimate aim of the RadGenomics (Radiation Genomics) project is to understand the implications of heterogeneity in responses to ionizing radiation arising from genetic variation between individuals in the human population. The rapid progression of the human genome sequencing and the recent development of new technologies in molecular genetics are providing us with new opportunities to understand the genetic basis of individual differences in susceptibility to natural and/or artificial environmental factors, including radiation exposure. The RadGenomics project will inevitably lead to improved protocols for personalized radiotherapy and reductions in the potential side effects of such treatment. The project will contribute to future research into the molecular mechanisms of radiation sensitivity in humans and will stimulate the development of new high-throughput technologies for a broader application of biological and medical sciences. The staff members are specialists in a variety of fields, including genome science, radiation biology, medical science, molecular biology, and informatics, and have joined the RadGenomics project from various universities, companies, and research institutes. The project started in April 2001. (author)
Journal of Chemical Sciences | Indian Academy of Sciences

Indian Academy of Sciences (India)

Home; Journals; Journal of Chemical Sciences; Volume 119; Issue 5. Controlling dynamics in diatomic systems ... Department of Chemistry, Panjab University, Chandigarh 160 014; Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad 500 032 ...
Engineering and Evolution of Saccharomyces cerevisiae to Produce Biofuels and Chemicals.

Science.gov (United States)

Turner, Timothy L; Kim, Heejin; Kong, In Iok; Liu, Jing-Jing; Zhang, Guo-Chang; Jin, Yong-Su

To mitigate global climate change caused partly by the use of fossil fuels, the production of fuels and chemicals from renewable biomass has been attempted. The conversion of various sugars from renewable biomass into biofuels by engineered baker's yeast (Saccharomyces cerevisiae) is one major direction which has grown dramatically in recent years. As well as shifting away from fossil fuels, the production of commodity chemicals by engineered S. cerevisiae has also increased significantly. The traditional approaches of biochemical and metabolic engineering to develop economic bioconversion processes in laboratory and industrial settings have been accelerated by rapid advancements in the areas of yeast genomics, synthetic biology, and systems biology. Together, these innovations have resulted in rapid and efficient manipulation of S. cerevisiae to expand fermentable substrates and diversify value-added products. Here, we discuss recent and major advances in rational (relying on prior experimentally-derived knowledge) and combinatorial (relying on high-throughput screening and genomics) approaches to engineer S. cerevisiae for producing ethanol, butanol, 2,3-butanediol, fatty acid ethyl esters, isoprenoids, organic acids, rare sugars, antioxidants, and sugar alcohols from glucose, xylose, cellobiose, galactose, acetate, alginate, mannitol, arabinose, and lactose.
Rumen microbial genomics

International Nuclear Information System (INIS)

Morrison, M.; Nelson, K.E.

2005-01-01

Improving microbial degradation of plant cell wall polysaccharides remains one of the highest priority goals for all livestock enterprises, including the cattle herds and draught animals of developing countries. The North American Consortium for Genomics of Fibrolytic Ruminal Bacteria was created to promote the sequencing and comparative analysis of rumen microbial genomes, offering the potential to fully assess the genetic potential in a functional and comparative fashion. It has been found that the Fibrobacter succinogenes genome encodes many more endoglucanases and cellodextrinases than previously isolated, and several new processive endoglucanases have been identified by genome and proteomic analysis of Ruminococcus albus, in addition to a variety of strategies for its adhesion to fibre. The ramifications of acquiring genome sequence data for rumen microorganisms are profound, including the potential to elucidate and overcome the biochemical, ecological or physiological processes that are rate limiting for ruminal fibre degradation. (author)
Atlas2 Cloud: a framework for personal genome analysis in the cloud.

Science.gov (United States)

Evani, Uday S; Challis, Danny; Yu, Jin; Jackson, Andrew R; Paithankar, Sameer; Bainbridge, Matthew N; Jakkamsetti, Adinarayana; Pham, Peter; Coarfa, Cristian; Milosavljevic, Aleksandar; Yu, Fuli

2012-01-01

Until recently, sequencing has primarily been carried out in large genome centers which have invested heavily in developing the computational infrastructure that enables genomic sequence analysis. The recent advancements in next generation sequencing (NGS) have led to a wide dissemination of sequencing technologies and data, to highly diverse research groups. It is expected that clinical sequencing will become part of diagnostic routines shortly. However, limited accessibility to computational infrastructure and high quality bioinformatic tools, and the demand for personnel skilled in data analysis and interpretation remains a serious bottleneck. To this end, the cloud computing and Software-as-a-Service (SaaS) technologies can help address these issues. We successfully enabled the Atlas2 Cloud pipeline for personal genome analysis on two different cloud service platforms: a community cloud via the Genboree Workbench, and a commercial cloud via the Amazon Web Services using Software-as-a-Service model. We report a case study of personal genome analysis using our Atlas2 Genboree pipeline. We also outline a detailed cost structure for running Atlas2 Amazon on whole exome capture data, providing cost projections in terms of storage, compute and I/O when running Atlas2 Amazon on a large data set. We find that providing a web interface and an optimized pipeline clearly facilitates usage of cloud computing for personal genome analysis, but for it to be routinely used for large scale projects there needs to be a paradigm shift in the way we develop tools, in standard operating procedures, and in funding mechanisms.
Comparative genomics reveals insights into avian genome evolution and adaptation

Science.gov (United States)

Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

2015-01-01

Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712
The Arabidopsis lyrata genome sequence and the basis of rapid genome size change

Energy Technology Data Exchange (ETDEWEB)

Hu, Tina T.; Pattyn, Pedro; Bakker, Erica G.; Cao, Jun; Cheng, Jan-Fang; Clark, Richard M.; Fahlgren, Noah; Fawcett, Jeffrey A.; Grimwood, Jane; Gundlach, Heidrun; Haberer, Georg; Hollister, Jesse D.; Ossowski, Stephan; Ottilar, Robert P.; Salamov, Asaf A.; Schneeberger, Korbinian; Spannagl, Manuel; Wang, Xi; Yang, Liang; Nasrallah, Mikhail E.; Bergelson, Joy; Carrington, James C.; Gaut, Brandon S.; Schmutz, Jeremy; Mayer, Klaus F. X.; Van de Peer, Yves; Grigoriev, Igor V.; Nordborg, Magnus; Weigel, Detlef; Guo, Ya-Long

2011-04-29

In our manuscript, we present a high-quality genome sequence of the Arabidopsis thaliana relative, Arabidopsis lyrata, produced by dideoxy sequencing. We have performed the usual types of genome analysis (gene annotation, dN/dS studies etc. etc.), but this is relegated to the Supporting Information. Instead, we focus on what was a major motivation for sequencing this genome, namely to understand how A. thaliana lost half its genome in a few million years and lived to tell the tale. The rather surprising conclusion is that there is not a single genomic feature that accounts for the reduced genome, but that every aspect centromeres, intergenic regions, transposable elements, gene family number is affected through hundreds of thousands of cuts. This strongly suggests that overall genome size in itself is what has been under selection, a suggestion that is strongly supported by our demonstration (using population genetics data from A. thaliana) that new deletions seem to be driven to fixation.

Revisiting the chlorophyll biosynthesis pathway using genome scale metabolic model of Oryza sativa japonica

Science.gov (United States)

Chatterjee, Ankita; Kundu, Sudip

2015-01-01

Chlorophyll is one of the most important pigments present in green plants and rice is one of the major food crops consumed worldwide. We curated the existing genome scale metabolic model (GSM) of rice leaf by incorporating new compartment, reactions and transporters. We used this modified GSM to elucidate how the chlorophyll is synthesized in a leaf through a series of bio-chemical reactions spanned over different organelles using inorganic macronutrients and light energy. We predicted the essential reactions and the associated genes of chlorophyll synthesis and validated against the existing experimental evidences. Further, ammonia is known to be the preferred source of nitrogen in rice paddy fields. The ammonia entering into the plant is assimilated in the root and leaf. The focus of the present work is centered on rice leaf metabolism. We studied the relative importance of ammonia transporters through the chloroplast and the cytosol and their interlink with other intracellular transporters. Ammonia assimilation in the leaves takes place by the enzyme glutamine synthetase (GS) which is present in the cytosol (GS1) and chloroplast (GS2). Our results provided possible explanation why GS2 mutants show normal growth under minimum photorespiration and appear chlorotic when exposed to air. PMID:26443104
CTEPP NC DATA COLLECTED ON FORM 05: CHILD DAY CARE CENTER PRE-MONITORING QUESTIONNAIRE

Science.gov (United States)

This data set contains data concerning the potential sources of pollutants at the day care center including the chemicals that have been applied in the past at the day care center by staff members or by commercial contractors. The day care teacher was asked questions related to t...
CTEPP-OH DATA COLLECTED ON FORM 05: CHILD DAY CARE CENTER PRE-MONITORING QUESTIONNAIRE

Science.gov (United States)

This data set contains data for CTEPP-OH concerning the potential sources of pollutants at the day care center including the chemicals that have been applied in the past at the day care center by staff members or by commercial contractors. The day care teacher was asked questions...
Genome Imprinting

Indian Academy of Sciences (India)

the cell nucleus (mitochondrial and chloroplast genomes), and. (3) traits governed ... tively good embryonic development but very poor development of membranes and ... Human homologies for the type of situation described above are naturally ..... imprint; (b) New modifications of the paternal genome in germ cells of each ...
The synergistic effect of chemical carcinogens enhances Epstein-Barr virus reactivation and tumor progression of nasopharyngeal carcinoma cells.

Science.gov (United States)

Fang, Chih-Yeu; Huang, Sheng-Yen; Wu, Chung-Chun; Hsu, Hui-Yu; Chou, Sheng-Ping; Tsai, Ching-Hwa; Chang, Yao; Takada, Kenzo; Chen, Jen-Yang

2012-01-01

Seroepidemiological studies imply a correlation between Epstein-Barr virus (EBV) reactivation and the development of nasopharyngeal carcinoma (NPC). N-nitroso compounds, phorbols, and butyrates are chemicals found in food and herb samples collected from NPC high-risk areas. These chemicals have been reported to be risk factors contributing to the development of NPC, however, the underlying mechanism is not fully understood. We have demonstrated previously that low dose N-methyl-N'-nitro-N-nitrosoguanidine (MNNG, 0.1 µg/ml) had a synergistic effect with 12-O-tetradecanoylphorbol-13-acetate (TPA) and sodium butyrate (SB) in enhancing EBV reactivation and genome instability in NPC cells harboring EBV. Considering that residents in NPC high-risk areas may contact regularly with these chemical carcinogens, it is vital to elucidate the relation between chemicals and EBV and their contributions to the carcinogenesis of NPC. In this study, we constructed a cell culture model to show that genome instability, alterations of cancer hallmark gene expression, and tumorigenicity were increased after recurrent EBV reactivation in NPC cells following combined treatment of TPA/SB and MNNG. NPC cells latently infected with EBV, NA, and the corresponding EBV-negative cell, NPC-TW01, were periodically treated with MNNG, TPA/SB, or TPA/SB combined with MNNG. With chemically-induced recurrent reactivation of EBV, the degree of genome instability was significantly enhanced in NA cells treated with a combination of TPA/SB and MNNG than those treated individually. The Matrigel invasiveness, as well as the tumorigenicity in mouse, was also enhanced in NA cells after recurrent EBV reactivation. Expression profile analysis by microarray indicates that many carcinogenesis-related genes were altered after recurrent EBV reactivation, and several aberrations observed in cell lines correspond to alterations in NPC lesions. These results indicate that cooperation between chemical carcinogens can
Genomes to Proteomes

Energy Technology Data Exchange (ETDEWEB)

Panisko, Ellen A. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Grigoriev, Igor [USDOE Joint Genome Inst., Walnut Creek, CA (United States); Daly, Don S. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Webb-Robertson, Bobbie-Jo [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Baker, Scott E. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

2009-03-01

Biologists are awash with genomic sequence data. In large part, this is due to the rapid acceleration in the generation of DNA sequence that occurred as public and private research institutes raced to sequence the human genome. In parallel with the large human genome effort, mostly smaller genomes of other important model organisms were sequenced. Projects following on these initial efforts have made use of technological advances and the DNA sequencing infrastructure that was built for the human and other organism genome projects. As a result, the genome sequences of many organisms are available in high quality draft form. While in many ways this is good news, there are limitations to the biological insights that can be gleaned from DNA sequences alone; genome sequences offer only a bird's eye view of the biological processes endemic to an organism or community. Fortunately, the genome sequences now being produced at such a high rate can serve as the foundation for other global experimental platforms such as proteomics. Proteomic methods offer a snapshot of the proteins present at a point in time for a given biological sample. Current global proteomics methods combine enzymatic digestion, separations, mass spectrometry and database searching for peptide identification. One key aspect of proteomics is the prediction of peptide sequences from mass spectrometry data. Global proteomic analysis uses computational matching of experimental mass spectra with predicted spectra based on databases of gene models that are often generated computationally. Thus, the quality of gene models predicted from a genome sequence is crucial in the generation of high quality peptide identifications. Once peptides are identified they can be assigned to their parent protein. Proteins identified as expressed in a given experiment are most useful when compared to other expressed proteins in a larger biological context or biochemical pathway. In this chapter we will discuss the automatic
Genome and transcriptome analysis of the food-yeast Candida utilis.

Directory of Open Access Journals (Sweden)

Yasuyuki Tomita

Full Text Available The industrially important food-yeast Candida utilis is a Crabtree effect-negative yeast used to produce valuable chemicals and recombinant proteins. In the present study, we conducted whole genome sequencing and phylogenetic analysis of C. utilis, which showed that this yeast diverged long before the formation of the CUG and Saccharomyces/Kluyveromyces clades. In addition, we performed comparative genome and transcriptome analyses using next-generation sequencing, which resulted in the identification of genes important for characteristic phenotypes of C. utilis such as those involved in nitrate assimilation, in addition to the gene encoding the functional hexose transporter. We also found that an antisense transcript of the alcohol dehydrogenase gene, which in silico analysis did not predict to be a functional gene, was transcribed in the stationary-phase, suggesting a novel system of repression of ethanol production. These findings should facilitate the development of more sophisticated systems for the production of useful reagents using C. utilis.
Functional Toxicogenomic Assessment of Triclosan in Human HepG2 Cells Using Genome-Wide CRISPR-Cas9 Screening.

Science.gov (United States)

Xia, Pu; Zhang, Xiaowei; Xie, Yuwei; Guan, Miao; Villeneuve, Daniel L; Yu, Hongxia

2016-10-04

There are thousands of chemicals used by humans and detected in the environment for which limited or no toxicological data are available. Rapid and cost-effective approaches for assessing the toxicological properties of chemicals are needed. We used CRISPR-Cas9 functional genomic screening to identify the potential molecular mechanism of a widely used antimicrobial triclosan (TCS) in HepG2 cells. Resistant genes at IC50 (the concentration causing a 50% reduction in cell viability) were significantly enriched in the adherens junction pathway, MAPK signaling pathway, and PPAR signaling pathway, suggesting a potential role in the molecular mechanism of TCS-induced cytotoxicity. Evaluation of the top-ranked resistant genes, FTO (encoding an mRNA demethylase) and MAP2K3 (a MAP kinase kinase family gene), revealed that their loss conferred resistance to TCS. In contrast, sensitive genes at IC10 and IC20 were specifically enriched in pathways involved with immune responses, which was concordant with transcriptomic profiling of TCS at concentrations of CRISPR-Cas9 fingerprint may reveal the patterns of TCS toxicity at low concentration levels. Moreover, we retrieved the potential connection between CRISPR-Cas9 fingerprint and disease terms, obesity, and breast cancer from an existing chemical-gene-disease database. Overall, CRISPR-Cas9 functional genomic screening offers an alternative approach for chemical toxicity testing.
Genome Improvement at JGI-HAGSC

Energy Technology Data Exchange (ETDEWEB)

Grimwood, Jane; Schmutz, Jeremy J.; Myers, Richard M.

2012-03-03

Since the completion of the sequencing of the human genome, the Joint Genome Institute (JGI) has rapidly expanded its scientific goals in several DOE mission-relevant areas. At the JGI-HAGSC, we have kept pace with this rapid expansion of projects with our focus on assessing, assembling, improving and finishing eukaryotic whole genome shotgun (WGS) projects for which the shotgun sequence is generated at the Production Genomic Facility (JGI-PGF). We follow this by combining the draft WGS with genomic resources generated at JGI-HAGSC or in collaborator laboratories (including BAC end sequences, genetic maps and FLcDNA sequences) to produce an improved draft sequence. For eukaryotic genomes important to the DOE mission, we then add further information from directed experiments to produce reference genomic sequences that are publicly available for any scientific researcher. Also, we have continued our program for producing BAC-based finished sequence, both for adding information to JGI genome projects and for small BAC-based sequencing projects proposed through any of the JGI sequencing programs. We have now built our computational expertise in WGS assembly and analysis and have moved eukaryotic genome assembly from the JGI-PGF to JGI-HAGSC. We have concentrated our assembly development work on large plant genomes and complex fungal and algal genomes.
A comprehensive crop genome research project: the Superhybrid Rice Genome Project in China.

Science.gov (United States)

Yu, Jun; Wong, Gane Ka-Shu; Liu, Siqi; Wang, Jian; Yang, Huanming

2007-06-29

In May 2000, the Beijing Institute of Genomics formally announced the launch of a comprehensive crop genome research project on rice genomics, the Chinese Superhybrid Rice Genome Project. SRGP is not simply a sequencing project targeted to a single rice (Oryza sativa L.) genome, but a full-swing research effort with an ultimate goal of providing inclusive basic genomic information and molecular tools not only to understand biology of the rice, both as an important crop species and a model organism of cereals, but also to focus on a popular superhybrid rice landrace, LYP9. We have completed the first phase of SRGP and provide the rice research community with a finished genome sequence of an indica variety, 93-11 (the paternal cultivar of LYP9), together with ample data on subspecific (between subspecies) polymorphisms, transcriptomes and proteomes, useful for within-species comparative studies. In the second phase, we have acquired the genome sequence of the maternal cultivar, PA64S, together with the detailed catalogues of genes uniquely expressed in the parental cultivars and the hybrid as well as allele-specific markers that distinguish parental alleles. Although SRGP in China is not an open-ended research programme, it has been designed to pave a way for future plant genomics research and application, such as to interrogate fundamentals of plant biology, including genome duplication, polyploidy and hybrid vigour, as well as to provide genetic tools for crop breeding and to carry along a social burden-leading a fight against the world's hunger. It began with genomics, the newly developed and industry-scale research field, and from the world's most populous country. In this review, we summarize our scientific goals and noteworthy discoveries that exploit new territories of systematic investigations on basic and applied biology of rice and other major cereal crops.
Lactobacillus paracasei comparative genomics: towards species pan-genome definition and exploitation of diversity.

Directory of Open Access Journals (Sweden)

Tamara Smokvina

Full Text Available Lactobacillus paracasei is a member of the normal human and animal gut microbiota and is used extensively in the food industry in starter cultures for dairy products or as probiotics. With the development of low-cost, high-throughput sequencing techniques it has become feasible to sequence many different strains of one species and to determine its "pan-genome". We have sequenced the genomes of 34 different L. paracasei strains, and performed a comparative genomics analysis. We analysed genome synteny and content, focussing on the pan-genome, core genome and variable genome. Each genome was shown to contain around 2800-3100 protein-coding genes, and comparative analysis identified over 4200 ortholog groups that comprise the pan-genome of this species, of which about 1800 ortholog groups make up the conserved core. Several factors previously associated with host-microbe interactions such as pili, cell-envelope proteinase, hydrolases p40 and p75 or the capacity to produce short branched-chain fatty acids (bkd operon are part of the L. paracasei core genome present in all analysed strains. The variome consists mainly of hypothetical proteins, phages, plasmids, transposon/conjugative elements, and known functions such as sugar metabolism, cell-surface proteins, transporters, CRISPR-associated proteins, and EPS biosynthesis proteins. An enormous variety and variability of sugar utilization gene cassettes were identified, with each strain harbouring between 25-53 cassettes, reflecting the high adaptability of L. paracasei to different niches. A phylogenomic tree was constructed based on total genome contents, and together with an analysis of horizontal gene transfer events we conclude that evolution of these L. paracasei strains is complex and not always related to niche adaptation. The results of this genome content comparison was used, together with high-throughput growth experiments on various carbohydrates, to perform gene-trait matching analysis
Northeastern Center for Chemical Energy Storage (NECCES)

Energy Technology Data Exchange (ETDEWEB)

Whittingham, M. Stanley [Stony Brook Univ., NY (United States)

2015-07-31

The chemical reactions that occur in batteries are complex, spanning a wide range of time and length scales from atomic jumps to the entire battery structure. The NECCES team of experimentalists and theorists made use of, and developed new methodologies to determine how model compound electrodes function in real time, as batteries are cycled. The team determined that kinetic control of intercalation reactions (reactions in which the crystalline structure is maintained) can be achieved by control of the materials morphology and explains and allows for the high rates of many intercalation reactions where the fundamental properties might indicate poor behavior in a battery application. The small overvoltage required for kinetic control is technically effective and economically feasible. A wide range of state-of-the-art operando techniques was developed to study materials under realistic battery conditions, which are now available to the scientific community. The team also investigated the key reaction steps in conversion electrodes, where the crystal structure is destroyed on reaction with lithium and rebuilt on lithium removal. These so-called conversion reactions have in principle much higher capacities, but were found to form very reactive discharge products that reduce the overall energy efficiency on cycling. It was found that by mixing either the anion, as in FeOF, or the cation, as in Cu1-yFeyF2, the capacity on cycling could be improved. The fundamental understanding of the reactions occurring in electrode materials gained in this study will allow for the development of much improved battery systems for energy storage. This will benefit the public in longer lived electronics, higher electric vehicle ranges at lower costs, and improved grid storage that also enables renewable energy supplies such as wind and solar.
Engineering propionibacteria as versatile cell factories for the production of industrially important chemicals: advances, challenges, and prospects.

Science.gov (United States)

Guan, Ningzi; Zhuge, Xin; Li, Jianghua; Shin, Hyun-Dong; Wu, Jing; Shi, Zhongping; Liu, Long

2015-01-01

Propionibacteria are actinobacteria consisting of two principal groups: cutaneous and dairy. Cutaneous propionibacteria are considered primary pathogens to humans, whereas dairy propionibacteria are widely used in the food and pharmaceutical industries. Increasing attention has been focused on improving the performance of dairy propionibacteria for the production of industrially important chemicals, and significant advances have been made through strain engineering and process optimization in the production of flavor compounds, nutraceuticals, and antimicrobial compounds. In addition, genome sequencing of several propionibacteria species has been completed, deepening understanding of the metabolic and physiological features of these organisms. However, the metabolic engineering of propionibacteria still faces several challenges owing to the lack of efficient genome manipulation tools and the existence of various types of strong restriction-modification systems. The emergence of systems and synthetic biology provides new opportunities to overcome these bottlenecks. In this review, we first introduce the major species of propionibacteria and their properties and provide an overview of their functions and applications. We then discuss advances in the genome sequencing and metabolic engineering of these bacteria. Finally, we discuss systems and synthetic biology approaches for engineering propionibacteria as efficient and robust cell factories for the production of industrially important chemicals.
A Trichosporonales genome tree based on 27 haploid and three evolutionarily conserved 'natural' hybrid genomes.

Science.gov (United States)

Takashima, Masako; Sriswasdi, Sira; Manabe, Ri-Ichiroh; Ohkuma, Moriya; Sugita, Takashi; Iwasaki, Wataru

2018-01-01

To construct a backbone tree consisting of basidiomycetous yeasts, draft genome sequences from 25 species of Trichosporonales (Tremellomycetes, Basidiomycota) were generated. In addition to the hybrid genomes of Trichosporon coremiiforme and Trichosporon ovoides that we described previously, we identified an interspecies hybrid genome in Cutaneotrichosporon mucoides (formerly Trichosporon mucoides). This hybrid genome had a gene retention rate of ~55%, and its closest haploid relative was Cutaneotrichosporon dermatis. After constructing the C. mucoides subgenomes, we generated a phylogenetic tree using genome data from the 27 haploid species and the subgenome data from the three hybrid genome species. It was a high-quality tree with 100% bootstrap support for all of the branches. The genome-based tree provided superior resolution compared with previous multi-gene analyses. Although our backbone tree does not include all Trichosporonales genera (e.g. Cryptotrichosporon), it will be valuable for future analyses of genome data. Interest in interspecies hybrid fungal genomes has recently increased because they may provide a basis for new technologies. The three Trichosporonales hybrid genomes described in this study are different from well-characterized hybrid genomes (e.g. those of Saccharomyces pastorianus and Saccharomyces bayanus) because these hybridization events probably occurred in the distant evolutionary past. Hence, they will be useful for studying genome stability following hybridization and speciation events. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes

Science.gov (United States)

Liu, Shengyi; Liu, Yumei; Yang, Xinhua; Tong, Chaobo; Edwards, David; Parkin, Isobel A. P.; Zhao, Meixia; Ma, Jianxin; Yu, Jingyin; Huang, Shunmou; Wang, Xiyin; Wang, Junyi; Lu, Kun; Fang, Zhiyuan; Bancroft, Ian; Yang, Tae-Jin; Hu, Qiong; Wang, Xinfa; Yue, Zhen; Li, Haojie; Yang, Linfeng; Wu, Jian; Zhou, Qing; Wang, Wanxin; King, Graham J; Pires, J. Chris; Lu, Changxin; Wu, Zhangyan; Sampath, Perumal; Wang, Zhuo; Guo, Hui; Pan, Shengkai; Yang, Limei; Min, Jiumeng; Zhang, Dong; Jin, Dianchuan; Li, Wanshun; Belcram, Harry; Tu, Jinxing; Guan, Mei; Qi, Cunkou; Du, Dezhi; Li, Jiana; Jiang, Liangcai; Batley, Jacqueline; Sharpe, Andrew G; Park, Beom-Seok; Ruperao, Pradeep; Cheng, Feng; Waminal, Nomar Espinosa; Huang, Yin; Dong, Caihua; Wang, Li; Li, Jingping; Hu, Zhiyong; Zhuang, Mu; Huang, Yi; Huang, Junyan; Shi, Jiaqin; Mei, Desheng; Liu, Jing; Lee, Tae-Ho; Wang, Jinpeng; Jin, Huizhe; Li, Zaiyun; Li, Xun; Zhang, Jiefu; Xiao, Lu; Zhou, Yongming; Liu, Zhongsong; Liu, Xuequn; Qin, Rui; Tang, Xu; Liu, Wenbin; Wang, Yupeng; Zhang, Yangyong; Lee, Jonghoon; Kim, Hyun Hee; Denoeud, France; Xu, Xun; Liang, Xinming; Hua, Wei; Wang, Xiaowu; Wang, Jun; Chalhoub, Boulos; Paterson, Andrew H

2014-01-01

Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus. PMID:24852848
Clinical genomics, big data, and electronic medical records: reconciling patient rights with research when privacy and science collide.

Science.gov (United States)

Kulynych, Jennifer; Greely, Henry T

2017-04-01

Widespread use of medical records for research, without consent, attracts little scrutiny compared to biospecimen research, where concerns about genomic privacy prompted recent federal proposals to mandate consent. This paper explores an important consequence of the proliferation of electronic health records (EHRs) in this permissive atmosphere: with the advent of clinical gene sequencing, EHR-based secondary research poses genetic privacy risks akin to those of biospecimen research, yet regulators still permit researchers to call gene sequence data 'de-identified', removing such data from the protection of the federal Privacy Rule and federal human subjects regulations. Medical centers and other providers seeking to offer genomic 'personalized medicine' now confront the problem of governing the secondary use of clinical genomic data as privacy risks escalate. We argue that regulators should no longer permit HIPAA-covered entities to treat dense genomic data as de-identified health information. Even with this step, the Privacy Rule would still permit disclosure of clinical genomic data for research, without consent, under a data use agreement, so we also urge that providers give patients specific notice before disclosing clinical genomic data for research, permitting (where possible) some degree of choice and control. To aid providers who offer clinical gene sequencing, we suggest both general approaches and specific actions to reconcile patients' rights and interests with genomic research.
Clinical genomics, big data, and electronic medical records: reconciling patient rights with research when privacy and science collide

Science.gov (United States)

Greely, Henry T.

2017-01-01

Abstract Widespread use of medical records for research, without consent, attracts little scrutiny compared to biospecimen research, where concerns about genomic privacy prompted recent federal proposals to mandate consent. This paper explores an important consequence of the proliferation of electronic health records (EHRs) in this permissive atmosphere: with the advent of clinical gene sequencing, EHR-based secondary research poses genetic privacy risks akin to those of biospecimen research, yet regulators still permit researchers to call gene sequence data ‘de-identified’, removing such data from the protection of the federal Privacy Rule and federal human subjects regulations. Medical centers and other providers seeking to offer genomic ‘personalized medicine’ now confront the problem of governing the secondary use of clinical genomic data as privacy risks escalate. We argue that regulators should no longer permit HIPAA-covered entities to treat dense genomic data as de-identified health information. Even with this step, the Privacy Rule would still permit disclosure of clinical genomic data for research, without consent, under a data use agreement, so we also urge that providers give patients specific notice before disclosing clinical genomic data for research, permitting (where possible) some degree of choice and control. To aid providers who offer clinical gene sequencing, we suggest both general approaches and specific actions to reconcile patients’ rights and interests with genomic research. PMID:28852559
Mining genome sequencing data to identify the genomic features linked to breast cancer histopathology

Science.gov (United States)

Ping, Zheng; Siegal, Gene P.; Almeida, Jonas S.; Schnitt, Stuart J.; Shen, Dejun

2014-01-01

Background: Genetics and genomics have radically altered our understanding of breast cancer progression. However, the genomic basis of various histopathologic features of breast cancer is not yet well-defined. Materials and Methods: The Cancer Genome Atlas (TCGA) is an international database containing a large collection of human cancer genome sequencing data. cBioPortal is a web tool developed for mining these sequencing data. We performed mining of TCGA sequencing data in an attempt to characterize the genomic features correlated with breast cancer histopathology. We first assessed the quality of the TCGA data using a group of genes with known alterations in various cancers. Both genome-wide gene mutation and copy number changes as well as a group of genes with a high frequency of genetic changes were then correlated with various histopathologic features of invasive breast cancer. Results: Validation of TCGA data using a group of genes with known alterations in breast cancer suggests that the TCGA has accurately documented the genomic abnormalities of multiple malignancies. Further analysis of TCGA breast cancer sequencing data shows that accumulation of specific genomic defects is associated with higher tumor grade, larger tumor size and receptor negativity. Distinct groups of genomic changes were found to be associated with the different grades of invasive ductal carcinoma. The mutator role of the TP53 gene was validated by genomic sequencing data of invasive breast cancer and TP53 mutation was found to play a critical role in defining high tumor grade. Conclusions: Data mining of the TCGA genome sequencing data is an innovative and reliable method to help characterize the genomic abnormalities associated with histopathologic features of invasive breast cancer. PMID:24672738
Mining genome sequencing data to identify the genomic features linked to breast cancer histopathology

Directory of Open Access Journals (Sweden)

Zheng Ping

2014-01-01

Full Text Available Background: Genetics and genomics have radically altered our understanding of breast cancer progression. However, the genomic basis of various histopathologic features of breast cancer is not yet well-defined. Materials and Methods: The Cancer Genome Atlas (TCGA is an international database containing a large collection of human cancer genome sequencing data. cBioPortal is a web tool developed for mining these sequencing data. We performed mining of TCGA sequencing data in an attempt to characterize the genomic features correlated with breast cancer histopathology. We first assessed the quality of the TCGA data using a group of genes with known alterations in various cancers. Both genome-wide gene mutation and copy number changes as well as a group of genes with a high frequency of genetic changes were then correlated with various histopathologic features of invasive breast cancer. Results: Validation of TCGA data using a group of genes with known alterations in breast cancer suggests that the TCGA has accurately documented the genomic abnormalities of multiple malignancies. Further analysis of TCGA breast cancer sequencing data shows that accumulation of specific genomic defects is associated with higher tumor grade, larger tumor size and receptor negativity. Distinct groups of genomic changes were found to be associated with the different grades of invasive ductal carcinoma. The mutator role of the TP53 gene was validated by genomic sequencing data of invasive breast cancer and TP53 mutation was found to play a critical role in defining high tumor grade. Conclusions: Data mining of the TCGA genome sequencing data is an innovative and reliable method to help characterize the genomic abnormalities associated with histopathologic features of invasive breast cancer.
Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes.

Science.gov (United States)

Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M; Murphy, Robert W; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

2015-03-17

The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies.

The complete mitochondrial genome of Gossypium hirsutum and evolutionary analysis of higher plant mitochondrial genomes.

Science.gov (United States)

Liu, Guozheng; Cao, Dandan; Li, Shuangshuang; Su, Aiguo; Geng, Jianing; Grover, Corrinne E; Hu, Songnian; Hua, Jinping

2013-01-01

Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species.
Microbial Genomes Multiply

Science.gov (United States)

Doolittle, Russell F.

2002-01-01

The publication of the first complete sequence of a bacterial genome in 1995 was a signal event, underscored by the fact that the article has been cited more than 2,100 times during the intervening seven years. It was a marvelous technical achievement, made possible by automatic DNA-sequencing machines. The feat is the more impressive in that complete genome sequencing has now been adopted in many different laboratories around the world. Four years ago in these columns I examined the situation after a dozen microbial genomes had been completed. Now, with upwards of 60 microbial genome sequences determined and twice that many in progress, it seems reasonable to assess just what is being learned. Are new concepts emerging about how cells work? Have there been practical benefits in the fields of medicine and agriculture? Is it feasible to determine the genomic sequence of every bacterial species on Earth? The answers to these questions maybe Yes, Perhaps, and No, respectively.
Cyclohexanecarbonitriles: Assigning Configurations at Quaternary Centers From 13C NMR CN Chemical Shifts.1

Science.gov (United States)

Wei, Guoqing

2009-01-01

13C NMR chemical shifts of the nitrile carbon in cyclohexanecarbonitriles directly correlate with the configuration of the quaternary, nitrile-bearing stereocenter. Comparing 13C NMR chemical shifts for over 200 cyclohexanecarbonitriles reveals that equatorially oriented nitriles resonate 3.3 ppm downfield, on average, from their axial counterparts. Pairs of axial/equatorial diastereomers varying only at the nitrile-bearing carbon consistently exhibit downfield shifts of δ 0.4–7.2 for the equatorial nitrile carbon, even in angularly substituted decalins and hydrindanes. PMID:19348434
An orthologous transcriptional signature differentiates responses towards closely related chemicals in Arabidopsis thaliana and brassica napus

Science.gov (United States)

Herbicides are structurally diverse chemicals that inhibit plant-specific targets, however their off-target and potentially differentiating side-effects are less well defined. In this study, genome-wide expression profiling based on Affymetrix AtH1 arrays was used to identify dis...
Efficient Breeding by Genomic Mating.

Science.gov (United States)

Akdemir, Deniz; Sánchez, Julio I

2016-01-01

Selection in breeding programs can be done by using phenotypes (phenotypic selection), pedigree relationship (breeding value selection) or molecular markers (marker assisted selection or genomic selection). All these methods are based on truncation selection, focusing on the best performance of parents before mating. In this article we proposed an approach to breeding, named genomic mating, which focuses on mating instead of truncation selection. Genomic mating uses information in a similar fashion to genomic selection but includes information on complementation of parents to be mated. Following the efficiency frontier surface, genomic mating uses concepts of estimated breeding values, risk (usefulness) and coefficient of ancestry to optimize mating between parents. We used a genetic algorithm to find solutions to this optimization problem and the results from our simulations comparing genomic selection, phenotypic selection and the mating approach indicate that current approach for breeding complex traits is more favorable than phenotypic and genomic selection. Genomic mating is similar to genomic selection in terms of estimating marker effects, but in genomic mating the genetic information and the estimated marker effects are used to decide which genotypes should be crossed to obtain the next breeding population.
The life cycle of a genome project: perspectives and guidelines inspired by insect genome projects.

Science.gov (United States)

Papanicolaou, Alexie

2016-01-01

Many research programs on non-model species biology have been empowered by genomics. In turn, genomics is underpinned by a reference sequence and ancillary information created by so-called "genome projects". The most reliable genome projects are the ones created as part of an active research program and designed to address specific questions but their life extends past publication. In this opinion paper I outline four key insights that have facilitated maintaining genomic communities: the key role of computational capability, the iterative process of building genomic resources, the value of community participation and the importance of manual curation. Taken together, these ideas can and do ensure the longevity of genome projects and the growing non-model species community can use them to focus a discussion with regards to its future genomic infrastructure.
Genomic prediction using subsampling

OpenAIRE

Xavier, Alencar; Xu, Shizhong; Muir, William; Rainey, Katy Martin

2017-01-01

Background Genome-wide assisted selection is a critical tool for the?genetic improvement of plants and animals. Whole-genome regression models in Bayesian framework represent the main family of prediction methods. Fitting such models with a large number of observations involves a prohibitive computational burden. We propose the use of subsampling bootstrap Markov chain in genomic prediction. Such method consists of fitting whole-genome regression models by subsampling observations in each rou...
Ginseng Genome Database: an open-access platform for genomics of Panax ginseng.

Science.gov (United States)

Jayakodi, Murukarthick; Choi, Beom-Soon; Lee, Sang-Choon; Kim, Nam-Hoon; Park, Jee Young; Jang, Woojong; Lakshmanan, Meiyappan; Mohan, Shobhana V G; Lee, Dong-Yup; Yang, Tae-Jin

2018-04-12

The ginseng (Panax ginseng C.A. Meyer) is a perennial herbaceous plant that has been used in traditional oriental medicine for thousands of years. Ginsenosides, which have significant pharmacological effects on human health, are the foremost bioactive constituents in this plant. Having realized the importance of this plant to humans, an integrated omics resource becomes indispensable to facilitate genomic research, molecular breeding and pharmacological study of this herb. The first draft genome sequences of P. ginseng cultivar "Chunpoong" were reported recently. Here, using the draft genome, transcriptome, and functional annotation datasets of P. ginseng, we have constructed the Ginseng Genome Database http://ginsengdb.snu.ac.kr /, the first open-access platform to provide comprehensive genomic resources of P. ginseng. The current version of this database provides the most up-to-date draft genome sequence (of approximately 3000 Mbp of scaffold sequences) along with the structural and functional annotations for 59,352 genes and digital expression of genes based on transcriptome data from different tissues, growth stages and treatments. In addition, tools for visualization and the genomic data from various analyses are provided. All data in the database were manually curated and integrated within a user-friendly query page. This database provides valuable resources for a range of research fields related to P. ginseng and other species belonging to the Apiales order as well as for plant research communities in general. Ginseng genome database can be accessed at http://ginsengdb.snu.ac.kr /.
Genome Modeling System: A Knowledge Management Platform for Genomics.

Directory of Open Access Journals (Sweden)

Malachi Griffith

2015-07-01

Full Text Available In this work, we present the Genome Modeling System (GMS, an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395 and matched lymphoblastoid line (HCC1395BL. These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms.
Ancient genomes

OpenAIRE

Hoelzel, A Rus

2005-01-01

Ever since its invention, the polymerase chain reaction has been the method of choice for work with ancient DNA. In an application of modern genomic methods to material from the Pleistocene, a recent study has instead undertaken to clone and sequence a portion of the ancient genome of the cave bear.
Governance in genomics: a conceptual challenge for public health genomics law

Directory of Open Access Journals (Sweden)

Tobias Schulte in den Bäumen

2006-12-01

Full Text Available Increasing levels of genomic knowledge has led to awareness that new governance issues need to be taken into consideration. While some countries have created new statutory laws in the last 10 years, science supports the idea that genomic data should be treated like other medical data. In this article we discuss the three core models of governance in medical law on a conceptual level. The three models, the Medical, Public Health and Fundamental Rights Model stress different values, or in legal terms serve different principles. The Medical Model stands for expert knowledge and the standardisation of quality in healthcare. The Public Health Model fosters a social point of view as it advocates distribution justice in healthcare and an awareness of healthcare as a broader concept. The Fundamental Rights Model focuses on individual rights such as the right to privacy and autonomy. We argue that none of the models can be used in a purist fashion as governance in genomics should enable society and individuals to protect individual rights, to strive for a distribution justice and to ensure the quality of genomic services in one coherent process. Thus, genomic governance in genomics requires procedural law and a set of applicable principles. The principle which underlies all three models is the principle of medical beneficence. Therefore genomic governance should refer to it as a key principle when conflicting rights of individuals or communities need to be balanced.
Genetical Genomics for Evolutionary Studies

NARCIS (Netherlands)

Prins, J.C.P.; Smant, G.; Jansen, R.C.

2012-01-01

Genetical genomics combines acquired high-throughput genomic data with genetic analysis. In this chapter, we discuss the application of genetical genomics for evolutionary studies, where new high-throughput molecular technologies are combined with mapping quantitative trait loci (QTL) on the genome
Human social genomics.

Directory of Open Access Journals (Sweden)

Steven W Cole

2014-08-01

Full Text Available A growing literature in human social genomics has begun to analyze how everyday life circumstances influence human gene expression. Social-environmental conditions such as urbanity, low socioeconomic status, social isolation, social threat, and low or unstable social status have been found to associate with differential expression of hundreds of gene transcripts in leukocytes and diseased tissues such as metastatic cancers. In leukocytes, diverse types of social adversity evoke a common conserved transcriptional response to adversity (CTRA characterized by increased expression of proinflammatory genes and decreased expression of genes involved in innate antiviral responses and antibody synthesis. Mechanistic analyses have mapped the neural "social signal transduction" pathways that stimulate CTRA gene expression in response to social threat and may contribute to social gradients in health. Research has also begun to analyze the functional genomics of optimal health and thriving. Two emerging opportunities now stand to revolutionize our understanding of the everyday life of the human genome: network genomics analyses examining how systems-level capabilities emerge from groups of individual socially sensitive genomes and near-real-time transcriptional biofeedback to empirically optimize individual well-being in the context of the unique genetic, geographic, historical, developmental, and social contexts that jointly shape the transcriptional realization of our innate human genomic potential for thriving.
Reference-quality genome sequence of Aegilops tauschii, the source of wheat D genome, shows that recombination shapes genome structure and evolution

Science.gov (United States)

Aegilops tauschii is the diploid progenitor of the D genome of hexaploid wheat and an important genetic resource for wheat. A reference-quality sequence for the Ae. tauschii genome was produced with a combination of ordered-clone sequencing, whole-genome shotgun sequencing, and BioNano optical geno...
Genomic taxonomy of vibrios

Directory of Open Access Journals (Sweden)

Iida Tetsuya

2009-10-01

Full Text Available Abstract Background Vibrio taxonomy has been based on a polyphasic approach. In this study, we retrieve useful taxonomic information (i.e. data that can be used to distinguish different taxonomic levels, such as species and genera from 32 genome sequences of different vibrio species. We use a variety of tools to explore the taxonomic relationship between the sequenced genomes, including Multilocus Sequence Analysis (MLSA, supertrees, Average Amino Acid Identity (AAI, genomic signatures, and Genome BLAST atlases. Our aim is to analyse the usefulness of these tools for species identification in vibrios. Results We have generated four new genome sequences of three Vibrio species, i.e., V. alginolyticus 40B, V. harveyi-like 1DA3, and V. mimicus strains VM573 and VM603, and present a broad analyses of these genomes along with other sequenced Vibrio species. The genome atlas and pangenome plots provide a tantalizing image of the genomic differences that occur between closely related sister species, e.g. V. cholerae and V. mimicus. The vibrio pangenome contains around 26504 genes. The V. cholerae core genome and pangenome consist of 1520 and 6923 genes, respectively. Pangenomes might allow different strains of V. cholerae to occupy different niches. MLSA and supertree analyses resulted in a similar phylogenetic picture, with a clear distinction of four groups (Vibrio core group, V. cholerae-V. mimicus, Aliivibrio spp., and Photobacterium spp.. A Vibrio species is defined as a group of strains that share > 95% DNA identity in MLSA and supertree analysis, > 96% AAI, ≤ 10 genome signature dissimilarity, and > 61% proteome identity. Strains of the same species and species of the same genus will form monophyletic groups on the basis of MLSA and supertree. Conclusion The combination of different analytical and bioinformatics tools will enable the most accurate species identification through genomic computational analysis. This endeavour will culminate in
Comparative genome analysis of Bacillus cereus group genomes withBacillus subtilis

Energy Technology Data Exchange (ETDEWEB)

Anderson, Iain; Sorokin, Alexei; Kapatral, Vinayak; Reznik, Gary; Bhattacharya, Anamitra; Mikhailova, Natalia; Burd, Henry; Joukov, Victor; Kaznadzey, Denis; Walunas, Theresa; D' Souza, Mark; Larsen, Niels; Pusch,Gordon; Liolios, Konstantinos; Grechkin, Yuri; Lapidus, Alla; Goltsman,Eugene; Chu, Lien; Fonstein, Michael; Ehrlich, S. Dusko; Overbeek, Ross; Kyrpides, Nikos; Ivanova, Natalia

2005-09-14

Genome features of the Bacillus cereus group genomes (representative strains of Bacillus cereus, Bacillus anthracis and Bacillus thuringiensis sub spp israelensis) were analyzed and compared with the Bacillus subtilis genome. A core set of 1,381 protein families among the four Bacillus genomes, with an additional set of 933 families common to the B. cereus group, was identified. Differences in signal transduction pathways, membrane transporters, cell surface structures, cell wall, and S-layer proteins suggesting differences in their phenotype were identified. The B. cereus group has signal transduction systems including a tyrosine kinase related to two-component system histidine kinases from B. subtilis. A model for regulation of the stress responsive sigma factor sigmaB in the B. cereus group different from the well studied regulation in B. subtilis has been proposed. Despite a high degree of chromosomal synteny among these genomes, significant differences in cell wall and spore coat proteins that contribute to the survival and adaptation in specific hosts has been identified.
What does it mean to be genomically literate?: National Human Genome Research Institute Meeting Report.

Science.gov (United States)

Hurle, Belen; Citrin, Toby; Jenkins, Jean F; Kaphingst, Kimberly A; Lamb, Neil; Roseman, Jo Ellen; Bonham, Vence L

2013-08-01

Genomic discoveries will increasingly advance the science of medicine. Limited genomic literacy may adversely impact the public's understanding and use of the power of genetics and genomics in health care and public health. In November 2011, a meeting was held by the National Human Genome Research Institute to examine the challenge of achieving genomic literacy for the general public, from kindergarten to grade 12 to adult education. The role of the media in disseminating scientific messages and in perpetuating or reducing misconceptions was also discussed. Workshop participants agreed that genomic literacy will be achieved only through active engagement between genomics experts and the varied constituencies that comprise the public. This report summarizes the background, content, and outcomes from this meeting, including recommendations for a research agenda to inform decisions about how to advance genomic literacy in our society.
Ultrafast comparison of personal genomes

OpenAIRE

Mauldin, Denise; Hood, Leroy; Robinson, Max; Glusman, Gustavo

2017-01-01

We present an ultra-fast method for comparing personal genomes. We transform the standard genome representation (lists of variants relative to a reference) into 'genome fingerprints' that can be readily compared across sequencing technologies and reference versions. Because of their reduced size, computation on the genome fingerprints is fast and requires little memory. This enables scaling up a variety of important genome analyses, including quantifying relatedness, recognizing duplicative s...
Bioinformatics decoding the genome

CERN Multimedia

CERN. Geneva; Deutsch, Sam; Michielin, Olivier; Thomas, Arthur; Descombes, Patrick

2006-01-01

Extracting the fundamental genomic sequence from the DNA From Genome to Sequence : Biology in the early 21st century has been radically transformed by the availability of the full genome sequences of an ever increasing number of life forms, from bacteria to major crop plants and to humans. The lecture will concentrate on the computational challenges associated with the production, storage and analysis of genome sequence data, with an emphasis on mammalian genomes. The quality and usability of genome sequences is increasingly conditioned by the careful integration of strategies for data collection and computational analysis, from the construction of maps and libraries to the assembly of raw data into sequence contigs and chromosome-sized scaffolds. Once the sequence is assembled, a major challenge is the mapping of biologically relevant information onto this sequence: promoters, introns and exons of protein-encoding genes, regulatory elements, functional RNAs, pseudogenes, transposons, etc. The methodological ...
The wolf reference genome sequence (Canis lupus lupus) and its implications for Canis spp. population genomics

DEFF Research Database (Denmark)

Gopalakrishnan, Shyam; Samaniego Castruita, Jose Alfredo; Sinding, Mikkel Holger Strander

2017-01-01

Background An increasing number of studies are addressing the evolutionary genomics of dog domestication, principally through resequencing dog, wolf and related canid genomes. There is, however, only one de novo assembled canid genome currently available against which to map such data - that of a......Background An increasing number of studies are addressing the evolutionary genomics of dog domestication, principally through resequencing dog, wolf and related canid genomes. There is, however, only one de novo assembled canid genome currently available against which to map such data...... that regardless of the reference genome choice, most evolutionary genomic analyses yield qualitatively similar results, including those exploring the structure between the wolves and dogs using admixture and principal component analysis. However, we do observe differences in the genomic coverage of re-mapped...

Some links on this page may take you to non-federal websites. Their policies may differ from this site.