WorldWideScience

Sample records for genome fragments based

  1. Unsupervised binning of environmental genomic fragments based on an error robust selection of l-mers.

    Science.gov (United States)

    Yang, Bin; Peng, Yu; Leung, Henry Chi-Ming; Yiu, Siu-Ming; Chen, Jing-Chi; Chin, Francis Yuk-Lun

    2010-04-16

    With the rapid development of genome sequencing techniques, traditional research methods based on the isolation and cultivation of microorganisms are being gradually replaced by metagenomics, which is also known as environmental genomics. The first step, which is still a major bottleneck, of metagenomics is the taxonomic characterization of DNA fragments (reads) resulting from sequencing a sample of mixed species. This step is usually referred as "binning". Existing binning methods are based on supervised or semi-supervised approaches which rely heavily on reference genomes of known microorganisms and phylogenetic marker genes. Due to the limited availability of reference genomes and the bias and instability of marker genes, existing binning methods may not be applicable in many cases. In this paper, we present an unsupervised binning method based on the distribution of a carefully selected set of l-mers (substrings of length l in DNA fragments). From our experiments, we show that our method can accurately bin DNA fragments with various lengths and relative species abundance ratios without using any reference and training datasets. Another feature of our method is its error robustness. The binning accuracy decreases by less than 1% when the sequencing error rate increases from 0% to 5%. Note that the typical sequencing error rate of existing commercial sequencing platforms is less than 2%. We provide a new and effective tool to solve the metagenome binning problem without using any reference datasets or markers information of any known reference genomes (species). The source code of our software tool, the reference genomes of the species for generating the test datasets and the corresponding test datasets are available at http://i.cs.hku.hk/~alse/MetaCluster/.

  2. An Efficient Genome Fragment Assembling Using GA with Neighborhood Aware Fitness Function

    Directory of Open Access Journals (Sweden)

    Satoko Kikuchi

    2012-01-01

    Full Text Available To decode a long genome sequence, shotgun sequencing is the state-of-the-art technique. It needs to properly sequence a very large number, sometimes as large as millions, of short partially readable strings (fragments. Arranging those fragments in correct sequence is known as fragment assembling, which is an NP-problem. Presently used methods require enormous computational cost. In this work, we have shown how our modified genetic algorithm (GA could solve this problem efficiently. In the proposed GA, the length of the chromosome, which represents the volume of the search space, is reduced with advancing generations, and thereby improves search efficiency. We also introduced a greedy mutation, by swapping nearby fragments using some heuristics, to improve the fitness of chromosomes. We compared results with Parsons’ algorithm which is based on GA too. We used fragments with partial reads on both sides, mimicking fragments in real genome assembling process. In Parsons’ work base-pair array of the whole fragment is known. Even then, we could obtain much better results, and we succeeded in restructuring contigs covering 100% of the genome sequences.

  3. Single-Cell-Based Platform for Copy Number Variation Profiling through Digital Counting of Amplified Genomic DNA Fragments.

    Science.gov (United States)

    Li, Chunmei; Yu, Zhilong; Fu, Yusi; Pang, Yuhong; Huang, Yanyi

    2017-04-26

    We develop a novel single-cell-based platform through digital counting of amplified genomic DNA fragments, named multifraction amplification (mfA), to detect the copy number variations (CNVs) in a single cell. Amplification is required to acquire genomic information from a single cell, while introducing unavoidable bias. Unlike prevalent methods that directly infer CNV profiles from the pattern of sequencing depth, our mfA platform denatures and separates the DNA molecules from a single cell into multiple fractions of a reaction mix before amplification. By examining the sequencing result of each fraction for a specific fragment and applying a segment-merge maximum likelihood algorithm to the calculation of copy number, we digitize the sequencing-depth-based CNV identification and thus provide a method that is less sensitive to the amplification bias. In this paper, we demonstrate a mfA platform through multiple displacement amplification (MDA) chemistry. When performing the mfA platform, the noise of MDA is reduced; therefore, the resolution of single-cell CNV identification can be improved to 100 kb. We can also determine the genomic region free of allelic drop-out with mfA platform, which is impossible for conventional single-cell amplification methods.

  4. The SGC beyond structural genomics: redefining the role of 3D structures by coupling genomic stratification with fragment-based discovery.

    Science.gov (United States)

    Bradley, Anthony R; Echalier, Aude; Fairhead, Michael; Strain-Damerell, Claire; Brennan, Paul; Bullock, Alex N; Burgess-Brown, Nicola A; Carpenter, Elisabeth P; Gileadi, Opher; Marsden, Brian D; Lee, Wen Hwa; Yue, Wyatt; Bountra, Chas; von Delft, Frank

    2017-11-08

    The ongoing explosion in genomics data has long since outpaced the capacity of conventional biochemical methodology to verify the large number of hypotheses that emerge from the analysis of such data. In contrast, it is still a gold-standard for early phenotypic validation towards small-molecule drug discovery to use probe molecules (or tool compounds), notwithstanding the difficulty and cost of generating them. Rational structure-based approaches to ligand discovery have long promised the efficiencies needed to close this divergence; in practice, however, this promise remains largely unfulfilled, for a host of well-rehearsed reasons and despite the huge technical advances spearheaded by the structural genomics initiatives of the noughties. Therefore the current, fourth funding phase of the Structural Genomics Consortium (SGC), building on its extensive experience in structural biology of novel targets and design of protein inhibitors, seeks to redefine what it means to do structural biology for drug discovery. We developed the concept of a Target Enabling Package (TEP) that provides, through reagents, assays and data, the missing link between genetic disease linkage and the development of usefully potent compounds. There are multiple prongs to the ambition: rigorously assessing targets' genetic disease linkages through crowdsourcing to a network of collaborating experts; establishing a systematic approach to generate the protocols and data that comprise each target's TEP; developing new, X-ray-based fragment technologies for generating high quality chemical matter quickly and cheaply; and exploiting a stringently open access model to build multidisciplinary partnerships throughout academia and industry. By learning how to scale these approaches, the SGC aims to make structures finally serve genomics, as originally intended, and demonstrate how 3D structures systematically allow new modes of druggability to be discovered for whole classes of targets. © 2017 The

  5. Non PCR-amplified Transcripts and AFLP fragments as reduced representations of the quail genome for 454 Titanium sequencing

    Directory of Open Access Journals (Sweden)

    Leterrier Christine

    2010-07-01

    Full Text Available Abstract Background SNP (Single Nucleotide Polymorphism discovery is now routinely performed using high-throughput sequencing of reduced representation libraries. Our objective was to adapt 454 GS FLX based sequencing methodologies in order to obtain the largest possible dataset from two reduced representations libraries, produced by AFLP (Amplified Fragment Length Polymorphism for genomic DNA, and EST (Expressed Sequence Tag for the transcribed fraction of the genome. Findings The expressed fraction was obtained by preparing cDNA libraries without PCR amplification from quail embryo and brain. To optimize the information content for SNP analyses, libraries were prepared from individuals selected in three quail lines and each individual in the AFLP library was tagged. Sequencing runs produced 399,189 sequence reads from cDNA and 373,484 from genomic fragments, covering close to 250 Mb of sequence in total. Conclusions Both methods used to obtain reduced representations for high-throughput sequencing were successful after several improvements. The protocols may be used for several sequencing applications, such as de novo sequencing, tagged PCR fragments or long fragment sequencing of cDNA.

  6. Nuclear targeting by fragmentation of the Potato spindle tuber viroid genome

    International Nuclear Information System (INIS)

    Abraitiene, Asta; Zhao Yan; Hammond, Rosemarie

    2008-01-01

    Transient expression of engineered reporter RNAs encoding an intron-containing green fluorescent protein (GFP) from a Potato virus X-based expression vector previously demonstrated the nuclear targeting capability of the 359 nucleotide Potato spindle tuber viroid (PSTVd) RNA genome. To further delimit the putative nuclear-targeting signal, PSTVd subgenomic fragments were embedded within the intron, and recombinant reporter RNAs were inoculated onto Nicotiana benthamiana plants. Appearance of green fluorescence in leaf tissue inoculated with PSTVd-fragment-containing constructs indicated shuttling of the RNA into the nucleus by fragments as short as 80 nucleotides in length. Plant-to-plant variation in the timing of intron removal and subsequent GFP fluorescence was observed; however, earliest and most abundant GFP expression was obtained with constructs containing the conserved hairpin I palindrome structure and embedded upper central conserved region. Our results suggest that this conserved sequence and/or the stem-loop structure it forms is sufficient for import of PSTVd into the nucleus

  7. ERIC-PCR fingerprinting-based community DNA hybridization to pinpoint genome-specific fragments as molecular markers to identify and track populations common to healthy human guts.

    Science.gov (United States)

    Wei, Guifang; Pan, Li; Du, Huimin; Chen, Junyi; Zhao, Liping

    2004-10-01

    Bacterial populations common to healthy human guts may play important roles in human health. A new strategy for discovering genomic sequences as markers for these bacteria was developed using Enterobacterial Repetitive Intergenic Consensus (ERIC)-PCR fingerprinting. Structural features within microbial communities are compared with ERIC-PCR followed by DNA hybridization to identify genomic fragments shared by samples from healthy human individuals. ERIC-PCR profiles of fecal samples from 12 diseased or healthy human and piglet subjects demonstrated stable, unique banding patterns for each individual tested. Sequence homology of DNA fragments in bands of identical size was examined between samples by hybridization under high stringency conditions with DIG-labeled ERIC-PCR products derived from the fecal sample of one healthy child. Comparative analysis of the hybridization profiles with the original agarose fingerprints identified three predominant bands as signatures for populations associated with healthy human guts with sizes of 500, 800 and 1000 bp. Clone library profiling of the three bands produced 17 genome fragments, three of which showed high similarity only with regions of the Bacteroides thetaiotaomicron genome, while the remainder were orphan sequences. Association of these sequences with healthy guts was validated by sequence-selective PCR experiments, which showed that a single fragment was present in all 32 healthy humans and 13 healthy piglets tested. Two fragments were present in the healthy human group and in 18 children with non-infectious diarrhea but not in eight children with infectious diarrhea. Genome fragments identified with this novel strategy may be used as genome-specific markers for dynamic monitoring and sequence-guided isolation of functionally important bacterial populations in complex communities such as human gut microflora.

  8. Mind the gap; seven reasons to close fragmented genome assemblies.

    Science.gov (United States)

    Thomma, Bart P H J; Seidl, Michael F; Shi-Kunne, Xiaoqian; Cook, David E; Bolton, Melvin D; van Kan, Jan A L; Faino, Luigi

    2016-05-01

    Like other domains of life, research into the biology of filamentous microbes has greatly benefited from the advent of whole-genome sequencing. Next-generation sequencing (NGS) technologies have revolutionized sequencing, making genomic sciences accessible to many academic laboratories including those that study non-model organisms. Thus, hundreds of fungal genomes have been sequenced and are publically available today, although these initiatives have typically yielded considerably fragmented genome assemblies that often lack large contiguous genomic regions. Many important genomic features are contained in intergenic DNA that is often missing in current genome assemblies, and recent studies underscore the significance of non-coding regions and repetitive elements for the life style, adaptability and evolution of many organisms. The study of particular types of genetic elements, such as telomeres, centromeres, repetitive elements, effectors, and clusters of co-regulated genes, but also of phenomena such as structural rearrangements, genome compartmentalization and epigenetics, greatly benefits from having a contiguous and high-quality, preferably even complete and gapless, genome assembly. Here we discuss a number of important reasons to produce gapless, finished, genome assemblies to help answer important biological questions. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. Genome analysis and DNA marker-based characterisation of pathogenic trypanosomes

    NARCIS (Netherlands)

    Agbo, Edwin Chukwura

    2003-01-01

    The advances in genomics technologies and genome analysis methods that offer new leads for accelerating discovery of putative targets for developing overall control tools are reviewed in Chapter 1. In Chapter 2, a PCR typing method based on restriction fragment length polymorphism analysis of the

  10. Physical mapping of 20 unmapped fragments of the btau_4.0 genome assembly in cattle, sheep and river buffalo.

    Science.gov (United States)

    De Lorenzi, L; Genualdo, V; Perucatti, A; Iannuzzi, A; Iannuzzi, L; Parma, P

    2013-01-01

    The recent advances in sequencing technology and bioinformatics have revolutionized genomic research, making the decoding of the genome an easier task. Genome sequences are currently available for many species, including cattle, sheep and river buffalo. The available reference genomes are very accurate, and they represent the best possible order of loci at this time. In cattle, despite the great accuracy achieved, a part of the genome has been sequenced but not yet assembled: these genome fragments are called unmapped fragments. In the present study, 20 unmapped fragments belonging to the Btau_4.0 reference genome have been mapped by FISH in cattle (Bos taurus, 2n = 60), sheep (Ovis aries, 2n = 54) and river buffalo (Bubalus bubalis, 2n = 50). Our results confirm the accuracy of the available reference genome, though there are some discrepancies between the expected localization and the observed localization. Moreover, the available data in the literature regarding genomic homologies between cattle, sheep and river buffalo are confirmed. Finally, the results presented here suggest that FISH was, and still is, a useful technology to validate the data produced by genome sequencing programs. Copyright © 2013 S. Karger AG, Basel.

  11. High-resolution genomic fingerprinting of Campylobacter jejuni and Campylobacter coli by analysis of amplified fragment length polymorphisms

    DEFF Research Database (Denmark)

    Kokotovic, Branko; On, Stephen L.W.

    1999-01-01

    A method for high-resolution genomic fingerprinting of the enteric pathogens Campylobacter jejuni and Campylobacter coli, based on the determination of amplified fragment length polymorphism, is described. The potential of this method for molecular epidemiological studies of these species...... is evaluated with 50 type, reference, and well-characterised field strains. Amplified fragment length polymorphism fingerprints comprised over 60 bands detected in the size range 35-500 bp. Groups of outbreak strains, replicate subcultures, and 'genetically identical' strains from humans, poultry and cattle......, proved indistinguishable by amplified fragment length polymorphism fingerprinting, but were differentiated fi-om unrelated isolates. Previously unknown relationships between three hippurate-negative C. jejuni strains, and two C. coil var, hyoilei strains, were identified. These relationships corresponded...

  12. Genomic Relatedness of Chlamydia Isolates Determined by Amplified Fragment Length Polymorphism Analysis

    OpenAIRE

    Meijer, Adam; Morré, Servaas A.; Van Den Brule, Adriaan J. C.; Savelkoul, Paul H. M.; Ossewaarde, Jacobus M.

    1999-01-01

    The genomic relatedness of 19 Chlamydia pneumoniae isolates (17 from respiratory origin and 2 from atherosclerotic origin), 21 Chlamydia trachomatis isolates (all serovars from the human biovar, an isolate from the mouse biovar, and a porcine isolate), 6 Chlamydia psittaci isolates (5 avian isolates and 1 feline isolate), and 1 Chlamydia pecorum isolate was studied by analyzing genomic amplified fragment length polymorphism (AFLP) fingerprints. The AFLP procedure was adapted from a previously...

  13. Isolation and characterization of reverse transcriptase fragments of LTR retrotransposons from the genome of Chenopodium quinoa (Amaranthaceae).

    Science.gov (United States)

    Kolano, Bozena; Bednara, Edyta; Weiss-Schneeweiss, Hanna

    2013-10-01

    High heterogeneity was observed among conserved domains of reverse transcriptase ( rt ) isolated from quinoa. Only one Ty1- copia rt was highly amplified. Reverse transcriptase sequences were located predominantly in pericentromeric region of quinoa chromosomes. The heterogeneity, genomic abundance, and chromosomal distribution of reverse transcriptase (rt)-coding fragments of Ty1-copia and Ty3-gypsy long terminal repeat retrotransposons were analyzed in the Chenopodium quinoa genome. Conserved domains of the rt gene were amplified and characterized using degenerate oligonucleotide primer pairs. Sequence analyses indicated that half of Ty1-copia rt (51 %) and 39 % of Ty3-gypsy rt fragments contained intact reading frames. High heterogeneity among rt sequences was observed for both Ty1-copia and Ty3-gypsy rt amplicons, with Ty1-copia more heterogeneous than Ty3-gypsy. Most of the isolated rt fragments were present in quinoa genome in low copy numbers, with only one highly amplified Ty1-copia rt sequence family. The gypsy-like RNase H fragments co-amplified with Ty1-copia-degenerate primers were shown to be highly amplified in the quinoa genome indicating either higher abundance of some gypsy families of which rt domains could not be amplified, or independent evolution of this gypsy-region in quinoa. Both Ty1-copia and Ty3-gypsy retrotransposons were preferentially located in pericentromeric heterochromatin of quinoa chromosomes. Phylogenetic analyses of newly amplified rt fragments together with well-characterized retrotransposon families from other organisms allowed identification of major lineages of retroelements in the genome of quinoa and provided preliminary insight into their evolutionary dynamics.

  14. Barcode server: a visualization-based genome analysis system.

    Directory of Open Access Journals (Sweden)

    Fenglou Mao

    Full Text Available We have previously developed a computational method for representing a genome as a barcode image, which makes various genomic features visually apparent. We have demonstrated that this visual capability has made some challenging genome analysis problems relatively easy to solve. We have applied this capability to a number of challenging problems, including (a identification of horizontally transferred genes, (b identification of genomic islands with special properties and (c binning of metagenomic sequences, and achieved highly encouraging results. These application results inspired us to develop this barcode-based genome analysis server for public service, which supports the following capabilities: (a calculation of the k-mer based barcode image for a provided DNA sequence; (b detection of sequence fragments in a given genome with distinct barcodes from those of the majority of the genome, (c clustering of provided DNA sequences into groups having similar barcodes; and (d homology-based search using Blast against a genome database for any selected genomic regions deemed to have interesting barcodes. The barcode server provides a job management capability, allowing processing of a large number of analysis jobs for barcode-based comparative genome analyses. The barcode server is accessible at http://csbl1.bmb.uga.edu/Barcode.

  15. Molecular and FISH analyses of a 53-kbp intact DNA fragment inserted by biolistics in wheat (Triticum aestivum L.) genome.

    Science.gov (United States)

    Partier, A; Gay, G; Tassy, C; Beckert, M; Feuillet, C; Barret, P

    2017-10-01

    A large, 53-kbp, intact DNA fragment was inserted into the wheat ( Triticum aestivum L.) genome. FISH analyses of individual transgenic events revealed multiple insertions of intact fragments. Transferring large intact DNA fragments containing clusters of resistance genes or complete metabolic pathways into the wheat genome remains a challenge. In a previous work, we showed that the use of dephosphorylated cassettes for wheat transformation enabled the production of simple integration patterns. Here, we used the same technology to produce a cassette containing a 44-kb Arabidopsis thaliana BAC, flanked by one selection gene and one reporter gene. This 53-kb linear cassette was integrated in the bread wheat (Triticum aestivum L.) genome by biolistic transformation. Our results showed that transgenic plants harboring the entire cassette were generated. The inheritability of the cassette was demonstrated in the T1 and T2 generation. Surprisingly, FISH analysis performed on T1 progeny of independent events identified double genomic insertions of intact fragments in non-homoeologous positions. Inheritability of these double insertions was demonstrated by FISH analysis of the T1 generation. Relative conclusions that can be drawn from molecular or FISH analysis are discussed along with future prospects of the engineering of large fragments for wheat transformation or genome editing.

  16. Genetic and functional properties of uncultivated thermophilic crenarchaeotes from a subsurface gold mine as revealed by analysis of genome fragments.

    Science.gov (United States)

    Nunoura, Takuro; Hirayama, Hisako; Takami, Hideto; Oida, Hanako; Nishi, Shinro; Shimamura, Shigeru; Suzuki, Yohey; Inagaki, Fumio; Takai, Ken; Nealson, Kenneth H; Horikoshi, Koki

    2005-12-01

    Within a phylum Crenarchaeota, only some members of the hyperthermophilic class Thermoprotei, have been cultivated and characterized. In this study, we have constructed a metagenomic library from a microbial mat formation in a subsurface hot water stream of the Hishikari gold mine, Japan, and sequenced genome fragments of two different phylogroups of uncultivated thermophilic Crenarchaeota: (i) hot water crenarchaeotic group (HWCG) I (41.2 kb), and (ii) HWCG III (49.3 kb). The genome fragment of HWCG I contained a 16S rRNA gene, two tRNA genes and 35 genes encoding proteins but no 23S rRNA gene. Among the genes encoding proteins, several genes for putative aerobic-type carbon monoxide dehydrogenase represented a potential clue with regard to the yet unknown metabolism of HWCG I Archaea. The genome fragment of HWCG III contained a 16S/23S rRNA operon and 44 genes encoding proteins. In the 23S rRNA gene, we detected a homing-endonuclease encoding a group I intron similar to those detected in hyperthermophilic Crenarchaeota and Bacteria, as well as eukaryotic organelles. The reconstructed phylogenetic tree based on the 23S rRNA gene sequence reinforced the intermediate phylogenetic affiliation of HWCG III bridging the hyperthermophilic and non-thermophilic uncultivated Crenarchaeota.

  17. Accurate phylogenetic classification of DNA fragments based onsequence composition

    Energy Technology Data Exchange (ETDEWEB)

    McHardy, Alice C.; Garcia Martin, Hector; Tsirigos, Aristotelis; Hugenholtz, Philip; Rigoutsos, Isidore

    2006-05-01

    Metagenome studies have retrieved vast amounts of sequenceout of a variety of environments, leading to novel discoveries and greatinsights into the uncultured microbial world. Except for very simplecommunities, diversity makes sequence assembly and analysis a verychallenging problem. To understand the structure a 5 nd function ofmicrobial communities, a taxonomic characterization of the obtainedsequence fragments is highly desirable, yet currently limited mostly tothose sequences that contain phylogenetic marker genes. We show that forclades at the rank of domain down to genus, sequence composition allowsthe very accurate phylogenetic 10 characterization of genomic sequence.We developed a composition-based classifier, PhyloPythia, for de novophylogenetic sequence characterization and have trained it on adata setof 340 genomes. By extensive evaluation experiments we show that themethodis accurate across all taxonomic ranks considered, even forsequences that originate fromnovel organisms and are as short as 1kb.Application to two metagenome datasets 15 obtained from samples ofphosphorus-removing sludge showed that the method allows the accurateclassification at genus level of most sequence fragments from thedominant populations, while at the same time correctly characterizingeven larger parts of the samples at higher taxonomic levels.

  18. Endogenous hepatitis C virus homolog fragments in European rabbit and hare genomes replicate in cell culture.

    Directory of Open Access Journals (Sweden)

    Eliane Silva

    Full Text Available Endogenous retroviruses, non-retroviral RNA viruses and DNA viruses have been found in the mammalian genomes. The origin of Hepatitis C virus (HCV, the major cause of chronic hepatitis, liver cirrhosis, and hepatocellular carcinoma in humans, remains unclear since its discovery. Here we show that fragments homologous to HCV structural and non-structural (NS proteins present in the European rabbit (Oryctolagus cuniculus and hare (Lepus europaeus genomes replicate in bovine cell cultures. The HCV genomic homolog fragments were demonstrated by RT-PCR, PCR, mass spectrometry, and replication in bovine cell cultures by immunofluorescence assay (IFA and immunogold electron microscopy (IEM using specific MAbs for HCV NS3, NS4A, and NS5 proteins. These findings may lead to novel research approaches on the HCV origin, genesis, evolution and diversity.

  19. Knowledge-based Fragment Binding Prediction

    Science.gov (United States)

    Tang, Grace W.; Altman, Russ B.

    2014-01-01

    Target-based drug discovery must assess many drug-like compounds for potential activity. Focusing on low-molecular-weight compounds (fragments) can dramatically reduce the chemical search space. However, approaches for determining protein-fragment interactions have limitations. Experimental assays are time-consuming, expensive, and not always applicable. At the same time, computational approaches using physics-based methods have limited accuracy. With increasing high-resolution structural data for protein-ligand complexes, there is now an opportunity for data-driven approaches to fragment binding prediction. We present FragFEATURE, a machine learning approach to predict small molecule fragments preferred by a target protein structure. We first create a knowledge base of protein structural environments annotated with the small molecule substructures they bind. These substructures have low-molecular weight and serve as a proxy for fragments. FragFEATURE then compares the structural environments within a target protein to those in the knowledge base to retrieve statistically preferred fragments. It merges information across diverse ligands with shared substructures to generate predictions. Our results demonstrate FragFEATURE's ability to rediscover fragments corresponding to the ligand bound with 74% precision and 82% recall on average. For many protein targets, it identifies high scoring fragments that are substructures of known inhibitors. FragFEATURE thus predicts fragments that can serve as inputs to fragment-based drug design or serve as refinement criteria for creating target-specific compound libraries for experimental or computational screening. PMID:24762971

  20. Genome-wide macrosynteny among Fusarium species in the Gibberella fujikuroi complex revealed by amplified fragment length polymorphisms.

    Directory of Open Access Journals (Sweden)

    Lieschen De Vos

    Full Text Available The Gibberella fujikuroi complex includes many Fusarium species that cause significant losses in yield and quality of agricultural and forestry crops. Due to their economic importance, whole-genome sequence information has rapidly become available for species including Fusarium circinatum, Fusarium fujikuroi and Fusarium verticillioides, each of which represent one of the three main clades known in this complex. However, no previous studies have explored the genomic commonalities and differences among these fungi. In this study, a previously completed genetic linkage map for an interspecific cross between Fusarium temperatum and F. circinatum, together with genomic sequence data, was utilized to consider the level of synteny between the three Fusarium genomes. Regions that are homologous amongst the Fusarium genomes examined were identified using in silico and pyrosequenced amplified fragment length polymorphism (AFLP fragment analyses. Homology was determined using BLAST analysis of the sequences, with 777 homologous regions aligned to F. fujikuroi and F. verticillioides. This also made it possible to assign the linkage groups from the interspecific cross to their corresponding chromosomes in F. verticillioides and F. fujikuroi, as well as to assign two previously unmapped supercontigs of F. verticillioides to probable chromosomal locations. We further found evidence of a reciprocal translocation between the distal ends of chromosome 8 and 11, which apparently originated before the divergence of F. circinatum and F. temperatum. Overall, a remarkable level of macrosynteny was observed among the three Fusarium genomes, when comparing AFLP fragments. This study not only demonstrates how in silico AFLPs can aid in the integration of a genetic linkage map to the physical genome, but it also highlights the benefits of using this tool to study genomic synteny and architecture.

  1. Distant homology between yeast photoreactivating gene fragment and human genomic digests

    International Nuclear Information System (INIS)

    Meechan, P.J.; Milam, K.M.; Cleaver, J.E.

    1985-01-01

    Hybridization of DNA coding for the yeast DNA photolyase to human genomic DNA appears to allow one to determine whether a conserved enzyme is coded for in human cells. Under stringent conditions (68 0 C), hybridization is not found between the cloned yeast fragment (YEp13-phr1) and human or chick genomic digests. At less stringent conditions (60 0 C), hybridization is observed with chick digests, indicating evolutionary divergence even among organisms capable of photo-reactivation. At 50 0 C, weak hybridization with human digests was observed, indicating further divergence from the cloned gene. Data concerning the precise extent of homology and methods to clone the chick gene for use as another probe are discussed

  2. Fragment-based lead generation: identification of seed fragments by a highly efficient fragment screening technology

    Science.gov (United States)

    Neumann, Lars; Ritscher, Allegra; Müller, Gerhard; Hafenbradl, Doris

    2009-08-01

    For the detection of the precise and unambiguous binding of fragments to a specific binding site on the target protein, we have developed a novel reporter displacement binding assay technology. The application of this technology for the fragment screening as well as the fragment evolution process with a specific modelling based design strategy is demonstrated for inhibitors of the protein kinase p38alpha. In a fragment screening approach seed fragments were identified which were then used to build compounds from the deep-pocket towards the hinge binding area of the protein kinase p38alpha based on a modelling approach. BIRB796 was used as a blueprint for the alignment of the fragments. The fragment evolution of these deep-pocket binding fragments towards the fully optimized inhibitor BIRB796 included the modulation of the residence time as well as the affinity. The goal of our study was to evaluate the robustness and efficiency of our novel fragment screening technology at high fragment concentrations, compare the screening data with biochemical activity data and to demonstrate the evolution of the hit fragments with fast kinetics, into slow kinetic inhibitors in an in silico approach.

  3. Missing Fragments: Detecting Cooperative Binding in Fragment-Based Drug Design

    Science.gov (United States)

    2012-01-01

    The aim of fragment-based drug design (FBDD) is to identify molecular fragments that bind to alternate subsites within a given binding pocket leading to cooperative binding when linked. In this study, the binding of fragments to human phenylethanolamine N-methyltransferase is used to illustrate how (a) current protocols may fail to detect fragments that bind cooperatively, (b) theoretical approaches can be used to validate potential hits, and (c) apparent false positives obtained when screening against cocktails of fragments may in fact indicate promising leads. PMID:24900472

  4. Restriction site extension PCR: a novel method for high-throughput characterization of tagged DNA fragments and genome walking.

    Directory of Open Access Journals (Sweden)

    Jiabing Ji

    Full Text Available BACKGROUND: Insertion mutant isolation and characterization are extremely valuable for linking genes to physiological function. Once an insertion mutant phenotype is identified, the challenge is to isolate the responsible gene. Multiple strategies have been employed to isolate unknown genomic DNA that flanks mutagenic insertions, however, all these methods suffer from limitations due to inefficient ligation steps, inclusion of restriction sites within the target DNA, and non-specific product generation. These limitations become close to insurmountable when the goal is to identify insertion sites in a high throughput manner. METHODOLOGY/PRINCIPAL FINDINGS: We designed a novel strategy called Restriction Site Extension PCR (RSE-PCR to efficiently conduct large-scale isolation of unknown genomic DNA fragments linked to DNA insertions. The strategy is a modified adaptor-mediated PCR without ligation. An adapter, with complementarity to the 3' overhang of the endonuclease (KpnI, NsiI, PstI, or SacI restricted DNA fragments, extends the 3' end of the DNA fragments in the first cycle of the primary RSE-PCR. During subsequent PCR cycles and a second semi-nested PCR (secondary RSE-PCR, touchdown and two-step PCR are combined to increase the amplification specificity of target fragments. The efficiency and specificity was demonstrated in our characterization of 37 tex mutants of Arabidopsis. All the steps of RSE-PCR can be executed in a 96 well PCR plate. Finally, RSE-PCR serves as a successful alternative to Genome Walker as demonstrated by gene isolation from maize, a plant with a more complex genome than Arabidopsis. CONCLUSIONS/SIGNIFICANCE: RSE-PCR has high potential application in identifying tagged (T-DNA or transposon sequence or walking from known DNA toward unknown regions in large-genome plants, with likely application in other organisms as well.

  5. Replication-Coupled PCNA Unloading by the Elg1 Complex Occurs Genome-wide and Requires Okazaki Fragment Ligation

    Directory of Open Access Journals (Sweden)

    Takashi Kubota

    2015-08-01

    Full Text Available The sliding clamp PCNA is a crucial component of the DNA replication machinery. Timely PCNA loading and unloading are central for genome integrity and must be strictly coordinated with other DNA processing steps during replication. Here, we show that the S. cerevisiae Elg1 replication factor C-like complex (Elg1-RLC unloads PCNA genome-wide following Okazaki fragment ligation. In the absence of Elg1, PCNA is retained on chromosomes in the wake of replication forks, rather than at specific sites. Degradation of the Okazaki fragment ligase Cdc9 leads to PCNA accumulation on chromatin, similar to the accumulation caused by lack of Elg1. We demonstrate that Okazaki fragment ligation is the critical prerequisite for PCNA unloading, since Chlorella virus DNA ligase can substitute for Cdc9 in yeast and simultaneously promotes PCNA unloading. Our results suggest that Elg1-RLC acts as a general PCNA unloader and is dependent upon DNA ligation during chromosome replication.

  6. Fragment-based drug design.

    Science.gov (United States)

    Feyfant, Eric; Cross, Jason B; Paris, Kevin; Tsao, Désirée H H

    2011-01-01

    Fragment-based drug design (FBDD), which is comprised of both fragment screening and the use of fragment hits to design leads, began more than 15 years ago and has been steadily gaining in popularity and utility. Its origin lies on the fact that the coverage of chemical space and the binding efficiency of hits are directly related to the size of the compounds screened. Nevertheless, FBDD still faces challenges, among them developing fragment screening libraries that ensure optimal coverage of chemical space, physical properties and chemical tractability. Fragment screening also requires sensitive assays, often biophysical in nature, to detect weak binders. In this chapter we will introduce the technologies used to address these challenges and outline the experimental advantages that make FBDD one of the most popular new hit-to-lead process.

  7. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments.

    Science.gov (United States)

    Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias

    2013-09-24

    Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp.

  8. Virtual fragment preparation for computational fragment-based drug design.

    Science.gov (United States)

    Ludington, Jennifer L

    2015-01-01

    Fragment-based drug design (FBDD) has become an important component of the drug discovery process. The use of fragments can accelerate both the search for a hit molecule and the development of that hit into a lead molecule for clinical testing. In addition to experimental methodologies for FBDD such as NMR and X-ray Crystallography screens, computational techniques are playing an increasingly important role. The success of the computational simulations is due in large part to how the database of virtual fragments is prepared. In order to prepare the fragments appropriately it is necessary to understand how FBDD differs from other approaches and the issues inherent in building up molecules from smaller fragment pieces. The ultimate goal of these calculations is to link two or more simulated fragments into a molecule that has an experimental binding affinity consistent with the additive predicted binding affinities of the virtual fragments. Computationally predicting binding affinities is a complex process, with many opportunities for introducing error. Therefore, care should be taken with the fragment preparation procedure to avoid introducing additional inaccuracies.This chapter is focused on the preparation process used to create a virtual fragment database. Several key issues of fragment preparation which affect the accuracy of binding affinity predictions are discussed. The first issue is the selection of the two-dimensional atomic structure of the virtual fragment. Although the particular usage of the fragment can affect this choice (i.e., whether the fragment will be used for calibration, binding site characterization, hit identification, or lead optimization), general factors such as synthetic accessibility, size, and flexibility are major considerations in selecting the 2D structure. Other aspects of preparing the virtual fragments for simulation are the generation of three-dimensional conformations and the assignment of the associated atomic point charges.

  9. Fragment-based quantitative structure-activity relationship (FB-QSAR) for fragment-based drug design.

    Science.gov (United States)

    Du, Qi-Shi; Huang, Ri-Bo; Wei, Yu-Tuo; Pang, Zong-Wen; Du, Li-Qin; Chou, Kuo-Chen

    2009-01-30

    In cooperation with the fragment-based design a new drug design method, the so-called "fragment-based quantitative structure-activity relationship" (FB-QSAR) is proposed. The essence of the new method is that the molecular framework in a family of drug candidates are divided into several fragments according to their substitutes being investigated. The bioactivities of molecules are correlated with the physicochemical properties of the molecular fragments through two sets of coefficients in the linear free energy equations. One coefficient set is for the physicochemical properties and the other for the weight factors of the molecular fragments. Meanwhile, an iterative double least square (IDLS) technique is developed to solve the two sets of coefficients in a training data set alternately and iteratively. The IDLS technique is a feedback procedure with machine learning ability. The standard Two-dimensional quantitative structure-activity relationship (2D-QSAR) is a special case, in the FB-QSAR, when the whole molecule is treated as one entity. The FB-QSAR approach can remarkably enhance the predictive power and provide more structural insights into rational drug design. As an example, the FB-QSAR is applied to build a predictive model of neuraminidase inhibitors for drug development against H5N1 influenza virus. (c) 2008 Wiley Periodicals, Inc.

  10. Non-functional plastid ndh gene fragments are present in the nuclear genome of Norway spruce (Picea abies L. Karsch): insights from in silico analysis of nuclear and organellar genomes.

    Science.gov (United States)

    Ranade, Sonali Sachin; García-Gil, María Rosario; Rosselló, Josep A

    2016-04-01

    Many genes have been lost from the prokaryote plastidial genome during the early events of endosymbiosis in eukaryotes. Some of them were definitively lost, but others were relocated and functionally integrated to the host nuclear genomes through serial events of gene transfer during plant evolution. In gymnosperms, plastid genome sequencing has revealed the loss of ndh genes from several species of Gnetales and Pinaceae, including Norway spruce (Picea abies). This study aims to trace the ndh genes in the nuclear and organellar Norway spruce genomes. The plastid genomes of higher plants contain 11 ndh genes which are homologues of mitochondrial genes encoding subunits of the proton-pumping NADH-dehydrogenase (nicotinamide adenine dinucleotide dehydrogenase) or complex I (electron transport chain). Ndh genes encode 11 NDH polypeptides forming the Ndh complex (analogous to complex I) which seems to be primarily involved in chloro-respiration processes. We considered ndh genes from the plastidial genome of four gymnosperms (Cryptomeria japonica, Cycas revoluta, Ginkgo biloba, Podocarpus totara) and a single angiosperm species (Arabidopsis thaliana) to trace putative homologs in the nuclear and organellar Norway spruce genomes using tBLASTn to assess the evolutionary fate of ndh genes in Norway spruce and to address their genomic location(s), structure, integrity and functionality. The results obtained from tBLASTn were subsequently analyzed by performing homology search for finding ndh specific conserved domains using conserved domain search. We report the presence of non-functional plastid ndh gene fragments, excepting ndhE and ndhG genes, in the nuclear genome of Norway spruce. Regulatory transcriptional elements like promoters, TATA boxes and enhancers were detected in the upstream regions of some ndh fragments. We also found transposable elements in the flanking regions of few ndh fragments suggesting nuclear rearrangements in those regions. These evidences

  11. Genome puzzle master (GPM): an integrated pipeline for building and editing pseudomolecules from fragmented sequences.

    Science.gov (United States)

    Zhang, Jianwei; Kudrna, Dave; Mu, Ting; Li, Weiming; Copetti, Dario; Yu, Yeisoo; Goicoechea, Jose Luis; Lei, Yang; Wing, Rod A

    2016-10-15

    Next generation sequencing technologies have revolutionized our ability to rapidly and affordably generate vast quantities of sequence data. Once generated, raw sequences are assembled into contigs or scaffolds. However, these assemblies are mostly fragmented and inaccurate at the whole genome scale, largely due to the inability to integrate additional informative datasets (e.g. physical, optical and genetic maps). To address this problem, we developed a semi-automated software tool-Genome Puzzle Master (GPM)-that enables the integration of additional genomic signposts to edit and build 'new-gen-assemblies' that result in high-quality 'annotation-ready' pseudomolecules. With GPM, loaded datasets can be connected to each other via their logical relationships which accomplishes tasks to 'group,' 'merge,' 'order and orient' sequences in a draft assembly. Manual editing can also be performed with a user-friendly graphical interface. Final pseudomolecules reflect a user's total data package and are available for long-term project management. GPM is a web-based pipeline and an important part of a Laboratory Information Management System (LIMS) which can be easily deployed on local servers for any genome research laboratory. The GPM (with LIMS) package is available at https://github.com/Jianwei-Zhang/LIMS CONTACTS: jzhang@mail.hzau.edu.cn or rwing@mail.arizona.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  12. Introduction to fragment-based drug discovery.

    Science.gov (United States)

    Erlanson, Daniel A

    2012-01-01

    Fragment-based drug discovery (FBDD) has emerged in the past decade as a powerful tool for discovering drug leads. The approach first identifies starting points: very small molecules (fragments) that are about half the size of typical drugs. These fragments are then expanded or linked together to generate drug leads. Although the origins of the technique date back some 30 years, it was only in the mid-1990s that experimental techniques became sufficiently sensitive and rapid for the concept to be become practical. Since that time, the field has exploded: FBDD has played a role in discovery of at least 18 drugs that have entered the clinic, and practitioners of FBDD can be found throughout the world in both academia and industry. Literally dozens of reviews have been published on various aspects of FBDD or on the field as a whole, as have three books (Jahnke and Erlanson, Fragment-based approaches in drug discovery, 2006; Zartler and Shapiro, Fragment-based drug discovery: a practical approach, 2008; Kuo, Fragment based drug design: tools, practical approaches, and examples, 2011). However, this chapter will assume that the reader is approaching the field with little prior knowledge. It will introduce some of the key concepts, set the stage for the chapters to follow, and demonstrate how X-ray crystallography plays a central role in fragment identification and advancement.

  13. An efficient approach to BAC based assembly of complex genomes.

    Science.gov (United States)

    Visendi, Paul; Berkman, Paul J; Hayashi, Satomi; Golicz, Agnieszka A; Bayer, Philipp E; Ruperao, Pradeep; Hurgobin, Bhavna; Montenegro, Juan; Chan, Chon-Kit Kenneth; Staňková, Helena; Batley, Jacqueline; Šimková, Hana; Doležel, Jaroslav; Edwards, David

    2016-01-01

    There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.

  14. Fragment-based approaches to TB drugs.

    Science.gov (United States)

    Marchetti, Chiara; Chan, Daniel S H; Coyne, Anthony G; Abell, Chris

    2018-02-01

    Tuberculosis is an infectious disease associated with significant mortality and morbidity worldwide, particularly in developing countries. The rise of antibiotic resistance in Mycobacterium tuberculosis (Mtb) urgently demands the development of new drug leads to tackle resistant strains. Fragment-based methods have recently emerged at the forefront of pharmaceutical development as a means to generate more effective lead structures, via the identification of fragment molecules that form weak but high quality interactions with the target biomolecule and subsequent fragment optimization. This review highlights a number of novel inhibitors of Mtb targets that have been developed through fragment-based approaches in recent years.

  15. Fragment informatics and computational fragment-based drug design: an overview and update.

    Science.gov (United States)

    Sheng, Chunquan; Zhang, Wannian

    2013-05-01

    Fragment-based drug design (FBDD) is a promising approach for the discovery and optimization of lead compounds. Despite its successes, FBDD also faces some internal limitations and challenges. FBDD requires a high quality of target protein and good solubility of fragments. Biophysical techniques for fragment screening necessitate expensive detection equipment and the strategies for evolving fragment hits to leads remain to be improved. Regardless, FBDD is necessary for investigating larger chemical space and can be applied to challenging biological targets. In this scenario, cheminformatics and computational chemistry can be used as alternative approaches that can significantly improve the efficiency and success rate of lead discovery and optimization. Cheminformatics and computational tools assist FBDD in a very flexible manner. Computational FBDD can be used independently or in parallel with experimental FBDD for efficiently generating and optimizing leads. Computational FBDD can also be integrated into each step of experimental FBDD and help to play a synergistic role by maximizing its performance. This review will provide critical analysis of the complementarity between computational and experimental FBDD and highlight recent advances in new algorithms and successful examples of their applications. In particular, fragment-based cheminformatics tools, high-throughput fragment docking, and fragment-based de novo drug design will provide the focus of this review. We will also discuss the advantages and limitations of different methods and the trends in new developments that should inspire future research. © 2012 Wiley Periodicals, Inc.

  16. Large scale meta-analysis of fragment-based screening campaigns: privileged fragments and complementary technologies.

    Science.gov (United States)

    Kutchukian, Peter S; Wassermann, Anne Mai; Lindvall, Mika K; Wright, S Kirk; Ottl, Johannes; Jacob, Jaison; Scheufler, Clemens; Marzinzik, Andreas; Brooijmans, Natasja; Glick, Meir

    2015-06-01

    A first step in fragment-based drug discovery (FBDD) often entails a fragment-based screen (FBS) to identify fragment "hits." However, the integration of conflicting results from orthogonal screens remains a challenge. Here we present a meta-analysis of 35 fragment-based campaigns at Novartis, which employed a generic 1400-fragment library against diverse target families using various biophysical and biochemical techniques. By statistically interrogating the multidimensional FBS data, we sought to investigate three questions: (1) What makes a fragment amenable for FBS? (2) How do hits from different fragment screening technologies and target classes compare with each other? (3) What is the best way to pair FBS assay technologies? In doing so, we identified substructures that were privileged for specific target classes, as well as fragments that were privileged for authentic activity against many targets. We also revealed some of the discrepancies between technologies. Finally, we uncovered a simple rule of thumb in screening strategy: when choosing two technologies for a campaign, pairing a biochemical and biophysical screen tends to yield the greatest coverage of authentic hits. © 2014 Society for Laboratory Automation and Screening.

  17. Assembly of the Complete Sitka Spruce Chloroplast Genome Using 10X Genomics' GemCode Sequencing Data.

    Directory of Open Access Journals (Sweden)

    Lauren Coombe

    Full Text Available The linked read sequencing library preparation platform by 10X Genomics produces barcoded sequencing libraries, which are subsequently sequenced using the Illumina short read sequencing technology. In this new approach, long fragments of DNA are partitioned into separate micro-reactions, where the same index sequence is incorporated into each of the sequencing fragment inserts derived from a given long fragment. In this study, we exploited this property by using reads from index sequences associated with a large number of reads, to assemble the chloroplast genome of the Sitka spruce tree (Picea sitchensis. Here we report on the first Sitka spruce chloroplast genome assembled exclusively from P. sitchensis genomic libraries prepared using the 10X Genomics protocol. We show that the resulting 124,049 base pair long genome shares high sequence similarity with the related white spruce and Norway spruce chloroplast genomes, but diverges substantially from a previously published P. sitchensis- P. thunbergii chimeric genome. The use of reads from high-frequency indices enabled separation of the nuclear genome reads from that of the chloroplast, which resulted in the simplification of the de Bruijn graphs used at the various stages of assembly.

  18. High Efficiency Hydrodynamic DNA Fragmentation in a Bubbling System.

    Science.gov (United States)

    Li, Lanhui; Jin, Mingliang; Sun, Chenglong; Wang, Xiaoxue; Xie, Shuting; Zhou, Guofu; van den Berg, Albert; Eijkel, Jan C T; Shui, Lingling

    2017-01-18

    DNA fragmentation down to a precise fragment size is important for biomedical applications, disease determination, gene therapy and shotgun sequencing. In this work, a cheap, easy to operate and high efficiency DNA fragmentation method is demonstrated based on hydrodynamic shearing in a bubbling system. We expect that hydrodynamic forces generated during the bubbling process shear the DNA molecules, extending and breaking them at the points where shearing forces are larger than the strength of the phosphate backbone. Factors of applied pressure, bubbling time and temperature have been investigated. Genomic DNA could be fragmented down to controllable 1-10 Kbp fragment lengths with a yield of 75.30-91.60%. We demonstrate that the ends of the genomic DNAs generated from hydrodynamic shearing can be ligated by T4 ligase and the fragmented DNAs can be used as templates for polymerase chain reaction. Therefore, in the bubbling system, DNAs could be hydrodynamically sheared to achieve smaller pieces in dsDNAs available for further processes. It could potentially serve as a DNA sample pretreatment technique in the future.

  19. Genomic diversity among Danish field strains of Mycoplasma hyosynoviae assessed by amplified fragment length polymorphism analysis

    DEFF Research Database (Denmark)

    Kokotovic, Branko; Friis, Niels F.; Nielsen, Elisabeth O.

    2002-01-01

    Genomic diversity among strains of Mycoplasma hyosynoviae isolated in Denmark was assessed by using amplified fragment length polymorphism (AFLP) analysis. Ninety-six strains, obtained from different specimens and geographical locations during 30 years and the type strain of M. hyosynoviae S16(T......) were concurrently examined for variance in BglII-MfeI and EcoRI-Csp6I-A AFLP markers. A total of 56 different genomic fingerprints having an overall similarity between 77 and 96% were detected. No correlation between AFLP variability and period of isolation or anatomical site of isolation could...

  20. Modeling the integration of bacterial rRNA fragments into the human cancer genome.

    Science.gov (United States)

    Sieber, Karsten B; Gajer, Pawel; Dunning Hotopp, Julie C

    2016-03-21

    Cancer is a disease driven by the accumulation of genomic alterations, including the integration of exogenous DNA into the human somatic genome. We previously identified in silico evidence of DNA fragments from a Pseudomonas-like bacteria integrating into the 5'-UTR of four proto-oncogenes in stomach cancer sequencing data. The functional and biological consequences of these bacterial DNA integrations remain unknown. Modeling of these integrations suggests that the previously identified sequences cover most of the sequence flanking the junction between the bacterial and human DNA. Further examination of these reads reveals that these integrations are rich in guanine nucleotides and the integrated bacterial DNA may have complex transcript secondary structures. The models presented here lay the foundation for future experiments to test if bacterial DNA integrations alter the transcription of the human genes.

  1. Fragment approaches in structure-based drug discovery

    International Nuclear Information System (INIS)

    Hubbard, Roderick E.

    2008-01-01

    Fragment-based methods are successfully generating novel and selective drug-like inhibitors of protein targets, with a number of groups reporting compounds entering clinical trials. This paper summarizes the key features of the approach as one of the tools in structure-guided drug discovery. There has been considerable interest recently in what is known as 'fragment-based lead discovery'. The novel feature of the approach is to begin with small low-affinity compounds. The main advantage is that a larger potential chemical diversity can be sampled with fewer compounds, which is particularly important for new target classes. The approach relies on careful design of the fragment library, a method that can detect binding of the fragment to the protein target, determination of the structure of the fragment bound to the target, and the conventional use of structural information to guide compound optimization. In this article the methods are reviewed, and experiences in fragment-based discovery of lead series of compounds against kinases such as PDK1 and ATPases such as Hsp90 are discussed. The examples illustrate some of the key benefits and issues of the approach and also provide anecdotal examples of the patterns seen in selectivity and the binding mode of fragments across different protein targets

  2. Fragment-based discovery of a potent NAMPT inhibitor.

    Science.gov (United States)

    Korepanova, Alla; Longenecker, Kenton L; Pratt, Steve D; Panchal, Sanjay C; Clark, Richard F; Lake, Marc; Gopalakrishnan, Sujatha M; Raich, Diana; Sun, Chaohong; Petros, Andrew M

    2017-12-12

    NAMPT expression is elevated in many cancers, making this protein a potential target for anticancer therapy. We have carried out both NMR based and TR-FRET based fragment screens against human NAMPT and identified six novel binders with a range of potencies. Co-crystal structures were obtained for two of the fragments bound to NAMPT while for the other four fragments force-field driven docking was employed to generate a bound pose. Based on structural insights arising from comparison of the bound fragment poses to that of bound FK866 we were able to synthetically elaborate one of the fragments into a potent NAMPT inhibitor. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Efficient clustering aggregation based on data fragments.

    Science.gov (United States)

    Wu, Ou; Hu, Weiming; Maybank, Stephen J; Zhu, Mingliang; Li, Bing

    2012-06-01

    Clustering aggregation, known as clustering ensembles, has emerged as a powerful technique for combining different clustering results to obtain a single better clustering. Existing clustering aggregation algorithms are applied directly to data points, in what is referred to as the point-based approach. The algorithms are inefficient if the number of data points is large. We define an efficient approach for clustering aggregation based on data fragments. In this fragment-based approach, a data fragment is any subset of the data that is not split by any of the clustering results. To establish the theoretical bases of the proposed approach, we prove that clustering aggregation can be performed directly on data fragments under two widely used goodness measures for clustering aggregation taken from the literature. Three new clustering aggregation algorithms are described. The experimental results obtained using several public data sets show that the new algorithms have lower computational complexity than three well-known existing point-based clustering aggregation algorithms (Agglomerative, Furthest, and LocalSearch); nevertheless, the new algorithms do not sacrifice the accuracy.

  4. A Survey of 6,300 Genomic Fragments for cis-Regulatory Activity in the Imaginal Discs of Drosophila melanogaster

    Directory of Open Access Journals (Sweden)

    Aurélie Jory

    2012-10-01

    Full Text Available Over 6,000 fragments from the genome of Drosophila melanogaster were analyzed for their ability to drive expression of GAL4 reporter genes in the third-instar larval imaginal discs. About 1,200 reporter genes drove expression in the eye, antenna, leg, wing, haltere, or genital imaginal discs. The patterns ranged from large regions to individual cells. About 75% of the active fragments drove expression in multiple discs; 20% were expressed in ventral, but not dorsal, discs (legs, genital, and antenna, whereas ∼23% were expressed in dorsal but not ventral discs (wing, haltere, and eye. Several patterns, for example, within the leg chordotonal organ, appeared a surprisingly large number of times. Unbiased searches for DNA sequence motifs suggest candidate transcription factors that may regulate enhancers with shared activities. Together, these expression patterns provide a valuable resource to the community and offer a broad overview of how transcriptional regulatory information is distributed in the Drosophila genome.

  5. Gene prediction in metagenomic fragments: A large scale machine learning approach

    Directory of Open Access Journals (Sweden)

    Morgenstern Burkhard

    2008-04-01

    Full Text Available Abstract Background Metagenomics is an approach to the characterization of microbial genomes via the direct isolation of genomic sequences from the environment without prior cultivation. The amount of metagenomic sequence data is growing fast while computational methods for metagenome analysis are still in their infancy. In contrast to genomic sequences of single species, which can usually be assembled and analyzed by many available methods, a large proportion of metagenome data remains as unassembled anonymous sequencing reads. One of the aims of all metagenomic sequencing projects is the identification of novel genes. Short length, for example, Sanger sequencing yields on average 700 bp fragments, and unknown phylogenetic origin of most fragments require approaches to gene prediction that are different from the currently available methods for genomes of single species. In particular, the large size of metagenomic samples requires fast and accurate methods with small numbers of false positive predictions. Results We introduce a novel gene prediction algorithm for metagenomic fragments based on a two-stage machine learning approach. In the first stage, we use linear discriminants for monocodon usage, dicodon usage and translation initiation sites to extract features from DNA sequences. In the second stage, an artificial neural network combines these features with open reading frame length and fragment GC-content to compute the probability that this open reading frame encodes a protein. This probability is used for the classification and scoring of gene candidates. With large scale training, our method provides fast single fragment predictions with good sensitivity and specificity on artificially fragmented genomic DNA. Additionally, this method is able to predict translation initiation sites accurately and distinguishes complete from incomplete genes with high reliability. Conclusion Large scale machine learning methods are well-suited for gene

  6. Fragmentation based

    Directory of Open Access Journals (Sweden)

    Shashank Srivastava

    2014-01-01

    Gaining the understanding of mobile agent architecture and the security concerns, in this paper, we proposed a security protocol which addresses security with mitigated computational cost. The protocol is a combination of self decryption, co-operation and obfuscation technique. To circumvent the risk of malicious code execution in attacking environment, we have proposed fragmentation based encryption technique. Our encryption technique suits the general mobile agent size and provides hard and thorny obfuscation increasing attacker’s challenge on the same plane providing better performance with respect to computational cost as compared to existing AES encryption.

  7. [Fragment-based drug discovery: concept and aim].

    Science.gov (United States)

    Tanaka, Daisuke

    2010-03-01

    Fragment-Based Drug Discovery (FBDD) has been recognized as a newly emerging lead discovery methodology that involves biophysical fragment screening and chemistry-driven fragment-to-lead stages. Although fragments, defined as structurally simple and small compounds (typically FBDD primarily turns our attention to weakly but specifically binding fragments (hit fragments) as the starting point of medicinal chemistry. Hit fragments are then promoted to more potent lead compounds through linking or merging with another hit fragment and/or attaching functional groups. Another positive aspect of FBDD is ligand efficiency. Ligand efficiency is a useful guide in screening hit selection and hit-to-lead phases to achieve lead-likeness. Owing to these features, a number of successful applications of FBDD to "undruggable targets" (where HTS and other lead identification methods failed to identify useful lead compounds) have been reported. As a result, FBDD is now expected to complement more conventional methodologies. This review, as an introduction of the following articles, will summarize the fundamental concepts of FBDD and will discuss its advantages over other conventional drug discovery approaches.

  8. NMR screening in fragment-based drug design: a practical guide.

    Science.gov (United States)

    Kim, Hai-Young; Wyss, Daniel F

    2015-01-01

    Fragment-based drug design (FBDD) comprises both fragment-based screening (FBS) to find hits and elaboration of these hits to lead compounds. Typical fragment hits have lower molecular weight (FBDD since it identifies and localizes the binding site of weakly interacting hits on the target protein. Here we describe ligand-based NMR methods for hit identification from fragment libraries and for functional cross-validation of primary hits.

  9. Amplified-fragment length polymorphism fingerprinting of Mycoplasma species

    DEFF Research Database (Denmark)

    Kokotovic, Branko; Friis, N.F.; Jensen, J.S.

    1999-01-01

    Amplified-fragment length polymorphism (AFLP) is a whole-genome fingerprinting method based on selective amplification of restriction fragments. The potential of the method for the characterization of mycoplasmas was investigated in a total of 50 strains of human and animal origin, including...... Mycoplasma genitalium (n = 11), Mycoplasma pneumoniae (n = 5), Mycoplasma hominis (n = 5), Mycoplasma hyopneunmoniae (n = 9), Myco plasma flocculare (n = 5), Mycoplasma hyosynoviae (n = 10), and Mycoplasma dispar (n = 5), AFLP templates were prepared by the digestion of mycoplasmal DNA with BglII and Mfe...... to discriminate the analyzed strains at species and intraspecies levels as well, Each of the tested Mycoplasma species developed a banding pattern entirely different from those obtained from other species under analysis, Subtle intraspecies genomic differences were detected among strains of all of the Mycoplasma...

  10. Experiences in fragment-based drug discovery.

    Science.gov (United States)

    Murray, Christopher W; Verdonk, Marcel L; Rees, David C

    2012-05-01

    Fragment-based drug discovery (FBDD) has become established in both industry and academia as an alternative approach to high-throughput screening for the generation of chemical leads for drug targets. In FBDD, specialised detection methods are used to identify small chemical compounds (fragments) that bind to the drug target, and structural biology is usually employed to establish their binding mode and to facilitate their optimisation. In this article, we present three recent and successful case histories in FBDD. We then re-examine the key concepts and challenges of FBDD with particular emphasis on recent literature and our own experience from a substantial number of FBDD applications. Our opinion is that careful application of FBDD is living up to its promise of delivering high quality leads with good physical properties and that in future many drug molecules will be derived from fragment-based approaches. Copyright © 2012 Elsevier Ltd. All rights reserved.

  11. Binding-site assessment by virtual fragment screening.

    Directory of Open Access Journals (Sweden)

    Niu Huang

    2010-04-01

    Full Text Available The accurate prediction of protein druggability (propensity to bind high-affinity drug-like small molecules would greatly benefit the fields of chemical genomics and drug discovery. We have developed a novel approach to quantitatively assess protein druggability by computationally screening a fragment-like compound library. In analogy to NMR-based fragment screening, we dock approximately 11,000 fragments against a given binding site and compute a computational hit rate based on the fraction of molecules that exceed an empirically chosen score cutoff. We perform a large-scale evaluation of the approach on four datasets, totaling 152 binding sites. We demonstrate that computed hit rates correlate with hit rates measured experimentally in a previously published NMR-based screening method. Secondly, we show that the in silico fragment screening method can be used to distinguish known druggable and non-druggable targets, including both enzymes and protein-protein interaction sites. Finally, we explore the sensitivity of the results to different receptor conformations, including flexible protein-protein interaction sites. Besides its original aim to assess druggability of different protein targets, this method could be used to identifying druggable conformations of flexible binding site for lead discovery, and suggesting strategies for growing or joining initial fragment hits to obtain more potent inhibitors.

  12. Metagenome Fragment Classification Using -Mer Frequency Profiles

    Directory of Open Access Journals (Sweden)

    Gail Rosen

    2008-01-01

    Full Text Available A vast amount of microbial sequencing data is being generated through large-scale projects in ecology, agriculture, and human health. Efficient high-throughput methods are needed to analyze the mass amounts of metagenomic data, all DNA present in an environmental sample. A major obstacle in metagenomics is the inability to obtain accuracy using technology that yields short reads. We construct the unique -mer frequency profiles of 635 microbial genomes publicly available as of February 2008. These profiles are used to train a naive Bayes classifier (NBC that can be used to identify the genome of any fragment. We show that our method is comparable to BLAST for small 25 bp fragments but does not have the ambiguity of BLAST's tied top scores. We demonstrate that this approach is scalable to identify any fragment from hundreds of genomes. It also performs quite well at the strain, species, and genera levels and achieves strain resolution despite classifying ubiquitous genomic fragments (gene and nongene regions. Cross-validation analysis demonstrates that species-accuracy achieves 90% for highly-represented species containing an average of 8 strains. We demonstrate that such a tool can be used on the Sargasso Sea dataset, and our analysis shows that NBC can be further enhanced.

  13. Fragment-based approaches to the discovery of kinase inhibitors.

    Science.gov (United States)

    Mortenson, Paul N; Berdini, Valerio; O'Reilly, Marc

    2014-01-01

    Protein kinases are one of the most important families of drug targets, and aberrant kinase activity has been linked to a large number of disease areas. Although eminently targetable using small molecules, kinases present a number of challenges as drug targets, not least obtaining selectivity across such a large and relatively closely related target family. Fragment-based drug discovery involves screening simple, low-molecular weight compounds to generate initial hits against a target. These hits are then optimized to more potent compounds via medicinal chemistry, usually facilitated by structural biology. Here, we will present a number of recent examples of fragment-based approaches to the discovery of kinase inhibitors, detailing the construction of fragment-screening libraries, the identification and validation of fragment hits, and their optimization into potent and selective lead compounds. The advantages of fragment-based methodologies will be discussed, along with some of the challenges associated with using this route. Finally, we will present a number of key lessons derived both from our own experience running fragment screens against kinases and from a large number of published studies.

  14. Prediction of Protein-Protein Interactions by NanoLuc-Based Protein-Fragment Complementation Assay | Office of Cancer Genomics

    Science.gov (United States)

    The CTD2 Center at Emory has developed a new NanoLuc®-based protein-fragment complementation assay (NanoPCA) which allows the detection of novel protein-protein interactions (PPI). NanoPCA allows the study of PPI dynamics with reversible interactions.  Read the abstract. Experimental Approaches Read the detailed Experimetnal Approaches. 

  15. NMR-Fragment Based Virtual Screening: A Brief Overview.

    Science.gov (United States)

    Singh, Meenakshi; Tam, Benjamin; Akabayov, Barak

    2018-01-25

    Fragment-based drug discovery (FBDD) using NMR has become a central approach over the last twenty years for development of small molecule inhibitors against biological macromolecules, to control a variety of cellular processes. Yet, several considerations should be taken into account for obtaining a therapeutically relevant agent. In this review, we aim to list the considerations that make NMR fragment screening a successful process for yielding potent inhibitors. Factors that may govern the competence of NMR in fragment based drug discovery are discussed, as well as later steps that involve optimization of hits obtained by NMR-FBDD.

  16. NMR-Fragment Based Virtual Screening: A Brief Overview

    Directory of Open Access Journals (Sweden)

    Meenakshi Singh

    2018-01-01

    Full Text Available Fragment-based drug discovery (FBDD using NMR has become a central approach over the last twenty years for development of small molecule inhibitors against biological macromolecules, to control a variety of cellular processes. Yet, several considerations should be taken into account for obtaining a therapeutically relevant agent. In this review, we aim to list the considerations that make NMR fragment screening a successful process for yielding potent inhibitors. Factors that may govern the competence of NMR in fragment based drug discovery are discussed, as well as later steps that involve optimization of hits obtained by NMR-FBDD.

  17. A probabilistic fragment-based protein structure prediction algorithm.

    Directory of Open Access Journals (Sweden)

    David Simoncini

    Full Text Available Conformational sampling is one of the bottlenecks in fragment-based protein structure prediction approaches. They generally start with a coarse-grained optimization where mainchain atoms and centroids of side chains are considered, followed by a fine-grained optimization with an all-atom representation of proteins. It is during this coarse-grained phase that fragment-based methods sample intensely the conformational space. If the native-like region is sampled more, the accuracy of the final all-atom predictions may be improved accordingly. In this work we present EdaFold, a new method for fragment-based protein structure prediction based on an Estimation of Distribution Algorithm. Fragment-based approaches build protein models by assembling short fragments from known protein structures. Whereas the probability mass functions over the fragment libraries are uniform in the usual case, we propose an algorithm that learns from previously generated decoys and steers the search toward native-like regions. A comparison with Rosetta AbInitio protocol shows that EdaFold is able to generate models with lower energies and to enhance the percentage of near-native coarse-grained decoys on a benchmark of [Formula: see text] proteins. The best coarse-grained models produced by both methods were refined into all-atom models and used in molecular replacement. All atom decoys produced out of EdaFold's decoy set reach high enough accuracy to solve the crystallographic phase problem by molecular replacement for some test proteins. EdaFold showed a higher success rate in molecular replacement when compared to Rosetta. Our study suggests that improving low resolution coarse-grained decoys allows computational methods to avoid subsequent sampling issues during all-atom refinement and to produce better all-atom models. EdaFold can be downloaded from http://www.riken.jp/zhangiru/software.html [corrected].

  18. Restricted fragmentation of poliovirus type 1, 2, and 3 RNAs by ribonuclease III

    Energy Technology Data Exchange (ETDEWEB)

    Nomoto, A. (State Univ. of New York, Stony Brook); Lee, Y.F.; Babich, A.; Jacobson, A.; Dunn, J.J.; Wimmer, E.

    1979-01-01

    Cleavage of the genome RNAs of poliovirus type 1, 2, and 3 with the ribonuclease III of Escherichia coli has been investigated with the following results: (1) at or above physiological salt concentration, the RNAs are completely resistant to the action of the enzyme, an observation suggesting that the RNAs lack primary cleavage sites; (2) lowering the salt concentration to 0.1 M or below allows RNase III to cleave the RNAs at secondary sites. Both large and small fragments can be obtained in a reproducible manner depending on salt conditions chosen for cleavage. Fingerprints of three large fragments of poliovirus type 2 RNA show that they originate from unique segments and represent most if not all sequences of the genome. Based upon binding to poly(U) filters of poly(A)-linked fragments, a physical map of the large fragments of poliovirus type 2 RNA was constructed. The data suggest that RNase III cleavage of single-stranded RNA provides a useful method to fragment the RNA for further studies.

  19. Towards novel therapeutics for HIV through fragment-based screening and drug design.

    Science.gov (United States)

    Tiefendbrunn, Theresa; Stout, C David

    2014-01-01

    Fragment-based drug discovery has been applied with varying levels of success to a number of proteins involved in the HIV (Human Immunodeficiency Virus) life cycle. Fragment-based approaches have led to the discovery of novel binding sites within protease, reverse transcriptase, integrase, and gp41. Novel compounds that bind to known pockets within CCR5 have also been identified via fragment screening, and a fragment-based approach to target the TAR-Tat interaction was explored. In the context of HIV-1 reverse transcriptase (RT), fragment-based approaches have yielded fragment hits with mid-μM activity in an in vitro activity assay, as well as fragment hits that are active against drug-resistant variants of RT. Fragment-based drug discovery is a powerful method to elucidate novel binding sites within proteins, and the method has had significant success in the context of HIV proteins.

  20. Fragment-based drug discovery using rational design.

    Science.gov (United States)

    Jhoti, H

    2007-01-01

    Fragment-based drug discovery (FBDD) is established as an alternative approach to high-throughput screening for generating novel small molecule drug candidates. In FBDD, relatively small libraries of low molecular weight compounds (or fragments) are screened using sensitive biophysical techniques to detect their binding to the target protein. A lower absolute affinity of binding is expected from fragments, compared to much higher molecular weight hits detected by high-throughput screening, due to their reduced size and complexity. Through the use of iterative cycles of medicinal chemistry, ideally guided by three-dimensional structural data, it is often then relatively straightforward to optimize these weak binding fragment hits into potent and selective lead compounds. As with most other lead discovery methods there are two key components of FBDD; the detection technology and the compound library. In this review I outline the two main approaches used for detecting the binding of low affinity fragments and also some of the key principles that are used to generate a fragment library. In addition, I describe an example of how FBDD has led to the generation of a drug candidate that is now being tested in clinical trials for the treatment of cancer.

  1. Herbarium genomics

    DEFF Research Database (Denmark)

    Bakker, Freek T.; Lei, Di; Yu, Jiaying

    2016-01-01

    Herbarium genomics is proving promising as next-generation sequencing approaches are well suited to deal with the usually fragmented nature of archival DNA. We show that routine assembly of partial plastome sequences from herbarium specimens is feasible, from total DNA extracts and with specimens...... up to 146 years old. We use genome skimming and an automated assembly pipeline, Iterative Organelle Genome Assembly, that assembles paired-end reads into a series of candidate assemblies, the best one of which is selected based on likelihood estimation. We used 93 specimens from 12 different...... correlation between plastome coverage and nuclear genome size (C value) in our samples, but the range of C values included is limited. Finally, we conclude that routine plastome sequencing from herbarium specimens is feasible and cost-effective (compared with Sanger sequencing or plastome...

  2. Computational Fragment-Based Drug Design: Current Trends, Strategies, and Applications.

    Science.gov (United States)

    Bian, Yuemin; Xie, Xiang-Qun Sean

    2018-04-09

    Fragment-based drug design (FBDD) has become an effective methodology for drug development for decades. Successful applications of this strategy brought both opportunities and challenges to the field of Pharmaceutical Science. Recent progress in the computational fragment-based drug design provide an additional approach for future research in a time- and labor-efficient manner. Combining multiple in silico methodologies, computational FBDD possesses flexibilities on fragment library selection, protein model generation, and fragments/compounds docking mode prediction. These characteristics provide computational FBDD superiority in designing novel and potential compounds for a certain target. The purpose of this review is to discuss the latest advances, ranging from commonly used strategies to novel concepts and technologies in computational fragment-based drug design. Particularly, in this review, specifications and advantages are compared between experimental and computational FBDD, and additionally, limitations and future prospective are discussed and emphasized.

  3. Functional genomics of tomato

    Indian Academy of Sciences (India)

    2014-10-20

    Oct 20, 2014 ... 1Repository of Tomato Genomics Resources, Department of Plant Sciences, School .... Due to its position at the crossroads of Sanger's sequencing .... replacement for the microarray-based expression profiling. .... during RNA fragmentation step prior to library construction, ...... tomato pollen as a test case.

  4. Comprehensive genomic characterization of campylobacter genus reveals some underlying mechanisms for its genomic diversification.

    Directory of Open Access Journals (Sweden)

    Yizhuang Zhou

    Full Text Available Campylobacter species.are phenotypically diverse in many aspects including host habitats and pathogenicities, which demands comprehensive characterization of the entire Campylobacter genus to study their underlying genetic diversification. Up to now, 34 Campylobacter strains have been sequenced and published in public databases, providing good opportunity to systemically analyze their genomic diversities. In this study, we first conducted genomic characterization, which includes genome-wide alignments, pan-genome analysis, and phylogenetic identification, to depict the genetic diversity of Campylobacter genus. Afterward, we improved the tetranucleotide usage pattern-based naïve Bayesian classifier to identify the abnormal composition fragments (ACFs, fragments with significantly different tetranucleotide frequency profiles from its genomic tetranucleotide frequency profiles including horizontal gene transfers (HGTs to explore the mechanisms for the genetic diversity of this organism. Finally, we analyzed the HGTs transferred via bacteriophage transductions. To our knowledge, this study is the first to use single nucleotide polymorphism information to construct liable microevolution phylogeny of 21 Campylobacter jejuni strains. Combined with the phylogeny of all the collected Campylobacter species based on genome-wide core gene information, comprehensive phylogenetic inference of all 34 Campylobacter organisms was determined. It was found that C. jejuni harbors a high fraction of ACFs possibly through intraspecies recombination, whereas other Campylobacter members possess numerous ACFs possibly via intragenus recombination. Furthermore, some Campylobacter strains have undergone significant ancient viral integration during their evolution process. The improved method is a powerful tool for bacterial genomic analysis. Moreover, the findings would provide useful information for future research on Campylobacter genus.

  5. In silico fragment-based drug design.

    Science.gov (United States)

    Konteatis, Zenon D

    2010-11-01

    In silico fragment-based drug design (FBDD) is a relatively new approach inspired by the success of the biophysical fragment-based drug discovery field. Here, we review the progress made by this approach in the last decade and showcase how it complements and expands the capabilities of biophysical FBDD and structure-based drug design to generate diverse, efficient drug candidates. Advancements in several areas of research that have enabled the development of in silico FBDD and some applications in drug discovery projects are reviewed. The reader is introduced to various computational methods that are used for in silico FBDD, the fragment library composition for this technique, special applications used to identify binding sites on the surface of proteins and how to assess the druggability of these sites. In addition, the reader will gain insight into the proper application of this approach from examples of successful programs. In silico FBDD captures a much larger chemical space than high-throughput screening and biophysical FBDD increasing the probability of developing more diverse, patentable and efficient molecules that can become oral drugs. The application of in silico FBDD holds great promise for historically challenging targets such as protein-protein interactions. Future advances in force fields, scoring functions and automated methods for determining synthetic accessibility will all aid in delivering more successes with in silico FBDD.

  6. Genome chaos: survival strategy during crisis.

    Science.gov (United States)

    Liu, Guo; Stevens, Joshua B; Horne, Steven D; Abdallah, Batoul Y; Ye, Karen J; Bremer, Steven W; Ye, Christine J; Chen, David J; Heng, Henry H

    2014-01-01

    Genome chaos, a process of complex, rapid genome re-organization, results in the formation of chaotic genomes, which is followed by the potential to establish stable genomes. It was initially detected through cytogenetic analyses, and recently confirmed by whole-genome sequencing efforts which identified multiple subtypes including "chromothripsis", "chromoplexy", "chromoanasynthesis", and "chromoanagenesis". Although genome chaos occurs commonly in tumors, both the mechanism and detailed aspects of the process are unknown due to the inability of observing its evolution over time in clinical samples. Here, an experimental system to monitor the evolutionary process of genome chaos was developed to elucidate its mechanisms. Genome chaos occurs following exposure to chemotherapeutics with different mechanisms, which act collectively as stressors. Characterization of the karyotype and its dynamic changes prior to, during, and after induction of genome chaos demonstrates that chromosome fragmentation (C-Frag) occurs just prior to chaotic genome formation. Chaotic genomes seem to form by random rejoining of chromosomal fragments, in part through non-homologous end joining (NHEJ). Stress induced genome chaos results in increased karyotypic heterogeneity. Such increased evolutionary potential is demonstrated by the identification of increased transcriptome dynamics associated with high levels of karyotypic variance. In contrast to impacting on a limited number of cancer genes, re-organized genomes lead to new system dynamics essential for cancer evolution. Genome chaos acts as a mechanism of rapid, adaptive, genome-based evolution that plays an essential role in promoting rapid macroevolution of new genome-defined systems during crisis, which may explain some unwanted consequences of cancer treatment.

  7. Advances in fragment-based drug discovery platforms.

    Science.gov (United States)

    Orita, Masaya; Warizaya, Masaichi; Amano, Yasushi; Ohno, Kazuki; Niimi, Tatsuya

    2009-11-01

    Fragment-based drug discovery (FBDD) has been established as a powerful alternative and complement to traditional high-throughput screening techniques for identifying drug leads. At present, this technique is widely used among academic groups as well as small biotech and large pharmaceutical companies. In recent years, > 10 new compounds developed with FBDD have entered clinical development, and more and more attention in the drug discovery field is being focused on this technique. Under the FBDD approach, a fragment library of relatively small compounds (molecular mass = 100 - 300 Da) is screened by various methods and the identified fragment hits which normally weakly bind to the target are used as starting points to generate more potent drug leads. Because FBDD is still a relatively new drug discovery technology, further developments and optimizations in screening platforms and fragment exploitation can be expected. This review summarizes recent advances in FBDD platforms and discusses the factors important for the successful application of this technique. Under the FBDD approach, both identifying the starting fragment hit to be developed and generating the drug lead from that starting fragment hit are important. Integration of various techniques, such as computational technology, X-ray crystallography, NMR, surface plasmon resonance, isothermal titration calorimetry, mass spectrometry and high-concentration screening, must be applied in a situation-appropriate manner.

  8. DNA fragments assembly based on nicking enzyme system.

    Directory of Open Access Journals (Sweden)

    Rui-Yan Wang

    Full Text Available A couple of DNA ligation-independent cloning (LIC methods have been reported to meet various requirements in metabolic engineering and synthetic biology. The principle of LIC is the assembly of multiple overlapping DNA fragments by single-stranded (ss DNA overlaps annealing. Here we present a method to generate single-stranded DNA overlaps based on Nicking Endonucleases (NEases for LIC, the method was termed NE-LIC. Factors related to cloning efficiency were optimized in this study. This NE-LIC allows generating 3'-end or 5'-end ss DNA overlaps of various lengths for fragments assembly. We demonstrated that the 10 bp/15 bp overlaps had the highest DNA fragments assembling efficiency, while 5 bp/10 bp overlaps showed the highest efficiency when T4 DNA ligase was added. Its advantage over Sequence and Ligation Independent Cloning (SLIC and Uracil-Specific Excision Reagent (USER was obvious. The mechanism can be applied to many other LIC strategies. Finally, the NEases based LIC (NE-LIC was successfully applied to assemble a pathway of six gene fragments responsible for synthesizing microbial poly-3-hydroxybutyrate (PHB.

  9. Performances of Different Fragment Sizes for Reduced Representation Bisulfite Sequencing in Pigs.

    Science.gov (United States)

    Yuan, Xiao-Long; Zhang, Zhe; Pan, Rong-Yang; Gao, Ning; Deng, Xi; Li, Bin; Zhang, Hao; Sangild, Per Torp; Li, Jia-Qi

    2017-01-01

    Reduced representation bisulfite sequencing (RRBS) has been widely used to profile genome-scale DNA methylation in mammalian genomes. However, the applications and technical performances of RRBS with different fragment sizes have not been systematically reported in pigs, which serve as one of the important biomedical models for humans. The aims of this study were to evaluate capacities of RRBS libraries with different fragment sizes to characterize the porcine genome. We found that the Msp I-digested segments between 40 and 220 bp harbored a high distribution peak at 74 bp, which were highly overlapped with the repetitive elements and might reduce the unique mapping alignment. The RRBS library of 110-220 bp fragment size had the highest unique mapping alignment and the lowest multiple alignment. The cost-effectiveness of the 40-110 bp, 110-220 bp and 40-220 bp fragment sizes might decrease when the dataset size was more than 70, 50 and 110 million reads for these three fragment sizes, respectively. Given a 50-million dataset size, the average sequencing depth of the detected CpG sites in the 110-220 bp fragment size appeared to be deeper than in the 40-110 bp and 40-220 bp fragment sizes, and these detected CpG sties differently located in gene- and CpG island-related regions. In this study, our results demonstrated that selections of fragment sizes could affect the numbers and sequencing depth of detected CpG sites as well as the cost-efficiency. No single solution of RRBS is optimal in all circumstances for investigating genome-scale DNA methylation. This work provides the useful knowledge on designing and executing RRBS for investigating the genome-wide DNA methylation in tissues from pigs.

  10. Process Fragment Libraries for Easier and Faster Development of Process-based Applications

    Directory of Open Access Journals (Sweden)

    David Schumm

    2011-01-01

    Full Text Available The term “process fragment” is recently gaining momentum in business process management research. We understand a process fragment as a connected and reusable process structure, which has relaxed completeness and consistency criteria compared to executable processes. We claim that process fragments allow for an easier and faster development of process-based applications. As evidence to this claim we present a process fragment concept and show a sample collection of concrete, real-world process fragments. We present advanced application scenarios for using such fragments in development of process-based applications. Process fragments are typically managed in a repository, forming a process fragment library. On top of a process fragment library from previous work, we discuss the potential impact of using process fragment libraries in cross-enterprise collaboration and application integration.

  11. Leveraging structure determination with fragment screening for infectious disease drug targets: MECP synthase from Burkholderia pseudomallei

    Energy Technology Data Exchange (ETDEWEB)

    Begley, Darren W.; Hartley, Robert C.; Davies, Douglas R.; Edwards, Thomas E.; Leonard, Jess T.; Abendroth, Jan; Burris, Courtney A.; Bhandari, Janhavi; Myler, Peter J.; Staker, Bart L.; Stewart, Lance J. (UWASH); (Emerald)

    2011-09-28

    As part of the Seattle Structural Genomics Center for Infectious Disease, we seek to enhance structural genomics with ligand-bound structure data which can serve as a blueprint for structure-based drug design. We have adapted fragment-based screening methods to our structural genomics pipeline to generate multiple ligand-bound structures of high priority drug targets from pathogenic organisms. In this study, we report fragment screening methods and structure determination results for 2C-methyl-D-erythritol-2,4-cyclo-diphosphate (MECP) synthase from Burkholderia pseudomallei, the gram-negative bacterium which causes melioidosis. Screening by nuclear magnetic resonance spectroscopy as well as crystal soaking followed by X-ray diffraction led to the identification of several small molecules which bind this enzyme in a critical metabolic pathway. A series of complex structures obtained with screening hits reveal distinct binding pockets and a range of small molecules which form complexes with the target. Additional soaks with these compounds further demonstrate a subset of fragments to only bind the protein when present in specific combinations. This ensemble of fragment-bound complexes illuminates several characteristics of MECP synthase, including a previously unknown binding surface external to the catalytic active site. These ligand-bound structures now serve to guide medicinal chemists and structural biologists in rational design of novel inhibitors for this enzyme.

  12. Process of Fragment-Based Lead Discovery—A Perspective from NMR

    Directory of Open Access Journals (Sweden)

    Rongsheng Ma

    2016-07-01

    Full Text Available Fragment-based lead discovery (FBLD has proven fruitful during the past two decades for a variety of targets, even challenging protein–protein interaction (PPI systems. Nuclear magnetic resonance (NMR spectroscopy plays a vital role, from initial fragment-based screening to lead generation, because of its power to probe the intrinsically weak interactions between targets and low-molecular-weight fragments. Here, we review the NMR FBLD process from initial library construction to lead generation. We describe technical aspects regarding fragment library design, ligand- and protein-observed screening, and protein–ligand structure model generation. For weak binders, the initial hit-to-lead evolution can be guided by structural information retrieved from NMR spectroscopy, including chemical shift perturbation, transferred pseudocontact shifts, and paramagnetic relaxation enhancement. This perspective examines structure-guided optimization from weak fragment screening hits to potent leads for challenging PPI targets.

  13. Fragment Linking and Optimization of Inhibitors of the Aspartic Protease Endothiapepsin : Fragment-Based Drug Design Facilitated by Dynamic Combinatorial Chemistry

    NARCIS (Netherlands)

    Mondal, Milon; Radeva, Nedyalka; Fanlo-Virgos, Hugo; Otto, Sijbren; Klebe, Gerhard; Hirsch, Anna K. H.

    2016-01-01

    Fragment-based drug design (FBDD) affords active compounds for biological targets. While there are numerous reports on FBDD by fragment growing/optimization, fragment linking has rarely been reported. Dynamic combinatorial chemistry (DCC) has become a powerful hit-identification strategy for

  14. OCCURRENCE OF SMALL HOMOLOGOUS AND COMPLEMENTARY FRAGMENTS IN HUMAN VIRUS GENOMES AND THEIR POSSIBLE ROLE

    Directory of Open Access Journals (Sweden)

    E. P. Kharchenko

    2017-01-01

    Full Text Available With computer analysis occurrence of small homologous and complementary fragments (21 nucleotides in length has been studied in genomes of 14 human viruses causing most dangerous infections. The sample includes viruses with (+ and (– single stranded RNA and DNA-containing hepatitis A virus. Analysis of occurrence of homologous sequences has shown the existence two extreme situations. On the one hand, the same virus contains homologous sequences to almost all other viruses (for example, Ebola virus, severe acute respiratory syndrome-related coronavirus, and mumps virus, and numerous homologous sequences to the same other virus (especially in severe acute respiratory syndrome-related coronavirus to Dengue virus and in Ebola virus to poliovirus. On the other hand, there are rare occurrence and not numerous homologous sequences in genomes of other viruses (rubella virus, hepatitis A virus, and hepatitis B virus. Similar situation exists for occurrence of complementary sequences. Rubella virus, the genome of which has the high content of guanine and cytosine, has no complementary sequences to almost all other viruses. Most viruses have moderate level of occurrence for homologous and complementary sequences. Autocomplementary sequences are numerous in most viruses and one may suggest that the genome of single stranded RNA viruses has branched secondary structure. In addition to possible role in recombination among strains autocomplementary sequences could be regulators of translation rate of virus proteins and determine its optimal proportion in virion assembly with genome and mRNA folding. Occurrence of small homologous and complementary sequences in RNA- and DNA-containing viruses may be the result of multiple recombinations in the past and the present and determine their adaptation and variability. Recombination may take place in coinfection of human and/or common hosts. Inclusion of homologous and complementary sequences into genome could not

  15. The ways and means of fragment-based drug design.

    Science.gov (United States)

    Doak, Bradley C; Norton, Raymond S; Scanlon, Martin J

    2016-11-01

    Fragment-based drug design (FBDD) has emerged as a mainstream approach for the rapid and efficient identification of building blocks that can be used to develop high-affinity ligands against protein targets. One of the strengths of FBDD is the relative ease and low cost of the primary screen to identify fragments that bind. However, the fragments that emerge from primary screens often have low affinities, with K D values in the high μM to mM range, and a significant challenge for FBDD is to develop the initial fragments into more potent ligands. Successful fragment elaboration often requires co-structures of the fragments bound to their target proteins, as well as a range of biophysical and biochemical assays to track potency and efficacy. These challenges have led to the development of specific chemical strategies for the elaboration of weakly-binding fragments into more potent "hits" and lead compounds. In this article we review different approaches that have been employed to meet these challenges and describe some of the strategies that have resulted in several fragment-derived compounds entering clinical trials. Copyright © 2016 Elsevier Inc. All rights reserved.

  16. Evolutions in fragment-based drug design: the deconstruction–reconstruction approach

    Science.gov (United States)

    Chen, Haijun; Zhou, Xiaobin; Wang, Ailan; Zheng, Yunquan; Gao, Yu; Zhou, Jia

    2014-01-01

    Recent advances in the understanding of molecular recognition and protein–ligand interactions have facilitated rapid development of potent and selective ligands for therapeutically relevant targets. Over the past two decades, a variety of useful approaches and emerging techniques have been developed to promote the identification and optimization of leads that have high potential for generating new therapeutic agents. Intriguingly, the innovation of a fragment-based drug design (FBDD) approach has enabled rapid and efficient progress in drug discovery. In this critical review, we focus on the construction of fragment libraries and the advantages and disadvantages of various fragment-based screening (FBS) for constructing such libraries. We also highlight the deconstruction–reconstruction strategy by utilizing privileged fragments of reported ligands. PMID:25263697

  17. Fragment based drug discovery: practical implementation based on ¹⁹F NMR spectroscopy.

    Science.gov (United States)

    Jordan, John B; Poppe, Leszek; Xia, Xiaoyang; Cheng, Alan C; Sun, Yax; Michelsen, Klaus; Eastwood, Heather; Schnier, Paul D; Nixey, Thomas; Zhong, Wenge

    2012-01-26

    Fragment based drug discovery (FBDD) is a widely used tool for discovering novel therapeutics. NMR is a powerful means for implementing FBDD, and several approaches have been proposed utilizing (1)H-(15)N heteronuclear single quantum coherence (HSQC) as well as one-dimensional (1)H and (19)F NMR to screen compound mixtures against a target of interest. While proton-based NMR methods of fragment screening (FBS) have been well documented and are widely used, the use of (19)F detection in FBS has been only recently introduced (Vulpetti et al. J. Am. Chem. Soc.2009, 131 (36), 12949-12959) with the aim of targeting "fluorophilic" sites in proteins. Here, we demonstrate a more general use of (19)F NMR-based fragment screening in several areas: as a key tool for rapid and sensitive detection of fragment hits, as a method for the rapid development of structure-activity relationship (SAR) on the hit-to-lead path using in-house libraries and/or commercially available compounds, and as a quick and efficient means of assessing target druggability.

  18. The heat is on: thermodynamic analysis in fragment-based drug discovery

    NARCIS (Netherlands)

    Edink, E.S.; Jansen, C.J.W.; Leurs, R.; De Esch, I.J.

    2010-01-01

    Thermodynamic analysis provides access to the determinants of binding affinity, enthalpy and entropy. In fragment-based drug discovery (FBDD), thermodynamic analysis provides a powerful tool to discriminate fragments based on their potential for successful optimization. The thermodynamic data

  19. ACFIS: a web server for fragment-based drug discovery

    Science.gov (United States)

    Hao, Ge-Fei; Jiang, Wen; Ye, Yuan-Nong; Wu, Feng-Xu; Zhu, Xiao-Lei; Guo, Feng-Biao; Yang, Guang-Fu

    2016-01-01

    In order to foster innovation and improve the effectiveness of drug discovery, there is a considerable interest in exploring unknown ‘chemical space’ to identify new bioactive compounds with novel and diverse scaffolds. Hence, fragment-based drug discovery (FBDD) was developed rapidly due to its advanced expansive search for ‘chemical space’, which can lead to a higher hit rate and ligand efficiency (LE). However, computational screening of fragments is always hampered by the promiscuous binding model. In this study, we developed a new web server Auto Core Fragment in silico Screening (ACFIS). It includes three computational modules, PARA_GEN, CORE_GEN and CAND_GEN. ACFIS can generate core fragment structure from the active molecule using fragment deconstruction analysis and perform in silico screening by growing fragments to the junction of core fragment structure. An integrated energy calculation rapidly identifies which fragments fit the binding site of a protein. We constructed a simple interface to enable users to view top-ranking molecules in 2D and the binding mode in 3D for further experimental exploration. This makes the ACFIS a highly valuable tool for drug discovery. The ACFIS web server is free and open to all users at http://chemyang.ccnu.edu.cn/ccb/server/ACFIS/. PMID:27150808

  20. Value-based genomics.

    Science.gov (United States)

    Gong, Jun; Pan, Kathy; Fakih, Marwan; Pal, Sumanta; Salgia, Ravi

    2018-03-20

    Advancements in next-generation sequencing have greatly enhanced the development of biomarker-driven cancer therapies. The affordability and availability of next-generation sequencers have allowed for the commercialization of next-generation sequencing platforms that have found widespread use for clinical-decision making and research purposes. Despite the greater availability of tumor molecular profiling by next-generation sequencing at our doorsteps, the achievement of value-based care, or improving patient outcomes while reducing overall costs or risks, in the era of precision oncology remains a looming challenge. In this review, we highlight available data through a pre-established and conceptualized framework for evaluating value-based medicine to assess the cost (efficiency), clinical benefit (effectiveness), and toxicity (safety) of genomic profiling in cancer care. We also provide perspectives on future directions of next-generation sequencing from targeted panels to whole-exome or whole-genome sequencing and describe potential strategies needed to attain value-based genomics.

  1. Fragment-Based Protein-Protein Interaction Antagonists of a Viral Dimeric Protease.

    Science.gov (United States)

    Gable, Jonathan E; Lee, Gregory M; Acker, Timothy M; Hulce, Kaitlin R; Gonzalez, Eric R; Schweigler, Patrick; Melkko, Samu; Farady, Christopher J; Craik, Charles S

    2016-04-19

    Fragment-based drug discovery has shown promise as an approach for challenging targets such as protein-protein interfaces. We developed and applied an activity-based fragment screen against dimeric Kaposi's sarcoma-associated herpesvirus protease (KSHV Pr) using an optimized fluorogenic substrate. Dose-response determination was performed as a confirmation screen, and NMR spectroscopy was used to map fragment inhibitor binding to KSHV Pr. Kinetic assays demonstrated that several initial hits also inhibit human cytomegalovirus protease (HCMV Pr). Binding of these hits to HCMV Pr was also confirmed by NMR spectroscopy. Despite the use of a target-agnostic fragment library, more than 80 % of confirmed hits disrupted dimerization and bound to a previously reported pocket at the dimer interface of KSHV Pr, not to the active site. One class of fragments, an aminothiazole scaffold, was further explored using commercially available analogues. These compounds demonstrated greater than 100-fold improvement of inhibition. This study illustrates the power of fragment-based screening for these challenging enzymatic targets and provides an example of the potential druggability of pockets at protein-protein interfaces. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. Telomere Restriction Fragment (TRF) Analysis.

    Science.gov (United States)

    Mender, Ilgen; Shay, Jerry W

    2015-11-20

    restriction enzyme recognition sites within TTAGGG tandem telomeric repeats, therefore digestion of genomic DNA, not telomeric DNA, with a combination of 6 base restriction endonucleases reduces genomic DNA size to less than 800 bp.

  3. Genome-Wide Single-Nucleotide Polymorphisms Discovery and High-Density Genetic Map Construction in Cauliflower Using Specific-Locus Amplified Fragment Sequencing

    Science.gov (United States)

    Zhao, Zhenqing; Gu, Honghui; Sheng, Xiaoguang; Yu, Huifang; Wang, Jiansheng; Huang, Long; Wang, Dan

    2016-01-01

    Molecular markers and genetic maps play an important role in plant genomics and breeding studies. Cauliflower is an important and distinctive vegetable; however, very few molecular resources have been reported for this species. In this study, a novel, specific-locus amplified fragment (SLAF) sequencing strategy was employed for large-scale single nucleotide polymorphism (SNP) discovery and high-density genetic map construction in a double-haploid, segregating population of cauliflower. A total of 12.47 Gb raw data containing 77.92 M pair-end reads were obtained after processing and 6815 polymorphic SLAFs between the two parents were detected. The average sequencing depths reached 52.66-fold for the female parent and 49.35-fold for the male parent. Subsequently, these polymorphic SLAFs were used to genotype the population and further filtered based on several criteria to construct a genetic linkage map of cauliflower. Finally, 1776 high-quality SLAF markers, including 2741 SNPs, constituted the linkage map with average data integrity of 95.68%. The final map spanned a total genetic length of 890.01 cM with an average marker interval of 0.50 cM, and covered 364.9 Mb of the reference genome. The markers and genetic map developed in this study could provide an important foundation not only for comparative genomics studies within Brassica oleracea species but also for quantitative trait loci identification and molecular breeding of cauliflower. PMID:27047515

  4. Fragment-based drug discovery and protein–protein interactions

    Directory of Open Access Journals (Sweden)

    Turnbull AP

    2014-09-01

    Full Text Available Andrew P Turnbull,1 Susan M Boyd,2 Björn Walse31CRT Discovery Laboratories, Department of Biological Sciences, Birkbeck, University of London, London, UK; 2IOTA Pharmaceuticals Ltd, Cambridge, UK; 3SARomics Biostructures AB, Lund, SwedenAbstract: Protein–protein interactions (PPIs are involved in many biological processes, with an estimated 400,000 PPIs within the human proteome. There is significant interest in exploiting the relatively unexplored potential of these interactions in drug discovery, driven by the need to find new therapeutic targets. Compared with classical drug discovery against targets with well-defined binding sites, developing small-molecule inhibitors against PPIs where the contact surfaces are frequently more extensive and comparatively flat, with most of the binding energy localized in “hot spots”, has proven far more challenging. However, despite the difficulties associated with targeting PPIs, important progress has been made in recent years with fragment-based drug discovery playing a pivotal role in improving their tractability. Computational and empirical approaches can be used to identify hot-spot regions and assess the druggability and ligandability of new targets, whilst fragment screening campaigns can detect low-affinity fragments that either directly or indirectly perturb the PPI. Once fragment hits have been identified and confirmed using biochemical and biophysical approaches, three-dimensional structural data derived from nuclear magnetic resonance or X-ray crystallography can be used to drive medicinal chemistry efforts towards the development of more potent inhibitors. A small-scale comparison presented in this review of “standard” fragments with those targeting PPIs has revealed that the latter tend to be larger, be more lipophilic, and contain more polar (acid/base functionality, whereas three-dimensional descriptor data indicate that there is little difference in their three

  5. Magnetic bead purification of labeled DNA fragments forhigh-throughput capillary electrophoresis sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Elkin, Christopher; Kapur, Hitesh; Smith, Troy; Humphries, David; Pollard, Martin; Hammon, Nancy; Hawkins, Trevor

    2001-09-15

    We have developed an automated purification method for terminator sequencing products based on a magnetic bead technology. This 384-well protocol generates labeled DNA fragments that are essentially free of contaminates for less than $0.005 per reaction. In comparison to laborious ethanol precipitation protocols, this method increases the phred20 read length by forty bases with various DNA templates such as PCR fragments, Plasmids, Cosmids and RCA products. Our method eliminates centrifugation and is compatible with both the MegaBACE 1000 and ABIPrism 3700 capillary instruments. As of September 2001, this method has produced over 1.6 million samples with 93 percent averaging 620 phred20 bases as part of Joint Genome Institutes Production Process.

  6. First insight into the genome of an uncultivated crenarchaeote from soil

    DEFF Research Database (Denmark)

    Quaiser, Achim; Ochsenreiter, Torsten; Klenk, Hans-Peter

    2002-01-01

    RNA genes and of several protein encoding genes (e.g. DNA polymerase, FixAB, glycosyl transferase) confirmed the specific affiliation of the genomic fragment with the non-thermophilic clade of the crenarchaeota. Content and structure of the genomic fragment indicated that the archaea from soil differ......Molecular phylogenetic surveys based on the characterization of 16S rRNA genes have revealed that soil is an environment particularly rich in microbial diversity. A clade of crenarchaeota (archaea) has frequently been detected among many other novel lineages of uncultivated bacteria. In this study...... we have initiated a genomic approach for the characterization of uncultivated microorganisms from soil. We have developed a procedure based on a two-phase electrophoresis technique that allows the fast and reliable purification of concentrated and clonable, high molecular weight DNA. From this DNA we...

  7. Genomic DNA fingerprinting of clinical Haemophilus influenzae isolates by polymerase chain reaction amplification: comparison with major outer-membrane protein and restriction fragment length polymorphism analysis

    NARCIS (Netherlands)

    van Belkum, A.; Duim, B.; Regelink, A.; Möller, L.; Quint, W.; van Alphen, L.

    1994-01-01

    Non-capsulate strains of Haemophilus influenzae were genotyped by analysis of variable DNA segments obtained by amplification of genomic DNA with the polymerase chain reaction (PCR fingerprinting). Discrete fragments of 100-2000 bp were obtained. The reproducibility of the procedure was assessed by

  8. GENOMIC DNA-FINGERPRINTING OF CLINICAL HAEMOPHILUS-INFLUENZAE ISOLATES BY POLYMERASE CHAIN-REACTION AMPLIFICATION - COMPARISON WITH MAJOR OUTER-MEMBRANE PROTEIN AND RESTRICTION-FRAGMENT-LENGTH-POLYMORPHISM ANALYSIS

    NARCIS (Netherlands)

    VANBELKUM, A; DUIM, B; REGELINK, A; MOLLER, L; QUINT, W; VANALPHEN, L

    Non-capsulate strains of Haemophilus influenzae were genotyped by analysis of variable DNA segments obtained by amplification of genomic DNA with the polymerase chain reaction (PCR fingerprinting). Discrete fragments of 100-2000 bp were obtained. The reproducibility of the procedure was assessed by

  9. Fragment Linking and Optimization of Inhibitors of the Aspartic Protease Endothiapepsin: Fragment-Based Drug Design Facilitated by Dynamic Combinatorial Chemistry.

    Science.gov (United States)

    Mondal, Milon; Radeva, Nedyalka; Fanlo-Virgós, Hugo; Otto, Sijbren; Klebe, Gerhard; Hirsch, Anna K H

    2016-08-01

    Fragment-based drug design (FBDD) affords active compounds for biological targets. While there are numerous reports on FBDD by fragment growing/optimization, fragment linking has rarely been reported. Dynamic combinatorial chemistry (DCC) has become a powerful hit-identification strategy for biological targets. We report the synergistic combination of fragment linking and DCC to identify inhibitors of the aspartic protease endothiapepsin. Based on X-ray crystal structures of endothiapepsin in complex with fragments, we designed a library of bis-acylhydrazones and used DCC to identify potent inhibitors. The most potent inhibitor exhibits an IC50 value of 54 nm, which represents a 240-fold improvement in potency compared to the parent hits. Subsequent X-ray crystallography validated the predicted binding mode, thus demonstrating the efficiency of the combination of fragment linking and DCC as a hit-identification strategy. This approach could be applied to a range of biological targets, and holds the potential to facilitate hit-to-lead optimization. © 2016 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.

  10. ACFIS: a web server for fragment-based drug discovery.

    Science.gov (United States)

    Hao, Ge-Fei; Jiang, Wen; Ye, Yuan-Nong; Wu, Feng-Xu; Zhu, Xiao-Lei; Guo, Feng-Biao; Yang, Guang-Fu

    2016-07-08

    In order to foster innovation and improve the effectiveness of drug discovery, there is a considerable interest in exploring unknown 'chemical space' to identify new bioactive compounds with novel and diverse scaffolds. Hence, fragment-based drug discovery (FBDD) was developed rapidly due to its advanced expansive search for 'chemical space', which can lead to a higher hit rate and ligand efficiency (LE). However, computational screening of fragments is always hampered by the promiscuous binding model. In this study, we developed a new web server Auto Core Fragment in silico Screening (ACFIS). It includes three computational modules, PARA_GEN, CORE_GEN and CAND_GEN. ACFIS can generate core fragment structure from the active molecule using fragment deconstruction analysis and perform in silico screening by growing fragments to the junction of core fragment structure. An integrated energy calculation rapidly identifies which fragments fit the binding site of a protein. We constructed a simple interface to enable users to view top-ranking molecules in 2D and the binding mode in 3D for further experimental exploration. This makes the ACFIS a highly valuable tool for drug discovery. The ACFIS web server is free and open to all users at http://chemyang.ccnu.edu.cn/ccb/server/ACFIS/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Certain amplified genomic-DNA fragments (AGFs) may be involved in cell cycle progression and chloroquine is found to induce the production of cell-cycle-associated AGFs (CAGFs) in Plasmodium falciparum

    OpenAIRE

    Li, Gao-De

    2015-01-01

    It is well known that cyclins are a family of proteins that control cell-cycle progression by activating cyclin-dependent kinase. Based on our experimental results, we propose here a novel hypothesis that certain amplified genomic-DNA fragments (AGFs) may also be required for the cell cycle progression of eukaryotic cells and thus can be named as cell-cycle-associated AGFs (CAGFs). Like fluctuation in cyclin levels during cell cycle progression, these CAGFs are amplified and degraded at diffe...

  12. DNA methylation alteration is a major consequence of genome doubling in autotetraploid Brassica rapa

    Directory of Open Access Journals (Sweden)

    Xu Yanhao

    2017-01-01

    Full Text Available Polyploids are typically classified as autopolyploids or allopolyploids based on the origin of their chromosome sets. Autopolyploidy is much more common than traditionally believed. Allopolyploidization, accompanied by genomic and transcriptomic changes, has been well investigated. In this study, genetic, DNA methylation and gene expression changes in autotetraploid Brassica rapa were investigated. No genetic alteration was detected using an amplified fragment length polymorphism (AFLP approach. Using a cDNA-AFLP approach, approximately 0.58% of fragments showed changes in gene expression in autotetraploid B. rapa. The methylation-sensitive amplification polymorphism (MSAP analysis showed that approximately 1.7% of the fragments underwent DNA methylation changes upon genome doubling, with hypermethylation and demethylation changes equally affected. Fragments displaying changes in gene expression and methylation status were isolated and then sequenced and characterized, respectively. This study showed that variation in cytosine methylation is a major consequence of genome doubling in autotetraploid Brassica rapa.

  13. Repetitive elements may comprise over two-thirds of the human genome.

    Directory of Open Access Journals (Sweden)

    A P Jason de Koning

    2011-12-01

    Full Text Available Transposable elements (TEs are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo "clouds". We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%-69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM, to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp. Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed "element-specific" P-clouds (ESPs to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed.

  14. Fragment-based screening by protein crystallography: successes and pitfalls.

    Science.gov (United States)

    Chilingaryan, Zorik; Yin, Zhou; Oakley, Aaron J

    2012-10-08

    Fragment-based drug discovery (FBDD) concerns the screening of low-molecular weight compounds against macromolecular targets of clinical relevance. These compounds act as starting points for the development of drugs. FBDD has evolved and grown in popularity over the past 15 years. In this paper, the rationale and technology behind the use of X-ray crystallography in fragment based screening (FBS) will be described, including fragment library design and use of synchrotron radiation and robotics for high-throughput X-ray data collection. Some recent uses of crystallography in FBS will be described in detail, including interrogation of the drug targets β-secretase, phenylethanolamine N-methyltransferase, phosphodiesterase 4A and Hsp90. These examples provide illustrations of projects where crystallography is straightforward or difficult, and where other screening methods can help overcome the limitations of crystallography necessitated by diffraction quality.

  15. Fragment-Based Screening by Protein Crystallography: Successes and Pitfalls

    Directory of Open Access Journals (Sweden)

    Aaron J. Oakley

    2012-10-01

    Full Text Available Fragment-based drug discovery (FBDD concerns the screening of low-molecular weight compounds against macromolecular targets of clinical relevance. These compounds act as starting points for the development of drugs. FBDD has evolved and grown in popularity over the past 15 years. In this paper, the rationale and technology behind the use of X-ray crystallography in fragment based screening (FBS will be described, including fragment library design and use of synchrotron radiation and robotics for high-throughput X-ray data collection. Some recent uses of crystallography in FBS will be described in detail, including interrogation of the drug targets β-secretase, phenylethanolamine N-methyltransferase, phosphodiesterase 4A and Hsp90. These examples provide illustrations of projects where crystallography is straightforward or difficult, and where other screening methods can help overcome the limitations of crystallography necessitated by diffraction quality.

  16. VirSorter: mining viral signal from microbial genomic data

    Directory of Open Access Journals (Sweden)

    Simon Roux

    2015-05-01

    Full Text Available Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome, new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter’s prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages. Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in “reverse” to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made

  17. VirSorter: mining viral signal from microbial genomic data

    Science.gov (United States)

    Roux, Simon; Enault, Francois; Hurwitz, Bonnie L.

    2015-01-01

    Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome), new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs) of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter’s prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages). Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs) as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision) on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in “reverse” to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made available through the i

  18. Broad genomic and transcriptional analysis reveals a highly derived genome in dinoflagellate mitochondria

    Directory of Open Access Journals (Sweden)

    Keeling Patrick J

    2007-09-01

    Full Text Available Abstract Background Dinoflagellates comprise an ecologically significant and diverse eukaryotic phylum that is sister to the phylum containing apicomplexan endoparasites. The mitochondrial genome of apicomplexans is uniquely reduced in gene content and size, encoding only three proteins and two ribosomal RNAs (rRNAs within a highly compacted 6 kb DNA. Dinoflagellate mitochondrial genomes have been comparatively poorly studied: limited available data suggest some similarities with apicomplexan mitochondrial genomes but an even more radical type of genomic organization. Here, we investigate structure, content and expression of dinoflagellate mitochondrial genomes. Results From two dinoflagellates, Crypthecodinium cohnii and Karlodinium micrum, we generated over 42 kb of mitochondrial genomic data that indicate a reduced gene content paralleling that of mitochondrial genomes in apicomplexans, i.e., only three protein-encoding genes and at least eight conserved components of the highly fragmented large and small subunit rRNAs. Unlike in apicomplexans, dinoflagellate mitochondrial genes occur in multiple copies, often as gene fragments, and in numerous genomic contexts. Analysis of cDNAs suggests several novel aspects of dinoflagellate mitochondrial gene expression. Polycistronic transcripts were found, standard start codons are absent, and oligoadenylation occurs upstream of stop codons, resulting in the absence of termination codons. Transcripts of at least one gene, cox3, are apparently trans-spliced to generate full-length mRNAs. RNA substitutional editing, a process previously identified for mRNAs in dinoflagellate mitochondria, is also implicated in rRNA expression. Conclusion The dinoflagellate mitochondrial genome shares the same gene complement and fragmentation of rRNA genes with its apicomplexan counterpart. However, it also exhibits several unique characteristics. Most notable are the expansion of gene copy numbers and their arrangements

  19. When fragments link : a bibliometric perspective on the development of fragment-based drug discovery

    NARCIS (Netherlands)

    Romasanta, A.K.S.; van der Sijde, P.C.; Hellsten, I.; Hubbard, Roderick E.; Keseru, Gyorgy M.; van Muijlwijk-Koezen, Jacqueline E.; de Esch, I.J.P.

    2018-01-01

    Fragment-based drug discovery (FBDD) is a highly interdisciplinary field, rich in ideas integrated from pharmaceutical sciences, chemistry, biology, and physics, among others. To enrich our understanding of the development of the field, we used bibliometric techniques to analyze 3642 publications in

  20. Site-specific genomic (SSG and random domain-localized (RDL mutagenesis in yeast

    Directory of Open Access Journals (Sweden)

    Honigberg Saul M

    2004-04-01

    Full Text Available Abstract Background A valuable weapon in the arsenal available to yeast geneticists is the ability to introduce specific mutations into yeast genome. In particular, methods have been developed to introduce deletions into the yeast genome using PCR fragments. These methods are highly efficient because they do not require cloning in plasmids. Results We have modified the existing method for introducing deletions in the yeast (S. cerevisiae genome using PCR fragments in order to target point mutations to this genome. We describe two PCR-based methods for directing point mutations into the yeast genome such that the final product contains no other disruptions. In the first method, site-specific genomic (SSG mutagenesis, a specific point mutation is targeted into the genome. In the second method, random domain-localized (RDL mutagenesis, a mutation is introduced at random within a specific domain of a gene. Both methods require two sequential transformations, the first transformation integrates the URA3 marker into the targeted locus, and the second transformation replaces URA3 with a PCR fragment containing one or a few mutations. This PCR fragment is synthesized using a primer containing a mutation (SSG mutagenesis or is synthesized by error-prone PCR (RDL mutagenesis. In SSG mutagenesis, mutations that are proximal to the URA3 site are incorporated at higher frequencies than distal mutations, however mutations can be introduced efficiently at distances of at least 500 bp from the URA3 insertion. In RDL mutagenesis, to ensure that incorporation of mutations occurs at approximately equal frequencies throughout the targeted region, this region is deleted at the same time URA3 is integrated. Conclusion SSG and RDL mutagenesis allow point mutations to be easily and efficiently incorporated into the yeast genome without disrupting the native locus.

  1. The rise of fragment-based drug discovery.

    Science.gov (United States)

    Murray, Christopher W; Rees, David C

    2009-06-01

    The search for new drugs is plagued by high attrition rates at all stages in research and development. Chemists have an opportunity to tackle this problem because attrition can be traced back, in part, to the quality of the chemical leads. Fragment-based drug discovery (FBDD) is a new approach, increasingly used in the pharmaceutical industry, for reducing attrition and providing leads for previously intractable biological targets. FBDD identifies low-molecular-weight ligands (∼150 Da) that bind to biologically important macromolecules. The three-dimensional experimental binding mode of these fragments is determined using X-ray crystallography or NMR spectroscopy, and is used to facilitate their optimization into potent molecules with drug-like properties. Compared with high-throughput-screening, the fragment approach requires fewer compounds to be screened, and, despite the lower initial potency of the screening hits, offers more efficient and fruitful optimization campaigns. Here, we review the rise of FBDD, including its application to discovering clinical candidates against targets for which other chemistry approaches have struggled.

  2. A Web-Based Comparative Genomics Tutorial for Investigating Microbial Genomes

    Directory of Open Access Journals (Sweden)

    Michael Strong

    2009-12-01

    Full Text Available As the number of completely sequenced microbial genomes continues to rise at an impressive rate, it is important to prepare students with the skills necessary to investigate microorganisms at the genomic level. As a part of the core curriculum for first-year graduate students in the biological sciences, we have implemented a web-based tutorial to introduce students to the fields of comparative and functional genomics. The tutorial focuses on recent computational methods for identifying functionally linked genes and proteins on a genome-wide scale and was used to introduce students to the Rosetta Stone, Phylogenetic Profile, conserved Gene Neighbor, and Operon computational methods. Students learned to use a number of publicly available web servers and databases to identify functionally linked genes in the Escherichia coli genome, with emphasis on genome organization and operon structure. The overall effectiveness of the tutorial was assessed based on student evaluations and homework assignments. The tutorial is available to other educators at http://www.doe-mbi.ucla.edu/~strong/m253.php.

  3. In silico fragment-based drug discovery: setup and validation of a fragment-to-lead computational protocol using S4MPLE.

    Science.gov (United States)

    Hoffer, Laurent; Renaud, Jean-Paul; Horvath, Dragos

    2013-04-22

    This paper describes the use and validation of S4MPLE in Fragment-Based Drug Design (FBDD)--a strategy to build drug-like ligands starting from small compounds called fragments. S4MPLE is a conformational sampling tool based on a hybrid genetic algorithm that is able to simulate one (conformer enumeration) or more molecules (docking). The goal of the current paper is to show that due to the judicious design of genetic operators, S4MPLE may be used without any specific adaptation as an in silico FBDD tool. Such fragment-to-lead evolution involves either growing of one or linking of several fragment-like binder(s). The native ability to specifically "dock" a substructure that is covalently anchored to its target (here, some prepositioned fragment formally part of the binding site) enables it to act like dedicated de novo builders and differentiates it from most classical docking tools, which may only cope with non-covalent interactions. Besides, S4MPLE may address growing/linking scenarios involving protein site flexibility, and it might also suggest "growth" moves by bridging the ligand to the site via water-mediated interactions if H2O molecules are simply appended to the input files. Therefore, the only development overhead required to build a virtual fragment→ligand growing/linking strategy based on S4MPLE were two chemoinformatics programs meant to provide a minimalistic management of the linker library. The first creates a duplicate-free library by fragmenting a compound database, whereas the second builds new compounds, attaching chemically compatible linkers to the starting fragments. S4MPLE is subsequently used to probe the optimal placement of the linkers within the binding site, with initial restraints on atoms from initial fragments, followed by an optimization of all kept poses after restraint removal. Ranking is mainly based on two criteria: force-field potential energy and RMSD shifts of the original fragment moieties. This strategy was applied to

  4. Lessons from hot spot analysis for fragment-based drug discovery

    Science.gov (United States)

    Hall, David R.; Vajda, Sandor

    2015-01-01

    Analysis of binding energy hot spots at protein surfaces can provide crucial insights into the prospects for successful application of fragment-based drug discovery (FBDD), and whether a fragment hit can be advanced into a high affinity, druglike ligand. The key factor is the strength of the top ranking hot spot, and how well a given fragment complements it. We show that published data are sufficient to provide a sophisticated and quantitative understanding of how hot spots derive from protein three-dimensional structure, and how their strength, number and spatial arrangement govern the potential for a surface site to bind to fragment-sized and larger ligands. This improved understanding provides important guidance for the effective application of FBDD in drug discovery. PMID:26538314

  5. GenColors-based comparative genome databases for small eukaryotic genomes.

    Science.gov (United States)

    Felder, Marius; Romualdi, Alessandro; Petzold, Andreas; Platzer, Matthias; Sühnel, Jürgen; Glöckner, Gernot

    2013-01-01

    Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources.

  6. Extensive error in the number of genes inferred from draft genome assemblies.

    Directory of Open Access Journals (Sweden)

    James F Denton

    2014-12-01

    Full Text Available Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process.

  7. GAAP: Genome-organization-framework-Assisted Assembly Pipeline for prokaryotic genomes.

    Science.gov (United States)

    Yuan, Lina; Yu, Yang; Zhu, Yanmin; Li, Yulai; Li, Changqing; Li, Rujiao; Ma, Qin; Siu, Gilman Kit-Hang; Yu, Jun; Jiang, Taijiao; Xiao, Jingfa; Kang, Yu

    2017-01-25

    Next-generation sequencing (NGS) technologies have greatly promoted the genomic study of prokaryotes. However, highly fragmented assemblies due to short reads from NGS are still a limiting factor in gaining insights into the genome biology. Reference-assisted tools are promising in genome assembly, but tend to result in false assembly when the assigned reference has extensive rearrangements. Herein, we present GAAP, a genome assembly pipeline for scaffolding based on core-gene-defined Genome Organizational Framework (cGOF) described in our previous study. Instead of assigning references, we use the multiple-reference-derived cGOFs as indexes to assist in order and orientation of the scaffolds and build a skeleton structure, and then use read pairs to extend scaffolds, called local scaffolding, and distinguish between true and chimeric adjacencies in the scaffolds. In our performance tests using both empirical and simulated data of 15 genomes in six species with diverse genome size, complexity, and all three categories of cGOFs, GAAP outcompetes or achieves comparable results when compared to three other reference-assisted programs, AlignGraph, Ragout and MeDuSa. GAAP uses both cGOF and pair-end reads to create assemblies in genomic scale, and performs better than the currently available reference-assisted assembly tools as it recovers more assemblies and makes fewer false locations, especially for species with extensive rearranged genomes. Our method is a promising solution for reconstruction of genome sequence from short reads of NGS.

  8. Target Immobilization as a Strategy for NMR-Based Fragment Screening: Comparison of TINS, STD, and SPR for Fragment Hit Identification

    NARCIS (Netherlands)

    Kobayashi, M.; Retra, K.; Figaroa, F.; Hollander, J.G.; Ab, E.; Heetebrij, R.J.; Irth, H.; Siegal, G.

    2010-01-01

    Fragment-based drug discovery (FBDD) has become a widely accepted tool that is complementary to high-throughput screening (HTS) in developing small-molecule inhibitors of pharmaceutical targets. Because a fragment campaign can only be as successful as the hit matter found, it is critical that the

  9. Mutant DNA quantification by digital PCR can be confounded by heating during DNA fragmentation.

    Science.gov (United States)

    Kang, Qing; Parkin, Brian; Giraldez, Maria D; Tewari, Muneesh

    2016-04-01

    Digital PCR (dPCR) is gaining popularity as a DNA mutation quantification method for clinical specimens. Fragmentation prior to dPCR is required for non-fragmented genomic DNA samples; however, the effect of fragmentation on DNA analysis has not been well-studied. Here we evaluated three fragmentation methods for their effects on dPCR point mutation assay performance. Wild-type (WT) human genomic DNA was fragmented by heating, restriction digestion, or acoustic shearing using a Covaris focused-ultrasonicator. dPCR was then used to determine the limit of blank (LoB) by quantifying observed WT and mutant allele counts of the proto-oncogenes KRAS and BRAF in the WT DNA sample. DNA fragmentation by heating to 95°C, while the simplest and least expensive method, produced a high background mutation frequency for certain KRAS mutations relative to the other methods. This was due to heat-induced mutations, specifically affecting dPCR assays designed to interrogate guanine to adenine (G>A) mutations. Moreover, heat-induced fragmentation overestimated gene copy number, potentially due to denaturation and partition of single-stranded DNA into different droplets. Covaris acoustic shearing and restriction enzyme digestion showed similar LoBs and gene copy number estimates to one another. It should be noted that moderate heating, commonly used in genomic DNA extraction protocols, did not significantly increase observed KRAS mutation counts.

  10. Ionization and fragmentation of DNA-RNA bases: a density functional theory study

    International Nuclear Information System (INIS)

    Sadr-Arani, Leila

    2014-01-01

    Ionizing radiation (IR) cross human tissue, deposit energy and dissipate fragmenting molecules. The resulting fragments may be highlighted by mass spectrometry. Despite the amount of information obtained experimentally by the interpretation of the mass spectrum, experience alone cannot answer all the questions of the mechanism of fragmentation of DNA/RNA bases and a theoretical study is a complement to this information. A theoretical study allows us to know the weakest bonds in the molecule during ionization and thus may help to provide mechanisms of dissociation and produced fragments. The purpose of this work, using the DFT with the PBE functional, is to study the ionization and fragmentation mechanisms of DNA/RNA bases (Uracil, Cytosine, Adenine and Guanine) and to identify the cations corresponding to each peak in mass spectra. For all RNA bases, the retro Diels-Alder reaction (elimination of HNCO or NCO*) is a major route for dissociating, with the exception of adenine for which there is no atom oxygen in its structure. Loss of NH 3 (NH 2 *) molecule is another common way to all bases that contain amine group. The possibility of the loss of hydrogen from the cations is also investigated, as well as the dissociation of dehydrogenated cations and protonated uracil. This work shows the interest of providing DFT calculation in the interpretation of mass spectra of DNA bases. (author)

  11. Lessons from Hot Spot Analysis for Fragment-Based Drug Discovery.

    Science.gov (United States)

    Hall, David R; Kozakov, Dima; Whitty, Adrian; Vajda, Sandor

    2015-11-01

    Analysis of binding energy hot spots at protein surfaces can provide crucial insights into the prospects for successful application of fragment-based drug discovery (FBDD), and whether a fragment hit can be advanced into a high-affinity, drug-like ligand. The key factor is the strength of the top ranking hot spot, and how well a given fragment complements it. We show that published data are sufficient to provide a sophisticated and quantitative understanding of how hot spots derive from a protein 3D structure, and how their strength, number, and spatial arrangement govern the potential for a surface site to bind to fragment-sized and larger ligands. This improved understanding provides important guidance for the effective application of FBDD in drug discovery. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. Origin and differentiation of a special fragment from Capra hircus ...

    African Journals Online (AJOL)

    Administrator

    2011-09-07

    Sep 7, 2011 ... regions of the special fragment in the GenBank of NCBI. A total number of 80 fragments with identity ... recombined during the long period of evolution within and among species, and might be related to ..... and their association to coat color phenotypes in horses (Equus caballus). Mammalian Genome, 12: ...

  13. WormBase: Annotating many nematode genomes.

    Science.gov (United States)

    Howe, Kevin; Davis, Paul; Paulini, Michael; Tuli, Mary Ann; Williams, Gary; Yook, Karen; Durbin, Richard; Kersey, Paul; Sternberg, Paul W

    2012-01-01

    WormBase (www.wormbase.org) has been serving the scientific community for over 11 years as the central repository for genomic and genetic information for the soil nematode Caenorhabditis elegans. The resource has evolved from its beginnings as a database housing the genomic sequence and genetic and physical maps of a single species, and now represents the breadth and diversity of nematode research, currently serving genome sequence and annotation for around 20 nematodes. In this article, we focus on WormBase's role of genome sequence annotation, describing how we annotate and integrate data from a growing collection of nematode species and strains. We also review our approaches to sequence curation, and discuss the impact on annotation quality of large functional genomics projects such as modENCODE.

  14. Fragment-based drug discovery and its application to challenging drug targets.

    Science.gov (United States)

    Price, Amanda J; Howard, Steven; Cons, Benjamin D

    2017-11-08

    Fragment-based drug discovery (FBDD) is a technique for identifying low molecular weight chemical starting points for drug discovery. Since its inception 20 years ago, FBDD has grown in popularity to the point where it is now an established technique in industry and academia. The approach involves the biophysical screening of proteins against collections of low molecular weight compounds (fragments). Although fragments bind to proteins with relatively low affinity, they form efficient, high quality binding interactions with the protein architecture as they have to overcome a significant entropy barrier to bind. Of the biophysical methods available for fragment screening, X-ray protein crystallography is one of the most sensitive and least prone to false positives. It also provides detailed structural information of the protein-fragment complex at the atomic level. Fragment-based screening using X-ray crystallography is therefore an efficient method for identifying binding hotspots on proteins, which can then be exploited by chemists and biologists for the discovery of new drugs. The use of FBDD is illustrated here with a recently published case study of a drug discovery programme targeting the challenging protein-protein interaction Kelch-like ECH-associated protein 1:nuclear factor erythroid 2-related factor 2. © 2017 The Author(s). Published by Portland Press Limited on behalf of the Biochemical Society.

  15. ALIS-FLP: Amplified ligation selected fragment-length polymorphism method for microbial genotyping

    DEFF Research Database (Denmark)

    Brillowska-Dabrowska, A.; Wianecka, M.; Dabrowski, Slawomir

    2008-01-01

    A DNA fingerprinting method known as ALIS-FLP (amplified ligation selected fragment-length polymorphism) has been developed for selective and specific amplification of restriction fragments from TspRI restriction endonuclease digested genomic DNA. The method is similar to AFLP, but differs...

  16. Applicability of SCAR markers to food genomics: olive oil traceability.

    Science.gov (United States)

    Pafundo, Simona; Agrimonti, Caterina; Maestri, Elena; Marmiroli, Nelson

    2007-07-25

    DNA analysis with molecular markers has opened a shortcut toward a genomic comprehension of complex organisms. The availability of micro-DNA extraction methods, coupled with selective amplification of the smallest extracted fragments with molecular markers, could equally bring a breakthrough in food genomics: the identification of original components in food. Amplified fragment length polymorphisms (AFLPs) have been instrumental in plant genomics because they may allow rapid and reliable analysis of multiple and potentially polymorphic sites. Nevertheless, their direct application to the analysis of DNA extracted from food matrixes is complicated by the low quality of DNA extracted: its high degradation and the presence of inhibitors of enzymatic reactions. The conversion of an AFLP fragment to a robust and specific single-locus PCR-based marker, therefore, could extend the use of molecular markers to large-scale analysis of complex agro-food matrixes. In the present study is reported the development of sequence characterized amplified regions (SCARs) starting from AFLP profiles of monovarietal olive oils analyzed on agarose gel; one of these was used to identify differences among 56 olive cultivars. All the developed markers were purposefully amplified in olive oils to apply them to olive oil traceability.

  17. Physics-Based Fragment Acceleration Modeling for Pressurized Tank Burst Risk Assessments

    Science.gov (United States)

    Manning, Ted A.; Lawrence, Scott L.

    2014-01-01

    As part of comprehensive efforts to develop physics-based risk assessment techniques for space systems at NASA, coupled computational fluid and rigid body dynamic simulations were carried out to investigate the flow mechanisms that accelerate tank fragments in bursting pressurized vessels. Simulations of several configurations were compared to analyses based on the industry-standard Baker explosion model, and were used to formulate an improved version of the model. The standard model, which neglects an external fluid, was found to agree best with simulation results only in configurations where the internal-to-external pressure ratio is very high and fragment curvature is small. The improved model introduces terms that accommodate an external fluid and better account for variations based on circumferential fragment count. Physics-based analysis was critical in increasing the model's range of applicability. The improved tank burst model can be used to produce more accurate risk assessments of space vehicle failure modes that involve high-speed debris, such as exploding propellant tanks and bursting rocket engines.

  18. Site Identification by Ligand Competitive Saturation (SILCS) Simulations for Fragment-Based Drug Design

    OpenAIRE

    Faller, Christina E.; Raman, E. Prabhu; MacKerell, Alexander D.; Guvench, Olgun

    2015-01-01

    Fragment-based drug design (FBDD) involves screening low molecular weight molecules (“fragments”) that correspond to functional groups found in larger drug-like molecules to determine their binding to target proteins or nucleic acids. Based on the principle of thermodynamic additivity, two fragments that bind non-overlapping nearby sites on the target can be combined to yield a new molecule whose binding free energy is the sum of those of the fragments. Experimental FBDD approaches, like NMR ...

  19. Novel approach of fragment-based lead discovery applied to renin inhibitors.

    Science.gov (United States)

    Tawada, Michiko; Suzuki, Shinkichi; Imaeda, Yasuhiro; Oki, Hideyuki; Snell, Gyorgy; Behnke, Craig A; Kondo, Mitsuyo; Tarui, Naoki; Tanaka, Toshimasa; Kuroita, Takanobu; Tomimoto, Masaki

    2016-11-15

    A novel approach was conducted for fragment-based lead discovery and applied to renin inhibitors. The biochemical screening of a fragment library against renin provided the hit fragment which showed a characteristic interaction pattern with the target protein. The hit fragment bound only to the S1, S3, and S3 SP (S3 subpocket) sites without any interactions with the catalytic aspartate residues (Asp32 and Asp215 (pepsin numbering)). Prior to making chemical modifications to the hit fragment, we first identified its essential binding sites by utilizing the hit fragment's substructures. Second, we created a new and smaller scaffold, which better occupied the identified essential S3 and S3 SP sites, by utilizing library synthesis with high-throughput chemistry. We then revisited the S1 site and efficiently explored a good building block attaching to the scaffold with library synthesis. In the library syntheses, the binding modes of each pivotal compound were determined and confirmed by X-ray crystallography and the library was strategically designed by structure-based computational approach not only to obtain a more active compound but also to obtain informative Structure Activity Relationship (SAR). As a result, we obtained a lead compound offering synthetic accessibility as well as the improved in vitro ADMET profiles. The fragments and compounds possessing a characteristic interaction pattern provided new structural insights into renin's active site and the potential to create a new generation of renin inhibitors. In addition, we demonstrated our FBDD strategy integrating highly sensitive biochemical assay, X-ray crystallography, and high-throughput synthesis and in silico library design aimed at fragment morphing at the initial stage was effective to elucidate a pocket profile and a promising lead compound. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. qPCR-based mitochondrial DNA quantification: Influence of template DNA fragmentation on accuracy

    International Nuclear Information System (INIS)

    Jackson, Christopher B.; Gallati, Sabina; Schaller, André

    2012-01-01

    Highlights: ► Serial qPCR accurately determines fragmentation state of any given DNA sample. ► Serial qPCR demonstrates different preservation of the nuclear and mitochondrial genome. ► Serial qPCR provides a diagnostic tool to validate the integrity of bioptic material. ► Serial qPCR excludes degradation-induced erroneous quantification. -- Abstract: Real-time PCR (qPCR) is the method of choice for quantification of mitochondrial DNA (mtDNA) by relative comparison of a nuclear to a mitochondrial locus. Quantitative abnormal mtDNA content is indicative of mitochondrial disorders and mostly confines in a tissue-specific manner. Thus handling of degradation-prone bioptic material is inevitable. We established a serial qPCR assay based on increasing amplicon size to measure degradation status of any DNA sample. Using this approach we can exclude erroneous mtDNA quantification due to degraded samples (e.g. long post-exicision time, autolytic processus, freeze–thaw cycles) and ensure abnormal DNA content measurements (e.g. depletion) in non-degraded patient material. By preparation of degraded DNA under controlled conditions using sonification and DNaseI digestion we show that erroneous quantification is due to the different preservation qualities of the nuclear and the mitochondrial genome. This disparate degradation of the two genomes results in over- or underestimation of mtDNA copy number in degraded samples. Moreover, as analysis of defined archival tissue would allow to precise the molecular pathomechanism of mitochondrial disorders presenting with abnormal mtDNA content, we compared fresh frozen (FF) with formalin-fixed paraffin-embedded (FFPE) skeletal muscle tissue of the same sample. By extrapolation of measured decay constants for nuclear DNA (λ nDNA ) and mtDNA (λ mtDNA ) we present an approach to possibly correct measurements in degraded samples in the future. To our knowledge this is the first time different degradation impact of the two

  1. qPCR-based mitochondrial DNA quantification: Influence of template DNA fragmentation on accuracy

    Energy Technology Data Exchange (ETDEWEB)

    Jackson, Christopher B., E-mail: Christopher.jackson@insel.ch [Division of Human Genetics, Departements of Pediatrics and Clinical Research, Inselspital, University of Berne, Freiburgstrasse, CH-3010 Berne (Switzerland); Gallati, Sabina, E-mail: sabina.gallati@insel.ch [Division of Human Genetics, Departements of Pediatrics and Clinical Research, Inselspital, University of Berne, Freiburgstrasse, CH-3010 Berne (Switzerland); Schaller, Andre, E-mail: andre.schaller@insel.ch [Division of Human Genetics, Departements of Pediatrics and Clinical Research, Inselspital, University of Berne, Freiburgstrasse, CH-3010 Berne (Switzerland)

    2012-07-06

    Highlights: Black-Right-Pointing-Pointer Serial qPCR accurately determines fragmentation state of any given DNA sample. Black-Right-Pointing-Pointer Serial qPCR demonstrates different preservation of the nuclear and mitochondrial genome. Black-Right-Pointing-Pointer Serial qPCR provides a diagnostic tool to validate the integrity of bioptic material. Black-Right-Pointing-Pointer Serial qPCR excludes degradation-induced erroneous quantification. -- Abstract: Real-time PCR (qPCR) is the method of choice for quantification of mitochondrial DNA (mtDNA) by relative comparison of a nuclear to a mitochondrial locus. Quantitative abnormal mtDNA content is indicative of mitochondrial disorders and mostly confines in a tissue-specific manner. Thus handling of degradation-prone bioptic material is inevitable. We established a serial qPCR assay based on increasing amplicon size to measure degradation status of any DNA sample. Using this approach we can exclude erroneous mtDNA quantification due to degraded samples (e.g. long post-exicision time, autolytic processus, freeze-thaw cycles) and ensure abnormal DNA content measurements (e.g. depletion) in non-degraded patient material. By preparation of degraded DNA under controlled conditions using sonification and DNaseI digestion we show that erroneous quantification is due to the different preservation qualities of the nuclear and the mitochondrial genome. This disparate degradation of the two genomes results in over- or underestimation of mtDNA copy number in degraded samples. Moreover, as analysis of defined archival tissue would allow to precise the molecular pathomechanism of mitochondrial disorders presenting with abnormal mtDNA content, we compared fresh frozen (FF) with formalin-fixed paraffin-embedded (FFPE) skeletal muscle tissue of the same sample. By extrapolation of measured decay constants for nuclear DNA ({lambda}{sub nDNA}) and mtDNA ({lambda}{sub mtDNA}) we present an approach to possibly correct measurements in

  2. Phylogenetic analysis of Gossypium L. using restriction fragment length polymorphism of repeated sequences.

    Science.gov (United States)

    Zhang, Meiping; Rong, Ying; Lee, Mi-Kyung; Zhang, Yang; Stelly, David M; Zhang, Hong-Bin

    2015-10-01

    Cotton is the world's leading textile fiber crop and is also grown as a bioenergy and food crop. Knowledge of the phylogeny of closely related species and the genome origin and evolution of polyploid species is significant for advanced genomics research and breeding. We have reconstructed the phylogeny of the cotton genus, Gossypium L., and deciphered the genome origin and evolution of its five polyploid species by restriction fragment analysis of repeated sequences. Nuclear DNA of 84 accessions representing 35 species and all eight genomes of the genus were analyzed. The phylogenetic tree of the genus was reconstructed using the parsimony method on 1033 polymorphic repeated sequence restriction fragments. The genome origin of its polyploids was determined by calculating the diploid-polyploid restriction fragment correspondence (RFC). The tree is consistent with the morphological classification, genome designation and geographic distribution of the species at subgenus, section and subsection levels. Gossypium lobatum (D7) was unambiguously shown to have the highest RFC with the D-subgenomes of all five polyploids of the genus, while the common ancestor of Gossypium herbaceum (A1) and Gossypium arboreum (A2) likely contributed to the A-subgenomes of the polyploids. These results provide a comprehensive phylogenetic tree of the cotton genus and new insights into the genome origin and evolution of its polyploid species. The results also further demonstrate a simple, rapid and inexpensive method suitable for phylogenetic analysis of closely related species, especially congeneric species, and the inference of genome origin of polyploids that constitute over 70 % of flowering plants.

  3. Fragment virtual screening based on Bayesian categorization for discovering novel VEGFR-2 scaffolds.

    Science.gov (United States)

    Zhang, Yanmin; Jiao, Yu; Xiong, Xiao; Liu, Haichun; Ran, Ting; Xu, Jinxing; Lu, Shuai; Xu, Anyang; Pan, Jing; Qiao, Xin; Shi, Zhihao; Lu, Tao; Chen, Yadong

    2015-11-01

    The discovery of novel scaffolds against a specific target has long been one of the most significant but challengeable goals in discovering lead compounds. A scaffold that binds in important regions of the active pocket is more favorable as a starting point because scaffolds generally possess greater optimization possibilities. However, due to the lack of sufficient chemical space diversity of the databases and the ineffectiveness of the screening methods, it still remains a great challenge to discover novel active scaffolds. Since the strengths and weaknesses of both fragment-based drug design and traditional virtual screening (VS), we proposed a fragment VS concept based on Bayesian categorization for the discovery of novel scaffolds. This work investigated the proposal through an application on VEGFR-2 target. Firstly, scaffold and structural diversity of chemical space for 10 compound databases were explicitly evaluated. Simultaneously, a robust Bayesian classification model was constructed for screening not only compound databases but also their corresponding fragment databases. Although analysis of the scaffold diversity demonstrated a very unevenly distribution of scaffolds over molecules, results showed that our Bayesian model behaved better in screening fragments than molecules. Through a literature retrospective research, several generated fragments with relatively high Bayesian scores indeed exhibit VEGFR-2 biological activity, which strongly proved the effectiveness of fragment VS based on Bayesian categorization models. This investigation of Bayesian-based fragment VS can further emphasize the necessity for enrichment of compound databases employed in lead discovery by amplifying the diversity of databases with novel structures.

  4. Assessment of Dengue virus helicase and methyltransferase as targets for fragment-based drug discovery.

    Science.gov (United States)

    Coutard, Bruno; Decroly, Etienne; Li, Changqing; Sharff, Andrew; Lescar, Julien; Bricogne, Gérard; Barral, Karine

    2014-06-01

    Seasonal and pandemic flaviviruses continue to be leading global health concerns. With the view to help drug discovery against Dengue virus (DENV), a fragment-based experimental approach was applied to identify small molecule ligands targeting two main components of the flavivirus replication complex: the NS3 helicase (Hel) and the NS5 mRNA methyltransferase (MTase) domains. A library of 500 drug-like fragments was first screened by thermal-shift assay (TSA) leading to the identification of 36 and 32 fragment hits binding Hel and MTase from DENV, respectively. In a second stage, we set up a fragment-based X-ray crystallographic screening (FBS-X) in order to provide both validated fragment hits and structural binding information. No fragment hit was confirmed for DENV Hel. In contrast, a total of seven fragments were identified as DENV MTase binders and structures of MTase-fragment hit complexes were solved at resolution at least 2.0Å or better. All fragment hits identified contain either a five- or six-membered aromatic ring or both, and three novel binding sites were located on the MTase. To further characterize the fragment hits identified by TSA and FBS-X, we performed enzymatic assays to assess their inhibition effect on the N7- and 2'-O-MTase enzymatic activities: five of these fragment hits inhibit at least one of the two activities with IC50 ranging from 180μM to 9mM. This work validates the FBS-X strategy for identifying new anti-flaviviral hits targeting MTase, while Hel might not be an amenable target for fragment-based drug discovery (FBDD). This approach proved to be a fast and efficient screening method for FBDD target validation and discovery of starting hits for the development of higher affinity molecules that bind to novel allosteric sites. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. Site Identification by Ligand Competitive Saturation (SILCS) simulations for fragment-based drug design.

    Science.gov (United States)

    Faller, Christina E; Raman, E Prabhu; MacKerell, Alexander D; Guvench, Olgun

    2015-01-01

    Fragment-based drug design (FBDD) involves screening low molecular weight molecules ("fragments") that correspond to functional groups found in larger drug-like molecules to determine their binding to target proteins or nucleic acids. Based on the principle of thermodynamic additivity, two fragments that bind nonoverlapping nearby sites on the target can be combined to yield a new molecule whose binding free energy is the sum of those of the fragments. Experimental FBDD approaches, like NMR and X-ray crystallography, have proven very useful but can be expensive in terms of time, materials, and labor. Accordingly, a variety of computational FBDD approaches have been developed that provide different levels of detail and accuracy.The Site Identification by Ligand Competitive Saturation (SILCS) method of computational FBDD uses all-atom explicit-solvent molecular dynamics (MD) simulations to identify fragment binding. The target is "soaked" in an aqueous solution with multiple fragments having different identities. The resulting computational competition assay reveals what small molecule types are most likely to bind which regions of the target. From SILCS simulations, 3D probability maps of fragment binding called "FragMaps" can be produced. Based on the probabilities relative to bulk, SILCS FragMaps can be used to determine "Grid Free Energies (GFEs)," which provide per-atom contributions to fragment binding affinities. For essentially no additional computational overhead relative to the production of the FragMaps, GFEs can be used to compute Ligand Grid Free Energies (LGFEs) for arbitrarily complex molecules, and these LGFEs can be used to rank-order the molecules in accordance with binding affinities.

  6. Fragment-based approaches to anti-HIV drug discovery: state of the art and future opportunities.

    Science.gov (United States)

    Huang, Boshi; Kang, Dongwei; Zhan, Peng; Liu, Xinyong

    2015-12-01

    The search for additional drugs to treat HIV infection is a continuing effort due to the emergence and spread of HIV strains resistant to nearly all current drugs. The recent literature reveals that fragment-based drug design/discovery (FBDD) has become an effective alternative to conventional high-throughput screening strategies for drug discovery. In this critical review, the authors describe the state of the art in FBDD strategies for the discovery of anti-HIV drug-like compounds. The article focuses on fragment screening techniques, direct fragment-based design and early hit-to-lead progress. Rapid progress in biophysical detection and in silico techniques has greatly aided the application of FBDD to discover candidate agents directed at a variety of anti-HIV targets. Growing evidence suggests that structural insights on key proteins in the HIV life cycle can be applied in the early phase of drug discovery campaigns, providing valuable information on the binding modes and efficiently prompting fragment hit-to-lead progression. The combination of structural insights with improved methodologies for FBDD, including the privileged fragment-based reconstruction approach, fragment hybridization based on crystallographic overlays, fragment growth exploiting dynamic combinatorial chemistry, and high-speed fragment assembly via diversity-oriented synthesis followed by in situ screening, offers the possibility of more efficient and rapid discovery of novel drugs for HIV-1 prevention or treatment. Though the use of FBDD in anti-HIV drug discovery is still in its infancy, it is anticipated that anti-HIV agents developed via fragment-based strategies will be introduced into the clinic in the future.

  7. When fragments link: a bibliometric perspective on the development of fragment-based drug discovery.

    Science.gov (United States)

    Romasanta, Angelo K S; van der Sijde, Peter; Hellsten, Iina; Hubbard, Roderick E; Keseru, Gyorgy M; van Muijlwijk-Koezen, Jacqueline; de Esch, Iwan J P

    2018-05-05

    Fragment-based drug discovery (FBDD) is a highly interdisciplinary field, rich in ideas integrated from pharmaceutical sciences, chemistry, biology, and physics, among others. To enrich our understanding of the development of the field, we used bibliometric techniques to analyze 3642 publications in FBDD, complementing accounts by key practitioners. Mapping its core papers, we found the transfer of knowledge from academia to industry. Co-authorship analysis showed that university-industry collaboration has grown over time. Moreover, we show how ideas from other scientific disciplines have been integrated into the FBDD paradigm. Keyword analysis showed that the field is organized into four interconnected practices: library design, fragment screening, computational methods, and optimization. This study highlights the importance of interactions among various individuals and institutions from diverse disciplines in newly emerging scientific fields. Copyright © 2018. Published by Elsevier Ltd.

  8. Anti-fouling properties of Fab’ fragments immobilized on silane-based adlayers

    International Nuclear Information System (INIS)

    Crivianu-Gaita, Victor; Romaschin, Alexander; Thompson, Michael

    2015-01-01

    Highlights: • Simple and mixed adlayers formed with Fab’ linker and/or spacers. • Binding of Fab’ fragments through TUBTS linker resulted in oriented immobilization. • Immobilized Fab’ fragments have inherent anti-fouling character. • Up to 80% fouling reduction when Fab’ fragments introduced to surfaces. • Used the minimally fouling surfaces to detect a cancer biomarker (PTHrP) in serum. - Graphical abstract: Biosensors require surfaces that are highly specific towards the target analyte and that are minimally fouling. However, surface tuning to minimize fouling is a difficult task. The last decade has seen an increase in the use of immobilized antigen-binding antibody fragments (Fab’) in biosensors. One Fab’ linker compound S-(11-trichlorosilyl-undecanyl)-benzothiosulfonate (TUBTS) and three spacers were used to create the silane-based adlayers. The ultra-high frequency electromagnetic piezoelectric acoustic sensor (EMPAS) was used to gauge the fouling properties of the various surfaces using bovine serum albumin (BSA), goat IgG, and mouse serum. X-ray photoelectron spectroscopy (XPS), contact angle, and atomic force microscopy (AFM) were employed to characterize the surfaces. It was discovered that immobilized oriented Fab’ fragments reduced the fouling levels of surfaces up to 80% compared to the surfaces without fragments. An explanation for this phenomenon is that the antibody fragments increase the hydration of the surfaces and aid in the formation of an anti-fouling water barrier. The anti-fouling effect of the Fab’ fragments is at its maximum when there is an even distribution of fragments across the surfaces. Finally, using Fab’-covered surfaces, a cancer biomarker was detected from serum, showing the applicability of this work to the field of biodetection. - Abstract: Biosensors require surfaces that are highly specific towards the target analyte and that are minimally fouling. However, surface tuning to minimize fouling is a

  9. Anti-fouling properties of Fab’ fragments immobilized on silane-based adlayers

    Energy Technology Data Exchange (ETDEWEB)

    Crivianu-Gaita, Victor [Department of Chemistry, University of Toronto, Toronto, ON M5S 3H6 (Canada); Romaschin, Alexander [Clinical Biochemistry, St. Michael' s Hospital, Toronto, ON M5B 1W8 (Canada); Thompson, Michael, E-mail: mikethom@chem.utoronto.ca [Department of Chemistry, University of Toronto, Toronto, ON M5S 3H6 (Canada)

    2015-12-30

    Highlights: • Simple and mixed adlayers formed with Fab’ linker and/or spacers. • Binding of Fab’ fragments through TUBTS linker resulted in oriented immobilization. • Immobilized Fab’ fragments have inherent anti-fouling character. • Up to 80% fouling reduction when Fab’ fragments introduced to surfaces. • Used the minimally fouling surfaces to detect a cancer biomarker (PTHrP) in serum. - Graphical abstract: Biosensors require surfaces that are highly specific towards the target analyte and that are minimally fouling. However, surface tuning to minimize fouling is a difficult task. The last decade has seen an increase in the use of immobilized antigen-binding antibody fragments (Fab’) in biosensors. One Fab’ linker compound S-(11-trichlorosilyl-undecanyl)-benzothiosulfonate (TUBTS) and three spacers were used to create the silane-based adlayers. The ultra-high frequency electromagnetic piezoelectric acoustic sensor (EMPAS) was used to gauge the fouling properties of the various surfaces using bovine serum albumin (BSA), goat IgG, and mouse serum. X-ray photoelectron spectroscopy (XPS), contact angle, and atomic force microscopy (AFM) were employed to characterize the surfaces. It was discovered that immobilized oriented Fab’ fragments reduced the fouling levels of surfaces up to 80% compared to the surfaces without fragments. An explanation for this phenomenon is that the antibody fragments increase the hydration of the surfaces and aid in the formation of an anti-fouling water barrier. The anti-fouling effect of the Fab’ fragments is at its maximum when there is an even distribution of fragments across the surfaces. Finally, using Fab’-covered surfaces, a cancer biomarker was detected from serum, showing the applicability of this work to the field of biodetection. - Abstract: Biosensors require surfaces that are highly specific towards the target analyte and that are minimally fouling. However, surface tuning to minimize fouling is a

  10. Construction of a genomic library of the human cytomegalovirus genome and analysis of late transcription of its inverted internal repeat region

    International Nuclear Information System (INIS)

    Silva, K.F.S.T.

    1989-01-01

    The investigations described in this dissertation were designed to determine the transcriptionally active DNA sequences of IIR region and to identify the viral mRNA transcribed from the transcriptionally most active DNA sequences of that region during late phase of HCMV Towne infection. Preliminary transcriptional studies which included the hybridization of a southern blot of XbaI digested entire HCMV genome to 32 P-labelled late phase infected cell A + RNA, indicated that late viral transcripts homologous to XbaI Q fragment of IIR region were very highly abundant while XbaI Q fragment showed a very low transcriptional activity. To facilitate further analysis of late transcription of IIR region, the entire DNA sequences of IIR region were molecularly cloned as U, S, and H BamHI fragments in pACYC-184 plasmid vector. In addition, to be used in future studies on other regions of the genome, except for y and c' smaller fragments the entire 240 kb HCMV genome was cloned as BamHI fragments in the same vector. Furthermore, the U, S, and H BamHI fragments were mapped with six other restriction enzymes in order to use that mapping data in subsequent transcriptional analysis of the IIR region. Further localization of transcriptionally active DNA sequences within IIR region was achieved by hybridization of southern blots of restricted U, S, and H BamHI fragments with 3' 32 P-labelled infected cell late A + RNA. The 1.5 kb EcooRI subfragments of S BamHI fragment and the adjoining 0.72 kb XhoI subfragment of H BamHI fragment revealed the highest level of transcription, although the remainder of the S fragment was also transcribed at a substantial level. The U fragment and the remainder of the H fragment was transcribed at a very low level

  11. The multiple roles of computational chemistry in fragment-based drug design

    Science.gov (United States)

    Law, Richard; Barker, Oliver; Barker, John J.; Hesterkamp, Thomas; Godemann, Robert; Andersen, Ole; Fryatt, Tara; Courtney, Steve; Hallett, Dave; Whittaker, Mark

    2009-08-01

    Fragment-based drug discovery (FBDD) represents a change in strategy from the screening of molecules with higher molecular weights and physical properties more akin to fully drug-like compounds, to the screening of smaller, less complex molecules. This is because it has been recognised that fragment hit molecules can be efficiently grown and optimised into leads, particularly after the binding mode to the target protein has been first determined by 3D structural elucidation, e.g. by NMR or X-ray crystallography. Several studies have shown that medicinal chemistry optimisation of an already drug-like hit or lead compound can result in a final compound with too high molecular weight and lipophilicity. The evolution of a lower molecular weight fragment hit therefore represents an attractive alternative approach to optimisation as it allows better control of compound properties. Computational chemistry can play an important role both prior to a fragment screen, in producing a target focussed fragment library, and post-screening in the evolution of a drug-like molecule from a fragment hit, both with and without the available fragment-target co-complex structure. We will review many of the current developments in the area and illustrate with some recent examples from successful FBDD discovery projects that we have conducted.

  12. A Ligand-observed Mass Spectrometry Approach Integrated into the Fragment Based Lead Discovery Pipeline

    Science.gov (United States)

    Chen, Xin; Qin, Shanshan; Chen, Shuai; Li, Jinlong; Li, Lixin; Wang, Zhongling; Wang, Quan; Lin, Jianping; Yang, Cheng; Shui, Wenqing

    2015-01-01

    In fragment-based lead discovery (FBLD), a cascade combining multiple orthogonal technologies is required for reliable detection and characterization of fragment binding to the target. Given the limitations of the mainstream screening techniques, we presented a ligand-observed mass spectrometry approach to expand the toolkits and increase the flexibility of building a FBLD pipeline especially for tough targets. In this study, this approach was integrated into a FBLD program targeting the HCV RNA polymerase NS5B. Our ligand-observed mass spectrometry analysis resulted in the discovery of 10 hits from a 384-member fragment library through two independent screens of complex cocktails and a follow-up validation assay. Moreover, this MS-based approach enabled quantitative measurement of weak binding affinities of fragments which was in general consistent with SPR analysis. Five out of the ten hits were then successfully translated to X-ray structures of fragment-bound complexes to lay a foundation for structure-based inhibitor design. With distinctive strengths in terms of high capacity and speed, minimal method development, easy sample preparation, low material consumption and quantitative capability, this MS-based assay is anticipated to be a valuable addition to the repertoire of current fragment screening techniques. PMID:25666181

  13. Critical Evaluation of Native Electrospray Ionization Mass Spectrometry for Fragment-Based Screening.

    Science.gov (United States)

    Göth, Melanie; Badock, Volker; Weiske, Jörg; Pagel, Kevin; Kuropka, Benno

    2017-08-08

    Fragment-based screening presents a promising alternative to high-throughput screening and has gained great attention in recent years. So far, only a few studies have discussed mass spectrometry as a screening technology for fragments. Herein, we report the application of native electrospray ionization mass spectrometry (MS) for screening defined sets of fragments against four different target proteins. Fragments were selected from a primary screening conducted with a thermal shift assay (TSA) and represented different binding categories. Our data indicated that, beside specific complex formation, many fragments show extensive multiple binding and also charge-state shifts. Both of these factors complicate automated data analysis and decrease the attractiveness of native MS as a primary screening tool for fragments. A comparison of the hits identified by native MS and TSA showed good agreement for two of the proteins. Furthermore, we discuss general challenges, including the determination of an optimal fragment concentration and the question of how to rank fragment hits according to their affinity. In conclusion, we consider native MS to be a highly valuable tool for the validation and deeper investigation of promising fragment hits rather than a method for primary screening. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Base-By-Base: single nucleotide-level analysis of whole viral genome alignments.

    Science.gov (United States)

    Brodie, Ryan; Smith, Alex J; Roper, Rachel L; Tcherepanov, Vasily; Upton, Chris

    2004-07-14

    With ever increasing numbers of closely related virus genomes being sequenced, it has become desirable to be able to compare two genomes at a level more detailed than gene content because two strains of an organism may share the same set of predicted genes but still differ in their pathogenicity profiles. For example, detailed comparison of multiple isolates of the smallpox virus genome (each approximately 200 kb, with 200 genes) is not feasible without new bioinformatics tools. A software package, Base-By-Base, has been developed that provides visualization tools to enable researchers to 1) rapidly identify and correct alignment errors in large, multiple genome alignments; and 2) generate tabular and graphical output of differences between the genomes at the nucleotide level. Base-By-Base uses detailed annotation information about the aligned genomes and can list each predicted gene with nucleotide differences, display whether variations occur within promoter regions or coding regions and whether these changes result in amino acid substitutions. Base-By-Base can connect to our mySQL database (Virus Orthologous Clusters; VOCs) to retrieve detailed annotation information about the aligned genomes or use information from text files. Base-By-Base enables users to quickly and easily compare large viral genomes; it highlights small differences that may be responsible for important phenotypic differences such as virulence. It is available via the Internet using Java Web Start and runs on Macintosh, PC and Linux operating systems with the Java 1.4 virtual machine.

  15. Incorporation of rapid thermodynamic data in fragment-based drug discovery.

    Science.gov (United States)

    Kobe, Akihiro; Caaveiro, Jose M M; Tashiro, Shinya; Kajihara, Daisuke; Kikkawa, Masato; Mitani, Tomoya; Tsumoto, Kouhei

    2013-03-14

    Fragment-based drug discovery (FBDD) has enjoyed increasing popularity in recent years. We introduce SITE (single-injection thermal extinction), a novel thermodynamic methodology that selects high-quality hits early in FBDD. SITE is a fast calorimetric competitive assay suitable for automation that captures the essence of isothermal titration calorimetry but using significantly fewer resources. We describe the principles of SITE and identify a novel family of fragment inhibitors of the enzyme ketosteroid isomerase displaying high values of enthalpic efficiency.

  16. A simple strategy for subcloning and amplifying random multimegabase subchromosomal acentric DNA fragments as double minute chromosomes

    International Nuclear Information System (INIS)

    Hahn, P.J.; Giddings, L.; Lane, M.J.

    1989-01-01

    Restriction mapping of relatively large genomes (e.g. human) utilizing randomly generated DNA segments requires high mapping redundancy to successfully organize 'contigs' to represent the entire genome. The number of independent DNA segment maps required is dependent on the average size of a mapping segment; the larger the segment, the fewer required. The authors have developed a strategy for subcloning intact multimegabase subchromosomal fragments as double minute chromosomes. Such fragments could serve as primary mapping elements or as adjunct (linking) fragments to rapidly connect already existent contigs generated using yeast artificial chromosomes or cosmids. They present several lines of evidence supporting the viability of this approach. (1) X-ray treated EMT-6 mouse cells (7.5 Gr.) which are selected over several months with increasing levels of methotrexate (MTX) contain highly amplified circular DNA molecules (double minutes) which include the dihydrofolate reductase (DHFR) gene in a size range between 1,000 and 3,500 kilobases as determined by pulsed-field gel electrophoresis and these acentric chromosomal fragments have been stably maintained in culture for at least a year. (2) Preliminary data based on experiments involving fusion of X-irradiated Chinese Hamster Ovary (CH0 DG44) cells containing randomly inserted cotransfected Neomycin resistance and DHFR genes to mouse EMT-6 cells shows that the linked genes can be readily cotransferred as acentric subchromosomal fragment(s) suitable for gene amplification. (3) The studies of CHO cells with cell fusion transferred X-ray induced chromosomal fragments containing the natural CHO DHFR gene suggest that transferred chromosome fragments undergo gene amplification much more readily than nonfragmented endogenous DHFR genes

  17. Controlled isotropic fission fragment sources on the base of nuclear-physical facilities

    International Nuclear Information System (INIS)

    Sevast'yanov, V.D.; Maslov, G.N.

    1995-01-01

    Isotropic fission fragment sources (IFFS) are developed on the base of a neutron generator and pulse fast reactor. IFFS permit to calibrate fission fragment detectors. The IFFS consist of radiators with 235 U. The radiators are placed in a thermal neutron field of the neutron generator or in the reactor core center. The fragment activity is controlled by indications of an α-particle counter or by indications of a monitor of energy release in the core. 14 refs.; 1 fig.; 1 tab

  18. Base-By-Base: Single nucleotide-level analysis of whole viral genome alignments

    Directory of Open Access Journals (Sweden)

    Tcherepanov Vasily

    2004-07-01

    Full Text Available Abstract Background With ever increasing numbers of closely related virus genomes being sequenced, it has become desirable to be able to compare two genomes at a level more detailed than gene content because two strains of an organism may share the same set of predicted genes but still differ in their pathogenicity profiles. For example, detailed comparison of multiple isolates of the smallpox virus genome (each approximately 200 kb, with 200 genes is not feasible without new bioinformatics tools. Results A software package, Base-By-Base, has been developed that provides visualization tools to enable researchers to 1 rapidly identify and correct alignment errors in large, multiple genome alignments; and 2 generate tabular and graphical output of differences between the genomes at the nucleotide level. Base-By-Base uses detailed annotation information about the aligned genomes and can list each predicted gene with nucleotide differences, display whether variations occur within promoter regions or coding regions and whether these changes result in amino acid substitutions. Base-By-Base can connect to our mySQL database (Virus Orthologous Clusters; VOCs to retrieve detailed annotation information about the aligned genomes or use information from text files. Conclusion Base-By-Base enables users to quickly and easily compare large viral genomes; it highlights small differences that may be responsible for important phenotypic differences such as virulence. It is available via the Internet using Java Web Start and runs on Macintosh, PC and Linux operating systems with the Java 1.4 virtual machine.

  19. Fragment-based screening in tandem with phenotypic screening provides novel antiparasitic hits.

    Science.gov (United States)

    Blaazer, Antoni R; Orrling, Kristina M; Shanmugham, Anitha; Jansen, Chimed; Maes, Louis; Edink, Ewald; Sterk, Geert Jan; Siderius, Marco; England, Paul; Bailey, David; de Esch, Iwan J P; Leurs, Rob

    2015-01-01

    Methods to discover biologically active small molecules include target-based and phenotypic screening approaches. One of the main difficulties in drug discovery is elucidating and exploiting the relationship between drug activity at the protein target and disease modification, a phenotypic endpoint. Fragment-based drug discovery is a target-based approach that typically involves the screening of a relatively small number of fragment-like (molecular weight <300) molecules that efficiently cover chemical space. Here, we report a fragment screening on TbrPDEB1, an essential cyclic nucleotide phosphodiesterase (PDE) from Trypanosoma brucei, and human PDE4D, an off-target, in a workflow in which fragment hits and a series of close analogs are subsequently screened for antiparasitic activity in a phenotypic panel. The phenotypic panel contained T. brucei, Trypanosoma cruzi, Leishmania infantum, and Plasmodium falciparum, the causative agents of human African trypanosomiasis (sleeping sickness), Chagas disease, leishmaniasis, and malaria, respectively, as well as MRC-5 human lung cells. This hybrid screening workflow has resulted in the discovery of various benzhydryl ethers with antiprotozoal activity and low toxicity, representing interesting starting points for further antiparasitic optimization. © 2014 Society for Laboratory Automation and Screening.

  20. Universal elements of fragmentation

    International Nuclear Information System (INIS)

    Yanovsky, V. V.; Tur, A. V.; Kuklina, O. V.

    2010-01-01

    A fragmentation theory is proposed that explains the universal asymptotic behavior of the fragment-size distribution in the large-size range, based on simple physical principles. The basic principles of the theory are the total mass conservation in a fragmentation process and a balance condition for the energy expended in increasing the surface of fragments during their breakup. A flux-based approach is used that makes it possible to supplement the basic principles and develop a minimal theory of fragmentation. Such a supplementary principle is that of decreasing fragment-volume flux with increasing energy expended in fragmentation. It is shown that the behavior of the decreasing flux is directly related to the form of a power-law fragment-size distribution. The minimal theory is used to find universal asymptotic fragment-size distributions and to develop a natural physical classification of fragmentation models. A more general, nonlinear theory of strong fragmentation is also developed. It is demonstrated that solutions to a nonlinear kinetic equation consistent with both basic principles approach a universal asymptotic size distribution. Agreement between the predicted asymptotic fragment-size distributions and experimental observations is discussed.

  1. Fragment-Based Drug Discovery of Potent Protein Kinase C Iota Inhibitors.

    Science.gov (United States)

    Kwiatkowski, Jacek; Liu, Boping; Tee, Doris Hui Ying; Chen, Guoying; Ahmad, Nur Huda Binte; Wong, Yun Xuan; Poh, Zhi Ying; Ang, Shi Hua; Tan, Eldwin Sum Wai; Ong, Esther Hq; Nurul Dinie; Poulsen, Anders; Pendharkar, Vishal; Sangthongpitag, Kanda; Lee, May Ann; Sepramaniam, Sugunavathi; Ho, Soo Yei; Cherian, Joseph; Hill, Jeffrey; Keller, Thomas H; Hung, Alvin W

    2018-05-24

    Protein kinase C iota (PKC-ι) is an atypical kinase implicated in the promotion of different cancer types. A biochemical screen of a fragment library has identified several hits from which an azaindole-based scaffold was chosen for optimization. Driven by a structure-activity relationship and supported by molecular modeling, a weakly bound fragment was systematically grown into a potent and selective inhibitor against PKC-ι.

  2. Highly sensitive strain sensors based on fragmentized carbon nanotube/polydimethylsiloxane composites.

    Science.gov (United States)

    Gao, Yang; Fang, Xiaoliang; Tan, Jianping; Lu, Ting; Pan, Likun; Xuan, Fuzhen

    2018-06-08

    Wearable strain sensors based on nanomaterial/elastomer composites have potential applications in flexible electronic skin, human motion detection, human-machine interfaces, etc. In this research, a type of high performance strain sensors has been developed using fragmentized carbon nanotube/polydimethylsiloxane (CNT/PDMS) composites. The CNT/PDMS composites were ground into fragments, and a liquid-induced densification method was used to fabricate the strain sensors. The strain sensors showed high sensitivity with gauge factors (GFs) larger than 200 and a broad strain detection range up to 80%, much higher than those strain sensors based on unfragmentized CNT/PDMS composites (GF sensors is ascribed to the sliding of individual fragmentized-CNT/PDMS-composite particles during mechanical deformation, which causes significant resistance change in the strain sensors. The strain sensors can differentiate mechanical stimuli and monitor various human body motions, such as bending of the fingers, human breathing, and blood pulsing.

  3. Molecular markers. Amplified fragment length polymorphism

    Directory of Open Access Journals (Sweden)

    Pržulj Novo

    2005-01-01

    Full Text Available Amplified Fragment Length Polymorphism molecular markers (AFLPs has been developed combining procedures of RFLPs and RAPDs molekular markers, i.e. the first step is restriction digestion of the genomic DNA that is followed by selective amplification of the restricted fragments. The advantage of the AFLP technique is that it allows rapid generation of a large number of reproducible markers. The reproducibility of AFLPs markers is assured by the use of restriction site-specific adapters and adapter-specific primers for PCR reaction. Only fragments containing the restriction site sequence plus the additional nucleotides will be amplified and the more selected nucleotides added on the primer sequence the fewer the number of fragments amplified by PCR. The amplified products are normally separated on a sequencing gel and visualized after exposure to X-ray film or by using fluorescent labeled primers. AFLP shave proven to be extremely proficient in revealing diversity at below the species level. A disadvantage of AFLP technique is that AFLPs are essentially a dominant marker system and not able to identify heterozygotes.

  4. Fragmentation of the large subunit ribosomal RNA gene in oyster mitochondrial genomes

    Directory of Open Access Journals (Sweden)

    Milbury Coren A

    2010-09-01

    Full Text Available Abstract Background Discontinuous genes have been observed in bacteria, archaea, and eukaryotic nuclei, mitochondria and chloroplasts. Gene discontinuity occurs in multiple forms: the two most frequent forms result from introns that are spliced out of the RNA and the resulting exons are spliced together to form a single transcript, and fragmented gene transcripts that are not covalently attached post-transcriptionally. Within the past few years, fragmented ribosomal RNA (rRNA genes have been discovered in bilateral metazoan mitochondria, all within a group of related oysters. Results In this study, we have characterized this fragmentation with comparative analysis and experimentation. We present secondary structures, modeled using comparative sequence analysis of the discontinuous mitochondrial large subunit rRNA genes of the cupped oysters C. virginica, C. gigas, and C. hongkongensis. Comparative structure models for the large subunit rRNA in each of the three oyster species are generally similar to those for other bilateral metazoans. We also used RT-PCR and analyzed ESTs to determine if the two fragmented LSU rRNAs are spliced together. The two segments are transcribed separately, and not spliced together although they still form functional rRNAs and ribosomes. Conclusions Although many examples of discontinuous ribosomal genes have been documented in bacteria and archaea, as well as the nuclei, chloroplasts, and mitochondria of eukaryotes, oysters are some of the first characterized examples of fragmented bilateral animal mitochondrial rRNA genes. The secondary structures of the oyster LSU rRNA fragments have been predicted on the basis of previous comparative metazoan mitochondrial LSU rRNA structure models.

  5. Investigation on energetics of ex-vessel vapor explosion based on spontaneous nucleation fragmentation

    International Nuclear Information System (INIS)

    Liu, Jie; Koshizuka, Seiichi; Oka, Yoshiaki

    2002-01-01

    A computer code PROVER-I is developed for propagation phase of vapor explosion. A new thermal fragmentation model is proposed with three kinds of time scale for modeling instant fragmentation, spontaneous nucleation fragmentation and normal boiling fragmentation. The energetics of ex-vessel vapor explosion is investigated based on different fragmentation models. A higher pressure peak and a larger mechanical energy conversion ratio are obtained by spontaneous nucleation fragmentation. A smaller energy conversion ratio results from normal boiling fragmentation. When the delay time in thermal fragmentation model is near 0.0 ms, the pressure propagation behavior tends to be analogous with that in hydrodynamic fragmentation. If the delay time is longer, pressure attenuation occurs at the shock front. The high energy conversion ratio (>4%) is obtained in a small vapor volume fraction together with spontaneous nucleation fragmentation. These results are consistent with fuel-coolant interaction experiments with alumina melt. However, in larger vapor volume fraction conditions (α υ >0.3), the vapor explosion is weak. For corium melt, a coarse mixture with void fraction of more than 30% can be generated in the pre-mixing process because of its physical properties. In the mixture with such a high void fraction the energetic vapor explosion hardly takes place. (author)

  6. Gene calling and bacterial genome annotation with BG7.

    Science.gov (United States)

    Tobes, Raquel; Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Kovach, Evdokim; Alekhin, Alexey; Pareja, Eduardo

    2015-01-01

    New massive sequencing technologies are providing many bacterial genome sequences from diverse taxa but a refined annotation of these genomes is crucial for obtaining scientific findings and new knowledge. Thus, bacterial genome annotation has emerged as a key point to investigate in bacteria. Any efficient tool designed specifically to annotate bacterial genomes sequenced with massively parallel technologies has to consider the specific features of bacterial genomes (absence of introns and scarcity of nonprotein-coding sequence) and of next-generation sequencing (NGS) technologies (presence of errors and not perfectly assembled genomes). These features make it convenient to focus on coding regions and, hence, on protein sequences that are the elements directly related with biological functions. In this chapter we describe how to annotate bacterial genomes with BG7, an open-source tool based on a protein-centered gene calling/annotation paradigm. BG7 is specifically designed for the annotation of bacterial genomes sequenced with NGS. This tool is sequence error tolerant maintaining their capabilities for the annotation of highly fragmented genomes or for annotating mixed sequences coming from several genomes (as those obtained through metagenomics samples). BG7 has been designed with scalability as a requirement, with a computing infrastructure completely based on cloud computing (Amazon Web Services).

  7. Bootstrap embedding: An internally consistent fragment-based method

    Energy Technology Data Exchange (ETDEWEB)

    Welborn, Matthew; Tsuchimochi, Takashi; Van Voorhis, Troy [Department of Chemistry, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139 (United States)

    2016-08-21

    Strong correlation poses a difficult problem for electronic structure theory, with computational cost scaling quickly with system size. Fragment embedding is an attractive approach to this problem. By dividing a large complicated system into smaller manageable fragments “embedded” in an approximate description of the rest of the system, we can hope to ameliorate the steep cost of correlated calculations. While appealing, these methods often converge slowly with fragment size because of small errors at the boundary between fragment and bath. We describe a new electronic embedding method, dubbed “Bootstrap Embedding,” a self-consistent wavefunction-in-wavefunction embedding theory that uses overlapping fragments to improve the description of fragment edges. We apply this method to the one dimensional Hubbard model and a translationally asymmetric variant, and find that it performs very well for energies and populations. We find Bootstrap Embedding converges rapidly with embedded fragment size, overcoming the surface-area-to-volume-ratio error typical of many embedding methods. We anticipate that this method may lead to a low-scaling, high accuracy treatment of electron correlation in large molecular systems.

  8. The Complete Mitochondrial Genome of the Foodborne Parasitic Pathogen Cyclospora cayetanensis.

    Directory of Open Access Journals (Sweden)

    Hediye Nese Cinar

    Full Text Available Cyclospora cayetanensis is a human-specific coccidian parasite responsible for several food and water-related outbreaks around the world, including the most recent ones involving over 900 persons in 2013 and 2014 outbreaks in the USA. Multicopy organellar DNA such as mitochondrion genomes have been particularly informative for detection and genetic traceback analysis in other parasites. We sequenced the C. cayetanensis genomic DNA obtained from stool samples from patients infected with Cyclospora in Nepal using the Illumina MiSeq platform. By bioinformatically filtering out the metagenomic reads of non-coccidian origin sequences and concentrating the reads by targeted alignment, we were able to obtain contigs containing Eimeria-like mitochondrial, apicoplastic and some chromosomal genomic fragments. A mitochondrial genomic sequence was assembled and confirmed by cloning and sequencing targeted PCR products amplified from Cyclospora DNA using primers based on our draft assembly sequence. The results show that the C. cayetanensis mitochondrion genome is 6274 bp in length, with 33% GC content, and likely exists in concatemeric arrays as in Eimeria mitochondrial genomes. Phylogenetic analysis of the C. cayetanensis mitochondrial genome places this organism in a tight cluster with Eimeria species. The mitochondrial genome of C. cayetanensis contains three protein coding genes, cytochrome (cytb, cytochrome C oxidase subunit 1 (cox1, and cytochrome C oxidase subunit 3 (cox3, in addition to 14 large subunit (LSU and nine small subunit (SSU fragmented rRNA genes.

  9. Genomic variations of Mycoplasma capricolum subsp capripneumoniae detected by amplified fragment length polymorphism (AFLP) analysis

    DEFF Research Database (Denmark)

    Kokotovic, Branko; Bolske, G.; Ahrens, Peter

    2000-01-01

    The genetic diversity of Mycoplasma capricolum subsp. capripneumoniae strains based on determination of amplified fragment length polymorphisms (AFLP) is described. AFLP fingerprints of 38 strains derived from different countries in Africa and the Middle East consisted of over 100 bands in the size...

  10. Progress towards construction of a total restriction fragment map of a human chromosome.

    NARCIS (Netherlands)

    H. Vissing; F.G. Grosveld (Frank); E. Solomon; G. Moore; N. Lench; N. Shennan; R. Williamson

    1987-01-01

    textabstractWe present an approach to the construction of an overlapping restriction fragment map of a single human chromosome. A genomic cosmid library genome was constructed from a mouse-human hybrid cell line containing chromosome 17 as its only human genetic component. Cosmids containing human

  11. A comprehensive evaluation of rodent malaria parasite genomes and gene expression

    KAUST Repository

    Otto, Thomas D

    2014-10-30

    Background: Rodent malaria parasites (RMP) are used extensively as models of human malaria. Draft RMP genomes have been published for Plasmodium yoelii, P. berghei ANKA (PbA) and P. chabaudi AS (PcAS). Although availability of these genomes made a significant impact on recent malaria research, these genomes were highly fragmented and were annotated with little manual curation. The fragmented nature of the genomes has hampered genome wide analysis of Plasmodium gene regulation and function. Results: We have greatly improved the genome assemblies of PbA and PcAS, newly sequenced the virulent parasite P. yoelii YM genome, sequenced additional RMP isolates/lines and have characterized genotypic diversity within RMP species. We have produced RNA-seq data and utilized it to improve gene-model prediction and to provide quantitative, genome-wide, data on gene expression. Comparison of the RMP genomes with the genome of the human malaria parasite P. falciparum and RNA-seq mapping permitted gene annotation at base-pair resolution. Full-length chromosomal annotation permitted a comprehensive classification of all subtelomeric multigene families including the `Plasmodium interspersed repeat genes\\' (pir). Phylogenetic classification of the pir family, combined with pir expression patterns, indicates functional diversification within this family. Conclusions: Complete RMP genomes, RNA-seq and genotypic diversity data are excellent and important resources for gene-function and post-genomic analyses and to better interrogate Plasmodium biology. Genotypic diversity between P. chabaudi isolates makes this species an excellent parasite to study genotype-phenotype relationships. The improved classification of multigene families will enhance studies on the role of (variant) exported proteins in virulence and immune evasion/modulation.

  12. Fragment-based drug discovery as alternative strategy to the drug development for neglected diseases.

    Science.gov (United States)

    Mello, Juliana da Fonseca Rezende E; Gomes, Renan Augusto; Vital-Fujii, Drielli Gomes; Ferreira, Glaucio Monteiro; Trossini, Gustavo Henrique Goulart

    2017-12-01

    Neglected diseases (NDs) affect large populations and almost whole continents, representing 12% of the global health burden. In contrast, the treatment available today is limited and sometimes ineffective. Under this scenery, the Fragment-Based Drug Discovery emerged as one of the most promising alternatives to the traditional methods of drug development. This method allows achieving new lead compounds with smaller size of fragment libraries. Even with the wide Fragment-Based Drug Discovery success resulting in new effective therapeutic agents against different diseases, until this moment few studies have been applied this approach for NDs area. In this article, we discuss the basic Fragment-Based Drug Discovery process, brief successful ideas of general applications and show a landscape of its use in NDs, encouraging the implementation of this strategy as an interesting way to optimize the development of new drugs to NDs. © 2017 John Wiley & Sons A/S.

  13. Current perspectives in fragment-based lead discovery (FBLD)

    Science.gov (United States)

    Lamoree, Bas; Hubbard, Roderick E.

    2017-01-01

    It is over 20 years since the first fragment-based discovery projects were disclosed. The methods are now mature for most ‘conventional’ targets in drug discovery such as enzymes (kinases and proteases) but there has also been growing success on more challenging targets, such as disruption of protein–protein interactions. The main application is to identify tractable chemical startpoints that non-covalently modulate the activity of a biological molecule. In this essay, we overview current practice in the methods and discuss how they have had an impact in lead discovery – generating a large number of fragment-derived compounds that are in clinical trials and two medicines treating patients. In addition, we discuss some of the more recent applications of the methods in chemical biology – providing chemical tools to investigate biological molecules, mechanisms and systems. PMID:29118093

  14. Differentiation and diagnosis of Pseudocercosporella herpotrichoides (Fron) Deighton with genomic DNA probes

    DEFF Research Database (Denmark)

    Frei, U; Wenzel, G.

    1993-01-01

    Repetitive genomic clones were used to differentiate between varieties within the species Pseudocercosporella herpotrichoides. From 21 clones tested 13 revealed restriction fragment length polymorphisms among isolates. Cluster analysis was performed based on these data. Differentiation of isolate...

  15. Ultrasensitive, Stretchable Strain Sensors Based on Fragmented Carbon Nanotube Papers

    KAUST Repository

    Zhou, Jian; Yu, Hu; Xu, Xuezhu; Han, Fei; Lubineau, Gilles

    2017-01-01

    The development of strain sensors featuring both ultra high sensitivity and high stretchability is still a challenge. We demonstrate that strain sensors based on fragmented single-walled carbon nanotube (SWCNT) paper embedded in poly

  16. Enhanced resolution of DNA restriction fragments: A procedure by two-dimensional electrophoresis and double-labeling

    International Nuclear Information System (INIS)

    Yi, M.; Au, L.C.; Ichikawa, N.; Ts'o, P.O.

    1990-01-01

    A probe-free method was developed to detect DNA rearrangement in bacteria based on the electrophoretic separation of twice-digested restriction fragments of genomic DNA into a two-dimensional (2-D) pattern. The first restriction enzyme digestion was done in solution, followed by electrophoresis of the restriction fragments in one dimension. A second restriction enzyme digestion was carried out in situ in the gel, followed by electrophoresis in a second dimension perpendicular to the first electrophoresis. The 2-D pattern provides for the resolution of 300-400 spots, which are defined and indexed by an x,y coordinate system with size markers. This approach has greatly increased the resolution power over conventional one-dimensional (1-D) electrophoresis. To study DNA rearrangement, a 2-D pattern from a test strain was compared with the 2-D pattern from a reference strain. After the first digestion, genomic DNA fragments from the test strain were labeled with 35S, while those from the reference strain were labeled with 32P. This was done to utilize the difference in the energy emission of 35S and 32P isotopes for autoradiography when two x-ray films were exposed simultaneously on top of the gel after the 2-D electrophoresis. The irradiation from the decay of 35S exposed only the lower film, whereas the irradiation from the decay of 32P exposed both the lower and upper films. Different DNA fragments existed in the test DNA compared with the reference DNA can be identified unambiguously by the differential two 2-D patterns produced on two films upon exposure to the 35S and 32P fragments in the same gel. An appropriate photographic procedure further simplified the process, allowing only the difference in DNA fragments between these two patterns to be shown in the map

  17. Ultrasensitive, Stretchable Strain Sensors Based on Fragmented Carbon Nanotube Papers

    KAUST Repository

    Zhou, Jian

    2017-01-17

    The development of strain sensors featuring both ultra high sensitivity and high stretchability is still a challenge. We demonstrate that strain sensors based on fragmented single-walled carbon nanotube (SWCNT) paper embedded in poly(dimethylsiloxane) (PDMS) can sustain their sensitivity even at very high strain levels (with a gauge factor of over 10(7) at 50% strain). This record sensitivity is ascribed to the low initial electrical resistance (5-28 Omega) of the SWCNT paper and the wide change in resistance (up to 10(6) Omega) governed by the percolated network of SWCNT in the cracked region. The sensor response remains nearly unchanged after 10 000 strain cycles at 20% proving the robustness of this technology. This fragmentation based sensing system brings opportunities to engineer highly sensitive stretchable sensors.

  18. Genomic signal processing methods for computation of alignment-free distances from DNA sequences.

    Science.gov (United States)

    Borrayo, Ernesto; Mendizabal-Ruiz, E Gerardo; Vélez-Pérez, Hugo; Romo-Vázquez, Rebeca; Mendizabal, Adriana P; Morales, J Alejandro

    2014-01-01

    Genomic signal processing (GSP) refers to the use of digital signal processing (DSP) tools for analyzing genomic data such as DNA sequences. A possible application of GSP that has not been fully explored is the computation of the distance between a pair of sequences. In this work we present GAFD, a novel GSP alignment-free distance computation method. We introduce a DNA sequence-to-signal mapping function based on the employment of doublet values, which increases the number of possible amplitude values for the generated signal. Additionally, we explore the use of three DSP distance metrics as descriptors for categorizing DNA signal fragments. Our results indicate the feasibility of employing GAFD for computing sequence distances and the use of descriptors for characterizing DNA fragments.

  19. Robust Object Tracking Using Valid Fragments Selection.

    Science.gov (United States)

    Zheng, Jin; Li, Bo; Tian, Peng; Luo, Gang

    Local features are widely used in visual tracking to improve robustness in cases of partial occlusion, deformation and rotation. This paper proposes a local fragment-based object tracking algorithm. Unlike many existing fragment-based algorithms that allocate the weights to each fragment, this method firstly defines discrimination and uniqueness for local fragment, and builds an automatic pre-selection of useful fragments for tracking. Then, a Harris-SIFT filter is used to choose the current valid fragments, excluding occluded or highly deformed fragments. Based on those valid fragments, fragment-based color histogram provides a structured and effective description for the object. Finally, the object is tracked using a valid fragment template combining the displacement constraint and similarity of each valid fragment. The object template is updated by fusing feature similarity and valid fragments, which is scale-adaptive and robust to partial occlusion. The experimental results show that the proposed algorithm is accurate and robust in challenging scenarios.

  20. Getting complete genomes from complex samples using nanopore sequencing

    DEFF Research Database (Denmark)

    Kirkegaard, Rasmus Hansen; Karst, Søren Michael; Albertsen, Mads

    Background Short read DNA sequencing and metagenomic binning workflows have made it possible to extract bacterial genome bins from environmental microbial samples containing hundreds to thousands of different species. However, these genome bins often do not represent complete genomes......, as they are mostly fragmented, incomplete and often contaminated with foreign DNA. The value of these `draft genomes` have limited, lasting value to the scientific community, as gene synteny is broken and there is some uncertainty of what is missing1. The genetic material most often missed is important multi......-copy and/or conserved marker genes such as the 16S rRNA gene, as sequence micro-heterogeneity prevents assembly of these genes in the de novo assembly. However, long read sequencing technologies are emerging promising an end to fragmented genome assemblies2. Experimental design We extracted DNA from a full...

  1. Dhajala meteorite shower: atmospheric fragmentation and ablation based on cosmic ray track studies

    Energy Technology Data Exchange (ETDEWEB)

    Bagolia, C; Doshi, N; Gupta, S K; Kumar, S; Lal, D; Trivedi, J R [Physical Research Lab., Ahmedabad (India)

    1977-06-01

    Cosmic-ray track studies have been carried out in more than 250 fragments of Dhajala meteorite comprising greater than 70% of the recovered mass. In the case of larger fragments (namely, those with mass exceeding 250 g) several faces of each fragment have been analysed for track densities. Track densities are low, and fall generally in the range (10/sup 3/ to 10/sup 5/)cm/sup -2/, indicating appreciable ablation losses since the cosmic ray exposure age of Dhajala is about 7 m.y. (track measurements were confined to large olivine grains to minimize contributions to tracks due to the fission of uranium and extinct radionuclides). Attempts have been made to deduce information about fragmentation dynamics and the preatmospheric mass/radius of Dhajala, based on the present comprehensive study of track densities in the fragments. Correlations between the position of a fragment in the strewnfield and its track density have provided an approximate scenario for the fragmentation/ablation of the meteorite during its atmospheric flight. Observation of minimum track density in the fragments lead to a value of (38 +- 2)cm for the preatmospheric radius of the meteorite. It is estimated from these data that the collection of fragments was made with an overall efficiency of approximately 60% and that the ablation amounts to (86.7 +- 2.1)%. Estimated amounts of ablation for shells of different radii are also presented.

  2. The heterothallic sugarbeet pathogen Cercospora beticola contains exon fragments of both MAT genes that are homogenized by concerted evolution.

    Science.gov (United States)

    Bolton, Melvin D; de Jonge, Ronnie; Inderbitzin, Patrik; Liu, Zhaohui; Birla, Keshav; Van de Peer, Yves; Subbarao, Krishna V; Thomma, Bart P H J; Secor, Gary A

    2014-01-01

    Dothideomycetes is one of the most ecologically diverse and economically important classes of fungi. Sexual reproduction in this group is governed by mating type (MAT) genes at the MAT1 locus. Self-sterile (heterothallic) species contain one of two genes at MAT1 (MAT1-1-1 or MAT1-2-1) and only isolates of opposite mating type are sexually compatible. In contrast, self-fertile (homothallic) species contain both MAT genes at MAT1. Knowledge of the reproductive capacities of plant pathogens are of particular interest because recombining populations tend to be more difficult to manage in agricultural settings. In this study, we sequenced MAT1 in the heterothallic Dothideomycete fungus Cercospora beticola to gain insight into the reproductive capabilities of this important plant pathogen. In addition to the expected MAT gene at MAT1, each isolate contained fragments of both MAT1-1-1 and MAT1-2-1 at ostensibly random loci across the genome. When MAT fragments from each locus were manually assembled, they reconstituted MAT1-1-1 and MAT1-2-1 exons with high identity, suggesting a retroposition event occurred in a homothallic ancestor in which both MAT genes were fused. The genome sequences of related taxa revealed that MAT gene fragment pattern of Cercospora zeae-maydis was analogous to C. beticola. In contrast, the genome of more distantly related Mycosphaerella graminicola did not contain MAT fragments. Although fragments occurred in syntenic regions of the C. beticola and C. zeae-maydis genomes, each MAT fragment was more closely related to the intact MAT gene of the same species. Taken together, these data suggest MAT genes fragmented after divergence of M. graminicola from the remaining taxa, and concerted evolution functioned to homogenize MAT fragments and MAT genes in each species. Published by Elsevier Inc.

  3. EDF fragment relocation model based on the displacement of rigid bodies

    International Nuclear Information System (INIS)

    Callu, C.; Baron, D.; Ruck, J.M.

    1997-01-01

    In order to release the restricting conditions imposed to the reactor operations with regards to PCMI (Pellet-Cladding Mechanical Interaction), the simulation of a fuel rod thermomechanical behavior has to be improved. The computer programming has to cope with the more and more sophisticated mathematical modellings induced by the complexity and the interdependence of the phenomena. Therefore EDF is developing a new code - CYRANO3 - since 1990 putting emphasis on its evolution capacities. Concerning more precisely the PCMI simulation, the pellet fragmentation and the fragments relocation is one of the major aspect one must account for. Thanks to recent analytical experiments, EDF developed a new modelling based on the displacement of rigid bodies and on the calculation of the interaction efforts between the fragments. This paper presents the basis of the model, its introduction within the CYRANO3 code and its calibration on a specific analytical experiment. The modelling is then tested against PWR fuel rods deformations from the EDF data base. The results are presented and discussed. (author)

  4. Generating "fragment-based virtual library" using pocket similarity search of ligand-receptor complexes.

    Science.gov (United States)

    Khashan, Raed S

    2015-01-01

    As the number of available ligand-receptor complexes is increasing, researchers are becoming more dedicated to mine these complexes to aid in the drug design and development process. We present free software which is developed as a tool for performing similarity search across ligand-receptor complexes for identifying binding pockets which are similar to that of a target receptor. The search is based on 3D-geometric and chemical similarity of the atoms forming the binding pocket. For each match identified, the ligand's fragment(s) corresponding to that binding pocket are extracted, thus forming a virtual library of fragments (FragVLib) that is useful for structure-based drug design. The program provides a very useful tool to explore available databases.

  5. Computational fragment-based screening using RosettaLigand: the SAMPL3 challenge

    Science.gov (United States)

    Kumar, Ashutosh; Zhang, Kam Y. J.

    2012-05-01

    SAMPL3 fragment based virtual screening challenge provides a valuable opportunity for researchers to test their programs, methods and screening protocols in a blind testing environment. We participated in SAMPL3 challenge and evaluated our virtual fragment screening protocol, which involves RosettaLigand as the core component by screening a 500 fragments Maybridge library against bovine pancreatic trypsin. Our study reaffirmed that the real test for any virtual screening approach would be in a blind testing environment. The analyses presented in this paper also showed that virtual screening performance can be improved, if a set of known active compounds is available and parameters and methods that yield better enrichment are selected. Our study also highlighted that to achieve accurate orientation and conformation of ligands within a binding site, selecting an appropriate method to calculate partial charges is important. Another finding is that using multiple receptor ensembles in docking does not always yield better enrichment than individual receptors. On the basis of our results and retrospective analyses from SAMPL3 fragment screening challenge we anticipate that chances of success in a fragment screening process could be increased significantly with careful selection of receptor structures, protein flexibility, sufficient conformational sampling within binding pocket and accurate assignment of ligand and protein partial charges.

  6. The complete mitochondrial genome sequence of Eimeria innocua (Eimeriidae, Coccidia, Apicomplexa).

    Science.gov (United States)

    Hafeez, Mian Abdul; Vrba, Vladimir; Barta, John Robert

    2016-07-01

    The complete mitochondrial genome of Eimeria innocua KR strain (Eimeriidae, Coccidia, Apicomplexa) was sequenced. This coccidium infects turkeys (Meleagris gallopavo), Bobwhite quails (Colinus virginianus), and Grey partridges (Perdix perdix). Genome organization and gene contents were comparable with other Eimeria spp. infecting galliform birds. The circular-mapping mt genome of E. innocua is 6247 bp in length with three protein-coding genes (cox1, cox3, and cytb), 19 gene fragments encoding large subunit (LSU) rRNA and 14 gene fragments encoding small subunit (SSU) rRNA. Like other Apicomplexa, no tRNA was encoded. The mitochondrial genome of E. innocua confirms its close phylogenetic affinities to Eimeria dispersa.

  7. Systematic evaluation of bias in microbial community profiles induced by whole genome amplification.

    Science.gov (United States)

    Direito, Susana O L; Zaura, Egija; Little, Miranda; Ehrenfreund, Pascale; Röling, Wilfred F M

    2014-03-01

    Whole genome amplification methods facilitate the detection and characterization of microbial communities in low biomass environments. We examined the extent to which the actual community structure is reliably revealed and factors contributing to bias. One widely used [multiple displacement amplification (MDA)] and one new primer-free method [primase-based whole genome amplification (pWGA)] were compared using a polymerase chain reaction (PCR)-based method as control. Pyrosequencing of an environmental sample and principal component analysis revealed that MDA impacted community profiles more strongly than pWGA and indicated that this related to species GC content, although an influence of DNA integrity could not be excluded. Subsequently, biases by species GC content, DNA integrity and fragment size were separately analysed using defined mixtures of DNA from various species. We found significantly less amplification of species with the highest GC content for MDA-based templates and, to a lesser extent, for pWGA. DNA fragmentation also interfered severely: species with more fragmented DNA were less amplified with MDA and pWGA. pWGA was unable to amplify low molecular weight DNA (microbial communities in low-biomass environments and for currently planned astrobiological missions to Mars. © 2013 Society for Applied Microbiology and John Wiley & Sons Ltd.

  8. AKT1, LKB1, and YAP1 revealed as MYC interactors with NanoLuc-based protein-fragment complementation assay. | Office of Cancer Genomics

    Science.gov (United States)

    The c-Myc (MYC) transcription factor is a major cancer driver and a well-validated therapeutic target. However, directly targeting MYC has been challenging. Thus, identifying proteins that interact with and regulate MYC may provide alternative strategies to inhibit its oncogenic activity. Here we report the development of a NanoLuc®-based protein-fragment complementation assay (NanoPCA) and mapping of the MYC protein interaction hub in live mammalian cells.

  9. Computational medicinal chemistry in fragment-based drug discovery: what, how and when.

    Science.gov (United States)

    Rabal, Obdulia; Urbano-Cuadrado, Manuel; Oyarzabal, Julen

    2011-01-01

    The use of fragment-based drug discovery (FBDD) has increased in the last decade due to the encouraging results obtained to date. In this scenario, computational approaches, together with experimental information, play an important role to guide and speed up the process. By default, FBDD is generally considered as a constructive approach. However, such additive behavior is not always present, therefore, simple fragment maturation will not always deliver the expected results. In this review, computational approaches utilized in FBDD are reported together with real case studies, where applicability domains are exemplified, in order to analyze them, and then, maximize their performance and reliability. Thus, a proper use of these computational tools can minimize misleading conclusions, keeping the credit on FBDD strategy, as well as achieve higher impact in the drug-discovery process. FBDD goes one step beyond a simple constructive approach. A broad set of computational tools: docking, R group quantitative structure-activity relationship, fragmentation tools, fragments management tools, patents analysis and fragment-hopping, for example, can be utilized in FBDD, providing a clear positive impact if they are utilized in the proper scenario - what, how and when. An initial assessment of additive/non-additive behavior is a critical point to define the most convenient approach for fragments elaboration.

  10. A new crystal structure fragment-based pharmacophore method for G protein-coupled receptors

    DEFF Research Database (Denmark)

    Fidom, Kimberley; Isberg, Vignir; Hauser, Alexander Sebastian

    2015-01-01

    and receptor residue pairs, from crystal structure complexes. We describe the procedure to collect a library with more than 250 fragments covering 29 residue positions within the generic transmembrane binding pocket. We describe how the library fragments are recombined and inferred to build pharmacophores...... for new targets. A validating retrospective virtual screening of histamine H1 and H3 receptor pharmacophores yielded area-under-the-curves of 0.88 and 0.82, respectively. The fragment-based method has the unique advantage that it can be applied to targets for which no (homologous) crystal structures...... or ligands are known. 47% of the class A G protein-coupled receptors can be targeted with at least four-element pharmacophores. The fragment libraries can also be used to grow known ligands or for rotamer refinement of homology models. Researchers can download the complete fragment library or a subset...

  11. Genome-wide engineering of an infectious clone of herpes simplex virus type 1 using synthetic genomics assembly methods.

    Science.gov (United States)

    Oldfield, Lauren M; Grzesik, Peter; Voorhies, Alexander A; Alperovich, Nina; MacMath, Derek; Najera, Claudia D; Chandra, Diya Sabrina; Prasad, Sanjana; Noskov, Vladimir N; Montague, Michael G; Friedman, Robert M; Desai, Prashant J; Vashee, Sanjay

    2017-10-17

    Here, we present a transformational approach to genome engineering of herpes simplex virus type 1 (HSV-1), which has a large DNA genome, using synthetic genomics tools. We believe this method will enable more rapid and complex modifications of HSV-1 and other large DNA viruses than previous technologies, facilitating many useful applications. Yeast transformation-associated recombination was used to clone 11 fragments comprising the HSV-1 strain KOS 152 kb genome. Using overlapping sequences between the adjacent pieces, we assembled the fragments into a complete virus genome in yeast, transferred it into an Escherichia coli host, and reconstituted infectious virus following transfection into mammalian cells. The virus derived from this yeast-assembled genome, KOS YA , replicated with kinetics similar to wild-type virus. We demonstrated the utility of this modular assembly technology by making numerous modifications to a single gene, making changes to two genes at the same time and, finally, generating individual and combinatorial deletions to a set of five conserved genes that encode virion structural proteins. While the ability to perform genome-wide editing through assembly methods in large DNA virus genomes raises dual-use concerns, we believe the incremental risks are outweighed by potential benefits. These include enhanced functional studies, generation of oncolytic virus vectors, development of delivery platforms of genes for vaccines or therapy, as well as more rapid development of countermeasures against potential biothreats.

  12. Characterization of large-insert DNA libraries from soil for environmental genomic studies of Archaea

    DEFF Research Database (Denmark)

    Treusch, Alexander H; Kletzin, Arnulf; Raddatz, Guenter

    2004-01-01

    Complex genomic libraries are increasingly being used to retrieve complete genes, operons or large genomic fragments directly from environmental samples, without the need to cultivate the respective microorganisms. We report on the construction of three large-insert fosmid libraries in total...... (approximately 1% each) have been captured in our libraries. The diversity of putative protein-encoding genes, as reflected by their distribution into different COG clusters, was comparable to that encoded in complete genomes of cultivated microorganisms. A huge variety of genomic fragments has been captured...

  13. Algorithms and Complexity Results for Genome Mapping Problems.

    Science.gov (United States)

    Rajaraman, Ashok; Zanetti, Joao Paulo Pereira; Manuch, Jan; Chauve, Cedric

    2017-01-01

    Genome mapping algorithms aim at computing an ordering of a set of genomic markers based on local ordering information such as adjacencies and intervals of markers. In most genome mapping models, markers are assumed to occur uniquely in the resulting map. We introduce algorithmic questions that consider repeats, i.e., markers that can have several occurrences in the resulting map. We show that, provided with an upper bound on the copy number of repeated markers and with intervals that span full repeat copies, called repeat spanning intervals, the problem of deciding if a set of adjacencies and repeat spanning intervals admits a genome representation is tractable if the target genome can contain linear and/or circular chromosomal fragments. We also show that extracting a maximum cardinality or weight subset of repeat spanning intervals given a set of adjacencies that admits a genome realization is NP-hard but fixed-parameter tractable in the maximum copy number and the number of adjacent repeats, and tractable if intervals contain a single repeated marker.

  14. LRSim: A Linked-Reads Simulator Generating Insights for Better Genome Partitioning

    Directory of Open Access Journals (Sweden)

    Ruibang Luo

    Full Text Available Linked-read sequencing, using highly-multiplexed genome partitioning and barcoding, can span hundreds of kilobases to improve de novo assembly, haplotype phasing, and other applications. Based on our analysis of 14 datasets, we introduce LRSim that simulates linked-reads by emulating the library preparation and sequencing process with fine control over variants, linked-read characteristics, and the short-read profile. We conclude from the phasing and assembly of multiple datasets, recommendations on coverage, fragment length, and partitioning when sequencing genomes of different sizes and complexities. These optimizations improve results by orders of magnitude, and enable the development of novel methods. LRSim is available at https://github.com/aquaskyline/LRSIM. Keywords: Linked-read, Molecular barcoding, Reads partitioning, Phasing, Reads simulation, Genome assembly, 10X Genomics

  15. Complete mitochondrial genome and phylogeny of Pleistocene mammoth Mammuthus primigenius.

    Directory of Open Access Journals (Sweden)

    Evgeny I Rogaev

    2006-03-01

    Full Text Available Phylogenetic relationships between the extinct woolly mammoth (Mammuthus primigenius, and the Asian (Elephas maximus and African savanna (Loxodonta africana elephants remain unresolved. Here, we report the sequence of the complete mitochondrial genome (16,842 base pairs of a woolly mammoth extracted from permafrost-preserved remains from the Pleistocene epoch--the oldest mitochondrial genome sequence determined to date. We demonstrate that well-preserved mitochondrial genome fragments, as long as approximately 1,600-1700 base pairs, can be retrieved from pre-Holocene remains of an extinct species. Phylogenetic reconstruction of the Elephantinae clade suggests that M. primigenius and E. maximus are sister species that diverged soon after their common ancestor split from the L. africana lineage. Low nucleotide diversity found between independently determined mitochondrial genomic sequences of woolly mammoths separated geographically and in time suggests that north-eastern Siberia was occupied by a relatively homogeneous population of M. primigenius throughout the late Pleistocene.

  16. Discovery of novel dengue virus NS5 methyltransferase non-nucleoside inhibitors by fragment-based drug design.

    Science.gov (United States)

    Benmansour, Fatiha; Trist, Iuni; Coutard, Bruno; Decroly, Etienne; Querat, Gilles; Brancale, Andrea; Barral, Karine

    2017-01-05

    With the aim to help drug discovery against dengue virus (DENV), a fragment-based drug design approach was applied to identify ligands targeting a main component of DENV replication complex: the NS5 AdoMet-dependent mRNA methyltransferase (MTase) domain, playing an essential role in the RNA capping process. Herein, we describe the identification of new inhibitors developed using fragment-based, structure-guided linking and optimization techniques. Thermal-shift assay followed by a fragment-based X-ray crystallographic screening lead to the identification of three fragment hits binding DENV MTase. We considered linking two of them, which bind to proximal sites of the AdoMet binding pocket, in order to improve their potency. X-ray crystallographic structures and computational docking were used to guide the fragment linking, ultimately leading to novel series of non-nucleoside inhibitors of flavivirus MTase, respectively N-phenyl-[(phenylcarbamoyl)amino]benzene-1-sulfonamide and phenyl [(phenylcarbamoyl)amino]benzene-1-sulfonate derivatives, that show a 10-100-fold stronger inhibition of 2'-O-MTase activity compared to the initial fragments. Copyright © 2016 Elsevier Masson SAS. All rights reserved.

  17. Identification of genomic insertion and flanking sequence of G2-EPSPS and GAT transgenes in soybean using whole genome sequencing method

    Directory of Open Access Journals (Sweden)

    Bingfu Guo

    2016-07-01

    Full Text Available Molecular characterization of sequences flanking exogenous fragment insertions is essential for safety assessment and labeling of genetically modified organisms (GMO. In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS method. About 21 Gb sequence data (~21× coverage for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundary of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of the genomic insertion site of the G2-EPSPS and GAT transgenes will facilitate the use of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS is a cost-effective and rapid method of identifying sites of T-DNA insertions and flanking sequences in soybean.

  18. The use of many-body expansions and geometry optimizations in fragment-based methods.

    Science.gov (United States)

    Fedorov, Dmitri G; Asada, Naoya; Nakanishi, Isao; Kitaura, Kazuo

    2014-09-16

    Conspectus Chemists routinely work with complex molecular systems: solutions, biochemical molecules, and amorphous and composite materials provide some typical examples. The questions one often asks are what are the driving forces for a chemical phenomenon? How reasonable are our views of chemical systems in terms of subunits, such as functional groups and individual molecules? How can one quantify the difference in physicochemical properties of functional units found in a different chemical environment? Are various effects on functional units in molecular systems additive? Can they be represented by pairwise potentials? Are there effects that cannot be represented in a simple picture of pairwise interactions? How can we obtain quantitative values for these effects? Many of these questions can be formulated in the language of many-body effects. They quantify the properties of subunits (fragments), referred to as one-body properties, pairwise interactions (two-body properties), couplings of two-body interactions described by three-body properties, and so on. By introducing the notion of fragments in the framework of quantum chemistry, one obtains two immense benefits: (a) chemists can finally relate to quantum chemistry, which now speaks their language, by discussing chemically interesting subunits and their interactions and (b) calculations become much faster due to a reduced computational scaling. For instance, the somewhat academic sounding question of the importance of three-body effects in water clusters is actually another way of asking how two hydrogen bonds affect each other, when they involve three water molecules. One aspect of this is the many-body charge transfer (CT), because the charge transfers in the two hydrogen bonds are coupled to each other (not independent). In this work, we provide a generalized view on the use of many-body expansions in fragment-based methods, focusing on the general aspects of the property expansion and a contraction of a

  19. An Enumerative Combinatorics Model for Fragmentation Patterns in RNA Sequencing Provides Insights into Nonuniformity of the Expected Fragment Starting-Point and Coverage Profile.

    Science.gov (United States)

    Prakash, Celine; Haeseler, Arndt Von

    2017-03-01

    RNA sequencing (RNA-seq) has emerged as the method of choice for measuring the expression of RNAs in a given cell population. In most RNA-seq technologies, sequencing the full length of RNA molecules requires fragmentation into smaller pieces. Unfortunately, the issue of nonuniform sequencing coverage across a genomic feature has been a concern in RNA-seq and is attributed to biases for certain fragments in RNA-seq library preparation and sequencing. To investigate the expected coverage obtained from fragmentation, we develop a simple fragmentation model that is independent of bias from the experimental method and is not specific to the transcript sequence. Essentially, we enumerate all configurations for maximal placement of a given fragment length, F, on transcript length, T, to represent every possible fragmentation pattern, from which we compute the expected coverage profile across a transcript. We extend this model to incorporate general empirical attributes such as read length, fragment length distribution, and number of molecules of the transcript. We further introduce the fragment starting-point, fragment coverage, and read coverage profiles. We find that the expected profiles are not uniform and that factors such as fragment length to transcript length ratio, read length to fragment length ratio, fragment length distribution, and number of molecules influence the variability of coverage across a transcript. Finally, we explore a potential application of the model where, with simulations, we show that it is possible to correctly estimate the transcript copy number for any transcript in the RNA-seq experiment.

  20. Tropical forest fragmentation affects floral visitors but not the structure of individual-based palm-pollinator networks.

    Science.gov (United States)

    Dáttilo, Wesley; Aguirre, Armando; Quesada, Mauricio; Dirzo, Rodolfo

    2015-01-01

    Despite increasing knowledge about the effects of habitat loss on pollinators in natural landscapes, information is very limited regarding the underlying mechanisms of forest fragmentation affecting plant-pollinator interactions in such landscapes. Here, we used a network approach to describe the effects of forest fragmentation on the patterns of interactions involving the understory dominant palm Astrocaryum mexicanum (Arecaceae) and its floral visitors (including both effective and non-effective pollinators) at the individual level in a Mexican tropical rainforest landscape. Specifically, we asked: (i) Does fragment size affect the structure of individual-based plant-pollinator networks? (ii) Does the core of highly interacting visitor species change along the fragmentation size gradient? (iii) Does forest fragment size influence the abundance of effective pollinators of A. mexicanum? We found that fragment size did not affect the topological structure of the individual-based palm-pollinator network. Furthermore, while the composition of peripheral non-effective pollinators changed depending on fragment size, effective core generalist species of pollinators remained stable. We also observed that both abundance and variance of effective pollinators of male and female flowers of A. mexicanum increased with forest fragment size. These findings indicate that the presence of effective pollinators in the core of all forest fragments could keep the network structure stable along the gradient of forest fragmentation. In addition, pollination of A. mexicanum could be more effective in larger fragments, since the greater abundance of pollinators in these fragments may increase the amount of pollen and diversity of pollen donors between flowers of individual plants. Given the prevalence of fragmentation in tropical ecosystems, our results indicate that the current patterns of land use will have consequences on the underlying mechanisms of pollination in remnant forests.

  1. Tropical forest fragmentation affects floral visitors but not the structure of individual-based palm-pollinator networks.

    Directory of Open Access Journals (Sweden)

    Wesley Dáttilo

    Full Text Available Despite increasing knowledge about the effects of habitat loss on pollinators in natural landscapes, information is very limited regarding the underlying mechanisms of forest fragmentation affecting plant-pollinator interactions in such landscapes. Here, we used a network approach to describe the effects of forest fragmentation on the patterns of interactions involving the understory dominant palm Astrocaryum mexicanum (Arecaceae and its floral visitors (including both effective and non-effective pollinators at the individual level in a Mexican tropical rainforest landscape. Specifically, we asked: (i Does fragment size affect the structure of individual-based plant-pollinator networks? (ii Does the core of highly interacting visitor species change along the fragmentation size gradient? (iii Does forest fragment size influence the abundance of effective pollinators of A. mexicanum? We found that fragment size did not affect the topological structure of the individual-based palm-pollinator network. Furthermore, while the composition of peripheral non-effective pollinators changed depending on fragment size, effective core generalist species of pollinators remained stable. We also observed that both abundance and variance of effective pollinators of male and female flowers of A. mexicanum increased with forest fragment size. These findings indicate that the presence of effective pollinators in the core of all forest fragments could keep the network structure stable along the gradient of forest fragmentation. In addition, pollination of A. mexicanum could be more effective in larger fragments, since the greater abundance of pollinators in these fragments may increase the amount of pollen and diversity of pollen donors between flowers of individual plants. Given the prevalence of fragmentation in tropical ecosystems, our results indicate that the current patterns of land use will have consequences on the underlying mechanisms of pollination in

  2. The complete mitochondrial genomes of five Eimeria species infecting domestic rabbits.

    Science.gov (United States)

    Liu, Guo-Hua; Tian, Si-Qin; Cui, Ping; Fang, Su-Fang; Wang, Chun-Ren; Zhu, Xing-Quan

    2015-12-01

    Rabbit coccidiosis caused by members of the genus Eimeria can cause enormous economic impact worldwide, but the genetics, epidemiology and biology of these parasites remain poorly understood. In the present study, we sequenced and annotated the complete mitochondrial (mt) genomes of five Eimeria species that commonly infect the domestic rabbits. The complete mt genomes of Eimeria intestinalis, Eimeria flavescens, Eimeria media, Eimeria vejdovskyi and Eimeria irresidua were 6261bp, 6258bp, 6168bp, 6254bp, 6259bp in length, respectively. All of the mt genomes consist of 3 genes for proteins (cytb, cox1, and cox3), 14 gene fragments for the large subunit (LSU) rRNA and 11 gene fragments for the small subunit (SSU) rRNA, but no transfer RNA (tRNA) genes. The gene order of the mt genomes is similar to that of Plasmodium, but distinct from Haemosporida and Theileria. Phylogenetic analyses based on full nucleotide sequences using Bayesian analysis revealed that the monophyly of the Eimeria of rabbits was strongly statistically supported with a Bayesian posterior probabilities. These data provide novel mtDNA markers for studying the population genetics and molecular epidemiology of the Eimeria species, and should have implications for the molecular diagnosis, prevention and control of coccidiosis in rabbits. Copyright © 2015 Elsevier Inc. All rights reserved.

  3. The Office Software Learning and Examination System Design Based on Fragmented Learning Idea

    Directory of Open Access Journals (Sweden)

    Xu Ling

    2016-01-01

    Full Text Available Fragmented learning is that through the segmentation of learning content or learning time, make learners can use the fragmented time for learning fragmentated content, have the characteristics of time flexibility, learning targeted and high learning efficiency. Based on the fragmented learning ideas, combined with the teaching idea of micro class and interactive teaching, comprehensive utilization of flash animation design software, .NET development platform, VSTO technology, multimedia development technology and so on, design and develop a system integrated with learning, practice and examination of the Office software, which is not only conducive to the effective and personalized learning of students, but also conducive to the understanding the students’ situation of teachers, and liberate teachers from the heavy labor of mechanical, focus on promoting the formation of students’ knowledge system.

  4. Insights into structural variations and genome rearrangements in prokaryotic genomes.

    Science.gov (United States)

    Periwal, Vinita; Scaria, Vinod

    2015-01-01

    Structural variations (SVs) are genomic rearrangements that affect fairly large fragments of DNA. Most of the SVs such as inversions, deletions and translocations have been largely studied in context of genetic diseases in eukaryotes. However, recent studies demonstrate that genome rearrangements can also have profound impact on prokaryotic genomes, leading to altered cell phenotype. In contrast to single-nucleotide variations, SVs provide a much deeper insight into organization of bacterial genomes at a much better resolution. SVs can confer change in gene copy number, creation of new genes, altered gene expression and many other functional consequences. High-throughput technologies have now made it possible to explore SVs at a much refined resolution in bacterial genomes. Through this review, we aim to highlight the importance of the less explored field of SVs in prokaryotic genomes and their impact. We also discuss its potential applicability in the emerging fields of synthetic biology and genome engineering where targeted SVs could serve to create sophisticated and accurate genome editing. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  5. Using Partial Genomic Fosmid Libraries for Sequencing CompleteOrganellar Genomes

    Energy Technology Data Exchange (ETDEWEB)

    McNeal, Joel R.; Leebens-Mack, James H.; Arumuganathan, K.; Kuehl, Jennifer V.; Boore, Jeffrey L.; dePamphilis, Claude W.

    2005-08-26

    Organellar genome sequences provide numerous phylogenetic markers and yield insight into organellar function and molecular evolution. These genomes are much smaller in size than their nuclear counterparts; thus, their complete sequencing is much less expensive than total nuclear genome sequencing, making broader phylogenetic sampling feasible. However, for some organisms it is challenging to isolate plastid DNA for sequencing using standard methods. To overcome these difficulties, we constructed partial genomic libraries from total DNA preparations of two heterotrophic and two autotrophic angiosperm species using fosmid vectors. We then used macroarray screening to isolate clones containing large fragments of plastid DNA. A minimum tiling path of clones comprising the entire genome sequence of each plastid was selected, and these clones were shotgun-sequenced and assembled into complete genomes. Although this method worked well for both heterotrophic and autotrophic plants, nuclear genome size had a dramatic effect on the proportion of screened clones containing plastid DNA and, consequently, the overall number of clones that must be screened to ensure full plastid genome coverage. This technique makes it possible to determine complete plastid genome sequences for organisms that defy other available organellar genome sequencing methods, especially those for which limited amounts of tissue are available.

  6. Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.

    Science.gov (United States)

    Hazkani-Covo, Einat; Martin, William F

    2017-05-01

    Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. CyanoBase: the cyanobacteria genome database update 2010

    OpenAIRE

    Nakao, Mitsuteru; Okamoto, Shinobu; Kohara, Mitsuyo; Fujishiro, Tsunakazu; Fujisawa, Takatomo; Sato, Shusei; Tabata, Satoshi; Kaneko, Takakazu; Nakamura, Yasukazu

    2009-01-01

    CyanoBase (http://genome.kazusa.or.jp/cyanobase) is the genome database for cyanobacteria, which are model organisms for photosynthesis. The database houses cyanobacteria species information, complete genome sequences, genome-scale experiment data, gene information, gene annotations and mutant information. In this version, we updated these datasets and improved the navigation and the visual display of the data views. In addition, a web service API now enables users to retrieve the data in var...

  8. Fragment-based virtual screening approach and molecular dynamics simulation studies for identification of BACE1 inhibitor leads.

    Science.gov (United States)

    Manoharan, Prabu; Ghoshal, Nanda

    2018-05-01

    Traditional structure-based virtual screening method to identify drug-like small molecules for BACE1 is so far unsuccessful. Location of BACE1, poor Blood Brain Barrier permeability and P-glycoprotein (Pgp) susceptibility of the inhibitors make it even more difficult. Fragment-based drug design method is suitable for efficient optimization of initial hit molecules for target like BACE1. We have developed a fragment-based virtual screening approach to identify/optimize the fragment molecules as a starting point. This method combines the shape, electrostatic, and pharmacophoric features of known fragment molecules, bound to protein conjugate crystal structure, and aims to identify both chemically and energetically feasible small fragment ligands that bind to BACE1 active site. The two top-ranked fragment hits were subjected for a 53 ns MD simulation. Principle component analysis and free energy landscape analysis reveal that the new ligands show the characteristic features of established BACE1 inhibitors. The potent method employed in this study may serve for the development of potential lead molecules for BACE1-directed Alzheimer's disease therapeutics.

  9. Homology of yeast photoreactivating gene fragment with human genomic digests

    International Nuclear Information System (INIS)

    Meechan, P.J.; Milam, K.M.; Cleaver, J.E.

    1984-01-01

    Enzymatic photoreactivation of UV-induced DNA lesions has been demonstrated for a variety of prokaryotic and eukaryotic organisms. Its presence in placental mammals, however, has not been clearly established. The authors attempted to resolve this question by assaying for the presence (or absence) of sequences in human DNA complimentary to a fragment of the photoreactivating gene from S. cerevisiae that has recently been cloned. In another study, DNA from human, chick E. coli and yeast cells was digested with either HindIII of BglII, electrophoresed on a 0.5% agarose gel, transferred (Southern blot) to a nylon membrane and probed for homology against a Sau3A restriction fragment from S. cerevisiae that compliments phr/sup -/ cells. Hybridization to human DNA digests was observed only under relatively non-stringent conditions indicating the gene is not conserved in placental mammals. These results are correlated with current literature data concerning photoreactivating enzymes

  10. Screening and identification of male-specific DNA fragments in common carps Cyprinus carpio using suppression subtractive hybridization.

    Science.gov (United States)

    Chen, J J; Du, Q Y; Yue, Y Y; Dang, B J; Chang, Z J

    2010-08-01

    In this study, a sex subtractive genomic DNA library was constructed using suppression subtractive hybridization (SSH) between male and female Cyprinus carpio. Twenty-two clones with distinguishable hybridization signals were selected and sequenced. The specific primers were designed based on the sequence data. Those primers were then used to amplify the sex-specific fragments from the genomic DNA of male and female carp. The amplified fragments from two clones showed specificity to males but not to females, which were named as Ccmf2 [387 base pairs (bp)] and Ccmf3 (183 bp), respectively. The sex-specific pattern was analysed in a total of 40 individuals from three other different C. carpio. stocks and grass carp Ctenopharyngodon idella using Ccmf2 and Ccmf3 as dot-blotting probes. The results revealed that the molecular diversity exists on the Y chromosome of C. carpio. No hybridization signals, however, were detected from individuals of C. idella, suggesting that the two sequences are specific to C. carpio. No significant homologous sequences of Ccmf2 and Ccmf3 were found in GenBank. Therefore, it was interpreted that the results as that Ccmf2 and Ccmf3 are two novel male-specific sequences; and both fragments could be used as markers to rapidly and accurately identify the genetic sex of part of C. carpio. This may provide a very efficient selective tool for practically breeding monosex female populations in aquacultural production.

  11. Recent progress on perturbative QCD fragmentation functions

    International Nuclear Information System (INIS)

    Cheung, K.

    1995-05-01

    The recent development of perturbative QCD (PQCD) fragmentation functions has strong impact on quarkonium production. I shall summarize B c meson production based on these PQCD fragmentation functions, as well as, the highlights of some recent activities on applying these PQCD fragmentation functions to explain anomalous J/ψ and ψ' production at the Tevatron. Finally, I discuss a fragmentation model based on the PQCD fragmentation functions for heavy quarks fragmenting into heavy-light mesons

  12. Mass spectrometry for fragment screening.

    Science.gov (United States)

    Chan, Daniel Shiu-Hin; Whitehouse, Andrew J; Coyne, Anthony G; Abell, Chris

    2017-11-08

    Fragment-based approaches in chemical biology and drug discovery have been widely adopted worldwide in both academia and industry. Fragment hits tend to interact weakly with their targets, necessitating the use of sensitive biophysical techniques to detect their binding. Common fragment screening techniques include differential scanning fluorimetry (DSF) and ligand-observed NMR. Validation and characterization of hits is usually performed using a combination of protein-observed NMR, isothermal titration calorimetry (ITC) and X-ray crystallography. In this context, MS is a relatively underutilized technique in fragment screening for drug discovery. MS-based techniques have the advantage of high sensitivity, low sample consumption and being label-free. This review highlights recent examples of the emerging use of MS-based techniques in fragment screening. © 2017 The Author(s). Published by Portland Press Limited on behalf of the Biochemical Society.

  13. CyanoBase: the cyanobacteria genome database update 2010.

    Science.gov (United States)

    Nakao, Mitsuteru; Okamoto, Shinobu; Kohara, Mitsuyo; Fujishiro, Tsunakazu; Fujisawa, Takatomo; Sato, Shusei; Tabata, Satoshi; Kaneko, Takakazu; Nakamura, Yasukazu

    2010-01-01

    CyanoBase (http://genome.kazusa.or.jp/cyanobase) is the genome database for cyanobacteria, which are model organisms for photosynthesis. The database houses cyanobacteria species information, complete genome sequences, genome-scale experiment data, gene information, gene annotations and mutant information. In this version, we updated these datasets and improved the navigation and the visual display of the data views. In addition, a web service API now enables users to retrieve the data in various formats with other tools, seamlessly.

  14. FragIt: a tool to prepare input files for fragment based quantum chemical calculations.

    Directory of Open Access Journals (Sweden)

    Casper Steinmann

    Full Text Available Near linear scaling fragment based quantum chemical calculations are becoming increasingly popular for treating large systems with high accuracy and is an active field of research. However, it remains difficult to set up these calculations without expert knowledge. To facilitate the use of such methods, software tools need to be available to support these methods and help to set up reasonable input files which will lower the barrier of entry for usage by non-experts. Previous tools relies on specific annotations in structure files for automatic and successful fragmentation such as residues in PDB files. We present a general fragmentation methodology and accompanying tools called FragIt to help setup these calculations. FragIt uses the SMARTS language to locate chemically appropriate fragments in large structures and is applicable to fragmentation of any molecular system given suitable SMARTS patterns. We present SMARTS patterns of fragmentation for proteins, DNA and polysaccharides, specifically for D-galactopyranose for use in cyclodextrins. FragIt is used to prepare input files for the Fragment Molecular Orbital method in the GAMESS program package, but can be extended to other computational methods easily.

  15. A role for fragment-based drug design in developing novel lead compounds for central nervous system targets

    Directory of Open Access Journals (Sweden)

    Michael J. Wasko

    2015-09-01

    Full Text Available Hundreds of millions of U.S. dollars are invested in the research and development of a single drug. Lead compound development is an area ripe for new design strategies. Therapeutic lead candidates have been traditionally found using high-throughput in vitro pharmacologic screening, a costly method for assaying thousands of compounds. This approach has recently been augmented by virtual screening, which employs computer models of the target protein to narrow the search for possible leads. A variant of virtual screening is fragment-based drug design, an emerging in silico lead discovery method that introduces low molecular weight fragments, rather than intact compounds, into the binding pocket of the receptor model. These fragments serve as starting points for growing the lead candidate. Current efforts in virtual fragment-based drug design within central nervous system (CNS targets are reviewed, as is a recent rule-based optimization strategy in which new molecules are generated within a 3D receptor binding pocket using the fragment as a scaffold. This process places special emphasis on creating synthesizable molecules but also exposes computational questions worth addressing. Fragment-based methods provide a viable, relatively low-cost alternative for therapeutic lead discovery and optimization that can be applied to CNS targets to augment current design strategies.

  16. Why close a bacterial genome? The plasmid of Alteromonas macleodii HOT1A3 is a vector for inter-specific transfer of a flexible genomic island

    Directory of Open Access Journals (Sweden)

    Eduard eFadeev

    2016-03-01

    Full Text Available Genome sequencing is rapidly becoming a staple technique in environmental and clinical microbiology, yet computational challenges still remain, leading to many draft genomes which are typically fragmented into many contigs. We sequenced and completely assembled the genome of a marine heterotrophic bacterium, Alteromonas macleodii HOT1A3, and compared its full genome to several draft genomes obtained using different reference-based and de-novo methods. In general, the de-novo assemblies clearly outperformed the reference-based or hybrid ones, covering>99% of the genes and representing essentially all of the gene functions. However, only the fully closed genome (~4.5Mbp allowed us to identify the presence of a large, 148 kbp plasmid, pAM1A3. While HOT1A3 belongs to Alteromonas macleodii, typically found in surface waters (surface ecotype, this plasmid consists of an almost complete flexible genomic island, containing many genes involved in metal resistance previously identified in the genomes of Alteromonas mediterranea (deep ecotype. Indeed, similar to A. mediterranea, A. macleodii HOT1A3 grows at concentrations of zinc, mercury and copper that are inhibitory for other A. macleodii strains. The presence of a plasmid encoding almost an entire flexible genomic island suggests that wholesale genomic exchange between heterotrophic marine bacteria belonging to related but ecologically different populations is not uncommon.

  17. Generation of Polar Semi-Saturated Bicyclic Pyrazoles for Fragment-Based Drug Discovery Campaigns.

    Science.gov (United States)

    Luise, Nicola; Wyatt, Paul

    2018-05-07

    Synthesising polar semi-saturated bicyclic heterocycles can lead to better starting points for fragment-based drug discovery (FBDD) programs. This communication highlights the application of diverse chemistry to construct bicyclic systems from a common intermediate, where pyrazole, a privileged heteroaromatic able to bind effectively to biological targets, is fused to diverse saturated counterparts. The generated fragments can be further developed either after confirmation of their binding pose or early in the process, as their synthetic intermediates. Essential quality control (QC) for selection of small molecules to add to a fragment library is discussed. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. Genomic treasure troves: complete genome sequencing of herbarium and insect museum specimens.

    Science.gov (United States)

    Staats, Martijn; Erkens, Roy H J; van de Vossenberg, Bart; Wieringa, Jan J; Kraaijeveld, Ken; Stielow, Benjamin; Geml, József; Richardson, James E; Bakker, Freek T

    2013-01-01

    Unlocking the vast genomic diversity stored in natural history collections would create unprecedented opportunities for genome-scale evolutionary, phylogenetic, domestication and population genomic studies. Many researchers have been discouraged from using historical specimens in molecular studies because of both generally limited success of DNA extraction and the challenges associated with PCR-amplifying highly degraded DNA. In today's next-generation sequencing (NGS) world, opportunities and prospects for historical DNA have changed dramatically, as most NGS methods are actually designed for taking short fragmented DNA molecules as templates. Here we show that using a standard multiplex and paired-end Illumina sequencing approach, genome-scale sequence data can be generated reliably from dry-preserved plant, fungal and insect specimens collected up to 115 years ago, and with minimal destructive sampling. Using a reference-based assembly approach, we were able to produce the entire nuclear genome of a 43-year-old Arabidopsis thaliana (Brassicaceae) herbarium specimen with high and uniform sequence coverage. Nuclear genome sequences of three fungal specimens of 22-82 years of age (Agaricus bisporus, Laccaria bicolor, Pleurotus ostreatus) were generated with 81.4-97.9% exome coverage. Complete organellar genome sequences were assembled for all specimens. Using de novo assembly we retrieved between 16.2-71.0% of coding sequence regions, and hence remain somewhat cautious about prospects for de novo genome assembly from historical specimens. Non-target sequence contaminations were observed in 2 of our insect museum specimens. We anticipate that future museum genomics projects will perhaps not generate entire genome sequences in all cases (our specimens contained relatively small and low-complexity genomes), but at least generating vital comparative genomic data for testing (phylo)genetic, demographic and genetic hypotheses, that become increasingly more horizontal

  19. Detection of Alicyclobacillus species in fruit juice using a random genomic DNA microarray chip.

    Science.gov (United States)

    Jang, Jun Hyeong; Kim, Sun-Joong; Yoon, Bo Hyun; Ryu, Jee-Hoon; Gu, Man Bock; Chang, Hyo-Ihl

    2011-06-01

    This study describes a method using a DNA microarray chip to rapidly and simultaneously detect Alicyclobacillus species in orange juice based on the hybridization of genomic DNA with random probes. Three food spoilage bacteria were used in this study: Alicyclobacillus acidocaldarius, Alicyclobacillus acidoterrestris, and Alicyclobacillus cycloheptanicus. The three Alicyclobacillus species were adjusted to 2 × 10(3) CFU/ml and inoculated into pasteurized 100% pure orange juice. Cy5-dCTP labeling was used for reference signals, and Cy3-dCTP was labeled for target genomic DNA. The molar ratio of 1:1 of Cy3-dCTP and Cy5-dCTP was used. DNA microarray chips were fabricated using randomly fragmented DNA of Alicyclobacillus spp. and were hybridized with genomic DNA extracted from Bacillus spp. Genomic DNA extracted from Alicyclobacillus spp. showed a significantly higher hybridization rate compared with DNA of Bacillus spp., thereby distinguishing Alicyclobacillus spp. from Bacillus spp. The results showed that the microarray DNA chip containing randomly fragmented genomic DNA was specific and clearly identified specific food spoilage bacteria. This microarray system is a good tool for rapid and specific detection of thermophilic spoilage bacteria, mainly Alicyclobacillus spp., and is useful and applicable to the fruit juice industry.

  20. Toward a physical map of the genome of the nematode Caenorhabditis elegans

    International Nuclear Information System (INIS)

    Coulson, A.; Sulston, J.; Brenner, S.; Karn, J.

    1986-01-01

    A technique for digital characterization and comparison of DNA fragments, using restriction enzymes, is described. The technique is being applied to fragments from the nematode Caenorhabditis elegans (i) to facilitate cross-indexing of clones emanating from different laboratories and (ii) to construct a physical map of the genome. Eight hundred sixty clusters of clones, from 35 to 350 kilobases long and totaling about 60% of the genome, have been characterized

  1. Reconstruction of Banknote Fragments Based on Keypoint Matching Method.

    Science.gov (United States)

    Gwo, Chih-Ying; Wei, Chia-Hung; Li, Yue; Chiu, Nan-Hsing

    2015-07-01

    Banknotes may be shredded by a scrap machine, ripped up by hand, or damaged in accidents. This study proposes an image registration method for reconstruction of multiple sheets of banknotes. The proposed method first constructs different scale spaces to identify keypoints in the underlying banknote fragments. Next, the features of those keypoints are extracted to represent their local patterns around keypoints. Then, similarity is computed to find the keypoint pairs between the fragment and the reference banknote. The banknote fragments can determine the coordinate and amend the orientation. Finally, an assembly strategy is proposed to piece multiple sheets of banknote fragments together. Experimental results show that the proposed method causes, on average, a deviation of 0.12457 ± 0.12810° for each fragment while the SIFT method deviates 1.16893 ± 2.35254° on average. The proposed method not only reconstructs the banknotes but also decreases the computing cost. Furthermore, the proposed method can estimate relatively precisely the orientation of the banknote fragments to assemble. © 2015 American Academy of Forensic Sciences.

  2. GI-SVM: A sensitive method for predicting genomic islands based on unannotated sequence of a single genome.

    Science.gov (United States)

    Lu, Bingxin; Leong, Hon Wai

    2016-02-01

    Genomic islands (GIs) are clusters of functionally related genes acquired by lateral genetic transfer (LGT), and they are present in many bacterial genomes. GIs are extremely important for bacterial research, because they not only promote genome evolution but also contain genes that enhance adaption and enable antibiotic resistance. Many methods have been proposed to predict GI. But most of them rely on either annotations or comparisons with other closely related genomes. Hence these methods cannot be easily applied to new genomes. As the number of newly sequenced bacterial genomes rapidly increases, there is a need for methods to detect GI based solely on sequences of a single genome. In this paper, we propose a novel method, GI-SVM, to predict GIs given only the unannotated genome sequence. GI-SVM is based on one-class support vector machine (SVM), utilizing composition bias in terms of k-mer content. From our evaluations on three real genomes, GI-SVM can achieve higher recall compared with current methods, without much loss of precision. Besides, GI-SVM allows flexible parameter tuning to get optimal results for each genome. In short, GI-SVM provides a more sensitive method for researchers interested in a first-pass detection of GI in newly sequenced genomes.

  3. Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.

    Science.gov (United States)

    Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi

    2014-01-01

    A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.

  4. Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology

    DEFF Research Database (Denmark)

    Cao, Hongzhi; Hastie, Alex R.; Cao, Dandan

    2014-01-01

    mutations; however, none of the current detection methods are comprehensive, and currently available methodologies are incapable of providing sufficient resolution and unambiguous information across complex regions in the human genome. To address these challenges, we applied a high-throughput, cost......-effective genome mapping technology to comprehensively discover genome-wide SVs and characterize complex regions of the YH genome using long single molecules (>150 kb) in a global fashion. RESULTS: Utilizing nanochannel-based genome mapping technology, we obtained 708 insertions/deletions and 17 inversions larger...... fosmid data. Of the remaining 270 SVs, 260 are insertions and 213 overlap known SVs in the Database of Genomic Variants. Overall, 609 out of 666 (90%) variants were supported by experimental orthogonal methods or historical evidence in public databases. At the same time, genome mapping also provides...

  5. Genomic DNA sequence and cytosine methylation changes of adult rice leaves after seeds space flight

    Science.gov (United States)

    Shi, Jinming

    In this study, cytosine methylation on CCGG site and genomic DNA sequence changes of adult leaves of rice after seeds space flight were detected by methylation-sensitive amplification polymorphism (MSAP) and Amplified fragment length polymorphism (AFLP) technique respectively. Rice seeds were planted in the trial field after 4 days space flight on the shenzhou-6 Spaceship of China. Adult leaves of space-treated rice including 8 plants chosen randomly and 2 plants with phenotypic mutation were used for AFLP and MSAP analysis. Polymorphism of both DNA sequence and cytosine methylation were detected. For MSAP analysis, the average polymorphic frequency of the on-ground controls, space-treated plants and mutants are 1.3%, 3.1% and 11% respectively. For AFLP analysis, the average polymorphic frequencies are 1.4%, 2.9%and 8%respectively. Total 27 and 22 polymorphic fragments were cloned sequenced from MSAP and AFLP analysis respectively. Nine of the 27 fragments from MSAP analysis show homology to coding sequence. For the 22 polymorphic fragments from AFLP analysis, no one shows homology to mRNA sequence and eight fragments show homology to repeat region or retrotransposon sequence. These results suggest that although both genomic DNA sequence and cytosine methylation status can be effected by space flight, the genomic region homology to the fragments from genome DNA and cytosine methylation analysis were different.

  6. The evolutionary value of recombination is constrained by genome modularity.

    Directory of Open Access Journals (Sweden)

    Darren P Martin

    2005-10-01

    Full Text Available Genetic recombination is a fundamental evolutionary mechanism promoting biological adaptation. Using engineered recombinants of the small single-stranded DNA plant virus, Maize streak virus (MSV, we experimentally demonstrate that fragments of genetic material only function optimally if they reside within genomes similar to those in which they evolved. The degree of similarity necessary for optimal functionality is correlated with the complexity of intragenomic interaction networks within which genome fragments must function. There is a striking correlation between our experimental results and the types of MSV recombinants that are detectable in nature, indicating that obligatory maintenance of intragenome interaction networks strongly constrains the evolutionary value of recombination for this virus and probably for genomes in general.

  7. Highly sensitive strain sensors based on fragmentized carbon nanotube/polydimethylsiloxane composites

    Science.gov (United States)

    Gao, Yang; Fang, Xiaoliang; Tan, Jianping; Lu, Ting; Pan, Likun; Xuan, Fuzhen

    2018-06-01

    Wearable strain sensors based on nanomaterial/elastomer composites have potential applications in flexible electronic skin, human motion detection, human–machine interfaces, etc. In this research, a type of high performance strain sensors has been developed using fragmentized carbon nanotube/polydimethylsiloxane (CNT/PDMS) composites. The CNT/PDMS composites were ground into fragments, and a liquid-induced densification method was used to fabricate the strain sensors. The strain sensors showed high sensitivity with gauge factors (GFs) larger than 200 and a broad strain detection range up to 80%, much higher than those strain sensors based on unfragmentized CNT/PDMS composites (GF composite particles during mechanical deformation, which causes significant resistance change in the strain sensors. The strain sensors can differentiate mechanical stimuli and monitor various human body motions, such as bending of the fingers, human breathing, and blood pulsing.

  8. Human Contamination in Public Genome Assemblies.

    Science.gov (United States)

    Kryukov, Kirill; Imanishi, Tadashi

    2016-01-01

    Contamination in genome assembly can lead to wrong or confusing results when using such genome as reference in sequence comparison. Although bacterial contamination is well known, the problem of human-originated contamination received little attention. In this study we surveyed 45,735 available genome assemblies for evidence of human contamination. We used lineage specificity to distinguish between contamination and conservation. We found that 154 genome assemblies contain fragments that with high confidence originate as contamination from human DNA. Majority of contaminating human sequences were present in the reference human genome assembly for over a decade. We recommend that existing contaminated genomes should be revised to remove contaminated sequence, and that new assemblies should be thoroughly checked for presence of human DNA before submitting them to public databases.

  9. DNABIT Compress - Genome compression algorithm.

    Science.gov (United States)

    Rajarajeswari, Pothuraju; Apparao, Allam

    2011-01-22

    Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.

  10. Comparative genome analysis of Pseudogymnoascus spp. reveals primarily clonal evolution with small genome fragments exchanged between lineages.

    Science.gov (United States)

    Leushkin, Evgeny V; Logacheva, Maria D; Penin, Aleksey A; Sutormin, Roman A; Gerasimov, Evgeny S; Kochkina, Galina A; Ivanushkina, Natalia E; Vasilenko, Oleg V; Kondrashov, Alexey S; Ozerskaya, Svetlana M

    2015-05-21

    Pseudogymnoascus spp. is a wide group of fungi lineages in the family Pseudorotiaceae including an aggressive pathogen of bats P. destructans. Although several lineages of P. spp. were shown to produce ascospores in culture, the vast majority of P. spp. demonstrates no evidence of sexual reproduction. P. spp. can tolerate a wide range of different temperatures and salinities and can survive even in permafrost layer. Adaptability of P. spp. to different environments is accompanied by extremely variable morphology and physiology. We sequenced genotypes of 14 strains of P. spp., 5 of which were extracted from permafrost, 1 from a cryopeg, a layer of unfrozen ground in permafrost, and 8 from temperate surface environments. All sequenced genotypes are haploid. Nucleotide diversity among these genomes is very high, with a typical evolutionary distance at synonymous sites dS ≈ 0.5, suggesting that the last common ancestor of these strains lived >50 Mya. The strains extracted from permafrost do not form a separate clade. Instead, each permafrost strain has close relatives from temperate environments. We observed a strictly clonal population structure with no conflicting topologies for ~99% of genome sequences. However, there is a number of short (~100-10,000 nt) genomic segments with the total length of 67.6 Kb which possess phylogenetic patterns strikingly different from the rest of the genome. The most remarkable case is a MAT-locus, which has 2 distinct alleles interspersed along the whole-genome phylogenetic tree. Predominantly clonal structure of genome sequences is consistent with the observations that sexual reproduction is rare in P. spp. Small number of regions with noncanonical phylogenies seem to arise due to some recombination events between derived lineages of P. spp., with MAT-locus being transferred on multiple occasions. All sequenced strains have heterothallic configuration of MAT-locus.

  11. Monte Carlo simulation as a tool to predict blasting fragmentation based on the Kuz Ram model

    Science.gov (United States)

    Morin, Mario A.; Ficarazzo, Francesco

    2006-04-01

    Rock fragmentation is considered the most important aspect of production blasting because of its direct effects on the costs of drilling and blasting and on the economics of the subsequent operations of loading, hauling and crushing. Over the past three decades, significant progress has been made in the development of new technologies for blasting applications. These technologies include increasingly sophisticated computer models for blast design and blast performance prediction. Rock fragmentation depends on many variables such as rock mass properties, site geology, in situ fracturing and blasting parameters and as such has no complete theoretical solution for its prediction. However, empirical models for the estimation of size distribution of rock fragments have been developed. In this study, a blast fragmentation Monte Carlo-based simulator, based on the Kuz-Ram fragmentation model, has been developed to predict the entire fragmentation size distribution, taking into account intact and joints rock properties, the type and properties of explosives and the drilling pattern. Results produced by this simulator were quite favorable when compared with real fragmentation data obtained from a blast quarry. It is anticipated that the use of Monte Carlo simulation will increase our understanding of the effects of rock mass and explosive properties on the rock fragmentation by blasting, as well as increase our confidence in these empirical models. This understanding will translate into improvements in blasting operations, its corresponding costs and the overall economics of open pit mines and rock quarries.

  12. GeNemo: a search engine for web-based functional genomic data.

    Science.gov (United States)

    Zhang, Yongqing; Cao, Xiaoyi; Zhong, Sheng

    2016-07-08

    A set of new data types emerged from functional genomic assays, including ChIP-seq, DNase-seq, FAIRE-seq and others. The results are typically stored as genome-wide intensities (WIG/bigWig files) or functional genomic regions (peak/BED files). These data types present new challenges to big data science. Here, we present GeNemo, a web-based search engine for functional genomic data. GeNemo searches user-input data against online functional genomic datasets, including the entire collection of ENCODE and mouse ENCODE datasets. Unlike text-based search engines, GeNemo's searches are based on pattern matching of functional genomic regions. This distinguishes GeNemo from text or DNA sequence searches. The user can input any complete or partial functional genomic dataset, for example, a binding intensity file (bigWig) or a peak file. GeNemo reports any genomic regions, ranging from hundred bases to hundred thousand bases, from any of the online ENCODE datasets that share similar functional (binding, modification, accessibility) patterns. This is enabled by a Markov Chain Monte Carlo-based maximization process, executed on up to 24 parallel computing threads. By clicking on a search result, the user can visually compare her/his data with the found datasets and navigate the identified genomic regions. GeNemo is available at www.genemo.org. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Note: Primer Amysat 001; Fragment size is 211bp

    Indian Academy of Sciences (India)

    Renuka

    Bhandara : Lanes 1–14 represent different strains of Bhandara Ecorace. Note: Primer Amysat 001; Fragment size is 211bp. Fig. 1. SSR profiles generated from genomic DNA of 16 strains from different individuals of (A.L, D. TV, D. BV, Modal, Sukinda, Raily, Bhandara) ecoraces of tasar silk worm, Antheraea mylitta using the.

  14. Brute-Force Approach for Mass Spectrometry-Based Variant Peptide Identification in Proteogenomics without Personalized Genomic Data

    Science.gov (United States)

    Ivanov, Mark V.; Lobas, Anna A.; Levitsky, Lev I.; Moshkovskii, Sergei A.; Gorshkov, Mikhail V.

    2018-02-01

    In a proteogenomic approach based on tandem mass spectrometry analysis of proteolytic peptide mixtures, customized exome or RNA-seq databases are employed for identifying protein sequence variants. However, the problem of variant peptide identification without personalized genomic data is important for a variety of applications. Following the recent proposal by Chick et al. (Nat. Biotechnol. 33, 743-749, 2015) on the feasibility of such variant peptide search, we evaluated two available approaches based on the previously suggested "open" search and the "brute-force" strategy. To improve the efficiency of these approaches, we propose an algorithm for exclusion of false variant identifications from the search results involving analysis of modifications mimicking single amino acid substitutions. Also, we propose a de novo based scoring scheme for assessment of identified point mutations. In the scheme, the search engine analyzes y-type fragment ions in MS/MS spectra to confirm the location of the mutation in the variant peptide sequence.

  15. A first generation BAC-based physical map of the rainbow trout genome

    Directory of Open Access Journals (Sweden)

    Thorgaard Gary H

    2009-10-01

    Full Text Available Abstract Background Rainbow trout (Oncorhynchus mykiss are the most-widely cultivated cold freshwater fish in the world and an important model species for many research areas. Coupling great interest in this species as a research model with the need for genetic improvement of aquaculture production efficiency traits justifies the continued development of genomics research resources. Many quantitative trait loci (QTL have been identified for production and life-history traits in rainbow trout. A bacterial artificial chromosome (BAC physical map is needed to facilitate fine mapping of QTL and the selection of positional candidate genes for incorporation in marker-assisted selection (MAS for improving rainbow trout aquaculture production. This resource will also facilitate efforts to obtain and assemble a whole-genome reference sequence for this species. Results The physical map was constructed from DNA fingerprinting of 192,096 BAC clones using the 4-color high-information content fingerprinting (HICF method. The clones were assembled into physical map contigs using the finger-printing contig (FPC program. The map is composed of 4,173 contigs and 9,379 singletons. The total number of unique fingerprinting fragments (consensus bands in contigs is 1,185,157, which corresponds to an estimated physical length of 2.0 Gb. The map assembly was validated by 1 comparison with probe hybridization results and agarose gel fingerprinting contigs; and 2 anchoring large contigs to the microsatellite-based genetic linkage map. Conclusion The production and validation of the first BAC physical map of the rainbow trout genome is described in this paper. We are currently integrating this map with the NCCCWA genetic map using more than 200 microsatellites isolated from BAC end sequences and by identifying BACs that harbor more than 300 previously mapped markers. The availability of an integrated physical and genetic map will enable detailed comparative genome

  16. Design Principles for Fragment Libraries: Maximizing the Value of Learnings from Pharma Fragment-Based Drug Discovery (FBDD) Programs for Use in Academia.

    Science.gov (United States)

    Keserű, György M; Erlanson, Daniel A; Ferenczy, György G; Hann, Michael M; Murray, Christopher W; Pickett, Stephen D

    2016-09-22

    Fragment-based drug discovery (FBDD) is well suited for discovering both drug leads and chemical probes of protein function; it can cover broad swaths of chemical space and allows the use of creative chemistry. FBDD is widely implemented for lead discovery in industry but is sometimes used less systematically in academia. Design principles and implementation approaches for fragment libraries are continually evolving, and the lack of up-to-date guidance may prevent more effective application of FBDD in academia. This Perspective explores many of the theoretical, practical, and strategic considerations that occur within FBDD programs, including the optimal size, complexity, physicochemical profile, and shape profile of fragments in FBDD libraries, as well as compound storage, evaluation, and screening technologies. This compilation of industry experience in FBDD will hopefully be useful for those pursuing FBDD in academia.

  17. CRISPR/Cas9 based genome editing of Penicillium chrysogenum

    NARCIS (Netherlands)

    Pohl, Carsten; Kiel, Jan A K W; Driessen, Arnold J M; Bovenberg, Roel A L; Nygård, Yvonne

    2016-01-01

    CRISPR/Cas9 based systems have emerged as versatile platforms for precision genome editing in a wide range of organisms. Here we have developed powerful CRISPR/Cas9 tools for marker-based and marker-free genome modifications in Penicillium chrysogenum, a model filamentous fungus and industrially

  18. A web-based multi-genome synteny viewer for customized data

    Directory of Open Access Journals (Sweden)

    Revanna Kashi V

    2012-08-01

    Full Text Available Abstract Background Web-based synteny visualization tools are important for sharing data and revealing patterns of complicated genome conservation and rearrangements. Such tools should allow biologists to upload genomic data for their own analysis. This requirement is critical because individual biologists are generating large amounts of genomic sequences that quickly overwhelm any centralized web resources to collect and display all those data. Recently, we published a web-based synteny viewer, GSV, which was designed to satisfy the above requirement. However, GSV can only compare two genomes at a given time. Extending the functionality of GSV to visualize multiple genomes is important to meet the increasing demand of the research community. Results We have developed a multi-Genome Synteny Viewer (mGSV. Similar to GSV, mGSV is a web-based tool that allows users to upload their own genomic data files for visualization. Multiple genomes can be presented in a single integrated view with an enhanced user interface. Users can navigate through all the selected genomes in either pairwise or multiple viewing mode to examine conserved genomic regions as well as the accompanying genome annotations. Besides serving users who manually interact with the web server, mGSV also provides Web Services for machine-to-machine communication to accept data sent by other remote resources. The entire mGSV package can also be downloaded for easy local installation. Conclusions mGSV significantly enhances the original functionalities of GSV. A web server hosting mGSV is provided at http://cas-bioinfo.cas.unt.edu/mgsv.

  19. Genomic comparisons of Brucella spp. and closely related bacteria using base compositional and proteome based methods

    DEFF Research Database (Denmark)

    Bohlin, Jon; Snipen, Lars; Cloeckaert, Axel

    2010-01-01

    BACKGROUND: Classification of bacteria within the genus Brucella has been difficult due in part to considerable genomic homogeneity between the different species and biovars, in spite of clear differences in phenotypes. Therefore, many different methods have been used to assess Brucella taxonomy....... In the current work, we examine 32 sequenced genomes from genus Brucella representing the six classical species, as well as more recently described species, using bioinformatical methods. Comparisons were made at the level of genomic DNA using oligonucleotide based methods (Markov chain based genomic signatures...... between the oligonucleotide based methods used. Whilst the Markov chain based genomic signatures grouped the different species in genus Brucella according to host preference, the codon and amino acid frequencies based methods reflected small differences between the Brucella species. Only minor differences...

  20. The complete mitochondrial genome of the enigmatic bigheadedturtle (Platysternon): description of unusual genomic features and thereconciliation of phylogenetic hypotheses based on mitochondrial andnuclear DNA

    Energy Technology Data Exchange (ETDEWEB)

    Parham, James F.; Feldman, Chris R.; Boore, Jeffrey L.

    2005-12-28

    The big-headed turtle (Platysternon megacephalum) from east Asia is the sole living representative of a poorly-studied turtle lineage (Platysternidae). It has no close living relatives, and its phylogenetic position within turtles is one of the outstanding controversies in turtle systematics. Platysternon was traditionally considered to be close to snapping turtles (Chelydridae) based on some studies of its morphology and mitochondrial (mt) DNA, however, other studies of morphology and nuclear (nu) DNA do not support that hypothesis. We sequenced the complete mt genome of Platysternon and the nearly complete mt genomes of two other relevant turtles and compared them to turtle mt genomes from the literature to form the largest molecular dataset used to date to address this issue. The resulting phylogeny robustly rejects the placement of Platysternon with Chelydridae, but instead shows that it is a member of the Testudinoidea, a diverse, nearly globally-distributed group that includes pond turtles and tortoises. We also discovered that Platysternon mtDNA has large-scale gene rearrangements and possesses two, nearly identical, control regions, features that distinguish it from all other studied turtles. Our study robustly determines the phylogenetic placement of Platysternon and provides a well-resolved outline of major turtle lineages, while demonstrating the significantly greater resolving power of comparing large amounts of mt sequence over that of short fragments. Earlier phylogenies placing Platysternon with chelydrids required a temporal gap in the fossil record that is now unnecessary. The duplicated control regions and gene rearrangements of the Platysternon mt DNA probably resulted from the duplication of part of the genome and then the subsequent loss of redundant genes. Although it is possible that having two control regions may provide some advantage, explaining why the control regions would be maintained while some of the duplicated genes were eroded

  1. Morphological, Genome and Gene Expression Changes in Newly Induced Autopolyploid Chrysanthemum lavandulifolium (Fisch. ex Trautv. Makino

    Directory of Open Access Journals (Sweden)

    Ri Gao

    2016-10-01

    Full Text Available Autopolyploidy is widespread in higher plants and plays an important role in the process of evolution. The present study successfully induced autotetraploidys from Chrysanthemum lavandulifolium by colchicine. The plant morphology, genomic, transcriptomic, and epigenetic changes between tetraploid and diploid plants were investigated. Ligulate flower, tubular flower and leaves of tetraploid plants were greater than those of the diploid plants. Compared with diploid plants, the genome changed as a consequence of polyploidization in tetraploid plants, namely, 1.1% lost fragments and 1.6% novel fragments occurred. In addition, DNA methylation increased after genome doubling in tetraploid plants. Among 485 common transcript-derived fragments (TDFs, which existed in tetraploid and diploid progenitors, 62 fragments were detected as differentially expressed TDFs, 6.8% of TDFs exhibited up-regulated gene expression in the tetraploid plants and 6.0% exhibited down-regulation. The present study provides a reference for further studying the autopolyploidization role in the evolution of C. lavandulifolium. In conclusion, the autopolyploid C. lavandulifolium showed a global change in morphology, genome and gene expression compared with corresponding diploid.

  2. [Genome editing of industrial microorganism].

    Science.gov (United States)

    Zhu, Linjiang; Li, Qi

    2015-03-01

    Genome editing is defined as highly-effective and precise modification of cellular genome in a large scale. In recent years, such genome-editing methods have been rapidly developed in the field of industrial strain improvement. The quickly-updating methods thoroughly change the old mode of inefficient genetic modification, which is "one modification, one selection marker, and one target site". Highly-effective modification mode in genome editing have been developed including simultaneous modification of multiplex genes, highly-effective insertion, replacement, and deletion of target genes in the genome scale, cut-paste of a large DNA fragment. These new tools for microbial genome editing will certainly be applied widely, and increase the efficiency of industrial strain improvement, and promote the revolution of traditional fermentation industry and rapid development of novel industrial biotechnology like production of biofuel and biomaterial. The technological principle of these genome-editing methods and their applications were summarized in this review, which can benefit engineering and construction of industrial microorganism.

  3. AutoDrug: fully automated macromolecular crystallography workflows for fragment-based drug discovery

    International Nuclear Information System (INIS)

    Tsai, Yingssu; McPhillips, Scott E.; González, Ana; McPhillips, Timothy M.; Zinn, Daniel; Cohen, Aina E.; Feese, Michael D.; Bushnell, David; Tiefenbrunn, Theresa; Stout, C. David; Ludaescher, Bertram; Hedman, Britt; Hodgson, Keith O.; Soltis, S. Michael

    2013-01-01

    New software has been developed for automating the experimental and data-processing stages of fragment-based drug discovery at a macromolecular crystallography beamline. A new workflow-automation framework orchestrates beamline-control and data-analysis software while organizing results from multiple samples. AutoDrug is software based upon the scientific workflow paradigm that integrates the Stanford Synchrotron Radiation Lightsource macromolecular crystallography beamlines and third-party processing software to automate the crystallography steps of the fragment-based drug-discovery process. AutoDrug screens a cassette of fragment-soaked crystals, selects crystals for data collection based on screening results and user-specified criteria and determines optimal data-collection strategies. It then collects and processes diffraction data, performs molecular replacement using provided models and detects electron density that is likely to arise from bound fragments. All processes are fully automated, i.e. are performed without user interaction or supervision. Samples can be screened in groups corresponding to particular proteins, crystal forms and/or soaking conditions. A single AutoDrug run is only limited by the capacity of the sample-storage dewar at the beamline: currently 288 samples. AutoDrug was developed in conjunction with RestFlow, a new scientific workflow-automation framework. RestFlow simplifies the design of AutoDrug by managing the flow of data and the organization of results and by orchestrating the execution of computational pipeline steps. It also simplifies the execution and interaction of third-party programs and the beamline-control system. Modeling AutoDrug as a scientific workflow enables multiple variants that meet the requirements of different user groups to be developed and supported. A workflow tailored to mimic the crystallography stages comprising the drug-discovery pipeline of CoCrystal Discovery Inc. has been deployed and successfully

  4. Ricebase: a breeding and genetics platform for rice, integrating individual molecular markers, pedigrees and whole-genome-based data.

    Science.gov (United States)

    Edwards, J D; Baldo, A M; Mueller, L A

    2016-01-01

    Ricebase (http://ricebase.org) is an integrative genomic database for rice (Oryza sativa) with an emphasis on combining datasets in a way that maintains the key links between past and current genetic studies. Ricebase includes DNA sequence data, gene annotations, nucleotide variation data and molecular marker fragment size data. Rice research has benefited from early adoption and extensive use of simple sequence repeat (SSR) markers; however, the majority of rice SSR markers were developed prior to the latest rice pseudomolecule assembly. Interpretation of new research using SNPs in the context of literature citing SSRs requires a common coordinate system. A new pipeline, using a stepwise relaxation of stringency, was used to map SSR primers onto the latest rice pseudomolecule assembly. The SSR markers and experimentally assayed amplicon sizes are presented in a relational database with a web-based front end, and are available as a track loaded in a genome browser with links connecting the browser and database. The combined capabilities of Ricebase link genetic markers, genome context, allele states across rice germplasm and potentially user curated phenotypic interpretations as a community resource for genetic discovery and breeding in rice. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the United States.

  5. A Trichosporonales genome tree based on 27 haploid and three evolutionarily conserved 'natural' hybrid genomes.

    Science.gov (United States)

    Takashima, Masako; Sriswasdi, Sira; Manabe, Ri-Ichiroh; Ohkuma, Moriya; Sugita, Takashi; Iwasaki, Wataru

    2018-01-01

    To construct a backbone tree consisting of basidiomycetous yeasts, draft genome sequences from 25 species of Trichosporonales (Tremellomycetes, Basidiomycota) were generated. In addition to the hybrid genomes of Trichosporon coremiiforme and Trichosporon ovoides that we described previously, we identified an interspecies hybrid genome in Cutaneotrichosporon mucoides (formerly Trichosporon mucoides). This hybrid genome had a gene retention rate of ~55%, and its closest haploid relative was Cutaneotrichosporon dermatis. After constructing the C. mucoides subgenomes, we generated a phylogenetic tree using genome data from the 27 haploid species and the subgenome data from the three hybrid genome species. It was a high-quality tree with 100% bootstrap support for all of the branches. The genome-based tree provided superior resolution compared with previous multi-gene analyses. Although our backbone tree does not include all Trichosporonales genera (e.g. Cryptotrichosporon), it will be valuable for future analyses of genome data. Interest in interspecies hybrid fungal genomes has recently increased because they may provide a basis for new technologies. The three Trichosporonales hybrid genomes described in this study are different from well-characterized hybrid genomes (e.g. those of Saccharomyces pastorianus and Saccharomyces bayanus) because these hybridization events probably occurred in the distant evolutionary past. Hence, they will be useful for studying genome stability following hybridization and speciation events. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  6. [Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].

    Science.gov (United States)

    Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y

    2017-08-01

    To analyze and detect the whole genome sequence of human mitochondrial DNA (mtDNA) by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine

  7. The diploid genome sequence of an individual human.

    Directory of Open Access Journals (Sweden)

    Samuel Levy

    2007-09-01

    Full Text Available Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel included 3,213,401 single nucleotide polymorphisms (SNPs, 53,823 block substitutions (2-206 bp, 292,102 heterozygous insertion/deletion events (indels(1-571 bp, 559,473 homozygous indels (1-82,711 bp, 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.

  8. Characterizing GEO Titan IIIC Transtage Fragmentations Using Ground-based and Telescopic Measurements

    Science.gov (United States)

    Cowardin, H.; Anz-Meador, P.; Reyes, J. A.

    In a continued effort to better characterize the geosynchronous orbit (GEO) environment, NASA’s Orbital Debris Program Office (ODPO) utilizes various ground-based optical assets to acquire photometric and spectral data of known debris associated with fragmentations in or near GEO. The Titan IIIC Transtage upper stage is known to have fragmented four times. Two of the four fragmentations were in GEO while the Transtage fragmented a third time in GEO transfer orbit. The forth fragmentation occurred in low Earth orbit. To better assess and characterize these fragmentations, the NASA ODPO acquired a Titan Transtage test and display article previously in the custody of the 309th Aerospace Maintenance and Regeneration Group (AMARG) in Tucson, Arizona. After initial inspections at AMARG demonstrated that it was of sufficient fidelity to be of interest, the test article was brought to NASA Johnson Space Center (JSC) to continue material analysis and historical documentation. The Transtage has undergone two separate spectral measurement campaigns to characterize the reflectance spectroscopy of historical aerospace materials. These data have been incorporated into the NASA Spectral Database, with the goal of using telescopic data comparisons for potential material identification. A Light Detection and Ranging (LIDAR) system scan also has been completed and a scale model has been created for use in the Optical Measurement Center (OMC) for photometric analysis of an intact Transtage, including bidirectional reflectance distribution function (BRDF) measurements. An historical overview of the Titan IIIC Transtage, the current analysis that has been done to date, and the future work to be completed in support of characterizing the GEO and near GEO orbital debris environment will be discussed in the subsequent presentation.

  9. Regulation of Cre recombinase by ligand-induced complementation of inactive fragments.

    Science.gov (United States)

    Jullien, Nicolas; Sampieri, François; Enjalbert, Alain; Herman, Jean-Paul

    2003-11-01

    Cre recombinase is extensively used to engineer the genome of experimental animals. However, its usefulness is still limited by the lack of an efficient temporal control over its activity. To overcome this, we have developed DiCre, a regulatable fragment complementation system for Cre. The enzyme was split into two moieties that were fused to FKBP12 (FK506-binding protein) and FRB (binding domain of the FKBP12-rapamycin-associated protein), respectively. These can be efficiently heterodimerized by rapamycin. Several variants, based on splitting Cre at different sites and using different linker peptides, were tested in an indicator cell line. The fusion proteins, taken separately, had no recombinase activity. Stable transformants, co-expressing complementing fragments based on splitting Cre between Asn59 and Asn60, displayed low background activity affecting 0.05-0.4% of the cells. Rapamycin induced a rapid recombination, reaching 100% by 48-72 h, with an EC50 of 0.02 nM. Thus, ligand-induced dimerization can efficiently regulate Cre, and should be useful to achieve a tight temporal control of its activity, such as in the case of the creation of conditional knock-out animals.

  10. HpBase: A genome database of a sea urchin, Hemicentrotus pulcherrimus.

    Science.gov (United States)

    Kinjo, Sonoko; Kiyomoto, Masato; Yamamoto, Takashi; Ikeo, Kazuho; Yaguchi, Shunsuke

    2018-04-01

    To understand the mystery of life, it is important to accumulate genomic information for various organisms because the whole genome encodes the commands for all the genes. Since the genome of Strongylocentrotus purpratus was sequenced in 2006 as the first sequenced genome in echinoderms, the genomic resources of other North American sea urchins have gradually been accumulated, but no sea urchin genomes are available in other areas, where many scientists have used the local species and reported important results. In this manuscript, we report a draft genome of the sea urchin Hemincentrotus pulcherrimus because this species has a long history as the target of developmental and cell biology in East Asia. The genome of H. pulcherrimus was assembled into 16,251 scaffold sequences with an N50 length of 143 kbp, and approximately 25,000 genes were identified in the genome. The size of the genome and the sequencing coverage were estimated to be approximately 800 Mbp and 100×, respectively. To provide these data and information of annotation, we constructed a database, HpBase (http://cell-innovation.nig.ac.jp/Hpul/). In HpBase, gene searches, genome browsing, and blast searches are available. In addition, HpBase includes the "recipes" for experiments from each lab using H. pulcherrimus. These recipes will continue to be updated according to the circumstances of individual scientists and can be powerful tools for experimental biologists and for the community. HpBase is a suitable dataset for evolutionary, developmental, and cell biologists to compare H. pulcherrimus genomic information with that of other species and to isolate gene information. © 2018 Japanese Society of Developmental Biologists.

  11. Quantum mechanical fragment methods based on partitioning atoms or partitioning coordinates.

    Science.gov (United States)

    Wang, Bo; Yang, Ke R; Xu, Xuefei; Isegawa, Miho; Leverentz, Hannah R; Truhlar, Donald G

    2014-09-16

    atoms for capping dangling bonds, and we have shown that they can greatly improve the accuracy. Finally we present a new approach that goes beyond QM/MM by combining the convenience of molecular mechanics with the accuracy of fitting a potential function to electronic structure calculations on a specific system. To make the latter practical for systems with a large number of degrees of freedom, we developed a method to interpolate between local internal-coordinate fits to the potential energy. A key issue for the application to large systems is that rather than assigning the atoms or monomers to fragments, we assign the internal coordinates to reaction, secondary, and tertiary sets. Thus, we make a partition in coordinate space rather than atom space. Fits to the local dependence of the potential energy on tertiary coordinates are arrayed along a preselected reaction coordinate at a sequence of geometries called anchor points; the potential energy function is called an anchor points reactive potential. Electrostatically embedded fragment methods and the anchor points reactive potential, because they are based on treating an entire system by quantum mechanical electronic structure methods but are affordable for large and complex systems, have the potential to open new areas for accurate simulations where combined QM/MM methods are inadequate.

  12. Acidobacteria form a coherent but highly diverse group within the bacterial domain: evidence from environmental genomics

    DEFF Research Database (Denmark)

    Quaiser, Achim; Ochsenreiter, Torsten; Lanz, Christa

    2003-01-01

    fragments differed between 2.3% and 19.9% and were placed into two different subgroups of Acidobacteria (groups III and V). Although partial co-linearity was found between genomic fragments, the gene content around the rRNA operons was generally not conserved. Phylogenetic reconstructions with orthologues......Acidobacteria have been established as a novel phylum of Bacteria that is consistently detected in many different habitats around the globe by 16S rDNA-based molecular surveys. The phylogenetic diversity, ubiquity and abundance of this group, particularly in soil habitats, suggest an important...... palustris and Bradyrhizobium japonicum, including a conserved two-component system. Phylogenetic analysis of the putative response regulator confirmed that this similarity between Rhizobiales and Acidobacteria might be due to a horizontal gene transfer. In total, our data give first insight into the genome...

  13. Microarray-based ultra-high resolution discovery of genomic deletion mutations

    Science.gov (United States)

    2014-01-01

    Background Oligonucleotide microarray-based comparative genomic hybridization (CGH) offers an attractive possible route for the rapid and cost-effective genome-wide discovery of deletion mutations. CGH typically involves comparison of the hybridization intensities of genomic DNA samples with microarray chip representations of entire genomes, and has widespread potential application in experimental research and medical diagnostics. However, the power to detect small deletions is low. Results Here we use a graduated series of Arabidopsis thaliana genomic deletion mutations (of sizes ranging from 4 bp to ~5 kb) to optimize CGH-based genomic deletion detection. We show that the power to detect smaller deletions (4, 28 and 104 bp) depends upon oligonucleotide density (essentially the number of genome-representative oligonucleotides on the microarray chip), and determine the oligonucleotide spacings necessary to guarantee detection of deletions of specified size. Conclusions Our findings will enhance a wide range of research and clinical applications, and in particular will aid in the discovery of genomic deletions in the absence of a priori knowledge of their existence. PMID:24655320

  14. Molecular analysis of Leptospira spp. isolated from humans by restriction fragment length polymorphism, real-time PCR and pulsed-field gel electrophoresis.

    Science.gov (United States)

    Turk, Nenad; Milas, Zoran; Mojcec, Vesna; Ruzic-Sabljic, Eva; Staresina, Vilim; Stritof, Zrinka; Habus, Josipa; Postic, Daniele

    2009-11-01

    A total of 17 Leptospira clinical strains isolated from humans in Croatia were serologically and genetically analysed. For serovar identification, the microscopic agglutination test (MAT) and pulsed-field gel electrophoresis (PFGE) were used. To identify isolates on genomic species level, PCR-based restriction fragment length polymorphism (RFLP) and real-time PCR were performed. MAT revealed the following serogroup affinities: Grippotyphosa (seven isolates), Icterohaemorrhagiae (eight isolates) and Javanica (two isolates). RFLP of PCR products from a 331-bp-long fragment of rrs (16S rRNA gene) digested with endonucleases MnlI and DdeI and real-time PCR revealed three Leptospira genomic species. Grippotyphosa isolates belonged to Leptospira kirschneri, Icterohaemorrhagiae isolates to Leptospira interrogans and Javanica isolates to Leptospira borgpetersenii. Genomic DNA from 17 leptospiral isolates was digested with NotI and SgrAI restriction enzymes and analysed by PFGE. Results showed that seven isolates have the same binding pattern to serovar Grippotyphosa, eight isolates to serovar Icterohaemorrhagiae and two isolates to serovar Poi. Results demonstrate the diversity of leptospires circulating in Croatia. We point out the usefulness of a combination of PFGE, RFLP and real-time PCR as appropriate molecular methods in molecular analysis of leptospires.

  15. A Rapid and Reproducible Genomic DNA Extraction Protocol for Sequence-Based Identification of Archaea, Bacteria, Cyanobacteria, Diatoms, Fungi, and Green Algae

    Directory of Open Access Journals (Sweden)

    Farkhondeh Saba

    2017-01-01

    Full Text Available Background:  Sequence-based identification of various microorganisms including Archaea, Bacteria, Cyanobacteria, Diatoms, Fungi, and green algae necessitates an efficient and reproducible genome extraction procedure though which a pure template DNA is yielded and it can be used in polymerase chain reactions (PCR. Considering the fact that DNA extraction from these microorganisms is time consuming and laborious, we developed and standardized a safe, rapid and inexpensive miniprep protocol. Methods:  According to our results, amplification of various genomic regions including SSU, LSU, ITS, β-tubulin, actin, RPB2, and EF-1 resulted in a reproducible and efficient DNA extraction from a wide range of microorganisms yielding adequate pure genomic material for reproducible PCR-amplifications. Results:   This method relies on a temporary shock of increased concentrations of detergent which can be applied concomitant with multiple freeze-thaws to yield sufficient amount of DNA for PCR amplification of multiple or single fragments(s of the genome. As an advantage, the recipe seems very flexible, thus, various optional steps can be included depending on the samples used.Conclusion:   Having the needed flexibility in each step, this protocol is applicable on a very wide range of samples. Hence, various steps can be included depending on the desired quantity and quality.

  16. A Rapid and Reproducible Genomic DNA Extraction Protocol for Sequence-Based Identification of Archaea, Bacteria, Cyanobacteria, Diatoms, Fungi, and Green Algae

    Directory of Open Access Journals (Sweden)

    Farkhondeh Saba

    2016-09-01

    Full Text Available Background:  Sequence-based identification of various microorganisms including Archaea, Bacteria, Cyanobacteria, Diatoms, Fungi, and green algae necessitates an efficient and reproducible genome extraction procedure though which a pure template DNA is yielded and it can be used in polymerase chain reactions (PCR. Considering the fact that DNA extraction from these microorganisms is time consuming and laborious, we developed and standardized a safe, rapid and inexpensive miniprep protocol. Methods:  According to our results, amplification of various genomic regions including SSU, LSU, ITS, β-tubulin, actin, RPB2, and EF-1 resulted in a reproducible and efficient DNA extraction from a wide range of microorganisms yielding adequate pure genomic material for reproducible PCR-amplifications. Results:   This method relies on a temporary shock of increased concentrations of detergent which can be applied concomitant with multiple freeze-thaws to yield sufficient amount of DNA for PCR amplification of multiple or single fragments(s of the genome. As an advantage, the recipe seems very flexible, thus, various optional steps can be included depending on the samples used.Conclusion:   Having the needed flexibility in each step, this protocol is applicable on a very wide range of samples. Hence, various steps can be included depending on the desired quantity and quality.

  17. Organization of the mitochondrial genomes of whiteflies, aphids, and psyllids (Hemiptera, Sternorrhyncha

    Directory of Open Access Journals (Sweden)

    Baumann Paul

    2004-08-01

    Full Text Available Abstract Background With some exceptions, mitochondria within the class Insecta have the same gene content, and generally, a similar gene order allowing the proposal of an ancestral gene order. The principal exceptions are several orders within the Hemipteroid assemblage including the order Thysanoptera, a sister group of the order Hemiptera. Within the Hemiptera, there are available a number of completely sequenced mitochondrial genomes that have a gene order similar to that of the proposed ancestor. None, however, are available from the suborder Sternorryncha that includes whiteflies, psyllids and aphids. Results We have determined the complete nucleotide sequence of the mitochondrial genomes of six species of whiteflies, one psyllid and one aphid. Two species of whiteflies, one psyllid and one aphid have mitochondrial genomes with a gene order very similar to that of the proposed insect ancestor. The remaining four species of whiteflies had variations in the gene order. In all cases, there was the excision of a DNA fragment encoding for cytochrome oxidase subunit III(COIII-tRNAgly-NADH dehydrogenase subunit 3(ND3-tRNAala-tRNAarg-tRNAasn from the ancestral position between genes for ATP synthase subunit 6 and NADH dehydrogenase subunit 5. Based on the position in which all or part of this fragment was inserted, the mitochondria could be subdivided into four different gene arrangement types. PCR amplification spanning from COIII to genes outside the inserted region and sequence determination of the resulting fragments, indicated that different whitefly species could be placed into one of these arrangement types. A phylogenetic analysis of 19 whitefly species based on genes for mitochondrial cytochrome b, NADH dehydrogenase subunit 1, and 16S ribosomal DNA as well as cospeciating endosymbiont 16S and 23S ribosomal DNA indicated a clustering of species that corresponded to the gene arrangement types. Conclusions In whiteflies, the region of the

  18. A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.

    Science.gov (United States)

    Luo, Li; Zhu, Yun; Xiong, Momiao

    2012-06-01

    The genome-wide association studies (GWAS) designed for next-generation sequencing data involve testing association of genomic variants, including common, low frequency, and rare variants. The current strategies for association studies are well developed for identifying association of common variants with the common diseases, but may be ill-suited when large amounts of allelic heterogeneity are present in sequence data. Recently, group tests that analyze their collective frequency differences between cases and controls shift the current variant-by-variant analysis paradigm for GWAS of common variants to the collective test of multiple variants in the association analysis of rare variants. However, group tests ignore differences in genetic effects among SNPs at different genomic locations. As an alternative to group tests, we developed a novel genome-information content-based statistics for testing association of the entire allele frequency spectrum of genomic variation with the diseases. To evaluate the performance of the proposed statistics, we use large-scale simulations based on whole genome low coverage pilot data in the 1000 Genomes Project to calculate the type 1 error rates and power of seven alternative statistics: a genome-information content-based statistic, the generalized T(2), collapsing method, multivariate and collapsing (CMC) method, individual χ(2) test, weighted-sum statistic, and variable threshold statistic. Finally, we apply the seven statistics to published resequencing dataset from ANGPTL3, ANGPTL4, ANGPTL5, and ANGPTL6 genes in the Dallas Heart Study. We report that the genome-information content-based statistic has significantly improved type 1 error rates and higher power than the other six statistics in both simulated and empirical datasets.

  19. Enrichment of true positives from structural alerts through the use of novel atomic fragment based descriptors

    DEFF Research Database (Denmark)

    Long, A.; Rydberg, Patrik

    2013-01-01

    To enhance the discrimination rate for methods applying structural alerts and biotransformation rules in the prediction of toxicity and drug metabolism we have developed a set of novel fragment based atomic descriptors. These atomic descriptors encode the properties of the fragments separating an...

  20. Prokaryotic Phylogenies Inferred from Whole-Genome Sequence and Annotation Data

    Directory of Open Access Journals (Sweden)

    Wei Du

    2013-01-01

    Full Text Available Phylogenetic trees are used to represent the evolutionary relationship among various groups of species. In this paper, a novel method for inferring prokaryotic phylogenies using multiple genomic information is proposed. The method is called CGCPhy and based on the distance matrix of orthologous gene clusters between whole-genome pairs. CGCPhy comprises four main steps. First, orthologous genes are determined by sequence similarity, genomic function, and genomic structure information. Second, genes involving potential HGT events are eliminated, since such genes are considered to be the highly conserved genes across different species and the genes located on fragments with abnormal genome barcode. Third, we calculate the distance of the orthologous gene clusters between each genome pair in terms of the number of orthologous genes in conserved clusters. Finally, the neighbor-joining method is employed to construct phylogenetic trees across different species. CGCPhy has been examined on different datasets from 617 complete single-chromosome prokaryotic genomes and achieved applicative accuracies on different species sets in agreement with Bergey's taxonomy in quartet topologies. Simulation results show that CGCPhy achieves high average accuracy and has a low standard deviation on different datasets, so it has an applicative potential for phylogenetic analysis.

  1. A Saccharomyces cerevisiae mitochondrial DNA fragment activates Reg1p-dependent glucose-repressible transcription in the nucleus.

    Science.gov (United States)

    Santangelo, G M; Tornow, J

    1997-12-01

    As part of an effort to identify random carbon-source-regulated promoters in the Saccharomyces cerevisiae genome, we discovered that a mitochondrial DNA fragment is capable of directing glucose-repressible expression of a reporter gene. This fragment (CR24) originated from the mitochondrial genome adjacent to a transcription initiation site. Mutational analyses identified a GC cluster within the fragment that is required for transcriptional induction. Repression of nuclear CR24-driven transcription required Reg1p, indicating that this mitochondrially derived promoter is a member of a large group of glucose-repressible nuclear promoters that are similarly regulated by Reg1p. In vivo and in vitro binding assays indicated the presence of factors, located within the nucleus and the mitochondria, that bind to the GC cluster. One or more of these factors may provide a regulatory link between the nucleus and mitochondria.

  2. Changing Histopathological Diagnostics by Genome-Based Tumor Classification

    Directory of Open Access Journals (Sweden)

    Michael Kloth

    2014-05-01

    Full Text Available Traditionally, tumors are classified by histopathological criteria, i.e., based on their specific morphological appearances. Consequently, current therapeutic decisions in oncology are strongly influenced by histology rather than underlying molecular or genomic aberrations. The increase of information on molecular changes however, enabled by the Human Genome Project and the International Cancer Genome Consortium as well as the manifold advances in molecular biology and high-throughput sequencing techniques, inaugurated the integration of genomic information into disease classification. Furthermore, in some cases it became evident that former classifications needed major revision and adaption. Such adaptations are often required by understanding the pathogenesis of a disease from a specific molecular alteration, using this molecular driver for targeted and highly effective therapies. Altogether, reclassifications should lead to higher information content of the underlying diagnoses, reflecting their molecular pathogenesis and resulting in optimized and individual therapeutic decisions. The objective of this article is to summarize some particularly important examples of genome-based classification approaches and associated therapeutic concepts. In addition to reviewing disease specific markers, we focus on potentially therapeutic or predictive markers and the relevance of molecular diagnostics in disease monitoring.

  3. Photon-hadron fragmentation: theoretical situation

    International Nuclear Information System (INIS)

    Peschanski, R.

    1983-07-01

    Using a selection of new experimental results models of hadronic fragmentation and their phenomenological comparison are presented. Indeed a convenient theory of hadronic fragmentation -for instance based on Q.C.D.- does not exist: low transverse momentum fragmentation involves the badly known hadronic long-range forces. Models should clarify the situation in the prospect of an eventual future theory

  4. DNABIT Compress – Genome compression algorithm

    Science.gov (United States)

    Rajarajeswari, Pothuraju; Apparao, Allam

    2011-01-01

    Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, “DNABIT Compress” for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that “DNABIT Compress” algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases. PMID:21383923

  5. Climate-driven range shifts of the king penguin in a fragmented ecosystem

    Science.gov (United States)

    Cristofari, Robin; Liu, Xiaoming; Bonadonna, Francesco; Cherel, Yves; Pistorius, Pierre; Le Maho, Yvon; Raybaud, Virginie; Stenseth, Nils Christian; Le Bohec, Céline; Trucchi, Emiliano

    2018-03-01

    Range shift is the primary short-term species response to rapid climate change, but it is often hampered by natural or anthropogenic habitat fragmentation. Different critical areas of a species' niche may be exposed to heterogeneous environmental changes and modelling species response under such complex spatial and ecological scenarios presents well-known challenges. Here, we use a biophysical ecological niche model validated through population genomics and palaeodemography to reconstruct past range shifts and identify future vulnerable areas and potential refugia of the king penguin in the Southern Ocean. Integrating genomic and demographic data at the whole-species level with specific biophysical constraints, we present a refined framework for predicting the effect of climate change on species relying on spatially and ecologically distinct areas to complete their life cycle (for example, migratory animals, marine pelagic organisms and central-place foragers) and, in general, on species living in fragmented ecosystems.

  6. A photoactivatable Cre-loxP recombination system for optogenetic genome engineering.

    Science.gov (United States)

    Kawano, Fuun; Okazaki, Risako; Yazawa, Masayuki; Sato, Moritoshi

    2016-12-01

    Genome engineering techniques represented by the Cre-loxP recombination system have been used extensively for biomedical research. However, powerful and useful techniques for genome engineering that have high spatiotemporal precision remain elusive. Here we develop a highly efficient photoactivatable Cre recombinase (PA-Cre) to optogenetically control genome engineering in vivo. PA-Cre is based on the reassembly of split Cre fragments by light-inducible dimerization of the Magnet system. PA-Cre enables sharp induction (up to 320-fold) of DNA recombination and is efficiently activated even by low-intensity illumination (∼0.04 W m -2 ) or short periods of pulsed illumination (∼30 s). We demonstrate that PA-Cre allows for efficient DNA recombination in an internal organ of living mice through noninvasive external illumination using a LED light source. The present PA-Cre provides a powerful tool to greatly facilitate optogenetic genome engineering in vivo.

  7. New Genome Similarity Measures based on Conserved Gene Adjacencies.

    Science.gov (United States)

    Doerr, Daniel; Kowada, Luis Antonio B; Araujo, Eloi; Deshpande, Shachi; Dantas, Simone; Moret, Bernard M E; Stoye, Jens

    2017-06-01

    Many important questions in molecular biology, evolution, and biomedicine can be addressed by comparative genomic approaches. One of the basic tasks when comparing genomes is the definition of measures of similarity (or dissimilarity) between two genomes, for example, to elucidate the phylogenetic relationships between species. The power of different genome comparison methods varies with the underlying formal model of a genome. The simplest models impose the strong restriction that each genome under study must contain the same genes, each in exactly one copy. More realistic models allow several copies of a gene in a genome. One speaks of gene families, and comparative genomic methods that allow this kind of input are called gene family-based. The most powerful-but also most complex-models avoid this preprocessing of the input data and instead integrate the family assignment within the comparative analysis. Such methods are called gene family-free. In this article, we study an intermediate approach between family-based and family-free genomic similarity measures. Introducing this simpler model, called gene connections, we focus on the combinatorial aspects of gene family-free genome comparison. While in most cases, the computational costs to the general family-free case are the same, we also find an instance where the gene connections model has lower complexity. Within the gene connections model, we define three variants of genomic similarity measures that have different expression powers. We give polynomial-time algorithms for two of them, while we show NP-hardness for the third, most powerful one. We also generalize the measures and algorithms to make them more robust against recent local disruptions in gene order. Our theoretical findings are supported by experimental results, proving the applicability and performance of our newly defined similarity measures.

  8. Separating metagenomic short reads into genomes via clustering

    Directory of Open Access Journals (Sweden)

    Tanaseichuk Olga

    2012-09-01

    Full Text Available Abstract Background The metagenomics approach allows the simultaneous sequencing of all genomes in an environmental sample. This results in high complexity datasets, where in addition to repeats and sequencing errors, the number of genomes and their abundance ratios are unknown. Recently developed next-generation sequencing (NGS technologies significantly improve the sequencing efficiency and cost. On the other hand, they result in shorter reads, which makes the separation of reads from different species harder. Among the existing computational tools for metagenomic analysis, there are similarity-based methods that use reference databases to align reads and composition-based methods that use composition patterns (i.e., frequencies of short words or l-mers to cluster reads. Similarity-based methods are unable to classify reads from unknown species without close references (which constitute the majority of reads. Since composition patterns are preserved only in significantly large fragments, composition-based tools cannot be used for very short reads, which becomes a significant limitation with the development of NGS. A recently proposed algorithm, AbundanceBin, introduced another method that bins reads based on predicted abundances of the genomes sequenced. However, it does not separate reads from genomes of similar abundance levels. Results In this work, we present a two-phase heuristic algorithm for separating short paired-end reads from different genomes in a metagenomic dataset. We use the observation that most of the l-mers belong to unique genomes when l is sufficiently large. The first phase of the algorithm results in clusters of l-mers each of which belongs to one genome. During the second phase, clusters are merged based on l-mer repeat information. These final clusters are used to assign reads. The algorithm could handle very short reads and sequencing errors. It is initially designed for genomes with similar abundance levels and then

  9. An efficient and high fidelity method for amplification, cloning and sequencing of complete tospovirus genomic RNA segments

    Science.gov (United States)

    Amplification and sequencing of the complete M- and S-RNA segments of Tomato spotted wilt virus and Impatiens necrotic spot virus as a single fragment is useful for whole genome sequencing of tospoviruses co-infecting a single host plant. It avoids issues associated with overlapping amplicon-based ...

  10. A simple and inexpensive method for genomic restriction mapping analysis

    International Nuclear Information System (INIS)

    Huang, C.H.; Lam, V.M.S.; Tam, J.W.O.

    1988-01-01

    The Southern blotting procedure for the transfer of DNA fragments from agarose gels to nitrocellulose membranes has revolutionized nucleic acid detection methods, and it forms the cornerstone of research in molecular biology. Basically, the method involves the denaturation of DNA fragments that have been separated on an agarose gel, the immobilization of the fragments by transfer to a nitrocellulose membrane, and the identification of the fragments of interest through hybridization to /sup 32/P-labeled probes and autoradiography. While the method is sensitive and applicable to both genomic and cloned DNA, it suffers from the disadvantages of being time consuming and expensive, and fragments of greater than 15 kb are difficult to transfer. Moreover, although theoretically the nitrocellulose membrane can be washed and hybridized repeatedly using different probes, in practice, the membrane becomes brittle and difficult to handle after a few cycles. A direct hybridization method for pure DNA clones was developed in 1975 but has not been widely exploited. The authors report here a modification of their procedure as applied to genomic DNA. The method is simple, rapid, and inexpensive, and it does not involve transfer to nitrocellulose membranes

  11. Genomic prediction in families of perennial ryegrass based on genotyping-by-sequencing

    DEFF Research Database (Denmark)

    Ashraf, Bilal

    In this thesis we investigate the potential for genomic prediction in perennial ryegrass using genotyping-by-sequencing (GBS) data. Association method based on family-based breeding systems was developed, genomic heritabilities, genomic prediction accurancies and effects of some key factors wer...... explored. Results show that low sequencing depth caused underestimation of allele substitution effects in GWAS and overestimation of genomic heritability in prediction studies. Other factors susch as SNP marker density, population structure and size of training population influenced accuracy of genomic...... prediction. Overall, GBS allows for genomic prediction in breeding families of perennial ryegrass and holds good potential to expedite genetic gain and encourage the application of genomic prediction...

  12. Fragment-Based Drug Discovery in the Bromodomain and Extra-Terminal Domain Family.

    Science.gov (United States)

    Radwan, Mostafa; Serya, Rabah

    2017-08-01

    Bromodomain and extra-terminal domain (BET) inhibition has emerged recently as a potential therapeutic target for the treatment of many human disorders such as atherosclerosis, inflammatory disorders, chronic obstructive pulmonary disease (COPD), some viral infections, and cancer. Since the discovery of the two potent inhibitors, I-BET762 and JQ1, different research groups have used different techniques to develop novel potent and selective inhibitors. In this review, we will be concerned with the trials that used fragment-based drug discovery (FBDD) approaches to discover or optimize BET inhibitors, also showing fragments that can be further optimized in future projects to reach novel potent BET inhibitors. © 2017 Deutsche Pharmazeutische Gesellschaft.

  13. Keeping it complicated: Mitochondrial genome plasticity across diplonemids.

    Science.gov (United States)

    Valach, Matus; Moreira, Sandrine; Hoffmann, Steve; Stadler, Peter F; Burger, Gertraud

    2017-10-26

    Chromosome rearrangements are important drivers in genome and gene evolution, with implications ranging from speciation to development to disease. In the flagellate Diplonema papillatum (Euglenozoa), mitochondrial genome rearrangements have resulted in nearly hundred chromosomes and a systematic dispersal of gene fragments across the multipartite genome. Maturation into functional RNAs involves separate transcription of gene pieces, joining of precursor RNAs via trans-splicing, and RNA editing by substitution and uridine additions both reconstituting crucial coding sequence. How widespread these unusual features are across diplonemids is unclear. We have analyzed the mitochondrial genomes and transcriptomes of four species from the Diplonema/Rhynchopus clade, revealing a considerable genomic plasticity. Although gene breakpoints, and thus the total number of gene pieces (~80), are essentially conserved across this group, the number of distinct chromosomes varies by a factor of two, with certain chromosomes combining up to eight unrelated gene fragments. Several internal protein-coding gene pieces overlap substantially, resulting, for example, in a stretch of 22 identical amino acids in cytochrome c oxidase subunit 1 and NADH dehydrogenase subunit 5. Finally, the variation of post-transcriptional editing patterns across diplonemids indicates compensation of two adverse trends: rapid sequence evolution and loss of genetic information through unequal chromosome segregation.

  14. Predicting "Hot" and "Warm" Spots for Fragment Binding.

    Science.gov (United States)

    Rathi, Prakash Chandra; Ludlow, R Frederick; Hall, Richard J; Murray, Christopher W; Mortenson, Paul N; Verdonk, Marcel L

    2017-05-11

    Computational fragment mapping methods aim to predict hotspots on protein surfaces where small fragments will bind. Such methods are popular for druggability assessment as well as structure-based design. However, to date researchers developing or using such tools have had no clear way of assessing the performance of these methods. Here, we introduce the first diverse, high quality validation set for computational fragment mapping. The set contains 52 diverse examples of fragment binding "hot" and "warm" spots from the Protein Data Bank (PDB). Additionally, we describe PLImap, a novel protocol for fragment mapping based on the Protein-Ligand Informatics force field (PLIff). We evaluate PLImap against the new fragment mapping test set, and compare its performance to that of simple shape-based algorithms and fragment docking using GOLD. PLImap is made publicly available from https://bitbucket.org/AstexUK/pli .

  15. A Role for Fragment-Based Drug Design in Developing Novel Lead Compounds for Central Nervous System Targets.

    Science.gov (United States)

    Wasko, Michael J; Pellegrene, Kendy A; Madura, Jeffry D; Surratt, Christopher K

    2015-01-01

    Hundreds of millions of U.S. dollars are invested in the research and development of a single drug. Lead compound development is an area ripe for new design strategies. Therapeutic lead candidates have been traditionally found using high-throughput in vitro pharmacological screening, a costly method for assaying thousands of compounds. This approach has recently been augmented by virtual screening (VS), which employs computer models of the target protein to narrow the search for possible leads. A variant of VS is fragment-based drug design (FBDD), an emerging in silico lead discovery method that introduces low-molecular weight fragments, rather than intact compounds, into the binding pocket of the receptor model. These fragments serve as starting points for "growing" the lead candidate. Current efforts in virtual FBDD within central nervous system (CNS) targets are reviewed, as is a recent rule-based optimization strategy in which new molecules are generated within a 3D receptor-binding pocket using the fragment as a scaffold. This process not only places special emphasis on creating synthesizable molecules but also exposes computational questions worth addressing. Fragment-based methods provide a viable, relatively low-cost alternative for therapeutic lead discovery and optimization that can be applied to CNS targets to augment current design strategies.

  16. Genomic applications in forensic medicine

    DEFF Research Database (Denmark)

    Børsting, Claus; Morling, Niels

    2016-01-01

    Since the 1980s, advances in DNA technology have revolutionized the scope and practice of forensic medicine. From the days of restriction fragment length polymorphisms (RFLPs) to short tandem repeats (STRs), the current focus is on the next generation genome sequencing. It has been almost a decad...

  17. Using DNase Hi-C techniques to map global and local three-dimensional genome architecture at high resolution.

    Science.gov (United States)

    Ma, Wenxiu; Ay, Ferhat; Lee, Choli; Gulsoy, Gunhan; Deng, Xinxian; Cook, Savannah; Hesson, Jennifer; Cavanaugh, Christopher; Ware, Carol B; Krumm, Anton; Shendure, Jay; Blau, C Anthony; Disteche, Christine M; Noble, William S; Duan, ZhiJun

    2018-06-01

    The folding and three-dimensional (3D) organization of chromatin in the nucleus critically impacts genome function. The past decade has witnessed rapid advances in genomic tools for delineating 3D genome architecture. Among them, chromosome conformation capture (3C)-based methods such as Hi-C are the most widely used techniques for mapping chromatin interactions. However, traditional Hi-C protocols rely on restriction enzymes (REs) to fragment chromatin and are therefore limited in resolution. We recently developed DNase Hi-C for mapping 3D genome organization, which uses DNase I for chromatin fragmentation. DNase Hi-C overcomes RE-related limitations associated with traditional Hi-C methods, leading to improved methodological resolution. Furthermore, combining this method with DNA capture technology provides a high-throughput approach (targeted DNase Hi-C) that allows for mapping fine-scale chromatin architecture at exceptionally high resolution. Hence, targeted DNase Hi-C will be valuable for delineating the physical landscapes of cis-regulatory networks that control gene expression and for characterizing phenotype-associated chromatin 3D signatures. Here, we provide a detailed description of method design and step-by-step working protocols for these two methods. Copyright © 2018 Elsevier Inc. All rights reserved.

  18. Draft genome sequence of ramie, Boehmeria nivea (L.) Gaudich.

    Science.gov (United States)

    Luan, Ming-Bao; Jian, Jian-Bo; Chen, Ping; Chen, Jun-Hui; Chen, Jian-Hua; Gao, Qiang; Gao, Gang; Zhou, Ju-Hong; Chen, Kun-Mei; Guang, Xuan-Min; Chen, Ji-Kang; Zhang, Qian-Qian; Wang, Xiao-Fei; Fang, Long; Sun, Zhi-Min; Bai, Ming-Zhou; Fang, Xiao-Dong; Zhao, Shan-Cen; Xiong, He-Ping; Yu, Chun-Ming; Zhu, Ai-Guo

    2018-05-01

    Ramie, Boehmeria nivea (L.) Gaudich, family Urticaceae, is a plant native to eastern Asia, and one of the world's oldest fibre crops. It is also used as animal feed and for the phytoremediation of heavy metal-contaminated farmlands. Thus, the genome sequence of ramie was determined to explore the molecular basis of its fibre quality, protein content and phytoremediation. For further understanding ramie genome, different paired-end and mate-pair libraries were combined to generate 134.31 Gb of raw DNA sequences using the Illumina whole-genome shotgun sequencing approach. The highly heterozygous B. nivea genome was assembled using the Platanus Genome Assembler, which is an effective tool for the assembly of highly heterozygous genome sequences. The final length of the draft genome of this species was approximately 341.9 Mb (contig N50 = 22.62 kb, scaffold N50 = 1,126.36 kb). Based on ramie genome annotations, 30,237 protein-coding genes were predicted, and the repetitive element content was 46.3%. The completeness of the final assembly was evaluated by benchmarking universal single-copy orthologous genes (BUSCO); 90.5% of the 1,440 expected embryophytic genes were identified as complete, and 4.9% were identified as fragmented. Phylogenetic analysis based on single-copy gene families and one-to-one orthologous genes placed ramie with mulberry and cannabis, within the clade of urticalean rosids. Genome information of ramie will be a valuable resource for the conservation of endangered Boehmeria species and for future studies on the biogeography and characteristic evolution of members of Urticaceae. © 2018 John Wiley & Sons Ltd.

  19. Mass distribution of fission fragments using SSNTDs based image analysis system

    International Nuclear Information System (INIS)

    Kolekar, R.V.; Sharma, D.N.

    2006-01-01

    Lexan polycarbonate track detector was used to obtain mass distribution of fission fragments from 252 Cf planchette source, Normally, if the fission fragments are incident perpendicular to the lexan surface, the diameter of heavy fragment is greater than that of lighter fragment. In practical problems fission fragments are incident on the detector at all angles. So, in the present experiment, lexan detector was exposed to 252 Cf planchette source in 2π geometry. Fission fragments were incident on the detector with various angles. So the projected fission track length for fission fragment of same energy is different because of different angle of incidence. Image analysis software was used to measure the projected track length. But the problem is that for fission fragment having greater angle of incidence the entire track length is not focused on the surface. So reduced track length is measured. This problem is solved by taking two images, one at the surface and one at the tip of track and then overlapping both the images using image analysis software. The projected track length and the depth of the track were used to get the angle of incidence. Fission track lengths were measured for same angle of incidence. In all 500 track lengths were measured and plot for mass distribution for fission fragment was obtained.(author)

  20. Methylation-sensitive amplified polymorphism-based genome-wide analysis of cytosine methylation profiles in Nicotiana tabacum cultivars.

    Science.gov (United States)

    Jiao, J; Wu, J; Lv, Z; Sun, C; Gao, L; Yan, X; Cui, L; Tang, Z; Yan, B; Jia, Y

    2015-11-26

    This study aimed to investigate cytosine methylation profiles in different tobacco (Nicotiana tabacum) cultivars grown in China. Methylation-sensitive amplified polymorphism was used to analyze genome-wide global methylation profiles in four tobacco cultivars (Yunyan 85, NC89, K326, and Yunyan 87). Amplicons with methylated C motifs were cloned by reamplified polymerase chain reaction, sequenced, and analyzed. The results show that geographical location had a greater effect on methylation patterns in the tobacco genome than did sampling time. Analysis of the CG dinucleotide distribution in methylation-sensitive polymorphic restriction fragments suggested that a CpG dinucleotide cluster-enriched area is a possible site of cytosine methylation in the tobacco genome. The sequence alignments of the Nia1 gene (that encodes nitrate reductase) in Yunyan 87 in different regions indicate that a C-T transition might be responsible for the tobacco phenotype. T-C nucleotide replacement might also be responsible for the tobacco phenotype and may be influenced by geographical location.

  1. Binning of shallowly sampled metagenomic sequence fragments reveals that low abundance bacteria play important roles in sulfur cycling and degradation of complex organic polymers in an acid mine drainage community

    Science.gov (United States)

    Dick, G. J.; Andersson, A.; Banfield, J. F.

    2007-12-01

    Our understanding of environmental microbiology has been greatly enhanced by community genome sequencing of DNA recovered directly the environment. Community genomics provides insights into the diversity, community structure, metabolic function, and evolution of natural populations of uncultivated microbes, thereby revealing dynamics of how microorganisms interact with each other and their environment. Recent studies have demonstrated the potential for reconstructing near-complete genomes from natural environments while highlighting the challenges of analyzing community genomic sequence, especially from diverse environments. A major challenge of shotgun community genome sequencing is identification of DNA fragments from minor community members for which only low coverage of genomic sequence is present. We analyzed community genome sequence retrieved from biofilms in an acid mine drainage (AMD) system in the Richmond Mine at Iron Mountain, CA, with an emphasis on identification and assembly of DNA fragments from low-abundance community members. The Richmond mine hosts an extensive, relatively low diversity subterranean chemolithoautotrophic community that is sustained entirely by oxidative dissolution of pyrite. The activity of these microorganisms greatly accelerates the generation of AMD. Previous and ongoing work in our laboratory has focused on reconstrucing genomes of dominant community members, including several bacteria and archaea. We binned contigs from several samples (including one new sample and two that had been previously analyzed) by tetranucleotide frequency with clustering by Self-Organizing Maps (SOM). The binning, evaluated by comparison with information from the manually curated assembly of the dominant organisms, was found to be very effective: fragments were correctly assigned with 95% accuracy. Improperly assigned fragments often contained sequences that are either evolutionarily constrained (e.g. 16S rRNA genes) or mobile elements that are

  2. Diffusion mediated coagulation and fragmentation based study of domain formation in lipid bilayer membrane

    Energy Technology Data Exchange (ETDEWEB)

    Rao, Laxminarsimha V., E-mail: laxman@iitk.ac.in [Mechanics and Applied Mathematics Group, Department of Mechanical Engineering, Indian Institute of Technology Kanpur, Kanpur 208016 (India); Roy, Subhradeep [Department of Biomedical Engineering and Mechanics (MC 0219), Virginia Tech, 495 Old Turner Street, Blacksburg, VA 24061 (United States); Das, Sovan Lal [Mechanics and Applied Mathematics Group, Department of Mechanical Engineering, Indian Institute of Technology Kanpur, Kanpur 208016 (India)

    2017-01-15

    We estimate the equilibrium size distribution of cholesterol rich micro-domains on a lipid bilayer by solving Smoluchowski equation for coagulation and fragmentation. Towards this aim, we first derive the coagulation kernels based on the diffusion behaviour of domains moving in a two dimensional membrane sheet, as this represents the reality better. We incorporate three different diffusion scenarios of domain diffusion into our coagulation kernel. Subsequently, we investigate the influence of the parameters in our model on the coagulation and fragmentation behaviour. The observed behaviours of the coagulation and fragmentation kernels are also manifested in the equilibrium domain size distribution and its first moment. Finally, considering the liquid domains diffusing in a supported lipid bilayer, we fit the equilibrium domain size distribution to a benchmark solution.

  3. Development and assessment of microarray-based DNA fingerprinting in Eucalyptus grandis.

    Science.gov (United States)

    Lezar, Sabine; Myburg, A A; Berger, D K; Wingfield, M J; Wingfield, B D

    2004-11-01

    Development of improved Eucalyptus genotypes involves the routine identification of breeding stock and superior clones. Currently, microsatellites and random amplified polymorphic DNA markers are the most widely used DNA-based techniques for fingerprinting of these trees. While these techniques have provided rapid and powerful fingerprinting assays, they are constrained by their reliance on gel or capillary electrophoresis, and therefore, relatively low throughput of fragment analysis. In contrast, recently developed microarray technology holds the promise of parallel analysis of thousands of markers in plant genomes. The aim of this study was to develop a DNA fingerprinting chip for Eucalyptus grandis and to investigate its usefulness for fingerprinting of eucalypt trees. A prototype chip was prepared using a partial genomic library from total genomic DNA of 23 E. grandis trees, of which 22 were full siblings. A total of 384 cloned genomic fragments were individually amplified and arrayed onto glass slides. DNA fingerprints were obtained for 17 individuals by hybridizing labeled genome representations of the individual trees to the 384-element chip. Polymorphic DNA fragments were identified by evaluating the binary distribution of their background-corrected signal intensities across full-sib individuals. Among 384 DNA fragments on the chip, 104 (27%) were found to be polymorphic. Hybridization of these polymorphic fragments was highly repeatable (R2>0.91) within the E. grandis individuals, and they allowed us to identify all 17 full-sib individuals. Our results suggest that DNA microarrays can be used to effectively fingerprint large numbers of closely related Eucalyptus trees.

  4. Fragment library design: using cheminformatics and expert chemists to fill gaps in existing fragment libraries.

    Science.gov (United States)

    Kutchukian, Peter S; So, Sung-Sau; Fischer, Christian; Waller, Chris L

    2015-01-01

    Fragment based screening (FBS) has emerged as a mainstream lead discovery strategy in academia, biotechnology start-ups, and large pharma. As a prerequisite of FBS, a structurally diverse library of fragments is desirable in order to identify chemical matter that will interact with the range of diverse target classes that are prosecuted in contemporary screening campaigns. In addition, it is also desirable to offer synthetically amenable starting points to increase the probability of a successful fragment evolution through medicinal chemistry. Herein we describe a method to identify biologically relevant chemical substructures that are missing from an existing fragment library (chemical gaps), and organize these chemical gaps hierarchically so that medicinal chemists can efficiently navigate the prioritized chemical space and subsequently select purchasable fragments for inclusion in an enhanced fragment library.

  5. GFVO: the Genomic Feature and Variation Ontology

    KAUST Repository

    Baran, Joachim

    2015-05-05

    Falling costs in genomic laboratory experiments have led to a steady increase of genomic feature and variation data. Multiple genomic data formats exist for sharing these data, and whilst they are similar, they are addressing slightly different data viewpoints and are consequently not fully compatible with each other. The fragmentation of data format specifications makes it hard to integrate and interpret data for further analysis with information from multiple data providers. As a solution, a new ontology is presented here for annotating and representing genomic feature and variation dataset contents. The Genomic Feature and Variation Ontology (GFVO) specifically addresses genomic data as it is regularly shared using the GFF3 (incl. FASTA), GTF, GVF and VCF file formats. GFVO simplifies data integration and enables linking of genomic annotations across datasets through common semantics of genomic types and relations. Availability and implementation. The latest stable release of the ontology is available via its base URI; previous and development versions are available at the ontology’s GitHub repository: https://github.com/BioInterchange/Ontologies; versions of the ontology are indexed through BioPortal (without external class-/property-equivalences due to BioPortal release 4.10 limitations); examples and reference documentation is provided on a separate web-page: http://www.biointerchange.org/ontologies.html. GFVO version 1.0.2 is licensed under the CC0 1.0 Universal license (https://creativecommons.org/publicdomain/zero/1.0) and therefore de facto within the public domain; the ontology can be appropriated without attribution for commercial and non-commercial use.

  6. Detection of fission fragments by secondary emission; Detection des fragments de fission par emission secondaire

    Energy Technology Data Exchange (ETDEWEB)

    Audias, A [Commissariat a l' Energie Atomique, Saclay (France). Centre d' Etudes Nucleaires

    1965-07-01

    This fission fragment detecting apparatus is based on the principle that fragments traversing a thin foil will cause emission of secondary electrons. These electrons are then accelerated (10 kV) and directly detected by means of a plastic scintillator and associated photomultiplier. Some of the advantages of such a detector are, its rapidity, its discriminating power between alpha particles and fission fragments, its small energy loss in detecting the fragments and the relatively great amount of fissionable material which it can contain. This paper is subdivided as follows: a) theoretical considerations b) constructional details of apparatus and some experimental details and c) a study of the secondary emission effect itself. (author) [French] Le detecteur de fragments de fission que nous avons realise est base sur le principe de l'emission secondaire produite par les fragments de fission traversant une feuille mince: les electrons secondaires emis sont acceleres a des tensions telles (de l'ordre de 10 kV), qu'ils soient directement detectables par un scintillateur plastique associe a un photomultiplicateur. L'interet d'un tel detecteur reside: dans sa rapidite, sa tres bonne discrimination alpha, fission, la possibilite de detecter les fragments de fission avec une perte d'energie pouvant rester relativement faible, et la possibilite d'introduire des quantites de matiere fissile plus importantes que dans les autres types de detecteurs. Ce travail comporte: -) un apercu bibliographique de la theorie du phenomene, -) realisation et mise au point du detecteur avec etude experimentale de quelques parametres intervenant dans l'emission secondaire, -) etude de l'emission secondaire (sur la face d'emergence des fragments de fission) en fonction de l'energie du fragment et en fonction de l'epaisseur de matiere traversee avant emission secondaire, et -) une etude comparative de l'emission secondaire sur la face d'incidence et sur la face d'emergence des fragments de

  7. Elucidating the triplicated ancestral genome structure of radish based on chromosome-level comparison with the Brassica genomes.

    Science.gov (United States)

    Jeong, Young-Min; Kim, Namshin; Ahn, Byung Ohg; Oh, Mijin; Chung, Won-Hyong; Chung, Hee; Jeong, Seongmun; Lim, Ki-Byung; Hwang, Yoon-Jung; Kim, Goon-Bo; Baek, Seunghoon; Choi, Sang-Bong; Hyung, Dae-Jin; Lee, Seung-Won; Sohn, Seong-Han; Kwon, Soo-Jin; Jin, Mina; Seol, Young-Joo; Chae, Won Byoung; Choi, Keun Jin; Park, Beom-Seok; Yu, Hee-Ju; Mun, Jeong-Hwan

    2016-07-01

    This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.

  8. Viral Genome DataBase: storing and analyzing genes and proteins from complete viral genomes.

    Science.gov (United States)

    Hiscock, D; Upton, C

    2000-05-01

    The Viral Genome DataBase (VGDB) contains detailed information of the genes and predicted protein sequences from 15 completely sequenced genomes of large (&100 kb) viruses (2847 genes). The data that is stored includes DNA sequence, protein sequence, GenBank and user-entered notes, molecular weight (MW), isoelectric point (pI), amino acid content, A + T%, nucleotide frequency, dinucleotide frequency and codon use. The VGDB is a mySQL database with a user-friendly JAVA GUI. Results of queries can be easily sorted by any of the individual parameters. The software and additional figures and information are available at http://athena.bioc.uvic.ca/genomes/index.html .

  9. NeisseriaBase: a specialised Neisseria genomic resource and analysis platform.

    Science.gov (United States)

    Zheng, Wenning; Mutha, Naresh V R; Heydari, Hamed; Dutta, Avirup; Siow, Cheuk Chuen; Jakubovics, Nicholas S; Wee, Wei Yee; Tan, Shi Yang; Ang, Mia Yang; Wong, Guat Jah; Choo, Siew Woh

    2016-01-01

    Background. The gram-negative Neisseria is associated with two of the most potent human epidemic diseases: meningococcal meningitis and gonorrhoea. In both cases, disease is caused by bacteria colonizing human mucosal membrane surfaces. Overall, the genus shows great diversity and genetic variation mainly due to its ability to acquire and incorporate genetic material from a diverse range of sources through horizontal gene transfer. Although a number of databases exist for the Neisseria genomes, they are mostly focused on the pathogenic species. In this present study we present the freely available NeisseriaBase, a database dedicated to the genus Neisseria encompassing the complete and draft genomes of 15 pathogenic and commensal Neisseria species. Methods. The genomic data were retrieved from National Center for Biotechnology Information (NCBI) and annotated using the RAST server which were then stored into the MySQL database. The protein-coding genes were further analyzed to obtain information such as calculation of GC content (%), predicted hydrophobicity and molecular weight (Da) using in-house Perl scripts. The web application was developed following the secure four-tier web application architecture: (1) client workstation, (2) web server, (3) application server, and (4) database server. The web interface was constructed using PHP, JavaScript, jQuery, AJAX and CSS, utilizing the model-view-controller (MVC) framework. The in-house developed bioinformatics tools implemented in NeisseraBase were developed using Python, Perl, BioPerl and R languages. Results. Currently, NeisseriaBase houses 603,500 Coding Sequences (CDSs), 16,071 RNAs and 13,119 tRNA genes from 227 Neisseria genomes. The database is equipped with interactive web interfaces. Incorporation of the JBrowse genome browser in the database enables fast and smooth browsing of Neisseria genomes. NeisseriaBase includes the standard BLAST program to facilitate homology searching, and for Virulence Factor

  10. NeisseriaBase: a specialised Neisseria genomic resource and analysis platform

    Directory of Open Access Journals (Sweden)

    Wenning Zheng

    2016-03-01

    Full Text Available Background. The gram-negative Neisseria is associated with two of the most potent human epidemic diseases: meningococcal meningitis and gonorrhoea. In both cases, disease is caused by bacteria colonizing human mucosal membrane surfaces. Overall, the genus shows great diversity and genetic variation mainly due to its ability to acquire and incorporate genetic material from a diverse range of sources through horizontal gene transfer. Although a number of databases exist for the Neisseria genomes, they are mostly focused on the pathogenic species. In this present study we present the freely available NeisseriaBase, a database dedicated to the genus Neisseria encompassing the complete and draft genomes of 15 pathogenic and commensal Neisseria species. Methods. The genomic data were retrieved from National Center for Biotechnology Information (NCBI and annotated using the RAST server which were then stored into the MySQL database. The protein-coding genes were further analyzed to obtain information such as calculation of GC content (%, predicted hydrophobicity and molecular weight (Da using in-house Perl scripts. The web application was developed following the secure four-tier web application architecture: (1 client workstation, (2 web server, (3 application server, and (4 database server. The web interface was constructed using PHP, JavaScript, jQuery, AJAX and CSS, utilizing the model-view-controller (MVC framework. The in-house developed bioinformatics tools implemented in NeisseraBase were developed using Python, Perl, BioPerl and R languages. Results. Currently, NeisseriaBase houses 603,500 Coding Sequences (CDSs, 16,071 RNAs and 13,119 tRNA genes from 227 Neisseria genomes. The database is equipped with interactive web interfaces. Incorporation of the JBrowse genome browser in the database enables fast and smooth browsing of Neisseria genomes. NeisseriaBase includes the standard BLAST program to facilitate homology searching, and for Virulence

  11. Properties of promoters cloned randomly from the Saccharomyces cerevisiae genome.

    Science.gov (United States)

    Santangelo, G M; Tornow, J; McLaughlin, C S; Moldave, K

    1988-01-01

    Promoters were isolated at random from the genome of Saccharomyces cerevisiae by using a plasmid that contains a divergently arrayed pair of promoterless reporter genes. A comprehensive library was constructed by inserting random (DNase I-generated) fragments into the intergenic region upstream from the reporter genes. Simple in vivo assays for either reporter gene product (alcohol dehydrogenase or beta-galactosidase) allowed the rapid identification of promoters from among these random fragments. Poly(dA-dT) homopolymer tracts were present in three of five randomly cloned promoters. With two exceptions, each RNA start site detected was 40 to 100 base pairs downstream from a TATA element. All of the randomly cloned promoters were capable of activating reporter gene transcription bidirectionally. Interestingly, one of the promoter fragments originated in a region of the S. cerevisiae rDNA spacer; regulated divergent transcription (presumably by RNA polymerase II) initiated in the same region. Images PMID:2847031

  12. Gene fragmentation: a key to mitochondrial genome evolution in Euglenozoa?

    Czech Academy of Sciences Publication Activity Database

    Flegontov, Pavel; Gray, M.W.; Burger, G.; Lukeš, Julius

    2011-01-01

    Roč. 57, č. 4 (2011), 225-232 ISSN 0172-8083 Institutional research plan: CEZ:AV0Z60220518 Keywords : Euglena * Diplonema * Mitochondrial genome * RNA editing * Constructive neutral evolution Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.556, year: 2011

  13. Time-zero fission-fragment detector based on low-pressure multiwire proportional chambers

    International Nuclear Information System (INIS)

    Assamagan, K.; Baker, K.; Bayatyan, G.; Carlini, R.; Danagoulian, S.; Eden, T.; Egiyan, K.; Ent, R.; Fenker, H.; Gan, L.; Gasparian, A.; Grigoryan, N.; Greenwood, Z.; Gueye, P.; Hashimoto, O.; Johnston, K.; Keppel, C.; Knyazyan, S.; Majewski, S.; Margaryan, A.; Margaryan, Yu.; Marikyan, G.; Martoff, J.; Mkrtchyan, H.; Parlakyan, L.; Sato, Y.; Sawafta, R.; Simicevic, N.; Tadevosyan, V.; Takahashi, T.; Tang, L.; Vartanyan, G.; Vulcan, W.; Wells, S.; Wood, S.

    1999-01-01

    A time-zero fission fragment (FF) detector, based on the technique of low-pressure multiwire proportional chambers (LPMWPC), has been designed and constructed for the heavy hypernuclear lifetime experiment (E95-002) at Thomas Jefferson National Accelerator Facility. Its characteristics and the method of time-zero reconstruction were investigated using fission fragments from a 252 Cf spontaneous fission source. The influence of the ionization energy loss was also studied. It is shown that Heptane, Hexane, and Isobutane gases at a pressure of 1-2 Torr are all suitable for such a FF detector. As desired by experiment, a timing resolution of about 200 ps (FWHM) for a chamber size of 21x21 cm 2 was achieved

  14. Native Mass Spectrometry in Fragment-Based Drug Discovery

    Directory of Open Access Journals (Sweden)

    Liliana Pedro

    2016-07-01

    Full Text Available The advent of native mass spectrometry (MS in 1990 led to the development of new mass spectrometry instrumentation and methodologies for the analysis of noncovalent protein–ligand complexes. Native MS has matured to become a fast, simple, highly sensitive and automatable technique with well-established utility for fragment-based drug discovery (FBDD. Native MS has the capability to directly detect weak ligand binding to proteins, to determine stoichiometry, relative or absolute binding affinities and specificities. Native MS can be used to delineate ligand-binding sites, to elucidate mechanisms of cooperativity and to study the thermodynamics of binding. This review highlights key attributes of native MS for FBDD campaigns.

  15. Native Mass Spectrometry in Fragment-Based Drug Discovery.

    Science.gov (United States)

    Pedro, Liliana; Quinn, Ronald J

    2016-07-28

    The advent of native mass spectrometry (MS) in 1990 led to the development of new mass spectrometry instrumentation and methodologies for the analysis of noncovalent protein-ligand complexes. Native MS has matured to become a fast, simple, highly sensitive and automatable technique with well-established utility for fragment-based drug discovery (FBDD). Native MS has the capability to directly detect weak ligand binding to proteins, to determine stoichiometry, relative or absolute binding affinities and specificities. Native MS can be used to delineate ligand-binding sites, to elucidate mechanisms of cooperativity and to study the thermodynamics of binding. This review highlights key attributes of native MS for FBDD campaigns.

  16. Developments in SPR Fragment Screening.

    Science.gov (United States)

    Chavanieu, Alain; Pugnière, Martine

    2016-01-01

    Fragment-based approaches have played an increasing role alongside high-throughput screening in drug discovery for 15 years. The label-free biosensor technology based on surface plasmon resonance (SPR) is now sensitive and informative enough to serve during primary screens and validation steps. In this review, the authors discuss the role of SPR in fragment screening. After a brief description of the underlying principles of the technique and main device developments, they evaluate the advantages and adaptations of SPR for fragment-based drug discovery. SPR can also be applied to challenging targets such as membrane receptors and enzymes. The high-level of immobilization of the protein target and its stability are key points for a relevant screening that can be optimized using oriented immobilized proteins and regenerable sensors. Furthermore, to decrease the rate of false negatives, a selectivity test may be performed in parallel on the main target bearing the binding site mutated or blocked with a low-off-rate ligand. Fragment-based drug design, integrated in a rational workflow led by SPR, will thus have a predominant role for the next wave of drug discovery which could be greatly enhanced by new improvements in SPR devices.

  17. Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data.

    Directory of Open Access Journals (Sweden)

    Can Alkan

    2007-09-01

    Full Text Available The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%-5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution.

  18. Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data.

    Science.gov (United States)

    Alkan, Can; Ventura, Mario; Archidiacono, Nicoletta; Rocchi, Mariano; Sahinalp, S Cenk; Eichler, Evan E

    2007-09-01

    The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%-5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution.

  19. DNA Length Modulates the Affinity of Fragments of Genomic DNA for the Nuclear Matrix In Vitro.

    Science.gov (United States)

    García-Vilchis, David; Aranda-Anzaldo, Armando

    2017-12-01

    Classical observations have shown that during the interphase the chromosomal DNA of metazoans is organized in supercoiled loops attached to a compartment known as the nuclear matrix (NM). Fragments of chromosomal DNA able to bind the isolated NM in vitro are known as matrix associated/attachment/addressed regions or MARs. No specific consensus sequence or motif has been found that may constitute a universal, defining feature of MARs. On the other hand, high-salt resistant DNA-NM interactions in situ define true DNA loop anchorage regions or LARs, that might correspond to a subset of the potential MARs but are not necessarily identical to MARs characterized in vitro, since there are several examples of MARs able to bind the NM in vitro but which are not actually bound to the NM in situ. In the present work we assayed the capacity of two LARs, as well as of shorter fragments within such LARs, for binding to the NM in vitro. Paradoxically the isolated (≈2 kb) LARs cannot bind to the NM in vitro while their shorter (≈300 pb) sub-fragments and other non-related but equally short DNA fragments, bind to the NM in a high-salt resistant fashion. Our results suggest that the ability of a given DNA fragment for binding to the NM in vitro primarily depends on the length of the fragment, suggesting that binding to the NM is modulated by the local topology of the DNA fragment in suspension that it is known to depend on the DNA length. J. Cell. Biochem. 118: 4487-4497, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  20. YersiniaBase: a genomic resource and analysis platform for comparative analysis of Yersinia.

    Science.gov (United States)

    Tan, Shi Yang; Dutta, Avirup; Jakubovics, Nicholas S; Ang, Mia Yang; Siow, Cheuk Chuen; Mutha, Naresh Vr; Heydari, Hamed; Wee, Wei Yee; Wong, Guat Jah; Choo, Siew Woh

    2015-01-16

    Yersinia is a Gram-negative bacteria that includes serious pathogens such as the Yersinia pestis, which causes plague, Yersinia pseudotuberculosis, Yersinia enterocolitica. The remaining species are generally considered non-pathogenic to humans, although there is evidence that at least some of these species can cause occasional infections using distinct mechanisms from the more pathogenic species. With the advances in sequencing technologies, many genomes of Yersinia have been sequenced. However, there is currently no specialized platform to hold the rapidly-growing Yersinia genomic data and to provide analysis tools particularly for comparative analyses, which are required to provide improved insights into their biology, evolution and pathogenicity. To facilitate the ongoing and future research of Yersinia, especially those generally considered non-pathogenic species, a well-defined repository and analysis platform is needed to hold the Yersinia genomic data and analysis tools for the Yersinia research community. Hence, we have developed the YersiniaBase, a robust and user-friendly Yersinia resource and analysis platform for the analysis of Yersinia genomic data. YersiniaBase has a total of twelve species and 232 genome sequences, of which the majority are Yersinia pestis. In order to smooth the process of searching genomic data in a large database, we implemented an Asynchronous JavaScript and XML (AJAX)-based real-time searching system in YersiniaBase. Besides incorporating existing tools, which include JavaScript-based genome browser (JBrowse) and Basic Local Alignment Search Tool (BLAST), YersiniaBase also has in-house developed tools: (1) Pairwise Genome Comparison tool (PGC) for comparing two user-selected genomes; (2) Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomics analysis of Yersinia genomes; (3) YersiniaTree for constructing phylogenetic tree of Yersinia. We ran analyses based on the tools and genomic data in YersiniaBase and the

  1. CRISPR/Cas9 Based Genome Editing of Penicillium chrysogenum.

    Science.gov (United States)

    Pohl, C; Kiel, J A K W; Driessen, A J M; Bovenberg, R A L; Nygård, Y

    2016-07-15

    CRISPR/Cas9 based systems have emerged as versatile platforms for precision genome editing in a wide range of organisms. Here we have developed powerful CRISPR/Cas9 tools for marker-based and marker-free genome modifications in Penicillium chrysogenum, a model filamentous fungus and industrially relevant cell factory. The developed CRISPR/Cas9 toolbox is highly flexible and allows editing of new targets with minimal cloning efforts. The Cas9 protein and the sgRNA can be either delivered during transformation, as preassembled CRISPR-Cas9 ribonucleoproteins (RNPs) or expressed from an AMA1 based plasmid within the cell. The direct delivery of the Cas9 protein with in vitro synthesized sgRNA to the cells allows for a transient method for genome engineering that may rapidly be applicable for other filamentous fungi. The expression of Cas9 from an AMA1 based vector was shown to be highly efficient for marker-free gene deletions.

  2. Contributions of computational chemistry and biophysical techniques to fragment-based drug discovery.

    Science.gov (United States)

    Gozalbes, Rafael; Carbajo, Rodrigo J; Pineda-Lucena, Antonio

    2010-01-01

    In the last decade, fragment-based drug discovery (FBDD) has evolved from a novel approach in the search of new hits to a valuable alternative to the high-throughput screening (HTS) campaigns of many pharmaceutical companies. The increasing relevance of FBDD in the drug discovery universe has been concomitant with an implementation of the biophysical techniques used for the detection of weak inhibitors, e.g. NMR, X-ray crystallography or surface plasmon resonance (SPR). At the same time, computational approaches have also been progressively incorporated into the FBDD process and nowadays several computational tools are available. These stretch from the filtering of huge chemical databases in order to build fragment-focused libraries comprising compounds with adequate physicochemical properties, to more evolved models based on different in silico methods such as docking, pharmacophore modelling, QSAR and virtual screening. In this paper we will review the parallel evolution and complementarities of biophysical techniques and computational methods, providing some representative examples of drug discovery success stories by using FBDD.

  3. KGCAK: a K-mer based database for genome-wide phylogeny and complexity evaluation.

    Science.gov (United States)

    Wang, Dapeng; Xu, Jiayue; Yu, Jun

    2015-09-16

    The K-mer approach, treating genomic sequences as simple characters and counting the relative abundance of each string upon a fixed K, has been extensively applied to phylogeny inference for genome assembly, annotation, and comparison. To meet increasing demands for comparing large genome sequences and to promote the use of the K-mer approach, we develop a versatile database, KGCAK ( http://kgcak.big.ac.cn/KGCAK/ ), containing ~8,000 genomes that include genome sequences of diverse life forms (viruses, prokaryotes, protists, animals, and plants) and cellular organelles of eukaryotic lineages. It builds phylogeny based on genomic elements in an alignment-free fashion and provides in-depth data processing enabling users to compare the complexity of genome sequences based on K-mer distribution. We hope that KGCAK becomes a powerful tool for exploring relationship within and among groups of species in a tree of life based on genomic data.

  4. Fragment-based modelling of single stranded RNA bound to RNA recognition motif containing proteins

    Science.gov (United States)

    de Beauchene, Isaure Chauvot; de Vries, Sjoerd J.; Zacharias, Martin

    2016-01-01

    Abstract Protein-RNA complexes are important for many biological processes. However, structural modeling of such complexes is hampered by the high flexibility of RNA. Particularly challenging is the docking of single-stranded RNA (ssRNA). We have developed a fragment-based approach to model the structure of ssRNA bound to a protein, based on only the protein structure, the RNA sequence and conserved contacts. The conformational diversity of each RNA fragment is sampled by an exhaustive library of trinucleotides extracted from all known experimental protein–RNA complexes. The method was applied to ssRNA with up to 12 nucleotides which bind to dimers of the RNA recognition motifs (RRMs), a highly abundant eukaryotic RNA-binding domain. The fragment based docking allows a precise de novo atomic modeling of protein-bound ssRNA chains. On a benchmark of seven experimental ssRNA–RRM complexes, near-native models (with a mean heavy-atom deviation of <3 Å from experiment) were generated for six out of seven bound RNA chains, and even more precise models (deviation < 2 Å) were obtained for five out of seven cases, a significant improvement compared to the state of the art. The method is not restricted to RRMs but was also successfully applied to Pumilio RNA binding proteins. PMID:27131381

  5. From Protein Structure to Small-Molecules: Recent Advances and Applications to Fragment-Based Drug Discovery.

    Science.gov (United States)

    Ferreira, Leonardo G; Andricopulo, Adriano D

    2017-01-01

    Fragment-based drug discovery (FBDD) is a broadly used strategy in structure-guided ligand design, whereby low-molecular weight hits move from lead-like to drug-like compounds. Over the past 15 years, an increasingly important role of the integration of these strategies into industrial and academic research platforms has been successfully established, allowing outstanding contributions to drug discovery. One important factor for the current prominence of FBDD is the better coverage of the chemical space provided by fragment-like libraries. The development of the field relies on two features: (i) the growing number of structurally characterized drug targets and (ii) the enormous chemical diversity available for experimental and virtual screenings. Indeed, fragment-based campaigns have contributed to address major challenges in lead optimization, such as the appropriate physicochemical profile of clinical candidates. This perspective paper outlines the usefulness and applications of FBDD approaches in medicinal chemistry and drug design. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  6. WormBase 2016: expanding to enable helminth genomic research.

    Science.gov (United States)

    Howe, Kevin L; Bolt, Bruce J; Cain, Scott; Chan, Juancarlos; Chen, Wen J; Davis, Paul; Done, James; Down, Thomas; Gao, Sibyl; Grove, Christian; Harris, Todd W; Kishore, Ranjana; Lee, Raymond; Lomax, Jane; Li, Yuling; Muller, Hans-Michael; Nakamura, Cecilia; Nuin, Paulo; Paulini, Michael; Raciti, Daniela; Schindelman, Gary; Stanley, Eleanor; Tuli, Mary Ann; Van Auken, Kimberly; Wang, Daniel; Wang, Xiaodong; Williams, Gary; Wright, Adam; Yook, Karen; Berriman, Matthew; Kersey, Paul; Schedl, Tim; Stein, Lincoln; Sternberg, Paul W

    2016-01-04

    WormBase (www.wormbase.org) is a central repository for research data on the biology, genetics and genomics of Caenorhabditis elegans and other nematodes. The project has evolved from its original remit to collect and integrate all data for a single species, and now extends to numerous nematodes, ranging from evolutionary comparators of C. elegans to parasitic species that threaten plant, animal and human health. Research activity using C. elegans as a model system is as vibrant as ever, and we have created new tools for community curation in response to the ever-increasing volume and complexity of data. To better allow users to navigate their way through these data, we have made a number of improvements to our main website, including new tools for browsing genomic features and ontology annotations. Finally, we have developed a new portal for parasitic worm genomes. WormBase ParaSite (parasite.wormbase.org) contains all publicly available nematode and platyhelminth annotated genome sequences, and is designed specifically to support helminth genomic research. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Dissecting fragment-based lead discovery at the von Hippel-Lindau protein:hypoxia inducible factor 1α protein-protein interface.

    Science.gov (United States)

    Van Molle, Inge; Thomann, Andreas; Buckley, Dennis L; So, Ernest C; Lang, Steffen; Crews, Craig M; Ciulli, Alessio

    2012-10-26

    Fragment screening is widely used to identify attractive starting points for drug design. However, its potential and limitations to assess the tractability of often challenging protein:protein interfaces have been underexplored. Here, we address this question by means of a systematic deconstruction of lead-like inhibitors of the pVHL:HIF-1α interaction into their component fragments. Using biophysical techniques commonly employed for screening, we could only detect binding of fragments that violate the Rule of Three, are more complex than those typically screened against classical druggable targets, and occupy two adjacent binding subsites at the interface rather than just one. Analyses based on ligand and group lipophilicity efficiency of anchored fragments were applied to dissect the individual subsites and probe for binding hot spots. The implications of our findings for targeting protein interfaces by fragment-based approaches are discussed. Copyright © 2012 Elsevier Ltd. All rights reserved.

  8. Benchmark fragment-based 1H, 13C, 15N and 17O chemical shift predictions in molecular crystals†

    Science.gov (United States)

    Hartman, Joshua D.; Kudla, Ryan A.; Day, Graeme M.; Mueller, Leonard J.; Beran, Gregory J. O.

    2016-01-01

    The performance of fragment-based ab initio 1H, 13C, 15N and 17O chemical shift predictions is assessed against experimental NMR chemical shift data in four benchmark sets of molecular crystals. Employing a variety of commonly used density functionals (PBE0, B3LYP, TPSSh, OPBE, PBE, TPSS), we explore the relative performance of cluster, two-body fragment, and combined cluster/fragment models. The hybrid density functionals (PBE0, B3LYP and TPSSh) generally out-perform their generalized gradient approximation (GGA)-based counterparts. 1H, 13C, 15N, and 17O isotropic chemical shifts can be predicted with root-mean-square errors of 0.3, 1.5, 4.2, and 9.8 ppm, respectively, using a computationally inexpensive electrostatically embedded two-body PBE0 fragment model. Oxygen chemical shieldings prove particularly sensitive to local many-body effects, and using a combined cluster/fragment model instead of the simple two-body fragment model decreases the root-mean-square errors to 7.6 ppm. These fragment-based model errors compare favorably with GIPAW PBE ones of 0.4, 2.2, 5.4, and 7.2 ppm for the same 1H, 13C, 15N, and 17O test sets. Using these benchmark calculations, a set of recommended linear regression parameters for mapping between calculated chemical shieldings and observed chemical shifts are provided and their robustness assessed using statistical cross-validation. We demonstrate the utility of these approaches and the reported scaling parameters on applications to 9-tertbutyl anthracene, several histidine co-crystals, benzoic acid and the C-nitrosoarene SnCl2(CH3)2(NODMA)2. PMID:27431490

  9. Benchmark fragment-based (1)H, (13)C, (15)N and (17)O chemical shift predictions in molecular crystals.

    Science.gov (United States)

    Hartman, Joshua D; Kudla, Ryan A; Day, Graeme M; Mueller, Leonard J; Beran, Gregory J O

    2016-08-21

    The performance of fragment-based ab initio(1)H, (13)C, (15)N and (17)O chemical shift predictions is assessed against experimental NMR chemical shift data in four benchmark sets of molecular crystals. Employing a variety of commonly used density functionals (PBE0, B3LYP, TPSSh, OPBE, PBE, TPSS), we explore the relative performance of cluster, two-body fragment, and combined cluster/fragment models. The hybrid density functionals (PBE0, B3LYP and TPSSh) generally out-perform their generalized gradient approximation (GGA)-based counterparts. (1)H, (13)C, (15)N, and (17)O isotropic chemical shifts can be predicted with root-mean-square errors of 0.3, 1.5, 4.2, and 9.8 ppm, respectively, using a computationally inexpensive electrostatically embedded two-body PBE0 fragment model. Oxygen chemical shieldings prove particularly sensitive to local many-body effects, and using a combined cluster/fragment model instead of the simple two-body fragment model decreases the root-mean-square errors to 7.6 ppm. These fragment-based model errors compare favorably with GIPAW PBE ones of 0.4, 2.2, 5.4, and 7.2 ppm for the same (1)H, (13)C, (15)N, and (17)O test sets. Using these benchmark calculations, a set of recommended linear regression parameters for mapping between calculated chemical shieldings and observed chemical shifts are provided and their robustness assessed using statistical cross-validation. We demonstrate the utility of these approaches and the reported scaling parameters on applications to 9-tert-butyl anthracene, several histidine co-crystals, benzoic acid and the C-nitrosoarene SnCl2(CH3)2(NODMA)2.

  10. DebriSat - A Planned Laboratory-Based Satellite Impact Experiment for Breakup Fragment Characterizations

    Science.gov (United States)

    Liou, Jer-Chyi; Clark, S.; Fitz-Coy, N.; Huynh, T.; Opiela, J.; Polk, M.; Roebuck, B.; Rushing, R.; Sorge, M.; Werremeyer, M.

    2013-01-01

    The goal of the DebriSat project is to characterize fragments generated by a hypervelocity collision involving a modern satellite in low Earth orbit (LEO). The DebriSat project will update and expand upon the information obtained in the 1992 Satellite Orbital Debris Characterization Impact Test (SOCIT), which characterized the breakup of a 1960 s US Navy Transit satellite. There are three phases to this project: the design and fabrication of DebriSat - an engineering model representing a modern, 60-cm/50-kg class LEO satellite; conduction of a laboratory-based hypervelocity impact to catastrophically break up the satellite; and characterization of the properties of breakup fragments down to 2 mm in size. The data obtained, including fragment size, area-to-mass ratio, density, shape, material composition, optical properties, and radar cross-section distributions, will be used to supplement the DoD s and NASA s satellite breakup models to better describe the breakup outcome of a modern satellite.

  11. Genome-wide mapping of autonomous promoter activity in human cells.

    Science.gov (United States)

    van Arensbergen, Joris; FitzPatrick, Vincent D; de Haas, Marcel; Pagie, Ludo; Sluimer, Jasper; Bussemaker, Harmen J; van Steensel, Bas

    2017-02-01

    Previous methods to systematically characterize sequence-intrinsic activity of promoters have been limited by relatively low throughput and the length of the sequences that could be tested. Here we present 'survey of regulatory elements' (SuRE), a method that assays more than 10 8 DNA fragments, each 0.2-2 kb in size, for their ability to drive transcription autonomously. In SuRE, a plasmid library of random genomic fragments upstream of a 20-bp barcode is constructed, and decoded by paired-end sequencing. This library is used to transfect cells, and barcodes in transcribed RNA are quantified by high-throughput sequencing. When applied to the human genome, we achieve 55-fold genome coverage, allowing us to map autonomous promoter activity genome-wide in K562 cells. By computational modeling we delineate subregions within promoters that are relevant for their activity. We show that antisense promoter transcription is generally dependent on the sense core promoter sequences, and that most enhancers and several families of repetitive elements act as autonomous transcription initiation sites.

  12. A linear mitochondrial genome of Cyclospora cayetanensis (Eimeriidae, Eucoccidiorida, Coccidiasina, Apicomplexa) suggests the ancestral start position within mitochondrial genomes of eimeriid coccidia.

    Science.gov (United States)

    Ogedengbe, Mosun E; Qvarnstrom, Yvonne; da Silva, Alexandre J; Arrowood, Michael J; Barta, John R

    2015-05-01

    The near complete mitochondrial genome for Cyclospora cayetanensis is 6184 bp in length with three protein-coding genes (Cox1, Cox3, CytB) and numerous lsrDNA and ssrDNA fragments. Gene arrangements were conserved with other coccidia in the Eimeriidae, but the C. cayetanensis mitochondrial genome is not circular-mapping. Terminal transferase tailing and nested PCR completed the 5'-terminus of the genome starting with a 21 bp A/T-only region that forms a potential stem-loop. Regions homologous to the C. cayetanensis mitochondrial genome 5'-terminus are found in all eimeriid mitochondrial genomes available and suggest this may be the ancestral start of eimeriid mitochondrial genomes. Copyright © 2015 Australian Society for Parasitology Inc. All rights reserved.

  13. Study on the Mitochondrial Genome of Sea Island Cotton (Gossypium barbadense) by BAC Library Screening

    Institute of Scientific and Technical Information of China (English)

    SU Ai-guo; LI Shuang-shuang; LIU Guo-zheng; LEI Bin-bin; KANG Ding-ming; LI Zhao-hu; MA Zhi-ying; HUA Jin-ping

    2014-01-01

    The plant mitochondrial genome displays complex features, particularly in terms of cytoplasmic male sterility (CMS). Therefore, research on the cotton mitochondrial genome may provide important information for analyzing genome evolution and exploring the molecular mechanism of CMS. In this paper, we present a preliminary study on the mitochondrial genome of sea island cotton (Gossypium barbadense) based on positive clones from the bacterial artiifcial chromosome (BAC) library. Thirty-ifve primers designed with the conserved sequences of functional genes and exons of mitochondria were used to screen positive clones in the genome library of the sea island cotton variety called Pima 90-53. Ten BAC clones were obtained and veriifed for further study. A contig was obtained based on six overlapping clones and subsequently laid out primarily on the mitochondrial genome. One BAC clone, clone 6 harbored with the inserter of approximate 115 kb mtDNA sequence, in which more than 10 primers fragments could be ampliifed, was sequenced and assembled using the Solexa strategy. Fifteen mitochondrial functional genes were revealed in clone 6 by gene annotation. The characteristics of the syntenic gene/exon of the sequences and RNA editing were preliminarily predicted.

  14. The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.

    Science.gov (United States)

    Mao, Qing; Ciotlos, Serban; Zhang, Rebecca Yu; Ball, Madeleine P; Chin, Robert; Carnevali, Paolo; Barua, Nina; Nguyen, Staci; Agarwal, Misha R; Clegg, Tom; Connelly, Abram; Vandewege, Ward; Zaranek, Alexander Wait; Estep, Preston W; Church, George M; Drmanac, Radoje; Peters, Brock A

    2016-10-11

    Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data. These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.

  15. Time-zero fission-fragment detector based on low-pressure multiwire proportional chambers

    CERN Document Server

    Assamagan, Ketevi A; Bayatyan, G L; Carlini, R; Danagulyan, S; Eden, T; Egiyan, K; Ent, R; Fenker, H; Gan, L; Gasparian, A; Grigoryan, N K; Greenwood, Z; Gueye, P; Hashimoto, O; Johnston, K; Keppel, C; Knyazyan, S; Majewski, S; Margaryan, A; Margaryan, Yu L; Marikian, G G; Martoff, J; Mkrtchyan, H G; Parlakyan, L; Sato, Y; Sawafta, R; Simicevic, N; Tadevosyan, V; Takahashi, T; Tang, L; Vartanian, G S; Vulcan, W; Wells, S; Wood, S

    1999-01-01

    A time-zero fission fragment (FF) detector, based on the technique of low-pressure multiwire proportional chambers (LPMWPC), has been designed and constructed for the heavy hypernuclear lifetime experiment (E95-002) at Thomas Jefferson National Accelerator Facility. Its characteristics and the method of time-zero reconstruction were investigated using fission fragments from a sup 2 sup 5 sup 2 Cf spontaneous fission source. The influence of the ionization energy loss was also studied. It is shown that Heptane, Hexane, and Isobutane gases at a pressure of 1-2 Torr are all suitable for such a FF detector. As desired by experiment, a timing resolution of about 200 ps (FWHM) for a chamber size of 21x21 cm sup 2 was achieved.

  16. The effect of using genealogy-based haplotypes for genomic prediction.

    Science.gov (United States)

    Edriss, Vahid; Fernando, Rohan L; Su, Guosheng; Lund, Mogens S; Guldbrandtsen, Bernt

    2013-03-06

    Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy.

  17. Integrated Genome-Based Studies of Shewanella Echophysiology

    Energy Technology Data Exchange (ETDEWEB)

    Margrethe H. Serres

    2012-06-29

    Shewanella oneidensis MR-1 is a motile, facultative {gamma}-Proteobacterium with remarkable respiratory versatility; it can utilize a range of organic and inorganic compounds as terminal electronacceptors for anaerobic metabolism. The ability to effectively reduce nitrate, S0, polyvalent metals andradionuclides has established MR-1 as an important model dissimilatory metal-reducing microorganism for genome-based investigations of biogeochemical transformation of metals and radionuclides that are of concern to the U.S. Department of Energy (DOE) sites nationwide. Metal-reducing bacteria such as Shewanella also have a highly developed capacity for extracellular transfer of respiratory electrons to solid phase Fe and Mn oxides as well as directly to anode surfaces in microbial fuel cells. More broadly, Shewanellae are recognized free-living microorganisms and members of microbial communities involved in the decomposition of organic matter and the cycling of elements in aquatic and sedimentary systems. To function and compete in environments that are subject to spatial and temporal environmental change, Shewanella must be able to sense and respond to such changes and therefore require relatively robust sensing and regulation systems. The overall goal of this project is to apply the tools of genomics, leveraging the availability of genome sequence for 18 additional strains of Shewanella, to better understand the ecophysiology and speciation of respiratory-versatile members of this important genus. To understand these systems we propose to use genome-based approaches to investigate Shewanella as a system of integrated networks; first describing key cellular subsystems - those involved in signal transduction, regulation, and metabolism - then building towards understanding the function of whole cells and, eventually, cells within populations. As a general approach, this project will employ complimentary "top-down" - bioinformatics-based genome functional predictions, high

  18. Genomic resources for gene discovery, functional genome annotation, and evolutionary studies of maize and its close relatives.

    Science.gov (United States)

    Wang, Chao; Shi, Xue; Liu, Lin; Li, Haiyan; Ammiraju, Jetty S S; Kudrna, David A; Xiong, Wentao; Wang, Hao; Dai, Zhaozhao; Zheng, Yonglian; Lai, Jinsheng; Jin, Weiwei; Messing, Joachim; Bennetzen, Jeffrey L; Wing, Rod A; Luo, Meizhong

    2013-11-01

    Maize is one of the most important food crops and a key model for genetics and developmental biology. A genetically anchored and high-quality draft genome sequence of maize inbred B73 has been obtained to serve as a reference sequence. To facilitate evolutionary studies in maize and its close relatives, much like the Oryza Map Alignment Project (OMAP) (www.OMAP.org) bacterial artificial chromosome (BAC) resource did for the rice community, we constructed BAC libraries for maize inbred lines Zheng58, Chang7-2, and Mo17 and maize wild relatives Zea mays ssp. parviglumis and Tripsacum dactyloides. Furthermore, to extend functional genomic studies to maize and sorghum, we also constructed binary BAC (BIBAC) libraries for the maize inbred B73 and the sorghum landrace Nengsi-1. The BAC/BIBAC vectors facilitate transfer of large intact DNA inserts from BAC clones to the BIBAC vector and functional complementation of large DNA fragments. These seven Zea Map Alignment Project (ZMAP) BAC/BIBAC libraries have average insert sizes ranging from 92 to 148 kb, organellar DNA from 0.17 to 2.3%, empty vector rates between 0.35 and 5.56%, and genome equivalents of 4.7- to 8.4-fold. The usefulness of the Parviglumis and Tripsacum BAC libraries was demonstrated by mapping clones to the reference genome. Novel genes and alleles present in these ZMAP libraries can now be used for functional complementation studies and positional or homology-based cloning of genes for translational genomics.

  19. Designer genes. Recombinant antibody fragments for biological imaging

    Energy Technology Data Exchange (ETDEWEB)

    Wu, A.M.; Yazaki, P.J. [Beckman Research Institute of the City of Hope, Duarte, CA (United States). Dept. of Molecular Biology

    2000-09-01

    Monoclonal antibodies (MAbs), with high specificity and high affinity for their target antigens, can be utilized for delivery of agents such as radionuclides, enzymes, drugs or toxins in vivo. However, the implementation of radiolabeled antibodies as magic bullets for detection and treatment of diseases such as cancer has required addressing several shortcomings of murine MAbs. These include their immunogenicity, sub-optimal targeting and pharmacokinetic properties, and practical issues of production and radiolabeling. Genetic engineering provides a powerful approach for redesigning antibodies for use in oncologic applications in vivo. Recombinant fragments have been produced that retain high affinity for target antigens, and display a combination of rapid, high-level tumor targeting with concomitant clearance from normal tissues and the circulation in animal models. An important first step was cloning and engineering of antibody heavy and light chain variable domains into single-chain Fvs (molecular weight, 25-17 kDa), in which the variable regions are joined via a synthetic linker peptide sequence. Although scFvs themselves showed limited tumor uptake in preclinical and clinical studies, they provide a useful building block for intermediate sized recombinant fragments. Covalently linked dimers or non-covalent dimers of scFvs (also known as diabodies) show improved targeting and clearance properties due to their higher molecular weight (55kDa) and increased avidity. Further gains can be made by generation of larger recombinant fragments, such as the minibody, an scFv-C{sub H}3 fusion protein that self-assembles into a bivalent dimer of 80 kDa. A systematic evaluation of scFv, diabody, minibody, and intact antibody (based on comparison of tumor uptakes, tumor: blood activity ratios, and calculation of an Imaging Figure of Merit) can form the basis for selection of combinations of recombinant fragments and radionuclides for imaging applications. Ease of engineering

  20. Designer genes. Recombinant antibody fragments for biological imaging

    International Nuclear Information System (INIS)

    Wu, A.M.; Yazaki, P.J.

    2000-01-01

    Monoclonal antibodies (MAbs), with high specificy and high affinity for their target antigens, can be utilized for delivery of agents such as radionuclides, enzymes, drugs or toxins in vivo. However, the implementation of radiolabeled antibodies as magic bullets for detection and treatment of diseases such as cancer has required addressing several shortcomings of murine MAbs. These include their immunogenicity, sub-optimal targeting and pharmacokinetic properties, and practical issues of production and radiolabeling. Genetic engineering provides a powerful approach for redesigning antibodies for use in oncologic applications in vivo. Recombinant fragments have been produced that retain high affinity for target antigens, and display a combination of rapid, high-level tumor targeting with concomitant clearance from normal tissues and the circulation in animal models. An important first step was cloning and engineering of antibody heavy and light chain variable domains into single-chain Fvs (molecular weight, 25-17 kDa), in which the variable regions are joined via a synthetic linker peptide sequence. Although scFvs themselves showed limited tumor uptake in preclinical and clinical studies, they provide a useful building block for intermediate sized recombinant fragments. Covalently linked dimers or non-covalent dimers of scFvs (also known as diabodies) show improved targeting and clearance properties due to their higher molecular weight (55kDa) and increased avidity. Further gains can be made by generation of larger recombinant fragments, such as the minibody, an scFv-C H 3 fusion protein that self-assembles into a bivalent dimer of 80 kDa. A systematic evaluation of scFv, diabody, minibody, and intact antibody (based on comparison of tumor uptakes, tumor: blood activity ratios, and calculation of an Imaging Figure of Merit) can form the basis for selection of combinations of recombinant fragments and radionuclides for imaging applications. Ease of engineering and

  1. Medicinal chemistry inspired fragment-based drug discovery.

    Science.gov (United States)

    Lanter, James; Zhang, Xuqing; Sui, Zhihua

    2011-01-01

    Lead generation can be a very challenging phase of the drug discovery process. The two principal methods for this stage of research are blind screening and rational design. Among the rational or semirational design approaches, fragment-based drug discovery (FBDD) has emerged as a useful tool for the generation of lead structures. It is particularly powerful as a complement to high-throughput screening approaches when the latter failed to yield viable hits for further development. Engagement of medicinal chemists early in the process can accelerate the progression of FBDD efforts by incorporating drug-friendly properties in the earliest stages of the design process. Medium-chain acyl-CoA synthetase 2b and ketohexokinase are chosen as examples to illustrate the importance of close collaboration of medicinal chemists, crystallography, and modeling. Copyright © 2011 Elsevier Inc. All rights reserved.

  2. Genomic localization, sequence analysis, and transcription of the putative human cytomegalovirus DNA polymerase gene

    International Nuclear Information System (INIS)

    Heilbronn, T.; Jahn, G.; Buerkle, A.; Freese, U.K.; Fleckenstein, B.; Zur Hausen, H.

    1987-01-01

    The human cytomegalovirus (HCMV)-induced DNA polymerase has been well characterized biochemically and functionally, but its genomic location has not yet been assigned. To identify the coding sequence, cross-hybridization with the herpes simplex virus type 1 (HSV-1) polymerase gene was used, as suggested by the close similarity of the herpes group virus-induced DNA polymerases to the HCMV DNA polymerase. A cosmid and plasmid library of the entire HCMV genome was screened with the BamHI Q fragment of HSF-1 at different stringency conditions. One PstI-HincII restriction fragment of 850 base pairs mapping within the EcoRI M fragment of HCMV cross-hybridized at T/sub m/ - 25/degrees/C. Sequence analysis revealed one open reading frame spanning the entire sequence. The amino acid sequence showed a highly conserved domain of 133 amino acids shared with the HSV and putative Esptein-Barr virus polymerase sequences. This domain maps within the C-terminal part of the HSV polymerase gene, which has been suggested to contain part of the catalytic center of the enzyme. Transcription analysis revealed one 5.4-kilobase early transcript in the sense orientation with respect to the open reading frame identified. This transcript appears to code for the 140-kilodalton HCMV polymerase protein

  3. Restriction map of the single-stranded DNA genome of Kilham rat virus strain 171, a nondefective parvovirus

    International Nuclear Information System (INIS)

    Banerjee, P.T.; Rathrock, R.; Mitra, S.

    1981-01-01

    A physical map of Kilham rat virus strain 171 DNA was constructed by analyzing the sizes and locations of restriction endonuclease-generated fragments of the replicative-form viral DNA synthesized in vitro. BglI, KpnI, BamHI, SmaI, XhoI, and XorII did not appear to have any cleavage sites, whereas 11 other enzymes cleaved the genome at one to eight sites, and AluI generated more than 12 distinct fragments. The 30 restriction sites that were mapped were distributed randomly in the viral genome. A comparison of the restriction fragments of in vivo- and in vitro-replicated replicative-form DNAs showed that these DNAs were identical except in the size or configuration of the terminal fragments

  4. Binding thermodynamics discriminates fragments from druglike compounds: a thermodynamic description of fragment-based drug discovery.

    Science.gov (United States)

    Williams, Glyn; Ferenczy, György G; Ulander, Johan; Keserű, György M

    2017-04-01

    Small is beautiful - reducing the size and complexity of chemical starting points for drug design allows better sampling of chemical space, reveals the most energetically important interactions within protein-binding sites and can lead to improvements in the physicochemical properties of the final drug. The impact of fragment-based drug discovery (FBDD) on recent drug discovery projects and our improved knowledge of the structural and thermodynamic details of ligand binding has prompted us to explore the relationships between ligand-binding thermodynamics and FBDD. Information on binding thermodynamics can give insights into the contributions to protein-ligand interactions and could therefore be used to prioritise compounds with a high degree of specificity in forming key interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. Comparative Genomics of Carp Herpesviruses

    Science.gov (United States)

    Kurobe, Tomofumi; Gatherer, Derek; Cunningham, Charles; Korf, Ian; Fukuda, Hideo; Hedrick, Ronald P.; Waltzek, Thomas B.

    2013-01-01

    Three alloherpesviruses are known to cause disease in cyprinid fish: cyprinid herpesviruses 1 and 3 (CyHV1 and CyHV3) in common carp and koi and cyprinid herpesvirus 2 (CyHV2) in goldfish. We have determined the genome sequences of CyHV1 and CyHV2 and compared them with the published CyHV3 sequence. The CyHV1 and CyHV2 genomes are 291,144 and 290,304 bp, respectively, in size, and thus the CyHV3 genome, at 295,146 bp, remains the largest recorded among the herpesviruses. Each of the three genomes consists of a unique region flanked at each terminus by a sizeable direct repeat. The CyHV1, CyHV2, and CyHV3 genomes are predicted to contain 137, 150, and 155 unique, functional protein-coding genes, respectively, of which six, four, and eight, respectively, are duplicated in the terminal repeat. The three viruses share 120 orthologous genes in a largely colinear arrangement, of which up to 55 are also conserved in the other member of the genus Cyprinivirus, anguillid herpesvirus 1. Twelve genes are conserved convincingly in all sequenced alloherpesviruses, and two others are conserved marginally. The reference CyHV3 strain has been reported to contain five fragmented genes that are presumably nonfunctional. The CyHV2 strain has two fragmented genes, and the CyHV1 strain has none. CyHV1, CyHV2, and CyHV3 have five, six, and five families of paralogous genes, respectively. One family unique to CyHV1 is related to cellular JUNB, which encodes a transcription factor involved in oncogenesis. To our knowledge, this is the first time that JUNB-related sequences have been reported in a herpesvirus. PMID:23269803

  6. A BAC-based physical map of the Drosophila buzzatii genome

    Energy Technology Data Exchange (ETDEWEB)

    Gonzalez, Josefa; Nefedov, Michael; Bosdet, Ian; Casals, Ferran; Calvete, Oriol; Delprat, Alejandra; Shin, Heesun; Chiu, Readman; Mathewson, Carrie; Wye, Natasja; Hoskins, Roger A.; Schein, JacquelineE.; de Jong, Pieter; Ruiz, Alfredo

    2005-03-18

    Large-insert genomic libraries facilitate cloning of large genomic regions, allow the construction of clone-based physical maps and provide useful resources for sequencing entire genomes. Drosophilabuzzatii is a representative species of the repleta group in the Drosophila subgenus, which is being widely used as a model in studies of genome evolution, ecological adaptation and speciation. We constructed a Bacterial Artificial Chromosome (BAC) genomic library of D. buzzatii using the shuttle vector pTARBAC2.1. The library comprises 18,353 clones with an average insert size of 152 kb and a {approx}18X expected representation of the D. buzzatii euchromatic genome. We screened the entire library with six euchromatic gene probes and estimated the actual genome representation to be {approx}23X. In addition, we fingerprinted by restriction digestion and agarose gel electrophoresis a sample of 9,555 clones, and assembled them using Finger Printed Contigs (FPC) software and manual editing into 345 contigs (mean of 26 clones per contig) and 670singletons. Finally, we anchored 181 large contigs (containing 7,788clones) to the D. buzzatii salivary gland polytene chromosomes by in situ hybridization of 427 representative clones. The BAC library and a database with all the information regarding the high coverage BAC-based physical map described in this paper are available to the research community.

  7. Evolutionary analysis of whole-genome sequences confirms inter-farm transmission of Aleutian mink disease virus

    DEFF Research Database (Denmark)

    Hagberg, Emma Elisabeth; Pedersen, Anders Gorm; Larsen, Lars E

    2017-01-01

    Aleutian mink disease virus (AMDV) is a frequently encountered pathogen associated with mink farming. Previous phylogenetic analyses of AMDV have been based on shorter and more conserved parts of the genome, e.g. the partial NS1 gene. Such fragments are suitable for detection but are less useful...... direction of spread. It was however impossible to infer transmission pathways from the partial NS1 gene tree, since all samples from the case farms branched out from a single internal node. A sliding window analysis showed that there were no shorter genomic regions providing the same phylogenetic resolution...

  8. Flexibility and symmetry of prokaryotic genome rearrangement reveal lineage-associated core-gene-defined genome organizational frameworks.

    Science.gov (United States)

    Kang, Yu; Gu, Chaohao; Yuan, Lina; Wang, Yue; Zhu, Yanmin; Li, Xinna; Luo, Qibin; Xiao, Jingfa; Jiang, Daquan; Qian, Minping; Ahmed Khan, Aftab; Chen, Fei; Zhang, Zhang; Yu, Jun

    2014-11-25

    among isolates but also functionally essential for a given species and to further evaluate the stability or flexibility of such genome structures across lineages are of importance. Based on a large number of multi-isolate pangenomic data, our analysis reveals that a subset of core genes is organized into a core-gene-defined genome organizational framework, or cGOF. Furthermore, the lineage-associated cGOFs among Gram-positive and Gram-negative bacteria behave differently: the former, composed of 2 to 4 segments, have their fragments symmetrically rearranged around the origin-terminus axis, whereas the latter show more complex segmentation and are partitioned asymmetrically into chromosomal structures. The definition of cGOFs provides new insights into prokaryotic genome organization and efficient guidance for genome assembly and analysis. Copyright © 2014 Kang et al.

  9. Genome-Based Microbial Taxonomy Coming of Age.

    Science.gov (United States)

    Hugenholtz, Philip; Skarshewski, Adam; Parks, Donovan H

    2016-06-01

    Reconstructing the complete evolutionary history of extant life on our planet will be one of the most fundamental accomplishments of scientific endeavor, akin to the completion of the periodic table, which revolutionized chemistry. The road to this goal is via comparative genomics because genomes are our most comprehensive and objective evolutionary documents. The genomes of plant and animal species have been systematically targeted over the past decade to provide coverage of the tree of life. However, multicellular organisms only emerged in the last 550 million years of more than three billion years of biological evolution and thus comprise a small fraction of total biological diversity. The bulk of biodiversity, both past and present, is microbial. We have only scratched the surface in our understanding of the microbial world, as most microorganisms cannot be readily grown in the laboratory and remain unknown to science. Ground-breaking, culture-independent molecular techniques developed over the past 30 years have opened the door to this so-called microbial dark matter with an accelerating momentum driven by exponential increases in sequencing capacity. We are on the verge of obtaining representative genomes across all life for the first time. However, historical use of morphology, biochemical properties, behavioral traits, and single-marker genes to infer organismal relationships mean that the existing highly incomplete tree is riddled with taxonomic errors. Concerted efforts are now needed to synthesize and integrate the burgeoning genomic data resources into a coherent universal tree of life and genome-based taxonomy. Copyright © 2016 Cold Spring Harbor Laboratory Press; all rights reserved.

  10. Sequence based polymorphic (SBP marker technology for targeted genomic regions: its application in generating a molecular map of the Arabidopsis thaliana genome

    Directory of Open Access Journals (Sweden)

    Sahu Binod B

    2012-01-01

    Full Text Available Abstract Background Molecular markers facilitate both genotype identification, essential for modern animal and plant breeding, and the isolation of genes based on their map positions. Advancements in sequencing technology have made possible the identification of single nucleotide polymorphisms (SNPs for any genomic regions. Here a sequence based polymorphic (SBP marker technology for generating molecular markers for targeted genomic regions in Arabidopsis is described. Results A ~3X genome coverage sequence of the Arabidopsis thaliana ecotype, Niederzenz (Nd-0 was obtained by applying Illumina's sequencing by synthesis (Solexa technology. Comparison of the Nd-0 genome sequence with the assembled Columbia-0 (Col-0 genome sequence identified putative single nucleotide polymorphisms (SNPs throughout the entire genome. Multiple 75 base pair Nd-0 sequence reads containing SNPs and originating from individual genomic DNA molecules were the basis for developing co-dominant SBP markers. SNPs containing Col-0 sequences, supported by transcript sequences or sequences from multiple BAC clones, were compared to the respective Nd-0 sequences to identify possible restriction endonuclease enzyme site variations. Small amplicons, PCR amplified from both ecotypes, were digested with suitable restriction enzymes and resolved on a gel to reveal the sequence based polymorphisms. By applying this technology, 21 SBP markers for the marker poor regions of the Arabidopsis map representing polymorphisms between Col-0 and Nd-0 ecotypes were generated. Conclusions The SBP marker technology described here allowed the development of molecular markers for targeted genomic regions of Arabidopsis. It should facilitate isolation of co-dominant molecular markers for targeted genomic regions of any animal or plant species, whose genomic sequences have been assembled. This technology will particularly facilitate the development of high density molecular marker maps, essential for

  11. Discovery of potent, reversible MetAP2 inhibitors via fragment based drug discovery and structure based drug design-Part 2.

    Science.gov (United States)

    McBride, Christopher; Cheruvallath, Zacharia; Komandla, Mallareddy; Tang, Mingnam; Farrell, Pamela; Lawson, J David; Vanderpool, Darin; Wu, Yiqin; Dougan, Douglas R; Plonowski, Artur; Holub, Corine; Larson, Chris

    2016-06-15

    Methionine aminopeptidase-2 (MetAP2) is an enzyme that cleaves an N-terminal methionine residue from a number of newly synthesized proteins. This step is required before they will fold or function correctly. Pre-clinical and clinical studies with a MetAP2 inhibitor suggest that they could be used as a novel treatment for obesity. Herein we describe the discovery of a series of pyrazolo[4,3-b]indoles as reversible MetAP2 inhibitors. A fragment-based drug discovery (FBDD) approach was used, beginning with the screening of fragment libraries to generate hits with high ligand-efficiency (LE). An indazole core was selected for further elaboration, guided by structural information. SAR from the indazole series led to the design of a pyrazolo[4,3-b]indole core and accelerated knowledge-based fragment growth resulted in potent and efficient MetAP2 inhibitors, which have shown robust and sustainable body weight loss in DIO mice when dosed orally. Copyright © 2016 Elsevier Ltd. All rights reserved.

  12. Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

    Directory of Open Access Journals (Sweden)

    Vassilev Boris

    2010-04-01

    Full Text Available Abstract Background A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to α-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94% appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1 a dimer interface motif found in voltage-gated chloride channels, (2 a proton transfer motif found in heme-copper oxidases, and (3 a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.

  13. Small molecules enhance CRISPR genome editing in pluripotent stem cells.

    Science.gov (United States)

    Yu, Chen; Liu, Yanxia; Ma, Tianhua; Liu, Kai; Xu, Shaohua; Zhang, Yu; Liu, Honglei; La Russa, Marie; Xie, Min; Ding, Sheng; Qi, Lei S

    2015-02-05

    The bacterial CRISPR-Cas9 system has emerged as an effective tool for sequence-specific gene knockout through non-homologous end joining (NHEJ), but it remains inefficient for precise editing of genome sequences. Here we develop a reporter-based screening approach for high-throughput identification of chemical compounds that can modulate precise genome editing through homology-directed repair (HDR). Using our screening method, we have identified small molecules that can enhance CRISPR-mediated HDR efficiency, 3-fold for large fragment insertions and 9-fold for point mutations. Interestingly, we have also observed that a small molecule that inhibits HDR can enhance frame shift insertion and deletion (indel) mutations mediated by NHEJ. The identified small molecules function robustly in diverse cell types with minimal toxicity. The use of small molecules provides a simple and effective strategy to enhance precise genome engineering applications and facilitates the study of DNA repair mechanisms in mammalian cells. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. Route to three-dimensional fragments using diversity-oriented synthesis.

    Science.gov (United States)

    Hung, Alvin W; Ramek, Alex; Wang, Yikai; Kaya, Taner; Wilson, J Anthony; Clemons, Paul A; Young, Damian W

    2011-04-26

    Fragment-based drug discovery (FBDD) has proven to be an effective means of producing high-quality chemical ligands as starting points for drug-discovery pursuits. The increasing number of clinical candidate drugs developed using FBDD approaches is a testament of the efficacy of this approach. The success of fragment-based methods is highly dependent on the identity of the fragment library used for screening. The vast majority of FBDD has centered on the use of sp(2)-rich aromatic compounds. An expanded set of fragments that possess more 3D character would provide access to a larger chemical space of fragments than those currently used. Diversity-oriented synthesis (DOS) aims to efficiently generate a set of molecules diverse in skeletal and stereochemical properties. Molecules derived from DOS have also displayed significant success in the modulation of function of various "difficult" targets. Herein, we describe the application of DOS toward the construction of a unique set of fragments containing highly sp(3)-rich skeletons for fragment-based screening. Using cheminformatic analysis, we quantified the shapes and physical properties of the new 3D fragments and compared them with a database containing known fragment-like molecules.

  15. RESEARCH NOTE Genome-based exome-sequencing analysis ...

    Indian Academy of Sciences (India)

    Navya

    2017-02-22

    Feb 22, 2017 ... Genome-based exome-sequencing analysis identifies GYG1, DIS3L, DDRGK1 genes ... Cardiology Division, Department of Internal Medicine, Severance .... with p values of <0.05 byanalyzing differences in allele distribution.

  16. Construction of a 3D-shaped, natural product like fragment library by fragmentation and diversification of natural products.

    Science.gov (United States)

    Prescher, Horst; Koch, Guido; Schuhmann, Tim; Ertl, Peter; Bussenault, Alex; Glick, Meir; Dix, Ina; Petersen, Frank; Lizos, Dimitrios E

    2017-02-01

    A fragment library consisting of 3D-shaped, natural product-like fragments was assembled. Library construction was mainly performed by natural product degradation and natural product diversification reactions and was complemented by the identification of 3D-shaped, natural product like fragments available from commercial sources. In addition, during the course of these studies, novel rearrangements were discovered for Massarigenin C and Cytochalasin E. The obtained fragment library has an excellent 3D-shape and natural product likeness, covering a novel, unexplored and underrepresented chemical space in fragment based drug discovery (FBDD). Copyright © 2016 Elsevier Ltd. All rights reserved.

  17. CrusView: a Java-based visualization platform for comparative genomics analyses in Brassicaceae species.

    Science.gov (United States)

    Chen, Hao; Wang, Xiangfeng

    2013-09-01

    In plants and animals, chromosomal breakage and fusion events based on conserved syntenic genomic blocks lead to conserved patterns of karyotype evolution among species of the same family. However, karyotype information has not been well utilized in genomic comparison studies. We present CrusView, a Java-based bioinformatic application utilizing Standard Widget Toolkit/Swing graphics libraries and a SQLite database for performing visualized analyses of comparative genomics data in Brassicaceae (crucifer) plants. Compared with similar software and databases, one of the unique features of CrusView is its integration of karyotype information when comparing two genomes. This feature allows users to perform karyotype-based genome assembly and karyotype-assisted genome synteny analyses with preset karyotype patterns of the Brassicaceae genomes. Additionally, CrusView is a local program, which gives its users high flexibility when analyzing unpublished genomes and allows users to upload self-defined genomic information so that they can visually study the associations between genome structural variations and genetic elements, including chromosomal rearrangements, genomic macrosynteny, gene families, high-frequency recombination sites, and tandem and segmental duplications between related species. This tool will greatly facilitate karyotype, chromosome, and genome evolution studies using visualized comparative genomics approaches in Brassicaceae species. CrusView is freely available at http://www.cmbb.arizona.edu/CrusView/.

  18. Post processing of protein-compound docking for fragment-based drug discovery (FBDD): in-silico structure-based drug screening and ligand-binding pose prediction.

    Science.gov (United States)

    Fukunishi, Yoshifumi

    2010-01-01

    For fragment-based drug development, both hit (active) compound prediction and docking-pose (protein-ligand complex structure) prediction of the hit compound are important, since chemical modification (fragment linking, fragment evolution) subsequent to the hit discovery must be performed based on the protein-ligand complex structure. However, the naïve protein-compound docking calculation shows poor accuracy in terms of docking-pose prediction. Thus, post-processing of the protein-compound docking is necessary. Recently, several methods for the post-processing of protein-compound docking have been proposed. In FBDD, the compounds are smaller than those for conventional drug screening. This makes it difficult to perform the protein-compound docking calculation. A method to avoid this problem has been reported. Protein-ligand binding free energy estimation is useful to reduce the procedures involved in the chemical modification of the hit fragment. Several prediction methods have been proposed for high-accuracy estimation of protein-ligand binding free energy. This paper summarizes the various computational methods proposed for docking-pose prediction and their usefulness in FBDD.

  19. Reframing landscape fragmentation's effects on ecosystem services.

    Science.gov (United States)

    Mitchell, Matthew G E; Suarez-Castro, Andrés F; Martinez-Harms, Maria; Maron, Martine; McAlpine, Clive; Gaston, Kevin J; Johansen, Kasper; Rhodes, Jonathan R

    2015-04-01

    Landscape structure and fragmentation have important effects on ecosystem services, with a common assumption being that fragmentation reduces service provision. This is based on fragmentation's expected effects on ecosystem service supply, but ignores how fragmentation influences the flow of services to people. Here we develop a new conceptual framework that explicitly considers the links between landscape fragmentation, the supply of services, and the flow of services to people. We argue that fragmentation's effects on ecosystem service flow can be positive or negative, and use our framework to construct testable hypotheses about the effects of fragmentation on final ecosystem service provision. Empirical efforts to apply and test this framework are critical to improving landscape management for multiple ecosystem services. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Genome-Wide Analysis of Microsatellite Markers Based on Sequenced Database in Chinese Spring Wheat (Triticum aestivum L..

    Directory of Open Access Journals (Sweden)

    Bin Han

    Full Text Available Microsatellites or simple sequence repeats (SSRs are distributed across both prokaryotic and eukaryotic genomes and have been widely used for genetic studies and molecular marker-assisted breeding in crops. Though an ordered draft sequence of hexaploid bread wheat have been announced, the researches about systemic analysis of SSRs for wheat still have not been reported so far. In the present study, we identified 364,347 SSRs from among 10,603,760 sequences of the Chinese spring wheat (CSW genome, which were present at a density of 36.68 SSR/Mb. In total, we detected 488 types of motifs ranging from di- to hexanucleotides, among which dinucleotide repeats dominated, accounting for approximately 42.52% of the genome. The density of tri- to hexanucleotide repeats was 24.97%, 4.62%, 3.25% and 24.65%, respectively. AG/CT, AAG/CTT, AGAT/ATCT, AAAAG/CTTTT and AAAATT/AATTTT were the most frequent repeats among di- to hexanucleotide repeats. Among the 21 chromosomes of CSW, the density of repeats was highest on chromosome 2D and lowest on chromosome 3A. The proportions of di-, tri-, tetra-, penta- and hexanucleotide repeats on each chromosome, and even on the whole genome, were almost identical. In addition, 295,267 SSR markers were successfully developed from the 21 chromosomes of CSW, which cover the entire genome at a density of 29.73 per Mb. All of the SSR markers were validated by reverse electronic-Polymerase Chain Reaction (re-PCR; 70,564 (23.9% were found to be monomorphic and 224,703 (76.1% were found to be polymorphic. A total of 45 monomorphic markers were selected randomly for validation purposes; 24 (53.3% amplified one locus, 8 (17.8% amplified multiple identical loci, and 13 (28.9% did not amplify any fragments from the genomic DNA of CSW. Then a dendrogram was generated based on the 24 monomorphic SSR markers among 20 wheat cultivars and three species of its diploid ancestors showing that monomorphic SSR markers represented a promising

  1. PCR-based detection of a rare linear DNA in cell culture

    Directory of Open Access Journals (Sweden)

    Saveliev Sergei V.

    2002-01-01

    Full Text Available The described method allows for detection of rare linear DNA fragments generated during genomic deletions. The predicted limit of the detection is one DNA molecule per 107 or more cells. The method is based on anchor PCR and involves gel separation of the linear DNA fragment and chromosomal DNA before amplification. The detailed chemical structure of the ends of the linear DNA can be defined with the use of additional PCR-based protocols. The method was applied to study the short-lived linear DNA generated during programmed genomic deletions in a ciliate. It can be useful in studies of spontaneous DNA deletions in cell culture or for tracking intracellular modifications at the ends of transfected DNA during gene therapy trials.

  2. PCR-based detection of a rare linear DNA in cell culture.

    Science.gov (United States)

    Saveliev, Sergei V.

    2002-11-11

    The described method allows for detection of rare linear DNA fragments generated during genomic deletions. The predicted limit of the detection is one DNA molecule per 10(7) or more cells. The method is based on anchor PCR and involves gel separation of the linear DNA fragment and chromosomal DNA before amplification. The detailed chemical structure of the ends of the linear DNA can be defined with the use of additional PCR-based protocols. The method was applied to study the short-lived linear DNA generated during programmed genomic deletions in a ciliate. It can be useful in studies of spontaneous DNA deletions in cell culture or for tracking intracellular modifications at the ends of transfected DNA during gene therapy trials.

  3. Double-strand breaks in genome-sized DNA caused by mechanical stress under mixing: Quantitative evaluation through single-molecule observation

    Science.gov (United States)

    Kikuchi, Hayato; Nose, Keiji; Yoshikawa, Yuko; Yoshikawa, Kenichi

    2018-06-01

    It is becoming increasingly apparent that changes in the higher-order structure of genome-sized DNA molecules of more than several tens kbp play important roles in the self-control of genome activity in living cells. Unfortunately, it has been rather difficult to prepare genome-sized DNA molecules without damage or fragmentation. Here, we evaluated the degree of double-strand breaks (DSBs) caused by mechanical mixing by single-molecule observation with fluorescence microscopy. The results show that DNA breaks are most significant for the first second after the initiation of mechanical agitation. Based on such observation, we propose a novel mixing procedure to significantly decrease DSBs.

  4. SPR-based fragment screening with neurotensin receptor 1 generates novel small molecule ligands

    Science.gov (United States)

    Huber, Sylwia; Casagrande, Fabio; Hug, Melanie N.; Wang, Lisha; Heine, Philipp; Kummer, Lutz; Plückthun, Andreas; Hennig, Michael

    2017-01-01

    The neurotensin receptor 1 represents an important drug target involved in various diseases of the central nervous system. So far, the full exploitation of potential therapeutic activities has been compromised by the lack of compounds with favorable physicochemical and pharmacokinetic properties which efficiently penetrate the blood-brain barrier. Recent progress in the generation of stabilized variants of solubilized neurotensin receptor 1 and its subsequent purification and successful structure determination presents a solid starting point to apply the approach of fragment-based screening to extend the chemical space of known neurotensin receptor 1 ligands. In this report, surface plasmon resonance was used as primary method to screen 6369 compounds. Thereby 44 hits were identified and confirmed in competition as well as dose-response experiments. Furthermore, 4 out of 8 selected hits were validated using nuclear magnetic resonance spectroscopy as orthogonal biophysical method. Computational analysis of the compound structures, taking the known crystal structure of the endogenous peptide agonist into consideration, gave insight into the potential fragment-binding location and interactions and inspires chemistry efforts for further exploration of the fragments. PMID:28510609

  5. IDENTIFICATION OF AVIAN-SPECIFIC FECAL METAGENOMIC SEQUENCES USING GENOME FRAGMENT ENRICHMENTS

    Science.gov (United States)

    Sequence analysis of microbial genomes has provided biologists the opportunity to compare genetic differences between closely related microorganisms. While random sequencing has also been used to study natural microbial communities, metagenomic comparisons via sequencing analysis...

  6. Fragment-Based Discovery of Pyrimido[1,2-b]indazole PDE10A Inhibitors.

    Science.gov (United States)

    Chino, Ayaka; Seo, Ryushi; Amano, Yasushi; Namatame, Ichiji; Hamaguchi, Wataru; Honbou, Kazuya; Mihara, Takuma; Yamazaki, Mayako; Tomishima, Masaki; Masuda, Naoyuki

    2018-01-01

    In this study, we report the identification of potent pyrimidoindazoles as phosphodiesterase10A (PDE10A) inhibitors by using the method of fragment-based drug discovery (FBDD). The pyrazolopyridine derivative 2 was found to be a fragment hit compound which could occupy a part of the binding site of PDE10A enzyme by using the method of the X-ray co-crystal structure analysis. On the basis of the crystal structure of compound 2 and PDE10A protein, a number of compounds were synthesized and evaluated, by means of structure-activity relationship (SAR) studies, which culminated in the discovery of a novel pyrimidoindazole derivative 13 having good physicochemical properties.

  7. Genomics-based plant germplasm research (GPGR)

    Institute of Scientific and Technical Information of China (English)

    Jizeng Jia; Hongjie Li; Xueyong Zhang; Zichao Li; Lijuan Qiu

    2017-01-01

    Plant germplasm underpins much of crop genetic improvement. Millions of germplasm accessions have been collected and conserved ex situ and/or in situ, and the major challenge is now how to exploit and utilize this abundant resource. Genomics-based plant germplasm research (GPGR) or "Genoplasmics" is a novel cross-disciplinary research field that seeks to apply the principles and techniques of genomics to germplasm research. We describe in this paper the concept, strategy, and approach behind GPGR, and summarize current progress in the areas of the definition and construction of core collections, enhancement of germplasm with core collections, and gene discovery from core collections. GPGR is opening a new era in germplasm research. The contribution, progress and achievements of GPGR in the future are predicted.

  8. Optimizing virtual fragment screening for GPCRs: Identification of novel ligands for the histamine H3 receptor using ligand- and structure-based molecular fingerprints

    NARCIS (Netherlands)

    Sirci, F.; Istyastono, E.P.; Vischer, H.F.; Nijmeijer, S.; Kuijer, M.; Kooistra, A.J.; Wijtmans, M.; Mannhold, R.; Leurs, R.; de Esch, I.J.P.; de Graaf, C.

    2012-01-01

    Virtual fragment screening (VFS) is a promising new method that uses computer models to identify small, fragment-like biologically active molecules as useful starting points for fragment-based drug discovery (FBDD). Training sets of true active and inactive fragment-like molecules to construct and

  9. Analysis of DNA restriction fragments greater than 5.7 Mb in size from the centromeric region of human chromosomes.

    Science.gov (United States)

    Arn, P H; Li, X; Smith, C; Hsu, M; Schwartz, D C; Jabs, E W

    1991-01-01

    Pulsed electrophoresis was used to study the organization of the human centromeric region. Genomic DNA was digested with rare-cutting enzymes. DNA fragments from 0.2 to greater than 5.7 Mb were separated by electrophoresis and hybridized with alphoid and simple DNA repeats. Rare-cutting enzymes (Mlu I, Nar I, Not I, Nru I, Sal I, Sfi I, Sst II) demonstrated fewer restriction sites at centromeric regions than elsewhere in the genome. The enzyme Not I had the fewest restriction sites at centromeric regions. As much as 70% of these sequences from the centromeric region are present in Not I DNA fragments greater than 5.7 and estimated to be as large as 10 Mb in size. Other repetitive sequences such as short interspersed repeated segments (SINEs), long interspersed repeated segments (LINEs), ribosomal DNA, and mini-satellite DNA that are not enriched at the centromeric region, are not enriched in Not I fragments of greater than 5.7 Mb in size.

  10. A complete mitochondrial genome sequence of Asian black bear Sichuan subspecies (Ursus thibetanus mupinensis)

    Science.gov (United States)

    Hou, Wan-ru; Chen, Yu; Wu, Xia; Hu, Jin-chu; Peng, Zheng-song; Yang, Jung; Tang, Zong-xiang; Zhou, Cai-Quan; Li, Yu-ming; Yang, Shi-kui; Du, Yu-jie; Kong, Ling-lu; Ren, Zheng-long; Zhang, Huai-yu; Shuai, Su-rong

    2007-01-01

    We obtained the complete mitochondrial genome of U.thibetanus mupinensis by DNA sequencing based on the PCR fragments of 18 primers we designed. The results indicate that the mtDNA is 16 868 bp in size, encodes 13 protein genes, 22 tRNA genes, and 2 rRNA genes, with an overall H-strand base composition of 31.2% A, 25.4% C, 15.5% G and 27.9% T. The sequence of the control region (CR) located between tRNA-Pro and tRNA-Phe is 1422 bp in size, consists of 8.43% of the whole genome, GC content is 51.9% and has a 6bp tandem repeat and two 10bp tandem repeats identified by using the Tandem Repeats Finder. U. thibetanus mupinensis mitochondrial genome shares high similarity with those of three other Ursidae: U. americanus (91.46%), U. arctos (89.25%) and U. maritimus (87.66%). PMID:17205108

  11. phiGENOME: an integrative navigation throughout bacteriophage genomes.

    Science.gov (United States)

    Stano, Matej; Klucar, Lubos

    2011-11-01

    phiGENOME is a web-based genome browser generating dynamic and interactive graphical representation of phage genomes stored in the phiSITE, database of gene regulation in bacteriophages. phiGENOME is an integral part of the phiSITE web portal (http://www.phisite.org/phigenome) and it was optimised for visualisation of phage genomes with the emphasis on the gene regulatory elements. phiGENOME consists of three components: (i) genome map viewer built using Adobe Flash technology, providing dynamic and interactive graphical display of phage genomes; (ii) sequence browser based on precisely formatted HTML tags, providing detailed exploration of genome features on the sequence level and (iii) regulation illustrator, based on Scalable Vector Graphics (SVG) and designed for graphical representation of gene regulations. Bringing 542 complete genome sequences accompanied with their rich annotations and references, makes phiGENOME a unique information resource in the field of phage genomics. Copyright © 2011 Elsevier Inc. All rights reserved.

  12. Targeting lysine specific demethylase 4A (KDM4A) tandem TUDOR domain - A fragment based approach.

    Science.gov (United States)

    Upadhyay, Anup K; Judge, Russell A; Li, Leiming; Pithawalla, Ron; Simanis, Justin; Bodelle, Pierre M; Marin, Violeta L; Henry, Rodger F; Petros, Andrew M; Sun, Chaohong

    2018-06-01

    The tandem TUDOR domains present in the non-catalytic C-terminal half of the KDM4A, 4B and 4C enzymes play important roles in regulating their chromatin localizations and substrate specificities. They achieve this regulatory role by binding to different tri-methylated lysine residues on histone H3 (H3-K4me3, H3-K23me3) and histone H4 (H4-K20me3) depending upon the specific chromatin environment. In this work, we have used a 2D-NMR based fragment screening approach to identify a novel fragment (1a), which binds to the KDM4A-TUDOR domain and shows modest competition with H3-K4me3 binding in biochemical as well as in vitro cell based assays. A co-crystal structure of KDM4A TUDOR domain in complex with 1a shows that the fragment binds stereo-specifically to the methyl lysine binding pocket forming a network of strong hydrogen bonds and hydrophobic interactions. We anticipate that the fragment 1a can be further developed into a novel allosteric inhibitor of the KDM4 family of enzymes through targeting their C-terminal tandem TUDOR domain. Copyright © 2018 Elsevier Ltd. All rights reserved.

  13. In search of new lead compounds for trypanosomiasis drug design: A protein structure-based linked-fragment approach

    Science.gov (United States)

    Verlinde, Christophe L. M. J.; Rudenko, Gabrielle; Hol, Wim G. J.

    1992-04-01

    A modular method for pursuing structure-based inhibitor design in the framework of a design cycle is presented. The approach entails four stages: (1) a design pathway is defined in the three-dimensional structure of a target protein; (2) this pathway is divided into subregions; (3) complementary building blocks, also called fragments, are designed in each subregion; complementarity is defined in terms of shape, hydrophobicity, hydrogen bond properties and electrostatics; and (4) fragments from different subregions are linked into potential lead compounds. Stages (3) and (4) are qualitatively guided by force-field calculations. In addition, the designed fragments serve as entries for retrieving existing compounds from chemical databases. This linked-fragment approach has been applied in the design of potentially selective inhibitors of triosephosphate isomerase from Trypanosoma brucei, the causative agent of sleeping sickness.

  14. Diversity of chloroplast genome among local clones of cocoa (Theobroma cacao, L.) from Central Sulawesi

    Science.gov (United States)

    Suwastika, I. Nengah; Pakawaru, Nurul Aisyah; Rifka, Rahmansyah, Muslimin, Ishizaki, Yoko; Cruz, André Freire; Basri, Zainuddin; Shiina, Takashi

    2017-02-01

    Chloroplast genomes typically range in size from 120 to 170 kilo base pairs (kb), which relatively conserved among plant species. Recent evaluation on several species, certain unique regions showed high variability which can be utilized in the phylogenetic analysis. Many fragments of coding regions, introns, and intergenic spacers, such as atpB-rbcL, ndhF, rbcL, rpl16, trnH-psbA, trnL-F, trnS-G, etc., have been used for phylogenetic reconstructions at various taxonomic levels. Based on that status, we would like to analysis the diversity of chloroplast genome within species of local cacao (Theobroma cacao L.) from Central Sulawesi. Our recent data showed, there were more than 20 clones from local farming in Central Sulawesi, and it can be detected based on phenotypic and nuclear-genome-based characterization (RAPD- Random Amplified Polymorphic DNA and SSR- Simple Sequences Repeat) markers. In developing DNA marker for this local cacao, here we also included analysis based on the variation of chloroplast genome. At least several regions such as rpl32-TurnL, it can be considered as chloroplast markers on our local clone of cocoa. Furthermore, we could develop phylogenetic analysis in between clones of cocoa.

  15. The complete mitochondrial genome sequence of Eimeria magna (Apicomplexa: Coccidia).

    Science.gov (United States)

    Tian, Si-Qin; Cui, Ping; Fang, Su-Fang; Liu, Guo-Hua; Wang, Chun-Ren; Zhu, Xing-Quan

    2015-01-01

    In the present study, we determined the complete mitochondrial DNA (mtDNA) sequence of Eimeria magna from rabbits for the first time, and compared its gene contents and genome organizations with that of seven Eimeria spp. from domestic chickens. The size of the complete mt genome sequence of E. magna is 6249 bp, which consists of 3 protein-coding genes (cytb, cox1 and cox3), 12 gene fragments for the large subunit (LSU) rRNA, and 7 gene fragments for the small subunit (SSU) rRNA, without transfer RNA genes, in accordance with that of Eimeria spp. from chickens. The putative direction of translation for three genes (cytb, cox1 and cox3) was the same as those of Eimeria species from domestic chickens. The content of A + T is 65.16% for E. magna mt genome (29.73% A, 35.43% T, 17.09 G and 17.75% C). The E. magna mt genome sequence provides novel mtDNA markers for studying the molecular epidemiology and population genetics of Eimeria spp. and has implications for the molecular diagnosis and control of rabbit coccidiosis.

  16. Generation of a BAC-based physical map of the melon genome

    Directory of Open Access Journals (Sweden)

    Puigdomènech Pere

    2010-05-01

    Full Text Available Abstract Background Cucumis melo (melon belongs to the Cucurbitaceae family, whose economic importance among horticulture crops is second only to Solanaceae. Melon has high intra-specific genetic variation, morphologic diversity and a small genome size (450 Mb, which make this species suitable for a great variety of molecular and genetic studies that can lead to the development of tools for breeding varieties of the species. A number of genetic and genomic resources have already been developed, such as several genetic maps and BAC genomic libraries. These tools are essential for the construction of a physical map, a valuable resource for map-based cloning, comparative genomics and assembly of whole genome sequencing data. However, no physical map of any Cucurbitaceae has yet been developed. A project has recently been started to sequence the complete melon genome following a whole-genome shotgun strategy, which makes use of massive sequencing data. A BAC-based melon physical map will be a useful tool to help assemble and refine the draft genome data that is being produced. Results A melon physical map was constructed using a 5.7 × BAC library and a genetic map previously developed in our laboratories. High-information-content fingerprinting (HICF was carried out on 23,040 BAC clones, digesting with five restriction enzymes and SNaPshot labeling, followed by contig assembly with FPC software. The physical map has 1,355 contigs and 441 singletons, with an estimated physical length of 407 Mb (0.9 × coverage of the genome and the longest contig being 3.2 Mb. The anchoring of 845 BAC clones to 178 genetic markers (100 RFLPs, 76 SNPs and 2 SSRs also allowed the genetic positioning of 183 physical map contigs/singletons, representing 55 Mb (12% of the melon genome, to individual chromosomal loci. The melon FPC database is available for download at http://melonomics.upv.es/static/files/public/physical_map/. Conclusions Here we report the construction

  17. Fragment Linking and Optimization of Inhibitors of the Aspartic Protease Endothiapepsin: Fragment‐Based Drug Design Facilitated by Dynamic Combinatorial Chemistry

    Science.gov (United States)

    Mondal, Milon; Radeva, Nedyalka; Fanlo‐Virgós, Hugo; Otto, Sijbren; Klebe, Gerhard

    2016-01-01

    Abstract Fragment‐based drug design (FBDD) affords active compounds for biological targets. While there are numerous reports on FBDD by fragment growing/optimization, fragment linking has rarely been reported. Dynamic combinatorial chemistry (DCC) has become a powerful hit‐identification strategy for biological targets. We report the synergistic combination of fragment linking and DCC to identify inhibitors of the aspartic protease endothiapepsin. Based on X‐ray crystal structures of endothiapepsin in complex with fragments, we designed a library of bis‐acylhydrazones and used DCC to identify potent inhibitors. The most potent inhibitor exhibits an IC50 value of 54 nm, which represents a 240‐fold improvement in potency compared to the parent hits. Subsequent X‐ray crystallography validated the predicted binding mode, thus demonstrating the efficiency of the combination of fragment linking and DCC as a hit‐identification strategy. This approach could be applied to a range of biological targets, and holds the potential to facilitate hit‐to‐lead optimization. PMID:27400756

  18. Fracture mechanics model of fragmentation

    International Nuclear Information System (INIS)

    Glenn, L.A.; Gommerstadt, B.Y.; Chudnovsky, A.

    1986-01-01

    A model of the fragmentation process is developed, based on the theory of linear elastic fracture mechanics, which predicts the average fragment size as a function of strain rate and material properties. This approach permits a unification of previous results, yielding Griffith's solution in the low-strain-rate limit and Grady's solution at high strain rates

  19. High-Resolution Amplified Fragment Length Polymorphism Typing of Lactococcus lactis Strains Enables Identification of Genetic Markers for Subspecies-Related Phenotypes▿

    Science.gov (United States)

    Kütahya, Oylum Erkus; Starrenburg, Marjo J. C.; Rademaker, Jan L. W.; Klaassen, Corné H. W.; van Hylckama Vlieg, Johan E. T.; Smid, Eddy J.; Kleerebezem, Michiel

    2011-01-01

    A high-resolution amplified fragment length polymorphism (AFLP) methodology was developed to achieve the delineation of closely related Lactococcus lactis strains. The differentiation depth of 24 enzyme-primer-nucleotide combinations was experimentally evaluated to maximize the number of polymorphisms. The resolution depth was confirmed by performing diversity analysis on 82 L. lactis strains, including both closely and distantly related strains with dairy and nondairy origins. Strains clustered into two main genomic lineages of L. lactis subsp. lactis and L. lactis subsp. cremoris type-strain-like genotypes and a third novel genomic lineage rooted from the L. lactis subsp. lactis genomic lineage. Cluster differentiation was highly correlated with small-subunit rRNA homology and multilocus sequence analysis (MLSA) studies. Additionally, the selected enzyme-primer combination generated L. lactis subsp. cremoris phenotype-specific fragments irrespective of the genotype. These phenotype-specific markers allowed the differentiation of L. lactis subsp. lactis phenotype from L. lactis subsp. cremoris phenotype strains within the same L. lactis subsp. cremoris type-strain-like genomic lineage, illustrating the potential of AFLP for the generation of phenotype-linked genetic markers. PMID:21666014

  20. Hot spot analysis for driving the development of hits into leads in fragment based drug discovery

    Science.gov (United States)

    Hall, David R.; Ngan, Chi Ho; Zerbe, Brandon S.; Kozakov, Dima; Vajda, Sandor

    2011-01-01

    Fragment based drug design (FBDD) starts with finding fragment-sized compounds that are highly ligand efficient and can serve as a core moiety for developing high affinity leads. Although the core-bound structure of a protein facilitates the construction of leads, effective design is far from straightforward. We show that protein mapping, a computational method developed to find binding hot spots and implemented as the FTMap server, provides information that complements the fragment screening results and can drive the evolution of core fragments into larger leads with a minimal loss or, in some cases, even a gain in ligand efficiency. The method places small molecular probes, the size of organic solvents, on a dense grid around the protein, and identifies the hot spots as consensus clusters formed by clusters of several probes. The hot spots are ranked based on the number of probe clusters, which predicts the binding propensity of the subsites and hence their importance for drug design. Accordingly, with a single exception the main hot spot identified by FTMap binds the core compound found by fragment screening. The most useful information is provided by the neighboring secondary hot spots, indicating the regions where the core can be extended to increase its affinity. To quantify this information, we calculate the density of probes from mapping, which describes the binding propensity at each point, and show that the change in the correlation between a ligand position and the probe density upon extending or repositioning the core moiety predicts the expected change in ligand efficiency. PMID:22145575

  1. Transposon domestication versus mutualism in ciliate genome rearrangements.

    Directory of Open Access Journals (Sweden)

    Alexander Vogt

    Full Text Available Ciliated protists rearrange their genomes dramatically during nuclear development via chromosome fragmentation and DNA deletion to produce a trimmer and highly reorganized somatic genome. The deleted portion of the genome includes potentially active transposons or transposon-like sequences that reside in the germline. Three independent studies recently showed that transposase proteins of the DDE/DDD superfamily are indispensible for DNA processing in three distantly related ciliates. In the spirotrich Oxytricha trifallax, high copy-number germline-limited transposons mediate their own excision from the somatic genome but also contribute to programmed genome rearrangement through a remarkable transposon mutualism with the host. By contrast, the genomes of two oligohymenophorean ciliates, Tetrahymena thermophila and Paramecium tetraurelia, encode homologous PiggyBac-like transposases as single-copy genes in both their germline and somatic genomes. These domesticated transposases are essential for deletion of thousands of different internal sequences in these species. This review contrasts the events underlying somatic genome reduction in three different ciliates and considers their evolutionary origins and the relationships among their distinct mechanisms for genome remodeling.

  2. Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field.

    Science.gov (United States)

    Xu, Dong; Zhang, Yang

    2012-07-01

    Ab initio protein folding is one of the major unsolved problems in computational biology owing to the difficulties in force field design and conformational search. We developed a novel program, QUARK, for template-free protein structure prediction. Query sequences are first broken into fragments of 1-20 residues where multiple fragment structures are retrieved at each position from unrelated experimental structures. Full-length structure models are then assembled from fragments using replica-exchange Monte Carlo simulations, which are guided by a composite knowledge-based force field. A number of novel energy terms and Monte Carlo movements are introduced and the particular contributions to enhancing the efficiency of both force field and search engine are analyzed in detail. QUARK prediction procedure is depicted and tested on the structure modeling of 145 nonhomologous proteins. Although no global templates are used and all fragments from experimental structures with template modeling score >0.5 are excluded, QUARK can successfully construct 3D models of correct folds in one-third cases of short proteins up to 100 residues. In the ninth community-wide Critical Assessment of protein Structure Prediction experiment, QUARK server outperformed the second and third best servers by 18 and 47% based on the cumulative Z-score of global distance test-total scores in the FM category. Although ab initio protein folding remains a significant challenge, these data demonstrate new progress toward the solution of the most important problem in the field. Copyright © 2012 Wiley Periodicals, Inc.

  3. Mechanisms of Base Substitution Mutagenesis in Cancer Genomes

    Directory of Open Access Journals (Sweden)

    Albino Bacolla

    2014-03-01

    Full Text Available Cancer genome sequence data provide an invaluable resource for inferring the key mechanisms by which mutations arise in cancer cells, favoring their survival, proliferation and invasiveness. Here we examine recent advances in understanding the molecular mechanisms responsible for the predominant type of genetic alteration found in cancer cells, somatic single base substitutions (SBSs. Cytosine methylation, demethylation and deamination, charge transfer reactions in DNA, DNA replication timing, chromatin status and altered DNA proofreading activities are all now known to contribute to the mechanisms leading to base substitution mutagenesis. We review current hypotheses as to the major processes that give rise to SBSs and evaluate their relative relevance in the light of knowledge acquired from cancer genome sequencing projects and the study of base modifications, DNA repair and lesion bypass. Although gene expression data on APOBEC3B enzymes provide support for a role in cancer mutagenesis through U:G mismatch intermediates, the enzyme preference for single-stranded DNA may limit its activity genome-wide. For SBSs at both CG:CG and YC:GR sites, we outline evidence for a prominent role of damage by charge transfer reactions that follow interactions of the DNA with reactive oxygen species (ROS and other endogenous or exogenous electron-abstracting molecules.

  4. Mechanisms of base substitution mutagenesis in cancer genomes.

    Science.gov (United States)

    Bacolla, Albino; Cooper, David N; Vasquez, Karen M

    2014-03-05

    Cancer genome sequence data provide an invaluable resource for inferring the key mechanisms by which mutations arise in cancer cells, favoring their survival, proliferation and invasiveness. Here we examine recent advances in understanding the molecular mechanisms responsible for the predominant type of genetic alteration found in cancer cells, somatic single base substitutions (SBSs). Cytosine methylation, demethylation and deamination, charge transfer reactions in DNA, DNA replication timing, chromatin status and altered DNA proofreading activities are all now known to contribute to the mechanisms leading to base substitution mutagenesis. We review current hypotheses as to the major processes that give rise to SBSs and evaluate their relative relevance in the light of knowledge acquired from cancer genome sequencing projects and the study of base modifications, DNA repair and lesion bypass. Although gene expression data on APOBEC3B enzymes provide support for a role in cancer mutagenesis through U:G mismatch intermediates, the enzyme preference for single-stranded DNA may limit its activity genome-wide. For SBSs at both CG:CG and YC:GR sites, we outline evidence for a prominent role of damage by charge transfer reactions that follow interactions of the DNA with reactive oxygen species (ROS) and other endogenous or exogenous electron-abstracting molecules.

  5. The dual role of fragments in fragment-assembly methods for de novo protein structure prediction

    Science.gov (United States)

    Handl, Julia; Knowles, Joshua; Vernon, Robert; Baker, David; Lovell, Simon C.

    2013-01-01

    In fragment-assembly techniques for protein structure prediction, models of protein structure are assembled from fragments of known protein structures. This process is typically guided by a knowledge-based energy function and uses a heuristic optimization method. The fragments play two important roles in this process: they define the set of structural parameters available, and they also assume the role of the main variation operators that are used by the optimiser. Previous analysis has typically focused on the first of these roles. In particular, the relationship between local amino acid sequence and local protein structure has been studied by a range of authors. The correlation between the two has been shown to vary with the window length considered, and the results of these analyses have informed directly the choice of fragment length in state-of-the-art prediction techniques. Here, we focus on the second role of fragments and aim to determine the effect of fragment length from an optimization perspective. We use theoretical analyses to reveal how the size and structure of the search space changes as a function of insertion length. Furthermore, empirical analyses are used to explore additional ways in which the size of the fragment insertion influences the search both in a simulation model and for the fragment-assembly technique, Rosetta. PMID:22095594

  6. Evidence-based gene models for structural and functional annotations of the oil palm genome.

    Science.gov (United States)

    Chan, Kuang-Lim; Tatarinova, Tatiana V; Rosli, Rozana; Amiruddin, Nadzirah; Azizi, Norazah; Halim, Mohd Amin Ab; Sanusi, Nik Shazana Nik Mohd; Jayanthi, Nagappan; Ponomarenko, Petr; Triska, Martin; Solovyev, Victor; Firdaus-Raih, Mohd; Sambanthamurthi, Ravigadevi; Murphy, Denis; Low, Eng-Ti Leslie

    2017-09-08

    Oil palm is an important source of edible oil. The importance of the crop, as well as its long breeding cycle (10-12 years) has led to the sequencing of its genome in 2013 to pave the way for genomics-guided breeding. Nevertheless, the first set of gene predictions, although useful, had many fragmented genes. Classification and characterization of genes associated with traits of interest, such as those for fatty acid biosynthesis and disease resistance, were also limited. Lipid-, especially fatty acid (FA)-related genes are of particular interest for the oil palm as they specify oil yields and quality. This paper presents the characterization of the oil palm genome using different gene prediction methods and comparative genomics analysis, identification of FA biosynthesis and disease resistance genes, and the development of an annotation database and bioinformatics tools. Using two independent gene-prediction pipelines, Fgenesh++ and Seqping, 26,059 oil palm genes with transcriptome and RefSeq support were identified from the oil palm genome. These coding regions of the genome have a characteristic broad distribution of GC 3 (fraction of cytosine and guanine in the third position of a codon) with over half the GC 3 -rich genes (GC 3  ≥ 0.75286) being intronless. In comparison, only one-seventh of the oil palm genes identified are intronless. Using comparative genomics analysis, characterization of conserved domains and active sites, and expression analysis, 42 key genes involved in FA biosynthesis in oil palm were identified. For three of them, namely EgFABF, EgFABH and EgFAD3, segmental duplication events were detected. Our analysis also identified 210 candidate resistance genes in six classes, grouped by their protein domain structures. We present an accurate and comprehensive annotation of the oil palm genome, focusing on analysis of important categories of genes (GC 3 -rich and intronless), as well as those associated with important functions, such as FA

  7. A Near-Complete Haplotype-Phased Genome of the Dikaryotic Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici Reveals High Interhaplotype Diversity

    Directory of Open Access Journals (Sweden)

    Benjamin Schwessinger

    2018-02-01

    Full Text Available A long-standing biological question is how evolution has shaped the genomic architecture of dikaryotic fungi. To answer this, high-quality genomic resources that enable haplotype comparisons are essential. Short-read genome assemblies for dikaryotic fungi are highly fragmented and lack haplotype-specific information due to the high heterozygosity and repeat content of these genomes. Here, we present a diploid-aware assembly of the wheat stripe rust fungus Puccinia striiformis f. sp. tritici based on long reads using the FALCON-Unzip assembler. Transcriptome sequencing data sets were used to infer high-quality gene models and identify virulence genes involved in plant infection referred to as effectors. This represents the most complete Puccinia striiformis f. sp. tritici genome assembly to date (83 Mb, 156 contigs, N50 of 1.5 Mb and provides phased haplotype information for over 92% of the genome. Comparisons of the phase blocks revealed high interhaplotype diversity of over 6%. More than 25% of all genes lack a clear allelic counterpart. When we investigated genome features that potentially promote the rapid evolution of virulence, we found that candidate effector genes are spatially associated with conserved genes commonly found in basidiomycetes. Yet, candidate effectors that lack an allelic counterpart are more distant from conserved genes than allelic candidate effectors and are less likely to be evolutionarily conserved within the P. striiformis species complex and Pucciniales. In summary, this haplotype-phased assembly enabled us to discover novel genome features of a dikaryotic plant-pathogenic fungus previously hidden in collapsed and fragmented genome assemblies.

  8. A fragment-based approach towards ab-initio treatment of polymeric ...

    Indian Academy of Sciences (India)

    Reshma S Pingale

    2017-06-20

    Jun 20, 2017 ... Keywords. π-Conjugated polymer; divide and conquer; ab-initio; fragmentation. PACS Nos 31.15.A−; 36.20. ... cut the parent system into a set of overlapping small fragments and .... some oligomers, we approached the problem by increas- ..... Financial support of DST, Govt. of India, New Delhi, in the form of ...

  9. Effective progression of nuclear magnetic resonance-detected fragment hits.

    Science.gov (United States)

    Eaton, Hugh L; Wyss, Daniel F

    2011-01-01

    Fragment-based drug discovery (FBDD) has become increasingly popular over the last decade as an alternate lead generation tool to HTS approaches. Several compounds have now progressed into the clinic which originated from a fragment-based approach, demonstrating the utility of this emerging field. While fragment hit identification has become much more routine and may involve different screening approaches, the efficient progression of fragment hits into quality lead series may still present a major bottleneck for the broadly successful application of FBDD. In our laboratory, we have extensive experience in fragment-based NMR screening (SbN) and the subsequent iterative progression of fragment hits using structure-assisted chemistry. To maximize impact, we have applied this approach strategically to early- and high-priority targets, and those struggling for leads. Its application has yielded a clinical candidate for BACE1 and lead series in about one third of the SbN/FBDD projects. In this chapter, we will give an overview of our strategy and focus our discussion on NMR-based FBDD approaches. Copyright © 2011 Elsevier Inc. All rights reserved.

  10. Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models

    Directory of Open Access Journals (Sweden)

    Surovcik Katharina

    2006-03-01

    Full Text Available Abstract Background Horizontal gene transfer (HGT is considered a strong evolutionary force shaping the content of microbial genomes in a substantial manner. It is the difference in speed enabling the rapid adaptation to changing environmental demands that distinguishes HGT from gene genesis, duplications or mutations. For a precise characterization, algorithms are needed that identify transfer events with high reliability. Frequently, the transferred pieces of DNA have a considerable length, comprise several genes and are called genomic islands (GIs or more specifically pathogenicity or symbiotic islands. Results We have implemented the program SIGI-HMM that predicts GIs and the putative donor of each individual alien gene. It is based on the analysis of codon usage (CU of each individual gene of a genome under study. CU of each gene is compared against a carefully selected set of CU tables representing microbial donors or highly expressed genes. Multiple tests are used to identify putatively alien genes, to predict putative donors and to mask putatively highly expressed genes. Thus, we determine the states and emission probabilities of an inhomogeneous hidden Markov model working on gene level. For the transition probabilities, we draw upon classical test theory with the intention of integrating a sensitivity controller in a consistent manner. SIGI-HMM was written in JAVA and is publicly available. It accepts as input any file created according to the EMBL-format. It generates output in the common GFF format readable for genome browsers. Benchmark tests showed that the output of SIGI-HMM is in agreement with known findings. Its predictions were both consistent with annotated GIs and with predictions generated by different methods. Conclusion SIGI-HMM is a sensitive tool for the identification of GIs in microbial genomes. It allows to interactively analyze genomes in detail and to generate or to test hypotheses about the origin of acquired

  11. Genomic profiling of oral squamous cell carcinoma by array-based comparative genomic hybridization.

    Directory of Open Access Journals (Sweden)

    Shunichi Yoshioka

    Full Text Available We designed a study to investigate genetic relationships between primary tumors of oral squamous cell carcinoma (OSCC and their lymph node metastases, and to identify genomic copy number aberrations (CNAs related to lymph node metastasis. For this purpose, we collected a total of 42 tumor samples from 25 patients and analyzed their genomic profiles by array-based comparative genomic hybridization. We then compared the genetic profiles of metastatic primary tumors (MPTs with their paired lymph node metastases (LNMs, and also those of LNMs with non-metastatic primary tumors (NMPTs. Firstly, we found that although there were some distinctive differences in the patterns of genomic profiles between MPTs and their paired LNMs, the paired samples shared similar genomic aberration patterns in each case. Unsupervised hierarchical clustering analysis grouped together 12 of the 15 MPT-LNM pairs. Furthermore, similarity scores between paired samples were significantly higher than those between non-paired samples. These results suggested that MPTs and their paired LNMs are composed predominantly of genetically clonal tumor cells, while minor populations with different CNAs may also exist in metastatic OSCCs. Secondly, to identify CNAs related to lymph node metastasis, we compared CNAs between grouped samples of MPTs and LNMs, but were unable to find any CNAs that were more common in LNMs. Finally, we hypothesized that subpopulations carrying metastasis-related CNAs might be present in both the MPT and LNM. Accordingly, we compared CNAs between NMPTs and LNMs, and found that gains of 7p, 8q and 17q were more common in the latter than in the former, suggesting that these CNAs may be involved in lymph node metastasis of OSCC. In conclusion, our data suggest that in OSCCs showing metastasis, the primary and metastatic tumors share similar genomic profiles, and that cells in the primary tumor may tend to metastasize after acquiring metastasis-associated CNAs.

  12. GenomeRNAi: a database for cell-based RNAi phenotypes.

    Science.gov (United States)

    Horn, Thomas; Arziman, Zeynep; Berger, Juerg; Boutros, Michael

    2007-01-01

    RNA interference (RNAi) has emerged as a powerful tool to generate loss-of-function phenotypes in a variety of organisms. Combined with the sequence information of almost completely annotated genomes, RNAi technologies have opened new avenues to conduct systematic genetic screens for every annotated gene in the genome. As increasing large datasets of RNAi-induced phenotypes become available, an important challenge remains the systematic integration and annotation of functional information. Genome-wide RNAi screens have been performed both in Caenorhabditis elegans and Drosophila for a variety of phenotypes and several RNAi libraries have become available to assess phenotypes for almost every gene in the genome. These screens were performed using different types of assays from visible phenotypes to focused transcriptional readouts and provide a rich data source for functional annotation across different species. The GenomeRNAi database provides access to published RNAi phenotypes obtained from cell-based screens and maps them to their genomic locus, including possible non-specific regions. The database also gives access to sequence information of RNAi probes used in various screens. It can be searched by phenotype, by gene, by RNAi probe or by sequence and is accessible at http://rnai.dkfz.de.

  13. Identification of genomic sites for CRISPR/Cas9-based genome editing in the Vitis vinifera genome

    Science.gov (United States)

    CRISPR/Cas9 has been recently demonstrated as an effective and popular genome editing tool for modifying genomes of human, animals, microorganisms, and plants. Success of such genome editing is highly dependent on the availability of suitable target sites in the genomes to be edited. Many specific t...

  14. Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes.

    Science.gov (United States)

    Cabal, Adriana; Jun, Se-Ran; Jenjaroenpun, Piroon; Wanchai, Visanu; Nookaew, Intawat; Wongsurawat, Thidathip; Burgess, Mary J; Kothari, Atul; Wassenaar, Trudy M; Ussery, David W

    2018-02-14

    Infections due to Clostridioides difficile (previously known as Clostridium difficile) are a major problem in hospitals, where cases can be caused by community-acquired strains as well as by nosocomial spread. Whole genome sequences from clinical samples contain a lot of information but that needs to be analyzed and compared in such a way that the outcome is useful for clinicians or epidemiologists. Here, we compare 663 public available complete genome sequences of C. difficile using average amino acid identity (AAI) scores. This analysis revealed that most of these genomes (640, 96.5%) clearly belong to the same species, while the remaining 23 genomes produce four distinct clusters within the Clostridioides genus. The main C. difficile cluster can be further divided into sub-clusters, depending on the chosen cutoff. We demonstrate that MLST, either based on partial or full gene-length, results in biased estimates of genetic differences and does not capture the true degree of similarity or differences of complete genomes. Presence of genes coding for C. difficile toxins A and B (ToxA/B), as well as the binary C. difficile toxin (CDT), was deduced from their unique PfamA domain architectures. Out of the 663 C. difficile genomes, 535 (80.7%) contained at least one copy of ToxA or ToxB, while these genes were missing from 128 genomes. Although some clusters were enriched for toxin presence, these genes are variably present in a given genetic background. The CDT genes were found in 191 genomes, which were restricted to a few clusters only, and only one cluster lacked the toxin A/B genes consistently. A total of 310 genomes contained ToxA/B without CDT (47%). Further, published metagenomic data from stools were used to assess the presence of C. difficile sequences in blinded cases of C. difficile infection (CDI) and controls, to test if metagenomic analysis is sensitive enough to detect the pathogen, and to establish strain relationships between cases from the same

  15. The complete mitochondrial genome of Gossypium hirsutum and evolutionary analysis of higher plant mitochondrial genomes.

    Science.gov (United States)

    Liu, Guozheng; Cao, Dandan; Li, Shuangshuang; Su, Aiguo; Geng, Jianing; Grover, Corrinne E; Hu, Songnian; Hua, Jinping

    2013-01-01

    Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species.

  16. Fragment-Based Screening of a Natural Product Library against 62 Potential Malaria Drug Targets Employing Native Mass Spectrometry

    Science.gov (United States)

    2018-01-01

    Natural products are well known for their biological relevance, high degree of three-dimensionality, and access to areas of largely unexplored chemical space. To shape our understanding of the interaction between natural products and protein targets in the postgenomic era, we have used native mass spectrometry to investigate 62 potential protein targets for malaria using a natural-product-based fragment library. We reveal here 96 low-molecular-weight natural products identified as binding partners of 32 of the putative malarial targets. Seventy-nine (79) fragments have direct growth inhibition on Plasmodium falciparum at concentrations that are promising for the development of fragment hits against these protein targets. This adds a fragment library to the published HTS active libraries in the public domain. PMID:29436819

  17. Immunogenic properties of Streptococcus agalactiae FbsA fragments.

    Directory of Open Access Journals (Sweden)

    Salvatore Papasergi

    Full Text Available Several species of Gram-positive bacteria can avidly bind soluble and surface-associated fibrinogen (Fng, a property that is considered important in the pathogenesis of human infections. To gain insights into the mechanism by which group B Streptococcus (GBS, a frequent neonatal pathogen, interacts with Fng, we have screened two phage displayed genomic GBS libraries. All of the Fng-binding phage clones contained inserts encoding fragments of FbsA, a protein displaying multiple repeats. Since the functional role of this protein is only partially understood, representative fragments were recombinantly expressed and analyzed for Fng binding affinity and ability to induce immune protection against GBS infection. Maternal immunization with 6pGST, a fragment containing five repeats, significantly protected mouse pups against lethal GBS challenge and these protective effects could be recapitulated by administration of anti-6pGST serum from adult animals. Notably, a monoclonal antibody that was capable of neutralizing Fng binding by 6pGST, but not a non-neutralizing antibody, could significantly protect pups against lethal GBS challenge. These data suggest that FbsA-Fng interaction promotes GBS pathogenesis and that blocking such interaction is a viable strategy to prevent or treat GBS infections.

  18. Microarray MAPH: accurate array-based detection of relative copy number in genomic DNA

    Directory of Open Access Journals (Sweden)

    Chan Alan

    2006-06-01

    Full Text Available Abstract Background Current methods for measurement of copy number do not combine all the desirable qualities of convenience, throughput, economy, accuracy and resolution. In this study, to improve the throughput associated with Multiplex Amplifiable Probe Hybridisation (MAPH we aimed to develop a modification based on the 3-Dimensional, Flow-Through Microarray Platform from PamGene International. In this new method, electrophoretic analysis of amplified products is replaced with photometric analysis of a probed oligonucleotide array. Copy number analysis of hybridised probes is based on a dual-label approach by comparing the intensity of Cy3-labelled MAPH probes amplified from test samples co-hybridised with similarly amplified Cy5-labelled reference MAPH probes. The key feature of using a hybridisation-based end point with MAPH is that discrimination of amplified probes is based on sequence and not fragment length. Results In this study we showed that microarray MAPH measurement of PMP22 gene dosage correlates well with PMP22 gene dosage determined by capillary MAPH and that copy number was accurately reported in analyses of DNA from 38 individuals, 12 of which were known to have Charcot-Marie-Tooth disease type 1A (CMT1A. Conclusion Measurement of microarray-based endpoints for MAPH appears to be of comparable accuracy to electrophoretic methods, and holds the prospect of fully exploiting the potential multiplicity of MAPH. The technology has the potential to simplify copy number assays for genes with a large number of exons, or of expanded sets of probes from dispersed genomic locations.

  19. Microarray MAPH: accurate array-based detection of relative copy number in genomic DNA.

    Science.gov (United States)

    Gibbons, Brian; Datta, Parikkhit; Wu, Ying; Chan, Alan; Al Armour, John

    2006-06-30

    Current methods for measurement of copy number do not combine all the desirable qualities of convenience, throughput, economy, accuracy and resolution. In this study, to improve the throughput associated with Multiplex Amplifiable Probe Hybridisation (MAPH) we aimed to develop a modification based on the 3-Dimensional, Flow-Through Microarray Platform from PamGene International. In this new method, electrophoretic analysis of amplified products is replaced with photometric analysis of a probed oligonucleotide array. Copy number analysis of hybridised probes is based on a dual-label approach by comparing the intensity of Cy3-labelled MAPH probes amplified from test samples co-hybridised with similarly amplified Cy5-labelled reference MAPH probes. The key feature of using a hybridisation-based end point with MAPH is that discrimination of amplified probes is based on sequence and not fragment length. In this study we showed that microarray MAPH measurement of PMP22 gene dosage correlates well with PMP22 gene dosage determined by capillary MAPH and that copy number was accurately reported in analyses of DNA from 38 individuals, 12 of which were known to have Charcot-Marie-Tooth disease type 1A (CMT1A). Measurement of microarray-based endpoints for MAPH appears to be of comparable accuracy to electrophoretic methods, and holds the prospect of fully exploiting the potential multiplicity of MAPH. The technology has the potential to simplify copy number assays for genes with a large number of exons, or of expanded sets of probes from dispersed genomic locations.

  20. Random Tagging Genotyping by Sequencing (rtGBS, an Unbiased Approach to Locate Restriction Enzyme Sites across the Target Genome.

    Directory of Open Access Journals (Sweden)

    Elena Hilario

    Full Text Available Genotyping by sequencing (GBS is a restriction enzyme based targeted approach developed to reduce the genome complexity and discover genetic markers when a priori sequence information is unavailable. Sufficient coverage at each locus is essential to distinguish heterozygous from homozygous sites accurately. The number of GBS samples able to be pooled in one sequencing lane is limited by the number of restriction sites present in the genome and the read depth required at each site per sample for accurate calling of single-nucleotide polymorphisms. Loci bias was observed using a slight modification of the Elshire et al.some restriction enzyme sites were represented in higher proportions while others were poorly represented or absent. This bias could be due to the quality of genomic DNA, the endonuclease and ligase reaction efficiency, the distance between restriction sites, the preferential amplification of small library restriction fragments, or bias towards cluster formation of small amplicons during the sequencing process. To overcome these issues, we have developed a GBS method based on randomly tagging genomic DNA (rtGBS. By randomly landing on the genome, we can, with less bias, find restriction sites that are far apart, and undetected by the standard GBS (stdGBS method. The study comprises two types of biological replicates: six different kiwifruit plants and two independent DNA extractions per plant; and three types of technical replicates: four samples of each DNA extraction, stdGBS vs. rtGBS methods, and two independent library amplifications, each sequenced in separate lanes. A statistically significant unbiased distribution of restriction fragment size by rtGBS showed that this method targeted 49% (39,145 of BamH I sites shared with the reference genome, compared to only 14% (11,513 by stdGBS.

  1. Fragment-based {sup 13}C nuclear magnetic resonance chemical shift predictions in molecular crystals: An alternative to planewave methods

    Energy Technology Data Exchange (ETDEWEB)

    Hartman, Joshua D.; Beran, Gregory J. O., E-mail: gregory.beran@ucr.edu [Department of Chemistry, University of California, Riverside, California 92521 (United States); Monaco, Stephen; Schatschneider, Bohdan [The Pennsylvania State University, The Eberly Campus, 2201 University Dr, Lemont Furnace, Pennsylvania 15456 (United States)

    2015-09-14

    We assess the quality of fragment-based ab initio isotropic {sup 13}C chemical shift predictions for a collection of 25 molecular crystals with eight different density functionals. We explore the relative performance of cluster, two-body fragment, combined cluster/fragment, and the planewave gauge-including projector augmented wave (GIPAW) models relative to experiment. When electrostatic embedding is employed to capture many-body polarization effects, the simple and computationally inexpensive two-body fragment model predicts both isotropic {sup 13}C chemical shifts and the chemical shielding tensors as well as both cluster models and the GIPAW approach. Unlike the GIPAW approach, hybrid density functionals can be used readily in a fragment model, and all four hybrid functionals tested here (PBE0, B3LYP, B3PW91, and B97-2) predict chemical shifts in noticeably better agreement with experiment than the four generalized gradient approximation (GGA) functionals considered (PBE, OPBE, BLYP, and BP86). A set of recommended linear regression parameters for mapping between calculated chemical shieldings and observed chemical shifts are provided based on these benchmark calculations. Statistical cross-validation procedures are used to demonstrate the robustness of these fits.

  2. Evaluation of genetic diversity in jackfruit (Artocarpus heterophyllus Lam.) based on amplified fragment length polymorphism markers.

    Science.gov (United States)

    Shyamalamma, S; Chandra, S B C; Hegde, M; Naryanswamy, P

    2008-07-22

    Artocarpus heterophyllus Lam., commonly called jackfruit, is a medium-sized evergreen tree that bears high yields of the largest known edible fruit. Yet, it has been little explored commercially due to wide variation in fruit quality. The genetic diversity and genetic relatedness of 50 jackfruit accessions were studied using amplified fragment length polymorphism markers. Of 16 primer pairs evaluated, eight were selected for screening of genotypes based on the number and quality of polymorphic fragments produced. These primer combinations produced 5976 bands, 1267 (22%) of which were polymorphic. Among the jackfruit accessions, the similarity coefficient ranged from 0.137 to 0.978; the accessions also shared a large number of monomorphic fragments (78%). Cluster analysis and principal component analysis grouped all jackfruit genotypes into three major clusters. Cluster I included the genotypes grown in a jackfruit region of Karnataka, called Tamaka, with very dry conditions; cluster II contained the genotypes collected from locations having medium to heavy rainfall in Karnataka; cluster III grouped the genotypes in distant locations with different environmental conditions. Strong coincidence of these amplified fragment length polymorphism-based groupings with geographical localities as well as morphological characters was observed. We found moderate genetic diversity in these jackfruit accessions. This information should be useful for tree breeding programs, as part of our effort to popularize jackfruit as a commercial crop.

  3. CrusView: A Java-Based Visualization Platform for Comparative Genomics Analyses in Brassicaceae Species[OPEN

    Science.gov (United States)

    Chen, Hao; Wang, Xiangfeng

    2013-01-01

    In plants and animals, chromosomal breakage and fusion events based on conserved syntenic genomic blocks lead to conserved patterns of karyotype evolution among species of the same family. However, karyotype information has not been well utilized in genomic comparison studies. We present CrusView, a Java-based bioinformatic application utilizing Standard Widget Toolkit/Swing graphics libraries and a SQLite database for performing visualized analyses of comparative genomics data in Brassicaceae (crucifer) plants. Compared with similar software and databases, one of the unique features of CrusView is its integration of karyotype information when comparing two genomes. This feature allows users to perform karyotype-based genome assembly and karyotype-assisted genome synteny analyses with preset karyotype patterns of the Brassicaceae genomes. Additionally, CrusView is a local program, which gives its users high flexibility when analyzing unpublished genomes and allows users to upload self-defined genomic information so that they can visually study the associations between genome structural variations and genetic elements, including chromosomal rearrangements, genomic macrosynteny, gene families, high-frequency recombination sites, and tandem and segmental duplications between related species. This tool will greatly facilitate karyotype, chromosome, and genome evolution studies using visualized comparative genomics approaches in Brassicaceae species. CrusView is freely available at http://www.cmbb.arizona.edu/CrusView/. PMID:23898041

  4. Genome-based comparative analyses of Antarctic and temperate species of Paenibacillus.

    Directory of Open Access Journals (Sweden)

    Melissa Dsouza

    Full Text Available Antarctic soils represent a unique environment characterised by extremes of temperature, salinity, elevated UV radiation, low nutrient and low water content. Despite the harshness of this environment, members of 15 bacterial phyla have been identified in soils of the Ross Sea Region (RSR. However, the survival mechanisms and ecological roles of these phyla are largely unknown. The aim of this study was to investigate whether strains of Paenibacillus darwinianus owe their resilience to substantial genomic changes. For this, genome-based comparative analyses were performed on three P. darwinianus strains, isolated from gamma-irradiated RSR soils, together with nine temperate, soil-dwelling Paenibacillus spp. The genome of each strain was sequenced to over 1,000-fold coverage, then assembled into contigs totalling approximately 3 Mbp per genome. Based on the occurrence of essential, single-copy genes, genome completeness was estimated at approximately 88%. Genome analysis revealed between 3,043-3,091 protein-coding sequences (CDSs, primarily associated with two-component systems, sigma factors, transporters, sporulation and genes induced by cold-shock, oxidative and osmotic stresses. These comparative analyses provide an insight into the metabolic potential of P. darwinianus, revealing potential adaptive mechanisms for survival in Antarctic soils. However, a large proportion of these mechanisms were also identified in temperate Paenibacillus spp., suggesting that these mechanisms are beneficial for growth and survival in a range of soil environments. These analyses have also revealed that the P. darwinianus genomes contain significantly fewer CDSs and have a lower paralogous content. Notwithstanding the incompleteness of the assemblies, the large differences in genome sizes, determined by the number of genes in paralogous clusters and the CDS content, are indicative of genome content scaling. Finally, these sequences are a resource for further

  5. Genomic-based-breeding tools for tropical maize improvement.

    Science.gov (United States)

    Chakradhar, Thammineni; Hindu, Vemuri; Reddy, Palakolanu Sudhakar

    2017-12-01

    Maize has traditionally been the main staple diet in the Southern Asia and Sub-Saharan Africa and widely grown by millions of resource poor small scale farmers. Approximately, 35.4 million hectares are sown to tropical maize, constituting around 59% of the developing worlds. Tropical maize encounters tremendous challenges besides poor agro-climatic situations with average yields recorded <3 tones/hectare that is far less than the average of developed countries. On the contrary to poor yields, the demand for maize as food, feed, and fuel is continuously increasing in these regions. Heterosis breeding introduced in early 90 s improved maize yields significantly, but genetic gains is still a mirage, particularly for crop growing under marginal environments. Application of molecular markers has accelerated the pace of maize breeding to some extent. The availability of array of sequencing and genotyping technologies offers unrivalled service to improve precision in maize-breeding programs through modern approaches such as genomic selection, genome-wide association studies, bulk segregant analysis-based sequencing approaches, etc. Superior alleles underlying complex traits can easily be identified and introgressed efficiently using these sequence-based approaches. Integration of genomic tools and techniques with advanced genetic resources such as nested association mapping and backcross nested association mapping could certainly address the genetic issues in maize improvement programs in developing countries. Huge diversity in tropical maize and its inherent capacity for doubled haploid technology offers advantage to apply the next generation genomic tools for accelerating production in marginal environments of tropical and subtropical world. Precision in phenotyping is the key for success of any molecular-breeding approach. This article reviews genomic technologies and their application to improve agronomic traits in tropical maize breeding has been reviewed in

  6. e-Drug3D: 3D structure collections dedicated to drug repurposing and fragment-based drug design.

    Science.gov (United States)

    Pihan, Emilie; Colliandre, Lionel; Guichou, Jean-François; Douguet, Dominique

    2012-06-01

    In the drug discovery field, new uses for old drugs, selective optimization of side activities and fragment-based drug design (FBDD) have proved to be successful alternatives to high-throughput screening. e-Drug3D is a database of 3D chemical structures of drugs that provides several collections of ready-to-screen SD files of drugs and commercial drug fragments. They are natural inputs in studies dedicated to drug repurposing and FBDD. e-Drug3D collections are freely available at http://chemoinfo.ipmc.cnrs.fr/e-drug3d.html either for download or for direct in silico web-based screenings.

  7. Progress of CRISPR-Cas Based Genome Editing in Photosynthetic Microbes.

    Science.gov (United States)

    Naduthodi, Mihris Ibnu Saleem; Barbosa, Maria J; van der Oost, John

    2018-02-03

    The carbon footprint caused by unsustainable development and its environmental and economic impact has become a major concern in the past few decades. Photosynthetic microbes such as microalgae and cyanobacteria are capable of accumulating value-added compounds from carbon dioxide, and have been regarded as environmentally friendly alternatives to reduce the usage of fossil fuels, thereby contributing to reducing the carbon footprint. This light-driven generation of green chemicals and biofuels has triggered the research for metabolic engineering of these photosynthetic microbes. CRISPR-Cas systems are successfully implemented across a wide range of prokaryotic and eukaryotic species for efficient genome editing. However, the inception of this genome editing tool in microalgal and cyanobacterial species took off rather slowly due to various complications. In this review, we elaborate on the established CRISPR-Cas based genome editing in various microalgal and cyanobacterial species. The complications associated with CRISPR-Cas based genome editing in these species are addressed along with possible strategies to overcome these issues. It is anticipated that in the near future this will result in improving and expanding the microalgal and cyanobacterial genome engineering toolbox. © 2018 The Authors. Biotechnology Journal Published by Wiley-VCH Verlag GmbH & Co. KGaA.

  8. Genome-reconstruction for eukaryotes from complex natural microbial communities.

    Science.gov (United States)

    West, Patrick T; Probst, Alexander J; Grigoriev, Igor V; Thomas, Brian C; Banfield, Jillian F

    2018-04-01

    Microbial eukaryotes are integral components of natural microbial communities, and their inclusion is critical for many ecosystem studies, yet the majority of published metagenome analyses ignore eukaryotes. In order to include eukaryotes in environmental studies, we propose a method to recover eukaryotic genomes from complex metagenomic samples. A key step for genome recovery is separation of eukaryotic and prokaryotic fragments. We developed a k -mer-based strategy, EukRep, for eukaryotic sequence identification and applied it to environmental samples to show that it enables genome recovery, genome completeness evaluation, and prediction of metabolic potential. We used this approach to test the effect of addition of organic carbon on a geyser-associated microbial community and detected a substantial change of the community metabolism, with selection against almost all candidate phyla bacteria and archaea and for eukaryotes. Near complete genomes were reconstructed for three fungi placed within the Eurotiomycetes and an arthropod. While carbon fixation and sulfur oxidation were important functions in the geyser community prior to carbon addition, the organic carbon-impacted community showed enrichment for secreted proteases, secreted lipases, cellulose targeting CAZymes, and methanol oxidation. We demonstrate the broader utility of EukRep by reconstructing and evaluating relatively high-quality fungal, protist, and rotifer genomes from complex environmental samples. This approach opens the way for cultivation-independent analyses of whole microbial communities. © 2018 West et al.; Published by Cold Spring Harbor Laboratory Press.

  9. Kernel-based whole-genome prediction of complex traits: a review.

    Science.gov (United States)

    Morota, Gota; Gianola, Daniel

    2014-01-01

    Prediction of genetic values has been a focus of applied quantitative genetics since the beginning of the 20th century, with renewed interest following the advent of the era of whole genome-enabled prediction. Opportunities offered by the emergence of high-dimensional genomic data fueled by post-Sanger sequencing technologies, especially molecular markers, have driven researchers to extend Ronald Fisher and Sewall Wright's models to confront new challenges. In particular, kernel methods are gaining consideration as a regression method of choice for genome-enabled prediction. Complex traits are presumably influenced by many genomic regions working in concert with others (clearly so when considering pathways), thus generating interactions. Motivated by this view, a growing number of statistical approaches based on kernels attempt to capture non-additive effects, either parametrically or non-parametrically. This review centers on whole-genome regression using kernel methods applied to a wide range of quantitative traits of agricultural importance in animals and plants. We discuss various kernel-based approaches tailored to capturing total genetic variation, with the aim of arriving at an enhanced predictive performance in the light of available genome annotation information. Connections between prediction machines born in animal breeding, statistics, and machine learning are revisited, and their empirical prediction performance is discussed. Overall, while some encouraging results have been obtained with non-parametric kernels, recovering non-additive genetic variation in a validation dataset remains a challenge in quantitative genetics.

  10. Kernel-based whole-genome prediction of complex traits: a review

    Directory of Open Access Journals (Sweden)

    Gota eMorota

    2014-10-01

    Full Text Available Prediction of genetic values has been a focus of applied quantitative genetics since the beginning of the 20th century, with renewed interest following the advent of the era of whole genome-enabled prediction. Opportunities offered by the emergence of high-dimensional genomic data fueled by post-Sanger sequencing technologies, especially molecular markers, have driven researchers to extend Ronald Fisher and Sewall Wright's models to confront new challenges. In particular, kernel methods are gaining consideration as a regression method of choice for genome-enabled prediction. Complex traits are presumably influenced by many genomic regions working in concert with others (clearly so when considering pathways, thus generating interactions. Motivated by this view, a growing number of statistical approaches based on kernels attempt to capture non-additive effects, either parametrically or non-parametrically. This review centers on whole-genome regression using kernel methods applied to a wide range of quantitative traits of agricultural importance in animals and plants. We discuss various kernel-based approaches tailored to capturing total genetic variation, with the aim of arriving at an enhanced predictive performance in the light of available genome annotation information. Connections between prediction machines born in animal breeding, statistics, and machine learning are revisited, and their empirical prediction performance is discussed. Overall, while some encouraging results have been obtained with non-parametric kernels, recovering non-additive genetic variation in a validation dataset remains a challenge in quantitative genetics.

  11. About human genome Acerca del genoma humano

    Directory of Open Access Journals (Sweden)

    Mojica Tobias

    2000-12-01

    Full Text Available The sequence ofthe human genome, an undertaking ofadvanced countries, is nearly complete. In fact The Human Genome Project has around 85% ofthe genome sequenced 4 times on the average, with an accuracy of roughly 1 in 1000 nucleotides. Celera Genomics, on the other hand, has 99% of the sequence of one person, with an accuracy of slightly less than 1 in 100. The Human Genome project trives to produce a physical map for public consumption following a step by step strategy, in which the researcher sequences short DNA fragments belonging to Iarger fragments of known relative
    position. Celera Genomics wants to have very rapidly a physical map which can be quickly used to develop genetic tests and drugs, which can be later sold. We feel that the sequence ofthe human genome is something, which will widen the gap between advanced and backward countries.En este artículo se revisan los eventos, alrededor del secuenciamiento del genoma humano, que han llevado a tanta excitación en los medios noticiosos y académicos en meses recientes. Se explican las estrategias que han llevado a que tengamos dos borradores diferentes pero complementarios, la estrategia llevada a cabo con el dinero
    de los contribuyentes que consiste en establecer el orden de fragmentos grandes de DNA antes de ser secuenciados y la estrategia llevada a cabo con dineros aportados por la industria privada, con la intención de explotar gananciosamente el conocimiento derivado del genoma humano. El genoma humano a mediados del año 2000 es
    un borrador incompleto que cubre aliededor del 85% de la secuencia con una precisión de un error en 1000 y el 99% de la secuencia con una precisión menor de 1 en 100 nucleótidos, También se discuten algunas de las posibles avenidas

  12. Nanobody®-based chromatin immunoprecipitation/micro-array analysis for genome-wide identification of transcription factor DNA binding sites

    Science.gov (United States)

    Nguyen-Duc, Trong; Peeters, Eveline; Muyldermans, Serge; Charlier, Daniel; Hassanzadeh-Ghassabeh, Gholamreza

    2013-01-01

    Nanobodies® are single-domain antibody fragments derived from camelid heavy-chain antibodies. Because of their small size, straightforward production in Escherichia coli, easy tailoring, high affinity, specificity, stability and solubility, nanobodies® have been exploited in various biotechnological applications. A major challenge in the post-genomics and post-proteomics era is the identification of regulatory networks involving nucleic acid–protein and protein–protein interactions. Here, we apply a nanobody® in chromatin immunoprecipitation followed by DNA microarray hybridization (ChIP-chip) for genome-wide identification of DNA–protein interactions. The Lrp-like regulator Ss-LrpB, arguably one of the best-studied specific transcription factors of the hyperthermophilic archaeon Sulfolobus solfataricus, was chosen for this proof-of-principle nanobody®-assisted ChIP. Three distinct Ss-LrpB-specific nanobodies®, each interacting with a different epitope, were generated for ChIP. Genome-wide ChIP-chip with one of these nanobodies® identified the well-established Ss-LrpB binding sites and revealed several unknown target sequences. Furthermore, these ChIP-chip profiles revealed auxiliary operator sites in the open reading frame of Ss-lrpB. Our work introduces nanobodies® as a novel class of affinity reagents for ChIP. Taking into account the unique characteristics of nanobodies®, in particular, their short generation time, nanobody®-based ChIP is expected to further streamline ChIP-chip and ChIP-Seq experiments, especially in organisms with no (or limited) possibility of genetic manipulation. PMID:23275538

  13. SPY: a new scission-point model based on microscopic inputs to predict fission fragment properties

    Energy Technology Data Exchange (ETDEWEB)

    Panebianco, Stefano; Lemaître, Jean-Francois; Sida, Jean-Luc [CEA Centre de Saclay, Gif-sur-Ivette (France); Dubray, Noëel [CEA, DAM, DIF, Arpajon (France); Goriely, Stephane [Institut d' Astronomie et d' Astrophisique, Universite Libre de Bruxelles, Brussels (Belgium)

    2014-07-01

    Despite the difficulty in describing the whole fission dynamics, the main fragment characteristics can be determined in a static approach based on a so-called scission-point model. Within this framework, a new Scission-Point model for the calculations of fission fragment Yields (SPY) has been developed. This model, initially based on the approach developed by Wilkins in the late seventies, consists in performing a static energy balance at scission, where the two fragments are supposed to be completely separated so that their macroscopic properties (mass and charge) can be considered as fixed. Given the knowledge of the system state density, averaged quantities such as mass and charge yields, mean kinetic and excitation energy can then be extracted in the framework of a microcanonical statistical description. The main advantage of the SPY model is the introduction of one of the most up-to-date microscopic descriptions of the nucleus for the individual energy of each fragment and, in the future, for their state density. These quantities are obtained in the framework of HFB calculations using the Gogny nucleon-nucleon interaction, ensuring an overall coherence of the model. Starting from a description of the SPY model and its main features, a comparison between the SPY predictions and experimental data will be discussed for some specific cases, from light nuclei around mercury to major actinides. Moreover, extensive predictions over the whole chart of nuclides will be discussed, with particular attention to their implication in stellar nucleosynthesis. Finally, future developments, mainly concerning the introduction of microscopic state densities, will be briefly discussed. (author)

  14. Targeting Ligandable Pockets on Plant Homeodomain (PHD) Zinc Finger Domains by a Fragment-Based Approach.

    Science.gov (United States)

    Amato, Anastasia; Lucas, Xavier; Bortoluzzi, Alessio; Wright, David; Ciulli, Alessio

    2018-04-20

    Plant homeodomain (PHD) zinc fingers are histone reader domains that are often associated with human diseases. Despite this, they constitute a poorly targeted class of readers, suggesting low ligandability. Here, we describe a successful fragment-based campaign targeting PHD fingers from the proteins BAZ2A and BAZ2B as model systems. We validated a pool of in silico fragments both biophysically and structurally and solved the first crystal structures of PHD zinc fingers in complex with fragments bound to an anchoring pocket at the histone binding site. The best-validated hits were found to displace a histone H3 tail peptide in competition assays. This work identifies new chemical scaffolds that provide suitable starting points for future ligand optimization using structure-guided approaches. The demonstrated ligandability of the PHD reader domains could pave the way for the development of chemical probes to drug this family of epigenetic readers.

  15. Toward The Reconstitution of the Maturation of Okazaki Fragments Multiprotein Complex in Human At The Single Molecule Level

    KAUST Repository

    Joudeh, Luay

    2017-01-01

    The maturation of Okazaki fragments on the lagging strand in eukaryotes is mediated by a highly coordinated multistep process involving several proteins that ensure the accurate and efficient replication of genomic DNA. Human proliferating cell

  16. Comparison by restriction fragment pattern analyses and molecular characterization of some European isolates of Suid herpesvirus 1: A contribution to strain differentiation of European isolates

    DEFF Research Database (Denmark)

    Christensen, Laurids Siig

    1988-01-01

    Eleven European isolates of Suid herpesvirus type 1 (SHV-1) were compared by restriction fragment pattern analyses and Southern blot hybridization using different genomic probes. The presence of strain discriminative 4 major genome types and several subtypes as well as the molecular distinctions...

  17. High-throughput fragment screening by affinity LC-MS.

    Science.gov (United States)

    Duong-Thi, Minh-Dao; Bergström, Maria; Fex, Tomas; Isaksson, Roland; Ohlson, Sten

    2013-02-01

    Fragment screening, an emerging approach for hit finding in drug discovery, has recently been proven effective by its first approved drug, vemurafenib, for cancer treatment. Techniques such as nuclear magnetic resonance, surface plasmon resonance, and isothemal titration calorimetry, with their own pros and cons, have been employed for screening fragment libraries. As an alternative approach, screening based on high-performance liquid chromatography separation has been developed. In this work, we present weak affinity LC/MS as a method to screen fragments under high-throughput conditions. Affinity-based capillary columns with immobilized thrombin were used to screen a collection of 590 compounds from a fragment library. The collection was divided into 11 mixtures (each containing 35 to 65 fragments) and screened by MS detection. The primary screening was performed in 3500 fragments per day). Thirty hits were defined, which subsequently entered a secondary screening using an active site-blocked thrombin column for confirmation of specificity. One hit showed selective binding to thrombin with an estimated dissociation constant (K (D)) in the 0.1 mM range. This study shows that affinity LC/MS is characterized by high throughput, ease of operation, and low consumption of target and fragments, and therefore it promises to be a valuable method for fragment screening.

  18. An open source GIS-based tool to integrate the fragmentation mechanism in rockfall propagation

    Science.gov (United States)

    Matas, Gerard; Lantada, Nieves; Gili, Josep A.; Corominas, Jordi

    2015-04-01

    Rockfalls are frequent instability processes in road cuts, open pit mines and quarries, steep slopes and cliffs. Even though the stability of rock slopes can be determined using analytical approaches, the assessment of large rock cliffs require simplifying assumptions due to the difficulty of working with a large amount of joints, the scattering of both the orientations and strength parameters. The attitude and persistency of joints within the rock mass define the size of kinematically unstable rock volumes. Furthermore the rock block will eventually split in several fragments during its propagation downhill due its impact with the ground surface. Knowledge of the size, energy, trajectory… of each block resulting from fragmentation is critical in determining the vulnerability of buildings and protection structures. The objective of this contribution is to present a simple and open source tool to simulate the fragmentation mechanism in rockfall propagation models and in the calculation of impact energies. This tool includes common modes of motion for falling boulders based on the previous literature. The final tool is being implemented in a GIS (Geographic Information Systems) using open source Python programming. The tool under development will be simple, modular, compatible with any GIS environment, open source, able to model rockfalls phenomena correctly. It could be used in any area susceptible to rockfalls with a previous adjustment of the parameters. After the adjustment of the model parameters to a given area, a simulation could be performed to obtain maps of kinetic energy, frequency, stopping density and passing heights. This GIS-based tool and the analysis of the fragmentation laws using data collected from recent rockfall have being developed within the RockRisk Project (2014-2016). This project is funded by the Spanish Ministerio de Economía y Competitividad and entitled "Rockfalls in cliffs: risk quantification and its prevention"(BIA2013-42582-P).

  19. A Fragment-Based Method of Creating Small-Molecule Libraries to Target the Aggregation of Intrinsically Disordered Proteins.

    Science.gov (United States)

    Joshi, Priyanka; Chia, Sean; Habchi, Johnny; Knowles, Tuomas P J; Dobson, Christopher M; Vendruscolo, Michele

    2016-03-14

    The aggregation process of intrinsically disordered proteins (IDPs) has been associated with a wide range of neurodegenerative disorders, including Alzheimer's and Parkinson's diseases. Currently, however, no drug in clinical use targets IDP aggregation. To facilitate drug discovery programs in this important and challenging area, we describe a fragment-based approach of generating small-molecule libraries that target specific IDPs. The method is based on the use of molecular fragments extracted from compounds reported in the literature to inhibit of the aggregation of IDPs. These fragments are used to screen existing large generic libraries of small molecules to form smaller libraries specific for given IDPs. We illustrate this approach by describing three distinct small-molecule libraries to target, Aβ, tau, and α-synuclein, which are three IDPs implicated in Alzheimer's and Parkinson's diseases. The strategy described here offers novel opportunities for the identification of effective molecular scaffolds for drug discovery for neurodegenerative disorders and to provide insights into the mechanism of small-molecule binding to IDPs.

  20. Fenton reaction induced cancer in wild type rats recapitulates genomic alterations observed in human cancer.

    Directory of Open Access Journals (Sweden)

    Shinya Akatsuka

    Full Text Available Iron overload has been associated with carcinogenesis in humans. Intraperitoneal administration of ferric nitrilotriacetate initiates a Fenton reaction in renal proximal tubules of rodents that ultimately leads to a high incidence of renal cell carcinoma (RCC after repeated treatments. We performed high-resolution microarray comparative genomic hybridization to identify characteristics in the genomic profiles of this oxidative stress-induced rat RCCs. The results revealed extensive large-scale genomic alterations with a preference for deletions. Deletions and amplifications were numerous and sometimes fragmented, demonstrating that a Fenton reaction is a cause of such genomic alterations in vivo. Frequency plotting indicated that two of the most commonly altered loci corresponded to a Cdkn2a/2b deletion and a Met amplification. Tumor sizes were proportionally associated with Met expression and/or amplification, and clustering analysis confirmed our results. Furthermore, we developed a procedure to compare whole genomic patterns of the copy number alterations among different species based on chromosomal syntenic relationship. Patterns of the rat RCCs showed the strongest similarity to the human RCCs among five types of human cancers, followed by human malignant mesothelioma, an iron overload-associated cancer. Therefore, an iron-dependent Fenton chemical reaction causes large-scale genomic alterations during carcinogenesis, which may result in distinct genomic profiles. Based on the characteristics of extensive genome alterations in human cancer, our results suggest that this chemical reaction may play a major role during human carcinogenesis.

  1. Towards the Genomic Basis of Local Adaptation in Landraces

    Directory of Open Access Journals (Sweden)

    Giandomenico Corrado

    2017-11-01

    Full Text Available Landraces are key elements of agricultural biodiversity that have long been considered a source of useful traits. Their importance goes beyond subsistence agriculture and the essential need to preserve genetic diversity, because landraces are farmer-developed populations that are often adapted to environmental conditions of significance to tackle environmental concerns. It is therefore increasingly important to identify adaptive traits in crop landraces and understand their molecular basis. This knowledge is potentially useful for promoting more sustainable agricultural techniques, reducing the environmental impact of high-input cropping systems, and diminishing the vulnerability of agriculture to global climate change. In this review, we present an overview of the opportunities and limitations offered by landraces’ genomics. We discuss how rapid advances in DNA sequencing techniques, plant phenotyping, and recombinant DNA-based biotechnology encourage both the identification and the validation of the genomic signature of local adaptation in crop landraces. The integration of ‘omics’ sciences, molecular population genetics, and field studies can provide information inaccessible with earlier technological tools. Although empirical knowledge on the genetic and genomic basis of local adaptation is still fragmented, it is predicted that genomic scans for adaptation will unlock an intraspecific molecular diversity that may be different from that of modern varieties.

  2. Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes.

    Science.gov (United States)

    Singh, Param Priya; Arora, Jatin; Isambert, Hervé

    2015-07-01

    Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases.

  3. GenomeVx: simple web-based creation of editable circular chromosome maps.

    Science.gov (United States)

    Conant, Gavin C; Wolfe, Kenneth H

    2008-03-15

    We describe GenomeVx, a web-based tool for making editable, publication-quality, maps of mitochondrial and chloroplast genomes and of large plasmids. These maps show the location of genes and chromosomal features as well as a position scale. The program takes as input either raw feature positions or GenBank records. In the latter case, features are automatically extracted and colored, an example of which is given. Output is in the Adobe Portable Document Format (PDF) and can be edited by programs such as Adobe Illustrator. GenomeVx is available at http://wolfe.gen.tcd.ie/GenomeVx

  4. Dual Fragment Impact of PBX Charges

    Science.gov (United States)

    Haskins, Peter; Briggs, Richard; Leeming, David; White, Nathan; Cheese, Philip; DE&S MoD UK Team; Ordnance Test Solutions Ltd Team

    2017-06-01

    Fragment impact can pose a significant hazard to many systems containing explosives or propellants. Testing for this threat is most commonly carried out using a single fragment. However, it can be argued that an initial fragment strike (or strikes) could sensitise the energetic material to subsequent impacts, which may then lead to a more violent reaction than would have been predicted based upon single fragment studies. To explore this potential hazard we have developed the capability to launch 2 fragments from the same gun at a range of velocities, and achieve impacts on an acceptor charge with good control over the spatial and temporal separation of the strikes. In this paper we will describe in detail the experimental techniques we have used, both to achieve the dual fragment launch and observe the acceptor charge response. In addition, we will describe the results obtained against PBX filled explosive targets; discuss the mechanisms controlling the target response and their significance for vulnerability assessment. Results of these tests have clearly indicated the potential for detonation upon the second strike, at velocities well below those needed for shock initiation by a single fragment.

  5. The anti-CMS technique for genome-wide mapping of 5-hydroxymethylcytosine.

    Science.gov (United States)

    Huang, Yun; Pastor, William A; Zepeda-Martínez, Jorge A; Rao, Anjana

    2012-10-01

    5-Hydroxymethylcytosine (5hmC) is a recently discovered base in the mammalian genome, produced upon oxidation of 5-methylcytosine (5mC) in a process catalyzed by TET proteins. The biological functions of 5hmC and further oxidation products of 5mC are under intense investigation, as they are likely intermediates in DNA demethylation pathways. Here we describe a novel protocol to profile 5hmC at a genome-wide scale. This approach is based on sodium bisulfite-mediated conversion of 5hmC to cytosine-5-methylenesulfonate (CMS); CMS-containing DNA fragments are then immunoprecipitated using a CMS-specific antiserum. The anti-CMS technique is highly specific with a low background, and is much less dependent on 5hmC density than anti-5hmC immunoprecipitation (IP). Moreover, it does not enrich for CA and CT repeats, as noted for 5hmC DNA IP using antibodies to 5hmC. The anti-CMS protocol takes 3 d to complete.

  6. Accurate DNA assembly and genome engineering with optimized uracil excision cloning

    DEFF Research Database (Denmark)

    Cavaleiro, Mafalda; Kim, Se Hyeuk; Seppala, Susanna

    2015-01-01

    Simple and reliable DNA editing by uracil excision (a.k.a. USER cloning) has been described by several research groups, but the optimal design of cohesive DNA ends for multigene assembly remains elusive. Here, we use two model constructs based on expression of gfp and a four-gene pathway that pro......Simple and reliable DNA editing by uracil excision (a.k.a. USER cloning) has been described by several research groups, but the optimal design of cohesive DNA ends for multigene assembly remains elusive. Here, we use two model constructs based on expression of gfp and a four-gene pathway...... that produces β-carotene to optimize assembly junctions and the uracil excision protocol. By combining uracil excision cloning with a genomic integration technology, we demonstrate that up to six DNA fragments can be assembled in a one-tube reaction for direct genome integration with high accuracy, greatly...... facilitating the advanced engineering of robust cell factories....

  7. Fragmentation of a 500 MeV/nucleon 86Kr beam, investigated at the GSI projectile fragment separator

    International Nuclear Information System (INIS)

    Weber, M.; Donzaud, C.; Geissel, H.; Grewe, A.; Lewitowicz, M.; Magel, A.; Mueller, A.C.; Nickel, F.; Pfuetzner, M.; Piechaczek, A.; Pravikoff, M.; Roeckl, E.; Rykaczewski, K.; Saint-Laurent, M.G.; Schall, I.; Stephan, C.; Tassan-Got, L.; Voss, B.

    1993-10-01

    Production cross-sections and longitudinal momentum distributions have been investigated for reactions between a 500 MeV/nucleon 86 Kr beam and beryllium, copper and tantalum targets. Fragments in a wide A/Z range were studied at the projectile-fragment separator FRS at GSI. The experimental production cross-sections have been used for testing the predictions obtained from a semi-empirical parameterization, a statistical abrasion model and an intranuclear-cascade model. The present study allows to extrapolate the production cross-sections towards very neutron-rich isotopes such as the doubly magic nucleus 78 Ni. For fragments close to the projectile the measured longitudinal momentum distributions agrees qualitatively with a semi-empirical parameterization, which is based on the two-step picture of the fragmentation process. The momentum widths of lighter fragments, however, show deviations from this simple picture. (orig.)

  8. An improved algorithm for MFR fragment assembly

    International Nuclear Information System (INIS)

    Kontaxis, Georg

    2012-01-01

    A method for generating protein backbone models from backbone only NMR data is presented, which is based on molecular fragment replacement (MFR). In a first step, the PDB database is mined for homologous peptide fragments using experimental backbone-only data i.e. backbone chemical shifts (CS) and residual dipolar couplings (RDC). Second, this fragment library is refined against the experimental restraints. Finally, the fragments are assembled into a protein backbone fold using a rigid body docking algorithm using the RDCs as restraints. For improved performance, backbone nuclear Overhauser effects (NOEs) may be included at that stage. Compared to previous implementations of MFR-derived structure determination protocols this model-building algorithm offers improved stability and reliability. Furthermore, relative to CS-ROSETTA based methods, it provides faster performance and straightforward implementation with the option to easily include further types of restraints and additional energy terms.

  9. Study in mutation of alfalfa genome DNA due to low energy N+ implantation using RAPD

    International Nuclear Information System (INIS)

    Chen Roulei; Song Daojun; Yu Zengliang; Li Yufeng; Liang Yunzhang

    2001-01-01

    After implanted by various dosage N + beams, germination rate of alfalfa seeds appears to be saddle line with dosage increasing. The authors have studied in mutation of genome DNA due to low energy N + implantation, and concluded that 30 differential DNA fragments have been amplified by 8 primers (S 41 , S 42 , S 45 , S 46 , S 50 , S 52 , S 56 , S 58 ) in 100 primers, moreover, number of differential DNA fragments between CK and treatments increases with dosage. Consequently, low energy ion implantation can cause mutation of alfalfa genome DNA. The more dosage it is, the more mutation alfalfa will be

  10. The ethical introduction of genome-based information and technologies into public health.

    Science.gov (United States)

    Howard, H C; Swinnen, E; Douw, K; Vondeling, H; Cassiman, J-J; Cambon-Thomsen, A; Borry, P

    2013-01-01

    With the human genome project running from 1989 until its completion in 2003, and the incredible advances in sequencing technology and in bioinformatics during the last decade, there has been a shift towards an increase focus on studying common complex disorders which develop due to the interplay of many different genes as well as environmental factors. Although some susceptibility genes have been identified in some populations for disorders such as cancer, diabetes and cardiovascular diseases, the integration of this information into the health care system has proven to be much more problematic than for single gene disorders. Furthermore, with the 1000$ genome supposedly just around the corner, and whole genome sequencing gradually being integrated into research protocols as well as in the clinical context, there is a strong push for the uptake of additional genomic testing. Indeed, the advent of public health genomics, wherein genomics would be integrated in all aspects of health care and public health, should be taken seriously. Although laudable, these advances also bring with them a slew of ethical and social issues that challenge the normative frameworks used in clinical genetics until now. With this in mind, we highlight herein 5 principles that are used as a primer to discuss the ethical introduction of genome-based information and genome-based technologies into public health. Copyright © 2013 S. Karger AG, Basel.

  11. Deciphering the hybridisation history leading to the Lager lineage based on the mosaic genomes of Saccharomyces bayanus strains NBRC1948 and CBS380.

    Directory of Open Access Journals (Sweden)

    Huu-Vang Nguyen

    Full Text Available Saccharomyces bayanus is a yeast species described as one of the two parents of the hybrid brewing yeast S. pastorianus. Strains CBS380(T and NBRC1948 have been retained successively as pure-line representatives of S. bayanus. In the present study, sequence analyses confirmed and upgraded our previous finding: S. bayanus type strain CBS380(T harbours a mosaic genome. The genome of strain NBRC1948 was also revealed to be mosaic. Both genomes were characterized by amplification and sequencing of different markers, including genes involved in maltotriose utilization or genes detected by array-CGH mapping. Sequence comparisons with public Saccharomyces spp. nucleotide sequences revealed that the CBS380(T and NBRC1948 genomes are composed of: a predominant non-cerevisiae genetic background belonging to S. uvarum, a second unidentified species provisionally named S. lagerae, and several introgressed S. cerevisiae fragments. The largest cerevisiae-introgressed DNA common to both genomes totals 70kb in length and is distributed in three contigs, cA, cB and cC. These vary in terms of length and presence of MAL31 or MTY1 (maltotriose-transporter gene. In NBRC1948, two additional cerevisiae-contigs, cD and cE, totaling 12kb in length, as well as several smaller cerevisiae fragments were identified. All of these contigs were partially detected in the genomes of S. pastorianus lager strains CBS1503 (S. monacensis and CBS1513 (S. carlsbergensis explaining the noticeable common ability of S. bayanus and S. pastorianus to metabolize maltotriose. NBRC1948 was shown to be inter-fertile with S. uvarum CBS7001. The cross involving these two strains produced F1 segregants resembling the strains CBS380(T or NRRLY-1551. This demonstrates that these S. bayanus strains were the offspring of a cross between S. uvarum and a strain similar to NBRC1948. Phylogenies established with selected cerevisiae and non-cerevisiae genes allowed us to decipher the complex hybridisation

  12. HANDS: a tool for genome-wide discovery of subgenome-specific base-identity in polyploids.

    KAUST Repository

    Mithani, Aziz; Belfield, Eric J; Brown, Carly; Jiang, Caifu; Leach, Lindsey J; Harberd, Nicholas P

    2013-01-01

    The analysis of polyploid genomes is problematic because homeologous subgenome sequences are closely related. This relatedness makes it difficult to assign individual sequences to the specific subgenome from which they are derived, and hinders the development of polyploid whole genome assemblies.We here present a next-generation sequencing (NGS)-based approach for assignment of subgenome-specific base-identity at sites containing homeolog-specific polymorphisms (HSPs): 'HSP base Assignment using NGS data through Diploid Similarity' (HANDS). We show that HANDS correctly predicts subgenome-specific base-identity at >90% of assayed HSPs in the hexaploid bread wheat (Triticum aestivum) transcriptome, thus providing a substantial increase in accuracy versus previous methods for homeolog-specific base assignment.We conclude that HANDS enables rapid and accurate genome-wide discovery of homeolog-specific base-identity, a capability having multiple applications in polyploid genomics.

  13. HANDS: a tool for genome-wide discovery of subgenome-specific base-identity in polyploids.

    KAUST Repository

    Mithani, Aziz

    2013-09-24

    The analysis of polyploid genomes is problematic because homeologous subgenome sequences are closely related. This relatedness makes it difficult to assign individual sequences to the specific subgenome from which they are derived, and hinders the development of polyploid whole genome assemblies.We here present a next-generation sequencing (NGS)-based approach for assignment of subgenome-specific base-identity at sites containing homeolog-specific polymorphisms (HSPs): \\'HSP base Assignment using NGS data through Diploid Similarity\\' (HANDS). We show that HANDS correctly predicts subgenome-specific base-identity at >90% of assayed HSPs in the hexaploid bread wheat (Triticum aestivum) transcriptome, thus providing a substantial increase in accuracy versus previous methods for homeolog-specific base assignment.We conclude that HANDS enables rapid and accurate genome-wide discovery of homeolog-specific base-identity, a capability having multiple applications in polyploid genomics.

  14. HAL: a hierarchical format for storing and analyzing multiple genome alignments.

    Science.gov (United States)

    Hickey, Glenn; Paten, Benedict; Earl, Dent; Zerbino, Daniel; Haussler, David

    2013-05-15

    Large multiple genome alignments and inferred ancestral genomes are ideal resources for comparative studies of molecular evolution, and advances in sequencing and computing technology are making them increasingly obtainable. These structures can provide a rich understanding of the genetic relationships between all subsets of species they contain. Current formats for storing genomic alignments, such as XMFA and MAF, are all indexed or ordered using a single reference genome, however, which limits the information that can be queried with respect to other species and clades. This loss of information grows with the number of species under comparison, as well as their phylogenetic distance. We present HAL, a compressed, graph-based hierarchical alignment format for storing multiple genome alignments and ancestral reconstructions. HAL graphs are indexed on all genomes they contain. Furthermore, they are organized phylogenetically, which allows for modular and parallel access to arbitrary subclades without fragmentation because of rearrangements that have occurred in other lineages. HAL graphs can be created or read with a comprehensive C++ API. A set of tools is also provided to perform basic operations, such as importing and exporting data, identifying mutations and coordinate mapping (liftover). All documentation and source code for the HAL API and tools are freely available at http://github.com/glennhickey/hal. hickey@soe.ucsc.edu or haussler@soe.ucsc.edu Supplementary data are available at Bioinformatics online.

  15. Identification of potential glutaminyl cyclase inhibitors from lead-like libraries by in silico and in vitro fragment-based screening.

    Science.gov (United States)

    Szaszkó, Mária; Hajdú, István; Flachner, Beáta; Dobi, Krisztina; Magyar, Csaba; Simon, István; Lőrincz, Zsolt; Kapui, Zoltán; Pázmány, Tamás; Cseh, Sándor; Dormán, György

    2017-02-01

    A glutaminyl cyclase (QC) fragment library was in silico selected by disconnection of the structure of known QC inhibitors and by lead-like 2D virtual screening of the same set. The resulting fragment library (204 compounds) was acquired from commercial suppliers and pre-screened by differential scanning fluorimetry followed by functional in vitro assays. In this way, 10 fragment hits were identified ([Formula: see text]5 % hit rate, best inhibitory activity: 16 [Formula: see text]). The in vitro hits were then docked to the active site of QC, and the best scoring compounds were analyzed for binding interactions. Two fragments bound to different regions in a complementary manner, and thus, linking those fragments offered a rational strategy to generate novel QC inhibitors. Based on the structure of the virtual linked fragment, a 77-membered QC target focused library was selected from vendor databases and docked to the active site of QC. A PubChem search confirmed that the best scoring analogues are novel, potential QC inhibitors.

  16. Design and synthesis of dihydroisoquinolones for fragment-based drug discovery (FBDD).

    Science.gov (United States)

    Palmer, Nick; Peakman, Torren M; Norton, David; Rees, David C

    2016-02-07

    This study describes general synthesis aspects of fragments for FBDD, as illustrated by the dihydroisoquinolones 1-3. Previous Rh(III) methodology is extended to incorporate amines, heteroatoms (N and S), and substituents (halogen, ester) as potential binding groups and/or synthetic growth points for fragment-to-lead elaboration.

  17. Modelling of the PELE fragmentation dynamics

    Science.gov (United States)

    Verreault, J.

    2014-05-01

    The Penetrator with Enhanced Lateral Effect (PELE) is a type of explosive-free projectile that undergoes radial fragmentation upon an impact with a target plate. This type of projectile is composed of a brittle cylindrical shell (the jacket) filled in its core with a material characterized with a large Poisson's ratio. Upon an impact with a target, the axial compression causes the filling to expand in the radial direction. However, due to the brittleness of the jacket material, very little radial deformation can occur which creates a radial stress between the two materials and a hoop stress in the jacket. Fragmentation of the jacket occurs if the hoop stress exceeds the material's ultimate stress. The PELE fragmentation dynamics is explored via Finite-Element Method (FEM) simulations using the Autodyn explicit dynamics hydrocode. The numerical results are compared with an analytical model based on wave interactions, as well as with the experimental investigation of Paulus and Schirm (1996). The comparison is based on the mechanical stress in the filling and the qualitative fragmentation of the jacket.

  18. Modelling of the PELE fragmentation dynamics

    International Nuclear Information System (INIS)

    Verreault, J

    2014-01-01

    The Penetrator with Enhanced Lateral Effect (PELE) is a type of explosive-free projectile that undergoes radial fragmentation upon an impact with a target plate. This type of projectile is composed of a brittle cylindrical shell (the jacket) filled in its core with a material characterized with a large Poisson's ratio. Upon an impact with a target, the axial compression causes the filling to expand in the radial direction. However, due to the brittleness of the jacket material, very little radial deformation can occur which creates a radial stress between the two materials and a hoop stress in the jacket. Fragmentation of the jacket occurs if the hoop stress exceeds the material's ultimate stress. The PELE fragmentation dynamics is explored via Finite-Element Method (FEM) simulations using the Autodyn explicit dynamics hydrocode. The numerical results are compared with an analytical model based on wave interactions, as well as with the experimental investigation of Paulus and Schirm (1996). The comparison is based on the mechanical stress in the filling and the qualitative fragmentation of the jacket.

  19. High efficiency hydrodynamic DNA fragmentation in a bubbling system

    NARCIS (Netherlands)

    Li, Lanhui; Jin, Mingliang; Sun, Chenglong; Wang, Xiaoxue; Xie, Shuting; Zhou, Guofu; Van Den Berg, Albert; Eijkel, Jan C.T.; Shui, Lingling

    2017-01-01

    DNA fragmentation down to a precise fragment size is important for biomedical applications, disease determination, gene therapy and shotgun sequencing. In this work, a cheap, easy to operate and high efficiency DNA fragmentation method is demonstrated based on hydrodynamic shearing in a bubbling

  20. Jet fragmentation

    International Nuclear Information System (INIS)

    Saxon, D.H.

    1985-10-01

    The paper reviews studies on jet fragmentation. The subject is discussed under the topic headings: fragmentation models, charged particle multiplicity, bose-einstein correlations, identified hadrons in jets, heavy quark fragmentation, baryon production, gluon and quark jets compared, the string effect, and two successful models. (U.K.)

  1. Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures.

    Science.gov (United States)

    Pride, David T; Schoenfeld, Thomas

    2008-09-17

    Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs

  2. Genome-based prediction of common diseases: Methodological considerations for future research

    NARCIS (Netherlands)

    A.C.J.W. Janssens (Cécile); P. Tikka-Kleemola (Päivi)

    2009-01-01

    textabstractThe translation of emerging genomic knowledge into public health and clinical care is one of the major challenges for the coming decades. At the moment, genome-based prediction of common diseases, such as type 2 diabetes, coronary heart disease and cancer, is still not informative. Our

  3. Getting complete genomes from complex samples using nanopore sequencing

    DEFF Research Database (Denmark)

    Kirkegaard, Rasmus Hansen; Karst, Søren Michael; Albertsen, Mads

    Short read sequencing and metagenomic binning workflows have made it possible to extract bacterial genome bins from environmental microbial samples containing hundreds to thousands of different species. However, these genome bins often do not represent complete genomes, as they are mostly...... fragmented, incomplete and often contaminated with foreign DNA and with no robust strategies to validate the quality. The value of these `draft genomes` have limited, lasting value to the scientific community, as gene synteny is broken and the uncertainty of what is missing. The genetic material most often...... missed is important multi-copy and/or conserved marker genes such as the 16S rRNA gene, as sequence micro-heterogeneity prevents assembly of these genes in the de novo assembly. We demonstrate that using nanopore long reads it is now possible to overcome these issues and make complete genomes from...

  4. Exploring Lactobacillus plantarum genome diversity by using microarrays

    NARCIS (Netherlands)

    Molenaar, D.; Bringel, F.; Schuren, F.H.; Vos, de W.M.; Siezen, R.J.; Kleerebezem, M.

    2005-01-01

    Lactobacillus plantarum is a versatile and flexible species that is encountered in a variety of niches and can utilize a broad range of fermentable carbon sources. To assess if this versatility is linked to a variable gene pool, microarrays containing a subset of small genomic fragments of L.

  5. Annotation-Based Whole Genomic Prediction and Selection

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Do, Duy Ngoc; Janss, Luc

    Genomic selection is widely used in both animal and plant species, however, it is performed with no input from known genomic or biological role of genetic variants and therefore is a black box approach in a genomic era. This study investigated the role of different genomic regions and detected QTLs...... in their contribution to estimated genomic variances and in prediction of genomic breeding values by applying SNP annotation approaches to feed efficiency. Ensembl Variant Predictor (EVP) and Pig QTL database were used as the source of genomic annotation for 60K chip. Genomic prediction was performed using the Bayes...... classes. Predictive accuracy was 0.531, 0.532, 0.302, and 0.344 for DFI, RFI, ADG and BF, respectively. The contribution per SNP to total genomic variance was similar among annotated classes across different traits. Predictive performance of SNP classes did not significantly differ from randomized SNP...

  6. Insights from the Genome Sequence of Acidovorax citrulli M6, a Group I Strain of the Causal Agent of Bacterial Fruit Blotch of Cucurbits.

    Science.gov (United States)

    Eckshtain-Levi, Noam; Shkedy, Dafna; Gershovits, Michael; Da Silva, Gustavo M; Tamir-Ariel, Dafna; Walcott, Ron; Pupko, Tal; Burdman, Saul

    2016-01-01

    Acidovorax citrulli is a seedborne bacterium that causes bacterial fruit blotch of cucurbit plants including watermelon and melon. A. citrulli strains can be divided into two major groups based on DNA fingerprint analyses and biochemical properties. Group I strains have been generally isolated from non-watermelon cucurbits, while group II strains are closely associated with watermelon. In the present study, we report the genome sequence of M6, a group I model A. citrulli strain, isolated from melon. We used comparative genome analysis to investigate differences between the genome of strain M6 and the genome of the group II model strain AAC00-1. The draft genome sequence of A. citrulli M6 harbors 139 contigs, with an overall approximate size of 4.85 Mb. The genome of M6 is ∼500 Kb shorter than that of strain AAC00-1. Comparative analysis revealed that this size difference is mainly explained by eight fragments, ranging from ∼35-120 Kb and distributed throughout the AAC00-1 genome, which are absent in the M6 genome. In agreement with this finding, while AAC00-1 was found to possess 532 open reading frames (ORFs) that are absent in strain M6, only 123 ORFs in M6 were absent in AAC00-1. Most of these M6 ORFs are hypothetical proteins and most of them were also detected in two group I strains that were recently sequenced, tw6 and pslb65. Further analyses by PCR assays and coverage analyses with other A. citrulli strains support the notion that some of these fragments or significant portions of them are discriminative between groups I and II strains of A. citrulli. Moreover, GC content, effective number of codon values and cluster of orthologs' analyses indicate that these fragments were introduced into group II strains by horizontal gene transfer events. Our study reports the genome sequence of a model group I strain of A. citrulli, one of the most important pathogens of cucurbits. It also provides the first comprehensive comparison at the genomic level between the

  7. Analysis Of Segmental Duplications In The Pig Genome Based On Next-Generation Sequencing

    DEFF Research Database (Denmark)

    Fadista, João; Bendixen, Christian

    Segmental duplications are >1kb segments of duplicated DNA present in a genome with high sequence identity (>90%). They are associated with genomic rearrangements and provide a significant source of gene and genome evolution within mammalian genomes. Although segmental duplications have been...... extensively studied in other organisms, its analysis in pig has been hampered by the lack of a complete pig genome assembly. By measuring the depth of coverage of Illumina whole-genome shotgun sequencing reads of the Tabasco animal aligned to the latest pig genome assembly (Sus scrofa 10 – based also...... and their associated copy number alterations, focusing on the global organization of these segments and their possible functional significance in porcine phenotypes. This work provides insights into mammalian genome evolution and generates a valuable resource for porcine genomics research...

  8. Single-molecule optical genome mapping of a human HapMap and a colorectal cancer cell line.

    Science.gov (United States)

    Teo, Audrey S M; Verzotto, Davide; Yao, Fei; Nagarajan, Niranjan; Hillmer, Axel M

    2015-01-01

    Next-generation sequencing (NGS) technologies have changed our understanding of the variability of the human genome. However, the identification of genome structural variations based on NGS approaches with read lengths of 35-300 bases remains a challenge. Single-molecule optical mapping technologies allow the analysis of DNA molecules of up to 2 Mb and as such are suitable for the identification of large-scale genome structural variations, and for de novo genome assemblies when combined with short-read NGS data. Here we present optical mapping data for two human genomes: the HapMap cell line GM12878 and the colorectal cancer cell line HCT116. High molecular weight DNA was obtained by embedding GM12878 and HCT116 cells, respectively, in agarose plugs, followed by DNA extraction under mild conditions. Genomic DNA was digested with KpnI and 310,000 and 296,000 DNA molecules (≥ 150 kb and 10 restriction fragments), respectively, were analyzed per cell line using the Argus optical mapping system. Maps were aligned to the human reference by OPTIMA, a new glocal alignment method. Genome coverage of 6.8× and 5.7× was obtained, respectively; 2.9× and 1.7× more than the coverage obtained with previously available software. Optical mapping allows the resolution of large-scale structural variations of the genome, and the scaffold extension of NGS-based de novo assemblies. OPTIMA is an efficient new alignment method; our optical mapping data provide a resource for genome structure analyses of the human HapMap reference cell line GM12878, and the colorectal cancer cell line HCT116.

  9. Identification of the major structural and nonstructural proteins encoded by human parvovirus B19 and mapping of their genes by procaryotic expression of isolated genomic fragments

    Energy Technology Data Exchange (ETDEWEB)

    Cotmore, S.F.; McKie, V.C.; Anderson, L.J.; Astell, C.R.; Tattersall, P.

    1986-11-01

    Plasma from a child with homozygous sickle-cell disease, sampled during the early phase of an aplastic crisis, contained human parvovirus B19 virions. Plasma taken 10 days later (during the convalescent phase) contained both immunoglobulin M and immunoglobulin G antibodies directed against two viral polypeptides with apparent molecular weights for 83,000 and 58,000 which were present exclusively in the particulate fraction of the plasma taken during the acute phase. These two protein species comigrated at 110S on neutral sucrose velocity gradients with the B19 viral DNA and thus appear to constitute the viral capsid polypeptides. The B19 genome was molecularly cloned into a bacterial plasmid vector. Two expression constructs containing B19 sequences from different halves of the viral genome were obtained, which directed the synthesis, in bacteria, of segments of virally encoded protein. These polypeptide fragments were then purified and used to immunize rabbits. Antibodies against a protein sequence specified between nucleotides 2897 and 3749 recognized both the 83- and 58-kilodalton capsid polypeptides in aplastic plasma taken during the acute phase and detected similar proteins in the similar proteins in the tissues of a stillborn fetus which had been infected transplacentally with B19. Antibodies against a protein sequence encoded in the other half of the B19 genome (nucleotides 1072 through 2044) did not react specifically with any protein in plasma taken during the acute phase but recognized three nonstructural polypeptides of 71, 63, and 52 kilodaltons present in the liver and, at lower levels, in some other tissues of the transplacentally infected fetus.

  10. Identification of the major structural and nonstructural proteins encoded by human parvovirus B19 and mapping of their genes by procaryotic expression of isolated genomic fragments

    International Nuclear Information System (INIS)

    Cotmore, S.F.; McKie, V.C.; Anderson, L.J.; Astell, C.R.; Tattersall, P.

    1986-01-01

    Plasma from a child with homozygous sickle-cell disease, sampled during the early phase of an aplastic crisis, contained human parvovirus B19 virions. Plasma taken 10 days later (during the convalescent phase) contained both immunoglobulin M and immunoglobulin G antibodies directed against two viral polypeptides with apparent molecular weights for 83,000 and 58,000 which were present exclusively in the particulate fraction of the plasma taken during the acute phase. These two protein species comigrated at 110S on neutral sucrose velocity gradients with the B19 viral DNA and thus appear to constitute the viral capsid polypeptides. The B19 genome was molecularly cloned into a bacterial plasmid vector. Two expression constructs containing B19 sequences from different halves of the viral genome were obtained, which directed the synthesis, in bacteria, of segments of virally encoded protein. These polypeptide fragments were then purified and used to immunize rabbits. Antibodies against a protein sequence specified between nucleotides 2897 and 3749 recognized both the 83- and 58-kilodalton capsid polypeptides in aplastic plasma taken during the acute phase and detected similar proteins in the similar proteins in the tissues of a stillborn fetus which had been infected transplacentally with B19. Antibodies against a protein sequence encoded in the other half of the B19 genome (nucleotides 1072 through 2044) did not react specifically with any protein in plasma taken during the acute phase but recognized three nonstructural polypeptides of 71, 63, and 52 kilodaltons present in the liver and, at lower levels, in some other tissues of the transplacentally infected fetus

  11. Microarray-based whole-genome hybridization as a tool for determining procaryotic species relatedness

    Energy Technology Data Exchange (ETDEWEB)

    Wu, L.; Liu, X.; Fields, M.W.; Thompson, D.K.; Bagwell, C.E.; Tiedje, J. M.; Hazen, T.C.; Zhou, J.

    2008-01-15

    The definition and delineation of microbial species are of great importance and challenge due to the extent of evolution and diversity. Whole-genome DNA-DNA hybridization is the cornerstone for defining procaryotic species relatedness, but obtaining pairwise DNA-DNA reassociation values for a comprehensive phylogenetic analysis of procaryotes is tedious and time consuming. A previously described microarray format containing whole-genomic DNA (the community genome array or CGA) was rigorously evaluated as a high-throughput alternative to the traditional DNA-DNA reassociation approach for delineating procaryotic species relationships. DNA similarities for multiple bacterial strains obtained with the CGA-based hybridization were comparable to those obtained with various traditional whole-genome hybridization methods (r=0.87, P<0.01). Significant linear relationships were also observed between the CGA-based genome similarities and those derived from small subunit (SSU) rRNA gene sequences (r=0.79, P<0.0001), gyrB sequences (r=0.95, P<0.0001) or REP- and BOX-PCR fingerprinting profiles (r=0.82, P<0.0001). The CGA hybridization-revealed species relationships in several representative genera, including Pseudomonas, Azoarcus and Shewanella, were largely congruent with previous classifications based on various conventional whole-genome DNA-DNA reassociation, SSU rRNA and/or gyrB analyses. These results suggest that CGA-based DNA-DNA hybridization could serve as a powerful, high-throughput format for determining species relatedness among microorganisms.

  12. IMG 4 version of the integrated microbial genomes comparative analysis system

    Science.gov (United States)

    Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Ratner, Anna; Huang, Jinghua; Woyke, Tanja; Huntemann, Marcel; Anderson, Iain; Billis, Konstantinos; Varghese, Neha; Mavromatis, Konstantinos; Pati, Amrita; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2014-01-01

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG’s data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG’s annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu). PMID:24165883

  13. IMG 4 version of the integrated microbial genomes comparative analysis system

    Energy Technology Data Exchange (ETDEWEB)

    Markowitz, Victor M. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Chen, I-Min A. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Palaniappan, Krishna [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Chu, Ken [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Szeto, Ernest [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Pillay, Manoj [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Ratner, Anna [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Huang, Jinghua [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Woyke, Tanja [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Huntemann, Marcel [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Anderson, Iain [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Billis, Konstantinos [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Varghese, Neha [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Mavromatis, Konstantinos [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Pati, Amrita [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Ivanova, Natalia N. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Kyrpides, Nikos C. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program

    2013-10-27

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG’s data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG’s annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Finally, different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu).

  14. Molecular cloning and restriction analysis of EcoRI-fragments of Vicia faba rDNA

    International Nuclear Information System (INIS)

    Yakura, Kimitaka; Tanifuji, Shigeyuki.

    1983-01-01

    EcoRI-fragments of Vicia faba rDNA were cloned in plasmid pBR325. Southern blot hybridization of BamHI-digests of these cloned plasmids and Vicia genomic DNA led to the determination of relative positions of BamHI sites in the rDNA and the physical map that had been tentatively made is corrected. (author)

  15. Excited nuclei fragmentation

    International Nuclear Information System (INIS)

    Ngo, C.

    1986-11-01

    Experimental indications leading to the thought of a very excited nucleus fragmentation are resumed. Theoretical approaches are briefly described; they are used to explain the phenomenon in showing off they are based on a minimum information principle. This model is based on time dependent Thomas-Fermi calculation which allows the mean field effect description, and with a site-bound percolation model which allows the fluctuation description [fr

  16. Sub-fragmentation of structural-reactive-material casings under explosion

    Science.gov (United States)

    Zhang, Fan

    2015-06-01

    The sub-fragmentation of structural reactive material (SRM) thick-casings is to generate fine fragments during casing fragmentation under explosive loading for their efficient energy release to enhance air blast. This has been investigated using a cylindrical casing made from either rich Al-MoO3 or Al-W-based granular composites. The former composite was to study the concept of reactive hot spots where the reaction of reactive particles, which were distributed into base SRM in a fuel-rich equivalence ratio, created heat and gas products during SRM fragmentation. The expansion of these distributed hot spots initiated local fractures of the casing, leading to fine fragments. The Al-W-based composite investigated the concept of impedance mismatch, where shock dynamics at the interfaces of different impedance ingredients resulted in non-uniform, high local temperatures and stresses and late in times the dissimilar inertia resulted in different accelerations, leading to material separation and fine fragments. The casings were manufactured through both hot iso-static pressing and cold gas dynamic spray deposition. Explosion experiments were conducted in a 3 m diameter, 23 m3 cylindrical chamber for these cased charges in a casing-to-explosive mass ratio of 1.75. The results demonstrated the presence of fine fragments and more efficient fragment combustion, compared with previous results, and indicated the effectiveness of both concepts. This work was jointly funded by Defence R&D Canada and the Advanced Energetics Program of DTRA (Dr. William H. Wilson).

  17. Fragmentation of Ceramics in Rapid Expansion Mode

    Science.gov (United States)

    Maiti, Spandan; Geubelle, Philippe H.; Rangaswamy, Krishnan

    The study of the fragmentation process goes back to more than a century, motivated primarily by problems related to mining and ore handling (Grady and Kipp, 1985). Various theories have been proposed to predict the fragmentation stress and the fragment size and distribution. But the investigations are generally case specific and relate to only a narrow set of fragmentation processes. A number of theoretical studies of dynamic fragmentation in a rapidly expanding body can be found in the literature. For example, the study summarized in (Grady, 1982) presents a model based on a simple energy balance concept between the surface energy released due to fracture and the kinetic energy of the fragments. Subsequent refinements of the energy balance model have been proposed by (Glenn and Chudnovsky, 1986), which take into account the strain energy of the fragments and specify a threshold stress below which no fragmentation occurs. These models assume that the fracture events are instantaneous and occur simultaneously. Evidently, these assumptions are quite restrictive and these models can not take into account the transient nature of the fragmentation process after the onset of fracture in the material. A more recent model proposed by (Miller et al., 1999) however takes into account this time-dependent nature of the fragmentation event and the distribution of flaws of various strengths in the original material.

  18. Practical application of in silico fragmentation based residue screening with ion mobility high-resolution mass spectrometry.

    Science.gov (United States)

    Kaufmann, Anton; Butcher, Patrick; Maden, Kathry; Walker, Stephan; Widmer, Mirjam

    2017-07-15

    A screening concept for residues in complex matrices based on liquid chromatography coupled to ion mobility high-resolution mass spectrometry LC/IMS-HRMS is presented. The comprehensive four-dimensional data (chromatographic retention time, drift time, mass-to-charge and ion abundance) obtained in data-independent acquisition (DIA) mode was used for data mining. An in silico fragmenter utilizing a molecular structure database was used for suspect screening, instead of targeted screening with reference substances. The utilized data-independent acquisition mode relies on the MS E concept; where two constantly alternating HRMS scans (low and high fragmentation energy) are acquired. Peak deconvolution and drift time alignment of ions from the low (precursor ion) and high (product ion) energy scan result in relatively clean product ion spectra. A bond dissociation in silico fragmenter (MassFragment) supplied with mol files of compounds of interest was used to explain the observed product ions of each extracted candidate component (chromatographic peak). Two complex matrices (fish and bovine liver extract) were fortified with 98 veterinary drugs. Out of 98 screened compounds 94 could be detected with the in silico based screening approach. The high correlation among drift time and m/z value of equally charged ions was utilized for an orthogonal filtration (ranking). Such an orthogonal ion mobility based filter removes multiply charged ions (e.g. peptides and proteins from the matrix) as well as noise and artefacts. Most significantly, this filtration dramatically reduces false positive findings but hardly increases false negative findings. The proposed screening approach may offer new possibilities for applications where reference compounds are hardly or not at all commercially available. Such areas may be the analysis of metabolites of drugs, pyrrolizidine alkaloids, marine toxins, derivatives of sildenafil or novel designer drugs (new psychoactive substances

  19. DebriSat - A Planned Laboratory-Based Satellite Impact Experiment for Breakup Fragment Characterization

    Science.gov (United States)

    Liou, J.-C.; Fitz-Coy, N.; Werremeyer, M.; Huynh, T.; Voelker, M.; Opiela, J.

    2012-01-01

    DebriSat is a planned laboratory ]based satellite hypervelocity impact experiment. The goal of the project is to characterize the orbital debris that would be generated by a hypervelocity collision involving a modern satellite in low Earth orbit (LEO). The DebriSat project will update and expand upon the information obtained in the 1992 Satellite Orbital Debris Characterization Impact Test (SOCIT), which characterized the breakup of a 1960 's US Navy Transit satellite. There are three phases to this project: the design and fabrication of an engineering model representing a modern, 50-cm/50-kg class LEO satellite known as DebriSat; conduction of a laboratory-based hypervelocity impact to catastrophically break up the satellite; and characterization of the properties of breakup fragments down to 2 mm in size. The data obtained, including fragment size, area ]to ]mass ratio, density, shape, material composition, optical properties, and radar cross ]section distributions, will be used to supplement the DoD fs and NASA fs satellite breakup models to better describe the breakup outcome of a modern satellite. Updated breakup models will improve mission planning, environmental models, and event response. The DebriSat project is sponsored by the Air Force fs Space and Missile Systems Center and the NASA Orbital Debris Program Office. The design and fabrication of DebriSat is led by University of Florida with subject matter experts f support from The Aerospace Corporation. The major milestones of the project include the complete fabrication of DebriSat by September 2013, the hypervelocity impact of DebriSat at the Air Force fs Arnold Engineering Development Complex in early 2014, and fragment characterization and data analyses in late 2014.

  20. Visualization for genomics: the Microbial Genome Viewer.

    NARCIS (Netherlands)

    Kerkhoven, R.; Enckevort, F.H.J. van; Boekhorst, J.; Molenaar, D; Siezen, R.J.

    2004-01-01

    SUMMARY: A Web-based visualization tool, the Microbial Genome Viewer, is presented that allows the user to combine complex genomic data in a highly interactive way. This Web tool enables the interactive generation of chromosome wheels and linear genome maps from genome annotation data stored in a

  1. GENOMEPOP: A program to simulate genomes in populations

    Directory of Open Access Journals (Sweden)

    Carvajal-Rodríguez Antonio

    2008-04-01

    Full Text Available Abstract Background There are several situations in population biology research where simulating DNA sequences is useful. Simulation of biological populations under different evolutionary genetic models can be undertaken using backward or forward strategies. Backward simulations, also called coalescent-based simulations, are computationally efficient. The reason is that they are based on the history of lineages with surviving offspring in the current population. On the contrary, forward simulations are less efficient because the entire population is simulated from past to present. However, the coalescent framework imposes some limitations that forward simulation does not. Hence, there is an increasing interest in forward population genetic simulation and efficient new tools have been developed recently. Software tools that allow efficient simulation of large DNA fragments under complex evolutionary models will be very helpful when trying to better understand the trace left on the DNA by the different interacting evolutionary forces. Here I will introduce GenomePop, a forward simulation program that fulfills the above requirements. The use of the program is demonstrated by studying the impact of intracodon recombination on global and site-specific dN/dS estimation. Results I have developed algorithms and written software to efficiently simulate, forward in time, different Markovian nucleotide or codon models of DNA mutation. Such models can be combined with recombination, at inter and intra codon levels, fitness-based selection and complex demographic scenarios. Conclusion GenomePop has many interesting characteristics for simulating SNPs or DNA sequences under complex evolutionary and demographic models. These features make it unique with respect to other simulation tools. Namely, the possibility of forward simulation under General Time Reversible (GTR mutation or GTR×MG94 codon models with intra-codon recombination, arbitrary, user

  2. Analysis of human blood plasma cell-free DNA fragment size distribution using EvaGreen chemistry based droplet digital PCR assays.

    Science.gov (United States)

    Fernando, M Rohan; Jiang, Chao; Krzyzanowski, Gary D; Ryan, Wayne L

    2018-04-12

    Plasma cell-free DNA (cfDNA) fragment size distribution provides important information required for diagnostic assay development. We have developed and optimized droplet digital PCR (ddPCR) assays that quantify short and long DNA fragments. These assays were used to analyze plasma cfDNA fragment size distribution in human blood. Assays were designed to amplify 76,135, 490 and 905 base pair fragments of human β-actin gene. These assays were used for fragment size analysis of plasma cell-free, exosome and apoptotic body DNA obtained from normal and pregnant donors. The relative percentages for 76, 135, 490 and 905 bp fragments from non-pregnant plasma and exosome DNA were 100%, 39%, 18%, 5.6% and 100%, 40%, 18%,3.3%, respectively. The relative percentages for pregnant plasma and exosome DNA were 100%, 34%, 14%, 23%, and 100%, 30%, 12%, 18%, respectively. The relative percentages for non-pregnant plasma pellet (obtained after 2nd centrifugation step) were 100%, 100%, 87% and 83%, respectively. Non-pregnant Plasma cell-free and exosome DNA share a unique fragment distribution pattern which is different from pregnant donor plasma and exosome DNA fragment distribution indicating the effect of physiological status on cfDNA fragment size distribution. Fragment distribution pattern for plasma pellet that includes apoptotic bodies and nuclear DNA was greatly different from plasma cell-free and exosome DNA. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  3. Humidity Effects on Fragmentation in Plasma-Based Ambient Ionization Sources.

    Science.gov (United States)

    Newsome, G Asher; Ackerman, Luke K; Johnson, Kevin J

    2016-01-01

    Post-plasma ambient desorption/ionization (ADI) sources are fundamentally dependent on surrounding water vapor to produce protonated analyte ions. There are two reports of humidity effects on ADI spectra. However, it is unclear whether humidity will affect all ADI sources and analytes, and by what mechanism humidity affects spectra. Flowing atmospheric pressure afterglow (FAPA) ionization and direct analysis in real time (DART) mass spectra of various surface-deposited and gas-phase analytes were acquired at ambient temperature and pressure across a range of observed humidity values. A controlled humidity enclosure around the ion source and mass spectrometer inlet was used to create programmed humidity and temperatures. The relative abundance and fragmentation of molecular adduct ions for several compounds consistently varied with changing ambient humidity and also were controlled with the humidity enclosure. For several compounds, increasing humidity decreased protonated molecule and other molecular adduct ion fragmentation in both FAPA and DART spectra. For others, humidity increased fragment ion ratios. The effects of humidity on molecular adduct ion fragmentation were caused by changes in the relative abundances of different reagent protonated water clusters and, thus, a change in the average difference in proton affinity between an analyte and the population of water clusters. Control of humidity in ambient post-plasma ion sources is needed to create spectral stability and reproducibility.

  4. Phylogenetic signal from rearrangements in 18 Anopheles species by joint scaffolding extant and ancestral genomes.

    Science.gov (United States)

    Anselmetti, Yoann; Duchemin, Wandrille; Tannier, Eric; Chauve, Cedric; Bérard, Sèverine

    2018-05-09

    Genomes rearrangements carry valuable information for phylogenetic inference or the elucidation of molecular mechanisms of adaptation. However, the detection of genome rearrangements is often hampered by current deficiencies in data and methods: Genomes obtained from short sequence reads have generally very fragmented assemblies, and comparing multiple gene orders generally leads to computationally intractable algorithmic questions. We present a computational method, ADSEQ, which, by combining ancestral gene order reconstruction, comparative scaffolding and de novo scaffolding methods, overcomes these two caveats. ADSEQ provides simultaneously improved assemblies and ancestral genomes, with statistical supports on all local features. Compared to previous comparative methods, it runs in polynomial time, it samples solutions in a probabilistic space, and it can handle a significantly larger gene complement from the considered extant genomes, with complex histories including gene duplications and losses. We use ADSEQ to provide improved assemblies and a genome history made of duplications, losses, gene translocations, rearrangements, of 18 complete Anopheles genomes, including several important malaria vectors. We also provide additional support for a differentiated mode of evolution of the sex chromosome and of the autosomes in these mosquito genomes. We demonstrate the method's ability to improve extant assemblies accurately through a procedure simulating realistic assembly fragmentation. We study a debated issue regarding the phylogeny of the Gambiae complex group of Anopheles genomes in the light of the evolution of chromosomal rearrangements, suggesting that the phylogenetic signal they carry can differ from the phylogenetic signal carried by gene sequences, more prone to introgression.

  5. The effect of genealogy-based haplotypes on genomic prediction

    DEFF Research Database (Denmark)

    Edriss, Vahid; Fernando, Rohan L.; Su, Guosheng

    2013-01-01

    on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using...... local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (pi) of the haplotype covariates had zero effect......, i.e. a Bayesian mixture method. Results About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some...

  6. Evolution of linear chromosomes and multipartite genomes in yeast mitochondria

    Science.gov (United States)

    Valach, Matus; Farkas, Zoltan; Fricova, Dominika; Kovac, Jakub; Brejova, Brona; Vinar, Tomas; Pfeiffer, Ilona; Kucsera, Judit; Tomaska, Lubomir; Lang, B. Franz; Nosek, Jozef

    2011-01-01

    Mitochondrial genome diversity in closely related species provides an excellent platform for investigation of chromosome architecture and its evolution by means of comparative genomics. In this study, we determined the complete mitochondrial DNA sequences of eight Candida species and analyzed their molecular architectures. Our survey revealed a puzzling variability of genome architecture, including circular- and linear-mapping and multipartite linear forms. We propose that the arrangement of large inverted repeats identified in these genomes plays a crucial role in alterations of their molecular architectures. In specific arrangements, the inverted repeats appear to function as resolution elements, allowing genome conversion among different topologies, eventually leading to genome fragmentation into multiple linear DNA molecules. We suggest that molecular transactions generating linear mitochondrial DNA molecules with defined telomeric structures may parallel the evolutionary emergence of linear chromosomes and multipartite genomes in general and may provide clues for the origin of telomeres and pathways implicated in their maintenance. PMID:21266473

  7. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform.

    Directory of Open Access Journals (Sweden)

    Wenning Zheng

    Full Text Available The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC tool and Pathogenomic Profiling Tool (PathoProT, which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my.

  8. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform.

    Science.gov (United States)

    Zheng, Wenning; Tan, Tze King; Paterson, Ian C; Mutha, Naresh V R; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A; Jakubovics, Nicholas S; Choo, Siew Woh

    2016-01-01

    The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my.

  9. Fragmentation of Continental United States Forests

    Science.gov (United States)

    Kurt H. Riitters; James D. Wickham; Robert V. O' Neill; K. Bruce Jones; Elizabeth R. Smith; John W. Coulston; Timothy G. Wade; Jonathan H. Smith

    2002-01-01

    We report a multiple-scale analysis of forest fragmentation based on 30-m (0.09 ha pixel-1) land- cover maps for the conterminous United States. Each 0.09-ha unit of forest was classified according to fragmentation indexes measured within the surrounding landscape, for five landscape sizes including 2.25, 7.29, 65.61, 590.49, and 5314.41 ha....

  10. Relationship between metabolic and genomic diversity in sesame (Sesamum indicum L.

    Directory of Open Access Journals (Sweden)

    Karlovsky Petr

    2008-05-01

    Full Text Available Abstract Background Diversity estimates in cultivated plants provide a rationale for conservation strategies and support the selection of starting material for breeding programs. Diversity measures applied to crops usually have been limited to the assessment of genome polymorphism at the DNA level. Occasionally, selected morphological features are recorded and the content of key chemical constituents determined, but unbiased and comprehensive chemical phenotypes have not been included systematically in diversity surveys. Our objective in this study was to assess metabolic diversity in sesame by nontargeted metabolic profiling and elucidate the relationship between metabolic and genome diversity in this crop. Results Ten sesame accessions were selected that represent most of the genome diversity of sesame grown in India, Western Asia, Sudan and Venezuela based on previous AFLP studies. Ethanolic seed extracts were separated by HPLC, metabolites were ionized by positive and negative electrospray and ions were detected with an ion trap mass spectrometer in full-scan mode for m/z from 50 to 1000. Genome diversity was determined by Amplified Fragment Length Polymorphism (AFLP using eight primer pair combinations. The relationship between biodiversity at the genome and at the metabolome levels was assessed by correlation analysis and multivariate statistics. Conclusion Patterns of diversity at the genomic and metabolic levels differed, indicating that selection played a significant role in the evolution of metabolic diversity in sesame. This result implies that when used for the selection of genotypes in breeding and conservation, diversity assessment based on neutral DNA markers should be complemented with metabolic profiles. We hypothesize that this applies to all crops with a long history of domestication that possess commercially relevant traits affected by chemical phenotypes.

  11. Genomic clones of bovine parvovirus: Construction and effect of deletions and terminal sequence inversions on infectivity

    International Nuclear Information System (INIS)

    Shull, B.C.; Chen, K.C.; Lederman, M.; Stout, E.R.; Bates, R.C.

    1988-01-01

    Genomic clones of the autonomous parvovirus bovine parvovirus (BPV) were constructed by blunt-end ligation of reannealed virion plus and minus DNA strands into the plasmid pUC8. These clones were stable during propagation in Escherichia coli JM107. All clones tested were found to be infectious by the criteria of plaque titer and progressive cytophathic effect after transfection into bovine fetal lung cells. Sequencing of the recombinant plasmids demonstrated that all of the BPV inserts had left-end (3')-terminal deletions of up to 34 bases. Defective genomes could also be detected in the progeny DNA even though the infection was initiated with homogeneous, cloned DNA. Full-length genomic clones with 3' flip and 3' flop conformations were constructed and were found to have equal infectivity. Expression of capsid proteins from tranfected genomes was demonstrated by hemagglutination, indirect immunofluorescence, and immunoprecipitation of [ 35 S]methionine-labeled cell lysates. Use of appropriate antiserum for immunoprecipitation showed the synthesis of BPV capsid and noncapsid proteins after transfection. Independently, a series of genomic clones with increasingly larger 3'-terminal deletions was prepared from separately subcloned 3'-terminal fragments. Transfection of these clones into bovine fetal lung cells revealed that deletions of up to 34 bases at the 3' end lowered but did not abolish infectivity, while deletions of greater than 52 bases were lethal. End-label analysis showed that the 34-base deletion was repaired to wild-type length in the progeny virus

  12. Quantitative metagenomic analyses based on average genome size normalization

    DEFF Research Database (Denmark)

    Frank, Jeremy Alexander; Sørensen, Søren Johannes

    2011-01-01

    provide not just a census of the community members but direct information on metabolic capabilities and potential interactions among community members. Here we introduce a method for the quantitative characterization and comparison of microbial communities based on the normalization of metagenomic data...... marine sources using both conventional small-subunit (SSU) rRNA gene analyses and our quantitative method to calculate the proportion of genomes in each sample that are capable of a particular metabolic trait. With both environments, to determine what proportion of each community they make up and how......). These analyses demonstrate how genome proportionality compares to SSU rRNA gene relative abundance and how factors such as average genome size and SSU rRNA gene copy number affect sampling probability and therefore both types of community analysis....

  13. Split photosystem protein, linear-mapping topology, and growth of structural complexity in the plastid genome of chromera velia

    KAUST Repository

    Janouškovec, Jan

    2013-08-22

    The canonical photosynthetic plastid genomes consist of a single circular-mapping chromosome that encodes a highly conserved protein core, involved in photosynthesis and ATP generation. Here, we demonstrate that the plastid genome of the photosynthetic relative of apicomplexans, Chromera velia, departs from this view in several unique ways. Core photosynthesis proteins PsaA and AtpB have been broken into two fragments, which we show are independently transcribed, oligoU-tailed, translated, and assembled into functional photosystem I and ATP synthase complexes. Genome-wide transcription profiles support expression of many other highly modified proteins, including several that contain extensions amounting to hundreds of amino acids in length. Canonical gene clusters and operons have been fragmented and reshuffled into novel putative transcriptional units. Massive genomic coverage by paired-end reads, coupled with pulsed-field gel electrophoresis and polymerase chain reaction, consistently indicate that the C. velia plastid genome is linear-mapping, a unique state among all plastids. Abundant intragenomic duplication probably mediated by recombination can explain protein splits, extensions, and genome linearization and is perhaps the key driving force behind the many features that defy the conventional ways of plastid genome architecture and function. © The Author 2013.

  14. Functional role of a highly repetitive DNA sequence in anchorage of the mouse genome.

    Science.gov (United States)

    Neuer-Nitsche, B; Lu, X N; Werner, D

    1988-09-12

    The major portion of the eukaryotic genome consists of various categories of repetitive DNA sequences which have been studied with respect to their base compositions, organizations, copy numbers, transcription and species specificities; their biological roles, however, are still unclear. A novel quality of a highly repetitive mouse DNA sequence is described which points to a functional role: All copies (approximately 50,000 per haploid genome) of this DNA sequence reside on genomic Alu I DNA fragments each associated with nuclear polypeptides that are not released from DNA by proteinase K, SDS and phenol extraction. By this quality the repetitive DNA sequence is classified as a member of the sub-set of DNA sequences involved in tight DNA-polypeptide complexes which have been previously shown to be components of the subnuclear structure termed 'nuclear matrix'. From these results it has to be concluded that the repetitive DNA sequence characterized in this report represents or comprises a signal for a large number of site specific attachment points of the mouse genome in the nuclear matrix.

  15. Using nanopore sequencing to get complete genomes from complex samples

    DEFF Research Database (Denmark)

    Kirkegaard, Rasmus Hansen; Karst, Søren Michael; Nielsen, Per Halkjær

    The advantages of “next generation sequencing” has come at the cost of genome finishing. The dominant sequencing technology provides short reads of 150-300 bp, which has made genome assembly very difficult as the reads do not span important repeat regions. Genomes have thus been added...... to the databases as fragmented assemblies and not as finished contigs that resemble the chromosomes in which the DNA is organised within the cells. This is especially troublesome for genomes derived from complex metagenome sequencing. Databases with incomplete genomes can lead to false conclusions about...... the absence of genes and functional predictions of the organisms. Furthermore, it is common that repetitive elements and marker genes such as the 16S rRNA gene are missing completely from these genome bins. Using nanopore long reads, we demonstrate that it is possible to span these regions and make complete...

  16. Structures of endothiapepsin-fragment complexes from crystallographic fragment screening using a novel, diverse and affordable 96-compound fragment library.

    Science.gov (United States)

    Huschmann, Franziska U; Linnik, Janina; Sparta, Karine; Ühlein, Monika; Wang, Xiaojie; Metz, Alexander; Schiebel, Johannes; Heine, Andreas; Klebe, Gerhard; Weiss, Manfred S; Mueller, Uwe

    2016-05-01

    Crystallographic screening of the binding of small organic compounds (termed fragments) to proteins is increasingly important for medicinal chemistry-oriented drug discovery. To enable such experiments in a widespread manner, an affordable 96-compound library has been assembled for fragment screening in both academia and industry. The library is selected from already existing protein-ligand structures and is characterized by a broad ligand diversity, including buffer ingredients, carbohydrates, nucleotides, amino acids, peptide-like fragments and various drug-like organic compounds. When applied to the model protease endothiapepsin in a crystallographic screening experiment, a hit rate of nearly 10% was obtained. In comparison to other fragment libraries and considering that no pre-screening was performed, this hit rate is remarkably high. This demonstrates the general suitability of the selected compounds for an initial fragment-screening campaign. The library composition, experimental considerations and time requirements for a complete crystallographic fragment-screening campaign are discussed as well as the nine fully refined obtained endothiapepsin-fragment structures. While most of the fragments bind close to the catalytic centre of endothiapepsin in poses that have been observed previously, two fragments address new sites on the protein surface. ITC measurements show that the fragments bind to endothiapepsin with millimolar affinity.

  17. Structures of endothiapepsin–fragment complexes from crystallographic fragment screening using a novel, diverse and affordable 96-compound fragment library

    Science.gov (United States)

    Huschmann, Franziska U.; Linnik, Janina; Sparta, Karine; Ühlein, Monika; Wang, Xiaojie; Metz, Alexander; Schiebel, Johannes; Heine, Andreas; Klebe, Gerhard; Weiss, Manfred S.; Mueller, Uwe

    2016-01-01

    Crystallographic screening of the binding of small organic compounds (termed fragments) to proteins is increasingly important for medicinal chemistry-oriented drug discovery. To enable such experiments in a widespread manner, an affordable 96-compound library has been assembled for fragment screening in both academia and industry. The library is selected from already existing protein–ligand structures and is characterized by a broad ligand diversity, including buffer ingredients, carbohydrates, nucleotides, amino acids, peptide-like fragments and various drug-like organic compounds. When applied to the model protease endothiapepsin in a crystallographic screening experiment, a hit rate of nearly 10% was obtained. In comparison to other fragment libraries and considering that no pre-screening was performed, this hit rate is remarkably high. This demonstrates the general suitability of the selected compounds for an initial fragment-screening campaign. The library composition, experimental considerations and time requirements for a complete crystallographic fragment-screening campaign are discussed as well as the nine fully refined obtained endothiapepsin–fragment structures. While most of the fragments bind close to the catalytic centre of endothiapepsin in poses that have been observed previously, two fragments address new sites on the protein surface. ITC measurements show that the fragments bind to endothiapepsin with millimolar affinity. PMID:27139825

  18. Whole genome PCR scanning reveals the syntenic genome structure of toxigenic Vibrio cholerae strains in the O1/O139 population.

    Directory of Open Access Journals (Sweden)

    Bo Pang

    Full Text Available Vibrio cholerae is commonly found in estuarine water systems. Toxigenic O1 and O139 V. cholerae strains have caused cholera epidemics and pandemics, whereas the nontoxigenic strains within these serogroups only occasionally lead to disease. To understand the differences in the genome and clonality between the toxigenic and nontoxigenic strains of V. cholerae serogroups O1 and O139, we employed a whole genome PCR scanning (WGPScanning method, an rrn operon-mediated fragment rearrangement analysis and comparative genomic hybridization (CGH to analyze the genome structure of different strains. WGPScanning in conjunction with CGH revealed that the genomic contents of the toxigenic strains were conservative, except for a few indels located mainly in mobile elements. Minor nucleotide variation in orthologous genes appeared to be the major difference between the toxigenic strains. rrn operon-mediated rearrangements were infrequent in El Tor toxigenic strains tested using I-CeuI digested pulsed-field gel electrophoresis (PFGE analysis and PCR analysis based on flanking sequence of rrn operons. Using these methods, we found that the genomic structures of toxigenic El Tor and O139 strains were syntenic. The nontoxigenic strains exhibited more extensive sequence variations, but toxin coregulated pilus positive (TCP+ strains had a similar structure. TCP+ nontoxigenic strains could be subdivided into multiple lineages according to the TCP type, suggesting the existence of complex intermediates in the evolution of toxigenic strains. The data indicate that toxigenic O1 El Tor and O139 strains were derived from a single lineage of intermediates from complex clones in the environment. The nontoxigenic strains with non-El Tor type TCP may yet evolve into new epidemic clones after attaining toxigenic attributes.

  19. A Bioinorganic Approach to Fragment-Based Drug Discovery Targeting Metalloenzymes.

    Science.gov (United States)

    Cohen, Seth M

    2017-08-15

    Metal-dependent enzymes (i.e., metalloenzymes) make up a large fraction of all enzymes and are critically important in a wide range of biological processes, including DNA modification, protein homeostasis, antibiotic resistance, and many others. Consequently, metalloenzymes represent a vast and largely untapped space for drug development. The discovery of effective therapeutics that target metalloenzymes lies squarely at the interface of bioinorganic and medicinal chemistry and requires expertise, methods, and strategies from both fields to mount an effective campaign. In this Account, our research program that brings together the principles and methods of bioinorganic and medicinal chemistry are described, in an effort to bridge the gap between these fields and address an important class of medicinal targets. Fragment-based drug discovery (FBDD) is an important drug discovery approach that is particularly well suited for metalloenzyme inhibitor development. FBDD uses relatively small but diverse chemical structures that allow for the assembly of privileged molecular collections that focus on a specific feature of the target enzyme. For metalloenzyme inhibition, the specific feature is rather obvious, namely, a metal-dependent active site. Surprisingly, prior to our work, the exploration of diverse molecular fragments for binding the metal active sites of metalloenzymes was largely unexplored. By assembling a modest library of metal-binding pharmacophores (MBPs), we have been able to find lead hits for many metalloenzymes and, from these hits, develop inhibitors that act via novel mechanisms of action. A specific case study on the use of this strategy to identify a first-in-class inhibitor of zinc-dependent Rpn11 (a component of the proteasome) is highlighted. The application of FBDD for the development of metalloenzyme inhibitors has raised several other compelling questions, such as how the metalloenzyme active site influences the coordination chemistry of bound

  20. Searching Fragment Spaces with feature trees.

    Science.gov (United States)

    Lessel, Uta; Wellenzohn, Bernd; Lilienthal, Markus; Claussen, Holger

    2009-02-01

    Virtual combinatorial chemistry easily produces billions of compounds, for which conventional virtual screening cannot be performed even with the fastest methods available. An efficient solution for such a scenario is the generation of Fragment Spaces, which encode huge numbers of virtual compounds by their fragments/reagents and rules of how to combine them. Similarity-based searches can be performed in such spaces without ever fully enumerating all virtual products. Here we describe the generation of a huge Fragment Space encoding about 5 * 10(11) compounds based on established in-house synthesis protocols for combinatorial libraries, i.e., we encode practically evaluated combinatorial chemistry protocols in a machine readable form, rendering them accessible to in silico search methods. We show how such searches in this Fragment Space can be integrated as a first step in an overall workflow. It reduces the extremely huge number of virtual products by several orders of magnitude so that the resulting list of molecules becomes more manageable for further more elaborated and time-consuming analysis steps. Results of a case study are presented and discussed, which lead to some general conclusions for an efficient expansion of the chemical space to be screened in pharmaceutical companies.

  1. ChIP on SNP-chip for genome-wide analysis of human histone H4 hyperacetylation

    Directory of Open Access Journals (Sweden)

    Porter Christopher J

    2007-09-01

    Full Text Available Abstract Background SNP microarrays are designed to genotype Single Nucleotide Polymorphisms (SNPs. These microarrays report hybridization of DNA fragments and therefore can be used for the purpose of detecting genomic fragments. Results Here, we demonstrate that a SNP microarray can be effectively used in this way to perform chromatin immunoprecipitation (ChIP on chip as an alternative to tiling microarrays. We illustrate this novel application by mapping whole genome histone H4 hyperacetylation in human myoblasts and myotubes. We detect clusters of hyperacetylated histone H4, often spanning across up to 300 kilobases of genomic sequence. Using complementary genome-wide analyses of gene expression by DNA microarray we demonstrate that these clusters of hyperacetylated histone H4 tend to be associated with expressed genes. Conclusion The use of a SNP array for a ChIP-on-chip application (ChIP on SNP-chip will be of great value to laboratories whose interest is the determination of general rules regarding the relationship of specific chromatin modifications to transcriptional status throughout the genome and to examine the asymmetric modification of chromatin at heterozygous loci.

  2. Analysis of cis-elements that facilitate extrachromosomal persistence of human papillomavirus genomes

    International Nuclear Information System (INIS)

    Pittayakhajonwut, Daraporn; Angeletti, Peter C.

    2008-01-01

    Human papillomaviruses (HPVs) are maintained latently in dividing epithelial cells as nuclear plasmids. Two virally encoded proteins, E1, a helicase, and E2, a transcription factor, are important players in replication and stable plasmid maintenance in host cells. Recent experiments in yeast have demonstrated that viral genomes retain replication and maintenance function independently of E1 and E2 [Angeletti, P.C., Kim, K., Fernandes, F.J., and Lambert, P.F. (2002). Stable replication of papillomavirus genomes in Saccharomyces cerevisiae. J. Virol. 76(7), 3350-8; Kim, K., Angeletti, P.C., Hassebroek, E.C., and Lambert, P.F. (2005). Identification of cis-acting elements that mediate the replication and maintenance of human papillomavirus type 16 genomes in Saccharomyces cerevisiae. J. Virol. 79(10), 5933-42]. Flow cytometry studies of EGFP-reporter vectors containing subgenomic HPV fragments with or without a human ARS (hARS), revealed that six fragments located in E6-E7, E1-E2, L1, and L2 regions showed a capacity for plasmid stabilization in the absence of E1 and E2 proteins. Interestingly, four fragments within E7, the 3' end of L2, and the 5' end of L1 exhibited stability in plasmids that lacked an hARS, indicating that they possess both replication and maintenance functions. Two fragments lying in E1-E2 and the 3' region of L1 were stable only in the presence of hARS, that they contained only maintenance function. Mutational analyses of HPV16-GFP reporter constructs provided evidence that genomes lacking E1 and E2 could replicate to an extent similar to wild type HPV16. Together these results support the concept that cellular factors influence HPV replication and maintenance, independently, and perhaps in conjunction with E1 and E2, suggesting a role in the persistent phase of the viral lifecycle

  3. A critique of race-based and genomic medicine.

    Science.gov (United States)

    Meier, Robert J

    2012-03-01

    Now that a composite human genome has been sequenced (HGP), research has accelerated to discover precise genetic bases of several chronic health issues, particularly in the realms of cancer and cardiovascular disease. It is anticipated that in the future it will be possible and cost effective to regularly sequence individual genomes, and thereby produce a DNA profile that potentially can be used to assess the health risks for each person with respect to certain genetically predisposed conditions. Coupled with that enormous diagnostic power, it will then depend upon equally rapid research efforts to develop personalized courses of treatment, including that of pharmaceutical therapy. Initial treatment attempts have been made to match drug efficacy and safety to individuals of assigned or self-identified groups according to their genetic ancestry or presumed race. A prime example is that of BiDil, which was the first drug approved by the US FDA for the explicit treatment of heart patients of African American ancestry. This race-based approach to medicine has been met with justifiable criticism, notably on ethical grounds that have long plagued historical applications and misuses of human race classification, and also on questionable science. This paper will assess race-based medical research and practice in light of a more thorough understanding of human genetic variability. Additional concerns will be expressed with regard to the rapidly developing area of pharmacogenomics, promoted to be the future of personalized medicine. Genomic epidemiology will be discussed with several examples of on-going research that hopefully will provide a solid scientific grounding for personalized medicine to build upon.

  4. Clustering document fragments using background color and texture information

    Science.gov (United States)

    Chanda, Sukalpa; Franke, Katrin; Pal, Umapada

    2012-01-01

    Forensic analysis of questioned documents sometimes can be extensively data intensive. A forensic expert might need to analyze a heap of document fragments and in such cases to ensure reliability he/she should focus only on relevant evidences hidden in those document fragments. Relevant document retrieval needs finding of similar document fragments. One notion of obtaining such similar documents could be by using document fragment's physical characteristics like color, texture, etc. In this article we propose an automatic scheme to retrieve similar document fragments based on visual appearance of document paper and texture. Multispectral color characteristics using biologically inspired color differentiation techniques are implemented here. This is done by projecting document color characteristics to Lab color space. Gabor filter-based texture analysis is used to identify document texture. It is desired that document fragments from same source will have similar color and texture. For clustering similar document fragments of our test dataset we use a Self Organizing Map (SOM) of dimension 5×5, where the document color and texture information are used as features. We obtained an encouraging accuracy of 97.17% from 1063 test images.

  5. Towards a population synthesis model of self-gravitating disc fragmentation and tidal downsizing II: the effect of fragment-fragment interactions

    Science.gov (United States)

    Forgan, D. H.; Hall, C.; Meru, F.; Rice, W. K. M.

    2018-03-01

    It is likely that most protostellar systems undergo a brief phase where the protostellar disc is self-gravitating. If these discs are prone to fragmentation, then they are able to rapidly form objects that are initially of several Jupiter masses and larger. The fate of these disc fragments (and the fate of planetary bodies formed afterwards via core accretion) depends sensitively not only on the fragment's interaction with the disc, but also with its neighbouring fragments. We return to and revise our population synthesis model of self-gravitating disc fragmentation and tidal downsizing. Amongst other improvements, the model now directly incorporates fragment-fragment interactions while the disc is still present. We find that fragment-fragment scattering dominates the orbital evolution, even when we enforce rapid migration and inefficient gap formation. Compared to our previous model, we see a small increase in the number of terrestrial-type objects being formed, although their survival under tidal evolution is at best unclear. We also see evidence for disrupted fragments with evolved grain populations - this is circumstantial evidence for the formation of planetesimal belts, a phenomenon not seen in runs where fragment-fragment interactions are ignored. In spite of intense dynamical evolution, our population is dominated by massive giant planets and brown dwarfs at large semimajor axis, which direct imaging surveys should, but only rarely, detect. Finally, disc fragmentation is shown to be an efficient manufacturer of free-floating planetary mass objects, and the typical multiplicity of systems formed via gravitational instability will be low.

  6. Full mitochondrial genome sequences of two endemic Philippine hornbill species (Aves: Bucerotidae) provide evidence for pervasive mitochondrial DNA recombination.

    Science.gov (United States)

    Sammler, Svenja; Bleidorn, Christoph; Tiedemann, Ralph

    2011-01-14

    Although nowaday it is broadly accepted that mitochondrial DNA (mtDNA) may undergo recombination, the frequency of such recombination remains controversial. Its estimation is not straightforward, as recombination under homoplasmy (i.e., among identical mt genomes) is likely to be overlooked. In species with tandem duplications of large mtDNA fragments the detection of recombination can be facilitated, as it can lead to gene conversion among duplicates. Although the mechanisms for concerted evolution in mtDNA are not fully understood yet, recombination rates have been estimated from "one per speciation event" down to 850 years or even "during every replication cycle". Here we present the first complete mt genome of the avian family Bucerotidae, i.e., that of two Philippine hornbills, Aceros waldeni and Penelopides panini. The mt genomes are characterized by a tandemly duplicated region encompassing part of cytochrome b, 3 tRNAs, NADH6, and the control region. The duplicated fragments are identical to each other except for a short section in domain I and for the length of repeat motifs in domain III of the control region. Due to the heteroplasmy with regard to the number of these repeat motifs, there is some size variation in both genomes; with around 21,657 bp (A. waldeni) and 22,737 bp (P. panini), they significantly exceed the hitherto longest known avian mt genomes, that of the albatrosses. We discovered concerted evolution between the duplicated fragments within individuals. The existence of differences between individuals in coding genes as well as in the control region, which are maintained between duplicates, indicates that recombination apparently occurs frequently, i.e., in every generation. The homogenised duplicates are interspersed by a short fragment which shows no sign of recombination. We hypothesize that this region corresponds to the so-called Replication Fork Barrier (RFB), which has been described from the chicken mitochondrial genome. As this RFB

  7. Full mitochondrial genome sequences of two endemic Philippine hornbill species (Aves: Bucerotidae provide evidence for pervasive mitochondrial DNA recombination

    Directory of Open Access Journals (Sweden)

    Bleidorn Christoph

    2011-01-01

    Full Text Available Abstract Background Although nowaday it is broadly accepted that mitochondrial DNA (mtDNA may undergo recombination, the frequency of such recombination remains controversial. Its estimation is not straightforward, as recombination under homoplasmy (i.e., among identical mt genomes is likely to be overlooked. In species with tandem duplications of large mtDNA fragments the detection of recombination can be facilitated, as it can lead to gene conversion among duplicates. Although the mechanisms for concerted evolution in mtDNA are not fully understood yet, recombination rates have been estimated from "one per speciation event" down to 850 years or even "during every replication cycle". Results Here we present the first complete mt genome of the avian family Bucerotidae, i.e., that of two Philippine hornbills, Aceros waldeni and Penelopides panini. The mt genomes are characterized by a tandemly duplicated region encompassing part of cytochrome b, 3 tRNAs, NADH6, and the control region. The duplicated fragments are identical to each other except for a short section in domain I and for the length of repeat motifs in domain III of the control region. Due to the heteroplasmy with regard to the number of these repeat motifs, there is some size variation in both genomes; with around 21,657 bp (A. waldeni and 22,737 bp (P. panini, they significantly exceed the hitherto longest known avian mt genomes, that of the albatrosses. We discovered concerted evolution between the duplicated fragments within individuals. The existence of differences between individuals in coding genes as well as in the control region, which are maintained between duplicates, indicates that recombination apparently occurs frequently, i.e., in every generation. Conclusions The homogenised duplicates are interspersed by a short fragment which shows no sign of recombination. We hypothesize that this region corresponds to the so-called Replication Fork Barrier (RFB, which has been

  8. The Vigna Genome Server, 'VigGS': A Genomic Knowledge Base of the Genus Vigna Based on High-Quality, Annotated Genome Sequence of the Azuki Bean, Vigna angularis (Willd.) Ohwi & Ohashi.

    Science.gov (United States)

    Sakai, Hiroaki; Naito, Ken; Takahashi, Yu; Sato, Toshiyuki; Yamamoto, Toshiya; Muto, Isamu; Itoh, Takeshi; Tomooka, Norihiko

    2016-01-01

    The genus Vigna includes legume crops such as cowpea, mungbean and azuki bean, as well as >100 wild species. A number of the wild species are highly tolerant to severe environmental conditions including high-salinity, acid or alkaline soil; drought; flooding; and pests and diseases. These features of the genus Vigna make it a good target for investigation of genetic diversity in adaptation to stressful environments; however, a lack of genomic information has hindered such research in this genus. Here, we present a genome database of the genus Vigna, Vigna Genome Server ('VigGS', http://viggs.dna.affrc.go.jp), based on the recently sequenced azuki bean genome, which incorporates annotated exon-intron structures, along with evidence for transcripts and proteins, visualized in GBrowse. VigGS also facilitates user construction of multiple alignments between azuki bean genes and those of six related dicot species. In addition, the database displays sequence polymorphisms between azuki bean and its wild relatives and enables users to design primer sequences targeting any variant site. VigGS offers a simple keyword search in addition to sequence similarity searches using BLAST and BLAT. To incorporate up to date genomic information, VigGS automatically receives newly deposited mRNA sequences of pre-set species from the public database once a week. Users can refer to not only gene structures mapped on the azuki bean genome on GBrowse but also relevant literature of the genes. VigGS will contribute to genomic research into plant biotic and abiotic stresses and to the future development of new stress-tolerant crops. © The Author 2015. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  9. Application of the fragment molecular orbital method analysis to fragment-based drug discovery of BET (bromodomain and extra-terminal proteins) inhibitors.

    Science.gov (United States)

    Ozawa, Motoyasu; Ozawa, Tomonaga; Ueda, Kazuyoshi

    2017-06-01

    The molecular interactions of inhibitors of bromodomains (BRDs) were investigated. BRDs are protein interaction modules that recognizing ε-N-acetyl-lysine (εAc-Lys) motifs found in histone tails and are promising protein-protein interaction (PPI) targets. First, we analyzed a peptide ligand containing εAc-Lys to evaluate native PPIs. We then analyzed tetrahydroquinazoline-6-yl-benzensulfonamide derivatives found by fragment-based drug design (FBDD) and examined their interactions with the protein compared with the peptide ligand in terms of the inter-fragment interaction energy. In addition, we analyzed benzodiazepine derivatives that are high-affinity ligands for BRDs and examined differences in the CH/π interactions of the amino acid residues. We further surveyed changes in the charges of the amino acid residues among individual ligands, performed pair interaction energy decomposition analysis and estimated the water profile within the ligand binding site. Thus, useful insights for drug design were provided. Through these analyses and considerations, we show that the FMO method is a useful drug design tool to evaluate the process of FBDD and to explore PPI inhibitors. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Target immobilization as a strategy for NMR-based fragment screening: comparison of TINS, STD, and SPR for fragment hit identification.

    Science.gov (United States)

    Kobayashi, Masakazu; Retra, Kim; Figaroa, Francis; Hollander, Johan G; Ab, Eiso; Heetebrij, Robert J; Irth, Hubertus; Siegal, Gregg

    2010-09-01

    Fragment-based drug discovery (FBDD) has become a widely accepted tool that is complementary to high-throughput screening (HTS) in developing small-molecule inhibitors of pharmaceutical targets. Because a fragment campaign can only be as successful as the hit matter found, it is critical that the first stage of the process be optimized. Here the authors compare the 3 most commonly used methods for hit discovery in FBDD: high concentration screening (HCS), solution ligand-observed nuclear magnetic resonance (NMR), and surface plasmon resonance (SPR). They selected the commonly used saturation transfer difference (STD) NMR spectroscopy and the proprietary target immobilized NMR screening (TINS) as representative of the array of possible NMR methods. Using a target typical of FBDD campaigns, the authors find that HCS and TINS are the most sensitive to weak interactions. They also find a good correlation between TINS and STD for tighter binding ligands, but the ability of STD to detect ligands with affinity weaker than 1 mM K(D) is limited. Similarly, they find that SPR detection is most suited to ligands that bind with K(D) better than 1 mM. However, the good correlation between SPR and potency in a bioassay makes this a good method for hit validation and characterization studies.

  11. Measuring the temperature of hot nuclear fragments

    International Nuclear Information System (INIS)

    Wuenschel, S.; Bonasera, A.; May, L.W.; Souliotis, G.A.; Tripathi, R.; Galanopoulos, S.; Kohley, Z.; Hagel, K.; Shetty, D.V.; Huseman, K.; Soisson, S.N.; Stein, B.C.; Yennello, S.J.

    2010-01-01

    A new thermometer based on fragment momentum fluctuations is presented. This thermometer exhibited residual contamination from the collective motion of the fragments along the beam axis. For this reason, the transverse direction has been explored. Additionally, a mass dependence was observed for this thermometer. This mass dependence may be the result of the Fermi momentum of nucleons or the different properties of the fragments (binding energy, spin, etc.) which might be more sensitive to different densities and temperatures of the exploding fragments. We expect some of these aspects to be smaller for protons (and/or neutrons); consequently, the proton transverse momentum fluctuations were used to investigate the temperature dependence of the source.

  12. Cloud-based interactive analytics for terabytes of genomic variants data.

    Science.gov (United States)

    Pan, Cuiping; McInnes, Gregory; Deflaux, Nicole; Snyder, Michael; Bingham, Jonathan; Datta, Somalee; Tsao, Philip S

    2017-12-01

    Large scale genomic sequencing is now widely used to decipher questions in diverse realms such as biological function, human diseases, evolution, ecosystems, and agriculture. With the quantity and diversity these data harbor, a robust and scalable data handling and analysis solution is desired. We present interactive analytics using a cloud-based columnar database built on Dremel to perform information compression, comprehensive quality controls, and biological information retrieval in large volumes of genomic data. We demonstrate such Big Data computing paradigms can provide orders of magnitude faster turnaround for common genomic analyses, transforming long-running batch jobs submitted via a Linux shell into questions that can be asked from a web browser in seconds. Using this method, we assessed a study population of 475 deeply sequenced human genomes for genomic call rate, genotype and allele frequency distribution, variant density across the genome, and pharmacogenomic information. Our analysis framework is implemented in Google Cloud Platform and BigQuery. Codes are available at https://github.com/StanfordBioinformatics/mvp_aaa_codelabs. cuiping@stanford.edu or ptsao@stanford.edu. Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2017. This work is written by US Government employees and are in the public domain in the US.

  13. A polymer, random walk model for the size-distribution of large DNA fragments after high linear energy transfer radiation

    Science.gov (United States)

    Ponomarev, A. L.; Brenner, D.; Hlatky, L. R.; Sachs, R. K.

    2000-01-01

    DNA double-strand breaks (DSBs) produced by densely ionizing radiation are not located randomly in the genome: recent data indicate DSB clustering along chromosomes. Stochastic DSB clustering at large scales, from > 100 Mbp down to simulations and analytic equations. A random-walk, coarse-grained polymer model for chromatin is combined with a simple track structure model in Monte Carlo software called DNAbreak and is applied to data on alpha-particle irradiation of V-79 cells. The chromatin model neglects molecular details but systematically incorporates an increase in average spatial separation between two DNA loci as the number of base-pairs between the loci increases. Fragment-size distributions obtained using DNAbreak match data on large fragments about as well as distributions previously obtained with a less mechanistic approach. Dose-response relations, linear at small doses of high linear energy transfer (LET) radiation, are obtained. They are found to be non-linear when the dose becomes so large that there is a significant probability of overlapping or close juxtaposition, along one chromosome, for different DSB clusters from different tracks. The non-linearity is more evident for large fragments than for small. The DNAbreak results furnish an example of the RLC (randomly located clusters) analytic formalism, which generalizes the broken-stick fragment-size distribution of the random-breakage model that is often applied to low-LET data.

  14. A recombinant estrogen receptor fragment-based homogeneous fluorescent assay for rapid detection of estrogens.

    Science.gov (United States)

    Wang, Dan; Xie, Jiangbi; Zhu, Xiaocui; Li, Jinqiu; Zhao, Dongqin; Zhao, Meiping

    2014-05-15

    In this work, we demonstrate a novel estrogenic receptor fragment-based homogeneous fluorescent assay which enables rapid and sensitive detection of 17β-estradiol (E2) and other highly potent estrogens. A modified human estrogenic receptor fragment (N-His × 6-hER270-595-C-Strep tag II) has been constructed that contains amino acids 270-595 of wild-type human estrogenic receptor α (hER270-595) and two specific tags (6 × His and Strep tag II) fused to the N and C terminus, respectively. The designed receptor protein fragment could be easily produced by prokaryotic expression with high yield and high purity. The obtained protein exhibits high binding affinity to E2 and the two tags greatly facilitate the application of the recombinant protein. Taking advantage of the unique spectroscopic properties of coumestrol (CS), a fluorescent phytoestrogen, a CS/hER270-595-based fluorescent assay has been developed which can sensitively respond to E2 within 1.0 min with a linear working range from 0.1 to 20 ng/mL and a limit of detection of 0.1 ng/mL. The assay was successfully applied for rapid detection of E2 in the culture medium of rat hippocampal neurons. The method also holds great potential for high-throughput monitoring the variation of estrogen levels in complex biological fluids, which is crucial for investigation of the molecular basis of various estrogen-involved processes. Copyright © 2013 Elsevier B.V. All rights reserved.

  15. Marker-based estimation of genetic parameters in genomics.

    Directory of Open Access Journals (Sweden)

    Zhiqiu Hu

    Full Text Available Linear mixed model (LMM analysis has been recently used extensively for estimating additive genetic variances and narrow-sense heritability in many genomic studies. While the LMM analysis is computationally less intensive than the Bayesian algorithms, it remains infeasible for large-scale genomic data sets. In this paper, we advocate the use of a statistical procedure known as symmetric differences squared (SDS as it may serve as a viable alternative when the LMM methods have difficulty or fail to work with large datasets. The SDS procedure is a general and computationally simple method based only on the least squares regression analysis. We carry out computer simulations and empirical analyses to compare the SDS procedure with two commonly used LMM-based procedures. Our results show that the SDS method is not as good as the LMM methods for small data sets, but it becomes progressively better and can match well with the precision of estimation by the LMM methods for data sets with large sample sizes. Its major advantage is that with larger and larger samples, it continues to work with the increasing precision of estimation while the commonly used LMM methods are no longer able to work under our current typical computing capacity. Thus, these results suggest that the SDS method can serve as a viable alternative particularly when analyzing 'big' genomic data sets.

  16. Comprehensive preimplantation genetic screening and sperm deoxyribonucleic acid fragmentation from three males carrying balanced chromosome rearrangements.

    Science.gov (United States)

    Ramos, Laia; Daina, Gemma; Del Rey, Javier; Ribas-Maynou, Jordi; Fernández-Encinas, Alba; Martinez-Passarell, Olga; Boada, Montserrat; Benet, Jordi; Navarro, Joaquima

    2015-09-01

    To assess whether preimplantation genetic screening can successfully identify cytogenetically normal embryos in couples carrying balanced chromosome rearrangements in addition to increased sperm DNA fragmentation. Comprehensive preimplantation genetic screening was performed on three couples carrying chromosome rearrangements. Sperm DNA fragmentation was assessed for each patient. Academic center. One couple with the male partner carrying a chromosome 2 pericentric inversion and two couples with the male partners carrying a Robertsonian translocation (13:14 and 14:21, respectively). A single blastomere from each of the 18 cleavage-stage embryos obtained was analysed by metaphase comparative genomic hybridization. Single- and double-strand sperm DNA fragmentation was determined by the alkaline and neutral Comet assays. Single- and double-strand sperm DNA fragmentation values and incidence of chromosome imbalances in the blastomeres were analyzed. The obtained values of single-strand sperm DNA fragmentation were between 47% and 59%, and the double-strand sperm DNA fragmentation values were between 43% and 54%. No euploid embryos were observed in the couple showing the highest single-strand sperm DNA fragmentation. However, euploid embryos were observed in the other two couples: embryo transfer was performed, and pregnancy was achieved by the couple showing the lowest sperm DNA fragmentation values. Preimplantation genetic screening enables the detection of euploid embryos in couples affected by balanced chromosome rearrangements and increased sperm DNA fragmentation. Even though sperm DNA fragmentation may potentially have clinical consequences on fertility, comprehensive preimplantation genetic screening allows for the identification and transfer of euploid embryos. Copyright © 2015. Published by Elsevier Inc.

  17. Comparative chloroplast genomes of eleven Schima (Theaceae) species: Insights into DNA barcoding and phylogeny.

    Science.gov (United States)

    Yu, Xiang-Qin; Drew, Bryan T; Yang, Jun-Bo; Gao, Lian-Ming; Li, De-Zhu

    2017-01-01

    Schima is an ecologically and economically important woody genus in tea family (Theaceae). Unresolved species delimitations and phylogenetic relationships within Schima limit our understanding of the genus and hinder utilization of the genus for economic purposes. In the present study, we conducted comparative analysis among the complete chloroplast (cp) genomes of 11 Schima species. Our results indicate that Schima cp genomes possess a typical quadripartite structure, with conserved genomic structure and gene order. The size of the Schima cp genome is about 157 kilo base pairs (kb). They consistently encode 114 unique genes, including 80 protein-coding genes, 30 tRNAs, and 4 rRNAs, with 17 duplicated in the inverted repeat (IR). These cp genomes are highly conserved and do not show obvious expansion or contraction of the IR region. The percent variability of the 68 coding and 93 noncoding (>150 bp) fragments is consistently less than 3%. The seven most widely touted DNA barcode regions as well as one promising barcode candidate showed low sequence divergence. Eight mutational hotspots were identified from the 11 cp genomes. These hotspots may potentially be useful as specific DNA barcodes for species identification of Schima. The 58 cpSSR loci reported here are complementary to the microsatellite markers identified from the nuclear genome, and will be leveraged for further population-level studies. Phylogenetic relationships among the 11 Schima species were resolved with strong support based on the cp genome data set, which corresponds well with the species distribution pattern. The data presented here will serve as a foundation to facilitate species identification, DNA barcoding and phylogenetic reconstructions for future exploration of Schima.

  18. Heavy-Quark Production in the Target Fragmentation Region

    CERN Document Server

    Graudenz, Dirk

    1997-01-01

    Fixed-target experiments permit the study of hadron production in the target fragmentation region. It is expected that the tagging of specific particles in the target fragments can be employed to introduce a bias in the hard scattering process towards a specific flavour content. The case of hadrons containing a heavy quark is particularly attractive because of the clear experimental signatures and the applicability of perturbative QCD. The standard approach to one-particle inclusive processes based on fragmentation functions is valid in the current fragmentation region and for large transverse momenta $p_T$ in the target fragmentation region, but it fails for particle production at small $p_T$ in the target fragmentation region. A collinear singularity, which cannot be absorbed in the standard way into the phenomenological distribution functions, prohibits the application of this procedure. This situation is remedied by the introduction of a new set of distribution functions, the target fragmentation function...

  19. Young, intact and nested retrotransposons are abundant in the onion and asparagus genomes.

    Science.gov (United States)

    Vitte, C; Estep, M C; Leebens-Mack, J; Bennetzen, J L

    2013-09-01

    Although monocotyledonous plants comprise one of the two major groups of angiosperms and include >65 000 species, comprehensive genome analysis has been focused mainly on the Poaceae (grass) family. Due to this bias, most of the conclusions that have been drawn for monocot genome evolution are based on grasses. It is not known whether these conclusions apply to many other monocots. To extend our understanding of genome evolution in the monocots, Asparagales genomic sequence data were acquired and the structural properties of asparagus and onion genomes were analysed. Specifically, several available onion and asparagus bacterial artificial chromosomes (BACs) with contig sizes >35 kb were annotated and analysed, with a particular focus on the characterization of long terminal repeat (LTR) retrotransposons. The results reveal that LTR retrotransposons are the major components of the onion and garden asparagus genomes. These elements are mostly intact (i.e. with two LTRs), have mainly inserted within the past 6 million years and are piled up into nested structures. Analysis of shotgun genomic sequence data and the observation of two copies for some transposable elements (TEs) in annotated BACs indicates that some families have become particularly abundant, as high as 4-5 % (asparagus) or 3-4 % (onion) of the genome for the most abundant families, as also seen in large grass genomes such as wheat and maize. Although previous annotations of contiguous genomic sequences have suggested that LTR retrotransposons were highly fragmented in these two Asparagales genomes, the results presented here show that this was largely due to the methodology used. In contrast, this current work indicates an ensemble of genomic features similar to those observed in the Poaceae.

  20. HETC-3STEP included fragmentation process

    Energy Technology Data Exchange (ETDEWEB)

    Shigyo, Nobuhiro; Iga, Kiminori; Ishibashi, Kenji [Kyushu Univ., Fukuoka (Japan). Faculty of Engineering

    1997-03-01

    High Energy Transport Code (HETC) based on the cascade-evaporation model is modified to calculate the fragmentation cross section. For the cascade process, nucleon-nucleon cross sections are used for collision computation; effective in-medium-corrected cross sections are adopted instead of the original free-nucleon collision. The exciton model is adopted for improvement of backward nucleon-emission cross section for low-energy nucleon-incident events. The fragmentation reaction is incorporated into the original HETC as a subroutine set by the use of the systematics of the reaction. The modified HETC (HETC-3STEP/FRG) reproduces experimental fragment yields to a reasonable degree. (author)

  1. Occlusion-Aware Fragment-Based Tracking With Spatial-Temporal Consistency.

    Science.gov (United States)

    Sun, Chong; Wang, Dong; Lu, Huchuan

    2016-08-01

    In this paper, we present a robust tracking method by exploiting a fragment-based appearance model with consideration of both temporal continuity and discontinuity information. From the perspective of probability theory, the proposed tracking algorithm can be viewed as a two-stage optimization problem. In the first stage, by adopting the estimated occlusion state as a prior, the optimal state of the tracked object can be obtained by solving an optimization problem, where the objective function is designed based on the classification score, occlusion prior, and temporal continuity information. In the second stage, we propose a discriminative occlusion model, which exploits both foreground and background information to detect the possible occlusion, and also models the consistency of occlusion labels among different frames. In addition, a simple yet effective training strategy is introduced during the model training (and updating) process, with which the effects of spatial-temporal consistency are properly weighted. The proposed tracker is evaluated by using the recent benchmark data set, on which the results demonstrate that our tracker performs favorably against other state-of-the-art tracking algorithms.

  2. Detection of ligand binding hot spots on protein surfaces via fragment-based methods: application to DJ-1 and glucocerebrosidase

    Energy Technology Data Exchange (ETDEWEB)

    Landon, Melissa R.; Lieberman, Raquel L.; Hoang, Quyen Q.; Ju, Shulin; Caaveiro, Jose M.M.; Orwig, Susan D.; Kozakov, Dima; Brenke, Ryan; Chuang, Gwo-Yu; Beglov, Dmitry; Vajda, Sandor; Petsko, Gregory A.; Ringe, Dagmar; (BU-M); (Brandeis); (GIT)

    2010-08-04

    The identification of hot spots, i.e., binding regions that contribute substantially to the free energy of ligand binding, is a critical step for structure-based drug design. Here we present the application of two fragment-based methods to the detection of hot spots for DJ-1 and glucocerebrosidase (GCase), targets for the development of therapeutics for Parkinson's and Gaucher's diseases, respectively. While the structures of these two proteins are known, binding information is lacking. In this study we employ the experimental multiple solvent crystal structures (MSCS) method and computational fragment mapping (FTMap) to identify regions suitable for the development of pharmacological chaperones for DJ-1 and GCase. Comparison of data derived via MSCS and FTMap also shows that FTMap, a computational method for the identification of fragment binding hot spots, is an accurate and robust alternative to the performance of expensive and difficult crystallographic experiments.

  3. Combining NMR and X-ray crystallography in fragment-based drug discovery: discovery of highly potent and selective BACE-1 inhibitors.

    Science.gov (United States)

    Wyss, Daniel F; Wang, Yu-Sen; Eaton, Hugh L; Strickland, Corey; Voigt, Johannes H; Zhu, Zhaoning; Stamford, Andrew W

    2012-01-01

    Fragment-based drug discovery (FBDD) has become increasingly popular over the last decade. We review here how we have used highly structure-driven fragment-based approaches to complement more traditional lead discovery to tackle high priority targets and those struggling for leads. Combining biomolecular nuclear magnetic resonance (NMR), X-ray crystallography, and molecular modeling with structure-assisted chemistry and innovative biology as an integrated approach for FBDD can solve very difficult problems, as illustrated in this chapter. Here, a successful FBDD campaign is described that has allowed the development of a clinical candidate for BACE-1, a challenging CNS drug target. Crucial to this achievement were the initial identification of a ligand-efficient isothiourea fragment through target-based NMR screening and the determination of its X-ray crystal structure in complex with BACE-1, which revealed an extensive H-bond network with the two active site aspartate residues. This detailed 3D structural information then enabled the design and validation of novel, chemically stable and accessible heterocyclic acylguanidines as aspartic acid protease inhibitor cores. Structure-assisted fragment hit-to-lead optimization yielded iminoheterocyclic BACE-1 inhibitors that possess desirable molecular properties as potential therapeutic agents to test the amyloid hypothesis of Alzheimer's disease in a clinical setting.

  4. Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity

    Science.gov (United States)

    Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Here we ...

  5. Integrated genome-based studies of Shewanella Ecophysiology

    Energy Technology Data Exchange (ETDEWEB)

    Tiedje, James M. [Michigan State Univ., East Lansing, MI (United States); Konstantinidis, Kostas [Michigan State Univ., East Lansing, MI (United States); Worden, Mark [Michigan State Univ., East Lansing, MI (United States)

    2014-01-08

    The aim of the work reported is to study Shewanella population genomics, and to understand the evolution, ecophysiology, and speciation of Shewanella. The tasks supporting this aim are: to study genetic and ecophysiological bases defining the core and diversification of Shewanella species; to determine gene content patterns along redox gradients; and to Investigate the evolutionary processes, patterns and mechanisms of Shewanella.

  6. Congruent Deep Relationships in the Grape Family (Vitaceae) Based on Sequences of Chloroplast Genomes and Mitochondrial Genes via Genome Skimming.

    Science.gov (United States)

    Zhang, Ning; Wen, Jun; Zimmer, Elizabeth A

    2015-01-01

    Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera). The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape family using plastomes and mitochondrial genes. In this study,next-generation sequencing data sets of 27 species were obtained using genome skimming with total DNAs from silica-gel preserved tissue samples on an Illumina NextSeq 500 instrument [corrected]. Plastomes were assembled using the combination of de novo and reference genome (of V. vinifera) methods. Sixteen mitochondrial genes were also obtained via genome skimming using the reference genome of V. vinifera. Extensive phylogenetic analyses were performed using maximum likelihood and Bayesian methods. The topology based on either plastome data or mitochondrial genes is congruent with the one using hundreds of nuclear genes, indicating that the grape family did not exhibit significant reticulation at the deep level. The results showcase the power of genome skimming in capturing extensive phylogenetic data: especially from chloroplast and mitochondrial DNAs.

  7. Congruent Deep Relationships in the Grape Family (Vitaceae Based on Sequences of Chloroplast Genomes and Mitochondrial Genes via Genome Skimming.

    Directory of Open Access Journals (Sweden)

    Ning Zhang

    Full Text Available Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera. The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape family using plastomes and mitochondrial genes. In this study,next-generation sequencing data sets of 27 species were obtained using genome skimming with total DNAs from silica-gel preserved tissue samples on an Illumina NextSeq 500 instrument [corrected]. Plastomes were assembled using the combination of de novo and reference genome (of V. vinifera methods. Sixteen mitochondrial genes were also obtained via genome skimming using the reference genome of V. vinifera. Extensive phylogenetic analyses were performed using maximum likelihood and Bayesian methods. The topology based on either plastome data or mitochondrial genes is congruent with the one using hundreds of nuclear genes, indicating that the grape family did not exhibit significant reticulation at the deep level. The results showcase the power of genome skimming in capturing extensive phylogenetic data: especially from chloroplast and mitochondrial DNAs.

  8. Fragment-assisted hit investigation involving integrated HTS and fragment screening: Application to the identification of phosphodiesterase 10A (PDE10A) inhibitors.

    Science.gov (United States)

    Varnes, Jeffrey G; Geschwindner, Stefan; Holmquist, Christopher R; Forst, Janet; Wang, Xia; Dekker, Niek; Scott, Clay W; Tian, Gaochao; Wood, Michael W; Albert, Jeffrey S

    2016-01-01

    Fragment-based drug design (FBDD) relies on direct elaboration of fragment hits and typically requires high resolution structural information to guide optimization. In fragment-assisted drug discovery (FADD), fragments provide information to guide selection and design but do not serve as starting points for elaboration. We describe FADD and high-throughput screening (HTS) campaign strategies conducted in parallel against PDE10A where fragment hit co-crystallography was not available. The fragment screen led to prioritized fragment hits (IC50's ∼500μM), which were used to generate a hypothetical core scaffold. Application of this scaffold as a filter to HTS output afforded a 4μM hit, which, after preparation of a small number of analogs, was elaborated into a 16nM lead. This approach highlights the strength of FADD, as fragment methods were applied despite the absence of co-crystallographical information to efficiently identify a lead compound for further optimization. Copyright © 2015 Elsevier Ltd. All rights reserved.

  9. The isolation and localization of arbitrary restriction fragment length polymorphisms in Southern African populations

    International Nuclear Information System (INIS)

    Conn, V.

    1987-01-01

    The main aim of this study was to contribute to the mapping of the human genome by searching for and characterizing a number of RFLPs (restriction fragment length polymorphisms) in the human genome. The more specific aims of this study were: 1. To isolate single-copy human DNA sequences from a human genomic library. 2. To use these single-copy sequences as DNA probes to search for polymorphic variation among Caucasoid individuals. 3. To show by means of family studies that the RFLPs were inherited in a co-dominant Mendelian fashion. 4. To determine the population frequencies of these RFLPs in Southern African Populations, namely the Bantu-speaking Negroids and the San. 5. To assign these RFLP-detecting DNA sequences to human chromosomes using somatic cell hybrid lines. In this study DNA was labelled with Phosphorus 32

  10. Oral lead bullet fragment exposure in northern bobwhite (Colinus virginianus).

    Science.gov (United States)

    Kerr, Richard; Holladay, Jeremy; Holladay, Steven; Tannenbaum, Lawrence; Selcer, Barbara; Meldrum, Blair; Williams, Susan; Jarrett, Timothy; Gogal, Robert

    2011-11-01

    Lead (Pb) is a worldwide environmental contaminant known to adversely affect multiple organ systems in both mammalian and avian species. In birds, a common route of exposure is via oral ingestion of lead particles. Data are currently lacking for the retention and clearance of Pb bullet fragments in gastrointestinal (GI) tract of birds while linking toxicity with blood Pb levels. In the present study, northern bobwhite quail fed a seed-based diet were orally gavaged with Pb bullet fragments (zero, one or five fragments/bird) and evaluated for rate of fragment clearance, and changes in peripheral blood, renal, immune, and gastrointestinal parameters. Based on radiographs, the majority of the birds cleared or absorbed the fragments by seven days, with the exception of one five-fragment bird which took between 7 and 14 days. Blood Pb levels were higher in males than females, which may be related to egg production in females. In males but not females, feed consumption, body weight gain, packed cell volume (PCV), plasma protein concentration, and δ-aminolevulinic acid dehydratase (δ-ALAD) activity were all adversely affected by five Pb fragments. Birds of both sexes that received a single Pb fragment displayed depressed δ-ALAD, suggesting altered hematologic function, while all birds dosed with five bullet fragments exhibited greater morbidity.

  11. Detection of bacterial contaminants and hybrid sequences in the genome of the kelp Saccharina japonica using Taxoblast

    Directory of Open Access Journals (Sweden)

    Simon M. Dittami

    2017-11-01

    Full Text Available Modern genome sequencing strategies are highly sensitive to contamination making the detection of foreign DNA sequences an important part of analysis pipelines. Here we use Taxoblast, a simple pipeline with a graphical user interface, for the post-assembly detection of contaminating sequences in the published genome of the kelp Saccharina japonica. Analyses were based on multiple blastn searches with short sequence fragments. They revealed a number of probable bacterial contaminations as well as hybrid scaffolds that contain both bacterial and algal sequences. This or similar types of analysis, in combination with manual curation, may thus constitute a useful complement to standard bioinformatics analyses prior to submission of genomic data to public repositories. Our analysis pipeline is open-source and freely available at http://sdittami.altervista.org/taxoblast and via SourceForge (https://sourceforge.net/projects/taxoblast.

  12. Genomic and gene variation in Mycoplasma hominis strains

    DEFF Research Database (Denmark)

    Christiansen, Gunna; Andersen, H; Birkelund, Svend

    1987-01-01

    DNAs from 14 strains of Mycoplasma hominis isolated from various habitats, including strain PG21, were analyzed for genomic heterogeneity. DNA-DNA filter hybridization values were from 51 to 91%. Restriction endonuclease digestion patterns, analyzed by agarose gel electrophoresis, revealed...... no identity or cluster formation between strains. Variation within M. hominis rRNA genes was analyzed by Southern hybridization of EcoRI-cleaved DNA hybridized with a cloned fragment of the rRNA gene from the mycoplasma strain PG50. Five of the M. hominis strains showed identical hybridization patterns....... These hybridization patterns were compared with those of 12 other mycoplasma species, which showed a much more complex band pattern. Cloned nonribosomal RNA gene fragments of M. hominis PG21 DNA were analyzed, and the fragments were used to demonstrate heterogeneity among the strains. A monoclonal antibody against...

  13. Dynameomics: Data-driven methods and models for utilizing large-scale protein structure repositories for improving fragment-based loop prediction

    Science.gov (United States)

    Rysavy, Steven J; Beck, David AC; Daggett, Valerie

    2014-01-01

    Protein function is intimately linked to protein structure and dynamics yet experimentally determined structures frequently omit regions within a protein due to indeterminate data, which is often due protein dynamics. We propose that atomistic molecular dynamics simulations provide a diverse sampling of biologically relevant structures for these missing segments (and beyond) to improve structural modeling and structure prediction. Here we make use of the Dynameomics data warehouse, which contains simulations of representatives of essentially all known protein folds. We developed novel computational methods to efficiently identify, rank and retrieve small peptide structures, or fragments, from this database. We also created a novel data model to analyze and compare large repositories of structural data, such as contained within the Protein Data Bank and the Dynameomics data warehouse. Our evaluation compares these structural repositories for improving loop predictions and analyzes the utility of our methods and models. Using a standard set of loop structures, containing 510 loops, 30 for each loop length from 4 to 20 residues, we find that the inclusion of Dynameomics structures in fragment-based methods improves the quality of the loop predictions without being dependent on sequence homology. Depending on loop length, ∼25–75% of the best predictions came from the Dynameomics set, resulting in lower main chain root-mean-square deviations for all fragment lengths using the combined fragment library. We also provide specific cases where Dynameomics fragments provide better predictions for NMR loop structures than fragments from crystal structures. Online access to these fragment libraries is available at http://www.dynameomics.org/fragments. PMID:25142412

  14. Dynameomics: data-driven methods and models for utilizing large-scale protein structure repositories for improving fragment-based loop prediction.

    Science.gov (United States)

    Rysavy, Steven J; Beck, David A C; Daggett, Valerie

    2014-11-01

    Protein function is intimately linked to protein structure and dynamics yet experimentally determined structures frequently omit regions within a protein due to indeterminate data, which is often due protein dynamics. We propose that atomistic molecular dynamics simulations provide a diverse sampling of biologically relevant structures for these missing segments (and beyond) to improve structural modeling and structure prediction. Here we make use of the Dynameomics data warehouse, which contains simulations of representatives of essentially all known protein folds. We developed novel computational methods to efficiently identify, rank and retrieve small peptide structures, or fragments, from this database. We also created a novel data model to analyze and compare large repositories of structural data, such as contained within the Protein Data Bank and the Dynameomics data warehouse. Our evaluation compares these structural repositories for improving loop predictions and analyzes the utility of our methods and models. Using a standard set of loop structures, containing 510 loops, 30 for each loop length from 4 to 20 residues, we find that the inclusion of Dynameomics structures in fragment-based methods improves the quality of the loop predictions without being dependent on sequence homology. Depending on loop length, ∼ 25-75% of the best predictions came from the Dynameomics set, resulting in lower main chain root-mean-square deviations for all fragment lengths using the combined fragment library. We also provide specific cases where Dynameomics fragments provide better predictions for NMR loop structures than fragments from crystal structures. Online access to these fragment libraries is available at http://www.dynameomics.org/fragments. © 2014 The Protein Society.

  15. Fragment-based discovery of potent inhibitors of the anti-apoptotic MCL-1 protein.

    Science.gov (United States)

    Petros, Andrew M; Swann, Steven L; Song, Danying; Swinger, Kerren; Park, Chang; Zhang, Haichao; Wendt, Michael D; Kunzer, Aaron R; Souers, Andrew J; Sun, Chaohong

    2014-03-15

    Apoptosis is regulated by the BCL-2 family of proteins, which is comprised of both pro-death and pro-survival members. Evasion of apoptosis is a hallmark of malignant cells. One way in which cancer cells achieve this evasion is thru overexpression of the pro-survival members of the BCL-2 family. Overexpression of MCL-1, a pro-survival protein, has been shown to be a resistance factor for Navitoclax, a potent inhibitor of BCL-2 and BCL-XL. Here we describe the use of fragment screening methods and structural biology to drive the discovery of novel MCL-1 inhibitors from two distinct structural classes. Specifically, cores derived from a biphenyl sulfonamide and salicylic acid were uncovered in an NMR-based fragment screen and elaborated using high throughput analog synthesis. This culminated in the discovery of selective and potent inhibitors of MCL-1 that may serve as promising leads for medicinal chemistry optimization efforts. Copyright © 2014 Elsevier Ltd. All rights reserved.

  16. Development of swine-specific DNA markers for biosensor-based halal authentication.

    Science.gov (United States)

    Ali, M E; Hashim, U; Kashif, M; Mustafa, S; Che Man, Y B; Abd Hamid, S B

    2012-06-29

    The pig (Sus scrofa) mitochondrial genome was targeted to design short (15-30 nucleotides) DNA markers that would be suitable for biosensor-based hybridization detection of target DNA. Short DNA markers are reported to survive harsh conditions in which longer ones are degraded into smaller fragments. The whole swine mitochondrial-genome was in silico digested with AluI restriction enzyme. Among 66 AluI fragments, five were selected as potential markers because of their convenient lengths, high degree of interspecies polymorphism and intraspecies conservatism. These were confirmed by NCBI blast analysis and ClustalW alignment analysis with 11 different meat-providing animal and fish species. Finally, we integrated a tetramethyl rhodamine-labeled 18-nucleotide AluI fragment into a 3-nm diameter citrate-tannate coated gold nanoparticle to develop a swine-specific hybrid nanobioprobe for the determination of pork adulteration in 2.5-h autoclaved pork-beef binary mixtures. This hybrid probe detected as low as 1% pork in deliberately contaminated autoclaved pork-beef binary mixtures and no cross-species detection was recorded, demonstrating the feasibility of this type of probe for biosensor-based detection of pork adulteration of halal and kosher foods.

  17. Exploration of the Germline Genome of the Ciliate Chilodonella uncinata through Single-Cell Omics (Transcriptomics and Genomics

    Directory of Open Access Journals (Sweden)

    Xyrus X. Maurer-Alcalá

    2018-01-01

    Full Text Available Separate germline and somatic genomes are found in numerous lineages across the eukaryotic tree of life, often separated into distinct tissues (e.g., in plants, animals, and fungi or distinct nuclei sharing a common cytoplasm (e.g., in ciliates and some foraminifera. In ciliates, germline-limited (i.e., micronuclear-specific DNA is eliminated during the development of a new somatic (i.e., macronuclear genome in a process that is tightly linked to large-scale genome rearrangements, such as deletions and reordering of protein-coding sequences. Most studies of germline genome architecture in ciliates have focused on the model ciliates Oxytricha trifallax, Paramecium tetraurelia, and Tetrahymena thermophila, for which the complete germline genome sequences are known. Outside of these model taxa, only a few dozen germline loci have been characterized from a limited number of cultivable species, which is likely due to difficulties in obtaining sufficient quantities of “purified” germline DNA in these taxa. Combining single-cell transcriptomics and genomics, we have overcome these limitations and provide the first insights into the structure of the germline genome of the ciliate Chilodonella uncinata, a member of the understudied class Phyllopharyngea. Our analyses reveal the following: (i large gene families contain a disproportionate number of genes from scrambled germline loci; (ii germline-soma boundaries in the germline genome are demarcated by substantial shifts in GC content; (iii single-cell omics techniques provide large-scale quality germline genome data with limited effort, at least for ciliates with extensively fragmented somatic genomes. Our approach provides an efficient means to understand better the evolution of genome rearrangements between germline and soma in ciliates.

  18. CRISPR-Cas9-Based Genome Editing of Human Induced Pluripotent Stem Cells.

    Science.gov (United States)

    Giacalone, Joseph C; Sharma, Tasneem P; Burnight, Erin R; Fingert, John F; Mullins, Robert F; Stone, Edwin M; Tucker, Budd A

    2018-02-28

    Human induced pluripotent stem cells (hiPSCs) are the ideal cell source for autologous cell replacement. However, for patients with Mendelian diseases, genetic correction of the original disease-causing mutation is likely required prior to cellular differentiation and transplantation. The emergence of the CRISPR-Cas9 system has revolutionized the field of genome editing. By introducing inexpensive reagents that are relatively straightforward to design and validate, it is now possible to correct genetic variants or insert desired sequences at any location within the genome. CRISPR-based genome editing of patient-specific iPSCs shows great promise for future autologous cell replacement therapies. One caveat, however, is that hiPSCs are notoriously difficult to transfect, and optimized experimental design considerations are often necessary. This unit describes design strategies and methods for efficient CRISPR-based genome editing of patient- specific iPSCs. Additionally, it details a flexible approach that utilizes positive selection to generate clones with a desired genomic modification, Cre-lox recombination to remove the integrated selection cassette, and negative selection to eliminate residual hiPSCs with intact selection cassettes. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.

  19. Solution-based targeted genomic enrichment for precious DNA samples

    Directory of Open Access Journals (Sweden)

    Shearer Aiden

    2012-05-01

    Full Text Available Abstract Background Solution-based targeted genomic enrichment (TGE protocols permit selective sequencing of genomic regions of interest on a massively parallel scale. These protocols could be improved by: 1 modifying or eliminating time consuming steps; 2 increasing yield to reduce input DNA and excessive PCR cycling; and 3 enhancing reproducible. Results We developed a solution-based TGE method for downstream Illumina sequencing in a non-automated workflow, adding standard Illumina barcode indexes during the post-hybridization amplification to allow for sample pooling prior to sequencing. The method utilizes Agilent SureSelect baits, primers and hybridization reagents for the capture, off-the-shelf reagents for the library preparation steps, and adaptor oligonucleotides for Illumina paired-end sequencing purchased directly from an oligonucleotide manufacturing company. Conclusions This solution-based TGE method for Illumina sequencing is optimized for small- or medium-sized laboratories and addresses the weaknesses of standard protocols by reducing the amount of input DNA required, increasing capture yield, optimizing efficiency, and improving reproducibility.

  20. MaRaCluster: A Fragment Rarity Metric for Clustering Fragment Spectra in Shotgun Proteomics.

    Science.gov (United States)

    The, Matthew; Käll, Lukas

    2016-03-04

    Shotgun proteomics experiments generate large amounts of fragment spectra as primary data, normally with high redundancy between and within experiments. Here, we have devised a clustering technique to identify fragment spectra stemming from the same species of peptide. This is a powerful alternative method to traditional search engines for analyzing spectra, specifically useful for larger scale mass spectrometry studies. As an aid in this process, we propose a distance calculation relying on the rarity of experimental fragment peaks, following the intuition that peaks shared by only a few spectra offer more evidence than peaks shared by a large number of spectra. We used this distance calculation and a complete-linkage scheme to cluster data from a recent large-scale mass spectrometry-based study. The clusterings produced by our method have up to 40% more identified peptides for their consensus spectra compared to those produced by the previous state-of-the-art method. We see that our method would advance the construction of spectral libraries as well as serve as a tool for mining large sets of fragment spectra. The source code and Ubuntu binary packages are available at https://github.com/statisticalbiotechnology/maracluster (under an Apache 2.0 license).

  1. Integration of fragment screening and library design.

    Science.gov (United States)

    Siegal, Gregg; Ab, Eiso; Schultz, Jan

    2007-12-01

    With more than 10 years of practical experience and theoretical analysis, fragment-based drug discovery (FBDD) has entered the mainstream of the pharmaceutical and biotech industries. An array of biophysical techniques has been used to detect the weak interaction between a fragment and the target. Each technique presents its own requirements regarding the fragment collection and the target; therefore, in order to optimize the potential of FBDD, the nature of the target should be a driving factor for simultaneous development of both the library and the screening technology. A roadmap is now available to guide fragment-to-lead evolution when structural information is available. The next challenge is to apply FBDD to targets for which high-resolution structural information is not available.

  2. Genome sequencing of bacteria: sequencing, de novo assembly and rapid analysis using open source tools.

    Science.gov (United States)

    Kisand, Veljo; Lettieri, Teresa

    2013-04-01

    De novo genome sequencing of previously uncharacterized microorganisms has the potential to open up new frontiers in microbial genomics by providing insight into both functional capabilities and biodiversity. Until recently, Roche 454 pyrosequencing was the NGS method of choice for de novo assembly because it generates hundreds of thousands of long reads (tools for processing NGS data are increasingly free and open source and are often adopted for both their high quality and role in promoting academic freedom. The error rate of pyrosequencing the Alcanivorax borkumensis genome was such that thousands of insertions and deletions were artificially introduced into the finished genome. Despite a high coverage (~30 fold), it did not allow the reference genome to be fully mapped. Reads from regions with errors had low quality, low coverage, or were missing. The main defect of the reference mapping was the introduction of artificial indels into contigs through lower than 100% consensus and distracting gene calling due to artificial stop codons. No assembler was able to perform de novo assembly comparable to reference mapping. Automated annotation tools performed similarly on reference mapped and de novo draft genomes, and annotated most CDSs in the de novo assembled draft genomes. Free and open source software (FOSS) tools for assembly and annotation of NGS data are being developed rapidly to provide accurate results with less computational effort. Usability is not high priority and these tools currently do not allow the data to be processed without manual intervention. Despite this, genome assemblers now readily assemble medium short reads into long contigs (>97-98% genome coverage). A notable gap in pyrosequencing technology is the quality of base pair calling and conflicting base pairs between single reads at the same nucleotide position. Regardless, using draft whole genomes that are not finished and remain fragmented into tens of contigs allows one to characterize

  3. incaRNAfbinv: a web server for the fragment-based design of RNA sequences

    Science.gov (United States)

    Drory Retwitzer, Matan; Reinharz, Vladimir; Ponty, Yann; Waldispühl, Jérôme; Barash, Danny

    2016-01-01

    Abstract In recent years, new methods for computational RNA design have been developed and applied to various problems in synthetic biology and nanotechnology. Lately, there is considerable interest in incorporating essential biological information when solving the inverse RNA folding problem. Correspondingly, RNAfbinv aims at including biologically meaningful constraints and is the only program to-date that performs a fragment-based design of RNA sequences. In doing so it allows the design of sequences that do not necessarily exactly fold into the target, as long as the overall coarse-grained tree graph shape is preserved. Augmented by the weighted sampling algorithm of incaRNAtion, our web server called incaRNAfbinv implements the method devised in RNAfbinv and offers an interactive environment for the inverse folding of RNA using a fragment-based design approach. It takes as input: a target RNA secondary structure; optional sequence and motif constraints; optional target minimum free energy, neutrality and GC content. In addition to the design of synthetic regulatory sequences, it can be used as a pre-processing step for the detection of novel natural occurring RNAs. The two complementary methodologies RNAfbinv and incaRNAtion are merged together and fully implemented in our web server incaRNAfbinv, available at http://www.cs.bgu.ac.il/incaRNAfbinv. PMID:27185893

  4. A new in silico classification model for ready biodegradability, based on molecular fragments.

    Science.gov (United States)

    Lombardo, Anna; Pizzo, Fabiola; Benfenati, Emilio; Manganaro, Alberto; Ferrari, Thomas; Gini, Giuseppina

    2014-08-01

    Regulations such as the European REACH (Registration, Evaluation, Authorization and restriction of Chemicals) often require chemicals to be evaluated for ready biodegradability, to assess the potential risk for environmental and human health. Because not all chemicals can be tested, there is an increasing demand for tools for quick and inexpensive biodegradability screening, such as computer-based (in silico) theoretical models. We developed an in silico model starting from a dataset of 728 chemicals with ready biodegradability data (MITI-test Ministry of International Trade and Industry). We used the novel software SARpy to automatically extract, through a structural fragmentation process, a set of substructures statistically related to ready biodegradability. Then, we analysed these substructures in order to build some general rules. The model consists of a rule-set made up of the combination of the statistically relevant fragments and of the expert-based rules. The model gives good statistical performance with 92%, 82% and 76% accuracy on the training, test and external set respectively. These results are comparable with other in silico models like BIOWIN developed by the United States Environmental Protection Agency (EPA); moreover this new model includes an easily understandable explanation. Copyright © 2014 Elsevier Ltd. All rights reserved.

  5. Advanced Whole-Genome Sequencing and Analysis of Fetal Genomes from Amniotic Fluid.

    Science.gov (United States)

    Mao, Qing; Chin, Robert; Xie, Weiwei; Deng, Yuqing; Zhang, Wenwei; Xu, Huixin; Zhang, Rebecca Yu; Shi, Quan; Peters, Erin E; Gulbahce, Natali; Li, Zhenyu; Chen, Fang; Drmanac, Radoje; Peters, Brock A

    2018-04-01

    Amniocentesis is a common procedure, the primary purpose of which is to collect cells from the fetus to allow testing for abnormal chromosomes, altered chromosomal copy number, or a small number of genes that have small single- to multibase defects. Here we demonstrate the feasibility of generating an accurate whole-genome sequence of a fetus from either the cellular or cell-free DNA (cfDNA) of an amniotic sample. cfDNA and DNA isolated from the cell pellet of 31 amniocenteses were sequenced to approximately 50× genome coverage by use of the Complete Genomics nanoarray platform. In a subset of the samples, long fragment read libraries were generated from DNA isolated from cells and sequenced to approximately 100× genome coverage. Concordance of variant calls between the 2 DNA sources and with parental libraries was >96%. Two fetal genomes were found to harbor potentially detrimental variants in chromodomain helicase DNA binding protein 8 ( CHD8 ) and LDL receptor-related protein 1 ( LRP1 ), variations of which have been associated with autism spectrum disorder and keratosis pilaris atrophicans, respectively. We also discovered drug sensitivities and carrier information of fetuses for a variety of diseases. We were able to elucidate the complete genome sequence of 31 fetuses from amniotic fluid and demonstrate that the cfDNA or DNA from the cell pellet can be analyzed with little difference in quality. We believe that current technologies could analyze this material in a highly accurate and complete manner and that analyses like these should be considered for addition to current amniocentesis procedures. © 2018 American Association for Clinical Chemistry.

  6. Complete genome sequence of Yersinia pestis strain 91001, an isolate avirulent to humans

    DEFF Research Database (Denmark)

    Song, Yajun; Tong, Zongzhong; Wang, Jin

    2004-01-01

    pseudo-genes. Due to the rearrangements mediated by insertion elements, the structure of the 91001 chromosome shows dramatic differences compared with CO92 and KIM. Based on the analysis of plasmids and chromosome architectures, pseudogene distribution, nitrate reduction negative mechanism and gene...... comparison, we conclude that strain 91001 and other strains isolated from M. brandti might have evolved from ancestral Y. pestis in a different lineage. The large genome fragment deletions in the 91001 chromosome and some pseudogenes may contribute to its unique nonpathogenicity to humans and host...

  7. Fragment-based drug design and identification of HJC0123, a novel orally bioavailable STAT3 inhibitor for cancer therapy

    Science.gov (United States)

    Chen, Haijun; Yang, Zhengduo; Ding, Chunyong; Chu, Lili; Zhang, Yusong; Terry, Kristin; Liu, Huiling; Shen, Qiang; Zhou, Jia

    2013-01-01

    Fragment-based drug design (FBDD) is a promising approach for the generation of lead molecules with enhanced activity and especially drug-like properties against therapeutic targets. Herein, we report the fragment-based drug design, systematic chemical synthesis and pharmacological evaluation of novel scaffolds as potent anticancer agents by utilizing six privileged fragments from known STAT3 inhibitors. Several new molecules such as compounds 5, 12, and 19 that may act as advanced chemical leads have been identified. The most potent compound 5 (HJC0123) has demonstrated to inhibit STAT3 promoter activity, downregulate phosphorylation of STAT3, increase the expression of cleaved caspase-3, inhibit cell cycle progression and promote apoptosis in breast and pancreatic cancer cells with low micromolar to nanomolar IC50 values. Furthermore, compound 5 significantly suppressed estrogen receptor (ER)-negative breast cancer MDA-MB-231 xenograft tumor growth in vivo (p.o.), indicating its great potential as an efficacious and orally bioavailable drug candidate for human cancer therapy. PMID:23416191

  8. Hands as markers of fragmentation

    Directory of Open Access Journals (Sweden)

    A. Barnard

    2005-07-01

    Full Text Available Margaret Atwood is an internationally read, translated, and critiqued writer whose novels have established her as one of the most esteemed authors in English (McCombs & Palmer, 1991:1. Critical studies of her work deal mainly with notions of identity from psychoanalytical perspectives. This study has identified a gap in current critical studies on Atwood’s works, namely the challenging of textual unity which is paralleled in the challenging of the traditional (single narrative voice. The challenging of textual unity and the single narrative voice brings about the fragmentation of both. This article will focus on the role that hands play as markers of fragmentation in “The Blind Assassin” (2000. In the novel, the writing hand destabilises the narrative voice, since it is not connected to the voice of a single author. If the author of the text – the final signified – is eliminated, the text becomes fragmentary and open, inviting the reader to contribute to the creation of meaning. Hands play a signficant role in foregrounding the narrator’s fragmented identity, and consequently, the fragmentation of the text. We will investigate this concept in the light of Roland Barthes’ notion of the scriptor, whose hand is metaphorically severed from his or her “voice”. Instead of the text being a unified entity, it becomes unstable and it displays the absence of hierarchical textual levels. Based mainly on Barthes’ writings, this article concludes that hands foreground the narrator’s fragmented identity, which is paralleled in the fragmented text.

  9. Genome Engineering and Modification Toward Synthetic Biology for the Production of Antibiotics.

    Science.gov (United States)

    Zou, Xuan; Wang, Lianrong; Li, Zhiqiang; Luo, Jie; Wang, Yunfu; Deng, Zixin; Du, Shiming; Chen, Shi

    2018-01-01

    Antibiotic production is often governed by large gene clusters composed of genes related to antibiotic scaffold synthesis, tailoring, regulation, and resistance. With the expansion of genome sequencing, a considerable number of antibiotic gene clusters has been isolated and characterized. The emerging genome engineering techniques make it possible towards more efficient engineering of antibiotics. In addition to genomic editing, multiple synthetic biology approaches have been developed for the exploration and improvement of antibiotic natural products. Here, we review the progress in the development of these genome editing techniques used to engineer new antibiotics, focusing on three aspects of genome engineering: direct cloning of large genomic fragments, genome engineering of gene clusters, and regulation of gene cluster expression. This review will not only summarize the current uses of genomic engineering techniques for cloning and assembly of antibiotic gene clusters or for altering antibiotic synthetic pathways but will also provide perspectives on the future directions of rebuilding biological systems for the design of novel antibiotics. © 2017 Wiley Periodicals, Inc.

  10. MobilomeFINDER: web-based tools for in silico and experimental discovery of bacterial genomic islands

    Science.gov (United States)

    Ou, Hong-Yu; He, Xinyi; Harrison, Ewan M.; Kulasekara, Bridget R.; Thani, Ali Bin; Kadioglu, Aras; Lory, Stephen; Hinton, Jay C. D.; Barer, Michael R.; Rajakumar, Kumar

    2007-01-01

    MobilomeFINDER (http://mml.sjtu.edu.cn/MobilomeFINDER) is an interactive online tool that facilitates bacterial genomic island or ‘mobile genome’ (mobilome) discovery; it integrates the ArrayOme and tRNAcc software packages. ArrayOme utilizes a microarray-derived comparative genomic hybridization input data set to generate ‘inferred contigs’ produced by merging adjacent genes classified as ‘present’. Collectively these ‘fragments’ represent a hypothetical ‘microarray-visualized genome (MVG)’. ArrayOme permits recognition of discordances between physical genome and MVG sizes, thereby enabling identification of strains rich in microarray-elusive novel genes. Individual tRNAcc tools facilitate automated identification of genomic islands by comparative analysis of the contents and contexts of tRNA sites and other integration hotspots in closely related sequenced genomes. Accessory tools facilitate design of hotspot-flanking primers for in silico and/or wet-science-based interrogation of cognate loci in unsequenced strains and analysis of islands for features suggestive of foreign origins; island-specific and genome-contextual features are tabulated and represented in schematic and graphical forms. To date we have used MobilomeFINDER to analyse several Enterobacteriaceae, Pseudomonas aeruginosa and Streptococcus suis genomes. MobilomeFINDER enables high-throughput island identification and characterization through increased exploitation of emerging sequence data and PCR-based profiling of unsequenced test strains; subsequent targeted yeast recombination-based capture permits full-length sequencing and detailed functional studies of novel genomic islands. PMID:17537813

  11. Controlled fragmentation

    International Nuclear Information System (INIS)

    Arnold, Werner

    2002-01-01

    Contrary to natural fragmentation, controlled fragmentation offers the possibility to adapt fragment parameters like size and mass to the performance requirements in a very flexible way. Known mechanisms like grooves inside the casing, weaken the structure. This is, however, excluded for applications with high accelerations during launch or piercing requirements for example on a semi armor piercing penetrator. Another method to achieve controlled fragmentation with an additional grid layer is presented with which the required grooves are produced 'just in time' inside the casing during detonation of the high explosive. The process of generating the grooves aided by the grid layer was studied using the hydrocode HULL with respect to varying grid designs and material combinations. Subsequent to this, a large range of these theoretically investigated combinations was contemplated in substantial experimental tests. With an optimised grid design and a suitable material selection, the controlled fragment admits a very flexible adaptation to the set requirements. Additional advantages like the increase of perforation performance or incendiary amplification can be realized with the grid layer

  12. DFAST and DAGA: web-based integrated genome annotation tools and resources.

    Science.gov (United States)

    Tanizawa, Yasuhiro; Fujisawa, Takatomo; Kaminuma, Eli; Nakamura, Yasukazu; Arita, Masanori

    2016-01-01

    Quality assurance and correct taxonomic affiliation of data submitted to public sequence databases have been an everlasting problem. The DDBJ Fast Annotation and Submission Tool (DFAST) is a newly developed genome annotation pipeline with quality and taxonomy assessment tools. To enable annotation of ready-to-submit quality, we also constructed curated reference protein databases tailored for lactic acid bacteria. DFAST was developed so that all the procedures required for DDBJ submission could be done seamlessly online. The online workspace would be especially useful for users not familiar with bioinformatics skills. In addition, we have developed a genome repository, DFAST Archive of Genome Annotation (DAGA), which currently includes 1,421 genomes covering 179 species and 18 subspecies of two genera, Lactobacillus and Pediococcus , obtained from both DDBJ/ENA/GenBank and Sequence Read Archive (SRA). All the genomes deposited in DAGA were annotated consistently and assessed using DFAST. To assess the taxonomic position based on genomic sequence information, we used the average nucleotide identity (ANI), which showed high discriminative power to determine whether two given genomes belong to the same species. We corrected mislabeled or misidentified genomes in the public database and deposited the curated information in DAGA. The repository will improve the accessibility and reusability of genome resources for lactic acid bacteria. By exploiting the data deposited in DAGA, we found intraspecific subgroups in Lactobacillus gasseri and Lactobacillus jensenii , whose variation between subgroups is larger than the well-accepted ANI threshold of 95% to differentiate species. DFAST and DAGA are freely accessible at https://dfast.nig.ac.jp.

  13. Diversity-Oriented Synthesis as a Strategy for Fragment Evolution against GSK3β

    Science.gov (United States)

    2016-01-01

    Traditional fragment-based drug discovery (FBDD) relies heavily on structural analysis of the hits bound to their targets. Herein, we present a complementary approach based on diversity-oriented synthesis (DOS). A DOS-based fragment collection was able to produce initial hit compounds against the target GSK3β, allow the systematic synthesis of related fragment analogues to explore fragment-level structure–activity relationship, and finally lead to the synthesis of a more potent compound. PMID:27660690

  14. [Fingerprints identification of Gynostemma pentaphyllum by RAPD and cloning and analysis of its specific DNA fragment].

    Science.gov (United States)

    Jiang, Jun-fu; Li, Xiong-ying; Wu, Yao-sheng; Luo, Yu; Zhao, Rui-qiang; Lan, Xiu-wan

    2009-02-01

    To identify the resources of Gynostemma pentaphyllum and its spurious breed plant Cayratia japonica at level of DNA. Two random primers ( WGS001, WGS004) screened were applied to do random amplification with genomic DNA extracted from Gynostemma pentaphyllum and Cayratia japonica which were collected from different habitats. After amplificated with WGS004, one characteristic fragment about 500 bp which was common to all Gynostemma pentaphyllum samples studied but not to Cayratia japonica was cloned and sequenced. Then these sequences obtained were analyzed for identity and compared by Blastn program in GenBank. There were obvious different bands amplified by above two primers in their fingerprints of genomic DNA. On the basis of these different bands of DNA fingerprints, they could distinguish Gynostemma pentaphyllum and Cayratia japonica obviously. Sequence alignment of seven cloned bands showed that their identities ranged from 45.7% - 94.5%. There was no similar genome sequences searched in GenBank. This indicated that these seven DNA fragments had not been reported before and they should be new sequences. RAPD technique can be used for the accurate identification of Gynostemma pentaphyllum and its counterfeit goods Cayratia japonica. Besides, these specific DNA sequences for Gynostemmna pentaphyllum in this study are useful for the further research on identification of species and assisted selection breeding in Gynostemma pentaphyllum.

  15. Impact of genome assembly status on ChIP-Seq and ChIP-PET data mapping

    Directory of Open Access Journals (Sweden)

    Sachs Laurent

    2009-12-01

    Full Text Available Abstract Background ChIP-Seq and ChIP-PET can potentially be used with any genome for genome wide profiling of protein-DNA interaction sites. Unfortunately, it is probable that most genome assemblies will never reach the quality of the human genome assembly. Therefore, it remains to be determined whether ChIP-Seq and ChIP-PET are practicable with genome sequences other than a few (e.g. human and mouse. Findings Here, we used in silico simulations to assess the impact of completeness or fragmentation of genome assemblies on ChIP-Seq and ChIP-PET data mapping. Conclusions Most currently published genome assemblies are suitable for mapping the short sequence tags produced by ChIP-Seq or ChIP-PET.

  16. Framing Fragmentation

    DEFF Research Database (Denmark)

    Bundgaard, Charlotte

    2009-01-01

    Contemporary industrialized architecture based on advanced information technology and highly technological production processes, implies a radically different approach to architecture than what we have experienced in the past. Works of architecture composed of prefabricated building components......, contain distinctive architectural traits, not only based on rational repetition, but also supporting composition and montage as dynamic concepts. Prefab architecture is an architecture of fragmentation, individualization and changeability, and this sets up new challenges for the architect. This paper...... tries to develop a strategy for the architect dealing with industrially based architecture; a strategy which exploits architectural potentials in industrial building, which recognizes the rules of mass production and which redefines the architect’s position among the agents of building. If recent...

  17. Rationalizing fragment based drug discovery for BACE1: insights from FB-QSAR, FB-QSSR, multi objective (MO-QSPR) and MIF studies.

    Science.gov (United States)

    Manoharan, Prabu; Vijayan, R S K; Ghoshal, Nanda

    2010-10-01

    The ability to identify fragments that interact with a biological target is a key step in FBDD. To date, the concept of fragment based drug design (FBDD) is increasingly driven by bio-physical methods. To expand the boundaries of QSAR paradigm, and to rationalize FBDD using In silico approach, we propose a fragment based QSAR methodology referred here in as FB-QSAR. The FB-QSAR methodology was validated on a dataset consisting of 52 Hydroxy ethylamine (HEA) inhibitors, disclosed by GlaxoSmithKline Pharmaceuticals as potential anti-Alzheimer agents. To address the issue of target selectivity, a major confounding factor in the development of selective BACE1 inhibitors, FB-QSSR models were developed using the reported off target activity values. A heat map constructed, based on the activity and selectivity profile of the individual R-group fragments, and was in turn used to identify superior R-group fragments. Further, simultaneous optimization of multiple properties, an issue encountered in real-world drug discovery scenario, and often overlooked in QSAR approaches, was addressed using a Multi Objective (MO-QSPR) method that balances properties, based on the defined objectives. MO-QSPR was implemented using Derringer and Suich desirability algorithm to identify the optimal level of independent variables (X) that could confer a trade-off between selectivity and activity. The results obtained from FB-QSAR were further substantiated using MIF (Molecular Interaction Fields) studies. To exemplify the potentials of FB-QSAR and MO-QSPR in a pragmatic fashion, the insights gleaned from the MO-QSPR study was reverse engineered using Inverse-QSAR in a combinatorial fashion to enumerate some prospective novel, potent and selective BACE1 inhibitors.

  18. Rationalizing fragment based drug discovery for BACE1: insights from FB-QSAR, FB-QSSR, multi objective (MO-QSPR) and MIF studies

    Science.gov (United States)

    Manoharan, Prabu; Vijayan, R. S. K.; Ghoshal, Nanda

    2010-10-01

    The ability to identify fragments that interact with a biological target is a key step in FBDD. To date, the concept of fragment based drug design (FBDD) is increasingly driven by bio-physical methods. To expand the boundaries of QSAR paradigm, and to rationalize FBDD using In silico approach, we propose a fragment based QSAR methodology referred here in as FB-QSAR. The FB-QSAR methodology was validated on a dataset consisting of 52 Hydroxy ethylamine (HEA) inhibitors, disclosed by GlaxoSmithKline Pharmaceuticals as potential anti-Alzheimer agents. To address the issue of target selectivity, a major confounding factor in the development of selective BACE1 inhibitors, FB-QSSR models were developed using the reported off target activity values. A heat map constructed, based on the activity and selectivity profile of the individual R-group fragments, and was in turn used to identify superior R-group fragments. Further, simultaneous optimization of multiple properties, an issue encountered in real-world drug discovery scenario, and often overlooked in QSAR approaches, was addressed using a Multi Objective (MO-QSPR) method that balances properties, based on the defined objectives. MO-QSPR was implemented using Derringer and Suich desirability algorithm to identify the optimal level of independent variables ( X) that could confer a trade-off between selectivity and activity. The results obtained from FB-QSAR were further substantiated using MIF (Molecular Interaction Fields) studies. To exemplify the potentials of FB-QSAR and MO-QSPR in a pragmatic fashion, the insights gleaned from the MO-QSPR study was reverse engineered using Inverse-QSAR in a combinatorial fashion to enumerate some prospective novel, potent and selective BACE1 inhibitors.

  19. Brief Guide to Genomics: DNA, Genes and Genomes

    Science.gov (United States)

    ... clinic. Most new drugs based on genome-based research are estimated to be at least 10 to 15 years away, though recent genome-driven efforts in lipid-lowering therapy have considerably shortened that interval. According ...

  20. Fragmentation cross sections outside the limiting-fragmentation regime

    CERN Document Server

    Sümmerer, K

    2003-01-01

    The empirical parametrization of fragmentation cross sections, EPAX, has been successfully applied to estimate fragment production cross sections in reactions of heavy ions at high incident energies. It is checked whether a similar parametrization can be found for proton-induced spallation around 1 GeV, the range of interest for ISOL-type RIB facilities. The validity of EPAX for medium-energy heavy-ion induced reactions is also checked. Only a few datasets are available, but in general EPAX predicts the cross sections rather well, except for fragments close to the projectile, where the experimental cross sections are found to be larger.